[R] R on Multicore for Linux

Madana_Babu madana_babu at infosys.com
Thu Jul 21 21:20:28 CEST 2011

Hi all,

Currently i am trying to this on R which is running on multicore processor.
I am not sure how to use mclapply() function on this task. Can anyone help

# Setting up directory

# Data is available in the form of multiple structured log files (nearly 10K
log files)

# I am using the following syntax to get required fields and aggregations
from the logs and creating a file called DF (with 3 columns V2, V14 and

a <- list.files(path = ".", pattern = "2011-07-20", all.files = FALSE,
full.names = FALSE, recursive = FALSE, ignore.case = FALSE)

for (f in a)
	dat <- read.csv(f, header=FALSE, sep="\t", na.strings="",dec=".",
strip.white=TRUE, fill=TRUE)
	data_1 <- sqldf("SELECT V2, V14, MIN(V16) FROM dat WHERE V6=104 GROUP BY
V2, V14")
	DF <- rbind(DF, data_1)

# Currently this process is taking almost 3 Hrs for me.

Can anyone help me to use mclapply() on this operation and get this process
completed asap.

Request you to provide me the syntax.

Thanks in advance


View this message in context: http://r.789695.n4.nabble.com/R-on-Multicore-for-Linux-tp3682318p3684736.html
Sent from the R help mailing list archive at Nabble.com.

More information about the R-help mailing list