[R] how can I import a number of datsets in a folder in my working directory to a list in R

arun smartpink111 at yahoo.com
Thu Nov 21 15:06:20 CET 2013


Hi,
Try:
#Generating the files

fileList1 <- paste0(rep(LETTERS[1:5],each=3),1:3,".txt")
set.seed(48)
lapply(fileList1,function(x) {m1 <- matrix(sample(1:20,1686*2,replace=TRUE),nrow=1686,ncol=2); write.table(m1,paste0("/home/arunksa111/Trial6/IR/",x),row.names=FALSE,quote=FALSE)}) 


dir()
#[1] "hcluster.r" "IR"     ##created another file in working directory to mimic the situation  
 D <- dir(path="IR",all.files=F,full.names=F,recursive=T)
D
# [1] "A1.txt" "A2.txt" "A3.txt" "B1.txt" "B2.txt" "B3.txt" "C1.txt" "C2.txt"
# [9] "C3.txt" "D1.txt" "D2.txt" "D3.txt" "E1.txt" "E2.txt" "E3.txt"

res1 <- do.call(rbind,lapply(D,function(x) {x1 <- read.table(paste0("/home/arunksa111/Trial6/IR/",x),header=TRUE);x1[,2]}))  ##change the path 
 dim(res1)
#[1]   15 1686
#or
res2 <- do.call(rbind,lapply(D,function(x) {x1 <- read.table(paste0(getwd(),"/IR/",x),header=TRUE);x1[,2]}))  
 identical(res1,res2)
#[1] TRUE



A.K.



Thanks for your prompt reply AK. I tried the script and it is still not working. The situation is this: the 15 files are inside a 
folder named "IR" in my working directory. The folder is NOT my working 
directory but is INSIDE my working directory;so my working directory is 
just like this
auto.r
rangescale.r
mncn.r
hcluster.r
IR(the folder that contains the 15 files)

The 15 files 
in the folder are named 
A1.txt,A2.txt,A3.txt,B1.txt,B2.txt,B3.txt,C1.txt,C2.txt,C3.txt,D1.txt,D2.txt,D3.txt,E1.txt,E2.txt,E3.txt. The challenge is to get the files into R using the dir() ,for loop 
,read.table and rbind to form a matrix containing only the 2nd column of each files. then create a vector of the filenames. If I make the folder "IR" my working directory, I wont be able to use the functions listed 
above on the data. Please help me look at this, critically. Thank you




On Thursday, November 21, 2013 1:34 AM, arun <smartpink111 at yahoo.com> wrote:
Hi,

Suppose, if I create 15 files in my working directory.
set.seed(48)
lapply(1:15,function(i) {m1 <- matrix(sample(1:20,1686*2,replace=TRUE),nrow=1686,ncol=2); write.table(m1,paste0("file_",i,".txt"),row.names=FALSE,quote=FALSE)})

 D <-dir()
D1 <- D[order(as.numeric(gsub("\\D+","",D)))]
D1

 res <- t(sapply(D1,function(x) {x1<- read.table(x,header=TRUE); x1[,2]}))
dim(res)
#[1]   15 1686
#or
res1 <- do.call(rbind,lapply(D1,function(x) {x1<- read.table(x,header=TRUE); x1[,2]}))
 dim(res1)
#[1]   15 1686
 dimnames(res) <- dimnames(res1)
 identical(res,res1)
#[1] TRUE

A.K.


I have a folder containing 15 text files in my working directory.  I 
want to use the dir() function 
D<-dir(path="IR",all.files=F,full.names=F,recursive=T) to get the 
files in a filelist in R 
....D<-dir(path="IR",all.files=F,full.names=F,recursive=T) . the 
output is that D is a list of the names of the 15. however, the files 
are inaccessible. I want R to access to access each file which is a 
matrix of dimension 1686 by 2 so I can be able to form a new matrix (15 
by 1686) using the rbind function binding the second columns of the 15 
files together.



More information about the R-help mailing list