[R] Regular expressions on filenames
arun
smartpink111 at yahoo.com
Thu Jan 16 01:48:02 CET 2014
Hi,
Try:
FILELIST <- list.files()
FILELIST
#[1] "FILE.csv" "FILE.XXX.csv" "FILE.YYY.xls"
sub("(.*)\\..*$", "\\1", basename(FILELIST))
#[1] "FILE" "FILE.XXX" "FILE.YYY"
A.K.
On Wednesday, January 15, 2014 7:35 PM, Fisher Dennis <fisher at plessthan.com> wrote:
R 3.0.2
OS X
Colleagues
I am writing code to read a large number of files in a particular folder. In some situations, there may be two versions of the file with different extensions, e.g.:
FILE.csv
FILE.xls
I extracted the portion before the extension with:
sub("\\..*$", "", basename(FILELIST))
then used
duplicated
to find duplicates. All was well until I encountered files named:
FILE.XXX.csv
FILE.YYY.xls
My regular expression extracted only the “FILE” portion of the text and claimed that the filenames (without the extensions) matched. Can someone provide me with the appropriate regular expression to deal with this? Thanks.
Dennis
Dennis Fisher MD
P < (The "P Less Than" Company)
Phone: 1-866-PLessThan (1-866-753-7784)
Fax: 1-866-PLessThan (1-866-753-7784)
www.PLessThan.com
______________________________________________
R-help at r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.
More information about the R-help
mailing list