[R] extracting information from txt file
David Winsemius
dwinsemius at comcast.net
Wed Oct 31 19:26:19 CET 2012
On Oct 31, 2012, at 11:11 AM, David Winsemius wrote:
>
> On Oct 31, 2012, at 9:46 AM, chuck.01 wrote:
>
>> Hello,
>>
>> Here is a link to some data:
>> http://www.epa.gov/emap/html/data/surfwatr/data/mastreams/9396/wchem/chmval.txt
>>
>> I am trying to read this in, and want to use:
>> chmval <-
>> read.table("http://www.epa.gov/emap/html/data/surfwatr/data/mastreams/9396/wchem/chmval.txt",
>> sep=",", skip= 84, header=T)
>>
>> the # 84, for 84 lines skipped needs to be derived from the 5th line of the
>> txt file
>> # Header Records: 85
>>
>> so, I need that # (-1) for input into the read.table statement above
>
> That "# (-1)" is fairly cryptic to my reading, but it appears you are seeing the
> behavior of the "3" character in terminating input for comments.
That would be the shifted-"3".
> Changing the comment character in the call to read.table will allow input from that line.
>
> ?read.table
>
> You will need to read only the first 5 or 6 lines first, then execute a separate read.table while skipping input from those lines as well as the variable list that forms a secondary header.
>
>> headfrm <- read.table( file=url( "http://www.epa.gov/emap/html/data/surfwatr/data/mastreams/9396/wchem/chmval.txt"), nrows=6, sep=":", comment.char="")
>> headfrm
> V1 V2
> 1 Dataset EMAP Stream Chemistry Data
> 2 File Name chmval
> 3 Date Created 02/22/99
> 4 # Variables 75
> 5 # Header Records 85
> 6 # Data Records 711
>
>
>
>>
>> I've tried grep but that didn't work:
>> (for this I downloaded the txt file and manually removed that hash mark!)
>>
>> grep("Header Records:", read.table("chmval.txt", header=T))
>> Error in scan(file, what, nmax, sep, dec, quote, skip, nlines, na.strings,
>> :
>> line 1 did not have 5 elements
>>
>> Any ideas?
>> Can I just extract the 5th line?
>>
>>
>>
>>
>> --
>> View this message in context: http://r.789695.n4.nabble.com/extracting-information-from-txt-file-tp4648033.html
>> Sent from the R help mailing list archive at Nabble.com.
>>
>> ______________________________________________
>> R-help at r-project.org mailing list
>> https://stat.ethz.ch/mailman/listinfo/r-help
>> PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
>> and provide commented, minimal, self-contained, reproducible code.
>
> David Winsemius, MD
> Alameda, CA, USA
>
> ______________________________________________
> R-help at r-project.org mailing list
> https://stat.ethz.ch/mailman/listinfo/r-help
> PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
> and provide commented, minimal, self-contained, reproducible code.
David Winsemius, MD
Alameda, CA, USA
More information about the R-help
mailing list