[R] Data import, going from 8 to 550 columns

Prof Brian Ripley ripley at stats.ox.ac.uk
Sat Apr 24 13:15:52 CEST 2004


You cannot use read.table() to read multi-line records as here.
Why?  It is not a table?

You can use scan() to do this.

On Sat, 24 Apr 2004, CG Pettersson wrote:

> Hello all!
> 
> I need to import a NIR dataset into R. It should be quite trivial, but
> I 
> can´t make it work. (No problems with the text in the beginning, as #
> is 
> recognised by read.table as the comment sign.)
> 
> The thing I can´t get around is the <CR> that ends every line after
> column eight as the line in R should be 550 columns wide (including
> the JF-number). 
> Every new line in R should begin with the "JF2455" and so on.
> Naturally it is possible to re-shape the tables in Excel before
> import, but it is quite tedious and doesn´t feel right...!
> 
> How do I make read.table to just go on reading on the next line when
> it comes to <CR>, and how do I make it use the double <CR> followed by
> a blank to begin the next line?
> 
> The data-file(s) looks like this:
> 
> 
> #ID=Samples from soil scanning
> #SAMPLE_NUMBERS_PRESENT=Y
> #NX_VARIABLES=550
> #NY_VARIABLES=0
> #FIRST_WAVELENGTH=1300.000000
> #LAST_WAVELENGTH=2398.000000
> #WAVELENGTH_INCREMENT=2.000000
> JF2455  0.4367495 0.4365539 0.4363573 0.4361560 0.4359702 0.4357788 
> 0.4355963 0.4354126 0.4352311 0.4350726 0.4349101 0.4347557 0.4346097
> 0.4344587 
> 0.4343193 0.4341759 0.4340320 0.4338984 0.4337671 0.4336369 0.4335097
> 0.4333864 
>   the original table is 8 columns wide, ended with a <CR>
>   sixty four lines removed here....
> 
> 0.5015950 0.5020472 0.5026294 0.5033303 0.5041344 0.5049909 0.5059010
> 0.5067372 
> 0.5075415 0.5082389 0.5089509 0.5095288 0.5101137 0.5106306 0.5111954
> 0.5116805 
>  
>  JF2456  0.3604568 0.3600681 0.3596676 0.3592694 0.3588919 0.3585098 
> 0.3581379 0.3577725 0.3573992 0.3570563 0.3566975 0.3563588 0.3560365
> 0.3556931 
> 0.3553730 0.3550543 0.3547286 0.3544230 0.3541073 0.3537982 0.3535004
> 0.3531921 
> 0.3529077 0.3526271 0.3523493 0.3520919 0.3518271 0.3515673 0.3513192
> 0.3510693 
> 0.3508208 0.3505693 .......
> 
> and so on
> 
> Thanks!
> /CG
> 
> 
> CG Pettersson, MSci, PhD Stud.
> Swedish University of Agricultural Sciences
> Dep. of Ecology and Crop Production. Box 7043
> SE-750 07 Uppsala
> 
> ______________________________________________
> R-help at stat.math.ethz.ch mailing list
> https://www.stat.math.ethz.ch/mailman/listinfo/r-help
> PLEASE do read the posting guide! http://www.R-project.org/posting-guide.html
> 
> 

-- 
Brian D. Ripley,                  ripley at stats.ox.ac.uk
Professor of Applied Statistics,  http://www.stats.ox.ac.uk/~ripley/
University of Oxford,             Tel:  +44 1865 272861 (self)
1 South Parks Road,                     +44 1865 272866 (PA)
Oxford OX1 3TG, UK                Fax:  +44 1865 272595




More information about the R-help mailing list