[R] Why do I have a column called row.names?

Bert Gunter gunter.berton at gene.com
Mon Jun 4 20:30:30 CEST 2012


Actually, I think it's ?data.frame that he should read.

The salient points are that:
1. All data frames must have unique row names. If not provided, they
are produced. Row numbers **are** row names.

2. The return value of read methods are data frames.

-- Bert

On Mon, Jun 4, 2012 at 11:05 AM, David L Carlson <dcarlson at tamu.edu> wrote:
> Try help("read.delim") - always a good strategy before using a function for
> the first time:
>
> In it, you will find: "Using row.names = NULL forces row numbering. Missing
> or NULL row.names generate row names that are considered to be 'automatic'
> (and not preserved by as.matrix)."
>
> ----------------------------------------------
> David L Carlson
> Associate Professor of Anthropology
> Texas A&M University
> College Station, TX 77843-4352
>
>
>> -----Original Message-----
>> From: r-help-bounces at r-project.org [mailto:r-help-bounces at r-
>> project.org] On Behalf Of Ed Siefker
>> Sent: Monday, June 04, 2012 12:47 PM
>> To: r-help at r-project.org
>> Subject: [R] Why do I have a column called row.names?
>>
>> I'm trying to read in a tab separated table with read.delim().
>> I don't particularly care what the row names are.
>> My data file looks like this:
>>
>>
>> start   stop    Symbol  Insert sequence Clone End Pair  FISH
>> 203048  67173930        ABC8-43024000D23                TI:993812543
>>  TI:993834585
>> 255176  87869359        ABC8-43034700N15                TI:995224581
>>  TI:995237913
>> 1022033 1060472 ABC27-1253C21           TI:2094436044   TI:2094696079
>> 1022033 1061172 ABC23-1388A1            TI:2120730727   TI:2121592459
>>
>>
>>
>> I have to do something with row.names because my first column has
>> duplicate entries.  So I read in the file like this:
>>
>> > BACS<-read.delim("testdata.txt", row.names=NULL, fill=TRUE)
>> > head(BACS)
>>   row.names    start             stop Symbol Insert.sequence
>> Clone.End.Pair
>> 1    203048 67173930 ABC8-43024000D23     NA    TI:993812543
>> TI:993834585
>> 2    255176 87869359 ABC8-43034700N15     NA    TI:995224581
>> TI:995237913
>> 3   1022033  1060472    ABC27-1253C21     NA   TI:2094436044
>> TI:2094696079
>> 4   1022033  1061172     ABC23-1388A1     NA   TI:2120730727
>> TI:2121592459
>>   FISH
>> 1   NA
>> 2   NA
>> 3   NA
>> 4   NA
>>
>>
>> Why is there a column named "row.names"?  I've tried a few different
>> ways of invoking this, but I always get the first column named
>> row.names,
>> and the rest of the columns shifted by one.
>>
>> Obviously I could fix this by using row.names<-, but I'd like to
>> understand
>> why this happens.  Any insight?
>>
>> ______________________________________________
>> R-help at r-project.org mailing list
>> https://stat.ethz.ch/mailman/listinfo/r-help
>> PLEASE do read the posting guide http://www.R-project.org/posting-
>> guide.html
>> and provide commented, minimal, self-contained, reproducible code.
>
> ______________________________________________
> R-help at r-project.org mailing list
> https://stat.ethz.ch/mailman/listinfo/r-help
> PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
> and provide commented, minimal, self-contained, reproducible code.



-- 

Bert Gunter
Genentech Nonclinical Biostatistics

Internal Contact Info:
Phone: 467-7374
Website:
http://pharmadevelopment.roche.com/index/pdb/pdb-functional-groups/pdb-biostatistics/pdb-ncb-home.htm



More information about the R-help mailing list