[BioC] Fwd: Limma: How to read gene list, coordinates of sport when NO GAL file available
Gordon Smyth
smyth at wehi.edu.au
Mon Apr 10 02:30:39 CEST 2006
Dear Srinivas,
Have you given all the column names which are in the dataset file?
Files from ArrayExpression usually contain columns "Name" and
"DBidentifier", in which case you would use the argument
annotation=c("Name","DBidentifier")
when using read.maimages() to read them.
Best wishes
Gordon
>To: Srinivas Iyyer <srini_iyyer_bio at yahoo.com>
>From: Gordon Smyth <smyth at wehi.edu.au>
>Subject: [BioC] Limma: How to read gene list, coordinates of sport
>when NO GAL file available
>Cc: bioconductor at stat.math.ethz.ch
>Bcc:
>X-Eudora-Signature: <<No Default>>
>Date: Sat, 08 Apr 2006 21:02:42 +1000
>
>>Date: Fri, 7 Apr 2006 16:22:23 -0700 (PDT)
>>From: Srinivas Iyyer <srini_iyyer_bio at yahoo.com>
>>Subject: [BioC] Limma: How to read gene list , coordinates of sport
>> when NO GAL file available
>>To: bioconductor at stat.math.ethz.ch
>>
>>Dera group,
>>limma is an excellent module for gene expression data
>>preprocessing and analysis.
>>however, I looked into many places i did not find a
>>good tutorial when the .gpr file is not what I it
>>should look like. Also, when GAL file is not the same
>>what it should be.
>>
>>I have a dataset downloaded from ArrayExpress and has
>>the following column names:
>>
>>[B635+1SD B635+2SD Autoflag B
>>Pixels B635 B635 CV B635
>>Mean B635 Median B635 SD Circularity Dia. F Pixels
>>F635 % Sat. F635 CV F635 Mean F635 Mean - B635 F635
>>Median F635 Median - B635 F635 SD F635 Total Intensity
>>Flags Normalize SNR 635]
>>
>>
>>The chip definition file obtained from "Array design
>>used" section of ArrayExpress has the following
>>columns:
>>
>>[MetaColumn MetaRow Column Row Reporter Identifier
>>Reporter Name Reporter Biosequence Type Reporter
>>actual Sequence Reporter Comment Reporter Group Role
>>Reporter Control Type CompositeSequence Identifier
>>CompositeSequence Name Composite Sequence Comment]
>>
>>when i did:
>>dat <- read.maimages('filename',source
>>='genepix.custom')
>>
>>I get "Error in readGPRHeader(fullname) : File is not
>>in Axon Text File (ATF) format"
>>
>>
>>my questions are:
>>
>>what should I tell read.maimages to accept my file and
>>process further.
>>
>>what should I do when I do not have GAL file. how can
>>the other file help me get genelist etc.
>>
>>Please help me.
>>
>>Thanks
>>sri
>
>Dear Sri,
>
>I am a bit puzzled why you would try to tell read.maimages() that
>you have GenePix data. Is this because the data files were
>originally GenePix according the description? Anyway, it is apparent
>that ArrayExpression changes the data format, so that using the
>GenePix setup will not work.
>
>To read the intensity data into limma, you should proceed as per the
>section of the User's Guide starting from "What should you do if
>your image analysis program is not currently supported by limma?" in
>the middle of page 14.
>
>As for reading the chip definition file, I assume that this file is
>of the same length and order as the intensity data files. If that is
>so, just read the file into R yourself using something like
>
> anndata <- read.delim(chipdeffile, as.is = TRUE, quote = "\"",
> fill = TRUE)
>
>and then
>
> RG$genes <- anndata
>
>That's all. You do not need to create your own GAL file.
>
>Best wishes
>Gordon
More information about the Bioconductor
mailing list