[BioC] gcrma problem while processing HuGene-1_0-st-v1 genechip from Affymetrix NEGATIVE CONTROL PROBES
Richard Friedman
friedman at cancercenter.columbia.edu
Mon Jun 11 17:24:27 CEST 2012
Jim and list,
Thank you for bringing the NCprobe option to my attention.
I did not know that it had been implemented.
Does anyone out there have a list of negative control probes more
Human st 1.0 and for Mouse st 1.0 ?
Thanks and best wishes,
Rich
On Jun 11, 2012, at 11:18 AM, James W. MacDonald wrote:
> Hi Suparna,
>
> On 6/11/2012 11:00 AM, suparna mitra wrote:
>> Hi,
>> I am very new to biocondunctor and microaray. I have limited
>> experience
>> with R.
>> I am trying to use biocondunctor for analyzing HuGene-1_0-st-v1
>> microarray
>> data. I selectected different normalization method (rma, gcrma and
>> mas5).
>> For my data rma worked but for for gcrma and mas5 both I have
>> problem.
>
> This is to be expected. The HuGene array is a PM-only design, so
> mas5() won't work (because the mas5 algorithm requires subtracting
> MM from PM, and there are no MM probes). In addition, the default
> for gcrma() is to estimate the background for probes based on the GC
> content, using bins of MM probes. Again, without any MM probes, this
> won't work.
>
> Note however that gcrma() has an 'NCprobe' argument that you can use
> to specify an index of negative control probes. This is a non-
> trivial thing to do, and may be beyond your abilities if you are
> very new to R and BioC.
>
> To get the index of these probes, you will need to decide which
> probes are negative control probes, and then you can use the
> indexProbes() function, passing a character vector of the negative
> control probes to the genenames argument. This will return a list of
> indices for each probeset that you can unlist() prior to feeding in
> to gcrma().
>
> Or you could just stick with rma(), like the vast majority of people
> do.
>
> Best,
>
> Jim
>
>
>> For gcrma it gives me a error like: Computing affinitiesError:
>> length(prlen) == 1 is not TRUE
>>
>> And for mas 5 it seems working but I get only a whole list of NA.
>>
>> Here is what I have done.
>>
>>> mydata<- ReadAffy()
>>> mydata
>> AffyBatch object
>> size of arrays=1050x1050 features (16 kb)
>> cdf=HuGene-1_0-st-v1 (32321 affyids)
>> number of samples=18
>> number of genes=32321
>> annotation=hugene10stv1
>>
>>> eset<- rma(mydata)
>> Background correcting
>> Normalizing
>> Calculating Expression
>>> eset_justrma=justRMA()
>>> eset_mas5<- mas5(mydata)
>> background correction: mas
>> PM/MM correction : mas
>> expression values: mas
>> background correcting...done.
>> 32321 ids to be processed
>> | |
>> |####################|
>>> eset_gcrma<- gcrma(mydata)
>> Adjusting for optical effect..................Done.
>> Computing affinitiesError: length(prlen) == 1 is not TRUE Here is
>> the
>> error
>>
>>> eset_justrma # this worked fine
>> ExpressionSet (storageMode: lockedEnvironment)
>> assayData: 32321 features, 18 samples
>> element names: exprs, se.exprs
>> protocolData
>> sampleNames: MC1_(HuGene-1_0-st-v1).CEL MC10_(HuGene-1_0-st-
>> v1).CEL ...
>> MC9_(HuGene-1_0-st-v1).CEL (18 total)
>> varLabels: ScanDate
>> varMetadata: labelDescription
>> phenoData
>> sampleNames: MC1_(HuGene-1_0-st-v1).CEL MC10_(HuGene-1_0-st-
>> v1).CEL ...
>> MC9_(HuGene-1_0-st-v1).CEL (18 total)
>> varLabels: sample
>> varMetadata: labelDescription
>> featureData: none
>> experimentData: use 'experimentData(object)'
>> Annotation: hugene10stv1
>>> eset_mas5 # this seems worked fine but resulted all NA
>> ExpressionSet (storageMode: lockedEnvironment)
>> assayData: 32321 features, 18 samples
>> element names: exprs, se.exprs
>> protocolData
>> sampleNames: MC1_(HuGene-1_0-st-v1).CEL MC10_(HuGene-1_0-st-
>> v1).CEL ...
>> MC9_(HuGene-1_0-st-v1).CEL (18 total)
>> varLabels: ScanDate
>> varMetadata: labelDescription
>> phenoData
>> sampleNames: MC1_(HuGene-1_0-st-v1).CEL MC10_(HuGene-1_0-st-
>> v1).CEL ...
>> MC9_(HuGene-1_0-st-v1).CEL (18 total)
>> varLabels: sample
>> varMetadata: labelDescription
>> featureData: none
>> experimentData: use 'experimentData(object)'
>> Annotation: hugene10stv1
>>> write.exprs(eset_justrma,file="eset_justrma.csv")
>>> write.exprs(eset_mas5,file="eset_mas5.csv")
>>> write.exprs(eset,file="eset.csv")
>> Any help in this will be really great. Being a novice, I am very
>> sorry if I
>> am doing any silly mistake.
>> Thanks a lot,
>> Suparna.
>>
>
> --
> James W. MacDonald, M.S.
> Biostatistician
> University of Washington
> Environmental and Occupational Health Sciences
> 4225 Roosevelt Way NE, # 100
> Seattle WA 98105-6099
>
> _______________________________________________
> Bioconductor mailing list
> Bioconductor at r-project.org
> https://stat.ethz.ch/mailman/listinfo/bioconductor
> Search the archives: http://news.gmane.org/gmane.science.biology.informatics.conductor
More information about the Bioconductor
mailing list