[BioC] localization of mm values in affybatch exprs matrix
Kasper Daniel Hansen
khansen at stat.Berkeley.EDU
Sat Jan 27 00:04:55 CET 2007
So I was a bit quick. It seems from Karin's post that she already has
a CDF env.
Jim's statement that the CEL file is ordered PM then MM is probably
right for most chips, but in general you can only be sure that the PM
and the MM are spatially close. In general you should use the CDF
information to link the pm/mm/(x,y) position together and you cannot
a priori know what coordinate corresponds to in terms of pm/mm/probeset.
Kasper
On Jan 26, 2007, at 2:56 PM, Kasper Daniel Hansen wrote:
>
> On Jan 25, 2007, at 10:10 AM, James W. MacDonald wrote:
>
>> Hi Karin,
>>
>> Karin Lagesen wrote:
>>> I have a custom affy chip that I read into R using ReadAffy():
>>>
>>>> newdata = ReadAffy()
>>>> newdata
>>>
>>> AffyBatch object
>>> size of arrays=754x754 features (17777 kb)
>>> cdf=E_colia530222N (11378 affyids)
>>> number of samples=4
>>> number of genes=11378
>>> annotation=ecolia530222n
>>>
>>>
>>> I now want to look at different values in this object.
>>>
>>> For instance, some pm values:
>>>
>>>
>>>> pm(newdata)[1:5,]
>>>
>>> chip1.CEL chip2.CEL chip3.CEL chip4.CEL
>>> [1,] 1855.0 2180.8 1444.0 2932.0
>>> [2,] 2812.0 3451.0 2276.5 3406.0
>>> [3,] 4162.3 4301.0 2996.0 5088.0
>>> [4,] 1608.5 1758.0 1123.0 1987.0
>>> [5,] 2290.0 3189.0 2474.5 2838.3
>>>
>>>
>>> I now also look at the values in the affybatch exprs matrix:
>>>
>>>
>>>> newdata at exprs[1:5,]
>>>
>>> chip1.CEL chip2.CEL chip3.CEL chip4.CEL
>>> [1,] 942.0 776.0 281 1475
>>> [2,] 24422.0 26071.0 8914 21826
>>> [3,] 1024.5 908.8 227 1594
>>> [4,] 26267.0 27674.0 16199 22104
>>> [5,] 130.0 193.0 168 145
>>>
>>>
>>> I also notice that the dimension of the exprs matrix is such that
>>> there is one column for each chip, and as many rows as there are pm
>>> plus mm values.
>>>
>>> Are the first half of rows the pm values, with the mm values
>>> following, or are the pm values every other row with the
>>> corresponding
>>> mm value below, or is this set up in some other way? Is there any
>>> way
>>> for me to look at a value in the exprs matrix and find out which
>>> entry
>>> in the pm/mm value list it is?
>>
>> The chip is read in row-wise, and the PM probes are in a given row,
>> with
>> the MM probes in the following row. Therefore, the data (excluding
>> the
>> various QC probes) will be N PM probes followed by N MM probes,
>> where N
>> is the row length of the chip.
>
> This is not true I believe. The are no clear order of the pm and
> mm's. You need to get that information from somewhere else, usually
> from a CDF file.
>
> Karin: you will need to use the makecdfenv package to make what is
> called a CDF package - an R representation of the PM/MM/probeset
> pairs.
>
> Kasper
>
>
>> If you really want to work with the exprs matrix directly (why?), you
>> can use indexProbes() to find the indices for whatever probeset you
>> are
>> interested in, and then subset out. Alternatively you can get the
>> indices for the PM and MM probes and subset those out separately
>> (which
>> is how pm() and mm() work). You can also use pm() or mm() with an
>> optional genenames argument to get the PM or MM probe values for a
>> particular probeset or probesets.
>>
>>
>> Best,
>>
>> Jim
>>
>>
>>>
>>> TIA,
>>>
>>> Karin
>>
>>
>> --
>> James W. MacDonald, M.S.
>> Biostatistician
>> Affymetrix and cDNA Microarray Core
>> University of Michigan Cancer Center
>> 1500 E. Medical Center Drive
>> 7410 CCGC
>> Ann Arbor MI 48109
>> 734-647-5623
>>
>>
>> **********************************************************
>> Electronic Mail is not secure, may not be read every day, and
>> should not be used for urgent or sensitive issues.
>>
>> _______________________________________________
>> Bioconductor mailing list
>> Bioconductor at stat.math.ethz.ch
>> https://stat.ethz.ch/mailman/listinfo/bioconductor
>> Search the archives: http://news.gmane.org/
>> gmane.science.biology.informatics.conductor
>
> _______________________________________________
> Bioconductor mailing list
> Bioconductor at stat.math.ethz.ch
> https://stat.ethz.ch/mailman/listinfo/bioconductor
> Search the archives: http://news.gmane.org/
> gmane.science.biology.informatics.conductor
More information about the Bioconductor
mailing list