[BioC] converting Affy indices to x,y coordinates
Kasper Daniel Hansen
kasperdanielhansen at gmail.com
Mon Feb 14 19:23:40 CET 2011
Todd,
This may be a bit hard to explain. There are essentially two index
numbers: the one used by Bioconductor/aff/affxparser etc. and the one
stored in the CDF file. The one stored in the CDF file (which you
will never see used in any Bioconductor documentation) is zero-based,
whereas the one "we" use is 1-based. Why the discrepancy. Well, I
cannot speak for Affymetrix (but I guess this is caused by C using
zero-based indixes), but in Bioconductor we use 1-based indexing
because if we read an entire CEL file into a vector we want to be able
to do
vector[INDEX]
and indexing is 1-based in R.
It is pretty clear the documentation in affxparser is a bit unclear
here. If you are really trying to understand the internals, you will
- aside from reading the affxparser docs - also have to do a fair
amount of experimentation and reading of the affymetrix file format
specs.
Kasper
On Mon, Feb 14, 2011 at 12:52 PM, Todd Allen <genesplicer28 at yahoo.com> wrote:
> Kasper,
>
> Let me clarify. I have opened the HGU133 Plus 2 CDF file inside Microsoft notepad, and I can visually see lists of data underneath header information. I randomly chose the value of
> 1354890, which I am confident is an authentic affymetrix index for a single, specific affymetrix probe on the chip because of the descriptive header information that is present.
>
> Assuming this value is an authentic index, I was hoping to use the formulas in the affyparser documentation to manually calculate the x & y coordinates of the probe on the affy chip. As mentioned below, the y coord is coming out correctly, but the x coordinate is off by 1.
>
> So, I am trying to understand if the problem is with something I am doing wrong, or whether the documented formulas in affyparser are somehow off.
>
> Todd
>
>
>
>
> --- On Mon, 2/14/11, Kasper Daniel Hansen <kasperdanielhansen at gmail.com> wrote:
>
>> From: Kasper Daniel Hansen <kasperdanielhansen at gmail.com>
>> Subject: Re: [BioC] converting Affy indices to x,y coordinates
>> To: "Todd Allen" <genesplicer28 at yahoo.com>
>> Cc: bioconductor at r-project.org
>> Date: Monday, February 14, 2011, 12:12 PM
>> On Mon, Feb 14, 2011 at 11:18 AM,
>> Todd Allen <genesplicer28 at yahoo.com>
>> wrote:
>> > In my own work, I am processing a HGU133Plus 2 CDF
>> file. The array dimensions are (1164, 1164) and if I take
>> the index of a specific probe listed as 1354890, I
>>
>> What exactly do you mean by "I take the index of a specific
>> probe
>> listed as 1354890"? Listed where, where do you get
>> this number and
>> how do you know what line in the CDF file corresponds to
>> this probe?
>>
>> Kasper
>>
>
>
>
>
More information about the Bioconductor
mailing list