[BioC] Drosophila Genome build used for GeneChip Drosophila-2 CDF?

James W. MacDonald jmacdon at med.umich.edu
Wed Jan 13 15:08:40 CET 2010


Hi David,

I guess it depends on what you are really after. The CDF simply maps the 
locations of the probes on a chip back to their probesets, so there is 
never any need to update that file (if Affy changed the design of the 
chip they would just release a new Drosophila chip as they did when they 
went to the Drosophila 2.0 chip).

I sort of assumed you originally meant to ask what genome build the chip 
was designed against, which is what they show in the pdf I directed you 
to. Again, this information is set in stone.

What may change is what the probes are assumed to bind to. If for 
instance an EST is later mapped to a particular gene, then the 
annotation for the probeset directed against the EST may change to 
reflect the newer annotation. These data are contained in the 
drosophila2.db package, which in turn is based on the most recent 
annotation file from Affymetrix. IIRC, Affy updates these annotation 
files quarterly, so we would have used the newest Affy annotation file 
as of October 2009 for the release version of this package.

Note that annotating the probesets is a non-trivial task, and we are 
simply taking the probeset -> Entrez Gene ID mappings from Affy and 
doing the mapping from Entrez Gene to the various other available 
annotation data. Since the annotation of any genome is a moving target, 
and it is a long time since 2003, some of the things that the probesets 
on this chip are intended to interrogate may no longer be interrogated 
as well (or at all), according to the genome as we now know it. Our 
annotation packages only take this into account to the extent that the 
annotation file from Affy does.

A different approach to this problem is taken by the folks at MBNI, who 
create their own CDF files based on the current genome build. If 
interested, you can see information about these packages at

http://brainarray.mbni.med.umich.edu/Brainarray/Database/CustomCDF/genomic_curated_CDF.asp

They have set up a CRAN-style repository that biocLite() knows about, so 
if interested, you can get their packages directly using biocLite().

Best,

Jim



Dávid Molnár wrote:
> Hi Jim,
> 
> thank you for the swift reply. I puzzled over that affymetrix
> datasheet before, and I should probably have been more specific on
> that: has affymetrix never updated this CDF based on Drosophila genome
> release 3.1? Is bioconductor using the unupdated affymetrix CDF?
> Release 3.1 of the Drosophila genome is from 2003.  Data and gene
> models have considerably improved since then. In the CDF packages on
> the bioconductor server the date of the files is from 2008. I
> speculated that somebody might have had a look on them recently. Is
> this true?
> 
> Best regards,
> David
> 
> 
> 2010/1/12 James W. MacDonald <jmacdon at med.umich.edu>:
>> Hi David,
>>
>> Look at the left margin here:
>>
>> http://www.affymetrix.com/support/technical/datasheets/drosophila2_datasheet.pdf
>>
>> Best,
>>
>> Jim
>>
>>
>>
>> Dávid Molnár wrote:
>>> Dear all,
>>>
>>> I tried to find out in vain from which genome release the current Chip
>>> Description File for GeneChip Drosophila-2 was made.
>>> Could anybody give me please a hint?
>>>
>>> Best regards,
>>> David Molnar
>>>
>>> _______________________________________________
>>> Bioconductor mailing list
>>> Bioconductor at stat.math.ethz.ch
>>> https://stat.ethz.ch/mailman/listinfo/bioconductor
>>> Search the archives:
>>> http://news.gmane.org/gmane.science.biology.informatics.conductor
>> --
>> James W. MacDonald, M.S.
>> Biostatistician
>> Douglas Lab
>> University of Michigan
>> Department of Human Genetics
>> 5912 Buhl
>> 1241 E. Catherine St.
>> Ann Arbor MI 48109-5618
>> 734-615-7826
>> **********************************************************
>> Electronic Mail is not secure, may not be read every day, and should not be
>> used for urgent or sensitive issues
>>

-- 
James W. MacDonald, M.S.
Biostatistician
Douglas Lab
University of Michigan
Department of Human Genetics
5912 Buhl
1241 E. Catherine St.
Ann Arbor MI 48109-5618
734-615-7826
**********************************************************
Electronic Mail is not secure, may not be read every day, and should not be used for urgent or sensitive issues 



More information about the Bioconductor mailing list