[BioC] hgu133a annotation and discontinued Entrez IDs

James W. MacDonald jmacdon at med.umich.edu
Fri Jun 13 15:18:34 CEST 2008



David Fermin wrote:
> anna freni sterrantino <annafreni at ...> writes:
> 
>> Hi David ,
>> could be that you should get 
>> library(hgu133a.db)
>> form your sessionInfo it seems that you have the older version.
>> You should also update your R to 2.7 and get the Bioc package Release 2.2.
>>
>> Cheers
>>
>> A.
>>
>>
>> _______________________________________________
>> Bioconductor mailing list
>> Bioconductor at ...
>> https://stat.ethz.ch/mailman/listinfo/bioconductor
>> Search the archives: http://news.gmane.org/gmane.science.biology.informatics.conductor
> 
> 
> Thank you for your reply. I know there are newer versions available, but for the 
> manuscript I'm preparing I would like to be consistent with version control 
> amongst my analyses.
> 
> The question remains, why am I getting probesets with antiquated Entrez IDs
>  when the annotation package I used (see original message) from August 2007 
> should be adequately up to date so as to exclude these discontinued IDs?

As you noted in your first post, the help page for this package says:

The annotation package was built using a downloadable R package -
AnnBuilder (download and build your own) from www.bioconductor.org
using the following public data sources:
Entrez Gene:ftp://ftp.ncbi.nlm.nih.gov/gene/DATA/. Built: Source data
downloaded from Entrez Gene on Fri Aug 24 18:20:19 2007...

In other words, we downloaded the data from Entrez Gene on that day and 
parsed from there, so any discontinued Entrez Gene IDs in that package 
were still in the data we downloaded and parsed.

Our purpose here is to simply make annotation data available in a 
convenient form, not to check the data made available by NCBI and ensure 
it is correct (that is NCBI's job), so my recommendation would be to 
inform them of these errors.

Best,

Jim



> 
> David
> 
> _______________________________________________
> Bioconductor mailing list
> Bioconductor at stat.math.ethz.ch
> https://stat.ethz.ch/mailman/listinfo/bioconductor
> Search the archives: http://news.gmane.org/gmane.science.biology.informatics.conductor

-- 
James W. MacDonald, M.S.
Biostatistician
Affymetrix and cDNA Microarray Core
University of Michigan Cancer Center
1500 E. Medical Center Drive
7410 CCGC
Ann Arbor MI 48109
734-647-5623



More information about the Bioconductor mailing list