[Bioc-devel] Removal of Information in OrgDb generated from NCBI -- Feedback needed.
Stadler, Michael
M|ch@e|@St@d|er @end|ng |rom |m|@ch
Thu Apr 30 08:46:04 CEST 2020
Hi Lori
Just my two-cents: I would not miss UNIGENE.
I am using org.db's mostly to annotate primary gene identifiers (ENTERZID, ENSEMBL) with
additional human readable information (SYMBOL, GENENAME), and to map between different
primary identifiers. I mostly use, in decreasing order of importance:
org.Hs.eg.db org.Mm.eg.db org.Rn.eg.db org.Ce.eg.db org.Dm.eg.db org.Sc.sgd.db
And the most frequent columns that I use are (again, decreasing order of importance):
ENTREZID ENSEMBL SYMBOL GENENAME UNIPROT GO
Maybe there is some indirect other usage that I am not aware of.
Best wishes,
Michael
-----Ursprüngliche Nachricht-----
Von: Bioc-devel <bioc-devel-bounces using r-project.org> Im Auftrag von Shepherd, Lori
Gesendet: Mittwoch, 29. April 2020 19:50
An: Bioc-devel using r-project.org
Betreff: [Bioc-devel] Removal of Information in OrgDb generated from NCBI -- Feedback needed.
Hello Bioconductor maintainers
The core team was made aware of an issue with one of the make orgDb functions in AnnotationForge:
https://github.com/Bioconductor/AnnotationForge/issues/13
Investigating further, NCBI will no longer be updating the gene2unigene file. The url has moved to an ARCHIVE directory and there is an explanation and notice of retirement found here:
ftp://ftp.ncbi.nih.gov/repository/UniGene/README
We use this function in creating OrgDb's for the AnnotationHub as well as the recommended way for users to make custom OrgDb's from NCBI.
Temporarily we are working on updating the url to the new location but we are thinking of removing the gene2unigene data from the orgDbs. We would like to ask the community especially those that utilize the orgDb objects frequently if this data is still necessary and would the removal of UNIGENE cause a large disruption to current packages/functions/utilization of objects?
Any feedback is greatly appreciated.
Thank you
Lori Shepherd
Bioconductor Core Team
Roswell Park Comprehensive Cancer Center
Department of Biostatistics & Bioinformatics
Elm & Carlton Streets
Buffalo, New York 14263
This email message may contain legally privileged and/or confidential information. If you are not the intended recipient(s), or the employee or agent responsible for the delivery of this message to the intended recipient(s), you are hereby notified that any disclosure, copying, distribution, or use of this email message is prohibited. If you have received this message in error, please notify the sender immediately by e-mail and delete this email message from your computer. Thank you.
[[alternative HTML version deleted]]
_______________________________________________
Bioc-devel using r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/bioc-devel
More information about the Bioc-devel
mailing list