[BioC] getting Locus Link ids from gene symbol

Nianhua Li nialicn at yahoo.com
Mon Jun 11 14:36:31 CEST 2007


Hi, Alex,

You can parse ftp://ftp.ncbi.nlm.nih.gov/gene/DATA/gene_info.gz
There are 4 useful columns: tax_id (column 1), GeneID (column 2), Symbol 
(column 3), and Synonyms (column 5). You can:

1 Read in the file
2 filter it based on tax_id
3 match your gene symboles to the "Symbol" column and find their Gene ID
4 removed the matched gene symboles from your list
5 match the rest of gene symboles to the "Synonyms" column and find their Gene 
ID

hope this helps

nianhua

Nianhua Li
Software Developer



More information about the Bioconductor mailing list