[BioC] getting Locus Link ids from gene symbol
Nianhua Li
nialicn at yahoo.com
Mon Jun 11 14:36:31 CEST 2007
Hi, Alex,
You can parse ftp://ftp.ncbi.nlm.nih.gov/gene/DATA/gene_info.gz
There are 4 useful columns: tax_id (column 1), GeneID (column 2), Symbol
(column 3), and Synonyms (column 5). You can:
1 Read in the file
2 filter it based on tax_id
3 match your gene symboles to the "Symbol" column and find their Gene ID
4 removed the matched gene symboles from your list
5 match the rest of gene symboles to the "Synonyms" column and find their Gene
ID
hope this helps
nianhua
Nianhua Li
Software Developer
More information about the Bioconductor
mailing list