[BioC] Where can I download/install old packages like 'org.Rn.eg.db' based on rn4?
Hervé Pagès
hpages at fhcrc.org
Thu Jan 2 23:33:34 CET 2014
Hi ZQ,
On 01/01/2014 12:51 AM, Zhi-Qiang Ye wrote:
> Hi, thanks for your help.
>
> The information I need is the 'rn4' coordinates of Entrez Genes, so I
> finally used the 'refGene' table and 'refLink' table downloaded from UCSC
> genome browser database under the directory of 'rn4', and got what I need.
Following Steve's original suggestion (but using the refGene table
instead of ensGene):
library(GenomicFeatures)
txdb <- makeTranscriptDbFromUCSC(genome='rn4', tablename='refGene')
gn <- genes(txdb)
This gives you the 'rn4' coordinates of Entrez Genes:
> head(gn)
GRanges with 6 ranges and 1 metadata column:
seqnames ranges strand | gene_id
<Rle> <IRanges> <Rle> | <CharacterList>
100034253 chrX [ 40301219, 40334396] - | 100034253
100036582 chr8 [ 18755584, 18756808] + | 100036582
100036765 chr12 [ 32898194, 32924599] + | 100036765
100049583 chr8 [114071508, 114072510] - | 100049583
100124593 chr8 [128569564, 128570618] + | 100124593
100125361 chr10 [ 45764849, 45775258] - | 100125361
---
seqlengths:
chr1 chr2 chr3 ... chrUn chrUn_random
267910886 258207540 171063335 ... 75822765 6862066
> head(as.character(mcols(gn)$gene_id))
[1] "100034253" "100036582" "100036765" "100049583" "100124593"
"100125361"
Note that makeTranscriptDbFromUCSC() downloads and uses the 'refGene'
and 'refLink' tables from UCSC for you in order to generate the
TranscriptDb object 'txdb'. In addition to being convenient, this
method also avoids some of the common pitfalls of downloading and
processing those tables "manually".
Cheers,
H.
> As for other information not related to the chromosomal locations, I just
> used the information in the package 'org.Rn.eg.db'.
>
> Maybe the maintainner can still provide official packages based on 'rn4',
> say, 'org.Rn4.eg.db', since annotation based on 'rn5' is not as complete as
> that based on 'rn4'.
>
> Best,
> ZQ
>
> On Sat, Dec 21, 2013 at 8:53 AM, Marc Carlson <mcarlson at fhcrc.org> wrote:
>
>> Steve is right. You could go and do that. However in this case we have
>> already done the work for you. You can find a package here:
>>
>> http://www.bioconductor.org/packages/release/data/annotation/html/TxDb.
>> Rnorvegicus.UCSC.rn4.ensGene.html
>>
>> That should get you transcript locations for RN4.
>>
>> For more information about how you can make use of that resource and of
>> others like it, please be sure to also peak at our workflow:
>>
>> http://www.bioconductor.org/help/workflows/annotation/annotation/
>>
>>
>> Marc
>>
>>
>>
>>
>>
>> On 12/18/2013 10:09 PM, Steve Lianoglou wrote:
>>
>>> Hi,
>>>
>>> On Wed, Dec 18, 2013 at 6:20 PM, Zhi-Qiang Ye <yezhiqiang at gmail.com>
>>> wrote:
>>>
>>>> Hi, all
>>>>
>>>> I need to analysis genes whose chromosomal locations are based
>>>> on
>>>> the rat genome version rn4, so installed 'org.Rn.eg.db'. However, the
>>>> genome coordinate system is based on rn5.
>>>>
>>>> Is there any way to get and install old versions of
>>>> 'org.Rn.eg.db'?
>>>>
>>> You might consider building a TranscriptDb object from rn4 annotations
>>> using the GenomicFeatures package:
>>>
>>> http://bioconductor.org/packages/release/bioc/html/GenomicFeatures.html
>>>
>>> Read through its vignette to figure out how to download and build a
>>> TranscriptDb from UCSC. It's going to be something like:
>>>
>>> R> library(GenomicFeatures)
>>> R> txdb <- makeTranscriptDbFromUCSC(genome='rn4', tablename='ensGene')
>>>
>>> Keep reading to figure out how to lookup genes/transcripts by
>>> (ensembl) transcript IDs, and I think you should be good to go.
>>>
>>> If you need to do sequence related analyses using these annotations,
>>> you can use the rn4 BSgenome object, to boot:
>>>
>>> http://bioconductor.org/packages/release/data/annotation/html/BSgenome.
>>> Rnorvegicus.UCSC.rn4.html
>>>
>>> HTH,
>>> -steve
>>>
>>>
>> _______________________________________________
>> Bioconductor mailing list
>> Bioconductor at r-project.org
>> https://stat.ethz.ch/mailman/listinfo/bioconductor
>> Search the archives: http://news.gmane.org/gmane.
>> science.biology.informatics.conductor
>>
>
> [[alternative HTML version deleted]]
>
> _______________________________________________
> Bioconductor mailing list
> Bioconductor at r-project.org
> https://stat.ethz.ch/mailman/listinfo/bioconductor
> Search the archives: http://news.gmane.org/gmane.science.biology.informatics.conductor
>
--
Hervé Pagès
Program in Computational Biology
Division of Public Health Sciences
Fred Hutchinson Cancer Research Center
1100 Fairview Ave. N, M1-B514
P.O. Box 19024
Seattle, WA 98109-1024
E-mail: hpages at fhcrc.org
Phone: (206) 667-5791
Fax: (206) 667-1319
More information about the Bioconductor
mailing list