[BioC] Bsgenome Zv7 sequence data
Julie Zhu
julie.zhu at umassmed.edu
Wed Jul 29 16:10:26 CEST 2009
Hi Herve,
Thank you very much!
Best regards,
Julie
On 7/29/09 12:41 AM, "hpages at fhcrc.org" <hpages at fhcrc.org> wrote:
> Hi Julie,
>
> The Zebrafish (Danio rerio) genome is now available in BioC release
> and will soon be in BioC devel:
>
>> library(BSgenome)
>> available.genomes()
> [1] "BSgenome.Amellifera.BeeBase.assembly4"
> [2] "BSgenome.Amellifera.UCSC.apiMel2"
> [3] "BSgenome.Athaliana.TAIR.01222004"
> [4] "BSgenome.Athaliana.TAIR.04232008"
> [5] "BSgenome.Btaurus.UCSC.bosTau3"
> [6] "BSgenome.Btaurus.UCSC.bosTau4"
> [7] "BSgenome.Celegans.UCSC.ce2"
> [8] "BSgenome.Cfamiliaris.UCSC.canFam2"
> [9] "BSgenome.Dmelanogaster.UCSC.dm2"
> [10] "BSgenome.Dmelanogaster.UCSC.dm3"
> [11] "BSgenome.Drerio.UCSC.danRer5"
> [12] "BSgenome.Ecoli.NCBI.20080805"
> [13] "BSgenome.Ggallus.UCSC.galGal3"
> [14] "BSgenome.Hsapiens.UCSC.hg17"
> [15] "BSgenome.Hsapiens.UCSC.hg18"
> [16] "BSgenome.Hsapiens.UCSC.hg19"
> [17] "BSgenome.Mmusculus.UCSC.mm8"
> [18] "BSgenome.Mmusculus.UCSC.mm9"
> [19] "BSgenome.Ptroglodytes.UCSC.panTro2"
> [20] "BSgenome.Rnorvegicus.UCSC.rn4"
> [21] "BSgenome.Scerevisiae.UCSC.sacCer1"
>
>> source("http://bioconductor.org/biocLite.R")
>> biocLite("BSgenome.Drerio.UCSC.danRer5")
> ...
>> library(BSgenome.Drerio.UCSC.danRer5)
>> Drerio
> Zebrafish genome
> |
> | organism: Danio rerio (Zebrafish)
> | provider: UCSC
> | provider version: danRer5
> | release date: Jul. 2007
> | release name: Sanger Institute Zv7
> |
> | single sequences (see '?seqnames'):
> | chr1 chr2 chr3 chr4 chr5 chr6 chr7 chr8 chr9
> chr10 chr11
> | chr12 chr13 chr14 chr15 chr16 chr17 chr18 chr19 chr20
> chr21 chr22
> | chr23 chr24 chr25 chrM
> |
> | multiple sequences (see '?mseqnames'):
> | Zv7_NA Zv7_scaffold upstream1000 upstream2000 upstream5000
> |
> | (use the '$' or '[[' operator to access a given sequence)
>
>> Drerio$chr1
> 56204684-letter "MaskedDNAString" instance (# for masking)
> seq:
> CACACACTCATACACTACGGCCAGTGTAGTTGATCA...GGAGGATCTGACGTCTGTGAGCAAACACAAACACAC
> masks:
> maskedwidth maskedratio active names desc
> 1 150400 2.675934e-03 TRUE AGAPS assembly gaps
> 2 288 5.124128e-06 TRUE AMB intra-contig ambiguities
> 3 26544901 4.722898e-01 FALSE RM RepeatMasker
> 4 1576324 2.804613e-02 FALSE TRF Tandem Repeats Finder [period<=12]
> all masks together:
> maskedwidth maskedratio
> 26736688 0.4757021
> all active masks together:
> maskedwidth maskedratio
> 150688 0.002681058
>
> Cheers,
> H.
>
>
> Quoting Julie Zhu <julie.zhu at umassmed.edu>:
>
>> Hi Herve,
>>
>> I need to obtain sequence data for a set of given coordinates. Do you know
>> whether Zv7 (zebrafish) sequence data will be made available for Bsgenome
>> package? Thanks!
>>
>> Best regards,
>>
>> Julie
>>
>>
>> *******************************************
>> Julie Zhu, Ph.D
>> Research Assistant Professor
>> Program Gene Function and Expression
>> University of Massachusetts Medical School
>> 364 Plantation Street, Room 613
>> Worcester, MA 01605
>> 508-856-5256
>> http://www.umassmed.edu/pgfe/faculty/zhu.cfm
>>
>>
>>
>>
>
>
>
>
More information about the Bioconductor
mailing list