[Bioc-devel] how to get genomic sequences?
Sean Davis
sdavis2 at mail.nih.gov
Wed Mar 28 01:07:12 CEST 2007
Roger Liu wrote:
> Hi,
>
> I have a set of data with chromosome number and coordinates of the sequences
> such as,chr10, start 1000, end 2000.
> I have tried using biomart to retrieve the genomic sequences for my dataset,
> but I didn't get success, I used seqType argument as:
> seqType="genomic", it reported error as"The type of sequence specified with
> seqType is not available. Please select from: cdna, peptide, 3utr, 5utr",
> but I have seen this "genomic" argument for seqType in the help file. So
> what's going on there?
>
> Or anyone can recommend a package which can help me retrieve the genomic
> sequences from hg18 with known chromosome number and sequences
> coordinates(start and end).
>
URLs like so:
http://genome.ucsc.edu/cgi-bin/das/hg18/dna?segment=chr1:1000,2000;segment=chr2:1000,2000
will get you sequences of interest. These come as simple well-formed
XML, so you can use the XML package to parse them pretty easily. You
can do one-at-a-time, or string together a bunch.
Sean
More information about the Bioc-devel
mailing list