[BioC] read.DNAStringSet
Thomas Girke
thomas.girke at ucr.edu
Mon Mar 8 18:24:04 CET 2010
Dear Ron,
Reading larger sequence sets directly from an ftp/http site into R is not a very
robust method. In this case you want to first download the sequences onto your computer
and then import them from there:
read.DNAStringSet("AE004437.ffn", "fasta")
Since you probably have this example from our workshop manual, I have now included a statement
what to do if you encounter this error (http://manuals.bioinformatics.ucr.edu/home/ht-seq).
Thomas
On Mon, Mar 08, 2010 at 03:54:04PM +0200, Ron Ophir wrote:
> I'm running Biostrings under the following session;
>
> > sessionInfo()
>
> R version 2.10.1 (2009-12-14)
>
> x86_64-unknown-linux-gnu
>
>
>
> locale:
>
> [1] LC_CTYPE=en_US.UTF-8 LC_NUMERIC=C
>
> [3] LC_TIME=en_US.UTF-8 LC_COLLATE=en_US.UTF-8
>
> [5] LC_MONETARY=C LC_MESSAGES=en_US.UTF-8
>
> [7] LC_PAPER=en_US.UTF-8 LC_NAME=C
>
> [9] LC_ADDRESS=C LC_TELEPHONE=C
>
> [11] LC_MEASUREMENT=en_US.UTF-8 LC_IDENTIFICATION=C
>
>
>
> attached base packages:
>
> [1] stats graphics grDevices utils datasets methods base
>
>
>
> other attached packages:
>
> [1] ShortRead_1.4.0 lattice_0.17-26 BSgenome_1.14.2
> Biostrings_2.14.11
>
> [5] IRanges_1.4.11
>
>
>
> loaded via a namespace (and not attached):
>
> [1] Biobase_2.6.1 grid_2.10.1 hwriter_1.1
>
>
>
> And I get
>
>
>
> myseq1 <-
> read.DNAStringSet("ftp://ftp.ncbi.nih.gov/genbank/genomes/Bacteria/Halob
> acterium_sp/AE004437.ffn", "fasta") # Imports
>
> Error in .read.fasta.in.XStringSet(filepath, set.names, elementType,
> lkup) :
>
> cannot open file
> 'ftp://ftp.ncbi.nih.gov/genbank/genomes/Bacteria/Halobacterium_sp/AE0044
> 37.ffn'
>
>
>
> for read.DNAStringSet() whereas for readFASTA() it works well
>
>
>
> myseq1 <-
> readFASTA("ftp://ftp.ncbi.nih.gov/genbank/genomes/Bacteria/Halobacterium
> _sp/AE004437.ffn", strip.descs=T)
>
>
>
> Is it a bug?
>
> Ron
>
>
>
>
>
>
> This mail was sent via Mail-SeCure System.
>
>
> _______________________________________________
> Bioconductor mailing list
> Bioconductor at stat.math.ethz.ch
> https://stat.ethz.ch/mailman/listinfo/bioconductor
> Search the archives: http://news.gmane.org/gmane.science.biology.informatics.conductor
More information about the Bioconductor
mailing list