[Bioc-devel] SRAdb missing runs
Cook, Malcolm
MEC at stowers.org
Fri Sep 9 23:14:38 CEST 2011
Hi Jack and other SRAdb users,
I find at least one SRA run missing from the sqlite database obtained from a fresh `getSRAdbFile()`
SRR074430 is present in the SRA http://trace.ncbi.nlm.nih.gov/Traces/sra/sra.cgi?cmd=viewer&m=data&s=viewer&run=SRR074430
but directly querying the sqlite3 database fails to find it:
sqlite3 -list SRAmetadb.sqlite "select study_accession, submission_accession, sample_accession, experiment_accession, run_accession, sample_alias from sra where run_accession in ('SRR031766','SRR031767','SRR074430')"
SRP001537|SRA010243|SRS008471|SRX014483|SRR031766|S2_DRSC_CG10128_RNAi-1
SRP001537|SRA010243|SRS008471|SRX014483|SRR031767|S2_DRSC_CG10128_RNAi-1
Can anyone advise me as the origin of this discrepancy, or perhaps fix a misunderstanding I may have in using this resource.
I just downloaded a fresh SRAdbFile... here is the "Metadata associate with downloaded file:"
c("schema version", "creation timestamp")c("1.0", "2011-09-03 10:38:16")
Below is a full transcript with SessionInfo(), if it helps.
Thanks!
Malcolm Cook
Computational Biology - Stowers Institute for Medical Research
> library('SRAdb')
> sqlfile <- getSRAdbFile()
sqlfile <- getSRAdbFile()
trying URL 'http://gbnci.abcc.ncifcrf.gov/backup/SRAmetadb.sqlite.gz'
Content type 'text/plain; charset=ISO-8859-1' length 38391904 bytes (36.6 Mb)
opened URL
==================================================
downloaded 36.6 Mb
Unzipping...
Metadata associate with downloaded file:
c("schema version", "creation timestamp")c("1.0", "2011-09-03 10:38:16")
> sessionInfo()
sessionInfo()
R version 2.13.1 (2011-07-08)
Platform: x86_64-apple-darwin9.8.0/x86_64 (64-bit)
locale:
[1] C
attached base packages:
[1] stats graphics grDevices utils datasets methods base
other attached packages:
[1] SRAdb_1.6.0 RCurl_1.5-0 bitops_1.0-4.1 graph_1.30.0 RSQLite_0.9-4
[6] DBI_0.2-5
loaded via a namespace (and not attached):
[1] Biobase_2.12.2 GEOquery_2.19.2 XML_3.4-0 tools_2.13.1
> q('no')
bash-3.2$ sqlite3 -list SRAmetadb.sqlite "select study_accession, submission_accession, sample_accession, experiment_accession, run_accession, sample_alias from sra where run_accession in ('SRR031766','SRR031767','SRR074430')"
sqlite3 -list SRAmetadb.sqlite "select study_accession, submission_accession, sample_accession, experiment_accession, run_accession, sample_alias from sra where run_accession in ('SRR031766','SRR031767','SRR074430')"
SRP001537|SRA010243|SRS008471|SRX014483|SRR031766|S2_DRSC_CG10128_RNAi-1
SRP001537|SRA010243|SRS008471|SRX014483|SRR031767|S2_DRSC_CG10128_RNAi-1
More information about the Bioc-devel
mailing list