[BioC] EasyRNASeq - gff file is not recognized
Nicolas Delhomme
delhomme at embl.de
Wed Mar 6 11:09:54 CET 2013
Dear Gabriela,
Given that error:
> Your file: /projects/ZmB73_5b_FGS.gff3 does not contain a gff header: '##gff-version 3' as first line. Is that really a gff3 file?
your gff3 appears not to contain a header.
Add the following line:
##gff-version 3
to the beginning of your gff3 file and that should solve the problem.
Cheers,
Nico
---------------------------------------------------------------
Nicolas Delhomme
Genome Biology Computational Support
European Molecular Biology Laboratory
Tel: +49 6221 387 8310
Email: nicolas.delhomme at embl.de
Meyerhofstrasse 1 - Postfach 10.2209
69102 Heidelberg, Germany
---------------------------------------------------------------
On 6 Mar 2013, at 10:57, Gabriela [guest] wrote:
>
> Hello,
>
> I am trying to generate a table of gene counts to use later with Deseq. However, I got an error message that the maize gff file that I am using is wrong. I downloaded this file directly from the plant ensembl website.
>
> I have to mention that I used a .gff file and a .gff3, and with both I have the same issue. Any hint in how to solve my problem.
>
> Many thanks for your help in advance,
>
> Gabriela
>
> -- output of sessionInfo():
>
>> genes_FGS1 <- easyRNASeq(filesDirectory="/projects/EASYRNASeq/",
> + gapped=F,
> + validity.check=TRUE,
> + chr.map=chr.map,
> + organism="custom",
> + annotationMethod="gff",
> + annotationFile="/projects/ZmB73_5b_FGS.gff3",
> + count="genes",
> + filenames=files,
> + summarization="geneModels",
> + outputFormat="RNAseq")
> Checking arguments...
> Fetching annotations...
> Error in .readGffGtf(filename = filename, ignoreWarnings = ignoreWarnings, :
>
>>
>
>
>
>
>
>
>
>
>> sessionInfo()
> R version 2.15.2 (2012-10-26)
> Platform: x86_64-unknown-linux-gnu (64-bit)
>
> locale:
> [1] LC_CTYPE=en_US.UTF-8 LC_NUMERIC=C
> [3] LC_TIME=en_US.UTF-8 LC_COLLATE=en_US.UTF-8
> [5] LC_MONETARY=en_US.UTF-8 LC_MESSAGES=en_US.UTF-8
> [7] LC_PAPER=C LC_NAME=C
> [9] LC_ADDRESS=C LC_TELEPHONE=C
> [11] LC_MEASUREMENT=en_US.UTF-8 LC_IDENTIFICATION=C
>
> attached base packages:
> [1] grid parallel stats graphics grDevices utils datasets
> [8] methods base
>
> other attached packages:
> [1] VennDiagram_1.5.1 easyRNASeq_1.4.2 ShortRead_1.16.1
> [4] latticeExtra_0.6-24 RColorBrewer_1.0-5 BSgenome_1.26.1
> [7] biomaRt_2.14.0 genomeIntervals_1.14.0 intervals_0.13.3
> [10] Rsamtools_1.10.1 Biostrings_2.26.2 GenomicRanges_1.10.4
> [13] IRanges_1.16.4 edgeR_3.0.2 limma_3.14.1
> [16] pasilla_0.2.13 DESeq_1.10.1 lattice_0.20-10
> [19] locfit_1.5-8 DEXSeq_1.2.1 Biobase_2.18.0
> [22] BiocGenerics_0.4.0 pasillaBamSubset_0.0.2
>
> loaded via a namespace (and not attached):
> [1] annotate_1.34.1 AnnotationDbi_1.18.1 bitops_1.0-4.2
> [4] DBI_0.2-5 genefilter_1.38.0 geneplotter_1.34.0
> [7] hwriter_1.3 plyr_1.7.1 RCurl_1.91-1
> [10] RSQLite_0.11.1 splines_2.15.2 statmod_1.4.15
> [13] stats4_2.15.2 stringr_0.6.1 survival_2.36-14
> [16] tools_2.15.2 XML_3.9-4 xtable_1.7-0
> [19] zlibbioc_1.4.0
>
>
> --
> Sent via the guest posting facility at bioconductor.org.
More information about the Bioconductor
mailing list