[Bioc-devel] Error: file larger than 5 Mb.
Bohdan Khomtchouk
khomtchoukmed @ending from gm@il@com
Thu Sep 27 00:12:08 CEST 2018
Looks like that was the issue, thanks Vincent & Martin!
> hub = AnnotationHub()
snapshotDate(): 2018-08-20
> isDevel()
[1] TRUE
> query(hub, "Mus_musculus.GRCm38.93.chr.gtf")
AnnotationHub with 1 record
# snapshotDate(): 2018-08-20
# names(): AH63797
# $dataprovider: Ensembl
# $species: Mus musculus
# $rdataclass: GRanges
# $rdatadateadded: 2018-07-31
# $title: Mus_musculus.GRCm38.93.chr.gtf
# $description: Gene Annotation for Mus musculus
# $taxonomyid: 10090
# $genome: GRCm38
# $sourcetype: GTF
# $sourceurl: ftp://ftp.ensembl.org/pub/release-93/gtf/mus_musculus/Mus...
# $sourcesize: 29183870
# $tags: c("GTF", "ensembl", "Gene", "Transcript", "Annotation")
# retrieve record with 'object[["AH63797"]]'
On Wed, Sep 26, 2018 at 2:01 PM Martin Morgan <mtmorgan.bioc using gmail.com>
wrote:
> maybe you are not using the 'devel' version of Bioconductor?
>
> On 09/26/2018 03:48 PM, Bohdan Khomtchouk wrote:
> > Martin, I don't see it unfortunately:
> >
> > > query(hub, "Mus_musculus.GRCm38.93.chr.gtf")
> > AnnotationHub with 0 records
> > # snapshotDate(): 2018-04-30
> >
> > But perhaps it needs to be added to AnnotationHub, because I see other
> > .chr.gtf files that go up to 86 (while I'm using 93 in this original
> post):
> >
> > > query(hub, c("GTF", "Mus musculus", ".chr.gtf"))
> > AnnotationHub with 112 records
> > # snapshotDate(): 2018-04-30
> > # $dataprovider: Ensembl
> > # $species: Mus musculus
> > # $rdataclass: GRanges
> > # additional mcols(): taxonomyid, genome, description,
> > # coordinate_1_based, maintainer, rdatadateadded, preparerclass,
> > # tags, rdatapath, sourceurl, sourcetype
> > # retrieve records with, e.g., 'object[["AH50868"]]'
> >
> > title
> > AH50868 | Mus_musculus.GRCm38.84.chr.gtf
> > AH51038 | Mus_musculus.GRCm38.85.chr.gtf
> > AH51979 | Mus_musculus.GRCm38.86.chr.gtf
> > AH51983 | Mus_musculus_129s1svimj.129S1_SvImJ_v1.86.chr.gtf
> > AH51986 | Mus_musculus_aj.A_J_v1.86.chr.gtf
> > ... ...
> > AH61202 | Mus_musculus_lpj.LP_J_v1.92.chr.gtf
> > AH61205 | Mus_musculus_nodshiltj.NOD_ShiLtJ_v1.92.chr.gtf
> > AH61208 | Mus_musculus_nzohlltj.NZO_HlLtJ_v1.92.chr.gtf
> > AH61211 | Mus_musculus_pwkphj.PWK_PhJ_v1.92.chr.gtf
> > AH61214 | Mus_musculus_wsbeij.WSB_EiJ_v1.92.chr.gtf
> >
> > So it looks like "AH51979 | Mus_musculus.GRCm38.86.chr.gtf" is the
> > latest one available in AnnotationHub right now.
> >
> >
> > On Tue, Sep 25, 2018 at 10:35 PM Martin Morgan <mtmorgan.bioc using gmail.com
> > <mailto:mtmorgan.bioc using gmail.com>> wrote:
> >
> > Good news, the file is in AnnotationHub and can be used from there.
> > There is a download cost the first time but not subsequently.
> >
> > library(AnnotationHub)
> > hub = AnnotationHub()
> > query(hub, "Mus_musculus.GRCm38.93.chr.gtf.gz")
> >
> > ## note 'AH' number, then...
> > gtf = hub[["AH63797"]]
> >
> > Need the path rather than the imported path? use
> cache(hub["AH63797"])
> > instead.
> >
> > Martin
> >
> > On 09/25/2018 09:59 PM, Bohdan Khomtchouk wrote:
> > > To generate mouse.rda, I used:
> > >
> > > library(rtracklayer)
> > > mouse <- readGFF("
> > >
> >
> ftp://ftp.ensembl.org/pub/release-93/gtf/mus_musculus/Mus_musculus.GRCm38.93.chr.gtf.gz
> > > ")
> > >
> > > And then I saved the object as mouse.rda
> > >
> > >
> > > On Tue, Sep 25, 2018 at 6:33 PM Bohdan Khomtchouk
> > <khomtchoukmed using gmail.com <mailto:khomtchoukmed using gmail.com>>
> > > wrote:
> > >
> > >> Here it is:
> > >>
> >
> https://github.com/Bohdan-Khomtchouk/geneXtendeR/blob/master/data/mouse.rda
> > >>
> > >>
> > >> On Tue, Sep 25, 2018 at 6:23 PM Vincent Carey
> > <stvjc using channing.harvard.edu <mailto:stvjc using channing.harvard.edu>>
> > >> wrote:
> > >>
> > >>> (resending with reply all -- please reply to this one)
> > >>>
> > >>> Can you say some more about the content of the file? Perhaps
> > it can be
> > >>> placed in ExperimentHub. "mouse.rda" is not very descriptive.
> > is it
> > >>> in your github repo?
> > >>>
> > >>>
> > >>> On Tue, Sep 25, 2018 at 6:25 PM, Bohdan Khomtchouk <
> > >>> khomtchoukmed using gmail.com <mailto:khomtchoukmed using gmail.com>>
> wrote:
> > >>>
> > >>>> I need to include an important .rda file in the /data folder
> of my
> > >>>> Bioconductor package. However, this .rda file exceeds 5 Mb in
> > size. Do
> > >>>> I
> > >>>> have any other options besides for deleting it?
> > >>>>
> > >>>> bohdankhomtchouk$ git push origin master
> > >>>>
> > >>>> Counting objects: 65, done.
> > >>>>
> > >>>> Delta compression using up to 8 threads.
> > >>>>
> > >>>> Compressing objects: 100% (65/65), done.
> > >>>>
> > >>>> Writing objects: 100% (65/65), 13.79 MiB | 7.62 MiB/s, done.
> > >>>>
> > >>>> Total 65 (delta 30), reused 0 (delta 0)
> > >>>>
> > >>>> remote: Error: file larger than 5 Mb.
> > >>>>
> > >>>> remote:
> > >>>>
> > >>>> remote: File name: 'data/mouse.rda'
> > >>>>
> > >>>> remote: File size: 12.9 Mb
> > >>>>
> > >>>> remote:
> > >>>>
> > >>>> remote: Please see Biocondcutor guidelines
> > >>>>
> > >>>> remote:
> https://bioconductor.org/developers/package-guidelines/
> > >>>>
> > >>>> remote:
> > >>>>
> > >>>> To git.bioconductor.org:packages/geneXtendeR
> > >>>>
> > >>>> ! [remote rejected] master -> master (pre-receive hook
> declined)
> > >>>>
> > >>>> error: failed to push some refs to 'git using git.bioconductor.org
> > <mailto:git using git.bioconductor.org>:
> > >>>> packages/geneXtendeR'
> > >>>>
> > >>>> --
> > >>>> Bohdan Khomtchouk, Ph.D.
> > >>>> AHA Postdoctoral Fellow, Gozani & Assimes Labs
> > >>>> Department of Biology & Department of Medicine
> > >>>> Division of Cardiovascular Medicine
> > >>>> Stanford University
> > >>>> Stanford, CA 94305
> > >>>> https://profiles.stanford.edu/bohdan-khomtchouk
> > >>>>
> > >>>>
> > >>>> CONFIDENTIALITY NOTICE: Information contained in this
> > me...{{dropped:9}}
> > >>>>
> > >>>> _______________________________________________
> > >>>> Bioc-devel using r-project.org <mailto:Bioc-devel using r-project.org>
> > mailing list
> > >>>> https://stat.ethz.ch/mailman/listinfo/bioc-devel
> > >>>>
> > >>>
> > >>>
> > >>> The information in this e-mail is intended only for
> > t...{{dropped:17}}
> > >
> > > _______________________________________________
> > > Bioc-devel using r-project.org <mailto:Bioc-devel using r-project.org>
> > mailing list
> > > https://stat.ethz.ch/mailman/listinfo/bioc-devel
> > >
> >
>
[[alternative HTML version deleted]]
More information about the Bioc-devel
mailing list