[Bioc-devel] is it normal for makeDBPackage to take a> VERY long time?

Tim Triche, Jr. tim.triche at gmail.com
Tue Jan 18 23:14:44 CET 2011


No doubt, I was not suggesting otherwise. My wording was unclear; I had two orthogonal questions that belonged in two separate emails. 

My apologies for the ruckus. 

--t

On Jan 18, 2011, at 2:06 PM, Pan Du <dupan at northwestern.edu> wrote:

> Hi Tim
> 
> I am agree with Sean. At least, we need to have one Bioconductor annotation
> package based on manufacturer provided mapping information. Of course, you
> can add additional tables in the library. Based on my experience, looks like
> Illumina has much improved annotation maintenance now.
> 
> Thanks for your effects!
> 
> 
> Pan
> 
> 
> On 1/18/11 2:40 PM, "bioc-devel-request at r-project.org"
> <bioc-devel-request at r-project.org> wrote:
> 
>> Message: 2
>> Date: Tue, 18 Jan 2011 15:28:23 -0500
>> From: Sean Davis <sdavis2 at mail.nih.gov>
>> To: "Tim Triche, Jr." <ttriche at usc.edu>
>> Cc: "bioc-devel at r-project.org" <bioc-devel at r-project.org>
>> Subject: Re: [Bioc-devel] is it normal for makeDBPackage to take a
>> VERY long time?
>> Message-ID:
>> <AANLkTinzY8XOaFv+N4yAXrP=u6JxpwuCnvMo5sDzvwqO at mail.gmail.com>
>> Content-Type: text/plain
>> 
>> On Tue, Jan 18, 2011 at 3:13 PM, Tim Triche, Jr. <ttriche at usc.edu> wrote:
>> 
>>> Hi all,
>>> 
>>> Thanks for all the tips and suggestions!   I have updated the 27k and 450k
>>> methylation annotation packages, and will re-run the packaging script in a
>>> few days to see how much of a difference this makes.  I need to update the
>>> 27k annotations anyways, now that I've added some fields and bimaps for my
>>> own purposes (decoding IDAT files and mapping the probes properly).   But,
>>> I've got working builds of the packages now -- whee.
>>> 
>>> The new 27k and 450k packages are ready (or at least, everything works -- I
>>> am only just learning how to use RUnit properly so the formal tests have not
>>> yet been updated).  Eventually I would like to automate the merging process
>>> for information from Illumina and that which is included from other sources
>>> (Entrez, UCSC, etc).  What is the accepted way of doing this?  I think I
>>> could handle maintenance of these packages if there's a means to properly
>>> automate the merging of manufacturer-specific and assembly-specific data.
>>> 
>>> For what it's worth, I tried using CHiPpeakAnno, GenomicRanges, and some
>>> other tools (but mostly the former) to annotate a few probes, and I'm not
>>> 100% certain that I did it right.  Absent strand information, which I find
>>> almost indecipherable from Illumina's "TOP/BOT" vs the usual "+/-"
>>> watson/crick notation, the results suggest that a handful of allegedly
>>> upstream CpG island probes are... not.  The re-annotation results do
>>> correspond fairly closely to the actual assay results, though (Jean-Pierre
>>> Issa wanted to compare some Illumina methylation results with those from the
>>> MCA platform in use at MD Anderson, so I said I would try and figure out why
>>> some Illumina probes seem concordant with the MCA probes and some don't...
>>> the reason, if GRanges is correct, is that some of the Illumina probes don't
>>> seem to map where I expected them to).
>>> 
>>> What's the best way to do reannotation of a platform for specific features?
>>> It took a long time just to map the accessions on the 450k platform, I'm
>>> not real excited about BLATting everything in sight whenever a string
>>> changes somewhere, if I don't have to.
>>> 
>>> 
>> Hi, Tim.
>> 
>> I would suggest using the annotations supplied by the manufacturer without
>> modification.  Doing otherwise will very likely be a non-trivial exercise,
>> particularly with regard to documenting what has been done.  Of course, feel
>> free to do that with your own versions (or even separate parallel versions)
>> of the packages, but for those for release on bioconductor, I suggest that
>> sticking to the manufacturer's annotations is the best way to go.  Of
>> course, if you find annotation problems, bring those up with Illumina
>> directly, as they have an interest in maintaining relevant and correct
>> annotations.
>> 
>> Sean
> 
> 
> 
> 



More information about the Bioc-devel mailing list