[BioC] ChIPpeakAnno: makeVennDiagram and sampling peaks
Zhu, Lihua (Julie)
Julie.Zhu at umassmed.edu
Thu Mar 15 15:41:31 CET 2012
Ron,
If you assume 10% total histone are available for modification, then you
would set totalTest = 3 * 10^9 bp / 146 * 0.1 which is about 2 million.
For your new question, here is some code snippet that might address your
needs.
t1 =findOverlappingPeaks(peaks1, peaks2, maxgap=0,
NameOfPeaks1="TF", NameOfPeaks2="Histone", select="First")
AllPeaks = c(peaks1[!rownames(peaks1) %in% rownames(t1$Peaks1withOverlap),],
peaks2[!rownames(peaks2) %in% rownames(t1$Peaks2withOverlap),],
t1$MergedPeaks)
Totals = rownames(AllPeaks)
Sample.n = dim(t1$MergedPeaks)[1]
##### put the following code snippets in a loop allow you to sample from the
total peak population multiple times
s1 = AllPeaks[rownames(AllPeaks) %in% sample(Totals, Sample.n),]
go.s1 = getEnrichedGO(annotatePeakInBatch(s1,....), ....)
##################
Please let me know if you come up with more elegant ways to do this. Thanks!
Best regards,
Julie
On 3/14/12 5:35 PM, "Ron Hart" <rhart at rci.rutgers.edu> wrote:
> Julie,
>
> In response to my last question and your phone call, I tried several values of
> totalTest based on the recommendations but I could only get either a 0 or a 1
> value. For histone marks, I used as the largest estimate the total number of
> possible histone overlaps (3 x 10^9 bp / 146 bp per nucleosome). Then I tried
> the sum of the two sets of marks, but nothing made sense for me. So I gave
> up trying to get a p-value. It¹s really not important for my study.
>
> New question. I¹m using the overlap function to extract intersecting peaks in
> common between two marks. Everything is working great. But I¹d like to
> compare the result to a random sampling of the same number of peaks from the
> union set of both marks. I think this sort of a bootstrapping approach would
> be convincing that my enriched GO list was unique to the actual intersection
> of the two sets of peaks.
>
> Ideally, I¹d like to merge two annotated peak objects and then sample them for
> the number I observed in the intersection set (which I know). Since I¹m not
> that familiar with working with IRanges-based objects, can you suggest a code
> snippet that would work for me?
>
> Does that make sense?
>
> Ron
>
More information about the Bioconductor
mailing list