[BioC] Unequally spaced replicates in limma - "sort your MA object by gene ID "

Jakob Hedegaard Jakob.Hedegaard at agrsci.dk
Fri Sep 3 12:40:28 CEST 2004


Hi Gordon

I know it is a basic R procedure - but what command would sort the MAlist by gene ID?

> sort(MAlist, .....)?

Thanks
Jakob

------------------------------------------------------
Jakob Hedegaard

Danish Institute of Agricultural Sciences
Department of Animal Breeding and Genetics
Research Centre Foulum
P.O. Box 50
DK-8830 Tjele, Denmark



-----Original Message-----
From: bioconductor-bounces at stat.math.ethz.ch
[mailto:bioconductor-bounces at stat.math.ethz.ch]On Behalf Of Gordon Smyth
Sent: Thursday, September 02, 2004 11:45 AM
To: michael watson (IAH-C)
Cc: bioconductor at stat.math.ethz.ch
Subject: RE: [BioC] Unequally spaced replicates in limma


At 07:23 PM 2/09/2004, michael watson (IAH-C) wrote:
>Thanks Gordon
>
>Actually when I did this, I got some odd results.

The results look to me as you would hope for and expect.

>If I ran lmFit(), eBayes() and topTable() on my data set on a per-spot
>basis, I found ~800 SPOTS with a p-value <= 0.05.  Now most of my genes
>are replicated in duplicate on the arrays (within-array replicates) and
>when I averaged over those replicates, and used that data to feed into
>lmFit(), eBayes() and topTable() I got ~1100 GENES with a p-value
><=0.05.
>
>Does this suggest that after averaging over replicate spots, the
>measurements for my genes are more tightly distributed than the
>individual spots were..?

1. You've reduced the number of genes by half, hence you do only half the 
adjustment for multiple testing, hence you end up with lower p-values.

2. You'd certainly hope that averages are more tightly distributed than the 
individual spots, that's why averaging is a good thing.

If your genes are virtually all in duplicate, and the others have an even 
number of reps, you could sort your MA object by gene ID and then use 
duplicateCorrelation() with ndups=2 and spacing=1.

Gordon

>Cheers
>Mick
>
>-----Original Message-----
>From: Gordon K Smyth [mailto:smyth at wehi.EDU.AU]
>Sent: 01 September 2004 23:12
>To: michael watson (IAH-C)
>Cc: bioconductor at stat.math.ethz.ch
>Subject: Re: [BioC] Unequally spaced replicates in limma
>
>
> > Hi
> >
> > As I have varying numbers of replicates, and they are not regularly
> > spaced on the array, and given that I would like a list of
> > differentially expressed genes which is averaged over replicates,
>
>I assume that these are within-array replicates.
>
> > I
> > assume the best thing to do is normalise my data, and then average
> > over replicates in the MAList object, and then pass the averaged data
> > to
> > lmFit() etc?
>
>Yes, you could do that.  It does raise subtle issues though concerning
>how the variance of the averages depends on the number of replicates.
>You might like to compute weights based on the number of replicates for
>each probe and pass that to lmFit also.
>
>Gordon
>
> > Is that right?
> >
> > Cheers
> > Mick

_______________________________________________
Bioconductor mailing list
Bioconductor at stat.math.ethz.ch
https://stat.ethz.ch/mailman/listinfo/bioconductor



More information about the Bioconductor mailing list