[BioC] Agilent spike-in probes

Sean Davis sdavis2 at mail.nih.gov
Fri Apr 4 13:46:06 CEST 2008


On Fri, Apr 4, 2008 at 7:20 AM, Srinivas Iyyer
<srini_iyyer_bio at yahoo.com> wrote:
> Dear Naomi,
>  How can I remove control probes before doing
>  expression analysis.
>
>  i am following some steps lile the followings.
>
>  RG <- read.maimages(files$targes,'agilent')
>  RG.b <-
>  backgroundSubtraction(RG,method='normexp',offset=50)
>
>  MA <- normalizewithinarrays(RG.b,method='loess').
>
>
>  Where in these steps I can remove controls entirely.

MAList and RGList objects subset just like other matrices and
data.frames.  Assuming that you know what probes are control probes,
you can simply create a new MAList, removing the control probes.

MA2 <- MA[-controlProbes,]

where controlProbes is a vector of the indices of control probes.

Sean


>
>  --- Naomi Altman <naomi at stat.psu.edu> wrote:
>
>  > In my experience with Agilent arabdopsis arrays,
>  > some of the Agilent
>  > spike-ins bind only to one of the dyes (or bind much
>  > more strongly to
>  > one).  I always remove the controls before doing
>  > differential
>  > expression analysis.
>  >
>  > Naomi
>  >
>  >
>  > At 08:29 AM 3/30/2008, Sean Davis wrote:
>  > >On Sat, Mar 29, 2008 at 11:39 PM, Srinivas Iyyer
>  > ><srini_iyyer_bio at yahoo.com> wrote:
>  > > > dear sean,
>  > > >  i apologize for sending this email and attached
>  > > >  figures to you. I am not sure if I can send
>  > figures as
>  > > >  attachment to mailing list.  I wanted to see
>  > expert
>  > > >  opinion on this particular topic because this
>  > is first
>  > > >  time i am analyzing agilent chip data.
>  > > >  Would you please look into my design, code and
>  > figures
>  > > >  and let me know if this method okay.
>  > > >
>  > > >  Spike-in probes are for QC purposes, if so why
>  > I am
>  > > >  getting spike-in probes as top candidates. Is
>  > there a
>  > > >  way to suppress them.
>  > > >  Thank you and I appreciate your help.
>  > > >
>  > > >
>  > > >
>  > > >  dear group,
>  > > >
>  > > >  I have agilent 4x44 (G4112F) chips.  the hybs
>  > are done
>  > > >  as a paired design. sample obtained from
>  > patient
>  > > >  before and after treatment.  40 patient are in
>  > the
>  > > >  study. chip was hybridized with before
>  > treated(cy3)
>  > > >  and after treated (cy5) rna.
>  > > >
>  > > >  I used LIMMA for normalizing and to calcuate
>  > > >  differentially expressed.
>  > > >
>  > > >  in the first step, I did not go for background
>  > > >  subtraction and observed a blown-out ma plot.
>  > >
>  > >I'm not sure what "blown-out" means, but Agilent
>  > typically does
>  > >background subtraction automatically (you'll need
>  > to look at the
>  > >specific image extraction protocol to check).  If
>  > you use the
>  > >gProcessedSignal and rProcessedSignal (these are
>  > not the defaults in
>  > >limma), you will probably get the benefit of their
>  > spatially-detrended
>  > >loess background subtraction.
>  > >
>  > > >  when i did background subtraction, i observed a
>  > more
>  > > >  compact ma. For q-q plot points at intersection
>  > are
>  > > >  not many suggesting that many genes are
>  > differentially
>  > > >  expressed. (figures are attach
>  > > >
>  > > >  my main concern is, of top100 (from toptable
>  > > >  number=100), most of the probesets are spikein
>  > > >  probesets. (+)E1A_r60_a22 , DCP_22_6,DCP_22_7
>  > and so
>  > > >  on.
>  > >
>  > >This could be dye bias, but I'm not sure.  You
>  > didn't do dye swaps, so
>  > >you cannot separate signal from dye bias.  In any
>  > case, you will need
>  > >to do some QC.  Agilent provides a huge amount of
>  > QC and plots on the
>  > >scanner machine.  You can always look there to see
>  > what they do.
>  > >Also, their technical manuals are pretty good at
>  > giving direction
>  > >about the technology and the array data processing.
>  > >
>  > > >  These spike-in probes are highly differentlly
>  > > >  expressed.
>  > > >
>  > > >
>  > > >  my targets file
>  > > >
>  > > >  filename   cy3  cy5
>  > > >  patient1  before after
>  > > >  patient2  before after
>  > > >  ......
>  > > >  patient40 before after
>  > > >
>  > > >  my design matrix:
>  > > >  desin <- modelMatrix(targets,ref='before')
>  > > >  > desin
>  > > >       after
>  > > >   [1,]      1
>  > > >   [2,]      1
>  > > >   [3,]      1
>  > > >   [4,]      1
>  > > >   [5,]      1
>  > > >   [6,]      1
>  > > >   [7,]      1
>  > > >   [8,]      1
>  > > >
>  > > >  RG2 <- backgroundCorrect(RG,method='subtract')
>  > > >  MA2 <-
>  > normalizeWithinArrays(RG2,method='loess')
>  > > >  plotDensities(MA2)
>  > > >  boxplot(MA2$M~col(MA2$M),names=colnames(MA2$M))
>  > > >  MA2a <-
>  > normalizeBetweenArrays(MA2,method='scale')
>  > >
>  > >These are two-color arrays.  Do you really need to
>  > do the
>  > >between-array normalization?  You might, but I
>  > think you might spend
>  > >some time proving to yourself that is the case.
>  > >
>  > > >  fit.b <- lmFit(MA2a,design)
>  > > >  fit.b <- eBayes(fit.b)
>  > > >
>  >
>  topTable(fit.b,number=50,adjust.method='BH')[,c(5,9,10,11,12,13)]
>  > > >
>  > > >  my questions are:
>  > > >
>  > > >  1. for this paired sample (cy3,cy5) design, is
>  > my
>  > > >  limma model matrix okay.
>  > > >  2. how to avoid getting spike-in . I never saw
>  > > >  spike-in getting into top-table. is there some
>  > mistake
>  > > >  going on at some place. is it normal for
>  > spike-in
>  > > >  probes to come as top differentially expressed
>  > probes.
>  > >
>  > >It happens, yes.  I would definitely do some QC,
>  > though.  It doesn't
>  > >look like you have done any in your code here.
>  > >
>  > > >  3. are the attached figures (MA plot and q-q
>  > plot)
>  > > >  reflect a good normalized data.
>  > >
>  > >The qq plot does not really tell you about
>  > normalization.  The single
>  > >MA plot looks OK.  You will want to look at all of
>  > the MA plots and
>  > >some more extensive QC.
>  > >
>  > > >  4. my chip is hgug4112F. I do not see
>  > annotation file
>  > > >  on bioconductor.
>  > >
>  > >I think the hgug4112a annotation package is what
>  > you want.  You'll
>  > >want to double-check that with a few lookups to be
>  > sure.
>  > >
>  > >Sean
>  > >
>  > >_______________________________________________
>  > >Bioconductor mailing list
>  > >Bioconductor at stat.math.ethz.ch
>  > >https://stat.ethz.ch/mailman/listinfo/bioconductor
>  > >Search the archives:
>  >
>  >http://news.gmane.org/gmane.science.biology.informatics.conductor
>  >
>  > Naomi S. Altman
>  > 814-865-3791 (voice)
>  > Associate Professor
>  > Dept. of Statistics
>  > 814-863-7114 (fax)
>  > Penn State University
>  > 814-865-1348 (Statistics)
>  > University Park, PA 16802-2111
>  >
>  >
>
>
>
>       ____________________________________________________________________________________
>  [[elided Yahoo spam]]
>
>  _______________________________________________
>  Bioconductor mailing list
>  Bioconductor at stat.math.ethz.ch
>  https://stat.ethz.ch/mailman/listinfo/bioconductor
>  Search the archives: http://news.gmane.org/gmane.science.biology.informatics.conductor
>



More information about the Bioconductor mailing list