[BioC] PLIER affinities redux
Jeremy Gollub
jeremy at gollub.net
Tue Oct 18 18:31:48 CEST 2005
Hi, James -
Thanks for clarifying that.
Looking at the source code for justPlier, it seems there's a misapprehension
about the probe affinity values - the justPlier code expects to get back
a value for each probe (note, not probe set) in each sample, whereas the
algorithm/C++ code actually returns a single value for each probe pair,
which is applied to all samples. I think the padding with many extra
zeros is done by justPlier, not by the C++ code...?
Crispin, unless I'm missing something, this should probably be considered
a bug. In the meantime, I should be able to deconvolute the matrix back
into the expected vector, but I'm not sure which probe pair to associate
with each value.
- Jeremy
James W. MacDonald wrote:
>
> Hi Jeremy,
>
> Seems I misunderstood your earlier email. I somehow thought you were
> talking about expression values, not the affinities. I get the same sort
> of result for the affinities that you are reporting.
>
> However it doesn't appear to be a sparse matrix per se, but a matrix
> with a bunch of zeros padded on the end.
>
> > pset <- justPlier(dat, get.affinities = TRUE)
> > pset
> Expression Set (exprSet) with
> 54675 genes
> 10 samples
> phenoData object with 1 variables and 10 cases
> varLabels
> sample: arbitrary numbering
> > dim(pset at description@preprocessing$affinity)
> [1] 604258 10
> > sum(rowSums(pset at description@preprocessing$affinity) != 0)
> [1] 54676
> > a <- which(rowSums(pset at description@preprocessing$affinity) != 0)
> > range(a)
> [1] 1 54676
>
> This is what I get with a HG-U133Plus_2 chip. It looks to me like there
> are indeed affinities for each probeset (rather than each probe), but
> the vector of affinities that is output by the C++ code is padded with a
> bunch of zeros. Maybe the result is different for other chips?
>
> Anyway, this is probably a question for Crispin Miller, who maintains
> the package.
>
> Best,
>
> Jim
>
>
>
> Jeremy Gollub wrote:
> > Hi, All -
> >
> > James MacDonald (I think) answered my previous posting, and I promptly
> > lost the message. Thanks, James, and apologies.
> >
> > The issue at hand was strange and (I believe) incorrect reporting of
> > probe affinities from justPlier (plier package). At James' suggestion
> > I have update to R 2.2.0 and plier 1.2.0, but the affinities are still
> > coming back in a sparse <# probe pairs> X <# arrays> matrix, rather than
> > as a useful vector. The colnames of this matrix are the sampleNames from
> > the eset provided to justPlier; the rownames are the probeNames.
> >
> > James, you said this doesn't happen to you. How do you retrieve the
> > affinities? Maybe I'm just looking at the wrong slot (see below).
> > Looking at the justPlier source code, though, I don't see any other
> > way to get them.
> >
> > Also, does justPlier allow one to pass the affinities back to another
> > invocation of the method, rather than computing them from the current
> > data?
> >
> > Thanks,
> >
> > - Jeremy Gollub
> >
> >
> > The session (edited for readability):
> >
> > # ---------------------------------------------------------------------
> >
> >
> >>sessionInfo())
> >
> > R version 2.2.0, 2005-10-06, i386-pc-mingw32
> >
> > attached base packages:
> > [1] "tools" "methods" "stats" "graphics" "grDevices" "utils"
> > [7] "datasets" "base"
> >
> > other attached packages:
> > rae230acdf plier affy Biobase qvalue
> > "1.10.0" "1.2.0" "1.8.1" "1.8.0" "1.4.0"
> >
> >
> >>data <- ReadAffy()
> >>data
> >
> > AffyBatch object
> > size of arrays=602x602 features (50972 kb)
> > cdf=RAE230A (15923 affyids)
> > number of samples=18
> > number of genes=15923
> > annotation=rae230a
> >
> >
> >>res <- justPlier(data, get.affinities = TRUE)
> >>dim(res at description@preprocessing$affinity)
> >
> > [1] 175477 18
> >
> >
> >>sum(res at description@preprocessing$affinity != 0)
> >
> > [1] 175407
> >
> > # ----------------------------------------------------------------------
> >
>
>
> --
> James W. MacDonald
> Affymetrix and cDNA Microarray Core
> University of Michigan Cancer Center
> 1500 E. Medical Center Drive
> 7410 CCGC
> Ann Arbor MI 48109
> 734-647-5623
>
> _______________________________________________
> Bioconductor mailing list
> Bioconductor at stat.math.ethz.ch
> https://stat.ethz.ch/mailman/listinfo/bioconductor
>
--
Jeremy Gollub
jeremy at gollub.net
More information about the Bioconductor
mailing list