[BioC] unbalanced factorial design

Naomi Altman naomi at stat.psu.edu
Fri Feb 27 22:55:45 MET 2004


Can you all use lme() on
20,000 probe sets and live to talk about it?

It depends on how much patience you have!

I have e-talked with Doug Bates about this.  He says that a much faster 
version of lme will soon be available.

But lme can be very slow.  I generally break the data up into smaller sets 
and run 1 set per night.  If it takes 20 minutes per run, you need to 
consider using a multiprocessor unix system and splitting the sets among 
the systems.

--Naomi


At 03:13 PM 2/25/2004, Francois Collin wrote:
>Running lme on a single probe set takes about 20
>minutes computing time on my PC.  I'm running R on
>windows, which I know can run into memory management
>problems, but this problem appears to be completely
>cpu bound.
>
>The model that I'm fitting has repeated measures, 6
>Visits per Subject, for 20 Subjects.  Subjects have a
>Treatment attribute and I'm interested in the fully
>saturated model with Visit, Treatment and interaction
>effects.   The call to lme() I use is something like
>this:
>
>lme(fixed= Expr ~ Visit:Treatment,
>     random= ~ Visit | Subject)
>
>The results appear ok, but it takes 20 minutes to run.
>  Am I doing something wrong?  Can you all use lme() on
>20,000 probe sets and live to talk about it?
>
>Thanks for any insight into this problem.
>
>-francois
>
>
>--- Naomi Altman <naomi at stat.psu.edu> wrote:
> > I find that the simplest thing to do is to write my
> > own function that
> > includes the appropriate call to lme.   That way I
> > do not need to worry
> > about grabbing components from complicated objects
> > and passing arguments to
> > lme.   I do have to write my own calling function
> > for each experiment, but
> > that takes only a few minutes.
> >
> > --Naomi
> >
> > At 10:15 PM 2/4/2004, Vincent Carey 525-2265 wrote:
> >
> > > >
> > > > I am not sure about if bioconductor includes any
> > functions for
> > > > mixed-effect models. there are several packages
> > in R handles mixed-effect
> > > > models, the most complete one is nlme.
> > >
> > >it is not too difficult to run gene-specific mixed
> > >effects models using the combination of esApply (in
> > >Biobase) and lme (in nlme).  the non-trivial part
> > is
> > >to properly specify the function (esApply parameter
> > FUN)
> > >to invoke through esApply.  the design will be
> > derivable from
> > >information in the phenoData component.  all
> > variables
> > >in phenoData are visible to the FUN for esApply, so
> > the
> > >model formula can be specified fairly naturally,
> > thanks
> > >to the environment manipulations provided in
> > esApply
> > >(by RG).
> > >
> > >with appropriately structured experimental designs
> > in
> > >which expression might vary smoothly but
> > nonlinearly
> > >as a function of some design variable, nlme models
> > may
> > >be of interest to fit through esApply as well.
> > >
> > >so the question "does bioconductor include
> > functions
> > >for ... modeling" often has a negative answer -- we
> > don't
> > >aim to have functions for all conceivable
> > approaches to
> > >modeling bioinformatic data.  we prefer to have
> > interfaces
> > >that allow existing functions in R to be reused
> > conveniently
> > >and at the option of the analyst, in the
> > bioinformatic context.
> > >
> > >_______________________________________________
> > >Bioconductor mailing list
> > >Bioconductor at stat.math.ethz.ch
> >
> >https://www.stat.math.ethz.ch/mailman/listinfo/bioconductor
> >
> > Naomi S. Altman
> > 814-865-3791 (voice)
> > Associate Professor
> > Bioinformatics Consulting Center
> > Dept. of Statistics
> > 814-863-7114 (fax)
> > Penn State University
> > 814-865-1348 (Statistics)
> > University Park, PA 16802-2111
> >
> > _______________________________________________
> > Bioconductor mailing list
> > Bioconductor at stat.math.ethz.ch
> >
>https://www.stat.math.ethz.ch/mailman/listinfo/bioconductor

Naomi S. Altman                                814-865-3791 (voice)
Associate Professor
Bioinformatics Consulting Center
Dept. of Statistics                              814-863-7114 (fax)
Penn State University                         814-865-1348 (Statistics)
University Park, PA 16802-2111



More information about the Bioconductor mailing list