[R-pkgs] New package: noia

Arnaud Le Rouzic a.p.s.l.rouzic at bio.uio.no
Fri Aug 1 17:55:03 CEST 2008

Hi the list,

A new version (0.92) of my package 'noia' will be available soon on CRAN 
mirrors, and I think it might be a good opportunity to introduce it 
shortly to the R community.

In summary: 'noia' will be of absolutely no interest for 99.99% of you. 
The 0.01% remaining are quantitative geneticists who are interested in 
measuring the effect of genes in a proper way.

Since some of you may want to know why it is worth to let this 'noia' 
package take a few bytes on the CRAN server, I will try to introduce 
very shortly the scientific problem. The rules that explain the 
transmission of genes between generations are well known since the end 
of the 19th century. One important consequence of these rules is that in 
all diploid organisms (having two copies of the genomes, i.e. most of 
them, including us), it is virtually impossible to produce an equal 
number of offspring of each "type" from non-inbred parents: you end up, 
most of the time, with frequencies such as 1/4 vs 3/4, or 7/16 vs 9/16, 
or many other combinations. For quantitative traits, the link between 
the genes and the characters is complex, and the effect of genes has to 
be measured through models that look like cross-factor ANOVA designs. 
Because of the unequal segregating ratios, these ANOVAs are highly 
unbalanced, with all well-known annoying consequences: correlations 
between effect estimates, non-orthogonal variance decomposition, etc.

For a long time, comparing estimates measured in differently designed 
populations was not considered as a big issue. However, the recent need 
of effective model selection procedures lead to the proposition of new 
models aiming to a (more or less) orthogonal decomposition of genetic 
effects. This package provides tools to perform linear regressions using 
various models, to manipulate the genetic effects, and to compute 
genotype to phenotype maps, i.e. the function explaining how the genes 
expresses a given character. In addition, the 'noia' package provides a 
tool to perform 'multilinear' regression, an attempt of non-linear 
genotype-phenotype mapping.

In summary, 'noia' is a wrapper for lm() and nls() functions in a very 
specific context: estimating the effects of genes on a given trait.

NB: 'noia' is useless if the location of the genes is not known. 
Locating the genes requires a QTL detection procedure, for which 
packages already exist (see Rqtl for instance).

Much more information can be found in scientific publications. Two are 
particularly relevant:

* This package and its functions are described in a recent paper:
Arnaud Le Rouzic and José M. Álvarez-Castro, Estimation of Genetic 
Effects and Genotype-Phenotype Maps, /Evolutionary Bioinformatics 
/2008:4 225-235.  http://la-press.com/article.php?article_id=887

* The corresponding statistical framework is extensively described in:
José M. Álvarez-Castro and Örjan Carlborg, A unified model for 
functional and statistical epistasis and its application in QTL 
analysis, Genetics, Vol. 176, 1151-1167 2007. 

Given the probable low impact of the package, I am not planning to 
follow the R help mailing list. However, my email address is everywhere 
in the manual, and I would be very happy to answer questions related to 
this package; so don't hesitate to forward potential questions or to 
encourage users to contact me directly.

Arnaud Le Rouzic.

More information about the R-packages mailing list