[R] Problems with data structure when using plsr() from package pls

Bjørn-Helge Mevik b.h.mevik at usit.uio.no
Fri Jan 15 13:37:30 CET 2016


Jeff Newmiller <jdnewmil at dcn.davis.ca.us> writes:

> Using I() in the data.frame seems ill-advised to me. You complain about 96
> variables but from reading your explanation that seems to be what your data
> are.

In PSLR, it is common to regress a variable against matrices with very
many coloumns, often several thousands.  Using a data frame with one
predictor variable for each coloumn is going to make the formula
handling very slow.  And if you have several such predictor matrices, it
is very practical to keep them as single variables in the data frame, so
you easily can select/deselect which groups of variables you want in the
model.

-- 
Regards,
Bjørn-Helge Mevik



More information about the R-help mailing list