[R-sig-hpc] Handling data with thousands of variables

Mauricio Zambrano-Bigiarini mauricio.zambrano at jrc.ec.europa.eu
Tue Jun 28 09:46:35 CEST 2011


Håvard Wahl Kongsgård wrote:
> In machine learning settings it's not uncommon that the data has
> thousands of variables. The same is also the case with genetic
> studies.
> 
> In R what is the best approach for handling such data? Any personal
> experience with handling such data in R?
> 
> For my case the raw data is a response variable and a unstructured
> tuple with string keywords.
> 
> 1341,{"Harry","Larry","Kline"}
> 54232,{"Mary","Kline","Larry"}
> 54232,{"David","Line","Lars"}
> 
> 
> - Håvard
> 
> _______________________________________________
> R-sig-hpc mailing list
> R-sig-hpc at r-project.org
> https://stat.ethz.ch/mailman/listinfo/r-sig-hpc
> 
Did you have a look to Bioconductor :

http://www.bioconductor.org/

http://manuals.bioinformatics.ucr.edu/home/ht-seq#R_BACK

?

IHTH.

Kinds,

Mauricio

-- 
=======================================================
Linux user #454569 -- Ubuntu user #17469
=======================================================
"Don't wish for less problems, wish for more skills.
Don't wish it were easier, wish you were better."
(Jim Rohn)



More information about the R-sig-hpc mailing list