[R-sig-hpc] Handling data with thousands of variables
Mauricio Zambrano-Bigiarini
mauricio.zambrano at jrc.ec.europa.eu
Tue Jun 28 09:46:35 CEST 2011
Håvard Wahl Kongsgård wrote:
> In machine learning settings it's not uncommon that the data has
> thousands of variables. The same is also the case with genetic
> studies.
>
> In R what is the best approach for handling such data? Any personal
> experience with handling such data in R?
>
> For my case the raw data is a response variable and a unstructured
> tuple with string keywords.
>
> 1341,{"Harry","Larry","Kline"}
> 54232,{"Mary","Kline","Larry"}
> 54232,{"David","Line","Lars"}
>
>
> - Håvard
>
> _______________________________________________
> R-sig-hpc mailing list
> R-sig-hpc at r-project.org
> https://stat.ethz.ch/mailman/listinfo/r-sig-hpc
>
Did you have a look to Bioconductor :
http://www.bioconductor.org/
http://manuals.bioinformatics.ucr.edu/home/ht-seq#R_BACK
?
IHTH.
Kinds,
Mauricio
--
=======================================================
Linux user #454569 -- Ubuntu user #17469
=======================================================
"Don't wish for less problems, wish for more skills.
Don't wish it were easier, wish you were better."
(Jim Rohn)
More information about the R-sig-hpc
mailing list