[R] Looking for package for data generation for classification and regression

Paul Smith phh@80 @end|ng |rom gm@||@com
Fri Mar 4 11:41:24 CET 2022


On Fri, Mar 4, 2022 at 8:07 AM Ranjan Maitra <mlmaitra using gmx.com> wrote:
>
> > I am in need of generating artificial data for machine learning
> > classification and regression analysis. What I am looking for is
> > something similar to Python sklearn.datasets.make_classification and
> > sklearn.datasets.make_regression:
> >
> > https://scikit-learn.org/stable/modules/generated/sklearn.datasets.make_classification.html
> >
> > https://scikit-learn.org/stable/modules/generated/sklearn.datasets.make_regression.html
> >
> > I have searched CRAN for something similar, but found nothing. Could
> > someone please help me with this?
>
> Not sure if this helps, but at least for classification and clustering, there is the MixSim package on CRAN which provides classification datasets according to an overall overlap measure.

Thanks, Ranjan, that is also quite helpful, since clustering is also a
topic of the course!

Paul



More information about the R-help mailing list