[R] variable names
Luis Miguel Almeida da Silva
lsilva at fc.up.pt
Tue Jul 22 16:40:21 CEST 2003
It worked! thank you
-----Original Message-----
From: Torsten Hothorn [mailto:hothorn at ci.tuwien.ac.at]
Sent: Tue 22/07/2003 15:16
To: Luis Miguel Almeida da Silva
Cc: r-help at stat.math.ethz.ch
Subject: RE: [R] variable names
On Tue, 22 Jul 2003, Luis Miguel Almeida da Silva wrote:
> I didn't noticed that fact. I've already found a way to do that
>
> x <- 1:40
> colnames(df.treino) <- paste("Ncp",x,sep=".")
>
> and this generates names that I can relate with the variables. Thanks anyway
>
> The problem is that I use rpart in a loop and the class labels are in the last column. For the above example I would "type"
>
> rpart(Ncp.40~.,data=df.treino)
>
> But in the next step of the loop I can have only 35 variables and the class labels would be at the Ncp.36. So I have to refresh the formula in rpart... and that is my problem
>
R> df.treino <- data.frame(matrix(rnorm(25), ncol=5))
R> thisformula <- as.formula(paste(colnames(df.treino)[ncol(df.treino)],
"~ ."))
R> thisformula
X5 ~ .
R> rpart(thisformula, data = df.treino)
n= 5
node), split, n, deviance, yval
* denotes terminal node
1) root 5 3.032904 -0.3392065 *
Torsten
> -----Original Message-----
> From: Torsten Hothorn [mailto:hothorn at ci.tuwien.ac.at]
> Sent: Tue 22/07/2003 14:57
> To: Luis Miguel Almeida da Silva
> Cc: r-help at stat.math.ethz.ch
> Subject: Re: [R] variable names
>
>
>
>
> On Tue, 22 Jul 2003, Luis Miguel Almeida da Silva wrote:
>
> > Dear helpers
> >
> > I want to use rpart several times in a loop to build a classification tree. My problem is that rpart needs a formula as argument and for that the variables need to have names and this doesn't happen in my case. Every iteration in the loop has a different dataset with several variables (ex. 38 or more) and so I can't type the names by hand every time. Is there any function that generates names for variables in a dataframe. If so, how can I use then the argument
> >
>
> If your data is organised in a data.frame, (dummy) variable names are
> available by default:
>
> R> mydata <- data.frame(matrix(rnorm(25), ncol=5))
> R> mydata
> X1 X2 X3 X4 X5
> 1 1.3806313 -0.41827136 0.9591628 -1.3351038 0.02746110
> 2 0.5114590 -1.34111439 -0.9617552 -0.8367088 -0.06913021
> 3 -1.7508089 -0.49387076 -1.7597395 2.3899490 -0.15209650
> 4 -1.6753809 -1.28381808 -1.0424903 0.1002998 0.27784949
> 5 -0.2605535 -0.09035652 -2.5786418 1.0483400 -0.70445615
> R> rpart(X1 ~ ., data = mydata)
> n= 5
>
> node), split, n, deviance, yval
> * denotes terminal node
>
> 1) root 5 7.463698 -0.3589306 *
>
> best,
>
> Torsten
>
> > rpart(classlabels~. ,.....)
> >
> > thanks
> >
> > ______________________________________________
> > R-help at stat.math.ethz.ch mailing list
> > https://www.stat.math.ethz.ch/mailman/listinfo/r-help
> >
> >
>
>
>
>
More information about the R-help
mailing list