[R] rpart unbalanced data
Dr. Diego Kuonen
kuonen at statoo.com
Fri Jul 21 14:21:44 CEST 2006
Dear Helen,
You may want to have a look at
http://www.togaware.com/datamining/survivor/Predicting_Fraud.html
Greets,
Diego Kuonen
helen.mills at yale.edu wrote:
> Hello all,
> I am currently working with rpart to classify vegetation types by spectral
> characteristics, and am comming up with poor classifications based on the fact
> that I have some vegetation types that have only 15 observations, while others
> have over 100. I have attempted to supply prior weights to the dataset, though
> this does not improve the classification greatly. Could anyone supply some
> hints about how to improve a classification for a badly unbalanced datase?
>
> Thank you,
> Helen Mills Poulos
--
Dr. ès sc. Diego Kuonen, CEO phone +41 (0)21 693 5508
Statoo Consulting fax +41 (0)21 693 8765
PO Box 107 mobile +41 (0)78 709 5384
CH-1015 Lausanne 15 email kuonen at statoo.com
web http://www.statoo.info skype Kuonen.Statoo.Consulting
-----------------------------------------------------------------
| Statistical Consulting + Data Analysis + Data Mining Services |
-----------------------------------------------------------------
+ Are you drowning in information and starving for knowledge? +
+ Have you ever been Statooed? http://www.statoo.biz +
More information about the R-help
mailing list