Java Mailing List Archive

http://www.r-help.com/

Home » Home (12/2007) » R Help for Statistical Computing »

[R] imbalanced data set

WeiWei Shi

2005-07-24


Hi,
I have a question of classification on imbalanced dataset. I am
wondering if there is a package which can solve this problem via
sampling approach, like one-sided selection.

A follow-up question is, how to select those 'representative' samples
and remove noise/borderlines and redundancy in order to increase
classification accuracy. Is there any work which has been implemented
in R or some GNU softwares?

Thanks,

weiwei



--
Weiwei Shi, Ph.D

"Did you always know?"
"No, I did not. But I believed..."
---Matrix III

______________________________________________
R-help@(protected)
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide! http://www.R-project.org/posting-guide.html
©2008 r-help.com - Jax Systems, LLC, U.S.A.