Discretization in OneR

View: New views
2 Messages — Rating Filter:   Alert me  

Discretization in OneR

by Polczynski, Mark :: Rate this Message:

Reply to Author | View Threaded | Show Only this Message

I am using OneR on a version of the weather dataset (outlook/temp/humid/ windy/play) which has numerical attributes for temperature and humidity.  Is there a way to see the bin ranges that OneR uses to discretize these numerical attributes?  I would like to know how many bins Weka used, and which bins the instances were put into.

Thanks!

Mark Polczynski

_______________________________________________
Wekalist mailing list
Send posts to: Wekalist@...
List info and subscription status: https://list.scms.waikato.ac.nz/mailman/listinfo/wekalist
List etiquette: http://www.cs.waikato.ac.nz/~ml/weka/mailinglist_etiquette.html

Re: Discretization in OneR

by Peter Reutemann-3 :: Rate this Message:

Reply to Author | View Threaded | Show Only this Message

> I am using OneR on a version of the weather dataset (outlook/temp/humid/ windy/play) which has numerical attributes for temperature and humidity.  Is there a way to see the bin ranges that OneR uses to discretize these numerical attributes?

If OneR chooses a numeric attribute to base its model on (the
classifiers uses only a single attribute!), then the output is as
follows (UCI dataset "balance-scale"):

left-weight:
        < 2.5 -> R
        >= 2.5 -> L

For numeric attributes, the generated rule holds the breakpoints (=
borders between bins). The above example has one breakpoint and
therefore two bins.

Cheers, Peter
--
Peter Reutemann, Dept. of Computer Science, University of Waikato, NZ
http://www.cs.waikato.ac.nz/~fracpete/           Ph. +64 (7) 858-5174

_______________________________________________
Wekalist mailing list
Send posts to: Wekalist@...
List info and subscription status: https://list.scms.waikato.ac.nz/mailman/listinfo/wekalist
List etiquette: http://www.cs.waikato.ac.nz/~ml/weka/mailinglist_etiquette.html