> I have a dataset with numeric and string-attributes and want to use K-means,
> maybe later even X-means. Is there any possibilty to use different metrics
> for the attributes? eg. Euclidean for numeric and edit-distance for strings?
No, you can only use a single distance function (though you can
specify what attributes ranges to use in calculation). But you can
always implement your own distance function that uses different
sub-distance-functions for the different attribute types. A distance
function has to implement the interface weka.core.DistanceFunction.
See the other distance functions for implementation examples.
Cheers, Peter
--
Peter Reutemann, Dept. of Computer Science, University of Waikato, NZ
http://www.cs.waikato.ac.nz/~fracpete/ Ph. +64 (7) 858-5174
_______________________________________________
Wekalist mailing list
Send posts to:
Wekalist@...
List info and subscription status:
https://list.scms.waikato.ac.nz/mailman/listinfo/wekalistList etiquette:
http://www.cs.waikato.ac.nz/~ml/weka/mailinglist_etiquette.html