« Return to Thread: Fwd: WEKA IBK question

Re: Fwd: WEKA IBK question

by Peter Reutemann-3 :: Rate this Message:

Reply to Author | View in Thread

> I have files containing moview reviews from the OpenLensGroup DB.
> I have a file containing user data (user Id-number, age-number, gender-M/F,
> occupation-number, zipcode-number)
> and a file containg movie reviews DATA (user id-number, movie id-number,
> grade-number)
> [in the movie review file the same user Id can repeat but with different
> movie Id in each row]
>
> In order to do that I tried KNN alg and for that I want to use the WEKA
> package
> However, I don't know how to define the arff file based on the 2 files I
> already have (of user, movie data).

Weka only allows flat files (apart from the multi-instance
classifiers), i.e., you have to join those two files (like with an SQL
join) and create one file ou of it. For details on the ARFF format see
the following wiki article:
  http://weka.wiki.sourceforge.net/ARFF

> I don't know what are the attributes that the generic IBK classifier can
> take

Assuming that you use 3.6.x or 3.7.x or Weka, bring up the
GenericObjectEditor dialog for IBk (e.g., in the Explorer) and click
on the "Capabilities" button. That tells you what attributes and class
types the classifier can handle.

> and based on them
> give me a list of movie Id 's from the K-nearest users.

The nearest neighbor search that IBk uses, works on a whole instance.
If you provide this search an Instance containing a specifc user-movie
relation, then it will return the k-nearest other user-movie relations
(but not the movie IDs from the k-nearest users!).

> I need the accuracy to be based on the movies the users have seen (meaning
> that if a user have seen movie A,B,C
> and I select him for the test data then if the train are movies A,B then if
> the IBK offers movie C the accuracy will increase)
> Do you have an Idea on how to do that using the WEKA's IBK?

I leave these questions to someone else on the list.

Cheers, Peter
--
Peter Reutemann, Dept. of Computer Science, University of Waikato, NZ
http://www.cs.waikato.ac.nz/~fracpete/           Ph. +64 (7) 858-5174

_______________________________________________
Wekalist mailing list
Send posts to: Wekalist@...
List info and subscription status: https://list.scms.waikato.ac.nz/mailman/listinfo/wekalist
List etiquette: http://www.cs.waikato.ac.nz/~ml/weka/mailinglist_etiquette.html

 « Return to Thread: Fwd: WEKA IBK question