converting a dataset

View: New views
4 Messages — Rating Filter:   Alert me  

converting a dataset

by Nancy Adam :: Rate this Message:

Reply to Author | View Threaded | Show Only this Message

Some parts of this message have been removed. Learn more about Nabble's security policy.

 

Hi everyone,

Can anyone please tell me how I can convert a dataset.arff into .data?

Cheers,

Nancy




Windows Live: Make it easier for your friends to see what you’re up to on Facebook.
_______________________________________________
Wekalist mailing list
Send posts to: Wekalist@...
List info and subscription status: https://list.scms.waikato.ac.nz/mailman/listinfo/wekalist
List etiquette: http://www.cs.waikato.ac.nz/~ml/weka/mailinglist_etiquette.html

Re: converting a dataset

by Peter Reutemann-3 :: Rate this Message:

Reply to Author | View Threaded | Show Only this Message

> Can anyone please tell me how I can convert a dataset.arff into .data?

Do you mean C4.5 format?

Either use the Explorer (load the ARFF file and save it as C4.5 file)
or use the following converter:
  java weka.core.converters.C45Saver -i <the_arff_file.arff> -o
<the_c45_file.data> -c <class_index>

Don't worry, if you can't specify a .names and .data file. Both of
them will get generated automatically.

Cheers, Peter
--
Peter Reutemann, Dept. of Computer Science, University of Waikato, NZ
http://www.cs.waikato.ac.nz/~fracpete/           Ph. +64 (7) 858-5174

_______________________________________________
Wekalist mailing list
Send posts to: Wekalist@...
List info and subscription status: https://list.scms.waikato.ac.nz/mailman/listinfo/wekalist
List etiquette: http://www.cs.waikato.ac.nz/~ml/weka/mailinglist_etiquette.html

RE: converting a dataset

by Nancy Adam :: Rate this Message:

Reply to Author | View Threaded | Show Only this Message

Some parts of this message have been removed. Learn more about Nabble's security policy.

 

Hi all,

Thank you Peter so much for your reply.

Could you please tell me the difference between .data, .arff, .names and .dat? can I use any of them in java classifier program?

 

How can I partition dataset.arff into two files one for training and other one for testing?

Many thanks,

Nancy

 


> From: fracpete@...
> Date: Thu, 29 Oct 2009 09:20:13 +1300
> Subject: Re: [Wekalist] converting a dataset
> To: wekalist@...
>
> > Can anyone please tell me how I can convert a dataset.arff into .data?
>
> Do you mean C4.5 format?
>
> Either use the Explorer (load the ARFF file and save it as C4.5 file)
> or use the following converter:
> java weka.core.converters.C45Saver -i <the_arff_file.arff> -o
> <the_c45_file.data> -c <class_index>
>
> Don't worry, if you can't specify a .names and .data file. Both of
> them will get generated automatically.
>
> Cheers, Peter
> --
> Peter Reutemann, Dept. of Computer Science, University of Waikato, NZ
> http://www.cs.waikato.ac.nz/~fracpete/ Ph. +64 (7) 858-5174
>
> _______________________________________________
> Wekalist mailing list
> Send posts to: Wekalist@...
> List info and subscription status: https://list.scms.waikato.ac.nz/mailman/listinfo/wekalist
> List etiquette: http://www.cs.waikato.ac.nz/~ml/weka/mailinglist_etiquette.html


Windows Live Hotmail: Your friends can get your Facebook updates, right from Hotmail®.
_______________________________________________
Wekalist mailing list
Send posts to: Wekalist@...
List info and subscription status: https://list.scms.waikato.ac.nz/mailman/listinfo/wekalist
List etiquette: http://www.cs.waikato.ac.nz/~ml/weka/mailinglist_etiquette.html

Re: converting a dataset

by Peter Reutemann-3 :: Rate this Message:

Reply to Author | View Threaded | Show Only this Message

Please no top-posting, see mailing list etiquette why
(http://www.cs.waikato.ac.nz/~ml/weka/mailinglist_etiquette.html).

> Could you please tell me the difference between .data, .arff, .names and
> .dat? can I use any of them in java classifier program?

They're just different data formats (just open them in a text editor
and you'll see the difference). And you need specific converters for
loading/saving them. ARFF is Weka's own (and preferred) data format.

> How can I partition dataset.arff into two files one for training and other
> one for testing?

I've added a new FAQ: How do I divide a dataset into training and test set?
Link to the FAQs available from the Weka homepage.

NB: The Explorer allows you to split a the currently loaded dataset
into a training and test set on-the-fly.

Cheers, Peter
--
Peter Reutemann, Dept. of Computer Science, University of Waikato, NZ
http://www.cs.waikato.ac.nz/~fracpete/           Ph. +64 (7) 858-5174

_______________________________________________
Wekalist mailing list
Send posts to: Wekalist@...
List info and subscription status: https://list.scms.waikato.ac.nz/mailman/listinfo/wekalist
List etiquette: http://www.cs.waikato.ac.nz/~ml/weka/mailinglist_etiquette.html