[jira] Created: (NUTCH-763) Separate configuration files from resources to be included in the job file

View: New views
1 Messages — Rating Filter:   Alert me  

[jira] Created: (NUTCH-763) Separate configuration files from resources to be included in the job file

by JIRA jira@apache.org :: Rate this Message:

Reply to Author | View Threaded | Show Only this Message

Separate configuration files from resources to be included in the job file
--------------------------------------------------------------------------

                 Key: NUTCH-763
                 URL: https://issues.apache.org/jira/browse/NUTCH-763
             Project: Nutch
          Issue Type: Wish
            Reporter: Julien Nioche
            Priority: Minor
             Fix For: 1.1


One of the things I found confusing when I was learning Nutch was the fact that the conf/ directory contains at the same time :
- configuration files for Hadoop / Nutch which are put in the jar files but not used there
- resource files (e.g. filtering rules) which MUST be up to date in the job file

I would separate the conf/ directory from say a resources/ directory which would contain the rule files and other things to put in the job file. Unless I am mistaken none of the configuration files need to be in the job file. I know it is a very minor point, but that would probably simplify things and make it easier for beginners to understand what has to be modified where.

--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.