Separate configuration files from resources to be included in the job file
--------------------------------------------------------------------------
Key: NUTCH-763
URL:
https://issues.apache.org/jira/browse/NUTCH-763 Project: Nutch
Issue Type: Wish
Reporter: Julien Nioche
Priority: Minor
Fix For: 1.1
One of the things I found confusing when I was learning Nutch was the fact that the conf/ directory contains at the same time :
- configuration files for Hadoop / Nutch which are put in the jar files but not used there
- resource files (e.g. filtering rules) which MUST be up to date in the job file
I would separate the conf/ directory from say a resources/ directory which would contain the rule files and other things to put in the job file. Unless I am mistaken none of the configuration files need to be in the job file. I know it is a very minor point, but that would probably simplify things and make it easier for beginners to understand what has to be modified where.
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.