« Return to Thread: [jira] Created: (TIKA-238) Better handling of delegating parser implementations

[jira] Created: (TIKA-238) Better handling of delegating parser implementations

by JIRA jira@apache.org :: Rate this Message:

Reply to Author | View in Thread

Better handling of delegating parser implementations
----------------------------------------------------

                 Key: TIKA-238
                 URL: https://issues.apache.org/jira/browse/TIKA-238
             Project: Tika
          Issue Type: Improvement
          Components: config, parser
            Reporter: Jukka Zitting
            Assignee: Jukka Zitting
            Priority: Minor
             Fix For: 0.4


Delegating Parser implementations (i.e. ones that call another Parser instance to process some parts of the input document) like the PackageParser subclasses currently simply instantiate a new AutoDetectParser() whenever a more specific delegate parser has not been specified. This is troublesome as it 1) makes customization more difficult and 2) performs poorly since a new AutoDetectParser (with a new freshly parsed copy of the default configuration) is currently instantiated whenever such a delegating parser is used.

Instead of working around the issue with things like keeping a static copy of the pre-parsed default configuration, I'd like to solve this issue by introducing a new DelegatingParser abstraction for this and hooking it to the configuration mechanism.

--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

 « Return to Thread: [jira] Created: (TIKA-238) Better handling of delegating parser implementations