[
https://issues.apache.org/jira/browse/TIKA-232?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Jukka Zitting resolved TIKA-232.
--------------------------------
Resolution: Duplicate
Assignee: Jukka Zitting
With TIKA-238 resolved, the former case above is now the default:
Parser parser = new ZipParser();
And the latter case is much simpler:
TikaConfig config = TikaConfig.getDefaultConfig(); // without a delegate parser
Parser parser = new AutoDetectParser(config);
Resolving this as a Duplicate of TIKA-238.
> Scanning of archive files
> -------------------------
>
> Key: TIKA-232
> URL:
https://issues.apache.org/jira/browse/TIKA-232> Project: Tika
> Issue Type: New Feature
> Components: parser
> Affects Versions: 0.3
> Environment: All
> Reporter: Karl Heinz Marbaise
> Assignee: Jukka Zitting
> Priority: Minor
>
> If i parse an archive all the files inside the archive will be extracted with their text as well. It would be nice to have the choice to extract only the list of files (directory) of an archive instead of extracting the whole contents. This seemed to be usable only for zip, tar, tar.gz, tar.bz2, .jar. May be this could be realized by using a different calling or by a run time configuration.
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.