« Return to Thread: [jira] Created: (TIKA-247) parse language and category from MS Office properties

[jira] Updated: (TIKA-247) parse language and category from MS Office properties

by JIRA jira@apache.org :: Rate this Message:

Reply to Author | View in Thread


     [ https://issues.apache.org/jira/browse/TIKA-247?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Daan de Wit updated TIKA-247:
-----------------------------

    Attachment: tika-0.3_ms-office-metadata.patch

fixed npe in patch, all tests pass now

> parse language and category from MS Office properties
> -----------------------------------------------------
>
>                 Key: TIKA-247
>                 URL: https://issues.apache.org/jira/browse/TIKA-247
>             Project: Tika
>          Issue Type: Improvement
>          Components: metadata
>    Affects Versions: 0.1-incubating, 0.2, 0.3, 0.4
>            Reporter: Daan de Wit
>         Attachments: tika-0.3_ms-office-metadata.patch, tika-0.3_ms-office-metadata.patch
>
>
> The parser for MS Office documents (.doc) does not add the language and category properties to the metadata

--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

 « Return to Thread: [jira] Created: (TIKA-247) parse language and category from MS Office properties