Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Tika not extracting table with Content control fields from word document #125

Open
mistakenjockey opened this issue Oct 3, 2018 · 2 comments

Comments

@mistakenjockey
Copy link

Hi,
I have a word document which contains normal tables and there are some table with content control. Tika extract's the text of document and content of normal table perfectly but skip the table which has content control over it. How to extract the data from table with content control .

"Content controls are individual controls that you can add and customize for use in templates, forms, and documents. "

@KevM
Copy link
Owner

KevM commented Oct 3, 2018

Sorry you are having problems. That part of Tika (Office document extraction) is controlled by POI. I'd take a look over there to see if they support the desired capability.

@mistakenjockey
Copy link
Author

Thanks for the reply. keep posted if you find something which can resolve the issue.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants