Replies: 2 comments
-
Hi @NILICK, have you tried using a layout-aware library that could help you do this, like using unstructured.io? |
Beta Was this translation helpful? Give feedback.
0 replies
-
Hi @mrm1001, thanks for your suggestion. I'll try it. |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
I'm using Haystack 2.0 and Qdrant to create a vector database for PDF files. I wrote the following code to achieve this:
However, I'm having trouble removing the "References" section before the data is added to the database. Is there a way to include only specific sections from PDFs when populating the vector database?
Beta Was this translation helpful? Give feedback.
All reactions