Document Search by Metadata
đź’ˇ
PageIndex with metadata support is in closed beta. Fill out this form to request early access to this feature.
For documents that can be easily distinguished by metadata, we recommend using metadata to search the documents.
Examples Documents:
- Financial reports categorized by company and time period
- Legal documents categorized by case type
- Medical records categorized by patient or condition
- And many others
In such cases, you can search documents by leveraging their metadata. A popular method is to use “Query to SQL” for document retrieval.
Example Pipeline
1. PageIndex Tree Generation
Upload all documents into PageIndex to get their doc_id
.
2. Set up SQL tables
Store documents along with their metadata and the PageIndex doc_id
in a database table.
3. Query to SQL
Use an LLM to transform a user’s retrieval request into a SQL query to fetch relevant documents.
4. Retrieve with PageIndex
Use the PageIndex doc_id
of the retrieved documents to perform further retrieval via the PageIndex retrieval API.
đź’¬ Help & Community
Contact us if you need any advice on conducting document searches for your use case.
- 🤝 Join our Discord 
- 📨 Leave us a message 
Last updated on