What is a Document Family?
A Document Family is Kodexa’s core entity representing a single document with all its versions, processing history, metadata, and extracted data. Every document uploaded to a store creates a document family.Accessing Document Families
There are two ways to access documents in Kodexa:1. Through Stores (File System Style)
Use the Stores API when you know the document’s path:2. Direct Access (By ID)
Use the DocumentFamilies API when you have the document family ID:When to Use DocumentFamilies API
Use the/api/documentFamilies
endpoints when you need to:
- Access a specific document by ID - When you have the UUID from processing results or webhooks
- Get external data - Retrieve data from external systems associated with the document
- Check processing steps - View the complete processing pipeline and transformations
- Update document status - Change workflow status (PROCESSING, COMPLETE, FAILED, etc.)
- Manage knowledge features - Add or remove knowledge base entries linked to the document
- Trigger events - Send document update notifications without modifying content
Key Operations
Get External Data
Documents can store data from external systems (ERP, CRM, databases):Update External Data
Store references or metadata from external systems:Get Processing Steps
View the complete processing pipeline:Update Document Status
Change workflow status:Add Knowledge Features
Link document to knowledge base:Touch Document
Trigger events without changes:External Data Use Cases
External data provides a bridge between Kodexa and your business systems:ERP Integration
CRM Tracking
Workflow State
Processing Steps Explained
Processing steps track every transformation:Document Status Values
Common status values for workflow management:Status | Description | Use Case |
---|---|---|
UPLOADED | Document uploaded, awaiting processing | Initial state |
PROCESSING | AI processing in progress | During extraction |
PROCESSED | Processing complete, data extracted | Ready for review |
REVIEW | Awaiting human review | Quality control |
APPROVED | Reviewed and approved | Ready for export |
REJECTED | Rejected during review | Needs correction |
FAILED | Processing failed | Error handling |
ARCHIVED | Archived for retention | Long-term storage |