
Welcome to Kodexa
Kodexa is a comprehensive platform for deploying intelligent document processing solutions. Whether youโre extracting data from invoices, processing contracts, or building custom document workflows, Kodexa provides the infrastructure, tools, and AI capabilities you need.Documentation Structure
This developer portal is organized into six main sections, each designed to help you at different stages of your journey with Kodexa:Welcome
Start here for an overview of the platform and recent changes
Concepts
Understand core platform concepts: organizations, projects, models, storage, and data structures
Getting Started
Quick starts, deployment guides, notebooks, and project templates to get you building fast
CLI Tools
Command-line tools for local development, GitOps workflows, and infrastructure management
Guides
In-depth guides for knowledge systems, formulas, and data forms
API Reference
Complete REST API documentation for programmatic access to all platform features
Where to Start
New to Kodexa?
1
Understand the Basics
Start with Concepts to learn about organizations, projects, documents, and models
2
Get Your Hands Dirty
Follow Getting Started with Python to build your first document processing workflow
3
Deploy with GitOps
Learn modern deployment workflows using GitHub Actions and version control
Building Production Applications?
GitOps Deployment
Automate deployments with GitHub Actions, multi-environment promotion, and PR validation
KDX CLI
Manage resources, sync metadata, and control your Kodexa infrastructure from the command line
API Integration
Integrate Kodexa into your applications with our comprehensive REST API
Data Forms
Build custom UIs for document review, validation, and data extraction
Working with Specific Features?
Document Processing & Models
Document Processing & Models
- Documents - Understanding document structure and content
- Models - Training and deploying ML models
- Working with Models - Practical model usage
- Model Runtimes - Deployment and scaling
Storage & Data
Storage & Data
- Document Stores - Persistent document storage
- Data Stores - Structured data storage
- Storage Overview - Platform storage architecture
- Taxonomies - Classification hierarchies
Knowledge & Intelligence
Knowledge & Intelligence
- Knowledge Systems - Building intelligent rule systems
- Formulas - Dynamic calculations and transformations
- Assistants - AI-powered document assistants
Infrastructure & Deployment
Infrastructure & Deployment
- Deployment Overview - GitOps workflows
- Resource Operations - Managing platform resources
- Metadata Sync - Version control for configurations
Platform Capabilities
๐ Document Processing
Process any document typeโPDFs, images, Word documents, emailsโwith built-in OCR, layout analysis, and content extraction.๐ค AI & Machine Learning
Train custom models for classification, extraction, and validation. Leverage pre-built models or bring your own.๐ Workflow Automation
Build end-to-end document workflows with assistants, event handlers, and custom processing pipelines.๐ Data Extraction
Extract structured data from unstructured documents using ML models, rules, and hybrid approaches.๐ Validation & Review
Create custom review interfaces with data forms, exception handling, and human-in-the-loop workflows.๐ GitOps Deployment
Manage infrastructure as code with version control, code review, and multi-environment promotion.Need Help?
Support
Visit our support portal for articles, tutorials, and troubleshooting
Contact Us
Get in touch with our team for personalized assistance

