Install OCR for Development¶
This guide shows how to set up OCR for local development using pixi.
Prerequisites¶
- Python 3.12+
- Git
Steps¶
1. Install pixi¶
For other platforms, see pixi.sh.
2. Clone and install dependencies¶
3. Configure environment¶
Edit .env and set required variables:
OCR_STORAGE_ROOT— where outputs are written (e.g.,s3://your-bucket/or local path)OCR_ENVIRONMENT— environment name (QA,STAGING, orPROD)OCR_DEBUG— set to1for verbose logging
4. Verify installation¶
5. Run tests¶
Next steps¶
- Run pre-commit checks:
pre-commit run --all-files - Run integration tests with snapshots:
pixi run tests -m integration - See Data Pipeline Tutorial to run the processing pipeline
- Read Project Structure to understand the codebase