Extraction Engine
- Hybrid lattice & stream table detection
- OCR fallback for image-based PDFs
- Rule-based statement classification
Unit & Scale Detection
- Auto-detect "in thousands" vs "in millions"
- Consistent normalization across sheets
Validation Layer
- Accounting cross-checks
- Roll-forward consistency
- Confidence metrics
Excel Outputs
- Banker-friendly formatting
- Traffic-light validation sheet
- Provenance tab for audit trail
Flexible Workflows
- Upload multiple PDFs for batch processing
- Smart page selection from large CIM documents
- Consolidate pages within a single PDF (e.g., page 5 = FY2023, page 6 = FY2024)
- Side-by-side period comparisons with YoY analysis
Data Privacy
- No cloud training
- Auto-delete after processing
- Secure encrypted file storage