Back to Projects
Professional ProjectAI/ML

Azure OCR Migration & Integration

Timeline: 2021 (2 months)
Role: AI Integration Engineer

Overview

Migrated entire document processing pipeline from legacy OCR provider to Azure Document Intelligence in 2 months, avoiding 100% price increase while gaining new capabilities.

Challenge

Legacy OCR provider threatened 100% price increase (doubling costs on hundreds of thousands EUR monthly spend). Azure APIs provided different result formats and data structures. Could not disrupt service during migration. New service had to integrate into existing Invoicetrack workflow.

Solution & Approach

Evaluated Azure Forms Recognizer/Document Intelligence/Content Understanding APIs for document processing capabilities. Designed and implemented adapter layer to normalize API response differences across Azure services. Architected seamless migration strategy with zero-downtime deployment. Integrated new additional information provided by Azure OCR into workflow, enabling use in other applications. Completed full migration in 2 months.

Outcome & Impact

Avoided 100% price increase from legacy provider (hundreds of thousands EUR monthly). Zero production issues since migration. New OCR data now used across multiple applications in Invoicetrack. Future-proof solution with Microsoft continuously improving OCR and adding AI capabilities to Content Understanding.

Technologies Used

Azure Forms RecognizerAzure Document IntelligenceAzure Content UnderstandingPythonAPI integration

Key Highlights

  • Avoided 100% price increase (hundreds of thousands EUR monthly)
  • Completed complex migration in 2 months
  • Zero-downtime deployment with no service disruption
  • New OCR data integrated and used across multiple applications
  • Future-proof with Microsoft's continuous AI improvements
View All Projects