Headline Impact
10,000+
Hours of Manual Labor Eliminated
EdTech
OCR
Mathematical Content
10,000+
Hours of manual labor eliminated
MathML
Output in MathML & MathType for platform-native rendering
Batch
Multi-image merging to minimize API costs at scale
The Client
CollegeDoors
CollegeDoors — an EdTech platform founded by IIT and IIM graduates, serving engineering and medical aspirants across India with test preparation content.
The Challenge
Low-Resolution Math Images Driving Customer Churn
CollegeDoors used low-resolution images in their test series, creating an unprofessional appearance that drove customer churn to competitors. They needed to convert thousands of images containing mathematical equations and chemical formulas into clean, platform-native text — but standard OCR solutions couldn't handle mathematical notation.
What We Built
Cost-Optimized Mathematical OCR Pipeline
1. OCR Benchmarking
Evaluated Tesseract, Pix2Text, Mathpix, and Amazon Textract. Selected Mathpix for its superior handling of mathematical content.
2. Multi-Image Merging
Built a system that merges multiple images with placeholder text before API calls, significantly reducing per-image processing costs.
3. Format Conversion
Converts Mathpix LaTeX output into MathML and MathType formats matching the platform's UI specifications.
4. Automated Pipeline
Python service that fetches images, processes batches, converts formats, updates the database, and stores assets on S3.
Technology
Powered By
Mathpix API
LaTeX to MathML
MathType Conversion
Batch Processing
S3 Storage
Cost-Optimized API Usage
The Results
10,000+ Hours Saved — With 98%+ Accuracy on Math & Chemistry
Eliminated over 10,000 hours of manual image-to-text conversion while achieving 98%+ accuracy on mathematical equations and chemical formulas. Replaced low-resolution images with clean, platform-native mathematical rendering — eliminating the unprofessional appearance that was driving churn.
"Standard OCR can't handle math and chemistry. We needed something purpose-built — and 98%+ accuracy was the minimum threshold."
— CollegeDoors
Ready to Transform Your Operations?
We've delivered $100M+ in business impact across IT services, healthcare, HR tech, and fintech.
Book a Scoping Call