Text Analytics using Gen AI
22000+ hours of manual work eliminated with 99%+ accurate classifiers
Reduce Costs
Headline Impact
22,000+
Hours of Manual Labor Eliminated with LLM-Powered Classification
FinTech Text Analytics LLM Classification
22,000+
Hours of manual labor eliminated through AI classification
2M+
Employee reviews processed and classified at scale
~20
Topic classes compressed for cost-efficient classification

Kelp — FinTech in Alternative Investments

Kelp — a FinTech company in alternative investments maintaining a proprietary database of 2M+ employee reviews. A small fraction contained investor-critical data on misconduct or workplace bias. Identifying them manually would have required 22,000+ hours of labor.

Finding the Needle in 2M+ Haystacks

Two problems: (1) Finding the needle in 2M+ haystacks — only a small fraction of employee reviews contained material information for investors, but manual identification was a 22,000+ hour impossibility. (2) Business analysts spent excessive hours extracting investment-sensitive information from news articles, market reports, and financial documents, creating delays in time-critical decisions.

LLM-Powered Classification & Extraction Pipeline

1. LLM-Powered Classification
Tested KNN with BERT embeddings, topicGPT, and OpenAI sentence embeddings before implementing GPT-4 with prompt engineering for maximum accuracy.
2. Data Cleaning Pipeline
Automated filtering to eliminate low-probability records before classification — dramatically reducing cost and processing time.
3. Topic Compression
Compressed classifications to ~20 topic classes for cost-efficient processing without sacrificing signal quality.
4. Model Evaluation Framework
Built pipelines enabling rapid iteration and easy switching to newer, cheaper LLMs as they become available.
5. Text Extraction Tools
Combined prompt tuning with post-processing to extract investment-sensitive information from unstructured documents.

Powered By

GPT-4 BERT Embeddings Prompt Engineering Topic Classification Model Evaluation Pipelines Automated Data Cleaning

22,000+ Hours Eliminated — Investor-Critical Data Surfaced at Scale

Eliminated 22,000+ hours of manual classification work. Processed 2M+ employee reviews, surfacing the investor-critical subset with high accuracy while building a reusable framework that can switch to cheaper models as they emerge.

"Tailored AI engages with Kelp on tasks involving building LLM-powered products for text analytics and extraction from unstructured datasets."

— Kelp

Ready to Transform Your Operations?

We've delivered $100M+ in business impact across IT services, healthcare, HR tech, and fintech.

Book a Scoping Call
Tailored AI Branding

We've delivered $100M+ impact across 5 industries

Let's scope what AI can do for yours

Book an Audit Today