Skip to content
AI Primer
release

lift-pdf releases 9B extractor with 90.2% accuracy and 9.5s p50

lift-pdf released an open-source 9B model for schema-constrained document extraction, with code, pip install, playground access, and a 90.2% score on the team's 225-document bench. It matters because the model claims near-Gemini 3.5 Flash accuracy at 9.5s p50, though coverage is still skewed toward Latin-language docs and commercial-use limits remain.

4 min read
lift-pdf releases 9B extractor with 90.2% accuracy and 9.5s p50
lift-pdf releases 9B extractor with 90.2% accuracy and 9.5s p50

TL;DR

You can grab the model, browse the code, and read the blog post. There is also a hosted playground path with $10 free credit, and the launch video already shows the basic invoice-to-JSON flow.

What shipped

The release is narrow and useful: a schema-constrained extractor, not a general document parser. In VikParuchuri's LiteParse clarification, he draws that line explicitly, saying LiteParse turns documents into markdown while this model extracts specific fields into a JSON schema.

The shipping surface is simple:

Benchmarks and latency

The main claim is near-frontier extraction accuracy without frontier-model latency. VikParuchuri's launch post puts the model at 90.2%, 1.1 points behind Gemini 3.5 Flash, and 8.7 points ahead of NuExtract3.

The benchmark setup in VikParuchuri's benchmark note is 225 documents across several categories, with document lengths from 6 to 64 pages and a focus on harder extraction cases. The closest like-for-like competitor in his replies is Azure Content Understanding, which VikParuchuri's Azure comparison reply lists at 83.4% accuracy and 73.7s p50 latency versus lift at 90.2% and 9.5s p50.

That speed claim is the interesting bit for engineering teams. A lot of document AI numbers look good until the pipeline turns into an async job queue.

Turbo mode

The broader product story is not just the open model. lift also shipped a turbo extraction mode through its API, and VikParuchuri's API reply says you enable it with extraction_mode: turbo.

The posted turbo numbers are:

VikParuchuri's playground note adds that turbo is the fast path, while fast and balanced modes trade speed for higher accuracy, citations, and verification.

Limits and gaps

The launch thread also surfaced the weak spots faster than the headline post did.

That makes this release feel more like a strong extraction specialist than a universal document stack. Even in the replies, VikParuchuri's model-backend reply says he is not fully sure what models some competing systems use, which is a good reminder that a lot of document-AI comparisons are still messy once you leave the benchmark table.

Further reading

Discussion across the web

Where this story is being discussed, in original context.

On X· 4 threads
TL;DR1 post
What shipped4 posts
Turbo mode2 posts
Limits and gaps4 posts
Share on X