releaseMarch 26, 2026

Chroma launches Context-1, a 20B search agent with Apache 2.0 weights

Chroma released Context-1, a 20B search agent it says pushes the speed-cost-accuracy frontier for agentic search, with open weights on Hugging Face. Benchmark it against your current search stack before wiring it into production.

3 min read

Chroma launches Context-1, a 20B search agent with Apache 2.0 weights

TL;DR

Chroma launched Context-1 launch as a 20B-parameter search agent and says it "pushes the pareto frontier of agentic search" with "order of magnitude faster" and "order of magnitude cheaper" performance than prior agentic-search setups. Context-1 launch
The release includes Apache 2.0 open-source weights, with the Hugging Face repo pointing engineers to a public chromadb/context-1 model package. the Hugging Face repo
Chroma says Context-1 "delivering the best accuracy, speed, and cost" across its generated benchmarks and public evals including Browsecomp-Plus, SealQA, LongSealQA, and FRAMES. Context-1 launch
Early practitioner reaction is centered on search subagents: one researcher called them a "best case scenario for small RL'd models" because they are easy to factor out of larger traces and generate synthetic data for. search subagents take

What exactly did Chroma ship?

Chroma's launch post positions Context-1 as a specialized 20B model for multi-step search, where "the output of one search often informs the next." The core claim is not just model quality but a better latency-cost tradeoff for long agentic retrieval trajectories, which Chroma says have been too expensive and slow with frontier LLMs. According to the launch thread, Context-1 is meant to improve "accuracy, speed, and cost" at the same time rather than trade one off for another.

The practical implementation detail is that Chroma shipped weights under Apache 2.0, and the Hugging Face listing makes the model immediately accessible for self-hosting, benchmarking, or adapter work. Chroma also says the model was evaluated on both internal and public benchmarks, naming Browsecomp-Plus, SealQA, LongSealQA, and FRAMES in the announcement, but the evidence here does not include the full benchmark tables or serving requirements. Context-1 launch

How usable is it in developer workflows today?

The release is already being framed as something engineers can plug into search-heavy agent stacks rather than a research-only checkpoint. In early discussion, one practitioner argued that search subagents are a "best case scenario" for smaller RL-trained models because the task can be separated from larger agent traces and fed with effectively unbounded synthetic data. That matters if you're comparing Context-1 against a larger general-purpose model currently doing retrieval orchestration.

Workflow fit is still uneven. A short follow-up thread showed a minimal code demo around Chroma, while a reply asking "can I use this with claude code?" got the answer "no but you can with codex" alongside Codex advanced configuration docs. Codex reply That suggests the open weights are available now, but editor and coding-agent integration is still more ad hoc than turnkey for some popular agent environments.

TL;DR

What exactly did Chroma ship?

How usable is it in developer workflows today?

Discussion across the web