H Company introduced Holo3, a computer-use model family with a 122B API model and an Apache 2.0 35B release on Hugging Face. Check the benchmark and pricing claims before assuming the model is ready for field deployment.

The official launch post has more useful detail than the tweet thread, including H's "agentic learning flywheel," a 486-task in-house benchmark, and an example multi-app workflow that crosses PDFs, budgets, and email. You can also inspect the open 35B model card, and H published the same writeup as a Hugging Face blog post.
H Company launched two models at once. Holo3-122B-A10B is the flagship API model at $0.40 per million input tokens and $3.00 per million output tokens, while Holo3-35B-A3B is positioned as the lighter release at $0.25 per million input and $1.80 per million output Release breakdown.
The company says all Holo3 models are available through its inference API, and the 35B weights are openly available on Hugging Face under Apache 2.0 with a free API tier in H's own stack, according to the launch post. The model card tags it as an image-text-to-text vision-language model for computer use and GUI agents, built as a finetune of Qwen3.5-35B-A3B.
The launch chart makes the headline claim simple: Holo3 is being sold as a computer-use model that reaches frontier scores without frontier pricing. H's plot places Holo3-122B-A10B at 78.9% on OSWorld-Verified and Holo3-35B-A3B at 77%, while GPT-5.4 and Opus 4.6 sit at roughly similar scores but much farther right on cost H Company launch tweet.
The fuller table adds two details that matter more than the scatter plot:
That near-tie between the two Holo3 variants is the interesting part. H is effectively arguing that most of the gain comes from specialized agent training, not just scaling the base model.
H's launch post says it built a proprietary "Synthetic Environment Factory" and a 486-task H Corporate benchmark to test enterprise workflows inside synthetic business software Benchmark table. The four categories are listed in the post and table: E-Commerce, Business Software, Collaboration, and Multi-Apps.
The results split cleanly by task shape:
The official writeup gives one concrete example of what H means by Multi-Apps: pulling equipment prices from a PDF, checking each employee's remaining budget, then sending approval or rejection emails automatically. That is a much better read on Holo3's current ceiling than the OSWorld headline score, because it shows exactly where the model still bends under longer cross-application workflows.
📊Here's what we're releasing: Holo3-122B: our most capable model, available via API at $0.40/M input · $3.00/M output Holo3-35B: a lighter, faster variant with nearly the same intelligence, fully open-source (Apache 2.0) on Hugging Face at $0.25/M input · $1.80/M output
Holo3 is here 🚀. Today, we're launching Holo3: our new series of frontier computer-use models. 78.9% on OSWorld-Verified. That puts us ahead of GPT-5.4 and Opus 4.6, at one-tenth of the cost. Weights on Hugging Face. API is live. Test it now! #Holo3 #OpenSource #ComputerUse Show more
H Company released Holo3, a new series of SOTA "Computer Use" models that outperform GPT-5.4 and Opus 4.6 on OSWorld-Verified and other benchmarks.
Holo3 is here 🚀. Today, we're launching Holo3: our new series of frontier computer-use models. 78.9% on OSWorld-Verified. That puts us ahead of GPT-5.4 and Opus 4.6, at one-tenth of the cost. Weights on Hugging Face. API is live. Test it now! #Holo3 #OpenSource #ComputerUse
H Company released Holo3, a new series of SOTA "Computer Use" models that outperform GPT-5.4 and Opus 4.6 on OSWorld-Verified and other benchmarks.
Holo3 is here 🚀. Today, we're launching Holo3: our new series of frontier computer-use models. 78.9% on OSWorld-Verified. That puts us ahead of GPT-5.4 and Opus 4.6, at one-tenth of the cost. Weights on Hugging Face. API is live. Test it now! #Holo3 #OpenSource #ComputerUse
I feel like there are too few computer use apps at this moment (being released) as everyone is building OpenClaw clones atm. They will for sure boom at some point. But if that will happen with a help of H models is unclear indeed.