H Company launched Holo3 for computer use and released a 35B model on Hugging Face. Teams get a smaller inspectable option for GUI navigation, but the benchmark and pricing claims come from vendor materials.

The useful reveals are pretty simple: the official launch page says H trained Holo3 in synthetic enterprise environments, the quickstart docs show an OpenAI-compatible chat completions API with schema-constrained outputs, and the Hugging Face model card confirms the open 35B variant is a Qwen3.5 fine-tune under Apache 2.0. If you care about desktop agents, those three details matter more than the victory-lap tweets.
H Company built the launch around one figure: 78.85 percent on OSWorld-Verified for Holo3-122B-A10B. The official post positions that as state of the art on a benchmark for desktop computer use, and the comparison graphic places Holo3 above GPT-5.4, Sonnet 4.6, Opus 4.6, Qwen3.5, and Kimi in that specific setting.
That is the most important claim in the story, but it is still a vendor claim. If you want the benchmark context, BenchLM's OSWorld-Verified explainer is a decent refresher on what these tasks are actually testing: screen perception, stepwise action, state tracking, and recovery across real software interfaces.
The bigger practical move is the open Holo3-35B-A3B release. The model card describes it as a sparse MoE vision-language model for navigation and computer-use agents, with 35B total parameters and 3B active, fine-tuned from Qwen/Qwen3.5-35B-A3B and released under Apache 2.0.
That combination is rare enough to matter. Teams that want an inspectable GUI model now have a smaller open option instead of jumping straight to an API-only frontier model. The launch tweet also notes official Transformers support community summary, which lowers the friction for trying it in existing inference stacks.
H's quickstart and Models API docs show what the company thinks developers will actually do with Holo3:
extra_bodyThat last point is easy to miss. Holo3 is presented in the docs as a native reasoning model, and the examples explicitly show message.reasoning alongside normal content. For agent builders, that is more concrete than benchmark talk because it tells you how the model is expected to be wired into automation loops.
The launch page spends a lot of time on what H calls an "agentic learning flywheel." The ingredients are synthetic navigation data, out-of-domain augmentation, and curated reinforcement learning. H also says it built a proprietary "Synthetic Environment Factory" to generate enterprise-like websites and verifiable multi-step tasks.
That is the real story behind the benchmark jump. H is making a bet that desktop-agent progress comes from better synthetic task generation and verification, not just scaling a general model and hoping grounding improves on its own.
The comparison table in the shared launch material is a good reminder that Holo3 is strongest where H optimized it. The Holo3 variants lead on OSWorld-Verified, several single-app corporate categories, and ScreenSpot-Pro, but the same table shows weaker relative results on the multi-app category, where Kimi-K2.5 and Claude Sonnet 4.6 score higher community summary.
So the clean read is not "new best model, full stop." The cleaner read is that H seems to have built a very strong specialist for GUI navigation and grounded action, and the open 35B version makes that specialization accessible enough for engineers to test in real workflows.
H Company is pushing two tracks at once. The flagship Holo3-122B-A10B is available through H's inference API, while the 35B-A3B model is both in the API and openly released on Hugging Face under Apache 2.0 pricing summary model link.
That split makes sense. The 122B model is the benchmark spearhead, but the 35B model is the adoption wedge. If Holo3 gets traction outside demos, it will probably come from teams benchmarking the open 35B model on their own UI tasks, then deciding whether the flagship is worth paying for.
Holo3 is here π. Today, we're launching Holo3: our new series of frontier computer-use models. 78.9% on OSWorld-Verified. That puts us ahead of GPT-5.4 and Opus 4.6, at one-tenth of the cost. Weights on Hugging Face. API is live. Test it now! #Holo3 #OpenSource #ComputerUse Β Show more
Ngl, reaching 78,9% on OS-world and outperforming even GPT-5.4 at 1/10 cost is a big deal
Holo3 is here π. Today, we're launching Holo3: our new series of frontier computer-use models. 78.9% on OSWorld-Verified. That puts us ahead of GPT-5.4 and Opus 4.6, at one-tenth of the cost. Weights on Hugging Face. API is live. Test it now! #Holo3 #OpenSource #ComputerUse
Holo3, new model of @hcompany_ai outperforming closed and larger open models on GUI navigation π₯ > A3B/35B based on Qwen3.5 > officially supported in transformers π€ > free license π
H Company launches Holo3 series of frontier computer-use models scoring 78.9% on OSWorld-Verified, ahead of GPT-5.4 and Opus 4.6 at one-tenth the cost, with the 122B model at $0.40/M input and $3.00/M output plus the 35B open-source variant at $0.25/M input and $1.80/M output.Β Show more
Holo3 is here π. Today, we're launching Holo3: our new series of frontier computer-use models. 78.9% on OSWorld-Verified. That puts us ahead of GPT-5.4 and Opus 4.6, at one-tenth of the cost. Weights on Hugging Face. API is live. Test it now! #Holo3 #OpenSource #ComputerUse
Holo3 is here π. Today, we're launching Holo3: our new series of frontier computer-use models. 78.9% on OSWorld-Verified. That puts us ahead of GPT-5.4 and Opus 4.6, at one-tenth of the cost. Weights on Hugging Face. API is live. Test it now! #Holo3 #OpenSource #ComputerUse Β Show more
Holo3, new model of @hcompany_ai outperforming closed and larger open models on GUI navigation π₯ > A3B/35B based on Qwen3.5 > officially supported in transformers π€ > free license π
H Company launches Holo3 series of frontier computer-use models scoring 78.9% on OSWorld-Verified, ahead of GPT-5.4 and Opus 4.6 at one-tenth the cost, with the 122B model at $0.40/M input and $3.00/M output plus the 35B open-source variant at $0.25/M input and $1.80/M output.Β Show more
Holo3 is here π. Today, we're launching Holo3: our new series of frontier computer-use models. 78.9% on OSWorld-Verified. That puts us ahead of GPT-5.4 and Opus 4.6, at one-tenth of the cost. Weights on Hugging Face. API is live. Test it now! #Holo3 #OpenSource #ComputerUse