releaseApril 1, 2026

H Company launches Holo3, claims 78.9% on OSWorld-Verified

H Company launched Holo3 for computer use and released a 35B model on Hugging Face. Teams get a smaller inspectable option for GUI navigation, but the benchmark and pricing claims come from vendor materials.

Multimodal Computer Use Benchmarks

5 min read

H Company launches Holo3, claims 78.9% on OSWorld-Verified

TL;DR

H Company launched Holo3, a computer-use model family aimed at GUI navigation and desktop agents, and the flagship 122B-A10B model claims 78.85 percent on OSWorld-Verified, according to the company launch materials launch post pricing summary.
The more practical release for many teams is the open Holo3-35B-A3B model card, which is Apache 2.0 licensed, based on Qwen3.5-35B-A3B, and already tagged for Transformers support community summary model link.
The benchmark chart matters because Holo3 is not just inching ahead, it is claiming a large jump over prior Holo and Qwen variants on OSWorld-Verified, with a much lower cost point than top proprietary models in the same chart cost-performance chart benchmark reaction.
The fine print matters too: the strongest benchmark and pricing claims in circulation come from H Company materials and reposts, while the official docs are more concrete about API shape, multimodal inputs, and structured outputs than about independent evaluation context vendor claims model link.

The useful reveals are pretty simple: the official launch page says H trained Holo3 in synthetic enterprise environments, the quickstart docs show an OpenAI-compatible chat completions API with schema-constrained outputs, and the Hugging Face model card confirms the open 35B variant is a Qwen3.5 fine-tune under Apache 2.0. If you care about desktop agents, those three details matter more than the victory-lap tweets.

OSWorld-Verified is the headline number

@hcompany_ai

·Follow

Holo3 is here 🚀. Today, we're launching Holo3: our new series of frontier computer-use models. 78.9% on OSWorld-Verified. That puts us ahead of GPT-5.4 and Opus 4.6, at one-tenth of the cost. Weights on Hugging Face. API is live. Test it now! #Holo3 #OpenSource #ComputerUse Show more

4:44 PM · Mar 31, 2026

·Follow

Ngl, reaching 78,9% on OS-world and outperforming even GPT-5.4 at 1/10 cost is a big deal

@hcompany_ai

2:31 PM · Apr 1, 2026

821

Read 20 replies

H Company built the launch around one figure: 78.85 percent on OSWorld-Verified for Holo3-122B-A10B. The official post positions that as state of the art on a benchmark for desktop computer use, and the comparison graphic places Holo3 above GPT-5.4, Sonnet 4.6, Opus 4.6, Qwen3.5, and Kimi in that specific setting.

That is the most important claim in the story, but it is still a vendor claim. If you want the benchmark context, BenchLM's OSWorld-Verified explainer is a decent refresher on what these tasks are actually testing: screen perception, stepwise action, state tracking, and recovery across real software interfaces.

The 35B open release is the real developer hook

merve

@mervenoyann

·Follow

Replying to @mervenoyann

huggingface.co/Hcompany/Holo3…

1:01 PM · Apr 1, 2026

Read 1 reply

merve

@mervenoyann

·Follow

Holo3, new model of @hcompany_ai outperforming closed and larger open models on GUI navigation 🔥 > A3B/35B based on Qwen3.5 > officially supported in transformers 🤗 > free license 👏

1:01 PM · Apr 1, 2026

166

Read 9 replies

The bigger practical move is the open Holo3-35B-A3B release. The model card describes it as a sparse MoE vision-language model for navigation and computer-use agents, with 35B total parameters and 3B active, fine-tuned from Qwen/Qwen3.5-35B-A3B and released under Apache 2.0.

That combination is rare enough to matter. Teams that want an inspectable GUI model now have a smaller open option instead of jumping straight to an API-only frontier model. The launch tweet also notes official Transformers support community summary, which lowers the friction for trying it in existing inference stacks.

The docs are more interesting than the marketing copy

Kol Tregaskes

@koltregaskes

·Follow

H Company launches Holo3 series of frontier computer-use models scoring 78.9% on OSWorld-Verified, ahead of GPT-5.4 and Opus 4.6 at one-tenth the cost, with the 122B model at $0.40/M input and $3.00/M output plus the 35B open-source variant at $0.25/M input and $1.80/M output. Show more

@hcompany_ai

11:30 PM · Apr 1, 2026

H Company's training story is synthetic environments

@hcompany_ai

·Follow

4:44 PM · Mar 31, 2026

2.0K

Read 66 replies

The launch page spends a lot of time on what H calls an "agentic learning flywheel." The ingredients are synthetic navigation data, out-of-domain augmentation, and curated reinforcement learning. H also says it built a proprietary "Synthetic Environment Factory" to generate enterprise-like websites and verifiable multi-step tasks.

That is the real story behind the benchmark jump. H is making a bet that desktop-agent progress comes from better synthetic task generation and verification, not just scaling a general model and hoping grounding improves on its own.

The benchmark spread suggests specialization, not general dominance

merve

@mervenoyann

·Follow

Holo3, new model of @hcompany_ai outperforming closed and larger open models on GUI navigation 🔥 > A3B/35B based on Qwen3.5 > officially supported in transformers 🤗 > free license 👏

1:01 PM · Apr 1, 2026

166

Read 9 replies

The comparison table in the shared launch material is a good reminder that Holo3 is strongest where H optimized it. The Holo3 variants lead on OSWorld-Verified, several single-app corporate categories, and ScreenSpot-Pro, but the same table shows weaker relative results on the multi-app category, where Kimi-K2.5 and Claude Sonnet 4.6 score higher community summary.

So the clean read is not "new best model, full stop." The cleaner read is that H seems to have built a very strong specialist for GUI navigation and grounded action, and the open 35B version makes that specialization accessible enough for engineers to test in real workflows.

Pricing and availability split the lineup in two

Kol Tregaskes

@koltregaskes

·Follow

@hcompany_ai

11:30 PM · Apr 1, 2026

🧾 More sources

TL;DR1 tweets

Core launch claims, open release, and caveats about vendor-supplied benchmark and pricing materials.