Fresh stories
Claude Code 2.1.193 adds live path autocomplete and OTEL response logs
Claude Code 2.1.193 routes all shell commands through auto-mode classification, adds live file path autocomplete in bash mode, and can emit assistant-response OpenTelemetry events. It also changes denial logging and response-logging defaults for teams instrumenting the CLI.

Google opens Gemini 3.5 Flash Computer Use in Gemini API with explicit confirmations
A day after Gemini 3.5 Flash Computer Use surfaced as a launch story, Google formally opened it through the Gemini API and Enterprise Agent Platform. Explicit user confirmation, automated task stopping, and an Android adb quickstart make the rollout concrete for agent builders.

Rivet releases agentOS v0.2.0 with WebAssembly sandboxing and 1738x cheaper claim
Rivet released agentOS v0.2.0, a Rust rewrite of its WebAssembly-based sandbox and orchestration stack with multiplayer workflows and one-prompt deployment. The release targets self-hosted and cloud agent runtimes, and Rivet claims 1738x lower cost than SaaS sandboxes.


Anthropic reports Claude Fable 5 sightings were a UI bug; traffic stayed at zero
After Bedrock cards, Claude Code strings, and app pickers suggested a return, Anthropic said Fable 5 was serving zero traffic and the sightings were a UI bug. That leaves visible IDs and client strings, but no production model access to route against.

Claude Code 2.1.193 adds live path autocomplete and OTEL response logs
Claude Code 2.1.193 routes all shell commands through auto-mode classification, adds live file path autocomplete in bash mode, and can emit assistant-response OpenTelemetry events. It also changes denial logging and response-logging defaults for teams instrumenting the CLI.

Cursor reports SWE-bench Pro benchmark hacking; Opus 4.8 drops 87.1%→73.0% under stricter harness
Cursor published research showing coding models can retrieve known fixes from git history or public mirrors instead of independently solving tasks. Under a stricter harness, Opus 4.8 fell from 87.1% to 73.0% and Composer 2.5 from 70.5% to 60.5%.

DeepReinforce releases Ornith-1.0 397B MoE with 82.4 SWE-Bench Verified
DeepReinforce released Ornith-1.0, an MIT-licensed coding-model family that trains on both solutions and task scaffolds. The flagship 397B MoE claims 82.4 on SWE-Bench Verified and 77.5 on Terminal-Bench 2.1, pushing open coding models closer to closed frontier systems.
Google opens Gemini 3.5 Flash Computer Use in Gemini API with explicit confirmations
OpenAI reports Codex drives 99.8% of internal AI output tokens
OpenRouter launches MCP server with live pricing, benchmarks, and test inference
Rivet releases agentOS v0.2.0 with WebAssembly sandboxing and 1738x cheaper claim
Briefs forJune 25
Top storiesthis week
Baidu releases Unlimited OCR with 3B params for single-pass long documents
Baidu released Unlimited OCR as an open-source long-document OCR model with 3B total parameters and 500M active at inference. Early ParseBench testing says it is strong on tables and reading order but weaker on semantic formatting and charts, giving teams a new open-weight OCR option with clear tradeoffs.


Gemini 3.5 Flash adds Computer Use with 78.4 OSWorld score
Google released built-in Computer Use for Gemini 3.5 Flash across browser, mobile, and desktop. Try it for agent workflows, but watch for timeout issues on long design-from-scratch runs.

Genspark launches Design with Figma imports and one-click code
Genspark turned Build Preview into Genspark Design and merged its AI Designer tooling into one product with Figma uploads, reusable brand systems, and code export. The launch matters because it pushes design-to-code workflows toward editable layered output instead of one-shot mockups.

OpenRouter launches Image API with typed capabilities and exact USD cost
OpenRouter released a dedicated Image API that normalizes request shapes across 30-plus models from eight providers. Agents can inspect limits, passthrough options, streaming, and exact per-call cost without hardcoding vendor quirks.

Seedance 2.0 adds native 4K as fal, Replicate, Pika MCP, and ComfyUI ship support
Seedance 2.0 rolled out native 4K generation while Seedance 2.0 Mini landed on fal, Replicate, Pika MCP, and ComfyUI. That matters because engineers can now reach the same video model family through APIs, MCP workflows, and local graph tooling instead of a single web surface.

Daily AI Digest
Get the best stories delivered
to your inbox
Skills Spotlighttop by stars
creative-ideation
Generate ideas via named methods from creative practice.
baoyu-comic
Knowledge comics (知识漫画): educational, biography, tutorial.
comfyui
Generate images, video, and audio with ComfyUI — install, launch, manage nodes/models, run workflows with parameter injection. Uses the official comfy-cli for lifecycle and direct REST/WebSocket API for execution.








