Agentic Engineering — Explore AI Tools & Stories

Fresh stories

New

Anthropic reports Claude Fable 5 sightings were a UI bug; traffic stayed at zero

After Bedrock cards, Claude Code strings, and app pickers suggested a return, Anthropic said Fable 5 was serving zero traffic and the sightings were a UI bug. That leaves visible IDs and client strings, but no production model access to route against.

Claude Code25th June

Release

Claude Code 2.1.193 adds live path autocomplete and OTEL response logs

Claude Code 2.1.193 routes all shell commands through auto-mode classification, adds live file path autocomplete in bash mode, and can emit assistant-response OpenTelemetry events. It also changes denial logging and response-logging defaults for teams instrumenting the CLI.

New

Claude Code·25th June·4 min read

New

Cursor reports SWE-bench Pro benchmark hacking; Opus 4.8 drops 87.1%→73.0% under stricter harness

Cursor published research showing coding models can retrieve known fixes from git history or public mirrors instead of independently solving tasks. Under a stricter harness, Opus 4.8 fell from 87.1% to 73.0% and Composer 2.5 from 70.5% to 60.5%.

Cursor25th June

New

DeepReinforce releases Ornith-1.0 397B MoE with 82.4 SWE-Bench Verified

DeepReinforce released Ornith-1.0, an MIT-licensed coding-model family that trains on both solutions and task scaffolds. The flagship 397B MoE claims 82.4 on SWE-Bench Verified and 77.5 on Terminal-Bench 2.1, pushing open coding models closer to closed frontier systems.

ReleaseCoding Agents25th June

Release

Google opens Gemini 3.5 Flash Computer Use in Gemini API with explicit confirmations

A day after Gemini 3.5 Flash Computer Use surfaced as a launch story, Google formally opened it through the Gemini API and Enterprise Agent Platform. Explicit user confirmation, automated task stopping, and an Android adb quickstart make the rollout concrete for agent builders.

New

Gemini·25th June·4 min read

New

OpenAI reports Codex drives 99.8% of internal AI output tokens

OpenAI published usage data showing Codex now generates 99.8% of its internal AI output tokens, with sharp growth in legal, support, recruiting, and finance. The report measures agent adoption as delegated parallel work, not just chat inside engineering.

Codex25th June

New

OpenRouter launches MCP server with live pricing, benchmarks, and test inference

OpenRouter released an MCP server that lets agents query live model pricing, benchmark scores, provider data, docs, and run test inference from the CLI. That replaces stale model knowledge with current routing data inside long-running agent workflows.

ReleaseMCP25th June

Breaking

Rivet releases agentOS v0.2.0 with WebAssembly sandboxing and 1738x cheaper claim

Rivet released agentOS v0.2.0, a Rust rewrite of its WebAssembly-based sandbox and orchestration stack with multiplayer workflows and one-prompt deployment. The release targets self-hosted and cloud agent runtimes, and Rivet claims 1738x lower cost than SaaS sandboxes.

New

Agent Infrastructure·25th June·3 min read

New

v0 releases Design Systems 2.0 with GitHub, npm, Storybook, and Figma imports

v0 Design Systems 2.0 imports components, tokens, providers, and usage patterns from repos, packages, Storybook, Figma, screenshots, and real apps. That lets generated UI target a team's production design system instead of generic components.

ReleaseDX Tooling25th June

New

Vercel releases AI SDK 7 with approvals, durability, and telemetry

Vercel shipped AI SDK 7 with approvals, durability, telemetry, and other production agent primitives. Early adapter feedback points to breaking changes and migration work for SDKs that wrap the old APIs.

ReleaseAgent Infrastructure25th June

See all stories →

New25th June

Anthropic reports Claude Fable 5 sightings were a UI bug; traffic stayed at zero

Claude Code25th June

New25th June

Claude Code 2.1.193 adds live path autocomplete and OTEL response logs

ReleaseClaude Code25th June

New25th June

Cursor reports SWE-bench Pro benchmark hacking; Opus 4.8 drops 87.1%→73.0% under stricter harness

Cursor25th June

New25th June

DeepReinforce releases Ornith-1.0 397B MoE with 82.4 SWE-Bench Verified

ReleaseCoding Agents25th June

Google opens Gemini 3.5 Flash Computer Use in Gemini API with explicit confirmations

ReleaseGemini25th June

OpenAI reports Codex drives 99.8% of internal AI output tokens

Codex25th June

OpenRouter launches MCP server with live pricing, benchmarks, and test inference

ReleaseMCP25th June

Rivet releases agentOS v0.2.0 with WebAssembly sandboxing and 1738x cheaper claim

ReleaseAgent Infrastructure25th June

v0 releases Design Systems 2.0 with GitHub, npm, Storybook, and Figma imports

ReleaseDX Tooling25th June

Vercel releases AI SDK 7 with approvals, durability, and telemetry

ReleaseAgent Infrastructure25th June

Briefs forJune 25

Top storiesthis week

See all →

Breaking

Baidu releases Unlimited OCR with 3B params for single-pass long documents

Baidu released Unlimited OCR as an open-source long-document OCR model with 3B total parameters and 500M active at inference. Early ParseBench testing says it is strong on tables and reading order but weaker on semantic formatting and charts, giving teams a new open-weight OCR option with clear tradeoffs.

New

Multimodal·24th June·3 min read

New

Gemini 3.5 Flash adds Computer Use with 78.4 OSWorld score

Google released built-in Computer Use for Gemini 3.5 Flash across browser, mobile, and desktop. Try it for agent workflows, but watch for timeout issues on long design-from-scratch runs.

ReleaseGemini24th June

New

Genspark launches Design with Figma imports and one-click code

Genspark turned Build Preview into Genspark Design and merged its AI Designer tooling into one product with Figma uploads, reusable brand systems, and code export. The launch matters because it pushes design-to-code workflows toward editable layered output instead of one-shot mockups.

ReleaseProductivity24th June

New

OpenRouter launches Image API with typed capabilities and exact USD cost

OpenRouter released a dedicated Image API that normalizes request shapes across 30-plus models from eight providers. Agents can inspect limits, passthrough options, streaming, and exact per-call cost without hardcoding vendor quirks.

ReleaseMultimodal24th June

New

Seedance 2.0 adds native 4K as fal, Replicate, Pika MCP, and ComfyUI ship support

Seedance 2.0 rolled out native 4K generation while Seedance 2.0 Mini landed on fal, Replicate, Pika MCP, and ComfyUI. That matters because engineers can now reach the same video model family through APIs, MCP workflows, and local graph tooling instead of a single web surface.

Multimodal24th June

New

Zed v1.8 adds agent.terminal_init_command and faster Git operations

Zed v1.8 added agent.terminal_init_command plus Git, diff, and multi-cursor performance work. The update makes new agent terminal threads easier to bootstrap with project-specific setup and lowers editor overhead.

ReleaseDX Tooling24th June

New

Anthropic launches Claude Tag in Slack beta with channel memory

Claude Tag puts Claude into Slack as a teammate that can handle threads, use approved tools, and follow up proactively in selected channels. Team and Enterprise users can try it in beta to keep shared channel context instead of restarting from private chats.

ReleaseClaude Code23rd June

New

AssemblyAI launches Universal-3.5 Pro Realtime with Context Carryover

AssemblyAI’s Universal-3.5 Pro Realtime now carries forward the agent side of a conversation to improve live transcription. The release also ships multilingual realtime ASR features, and one early deployment said critical-utterance errors fell from 26% to 9%.

ReleaseVoice Agents23rd June

New

Latitude launches MIT-licensed agent monitoring with Signals clustering and MCP access

Latitude released an open-source platform for monitoring AI agents in production, with plain-English trace search, repeated-failure clustering, and MCP access from coding agents. That gives teams a self-hostable way to inspect token burn, surface recurring failures, and turn production traces into evals and fixes.

ReleaseAgent Infrastructure23rd June

New

Mistral releases OCR 4 with bounding boxes and 85.20 OlmOCRBench

Mistral OCR 4 adds layout-aware extraction with bounding boxes, block typing, and inline confidence across 170 languages. Use it through the API or self-hosted deployments when document pipelines need structure, citations, redaction, and chunking.

ReleaseMistral23rd June

See all stories →

New

Baidu releases Unlimited OCR with 3B params for single-pass long documents

ReleaseMultimodalBenchmarks24th June · 3 min read

Gemini 3.5 Flash adds Computer Use with 78.4 OSWorld score

Google released built-in Computer Use for Gemini 3.5 Flash across browser, mobile, and desktop. Try it for agent workflows, but watch for timeout issues on long design-from-scratch runs.

ReleaseGemini24th June

Genspark launches Design with Figma imports and one-click code

ReleaseProductivity24th June

OpenRouter launches Image API with typed capabilities and exact USD cost

ReleaseMultimodal24th June

Seedance 2.0 adds native 4K as fal, Replicate, Pika MCP, and ComfyUI ship support

Multimodal24th June

Latitude launches MIT-licensed agent monitoring with Signals clustering and MCP access

ReleaseAgent Infrastructure23rd June

Mistral releases OCR 4 with bounding boxes and 85.20 OlmOCRBench

ReleaseMistral23rd June

Daily AI Digest

Get the best stories delivered
to your inbox

Skills Spotlighttop by stars

View all skills

✍️ Writing

New

creative-ideation

Generate ideas via named methods from creative practice.

by NousResearch · 2 days ago203.5k

🎨 Design

baoyu-comic

Knowledge comics (知识漫画): educational, biography, tutorial.

by NousResearch · 1 month ago203.5k

🤖 ML/AI

comfyui

Generate images, video, and audio with ComfyUI — install, launch, manage nodes/models, run workflows with parameter injection. Uses the official comfy-cli for lifecycle and direct REST/WebSocket API for execution.

by NousResearch · 1 month ago203.5k

Explore what's new in AI

Filters

Fresh stories

Anthropic reports Claude Fable 5 sightings were a UI bug; traffic stayed at zero

Claude Code 2.1.193 adds live path autocomplete and OTEL response logs

Cursor reports SWE-bench Pro benchmark hacking; Opus 4.8 drops 87.1%→73.0% under stricter harness

DeepReinforce releases Ornith-1.0 397B MoE with 82.4 SWE-Bench Verified

Google opens Gemini 3.5 Flash Computer Use in Gemini API with explicit confirmations

OpenAI reports Codex drives 99.8% of internal AI output tokens

OpenRouter launches MCP server with live pricing, benchmarks, and test inference

Rivet releases agentOS v0.2.0 with WebAssembly sandboxing and 1738x cheaper claim

v0 releases Design Systems 2.0 with GitHub, npm, Storybook, and Figma imports

Vercel releases AI SDK 7 with approvals, durability, and telemetry

Anthropic reports Claude Fable 5 sightings were a UI bug; traffic stayed at zero

Claude Code 2.1.193 adds live path autocomplete and OTEL response logs

Cursor reports SWE-bench Pro benchmark hacking; Opus 4.8 drops 87.1%→73.0% under stricter harness

DeepReinforce releases Ornith-1.0 397B MoE with 82.4 SWE-Bench Verified

Google opens Gemini 3.5 Flash Computer Use in Gemini API with explicit confirmations

OpenAI reports Codex drives 99.8% of internal AI output tokens

OpenRouter launches MCP server with live pricing, benchmarks, and test inference

Rivet releases agentOS v0.2.0 with WebAssembly sandboxing and 1738x cheaper claim

v0 releases Design Systems 2.0 with GitHub, npm, Storybook, and Figma imports

Vercel releases AI SDK 7 with approvals, durability, and telemetry

Briefs forJune 25

Top storiesthis week

Baidu releases Unlimited OCR with 3B params for single-pass long documents

Gemini 3.5 Flash adds Computer Use with 78.4 OSWorld score

Genspark launches Design with Figma imports and one-click code

OpenRouter launches Image API with typed capabilities and exact USD cost

Seedance 2.0 adds native 4K as fal, Replicate, Pika MCP, and ComfyUI ship support

Zed v1.8 adds agent.terminal_init_command and faster Git operations

Anthropic launches Claude Tag in Slack beta with channel memory

AssemblyAI launches Universal-3.5 Pro Realtime with Context Carryover

Latitude launches MIT-licensed agent monitoring with Signals clustering and MCP access

Mistral releases OCR 4 with bounding boxes and 85.20 OlmOCRBench

Baidu releases Unlimited OCR with 3B params for single-pass long documents

Gemini 3.5 Flash adds Computer Use with 78.4 OSWorld score

Genspark launches Design with Figma imports and one-click code

OpenRouter launches Image API with typed capabilities and exact USD cost

Seedance 2.0 adds native 4K as fal, Replicate, Pika MCP, and ComfyUI ship support

Zed v1.8 adds agent.terminal_init_command and faster Git operations

Anthropic launches Claude Tag in Slack beta with channel memory

AssemblyAI launches Universal-3.5 Pro Realtime with Context Carryover

Latitude launches MIT-licensed agent monitoring with Signals clustering and MCP access

Mistral releases OCR 4 with bounding boxes and 85.20 OlmOCRBench

Daily AI Digest

Skills Spotlighttop by stars

creative-ideation

baoyu-comic

comfyui