AI Primer for Engineers — Daily AI Changelog

Fresh stories

New

Codex adds use-case gallery and official plugins for shadcn/ui and Box

OpenAI published a Codex use-case gallery with one-click workflows, and shadcn/ui and Box shipped official plugins. Teams can now install reusable app and web workflows directly instead of wiring each integration by hand.

Workflow🤖Codex1d ago

Breaking

LiteLLM 1.82.8 ships malicious .pth credential stealer on PyPI

Compromised LiteLLM 1.82.7 and 1.82.8 wheels executed a malicious .pth file at install time to exfiltrate credentials, and PyPI quarantined the releases. Treat fresh-package installs and AI infra dependencies as supply-chain risk, and check startup hooks on affected systems.

New

Reliability·1d ago·3 min read

New

GitHub updates Copilot policy: private-repo interactions train models by default on Apr. 24

GitHub said Copilot Free, Pro, and Pro+ interaction data will train models by default from Apr. 24 unless users opt out, while private repo content at rest stays excluded. Teams should review per-user enforcement, enterprise coverage, and repo privacy settings before the change lands.

🔒GitHub Copilot1d ago

New

Anthropic limits Claude 5-hour sessions as users report 529 overloads

Anthropic confirmed new peak-time metering that burns through 5-hour Claude sessions faster, and multiple power users posted 529 overloaded errors and early exhaustion. If you rely on Max plans for coding, watch for session limits and consider moving daily work to Codex.

💰Claude Code1d ago

Breaking

Anthropic leaks Claude Mythos draft, with Capybara tier above Opus 4.6

Public Anthropic draft posts described Claude Mythos as the company's most powerful model and placed a new Capybara tier above Opus 4.6. The documents also point to cybersecurity capability and compute cost as rollout constraints.

New

Claude·1d ago·4 min read

New

Composio launches Universal CLI for terminal-native tool access

Composio shipped Universal CLI as a shell-first interface to its integrations, moving install, search, and agent workflows out of MCP setup. The release targets users who want simpler agent tool access after complaints that MCP stacks are harder to install, slower, and less stable.

Release🤖MCP1d ago

New

Claude Code adds scheduled cloud tasks for PR reviews and `/schedule` runs

Claude Code can now run recurring prompts and background pull-request work on Anthropic-managed cloud environments from the web, desktop, or `/schedule`. That makes long-running repo tasks less dependent on a local machine, but users report task caps and restricted egress.

🤖Claude Code1d ago

Breaking

TurboQuant cuts KV cache memory 6x with 3-bit storage

Google Research said TurboQuant can shrink KV cache storage to 3 bits with roughly 6x less memory, and early implementations already surfaced in PyTorch, llama.cpp, and Atomic Chat. The work targets a core inference bottleneck for long-context serving on local and server hardware.

New

Inference Optimization·1d ago·3 min read

New

ARC-AGI-3 launches interactive benchmark for world-model reasoning

ARC-AGI-3 introduced an interactive reasoning benchmark that measures world-model building and skill acquisition without natural-language instructions. Early discussion is focused on Duke harness results with generic tools and whether the scoring rewards generalization or benchmark-specific optimization.

Release🔬Benchmarks1d ago

New

Z.ai releases GLM-5.1 to Coding Plan users with `glm-5.1` model switch

Z.ai made GLM-5.1 available to all Coding Plan users and documented how to route coding agents to it by changing the model name in config. Early harness benchmarks place it near Opus 4.6 on coding evals, but BridgeBench users report much slower tokens per second.

Release🧠GLM1d ago

New

Hermes Agent adds Hugging Face provider with 28 curated models

Hermes Agent now treats Hugging Face as a first-class inference provider and surfaces 28 curated models in its picker, plus a custom path to the broader catalog. That broadens model choice for a persistent local agent workflow without requiring users to wire a provider manually.

Release🤖Coding Agents1d ago

New

Artificial Analysis launches AA-AgentPerf for 200-turn, 100K-token coding traces

Artificial Analysis introduced AA-AgentPerf to benchmark hardware on real coding-agent traces instead of synthetic chat prompts. The benchmark reports users per accelerator, kW, dollar, and rack, so teams can compare production cost and throughput more realistically.

🔬Benchmarks1d ago

New

OpenCode launches open-source coding agent with `opencode serve` and multi-backend web UI

OpenCode shipped terminal, desktop, and `opencode serve` workflows for an open-source coding agent with LSP support, plugins, and more than 75 providers. Users should look at the multi-backend web sessions, IPC plugins, and sandboxed local setup as the main differentiators.

Release🤖Developer Experience1d ago

New

Meta ships SAM 3.1 with object multiplexing for 16 tracked objects

SAM 3.1 is a drop-in update that shares video computation across up to 16 tracked objects instead of rerunning most of the model per object. Meta's H100 numbers show roughly 30 FPS at 16 objects versus under 10 FPS for SAM 3, which cuts multi-object video tracking cost.

Release🧠Multimodal1d ago

See all stories →

New1d ago

LiteLLM 1.82.8 ships malicious .pth credential stealer on PyPI

🔒Reliability1d ago

New1d ago

GitHub updates Copilot policy: private-repo interactions train models by default on Apr. 24

🔒GitHub Copilot1d ago

New1d ago

Artificial Analysis launches AA-AgentPerf for 200-turn, 100K-token coding traces

🔬Benchmarks1d ago

OpenCode launches open-source coding agent with `opencode serve` and multi-backend web UI

Release🤖Developer Experience1d ago

Meta ships SAM 3.1 with object multiplexing for 16 tracked objects

Release🧠Multimodal1d ago

Top storiesthis week

See all →

Breaking

Cline launches Kanban with worktree-linked parallel CLI agents

Cline launched Kanban, a local multi-agent board that runs Claude, Codex, and Cline CLI tasks in isolated worktrees with dependency chains and diffs. Teams can use it as a visual control layer for parallel coding agents on repo chores that split cleanly.

New

Coding Agents·2d ago·3 min read

New

Mistral launches Voxtral TTS with 9 languages and 90 ms first audio

Mistral released open-weight Voxtral TTS with low-latency streaming, voice cloning, and cross-lingual adaptation, and vLLM Omni shipped day-0 support. Voice-agent teams should compare quality, latency, and serving cost against closed APIs.

Release🧠Mistral2d ago

New

Anthropic limits Claude 5-hour sessions during 5am-11am PT peak window

Anthropic said free, Pro, and Max users will hit 5-hour Claude session limits faster on weekdays from 5am to 11am PT, while weekly caps stay the same. Shift long Claude Code jobs off-peak and watch prompt-cache misses.

💰Claude Code2d ago

New

Codex launches plugins for Slack, Figma, Gmail, and Google Drive

OpenAI rolled out Codex plugins across the app, CLI, and IDE extensions, with app auth, reusable skills, and optional MCP servers. Teams should test plugin-backed workflows and permission models before broad rollout.

Release🤖Codex2d ago

New

Gemini 3.1 Flash Live launches with 90.8% audio tool-use score and 128K context

Google launched Gemini 3.1 Flash Live in AI Studio, the API, and Gemini Live with stronger audio tool use, lower latency, and 128K context. Voice-agent teams should benchmark quality, latency, and thinking settings before switching.

Release🧠Gemini2d ago

New

Every launches Plus One, a hosted OpenClaw for Slack

Every opened Plus One, a hosted OpenClaw that lives in Slack, comes preloaded with internal skills, and works with a ChatGPT subscription or other API keys. It lowers the ops burden for deployed coworkers, so teams can test packaged agents before building their own stack.

🤖OpenClaw2d ago

New

CNN reports judge blocks Pentagon supply-chain-risk label against Anthropic

A federal judge temporarily blocked the Pentagon’s supply-chain-risk designation against Anthropic, with court filings arguing the move would have constrained Claude use across government-related procurement. Enterprises working with regulated buyers should watch the appeal path if deployment policy affects access.

📊Regulation2d ago

New

Chroma launches Context-1, a 20B search agent with Apache 2.0 weights

Chroma released Context-1, a 20B search agent it says pushes the speed-cost-accuracy frontier for agentic search, with open weights on Hugging Face. Benchmark it against your current search stack before wiring it into production.

Release🧠Search2d ago

New

Cohere launches Transcribe 03-2026 with 14 languages and Apache 2.0 weights

Cohere released a 2B speech-to-text model with 14 languages and top Open ASR scores, and upstreamed encoder-decoder optimizations to vLLM in the same launch. It is a self-hosted ASR option, so test accuracy and throughput on your own speech workload.

Release🧠vLLM2d ago

New

Imbue launches Latchkey: local agents call HTTP APIs without exposing tokens

Imbue released Latchkey, a library that prepends ordinary curl calls so local agents can use SaaS and internal APIs while credentials stay on the developer machine. Try it where agents need many HTTP integrations but should not see raw secrets.

Release🔒Agent Security2d ago

See all stories →

New

Cline launches Kanban with worktree-linked parallel CLI agents

Release🤖Coding AgentsOrchestration2d ago · 3 min read

Mistral launches Voxtral TTS with 9 languages and 90 ms first audio

Release🧠Mistral2d ago

Anthropic limits Claude 5-hour sessions during 5am-11am PT peak window

💰Claude Code2d ago

Codex launches plugins for Slack, Figma, Gmail, and Google Drive

Release🤖Codex2d ago

Gemini 3.1 Flash Live launches with 90.8% audio tool-use score and 128K context

Release🧠Gemini2d ago

Every launches Plus One, a hosted OpenClaw for Slack

🤖OpenClaw2d ago

CNN reports judge blocks Pentagon supply-chain-risk label against Anthropic

📊Regulation2d ago

Chroma launches Context-1, a 20B search agent with Apache 2.0 weights

Release🧠Search2d ago

Cohere launches Transcribe 03-2026 with 14 languages and Apache 2.0 weights

Release🧠vLLM2d ago

Imbue launches Latchkey: local agents call HTTP APIs without exposing tokens

Release🔒Agent Security2d ago

Daily AI Digest

Get the best stories delivered to your inbox

Explore what's new in AI

Filter by tag

Fresh stories

Codex adds use-case gallery and official plugins for shadcn/ui and Box

LiteLLM 1.82.8 ships malicious .pth credential stealer on PyPI

GitHub updates Copilot policy: private-repo interactions train models by default on Apr. 24

Anthropic limits Claude 5-hour sessions as users report 529 overloads

Anthropic leaks Claude Mythos draft, with Capybara tier above Opus 4.6

Composio launches Universal CLI for terminal-native tool access

Claude Code adds scheduled cloud tasks for PR reviews and `/schedule` runs

TurboQuant cuts KV cache memory 6x with 3-bit storage

ARC-AGI-3 launches interactive benchmark for world-model reasoning

Z.ai releases GLM-5.1 to Coding Plan users with `glm-5.1` model switch

Hermes Agent adds Hugging Face provider with 28 curated models

Artificial Analysis launches AA-AgentPerf for 200-turn, 100K-token coding traces

OpenCode launches open-source coding agent with `opencode serve` and multi-backend web UI

Meta ships SAM 3.1 with object multiplexing for 16 tracked objects

LiteLLM 1.82.8 ships malicious .pth credential stealer on PyPI

GitHub updates Copilot policy: private-repo interactions train models by default on Apr. 24

Anthropic limits Claude 5-hour sessions as users report 529 overloads

Anthropic leaks Claude Mythos draft, with Capybara tier above Opus 4.6

Codex adds use-case gallery and official plugins for shadcn/ui and Box

Composio launches Universal CLI for terminal-native tool access

Claude Code adds scheduled cloud tasks for PR reviews and `/schedule` runs

TurboQuant cuts KV cache memory 6x with 3-bit storage

ARC-AGI-3 launches interactive benchmark for world-model reasoning

Z.ai releases GLM-5.1 to Coding Plan users with `glm-5.1` model switch

Hermes Agent adds Hugging Face provider with 28 curated models

Artificial Analysis launches AA-AgentPerf for 200-turn, 100K-token coding traces

OpenCode launches open-source coding agent with `opencode serve` and multi-backend web UI

Meta ships SAM 3.1 with object multiplexing for 16 tracked objects

Top storiesthis week

Cline launches Kanban with worktree-linked parallel CLI agents

Mistral launches Voxtral TTS with 9 languages and 90 ms first audio

Anthropic limits Claude 5-hour sessions during 5am-11am PT peak window

Codex launches plugins for Slack, Figma, Gmail, and Google Drive

Gemini 3.1 Flash Live launches with 90.8% audio tool-use score and 128K context

Every launches Plus One, a hosted OpenClaw for Slack

CNN reports judge blocks Pentagon supply-chain-risk label against Anthropic

Chroma launches Context-1, a 20B search agent with Apache 2.0 weights

Cohere launches Transcribe 03-2026 with 14 languages and Apache 2.0 weights

Imbue launches Latchkey: local agents call HTTP APIs without exposing tokens

Cline launches Kanban with worktree-linked parallel CLI agents

Mistral launches Voxtral TTS with 9 languages and 90 ms first audio

Anthropic limits Claude 5-hour sessions during 5am-11am PT peak window

Codex launches plugins for Slack, Figma, Gmail, and Google Drive

Gemini 3.1 Flash Live launches with 90.8% audio tool-use score and 128K context

Every launches Plus One, a hosted OpenClaw for Slack

CNN reports judge blocks Pentagon supply-chain-risk label against Anthropic

Chroma launches Context-1, a 20B search agent with Apache 2.0 weights

Cohere launches Transcribe 03-2026 with 14 languages and Apache 2.0 weights

Imbue launches Latchkey: local agents call HTTP APIs without exposing tokens

Daily AI Digest

Community Notesquick tips

Explore what's new in AI

Filter by tag

Fresh stories

Codex adds use-case gallery and official plugins for shadcn/ui and Box

LiteLLM 1.82.8 ships malicious .pth credential stealer on PyPI

GitHub updates Copilot policy: private-repo interactions train models by default on Apr. 24

Anthropic limits Claude 5-hour sessions as users report 529 overloads

Anthropic leaks Claude Mythos draft, with Capybara tier above Opus 4.6

Composio launches Universal CLI for terminal-native tool access

Claude Code adds scheduled cloud tasks for PR reviews and `/schedule` runs

TurboQuant cuts KV cache memory 6x with 3-bit storage

ARC-AGI-3 launches interactive benchmark for world-model reasoning

Z.ai releases GLM-5.1 to Coding Plan users with `glm-5.1` model switch

Hermes Agent adds Hugging Face provider with 28 curated models

Artificial Analysis launches AA-AgentPerf for 200-turn, 100K-token coding traces

OpenCode launches open-source coding agent with `opencode serve` and multi-backend web UI

Meta ships SAM 3.1 with object multiplexing for 16 tracked objects

LiteLLM 1.82.8 ships malicious .pth credential stealer on PyPI

GitHub updates Copilot policy: private-repo interactions train models by default on Apr. 24

Anthropic limits Claude 5-hour sessions as users report 529 overloads

Anthropic leaks Claude Mythos draft, with Capybara tier above Opus 4.6

Codex adds use-case gallery and official plugins for shadcn/ui and Box

Composio launches Universal CLI for terminal-native tool access

Claude Code adds scheduled cloud tasks for PR reviews and `/schedule` runs

TurboQuant cuts KV cache memory 6x with 3-bit storage

ARC-AGI-3 launches interactive benchmark for world-model reasoning