Fresh stories
Briefs forJune 25
Top storiesthis week
OpenRouter launches Image API with typed capabilities and exact USD cost
OpenRouter released a dedicated Image API that normalizes request shapes across 30-plus models from eight providers. Agents can inspect limits, passthrough options, streaming, and exact per-call cost without hardcoding vendor quirks.


Vercel AI Gateway adds GLM-5.2 Fast at 150-250 tok/s
Vercel and Wafer launched a serverless GLM-5.2 endpoint on AI Gateway with 1M context and published pricing. Teams get a high-throughput open-model option inside an existing gateway instead of managing GLM inference directly.

GLM-5.2 adds Perplexity Agent API and Droid support on Baseten at >280 TPS
GLM-5.2 added Perplexity Agent API, Droid, and more hosting options, while Baseten reported over 280 TPS and sub-0.8s TTFT. Builders should watch the cost and benchmark data as it moves into production agent stacks.

Morph supports Qwen, GLM-5.2, MiniMax M3, DeepSeek v4 with 20-35% higher code acceptance
Morph said its code-serving stack now exposes Qwen, GLM-5.2, MiniMax M3, and DeepSeek v4 with code-tuned speculative decoding. It claims 20-35% higher acceptance than Eagle 3.1 or DFlash, plus kernels for cheaper hardware.

GLM-5.2 ships to BrowserCode, Hyper, OpenCode, and Together in 3 days
BrowserCode, Hyper, OpenCode, Together, and other vendors added GLM-5.2 soon after release. That turns the open model into a deployable option across coding, browser automation, and hosted chat.

Daily AI Digest
Get the best stories delivered
to your inbox
Skills Spotlighttop by stars
creative-ideation
Generate ideas via named methods from creative practice.
baoyu-comic
Knowledge comics (知识漫画): educational, biography, tutorial.
comfyui
Generate images, video, and audio with ComfyUI — install, launch, manage nodes/models, run workflows with parameter injection. Uses the official comfy-cli for lifecycle and direct REST/WebSocket API for execution.



