Fresh stories
Briefs forJune 25
Top storiesthis week
Baidu releases Unlimited OCR with 3B params for single-pass long documents
Baidu released Unlimited OCR as an open-source long-document OCR model with 3B total parameters and 500M active at inference. Early ParseBench testing says it is strong on tables and reading order but weaker on semantic formatting and charts, giving teams a new open-weight OCR option with clear tradeoffs.


Gemini 3.5 Flash adds Computer Use with 78.4 OSWorld score
Google released built-in Computer Use for Gemini 3.5 Flash across browser, mobile, and desktop. Try it for agent workflows, but watch for timeout issues on long design-from-scratch runs.

OpenRouter launches Image API with typed capabilities and exact USD cost
OpenRouter released a dedicated Image API that normalizes request shapes across 30-plus models from eight providers. Agents can inspect limits, passthrough options, streaming, and exact per-call cost without hardcoding vendor quirks.

Seedance 2.0 adds native 4K as fal, Replicate, Pika MCP, and ComfyUI ship support
Seedance 2.0 rolled out native 4K generation while Seedance 2.0 Mini landed on fal, Replicate, Pika MCP, and ComfyUI. That matters because engineers can now reach the same video model family through APIs, MCP workflows, and local graph tooling instead of a single web surface.

Vercel AI Gateway adds GLM-5.2 Fast at 150-250 tok/s
Vercel and Wafer launched a serverless GLM-5.2 endpoint on AI Gateway with 1M context and published pricing. Teams get a high-throughput open-model option inside an existing gateway instead of managing GLM inference directly.

Daily AI Digest
Get the best stories delivered
to your inbox
Skills Spotlighttop by stars
creative-ideation
Generate ideas via named methods from creative practice.
baoyu-comic
Knowledge comics (知识漫画): educational, biography, tutorial.
comfyui
Generate images, video, and audio with ComfyUI — install, launch, manage nodes/models, run workflows with parameter injection. Uses the official comfy-cli for lifecycle and direct REST/WebSocket API for execution.







