Cost & Operations — Explore AI Tools & Stories

Fresh stories

New

Seedance 2.0 Mini launches on Venice, ComfyUI, and Pika MCP with 15s 720p video

A day after Seedance 2.0's 4K rollout story, partners began shipping the cheaper Seedance 2.0 Mini across Venice, ComfyUI, and Pika MCP. The 15-second 720p variant with native audio gives video workflows a lower-cost path than the flagship model.

ReleaseMultimodal25th June

Seedance 2.0 Mini launches on Venice, ComfyUI, and Pika MCP with 15s 720p video

ReleaseMultimodal25th June

Briefs forJune 25

Top storiesthis week

See all →

Breaking

Vercel AI Gateway adds GLM-5.2 Fast at 150-250 tok/s

Vercel and Wafer launched a serverless GLM-5.2 endpoint on AI Gateway with 1M context and published pricing. Teams get a high-throughput open-model option inside an existing gateway instead of managing GLM inference directly.

New

GLM·24th June·3 min read

GLM-5.2 adds Perplexity Agent API and Droid support on Baseten at >280 TPS

GLM-5.2 added Perplexity Agent API, Droid, and more hosting options, while Baseten reported over 280 TPS and sub-0.8s TTFT. Builders should watch the cost and benchmark data as it moves into production agent stacks.

GLM22nd June

GLM-5.2 ranks #1 on DeepSWE with 44% pass@1

Independent results put GLM-5.2 at the top of the open-model DeepSWE board and near the top on debate and post-train evals. Watch token use and long reasoning traces, which can offset its headline price advantage.

GLM20th June

New

Wafer claims GLM-5.2 hits 222 tok/s and 12.6s end-to-end

Wafer said its GLM-5.2 deployment leads Artificial Analysis on throughput and latency, and priced usage at $1.20 input and $4.10 output per million tokens. Compare serverless and dedicated endpoints if you need speed at scale.

GLM20th June

New

ComputeSDK releases 2026 100k Scale Invitational results across 6 sandbox providers

ComputeSDK published results from its 2026 100k Scale Invitational after weeks of reruns and infra tuning across Modal, Tensorlake, Northflank, Declaw AI, E2B, and Isorun. It matters because sandbox and agent infra claims now have a shared public concurrency target instead of vendor-specific load demos.

Agent Infrastructure19th June

Engineers report GLM-5.2 matches near-Opus planning at about 1/10 the price

Independent tests put GLM-5.2 near Opus 4.8 and GPT-5.5 on planning and coding, and users shared Claude Code, BrowserCode, dcode, and local-serving recipes. It matters because many engineers are treating it as a daily-driver option for text-heavy coding, though teams still report weaker vision and provider limits.

GLM19th June

New

Kilo Code adds Terminal Bench scores and average attempt cost to model picker

Kilo Code now shows Terminal Bench completion rate and average attempt cost directly in model details inside its CLI and VS Code extension. It matters because the numbers come from Kilo's own harness and retry logic rather than public leaderboard scaffolds.

ReleaseBenchmarks19th June

See all stories →

New

ComputeSDK releases 2026 100k Scale Invitational results across 6 sandbox providers

Agent Infrastructure19th June

Engineers report GLM-5.2 matches near-Opus planning at about 1/10 the price

GLM19th June

Kilo Code adds Terminal Bench scores and average attempt cost to model picker

ReleaseBenchmarks19th June

Daily AI Digest

Get the best stories delivered
to your inbox

Skills Spotlighttop by stars

View all skills

✍️ Writing

New

creative-ideation

Generate ideas via named methods from creative practice.

by NousResearch · 2 days ago203.5k

🎨 Design

baoyu-comic

Knowledge comics (知识漫画): educational, biography, tutorial.

by NousResearch · 1 month ago203.5k

🤖 ML/AI

comfyui

Generate images, video, and audio with ComfyUI — install, launch, manage nodes/models, run workflows with parameter injection. Uses the official comfy-cli for lifecycle and direct REST/WebSocket API for execution.

by NousResearch · 1 month ago203.5k

Explore what's new in AI

Filters

Fresh stories

Seedance 2.0 Mini launches on Venice, ComfyUI, and Pika MCP with 15s 720p video

Seedance 2.0 Mini launches on Venice, ComfyUI, and Pika MCP with 15s 720p video

Briefs forJune 25

Top storiesthis week

Vercel AI Gateway adds GLM-5.2 Fast at 150-250 tok/s

GLM-5.2 adds Perplexity Agent API and Droid support on Baseten at >280 TPS

GLM-5.2 ranks #1 on DeepSWE with 44% pass@1

Wafer claims GLM-5.2 hits 222 tok/s and 12.6s end-to-end

ComputeSDK releases 2026 100k Scale Invitational results across 6 sandbox providers

Engineers report GLM-5.2 matches near-Opus planning at about 1/10 the price

Kilo Code adds Terminal Bench scores and average attempt cost to model picker

Vercel AI Gateway adds GLM-5.2 Fast at 150-250 tok/s

GLM-5.2 adds Perplexity Agent API and Droid support on Baseten at >280 TPS

GLM-5.2 ranks #1 on DeepSWE with 44% pass@1

Wafer claims GLM-5.2 hits 222 tok/s and 12.6s end-to-end

ComputeSDK releases 2026 100k Scale Invitational results across 6 sandbox providers

Engineers report GLM-5.2 matches near-Opus planning at about 1/10 the price

Kilo Code adds Terminal Bench scores and average attempt cost to model picker

Daily AI Digest

Skills Spotlighttop by stars

creative-ideation

baoyu-comic

comfyui