TOPIC35 stories

Context Engineering

How agents gather, compress, structure, and use context.

Stories

Developers launch Markdown Experience Guidelines and HTML explainer tools for agent output

Independent builders shipped new specs and tools for richer agent output, including Markdown Experience Guidelines and Plannotator’s HTML render-and-annotate flow, while practitioners shared repeatable HTML explainer workflows. The cluster matters because teams are moving beyond raw Markdown toward output surfaces that can carry charts, interaction, review context, and persistent knowledge-base structure.

WORKFLOW9th May

Claude Code adds `frontend-slides` for HTML briefs and publishable slides

A day after HTML artifacts surfaced as a Claude Code workflow, Anthropic promoted a `frontend-slides` plugin with direct install commands and artifact publishing. The rollout sharpened a real workflow split: teams are using HTML for human review and demos, while keeping markdown or MDX for token-efficient agent context.

WORKFLOW9th May

Claude Code guide fixes hallucinated SHAs with adaptive thinking off and effort=high

A Claude Code guide tied hallucinated package names, API versions, and SHAs to zero-thinking turns and recommended config changes to force fixed reasoning budgets and higher effort. HN discussion and user reports suggest the workaround is being used against a broader reliability regression, not just one bad prompt.

WORKFLOW8th May

Claude Code users report HTML artifacts improve PR review, dashboards, and visual explainers

A cluster of Claude Code users, guides, and companion tools shifted from Markdown toward HTML artifacts for code review, dashboards, and explainer pages. The pattern matters because richer outputs are easier to inspect and share during long agent workflows, though several builders note the token cost is materially higher than Markdown.

RELEASE1w ago

Anthropic launches Claude Managed Agents with Dreaming, Outcomes, and multiagent orchestration

Anthropic added Dreaming in research preview plus public-beta Outcomes, multiagent orchestration, and webhooks to Claude Managed Agents. Teams should try the new grader loops and shared-container sub-agents if they want more control over long-running agent work.

WORKFLOW1w ago

Practitioners report harness playbooks with Playwright CLI, create_agent, and MCP

Builders shared concrete Symphony, create_agent, and MCP setup guides after arguing that model switching is easy but harness switching is not. The playbooks matter because they make harness engineering more repeatable, so teams can copy tested tooling and integration patterns.

RELEASE2w ago

GitHub Copilot adds semantic indexing to all workspaces and cross-repo search in @code

GitHub expanded semantic indexing beyond GitHub and Azure DevOps remotes, so Copilot can search across more workspace types and repositories inside @code. That improves agent context retrieval in local workflows, while the same release also adds chat-history recall and prompt-eval tooling.

WORKFLOW2w ago

mattpocock/skills ranks #1 on GitHub at 28K stars with `/grill-me` and `/tdd` packs

mattpocock/skills hit the top of GitHub Trending as reusable `SKILL.md` packs for grilling specs, writing PRDs, and enforcing TDD spread across coding-agent workflows. The format is starting to look like a distribution layer for agent behavior, with faster install tooling and third-party skills shipping around the same pattern.

RELEASE3w ago

OpenAI Codex adds Chronicle screen memories in macOS Pro preview

OpenAI added Chronicle, a Codex preview that turns recent screen context into reusable memories for errors, files, docs, and workflows. The macOS Pro-only feature stores local memory unencrypted and can burn rate limits quickly, so watch prompt-injection risk before relying on it.

WORKFLOW3w ago

Claude Design users report AGENTS.md backends and Claude Code handoffs one day after launch

Builders say they are already passing Claude Design prototypes into Claude Code, wiring live backends through AGENTS.md, and recreating the flow in custom orchestrators. Try this if you want a faster handoff from mockup to full-stack build, but expect early workflow rough edges.

RELEASE4w ago

OpenClaw 2026.4.15 adds Opus 4.7 support and bounded memory reads

OpenClaw 2026.4.15 adds Anthropic Opus 4.7, bundled Gemini TTS, bounded memory reads, and transport self-heal fixes. The release targets context and reliability issues users had been reporting this week.

RELEASE4w ago

Codex 0.120 adds per-project memory extensions and Realtime V2 streaming

Codex 0.120 introduced per-project memory extension files and Realtime V2 progress streaming for background agents. Separate app findings also showed an unreleased Scratchpad view that can start parallel Codex chats from a task list, which may change how teams queue work.

NEWS1mo ago

Hermes Agent adds MiniMax M2.7 and MiMo V2 Pro through partner integrations

Nous Research added MiniMax M2.7, Xiaomi’s MiMo V2 Pro, a SuperMemory plugin, and expanded Manim support to Hermes through partner integrations. The additions give users new hosted model options, a shared memory backend, and more complete technical-animation tooling to try in workflows.

RELEASE1mo ago

OpenClaw 2026.4.7 adds a headless inference hub, memory-wiki, and webhook TaskFlows

OpenClaw 2026.4.7 adds a headless inference hub, memory-wiki, session branch and restore, and webhook-driven TaskFlows. Composio also shipped a CLI for secure app authentication, so users can expand OpenClaw from a local coding harness into a broader agent runtime.

WORKFLOW1mo ago

Bram Cohen compares vibe coding with AI Level 6 workflows after Claude Code leak

Bram Cohen used the Claude Code leak to argue that prompt-only development produces bad software, while a separate 250-hour syntaqlite build said the durable version arrived only after a Python-to-Rust rewrite. Practitioners say specs, tests, linters, repo skills, and codebase context are the controls that keep coding agents maintainable.

RELEASE1mo ago

Hermes Agent adds /claude-code orchestration and cron hooks

Hermes Agent added direct /claude-code orchestration and cron-time script hooks, and the team also shipped Hermes-focused datasets and agent-tuned model variants. The update turns Hermes into a harness that can steer Claude Code and inject recurring context automatically.

RELEASE1mo ago

Clawback releases Claude Code hook layer for stop-checks and PostToolUse enforcement

Clawback turns leaked Claude Code verification patterns into stop, pre-tool, post-tool, and post-compaction hooks. It replaces prompt-only guardrails with deterministic checks and shows how fast the source-map leak is becoming third-party control layers.

RELEASE1mo ago

Hermes Agent v0.7.0 adds memory plugins, credential pools, and Camofox browsing

Nous released Hermes Agent v0.7.0 with an extensible memory plugin system, rotating credential pools, Camofox browser support, and inline diff previews. The update pushes the framework closer to production use, so teams can test the new failure handling and interface split before adopting it.

WORKFLOW1mo ago

Claude Code adds /loop, /teleport, and /batch workflow guidance in Boris Cherny guide

A Boris Cherny guide maps Claude Code mobile sessions, /teleport, /loop, hooks, worktrees, /batch, and custom agents into one workflow set. Use it to turn scattered commands into repeatable patterns for long-running coding sessions across terminal, desktop, and cloud.

WORKFLOW1mo ago

Claude Code guides compare `.claude/` commands, agents, and global rules

Two new guides map how Claude Code teams are using `.claude/`, `CLAUDE.md`, commands, agents, skills, and global rules. The overlap matters because commenters favor short instructions and a small number of repeatable guardrails over larger prompt stacks.

NEWS1mo ago

ChatGPT adds Library tab for reusable file uploads across conversations

ChatGPT now saves uploaded and generated files into an account-level Library that can be reused across conversations from the web sidebar or recent-files picker. It removes repetitive re-uploading and makes past PDFs, spreadsheets, and images part of a persistent working context.

NEWS1mo ago

Hermes Agent reports 10,000 GitHub stars with memory and delegation tools

Nous Research said Hermes Agent crossed 10,000 stars, while users reported easy migrations from OpenClaw and stable long-running use. If you test it, focus on persistent memory, MCP browser control, and delegation behavior under real workloads.

WORKFLOW1mo ago

Agent Flywheel introduces beads-and-swarms workflow for 1,000 commits a day

Agent Flywheel lays out a planning-first workflow built on beads, agent mail, swarms, and TUI inspection for very large coding runs. It is useful because the guide exposes coordination primitives and review loops, not just benchmark screenshots.

RELEASE1mo ago

Claude adds Projects to Cowork desktop with local folders and one-click imports

Anthropic rolled Projects into Cowork on the Claude desktop app, giving each project its own local folder, persistent instructions, and import paths from existing work. It makes Cowork more practical for ongoing tasks, though teams should test current folder-location limits.

RELEASE1mo ago

Next.js 16.2 ships AGENTS.md defaults and next-browser for agent debugging

Next.js 16.2 adds version-matched AGENTS.md docs, a terminal browser for inspecting running apps, browser-error forwarding, and a dev-server lock file. It gives coding agents better frontend context and cuts duplicate-server and client-side debugging waste.

RELEASE1mo ago

LlamaIndex releases LiteParse for 50+ document types

LlamaIndex open-sourced LiteParse, a model-free local parser for 50+ document types that preserves layout well enough for agent workflows. Use it as a fast first pass before expensive OCR or VLM parsing, especially when you need table structure and local execution.

RELEASE1mo ago

Google Stitch launches an AI-native canvas with DESIGN.md and voice mode

Google rolled out a redesigned Stitch workspace that accepts text, code, PRDs, and images on a spatial canvas, then generates prototypes and portable DESIGN.md files. Teams testing AI-native UI workflows can use it to try a tighter design-to-code loop in the live product.

RELEASE1mo ago

Morph launches FlashCompact: 33k tok/s compaction from 200k to 50k in 1.5s

Morph released FlashCompact, a specialized compaction model and SDK for coding agents, claiming 33k tokens per second and near-invisible long-context compression. Use it or copy the approach if compaction latency and noisy tool output are blocking longer agent runs.

WORKFLOW2mo ago

Geoffrey Huntley compares 4 Ralph loops for porting codebases across languages

Geoffrey Huntley published a four-loop Ralph workflow for porting codebases by turning tests and source into cited specs before implementation. Try it when you need AI help translating a mature codebase across languages without losing behavioral coverage.

RELEASE2mo ago

ACE opens ace-platform for self-hosting with MCP-compatible playbooks

ACE open-sourced ace-platform so teams can self-host its playbook-based context engineering workflow instead of relying only on the hosted service. Use it if you want MCP-compatible prompting infrastructure with your own Postgres, Redis, and deployment controls.

RELEASE2mo ago

supermemory launches CLI with npx install, scoped agent access, and audit logs

supermemory launched a CLI that exposes platform actions directly to agents and added scoped agent access with tag-level permissions plus audit logs. Use it to wire memory into agent loops without granting a full account.

RELEASE2mo ago

Keep adds an in-app feed reader for saved bookmarks

Keep added an in-app feed reader so saved links can be read directly inside its bookmark store for agent workflows. Use it to turn bookmarks, RSS feeds, and markdown exports into reusable context instead of scattered tabs.

RELEASE2mo ago

Anthropic launches 1M-token context for Opus 4.6 and Sonnet 4.6 at flat pricing

Anthropic made 1M-token context generally available for Opus 4.6 and Sonnet 4.6, removed the long-context premium, and raised media limits to 600 images or PDF pages. Use it for retrieval-heavy and codebase-scale workflows that previously needed beta headers or special long-context pricing.

RELEASE2mo ago

Claude Code adds /btw for side questions without interrupting the main task

Claude Code added /btw, a side-channel prompt that can inspect the current session without interrupting the main task. Use it to ask clarifying questions mid-run without polluting history or triggering extra tool work.

WORKFLOW2mo ago

OpenAI introduces Codex skills workflows for Agents SDK OSS maintenance

OpenAI detailed how repo-local skills, AGENTS.md, and GitHub Actions now drive repeatable verification, release, and pull request workflows across its Agents SDK repositories. Maintainers can copy the pattern to reduce prompt sprawl and keep agent behavior closer to the codebase.