Context Engineering
How agents gather, compress, structure, and use context.
Stories
Filter storiesIndependent builders shipped new specs and tools for richer agent output, including Markdown Experience Guidelines and Plannotator’s HTML render-and-annotate flow, while practitioners shared repeatable HTML explainer workflows. The cluster matters because teams are moving beyond raw Markdown toward output surfaces that can carry charts, interaction, review context, and persistent knowledge-base structure.
A day after HTML artifacts surfaced as a Claude Code workflow, Anthropic promoted a `frontend-slides` plugin with direct install commands and artifact publishing. The rollout sharpened a real workflow split: teams are using HTML for human review and demos, while keeping markdown or MDX for token-efficient agent context.
A Claude Code guide tied hallucinated package names, API versions, and SHAs to zero-thinking turns and recommended config changes to force fixed reasoning budgets and higher effort. HN discussion and user reports suggest the workaround is being used against a broader reliability regression, not just one bad prompt.
A cluster of Claude Code users, guides, and companion tools shifted from Markdown toward HTML artifacts for code review, dashboards, and explainer pages. The pattern matters because richer outputs are easier to inspect and share during long agent workflows, though several builders note the token cost is materially higher than Markdown.
Anthropic added Dreaming in research preview plus public-beta Outcomes, multiagent orchestration, and webhooks to Claude Managed Agents. Teams should try the new grader loops and shared-container sub-agents if they want more control over long-running agent work.
Builders shared concrete Symphony, create_agent, and MCP setup guides after arguing that model switching is easy but harness switching is not. The playbooks matter because they make harness engineering more repeatable, so teams can copy tested tooling and integration patterns.
GitHub expanded semantic indexing beyond GitHub and Azure DevOps remotes, so Copilot can search across more workspace types and repositories inside @code. That improves agent context retrieval in local workflows, while the same release also adds chat-history recall and prompt-eval tooling.
mattpocock/skills hit the top of GitHub Trending as reusable `SKILL.md` packs for grilling specs, writing PRDs, and enforcing TDD spread across coding-agent workflows. The format is starting to look like a distribution layer for agent behavior, with faster install tooling and third-party skills shipping around the same pattern.
OpenAI added Chronicle, a Codex preview that turns recent screen context into reusable memories for errors, files, docs, and workflows. The macOS Pro-only feature stores local memory unencrypted and can burn rate limits quickly, so watch prompt-injection risk before relying on it.
Builders say they are already passing Claude Design prototypes into Claude Code, wiring live backends through AGENTS.md, and recreating the flow in custom orchestrators. Try this if you want a faster handoff from mockup to full-stack build, but expect early workflow rough edges.
OpenClaw 2026.4.15 adds Anthropic Opus 4.7, bundled Gemini TTS, bounded memory reads, and transport self-heal fixes. The release targets context and reliability issues users had been reporting this week.
Codex 0.120 introduced per-project memory extension files and Realtime V2 progress streaming for background agents. Separate app findings also showed an unreleased Scratchpad view that can start parallel Codex chats from a task list, which may change how teams queue work.
Nous Research added MiniMax M2.7, Xiaomi’s MiMo V2 Pro, a SuperMemory plugin, and expanded Manim support to Hermes through partner integrations. The additions give users new hosted model options, a shared memory backend, and more complete technical-animation tooling to try in workflows.
OpenClaw 2026.4.7 adds a headless inference hub, memory-wiki, session branch and restore, and webhook-driven TaskFlows. Composio also shipped a CLI for secure app authentication, so users can expand OpenClaw from a local coding harness into a broader agent runtime.
Bram Cohen used the Claude Code leak to argue that prompt-only development produces bad software, while a separate 250-hour syntaqlite build said the durable version arrived only after a Python-to-Rust rewrite. Practitioners say specs, tests, linters, repo skills, and codebase context are the controls that keep coding agents maintainable.
Hermes Agent added direct /claude-code orchestration and cron-time script hooks, and the team also shipped Hermes-focused datasets and agent-tuned model variants. The update turns Hermes into a harness that can steer Claude Code and inject recurring context automatically.
Clawback turns leaked Claude Code verification patterns into stop, pre-tool, post-tool, and post-compaction hooks. It replaces prompt-only guardrails with deterministic checks and shows how fast the source-map leak is becoming third-party control layers.
Nous released Hermes Agent v0.7.0 with an extensible memory plugin system, rotating credential pools, Camofox browser support, and inline diff previews. The update pushes the framework closer to production use, so teams can test the new failure handling and interface split before adopting it.
A Boris Cherny guide maps Claude Code mobile sessions, /teleport, /loop, hooks, worktrees, /batch, and custom agents into one workflow set. Use it to turn scattered commands into repeatable patterns for long-running coding sessions across terminal, desktop, and cloud.
Two new guides map how Claude Code teams are using `.claude/`, `CLAUDE.md`, commands, agents, skills, and global rules. The overlap matters because commenters favor short instructions and a small number of repeatable guardrails over larger prompt stacks.
ChatGPT now saves uploaded and generated files into an account-level Library that can be reused across conversations from the web sidebar or recent-files picker. It removes repetitive re-uploading and makes past PDFs, spreadsheets, and images part of a persistent working context.
Nous Research said Hermes Agent crossed 10,000 stars, while users reported easy migrations from OpenClaw and stable long-running use. If you test it, focus on persistent memory, MCP browser control, and delegation behavior under real workloads.
Agent Flywheel lays out a planning-first workflow built on beads, agent mail, swarms, and TUI inspection for very large coding runs. It is useful because the guide exposes coordination primitives and review loops, not just benchmark screenshots.
Anthropic rolled Projects into Cowork on the Claude desktop app, giving each project its own local folder, persistent instructions, and import paths from existing work. It makes Cowork more practical for ongoing tasks, though teams should test current folder-location limits.
Next.js 16.2 adds version-matched AGENTS.md docs, a terminal browser for inspecting running apps, browser-error forwarding, and a dev-server lock file. It gives coding agents better frontend context and cuts duplicate-server and client-side debugging waste.
LlamaIndex open-sourced LiteParse, a model-free local parser for 50+ document types that preserves layout well enough for agent workflows. Use it as a fast first pass before expensive OCR or VLM parsing, especially when you need table structure and local execution.
Google rolled out a redesigned Stitch workspace that accepts text, code, PRDs, and images on a spatial canvas, then generates prototypes and portable DESIGN.md files. Teams testing AI-native UI workflows can use it to try a tighter design-to-code loop in the live product.
Morph released FlashCompact, a specialized compaction model and SDK for coding agents, claiming 33k tokens per second and near-invisible long-context compression. Use it or copy the approach if compaction latency and noisy tool output are blocking longer agent runs.
Geoffrey Huntley published a four-loop Ralph workflow for porting codebases by turning tests and source into cited specs before implementation. Try it when you need AI help translating a mature codebase across languages without losing behavioral coverage.
ACE open-sourced ace-platform so teams can self-host its playbook-based context engineering workflow instead of relying only on the hosted service. Use it if you want MCP-compatible prompting infrastructure with your own Postgres, Redis, and deployment controls.
supermemory launched a CLI that exposes platform actions directly to agents and added scoped agent access with tag-level permissions plus audit logs. Use it to wire memory into agent loops without granting a full account.
Keep added an in-app feed reader so saved links can be read directly inside its bookmark store for agent workflows. Use it to turn bookmarks, RSS feeds, and markdown exports into reusable context instead of scattered tabs.
Anthropic made 1M-token context generally available for Opus 4.6 and Sonnet 4.6, removed the long-context premium, and raised media limits to 600 images or PDF pages. Use it for retrieval-heavy and codebase-scale workflows that previously needed beta headers or special long-context pricing.
Claude Code added /btw, a side-channel prompt that can inspect the current session without interrupting the main task. Use it to ask clarifying questions mid-run without polluting history or triggering extra tool work.
OpenAI detailed how repo-local skills, AGENTS.md, and GitHub Actions now drive repeatable verification, release, and pull request workflows across its Agents SDK repositories. Maintainers can copy the pattern to reduce prompt sprawl and keep agent behavior closer to the codebase.