Skip to content
AI Primer
TOPIC50 stories

Context Engineering

How agents gather, compress, structure, and use context.

RELEASE27th June
Junior adds memory and cuts one analytics task from 3m to 1m

Junior’s first memory system cut one analytics task from about 3 minutes to 1 minute in early tests, with tokens down two-thirds and tool calls down 60%. The feature moves persistent task learning into the agent loop, though the results are still internal.

WORKFLOW1w ago
Developers publish loop libraries and control-loop guides for long-running agents

Builders released reusable loop artifacts this week, including a Loop Library Skill, repo templates, and published control-loop definitions for docs sweeps, onboarding checks, and error triage. It matters because teams are turning one-shot prompting into persistent agent runs with explicit stop conditions and shared repo state.

RELEASE1w ago
Plannotator 0.21.0 adds direct file editing, embedded CLI agents, and Bedrock support

Plannotator 0.21.0 added direct document editing, an embedded CLI agent for co-context, first-class HTML handling, and Bedrock and Vertex support. It matters because the tool is moving from annotation-only review toward persistent IDE-like agent sessions with shared file diffs and cloud model backends.

NEWS1w ago
Perplexity Computer adds Brain context graph with +25% correctness on memory-heavy tasks

Perplexity rolled out Brain, a self-updating context graph that carries prior sessions, files, and decisions into new Computer tasks. In research preview for Max users, it matters because Perplexity says the memory layer improves correctness and recall while lowering per-task cost on history-dependent work.

RELEASE1w ago
Factory adds AutoWiki: /wiki generates repo docs on every push

Two days after Factory 2.0 launched, Factory added AutoWiki, which turns a repo into structured engineering docs that update on every push and can be generated from Droid with /wiki. The feature keeps architecture and request-path context aligned with the live codebase.

RELEASE1w ago
Z.ai releases GLM-5.2 open weights with 1M context and 46.2% DeepSWE

Z.ai released GLM-5.2 MIT-licensed open weights with 1M context and broad runtime support. Vendor and arena results put it near frontier closed models on long-horizon coding.

WORKFLOW2w ago
Codex supports agent-written `/goal` prompts for spawned threads

Codex users are having the agent write its own `/goal` and sub-agent goals, with OpenAI-side commentary describing that as a built-in meta-prompting pattern. The workflow turns long autonomous runs into a tighter control loop, but users still review goals first so a bad objective does not burn tokens for hours.

WORKFLOW2w ago
Practitioners report Fable 5 planner workflows with Opus, Codex, and HTML logs

Users are using Fable 5 as a planner and long-run orchestrator while pushing implementation and heavy reasoning to Opus and Codex. The setup keeps Fable on supervision and planning, so teams can track execution through live status pages on larger tasks.

WORKFLOW2w ago
Hyperbrowser, InsForge, and Higgsfield release Fable 5 harnesses and MCP workflows

Hyperbrowser shipped a Claude Code harness, InsForge showed a Fable run drop from 5.5M to 2.3M tokens, and Higgsfield published new MCP workflows. These tools add reusable harness, context, and interface layers around Fable for more controlled runs.

WORKFLOW2w ago
/teach adds npx install and primary-source lessons

Matt Pocock's /teach skill installs with npx skills add mattpocock/skills --skill teach and runs structured strategy lessons inside a Claude agent. Follow-up posts add primary-source reading to the lessons and point to a larger dedicated repo.

WORKFLOW2w ago
Anthropic updates Fable prompting with /model, high effort, and /goal loops

Anthropic published Fable-specific guidance for Claude Code and API, emphasizing the /model switch, higher default effort, simpler prompts, and /goal-style verification loops. The Claude Code team says older prompt scaffolds can work against Fable's longer sessions.

WORKFLOW3w ago
Claude Code users report auto mode, dynamic workflows, and critique loops finding 144 bugs

Practitioners shared repeatable setups for multi-hour Claude runs using auto approvals, dynamic workflows, cloud sessions, and critique loops. One large-codebase sweep reported 144 bugs fixed in about four hours with fewer false positives under model critique.

WORKFLOW3w ago
Codex users compare iOS dictation, multi-thread UX, and long-context prompts

Codex usage moved further into phone-first workflows, with iOS dictation loops, background voice capture, and app updates like searchable settings and restored state. The comparisons still flag rough spots in multi-thread UX, Windows support, and cases where CLI tabs or cloud agents are easier to manage.

WORKFLOW3w ago
Codex /goal template adds 6 fields for verification commands and stop conditions

A community workflow broke long-running Codex goals into six required fields, then added an eight-item preflight checklist and helper tools. The structure is meant to reduce runs that drift, stop early, or claim completion without an objective verification step.

NEWS3w ago
ChatGPT adds memory summaries and 2x memory in Dreaming V3 rollout

OpenAI rolled out a more capable ChatGPT memory system that keeps context across conversations, shows a reviewable memory summary, and doubles memory for US Plus and Pro users. The change matters because persistent context becomes a first-class product feature with explicit controls instead of a static saved-memories note list.

NEWS3w ago
Weaviate launches Engram memory service with async writes

Weaviate introduced Engram, a dedicated agent memory service with async writes, semantic topic grouping, tenant scopes, and composable pipelines. It matters because teams can add a hosted memory layer for agent stacks without stitching custom memory workflows into each application.

NEWS3w ago
GitHub Copilot adds 1M context window and reasoning levels

GitHub Copilot now supports a 1M-token context window and configurable reasoning levels in VS Code, Copilot CLI, and the Copilot app. It matters because larger repo context and explicit effort controls arrive in the same surfaces teams already use for agent-assisted development.

WORKFLOW4w ago
Codex community ships /dynamic swarms, session lifecycles, and model routing

Builders added /dynamic orchestration, custom-model routing, and repo runbooks around Codex as users exposed new session lifecycle controls in the app. That makes Codex a better fit for long-running, multi-context coding work.

NEWS4w ago
Claude Opus 4.8 adds mid-conversation system messages without breaking prompt cache

Opus 4.8 can accept new system-role instructions after a user turn while keeping earlier prompt segments cacheable. That lets long-running agents update constraints mid-loop without replaying the full system prompt on every call.

WORKFLOW4w ago
Developers ship Chrome MCP, repo-graph search, and token compression for Claude Code and Codex

Independent developers released browser-control MCP tooling, repo-context graphing and packaging utilities, and token-compression helpers for coding agents. The cluster matters because agent workflows are now adding browser control, context packing, and cost controls as external infrastructure instead of waiting on raw model upgrades alone.

WORKFLOW4w ago
Researchers and builders ship external memory layers with recipe stores and 33% cheaper updates

A new MeMo paper and several community memory systems converged on keeping knowledge outside the base model through recipe files, semantic and autobiographical stores, and background reconsolidation. The pattern matters because engineers are treating context loss as a systems problem instead of only asking for larger context windows.

WORKFLOW1mo ago
Codex users share /goal audits, mobile delegation, and Raspberry Pi workflows

Practitioners published reusable Codex workflows for project audits, memory-driven skill packaging, mobile delegation, and remote computer use. Try the prompt-and-steps patterns if you want to adapt Codex across repos and devices.

WORKFLOW1mo ago
Agent Skills ecosystem ships handoff docs, htmx v4 packs, and Project Think support

Independent builders published reusable skills infrastructure across coding agents, including Project Think preview support, handoff docs, and an htmx v4 skill pack. That matters because skills are starting to work like portable workflow units instead of one-off prompt snippets inside a single tool.

WORKFLOW1mo ago
Codex users ship durable-memory workspaces and auto-triage flows

Independent Codex users published Obsidian memory setups, reusable skill prompts, auto-triage flows, and Cloudflare-backed runners for longer jobs. That matters because Codex is being wrapped into persistent workspaces and operator-defined subagents instead of one-shot chats.

WORKFLOW1mo ago
Codex users report better compaction and Colab control after v0.133.0

Developers say Codex v0.133.0 improved compaction, remote-control workflows, and Chrome-driven Colab runs after `/goal` became default. The same update window also brought easier skill discovery and new diff options, though some users saw approval-pause regressions in full-access mode.

NEWS1mo ago
Claude Console adds prompt cache-miss diagnostics with per-message and per-tool token costs

Claude Console now shows which message, system prompt, tool, or model change caused a cache miss and how many tokens it cost. That matters because teams can trace prompt-cost regressions to specific edits instead of debugging cache churn blind.

RELEASE1mo ago
Notion launches Developer Platform with External Agents API and Workers

Notion opened a developer platform with an External Agents API plus Workers, webhooks, and a headless CLI. The release lets external agents query Notion, extend workflows, and stay in sync with other systems.

WORKFLOW1mo ago
Developers launch Markdown Experience Guidelines and HTML explainer tools for agent output

Independent builders shipped new specs and tools for richer agent output, including Markdown Experience Guidelines and Plannotator’s HTML render-and-annotate flow, while practitioners shared repeatable HTML explainer workflows. The cluster matters because teams are moving beyond raw Markdown toward output surfaces that can carry charts, interaction, review context, and persistent knowledge-base structure.

WORKFLOW1mo ago
Claude Code adds `frontend-slides` for HTML briefs and publishable slides

A day after HTML artifacts surfaced as a Claude Code workflow, Anthropic promoted a `frontend-slides` plugin with direct install commands and artifact publishing. The rollout sharpened a real workflow split: teams are using HTML for human review and demos, while keeping markdown or MDX for token-efficient agent context.

WORKFLOW1mo ago
Claude Code users report HTML artifacts improve PR review, dashboards, and visual explainers

A cluster of Claude Code users, guides, and companion tools shifted from Markdown toward HTML artifacts for code review, dashboards, and explainer pages. The pattern matters because richer outputs are easier to inspect and share during long agent workflows, though several builders note the token cost is materially higher than Markdown.

RELEASE1mo ago
Anthropic launches Claude Managed Agents with Dreaming, Outcomes, and multiagent orchestration

Anthropic added Dreaming in research preview plus public-beta Outcomes, multiagent orchestration, and webhooks to Claude Managed Agents. Teams should try the new grader loops and shared-container sub-agents if they want more control over long-running agent work.

WORKFLOW1mo ago
Practitioners report harness playbooks with Playwright CLI, create_agent, and MCP

Builders shared concrete Symphony, create_agent, and MCP setup guides after arguing that model switching is easy but harness switching is not. The playbooks matter because they make harness engineering more repeatable, so teams can copy tested tooling and integration patterns.

RELEASE2mo ago
GitHub Copilot adds semantic indexing to all workspaces and cross-repo search in @code

GitHub expanded semantic indexing beyond GitHub and Azure DevOps remotes, so Copilot can search across more workspace types and repositories inside @code. That improves agent context retrieval in local workflows, while the same release also adds chat-history recall and prompt-eval tooling.

WORKFLOW2mo ago
mattpocock/skills ranks #1 on GitHub at 28K stars with `/grill-me` and `/tdd` packs

mattpocock/skills hit the top of GitHub Trending as reusable `SKILL.md` packs for grilling specs, writing PRDs, and enforcing TDD spread across coding-agent workflows. The format is starting to look like a distribution layer for agent behavior, with faster install tooling and third-party skills shipping around the same pattern.

RELEASE2mo ago
OpenAI Codex adds Chronicle screen memories in macOS Pro preview

OpenAI added Chronicle, a Codex preview that turns recent screen context into reusable memories for errors, files, docs, and workflows. The macOS Pro-only feature stores local memory unencrypted and can burn rate limits quickly, so watch prompt-injection risk before relying on it.

WORKFLOW2mo ago
Claude Design users report AGENTS.md backends and Claude Code handoffs one day after launch

Builders say they are already passing Claude Design prototypes into Claude Code, wiring live backends through AGENTS.md, and recreating the flow in custom orchestrators. Try this if you want a faster handoff from mockup to full-stack build, but expect early workflow rough edges.

RELEASE2mo ago
OpenClaw 2026.4.15 adds Opus 4.7 support and bounded memory reads

OpenClaw 2026.4.15 adds Anthropic Opus 4.7, bundled Gemini TTS, bounded memory reads, and transport self-heal fixes. The release targets context and reliability issues users had been reporting this week.

RELEASE2mo ago
Codex 0.120 adds per-project memory extensions and Realtime V2 streaming

Codex 0.120 introduced per-project memory extension files and Realtime V2 progress streaming for background agents. Separate app findings also showed an unreleased Scratchpad view that can start parallel Codex chats from a task list, which may change how teams queue work.

NEWS2mo ago
Hermes Agent adds MiniMax M2.7 and MiMo V2 Pro through partner integrations

Nous Research added MiniMax M2.7, Xiaomi’s MiMo V2 Pro, a SuperMemory plugin, and expanded Manim support to Hermes through partner integrations. The additions give users new hosted model options, a shared memory backend, and more complete technical-animation tooling to try in workflows.

RELEASE2mo ago
OpenClaw 2026.4.7 adds a headless inference hub, memory-wiki, and webhook TaskFlows

OpenClaw 2026.4.7 adds a headless inference hub, memory-wiki, session branch and restore, and webhook-driven TaskFlows. Composio also shipped a CLI for secure app authentication, so users can expand OpenClaw from a local coding harness into a broader agent runtime.

WORKFLOW2mo ago
Bram Cohen compares vibe coding with AI Level 6 workflows after Claude Code leak

Bram Cohen used the Claude Code leak to argue that prompt-only development produces bad software, while a separate 250-hour syntaqlite build said the durable version arrived only after a Python-to-Rust rewrite. Practitioners say specs, tests, linters, repo skills, and codebase context are the controls that keep coding agents maintainable.

RELEASE2mo ago
Hermes Agent adds /claude-code orchestration and cron hooks

Hermes Agent added direct /claude-code orchestration and cron-time script hooks, and the team also shipped Hermes-focused datasets and agent-tuned model variants. The update turns Hermes into a harness that can steer Claude Code and inject recurring context automatically.

RELEASE2mo ago
Clawback releases Claude Code hook layer for stop-checks and PostToolUse enforcement

Clawback turns leaked Claude Code verification patterns into stop, pre-tool, post-tool, and post-compaction hooks. It replaces prompt-only guardrails with deterministic checks and shows how fast the source-map leak is becoming third-party control layers.

RELEASE2mo ago
Hermes Agent v0.7.0 adds memory plugins, credential pools, and Camofox browsing

Nous released Hermes Agent v0.7.0 with an extensible memory plugin system, rotating credential pools, Camofox browser support, and inline diff previews. The update pushes the framework closer to production use, so teams can test the new failure handling and interface split before adopting it.

WORKFLOW3mo ago
Claude Code adds /loop, /teleport, and /batch workflow guidance in Boris Cherny guide

A Boris Cherny guide maps Claude Code mobile sessions, /teleport, /loop, hooks, worktrees, /batch, and custom agents into one workflow set. Use it to turn scattered commands into repeatable patterns for long-running coding sessions across terminal, desktop, and cloud.

WORKFLOW3mo ago
Claude Code guides compare `.claude/` commands, agents, and global rules

Two new guides map how Claude Code teams are using `.claude/`, `CLAUDE.md`, commands, agents, skills, and global rules. The overlap matters because commenters favor short instructions and a small number of repeatable guardrails over larger prompt stacks.

NEWS3mo ago
ChatGPT adds Library tab for reusable file uploads across conversations

ChatGPT now saves uploaded and generated files into an account-level Library that can be reused across conversations from the web sidebar or recent-files picker. It removes repetitive re-uploading and makes past PDFs, spreadsheets, and images part of a persistent working context.

NEWS3mo ago
Hermes Agent reports 10,000 GitHub stars with memory and delegation tools

Nous Research said Hermes Agent crossed 10,000 stars, while users reported easy migrations from OpenClaw and stable long-running use. If you test it, focus on persistent memory, MCP browser control, and delegation behavior under real workloads.

WORKFLOW3mo ago
Agent Flywheel introduces beads-and-swarms workflow for 1,000 commits a day

Agent Flywheel lays out a planning-first workflow built on beads, agent mail, swarms, and TUI inspection for very large coding runs. It is useful because the guide exposes coordination primitives and review loops, not just benchmark screenshots.

RELEASE3mo ago
Claude adds Projects to Cowork desktop with local folders and one-click imports

Anthropic rolled Projects into Cowork on the Claude desktop app, giving each project its own local folder, persistent instructions, and import paths from existing work. It makes Cowork more practical for ongoing tasks, though teams should test current folder-location limits.

AI PrimerAI Primer

Your daily guide to AI tools, workflows, and creative inspiration.

© 2026 AI Primer. All rights reserved.