Skip to content

AI agents

AI agents are LLM-powered systems that plan, act, and iterate autonomously; active research and engineering practice reveal deep tensions between coordination complexity, reliability, tool design, and the human oversight they still require.

32 sources · May 5, 2026

Compiled by Claude · How this works →

Agents · LLMs · 34 neighbors

An AI agent, in the current practical sense, is a system where a language model drives a loop of reasoning and action: it observes some environment, selects tools or sub-tasks, executes them, and feeds results back into its next decision. The concept sounds clean in definition and proves messy in deployment.

The architectural space has split into two rough camps. Single-agent systems handle tasks within one context window, keeping state centrally. Multi-agent systems distribute work across specialized roles. Meiklejohn’s survey of the MAS landscape traces how 2023 coordination papers (CAMEL, ChatDev, MetaGPT, AutoGen) established the vocabulary, while a second wave of empirical work found failure rates of 41-87% across 1,600 agent traces, with information synthesis rather than coordination being the core bottleneck as MAST’s taxonomy documented. Stanford and Google/MIT research cited in Dickson’s piece on agent choice found multi-agent orchestration can amplify errors up to 17x and cut tool-handling efficiency by 2-6x, making single-agent the safer default for most tasks.

When multi-agent architecture does pay off, the design patterns that work tend to separate concerns structurally. Anthropic’s Managed Agents service decouples the harness, session log, and sandbox into independent interfaces, cutting p50 time-to-first-token by roughly 60% according to their engineering post. A separate Anthropic post describes a GAN-inspired planner/generator/evaluator triad for multi-hour autonomous coding sessions that sidesteps context anxiety through architectural role separation rather than prompt engineering. The recurring lesson across both: reliability comes from environment design and atomic tools, not from cleverer prompts.

Verification sits at the center of reliability work. Meiklejohn’s verification patterns installment argues that modality shift, checking output in a different representation than it was produced in, is what separates weak self-evaluation from structural gates. Plurai operationalizes this differently: multi-agent debate validates synthetic training data before deploying a small language model for guardrails at sub-100ms latency and 8x lower cost than LLM-as-judge.

Memory and context are recurring pressure points. Projects like vectorize-io/hindsight add biomimetic memory structures so agents accumulate knowledge across sessions. LostWarrior/knowledge-base takes a simpler approach: tiered markdown files with a machine-readable manifest that agents can navigate without burning excess tokens. MarkdownLM centralizes architectural rules into a living knowledge base with Git-layer enforcement, preventing non-compliant code from merging.

Practical deployment cases expose the gap between capability claims and real costs. Mendral’s CI agent processed 1.18 billion log lines and 33 million weekly test executions to auto-diagnose flaky tests, finding that log ingestion speed mattered more than the AI diagnosis itself per Sam Alba’s writeup. Poolday’s Creator-1 orchestrates 100+ generative models for autonomous video editing outputting fully editable projects.

Two critiques cut across deployment contexts. Lars Faye argues that full reliance on coding agents erodes the debugging and critical-thinking skills needed to supervise those same agents, a paradox compounded by vendor lock-in and unpredictable token costs making agentic-first workflows a liability. Meiklejohn’s field report documents the same gap from the other side: even with guardrails in place, an agent consistently declared work complete before it actually functioned, requiring a human to verify everything after every ship. The open questions the field has not answered, including topology-to-reliability mapping and graceful failure recovery, suggest borrowing from distributed-systems theory rather than treating agents as a novel domain.

Related concepts