Vercel Introduces Eve, an Open-Source Framework for Building AI Agents
Vercel's Eve organizes agent instructions, tools, and skills with a filesystem-based project structure aimed at building and operating agents in production.
13 articles · 4 categories
The finishable daily brief
Friday, Jun 26, 2026
13 articles · 4 categories
read top to bottom · then stop
In 30 seconds
Friday was about hardening the agent stack rather than any single launch. New building blocks landed for agent builders — Vercel open-sourced its Eve framework, BetterDB shipped a Valkey-native context layer for memory, and LangChain's Deep Agents leaned on prompt caching to cut token costs — while Stripe and InfoQ surfaced what it actually takes to run agents in production.
The louder thread, though, was trust and security. Google Cloud extended VPC Service Controls to fence in agentic traffic, Dapr 1.18 added cryptographically verifiable execution, and Simon Willison reported on 2,000 people trying to phish an AI assistant. Even OpenAI's GPT-5.6 Sol preview led with cybersecurity and its safety stack.
A wave of new primitives for agent builders — a production framework, a memory/context layer, cheaper inference, and local coding-agent tooling.
Vercel's Eve organizes agent instructions, tools, and skills with a filesystem-based project structure aimed at building and operating agents in production.
An open, Valkey-native context layer providing agent memory, semantic plus multi-tier caching, and typed retrieval that runs on any Valkey instance.
LangChain shows how Deep Agents uses prompt caching to cut LLM token costs by up to 80% across major providers with no extra configuration.
A coding-agent multiplexer built on the tenet that everything a user can do manually must also be exposed via CLI for agents and automation.
Real-world deployments and the friction they create: a regulated production architecture, and the review bottleneck AI-generated code is opening up.
Stripe's ReAct-based agent system for financial compliance, including the technical architecture and infrastructure decisions behind running it in production.
Michael Webster on how headless agents generate massive pull requests that bottleneck human reviewers and strain software delivery pipelines.
A sharp hypothetical incident report by Andrew Nesbitt in which two competing AI review agents collide on a downstream pull request — a cautionary tale for agent-driven CI.
The day's dominant thread: perimeter controls, verifiable execution, and hard data on whether agents can be phished — the trust layer around agents is filling in.
Simon Willison covers Fernando Irarrázaval's challenge: 2,000 people tried to leak secrets from an AI assistant via email, with surprising results on injection resistance.
Google Cloud extends VPC Service Controls so teams can put network-level perimeter guardrails around autonomous agents connecting across tools and datasets.
Dapr 1.18 adds verifiable execution — cryptographic trust, provenance, and tamper-evident records for distributed agents and workflows.
A look at constraining offensive security agents — where guardrails matter most as agents take on active, adversarial tasks.
A next-gen model preview that itself leans on security, plus fresh research on how easily agent behavior can be steered.
OpenAI previews GPT-5.6 Sol with stronger coding, science, and cybersecurity capabilities, paired with what it calls its most advanced safety stack.
A PNAS study finding that agent behavior shifts measurably in response to small nudges — a reliability signal worth weighing when designing agent prompts and environments.
You are caught up for this edition