LLM Digest

Live feed

AI news for platform & agent engineers

Ranked signal · finite reading

The AI brief that ends.

One shared ranking. Scan what changed, save what matters, and stop when the finish line appears.

Today's top signals

langchain.com · 2026-07-02

OpenWiki: Open Source Repo Documentation for Coding Agents

OpenWiki generates and maintains codebase documentation so coding agents can find the repo context they need without loading everything into one instruction file. Context & related coverage →

simonwillison.net · 2026-07-05

sqlite-utils 4.0rc2, mostly written by Claude Fable (for about $149.25)

I wrote about the sqlite-utils 4.0rc1 release a couple of weeks ago. Since we only have Claude Fable on our Max subscriptions for a few more days, I decided to see if it could help me get to a 4.0 stable release that... Context & related coverage →

arxiv.org · 2026-07-02

TestEvo-Bench: An Executable and Live Benchmark for Test and Code Co-Evolution

Software tests and code evolve together: a code change should be followed by new or updated tests that record the new software behavior. Yet existing test generation and update benchmarks often isolate the test from t... Context & related coverage →

arxiv.org · 2026-07-02

Understanding Agent-Based Patching of Compiler Missed Optimizations

Compiler missed optimizations refer to cases in which compilers failed to optimize certain code. It takes many compiler developers' efforts to implement or patch such missed optimizations. In this paper, we present a... Context & related coverage →

simonwillison.net · 2026-07-02

llm-coding-agent 0.1a0

Release: llm-coding-agent 0.1a0 Another Fable 5 experiment. Now that my LLM library has evolved into more of an agent framework it's time to see what a simple coding agent would look like built on it. I started a new... Context & related coverage →

langchain.com · 2026-06-30

Harbor x LangChain: A Unified Stack for Evaluating Agents

Evaluating long-running, stateful agents needs a new kind of runner. Here's how Deep Agents, LangSmith sandboxes, and observability plug into Harbor. Context & related coverage →

arxiv.org · 2026-07-02

DemoPSD: Disagreement-Modulated Policy Self-Distillation

On-policy self-distillation (OPSD) has emerged as a practical method for training large language models (LLMs) to reason, where a single model acts as both the teacher and the student with different levels of informat... Context & related coverage →

huggingface.co · 2026-06-30

ScarfBench: Benchmarking AI Agents for Enterprise Java Framework Migration

Context & related coverage →

infoq.com · 2026-07-03

Cloudflare Details Unified Data Platform Where Billing Workloads Account for 53% of Queries

Cloudflare details Town Lake, an internal unified data platform, and Skipper, an AI analytics agent unifying access to operational, billing, security, and business data. The platform processed ~91K billing queries, wi... Context & related coverage →

github.com · 2026-07-03

claude-code v2.1.200

Changed AskUserQuestion dialogs to no longer auto-continue by default; opt into an idle timeout via /config · Changed the "default" permission mode to "Manual" across the CLI, --help , VS Code, and JetBrains; --permis... Context & related coverage →

anthropic.com · 2026-06-30

Introducing Claude Sonnet 5

Our most agentic Sonnet yet, with top-tier intelligence for coding and everyday professional work. Context & related coverage →

infoq.com · 2026-07-03

Presentation: Fine Tuning the Enterprise: Reinforcement Learning in Practice

The speakers discuss Agent RFT, OpenAI’s platform for fine-tuning reasoning models via real-time tool interactions and custom reward signals. They explain how reinforcement learning solves complex credit assignment ch... Context & related coverage →

Prefer it summarized? Read the daily recap →

The finishable AI feed for platform & agent engineers LLM Digest is a low-hype, ranked daily brief of AI news for platform and infrastructure engineers — model releases, frontier-lab research, inference and serving updates, agent tooling, and selected papers. One shared, transparent ranking for everyone; no engagement-optimized infinite scroll. Above is a static snapshot of the current top items; the live, filterable feed needs JavaScript. These pages are fully readable without it:

Daily recap — what changed in AI today, in 10 minutes.

Weekly recap — what you missed this week.

Storylines — follow a developing story day by day.

Playbook — actionable cards: the problem, what to apply, the expected result.

Knowledge map — agent-engineering obstacles mapped to solutions.

Foundations — evidence-tiered explanations behind agent-building practice.

Voices — influential AI engineers and their writing.

Email digest · JSON feed