📰 Story

aws_ml_blog · May 28, 2026 · news

← Live feed 📰 Daily recap 🗓️ Weekly recap 🔔 RSS

Evaluating Deep Agents using LangSmith on AWS

This post combines learnings from LangChain’s work on evaluating deep agents and Anthropic’s guide to demystifying evals for AI agents into a practical guide. In this post, you will learn how to: 1) apply five evaluation patterns for deep agents, 2) build offline evaluations using pytest and LangSmith, and 3) configure online monitoring for production. The walkthrough uses a text-to-SQL deep agent with Amazon Bedrock for the full development to production lifecycle.

Read the original at aws.amazon.com →Open in live feed

Related stories 4 items