Książka Evaluation-Driven Agentic Systems Ethan Tyson

Evaluation-Driven Agentic Systems

From Design to Deployment

Autor: Ethan Tyson
Język: Angielski
Oprawa: Miękka
Dostępność: Dostępna u dostawcy
Wysyłamy za 9-15 dni
79.59
Evaluation-Driven Agentic Systems: From Design to Deployment equips AI practitioners, engineers, and...

Informacje o książce

Autor
Język
Angielski
Oprawa
Książka - Miękka
Data wydania
2025
strony
188
EAN
9798271152535
Enbook ID
50577957
Waga
337
Wymiary
178 x 254 x 10

Pełny opis

Evaluation-Driven Agentic Systems: From Design to Deployment equips AI practitioners, engineers, and product leaders with the tools, frameworks, and workflows to build autonomous agents that perform reliably, safely, and efficiently. In a landscape where agentic systems are tasked with planning, tool usage, multi-step workflows, and continuous adaptation, how can you ensure they meet business objectives, align with human expectations, and maintain operational integrity? This book provides a systematic, practical answer.

Through clear, tutorial-driven guidance, you will learn to implement Evaluation-Driven Development (EDD): a methodology that embeds evaluation at every stage of agent creation and deployment. From defining business-aligned evaluation goals to constructing scenario sets, designing metrics matrices, setting thresholds, and integrating evaluation into CI/CD pipelines, this book ensures agents are rigorously assessed before reaching production. It also covers advanced practices such as monitoring live agents, detecting drift, handling multi-agent interactions, and applying ethical and safety checks, ensuring your systems remain accountable and aligned over time.

Readers will gain practical skills and actionable insights to:

  • Translate business objectives and user requirements into measurable evaluation goals and success criteria.

  • Design comprehensive evaluation suites with normal, edge, adversarial, and load-testing scenarios.

  • Implement multi-dimensional metrics, dashboards, and thresholds to measure task success, planning efficiency, tool usage, and user alignment.

  • Integrate automated evaluation pipelines into CI/CD workflows for continuous monitoring and regression detection.

  • Handle agent updates, versioning, and emerging behaviors while maintaining alignment, safety, and governance.

  • Scale evaluation from single agents to multi-agent systems, ensuring robustness and reliability across complex workflows.

Each chapter combines hands-on code examples, templates, rubrics, and checklists with expert commentary, making it immediately applicable in real-world development and operational environments. The book empowers readers to confidently deploy agents that are tested, traceable, and consistently performant, avoiding common pitfalls and operational risk.

If you are designing autonomous systems, managing AI deployments, or building agentic workflows that require reliability, safety, and measurable impact, Evaluation-Driven Agentic Systems: From Design to Deployment is your essential, practical guide to building agents that meet today's complex requirements while preparing for the AI challenges of tomorrow.

Możesz być zainteresowany

73.16

Different Way

Christopher A. Hall
101.02

The Walk

Lindsay Anderson
38.77
95.76
889.19

Daisy Chain War

Joan O'Neill
39.74

Klienci, którzy kupili tę książkę, kupili również

Agentic Ai

Carlos Smith
111.25
92.15
219.88
70.53

ALFONS MUCHA

CATHERINE DE DUVE
78.71

Robes à coudre

Annabel Benilan
103.94

Creer y Pensar

Arturo Ivan Rojas
75.69

Ping Pong

Katsumi Komagata
84.36
218.32

El diván del buscador

SERGIO NOGUERON
77.54