Media Summary: LLM Agents are one of the most in-demand uses of large language models. This workshop, led by the expert founders of ... In this talk, Hugo Bowne-Anderson, an independent data and AI consultant, educator, and host of the podcasts Vanishing ... With nearly two-thirds of enterprise developers planning production deployments of large language models this year, LLM ...

How To Build Evaluate And - Detailed Analysis & Overview

LLM Agents are one of the most in-demand uses of large language models. This workshop, led by the expert founders of ... In this talk, Hugo Bowne-Anderson, an independent data and AI consultant, educator, and host of the podcasts Vanishing ... With nearly two-thirds of enterprise developers planning production deployments of large language models this year, LLM ... Want to learn real AI Engineering? Go here: Want to start freelancing? Let me help: ... Today we learn how to easily and professionally Ready to become a certified watsonx AI Assistant Engineer? Register now and use code IBMTechYT20 for 20% off of your exam ...

Welcome to an in-depth tutorial on RAGAS, your go-to framework for How do you test if an LLM is actually "friendly"? You can't run pytest.assert(email.is_friendly). This is the Hamel Husain and Shreya Shankar are back with the definitive guide to AI evals. Step-by-step walkthrough using real production ... This video introduces a new series on testing AI agents, focusing on why traditional This is a full walkthrough covering a typical prompt improvement workflow in Latitude.so. YC Group Partner Jared Friedman shares a framework for how to get and

Photo Gallery

How to Build, Evaluate, and Iterate on LLM Agents
How to Build and Evaluate AI systems in the Age of LLMs - Hugo Bowne-Anderson
Lessons from the Trenches: Building LLM Evals That Work IRL: Aparna Dhinkaran
How to Systematically Setup LLM Evals (Metrics, Unit Tests, LLM-as-a-Judge)
Evaluate LLMs in Python with DeepEval
LLM as a Judge: Scaling AI Evaluation Strategies
RAGAS: How to Evaluate a RAG Application Like a Pro for Beginners
Building an AI Judge: The Most Powerful (and Dangerous) Way to Evaluate LLMs
Evals 101 — Doug Guthrie, Braintrust
How to Build AI Evals in 2026 (Step-by-Step, No Hype)
The agent evaluation revolution
How to build, evaluate, and refine prompts with AI — Latitude
View Detailed Profile
How to Build, Evaluate, and Iterate on LLM Agents

How to Build, Evaluate, and Iterate on LLM Agents

LLM Agents are one of the most in-demand uses of large language models. This workshop, led by the expert founders of ...

How to Build and Evaluate AI systems in the Age of LLMs - Hugo Bowne-Anderson

How to Build and Evaluate AI systems in the Age of LLMs - Hugo Bowne-Anderson

In this talk, Hugo Bowne-Anderson, an independent data and AI consultant, educator, and host of the podcasts Vanishing ...

Lessons from the Trenches: Building LLM Evals That Work IRL: Aparna Dhinkaran

Lessons from the Trenches: Building LLM Evals That Work IRL: Aparna Dhinkaran

With nearly two-thirds of enterprise developers planning production deployments of large language models this year, LLM ...

How to Systematically Setup LLM Evals (Metrics, Unit Tests, LLM-as-a-Judge)

How to Systematically Setup LLM Evals (Metrics, Unit Tests, LLM-as-a-Judge)

Want to learn real AI Engineering? Go here: https://go.datalumina.com/iIO93Ps Want to start freelancing? Let me help: ...

Evaluate LLMs in Python with DeepEval

Evaluate LLMs in Python with DeepEval

Today we learn how to easily and professionally

LLM as a Judge: Scaling AI Evaluation Strategies

LLM as a Judge: Scaling AI Evaluation Strategies

Ready to become a certified watsonx AI Assistant Engineer? Register now and use code IBMTechYT20 for 20% off of your exam ...

RAGAS: How to Evaluate a RAG Application Like a Pro for Beginners

RAGAS: How to Evaluate a RAG Application Like a Pro for Beginners

Welcome to an in-depth tutorial on RAGAS, your go-to framework for

Building an AI Judge: The Most Powerful (and Dangerous) Way to Evaluate LLMs

Building an AI Judge: The Most Powerful (and Dangerous) Way to Evaluate LLMs

How do you test if an LLM is actually "friendly"? You can't run pytest.assert(email.is_friendly). This is the

Evals 101 — Doug Guthrie, Braintrust

Evals 101 — Doug Guthrie, Braintrust

Attendees will

How to Build AI Evals in 2026 (Step-by-Step, No Hype)

How to Build AI Evals in 2026 (Step-by-Step, No Hype)

Hamel Husain and Shreya Shankar are back with the definitive guide to AI evals. Step-by-step walkthrough using real production ...

The agent evaluation revolution

The agent evaluation revolution

This video introduces a new series on testing AI agents, focusing on why traditional

How to build, evaluate, and refine prompts with AI — Latitude

How to build, evaluate, and refine prompts with AI — Latitude

This is a full walkthrough covering a typical prompt improvement workflow in Latitude.so.

How to Get and Evaluate Startup Ideas | Startup School

How to Get and Evaluate Startup Ideas | Startup School

YC Group Partner Jared Friedman shares a framework for how to get and