Media Summary: Join the Blog and follow on social handles for engaging conversations about Software Architecture and Tech. Shishir Patal, a Research Scientist at Meta, delivered a presentation on For more information about Stanford's graduate programs, visit: November 21, ...

How To Evaluate Ai Agents - Detailed Analysis & Overview

Join the Blog and follow on social handles for engaging conversations about Software Architecture and Tech. Shishir Patal, a Research Scientist at Meta, delivered a presentation on For more information about Stanford's graduate programs, visit: November 21, ... Today, I want to share a new episode with Aman Khan. The best way to learn about Description Today we explore multi-layered In this video we take a look at Ragas, a Python package made for

Photo Gallery

LLM as a Judge: Scaling AI Evaluation Strategies
How to Evaluate AI Agents ?
AI Agents, Clearly Explained
Agentic Evals by Shishir Patil
Stanford CME295 Transformers & LLMs | Autumn 2025 | Lecture 8 - LLM Evaluation
The 100% EASIEST Way to Test LLMs & AI Agents (Seriously)
Evaluating and Debugging Non-Deterministic AI Agents
Complete Beginner's Course on AI Evaluations in 50 Minutes (2025) | Aman Khan
How to Systematically Setup LLM Evals (Metrics, Unit Tests, LLM-as-a-Judge)
Practical Techniques for Evaluating AI Agents - May 26, 2026
How to evaluate agents in practice
Evaluate AI Agents in  Python with Ragas
View Detailed Profile
LLM as a Judge: Scaling AI Evaluation Strategies

LLM as a Judge: Scaling AI Evaluation Strategies

Ready to become a certified watsonx

How to Evaluate AI Agents ?

How to Evaluate AI Agents ?

Join the Blog and follow on social handles for engaging conversations about Software Architecture and Tech.

AI Agents, Clearly Explained

AI Agents, Clearly Explained

My

Agentic Evals by Shishir Patil

Agentic Evals by Shishir Patil

Shishir Patal, a Research Scientist at Meta, delivered a presentation on

Stanford CME295 Transformers & LLMs | Autumn 2025 | Lecture 8 - LLM Evaluation

Stanford CME295 Transformers & LLMs | Autumn 2025 | Lecture 8 - LLM Evaluation

For more information about Stanford's graduate programs, visit: https://online.stanford.edu/graduate-education November 21, ...

The 100% EASIEST Way to Test LLMs & AI Agents (Seriously)

The 100% EASIEST Way to Test LLMs & AI Agents (Seriously)

Learn how to professionally

Evaluating and Debugging Non-Deterministic AI Agents

Evaluating and Debugging Non-Deterministic AI Agents

Evaluate

Complete Beginner's Course on AI Evaluations in 50 Minutes (2025) | Aman Khan

Complete Beginner's Course on AI Evaluations in 50 Minutes (2025) | Aman Khan

Today, I want to share a new episode with Aman Khan. The best way to learn about

How to Systematically Setup LLM Evals (Metrics, Unit Tests, LLM-as-a-Judge)

How to Systematically Setup LLM Evals (Metrics, Unit Tests, LLM-as-a-Judge)

Want to learn real

Practical Techniques for Evaluating AI Agents - May 26, 2026

Practical Techniques for Evaluating AI Agents - May 26, 2026

Description Today we explore multi-layered

How to evaluate agents in practice

How to evaluate agents in practice

Evaluating Agents

Evaluate AI Agents in  Python with Ragas

Evaluate AI Agents in Python with Ragas

In this video we take a look at Ragas, a Python package made for

Beginner's Guide to Agent Evaluations

Beginner's Guide to Agent Evaluations

When companies deploy their