Arize AI

Deep Papers

Explore insightful discussions on pivotal AI research and innovations in this podcast hosted by Arize AI founders. Discover the minds and methods shaping machine learning.

Listen on Apple Podcasts

LLMs as Judges: A Comprehensive Survey on LLM-Based Evaluation Methods

29 mins • Dec 23, 2024

Recent Episodes

Dec 23, 2024

LLMs as Judges: A Comprehensive Survey on LLM-Based Evaluation Methods

29 mins

Dec 10, 2024

Merge, Ensemble, and Cooperate! A Survey on Collaborative LLM Strategies

29 mins

Nov 23, 2024

Agent-as-a-Judge: Evaluate Agents with Agents

25 mins

Nov 12, 2024

Introduction to OpenAI's Realtime API

30 mins

Oct 29, 2024

Swarm: OpenAI's Experimental Approach to Multi-Agent Systems

47 mins

Language
English
Country
Argentina
Feed Host