Arize AI

Deep Papers

Explore insightful discussions on pivotal AI research and innovations in this podcast hosted by Arize AI founders. Discover the minds and methods shaping machine learning.

LLMs as Judges: A Comprehensive Survey on LLM-Based Evaluation Methods

29 mins • Dec 23, 2024

Charts

9
NEW
Apple Podcasts – United States – Mathematics
4
NEW
Apple Podcasts – United Kingdom – Mathematics
12
Decreased by 3
Apple Podcasts – Canada – Mathematics
7
Decreased by 1
Apple Podcasts – Australia – Mathematics
26
NEW
Apple Podcasts – Germany – Mathematics

Recent Episodes

Dec 23, 2024

LLMs as Judges: A Comprehensive Survey on LLM-Based Evaluation Methods

29 mins

Dec 10, 2024

Merge, Ensemble, and Cooperate! A Survey on Collaborative LLM Strategies

29 mins

Nov 23, 2024

Agent-as-a-Judge: Evaluate Agents with Agents

25 mins

Nov 12, 2024

Introduction to OpenAI's Realtime API

30 mins

Oct 29, 2024

Swarm: OpenAI's Experimental Approach to Multi-Agent Systems

47 mins

Language

English

Country

Argentina

Deep Papers

LLMs as Judges: A Comprehensive Survey on LLM-Based Evaluation Methods

Charts

Apple Podcasts – United States – Mathematics

Apple Podcasts – United Kingdom – Mathematics

Apple Podcasts – Canada – Mathematics

Apple Podcasts – Australia – Mathematics

Apple Podcasts – Germany – Mathematics

Recent Episodes

LLMs as Judges: A Comprehensive Survey on LLM-Based Evaluation Methods

Merge, Ensemble, and Cooperate! A Survey on Collaborative LLM Strategies

Agent-as-a-Judge: Evaluate Agents with Agents

Introduction to OpenAI's Realtime API

Swarm: OpenAI's Experimental Approach to Multi-Agent Systems