overview
What is LMSys Chatbot Arena?
LMSys Chatbot Arena is an AI evaluation tool developed by LMSYS and UC Berkeley SkyLab that enables AI enthusiasts, developers, and researchers to evaluate and compare large language models through crowdsourced battles. It provides an open, community-driven platform for live LLM evaluation through anonymous, randomized pairwise comparisons by human users. The platform, now rebranded as Arena (arena.ai), facilitates blind, pairwise comparisons of AI chatbots through user votes, generating a dynamic leaderboard based on an Elo-like rating system. This web-based interface allows users to interact with two anonymous LLMs simultaneously, posing prompts and then voting for the better response or declaring a tie, thereby gathering human preferences to assess conversational quality and helpfulness.