Skip to content
KI-Werkzeug

LMSys Chatbot-Arena

Eine offene Plattform zur Bewertung und zum Vergleich großer Sprachmodelle durch crowdsourced Duelle. Vergleichen Sie GPT-4, Claude, Gemini und weitere Modelle direkt nebeneinander.

shipped 25. Nov. 2025chatbotfreemium
LMSys Chatbot Arena - AI tool hero image
1Sure! Please provide the text you would like me to translate into German.
2Of course! Please provide the text you would like me to translate into German.
3The term "benchmark" can be translated into German as "Referenzwert" or "Benchmark." The choice depends on the context in which it is used. If you have a specific sentence or context in mind, feel free to share!

LMSys Chatbot Arena at a Glance

Best For
chatbot, LLM, benchmark
Pricing
freemium
Key Features
Rebranded to LMArena in January 2026, consolidating evaluation projects under lmarena.ai. · GPT-5.4 achieved an Elo rating of 1502 on March 5, 2026, marking significant leaderboard shifts. · Collected over 6.3 million votes across more than 200 models by March 2025.
Alternatives
Arena AI, ChatComparison.ai, Hugging Face Open LLM Leaderboard, LiveBench

Ähnliche Tools

Alternativen vergleichen

Andere Tools, die Sie in Betracht ziehen könnten

1

Arena AI

It provides an official AI ranking and LLM leaderboard shaped by a community that chats, compares, and votes on AI models through real-world evaluation.

Besuchen
2

ChatComparison.ai

It allows users to instantly view side-by-side pricing, speed, and performance of various AI models to pick the best fit for their use case.

Besuchen
3

Hugging Face Open LLM Leaderboard

It serves as a central, transparent platform for independently evaluating and benchmarking open-weights AI models against rigorous frameworks.

Besuchen
4

LiveBench

It offers a contamination-free LLM benchmark with regularly released new questions that have verifiable, objective ground-truth answers, removing the need for an LLM judge.

Besuchen

overview

Überblick

Eine offene Plattform zur Bewertung und zum Vergleich großer Sprachmodelle durch crowdsourcing-basierte Wettbewerbe. Vergleichen Sie GPT-4, Claude, Gemini und weitere Modelle nebeneinander.

competitors

Alternatives & Competitors

1
Arena AI

It provides an official AI ranking and LLM leaderboard shaped by a community that chats, compares, and votes on AI models through real-world evaluation.

Similar to LMSys Chatbot Arena, Arena AI focuses on crowdsourced evaluation and a public leaderboard, but it also extends to image and code models, not just chatbots.

2
ChatComparison.ai

It allows users to instantly view side-by-side pricing, speed, and performance of various AI models to pick the best fit for their use case.

Unlike LMSys Chatbot Arena's 'battle' format, ChatComparison.ai emphasizes direct side-by-side comparison of model outputs, pricing, and performance metrics, helping users optimize their workflows and reduce AI costs.

3
Hugging Face Open LLM Leaderboard

It serves as a central, transparent platform for independently evaluating and benchmarking open-weights AI models against rigorous frameworks.

While both provide LLM rankings, Hugging Face's leaderboard focuses on standardized, framework-based evaluation of open-source models, whereas LMSys Chatbot Arena primarily uses crowdsourced human preference battles for a broader range of models.

4
LiveBench

It offers a contamination-free LLM benchmark with regularly released new questions that have verifiable, objective ground-truth answers, removing the need for an LLM judge.

LiveBench differentiates from LMSys Chatbot Arena by focusing on objective, ground-truth based evaluation and regularly updated, contamination-free benchmarks, rather than subjective crowdsourced human preferences.

For builders

This page is doing a job for someone else’s tool.

AI agents read it. Buyers find it. Backlinks accrue. Your tool can have one too — live in 24 hours, indexed by Claude, ChatGPT, and Perplexity, queryable via MCP.