HELM Benchmark
Shares tags: build, data, eval datasets
Computer vision eval datasets with leaderboards.
Similar Tools
Other tools you might consider
HELM Benchmark
Shares tags: build, data, eval datasets
LMSYS Arena Hard
Shares tags: build, data, eval datasets
Lamini Eval Sets
Shares tags: build, data, eval datasets
Labelbox AI
Shares tags: build, data
overview
Computer vision eval datasets with leaderboards.
More on Stork
Other tools in this category, ranked by community signal
Lamini Eval Sets
🧩 Build
Vertical-specific prompts + answers for evals.
pgvector
🧩 Build
Postgres extension for vector indexes.
Faiss
🧩 Build
Library for building custom vector DB backends.
Datasaur
🧩 Build
Collaborative labeling for text, audio, and documents.
SuperAnnotate
🧩 Build
Annotation suite with QA and workforce tools.
SageMaker Feature Store
🧩 Build
Built-in feature store for Amazon SageMaker.
For builders
AI agents read it. Buyers find it. Backlinks accrue. Your tool can have one too — live in 24 hours, indexed by Claude, ChatGPT, and Perplexity, queryable via MCP.