Open Agent Leaderboard
Open Agent Leaderboard
Corpus Map
Display a treemap of languages and datasets
SmolAgents DA
Analyze your dataset with guided tools
CLIP Benchmarks
Display CLIP benchmark results for inference performance
Data Visualization Ai Excel Togetherai E2b
Analyze and visualize your dataset using AI
Tf Xla Generate Benchmarks
Generate benchmark plots for text generation models
WebApp1K Models Leaderboard
View and compare pass@k metrics for AI models
Mobile-MMLU-Challenge
Evaluate model predictions and update leaderboard
Github Repo To Spaces
Transfer GitHub repositories to Hugging Face Spaces
UnlearnDiffAtk Benchmark
Browse and filter AI model evaluation results
Bvid2acid
Parse bilibili bvid to aid / cid
Easy Analysis
Analyze and compare datasets, upload reports to Hugging Face
GGUF Parser Web
This project is a GUI for the gpustack/gguf-parser-go
Kaz LLM Leaderboard
Evaluate LLMs using Kazakh MC tasks
Nieman Lab 2025 Predictions Visualization
Mapping Nieman Lab's 2025 Journalism Predictions
Merve Data Report
Create detailed data reports
Characters Tag
Search for tagged characters in Animagine datasets
M-RewardBench
M-RewardBench Leaderboard
dataset-worldviews
Explore how datasets shape classifier biases
Autism
Analyze autism data and generate detailed reports