Open Agent Leaderboard
Display a treemap of languages and datasets
Analyze your dataset with guided tools
Display CLIP benchmark results for inference performance
Analyze and visualize your dataset using AI
Generate benchmark plots for text generation models
View and compare pass@k metrics for AI models
Evaluate model predictions and update leaderboard
Transfer GitHub repositories to Hugging Face Spaces
Browse and filter AI model evaluation results
Parse bilibili bvid to aid / cid
Analyze and compare datasets, upload reports to Hugging Face
This project is a GUI for the gpustack/gguf-parser-go
Evaluate LLMs using Kazakh MC tasks
Mapping Nieman Lab's 2025 Journalism Predictions
Create detailed data reports
Search for tagged characters in Animagine datasets
M-RewardBench Leaderboard
Explore how datasets shape classifier biases
Analyze autism data and generate detailed reports