Can You Run It? LLM version
Determine GPU requirements for large language models
Model Memory Utility
Calculate memory needed to train AI models
GAIA Leaderboard
Submit models for evaluation and view leaderboard
LLM Performance Leaderboard
View LLM Performance Leaderboard
mergekit-gui
Merge machine learning models using a YAML configuration file
Low-bit Quantized Open LLM Leaderboard
Track, rank and evaluate open LLMs and chatbots
Open Object Detection Leaderboard
Request model evaluation on COCO val 2017 dataset
Vidore Leaderboard
Explore and benchmark visual document retrieval models
Modelcard Creator
Create and upload a Hugging Face model card
MTEB Arena
Teach, test, evaluate language models with MTEB Arena
European Leaderboard
Benchmark LLMs in accuracy and translation across languages
Nexus Function Calling Leaderboard
Visualize model performance on function calling tasks
LLM Safety Leaderboard
View and submit machine learning model evaluations
Open LLM Leaderboard
Track, rank and evaluate open LLMs and chatbots
La Leaderboard
Evaluate open LLMs in the languages of LATAM and Spain.
SD To Diffusers
Convert Stable Diffusion checkpoint to Diffusers and open a PR
Export to ONNX
Export Hugging Face models to ONNX
GIFT Eval
GIFT-Eval: A Benchmark for General Time Series Forecasting
Open Persian LLM Leaderboard
Open Persian LLM Leaderboard
WebGPU Embedding Benchmark
Measure execution times of BERT models using WebGPU and WASM