Model Benchmarking | Free AI Tools by Category

🚀

Can You Run It? LLM version

Determine GPU requirements for large language models

950

🚀

Model Memory Utility

Calculate memory needed to train AI models

922

🦾

GAIA Leaderboard

Submit models for evaluation and view leaderboard

360

🐨

LLM Performance Leaderboard

View LLM Performance Leaderboard

296

🔀

mergekit-gui

Merge machine learning models using a YAML configuration file

271

🏆

Low-bit Quantized Open LLM Leaderboard

Track, rank and evaluate open LLMs and chatbots

166

🏆

Open Object Detection Leaderboard

Request model evaluation on COCO val 2017 dataset

158

🥇

Vidore Leaderboard

Explore and benchmark visual document retrieval models

124

⚡

Modelcard Creator

Create and upload a Hugging Face model card

110

⚔

MTEB Arena

Teach, test, evaluate language models with MTEB Arena

103

🌍

European Leaderboard

Benchmark LLMs in accuracy and translation across languages

🐠

Nexus Function Calling Leaderboard

Visualize model performance on function calling tasks

🥇

LLM Safety Leaderboard

View and submit machine learning model evaluations

🏆

Open LLM Leaderboard

Track, rank and evaluate open LLMs and chatbots

🌸

La Leaderboard

Evaluate open LLMs in the languages of LATAM and Spain.

🎨

SD To Diffusers

Convert Stable Diffusion checkpoint to Diffusers and open a PR

🏎

Export to ONNX

Export Hugging Face models to ONNX

🥇

GIFT Eval

GIFT-Eval: A Benchmark for General Time Series Forecasting

🏅

Open Persian LLM Leaderboard

🐠

WebGPU Embedding Benchmark

Measure execution times of BERT models using WebGPU and WASM

…