Model Benchmarking | Free AI Tools by Category

🏆

Nucleotide Transformer Benchmark

Generate leaderboard comparing DNA models

🚀

DGEB

Display genomic embedding leaderboard

🧐

InspectorRAGet

Evaluate RAG systems with visual analytics

🏛

CaselawQA leaderboard (WIP)

Browse and submit evaluations for CaselawQA benchmarks

🐨

Robotics Model Playground

Benchmark AI models by comparison

🧠

Guerra LLM AI Leaderboard

Compare and rank LLMs using benchmark scores

📊

ARCH

Compare audio representation models using benchmark results

♻

Converter

Convert and upload model files for Stable Diffusion

🐠

PaddleOCRModelConverter

Convert PaddleOCR models to ONNX format

🥇

Deepfake Detection Arena Leaderboard

Submit deepfake detection models for evaluation

🏋

OpenVINO Benchmark

Benchmark models using PyTorch and OpenVINO

🏆

OR-Bench Leaderboard

Measure over-refusal in LLMs using OR-Bench

📊

Llm Memory Requirement

Calculate memory usage for LLM models

🚀

stm32 model zoo app

Explore and manage STM32 ML models with the STM32AI Model Zoo dashboard

📈

Building And Deploying A Machine Learning Models Using Gradio Application

Predict customer churn based on input details

🧘

Zenml Server

Create and manage ML pipelines with ZenML Dashboard

🚀

EdgeTA

Retrain models for new data at edge devices

🏷

ExplaiNER

Analyze model errors with interactive pages

🏢

Trulens

Evaluate model predictions with TruLens

🚀

Can You Run It? LLM version

Calculate GPU requirements for running LLMs