Nucleotide Transformer Benchmark
Generate leaderboard comparing DNA models
DGEB
Display genomic embedding leaderboard
InspectorRAGet
Evaluate RAG systems with visual analytics
CaselawQA leaderboard (WIP)
Browse and submit evaluations for CaselawQA benchmarks
Robotics Model Playground
Benchmark AI models by comparison
Guerra LLM AI Leaderboard
Compare and rank LLMs using benchmark scores
ARCH
Compare audio representation models using benchmark results
Converter
Convert and upload model files for Stable Diffusion
PaddleOCRModelConverter
Convert PaddleOCR models to ONNX format
Deepfake Detection Arena Leaderboard
Submit deepfake detection models for evaluation
OpenVINO Benchmark
Benchmark models using PyTorch and OpenVINO
OR-Bench Leaderboard
Measure over-refusal in LLMs using OR-Bench
Llm Memory Requirement
Calculate memory usage for LLM models
stm32 model zoo app
Explore and manage STM32 ML models with the STM32AI Model Zoo dashboard
Building And Deploying A Machine Learning Models Using Gradio Application
Predict customer churn based on input details
Zenml Server
Create and manage ML pipelines with ZenML Dashboard
EdgeTA
Retrain models for new data at edge devices
ExplaiNER
Analyze model errors with interactive pages
Trulens
Evaluate model predictions with TruLens
Can You Run It? LLM version
Calculate GPU requirements for running LLMs