Generate leaderboard comparing DNA models
Display genomic embedding leaderboard
Evaluate RAG systems with visual analytics
Browse and submit evaluations for CaselawQA benchmarks
Benchmark AI models by comparison
Compare and rank LLMs using benchmark scores
Compare audio representation models using benchmark results
Convert and upload model files for Stable Diffusion
Convert PaddleOCR models to ONNX format
Submit deepfake detection models for evaluation
Benchmark models using PyTorch and OpenVINO
Measure over-refusal in LLMs using OR-Bench
Calculate memory usage for LLM models
Explore and manage STM32 ML models with the STM32AI Model Zoo dashboard
Predict customer churn based on input details
Create and manage ML pipelines with ZenML Dashboard
Retrain models for new data at edge devices
Analyze model errors with interactive pages
Evaluate model predictions with TruLens
Calculate GPU requirements for running LLMs