Request model evaluation on COCO val 2017 dataset
Generate and view leaderboard for LLM evaluations
Display and submit LLM benchmarks
Explore and manage STM32 ML models with the STM32AI Model Zoo dashboard
Benchmark AI models by comparison
Convert a Stable Diffusion XL checkpoint to Diffusers and open a PR
Browse and filter ML model leaderboard data
Convert and upload model files for Stable Diffusion
Compare LLM performance across benchmarks
Create and upload a Hugging Face model card
Search for model performance across languages and benchmarks
Submit deepfake detection models for evaluation
Evaluate LLM over-refusal rates with OR-Bench
The Open Object Detection Leaderboard is a benchmarking platform designed to evaluate and compare different object detection models. It provides a standardized framework for assessing model performance using the COCO (Common Objects in Context) val 2017 dataset. This leaderboard is a community-driven tool that allows researchers and developers to submit their model results and view how they stack up against others in the field.
What metrics are used for evaluation?
The leaderboard primarily uses the COCO metric, which is the mean Average Precision (mAP) across all categories and instance sizes.
How can I submit my model results?
To submit your model, evaluate it on the COCO val 2017 dataset and follow the submission guidelines provided on the leaderboard's website.
Can I update my model's entry after submission?
Yes, you can update your model's entry by resubmitting the results. The leaderboard will reflect the latest submission for your model.