Request model evaluation on COCO val 2017 dataset
Evaluate reward models for math reasoning
View LLM Performance Leaderboard
Upload a machine learning model to Hugging Face Hub
Browse and submit evaluations for CaselawQA benchmarks
Compare and rank LLMs using benchmark scores
Benchmark LLMs in accuracy and translation across languages
Quantize a model for faster inference
Evaluate AI-generated results for accuracy
Launch web-based model application
Download a TriplaneGaussian model checkpoint
Upload ML model to Hugging Face Hub
Submit deepfake detection models for evaluation
The Open Object Detection Leaderboard is a benchmarking platform designed to evaluate and compare different object detection models. It provides a standardized framework for assessing model performance using the COCO (Common Objects in Context) val 2017 dataset. This leaderboard is a community-driven tool that allows researchers and developers to submit their model results and view how they stack up against others in the field.
What metrics are used for evaluation?
The leaderboard primarily uses the COCO metric, which is the mean Average Precision (mAP) across all categories and instance sizes.
How can I submit my model results?
To submit your model, evaluate it on the COCO val 2017 dataset and follow the submission guidelines provided on the leaderboard's website.
Can I update my model's entry after submission?
Yes, you can update your model's entry by resubmitting the results. The leaderboard will reflect the latest submission for your model.