Request model evaluation on COCO val 2017 dataset
Merge machine learning models using a YAML configuration file
Download a TriplaneGaussian model checkpoint
Browse and submit evaluations for CaselawQA benchmarks
Benchmark AI models by comparison
Find and download models from Hugging Face
Evaluate reward models for math reasoning
Launch web-based model application
View LLM Performance Leaderboard
Display model benchmark results
Measure over-refusal in LLMs using OR-Bench
Create and upload a Hugging Face model card
Calculate survival probability based on passenger details
The Open Object Detection Leaderboard is a benchmarking platform designed to evaluate and compare different object detection models. It provides a standardized framework for assessing model performance using the COCO (Common Objects in Context) val 2017 dataset. This leaderboard is a community-driven tool that allows researchers and developers to submit their model results and view how they stack up against others in the field.
What metrics are used for evaluation?
The leaderboard primarily uses the COCO metric, which is the mean Average Precision (mAP) across all categories and instance sizes.
How can I submit my model results?
To submit your model, evaluate it on the COCO val 2017 dataset and follow the submission guidelines provided on the leaderboard's website.
Can I update my model's entry after submission?
Yes, you can update your model's entry by resubmitting the results. The leaderboard will reflect the latest submission for your model.