Browse and submit LLM evaluations
Track, rank and evaluate open LLMs and chatbots
Evaluate model predictions with TruLens
Explore and benchmark visual document retrieval models
Optimize and train foundation models using IBM's FMS
Upload ML model to Hugging Face Hub
Explain GPU usage for model training
Predict customer churn based on input details
Teach, test, evaluate language models with MTEB Arena
Leaderboard of information retrieval models in French
Browse and submit evaluations for CaselawQA benchmarks
Compare LLM performance across benchmarks
Display genomic embedding leaderboard
The Open Medical-LLM Leaderboard is a comprehensive platform designed for benchmarking and comparing large language models (LLMs) specifically tailored for medical and healthcare applications. It provides a centralized hub where users can browse, evaluate, and submit their own model evaluations, fostering transparency and collaboration in the development of AI for medical use cases.
What types of medical applications are supported?
The Open Medical-LLM Leaderboard supports a wide range of medical applications, including clinical text analysis, medical question answering, and healthcare document summarization.
How do I submit my own model evaluation?
To submit your model evaluation, follow these steps:
Is the leaderboard open to non-experts?
Yes, the leaderboard is designed to be accessible to both experts and non-experts. Researchers, developers, and healthcare professionals can all benefit from the platform's resources and tools.