Browse and submit LLM evaluations
Retrain models for new data at edge devices
Measure BERT model performance using WASM and WebGPU
Export Hugging Face models to ONNX
Predict customer churn based on input details
Evaluate model predictions with TruLens
Multilingual Text Embedding Model Pruner
Rank machines based on LLaMA 7B v2 benchmark results
Create and manage ML pipelines with ZenML Dashboard
Track, rank and evaluate open LLMs and chatbots
Evaluate code generation with diverse feedback types
Open Persian LLM Leaderboard
Compare LLM performance across benchmarks
The Open Medical-LLM Leaderboard is a comprehensive platform designed for benchmarking and comparing large language models (LLMs) specifically tailored for medical and healthcare applications. It provides a centralized hub where users can browse, evaluate, and submit their own model evaluations, fostering transparency and collaboration in the development of AI for medical use cases.
What types of medical applications are supported?
The Open Medical-LLM Leaderboard supports a wide range of medical applications, including clinical text analysis, medical question answering, and healthcare document summarization.
How do I submit my own model evaluation?
To submit your model evaluation, follow these steps:
Is the leaderboard open to non-experts?
Yes, the leaderboard is designed to be accessible to both experts and non-experts. Researchers, developers, and healthcare professionals can all benefit from the platform's resources and tools.