Embedding Leaderboard
Track, rank and evaluate open LLMs and chatbots
Search for philosophical answers by author
Upload a table to predict basalt source lithology, temperature, and pressure
Generate answers by querying text in uploaded documents
Analyze text using tuned lens and visualize predictions
Generative Tasks Evaluation of Arabic LLMs
Extract bibliographical metadata from PDFs
Provide feedback on text content
Display and explore model leaderboards and chat history
Analyze sentiment of articles about trading assets
Search for similar AI-generated patent abstracts
Classify patent abstracts into subsectors
The MTEB Leaderboard is a comprehensive platform designed for evaluating and comparing text embeddings across various models, benchmarks, and languages. It provides a standardized framework for assessing the performance of different embedding techniques, enabling researchers and developers to identify the most effective solutions for their specific use cases.
What benchmarks are available on the MTEB Leaderboard?
The MTEB Leaderboard supports a wide range of benchmarks tailored for specific tasks in text analysis, including but not limited to text classification, clustering, and information retrieval.
How do I interpret the scores on the leaderboard?
Scores are typically represented as performance metrics (e.g., accuracy, F1-score, or Spearman correlation) depending on the benchmark. Higher scores generally indicate better performance for the specific task.
Can I evaluate my custom model on the MTEB Leaderboard?
Yes, you can evaluate custom models by generating embeddings for the selected benchmarks and languages, and then uploading the results to the leaderboard for comparison.