Embedding Leaderboard
Generate topics from text data with BERTopic
Retrieve news articles based on a query
Provide feedback on text content
Track, rank and evaluate open Arabic LLMs and chatbots
Check text for moderation flags
Experiment with and compare different tokenizers
Predict NCM codes from product descriptions
Predict song genres from lyrics
A benchmark for open-source multi-dialect Arabic ASR models
Compare LLMs by role stability
Explore BERT model interactions
fake news detection using distilbert trained on liar dataset
The MTEB Leaderboard is a comprehensive platform designed for evaluating and comparing text embeddings across various models, benchmarks, and languages. It provides a standardized framework for assessing the performance of different embedding techniques, enabling researchers and developers to identify the most effective solutions for their specific use cases.
What benchmarks are available on the MTEB Leaderboard?
The MTEB Leaderboard supports a wide range of benchmarks tailored for specific tasks in text analysis, including but not limited to text classification, clustering, and information retrieval.
How do I interpret the scores on the leaderboard?
Scores are typically represented as performance metrics (e.g., accuracy, F1-score, or Spearman correlation) depending on the benchmark. Higher scores generally indicate better performance for the specific task.
Can I evaluate my custom model on the MTEB Leaderboard?
Yes, you can evaluate custom models by generating embeddings for the selected benchmarks and languages, and then uploading the results to the leaderboard for comparison.