Embedding Leaderboard
Encode and decode Hindi text using BPE
Predict NCM codes from product descriptions
Detect emotions in text sentences
Retrieve news articles based on a query
Track, rank and evaluate open Arabic LLMs and chatbots
Detect harms and risks with Granite Guardian 3.1 8B
Search for similar AI-generated patent abstracts
Explore and interact with HuggingFace LLM APIs using Swagger UI
Generate answers by querying text in uploaded documents
Classify patent abstracts into subsectors
A benchmark for open-source multi-dialect Arabic ASR models
Demo emotion detection
The MTEB Leaderboard is a comprehensive platform designed for evaluating and comparing text embeddings across various models, benchmarks, and languages. It provides a standardized framework for assessing the performance of different embedding techniques, enabling researchers and developers to identify the most effective solutions for their specific use cases.
What benchmarks are available on the MTEB Leaderboard?
The MTEB Leaderboard supports a wide range of benchmarks tailored for specific tasks in text analysis, including but not limited to text classification, clustering, and information retrieval.
How do I interpret the scores on the leaderboard?
Scores are typically represented as performance metrics (e.g., accuracy, F1-score, or Spearman correlation) depending on the benchmark. Higher scores generally indicate better performance for the specific task.
Can I evaluate my custom model on the MTEB Leaderboard?
Yes, you can evaluate custom models by generating embeddings for the selected benchmarks and languages, and then uploading the results to the leaderboard for comparison.