Persian Text Embedding Benchmark
Teach, test, evaluate language models with MTEB Arena
Benchmark AI models by comparison
Request model evaluation on COCO val 2017 dataset
Evaluate code generation with diverse feedback types
View NSQL Scores for Models
Quantize a model for faster inference
Browse and submit model evaluations in LLM benchmarks
Calculate GPU requirements for running LLMs
Open Persian LLM Leaderboard
View and compare language model evaluations
Display model benchmark results
Measure execution times of BERT models using WebGPU and WASM
The PTEB Leaderboard is a benchmarking platform designed to evaluate and compare the performance of Persian text embedding models. It provides a comprehensive framework for assessing how well these models handle Persian language tasks, making it an essential tool for researchers and developers in the NLP community. The leaderboard allows users to view and analyze the results of various models across different metrics and datasets.
• Comprehensive Benchmarking: Evaluates models on multiple Persian language tasks and datasets.
• Model Comparison: Enables side-by-side comparison of different embedding models.
• Customizable Metrics: Supports a variety of evaluation metrics tailored for Persian text.
• Interactive Visualizations: Presents results in easy-to-understand charts and graphs.
• Regular Updates: Maintains up-to-date results as new models are released.
What is the purpose of the PTEB Leaderboard?
The PTEB Leaderboard is designed to provide standardized benchmarks for Persian text embedding models, helping researchers and developers identify top-performing models for their specific use cases.
Can I add my own model to the leaderboard?
Yes, the PTEB Leaderboard allows submissions of new models. Visit the official documentation for guidelines on how to prepare and submit your model for evaluation.
How often are the benchmarks updated?
The benchmarks are updated regularly as new models are released and existing models are fine-tuned. Follow the leaderboard for the latest updates and improvements.