Persian Text Embedding Benchmark
View LLM Performance Leaderboard
View RL Benchmark Reports
Convert PyTorch models to waifu2x-ios format
Browse and filter ML model leaderboard data
Retrain models for new data at edge devices
Compare code model performance on benchmarks
Evaluate LLM over-refusal rates with OR-Bench
Rank machines based on LLaMA 7B v2 benchmark results
Evaluate model predictions with TruLens
View and submit LLM benchmark evaluations
Benchmark LLMs in accuracy and translation across languages
Evaluate and submit AI model results for Frugal AI Challenge
The PTEB Leaderboard is a benchmarking platform designed to evaluate and compare the performance of Persian text embedding models. It provides a comprehensive framework for assessing how well these models handle Persian language tasks, making it an essential tool for researchers and developers in the NLP community. The leaderboard allows users to view and analyze the results of various models across different metrics and datasets.
• Comprehensive Benchmarking: Evaluates models on multiple Persian language tasks and datasets.
• Model Comparison: Enables side-by-side comparison of different embedding models.
• Customizable Metrics: Supports a variety of evaluation metrics tailored for Persian text.
• Interactive Visualizations: Presents results in easy-to-understand charts and graphs.
• Regular Updates: Maintains up-to-date results as new models are released.
What is the purpose of the PTEB Leaderboard?
The PTEB Leaderboard is designed to provide standardized benchmarks for Persian text embedding models, helping researchers and developers identify top-performing models for their specific use cases.
Can I add my own model to the leaderboard?
Yes, the PTEB Leaderboard allows submissions of new models. Visit the official documentation for guidelines on how to prepare and submit your model for evaluation.
How often are the benchmarks updated?
The benchmarks are updated regularly as new models are released and existing models are fine-tuned. Follow the leaderboard for the latest updates and improvements.