Open Persian LLM Leaderboard
Browse and submit model evaluations in LLM benchmarks
Measure over-refusal in LLMs using OR-Bench
Submit deepfake detection models for evaluation
View and submit machine learning model evaluations
Explain GPU usage for model training
Evaluate and submit AI model results for Frugal AI Challenge
View and submit LLM benchmark evaluations
Create demo spaces for models on Hugging Face
Evaluate reward models for math reasoning
Browse and submit evaluations for CaselawQA benchmarks
Display benchmark results
Explore GenAI model efficiency on ML.ENERGY leaderboard
The Open Persian LLM Leaderboard is a comprehensive platform designed to benchmark and evaluate Persian language models. It provides a detailed comparison of various models based on their performance on diverse Persian language tasks. The leaderboard aims to promote transparency and advance research in Persian natural language processing by offering standardized metrics and rankings.
• Model Performance Tracking: Compare the performance of different Persian language models across various tasks.
• Task-Specific Benchmarks: Evaluate models on text classification, machine translation, summarization, and more.
• Standardized Metrics: Access clear and consistent evaluation metrics for fair comparison.
• Community Contributions: Submit your own models or datasets to the leaderboard.
• Regular Updates: Stay informed with the latest developments in Persian NLP through frequent leaderboard updates.
What models are included in the Open Persian LLM Leaderboard?
The leaderboard includes a variety of Persian language models, ranging from small-scale models to state-of-the-art architectures. It also features community-submitted models.
How often is the leaderboard updated?
The leaderboard is updated regularly to reflect new models, datasets, and advancements in Persian NLP. Users are encouraged to check back frequently for the latest rankings.
Can I submit my own model to the leaderboard?
Yes, the Open Persian LLM Leaderboard is open to community contributions. Visit the platform's documentation to learn about submission guidelines and requirements.