Evaluate Persian LLMs on various tasks
Create and manage ML pipelines with ZenML Dashboard
Track, rank and evaluate open LLMs and chatbots
Launch web-based model application
Evaluate and submit AI model results for Frugal AI Challenge
Measure execution times of BERT models using WebGPU and WASM
Compare code model performance on benchmarks
Convert and upload model files for Stable Diffusion
Submit models for evaluation and view leaderboard
Persian Text Embedding Benchmark
View NSQL Scores for Models
Calculate VRAM requirements for LLM models
Evaluate model predictions with TruLens
The š¤ Persian LLM Leaderboard is a comprehensive platform designed to evaluate and compare Persian language models across various tasks. It provides a centralized hub for researchers and developers to assess the performance of different models in the Persian language, fostering innovation and transparency in the field of natural language processing.
⢠Model Comparison: Evaluate and compare the performance of multiple Persian LLMs on different tasks. ⢠Task-Specific Benchmarks: Assess models on a variety of tasks tailored to the Persian language, such as text classification, summarization, and translation. ⢠Detailed Metrics: Access detailed performance metrics to understand model strengths and weaknesses. ⢠Visualizations: Interactive charts and graphs to visualize model performance and trends over time. ⢠Regular Updates: Stay informed with the latest developments and updates in Persian LLMs. ⢠Community-Driven: Submit your own models or results to contribute to the leaderboard.
What models are included on the leaderboard?
The leaderboard features a variety of Persian language models, including both state-of-the-art and emerging models from researchers and developers.
How are models evaluated?
Models are evaluated based on their performance on a range of tasks specific to the Persian language, using standard benchmarks and metrics.
Can I submit my own model?
Yes, you can submit your Persian LLM for evaluation. Follow the submission guidelines provided on the platform to include your model on the leaderboard.