Pergel: A Unified Benchmark for Evaluating Turkish LLMs
Rank machines based on LLaMA 7B v2 benchmark results
Generate leaderboard comparing DNA models
Convert Stable Diffusion checkpoint to Diffusers and open a PR
Display and filter leaderboard models
Evaluate LLM over-refusal rates with OR-Bench
Compare code model performance on benchmarks
Download a TriplaneGaussian model checkpoint
Convert a Stable Diffusion XL checkpoint to Diffusers and open a PR
Compare and rank LLMs using benchmark scores
Evaluate model predictions with TruLens
Multilingual Text Embedding Model Pruner
Browse and filter ML model leaderboard data
Cetvel is a benchmarking tool designed to evaluate the performance of Turkish Large Language Models (LLMs). It provides a comprehensive framework for assessing model capabilities across various natural language processing tasks. Cetvel automates the evaluation process, enabling users to compare and analyze the performance of different models efficiently.
• Task Coverage: Evaluate models on a wide range of Turkish NLP tasks, including text classification, summarization, and question answering.
• Customizable Benchmarks: Tailor evaluation metrics and tasks to specific use cases.
• Detailed Performance Reports: Generate in-depth analysis of model strengths and weaknesses.
• Cross-Model Comparison: Compare multiple models side-by-side to identify the best performer for your needs.
• Easy Integration: Seamlessly integrate with popular Turkish LLMs for quick and accurate benchmarking.
What models does Cetvel support?
Cetvel supports a wide range of Turkish LLMs, including popular models like BERTurk, TTUM, and others. For the full list of supported models, refer to the official documentation.
How do I customize the benchmarking tasks?
Customization options are available through the Cetvel interface, where you can select specific tasks, datasets, and evaluation metrics to tailor the benchmarking process to your needs.
Where can I find Cetvel?
Cetvel is available for download on its official GitHub repository. Ensure you follow the installation instructions carefully to set up the tool correctly.