Request evaluation of a speech recognition model
MP-SENet is a speech enhancement model.
Generate audio from text with adjustable speed
IndicParler_TTS for Urdu_Punjabi & Sindhi
Convertir texto a audio
Transcribe or translate audio and YouTube videos
Transcribe Persian audio files into text
Generate audio from text
Spanish finetune for the original F5 model.
Cloning Voice tokoh Indonesia - Bahasa Indonesia
Belarusian TTS
Generate audio from text input
Open ASR Leaderboard is a platform designed to evaluate and benchmark speech recognition models. It provides a centralized location for developers and researchers to assess the performance of their automatic speech recognition (ASR) systems against established standards and compare them with other models.
• Comprehensive evaluation metrics: The leaderboard provides detailed performance metrics, including word error rate (WER), character error rate (CER), and real-time factor (RTF).
• Multi-language support: It supports evaluation across multiple languages and accents, making it a versatile tool for diverse datasets.
• Benchmark datasets: Access to standardized test datasets for consistent and fair model comparison.
• Customizable evaluation: Users can define specific test scenarios or use predefined configurations.
• Visualization tools: Results are presented in interactive charts and tables for easy analysis.
• Community collaboration: A forum for sharing insights, best practices, and model improvements.
What types of speech recognition models can I evaluate?
You can evaluate any automatic speech recognition model, including deep learning-based models, traditional HMM-based systems, or hybrid approaches.
How often are the leaderboards updated?
The leaderboards are updated regularly as new models are submitted and evaluated. Updates are typically announced in the community forum.
Can I use custom datasets for evaluation?
Yes, you can upload custom test datasets for evaluation, provided they meet the platform's formatting requirements.