Speech Corpus Creation Tool
Train a model using custom data
Colabora para conseguir un Carnaval de Cádiz más accesible
Create datasets with FAQs and SFT prompts
Data annotation for Sparky
Browse and extract data from Hugging Face datasets
Manage and label datasets for your projects
Speech Corpus Creation Tool
List of French datasets not referenced on the Hub
Upload files to a Hugging Face repository
Convert PDFs to a dataset and upload to Hugging Face
Access NLPre-PL dataset and pre-trained models
Explore and edit JSON datasets
Dhravani is a Speech Corpus Creation Tool designed for creating high-quality speech datasets. It allows users to record voices and transcribe them efficiently, making it an essential tool for dataset creation in various applications like speech recognition, voice assistants, and language research.
• Multi-Language Support: Record and transcribe speech in multiple languages, catering to diverse linguistic needs. • AI-Powered Transcription: Utilizes advanced AI algorithms for accurate and rapid transcription of recorded audio. • Collaborative Workspace: Enables team collaboration for efficient dataset creation and management. • Audio Quality Control: Includes tools to analyze and enhance audio quality for optimal dataset performance. • Customizable Metadata: Allows users to add and manage metadata for better organization and search functionality.
What languages does Dhravani support?
Dhravani supports a wide range of languages, including popular ones like English, Spanish, Mandarin, and many others. Check the app for the full list of supported languages.
Do I need an internet connection to use Dhravani?
Yes, an internet connection is required for AI transcription and feature updates, but audio recording can be done offline.
What formats can I export my dataset in?
Dhravani allows you to export your speech corpus in common formats such as WAV, MP3, and XML, depending on your project requirements.