Speech Corpus Creation Tool
Support by Parquet, CSV, Jsonl, XLS
Browse and search datasets
Validate JSONL format for fine-tuning
Explore and edit JSON datasets
Organize and process datasets using AI
sign in to receive news on the iPhone app
Generate synthetic datasets for AI training
Explore datasets on a Nomic Atlas map
Browse and extract data from Hugging Face datasets
Display instructional dataset
Convert PDFs to a dataset and upload to Hugging Face
Dhravani is a Speech Corpus Creation Tool designed for creating high-quality speech datasets. It allows users to record voices and transcribe them efficiently, making it an essential tool for dataset creation in various applications like speech recognition, voice assistants, and language research.
• Multi-Language Support: Record and transcribe speech in multiple languages, catering to diverse linguistic needs. • AI-Powered Transcription: Utilizes advanced AI algorithms for accurate and rapid transcription of recorded audio. • Collaborative Workspace: Enables team collaboration for efficient dataset creation and management. • Audio Quality Control: Includes tools to analyze and enhance audio quality for optimal dataset performance. • Customizable Metadata: Allows users to add and manage metadata for better organization and search functionality.
What languages does Dhravani support?
Dhravani supports a wide range of languages, including popular ones like English, Spanish, Mandarin, and many others. Check the app for the full list of supported languages.
Do I need an internet connection to use Dhravani?
Yes, an internet connection is required for AI transcription and feature updates, but audio recording can be done offline.
What formats can I export my dataset in?
Dhravani allows you to export your speech corpus in common formats such as WAV, MP3, and XML, depending on your project requirements.