Speech Corpus Creation Tool
Display instructional dataset
Create a domain-specific dataset project
Organize and process datasets using AI
Generate dataset for machine learning
Explore and edit JSON datasets
Browse and search datasets
Create a large, deduplicated dataset for LLM pre-training
Annotation Tool
Manage and label data for machine learning projects
Upload files to a Hugging Face repository
Manage and orchestrate AI workflows and datasets
Explore datasets on a Nomic Atlas map
Dhravani is a speech corpus creation tool designed to help users record voices and transcribe them into a structured format. It is specifically tailored for dataset creation, making it an essential tool for researchers, developers, and anyone involved in AI and machine learning applications that require high-quality speech data.
What languages does Dhravani support?
Dhravani supports a wide range of languages, including but not limited to English, Spanish, Mandarin, Hindi, and more. Please refer to the official documentation for a complete list.
How secure is my data?
Dhravani prioritizes data security. All recordings and transcriptions are stored securely with encryption and access controls.
What are the system requirements for running Dhravani?
Dhravani is compatible with Windows, macOS, and Linux. Ensure your system meets the minimum specifications for smooth performance.