Speech Corpus Creation Tool
Generate dataset for machine learning
Explore datasets on a Nomic Atlas map
Create a large, deduplicated dataset for LLM pre-training
Download datasets from a URL
Data annotation for Sparky
Perform OSINT analysis, fetch URL titles, fine-tune models
Display trending datasets from Hugging Face
Explore, annotate, and manage datasets
Browse TheBloke models' history
Create a domain-specific dataset seed
Display translation benchmark results from NTREX dataset
Dhravani is a speech corpus creation tool designed to help users record voices and transcribe them into a structured format. It is specifically tailored for dataset creation, making it an essential tool for researchers, developers, and anyone involved in AI and machine learning applications that require high-quality speech data.
What languages does Dhravani support?
Dhravani supports a wide range of languages, including but not limited to English, Spanish, Mandarin, Hindi, and more. Please refer to the official documentation for a complete list.
How secure is my data?
Dhravani prioritizes data security. All recordings and transcriptions are stored securely with encryption and access controls.
What are the system requirements for running Dhravani?
Dhravani is compatible with Windows, macOS, and Linux. Ensure your system meets the minimum specifications for smooth performance.