Transcribe audio with emotions and events
"Designed for all users, including those with disabilities."
Explore and analyze audio data with AudioBench Leaderboard
Ebook2audiobook docker space beta
Generate audio from text or modify voice pitch
Generate speech using a speaker's voice
Convert text to speech with voice customization
Generate audio from text for anime characters
Moonshine ASR models running on-device, in your web browser.
Generate speech from text or files
Accessibility PDF & pasted text to speech converter w/ gTTs
Kokoro is an open-weight TTS model with 82 million parameters.
Generate audio from text with adjustable speed
SenseVoice is an advanced speech synthesis and transcription tool designed to analyze audio data with remarkable accuracy. It specializes in identifying and transcribing emotions, events, and key points within audio content, making it a powerful solution for understanding spoken data at a deeper level.
What languages does SenseVoice support?
SenseVoice supports multiple languages, including English, Spanish, French, Mandarin, and several others, making it accessible to a wide range of users.
Can I use SenseVoice for real-time transcription?
Yes, SenseVoice offers real-time transcription capabilities, allowing users to transcribe audio as it is being spoken.
Is SenseVoice free to use?
SenseVoice offers a free trial version with basic features. For advanced capabilities, users may need to subscribe to a paid plan.