Simple Space for the Kokoro Model
✨[With v1.0.0] Accelerated TTS on Kokoro-82M
Transcribe or translate audio files
F5-TTS & E2-TTS: Zero-Shot Voice Cloning (Unofficial Demo)
Generate speech from text with adjustable speed
Generate edited English speech from audio and text
Sound effect from description
Request evaluation of a speech recognition model
Convert text to speech with different voices
Generate Vietnamese speech from text and reference audio
Generate audiobooks giving each character a unique voice
ヘスティアのAI音声合成モデルを作りました。
Kokoro is a speech synthesis tool designed to convert text into natural-sounding speech. It provides a simple and intuitive interface for generating audio from written content, leveraging advanced models and engines to deliver high-quality voice outputs.
• Multiple Voice Options: Choose from a variety of voices to match your needs.
• Language Support: Generate speech in multiple languages for global accessibility.
• Engine Flexibility: Utilize different speech synthesis engines for varying output styles.
• SSML Support: Customize speech patterns, pitch, and speed using Speech Synthesis Markup Language.
• Real-Time Generation: Quickly convert text to speech with minimal processing time.
What engines does Kokoro support?
Kokoro supports a range of engines, including Google Text-to-Speech, Amazon Polly, and others, depending on your setup.
Can I customize the speech output?
Yes, Kokoro allows you to customize speech using SSML, enabling control over pitch, speed, and emphasis.
Is Kokoro free to use?
Kokoro offers a free tier with basic features, but advanced options may require a subscription or payment.