Simple Space for the Kokoro Model
CPU powered, low RTF, emotional, multilingual TTS
StyleTTS2 trained on ukrainian dataset
Converse with Claude Play.ai and WebRTC ⚡️
Moonshine ASR models running on-device, in your web browser.
Generate natural-sounding speech from text using OpenAI's API
High-fidelity Text-To-Speech
Cloning Voice tokoh Indonesia - Bahasa Indonesia
F5-TTS & E2-TTS: Zero-Shot Voice Cloning (Unofficial Demo)
Kokoro is an open-weight TTS model with 82 million parameters.
MaskGCT TTS Demo
Convert text to speech with Next-gen Kaldi
A demo of Indic Parler-TTS
Kokoro is a speech synthesis tool designed to convert text into natural-sounding speech. It provides a simple and intuitive interface for generating audio from written content, leveraging advanced models and engines to deliver high-quality voice outputs.
• Multiple Voice Options: Choose from a variety of voices to match your needs.
• Language Support: Generate speech in multiple languages for global accessibility.
• Engine Flexibility: Utilize different speech synthesis engines for varying output styles.
• SSML Support: Customize speech patterns, pitch, and speed using Speech Synthesis Markup Language.
• Real-Time Generation: Quickly convert text to speech with minimal processing time.
What engines does Kokoro support?
Kokoro supports a range of engines, including Google Text-to-Speech, Amazon Polly, and others, depending on your setup.
Can I customize the speech output?
Yes, Kokoro allows you to customize speech using SSML, enabling control over pitch, speed, and emphasis.
Is Kokoro free to use?
Kokoro offers a free tier with basic features, but advanced options may require a subscription or payment.