Simple Space for the Kokoro Model
Generate audio from text input
Fast, efficient, & multilingual text-to-speech
CPU powered, low RTF, emotional, multilingual TTS
Talk to Qwen2Audio with Gradio and WebRTC ⚡️
Generate text from audio input
Transcribe voice to text
Generate sexual voice sounds from text
Transcribe or translate audio and YouTube videos
Transcribe Persian audio to text
IndicParler_TTS for Urdu_Punjabi & Sindhi
Transcribe YouTube videos to text
Accessibility PDF & pasted text to speech converter w/ gTTs
Kokoro is a speech synthesis tool designed to convert text into natural-sounding speech. It provides a simple and intuitive interface for generating audio from written content, leveraging advanced models and engines to deliver high-quality voice outputs.
• Multiple Voice Options: Choose from a variety of voices to match your needs.
• Language Support: Generate speech in multiple languages for global accessibility.
• Engine Flexibility: Utilize different speech synthesis engines for varying output styles.
• SSML Support: Customize speech patterns, pitch, and speed using Speech Synthesis Markup Language.
• Real-Time Generation: Quickly convert text to speech with minimal processing time.
What engines does Kokoro support?
Kokoro supports a range of engines, including Google Text-to-Speech, Amazon Polly, and others, depending on your setup.
Can I customize the speech output?
Yes, Kokoro allows you to customize speech using SSML, enabling control over pitch, speed, and emphasis.
Is Kokoro free to use?
Kokoro offers a free tier with basic features, but advanced options may require a subscription or payment.