Simple Space for the Kokoro Model
Sound effect from description
Generate natural-sounding speech from text using OpenAI's API
Generate text from audio input
WebGPU text-to-Speech powered by OuteTTS and Transformers.js
audio-arena
F5-TTS & E2-TTS: Zero-Shot Voice Cloning (Unofficial Demo)
Accessibility PDF & pasted text to speech converter w/ gTTs
Generate anime character speech from text
Transcribe Persian audio to text
CPU powered, low RTF, emotional, multilingual TTS
Generate realistic voices from text
Efficient, fast, and natural text to speech with StyleTTS 2!
Kokoro is a speech synthesis tool designed to convert text into natural-sounding speech. It provides a simple and intuitive interface for generating audio from written content, leveraging advanced models and engines to deliver high-quality voice outputs.
• Multiple Voice Options: Choose from a variety of voices to match your needs.
• Language Support: Generate speech in multiple languages for global accessibility.
• Engine Flexibility: Utilize different speech synthesis engines for varying output styles.
• SSML Support: Customize speech patterns, pitch, and speed using Speech Synthesis Markup Language.
• Real-Time Generation: Quickly convert text to speech with minimal processing time.
What engines does Kokoro support?
Kokoro supports a range of engines, including Google Text-to-Speech, Amazon Polly, and others, depending on your setup.
Can I customize the speech output?
Yes, Kokoro allows you to customize speech using SSML, enabling control over pitch, speed, and emphasis.
Is Kokoro free to use?
Kokoro offers a free tier with basic features, but advanced options may require a subscription or payment.