Simple Space for the Kokoro Model
V1.0Convert any Ebook to AudioBook with Xtts + VoiceCloning!
Generate speech from text with reference audio
Moonshine ASR models running on-device, in your web browser.
Transcribe Persian audio files into text
Generate realistic audio from text
Convert text to speech with Next-gen Kaldi
Convert text to speech effortlessly
Voice Clone Multilingual TTS
ヘスティアのAI音声合成モデルを作りました。
Generate speech from text
Lunch web-based text-to-speech interface
Convert audio to text and summarize highlights
Kokoro is a speech synthesis tool designed to convert text into natural-sounding speech. It provides a simple and intuitive interface for generating audio from written content, leveraging advanced models and engines to deliver high-quality voice outputs.
• Multiple Voice Options: Choose from a variety of voices to match your needs.
• Language Support: Generate speech in multiple languages for global accessibility.
• Engine Flexibility: Utilize different speech synthesis engines for varying output styles.
• SSML Support: Customize speech patterns, pitch, and speed using Speech Synthesis Markup Language.
• Real-Time Generation: Quickly convert text to speech with minimal processing time.
What engines does Kokoro support?
Kokoro supports a range of engines, including Google Text-to-Speech, Amazon Polly, and others, depending on your setup.
Can I customize the speech output?
Yes, Kokoro allows you to customize speech using SSML, enabling control over pitch, speed, and emphasis.
Is Kokoro free to use?
Kokoro offers a free tier with basic features, but advanced options may require a subscription or payment.