WebGPU text-to-Speech powered by OuteTTS and Transformers.js
Generate text transcripts with timestamps from audio or video
Generate speech using a speaker's voice
Spanish finetune for the original F5 model.
MaskGCT TTS Demo
Generate anime character speech from text
Transcribe audio or YouTube videos into text
Transcribe audio from microphone, file, or YouTube link
Generate audio from text or file
Request evaluation of a speech recognition model
Turn text into speech with customizable voice, rate, and pitch
Text-to-Speech WebGPU is a cutting-edge text-to-speech service powered by OuteTTS and Transformers.js. It leverages WebGPU technology to synthesize natural-sounding speech from text input. This tool is designed to provide high-quality, efficient, and scalable speech synthesis for various applications, from voice assistants to content creation.
What browsers are supported?
Text-to-Speech WebGPU works on modern browsers that support WebGPU, including Chrome, Edge, and Firefox.
Can I use it offline?
Yes, once the model is loaded, you can use Text-to-Speech WebGPU offline.
Are there any usage limits?
Usage limits depend on the hosting platform. By default, the demo version may have constraints for free-tier users.