Transcribe Persian audio files into text
Transcribe spoken Russian into text
Generate audio from text with adjustable speed
Convert text to speech in multiple languages
Convert text to speech effortlessly
Generate audio from text or file
✨[With v1.0.0] Accelerated TTS on Kokoro-82M
Transcribe audio from microphone, file, or YouTube link
MaskGCT TTS Demo
Better AI powered platform to purify your speech signal
Text to Audio (Sound SFX) Generator
Moonshine ASR models running on-device, in your web browser.
Talk to Qwen2Audio with Gradio and WebRTC ⚡️
Gooya v1.4 Persian Speech Recognition is a cutting-edge tool designed to transcribe Persian audio files into text. It leverages advanced AI technology to accurately recognize and convert spoken Persian language into written format, enabling efficient processing of audio data.
• Support for Persian Language: Tailored to recognize and transcribe Persian speech with high accuracy.
• Multi-Language Support: Capable of handling other languages commonly spoken in Iran, including English.
• High Accuracy: Advanced algorithms ensure precise transcription of spoken words.
• Real-Time Processing: Allows for immediate transcription of audio input.
• Support for Various Audio Formats: Compatible with popular formats such as WAV, MP3, and more.
• Speaker Identification: Can distinguish between multiple speakers in an audio file.
• Custom Vocabulary Support: Enables users to add specific terms or phrases for better accuracy.
• Cross-Platform Compatibility: Available for use on Windows, macOS, and Linux systems.
What languages does Gooya v1.4 support?
Gooya v1.4 primarily supports Persian but can also handle other languages commonly spoken in Iran, including English.
Is Gooya v1.4 compatible with all audio formats?
Gooya v1.4 supports popular audio formats such as WAV, MP3, and FLAC. For less common formats, conversion may be required before transcription.
Can Gooya v1.4 transcribe speech in real-time?
Yes, Gooya v1.4 offers real-time transcription capabilities, making it ideal for live audio processing.