Generate a summary of any text
Generate detailed speaker diarization from text inputπ¬
Upscale images to x4
Generate a talking face video from an image and audio
Process audio to denoise or extract noise
Mark attendance using face recognition
Use AI to summarize, answer questions, translate, fill blanks, and paraphrase text
Generate sound effects for silent videos
Detect 3D object poses in images
Benchmark AI models by comparison
Translate text into different languages
Turn images grayscale
Detect objects in your image
Explore and benchmark visual document retrieval models
Separate instrumental and vocal tracks from audio files
Launch web-based model application
Swap a face from one image to another
Track, rank and evaluate open LLMs and chatbots
Create 3D reconstructions from videos or images
Restore and colorize old photos