Try on clothes virtually with an image
tryitout-HD-virtual-tryon that works with single image
Qwen2-VL is a vision-language model that performs OCR
Extract text from images
Detect and estimate human poses in images
Restore degraded audio using a Transformer-based model
Process video to detect specified objects
Generate captions for images
Ultra-high resolution image synthesis
Generate lip-synced talking head video from audio
Forecast future values from time series data
Create realistic images from prompts and images
Pulls Best Trade Setups Based on RSI Conditions
Extract text and search keywords from images
Quickly edit the expression of a face
Transcribe Persian audio files into text
Clone a voice with text input
Generate and export filtered syndical news reports to PDF
Generate 3D face model from image or webcam
Generate videos from an image and text prompt