Enhance and upscale images for better resolution
Florence 2 used in OCR to extract & visualize text
Generate answers to questions about images
Describe images using text
a tiny vision language model
Generate text responses in a chat interface
Rank images based on text similarity
Generate music from an image
Generate captions for your images
Generate music with chord progression and MIDI prompts
Generate a detailed song in MIDI and WAV formats
age estimation
Swap faces in photos and videos
Generate animated characters from images
Get a personalized recommendation using AI
Detect and mark facial landmarks in photos
Generate audio effects from video using image caption
Upload an image, detect objects, hear descriptions
Generate a video with text synchronized to audio
Make your audio to 8D