Highlight objects in images using text prompts
Generate saliency maps from RGB and depth images
Visual Retrieval with ColPali and Vespa
Detect if a person in a picture is a Host from Westworld
Tag images with NSFW labels
Generate depth map from an image
Complete depth for images using sparse depth maps
Generate mask from image
Apply ZCA Whitening to images
Simulate wearing clothes on images
Interact with Florence-2 to analyze images and generate descriptions
Convert images of screens to structured elements
Enhance faces in old or AI-generated photos
Florence2 + SAM2 Masking is an advanced AI tool designed to highlight specific objects within images using text prompts. By combining the powerful vision capabilities of Florence2 and the language processing strengths of SAM2, this tool enables users to precisely mask and isolate objects in images based on descriptive text inputs.
• Object Highlighting: Accurately identifies and highlights objects in images using text prompts.
• Multi-Object Support: Can process and mask multiple objects within a single image.
• Text-Based Control: Uses natural language descriptions to guide the masking process.
• Model Synergy: Leverages the strengths of both Florence2 and SAM2 for enhanced accuracy and versatility.
What file formats does Florence2 + SAM2 Masking support?
Florence2 + SAM2 Masking supports common image formats such as PNG, JPG, and JPEG.
Can I use complex text prompts?
Yes, you can use complex and detailed text prompts to achieve more precise masking results.
How long does the masking process take?
The processing time depends on the image size and complexity. Typically, it takes a few seconds for standard images.