Analyze images to generate captions, detect objects, or perform OCR
Facial expressions, 3D landmarks, embeddings, recognition.
Complete depth for images using sparse depth maps
Segment objects in images and videos using text prompts
Detect if a person in a picture is a Host from Westworld
Detect lines in images using a transformer-based model
Segment body parts in images
Search for illustrations using descriptions or images
Swap faces in images
Simulate wearing clothes on images
Test
Analyze images to identify marine species and objects
Enhance and upscale images, especially faces
Florence 2 is an AI-powered image analysis tool designed to process and understand visual data. It can generate captions, detect objects, and perform OCR (Optical Character Recognition) on images, making it versatile for various applications.
What formats does Florence 2 support?
Florence 2 supports common image formats such as JPG, PNG, and BMP.
How accurate is Florence 2?
Accuracy depends on the quality of the input image and the complexity of the task. Generally, it achieves high precision in ideal conditions.
Can Florence 2 process handwritten text?
Yes, Florence 2 can extract handwritten text using OCR, though accuracy may vary based on handwriting legibility.