Florence 2

Florence 2 Model (Lightweight)

Florence-2 is a lightweight vision-language model open-sourced by Microsoft under the MIT license. The model demonstrates strong zero-shot and fine-tuning capabilities across tasks such as captioning, object detection, grounding, and segmentation.

Florence-2, released by Microsoft in June 2024, is a foundation vision-language model. This model is very attractive because of its small size (0.2B and 0.7B) and strong performance on a variety of computer vision and vision-language tasks.

Articles

2024/07/26 Florence-2: Microsoft's Latest Vision-Language Model
2024/07/17 Launch: Deploy Florence-2 with Roboflow
2024/07/14 Fine-Tuning Florence 2: Training a Vision Language Model Step-by-Step
2024/07/02 Fine-tuning Florence-2 for VQA (Visual Question Answering) using the Azure ML Python SDK and MLflow
2024/06/27 Unveiling Microsoft Florence 2: The Next Leap in Vision Language Models
2024/06/27 Microsoft Florence-2 - A Tiny Titan in Computer Vision…….
2024/06/26 Florence-2: Revolutionizing Vision-Language Models
2024/06/26 All You Need to Know About Florence-2!
2024/06/25 Hand-on Guide to Vision Language Tasks using Microsoft Florence-2
2024/06/25 How to Fine-tune Florence-2 for Object Detection Tasks
2024/06/24 Fine-tuning Florence-2 - Microsoft's Cutting-edge Vision Language Models 🤗
2024/06/23 Is This the Future of Vision? Florence-2 by Microsoft Redefines AI Perception
2024/06/20 Florence-2: Open Source Vision Foundation Model by Microsoft
2024/06/20 Microsoft Unveils Florence-2: A Unified Model for Vision Tasks
2024/02/06 Brief Review — Florence: A New Foundation Model for Computer Vision - Florence, Pretrained Using Image Captioning Dataset
2023/11/23 Microsoft Research Introduces Florence-2: A Novel Vision Foundation Model with a Unified Prompt-based Representation for a Variety of Computer Vision and Vision-Language Tasks
2023/11/23 Florence-2: A Novel Vision Foundation Model with a Unified Prompt-based Representation for a Variety of Computer Vision and Vision-Language Tasks
2023/11/14 Florence-2: Microsoft's New Foundation Model Explained
2023/11/13 Florence-2: Novel Vision Language Model by Microsoft
2023/07/18 Azure AI Vision - Large Foundation Model
2023/03/07 Announcing a renaissance in computer vision AI with Microsoft’s Florence foundation model
2021/12/09 Florence: A New Foundation for Computer Vision
2021/11/29 Microsoft’s ‘Florence’ General-Purpose Foundation Model Achieves SOTA Results on Dozens of CV Benchmarks

Articles (Korean)

Information

2024/06/19 roboflow Models: Florence 2
- How to label data for Florence 2 training
Project Florence (AI) - Advancing the state-of-the-art computer vision technologies
FLD-5B Dataset
ComfyUI Extension: ComfyUI-Florence2

People

Bin Xiao - Principal Researcher at Microsoft GenAI

Open Source

https://github.com/uncbiag/Awesome-Foundation-Models - A curated list of foundation models for vision and language tasks
https://github.com/autodistill/autodistill-florence-2 - Use Florence 2 to auto-label data for use in training fine-tuned object detection models.
https://github.com/kijai/ComfyUI-Florence2 - Inference Microsoft Florence2 VLM
https://github.com/spacepxl/ComfyUI-Florence-2 - Florence-2 image captioning and tasks
https://github.com/andimarafioti/florence2-finetuning - Quick exploration into fine tuning florence 2
https://github.com/xenova/transformers.js - State-of-the-art Machine Learning for the web. Run 🤗 Transformers directly in your browser, with no need for a server!
https://github.com/nvidia-holoscan/holohub/tree/main/applications/florence-2-vision
https://github.com/skye0402/florence-2-large-playground - Microsoft Florence-2 Vision Model
https://github.com/SangbumChoi/florence2-triton - Unofficial repository for building Florence-2 in Microsoft Azure

sikang99/Florence-2.md

Florence 2 Model (Lightweight)

Articles

Articles (Korean)

Information

People

Videos

Papers

Hugging Face

Open Source