Florence-2 is a lightweight vision-language model open-sourced by Microsoft under the MIT license. The model demonstrates strong zero-shot and fine-tuning capabilities across tasks such as captioning, object detection, grounding, and segmentation.
Florence-2, released by Microsoft in June 2024, is a foundation vision-language model. This model is very attractive because of its small size (0.2B and 0.7B) and strong performance on a variety of computer vision and vision-language tasks.
- 2024/07/26 Florence-2: Microsoft's Latest Vision-Language Model
- 2024/07/17 Launch: Deploy Florence-2 with Roboflow
- 2024/07/14 Fine-Tuning Florence 2: Training a Vision Language Model Step-by-Step
- 2024/07/02 Fine-tuning Florence-2 for VQA (Visual Question Answering) using the Azure ML Python SDK and MLflow
- 2024/06/27 Unveiling Microsoft Florence 2: The Next Leap in Vision Language Models
- 2024/06/27 Microsoft Florence-2 - A Tiny Titan in Computer Vision…….
- 2024/06/26 Florence-2: Revolutionizing Vision-Language Models
- 2024/06/26 All You Need to Know About Florence-2!
- 2024/06/25 Hand-on Guide to Vision Language Tasks using Microsoft Florence-2
- 2024/06/25 How to Fine-tune Florence-2 for Object Detection Tasks
- 2024/06/24 Fine-tuning Florence-2 - Microsoft's Cutting-edge Vision Language Models 🤗
- 2024/06/23 Is This the Future of Vision? Florence-2 by Microsoft Redefines AI Perception
- 2024/06/20 Florence-2: Open Source Vision Foundation Model by Microsoft
- 2024/06/20 Microsoft Unveils Florence-2: A Unified Model for Vision Tasks
- 2024/02/06 Brief Review — Florence: A New Foundation Model for Computer Vision - Florence, Pretrained Using Image Captioning Dataset
- 2023/11/23 Microsoft Research Introduces Florence-2: A Novel Vision Foundation Model with a Unified Prompt-based Representation for a Variety of Computer Vision and Vision-Language Tasks
- 2023/11/23 Florence-2: A Novel Vision Foundation Model with a Unified Prompt-based Representation for a Variety of Computer Vision and Vision-Language Tasks
- 2023/11/14 Florence-2: Microsoft's New Foundation Model Explained
- 2023/11/13 Florence-2: Novel Vision Language Model by Microsoft
- 2023/07/18 Azure AI Vision - Large Foundation Model
- 2023/03/07 Announcing a renaissance in computer vision AI with Microsoft’s Florence foundation model
- 2021/12/09 Florence: A New Foundation for Computer Vision
- 2021/11/29 Microsoft’s ‘Florence’ General-Purpose Foundation Model Achieves SOTA Results on Dozens of CV Benchmarks
- 2024/06/28 마이크로소프트의 최신 비전 모델 Florence-2 리뷰: 0.7B 비젼 모델의 혁신
- 2024/06/25 마이크로소프트의 Florence-2 Vision - 오픈소스 비전 모델
- 2024/06/21 MS, 멀티모달 AI '플로렌스-2' 출시...음성·비전 통합 처리
- 2024/06/19 roboflow Models: Florence 2
- Project Florence (AI) - Advancing the state-of-the-art computer vision technologies
- FLD-5B Dataset
- ComfyUI Extension: ComfyUI-Florence2
- Bin Xiao - Principal Researcher at Microsoft GenAI
- 2023/11/20 Microsoft Introduces Florence 2 Computer Vision
- 2023/10/27 Make Azure AI Real: Apply Azure AI Services for Vision and Florence to Your Image Data
- 2023/04/21 Azure Cognitive Service for Vision with the Florence foundation model
- 2023 Florence-2: Advancing a Unified Representation for a Variety of Vision Tasks - CVPR 2024
- 2021 Florence: A New Foundation Model for Computer Vision
- microsoft's Collections: Phi-3
- microsoft's Collections: Florence
- Florence-2: Advancing a Unified Representation for a Variety of Vision Tasks
- HuggingFaceM4
- https://github.com/uncbiag/Awesome-Foundation-Models - A curated list of foundation models for vision and language tasks
- https://github.com/autodistill/autodistill-florence-2 - Use Florence 2 to auto-label data for use in training fine-tuned object detection models.
- https://github.com/kijai/ComfyUI-Florence2 - Inference Microsoft Florence2 VLM
- https://github.com/spacepxl/ComfyUI-Florence-2 - Florence-2 image captioning and tasks
- https://github.com/andimarafioti/florence2-finetuning - Quick exploration into fine tuning florence 2
- https://github.com/xenova/transformers.js - State-of-the-art Machine Learning for the web. Run 🤗 Transformers directly in your browser, with no need for a server!
- https://github.com/nvidia-holoscan/holohub/tree/main/applications/florence-2-vision
- https://github.com/skye0402/florence-2-large-playground - Microsoft Florence-2 Vision Model
- https://github.com/SangbumChoi/florence2-triton - Unofficial repository for building Florence-2 in Microsoft Azure