Current best open-source AI models

Task	Model	Params (billions)	Notes
Image Object Detection	DETR-DC5 R101	0.607
Image Masking	Segment Anything + ViT Huge	0.641
Image Depth Map Creation	Depth Anything v2 Huge	1.3	not released yet, only Small to Large
Caption to Image	Stable Diffusion XL	2.6
Caption to Video	Open-Sora
Image to Video	Stable Video Diffusion XT
Caption to Sound
Image to 3D	InstantMesh		taken from 3D Arena / maybe SV3D
Coding LLM (Instruct)	DeepSeek Coder v2 Instruct	236
Code LLM (Completion / Filling holes)	DeepSeek Coder v2 Base	236
General LLM (Instruct)	Llama 3	70.6
General LLM (Completion / Filling holes)	Roberta Large	0.355
Text to Speech 🇺🇸	Parler	0.6
Speech to Text 🇺🇸	Whisper Large v3	1.54
Text to Speech 🇩🇪
Speech to Text 🇩🇪
Speech to Language	Lang ID
Speaking animation for portraits	Hallo

Jaid/best-open-source-ai-models.md

No results found