Skip to content

Instantly share code, notes, and snippets.

@Jaid
Last active June 20, 2024 00:15
Show Gist options
  • Save Jaid/5ddee6cf20145da1b1b8745ef23b59e1 to your computer and use it in GitHub Desktop.
Save Jaid/5ddee6cf20145da1b1b8745ef23b59e1 to your computer and use it in GitHub Desktop.
Current best open-source AI models

Current best open-source AI models

Task Model Params (billions) Notes
Image Object Detection DETR-DC5 R101 0.607
Image Masking Segment Anything + ViT Huge 0.641
Image Depth Map Creation Depth Anything v2 Huge 1.3 not released yet, only Small to Large
Caption to Image Stable Diffusion XL 2.6
Caption to Video Open-Sora
Image to Video Stable Video Diffusion XT
Caption to Sound
Image to 3D InstantMesh taken from 3D Arena / maybe SV3D
Coding LLM (Instruct) DeepSeek Coder v2 Instruct 236
Code LLM (Completion / Filling holes) DeepSeek Coder v2 Base 236
General LLM (Instruct) Llama 3 70.6
General LLM (Completion / Filling holes) Roberta Large 0.355
Text to Speech 🇺🇸 Parler 0.6
Speech to Text 🇺🇸 Whisper Large v3 1.54
Text to Speech 🇩🇪
Speech to Text 🇩🇪
Speech to Language Lang ID
Speaking animation for portraits Hallo
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment