Skip to content

Instantly share code, notes, and snippets.

@pydemo
Created July 23, 2024 14:05
Show Gist options
  • Select an option

  • Save pydemo/aeb091f09cdde1078a75cf3eca495a8f to your computer and use it in GitHub Desktop.

Select an option

Save pydemo/aeb091f09cdde1078a75cf3eca495a8f to your computer and use it in GitHub Desktop.
Model Description
BERT Introduced by Google in 2018, BERT is a transformer-based model with 342 million parameters. It's used for query understanding in Google Search.
Claude Created by Anthropic, Claude focuses on constitutional AI. The latest iteration, Claude 3.5 Sonnet, excels in understanding nuance, humor, and complex instructions. Available for free via Claude.ai and the Claude iOS app.
Cohere Cohere provides several custom-trainable LLMs like Command, Rerank, and Embed. Founded by an author of "Attention Is All You Need," it operates on multiple clouds.
Ernie Baidu’s Ernie 4.0 chatbot, with rumored 10 trillion parameters, has over 45 million users. It's primarily effective in Mandarin but capable in other languages.
Falcon 40B Developed by the Technology Innovation Institute, Falcon 40B is a causal decoder-only model available in smaller variants (1B, 7B). It's open source and available on Amazon SageMaker and GitHub.
Gemini Google's multimodal LLM, replacing Palm in Bard, now Gemini. Comes in Ultra, Pro, and Nano sizes, and outperforms GPT-4 on most benchmarks. Integrated into many Google products.
Gemma Google's open-source models, Gemma 2B and 7B, trained on the same resources as Gemini, can run locally and surpass similarly sized Llama 2 models on several benchmarks.
GPT-3 OpenAI's 175 billion parameter model, GPT-3 uses a decoder-only transformer architecture and is integrated exclusively into Microsoft products.
GPT-3.5 An upgraded version of GPT-3, fine-tuned with reinforcement learning from human feedback. Powers ChatGPT and is integrated into Bing Search, replaced by GPT-4.
GPT-4 OpenAI's 2023 release, GPT-4, is a multimodal model rumored to have 170 trillion parameters. It can process both text and images, demonstrating human-level performance in multiple exams. Powers Bing Search and ChatGPT Plus.
GPT-4o Successor to GPT-4, offering faster, more natural interactions and multimodal capabilities. Free for developer and customer products.
Lamda Google's Lamda, announced in 2021, is a decoder-only transformer model pre-trained on a large corpus. Known for generating claims of sentience from a Google engineer.
Llama Meta's LLM, released in 2023, available in sizes up to 65 billion parameters. Initially for approved researchers, now open source. Trained on public data like CommonCrawl and Wikipedia.
Mistral A 7 billion parameter model that outperforms Llama's similar-sized models. It is self-hostable and suitable for business purposes, released under the Apache 2.0 license.
Orca Developed by Microsoft with 13 billion parameters, Orca aims to improve on other open source models, achieving similar performance to GPT-4 with fewer parameters. Built on LLaMA 13B.
Palm Google's 540 billion parameter model powering Bard, specializing in reasoning tasks like coding and math. Part of the Pathways initiative for diverse applications.
Phi-1 Microsoft’s transformer-based model with 1.3 billion parameters, trained on high-quality data. Specializes in Python coding.
StableLM Stability AI's series of open source models, ranging from 3 billion to 175 billion parameters. Aims to be transparent and accessible.
Vicuna 33B Derived from Llama, developed by LMSYS, with 33 billion parameters. Performs well for its size, though smaller and less capable than GPT-4.
Seq2Seq Google's deep learning approach for machine translation and NLP, underlying modern models like LaMDA and AlexaTM 20B. Uses a mix of encoders and decoders.
Eliza An early NLP program from 1966, simulating conversation using pattern matching. One of the first language models, demonstrating basic interaction principles.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment