Skip to content

Instantly share code, notes, and snippets.

@wilmoore
Last active March 11, 2024 00:44
Show Gist options
  • Select an option

  • Save wilmoore/2e9faa4ae03eece4bcb57545a37e664a to your computer and use it in GitHub Desktop.

Select an option

Save wilmoore/2e9faa4ae03eece4bcb57545a37e664a to your computer and use it in GitHub Desktop.
AI :: LLM :: InstructGPT

AI :: LLM :: InstructGPT

⪼ Made with 💜 by Polyglot.

About

GPT-3 models aren’t trained to follow user instructions. Our InstructGPT models (highlighted) generate much more helpful outputs in response to user instructions. GPT-3 is trained to predict the next word on a large dataset of Internet text, rather than to safely perform the language task that the user wants. In other words, these models aren’t aligned with their users.


Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment