wilmoore/readme.md Secret

Last active March 11, 2024 00:44

Star (0) You must be signed in to star a gist
Fork (0) You must be signed in to fork a gist

Select an option

Learn more about clone URLs
Clone this repository at <script src="https://gist.github.com/wilmoore/2e9faa4ae03eece4bcb57545a37e664a.js"></script>
Save wilmoore/2e9faa4ae03eece4bcb57545a37e664a to your computer and use it in GitHub Desktop.

Download ZIP

AI :: LLM :: InstructGPT

Raw

AI :: LLM :: InstructGPT

⪼ Made with 💜 by Polyglot.

..

About

GPT-3 models aren’t trained to follow user instructions. Our InstructGPT models (highlighted) generate much more helpful outputs in response to user instructions. GPT-3 is trained to predict the next word on a large dataset of Internet text, rather than to safely perform the language task that the user wants. In other words, these models aren’t aligned with their users.

AI :: LLM :: RLHF

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment