Skip to content

Instantly share code, notes, and snippets.

@ruvnet
Created February 9, 2025 15:11
Show Gist options
  • Save ruvnet/5eabfef63adddd272d756e42b131deb7 to your computer and use it in GitHub Desktop.
Save ruvnet/5eabfef63adddd272d756e42b131deb7 to your computer and use it in GitHub Desktop.
Applying GRPO to DeepSeek-R1-Distill-Qwen-1.5B with LIMO Dataset
Display the source blob
Display the rendered blob
Raw
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment