Skip to content

Instantly share code, notes, and snippets.

@danbri
Forked from ruvnet/Notebook.ipynb
Created February 17, 2025 21:15
Show Gist options
  • Save danbri/a57d20c503b5eed576d8101cf59b07d3 to your computer and use it in GitHub Desktop.
Save danbri/a57d20c503b5eed576d8101cf59b07d3 to your computer and use it in GitHub Desktop.
Applying GRPO to DeepSeek-R1-Distill-Qwen-1.5B with LIMO Dataset
Display the source blob
Display the rendered blob
Raw
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment