Duration: 14h 5m | Messages: 625 | Phases: 10
Session Overview
Duration: 13h 6m | Messages: 388 | Phases: 6
what's still pending to do here. do you ahve plans on how to improve this significantly. what would make it useful. why would people want to use it. how do you think they'd use it. what are their motivations. are we meeting them. are we helping them look good?
Duration: 12h 55m | Messages: 328 | Phases: 3
what's still pending to do here. do you ahve plans on how to improve this significantly. what would make it useful. why would people want to use it. how do you think they'd use it. what are their motivations. are we meeting them. are we helping them look good?
| # train_grpo.py | |
| import re | |
| import torch | |
| from datasets import load_dataset, Dataset | |
| from transformers import AutoTokenizer, AutoModelForCausalLM | |
| from peft import LoraConfig | |
| from trl import GRPOConfig, GRPOTrainer | |
| # Load and prep dataset |
| Verifying my Blockstack ID is secured with the address 1M7sco3kdZy2qiSLBkRWQg2XKJaMDHHvQe https://explorer.blockstack.org/address/1M7sco3kdZy2qiSLBkRWQg2XKJaMDHHvQe |
I hereby claim:
To claim this, I am signing this object: