Skip to content

Instantly share code, notes, and snippets.

View miknoj's full-sized avatar
🏠

Jonathan Kim miknoj

🏠
  • US
View GitHub Profile
@miknoj
miknoj / fairdeck.py
Last active November 22, 2017 04:33
from collections import OrderedDict
from random import shuffle
class FairDeck(object):
def __init__(self):
''' Default constructor.
'''
self.deck = []
@miknoj
miknoj / normcore-llm.md
Created September 9, 2023 14:30 — forked from veekaybee/normcore-llm.md
Normcore LLM Reads
@miknoj
miknoj / chatgpt.md
Created September 11, 2023 02:10 — forked from veekaybee/chatgpt.md
Everything I understand about chatgpt

ChatGPT Resources

Context

ChatGPT appeared like an explosion on all my social media timelines in early December 2022. While I keep up with machine learning as an industry, I wasn't focused so much on this particular corner, and all the screenshots seemed like they came out of nowhere. What was this model? How did the chat prompting work? What was the context of OpenAI doing this work and collecting my prompts for training data?

I decided to do a quick investigation. Here's all the information I've found so far. I'm aggregating and synthesizing it as I go, so it's currently changing pretty frequently.

Model Architecture

@miknoj
miknoj / grpo_demo.py
Created February 8, 2025 22:35 — forked from willccbb/grpo_demo.py
GRPO Llama-1B
# train_grpo.py
import re
import torch
from datasets import load_dataset, Dataset
from transformers import AutoTokenizer, AutoModelForCausalLM
from peft import LoraConfig
from trl import GRPOConfig, GRPOTrainer
# Load and prep dataset