- Run python train_gpt2.py
- Copy paste the .cu snippet below right above this line here:
- Use https://tiktokenizer.vercel.app/?model=gpt2 to generate tokens
- Copy and paste the values into
tokens
array. (Keep the -1 at the end - marks the end of the array) - Run the .sh snippet listed below, linking your model binary
https://tiktokenizer.vercel.app/?model=gpt2 to generate the tokens