MattPitlyk/fine-tuning-gpt-2-on-a-custom-dataset.ipynb

Created February 14, 2020 19:14

Star (12) You must be signed in to star a gist
Fork (1) You must be signed in to fork a gist

Learn more about clone URLs
Clone this repository at <script src="https://gist.github.com/MattPitlyk/45541145ad48b93da395f0a72ec2e7dc.js"></script>
Save MattPitlyk/45541145ad48b93da395f0a72ec2e7dc to your computer and use it in GitHub Desktop.

Download ZIP

Fine-Tuning GPT-2 on a Custom Dataset

Raw

fine-tuning-gpt-2-on-a-custom-dataset.ipynb

Sorry, something went wrong. Reload?

Sorry, we cannot display this file.

Sorry, this file is invalid so it cannot be displayed.

Author

MattPitlyk commented Aug 16, 2022

Just a text file with each example sentence on its own line. No json needed.

seungjun-green commented Jan 20, 2023

If I want to create a text summary ml model by fine tuning GPT2, then how the text file should be formatted?

anujsahani01 commented May 20, 2023

the text will get tokenized in its own?
we just have to pass the text file, what kind of formatting should be done,

Question: 'the ques'

Answer: 'the answer'

will this format work?
And secondly my colab session is crashing when we train the model, what can be the solution to this?

AhmedAskar12 commented Jul 11, 2023

It requires >20gb memory so you can subscribe to colab plus or use a free trial virtual machine.

AhmedAskar12 commented Jul 11, 2023 •

edited

Loading

Is that gpt2 fine tuning approach effective btw?

MattPitlyk/fine-tuning-gpt-2-on-a-custom-dataset.ipynb

MattPitlyk commented Aug 16, 2022

Uh oh!

seungjun-green commented Jan 20, 2023

Uh oh!

anujsahani01 commented May 20, 2023

Uh oh!

AhmedAskar12 commented Jul 11, 2023

Uh oh!

AhmedAskar12 commented Jul 11, 2023 •

edited

Loading

Uh oh!

MattPitlyk/fine-tuning-gpt-2-on-a-custom-dataset.ipynb

MattPitlyk commented Aug 16, 2022

Uh oh!

seungjun-green commented Jan 20, 2023

Uh oh!

anujsahani01 commented May 20, 2023

Question: 'the ques'

Answer: 'the answer'

Uh oh!

AhmedAskar12 commented Jul 11, 2023

Uh oh!

AhmedAskar12 commented Jul 11, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

AhmedAskar12 commented Jul 11, 2023 •

edited

Loading