Wanna create and play with an AI clone of yourself or someone else (my lawyer says please don't)[^1] like this one? You're in luck because it's super easy!
This step really varies depending on your data sources, but the end goal is to turn some of real-you's conversations (from your platforms of choice) into a ShareGPT format dataset with you as the gpt
. Here's what your (json) file should end up looking like:
{"conversations": [{"from": "human", "value": "Hi"}, {"from": "gpt", "value": "Hello"}]}
{"conversations": [{"from": "human", "value": "What's up "}, {"from": "gpt", "value": "not much, you?"}, {"from": "human", "value": "Just thinking, what if you're a robot and I don't realize it?"}, {"from": "gpt", "value": "hahaha don't be crazy"}]}
...
NOTE: Make sure every line starts with a message from the other person ("human")