This guide is meant for Windows users who wish to run Facebook's Llama AI language model on their own PC locally. Our focus will be on character chats, reminiscent of platforms like character.ai / c.ai, using Llama architecture models. Most recently, in late 2023 and early 2024, Mistral AI has released high quality models that are based of the Llama architecture, and will work in the same way if you choose to use them.
-
- Windows operating system (may make Mac version of the guide later)
- GPU with at least a few gigabytes of VRAM (NVIDIA graphics cards recommended)
-
Sufficient regular RAM for a model (system memory)