This guide shows how to deploy an uncensored DeepSeek R1 Distill model to Google Cloud Run with GPU support and how to perform a basic, functional fine-tuning process. The tutorial is split into:
- Environment Setup
- FastAPI Inference Server
- Docker Configuration
- Google Cloud Run Deployment
- Fine-Tuning Pipeline (Cold Start, Reasoning RL, Data Collection, Final RL Phase)