Abhinav blacksmithop

DeepSeek R1 Distill: Complete Tutorial for Deployment & Fine-Tuning

This guide shows how to deploy an uncensored DeepSeek R1 Distill model to Google Cloud Run with GPU support and how to perform a basic, functional fine-tuning process. The tutorial is split into:

Environment Setup
FastAPI Inference Server
Docker Configuration
Google Cloud Run Deployment
Fine-Tuning Pipeline (Cold Start, Reasoning RL, Data Collection, Final RL Phase)

Thought I should post my guide on how to do this, as I've struggled with this for quite some time now. I wanted something to bypass the NAT/router as I did not have the option of port-forwarding. I will also be moving quite some in the upcoming year, so dynamic IP was almost a given. And obviously I wanted it to be cheap. This does it all.

This is free, no port-forwarding required and no static IP required. Wildcard domain and SSL certificate supported. It works with Cloudflare tunnels, Cloudflare DNS, Nginx Proxy Manager and obviously TrueNAS SCALE.

I am not a professional, if you see a flaw in this design, please let me know!

Requirements:

Domain name at Cloudflare

Setup an Always-on VSCode Tunnel on Oracle Cloud Always-free Instance

Access Oracle Cloud VM from any browser on any devices

Install VSCode for arm64

wget -O vscode.deb https://code.visualstudio.com/sha/download\?build\=stable\&os\=linux-deb-arm64 && sudo apt install ./vscode.deb --fix-broken -y && rm vscode.deb

Requires Python 3.6+

Only 3rd party dependancy is the Requests library. Install it with: python3 -m pip install requests

	<#meta#>
	- Date: 2023-10-05
	- Task: summary
	<#system#>
	Your main objective is to condense the content of the document into a concise summary, capturing the main points and themes.
	<#chat#>
	<#user#>
	Please read the provided Original section to understand the context and content. Use this understanding to generate a summary of the Original section, incorporating relevant details and maintaining coherence with the Prior Summary.

	Notes:

	"""This is an example of how to use async langchain with fastapi and return a streaming response.
	The latest version of Langchain has improved its compatibility with asynchronous FastAPI,
	making it easier to implement streaming functionality in your applications.
	"""
	import asyncio
	import os
	from typing import AsyncIterable, Awaitable

	import uvicorn
	from dotenv import load_dotenv


	/****************
	* drones101.js *
	****************
	*
	* Do you remember, my dear Professor, a certain introductory
	* computational rationality class you taught long ago? Assignment
	* #2, behavior functions of autonomous agents? I remember that one
	* fondly - but attack drones are so much easier to reason about
	* when they're not staring you in the face, I would imagine!

	#!/usr/bin/env python3.8
	"""
	# ------------------------------------------------------
	#
	# Hive/Steem RPC get_block benchmarker
	# Written by Someguy123 (github.com/Someguy123)
	#
	# (C) 2020 Someguy123 / Privex \|\| License: X11 / MIT
	# Buy a server from Privex! https://www.privex.io
	#

	/*!
	* Gist DarkCode ver 0.2.1
	* Update 03/12/2021 www.adiman.web.id
	*/
	.gist{font-size: 18px}.gist-meta, .gist-file, .octotree_toggle, ul.comparison-list > li.title,button.button, a.button, span.button, button.minibutton, a.minibutton,span.minibutton, .clone-url-button > .clone-url-link{background: linear-gradient(#202020, #181818) !important;border-color: #383838 !important;border-radius: 0 0 3px 3px !important;text-shadow: none !important;color: #b5b5b5 !important}.markdown-format pre, .markdown-body pre, .markdown-format .highlight pre,.markdown-body .highlight pre, body.blog pre, #facebox pre, .blob-expanded,.terminal, .copyable-terminal, #notebook .input_area, .blob-code-context,.markdown-format code, body.blog pre > code, .api pre, .api code,.CodeMirror,.highlight{background-color: #1D1F21!important;color: #C5C8C6!important}.gist .blob-code{padding: 1px 10px !important;text-align: left;background: #000;border: 0}::selection{background: #24890d;color: #fff;text-shadow: none}::-moz-selection{background: