These instructions are for a Windows host, and assume that you're using Ollama. If you aren't using Ollama then there's no need to do any of the proxy stuff.
Ensure ollama is running in the host.
$env:OLLAMA_CONTEXT_LENGTH = 65536
ollama serveThese instructions are for a Windows host, and assume that you're using Ollama. If you aren't using Ollama then there's no need to do any of the proxy stuff.
Ensure ollama is running in the host.
$env:OLLAMA_CONTEXT_LENGTH = 65536
ollama serve| #!/usr/bin/env python | |
| import tornado.web | |
| import tornado.gen | |
| import time | |
| from functools import partial | |
| import os | |
| from concurrent.futures import ThreadPoolExecutor | |