Skip to content

Instantly share code, notes, and snippets.

@jfjensen
Last active May 15, 2025 16:45
Show Gist options
  • Save jfjensen/b7cc8ba29b72740f2d4461260147361d to your computer and use it in GitHub Desktop.
Save jfjensen/b7cc8ba29b72740f2d4461260147361d to your computer and use it in GitHub Desktop.
Using the Guidance AI to access an LLM on a vLLM server
import guidance
import requests
base_url = "http://localhost:9999/v1"
x = requests.get(base_url + "/models")
model = str(x.json()["data"][0]["id"])
guidance.llm = guidance.llms.OpenAI(
model=model,
endpoint=base_url,
api_key="***",
chat_mode=False,
encoding_name="cl100k_base"
)
program = guidance(
"""What are the top five fun facts about {{country}}? Provide a one-liner with a description of each of the facts.
Here are the fun facts:
{{~#geneach 'facts' num_iterations=5}}
[{{@index}}]: "{{gen 'this' stop='"' max_tokens=50 ~}}"
{{/geneach}}
"""
)
result = program(country="France")
print(result["facts"])
print("===========================================")
print(result)
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment