sugatoray/mlx_lm_openai.md

Forked from awni/mlx_lm_openai.md

Created August 16, 2024 05:13

Star (0) You must be signed in to star a gist
Fork (0) You must be signed in to fork a gist

Learn more about clone URLs
Clone this repository at <script src="https://gist.github.com/sugatoray/f4c351ffeb525ad909576b7dedefbdc5.js"></script>
Save sugatoray/f4c351ffeb525ad909576b7dedefbdc5 to your computer and use it in GitHub Desktop.

Download ZIP

Raw

mlx_lm_openai.md

MLX LM with the OpenAI Python Package

1. Install

Install MLX LM and openai:

pip install mlx-lm openai

2. Run the MLX LM server

Run the MLX LM server with:

mlx_lm.server

3. Make the HTTP request:

Make a Python script (like test.py). And include the following:

import openai

openai_client = openai.OpenAI(
  api_key="placeholder-api", base_url="http://localhost:8080"
)

response = openai_client.chat.completions.create(
    model='mlx-community/Meta-Llama-3-8B-Instruct-4bit',
    messages=[
        {"role": "system", "content": "You are a helpful assistant."},
        {"role": "user", "content": f"Say this is a test!"},
    ],
)

# Process response.

Run the script python test.py.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment