This document provides PromQL queries for monitoring LLM-D deployments using Prometheus metrics. The provided load generation script will populate error metrics for testing.
| Desired Information | PromQL Query |
#!/bin/bash | |
# Load generation script with malformed requests to trigger error metrics | |
# Usage: ./generate-load-llmd.sh [duration_minutes] | |
set -e | |
ENDPOINT="http://localhost:8080/v1" | |
DURATION_MINUTES=${1:-5} | |
MODEL_NAME="Qwen/Qwen3-0.6B" |
This document provides PromQL queries for monitoring LLM-D deployments using Prometheus metrics. The provided load generation script will populate error metrics for testing.
| Desired Information | PromQL Query |
This document provides an overview of all metrics generated by the llm-d
components.
The llm-d
system uses Prometheus as the primary metrics collection framework, with metrics covering inference performance, resource utilization,
error rates, and energy consumption across multiple components.
See the tutorial below. |
#!/usr/bin/env bash | |
set -o errexit | |
# Create a container | |
container=$(buildah from alpine) | |
# Run this from wherever the built binaries are available | |
buildah config --label maintainer="Sally O'Malley <[email protected]>" $container |
apiVersion: v1 | |
kind: Namespace | |
metadata: | |
name: rekor-system | |
labels: | |
openshift.io/cluster-monitoring: true | |
--- | |
apiVersion: rbac.authorization.k8s.io/v1 | |
kind: Role | |
metadata: |
$ df -Th | |
Filesystem Type Size Used Avail Use% Mounted on | |
devtmpfs devtmpfs 3.8G 0 3.8G 0% /dev | |
tmpfs tmpfs 3.8G 0 3.8G 0% /dev/shm | |
tmpfs tmpfs 3.8G 25M 3.8G 1% /run | |
/dev/mapper/rhel-root xfs 70G 5.4G 64G 8% / | |
/dev/vda1 xfs 794M 214M 581M 27% /boot | |
/dev/vda2 vfat 200M 8.0K 200M 1% /boot/efi | |
tmpfs tmpfs 777M 0 777M 0% /run/user/1000 | |
---------------------------------------------------------- |
#!/bin/bash | |
### Run `make sigstore-testenv-up` from local checkout of containers/skopeo before running this script. | |
### This script must run from local checkout of sigstore/sigstore | |
set -ex | |
echo "running tests" | |
export VAULT_TOKEN=testtoken |
# This is an example configTarget that is used in testing. | |
# Harpoon will start with this config then will load targets from ./examples/config-reload.yaml | |
targets: | |
- name: config | |
methods: | |
configTarget: | |
configUrl: https://raw.githubusercontent.com/sallyom/harpoon/config-upload/examples/config-reload.yaml | |
schedule: "*/1 * * * *" |
# Helper function for fancy git prompt. | |
# Place this in ~/.bashrc | |
# Then, `source ~/.bashrc` will execute the prompt function w/out having to reboot | |
# (otherwise, any ~/.bashrc settings take effect with every reboot. | |
# Lines 7-38 go in ~/.bashrc | |
function parse_git_branch { | |
git branch --no-color 2> /dev/null | sed -e '/^[^*]/d' -e 's/* \(.*\)/(\1)/' | |
} |