This document provides PromQL queries for monitoring LLM-D deployments using Prometheus metrics. The provided load generation script will populate error metrics for testing.
| Metric Need | PromQL Query | 
|---|---|
| Overall Error Rate (Platform-wide) | sum(rate(inference_model_request_error_total[5m])) / sum(rate(inference_model_request_total[5m])) |