This document provides PromQL queries for monitoring LLM-D deployments using Prometheus metrics. The provided load generation script will populate error metrics for testing.
Metric Need | PromQL Query |
---|---|
Overall Error Rate (Platform-wide) | sum(rate(inference_model_request_error_total[5m])) / sum(rate(inference_model_request_total[5m])) |