Based on your observability needs, here's a comprehensive mapping of PromQL queries and complementary trace spans:
Metric Need | PromQL Query | Trace Spans to Enhance Insight |
---|---|---|
Overall Error Rate (Platform-wide) | sum(rate(inference_model_request_error_total[5m])) / sum(rate(inference_model_request_total[5m])) |
gateway.request with error status codes and error messages |
Per-Model Error Rate | sum by(model) (rate(inference_model_request_error_total[5m])) / sum by(model) (rate(inference_model_request_total[5m])) |
gateway.request with gen_ai.request.model attribute |