Inference Dashboards

← back to chat
Requests
3
Error rate
0.0%
Avg latency
38989 ms
p95 latency
81043 ms
Tokens
5,059
Pipeline lag
40009 ms

Latency (ms) — successful requests, per minute

Throughput — requests/min (errors highlighted)

Tokens by provider

Requests by provider