Inference Dashboards

← back to chat
Requests
5
Error rate
0.0%
Avg latency
25415 ms
p95 latency
75123 ms
Tokens
5,273
Pipeline lag
26450 ms

Latency (ms) — successful requests, per minute

Throughput — requests/min (errors highlighted)

Tokens by provider

Requests by provider