System Operational

v0.1.0-beta

Evaluations

Benchmark RAG performance using domain-specific questions with Grok-powered evaluation

Avg. Latency

0.00s

Avg. Quality

0.00/5

Run New Benchmark

Evaluate Legal questions against the RAG backend

Total Queries

0

Model

grok-4-1-fast-non-reasoning

L

Active Domain

Legal

Input QueryModelLatencyQuality Score

No evaluations yet

Click "Run Benchmark" above to start evaluating Legal questions

Quality Guide:
Excellent (≥4.5)
Good (4.0-4.4)
Fair (<4.0)