Evaluations
Benchmark RAG performance using domain-specific questions with Grok-powered evaluation
Avg. Latency
0.00s
Avg. Quality
0.00/5
Run New Benchmark
Evaluate Legal questions against the RAG backend
Total Queries
0
Model
grok-4-1-fast-non-reasoning
L
Active Domain
Legal
| Input Query | Model | Latency | Quality Score |
|---|---|---|---|
No evaluations yet Click "Run Benchmark" above to start evaluating Legal questions | |||
Quality Guide:
Excellent (≥4.5)
Good (4.0-4.4)
Fair (<4.0)