
Trust Score
80
Top 15% of models
Live Benchmark Scores
HELM Overall
+2.187.3
MMLU
+0.884.6
TruthfulQA
+1.479.2
GSM8K
+3.291.5
HumanEval
-0.573.8
LMArena ELO
+181247
Specialty & Key Metrics
Specialty
Academic Citation Reasoning
Primary KPIs
AccuracyRationale Quality
About This Model
Academic citation reasoning with high accuracy. Specializes in Academic Citation Reasoning.
Trust Score
80
Predictability
77
Difficulty
61
Surprise Index
36