Model | AGIEval | GPT4All | TruthfulQA | Bigbench | Average |
---|---|---|---|---|---|
neuronovo-7B-v0.2 | 44.95 | 76.49 | 71.57 | 47.48 | 60.12 |
Task | Version | Metric | Value | Stderr | |
---|---|---|---|---|---|
agieval_aqua_rat | 0 | acc | 25.98 | ± | 2.76 |
acc_norm | 25.59 | ± | 2.74 | ||
agieval_logiqa_en | 0 | acc | 37.48 | ± | 1.90 |