Apertus 8B Instruct

apertus-8b-instruct · 33 samples · combined / raw

Scoring

Choose the quality benchmark and final score formula for this model.

Cost adjustment

Tune formulas that account for benchmark cost.

Score over time

3.43.63.73.84.005-11 23:1705-12 09:4305-12 20:0905-13 06:3505-13 17:00 Run #1 · Score · X: 2026-05-11 23:17:33 · Y: 3.7Run #2 · Score · X: 2026-05-12 00:00:09 · Y: 3.7Run #3 · Score · X: 2026-05-12 01:00:09 · Y: 3.7Run #4 · Score · X: 2026-05-12 02:00:11 · Y: 3.7Run #5 · Score · X: 2026-05-12 03:00:11 · Y: 3.7Run #7 · Score · X: 2026-05-12 05:00:17 · Y: 3.7Run #8 · Score · X: 2026-05-12 06:00:21 · Y: 3.7Run #9 · Score · X: 2026-05-12 07:00:22 · Y: 3.7Run #11 · Score · X: 2026-05-12 09:00:15 · Y: 3.7Run #12 · Score · X: 2026-05-12 10:00:12 · Y: 3.7Run #15 · Score · X: 2026-05-12 13:00:16 · Y: 3.7Run #16 · Score · X: 2026-05-12 14:00:12 · Y: 3.7Run #18 · Score · X: 2026-05-12 16:00:32 · Y: 3.7Run #19 · Score · X: 2026-05-12 17:00:36 · Y: 3.7Run #20 · Score · X: 2026-05-12 18:00:35 · Y: 3.7Run #21 · Score · X: 2026-05-12 19:00:24 · Y: 3.7Run #22 · Score · X: 2026-05-12 20:00:23 · Y: 3.7Run #23 · Score · X: 2026-05-12 21:00:26 · Y: 3.7Run #24 · Score · X: 2026-05-12 22:00:27 · Y: 3.7Run #25 · Score · X: 2026-05-12 23:00:22 · Y: 3.7Run #26 · Score · X: 2026-05-13 00:00:26 · Y: 3.7Run #27 · Score · X: 2026-05-13 01:00:29 · Y: 3.7Run #28 · Score · X: 2026-05-13 02:00:24 · Y: 3.7Run #29 · Score · X: 2026-05-13 03:00:35 · Y: 3.7Run #30 · Score · X: 2026-05-13 04:00:30 · Y: 3.7Run #31 · Score · X: 2026-05-13 05:00:25 · Y: 3.7Run #32 · Score · X: 2026-05-13 06:00:23 · Y: 3.7Run #33 · Score · X: 2026-05-13 07:00:20 · Y: 3.7Run #34 · Score · X: 2026-05-13 08:00:49 · Y: 3.7Run #35 · Score · X: 2026-05-13 09:00:49 · Y: 3.7Run #41 · Score · X: 2026-05-13 15:00:49 · Y: 3.7Run #42 · Score · X: 2026-05-13 16:00:49 · Y: 3.7Run #43 · Score · X: 2026-05-13 17:00:49 · Y: 3.7

Cost over time

$0.08$0.09$0.10$0.11$0.1205-11 23:1705-12 09:4305-12 20:0905-13 06:3505-13 17:00 Run #1 · Cost · X: 2026-05-11 23:17:33 · Y: $0.10Run #2 · Cost · X: 2026-05-12 00:00:09 · Y: $0.10Run #3 · Cost · X: 2026-05-12 01:00:09 · Y: $0.10Run #4 · Cost · X: 2026-05-12 02:00:11 · Y: $0.10Run #5 · Cost · X: 2026-05-12 03:00:11 · Y: $0.10Run #7 · Cost · X: 2026-05-12 05:00:17 · Y: $0.10Run #8 · Cost · X: 2026-05-12 06:00:21 · Y: $0.10Run #9 · Cost · X: 2026-05-12 07:00:22 · Y: $0.10Run #11 · Cost · X: 2026-05-12 09:00:15 · Y: $0.10Run #12 · Cost · X: 2026-05-12 10:00:12 · Y: $0.10Run #15 · Cost · X: 2026-05-12 13:00:16 · Y: $0.10Run #16 · Cost · X: 2026-05-12 14:00:12 · Y: $0.10Run #18 · Cost · X: 2026-05-12 16:00:32 · Y: $0.10Run #19 · Cost · X: 2026-05-12 17:00:36 · Y: $0.10Run #20 · Cost · X: 2026-05-12 18:00:35 · Y: $0.10Run #21 · Cost · X: 2026-05-12 19:00:24 · Y: $0.10Run #22 · Cost · X: 2026-05-12 20:00:23 · Y: $0.10Run #23 · Cost · X: 2026-05-12 21:00:26 · Y: $0.10Run #24 · Cost · X: 2026-05-12 22:00:27 · Y: $0.10Run #25 · Cost · X: 2026-05-12 23:00:22 · Y: $0.10Run #26 · Cost · X: 2026-05-13 00:00:26 · Y: $0.10Run #27 · Cost · X: 2026-05-13 01:00:29 · Y: $0.10Run #28 · Cost · X: 2026-05-13 02:00:24 · Y: $0.10Run #29 · Cost · X: 2026-05-13 03:00:35 · Y: $0.10Run #30 · Cost · X: 2026-05-13 04:00:30 · Y: $0.10Run #31 · Cost · X: 2026-05-13 05:00:25 · Y: $0.10Run #32 · Cost · X: 2026-05-13 06:00:23 · Y: $0.10Run #33 · Cost · X: 2026-05-13 07:00:20 · Y: $0.10Run #34 · Cost · X: 2026-05-13 08:00:49 · Y: $0.10Run #35 · Cost · X: 2026-05-13 09:00:49 · Y: $0.10Run #41 · Cost · X: 2026-05-13 15:00:49 · Y: $0.10Run #42 · Cost · X: 2026-05-13 16:00:49 · Y: $0.10Run #43 · Cost · X: 2026-05-13 17:00:49 · Y: $0.10
Run?Fetched?Score?Quality?Cost?Intel?Code?Agent?MMMU%?
#43 2026-05-13 17:00:49 3.7 3.7 $0.10 5.9 1.4 3.8 -
#42 2026-05-13 16:00:49 3.7 3.7 $0.10 5.9 1.4 3.8 -
#41 2026-05-13 15:00:49 3.7 3.7 $0.10 5.9 1.4 3.8 -
#35 2026-05-13 09:00:49 3.7 3.7 $0.10 5.9 1.4 3.8 -
#34 2026-05-13 08:00:49 3.7 3.7 $0.10 5.9 1.4 3.8 -
#33 2026-05-13 07:00:20 3.7 3.7 $0.10 5.9 1.4 3.8 -
#32 2026-05-13 06:00:23 3.7 3.7 $0.10 5.9 1.4 3.8 -
#31 2026-05-13 05:00:25 3.7 3.7 $0.10 5.9 1.4 3.8 -
#30 2026-05-13 04:00:30 3.7 3.7 $0.10 5.9 1.4 3.8 -
#29 2026-05-13 03:00:35 3.7 3.7 $0.10 5.9 1.4 3.8 -
#28 2026-05-13 02:00:24 3.7 3.7 $0.10 5.9 1.4 3.8 -
#27 2026-05-13 01:00:29 3.7 3.7 $0.10 5.9 1.4 3.8 -
#26 2026-05-13 00:00:26 3.7 3.7 $0.10 5.9 1.4 3.8 -
#25 2026-05-12 23:00:22 3.7 3.7 $0.10 5.9 1.4 3.8 -
#24 2026-05-12 22:00:27 3.7 3.7 $0.10 5.9 1.4 3.8 -
#23 2026-05-12 21:00:26 3.7 3.7 $0.10 5.9 1.4 3.8 -
#22 2026-05-12 20:00:23 3.7 3.7 $0.10 5.9 1.4 3.8 -
#21 2026-05-12 19:00:24 3.7 3.7 $0.10 5.9 1.4 3.8 -
#20 2026-05-12 18:00:35 3.7 3.7 $0.10 5.9 1.4 3.8 -
#19 2026-05-12 17:00:36 3.7 3.7 $0.10 5.9 1.4 3.8 -
#18 2026-05-12 16:00:32 3.7 3.7 $0.10 5.9 1.4 3.8 -
#16 2026-05-12 14:00:12 3.7 3.7 $0.10 5.9 1.4 3.8 -
#15 2026-05-12 13:00:16 3.7 3.7 $0.10 5.9 1.4 3.8 -
#12 2026-05-12 10:00:12 3.7 3.7 $0.10 5.9 1.4 3.8 -
#11 2026-05-12 09:00:15 3.7 3.7 $0.10 5.9 1.4 3.8 -
#9 2026-05-12 07:00:22 3.7 3.7 $0.10 5.9 1.4 3.8 -
#8 2026-05-12 06:00:21 3.7 3.7 $0.10 5.9 1.4 3.8 -
#7 2026-05-12 05:00:17 3.7 3.7 $0.10 5.9 1.4 3.8 -
#5 2026-05-12 03:00:11 3.7 3.7 $0.10 5.9 1.4 3.8 -
#4 2026-05-12 02:00:11 3.7 3.7 $0.10 5.9 1.4 3.8 -
#3 2026-05-12 01:00:09 3.7 3.7 $0.10 5.9 1.4 3.8 -
#2 2026-05-12 00:00:09 3.7 3.7 $0.10 5.9 1.4 3.8 -
#1 2026-05-11 23:17:33 3.7 3.7 $0.10 5.9 1.4 3.8 -