Skip to content

Instantly share code, notes, and snippets.

@de-wim
de-wim / day5.adb
Created December 12, 2025 17:55
day5.adb
with Ada.Text_IO; use Ada.Text_IO;
with Ada.Containers.Vectors;
with Ada.Containers.Generic_Sort;
with Ada.Strings; use Ada.Strings;
with Ada.Strings.Fixed; use Ada.Strings.Fixed;
procedure Day5 is
type FreshRange is record
Start : Long_Long_Integer;
Stop : Long_Long_Integer;
/opt/llama.cpp/vulkan/bin/llama-bench -ub 2048 -b 2048 -ctk q8_0 -ctv q8_0 -ngl 999 -fa 1 -d 8000 -m /home/wim/.cache/llama.cpp/bartowski_Qwen_Qwen3-Coder-Next-GGUF_Qwen_Qwen3-Coder-Next-Q8_0_Qwen_Qwen3-Coder-Next-Q8_0-00001-of-00003.gguf
ggml_vulkan: Found 1 Vulkan devices:
ggml_vulkan: 0 = Radeon 8060S Graphics (RADV GFX1151) (radv) | uma: 1 | fp16: 1 | bf16: 0 | warp size: 64 | shared memory: 65536 | int dot: 1 | matrix cores: KHR_coopmat
| model | size | params | backend | threads | n_ubatch | type_k | type_v | fa | test | t/s |
| ------------------------------ | ---------: | ---------: | ---------- | ------: | -------: | -----: | -----: | -: | --------------: | -------------------: |
| qwen3next 80B.A3B Q8_0 | 78.98 GiB | 79.67 B | Vulkan,BLAS | 16 | 2048 | q8_0 | q8_0 | 1 | pp512 @ d8000 | 358.92 ± 2.25 |
| qwen3next 80B.A3B Q8_0 | 78.98 GiB | 79.67 B | Vulkan,BLAS | 16 | 2048 | q8