AMD Ryzen AI MAX+ 395 "Strix Halo" — Llama.cpp Backend Performance Comparison

Compare model throughput across backends (pp512 & tg128). Repo: kyuz0/amd-strix-halo-toolboxes

Loading meta…

4B50B96B143B189B235B
4B235B
Winner = every selected backend within the best’s uncertainty range, combining ± errors from both results.

Prompt Processing (pp512) — tokens/second

Text Generation (tg128) — tokens/second