AMD Ryzen AI Max "Strix Halo" — ds4 Benchmark Grid

ds4 upstream ROCm performance

Comparing quantizations on ds4 ROCm backend

Environment Specifications

Prefill Throughput (Tokens/s)

Prompt processing speed across expanding context lengths.

Generation Throughput (Tokens/s)

Token generation speed across expanding context lengths.

Raw Benchmark Data

Detailed breakdown of tokens/s across selected models.