Single GPU Performance Comparison

vLLM Decoding Throughput (Tokens/s)

Metric: Raw Tokens/s

← Back to Benchmarks