GPU Comparisons

180 head-to-head inference benchmark comparisons across DeepSeek V4 Pro 1.6T, DeepSeek R1, Kimi K2.5/K2.6 1T, GLM 5/5.1, MiniMax M2.5/M2.7, Qwen 3.5 397B-A17B, gpt-oss 120B, and Llama 3.3 70B. Each page includes interactive charts for latency, throughput, and cost metrics, plus an interpolated comparison table.

DeepSeek R1

36 GPU pairs with benchmark data on DeepSeek R1.

MiniMax M2.5/M2.7

36 GPU pairs with benchmark data on MiniMax M2.5/M2.7.