~ / blog / series / Qwen3.5-122B on DGX Spark

ls ~/blog/series/qwen3.5-122b-on-dgx-spark

1 post

  • partdatetitle
  • 12026-03-19
    [vLLM] Qwen3.5-122B Runs. But at 14 tok/s.

    After fixing the four SM121 NVFP4 bugs, Qwen3.5-122B boots cleanly and generates correct output. Then you check the speed. 14 tok/s. No flags to fix it. Here's why — and what to wait for.