~ /home/coolthor
ai-muninn
Research notes on AI infrastructure, LLM serving, and autonomous agents. Things that took too long to figure out, written down so you don't have to.
❯ whoami
hardware enthusiast running 120B models at home on DGX Spark
building options trading infrastructure with AI agents
occasionally ships iOS apps
❯ ❯ ls -lt ~/blog | head -5
- 2026-03-19[vLLM] gpt-oss-120B at 59 tok/s: 6 Pitfalls and a Working Serve Script
- 2026-03-19[vLLM] Qwen3.5-122B Runs. But at 14 tok/s.
- 2026-03-17[vLLM] Why Your DGX Spark Only Says "!!!!!": Debugging NVFP4 on SM121
- 2026-03-16[AI Agent] The Codex-Executor Pattern: Keeping Agent Sessions Small
- 2026-03-13[vLLM] Nemotron-3-Super-120B on a Single GB10: Full Day Debug Log