#local-llm — Blog — ai-muninn

~ / blog / tag / local-llm

❯ grep -r "#local-llm" ~/blog

6 matches

datereadtitle
2026-06-2213m
[Just for Fun — Advanced] I Scored a 22GB-Modded 2080 Ti for ~$340 All-In — Just Enough to Keep a 27B Agent Running at Home
#rtx-2080-ti #gpu-mod #local-llm #ai-agent
2026-06-1310m
[vLLM] DiffusionGemma 26B NVFP4 on a DGX Spark: 158 tok/s, and why diffusion tok/s lies
#dgx-spark #gb10 #diffusiongemma #diffusion-llm
2026-06-127m
[Local LLM] Weights win: a 284B crushed to 2-bit still beats the small model that fits
#dgx-spark #gb10 #deepseek-v4-flash #quantization
2026-06-1215m
[Local LLM] Running a 15 tok/s 284B as your daily agent brain — the settings that make it bearable
#dgx-spark #gb10 #deepseek-v4-flash #kv-cache
2026-06-1210m
[Local LLM] My first Q2 model looked broken on a 128GB box — the real culprit was a parser that couldn't read DSML, not the quantization
#dgx-spark #gb10 #deepseek-v4-flash #ds4
2026-06-0212m
[AI Agent] My Local Agent Flailed at Image Gen — It Was the Harness, Not the Weights
#ai-agent #aci #harness #comfyui

← back to all posts