~/ai-muninn
search
⌘K
blog
github
中
~ / blog
/
tag / local-llm
❯
grep -r "#local-llm" ~/blog
6 matches
date
read
title
2026-06-22
13m
[Just for Fun — Advanced] I Scored a 22GB-Modded 2080 Ti for ~$340 All-In — Just Enough to Keep a 27B Agent Running at Home
#rtx-2080-ti
#gpu-mod
#local-llm
#ai-agent
2026-06-13
10m
[vLLM] DiffusionGemma 26B NVFP4 on a DGX Spark: 158 tok/s, and why diffusion tok/s lies
#dgx-spark
#gb10
#diffusiongemma
#diffusion-llm
2026-06-12
7m
[Local LLM] Weights win: a 284B crushed to 2-bit still beats the small model that fits
#dgx-spark
#gb10
#deepseek-v4-flash
#quantization
2026-06-12
15m
[Local LLM] Running a 15 tok/s 284B as your daily agent brain — the settings that make it bearable
#dgx-spark
#gb10
#deepseek-v4-flash
#kv-cache
2026-06-12
10m
[Local LLM] My first Q2 model looked broken on a 128GB box — the real culprit was a parser that couldn't read DSML, not the quantization
#dgx-spark
#gb10
#deepseek-v4-flash
#ds4
2026-06-02
12m
[AI Agent] My Local Agent Flailed at Image Gen — It Was the Harness, Not the Weights
#ai-agent
#aci
#harness
#comfyui
← back to all posts