~/ai-muninn
search
⌘K
blog
github
中
~ / blog
/
tag / llama.cpp
❯
grep -r "#llama.cpp" ~/blog
1 match
date
read
title
2026-06-09
10m
[Just for Fun] Gemma 4 E2B on a GTX 970: the biggest quant runs fastest (47.6 tok/s)
#gemma-4
#quantization
#gtx-970
#llama.cpp
← back to all posts