#flash-attention — Blog — ai-muninn

~ / blog / tag / flash-attention

❯ grep -r "#flash-attention" ~/blog

1 match

datereadtitle
2026-06-1410m
[Just for Fun] On a GTX 970, Flash Attention nearly doubles long-context decode (24.3 → 42.5 tok/s)
#gemma-4 #gtx-970 #flash-attention #kv-cache

← back to all posts