~/ai-muninn
search
⌘K
blog
github
中
~ / blog
/
tag / multimodal
❯
grep -r "#multimodal" ~/blog
4 matches
date
read
title
2026-06-16
11m
[Agent 101 #7] Give your AI assistant eyes and ears: vision + voice for a text-only brain
#ai-assistant
#ai-agent
#hermes
#multimodal
2026-06-09
10m
[Just for Fun] A GTX 970 as an offline voice assistant: Gemma 4 E2B + Piper TTS (2.8s end-to-end)
#gemma-4
#gtx-970
#multimodal
#piper-tts
2026-06-04
6m
[Benchmark] Gemma 4 12B Omni on DGX Spark: Weight-Only NVFP4 Beats W4A4 (and Keeps Multimodal)
#dgx-spark
#gb10
#gemma-4
#nvfp4
2026-05-01
13m
[vLLM] Watching English Videos with DGX Spark: Nemotron Omni Multimodal on GB10
#nemotron-omni
#multimodal
#vllm
#dgx-spark
← back to all posts