Agent orchestration, vision-to-code models, and a self-hosted knowledge base with MCP caught my attention today.
Gemma 4 dominates the day. Plus Qwen 3.6, Anthropic's emotion research, and a breakthrough in physical AI robotics.
Google released Gemma 4 in four sizes: E2B, E4B, 26B-A4B (MoE), and 31B Dense. Built from the same research as Gemini 3, the 31B model ranks #3 on Arena AI and the 26B MoE ranks #6 among open models. Natively multimodal (vision, video, audio incoming), with thinking capabilities and 128K-256K context windows.
First Gemma release under Apache 2.0 license (previously restricted). Already running on Raspberry Pi, Android phones, and available via Ollama, HuggingFace, and llama.cpp. Unsloth had GGUF quants within minutes. Community uncensored variants appeared within 90 minutes via Heretic's new ARA method.
Quick hits from the local AI community.
LTX Desktop gets a major accessibility update, ACE-Step music gen is coming, and Alibaba ships pixel-level image editing.
Native Mac utilities, a Jellyfin music player update, and self-hosted tools for docs and family planning.