E-ink hardware, a new image gen API, and developer tools worth a look.
OpenAI ships voice models and kills fine-tuning, AMD and a Taiwanese startup bring serious inference hardware to PCIe, and Chrome is quietly installing a 4GB model on your machine.
GPT-Realtime-2 brings GPT-5-class reasoning to voice apps. GPT-Realtime-Translate handles live translation from 70+ input languages into 13 output languages. GPT-Realtime-Whisper streams speech-to-text as the speaker talks.
Pricing: Realtime-2 at $32/$64 per 1M audio tokens, Translate at $0.034/min, Whisper at $0.017/min. Available now in the API.
Juggernaut returns for Z-Image, a single-image LoRA technique, and a massive open-source knowledge base for GenAI workflows.
macOS utilities and a subtitle tool for your arr stack.