MCP tools and AI memory solutions dominate today. Multiple launches around giving AI assistants better context and control.
Big benchmark news with GPT 5.2 crushing ARC-AGI. Plus Alibaba drops a new coding model, METR evals show impressive agent horizons, and a prompt injection warning for anyone building agents.
New SOTA on ARC-AGI-1 at 90.5% ($11.64/task) and 54.2% on ARC-AGI-2. A year ago o3 scored 88% at $4.5k/task — this represents ~390X efficiency improvement.
Using GPT-5.2 X-High, scores as high as 75% at under $8/problem observed on ARC-AGI-2. The benchmark is approaching saturation.
Quick hits — security warnings and regulatory news.
While everyone waits for Z-Image Edit weights, Meituan drops an alternative. Plus ComfyUI gets motion capture and a new vision-language model.
Multiple window managers for Mac (pick your poison), a long-awaited per-app volume control, and self-hosted media analytics.
Quick hits — utilities and side projects.