From Product Hunt

Multi-agent coordination, dev tools, and a gradient editor that actually understands curves.

WUPHF — Collaborative AI Agent Office Slack-style channels where Claude Code, Codex, and OpenClaw agents work as a team. Agents build and maintain their own wiki, learn from each task, and coordinate without you being the relay. Open source (MIT), self-hosted, your keys. Hit #1 on Hacker News.
▲ 114 · wuphf.team
SimCam — Test Camera in iOS Simulator Stream from your Mac camera into the iOS Simulator, inject images or QR codes, and control front/back cameras independently. CLI lets AI agents automate camera test scenarios. One-time purchase, no dependencies needed in your app.
▲ 117 · simcam.swmansion.com
Blueprint — Plan Before Your Agent Codes Reads your codebase, asks grounded multiple-choice questions to clarify ambiguity, then hands any coding agent a plan worth executing in one shot. Free, open source, available as extensions for Cursor, Windsurf, and VS Code.
▲ 92 · github.com
Colir — Non-Linear Gradient Editor Sculpt gradients with curve-based control on X/Y axes instead of flat linear stops. Real-time WebGL rendering, 12 blend modes, noise/sparkle/feather/distortion effects. PNG/WebP export.
▲ 98 · colir.space
Actian VectorAI DB — Portable Edge Vector Database Vector database that runs the same API from Raspberry Pi to Kubernetes. 22x QPS advantage over Milvus and Qdrant at 10M vectors. Docker container, Python/JS SDKs, LangChain + LlamaIndex support. Community edition available.
▲ 175 · actian.com
From Reddit

The AlphaGo creator just raised $1.1B for a new lab, Nvidia dropped an omnimodal edge model, and the model release pace isn't slowing down.

Nvidia Nemotron-3 Nano Omni 30B — Unified Multimodal Agent Model 30B-parameter MoE that activates only 3B per pass. Handles text, images, video, and audio in a single architecture. Open weights. 9x higher throughput than comparable open multimodal models. Designed for edge AI agents on a single GPU.
Poolside AI Launches First Models: Laguna XS.2 and M.1 The heavily-funded AI lab finally ships. Laguna XS.2 is a 33B/3B MoE coding model under Apache 2.0. Laguna M.1 is a closed frontier model. XS.2 benchmarks roughly on par with Qwen 3.5 35B for agentic coding tasks.
Mistral Launches Workflows + Teases New Model Mistral released Workflows, a visual tool for building multi-step AI pipelines. Meanwhile, a Mistral Medium 3.5 (128B) was spotted in a vLLM pull request, and Mistral's Vibe account teased a release for tomorrow. Dense or sparse MoE remains unclear.
Thousands of RobotEra L7 Humanoids Entering Logistics Centers Beijing-based RobotEra is deploying its L7 humanoid robot across 10+ logistics centers for sorting tasks. The 171cm, 55-DoF robot runs at 14.4 km/h, carries 20kg dual-arm payload, and is powered by ERA-42 Visual-Language-Action model.
DeepSeek Vision Multimodal Coming Soon Xiaokang Chen (DeepSeek core researcher) teased upcoming multimodal vision capabilities for DeepSeek V4. The current text-only V4 launched April 24 but multimodal was notably absent. Vision support appears imminent.

Quick hits.

GPT 5.6 teased on r/openai Just five days after GPT 5.5, signs of GPT 5.6 appearing. OpenAI's release cadence continues to accelerate.
Caltech Claims Radical Compression of High-Fidelity AI Models Researchers demonstrate a new compression technique that preserves model quality at dramatically smaller sizes.
From Reddit

Two new architectures that ditch vision encoders entirely, and OpenAI's Image 2 continues to impress.

Meta Tuna-2 — Pixel-Space Unified Multimodal Model Meta's new model ditches both VAE and vision encoders entirely, working directly on pixel embeddings. SOTA on multimodal benchmarks. Apache 2.0 but with a catch: foundation weights released with some layers removed (community will likely fix this quickly).
SenseNova U1 — NEO-Unify Architecture, No VAE Required SenseTime's new 8B/3B MoT model unifies image understanding and generation end-to-end with no vision encoder or VAE. Apache 2.0, open weights on HuggingFace. Generates interleaved text and images in a single flow.
GPT Image 2 — Community showcasing impressive quality Users sharing results from OpenAI's GPT Image 2. Notable quality jump from DALL-E 3, particularly for photorealism and prompt adherence.
From Reddit

A strong batch of native Mac apps today: offline grammar checking, workspace launchers, and niche audio tools.

Refine — Offline Grammarly Alternative for macOS (9-Month Update) 100% offline grammar and style checker with no cloud dependency. This 9-month update brings community-requested improvements. Lifetime license, no subscription. Native Mac app.
FlutterTime — Beautiful Timezone Converter in Your Menu Bar Free timezone converter with an intuitive visual timeline. Add cities, slide through time across zones. Lives in the menu bar. No in-app purchases, no ads, no tracking.
Lattix 2.0 — Launch Workspaces Across Spaces and Monitors Save your entire workspace (apps, windows, positions) across multiple macOS Spaces and monitors, then restore it with one click. Now with space naming and ultrafast space switching. Lifetime license.

Quick hits — smaller utilities and niche tools.

Saisei — Fast Native Audio Player with Waveform Seeking 2.7MB Swift/AppKit app. Plays WAV/MP3/FLAC/AIFF instantly with clickable waveform and keyboard-first navigation.
VisionTagger — Local AI Photo Tagging for macOS Generates searchable descriptions and keywords for photos fully on-device using Apple Silicon. Works with folders and Apple Photos. One-time purchase.
Tolaria — Files-First Markdown App for Git and AI Agents Free, open-source macOS/Linux Markdown editor built for 10K+ note collections and Git-based workflows. By the author of the Refactoring newsletter.
Spectro — Detect Fake Lossless Audio Files Auto-detects upconverted MP3s hiding inside WAV/AIFF containers by analyzing spectral frequency cutoffs. $39 lifetime.