OpenAI paid $6.5 billion for a Jony Ive smart speaker. The real product is you.
The device could connect biometric identity, targeted ads, and commerce into a closed loop Anthropic can't touch
THE MOST expensive design brief in history began with a vision of serenity. When Sam Altman and Jony Ive appeared together at an Emerson Collective event in November 2025, they described their forthcoming AI device as something that would bring "peace and calm," the antithesis of the smartphone's attention-shredding dopamine loop. Altman compared the experience to sitting in a cabin by a lake. The device, he declared, would be "the coolest piece of technology that the world will have ever seen." Then details leaked, and the cabin turned out to have a camera, a microphone, and, if you squint at OpenAI's recent moves, an advertising business model waiting to be assembled around it.
According to a February 2026 report from The Information, OpenAI's first hardware product will be a smart speaker with a built-in camera, priced between $200 and $300, with more than 200 employees dedicated to the project and a launch targeted for early 2027. The company is also exploring a smart lamp and AI glasses, though those won't be ready until 2028 or later. For a venture born from a $6.5 billion all-stock acquisition of Ive's startup io (internally codenamed "Gumdrop"), a smart speaker might seem anticlimactic. But fixating on the form factor misses what may be the more consequential play hiding inside the hardware: an identity and commerce layer that connects who you are, what you want, and what you buy, and that can be monetized in ways a chatbot window never could.
Orb and order
OpenAI's speaker will include a facial recognition system similar to Apple's Face ID, enabling users to authenticate purchases and verify their identity. In internal presentations, employees were told the device would observe users and proactively suggest actions, recommending an early bedtime ahead of a morning meeting, for instance, by using its camera to interpret its surroundings. This is not a speaker you talk to; it is a speaker that watches, listens, identifies, and acts. The privacy implications are obvious. Less obvious, and arguably more significant, is how neatly this dovetails with another Altman venture.
This article is for Vector members. Start a 7-day free trial to keep reading.
Start your free trial