News

OpenAI launches Operator, an AI agent that browses the web and takes actions on your behalf

Jan 23, 2025

Key Points

  • OpenAI launches Operator, an AI agent that browses the web and executes tasks like booking flights and ordering groceries, available to Pro users at $200 per month.
  • The agent threatens Google's search dominance by positioning OpenAI as the starting point for transactional tasks, potentially capturing referral economics at the moment users take real-world actions.
  • Hundreds of startups building specialized agents face direct competition from OpenAI's 100 million user base, while content creators risk losing traffic and attribution as Operator extracts value without routing users to sources.

Summary

OpenAI has launched Operator, an AI agent in research preview that can browse the web and execute tasks by typing, clicking, and scrolling. The product is available to Pro users at $200 per month via operator.chatgpt.com. The company is positioning this as a technical breakthrough in what it calls "computer using agents," or CUA — models that can see web pages through screenshots and interact with them using vision capabilities and reinforcement learning trained on billions of web interactions.

The core innovation is visual understanding. Rather than relying on brittle HTML parsing that breaks when websites change, Operator uses vision models to understand what it sees on screen — a blue button is a blue button regardless of the underlying code — then interacts using standard keyboard and mouse actions. When the agent encounters tasks requiring sensitive information like login credentials, payment details, or CAPTCHA solving, it hands control back to the user. It will also refuse banking transactions and certain other sensitive tasks outright.

OpenAI has partnered with DoorDash, Instacart, OpenTable, Priceline, StubHub, Thumbtack, and Uber, all of which have opted in to allow the agent to interact with their platforms. The company is also working with the city of Stockton to simplify enrollment in municipal services. Early user demos show Operator handling flight bookings (with some manual input for payment), grocery ordering from Instacart, appointment scheduling, birthday gift research, house cleaning through Thumbtack, and dog walker searches. It hits rough edges on complex interfaces like calendar management and slideshow creation, and it sometimes gets blocked by sites like Reddit or the New York Times before finding workarounds via alternative searches.

The product sits at a strategic inflection point for OpenAI's broader positioning. Unlike ChatGPT as a query interface, Operator positions OpenAI as an intermediary between users and digital services. If the agent successfully handles bookings, shopping, and service discovery, OpenAI potentially captures referral economics and user attention at the moment of transaction — the point where real economic value moves. This is a direct threat to search's traditional role. Google has long been the starting point for "book a flight" or "find a dog walker." If that starting point shifts to ChatGPT, OpenAI controls not just the information surface but the action surface.

The threat to existing business models is direct. As one observer noted, creators and small websites currently benefit from Google search traffic and ad revenue. With Operator, an AI agent can crawl a recipe site, extract the recipe, and deliver it to the user without the creator getting traffic, leads, or credit. The historical internet flywheel — "publish free content, drive traffic, monetize with ads or lead funnels" — breaks if aggregators can extract value without sending users to the source. Large content platforms like Substack are already considering blocking agent access to protect creator economics.

The immediate competitive threat is to companies already building agentic tools. Hundreds of startups have spent years and significant capital building agents for flight booking, DMV enrollment, and service discovery. OpenAI's distribution (100 million active ChatGPT users), brand, and engineering resources now directly compete with those products. The city of Stockton example is illustrative: a municipal government will prefer to work with OpenAI rather than a pre-seed startup, collapsing the addressable market for specialized agent builders.

Operator is currently in early research preview with known limitations. The company is collecting user feedback before broader rollout to Plus and Team users, then enterprise. It plans to expose the agent via API for developers to build on top of it — a move that will likely accelerate the commodification of agent capabilities while concentrating control at OpenAI's platform layer.