From chatbot to digital assistant
What previously functioned only as an AI-supported dialogue becomes a real tool with Agent Mode. ChatGPT can now actively interact on an isolated virtual machine in the background. There, the AI clicks through websites, types text, retrieves data, and interacts with graphical user interfaces as if a human were at work.
Technically speaking, Agent Mode combines two existing OpenAI tools: Operator, which can serve websites, and Deep Research, a research tool for multi-level analysis. Agent Mode merges both approaches, allowing ChatGPT to switch between information gathering and action in real time.
The new mode shows its strengths especially when it comes to specific tasks that would otherwise be time-consuming and stressful. Here are a few examples of how ChatGPT can be used in Agent Mode:
- Travel planning from flight to hotel booking
Agent mode can handle complex travel planning independently. It researches flights, compares availability, analyzes hotel reviews, and automatically prepares bookings. Special features such as baggage options, hotel location, and cancellation policies are taken into account. Final approval is always provided by the user. - Targeted online shopping
For targeted product searches, ChatGPT can search online shops based on specific criteria. Price limits, design preferences, reviews, and shipping details are taken into account. Suitable products are presented in a structured manner and, if desired, placed in the shopping cart, including direct links for the final purchase. - Restaurant research with reservation
Agent Mode also helps you find suitable restaurants. It analyzes reviews, availability, and menu options, checks for special requirements such as vegetarian cuisine or accessibility, and makes reservations directly via the provider's website if necessary.
Users retain control
Despite all the automation, humans remain number one in the decision-making process. Before ChatGPT sends an email, submits a form, or completes a booking, it actively obtains the user's consent. Highly sensitive actions such as financial transactions or legal advice are strictly excluded.
Anyone who wants to can take over the browser at any time, cancel tasks, or authorize only specific actions. OpenAI wants to ensure that agent mode isn't a black box, but remains transparent and controllable.
Also interesting for workflows
In addition to everyday private tasks, Agent Mode can also be integrated into professional workflows. ChatGPT can, for example, conduct competitive analyses, create presentations, or evaluate data in tabular format. Various interfaces allow the agent to connect directly to other tools such as calendars, email inboxes, or project management systems.
In initial tests, the agent mode proved to be significantly more powerful than previous GPT models and, according to OpenAI, even surpasses human benchmarks in many areas.
Soon in Germany too?
The new Agent Mode is currently only available to users in the UK, provided they have a Pro, Plus, or Team subscription. OpenAI has not yet announced when the feature will launch in Germany. However, OpenAI says the launch will take place soon.
Source: OpenAI press release






