OpenAI has unveiled its latest development, Operator, an AI tool designed to autonomously complete tasks through a web browser. This AI agent, currently available to ChatGPT Pro users in the United States, marks OpenAI's entry into the realm of autonomous AI technology.
Operator functions with minimal input from users, handling tasks that would usually require human interaction. It uses a specialised model known as Computer-Using Agent (CUA), which integrates the capabilities of GPT-4's vision and advanced reasoning abilities to perform tasks efficiently.
Also read: Meta seeks urgent fix to AI chatbot's confusion on name of US president
Operator can navigate websites and perform tasks such as booking reservations, purchasing items, or researching information, without much oversight. It employs a virtual keyboard and mouse to interact with graphical user interfaces like buttons and text fields. The AI agent processes screen data, using both text and images to understand the environment and make decisions. This allows it to adapt to unexpected changes and handle complex tasks, such as filling out forms or managing purchases. However, users can intervene at any point during a task to maintain control.
Also read: UK to investigate Apple and Google's mobile ecosystems: Details here
OpenAI envisions Operator as a solution for repetitive online tasks, helping users save time. In demonstrations, the AI agent successfully planned a weekend trip by sourcing information from Reddit, setting budgets, and factoring in preferences. When Reddit became inaccessible, the Operator shifted to Bing to continue the task, showcasing its adaptability.
Operator also managed a cryptocurrency research task, pausing to notify the user when it encountered a CAPTCHA, requiring human input before resuming. This feature highlights the collaboration between the user and the AI, ensuring tasks are completed accurately while still allowing for user involvement.
Also read: Move over 50MP and 60MP:
Read more on tech.hindustantimes.com