OpenAI launches ‘Operator’ agent that handles web tasks
OpenAI on Thursday introduced an artificial intelligence program called “Operator” that can perform online tasks such as placing orders or filling out forms.
According to OpenAI, operators can look up web pages and interact with them by typing, clicking or scrolling the way a human might.
“Operators can be asked to handle a variety of repetitive browser tasks such as filling out forms, ordering groceries, and even creating memes,” OpenAI said in an online post.
“The ability to use the same interfaces and tools that humans interact with every day will expand the utility of AI, helping people save time on everyday tasks while opening up new opportunities for interaction for businesses.”
An AI “agent,” Silicon Valley’s latest trend, is a digital helper whose job is to sense its surroundings, make decisions, and take actions to achieve specific goals. can.
Google in December announced the agent’s capabilities with the launch of Gemini 2.0, its most advanced artificial intelligence model to date.
AI race competitor Anthropic two months earlier added a “computer use” feature to its frontier AI model Claude during public beta testing.
“Developers can instruct Claude to use the computer the way people do by looking at the screen, moving the cursor, clicking buttons, and entering text,” Anthropic said in a post at the time. ”.
OpenAI describes the Operator as one of the first AI agents capable of doing work for people independently, designed to complete assigned tasks.
Also read
OpenAI said the operator only makes it available to US users who pay for a Pro subscription to the OpenAI service “to ensure secure and repeatable deployment.”
“If they encounter a challenge or make an error, the Operator can leverage their reasoning abilities to self-correct,” OpenAI said.
“When it gets stuck and needs assistance, it simply gives control back to the user.”
According to OpenAI, operators are trained to ask users to perform tasks that require login, payment details or when solving “CAPTCHA” security challenges to differentiate between humans and online software. gland.
“Users can ask Operators to perform multiple tasks at once by creating new conversations, such as ordering a personalized enamel mug on Etsy during a booking,” OpenAI said. camping on Hipcamp”.
One more thing! We are now on WhatsApp Channel! Follow us there to never miss any updates from the world of technology. To follow HT Tech channel on WhatsApp, click This to join now!