Generative artificial intelligence heavyweight OpenAI on Thursday previewed an AI agent that can carry out tasks on the web for users, as it seeks to enhance its chatbot amid intensifying competition.
The company says the CUA’s reasoning technique, which they call an “inner monologue,” helps the model understand intermediate steps and adapt to unexpected input. Under the hood, CUA takes screenshots of web pages and uses a virtual mouse and keyboard to navigate.
The model underpinning Operator is a Computer-Using Agent (CUA) that combines GPT-4o's vision mode to "see" what's on the user's screen through screenshots with graphical user interfaces (GUIs) that enable Operator to interact with the screen (clicking buttons, typing, scrolling, etc.).
This development follows the introduction of the o3 series, designed to enhance AI's ability to tackle complex problems through improved reasoning capabilities. The o3 mini model represents a significant leap from its predecessor, o1, by incorporating advanced reasoning skills that allow for step-by-step logical analysis.
Samsung, Google join forces to tackle the AI boom, facing competition from OpenAI and Apple, redefining innovation in the smartphone market with the Galaxy S25
Meta, Apple, Google and other tech companies have been named in a letter penned by Democratic lawmakers, accusing them of cozying up to President-elect Trump.
Under the Trump administration, OpenAI and other members of the tech sector are making a push to establish AI dominance in the U.S.
OpenAI, Oracle, Softbank and MGX are investing a record amount in new AI infrastructure even as China's DeepSeek outperforms on cost.
ChatGPT is OpenAI's extremely useful chatbot for answering questions. Here's how to use the generative AI tool in Apple's Notes app in macOS.
The company announced it was testing advanced reasoning models, o3 and o3 mini, designed to address more complex tasks compared to earlier iterations.
US lawmakers are demanding answers from tech giants such as Apple, Meta, and Google over their generous donations to Donald Trump.
GENERATIVE artificial intelligence (AI) heavyweight OpenAI on Thursday (Jan 23) previewed an AI agent that can carry out tasks on the web for users, as it seeks to enhance its chatbot amid intensifying competition.