OpenAI Releases “Operator” AI Agent

-

Good morning. It’s Friday, January twenty fourth.

Did you realize: On at the present time in 1996, the primary version of the Java programming language was released. The power of Java to “write once, run anywhere” made it ideal for Web-based applications.

You read. We listen. Tell us what you’re thinking that by replying to this email.

In partnership with HUBSPOT

Unlock the complete potential of your workday with cutting-edge AI strategies and actionable insights, empowering you to realize unparalleled excellence in the longer term of labor. Download the free guide today!

Today’s trending AI news stories

OpenAI launches Operator—an agent that may use a pc for you

OpenAI has rolled out Operator, an AI agent able to executing tasks autonomously inside an internet browser. Initially available to U.S. users on ChatGPT’s $200 Pro plan, it should expand across other tiers. Operator automates tasks like booking travel, making reservations, and online shopping, using a dedicated browser interface that mimics human navigation.

Powered by the Computer-Using Agent (CUA) model, it interacts with web sites identical to a human would, filling out forms and clicking buttons. While effective for routine tasks, it requires user supervision for sensitive actions akin to banking or email. Collaborating with corporations like eBay and Uber, OpenAI ensures compliance with service agreements.

In a notable shift in its data retention policies, OpenAI revealed that it might retain deleted data from Operator for as much as 90 days—longer than the 30-day retention period for ChatGPT. This policy goals to stop abuse and improve fraud monitoring. While data could also be accessed by authorized personnel for legal or security purposes, users retain control over their information.

OpenAI is already training o4 and expects “one other big jump in capabilities”

OpenAI has initiated the training phase for its succeeding reasoning model, tentatively designated “o4,” as disclosed by Chief Product Officer Kevin Weil during his address on the World Economic Forum in Davos. This development follows the expeditious advancement of its predecessor, the o3 model, which was realized inside a mere three-month timeframe.

Weil expressed confidence in a considerable augmentation of capabilities with the forthcoming o4 model, while concurrently projecting much more abbreviated iteration cycles for subsequent models. Read more.

ByteDance’s UI-TARS can take over your computer, outperforms GPT-4o and Claude

ByteDance’s UI-TARS stands as a cutting-edge AI agent for PC and macOS, outstripping GPT-4o, Claude, and Gemini in GUI-centric tasks. Available in 7B and 72B parameter variants, it achieves state-of-the-art performance across 10+ benchmarks, showcasing its prowess in perception, contextual grounding, and sequential reasoning. Trained on 50 billion tokens, augmented by a screenshot-rich dataset, UI-TARS interprets multimodal inputs with finesse, autonomously completes complex workflows, and iteratively refines its outputs through error evaluation.

Equipped with sophisticated memory systems, UI-TARS seamlessly balances rapid, intuitive responses with deliberate, multi-step planning. In evaluations akin to VisualWebBench, ScreenQA-short, and WebSRC, it demonstrates unparalleled proficiency, reflecting an astute grasp of web and mobile GUIs. By offering transparent, stepwise task execution, it sets a brand new standard for adaptive AI agents within the competitive landscape of multimodal intelligence. Read more.

5 recent AI-powered tools from around the online

God of Prompt – Best AI Prompts for ChatGPT, Claude, Midjourney & Gemini AI

The Biggest AI Resource for ChatGPT, Claude, Midjourney & Gemini – 30,000+ AI Prompts, AI Guides & Toolkits to Streamline Your Workflow in Marketing, search engine marketing, No-Code Automation, Productivity, and more.

Hire Ava, the AI SDR & Get Meetings on Autopilot

Ava automates your entire outbound demand generation process, including:

Unlock your sales team to concentrate on high-value interactions and shutting deals, while Ava handles the time-consuming tasks.

arXiv is a free online library where researchers share pre-publication papers.

Your feedback is priceless. Reply to this email and tell us how you’re thinking that we could add more value to this article.

Considering reaching smart readers such as you? To develop into an AI Breakfast sponsor, reply to this email or DM us on 𝕏!

ASK ANA

What are your thoughts on this topic?
Let us know in the comments below.

0 0 votes
Article Rating
guest
0 Comments
Oldest
Newest Most Voted
Inline Feedbacks
View all comments

Share this article

Recent posts

0
Would love your thoughts, please comment.x
()
x