Good morning. It’s Wednesday, August 14th.
Did you already know: On this present day in 1995, Nintendo released the Virtual Boy in North America.
You read. We listen. Tell us what you’re thinking that by replying to this email.
Today’s trending AI news stories
OpenAI Confirms ChatGPT Really Did Get a Secret Upgrade
OpenAI has introduced an updated version of GPT-4o for its ChatGPT model, enhancing its performance significantly. Announced on 𝕏, this new edition provides faster response times and improved interaction quality. While OpenAI has not detailed the precise changes, it’s distinct from the GPT-4o-2024-08-06 API update, which was designed to scale back costs for developers.
There is concept that this update could involve a bigger or otherwise configured model, potentially linked to Project Strawberry. Users can access the brand new model via the ChatGPT app, but it surely is unclear if GPT-4o mini has been updated as well. To completely utilize the brand new features, a ChatGPT Plus subscription is required. Read more.
xAI Releases Grok-2, Adds Image Generation on X
xAI has launched Grok-2 and Grok-2 mini in beta, advancing from the previous Grok-1.5 model with improved reasoning features. These models, available only to Premium and Premium+ users on 𝕏, now include image generation capabilities. Grok-2 is designed to boost chat, coding, and reasoning, while Grok-2 mini offers similar advantages in a more compact form.
Early feedback indicates that Grok-2’s image generation, powered by Black Forest Labs’ FLUX.1, lacks restrictions on political figures, raising concerns about potential misuse. Although specific details about Grok-2’s capabilities are sparse, it’s reportedly higher at code generation and writing. xAI plans to supply these models through an enterprise API and integrate them into 𝕏’s features, corresponding to advanced search and AI-driven replies.
MultiOn Pronounces Latest AI Agent called ‘Agent Q’
Agent Q marks a serious advancement in autonomous web agents by addressing the constraints of current LLMs in handling dynamic, multi-step tasks. Traditional methods, which depend on static datasets and supervised fine-tuning, often struggle with complex decision-making because of limited exploration and compounding errors.
Agent Q improves on this through the use of guided Monte Carlo Tree Search (MCTS), AI self-critique, and reinforcement learning, including the Direct Preference Optimization (DPO) algorithm. This approach enables the model to explore various actions and web pages, learn from each successful and unsuccessful attempts, and refine its decision-making through iterative feedback.
In tests, corresponding to booking experiments on Open Table, Agent Q significantly boosted the LLaMa-3 model’s success rate from 18.6% to 95.4%, demonstrating its effectiveness in enhancing autonomous web agents’ performance. Read more.
Gemini Live, Google’s answer to ChatGPT’s Advanced Voice Mode, Launches
Google’s Gemini Live, announced on the Made by Google 2024 event, is designed to rival OpenAI’s ChatGPT Advanced Voice Mode. This recent feature enables users to have detailed voice interactions with Gemini on their smartphones. It incorporates a sophisticated speech engine for more natural and responsive conversations, allowing users to interrupt and adjust interactions in real time. Gemini Live offers ten recent, natural-sounding voices and supports hands-free operation, including background usage and the flexibility to pause and resume conversations.
Gemini Live advantages from the big context window of the Gemini 1.5 Pro and Gemini 1.5 Flash models, which enables it to process and retain extensive conversational data. Although multimodal input capabilities are usually not yet available, they’re expected later this yr, together with additional language support and iOS compatibility. Gemini Live is obtainable through the $20 monthly Google One AI Premium Plan and features recent integrations with Google services. Read more.
Etcetera: Stories you could have missed

5 recent AI-powered tools from around the net
Profundo automates multi-source data aggregation, performs advanced analytics, and facilitates precise essay composition with integrated academic citations and AI integration.
Conva.AI is the primary platform offering AI Assistant as a Service. It streamlines creation and integration of AI Assistants for apps, leveraging platform-specific SDKs.
Postgres.recent provides an in-browser Postgres sandbox with AI assistance, enabling quick database creation, data manipulation, and advanced querying capabilities.
Jupitrr AI auto-generates B-roll visuals, leveraging AI to supply stock footage, Google Images, and GIFs, with transcript-based content mapping and customizable animated subtitles.
Tusk is an AI-powered tool for automating UI changes, integrating with Jira and GitHub to streamline bug fixes, copy edits, and code reviews.
Augmeta is an AI-driven platform that enhances product and engineering management by offering adaptable AI peers for decision-making, task automation, and expertise augmentation.

arXiv is a free online library where researchers share pre-publication papers.



Your feedback is precious. Reply to this email and tell us how you’re thinking that we could add more value to this text.
Fascinated about reaching smart readers such as you? To develop into an AI Breakfast sponsor, reply to this email!