OpenAI’s Sora 2 is INCREDIBLE

-

Good morning. It’s Wednesday, October 1st.

On today in tech history: In 2003the DARPA-funded CALO (Cognitive Assistant that Learns and Organizes) project held its first cross-institution integration demo on. CALO’s legacy code directly seeded Apple’s Siri years later. It was one in every of the primary serious attempts to unify NLP, task planning, and context models in a single assistant.

  • Sora 2 – The Best AI Video Generator Yet

  • Deepmind’s “Chain-of-Frames” Theory

  • Claude Sonnet 4.5s 30hr Reasoning

  • 5 Latest AI Tools

  • Latest AI Research Papers

You read. We listen. Tell us what you think that by replying to this email.

8 Weeks. Actionable AI Skills. MBA-Style Networking.

  • Construct AI confidence with role-specific use cases

  • Find out how leaders are implementing AI strategies at top financial firms

  • Secure an enduring network that supports your profession growth

Earn your certificate from Columbia Business School Executive Education—program starts November 10.

Enroll by Oct. 13 to get $200 off tuition + use code AIBREAKFAST for an extra $300 off.

Today’s trending AI news stories

OpenAI goes full platform with Sora 2 drop, copyright opt-out, and in-chat payments

Sora 2 is a significant technical upgrade: synced audio, beats Google’s Veo 3 on physics and realism, higher scene continuity, and support for all the pieces from anime to cinematic shots. It may possibly follow complex, multi-shot prompts without breaking continuity, keeping track of objects, momentum, and scene logic. A defining feature is “cameos,” where verified users can insert their likeness and voice into generated scenes, with strict opt-in consent, revocation rights, and cryptographic watermarking.

The companion Sora app mirrors TikTok with a vertical feed, remix tools, and social features. But unlike competitors, OpenAI has warned studios that future Sora versions may incorporate copyrighted material unless rightsholders explicitly opt out, a pointy inversion of the standard consent model and a possible legal flashpoint. The app is invite-only within the U.S. and Canada, with Android and API support coming soon.

ChatGPT also added a shopping layer. “Easy Checkout” lets U.S. users buy Etsy and shortly Shopify products directly in chat. Purchases run on the brand new Agentic Commerce Protocol (ACP), co-developed with Stripe and now being open-sourced. ACP uses encrypted, per-merchant payment tokens and step-by-step confirmations. Merchants can plug in via Stripe or token APIs, and OpenAI takes a transaction fee without touching pricing or rankings. Multi-item carts and international rollout are next.

With Sora’s TikTok-style AI feed, default copyright opt-outs, and agent-driven payments, OpenAI is facing what critics call its “infinite slop” moment. Critics argue the corporate is collapsing search, rating, and payments right into a single black box controlled by one AI vendor, raising antitrust concerns, privacy alarms over behavioral data, and claims it’s drifting away from its safety mission. Read more.

DeepMind says video models are the following LLMs, powered by zero-shot “chain-of-frames”

A brand new Google DeepMind paper positions its Veo 3 video model because the visual counterpart to large language models. Trained with a continuation objective on web-scale data, Veo 3 performs zero-shot across greater than 60 visual tasks, covering segmentation, detection, denoising, and super-resolution, without task-specific tuning.

The model also shows early signs of what researchers call chain-of-frames reasoning, using temporal cues across generated frames to resolve visual logic tasks like mazes and symmetry. DeepMind expects inference costs to say no similarly to LLMs, suggesting generative video could replace specialized vision models over time.

On the patron front, Google is upgrading AI Mode in Image Search with conversational querying. Somewhat than filtering by static attributes, users can describe what they need in natural language, mix prompts with reference photos, and iteratively refine results, whether searching for clothing or browsing interior design ideas. Powered by Gemini 2.5 and built on Google Lens, the system parses subtle visual context, secondary objects, and stylistic cues while surfacing shoppable links directly in Search.

Google Drive for desktop, meanwhile, is getting AI-based ransomware detection. A model trained on tens of millions of real-world samples analyzes file behavior and halts syncing if it detects mass encryption or malicious modification. The system then alerts users via email and desktop notification and offers rollback to wash versions. The tool is entering open beta now, with general availability expected by yr’s end. Read more.

Claude Sonnet 4.5 ships with 30-hour focus, Agent SDK, and no-scaffold coding preview

Anthropic just dropped Claude Sonnet 4.5, and it’s clearly gunning for GPT-5, not with flashy demos, but with stamina and tooling. The model can stay locked onto a coding job for greater than 30 hours without derailing, which is a giant jump from even its own Opus 4.1. Benchmarks back it up: 77.2% on SWE-bench Verified, 50% on Terminal-bench, and a jump from 42.2% to 61.4% on OSWorld in only 4 months. That shows it isn’t just spitting out code, it might actually operate inside real computing environments.

Pricing hasn’t budged ($3 per million input tokens, $15 per output), though GPT-5 is as much as seven times cheaper. Anthropic’s betting that enterprises pays for reliability, especially because it still holds 42% of the code-gen market and a $5B run rate, though most of that leans on Cursor and GitHub Copilot.

Anthropic also added checkpoints for pausing long tasks, a VS Code extension, higher terminal controls, context editing, and a memory tool to scale back context blowouts. Security is tightened under ASL-3, with higher guards against prompt injection and sensitive misuse. To check what’s next, Anthropic also launched a five-day preview called “Imagine with Claude” for Max users. It strips out prewritten functions entirely, Claude has to generate software logic from zero, in real time.

The corporate claims Sonnet 4.5 can be scoring higher in math, finance, cybersecurity, and domain reasoning. With GPT-5 pushing on price and scale, Anthropic is pitching endurance, autonomy, and developer fit as its differentiators. Read more.

  • Meta is acquiring AI chip startup Rivos, because it seeks to scale back reliance on Nvidia

  • Excel gets a planning agent as Copilot tests ‘Portraits’ chat

  • Amazon’s 2025 hardware event: the 8 biggest announcements

  • Periodic Labs, led by OpenAI, DeepMind researchers, bets on AI to hurry advances in physics and chemistry

  • Opera launches Neon AI browser to hitch agentic web browsing race

  • PayPal’s Honey to integrate with ChatGPT and other AIs for shopping assistance

  • Kling AI launches $42K global video contest for AI creators, winners to screen at Cannes, Tokyo

  • Z.ai released a brand new iteration of its flagship model: GLM 4.6

  • Akuity’s newest AI automations quickly discover, triage and remediate Kubernetes application incidents

  • Hollywood Actors Union slams ‘AI Actress’ Tilly Norwood as backlash builds

  • Nothing Phone 3 update will allow you to create apps with AI and share them

  • AI note-taking app Granola adds a repeatable prompts feature

  • Engineers create first artificial neurons that might directly communicate with living cells

  • 3D printing becomes stronger and more economical with light and AI

  • Swift teams with 30+ banks and Consensys to construct blockchain-based ledger prototype

  • DeepSeek’s recent V3.2-Exp model cuts API pricing in half to lower than 3 cents per 1M input tokens

  • Apple launches Foundation Models framework to empower developers with smarter app capabilities

  • Humanoid robots get smarter muscles and sharper minds with NVIDIA’s latest arsenal

  • Australia’s recent robot 3D prints a house overnight; could construct lunar bases at some point

  • $400 exoskeleton suit built from threads and motors delivers lifelike VR feedback

  • SB 53, the landmark AI transparency bill, is now law in California

  • Engineers buck against ‘vibe-coding’ label, saying responsibility still lies with the humans behind the code

  • 6,100-qubit processor shatters quantum computing record

  • Lufthansa to chop 4,000 jobs as airline turns to AI to spice up efficiency

  • Files selector is rolling out on Jules Agent

  • Cursor releases update to the newest version and enable it in Settings → Beta

  • Vulnerability exposes Unitree robots to distant fleet-wide takeover, researchers warn

  • Lovable launches cloud and AI platform to show prompts into full apps

  • Looki launches $199 kitten-shaped AI wearable camera

  • Former Microsoft execs launch AI agents to finish Excel-led finance

5 recent AI-powered tools from around the online

arXiv is a free online library where researchers share pre-publication papers.

Your feedback is worthwhile. Reply to this email and tell us how you think that we could add more value to this text.

Inquisitive about reaching smart readers such as you? To grow to be an AI Breakfast sponsor, reply to this email or DM us on 𝕏!

ASK ANA

What are your thoughts on this topic?
Let us know in the comments below.

0 0 votes
Article Rating
guest
0 Comments
Oldest
Newest Most Voted
Inline Feedbacks
View all comments

Share this article

Recent posts

0
Would love your thoughts, please comment.x
()
x