OpenAI, Anthropic fight on the frontier

-

Good morning, { AI enthusiasts }. Yesterday, it was Super Bowl attack ads. Today, OpenAI and Anthropic are letting the models do the talking.

With back-to-back flagship drops that pushed agentic coding, self-improving AI, and enterprise automation forward in a single afternoon, things are moving faster than ever — and the “AI is hitting a wall” crowd might want to sit down this news cycle out.

In today’s AI rundown:

  • OpenAI’s GPT-5.3-Codex helps construct itself

  • Anthropic’s Opus 4.6 with ‘agent teams’, 1M context

  • Cut down reporting times with Claude in Excel

  • OpenAI’s Frontier to administer ‘AI coworkers’

  • 4 recent AI tools, community workflows, and more

LATEST DEVELOPMENTS

OPENAI

Image source: OpenAI

The Rundown: OpenAI just rolled out GPT-5.3-Codex, a brand new flagship coding model that merges its best programming and reasoning capabilities into one faster package — while also serving as a key tool in its own training and deployment process.

The main points:

  • OpenAI said early versions of 5.3-Codex were used to seek out bugs in its own training runs, manage its rollout, and analyze evaluation results.

  • Codex tops agentic coding benchmarks like SWE-Bench Pro and Terminal-Bench 2.0, topping Opus 4.6 by 12% on the latter just minutes after its release.

  • On OSWorld, a benchmark testing AI control of desktop computers, the model scored 64.7% — nearly double the 38.2% from the prior Codex version.

  • OpenAI flagged the model as its first “High” cybersecurity risk rating and committed $10M in API credits to fund defensive security research

Why it matters: The self-improvement angle here is the headline, with Anthropic’s Dario Amodei also recently saying Claude helps design its own successor. Yesterday’s bickering over ads now looks childish in comparison with the true fight on the model frontier, with a giant day of dueling releases out of each labs.

TOGETHER WITH BLAND

The Rundown: Bland AI automates phone calls for over 250+ enterprise customers. No phone trees. No hold music. Just faster, smarter customer conversations.

Here’s among the outcomes they’ve driven for businesses:

  • Idaho Finance saved $750k/yr by replacing their IVR with AI Voice Agents

  • MyPlanAdvocate added $40M/yr by automating their inbound lead qualification

  • And Needle saves $1M/yr by automating outbound calls

Book a demo today to see how they’ll work for your corporation.

ANTHROPIC

Image source: Anthropic

The Rundown: Anthropic released Claude Opus 4.6, the corporate’s recent strongest model — featuring multi-agent collaboration in Claude Code, an enormous context window, and recent Office integrations that put the AI directly inside PowerPoint.

The main points:

  • A brand new “agent teams” feature in Claude Code lets multiple AI agents split a single project and work concurrently as a substitute of handling steps one after the other

  • Opus 4.6 brings a 1M token context window to Anthropic’s Opus tier for the primary time, matching what Sonnet offers for heavy document and code work.

  • Latest Excel and PowerPoint sidebars let Claude read users’ existing templates and construct models or decks natively without copying and pasting between tools.

  • 4.6 topped most agentic benchmarks, including a leap on ARC-AGI-2 to almost 70% — though OAI’s Codex 5.3 reclaimed agentic coding highs minutes later.

Why it matters: It’s a giant day for devs, with each Codex 5.3 and Opus 4.6 releases bringing major capability increases across the board. With time between upgrades getting shorter and the length of tasks models can tackle continuing to maneuver up the curve, the “AI is hitting a wall” crowd seems pretty quiet as of late.

AI TRAINING

The Rundown: On this guide, you’ll do a fast exercise that teaches you how you can use Claude as a spreadsheet architect, taking 5+ messy CSVs and watching Claude handle data cleansing, table formatting, color-coding, and more.

Step-by-step:

  1. Install Claude’s Excel app from the Microsoft Marketplace. For this instance, we used a 12 months’s price of search engine marketing data, but you need to use sales data, receipts, etc

  2. In Excel, click the Claude button and prompt “I even have [data type] data from [sources] for my website/brand/team. Make a plan to rename each tab and clean the information as much as make it more readable”. Then, edit and approve the plan

  3. Once done, ask Claude to make a plan for the master dashboard tab: “Based on all tabs, what’s one of the best option to tie this data right into a Master Dashboard?”

  4. Finally, you possibly can ask Claude to visualise data with prompts like Create a combo chart for Clicks vs. Average Position”

Pro tip: Asking Claude to review the information and create a plan improves its output significantly in comparison with asking it to start immediately.

PRESENTED BY TRIPLE WHALE

The Rundown: Triple Whale merchants saw LLM-referred orders jump from 7,152 in 2024 to 424,000+ in Q4 2025 alone. AEO (AI Engine Optimization) is the subsequent frontier—and early movers are constructing an unfair advantage. Try Triple Whale’s free tool to see how LLMs see your brand across ChatGPT and other leading platforms.

With the AI Visibility tool, you possibly can:

  • Monitor your brand’s AI visibility rating totally free

  • Track mentions across ChatGPT and leading LLMs

  • Connect AI referrals to actual revenue with attribution

OPENAI

Image source: OpenAI

The Rundown: OpenAI just launched Frontier, a brand new platform for enterprises to deploy and manage AI agents like recent hires — complete with onboarding, permissions, and performance reviews across an organization’s existing tech stack.

The main points:

  • Frontier connects to existing enterprise systems like CRMs and ticketing tools, letting agents pull context from across the business without migrations.

  • Built-in eval and feedback loops let agents learn via experience, with OAI comparing it to onboarding a brand new worker with reviews and limits.

  • Every agent operates under its own profile with scoped access and hard limits on what it could touch for enterprise and controlled control.

  • HP, Oracle, State Farm, and Uber are among the many first adopters, with OAI embedding engineers on-site to assist teams get agents into production.

Why it matters: Anthropic and OAI have been battling over models and coding tools, but Frontier shows the fight can also be bleeding into who controls the enterprise agent layer underneath. Model capabilities are making AI coworkers a reality within the near future, and the system that ultimately orchestrates them might be helpful real estate.

QUICK HITS

  • ⚙️ GPT-5.3-Codex – OpenAI’s recent SOTA agentic coding model

  • 🧠 Claude Opus 4.6 – Anthropic’s upgrade to its strongest model line

  • 🤖 OpenAI Frontier – Enterprise platform to create, deploy, manage AI agents

  • 🔎 Model Council – Perplexity’s recent tool for querying multiple models

Perplexity launched Model Council, a brand new feature that runs queries through multiple AI models at the identical time and synthesizes outputs right into a single answer.

Roblox introduced 4D generation via its Cube AI foundation model, letting creators generate fully functional, interactive objects from text prompts.

Lotus Health raised $35M in Series A funding for its free AI-powered primary care platform, providing diagnosis, prescriptions, and referrals across 50 states.

Meta is rolling out a standalone app for its Vibes AI video platform, which was previously only available via the Meta app.

AI evaluation firm METR released recent evaluation for GPT-5.2 (high), finding it could now handle tasks that will take a human engineer over 6 hours to finish.

COMMUNITY

Every newsletter, we showcase how a reader is using AI to work smarter, save time, or make life easier.

Today’s workflow comes from reader T. in Canada:

“I exploit AI to source vendors for fresh produce and compare/predict market price changes from globally supplied goods. It helps to take care of food security in our northern region.”

How do you employ AI? Tell us here.

That is it for today!

Before you go we’d like to know what you considered today’s newsletter to assist us improve The Rundown experience for you.
  • ⭐️⭐️⭐️⭐️⭐️ Nailed it
  • ⭐️⭐️⭐️ Average
  • ⭐️ Fail

Login or Subscribe to participate

See you soon,

ASK ANA

What are your thoughts on this topic?
Let us know in the comments below.

0 0 votes
Article Rating
guest
0 Comments
Oldest
Newest Most Voted
Inline Feedbacks
View all comments

Share this article

Recent posts

0
Would love your thoughts, please comment.x
()
x