Alibaba’s o1 reasoning rival

-

Welcome, AI enthusiasts.

Chinese tech giant Alibaba just entered the reasoning race in an enormous way — with a brand new open o1 rival that matches the industry leader’s capabilities.

Open-source AI is officially competing with Silicon Valley’s finest, and OpenAI’s model moat is looking thinner by the day. Let’s get into it…

In today’s AI rundown:

  • Alibaba challenges o1 with open-source reasoning model

  • AI2 launches fully open Llama competitor

  • Create live web prototypes with Qwen Artifacts

  • AI outperforms experts at predicting scientific results

  • 5 latest AI tools & 5 latest AI jobs

  • More AI & tech news

Read time: 4 minutes

Vital: The Rundown is slowly being sent from a brand new email address. To make sure you get our newsletter, please add [email protected] to your contact list.

LATEST DEVELOPMENTS

ALIBABA

🧠 Alibaba challenges o1 with open-source reasoning model

Image source: Alibaba

The Rundown: Alibaba’s Qwen team just released QwQ-32B-Preview, a robust latest open-source AI reasoning model that may reason step-by-step through difficult problems and directly competes with OpenAI’s o1 series across benchmarks.

The small print:

  • QwQ incorporates a 32K context window, outperforming o1-mini and competing with o1-preview on key math and reasoning benchmarks.

  • The model was tested across several of essentially the most difficult math and programming benchmarks, showing major advances in deep reasoning.

  • QwQ demonstrates ‘deep introspection,’ talking through problems step-by-step and questioning and examining its own answers to reason to an answer.

  • The Qwen team noted several issues within the Preview model, including getting stuck in reasoning loops, combating common sense, and language mixing.

Why it matters: Between QwQ and DeepSeek, open-source reasoning models are here — and Chinese firms are absolutely cooking with latest models that almost match the present top closed leaders. Has OpenAI’s moat dried up, or does the AI leader have something special up its sleeve before the tip of the 12 months?

TOGETHER WITH EIGHT SLEEP

🧠 Sleep with AI-powered precision

The Rundown: Eight Sleep’s Pod 4 Ultra is redefining sleep by combining AI, biometrics, and personalized climate control for the final word night’s rest —  bringing lab-grade sleep optimization to your bedroom.

The Pod 4 Ultra offers:

  • AI-driven temperature adjustments throughout the night

  • Detailed sleep analytics and day by day sleep fitness scores

  • Advanced snore detection with automatic bed adjustments

Use code RUNDOWN at eightsleep.com/rundown for as much as $600 off bundled purchases through December 14th.

AI2

🚀 AI2 launches fully open Llama competitor

Image source: AI2

The Rundown: Research institute AI2 just released OLMo 2, a brand new family of fully open-source language models that matches the performance of similar-sized competitors like Meta’s Llama.

The small print:

  • The 7B and 13B models were trained on a 5T token dataset of high-quality academic content, filtered web data, and specialized instruction sources.

  • The OLMo models achieved similar or higher results while using less computing power than competitors and being smaller in size.

  • The models are fully open, with AI2 providing access to source code, training data, and a dev package with training recipes and evaluation frameworks.

  • The discharge also includes instruction-tuned variants, which achieve competitive results against leading open models like Qwen 2.5.

Why it matters: While other open-source models release weights but remain heavily guarded, OLMo 2 proves that cutting-edge AI could be developed and released completely within the open — potentially setting a robust latest standard for a way future systems are built and shared.

AI TRAINING

⚙️ Create live web prototypes with Qwen Artifacts

The Rundown: Qwen2.5-Coder’s latest Artifact feature immediately transforms your web ideas into live, interactive prototypes.

Step-by-step:

  1. Visit Hugging Face and locate the Qwen2.5-Coder-Artifacts space.

  2. Enter your prototype description with specific design requirements.

  3. Click “Send” to generate and preview your prototype immediately.

  4. Refine the design and export the code to your project.

Pro tip: Start with basic layouts and regularly add features to construct complex prototypes efficiently.

AI RESEARCH

🧪 AI outperforms experts at predicting scientific results

Image source: Ideogram

The Rundown: A brand new study from the University College of London just revealed that AI systems can predict scientific outcomes significantly higher than expert neuroscientists — also uncovering ‘hidden’ patterns in research that would help higher guide future studies.

The small print:

  • A ‘BrainBench’ tool was used to check 15 AI models and 171 neuroscience experts’ ability to differentiate real vs. fake outcomes in research abstracts.

  • The AI models achieved 81% accuracy, in comparison with 63% for the experts — with a ‘BrainGPT’ trained on neuroscience papers scoring even higher at 86%.

  • The success suggests scientific research follows more discoverable patterns than previously thought, which AI can leverage to guide future experiments.

  • The researchers are developing tools to assist scientists validate experimental designs before conducting studies, potentially saving time and resources.

Why it matters: While AI’s pattern recognition capabilities aren’t surprising, its ability to predict scientific outcomes could completely change how research is conducted. Using AI to validate experiments before spending any time within the lab may lead to faster research cycles, fewer dead ends, and accelerated scientific breakthroughs.

NEW TOOLS & JOBS

Trending AI Tools

  • 🎥 Magic Roll – Create viral shorts in a single click with B-roll, motion graphics, and AI-powered captions

  • 🤝 OfferGenie – AI-powered profession copilot with real-time guidance to ace every interview

  • 📸 Runway Frames – A brand new foundation model for image generation with style precision and visual world-building.

  • ⚙️ Foundry – Construct, evaluate, and improve AI agents that may automate key parts of your online business

  • 💬 Llms.txt Generator – Generate an llms.txt file to your website to offer information to assist LLMs use your website at inference time

Recent AI Job Opportunities

  • 🚀 The Rundown – Head of Growth

  • 🛠️ Shield AI – Manufacturing Engineer

  • 💼 Cresta – Sales Development Representative, Recent York

  • 🧠 Author – Director, AI Research

  • 🔬 Deepmind – Research Engineer, Materials Science

QUICK HITS

OpenAI temporarily suspended access to Sora for beta testers following Tuesday’s leak, with a bunch of artists creating an unauthorized public interface to the AI video tool.

xAI reportedly plans to release a standalone app to compete with OpenAI’s ChatGPT as early as December, marking the corporate’s first product outside of the X platform.

H Company showcased latest demos of its Runner H agent, performing advanced web tasks, including real-time data extraction, complex interface navigation, and precision web scraping across multiple platforms.

ElevenLabs introduced GenFM podcasts, a brand new feature that permits users to generate AI-hosted conversations in 32 languages about uploaded PDFs, articles, eBooks, and more.

Elon Musk posted on X that he plans to begin an AI game studio with xAI, saying he desires to “make games great again.”

Chinese self-driving startup Pony AI raised $260M at a $4.5B valuation because the autonomous taxi company’s U.S. IPO goes live for trading this week.

THAT’S A WRAP

That is it for today!

Before you go we’d like to know what you considered today’s newsletter to assist us improve The Rundown experience for you.
  • ⭐️⭐️⭐️⭐️⭐️ Nailed it
  • ⭐️⭐️⭐️ Average
  • ⭐️ Fail

Login or Subscribe to take part in polls.

See you soon,

Rowan, Joey, Zach, and Alvaro—aka The Rundown Team

ASK DUKE

What are your thoughts on this topic?
Let us know in the comments below.

0 0 votes
Article Rating
guest
0 Comments
Oldest
Newest Most Voted
Inline Feedbacks
View all comments

Share this article

Recent posts

0
Would love your thoughts, please comment.x
()
x