The AI model leaderboard

-

Welcome, AI enthusiasts.

The AI world’s favorite open LLM scoreboard just got a serious upgrade, and Alibaba’s Qwen 2 is on top of the rostrum (for now).

Hugging Face’s recent benchmarks are set to alter how we evaluate top models — a task becoming harder daily as AI continues to speed up. Let’s explore…

In today’s AI rundown:

  • Hugging Face updates Open LLM Leaderboard

  • NBC rolls out AI vocals for Olympic recaps

  • Enhance videos with Krea AI upscaling

  • Rabbit R1 hit with major security flaw

  • 5 recent AI tools & 4 recent AI jobs

  • More AI & tech news

Read time: 4 minutes

LATEST DEVELOPMENTS

OPENAI

🏆 Hugging Face updates Open LLM Leaderboard

Image source: Hugging Face

The Rundown: Hugging Face just introduced a brand new upgrade to its Open LLM Leaderboard, adding recent benchmarks and evaluation methods to assist address the recent plateau in LLM performance gains.

The main points:

  • The leaderboard now features six recent benchmarks designed to be more difficult and fewer vulnerable to contamination.

  • Initial rankings show Qwen2-72B-Instruct leading the pack, followed by Meta’s Llama-3-70B-Instruct and Mixtral 8×22b.

  • A brand new normalized scoring system adjusts for baseline performance, providing a more fair comparison across different evaluation types.

  • The upgrade also introduces a ‘maintainer’s highlight’ category and community voting system to prioritize probably the most relevant models.

Why it matters: As LLMs approach human-level performance on most tasks, finding recent ways to judge them is becoming harder — and more crucial. This revamp helps guide researchers and developers towards more targeted improvements, providing a more nuanced assessment of model capabilities.

TOGETHER WITH HEDONOVA

🤑 Simplify alternative investing with Hedonova

The Rundown: Hedonova is simplifying the complex means of investing in alternative assets — enabling investors to access a various portfolio media royalties, pre-IPO startups, wine, fantastic art, and more.

Hedonova’s advantages include:

  • An easy, single access point to a wide selection of other assets

  • SEC regulation and award-winning returns, outperforming the S&P 500 by 200% since 2019

  • A low minimum investment of just $10k

Start today and begin discovering the ability of other investments. 

THE OLYMPICS & AI

🎙️ NBC rolls out AI vocals for Olympic recaps

Image source: NBC

The Rundown: NBC is launching an AI-generated version of legendary sportscaster Al Michaels to narrate personalized Olympic highlight reels on its Peacock streaming service for the 2024 Paris Games.

The main points:

  • Subscribers can customize the 10-minute recap packages based on preferred sports, athletes, and content types, narrated by an AI clone of Michael’s voice.

  • The AI system was trained on Michaels’ past NBC broadcasts to recreate his signature style, with the broadcaster giving his approval for the method.

  • NBC said they estimate nearly 7M unique variations of recaps generated throughout the Olympics.

  • Human editors will reportedly review all AI-generated content for accuracy before being released to viewers.

Why it matters: The launch of A.I. Michaels marks a serious leap into the tech for a media giant, something we’ve seen reluctance and even outright dismissal of previously for fear of backlash. The tide is changing — and things like AI-recreated voices are steadily moving from controversial to the norm.

AI TRAINING

🎥 Enhance videos with Krea AI upscaling

The Rundown: Krea AI recently released a brand new video upscaling feature that enables users to enhance the standard of their blurry videos without spending a dime.

Step-by-step:

  1. Enroll without spending a dime or login to Krea AI.

  2. Click on the “Upscale & Enhance” button within the fundamental dashboard

  3. Upload your video and customize the enhancement settings: Upscaling factor, framerate prompt, mode, AI strength, and resemblance

  4. Click “Enhance”, wait just a few minutes, and check your recent enhanced video 🎉

Get more AI tutorials →

PRESENTED BY BRILLIANT

⏲️ AI mastery in minutes a day

The Rundown: Good’s interactive courses demystify AI — helping you stay competitive in today’s tech-driven world with just minutes of each day learning.

Good’s platform lets you:

  • Unravel the mysteries of AI through expert-designed, interactive lessons

  • Apply your knowledge to unravel real-world tech challenges

  • Transform learning right into a each day habit with bite-sized, gamified content

Join 10 million learners worldwide and begin your 30-day free trial today. P.S. — enjoy 20% off a premium annual subscription, exclusive to The Rundown readers.

RABBIT

🚨 Rabbit R1 hit with major security flaw

Image source: Rabbit Inc.

The Rundown: A gaggle of developers just discovered a serious vulnerability in Rabbit’s R1 AI assistant device, potentially exposing user’s private data and chat responses.

The main points:

  • A community-led group called Rabbitude uncovered hardcoded API keys in Rabbit’s codebase, which allowed access to all R1 responses.

  • The group gained access to the codebase in mid-May, saying the Rabbit team was aware of the difficulty but didn’t take motion.

  • Rabbitude said the vulnerability could allow bad actors to disable all r1 devices, alter voices and responses, and access private messages.

  • Rabbit acknowledged an ‘alleged data breach’ via a Discord post, but claims no customer data was leaked.

Why it matters: Despite massive hype in the primary wave of consumer AI standalone devices, the Rabbit r1 has been nothing in need of a disaster to date. Already facing major criticism over the companion’s limited capabilities, this security breach only furthers the skepticism surrounding the early AI hardware market entrants.

NEW TOOLS & JOBS

Trending AI Tools

  • 📈 June – AI-powered customer analytics for product-focused teams

  • 📲 Pygma – Personal AI social media manager

  • 🚀 AppFlowy – Open-sourced alternative to Notion, manage wiki & projects with AI

  • 🖥️ VisualSitemaps – Autogenerate visual sitemaps for UX and search engine marketing

  • ⚖️ Created by Human – AI rights licensing platform for creators

Browse more AI tools →

Latest AI Job Opportunities

  • 🚀 Scale AI – Chief of Staff, Generative AI

  • 🎨 DeepL – Director of Product Design

  • 👥 Glean – Senior Technical Recruiter

  • 🔬 Luma AI – Senior Machine Learning Engineer – Data Quality

Browse more AI jobs →

QUICK HITS

Free workshop: AI for Strategic Decision-Making (July 9). In case you’re still mostly using AI for copy generation, join Section’s free workshop on putting it to work as a thought partner. Enroll here.*

Figma unveiled Figma AI, a series of recent AI-powered features to its design platform, including Visual and Asset Search, AI text tools, image generation, and quick prototyping.

YouTube is reportedly in negotiations with record labels to license songs for the corporate’s AI-powered music generation tools, with plans to launch recent features later this yr.

Israel announced plans to construct its first supercomputer, investing $250M in a national AI program to keep up its global leadership within the tech space.

Formation Bio secured $372M in funding to advance AI-driven drug development, with the Sam Altman-backed startup boosting its valuation above $1B.

Opera launched a brand new R2 update to its Opera One browser, featuring recent AI-powered image generation and recognition, an AI Voice Output, and Page Context Mode for summaries and translations during web browsing.

*Sponsored listing

THAT’S A WRAP

SPONSOR US

Get your product in front of over 600k+ AI enthusiasts

Our newsletter is read by 1000’s of tech professionals, investors, engineers, managers, and business owners around the globe. Get in contact today.

FEEDBACK

How would you rate today’s newsletter?

Vote below to assist us improve the newsletter for you.
  • ⭐️⭐️⭐️⭐️⭐️ Nailed it
  • ⭐️⭐️⭐️ Average
  • ⭐️ Epic Fail

Login or Subscribe to take part in polls.

If you could have specific feedback or anything interesting you’d wish to share, please tell us by replying to this email.

ASK DUKE

What are your thoughts on this topic?
Let us know in the comments below.

0 0 votes
Article Rating
guest
0 Comments
Inline Feedbacks
View all comments

Share this article

Recent posts

0
Would love your thoughts, please comment.x
()
x