LLMs pass legendary Turing test

Good morning, AI enthusiasts. A historic AI milestone just arrived with little fanfare — with AI systems now consistently passing as humans in controlled conversations, passing the legendary Turing test.

With GPT-4.5 achieving a 73% success rate in fooling judges during casual conversation and models only getting more capable, are we ready for a world where we will not tell AI from humans?

In today’s AI rundown:

LLMs officially pass the Turing test
Anthropic brings Claude to higher education
Create product showcase videos with Kling AI
Google DeepMind publishes AGI safety plan
4 latest AI tools & 4 job opportunities

LATEST DEVELOPMENTS

AI RESEARCH

🧠 LLMs officially pass the Turing test

Image source: GPT-4o / The Rundown

The Rundown: Researchers at UC San Diego just demonstrated that AI systems can consistently pass Alan Turing’s famous test of machine intelligence, with OpenAI’s GPT-4.5 being mistaken for human nearly three-quarters of the time in controlled trials.

The small print:

The Turing test, proposed in 1950, challenges machines to persuade human judges they’re human through text-only conversations.
The study used a three-party setup where judges had to match an AI and a human concurrently for direct comparison during five-minute conversations.
The judges relied on casual conversation and emotional cues over knowledge, with over 60% of interactions specializing in day by day activities and private details.
GPT-4.5 achieved a 73% win rate in fooling human judges when prompted to adopt a selected persona, significantly outperforming real humans.
Meta’s LLaMa-3.1-405B model also passed the test with a 56% success rate, while baseline models like GPT-4o only achieved around 20%.

Why it matters: The Turing test has been a holy grail of AI research for a long time — but model acceleration moved the goalposts so fast that the outcomes don’t feel surprising in any respect. With AI agents equipped with next-level text, audio, image, and video capabilities, the power to tell apart AI from humans is about to grow to be a significant challenge.

TOGETHER WITH CONVEYOR

✍️ Let Phil write winning RFPs

The Rundown: Meet Phil — Conveyor’s AI agent that automates your proposal process. From generating near-flawless responses to coordinating multiple teams, Phil helps you secure more wins with less hassle.

Phil, the AI agent for RFPs, can do all of this autonomously:

Generate 95% accurate AI answers to proposals
Maintain your source material and conduct RFP research
Organize your project, collaborate with other teams, and update systems

Enroll for early access and upgrade your proposal process with Phil.

ANTHROPIC

🎓 Anthropic brings Claude to higher education

Image source: Anthropic

The Rundown: Anthropic launched Claude for Education, a specialized version of its AI assistant that goals to develop students’ critical considering relatively than simply provide answers — introducing a brand new “Learning Mode” alongside major university partnerships.

The small print:

The Learning Mode asks inquiries to guide students through problem-solving, specializing in their understanding of the topic relatively than quick answers.
Other features include templates for research papers, study guides and descriptions, organization of labor and materials, and tutoring capabilities.
Northeastern University, London School of Economics, and Champlain College signed campus-wide agreements, giving access to each students and college.
Anthropic also introduced student programs, including Campus Ambassadors and API credits for projects, to foster a community of AI advocates.

Why it matters: Education continues to grapple with AI, but Anthropic is flipping the script by making the tech a partner in developing critical considering relatively than a solution engine. While the controversy over its use likely isn’t going away, this generation of scholars can have access to essentially the most personalized, high-quality learning tools ever.

AI TRAINING

🛍️ Create product showcase videos with Kling AI

The Rundown: On this tutorial, you’ll learn the right way to use Kling AI’s Elements feature to remodel static product images into skilled animated videos for marketing across all platforms.

Step-by-step:

Open Kling AI‘s “Image to Video” section and choose the “Elements” tab.
Upload your product image because the principal element (high-quality with clean background) and add complementary elements like props or contextual items to reinforce your product’s appeal.
Write a selected prompt describing your ideal product showcase scene.
Click “Generate” to create your skilled product video ready for all marketing channels.

Pro tip: We did an intensive workshop on the right way to use Kling AI to reinforce your promoting and artistic production workflows at The Rundown University with Tony Pu, Product Marketing & Operations Lead at Kling AI.

PRESENTED BY ENCORD

🛠️ Construct multimodal datasets for physical AI

The Rundown: Encord consolidates multimodal AI data management, curation, and annotation pipelines to 1 single platform — helping teams speed up model iteration cycles by utilizing an agentic AI data workflow system to arrange balanced, accurately labeled datasets 10x faster.

Join Encord & Archetype AI on April tenth for a Physical AI data workshop webinar to learn the right way to:

Discover critical edge-cases inside petabytes of multimodal sensor data for video, audio, and text
Integrate AI models like GPT-4o, Grok 3, and Gemini 2.5 directly into data pipelines to speed up high-quality data annotation
Give Physical AI models wealthy multimodal context, enabling multi-step reasoning for safer, more reliable deployments

GOOGLE DEEPMIND

🛡️ Google DeepMind publishes AGI safety plan

Image source: Google DeepMind

The Rundown: Google DeepMind just published an enormous paper detailing its safety strategy for AGI, hitting on topics including its potential arrival by 2030, the risks posed by the tech advances, and proposed approaches to combating them.

The small print:

The 145-page paper predicts that AGI matching top human skills could arrive by 2030, warning of existential threats “that permanently destroy humanity.”
DeepMind compares its safety approach with rivals, critiquing OpenAI’s concentrate on automating alignment and Anthropic’s lesser emphasis on security.
The paper specifically flags the danger of “deceptive alignment,” where AI intentionally hides its true goals, noting current LLMs show potential for it.
Key recommendations targeted misuse (cybersecurity evals, access controls) and misalignment (AI recognizing uncertainty and escalating decisions).

Why it matters: Because the race toward AGI accelerates, DeepMind’s safety blueprint is a shift from theoretical discussions to concrete planning. But with the vast amount of labs, models, and open-source options popping up across the globe, ensuring everyone adheres to safety protocols seems like an inconceivable game of whack-a-mole.

QUICK HITS

🛠️ Trending AI Tools

💬 Speech-02 – Minimax’s text-to-speech AI supporting over 30 languages
👄 MoCha – Generate movie-grade talking characters from speech and text
🤖 Agent Swarms – Automate tasks with a whole bunch of agents working together
🗣️ Vapi – Construct voice AI agents in minutes with any model

💼 AI Job Opportunities

⚙️ Hume – Senior DevOps Engineer
🎙️ Soundhound AI – General Manager, Smart Answering
🧠 Dataiku – VP, Industry Solutions
💰 Scale AI – Revenue Operations Manager

📰 The whole lot else in AI today

Meta is planning to launch latest $1000+ “Hypernova” AI-infused smart glasses that feature a screen, hand-gesture controls, and a neural wristband by the top of the 12 months.

OpenAI published PaperBench, a brand new benchmark testing AI agents’ ability to duplicate SOTA research, with Claude 3.5 Sonnet (latest) rating highest of the models tested.

Chinese giants, including ByteDance and Alibaba, are placing $16B price of orders for Nvidia’s upgraded H20 AI chips, aiming to get ahead of U.S. export restrictions.

Google appointed Google Labs lead Josh Woodward as the brand new head of consumer AI apps, replacing Sissie Hsiao for the following chapter of its Gemini assistant.

OpenAI announced an authority commission to guide its nonprofit, combining “historic financial resources” with “powerful technology that may scale human ingenuity itself.

The UFC and Meta announced a multiyear partnership, integrating Meta AI, AI Glasses, and Meta’s social platforms into latest immersive experiences for the game.

COMMUNITY

🎥 Join our next live workshop

Join our next workshop on Friday at 4 PM EST with Dr. Alvaro Cintas, The Rundown’s AI professor. By the top of the workshop, you’ll walk away with practical skills and a transparent understanding of the right way to use GPT-4o effectively in your individual work — beyond the hype.

RSVP here. Not a member? Join The Rundown University on a 14-day free trial.

🤝 Share The Rundown, get rewards

We’ll all the time keep this article 100% free. To support our work, consider sharing The Rundown with your pals, and we’ll send you more free goodies.

That is it for today!

Before you go we’d like to know what you considered today’s newsletter to assist us improve The Rundown experience for you.

⭐️⭐️⭐️⭐️⭐️ Nailed it
⭐️⭐️⭐️ Average
⭐️ Fail

See you soon,

Rowan, Joey, Zach, Alvaro, and Jason—The Rundown’s editorial team

LLMs pass legendary Turing test

AI RESEARCH

🧠 LLMs officially pass the Turing test

TOGETHER WITH CONVEYOR

✍️ Let Phil write winning RFPs

ANTHROPIC

🎓 Anthropic brings Claude to higher education

AI TRAINING

🛍️ Create product showcase videos with Kling AI

PRESENTED BY ENCORD

🛠️ Construct multimodal datasets for physical AI

GOOGLE DEEPMIND

🛡️ Google DeepMind publishes AGI safety plan

🛠️ Trending AI Tools

💼 AI Job Opportunities

📰 The whole lot else in AI today

🎥 Join our next live workshop

🤝 Share The Rundown, get rewards

That is it for today!

What are your thoughts on this topic?
Let us know in the comments below.

Share this article

Recent posts

Claude Code costs as much as $200 a month. Goose does the identical thing without spending a dime.

Does Calendar-Based Time-Intelligence Change Custom Logic?

Superb-tune Llama 2 with DPO

Claude Code costs as much as $200 a month. Goose does the identical thing totally free.

The UK government is backing AI scientists that may run their very own experiments

LLMs pass legendary Turing test

AI RESEARCH

🧠 LLMs officially pass the Turing test

TOGETHER WITH CONVEYOR

✍️ Let Phil write winning RFPs

ANTHROPIC

🎓 Anthropic brings Claude to higher education

AI TRAINING

🛍️ Create product showcase videos with Kling AI

PRESENTED BY ENCORD

🛠️ Construct multimodal datasets for physical AI

GOOGLE DEEPMIND

🛡️ Google DeepMind publishes AGI safety plan

🛠️ Trending AI Tools

💼 AI Job Opportunities

📰 The whole lot else in AI today

🎥 Join our next live workshop

🤝 Share The Rundown, get rewards

That is it for today!

What are your thoughts on this topic? Let us know in the comments below.

Share this article

Recent posts

What are your thoughts on this topic?
Let us know in the comments below.