The First GPT-4(o) Image

-

Good morning. It’s Friday, May seventeenth.

Did you recognize: 15 years ago today, Minecraft was released?

You read. We listen. Tell us what you’re thinking that by replying to this email.

Today’s trending AI news stories

OpenAI Shares First Image Generated By GPT-4o

Image (including text) was generated by GPT-4o

OpenAI’s president Greg Brockman recently shared the first image generated by the brand new GPT-4o modelwhich debuted on Monday. This photorealistic image includes a person wearing an OpenAI logo T-shirt, wiping a blackboard with chalk text discussing the model’s capabilities. GPT-4o improves upon previous models by being faster, cheaper, and higher at retaining information from multimedia inputs.

Unlike its predecessors, which converted media to text, GPT-4o was trained on multimedia tokens, allowing it to directly analyze and interpret vision and audio. This recent approach ends in higher quality and accuracy in image generation, as evidenced by the comparison to DALL-E 3. While the general public cannot yet access GPT-4o’s image generation features, OpenAI is working to make them available soon. Read more.

ElevenLabs Launches AI-Voiced Screen Reader App

ElevenLabs, a startup known for its AI dubbing, has launched a free iPhone app called ElevenLabs Reader: AI Audio. The app can recognize and voice text from web pages, PDFs, and other documents using 11 different voices. Available for download within the App Store, it builds on ElevenLabs’ voice cloning technology, which generates audio using neural networks from transient samples. Founded in 2022 by former Google engineer Piotr Dabkowski and ex-Palantir strategist Mati StaniszewskiElevenLabs raised $80 million in January, valuing it at $1.1 billion.

The corporate goals to expand its reach, with deals including creating multilingual audio books for HarperCollins and plans to market its dubbing tech to YouTube creators, movie studios, and news publishers. Read more.

OpenAI Signs Reddit Deal To Train AI On Your Posts

In a data-driven alliance, OpenAI gains access to Reddit’s real-time firehose via their API. This fuels each ChatGPT development and future OpenAI products, mirroring Reddit’s prior pact (rumored at $60 million) with Google for AI-powered features. OpenAI joins the Reddit ad ecosystem, likely leveraging Reddit’s conversation troves for model training.

While financial details remain under wraps (unlike Google’s deal), Reddit emphasizes fostering community engagement. Nevertheless, past restrictions on data scraping raise eyebrows. OpenAI CEO’s Reddit activity (including past moderation disputes) adds intrigue. Nevertheless, Reddit CEO Steve Huffman sees a win-win, highlighting the platform’s value as an open forum. Read more.

Etcetera: Stories you’ll have missed

10 recent AI-powered tools from around the online

UserCall provides AI-driven voice interviews, delivering detailed user feedback faster and more efficiently than surveys. Customizable agents generate insightful follow-up questions for deeper evaluation.

FISKL offers AI-powered accounting with Stripe integration, multi-currency support, and automatic bank sync for global businesses. It simplifies invoicing, payments, and tax tracking.

Reap is an AI tool that transforms long videos into social-ready clips for platforms like Reels, Shorts, and TikToks, automating editing processes.

Oliv AI is an AI copilot for account executives, automating CRM tasks, MEDDIC scorecards, meeting prep, and personalized follow-up emails to avoid wasting time.

Jovu by Amplication is an AI platform that generates production-ready code in minutes, automating backend development with consistent, scalable, and high-standard solutions.

Glato AI automates video creation by generating scripts, using digital clones for UGC-style ads, and adding b-roll and effects, simplifying content production for marketers.

Edge Delta processes observability data on the source, providing real-time AI insights, cost optimization, and scalability for $0.20 per GB.

Scalenut’s AI Link Manager automates internal linking by detecting, fixing, and deploying links with a click, enhancing search engine optimization efficiency and website authority.

Iris.ai is a software suite for analyzing, summarizing, and managing research data from various sources, providing tools for efficient research management.

Collato transforms meeting transcripts, images, and audio recordings into documentation, streamlining the method for users by automating write-ups from Google Meet.

arXiv is a free online library where researchers share pre-publication papers.

Your feedback is worthwhile. Reply to this email and tell us how you’re thinking that we could add more value to this text.

ASK DUKE

What are your thoughts on this topic?
Let us know in the comments below.

5 1 vote
Article Rating
guest
0 Comments
Inline Feedbacks
View all comments

Share this article

Recent posts

0
Would love your thoughts, please comment.x
()
x