Nvidia’s Music Generator Creates ‘Never before heard sounds’

-

In partnership with

Good morning. It’s Wednesday, November twenty seventh.

Did you already know: We’re skipping the history facts today for this: Marc Andreessen’s interview on The Joe Rogan Experience yesterday is a must-watch for technologists.

  • Neuralink Trial

  • Runway’s Custom Worlds

  • NVIDIA’s Fugatto Music Generator

  • Text to 3D Assets

  • Perplexity’s Hardware Play

  • OpenAI Sora Leak

  • 3 Recent AI Tools

  • Latest AI Research Papers

You read. We listen. Tell us what you think that by replying to this email.

The long run of presentations, powered by AI

Gamma is a contemporary alternative to slides, powered by AI. Create beautiful and interesting presentations in minutes. Try it free today.

Today’s trending AI news stories

Musk’s Neuralink to launch feasibility trial with brain implant, robotic arm

Neuralink, Elon Musk’s neurotech enterprise, is moving the needle with a feasibility trial for its brain-computer interface and surgical robotic arm. Constructing on its PRIME study, the trial focuses on patients with quadriplegia, enabling device control via neural signals. Early U.S. participants have already showcased thought-driven feats like gaming, web navigation, and 3D design, offering a glimpse of its transformative potential.

Across the border, Health Canada has greenlit Neuralink’s first international trial, recruiting six participants to check the implant’s safety and real-world utility. By bridging neural activity with external systems, the corporate goals to ascertain a blueprint for brain-machine interoperability. As Neuralink refines its tech stack and expands its trials, the implications could ripple far beyond assistive tech, positioning it on the nexus of neuroscience and engineering innovation. Read more.

Runway launches Frames — a brand new AI image generator that creates custom worlds

Runway’s latest foundation model, Framesredefines image generation with precise stylistic control and heightened visual fidelity. By resolving the persistent challenge of maintaining consistency across creative outputs, it enables users to design immersive, cohesive visual worlds with remarkable accuracy.

Available through Gen-3 Alpha and the Runway API, Frames demonstrates its capability in various applications, from retro album art to highly stylised compositions. Its combination of realism and aesthetic detail provides creative professionals with a complicated toolset for creating visually cohesive and interesting images. Read more.

Nvidia’s latest music generation model Fugatto creates ‘never before heard sounds’

Nvidia’s Fugatto, the Foundational Generative Audio Transformer Opus 1, pushes the boundaries of audio synthesis by mixing and reinterpreting sound in ways previously unimagined. It doesn’t just generate music; it morphs existing audio—turning a piano melody right into a human voice or transforming a recording’s mood and accent. Fugatto’s ability to fuse distinct sounds, like a train’s rumble with orchestral music, produces truly original soundscapes.

Although trained on hundreds of thousands of open-source samples, Nvidia is holding back public access, citing concerns over safety and copyright risks. With comparisons to the disruptive influence of synthesizers, Fugatto positions itself as a robust tool for reshaping not only music, but in addition broader creative fields like gaming and media production. Nevertheless, its release can be cautious—balancing innovation with responsibility. Read more.

Nvidia’s Edify 3D turns text and pictures into 3D assets

Nvidia’s Edify 3D is a game-changer in asset creation, turning text or images into fully realized 3D models and textures in under two minutes. Using a diffusion model, it generates multiple views of an object, which a reconstruction model then weaves together into a refined, topologically sound 3D asset.

The outcomes are high-quality meshes with UV maps, ready for refinement. It doesn’t stop at individual objects; Edify 3D can create entire 3D environments by stitching related assets into cohesive scenes. This guarantees to streamline workflows for industries like gaming, AR, and film production. While the technology impresses, Nvidia’s silence on public release raises questions on when, or if, this powerhouse tool will make its approach to the masses. Read more.

Perplexity weighs a step into the hardware game

Perplexity, the AI-centric search engine, is toying with the thought of hardware via a compact, sub-$50 device geared toward facilitating seamless voice-based Q&A exchanges. Aravind Srinivas, its founder, catalysed interest through a social media challenge, promising development upon reaching 5,000 likes.

This reflects the AI industry’s growing hardware fixation, from MidJourney’s team to OpenAI’s Jony Ive project. Yet pitfalls abound—Rabbit’s R1 quickly oversaturated, and Humane’s AI Pin collapsed after poor sales and recalls.

While Perplexity’s coffers are flush—rumoured to be bolstered by a $500 million raise—success on this domain demands greater than ambition. Historical missteps by others function a cautionary tale. Read more.

OpenAI’s Sora video generator appears to have leaked

A gaggle of early beta testers has leaked access to OpenAI’s Sora video generator, sparking a pointy critique of the corporate’s early access practices. Through a frontend built on Hugging Face, users were capable of generate short video clips from text prompts.

In an open letter, the group accuses OpenAI of exploiting artists for promotional purposes, with creative freedoms severely limited. They claim that each one outputs must receive OpenAI’s approval before being shared, stifling real artistic expression.

An OpenAI spokesperson clarified that artists haven’t any formal obligations beyond using Sora “responsibly” and maintaining confidentiality, though the corporate shunned defining what constitutes responsible use or which details are considered confidential.

The leaked version of Sora appears to be a “turbo” variant, faster and with indications of favor control and limited customization. Read more.

3 latest AI-powered tools from around the net

Looking for impartial news? Meet 1440.

Each day, 3.5 million readers turn to 1440 for his or her factual news. We sift through 100+ sources to bring you an entire summary of politics, global events, business, and culture, all in a transient 5-minute email. Enjoy an impartial news experience.

arXiv is a free online library where researchers share pre-publication papers.

Your feedback is invaluable. Reply to this email and tell us how you think that we could add more value to this text.

Thinking about reaching smart readers such as you? To turn into an AI Breakfast sponsor, reply to this email or DM us on X!

ASK DUKE

What are your thoughts on this topic?
Let us know in the comments below.

0 0 votes
Article Rating
guest
0 Comments
Oldest
Newest Most Voted
Inline Feedbacks
View all comments

Share this article

Recent posts

0
Would love your thoughts, please comment.x
()
x