Google DeepMind’s latest Gemini model looks amazing—but could signal peak AI hype

“The model is innately more capable,” Sundar Pichai, the CEO of Google and its parent company Alphabet, told MIT Technology Review. “It’s a platform. AI is a profound platform shift, greater than web or mobile. And so it represents an enormous step for us.”

It’s an enormous step for Google, but not necessarily an enormous leap for the sphere as an entire. Google DeepMind claims that Gemini outmatches GPT-4 on 30 out of 32 standard measures of performance. And yet the margins between them are thin. What Google DeepMind has done is pull AI’s best current capabilities into one powerful package. To evaluate from demos, it does many things thoroughly—but few things that we haven’t seen before. For all the excitement in regards to the next big thing, Gemini could possibly be an indication that we’ve reached peak AI hype. At the very least for now.

Chirag Shah, a professor on the University of Washington who focuses on online search, compares the launch to Apple’s introduction of a latest iPhone every 12 months. “Perhaps we just have risen to a distinct threshold now, where this doesn’t impress us as much because we’ve just seen a lot,” he says.

Like GPT-4, Gemini is multimodal, meaning it’s trained to handle multiple sorts of input: text, images, audio. It could mix these different formats to reply questions on the whole lot from household chores to varsity math to economics.

In a demo for journalists yesterday, Google showed Gemini’s ability to take an existing screenshot of a chart, analyze a whole bunch of pages of research with latest data, after which update the chart with that latest information. In one other example, Gemini is shown pictures of an omelet cooking in a pan and asked (using speech, not text) if the omelet is cooked yet. “It’s not ready since the eggs are still runny,” it replies.

Most individuals can have to attend for the complete experience, nonetheless. The version launched today is a back end to Bard, Google’s text-based search chatbot, which the corporate says will give it more advanced reasoning, planning, and understanding capabilities. Gemini’s full release will likely be staggered over the approaching months. The brand new Gemini-boosted Bard will initially be available in English in greater than 170 countries, not including the EU and the UK. That is to let the corporate “engage” with local regulators, says Sissie Hsiao, a Google vp answerable for Bard.

Gemini also is available in three sizes: Ultra, Pro and Nano. Ultra is the full-powered version; Pro and Nano are tailored to applications that run with more limited computing resources. Nano is designed to run on devices, corresponding to Google’s latest Pixel phones. Developers and businesses will have the option to access Gemini Pro starting December 13. Gemini Ultra, essentially the most powerful model, will likely be available “early next 12 months” following “extensive trust and safety checks,” Google executives told reporters on a press call.

“I believe of it because the Gemini era of models,” Pichai told us. “That is how Google DeepMind goes to construct and make progress on AI. So it should all the time represent the frontier of where we’re making progress on AI technology.”

Google DeepMind’s latest Gemini model looks amazing—but could signal peak AI hype

What are your thoughts on this topic?
Let us know in the comments below.

Share this article

Recent posts

How you can Make Your AI App Faster and More Interactive with Response Streaming

Google’s latest AI audio model

Beyond Code Generation: AI for the Full Data Science Workflow

Protecting People from Harmful Manipulation — Google DeepMind

What the Bits-over-Random Metric Modified in How I Think About RAG and Agents

Google DeepMind’s latest Gemini model looks amazing—but could signal peak AI hype

What are your thoughts on this topic? Let us know in the comments below.

Share this article

Recent posts

What are your thoughts on this topic?
Let us know in the comments below.