Google takes on OpenAI with release of Gemini model

Reed Albergotti

December 6, 2023 at 10:02 AM·7 min read

The News

Google launched its long-anticipated Gemini AI model on Wednesday, a move the company says puts it at the front of a race long dominated by ChatGPT maker OpenAI.

It’s Google’s attempt to reclaim the lead it gave away after researchers there made the 2017 breakthrough that allowed ChatGPT to exist in the first place. Google said Gemini is ahead of all other AI models in 30 out of 32 industry standard benchmarks, the majority of which were led by GPT–4, the most advanced one developed by OpenAI.

To build Gemini, Alphabet’s Google unit pulled together resources and talent from the far reaches of the 190,000-employee company, tapping DeepMind, the startup it acquired in 2014 to develop artificial general intelligence, as well as teams charged with pushing the limits of cloud computing and infrastructure.

“This new era of models represents one of the biggest science and engineering efforts we’ve undertaken as a company,” Alphabet and Google CEO Sundar Pichai said in a statement.

Consumers can test a pared-down version of Gemini starting Wednesday, when it is incorporated into Bard, the company’s chatbot. The most advanced version of Gemini is still undergoing tests to ensure it is safe for customers, the company said.

Eventually, Gemini will filter its way into most Google products, including the company’s generative, experimental search engine that could be the future of the company’s bread and butter business.

The firm is in a race with Microsoft to augment its suite of products, from documents to spreadsheets to email, with the new technology, eventually allowing people to converse with their computers as much as they click and type.

Know More

The most noticeable difference between Gemini and its competitors is that it is “multimodal,” meaning it was trained on a mixture of text, audio, and video. Other large language models also have multimodal capabilities, but do so by combining multiple models, each with a single modality.

Google said the “native” multi-modal approach gives Gemini better reasoning skills in its image analysis.

In one example shared with reporters, Google showed Gemini watching a person’s hands as they perform a magic trick with a quarter. The model first tries to guess which hand the quarter is in, then when it is wrong, realizes that it was fooled. “The coin is in the left hand, using a sleight of hand technique to make it appear as if the coin has disappeared,” Gemini said.

In another, it’s shown several paper airplane designs by YouTube celebrity and former NASA engineer Mark Rober, who asks it to determine which one will fly most effectively. Gemini correctly determines the best design.

It was also able to watch a video of a normally-dressed person mimicking the body movements of Keanu Reeves in as his character, Neo, dodges bullets. Gemini correctly guesses that the person is reenacting a scene from the movie. Eli Collins, vice president of product for DeepMind, said the model learned that scene from “copyright safe data” found on the open web.

Google researchers said there were questions about whether the multimodal approach would be able to perform as well or better than models that focused solely on one specific modality — a kind of specialist versus generalist debate.

But they said they found their generalist model prevailed. “Gemini sets a new state of the art across a wide range of text, image, audio, and video benchmarks,” they wrote in a paper released Wednesday.

Google said Gemini also beats all other large language models in basic math capability and can understand physics.

The company declined to reveal the size of the Gemini model, giving figures only for the smallest version, called Gemini Nano, which can run on Google Pixel smartphones. But the company said it took advantage of new compute capabilities that utilize the latest version of Google’s custom chips, known as Tensor Processing Units.

That is notable because other leading large language models, like OpenAI’s GPT-4 and Anthropic’s Claude, were trained using Nvidia graphics processors, which are in short supply and expensive to operate.

Google said Gemini is designed to run more efficiently on its processors, but declined to provide specific figures.

All three Gemini models — Nano, Pro and Ultra — will be available to enterprise customers, who can tap their capabilities and offer them to its own clients.

Reed’s view

With few exceptions, companies working in the AI industry or offering AI services to their employees say GPT-4 is the undisputed winner in terms of capability.

Benchmarks don’t tell the whole story. These evaluations are based mainly on real-world experience. Companies have their own criteria based on their specific needs, and in pretty much every case, GPT-4 has no close competition.

It could be that Gemini’s claimed success in a wide variety of benchmarks will mean it outperforms GPT-4 in the real world. We won’t know for sure until Google’s model reaches wide distribution and is put to the test by the same companies that have found GPT-4 to be the most capable.

And where Google competitor Microsoft is reliant on OpenAI to develop new models, Google has now shown it is able to build state-of-the-art AI completely in-house. That advantage is especially important after OpenAI CEO Sam Altman was fired last month from the company under mysterious circumstances, only to be rehired after the startup came close to dissolving.

Still, by one practical measure, for instance, GPT-4 is still the undisputed winner. It can ingest about 300 pages of text in a single prompt in a measure known as the “context window.” That capability is important for use cases like legal research, where analyzing long documents is important. Gemini can only handle about one quarter as much text, according to the paper released Wednesday, though this is the first version and the context window will increase, along with other capabilities.

But for most enterprise needs, Gemini Ultra will be overkill, just like GPT-4. Most companies find they can use much smaller and less capable models, which are less expensive, with the same amount of success. That’s because for business use cases, companies are not looking for general purpose AI. They want models that zero in on data stored on corporate servers.

Today, general purpose AI models like GPT-4 and Gemini are useful for consumers. But there’s another possible customer for Gemini: Startups.

A new generation of AI startups aims to create “agents” that can take autonomous action on behalf of users. Think of them as AI personal assistants. Today, even GPT-4 is insufficient to deliver this experience. Could Gemini, with its multimodal capabilities, allow more ambitious products from AI startups?

We won’t know until startups can use it in earnest, but some of the capabilities Google showed off in demos suggests it may represent a new level in capability.

Even if Gemini is not a game changer right off the bat, it clearly represents a long-term threat to OpenAI’s dominance. When it comes to LLMs like Gemini, Google is kind of a sleeping giant awakened by ChatGPT.

Many of Google’s best minds reside within DeepMind, which has been busy on more narrow applications of AI. DeepMind’s accomplishments like AlphaFold are arguably more impactful and important than ChatGPT.

Now, DeepMind is focusing its brain power on general purpose AI models and the results are pretty stark. Gemini is on its first version and looks like it may have instantly become the industry standard. I can only imagine what Gemini 3 or 4 will look like.

The View From OpenAI

Sometime in the spring or summer of 2024, OpenAI will likely release GPT-5, the next and most advanced version of its large language model. OpenAI has hinted that this will be orders of magnitude better than GPT-4 and, like Gemini, will be multimodal from the ground up.

If GPT-5 achieves artificial general intelligence, a key benchmark that gives AI human-level intelligence in most tasks, it will have left Google in the dust.

But of course, achieving AGI on the next big model is unlikely. What we may see is that Google and OpenAI keep leapfrogging each other in capability until one eventually gets there first.

Notable

The Information detailed the fits and starts in Google’s launch of Gemini.

Yahoo Sports
2024 NFL Draft grades: Denver Broncos earn one of our lowest grades mostly due to one pick
Yahoo Sports' Charles McDonald breaks down the Broncos' 2024 draft.
4d ago
Yahoo Sports
NFL Power Rankings, draft edition: Did Patriots fix their offensive issues?
Which teams did the best in the NFL Draft?
22h ago
Yahoo Sports
Formula 1: Miami Grand Prix sends cease and desist letter to prevent Donald Trump fundraiser during race
Race organizers say they'll revoke a Trump fundraiser's suite license if he holds an event for the former president on Sunday at the race.
3d ago
Yahoo Life Shopping
Does castor oil really help with hair growth? We asked the experts, and their answer may surprise you
It's inexpensive, but is it effective? Dermatologists' verdict is in — and it's unanimous.
2d ago
Yahoo Sports
New details emerge in alleged gambling ring behind Shohei Ohtani-Ippei Mizuhara scandal
It turns out the money was going from Ohtani's bank account to an illegal bookie to ... casinos.
1d ago
Yahoo Sports
NFL Draft: Packers fan upset with team's 1st pick, and Lions fans hilariously rubbed it in
Not everyone was thrilled with their team's draft on Thursday night.
6d ago
Yahoo Finance
CVS stock plunges after earnings numbers one analyst 'did not even believe'
CVS warns it could cede Medicare Advantage market share as reimbursement rates pressure the company.
17h ago
Yahoo Sports
NFL Draft grades for all 32 teams | Zero Blitz
Jason Fitz and Frank Schwab join forces to recap the draft in the best way they know how: letter grades! Fitz and Frank discuss all 32 teams division by division as they give a snapshot of how fans should be feeling heading into the 2024 season. The duo have key debates on the Dallas Cowboys, New York Giants, New Orleans Saints, Los Angeles Rams, New England Patriots, Las Vegas Raiders and more.
3d ago
Yahoo Sports
The best RBs for 2024 fantasy football according to our experts
The Yahoo Fantasy football analysts reveal their first running back rankings for the 2024 season.
2d ago
Yahoo Sports
NBA playoffs: Luka Dončić leads Mavericks in blowout win over Clippers to take 3-2 series lead
Luka Dončić singlehandedly outscored Paul George, James Harden and Russell Westbrook.
6h ago
Yahoo Sports
Canelo Álvarez and Oscar De La Hoya erupt in heated exchange ahead of title bout with Jaime Munguía
Canelo Álvarez is set to defend his title against undefeated Jaime Munguía on Saturday in Las Vegas.
12h ago
Yahoo Sports
NFL to allow players to wear protective Guardian Caps in games beginning with 2024 season
The NFL will allow players to wear protective Guardian Caps during games beginning with the 2024 season. The caps were previously mandated for practices.
6d ago
Yahoo Sports
2024 NFL Draft grades for all 32 teams
While the Falcons baffled everyone, the defending champion Chiefs looked like champs again.
4d ago
Yahoo Sports
Wide receiver rankings for 2024 fantasy football
The Yahoo Fantasy football analysts reveal their first wide receiver rankings for the 2024 season.
2d ago
Yahoo Sports
MLB Power Rankings: Braves move into the top spot followed by Dodgers, Phillies as injuries take a toll across the league
From the Braves to the Marlins, here's where all 30 teams stand after the season's first month.
2d ago
Yahoo Sports
The expanded 12-team College Football Playoff is here — and it already has problems
There is cause for excitement around the new playoff format. There's also lots of complaints and criticism to go around.
3d ago
Yahoo Sports
Ex-Florida State QB and 1999 Fiesta Bowl starter Marcus Outzen dies at 46
The Seminoles lost 23-16 to Tennessee in the first-ever BCS title game.
16h ago
Yahoo Sports
Dolphins owner Stephen Ross reportedly declined $10 billion for team, stadium and F1 race
The value of the Dolphins and Formula One racing is enormous.
17h ago
Yahoo Sports
Korey Cunningham, former NFL lineman, found dead in New Jersey home at age 28
Cunningham played 31 games in the NFL with the Cardinals, Patriots and Giants.
6d ago
Autoblog
Best places to get your car maintained and repaired
Consumer Reports offers a quick guide, with a few examples, of when you should get your car maintained and repaired at a dealership, vs. an independent shop, vs. a chain.
2d ago

News

Life

Entertainment

Finance

Sports

New on Yahoo

Google takes on OpenAI with release of Gemini model

The News

Know More

Reed’s view

The View From OpenAI

Notable

Recommended Stories

2024 NFL Draft grades: Denver Broncos earn one of our lowest grades mostly due to one pick

NFL Power Rankings, draft edition: Did Patriots fix their offensive issues?

Formula 1: Miami Grand Prix sends cease and desist letter to prevent Donald Trump fundraiser during race

Does castor oil really help with hair growth? We asked the experts, and their answer may surprise you

New details emerge in alleged gambling ring behind Shohei Ohtani-Ippei Mizuhara scandal

NFL Draft: Packers fan upset with team's 1st pick, and Lions fans hilariously rubbed it in

CVS stock plunges after earnings numbers one analyst 'did not even believe'

NFL Draft grades for all 32 teams | Zero Blitz

The best RBs for 2024 fantasy football according to our experts

NBA playoffs: Luka Dončić leads Mavericks in blowout win over Clippers to take 3-2 series lead

Canelo Álvarez and Oscar De La Hoya erupt in heated exchange ahead of title bout with Jaime Munguía

NFL to allow players to wear protective Guardian Caps in games beginning with 2024 season

2024 NFL Draft grades for all 32 teams

Wide receiver rankings for 2024 fantasy football

MLB Power Rankings: Braves move into the top spot followed by Dodgers, Phillies as injuries take a toll across the league

The expanded 12-team College Football Playoff is here — and it already has problems

Ex-Florida State QB and 1999 Fiesta Bowl starter Marcus Outzen dies at 46

Dolphins owner Stephen Ross reportedly declined $10 billion for team, stadium and F1 race

Korey Cunningham, former NFL lineman, found dead in New Jersey home at age 28

Best places to get your car maintained and repaired