Google just gave us a tantalizing glimpse into the future of AI agents

Hugh Langley

Updated May 14, 2024 at 4:16 PM·5 min read

At the Google I/O conference, CEO Sundar Pichai teased where AI is headed next.
It's going to be all about agents that are better at reasoning and can act on our behalf.
What if Google did the searching for you?

What if the Google Assistant was actually … an assistant?

It's a question the company finally started answering Tuesday at the 2024 Google I/O conference, where the search giant fired off AI announcements by the dozen.

With Google and other artificial-intelligence companies making advances in how these systems can ingest images, videos, sound, and text, we're beginning to see how these systems are evolving from smart chatbots to more sophisticated tools that can do more of the hard work.

It's something Google CEO Sundar Pichai is thinking a lot about right now.

"I think about AI agents as intelligent systems that show reasoning planning and memory," Pichai said this week at a roundtable with reporters ahead of the developer show.

"They're able to think multiple steps ahead and work across software and systems, all to get something done on your behalf and, most importantly, with your supervision," he added.

In short: AI agents are what have the best chance at taking this technology from a nice-to-have to a need-to-have.

Project Astra

Pichai said Google was "in the very early days" of developing this but promised we'd see glimpses of "agentic direction" across Google's products at the conference. And there were several, but the big one people will be talking about is Project Astra.

Astra is a vision of what the Google Assistant should have been all along. You can also think of it a smarter version of Google Lens, one that uses real-time computer-vision capabilities to let you ask it questions about what you can see and hear around you.

"We've always wanted to build a universal agent that will be useful in everyday life," Google DeepMind's chief, Demis Hassabis, said. "Imagine agents that can see and hear what we do better, understand the context we're in, and respond quickly in conversation, making the pace and quality of interaction feel much more natural."

Google showed a demo of someone holding their phone up with the camera on and asking the AI voice assistant questions about what it was seeing. For example, they pointed it out the window and asked, "What neighborhood do you think I'm in?" Correctly, it located Google's King's Cross office in London.

Hassabis stressed that this demo video was recorded in real time. Google got a lot of blowback in December after a Gemini AI-model demo turned out to be edited, so Google needed to emphasize it could really do this — especially after OpenAI showed off a similar demo on Monday.

If there really is no manipulation here, Astra is certainly impressive, and will probably be the big takeaway of the show. But there are other ways these AI agents will emerge in the nearer term.

Google teased a combination of updates that would soon make its Gemini AI chatbot more capable and proactive. Some of this is being unlocked as Google continues to increase the context window, which is the amount of information a large language model can ingest at a time.

Say you want to know something buried deep in a series of very long documents that you don't want to spend hours sifting through. With a large context window, you could share all the documents with Gemini and then ask questions. The model would then answer quickly based on all the information it just ingested.

Tentacles and tailoring

But it's in its legacy products that Google really has an edge when it comes to enabling agentlike qualities.

With its "tentacles" already in many aspects of our lives, from email to search to maps, Google can synthesize all its knowledge about users and the world around them to not just answer queries but also tailor responses.

"My Gemini should really be different than your Gemini," Sissie Hsiao, the head of Gemini and Google Assistant, said.

Later this year, Gemini will be able to plan your vacation with a much more granular level of detail, Hsiao said. The idea is you'll be able to plug in all your specific demands (you like to hike, you hate it when it's too hot, and you're allergic to shellfish) and Gemini will return a detailed itinerary. Chatbots can already do this type of thing, but, if permitted, Gemini will have access to your flight information, travel confirmations in Gmail, and perhaps your hotel, and can use this to inform its answers.

"An AI assistant should be able to solve complex problems, should be able to take actions for you, and also feel very natural and fluid when you engage with it," Hsiao said.

Much of what agents will do is remove steps and shorten tasks. Google is also thinking about this when it comes to Search, as the company evolves its most precious product to spit out answers made using generative AI.

It's rolling out a version of Gemini built specifically for Search, combining its knowledge of the web with the AI model's multimodal abilities and giant context window.

Liz Reid, Google's head of Search, showed an example of Google Lens being used to take a video of a record player. The user asks Gemini why the arm isn't staying in place. Google responds with exact instructions for that specific turntable.

As ever, Google has some confusing branding to solve: Lens, Assistant, Gemini, Astra. Ultimately, though, a lot of this eventually merges together. On Tuesday, Google gave us clues on what this logical conclusion would look like.

As Reid said, "This is a way for Google to do the searching for you."

Read the original article on Business Insider

Engadget
Google I/O 2024: Everything revealed including Gemini AI, Android 15 and more
Here's all the big news that Google announced at I/O 2024 in a single place.
TechCrunch
Google I/O 2024: What to expect
Developer season has officially commenced, and we’re just one day away today, Google I/O 2024 kicks off in Mountain View's Shoreline Amphitheatre. More like Google A/I. There’s been an element of artificial intelligence/machine learning in almost every Google announcement for the past several years, but this time, you’re going to be sick of the subject by the time CEO Sundar Pichai leaves the stage. “Discover how we're furthering our mission to organize the world's information and make it universally accessible and useful,” Google says.
Engadget
Ask Google Photos to help make sense of your gallery
Google unveils Ask Photos to help you more easily get answers out of your camera roll.
Engadget
The Morning After: The biggest news from Google's I/O keynote
The biggest news stories this morning: With Gemini Live, Google wants you to relax and have a natural chat with AI, Google Pixel 8a review.
Engadget
Google's Project Astra uses your phone's camera and AI to find noise makers, misplaced items and more.
Google's Project Astra uses your phone's camera to let AI find noise makers, misplaced items and more.
TechCrunch
Google adds 'Web' search filter for showing old-school text links as AI rolls out
As Google revamps itself for the AI era, offering AI overviews within its search results, the company is introducing a new way to filter for just text-based links. According to Google, the new "Web" filter will appear either at the top of the results page or as part of the "More" option, depending on your query. The launch is an admission that sometimes people will want to just surface text-based links to web pages -- the classic blue links that today are often of secondary importance as Google either answers the question in its informational Knowledge Panels or, now, through AI experiments.
Engadget
With Gemini Live, Google wants you to relax and have a natural chat with AI
At Google I/O today, the company showed off Gemini Live, a new mobile experience for natural conversations with its AI.
TechCrunch
Gemini on Android becomes more capable and works with Gmail, Messages, YouTube and more
Google’s Gemini on Android, its AI replacement for Google Assistant, will soon be taking advantage of its ability to deeply integrate with Android’s mobile operating system and Google’s apps. At the Google I/O 2024 developer conference on Tuesday, the company announced that users will be able to pull up the Gemini overlay on top of the app they’re using in more ways. It’s also updating Android’s built-in AI model, Gemini Nano.
Engadget
OpenAI's board allegedly learned about ChatGPT launch on Twitter
“[The] board was not informed in advance of that,” Toner said on Tuesday on a podcast called The Ted AI Show. “We learned about ChatGPT on Twitter.”
TechCrunch
AI models have favorite numbers, because they think they're people
AI models are always surprising us, not just in what they can do, but what they can't, and why. An interesting new behavior is both superficial and revealing about these systems: they pick random numbers as if they're human beings. This is actually a very old and well known limitation we, humans, have: we overthink and misunderstand randomness.
Yahoo Sports
Caitlin Clark outplays Cameron Brink for career-high 30, but red-hot Sparks overwhelm Fever from long distance
Turnovers plagued Clark and the Fever again while the Sparks put on a clinic from beyond the 3-point arc.
Yahoo Sports
French Open 2024: How to watch the Coco Gauff vs. Tamara Zidanšek match
It's time for the clay court Grand Slam at Roland Garros. Here's how to tune into Gauff vs. Zidanšek.
Yahoo News
Trump trial live updates: Marathon closing arguments come to an end; jury to receive instructions at 10 a.m.
Attorneys for the prosecution and defense are delivering closing arguments on Tuesday.
Yahoo Sports
Justin Fields laughs off idea that he’ll run kicks back for the Steelers: ‘I’m not here to do that’
After it took off on social media, Justin Fields officially shut down the idea that he’d be playing on special teams for the Steelers.
Yahoo Life
Actress Judi Dench says she 'can't even see' due to macular degeneration. Here's what to know about the leading cause of vision loss for people over 50.
The eye condition causes progressive sight loss in the center of vision.
TechCrunch
Anthropic hires former OpenAI safety lead to head up new team
Jan Leike, a leading AI researcher who earlier this month resigned from OpenAI before publicly criticizing the company's approach to AI safety, has joined OpenAI rival Anthropic to lead a new "superalignment" team. In a post on X, Leike said that his team at Anthropic will focus on various aspects of AI safety and security, specifically "scalable oversight," "weak-to-strong generalization" and automated alignment research. In many ways, Leike's team sounds similar in mission to OpenAI's recently dissolved Superalignment team.
Yahoo Sports
Dodgers snap longest losing streak in 5 years aided by late Mets blunders
The New York Mets were the cure for the ailing Los Angeles Dodgers.
Autoblog
2025 Acura MDX prices rise thanks to new tech and audio
Prices for the 2025 Acura MDX rise from $850 to $2,000 thanks to new tech and a new audio system. The brand also reworked the trim structure at the top.
Yahoo Music
Darius Rucker says Hootie & the Blowfish bandmates tried to 'outparty' each other: 'That was just how we lived'
Rucker, who has also launched a successful solo career as a country artist, has written a dishy new memoir.
Yahoo Sports
Browns DT Shelby Harris slams reported NFLPA proposal to start training camp weeks earlier
Why, exactly, does the NFLPA want to cut players' summer breaks short? Shelby Harris would like to know.

News

Life

Entertainment

Finance

Sports

New on Yahoo

Google just gave us a tantalizing glimpse into the future of AI agents

Project Astra

Tentacles and tailoring

Recommended Stories

Google I/O 2024: Everything revealed including Gemini AI, Android 15 and more

Google I/O 2024: What to expect

Ask Google Photos to help make sense of your gallery

The Morning After: The biggest news from Google's I/O keynote

Google's Project Astra uses your phone's camera and AI to find noise makers, misplaced items and more.

Google adds 'Web' search filter for showing old-school text links as AI rolls out

With Gemini Live, Google wants you to relax and have a natural chat with AI

Gemini on Android becomes more capable and works with Gmail, Messages, YouTube and more

OpenAI's board allegedly learned about ChatGPT launch on Twitter

AI models have favorite numbers, because they think they're people

Caitlin Clark outplays Cameron Brink for career-high 30, but red-hot Sparks overwhelm Fever from long distance

French Open 2024: How to watch the Coco Gauff vs. Tamara Zidanšek match

Trump trial live updates: Marathon closing arguments come to an end; jury to receive instructions at 10 a.m.

Justin Fields laughs off idea that he’ll run kicks back for the Steelers: ‘I’m not here to do that’

Actress Judi Dench says she 'can't even see' due to macular degeneration. Here's what to know about the leading cause of vision loss for people over 50.

Anthropic hires former OpenAI safety lead to head up new team

Dodgers snap longest losing streak in 5 years aided by late Mets blunders

2025 Acura MDX prices rise thanks to new tech and audio

Darius Rucker says Hootie & the Blowfish bandmates tried to 'outparty' each other: 'That was just how we lived'

Browns DT Shelby Harris slams reported NFLPA proposal to start training camp weeks earlier