Google I/O 2024 – Here are the AI highlights that Google revealed

May 16, 2024

195

Google's I/O 2024 event began on Tuesday with the announcement of several recent AI product developments.

OpenAI Google could have tried to outshine Google by releasing GPT-4o on Monday, however the Google I/O 2024 keynote was filled with exciting announcements.

Here's a have a look at the outstanding AI advances, recent tools and prototypes that Google is experimenting with.

Ask for photos

Google Photos, Google's photo storage and sharing service, will likely be searchable using natural language queries with Ask Photos. Users can already seek for specific objects or people of their photos, but Ask Photos takes this to the following level.

Sundar Pichai, CEO of Google, showed how you need to use Ask Photos to recollect your automobile's license plate or provide feedback on the progress of a baby's swimming skills.

Powered by TwinsAsk Photos understands the context between images and might extract text, create highlight compilations, or answer questions on saved images.

With greater than 6 billion images uploaded to Google Photos on daily basis, Ask Photos needs a big contextual window to be useful.

What in case your photos could answer your questions? At #GoogleIO Today we announced Ask Photos, a brand new feature in Google Photos that lets you just do that. Ask Photos is the brand new solution to search your photos using Twins. #AskPhotos https://t.co/KhPeCauFAf pic.twitter.com/3MZg55SgdD

– Google Photos (@googlephotos) May 14, 2024

Twins 1.5 per

Pichai announced this Twins 1.5 Pro with a 1M token context window will likely be available Twins Advanced users. That's roughly 1,500 pages of text, hours of audio and a full hour of video.

Developers can join a waiting list to try it out Twins 1.5 Pro with a powerful 2M context window coming to general availability soon. According to Pichai, that is the following step in Google's journey towards the final word goal of infinite context.

Twins 1.5 Pro has also seen a performance boost in translation, reasoning and encoding and will likely be truly multimodal with the flexibility to investigate uploaded videos and audios.

“It hit the nail on the top.”
“This changes the whole lot.”
“It’s an awesome experience.”
“I felt like I had a superpower.”
“It’s going to be great.”

Hear from developers who’ve tried it Twins 1.5 Pro with a 1 million token context window. #GoogleIO pic.twitter.com/odOfI4lvOL

– Google Google) May 14, 2024

Google Workspace

The expanded context and multimodal capabilities enable Twins proves extremely useful when integrating with Google Workspace.

Users can use natural language queries to ask questions Twins Questions related to your emails. The demo showed an example of a parent asking for a summary of recent emails from their child's school.

Twins may also have the opportunity to extract highlights from Google Meet meetings as much as an hour long and answer questions on them.

NotebookLM – Audio Overview

Google released NotebookLM last yr. It allows users to upload their very own notes and documents for which NotebookLM becomes an authority.

This is incredibly useful as a research guide or tutor, and Google has demonstrated an experimental upgrade called Audio Overview.

Audio Overview takes the input source documents and generates an audio discussion based on the content. Users can join the conversation and use voice to ask questions on NotebookLM and control the discussion.

NotebookLM! I like this project a lot, the AI-powered Arcades project. With the multimodality of Twins Pro 1.5 it could possibly robotically create audio discussions concerning the source material you could have added to your sources. pic.twitter.com/IhhSfj8AqR

— Dieter Bohn (@backlon) May 14, 2024

It's not yet known when the audio overview will likely be introduced, but it surely could possibly be a terrific help to anyone who needs a tutor or sounding board to assist solve an issue.

Google also announced LearnLM, a brand new family of models based on it Twins and precisely tailored to learning and education. LearnLM makes NotebookLM, YouTube, search, and other educational tools more interactive.

The demo was very impressive, but it surely already seems to point out among the mistakes that Google made with its original Twins Release videos crept into this event.

The Notebooklm demo will not be real time. I wish that they had expressed this expectation without burying it in a footnote in as small a font as possible. pic.twitter.com/tGN5i3fsVD

— Delip Rao e/σ (@deliprao) May 14, 2024

AI agents and Project Astra

Pichai says AI agents are powered by Twins will soon have the opportunity to perform our on a regular basis tasks. Google is developing prototype agents that may work across platforms and browsers.

The example Pichai gave was of a user giving instructions Twins For example, you may return a pair of shoes after which have the agent undergo several emails to seek out out the relevant details, log the return in the web store and book the gathering with a courier.

Demis Hassabis introduced Project Astra, Google's prototype conversational AI assistant. The demonstration of its multimodal capabilities gave a glimpse of the long run, where an AI answers questions in real time based on live video and remembers details from previous videos.

Hassabis said a few of these features will likely be rolled out later this yr.

We have been working for a very long time on a universal AI agent that will be really helpful in on a regular basis life. Today at #GoogleIO We introduced our latest advancement along this path: Project Astra. Here is a video of our prototype recorded in real time. pic.twitter.com/TSGDJZVslg

— Demis Hassabis (@demishassabis) May 14, 2024

Generative AI

Google gave us a have a look at the generative AI tools for images, music and videos the corporate is working on.

Google introduced Imagen 3, its most advanced image generator. It reportedly responds more accurately to details under sophisticated prompts and delivers more photorealistic images.

Hassabis said Imagen 3 is Google's “best model thus far for rendering text, which has been difficult for image generation models.”

Today we introduce Imagen 3,DeepMind?ref_src=twsrc%5Etfw“>@GoogleDeepMindis essentially the most powerful image generation model thus far. It understands prompts, how people write, creates more photorealistic images, and is our greatest model for text rendering. #GoogleIO pic.twitter.com/6bjidsz6pJ

– Google Google) May 14, 2024

Music AI Sandbox is an AI music generator designed as knowledgeable collaborative music creation tool moderately than a full song generator. This looks as if a terrific example of how AI could possibly be used to make good music, with a human driving the creative process.

Veo is Google's video generator that converts text, image or video prompts into minute-long 1080p clips. It also allows text prompts to make video edits. Will Veo be pretty much as good as Sora?

Google will introduce its SynthID digital watermark for text, audio, images and video.

Trillium

All of those recent multimodal capabilities require a whole lot of computing power to coach the models. Pichai introduced Trillium, the sixth iteration of its Tensor Processing Units (TPUs). Trillium delivers greater than 4 times the computing power of the previous TPU generation.

Trillium will likely be available to Google's cloud computing customers later this yr and can make NVIDIA's Blackwell GPUs available in early 2025.

AI search

Google will integrate Twins into its search platform because it moves toward using generative AI to reply queries.

With AI Overview, a search query ends in a comprehensive answer compiled from multiple online sources. This makes Google Search more of a research assistant than simply finding an internet site that might need the reply.

Twins allows Google Search to make use of multi-level considering to interrupt down complex multi-part questions and return essentially the most relevant information from multiple sources.

TwinsVideo Understanding will soon allow users to make use of a video to question Google searches.

This is useful for Google Search users, but is more likely to end in significantly less traffic to the sites from which Google sources information.

This is the search in Twins Epoch. #GoogleIO pic.twitter.com/JxldNjbqyn

– Google Google) May 14, 2024

You also can ask questions via video directly within the search. Soon. #GoogleIO pic.twitter.com/zFVu8yOWI1

– Google Google) May 14, 2024

Twins 1.5 flash

Google has announced a lightweight, cheaper and fast model called Twins 1.5 flash. According to Google, the model is “optimized for narrower or high-frequency tasks where the model’s responsiveness is most vital.”

Twins 1.5 Flash costs $0.35 per million tokens, much lower than the $7 you would need to pay to make use of it Twins 1.5 per.

Each of those further developments and recent products deserves its own article. We'll post updates as more information becomes available or after we're in a position to try it out for ourselves.

Google I/O 2024 – Here are the AI highlights that Google revealed

Ask for photos

Twins 1.5 per

Google Workspace

NotebookLM – Audio Overview

AI agents and Project Astra

Generative AI

Trillium

AI search

Twins 1.5 flash

LEAVE A REPLY Cancel reply

Must Read

Nvidia's quarterly turnover increases by almost 70% despite China's Corbs

The reasoning Engineering creates a compact latest tool for gene therapy

Deepseek: Everything that you must know in regards to the AI -Chatbot app

Rumi collects 4.7 million US dollars to rework passive media into interactive AI experiences

Robot dogs multiply from the production line to the front line

What is Ai Slop? Why you see more flawed photos and videos in your social media feeds

Do you would like a sophisticated AI assistant? Prepare yourself that you simply all appear in your organization

Latest articles

Nvidia's quarterly turnover increases by almost 70% despite China's Corbs

The reasoning Engineering creates a compact latest tool for gene therapy

Deepseek: Everything that you must know in regards to the AI -Chatbot app

Our Newsletter

Google I/O 2024 – Here are the AI ​​highlights that Google revealed

Ask for photos

Twins 1.5 per

Google Workspace

NotebookLM – Audio Overview

AI agents and Project Astra

Generative AI

Trillium

AI search

Twins 1.5 flash

RELATED ARTICLES

LEAVE A REPLY Cancel reply

Must Read

Latest articles

Our Newsletter

Google I/O 2024 – Here are the AI highlights that Google revealed