[go: up one dir, main page]

AI

The top AI announcements from Google I/O

Comment

Sundar Pichai onstage at Google IO
Image Credits: Google

Google’s going all in on AI — and it wants you to know it. During the company’s keynote at its I/O developer conference on Tuesday, Google mentioned “AI” more than 120 times. That’s a lot!

But not all of Google’s AI announcements were significant per se. Some were incremental. Others were rehashed. So to help sort the wheat from the chaff, we rounded up the top new AI products and features unveiled at Google I/O 2024. 

Google plans to use generative AI to organize entire Google Search results pages.

What will AI-organized pages look like? Well, it depends on the search query. But they might show AI-generated summaries of reviews, discussions from social media sites like Reddit and AI-generated lists of suggestions, Google said.

For now, Google plans to show AI-enhanced results pages when it detects a user is looking for inspiration — for example, when they’re trip planning. Soon, it’ll also show these results when users search for dining options and recipes, with results for movies, books, hotels, e-commerce and more to come.

Project Astra and Gemini Live

Gemini
Image Credits: Google / Google

Google is improving its AI-powered chatbot Gemini so that it can better understand the world around it.

The company previewed a new experience in Gemini called Gemini Live, which lets users have “in-depth” voice chats with Gemini on their smartphones. Users can interrupt Gemini while the chatbot’s speaking to ask clarifying questions, and it’ll adapt to their speech patterns in real time. And Gemini can see and respond to users’ surroundings, either via photos or video captured by their smartphones’ cameras.

Gemini Live — which won’t launch until later this year — can answer questions about things within view (or recently within view) of a smartphone’s camera, like which neighborhood a user might be in or the name of a part on a broken bicycle. The technical innovations driving Live stem in part from Project Astra, a new initiative within DeepMind to create AI-powered apps and “agents” for real-time, multimodal understanding.

Google Veo

Veo
Image Credits: Google

Google’s gunning for OpenAI’s Sora with Veo, an AI model that can create 1080p video clips around a minute long when given a text prompt. 

Veo can capture different visual and cinematic styles, including shots of landscapes and time lapses, and make edits and adjustments to already generated footage. The model understands camera movements and VFX reasonably well from prompts (think descriptors like “pan,” “zoom” and “explosion”). And Veo has somewhat of a grasp on physics — things like fluid dynamics and gravity — which contribute to the realism of the videos it generates. 

Veo also supports masked editing for changes to specific areas of a video and can generate videos from a still image, à la generative models like Stability AI’s Stable Video. Perhaps most intriguing, given a sequence of prompts that together tell a story, Veo can generate longer videos — videos beyond a minute in length.

Ask Photos

Image Credits: TechCrunch

Google Photos is getting an AI infusion with the launch of an experimental feature called Ask Photos, powered by Google’s Gemini family of generative AI models.

Ask Photos, which will roll out later this summer, will allow users to search across their Google Photos collection using natural language queries that leverage Gemini’s understanding of their photo’s content — and other metadata.

For instance, instead of searching for a specific thing in a photo, such as “One World Trade,” users will be able to perform much more broad and complex searches, like finding the “best photo from each of the National Parks I visited.” In that example, Gemini would use signals such as lighting, blurriness and lack of background distortion to determine what makes a photo the “best” in a given set and combine that with an understanding of the geolocation info and dates to return the relevant images.

Gemini in Gmail

Image Credits: TechCrunch

Gmail users will soon be able to search, summarize and draft emails, courtesy of Gemini — as well as take action on emails for more complex tasks, like helping process returns. 

In one demo at I/O, Google showed how a parent could catch up on what was going on at their child’s school by asking Gemini to summarize all the recent emails from the school. In addition to the body of the emails, Gemini will also analyze attachments, such as PDFs, and spit out a summary with key points and action items.

From a sidebar in Gmail, users can ask Gemini to help them organize receipts from their emails and even put them in a Google Drive folder, or extract information from the receipts and paste it into a spreadsheet. If that’s something you do often — for example, as a business traveler tracking expenses — Gemini can also offer to automate the workflow for use in the future.

Detecting scams during calls

Image Credits: Google

Google previewed an AI-powered feature to alert users to potential scams during a call. 

The capability, which will be built into a future version of Android, uses Gemini Nano, the smallest version of Google’s generative AI offering, which can be run entirely on-device, to listen for “conversation patterns commonly associated with scams” in real time. 

No specific release date has been set for the feature. Like many of these things, Google is previewing how much Gemini Nano will be able to do down the road. We do know, however, that the feature will be opt-in — which is a good thing. While the use of Nano means the system won’t be automatically uploading audio to the cloud, the system is still effectively listening to users’ conversations — a potential privacy risk.

AI for accessibility

Image Credits: Google

Google is enhancing its TalkBack accessibility feature for Android with a bit of generative AI magic.

Soon, TalkBack will tap Gemini Nano to create aural descriptions of objects for low-vision and blind users. For example, TalkBack might describe an article of clothing as such: “A close-up of a black and white gingham dress. The dress is short, with a collar and long sleeves. It is tied at the waist with a big bow.”

According to Google, TalkBack users encounter around 90 or so unlabeled images per day. Using Nano, the system will be able to offer insight into content — potentially forgoing the need for someone to input that information manually.

We’re launching an AI newsletter! Sign up here to start receiving it in your inboxes on June 5.

Read more about Google I/O 2024 on TechCrunch

More TechCrunch

The first defense startup to receive backing from Y Combinator, Ares Industries, launched earlier this week. In a post on the YC website, the startup outlined a vision to build…

Y Combinator backs its first defense startup, Ares Industries

Pavel Durov, founder and CEO of messaging app Telegram, was arrested on Saturday evening while leaving his private jet at France’s Le Bourget airport, as initially reported by French television…

Telegram founder Pavel Durov arrested in France

The Port of Seattle, which also operates the Seattle-Tacoma International Airport, said it was hit with a “possible cyberattack” that appeared to affect websites and phone systems. The port first…

The Port of Seattle and Sea-Tac Airport say they’ve been hit by ‘possible cyberattack’

Travly is a new social-first discovery and hotel booking platform designed to cater to the growing number of travelers who rely on short-form video content for trip ideas.  The platform…

Travly lets travelers submit videos for a chance to earn a 5% commission from hotel bookings

As AI developers and others start to think more deeply about how computers and people intersect, Stephan Wolfram says it is becoming a much more of a philosophical exercise

Stephen Wolfram thinks we need philosophers working on big questions around AI

Featured Article

The 12 biggest take-private PE acquisitions so far this year in tech

A roundup of the year’s billion-dollar take-private deals in the technology sector.

The 12 biggest take-private PE acquisitions so far this year in tech

Eruditus, an Indian edtech startup, is in advanced stages of talks to secure about $150 million in new funding, two sources familiar with the matter told TechCrunch, in what would…

TPG nears $150M funding in India’s Eruditus at $2.3B valuation

Apple will be unveiling new products on September 10, with the announced phones going on sale on September 20, according to a report from Bloomberg’s Mark Gurman. That lineup will…

Apple reportedly announcing iPhone 16 lineup and more on Sept. 10

Featured Article

The fallout after Bolt’s aggressive fundraising attempt has been wild

After fintech Bolt surprised the industry with a leaked term sheet that revealed it is trying to raise at a $14 billion valuation, things got weird.

The fallout after Bolt’s aggressive fundraising attempt has been wild

Boeing’s Starliner mission is coming back to Earth — empty. After months of data analysis and internal deliberation, NASA leadership announced today that Starliner will be coming back to Earth…

Starliner will return to Earth uncrewed, astronauts staying on ISS until February

A surprising number of “iPad kids” — aka Generation Alpha’s 7- to 9-year-old demographic — are using X, according to new data from parental control software maker Qustodio. The firm…

Do you know where your children are? Maybe on X

This week, Google joined a $250 million deal with the state of California to support California newsrooms. While the deal offers a much-needed cash infusion for an industry that’s seen…

Google just made a $250M deal with California to support journalism — here’s what it means

A court order recently forced Elon Musk’s X to reveal its full list of shareholders, as of June 2023, to the public. Many of the recognizable tech industry names had…

X shareholders as of June 2023 included funds tied to Bill Ackman, Binance, and Sean ‘Diddy’ Combs

Featured Article

VCs are so eager for AI startups, they’re buying into each others’ SPVs at high prices

VCs are increasingly buying shares of late-stage startups on the secondary market as they try to get pieces of the hottest ones — especially AI companies. But they are also increasingly doing so through financial instruments called special purpose vehicles (SVPs). Some of those SPVs are becoming such hot commodities…

VCs are so eager for AI startups, they’re buying into each others’ SPVs at high prices

Featured Article

The top AI deals in Europe this year

Cumulatively, there have been more than 1,700 funding rounds for AI startups in Europe so far in 2024.

The top AI deals in Europe this year

After two years of building the company, the company quietly launched its beta in June and is officially announcing it today, right here, in TechCrunch. 

The founder building a wealth-management product her grandmother would have loved

From the looks of things, companies in the category — including Agility Robotics and Formlogic — can’t hire quickly enough.

These 74 robotics companies are hiring

Automatically disappearing posts on social networks could be handy for users who have a habit of deleting their posts through third-party tools, or if the context of those posts is…

Threads confirms it is experimenting with ephemeral posts

Two former OpenAI researchers who resigned this year over safety concerns say they are disappointed but not surprised by OpenAI’s decision to oppose California’s bill to prevent AI disasters, SB…

‘Disappointed but not surprised’: Former employees speak on OpenAI’s opposition to SB 1047

Neil Mehta, the VC behind the acquisition of a string of properties on San Francisco’s tony Fillmore Street, made waves earlier this week for reportedly throwing long-established local restaurants to…

VC Neil Mehta, who’s quietly nabbing prized SF property, plans a “Y Combinator for restaurants”

RealPage, which makes property management software, was sued Friday by the U.S. Justice Department and eight attorneys general for allegedly helping apartment and building managers around the country collude to…

Justice Department sues RealPage over allegedly helping landlords collude to drive up rents

Colorful Capital’s co-founders, William Burckart and Megan Kashner, declined to comment. 

Colorful Capital will stop trying to raise for a fund

Andrew Ng is stepping down from his role as CEO at Landing AI, the computer vision platform he founded in 2017. Dan Maloney, formerly the COO, will take the reins…

Andrew Ng steps back at Landing AI after announcing new fund

AI models are being applied to every dataset under the sun, but are inconsistent in their outcomes. This is as true in the medical world as anywhere else, but a…

Piramidal’s foundation model for brainwaves could supercharge EEGs

No two businesses are the same, and that’s good news: As we saw again this week, it opens up space for companies to try opposite approaches, join forces or challenge…

M&A can open up the playing field for the competition

Featured Article

Marc Andreessen’s family plans to build a ‘visionary’ subdivision near the proposed California Forever utopia city

Marc Andreessen’s family is planning to build a large housing development near the proposed California Forever city.

Marc Andreessen’s family plans to build a ‘visionary’ subdivision near the proposed California Forever utopia city

EV startup Canoo’s chief technology officer Sohel Merchant has left the company, two people familiar with his departure have told TechCrunch. Merchant was one of the members of Canoo’s founding…

Canoo’s chief technology officer is out amid wider reorg

A company spokesperson for the oil drilling and fracking giant declined to name the executive overseeing cybersecurity, if any.

Halliburton shuts down systems after cyberattack

The move is an effort to squeeze additional revenue from second-hand products, over concerns that cheaper, slightly used bikes, treadmills and rowers could cannibalize used sales.

Peloton adds $95 activation fee for used equipment

Time is running out! These are the last hours to save up to $600 on TechCrunch Disrupt 2024 tickets — offer ends tonight at 11:59 p.m. PT. Join 10,000+ startup…

Last day for massive ticket savings to TechCrunch Disrupt 2024