Photo by Solen Feyissa / Unsplash

Unleashing Gemini Live: Google's New AI Chatbot that's Revolutionizing Virtual Conversation

AI Aug 17, 2024

An Introduction to Google's Software Marvel: Gemini Live

This week, Google unveiled a slew of new tech products, including the much anticipated Pixel 9 smartphones and innovative wireless earbuds. Underpinning all of these high-tech gadgets is another tech marvel named Gemini - an artificially intelligent assistant also from Google. Launched earlier in the year, this AI chatbot forms the core conversational interface in the Pixel 9 series and is swiftly making its way to millions of Android phones across the globe. The newer and increasingly remarkable feature now being rolled out is called Gemini Live.

Google’s answer to OpenAI's GPT-4o, Gemini Live enables a natural conversation with the assistant. Its aim is to mimic a chat between two humans and remove any unnatural identifiers that it's speaking with a programmed AI. Available currently in English for Gemini Advanced subscribers at $20 per month, it can be accessed via a Live button located bottom right in the Gemini app. An iOS version and translation into multiple languages are planned releases in the coming weeks.

Genesis and Evolution of Gemini

Sissie Hsiao, VP of Gemini experiences at Google, explained to WIRED that Gemini is not merely a rehash of the Google Assistant but a completely rebuilt interface, harnessing the power of generative AI. "The continuous user feedback over the years has indicated a demand for a more natural and capable assistant. One that holds conversations naturally without language restructuring and is equipped to address complex life issues beyond simple tasks", Hsiao iterated.

On launching Gemini, users are presented with a blank screen, with an ethereal light glowing from the bottom. Starting a conversation is as simple as speaking to the assistant, operable even with a locked screen or when accessed through Google's latest Pixel Buds Pro 2. This offers the freedom to interact while on the go or when the phone is tucked away in a bag. There are ten voices available in various tones, accents, and styles, with a transcription of the entire conversation accessible at any point within the app.

Features and Capabilities of Gemini Live

Unlike previous voice assistants, Gemini Live continues its conversation seamlessly even if interrupted. It aims to communicate with other apps via extensions, some of which are pending release. Planned features include pulling up a party invitation from Gmail and enquiring about its details, searching for recipes, and adding ingredients to a shopping list on Google Keep. Google has promised an extension of these features to include its other apps like Keep, Tasks, Utilities, Calendar, and YouTube Music in the near future.

Later this year, Google plans to integrate Gemini Live with Project Astra, a computer vision technology that it previewed at a developer conference in May. This will enable real-time information about objects in the physical world through the phone's camera. One could note down dates from a concert poster in your calendar and set reminders for ticket purchases.

First-hand Experience

Initiation of a conversation with Gemini Live may feel slightly odd at first, given that previous experiences with voice assistants have been largely transactional. But as you grow accustomed to it, the flow of the conversation feels more natural. There are a few concerns, such as the lack of direct attribution for the surfaced information. However, Google has assured that users can cross-check the information by clicking on the little "G" icon underneath the transcribed text and running their own Google searches.

Where does Gemini Live differ from Google Assistant?

Perhaps you're wondering: What's the difference between Gemini Live and Google Assistant? For now, Google Assistant is still available as an option, but its features are poised to be upgraded with Gemini's larger language models. Maximizing user convenience, Gemini will act as your personal assistant, helping with calendar appointments and email invites. In contrast, Google Assistant will serve as a "communal" assistant, suitable for family usage at home.

While it may seem confusing having two Google-made voice assistants, the company assures it's aiming to provide the most helpful assistance by solving user's use cases, regardless of the device they're using. The branding and specific device functions are still being explored, and we can expect further developments in the near future.

Support for Google's New Launches

Pixel 9 Series: Starting at $799
Pixel 9 Pro & XL: Starting at $999
Pixel 9 Pro Fold: Starting at $1,799

Note: Commissions may be earned from purchases made through links in our stories, contributing to our journalism.

Tags

Suiradybedam Tobami

Software Automation Engineer