Another response from Google to GPT-4o in the form of a mobile app update via the Gemini Advanced subscription. Gemini Live is a new conversational experience for mobile phones that will help the user achieve anything they want, like learn any subject, create images or even program.
Gemini Live is coming this summer to have voice conversations or use text, images and audio for complete multimodal interaction. Google AI will respond naturally to be one of the most anticipated experiences of the coming months.
You will have the opportunity to understand what you see on video by the end of this year to respond in real time. The other big new feature in Gemini Live are Gems, which can be created to personalize the AI based on what the user wants for a specific topic. Similar to ChatGPT GPTs.
This summer, we’re expanding Gemini’s multimodal capabilities, including the ability to have an in-depth two-way conversation using your voice. This new experience is called Live. #GoogleIO pic.twitter.com/eAZbaO5WKz
-Google Google) May 14, 2024
A “Gem” is created to access a personalized experience and thus access an AI that will behave as a chef, Pilates trainer or calculus teacher or mathematics.
Travel plans are other examples of Gemini Live via Gemini Advanced, Google’s subscription plan. The user can give all the details with a detailed text of the trip they want to take soon with their family.
Gemini collects all information from Google Maps or Gmail emails to generate a complete vacation plan with the return flight, restaurants for lunch or dinnerthe hotel where they will stay and recommendations on what times they should wake up according to the plan for the day.
All powered by Gemini 1.5 Pro, the big generative AI update that now offers the user the option to download a 1,500 page PDF document, several files on different projectsread up to 30,000 lines of code or summarize an hour-long video.
Starting today, Gemini Advanced gives you access to our next-generation AI model, 1.5 Pro, with a million token pop-up. Upload your documents (up to 1,500 pages) so you can solve more complex problems than ever before. https://t.co/oES28UZ4n0 #GoogleIO pic.twitter.com/lKpmFF1Aqw
-Google Google) May 14, 2024
Also will have the ability to offer several natural voices and you can choose the one you want for Gemini to respond. In its customization, it allows you to speak at the user’s own pace or to interrupt them in the middle of an answer to ask another question. In other words, the experience is like talking to a person.
A full-fledged assistant for you Offer tips for public speaking or a job interview. An incredible experience, although limited via Gemini Advanced, the paid subscription included in the different Google One AI Premium plans, and which differs from the free OpenAI with GPT-4o, although it also has its limitations.