close
close

File uploads get Gemini Live chat support

File uploads get Gemini Live chat support

Like ChatGPT, Google Gemini allows you to upload all kinds of files that you want the AI ​​to analyze. It is part of the multimodality support available through these major language models. You can interact with AI with text and voice and use files in your directions.

On that note, Google Gemini also has its equivalent to ChatGPT’s advanced voice mode. It’s called Gemini Live and works as Advanced Voice Mode. The AI ​​sounds a lot like a human during voice chats, making interacting with the AI ​​easier than writing text prompts. The feature is even more useful when using the mobile app.

In the future, the Google Gemini app may be able to direct you to a Gemini Live session once it detects files have been added to a chat. The feature isn’t available yet, but could be available soon Twin users. Traces of it have just been discovered in the code of a beta version of the Googling app for Android.

According to Android Authorityversion 15.45.33.ve.arm64 beta of the Google app contains text strings indicating that Gemini Live will soon be ready to talk to you about your files.

Apparently the app knows that you are uploading a file. It will suggest that you give your directions to Gemini Live. You might see things like “Open Live,” “Talk About Attachment,” or “Open Live with Attachment.”

The feature is not functional in the app and you cannot activate Live with file uploads even if you are using this beta version. However, it looks like Google is getting ready to roll out Gemini Live file upload support.

That’s definitely the kind of feature we want from multimodal AIs that can support features like Advanced Voice Mode. I mainly use ChatGPT for my AI chatbot needs, and that’s where I upload files that I need the AI ​​to work for me.

I usually provide text prompts for the files I upload, instructing ChatGPT what to do with them. A chat conversation may follow, because I may have additional questions or wishes.

The ability to do all that with your voice should further improve these interactions. Instead of typing prompts, I talked to ChatGPT, and the Advanced Voice Mode simply responded in a human conversational tone. The feature would be even more useful in the mobile version of the app, where talking to the AI ​​is easier than typing.

You don’t need the app to automatically activate voice mode when uploading files. But Google is still on the right track here. If you’re going to communicate with the AI ​​about these attachments using your voice, why not have the option to do this from the start?

Perhaps the ChatGPT mobile apps need a similar feature for file uploads on mobile devices.

Separately, I would like to remind you that Google also underwent a soft launch a standalone Gemini app for iPhone. The app is being tested in some markets, complete with Gemini Live support. Once the iPhone app is generally available, it will likely offer the same features as the Google app for Android. That should include this newly discovered ability to initiate voice chats when uploading files.