Dataconomy
  • News
    • Artificial Intelligence
    • Cybersecurity
    • DeFi & Blockchain
    • Finance
    • Gaming
    • Startups
    • Tech
  • Industry
  • Research
  • Resources
    • Articles
    • Guides
    • Case Studies
    • Whitepapers
    • AI Models Leaderboard
  • AI toolsNEW
  • Newsletter
  • + More
    • Glossary
    • Conversations
    • Events
    • About
      • Who we are
      • Contact
      • Imprint
      • Legal & Privacy
      • Partner With Us
Subscribe
No Result
View All Result
  • AI
  • Tech
  • Cybersecurity
  • Finance
  • DeFi & Blockchain
  • Startups
  • Gaming
Dataconomy
  • News
    • Artificial Intelligence
    • Cybersecurity
    • DeFi & Blockchain
    • Finance
    • Gaming
    • Startups
    • Tech
  • Industry
  • Research
  • Resources
    • Articles
    • Guides
    • Case Studies
    • Whitepapers
    • AI Models Leaderboard
  • AI toolsNEW
  • Newsletter
  • + More
    • Glossary
    • Conversations
    • Events
    • About
      • Who we are
      • Contact
      • Imprint
      • Legal & Privacy
      • Partner With Us
Subscribe
No Result
View All Result
Dataconomy
No Result
View All Result

Gemini Android app beta adds audio file support

The Gemini Android beta includes an option to attach MP3s and talk live about them but processing often fails or misfires.

byEmre Çıtak
August 5, 2025
in Artificial Intelligence, News
Home News Artificial Intelligence
Share on FacebookShare on TwitterShare on LinkedInShare on WhatsAppShare on e-mail
Google Preferred Source

The Gemini Android application beta now includes a feature allowing users to attach audio files like MP3s to chat conversations. This functionality, observed by Android Authority in version 16.30.59.sa.arm64 of the Google app beta, introduces a “Talk live about this” prompt upon file attachment. While present, the audio processing capabilities within the beta are not yet fully operational.

Upon attaching an audio file, users are presented with the option to either type a question or select the “talk live” prompt. Current observations indicate Gemini does not consistently process the audio input. In some instances, the application ignores the attached audio file entirely. In other cases, Gemini may generate responses that do not correlate with the audio content, exhibiting behavior consistent with chatbot hallucinations.

Despite the current limitations in the Android beta, the Gemini API already supports audio input. Developers can utilize the API to submit audio files and request various processing tasks. These tasks include generating descriptions of audio content, summarizing spoken information, and transcribing speech. The API also accommodates specific timestamp requests, such as processing segments from “2:30 to 3:29.” Supported audio formats for the API include MP3, WAV, and FLAC.

Stay Ahead of the Curve!

Don't miss out on the latest insights, trends, and analysis in the world of data, technology, and startups. Subscribe to our newsletter and get exclusive content delivered straight to your inbox.

The integration of audio file attachment in the Gemini Android app is likely an ongoing development effort by Google. There is no official confirmation regarding a specific launch date for this feature. Image upload functionality is currently widely available within the Gemini Android application, suggesting audio support represents a subsequent progression in the app’s capabilities.


Featured image credit

Tags: Featuredgemini

Related Posts

OpenAI limits ChatGPT 5.6 access to government-approved users first

OpenAI limits ChatGPT 5.6 access to government-approved users first

June 26, 2026
Apple to skip M6 Pro and Max chips and launch M7 in 2027

Apple to skip M6 Pro and Max chips and launch M7 in 2027

June 26, 2026
IBM unveils world’s first sub-1nm chip with new nanostack architecture

IBM unveils world’s first sub-1nm chip with new nanostack architecture

June 26, 2026
Apple raises prices across Macs, iPads and home devices

Apple raises prices across Macs, iPads and home devices

June 26, 2026
Nothing to launch entry-level Phone 4b on July 7

Nothing to launch entry-level Phone 4b on July 7

June 26, 2026
Xbox tests 15-character gamertags for Insider users

Xbox tests 15-character gamertags for Insider users

June 26, 2026

LATEST NEWS

OpenAI limits ChatGPT 5.6 access to government-approved users first

Apple to skip M6 Pro and Max chips and launch M7 in 2027

IBM unveils world’s first sub-1nm chip with new nanostack architecture

Apple raises prices across Macs, iPads and home devices

Nothing to launch entry-level Phone 4b on July 7

Xbox tests 15-character gamertags for Insider users

BEST AI MODELS LEADERBOARD

See the best AI models, ranked by intelligence, benchmark results, speed and token price. Find the most suitable LLMs, Text-to-Image, Image Editing, Text-to-Speech, Text-to-Video and Image-to-Video  artificial intelligence model for your tasks and business.

LATEST TOOLS

WatchMyCompetitor

TokkingHeads

Fellow.app

Octoparse

AnyToSpeech

Vrew

Fireflies

SpeedLegal

Teachable Machine

Unriddle

Dataconomy

COPYRIGHT © DATACONOMY MEDIA GMBH, ALL RIGHTS RESERVED.

  • About
  • Imprint
  • Contact
  • Legal & Privacy

Follow Us

  • News
    • Artificial Intelligence
    • Cybersecurity
    • DeFi & Blockchain
    • Finance
    • Gaming
    • Startups
    • Tech
  • Industry
  • Research
  • Resources
    • Articles
    • Guides
    • Case Studies
    • Whitepapers
    • AI Models Leaderboard
  • AI tools
  • Newsletter
  • + More
    • Glossary
    • Conversations
    • Events
    • About
      • Who we are
      • Contact
      • Imprint
      • Legal & Privacy
      • Partner With Us
No Result
View All Result
Subscribe

This website uses cookies to improve your experience. You can choose to accept or reject them. Visit our Privacy Policy.