Dataconomy
  • News
    • Artificial Intelligence
    • Cybersecurity
    • DeFi & Blockchain
    • Finance
    • Gaming
    • Startups
    • Tech
  • Industry
  • Research
  • Resources
    • Articles
    • Guides
    • Case Studies
    • Whitepapers
    • AI Models Leaderboard
  • AI toolsNEW
  • Newsletter
  • + More
    • Glossary
    • Conversations
    • Events
    • About
      • Who we are
      • Contact
      • Imprint
      • Legal & Privacy
      • Partner With Us
Subscribe
No Result
View All Result
  • AI
  • Tech
  • Cybersecurity
  • Finance
  • DeFi & Blockchain
  • Startups
  • Gaming
Dataconomy
  • News
    • Artificial Intelligence
    • Cybersecurity
    • DeFi & Blockchain
    • Finance
    • Gaming
    • Startups
    • Tech
  • Industry
  • Research
  • Resources
    • Articles
    • Guides
    • Case Studies
    • Whitepapers
    • AI Models Leaderboard
  • AI toolsNEW
  • Newsletter
  • + More
    • Glossary
    • Conversations
    • Events
    • About
      • Who we are
      • Contact
      • Imprint
      • Legal & Privacy
      • Partner With Us
Subscribe
No Result
View All Result
Dataconomy
No Result
View All Result

Google launches Gemini Omni for multimodal video creation

The first release, Gemini Omni Flash, is rolling out now through the Gemini app, Google Flow, and YouTube Shorts.

byKerem Gülen
May 20, 2026
in Artificial Intelligence, News
Home News Artificial Intelligence
Share on FacebookShare on TwitterShare on LinkedInShare on WhatsAppShare on e-mail
Google Preferred Source

Google announced the launch of Gemini Omni, a new model designed to create content from a variety of inputs, with an initial focus on video. The first version, dubbed Gemini Omni Flash, is rolling out today to users of the Gemini app, Google Flow, and YouTube Shorts.

According to Google, Gemini Omni is considered “the next step” beyond its previous models, including Nano Banana and the existing video generator, Veo 3.1. The model enables users to combine images, audio, video, and text as input to generate high-quality videos that are grounded in advanced real-world knowledge.

Editing capabilities allow users to modify videos through natural conversation, building upon previous instructions for consistency in characters and elements. This contrasts with Veo 3.1, which was restricted to generating video content based solely on prompts and images.

Stay Ahead of the Curve!

Don't miss out on the latest insights, trends, and analysis in the world of data, technology, and startups. Subscribe to our newsletter and get exclusive content delivered straight to your inbox.

Gemini Omni Flash allows users to shoot a video and then request modifications, transforming their initial content into something new. Google stated, “Your video becomes a starting point for something you never could have filmed yourself,” indicating that users can alter actions, add characters, and change settings seamlessly.

https://storage.googleapis.com/gweb-gemini-cdn/gemini/uploads/1f9fd2cb30c7d122190623587be5b7a3e3ce57c1.mp4

Video: Google

The model better understands physical principles such as gravity and kinetic energy to generate more realistic scenes. Gemini Omni integrates knowledge from various domains, including history, science, and cultural context, to enhance storytelling within the generated content.

The application can produce visual explainers from simple prompts to simplify complex ideas. However, initial audio features will support only voice references.

Gemini Omni also includes functionality to create a digital avatar based on the user’s appearance and voice. Google emphasized that it has established “clear policies to protect users from harm” while utilizing its AI tools. Editing features for modifying audio and speech are currently still under testing.

All content generated with Gemini Omni will incorporate Google’s imperceptible SynthID digital watermark for verification purposes. Users have expressed concerns over the “uncanny valley” effect seen in output quality from Veo 3.1, and it remains to be seen if Gemini Omni’s results will alleviate these issues.

Gemini Omni Flash is now accessible to Google AI Plus, Pro, and Ultra subscribers globally, with rollouts to users of YouTube Shorts and the YouTube Create App anticipated to begin this week.


Featured image credit

Tags: Featuredgemini omniGoogle

Related Posts

“Free robots are an illusion”: Why we’ll pay for system intelligence, not delivery workers

“Free robots are an illusion”: Why we’ll pay for system intelligence, not delivery workers

June 12, 2026
How Henrique Schmaiske led Meteor.js through its biggest transformation

How Henrique Schmaiske led Meteor.js through its biggest transformation

June 12, 2026
Proven privacy: Why ‘no-log’ claims need real evidence today

Proven privacy: Why ‘no-log’ claims need real evidence today

June 12, 2026
ChatGPT hits 1 billion users as global AI adoption surges despite backlash

ChatGPT hits 1 billion users as global AI adoption surges despite backlash

June 12, 2026
Huawei launches HarmonyOS 7 developer beta with upgraded API 26

Huawei launches HarmonyOS 7 developer beta with upgraded API 26

June 12, 2026
OpenAI Codex referral program rewards users with extra rate resets

OpenAI Codex referral program rewards users with extra rate resets

June 12, 2026

LATEST NEWS

“Free robots are an illusion”: Why we’ll pay for system intelligence, not delivery workers

How Henrique Schmaiske led Meteor.js through its biggest transformation

Proven privacy: Why ‘no-log’ claims need real evidence today

ChatGPT hits 1 billion users as global AI adoption surges despite backlash

Huawei launches HarmonyOS 7 developer beta with upgraded API 26

OpenAI Codex referral program rewards users with extra rate resets

BEST AI MODELS LEADERBOARD

See the best AI models, ranked by intelligence, benchmark results, speed and token price. Find the most suitable LLMs, Text-to-Image, Image Editing, Text-to-Speech, Text-to-Video and Image-to-Video  artificial intelligence model for your tasks and business.

LATEST TOOLS

Roboto AI

Pickaxe

Pfpmaker

MindPal

Syllaby

ScreenApp

FinanceBrain

GitHub Spark

Hints

VisionStory AI

Dataconomy

COPYRIGHT © DATACONOMY MEDIA GMBH, ALL RIGHTS RESERVED.

  • About
  • Imprint
  • Contact
  • Legal & Privacy

Follow Us

  • News
    • Artificial Intelligence
    • Cybersecurity
    • DeFi & Blockchain
    • Finance
    • Gaming
    • Startups
    • Tech
  • Industry
  • Research
  • Resources
    • Articles
    • Guides
    • Case Studies
    • Whitepapers
    • AI Models Leaderboard
  • AI tools
  • Newsletter
  • + More
    • Glossary
    • Conversations
    • Events
    • About
      • Who we are
      • Contact
      • Imprint
      • Legal & Privacy
      • Partner With Us
No Result
View All Result
Subscribe

This website uses cookies to improve your experience. You can choose to accept or reject them. Visit our Privacy Policy.