Dataconomy
  • News
    • Artificial Intelligence
    • Cybersecurity
    • DeFi & Blockchain
    • Finance
    • Gaming
    • Startups
    • Tech
  • Industry
  • Research
  • Resources
    • Articles
    • Guides
    • Case Studies
    • Whitepapers
    • AI Models Leaderboard
  • AI toolsNEW
  • Newsletter
  • + More
    • Glossary
    • Conversations
    • Events
    • About
      • Who we are
      • Contact
      • Imprint
      • Legal & Privacy
      • Partner With Us
Subscribe
No Result
View All Result
  • AI
  • Tech
  • Cybersecurity
  • Finance
  • DeFi & Blockchain
  • Startups
  • Gaming
Dataconomy
  • News
    • Artificial Intelligence
    • Cybersecurity
    • DeFi & Blockchain
    • Finance
    • Gaming
    • Startups
    • Tech
  • Industry
  • Research
  • Resources
    • Articles
    • Guides
    • Case Studies
    • Whitepapers
    • AI Models Leaderboard
  • AI toolsNEW
  • Newsletter
  • + More
    • Glossary
    • Conversations
    • Events
    • About
      • Who we are
      • Contact
      • Imprint
      • Legal & Privacy
      • Partner With Us
Subscribe
No Result
View All Result
Dataconomy
No Result
View All Result

Apple MGIE brings expected player to AI industry

With Apple MGIE, tech giant breaks its silence on generative AI

byEmre Çıtak
February 6, 2024
in Artificial Intelligence
Home News Artificial Intelligence
Share on FacebookShare on TwitterShare on LinkedInShare on WhatsAppShare on e-mail
Google Preferred Source

The tech giant has unveiled Apple MGIE, a cutting-edge open-source AI model that enables image editing through natural language instructions. MGIE, short for MLLM-Guided Image Editing, harnesses the power of multimodal large language models (MLLMs) to interpret user commands and perform pixel-level manipulations with remarkable accuracy.

The model boasts a wide range of editing capabilities, including Photoshop-style modification, global photo optimization, and local editing. This means that users can effortlessly enhance their images with a simple text command.

The development of MGIE is a result of a groundbreaking collaboration between Apple and a team of researchers from the University of California, Santa Barbara. The model was presented in a research paper accepted at the prestigious International Conference on Learning Representations (ICLR) 2024, a premier platform for AI research. The paper showcases the impressive effectiveness of MGIE in improving automatic metrics and human evaluation, all while maintaining competitive inference efficiency.

Stay Ahead of the Curve!

Don't miss out on the latest insights, trends, and analysis in the world of data, technology, and startups. Subscribe to our newsletter and get exclusive content delivered straight to your inbox.

Apple MGIE
Apple has unveiled Apple MGIE, a cutting-edge open-source AI model for image editing through natural language instructions (Image credit)

What is Apple MGIE?

Apple MGIE, which stands for Multimodal Guided Image Editing, is a system developed by Apple that uses machine learning to allow users to edit images using natural language instructions. This means that instead of having to use complex editing tools or menus, users can simply describe what they want to do to the image, and MGIE will automatically make the changes.

Just like other generative AI image tools such as Midjourney, StableDiffusion, and DALL-E, Apple MGIE bridges the gap between human intention and image manipulation. It leverages the power of multimodal learning, meaning it understands both visual information (the image itself) and textual information (your instructions).

Apple MGIE
Apple MGIE offers a range of editing capabilities, including Photoshop-style modification, global photo optimization, and local editing (Image credit)

How does Apple MGIE work?

A user could say “Make the sky in this image bluer” or “Remove the red car from this photo”, and MGIE would be able to understand and carry out these instructions. MGIE is still under development, but it has the potential to make image editing much easier and more accessible for everyone.

The core concept behind Apple MGIE workflow is as follows:

  • Inputting your commands: You describe your desired edits in plain English, like “Make the trees in this photo taller,” or “Change the color of the dress to blue”
  • Understanding your intent: MGIE’s advanced language model deciphers your instructions, grasping the specific objects, attributes, and modifications you have in mind
  • Visual understanding: simultaneously, MGIE analyzes the image, identifying key elements and their relationships
  • Guided editing: Combining both linguistic and visual understanding, MGIE intelligently manipulates the image to accurately reflect your commands. It doesn’t just blindly follow instructions but can interpret context and make sensible adjustments
Apple MGIE
The model was presented in a research paper accepted at the International Conference on Learning Representations (ICLR) 2024 (Image credit)

How to use MGIE

Apple MGIE has emerged as an open-source project on GitHub, offering a unique approach to image editing through natural language commands. This development allows users to explore and contribute to the project directly.

The project provides full access to its source code, training data, and pre-trained models on GitHub. This transparency enables developers and researchers to understand its inner workings and potentially contribute improvements.

A demo notebook is also available on GitHub, guiding users through various editing tasks using natural language instructions. This serves as a practical introduction to MGIE’s capabilities.

Users can also experiment with MGIE through a web demo hosted on Hugging Face Spaces. This online platform offers a quick and convenient way to try out the system without local setup.

The system welcomes user feedback and allows for refining edits or requesting different modifications. This iterative approach aims to ensure the generated edits align with the user’s artistic vision.

While open-sourcing makes MGIE accessible, it’s important to remember it remains under development. Ongoing research and user contributions will shape its future capabilities and potential applications.


Featured image credit: vecstock/Freepik.

Tags: AppleFeatured

Related Posts

ByteDance launches Doubao 2.1 Pro language model

ByteDance launches Doubao 2.1 Pro language model

June 24, 2026
OpenAI expands cybersecurity efforts with Patch the Planet

OpenAI expands cybersecurity efforts with Patch the Planet

June 24, 2026
Claude Tag brings shared AI assistant to Slack channels

Claude Tag brings shared AI assistant to Slack channels

June 24, 2026
Getty Images partners with OpenAI to supply licensed visuals for ChatGPT

Getty Images partners with OpenAI to supply licensed visuals for ChatGPT

June 23, 2026
Samsung adopts ChatGPT Enterprise and Codex across global workforce

Samsung adopts ChatGPT Enterprise and Codex across global workforce

June 22, 2026
OpenAI improves health responses for free ChatGPT users

OpenAI improves health responses for free ChatGPT users

June 19, 2026

LATEST NEWS

Rockstar confirms GTA 6 pricing and pre-order details

ByteDance launches Doubao 2.1 Pro language model

OpenAI expands cybersecurity efforts with Patch the Planet

Meta launches $299 smart glasses under its own brand

Claude Tag brings shared AI assistant to Slack channels

PlayStation 6 leak points to 2027 release window

BEST AI MODELS LEADERBOARD

See the best AI models, ranked by intelligence, benchmark results, speed and token price. Find the most suitable LLMs, Text-to-Image, Image Editing, Text-to-Speech, Text-to-Video and Image-to-Video  artificial intelligence model for your tasks and business.

LATEST TOOLS

Vrew

Fireflies

SpeedLegal

Teachable Machine

Unriddle

VidAU

Qualified

character.ai

Interview Coder

Moonbeam

Dataconomy

COPYRIGHT © DATACONOMY MEDIA GMBH, ALL RIGHTS RESERVED.

  • About
  • Imprint
  • Contact
  • Legal & Privacy

Follow Us

  • News
    • Artificial Intelligence
    • Cybersecurity
    • DeFi & Blockchain
    • Finance
    • Gaming
    • Startups
    • Tech
  • Industry
  • Research
  • Resources
    • Articles
    • Guides
    • Case Studies
    • Whitepapers
    • AI Models Leaderboard
  • AI tools
  • Newsletter
  • + More
    • Glossary
    • Conversations
    • Events
    • About
      • Who we are
      • Contact
      • Imprint
      • Legal & Privacy
      • Partner With Us
No Result
View All Result
Subscribe

This website uses cookies to improve your experience. You can choose to accept or reject them. Visit our Privacy Policy.