Dataconomy
  • News
    • Artificial Intelligence
    • Cybersecurity
    • DeFi & Blockchain
    • Finance
    • Gaming
    • Startups
    • Tech
  • Industry
  • Research
  • Resources
    • Articles
    • Guides
    • Case Studies
    • Glossary
    • Whitepapers
  • Newsletter
  • + More
    • Conversations
    • Events
    • About
      • About
      • Contact
      • Imprint
      • Legal & Privacy
      • Partner With Us
Subscribe
No Result
View All Result
  • AI
  • Tech
  • Cybersecurity
  • Finance
  • DeFi & Blockchain
  • Startups
  • Gaming
Dataconomy
  • News
    • Artificial Intelligence
    • Cybersecurity
    • DeFi & Blockchain
    • Finance
    • Gaming
    • Startups
    • Tech
  • Industry
  • Research
  • Resources
    • Articles
    • Guides
    • Case Studies
    • Glossary
    • Whitepapers
  • Newsletter
  • + More
    • Conversations
    • Events
    • About
      • About
      • Contact
      • Imprint
      • Legal & Privacy
      • Partner With Us
Subscribe
No Result
View All Result
Dataconomy
No Result
View All Result

Apple MGIE brings expected player to AI industry

With Apple MGIE, tech giant breaks its silence on generative AI

byEmre Çıtak
February 6, 2024
in Artificial Intelligence
Home News Artificial Intelligence

The tech giant has unveiled Apple MGIE, a cutting-edge open-source AI model that enables image editing through natural language instructions. MGIE, short for MLLM-Guided Image Editing, harnesses the power of multimodal large language models (MLLMs) to interpret user commands and perform pixel-level manipulations with remarkable accuracy.

The model boasts a wide range of editing capabilities, including Photoshop-style modification, global photo optimization, and local editing. This means that users can effortlessly enhance their images with a simple text command.

The development of MGIE is a result of a groundbreaking collaboration between Apple and a team of researchers from the University of California, Santa Barbara. The model was presented in a research paper accepted at the prestigious International Conference on Learning Representations (ICLR) 2024, a premier platform for AI research. The paper showcases the impressive effectiveness of MGIE in improving automatic metrics and human evaluation, all while maintaining competitive inference efficiency.

Stay Ahead of the Curve!

Don't miss out on the latest insights, trends, and analysis in the world of data, technology, and startups. Subscribe to our newsletter and get exclusive content delivered straight to your inbox.

Apple MGIE
Apple has unveiled Apple MGIE, a cutting-edge open-source AI model for image editing through natural language instructions (Image credit)

What is Apple MGIE?

Apple MGIE, which stands for Multimodal Guided Image Editing, is a system developed by Apple that uses machine learning to allow users to edit images using natural language instructions. This means that instead of having to use complex editing tools or menus, users can simply describe what they want to do to the image, and MGIE will automatically make the changes.

Just like other generative AI image tools such as Midjourney, StableDiffusion, and DALL-E, Apple MGIE bridges the gap between human intention and image manipulation. It leverages the power of multimodal learning, meaning it understands both visual information (the image itself) and textual information (your instructions).

Apple MGIE
Apple MGIE offers a range of editing capabilities, including Photoshop-style modification, global photo optimization, and local editing (Image credit)

How does Apple MGIE work?

A user could say “Make the sky in this image bluer” or “Remove the red car from this photo”, and MGIE would be able to understand and carry out these instructions. MGIE is still under development, but it has the potential to make image editing much easier and more accessible for everyone.

The core concept behind Apple MGIE workflow is as follows:

  • Inputting your commands: You describe your desired edits in plain English, like “Make the trees in this photo taller,” or “Change the color of the dress to blue”
  • Understanding your intent: MGIE’s advanced language model deciphers your instructions, grasping the specific objects, attributes, and modifications you have in mind
  • Visual understanding: simultaneously, MGIE analyzes the image, identifying key elements and their relationships
  • Guided editing: Combining both linguistic and visual understanding, MGIE intelligently manipulates the image to accurately reflect your commands. It doesn’t just blindly follow instructions but can interpret context and make sensible adjustments
Apple MGIE
The model was presented in a research paper accepted at the International Conference on Learning Representations (ICLR) 2024 (Image credit)

How to use MGIE

Apple MGIE has emerged as an open-source project on GitHub, offering a unique approach to image editing through natural language commands. This development allows users to explore and contribute to the project directly.

The project provides full access to its source code, training data, and pre-trained models on GitHub. This transparency enables developers and researchers to understand its inner workings and potentially contribute improvements.

A demo notebook is also available on GitHub, guiding users through various editing tasks using natural language instructions. This serves as a practical introduction to MGIE’s capabilities.

Users can also experiment with MGIE through a web demo hosted on Hugging Face Spaces. This online platform offers a quick and convenient way to try out the system without local setup.

The system welcomes user feedback and allows for refining edits or requesting different modifications. This iterative approach aims to ensure the generated edits align with the user’s artistic vision.

While open-sourcing makes MGIE accessible, it’s important to remember it remains under development. Ongoing research and user contributions will shape its future capabilities and potential applications.


Featured image credit: vecstock/Freepik.

Tags: AppleFeatured

Related Posts

Zoom announces AI Companion 3.0 at Zoomtopia

Zoom announces AI Companion 3.0 at Zoomtopia

September 19, 2025
Google Cloud adds Lovable and Windsurf as AI coding customers

Google Cloud adds Lovable and Windsurf as AI coding customers

September 19, 2025
Elon Musk’s xAI chatbot Grok exposed hundreds of thousands of private user conversations

Elon Musk’s xAI chatbot Grok exposed hundreds of thousands of private user conversations

September 19, 2025
DeepSeek releases R1 model trained for 4,000 on 512 H800 GPUs

DeepSeek releases R1 model trained for $294,000 on 512 H800 GPUs

September 19, 2025
Google’s Gemini AI achieves gold medal in prestigious ICPC coding competition, outperforming most human teams

Google’s Gemini AI achieves gold medal in prestigious ICPC coding competition, outperforming most human teams

September 18, 2025
Leveraging AI to transform data visualizations into engaging presentations

Leveraging AI to transform data visualizations into engaging presentations

September 18, 2025

LATEST NEWS

Zoom announces AI Companion 3.0 at Zoomtopia

Google Cloud adds Lovable and Windsurf as AI coding customers

Radware tricks ChatGPT’s Deep Research into Gmail data leak

Elon Musk’s xAI chatbot Grok exposed hundreds of thousands of private user conversations

Roblox game Steal a Brainrot removes AI-generated character, sparking fan backlash and a debate over copyright

DeepSeek releases R1 model trained for $294,000 on 512 H800 GPUs

Dataconomy

COPYRIGHT © DATACONOMY MEDIA GMBH, ALL RIGHTS RESERVED.

  • About
  • Imprint
  • Contact
  • Legal & Privacy

Follow Us

  • News
    • Artificial Intelligence
    • Cybersecurity
    • DeFi & Blockchain
    • Finance
    • Gaming
    • Startups
    • Tech
  • Industry
  • Research
  • Resources
    • Articles
    • Guides
    • Case Studies
    • Glossary
    • Whitepapers
  • Newsletter
  • + More
    • Conversations
    • Events
    • About
      • About
      • Contact
      • Imprint
      • Legal & Privacy
      • Partner With Us
No Result
View All Result
Subscribe

This website uses cookies. By continuing to use this website you are giving consent to cookies being used. Visit our Privacy Policy.