Dataconomy
  • News
    • Artificial Intelligence
    • Cybersecurity
    • DeFi & Blockchain
    • Finance
    • Gaming
    • Startups
    • Tech
  • Industry
  • Research
  • Resources
    • Articles
    • Guides
    • Case Studies
    • Whitepapers
    • AI Models Leaderboard
  • AI toolsNEW
  • Newsletter
  • + More
    • Glossary
    • Conversations
    • Events
    • About
      • Who we are
      • Contact
      • Imprint
      • Legal & Privacy
      • Partner With Us
Subscribe
No Result
View All Result
  • AI
  • Tech
  • Cybersecurity
  • Finance
  • DeFi & Blockchain
  • Startups
  • Gaming
Dataconomy
  • News
    • Artificial Intelligence
    • Cybersecurity
    • DeFi & Blockchain
    • Finance
    • Gaming
    • Startups
    • Tech
  • Industry
  • Research
  • Resources
    • Articles
    • Guides
    • Case Studies
    • Whitepapers
    • AI Models Leaderboard
  • AI toolsNEW
  • Newsletter
  • + More
    • Glossary
    • Conversations
    • Events
    • About
      • Who we are
      • Contact
      • Imprint
      • Legal & Privacy
      • Partner With Us
Subscribe
No Result
View All Result
Dataconomy
No Result
View All Result

Apple’s Pico-Banana-400K dataset could redefine how AI learns to edit images

The dataset was built using an automated pipeline powered by Google’s Nano-Banana and Gemini-2.5-Pro, eliminating the need for human annotators.

byKerem Gülen
November 4, 2025
in Research
Home Research
Share on FacebookShare on TwitterShare on LinkedInShare on WhatsAppShare on e-mail
Google Preferred Source

Apple has released Pico-Banana-400K, a massive, high-quality dataset of nearly 400,000 image editing examples. The new dataset, detailed in an academic paper posted on October 23, 2025, was built by Apple researchers including Yusu Qian, Jialing Tong, and Zhe Gan. This matters because the AI community has been held back by a lack of large-scale, open, and realistic datasets. Most previous datasets were either synthetic, low-quality, or built with proprietary models. Apple’s new resource, which is built from real photographs, is designed to be a robust foundation for training the next generation of text-guided image editing models, from simple touch-ups to complex, multi-step creative projects.

How Pico-Banana-400K was built

Instead of the old, expensive method of paying humans to manually edit hundreds of thousands of images, Apple’s team created a sophisticated, automated pipeline using other powerful AI models. . First, they sourced real photographs from the OpenImages collection. Then, they used Google’s Nano-Banana model to generate a diverse range of edits based on a comprehensive taxonomy of 35 different edit types, from “change color” to “apply seasonal transformation.”

But here’s the clever part: to ensure quality, they used another AI, Gemini-2.5-Pro, as an automated “judge.” This AI judge scored every single edit on four criteria: Instruction Compliance (40%), Seamlessness (25%), Preservation Balance (20%), and Technical Quality (15%). Edits that scored above a 0.7 threshold were labeled “successful.” Edits that failed were kept as “negative examples.” This process creates a high-quality dataset without a single human annotator, at a total cost of about $100,000.

Stay Ahead of the Curve!

Don't miss out on the latest insights, trends, and analysis in the world of data, technology, and startups. Subscribe to our newsletter and get exclusive content delivered straight to your inbox.

More than just single edits

The real power of Pico-Banana-400K isn’t just its size; it’s the specialized subsets designed to solve complex research problems. The full dataset includes:

  • 258K Single-Turn Edits: The core dataset of (before, after, instruction) triplets for basic model training.
  • 72K Multi-Turn Examples: This subset contains “editing sessions” with 2-5 consecutive modifications. . This is crucial for teaching AI models how to handle sequential commands, reason about changes over time, and understand context (e.g., “add a hat to the man,” followed by “now make it blue”).
  • 56K Preference Pairs: By saving the “successful” and “failed” edits for the same instruction, this subset allows researchers to train AI reward models and improve alignment, teaching models to understand why one edit is better than another.
  • Paired Instructions: Each edit comes with two instruction types: a long, detailed prompt perfect for training and a short, concise “user-style” command (e.g., “make the sky snowy”) to mimic how real people type.

What this means for future AI editors

By analyzing the “success rates” of its own pipeline, the Apple team also created a clear map of what AI image editors are good at and where they still fail. Global edits like “add a vintage filter” (90% success) are easy. Object-level edits like “remove this car” (83% success) are pretty good. But edits requiring precise spatial control or symbolic understanding remain “brittle” and are now open problems for researchers to solve.

The hardest tasks? Relocating an object (59% success), changing a font (57% success), and generating caricatures (58% success). By open-sourcing this dataset, Apple is essentially giving the entire AI community a high-quality “gym” to train their models and a clear list of challenges to tackle next.


Featured image credit

Tags: Applepico-banana-400k

Related Posts

Study links AI-assisted homework to lower exam scores

Study links AI-assisted homework to lower exam scores

June 22, 2026
Harvard and Boston Children’s use AI to revisit unsolved genetic cases

Harvard and Boston Children’s use AI to revisit unsolved genetic cases

June 19, 2026
Adobe report finds 86% of creators now use generative AI in workflows

Adobe report finds 86% of creators now use generative AI in workflows

June 17, 2026
AI transfer learning speeds cosmology research but has hidden risks

AI transfer learning speeds cosmology research but has hidden risks

June 15, 2026
Phishing scams targeting travelers hit record levels in 2026

Phishing scams targeting travelers hit record levels in 2026

June 15, 2026
Most UK SMEs now consult AI before their accountants

Most UK SMEs now consult AI before their accountants

June 12, 2026

LATEST NEWS

Samsung adopts ChatGPT Enterprise and Codex across global workforce

Samsung Galaxy S27 Pro leak points to built-in Privacy Display

Perseverance rover completes a marathon on Mars

Polymarket accused of paying creators to post misleading TikTok bet videos

OpenAI improves health responses for free ChatGPT users

Adobe expands Firefly AI across Premiere, Illustrator, InDesign and Frame.io

BEST AI MODELS LEADERBOARD

See the best AI models, ranked by intelligence, benchmark results, speed and token price. Find the most suitable LLMs, Text-to-Image, Image Editing, Text-to-Speech, Text-to-Video and Image-to-Video  artificial intelligence model for your tasks and business.

LATEST TOOLS

Moonbeam

Charisma AI

Essay Writer by Papertyper

Slite

Wonderin AI

Spur

Stenography

Calldesk

MaxAI.me

PhotoRestore

Dataconomy

COPYRIGHT © DATACONOMY MEDIA GMBH, ALL RIGHTS RESERVED.

  • About
  • Imprint
  • Contact
  • Legal & Privacy

Follow Us

  • News
    • Artificial Intelligence
    • Cybersecurity
    • DeFi & Blockchain
    • Finance
    • Gaming
    • Startups
    • Tech
  • Industry
  • Research
  • Resources
    • Articles
    • Guides
    • Case Studies
    • Whitepapers
    • AI Models Leaderboard
  • AI tools
  • Newsletter
  • + More
    • Glossary
    • Conversations
    • Events
    • About
      • Who we are
      • Contact
      • Imprint
      • Legal & Privacy
      • Partner With Us
No Result
View All Result
Subscribe

This website uses cookies to improve your experience. You can choose to accept or reject them. Visit our Privacy Policy.