Dataconomy
  • News
    • Artificial Intelligence
    • Cybersecurity
    • DeFi & Blockchain
    • Finance
    • Gaming
    • Startups
    • Tech
  • Industry
  • Research
  • Resources
    • Articles
    • Guides
    • Case Studies
    • Glossary
    • Whitepapers
  • Newsletter
  • + More
    • Conversations
    • Events
    • About
      • About
      • Contact
      • Imprint
      • Legal & Privacy
      • Partner With Us
Subscribe
No Result
View All Result
  • AI
  • Tech
  • Cybersecurity
  • Finance
  • DeFi & Blockchain
  • Startups
  • Gaming
Dataconomy
  • News
    • Artificial Intelligence
    • Cybersecurity
    • DeFi & Blockchain
    • Finance
    • Gaming
    • Startups
    • Tech
  • Industry
  • Research
  • Resources
    • Articles
    • Guides
    • Case Studies
    • Glossary
    • Whitepapers
  • Newsletter
  • + More
    • Conversations
    • Events
    • About
      • About
      • Contact
      • Imprint
      • Legal & Privacy
      • Partner With Us
Subscribe
No Result
View All Result
Dataconomy
No Result
View All Result

Apple builds an AI “engineering team” that finds and fixes bugs on its own

The company’s new ADE-QVAET model achieved 98.08% accuracy in predicting buggy code regions. An agentic AI tool now generates entire test plans with 94.8% accuracy, cutting testing time by 85%. SWE-Gym trains AI agents to fix code in real-world conditions, solving 72.5% of tasks correctly.

byKerem Gülen
October 17, 2025
in Research

Apple’s AI researchers have quietly published three new studies that pull back the curtain on a major new ambition: automating the most tedious and critical parts of software development. The papers, published on Apple’s Machine Learning Research blog, detail new AI systems that can predict where bugs are likely to appear, automatically write entire test plans, and even fix the broken code themselves. This matters because it’s not just another “AI writes code” demo. Apple is building a suite of specialized AI quality engineers to find and fix flaws before they ever reach your phone or computer, which could lead to massive gains in productivity and (hopefully) more stable software.

Paper 1: The AI bug predictor

The first study, “Software Defect Prediction using Autoencoder Transformer Model,” from researchers Seshu Barma, Mohanakrishnan Hariharan, and Satish Arvapalli, tackles the problem of “buggy” code. Instead of having an AI read millions of lines of code—a process prone to AI “hallucinations”—they built a different kind of tool.

Their model, ADE-QVAET, acts less like a code reviewer and more like a data analyst. It doesn’t read the code itself. Instead, it analyzes metrics about the code, such as its complexity, size, and structure. It’s trained to find the hidden patterns in these metrics that reliably predict where bugs are most likely to be hiding.

Stay Ahead of the Curve!

Don't miss out on the latest insights, trends, and analysis in the world of data, technology, and startups. Subscribe to our newsletter and get exclusive content delivered straight to your inbox.

The results are incredibly effective. On a standard dataset for bug prediction, the model achieved 98.08% accuracy. It also scored high on precision and recall, a technical way of saying it’s extremely good at finding real bugs while avoiding “false positives” that waste developers’ time.

Paper 2: The automated quality engineer

Finding bugs is great, but what about the mountain of paperwork that comes with software testing? The second study, “Agentic RAG for Software Testing,” addresses this head-on. The researchers note that quality engineers spend 30-40% of their time just creating “foundational testing artifacts”—a corporate term for test plans, cases, and scripts.

Their solution is an AI agent that does this work automatically. The system reads the project’s requirements and business logic, then autonomously generates the entire suite of testing documents. This system keeps full “traceability,” meaning it logs exactly which test case corresponds to which business requirement.

The impact here is measured in time and money. The system showed a remarkable 94.8% accuracy in its generated tests. In validation projects, it led to an 85% reduction in the testing timeline and an 85% improvement in test suite efficiency. For one project, that meant accelerating the go-live date by a full two months.


MIT researchers have built an AI that teaches itself how to learn


Paper 3: The AI ‘gym’ that teaches code-fixing

The third and most ambitious study is “Training Software Engineering Agents and Verifiers with SWE-Gym.” This paper asks the logical next question: Why just find bugs when you can fix them?

To do this, the team built a “gym” for AI agents. This training environment, SWE-Gym, is a sandbox built from 2,438 real-world Python tasks pulled from 11 open-source projects. Each task comes with its own executable environment and test suite. This allows an AI agent to practice the full developer workflow: read the bug report, write the code to fix it, and then run the tests to see if the fix actually worked (and didn’t break anything else).

The training paid off. AI agents trained in this “gym” correctly solved 72.5% of the buggy tasks, a result that outperformed previous benchmarks by more than 20 percentage points.

These are specialized tools, not a general-purpose AI coder. The researchers for the automated testing (Paper 2) note that their work was focused only on specific “Employee Systems, Finance, and SAP environments,” meaning it’s not a one-size-fits-all solution just yet. Similarly, the bug-fixing “gym” was focused on Python tasks.

What these three studies show is a clear, multi-pronged strategy. Apple isn’t just trying to build one “do-it-all” AI. Instead, they’re building a team of AI specialists: a bug-predicting analyst, a test-writing “paper-pusher,” and a bug-fixing “mechanic.” This approach could fundamentally change the economics of software development, leading to faster timelines, lower costs, and more reliable products.


Featured image credit

Tags: AIAppleCoding

Related Posts

Gen Z workers are telling AI things they’ve never told a human

Gen Z workers are telling AI things they’ve never told a human

October 20, 2025
MIT researchers have built an AI that teaches itself how to learn

MIT researchers have built an AI that teaches itself how to learn

October 20, 2025
Graphite: 52% of new content is AI-generated

Graphite: 52% of new content is AI-generated

October 17, 2025
Just 250 bad documents can poison a massive AI model

Just 250 bad documents can poison a massive AI model

October 15, 2025
71% of workers are using rogue AI tools at work, Microsoft warns

71% of workers are using rogue AI tools at work, Microsoft warns

October 14, 2025
Google taught your voice assistant to understand what you mean

Google taught your voice assistant to understand what you mean

October 14, 2025

LATEST NEWS

AWS outage: A complete list of every site and app that went down

Facebook’s new AI tool will scan your camera roll

Google will discontinue this once fancy project and here’s why

Wikipedia’s human traffic drops 8% as AI takes the wheel

WhatsApp tests monthly message cap to fight spam

How one woman beat eviction court using ChatGPT and Perplexity

Dataconomy

COPYRIGHT © DATACONOMY MEDIA GMBH, ALL RIGHTS RESERVED.

  • About
  • Imprint
  • Contact
  • Legal & Privacy

Follow Us

  • News
    • Artificial Intelligence
    • Cybersecurity
    • DeFi & Blockchain
    • Finance
    • Gaming
    • Startups
    • Tech
  • Industry
  • Research
  • Resources
    • Articles
    • Guides
    • Case Studies
    • Glossary
    • Whitepapers
  • Newsletter
  • + More
    • Conversations
    • Events
    • About
      • About
      • Contact
      • Imprint
      • Legal & Privacy
      • Partner With Us
No Result
View All Result
Subscribe

This website uses cookies. By continuing to use this website you are giving consent to cookies being used. Visit our Privacy Policy.