Dataconomy
  • News
    • Artificial Intelligence
    • Cybersecurity
    • DeFi & Blockchain
    • Finance
    • Gaming
    • Startups
    • Tech
  • Industry
  • Research
  • Resources
    • Articles
    • Guides
    • Case Studies
    • Whitepapers
    • AI Models Leaderboard
  • AI toolsNEW
  • Newsletter
  • + More
    • Glossary
    • Conversations
    • Events
    • About
      • Who we are
      • Contact
      • Imprint
      • Legal & Privacy
      • Partner With Us
Subscribe
No Result
View All Result
  • AI
  • Tech
  • Cybersecurity
  • Finance
  • DeFi & Blockchain
  • Startups
  • Gaming
Dataconomy
  • News
    • Artificial Intelligence
    • Cybersecurity
    • DeFi & Blockchain
    • Finance
    • Gaming
    • Startups
    • Tech
  • Industry
  • Research
  • Resources
    • Articles
    • Guides
    • Case Studies
    • Whitepapers
    • AI Models Leaderboard
  • AI toolsNEW
  • Newsletter
  • + More
    • Glossary
    • Conversations
    • Events
    • About
      • Who we are
      • Contact
      • Imprint
      • Legal & Privacy
      • Partner With Us
Subscribe
No Result
View All Result
Dataconomy
No Result
View All Result

Microsoft and Tsinghua’s X-Coder hits 62.9% pass rate on LiveCodeBench v5

Research shows that increasing task diversity is more effective than adding more solutions.

byAytun Çelebi
January 27, 2026
in Research
Home Research
Share on FacebookShare on TwitterShare on LinkedInShare on WhatsAppShare on e-mail
Google Preferred Source

Researchers from Tsinghua University and Microsoft have developed X-Coder, an AI coding model with 7 billion parameters. This model was trained exclusively on synthetic data. A paper detailing X-Coder was posted on arXiv on January 11.

X-Coder achieved a 62.9% pass rate on LiveCodeBench v5 and a 55.8% pass rate on LiveCodeBench v6. This performance surpasses models such as DeepCoder-14B-Preview and AReal-boba2-14B, both of which have 14 billion parameters.

The development utilized SynthSmith, a data synthesis pipeline that generates programming tasks, solutions, and test cases. SynthSmith does not rely on human-written examples. The system begins by extracting coding-relevant features, including algorithms, data structures, and optimization techniques, from an initial pool of approximately 27,000 code examples. This pool is then expanded to nearly 177,000 entries through an evolutionary process.

Stay Ahead of the Curve!

Don't miss out on the latest insights, trends, and analysis in the world of data, technology, and startups. Subscribe to our newsletter and get exclusive content delivered straight to your inbox.

Quality control in SynthSmith involves a dual-verification strategy. The system determines correct test outputs through majority voting among multiple candidate solutions. The best solution is then validated against a holdout test set.

The research indicated that task variety in training data contributes more to competitive programming performance than model size or solution quantity. Experiments showed that increasing the number of distinct tasks was more effective than adding multiple solutions per task.

A dataset with 64,000 different tasks, each with one solution, outperformed datasets with fewer tasks but more solutions per problem. Pass rates increased with task count: from 43.7% with 32,000 tasks to 51.3% with 64,000 tasks, then 57.2% with 128,000 tasks, and 62.7% with 192,000 tasks. The supervised fine-tuning phase achieved 60.3%, with an additional 4.6 percentage points added during reinforcement learning.

The synthetic training approach helps mitigate benchmark contamination concerns. A reference model, Qwen3-8B, showed a 30-point performance decrease between older and newer LiveCodeBench versions. X-Coder exhibited a smaller decline of 17.2 points, suggesting reduced memorization of benchmark problems.

The code for SynthSmith is available on GitHub. Researchers have stated intentions to release model weights. This work occurs as the AI industry increasingly utilizes synthetic data to address limitations in available training material. Microsoft has previously developed SynthLLM for broader synthetic data generation.


Featured image credit

Tags: Microsofttsinghua

Related Posts

Alibaba framework allegedly cuts AI agent token use by 99%

Alibaba framework allegedly cuts AI agent token use by 99%

July 3, 2026
Codex use is spreading into knowledge work, OpenAI says

Codex use is spreading into knowledge work, OpenAI says

July 1, 2026
Meta says Brain2Qwerty v2 turns brain activity into text

Meta says Brain2Qwerty v2 turns brain activity into text

July 1, 2026
Penn Medicine unveils AI-human system to speed CAR T cancer target discovery

Penn Medicine unveils AI-human system to speed CAR T cancer target discovery

June 30, 2026
CrowdStrike warns prompt injection attacks hit over 90 firms in 2025

CrowdStrike warns prompt injection attacks hit over 90 firms in 2025

June 29, 2026
Wireless charging uses about 40% more electricity

Wireless charging uses about 40% more electricity

June 25, 2026

LATEST NEWS

Tesla brings long-wheelbase Model Y to the US

Opera adds protection against copy-paste ClickFix attacks

Cloudflare will block AI crawlers unless sites opt in

Meta releases Pocket app for generative AI games

Android Halo will place AI agent updates in status bar

WhatsApp usernames spark impersonation and fraud concerns

BEST AI MODELS LEADERBOARD

See the best AI models, ranked by intelligence, benchmark results, speed and token price. Find the most suitable LLMs, Text-to-Image, Image Editing, Text-to-Speech, Text-to-Video and Image-to-Video  artificial intelligence model for your tasks and business.

LATEST TOOLS

Instantchapters

Intellectia

ZipWP

Copyleaks – Plagiarism detector

Clipping Magic

KoalaChat

SpeechText

Booknotes

Unscrambler

LingoLooper

Dataconomy

COPYRIGHT © DATACONOMY MEDIA GMBH, ALL RIGHTS RESERVED.

  • About
  • Imprint
  • Contact
  • Legal & Privacy

Follow Us

  • News
    • Artificial Intelligence
    • Cybersecurity
    • DeFi & Blockchain
    • Finance
    • Gaming
    • Startups
    • Tech
  • Industry
  • Research
  • Resources
    • Articles
    • Guides
    • Case Studies
    • Whitepapers
    • AI Models Leaderboard
  • AI tools
  • Newsletter
  • + More
    • Glossary
    • Conversations
    • Events
    • About
      • Who we are
      • Contact
      • Imprint
      • Legal & Privacy
      • Partner With Us
No Result
View All Result
Subscribe

This website uses cookies to improve your experience. You can choose to accept or reject them. Visit our Privacy Policy.