Dataconomy
  • News
    • Artificial Intelligence
    • Cybersecurity
    • DeFi & Blockchain
    • Finance
    • Gaming
    • Startups
    • Tech
  • Industry
  • Research
  • Resources
    • Articles
    • Guides
    • Case Studies
    • Whitepapers
    • AI Models Leaderboard
  • AI toolsNEW
  • Newsletter
  • + More
    • Glossary
    • Conversations
    • Events
    • About
      • Who we are
      • Contact
      • Imprint
      • Legal & Privacy
      • Partner With Us
Subscribe
No Result
View All Result
  • AI
  • Tech
  • Cybersecurity
  • Finance
  • DeFi & Blockchain
  • Startups
  • Gaming
Dataconomy
  • News
    • Artificial Intelligence
    • Cybersecurity
    • DeFi & Blockchain
    • Finance
    • Gaming
    • Startups
    • Tech
  • Industry
  • Research
  • Resources
    • Articles
    • Guides
    • Case Studies
    • Whitepapers
    • AI Models Leaderboard
  • AI toolsNEW
  • Newsletter
  • + More
    • Glossary
    • Conversations
    • Events
    • About
      • Who we are
      • Contact
      • Imprint
      • Legal & Privacy
      • Partner With Us
Subscribe
No Result
View All Result
Dataconomy
No Result
View All Result

DeepSeek introduces Manifold-Constrained Hyper-Connections for R2

The research suggests that future models like the anticipated R2 could be trained on limited hardware through smarter engineering

byKerem Gülen
January 6, 2026
in News, Research
Home News
Share on FacebookShare on TwitterShare on LinkedInShare on WhatsAppShare on e-mail
Google Preferred Source

Just before the start of the new year, the artificial intelligence community was introduced to a potential breakthrough in model training. A team of researchers from the Chinese AI firm DeepSeek released a paper outlining a novel architectural approach called Manifold-Constrained Hyper-Connections, or mHC for short. This new methodology may provide a pathway for engineers to build and scale large language models without the prohibitive computational costs and capital typically required.

DeepSeek first captured the cultural spotlight one year ago with the release of R1. That model rivaled the capabilities of OpenAI’s o1 but was reportedly trained at a fraction of the cost. The release came as a shock to US-based developers because it challenged the assumption that only massive reserves of capital and hardware could produce cutting-edge AI. The newly published mHC paper, hosted on the preprint server arXiv, could serve as the technological framework for DeepSeek’s forthcoming model, R2. The R2 model was originally expected in mid-2025 but was postponed, reportedly due to concerns from CEO Liang Wenfeng regarding performance and China’s limited access to advanced AI chips.

The new paper attempts to bridge a complex technical gap that currently hinders AI scalability. Large language models are built upon neural networks designed to conserve signals across many layers. However, as the model grows and more layers are added, the signal can become attenuated or degraded, increasing the risk of it turning into noise. The researchers liken this to a game of “telephone”: the more people involved in the chain, the higher the chance the original message becomes confused or altered. The core engineering challenge is optimizing the trade-off between plasticity and stability, ensuring signals are conserved across as many layers as possible without degradation.

Stay Ahead of the Curve!

Don't miss out on the latest insights, trends, and analysis in the world of data, technology, and startups. Subscribe to our newsletter and get exclusive content delivered straight to your inbox.

The authors of the paper, including CEO Liang Wenfeng, built their research upon hyper-connections (HCs), a framework introduced in 2024 by researchers from ByteDance. Standard HCs diversify the channels through which neural network layers share information, but they introduce the risk of signal loss and come with high memory costs that make them difficult to implement at scale. DeepSeek’s mHC architecture aims to solve this by constraining the hyperconnectivity within a model. This approach preserves the informational complexity enabled by HCs while sidestepping the memory issues, allowing for the training of highly complex models in a way that is practical even for developers with limited resources.

The debut of the mHC framework suggests a pivot in the evolution of AI development. Until recently, prevailing industry wisdom held that only the wealthiest companies could afford to build frontier models. DeepSeek continues to demonstrate that breakthroughs can be achieved through clever engineering rather than raw financial force. By publishing this research, DeepSeek has made the mHC method available to smaller developers, potentially democratizing access to advanced AI capabilities if this architecture proves successful in the anticipated R2 model.


Featured image credit

Tags: AIdeepseek

Related Posts

Harvard and Boston Children’s use AI to revisit unsolved genetic cases

Harvard and Boston Children’s use AI to revisit unsolved genetic cases

June 19, 2026
OpenAI improves health responses for free ChatGPT users

OpenAI improves health responses for free ChatGPT users

June 19, 2026
Adobe expands Firefly AI across Premiere, Illustrator, InDesign and Frame.io

Adobe expands Firefly AI across Premiere, Illustrator, InDesign and Frame.io

June 19, 2026
Spotify launches Reserved to give superfans early ticket access

Spotify launches Reserved to give superfans early ticket access

June 19, 2026
Google discontinues Nest Home Mini and Nest Audio

Google discontinues Nest Home Mini and Nest Audio

June 19, 2026
Instagram adds unique captions for each carousel slide

Instagram adds unique captions for each carousel slide

June 19, 2026

LATEST NEWS

OpenAI improves health responses for free ChatGPT users

Adobe expands Firefly AI across Premiere, Illustrator, InDesign and Frame.io

Spotify launches Reserved to give superfans early ticket access

Google discontinues Nest Home Mini and Nest Audio

Instagram adds unique captions for each carousel slide

Steam Next Fest sees one in five demos labeled for generative AI

BEST AI MODELS LEADERBOARD

See the best AI models, ranked by intelligence, benchmark results, speed and token price. Find the most suitable LLMs, Text-to-Image, Image Editing, Text-to-Speech, Text-to-Video and Image-to-Video  artificial intelligence model for your tasks and business.

LATEST TOOLS

Novoresume

PolyAI

SeaArt

H2O.ai

Techpresso

Namecheap Free Logo Maker

Binaural Beats Factory

Lyricallabs

Jobscan

Vsub

Dataconomy

COPYRIGHT © DATACONOMY MEDIA GMBH, ALL RIGHTS RESERVED.

  • About
  • Imprint
  • Contact
  • Legal & Privacy

Follow Us

  • News
    • Artificial Intelligence
    • Cybersecurity
    • DeFi & Blockchain
    • Finance
    • Gaming
    • Startups
    • Tech
  • Industry
  • Research
  • Resources
    • Articles
    • Guides
    • Case Studies
    • Whitepapers
    • AI Models Leaderboard
  • AI tools
  • Newsletter
  • + More
    • Glossary
    • Conversations
    • Events
    • About
      • Who we are
      • Contact
      • Imprint
      • Legal & Privacy
      • Partner With Us
No Result
View All Result
Subscribe

This website uses cookies to improve your experience. You can choose to accept or reject them. Visit our Privacy Policy.