Dataconomy
  • News
    • Artificial Intelligence
    • Cybersecurity
    • DeFi & Blockchain
    • Finance
    • Gaming
    • Startups
    • Tech
  • Industry
  • Research
  • Resources
    • Articles
    • Guides
    • Case Studies
    • Whitepapers
  • AI toolsNEW
  • Newsletter
  • + More
    • Glossary
    • Conversations
    • Events
    • About
      • About
      • Contact
      • Imprint
      • Legal & Privacy
      • Partner With Us
Subscribe
No Result
View All Result
  • AI
  • Tech
  • Cybersecurity
  • Finance
  • DeFi & Blockchain
  • Startups
  • Gaming
Dataconomy
  • News
    • Artificial Intelligence
    • Cybersecurity
    • DeFi & Blockchain
    • Finance
    • Gaming
    • Startups
    • Tech
  • Industry
  • Research
  • Resources
    • Articles
    • Guides
    • Case Studies
    • Whitepapers
  • AI toolsNEW
  • Newsletter
  • + More
    • Glossary
    • Conversations
    • Events
    • About
      • About
      • Contact
      • Imprint
      • Legal & Privacy
      • Partner With Us
Subscribe
No Result
View All Result
Dataconomy
No Result
View All Result

DeepSeek introduces Manifold-Constrained Hyper-Connections for R2

The research suggests that future models like the anticipated R2 could be trained on limited hardware through smarter engineering

byKerem Gülen
January 6, 2026
in News, Research
Home News
Share on FacebookShare on TwitterShare on LinkedInShare on WhatsAppShare on e-mail

Just before the start of the new year, the artificial intelligence community was introduced to a potential breakthrough in model training. A team of researchers from the Chinese AI firm DeepSeek released a paper outlining a novel architectural approach called Manifold-Constrained Hyper-Connections, or mHC for short. This new methodology may provide a pathway for engineers to build and scale large language models without the prohibitive computational costs and capital typically required.

DeepSeek first captured the cultural spotlight one year ago with the release of R1. That model rivaled the capabilities of OpenAI’s o1 but was reportedly trained at a fraction of the cost. The release came as a shock to US-based developers because it challenged the assumption that only massive reserves of capital and hardware could produce cutting-edge AI. The newly published mHC paper, hosted on the preprint server arXiv, could serve as the technological framework for DeepSeek’s forthcoming model, R2. The R2 model was originally expected in mid-2025 but was postponed, reportedly due to concerns from CEO Liang Wenfeng regarding performance and China’s limited access to advanced AI chips.

The new paper attempts to bridge a complex technical gap that currently hinders AI scalability. Large language models are built upon neural networks designed to conserve signals across many layers. However, as the model grows and more layers are added, the signal can become attenuated or degraded, increasing the risk of it turning into noise. The researchers liken this to a game of “telephone”: the more people involved in the chain, the higher the chance the original message becomes confused or altered. The core engineering challenge is optimizing the trade-off between plasticity and stability, ensuring signals are conserved across as many layers as possible without degradation.

Stay Ahead of the Curve!

Don't miss out on the latest insights, trends, and analysis in the world of data, technology, and startups. Subscribe to our newsletter and get exclusive content delivered straight to your inbox.

The authors of the paper, including CEO Liang Wenfeng, built their research upon hyper-connections (HCs), a framework introduced in 2024 by researchers from ByteDance. Standard HCs diversify the channels through which neural network layers share information, but they introduce the risk of signal loss and come with high memory costs that make them difficult to implement at scale. DeepSeek’s mHC architecture aims to solve this by constraining the hyperconnectivity within a model. This approach preserves the informational complexity enabled by HCs while sidestepping the memory issues, allowing for the training of highly complex models in a way that is practical even for developers with limited resources.

The debut of the mHC framework suggests a pivot in the evolution of AI development. Until recently, prevailing industry wisdom held that only the wealthiest companies could afford to build frontier models. DeepSeek continues to demonstrate that breakthroughs can be achieved through clever engineering rather than raw financial force. By publishing this research, DeepSeek has made the mHC method available to smaller developers, potentially democratizing access to advanced AI capabilities if this architecture proves successful in the anticipated R2 model.


Featured image credit

Tags: AIdeepseek

Related Posts

Meta expands neural wristband tech to cars and accessibility at CES 2026

Meta expands neural wristband tech to cars and accessibility at CES 2026

January 7, 2026
iPolish unveils color-changing smart nails at CES 2026

iPolish unveils color-changing smart nails at CES 2026

January 7, 2026
Lenovo and Motorola introduce Qira cross-device AI assistant

Lenovo and Motorola introduce Qira cross-device AI assistant

January 7, 2026
Motorola expands Moto Things lineup at CES 2026

Motorola expands Moto Things lineup at CES 2026

January 7, 2026
Lenovo reveals Legion Go 2 with SteamOS at CES 2026

Lenovo reveals Legion Go 2 with SteamOS at CES 2026

January 7, 2026
CES 2026: Lenovo unveils XD Rollable Concept with wrap-around screen

CES 2026: Lenovo unveils XD Rollable Concept with wrap-around screen

January 7, 2026

LATEST NEWS

Meta expands neural wristband tech to cars and accessibility at CES 2026

iPolish unveils color-changing smart nails at CES 2026

Lenovo and Motorola introduce Qira cross-device AI assistant

Motorola expands Moto Things lineup at CES 2026

Lenovo reveals Legion Go 2 with SteamOS at CES 2026

CES 2026: Lenovo unveils XD Rollable Concept with wrap-around screen

Dataconomy

COPYRIGHT © DATACONOMY MEDIA GMBH, ALL RIGHTS RESERVED.

  • About
  • Imprint
  • Contact
  • Legal & Privacy

Follow Us

  • News
    • Artificial Intelligence
    • Cybersecurity
    • DeFi & Blockchain
    • Finance
    • Gaming
    • Startups
    • Tech
  • Industry
  • Research
  • Resources
    • Articles
    • Guides
    • Case Studies
    • Whitepapers
  • AI tools
  • Newsletter
  • + More
    • Glossary
    • Conversations
    • Events
    • About
      • About
      • Contact
      • Imprint
      • Legal & Privacy
      • Partner With Us
No Result
View All Result
Subscribe

This website uses cookies. By continuing to use this website you are giving consent to cookies being used. Visit our Privacy Policy.