Dataconomy
  • News
    • Artificial Intelligence
    • Cybersecurity
    • DeFi & Blockchain
    • Finance
    • Gaming
    • Startups
    • Tech
  • Industry
  • Research
  • Resources
    • Articles
    • Guides
    • Case Studies
    • Whitepapers
  • AI toolsNEW
  • Newsletter
  • + More
    • Glossary
    • Conversations
    • Events
    • About
      • Who we are
      • Contact
      • Imprint
      • Legal & Privacy
      • Partner With Us
Subscribe
No Result
View All Result
  • AI
  • Tech
  • Cybersecurity
  • Finance
  • DeFi & Blockchain
  • Startups
  • Gaming
Dataconomy
  • News
    • Artificial Intelligence
    • Cybersecurity
    • DeFi & Blockchain
    • Finance
    • Gaming
    • Startups
    • Tech
  • Industry
  • Research
  • Resources
    • Articles
    • Guides
    • Case Studies
    • Whitepapers
  • AI toolsNEW
  • Newsletter
  • + More
    • Glossary
    • Conversations
    • Events
    • About
      • Who we are
      • Contact
      • Imprint
      • Legal & Privacy
      • Partner With Us
Subscribe
No Result
View All Result
Dataconomy
No Result
View All Result

DeepSeek reveals MODEL1 architecture in GitHub update ahead of V4

The GitHub discovery suggests a mid-February launch for V4 to coincide with the 2026 Lunar New Year.

byKerem Gülen
January 21, 2026
in Artificial Intelligence, News
Home News Artificial Intelligence
Share on FacebookShare on TwitterShare on LinkedInShare on WhatsAppShare on e-mail

DeepSeek revealed details of a new model designated “MODEL1” through recent updates to its FlashMLA codebase on GitHub. The identifier “MODEL1” appears 28 times across 114 files within the repository, marking the disclosure on the one-year anniversary of the company’s R1 release. This development follows reports that DeepSeek plans to release its next-generation V4 model around mid-February 2026, coinciding with the Lunar New Year.

Analysis of the updated codebase by developers indicates MODEL1 features a distinct architecture from DeepSeek-V3.2, codenamed “V32” in the repository. Code logic discrepancies suggest changes in key-value cache layout, sparsity handling, and FP8 data format decoding, pointing to restructuring for memory optimization and computational efficiency. Researchers on Reddit’s LocalLLaMA community noted the FlashMLA source code update added extensive MODEL1 support, including compatibility with Nvidia’s forthcoming Blackwell architecture (SM100) and current Hopper chips. The changes reportedly show MODEL1 reverting to a unified 512-standard dimension and introducing “Value Vector Position Awareness” features, alongside potential implementations of DeepSeek’s recently published “Engram” conditional memory system.

The FlashMLA repository, which houses DeepSeek’s Multi-Head Latent Attention decoding kernel optimized for Nvidia Hopper GPUs, was the source of the technical clues. DeepSeek’s V4 model is expected to integrate the Engram architecture, which facilitates efficient retrieval from contexts exceeding one million tokens by utilizing a lookup system for foundational facts rather than recalculating them through computation. Internal tests by DeepSeek employees reportedly suggest V4 could outperform rival models from Anthropic and OpenAI on coding benchmarks, particularly with long code prompts.

Stay Ahead of the Curve!

Don't miss out on the latest insights, trends, and analysis in the world of data, technology, and startups. Subscribe to our newsletter and get exclusive content delivered straight to your inbox.

The MODEL1 revelation occurs as DeepSeek approaches one year since its R1 debut in January 2025. The R1 release resulted in a $593 billion reduction in Nvidia’s market value on a single day, according to ITPro. DeepSeek’s R1 model reportedly cost under $6 million to train and achieved performance on par with or exceeding OpenAI’s o1 model on math and coding benchmarks. The company subsequently released V3.1 in August and V3.2 in December, with V3.2 described as offering performance equivalent to OpenAI’s GPT-5. DeepSeek has not officially commented on MODEL1 or confirmed specific release timing for V4.


Featured image credit

Tags: deepseekmodel1

Related Posts

Substack goes for the living room with beta TV app launch

Substack goes for the living room with beta TV app launch

January 23, 2026
Google rolls out opt-in “Personal Intelligence” for AI Pro and Ultra users

Google rolls out opt-in “Personal Intelligence” for AI Pro and Ultra users

January 23, 2026
JBL launches AI-powered BandBox amps

JBL launches AI-powered BandBox amps

January 23, 2026
The billion-event problem: How data engineering powers 8-hour battery life in AR glasses

The billion-event problem: How data engineering powers 8-hour battery life in AR glasses

January 23, 2026
Influencer collaboration with brands: 15 real formats beyond the sponsored post

Influencer collaboration with brands: 15 real formats beyond the sponsored post

January 23, 2026
From fragmented systems to intelligent workflows: How CRM platforms like Salesforce power data-driven enterprise operations

From fragmented systems to intelligent workflows: How CRM platforms like Salesforce power data-driven enterprise operations

January 23, 2026

LATEST NEWS

Substack goes for the living room with beta TV app launch

Google rolls out opt-in “Personal Intelligence” for AI Pro and Ultra users

JBL launches AI-powered BandBox amps

The billion-event problem: How data engineering powers 8-hour battery life in AR glasses

Influencer collaboration with brands: 15 real formats beyond the sponsored post

From fragmented systems to intelligent workflows: How CRM platforms like Salesforce power data-driven enterprise operations

Dataconomy

COPYRIGHT © DATACONOMY MEDIA GMBH, ALL RIGHTS RESERVED.

  • About
  • Imprint
  • Contact
  • Legal & Privacy

Follow Us

  • News
    • Artificial Intelligence
    • Cybersecurity
    • DeFi & Blockchain
    • Finance
    • Gaming
    • Startups
    • Tech
  • Industry
  • Research
  • Resources
    • Articles
    • Guides
    • Case Studies
    • Whitepapers
  • AI tools
  • Newsletter
  • + More
    • Glossary
    • Conversations
    • Events
    • About
      • Who we are
      • Contact
      • Imprint
      • Legal & Privacy
      • Partner With Us
No Result
View All Result
Subscribe

This website uses cookies. By continuing to use this website you are giving consent to cookies being used. Visit our Privacy Policy.