Dataconomy
  • News
    • Artificial Intelligence
    • Cybersecurity
    • DeFi & Blockchain
    • Finance
    • Gaming
    • Startups
    • Tech
  • Industry
  • Research
  • Resources
    • Articles
    • Guides
    • Case Studies
    • Whitepapers
    • AI Models Leaderboard
  • AI toolsNEW
  • Newsletter
  • + More
    • Glossary
    • Conversations
    • Events
    • About
      • Who we are
      • Contact
      • Imprint
      • Legal & Privacy
      • Partner With Us
Subscribe
No Result
View All Result
  • AI
  • Tech
  • Cybersecurity
  • Finance
  • DeFi & Blockchain
  • Startups
  • Gaming
Dataconomy
  • News
    • Artificial Intelligence
    • Cybersecurity
    • DeFi & Blockchain
    • Finance
    • Gaming
    • Startups
    • Tech
  • Industry
  • Research
  • Resources
    • Articles
    • Guides
    • Case Studies
    • Whitepapers
    • AI Models Leaderboard
  • AI toolsNEW
  • Newsletter
  • + More
    • Glossary
    • Conversations
    • Events
    • About
      • Who we are
      • Contact
      • Imprint
      • Legal & Privacy
      • Partner With Us
Subscribe
No Result
View All Result
Dataconomy
No Result
View All Result

DeepSeek reveals MODEL1 architecture in GitHub update ahead of V4

The GitHub discovery suggests a mid-February launch for V4 to coincide with the 2026 Lunar New Year.

byKerem Gülen
January 21, 2026
in Artificial Intelligence, News
Home News Artificial Intelligence
Share on FacebookShare on TwitterShare on LinkedInShare on WhatsAppShare on e-mail
Google Preferred Source

DeepSeek revealed details of a new model designated “MODEL1” through recent updates to its FlashMLA codebase on GitHub. The identifier “MODEL1” appears 28 times across 114 files within the repository, marking the disclosure on the one-year anniversary of the company’s R1 release. This development follows reports that DeepSeek plans to release its next-generation V4 model around mid-February 2026, coinciding with the Lunar New Year.

Analysis of the updated codebase by developers indicates MODEL1 features a distinct architecture from DeepSeek-V3.2, codenamed “V32” in the repository. Code logic discrepancies suggest changes in key-value cache layout, sparsity handling, and FP8 data format decoding, pointing to restructuring for memory optimization and computational efficiency. Researchers on Reddit’s LocalLLaMA community noted the FlashMLA source code update added extensive MODEL1 support, including compatibility with Nvidia’s forthcoming Blackwell architecture (SM100) and current Hopper chips. The changes reportedly show MODEL1 reverting to a unified 512-standard dimension and introducing “Value Vector Position Awareness” features, alongside potential implementations of DeepSeek’s recently published “Engram” conditional memory system.

The FlashMLA repository, which houses DeepSeek’s Multi-Head Latent Attention decoding kernel optimized for Nvidia Hopper GPUs, was the source of the technical clues. DeepSeek’s V4 model is expected to integrate the Engram architecture, which facilitates efficient retrieval from contexts exceeding one million tokens by utilizing a lookup system for foundational facts rather than recalculating them through computation. Internal tests by DeepSeek employees reportedly suggest V4 could outperform rival models from Anthropic and OpenAI on coding benchmarks, particularly with long code prompts.

Stay Ahead of the Curve!

Don't miss out on the latest insights, trends, and analysis in the world of data, technology, and startups. Subscribe to our newsletter and get exclusive content delivered straight to your inbox.

The MODEL1 revelation occurs as DeepSeek approaches one year since its R1 debut in January 2025. The R1 release resulted in a $593 billion reduction in Nvidia’s market value on a single day, according to ITPro. DeepSeek’s R1 model reportedly cost under $6 million to train and achieved performance on par with or exceeding OpenAI’s o1 model on math and coding benchmarks. The company subsequently released V3.1 in August and V3.2 in December, with V3.2 described as offering performance equivalent to OpenAI’s GPT-5. DeepSeek has not officially commented on MODEL1 or confirmed specific release timing for V4.


Featured image credit

Tags: deepseekmodel1

Related Posts

OpenAI improves health responses for free ChatGPT users

OpenAI improves health responses for free ChatGPT users

June 19, 2026
Adobe expands Firefly AI across Premiere, Illustrator, InDesign and Frame.io

Adobe expands Firefly AI across Premiere, Illustrator, InDesign and Frame.io

June 19, 2026
Spotify launches Reserved to give superfans early ticket access

Spotify launches Reserved to give superfans early ticket access

June 19, 2026
Google discontinues Nest Home Mini and Nest Audio

Google discontinues Nest Home Mini and Nest Audio

June 19, 2026
Instagram adds unique captions for each carousel slide

Instagram adds unique captions for each carousel slide

June 19, 2026
Steam Next Fest sees one in five demos labeled for generative AI

Steam Next Fest sees one in five demos labeled for generative AI

June 17, 2026

LATEST NEWS

OpenAI improves health responses for free ChatGPT users

Adobe expands Firefly AI across Premiere, Illustrator, InDesign and Frame.io

Spotify launches Reserved to give superfans early ticket access

Google discontinues Nest Home Mini and Nest Audio

Instagram adds unique captions for each carousel slide

Steam Next Fest sees one in five demos labeled for generative AI

BEST AI MODELS LEADERBOARD

See the best AI models, ranked by intelligence, benchmark results, speed and token price. Find the most suitable LLMs, Text-to-Image, Image Editing, Text-to-Speech, Text-to-Video and Image-to-Video  artificial intelligence model for your tasks and business.

LATEST TOOLS

Novoresume

PolyAI

SeaArt

H2O.ai

Techpresso

Namecheap Free Logo Maker

Binaural Beats Factory

Lyricallabs

Jobscan

Vsub

Dataconomy

COPYRIGHT © DATACONOMY MEDIA GMBH, ALL RIGHTS RESERVED.

  • About
  • Imprint
  • Contact
  • Legal & Privacy

Follow Us

  • News
    • Artificial Intelligence
    • Cybersecurity
    • DeFi & Blockchain
    • Finance
    • Gaming
    • Startups
    • Tech
  • Industry
  • Research
  • Resources
    • Articles
    • Guides
    • Case Studies
    • Whitepapers
    • AI Models Leaderboard
  • AI tools
  • Newsletter
  • + More
    • Glossary
    • Conversations
    • Events
    • About
      • Who we are
      • Contact
      • Imprint
      • Legal & Privacy
      • Partner With Us
No Result
View All Result
Subscribe

This website uses cookies to improve your experience. You can choose to accept or reject them. Visit our Privacy Policy.