Dataconomy
  • News
    • Artificial Intelligence
    • Cybersecurity
    • DeFi & Blockchain
    • Finance
    • Gaming
    • Startups
    • Tech
  • Industry
  • Research
  • Resources
    • Articles
    • Guides
    • Case Studies
    • Whitepapers
    • AI Models Leaderboard
  • AI toolsNEW
  • Newsletter
  • + More
    • Glossary
    • Conversations
    • Events
    • About
      • Who we are
      • Contact
      • Imprint
      • Legal & Privacy
      • Partner With Us
Subscribe
No Result
View All Result
  • AI
  • Tech
  • Cybersecurity
  • Finance
  • DeFi & Blockchain
  • Startups
  • Gaming
Dataconomy
  • News
    • Artificial Intelligence
    • Cybersecurity
    • DeFi & Blockchain
    • Finance
    • Gaming
    • Startups
    • Tech
  • Industry
  • Research
  • Resources
    • Articles
    • Guides
    • Case Studies
    • Whitepapers
    • AI Models Leaderboard
  • AI toolsNEW
  • Newsletter
  • + More
    • Glossary
    • Conversations
    • Events
    • About
      • Who we are
      • Contact
      • Imprint
      • Legal & Privacy
      • Partner With Us
Subscribe
No Result
View All Result
Dataconomy
No Result
View All Result

Tech industry averages just 5% GPU utilization, report finds

According to Cast AI, CPU overprovisioning has surged from 40% to 69%, while memory overprovisioning stands at 79%. This indicates that many companies are paying for infrastructure that their applications do not effectively utilize, thus exacerbating costs.

byEmre Çıtak
April 23, 2026
in Research
Home Research
Share on FacebookShare on TwitterShare on LinkedInShare on WhatsAppShare on e-mail
Google Preferred Source

A report from Cast AI reveals that average GPU utilization in the tech industry is only 5%, indicating widespread inefficiency in infrastructure usage. Despite significant investments in AI resources, companies are purchasing approximately twenty times more GPU capacity than necessary.

The findings suggest that overprovisioning is worsening rather than improving, with CPU utilization declining from 10% to 8% and memory utilization dropping from 23% to 20% over the past year. Organizations are reserving nearly double the CPU resources and four times the memory that their workloads actually require.

According to Cast AI, CPU overprovisioning has surged from 40% to 69%, while memory overprovisioning stands at 79%. This indicates that many companies are paying for infrastructure that their applications do not effectively utilize, thus exacerbating costs.

Stay Ahead of the Curve!

Don't miss out on the latest insights, trends, and analysis in the world of data, technology, and startups. Subscribe to our newsletter and get exclusive content delivered straight to your inbox.

The report highlights that idle GPU costs significantly exceed those of idle CPUs, costing dollars per hour compared to mere cents for CPUs. In a notable shift, GPU prices increased by 15% in January 2026 for the first time since the launch of EC2 in 2006, attributed to supply and demand fluctuations.

“At 5% utilization, the math doesn’t work,” said Laurent Gil, co-founder and President of Cast AI. Gil emphasized that the trend of overprovisioning stems from a preference for perceived safety over resource efficiency.

While some organizations have achieved higher GPU utilization rates—one reported 49% utilization on H200s and 30% on H100s—most are not leveraging existing solutions such as automated rightsizing, GPU sharing, and Spot management, resulting in continued overprovisioning.

Cast AI’s data indicates that many companies remain reluctant to change long-standing operational practices, even at the cost of higher expenses. A shift toward treating resource efficiency as a continuous automated process is necessary to mitigate these inefficiencies.


Featured image credit

Tags: FeaturedGPUs

Related Posts

European consumers may leave businesses using US tech providers

European consumers may leave businesses using US tech providers

June 24, 2026
Study links AI-assisted homework to lower exam scores

Study links AI-assisted homework to lower exam scores

June 22, 2026
Harvard and Boston Children’s use AI to revisit unsolved genetic cases

Harvard and Boston Children’s use AI to revisit unsolved genetic cases

June 19, 2026
Adobe report finds 86% of creators now use generative AI in workflows

Adobe report finds 86% of creators now use generative AI in workflows

June 17, 2026
AI transfer learning speeds cosmology research but has hidden risks

AI transfer learning speeds cosmology research but has hidden risks

June 15, 2026
Phishing scams targeting travelers hit record levels in 2026

Phishing scams targeting travelers hit record levels in 2026

June 15, 2026

LATEST NEWS

Rockstar confirms GTA 6 pricing and pre-order details

ByteDance launches Doubao 2.1 Pro language model

OpenAI expands cybersecurity efforts with Patch the Planet

Meta launches $299 smart glasses under its own brand

Claude Tag brings shared AI assistant to Slack channels

PlayStation 6 leak points to 2027 release window

BEST AI MODELS LEADERBOARD

See the best AI models, ranked by intelligence, benchmark results, speed and token price. Find the most suitable LLMs, Text-to-Image, Image Editing, Text-to-Speech, Text-to-Video and Image-to-Video  artificial intelligence model for your tasks and business.

LATEST TOOLS

Vrew

Fireflies

SpeedLegal

Teachable Machine

Unriddle

VidAU

Qualified

character.ai

Interview Coder

Moonbeam

Dataconomy

COPYRIGHT © DATACONOMY MEDIA GMBH, ALL RIGHTS RESERVED.

  • About
  • Imprint
  • Contact
  • Legal & Privacy

Follow Us

  • News
    • Artificial Intelligence
    • Cybersecurity
    • DeFi & Blockchain
    • Finance
    • Gaming
    • Startups
    • Tech
  • Industry
  • Research
  • Resources
    • Articles
    • Guides
    • Case Studies
    • Whitepapers
    • AI Models Leaderboard
  • AI tools
  • Newsletter
  • + More
    • Glossary
    • Conversations
    • Events
    • About
      • Who we are
      • Contact
      • Imprint
      • Legal & Privacy
      • Partner With Us
No Result
View All Result
Subscribe

This website uses cookies to improve your experience. You can choose to accept or reject them. Visit our Privacy Policy.