Dataconomy
  • News
    • Artificial Intelligence
    • Cybersecurity
    • DeFi & Blockchain
    • Finance
    • Gaming
    • Startups
    • Tech
  • Industry
  • Research
  • Resources
    • Articles
    • Guides
    • Case Studies
    • Whitepapers
    • AI Models Leaderboard
  • AI toolsNEW
  • Newsletter
  • + More
    • Glossary
    • Conversations
    • Events
    • About
      • Who we are
      • Contact
      • Imprint
      • Legal & Privacy
      • Partner With Us
Subscribe
No Result
View All Result
  • AI
  • Tech
  • Cybersecurity
  • Finance
  • DeFi & Blockchain
  • Startups
  • Gaming
Dataconomy
  • News
    • Artificial Intelligence
    • Cybersecurity
    • DeFi & Blockchain
    • Finance
    • Gaming
    • Startups
    • Tech
  • Industry
  • Research
  • Resources
    • Articles
    • Guides
    • Case Studies
    • Whitepapers
    • AI Models Leaderboard
  • AI toolsNEW
  • Newsletter
  • + More
    • Glossary
    • Conversations
    • Events
    • About
      • Who we are
      • Contact
      • Imprint
      • Legal & Privacy
      • Partner With Us
Subscribe
No Result
View All Result
Dataconomy
No Result
View All Result

Tech industry averages just 5% GPU utilization, report finds

According to Cast AI, CPU overprovisioning has surged from 40% to 69%, while memory overprovisioning stands at 79%. This indicates that many companies are paying for infrastructure that their applications do not effectively utilize, thus exacerbating costs.

byEmre Çıtak
April 23, 2026
in Research
Home Research
Share on FacebookShare on TwitterShare on LinkedInShare on WhatsAppShare on e-mail
Google Preferred Source

A report from Cast AI reveals that average GPU utilization in the tech industry is only 5%, indicating widespread inefficiency in infrastructure usage. Despite significant investments in AI resources, companies are purchasing approximately twenty times more GPU capacity than necessary.

The findings suggest that overprovisioning is worsening rather than improving, with CPU utilization declining from 10% to 8% and memory utilization dropping from 23% to 20% over the past year. Organizations are reserving nearly double the CPU resources and four times the memory that their workloads actually require.

According to Cast AI, CPU overprovisioning has surged from 40% to 69%, while memory overprovisioning stands at 79%. This indicates that many companies are paying for infrastructure that their applications do not effectively utilize, thus exacerbating costs.

Stay Ahead of the Curve!

Don't miss out on the latest insights, trends, and analysis in the world of data, technology, and startups. Subscribe to our newsletter and get exclusive content delivered straight to your inbox.

The report highlights that idle GPU costs significantly exceed those of idle CPUs, costing dollars per hour compared to mere cents for CPUs. In a notable shift, GPU prices increased by 15% in January 2026 for the first time since the launch of EC2 in 2006, attributed to supply and demand fluctuations.

“At 5% utilization, the math doesn’t work,” said Laurent Gil, co-founder and President of Cast AI. Gil emphasized that the trend of overprovisioning stems from a preference for perceived safety over resource efficiency.

While some organizations have achieved higher GPU utilization rates—one reported 49% utilization on H200s and 30% on H100s—most are not leveraging existing solutions such as automated rightsizing, GPU sharing, and Spot management, resulting in continued overprovisioning.

Cast AI’s data indicates that many companies remain reluctant to change long-standing operational practices, even at the cost of higher expenses. A shift toward treating resource efficiency as a continuous automated process is necessary to mitigate these inefficiencies.


Featured image credit

Tags: FeaturedGPUs

Related Posts

Researchers create AI worm that adapts attacks without human input

Researchers create AI worm that adapts attacks without human input

June 4, 2026
Researchers unlock 20-fold enhancement in ultrafast laser experiments

Researchers unlock 20-fold enhancement in ultrafast laser experiments

June 3, 2026
NASA tests next-gen radiation-hardened space computer chip

NASA tests next-gen radiation-hardened space computer chip

May 29, 2026
Penn physicists use light-matter particles to boost AI chip speeds

Penn physicists use light-matter particles to boost AI chip speeds

May 29, 2026
Global AI spending to hit .59 trillion in 2026, says Gartner forecast

Global AI spending to hit $2.59 trillion in 2026, says Gartner forecast

May 28, 2026
New CHEEM framework helps AI learn new tasks without forgetting old ones

New CHEEM framework helps AI learn new tasks without forgetting old ones

May 27, 2026

LATEST NEWS

Amazon adds AI-generated product previews to search results

Meta launches AI business agents on WhatsApp, Instagram and Messenger

Nintendo will release a repair-friendly Switch 2 in Europe

Google rolls out Ask Gemini in Drive to eligible Workspace users

Google Wallet to add digital IDs from select EU countries this summer

Why Telegram Mini Apps have become the optimal ecosystem for launching AI SaaS products

BEST AI MODELS LEADERBOARD

See the best AI models, ranked by intelligence, benchmark results, speed and token price. Find the most suitable LLMs, Text-to-Image, Image Editing, Text-to-Speech, Text-to-Video and Image-to-Video  artificial intelligence model for your tasks and business.

LATEST TOOLS

Roboto AI

Pickaxe

Pfpmaker

MindPal

Syllaby

ScreenApp

FinanceBrain

GitHub Spark

Hints

VisionStory AI

Dataconomy

COPYRIGHT © DATACONOMY MEDIA GMBH, ALL RIGHTS RESERVED.

  • About
  • Imprint
  • Contact
  • Legal & Privacy

Follow Us

  • News
    • Artificial Intelligence
    • Cybersecurity
    • DeFi & Blockchain
    • Finance
    • Gaming
    • Startups
    • Tech
  • Industry
  • Research
  • Resources
    • Articles
    • Guides
    • Case Studies
    • Whitepapers
    • AI Models Leaderboard
  • AI tools
  • Newsletter
  • + More
    • Glossary
    • Conversations
    • Events
    • About
      • Who we are
      • Contact
      • Imprint
      • Legal & Privacy
      • Partner With Us
No Result
View All Result
Subscribe

This website uses cookies to improve your experience. You can choose to accept or reject them. Visit our Privacy Policy.