Dataconomy
  • News
    • Artificial Intelligence
    • Cybersecurity
    • DeFi & Blockchain
    • Finance
    • Gaming
    • Startups
    • Tech
  • Industry
  • Research
  • Resources
    • Articles
    • Guides
    • Case Studies
    • Whitepapers
    • AI Models Leaderboard
  • AI toolsNEW
  • Newsletter
  • + More
    • Glossary
    • Conversations
    • Events
    • About
      • Who we are
      • Contact
      • Imprint
      • Legal & Privacy
      • Partner With Us
Subscribe
No Result
View All Result
  • AI
  • Tech
  • Cybersecurity
  • Finance
  • DeFi & Blockchain
  • Startups
  • Gaming
Dataconomy
  • News
    • Artificial Intelligence
    • Cybersecurity
    • DeFi & Blockchain
    • Finance
    • Gaming
    • Startups
    • Tech
  • Industry
  • Research
  • Resources
    • Articles
    • Guides
    • Case Studies
    • Whitepapers
    • AI Models Leaderboard
  • AI toolsNEW
  • Newsletter
  • + More
    • Glossary
    • Conversations
    • Events
    • About
      • Who we are
      • Contact
      • Imprint
      • Legal & Privacy
      • Partner With Us
Subscribe
No Result
View All Result
Dataconomy
No Result
View All Result

OpenAI unveils GPT-5.4 Pro and Thinking models

The new model sets a benchmark record by matching or exceeding human performance 83% of the time on complex tasks in industries like law and finance.

byEmre Çıtak
March 6, 2026
in Artificial Intelligence, News
Home News Artificial Intelligence
Share on FacebookShare on TwitterShare on LinkedInShare on WhatsAppShare on e-mail
Google Preferred Source

OpenAI released GPT-5.4 on Thursday, introducing a new foundation model available in standard, Thinking, and Pro versions.

The launch introduces a model with a 1 million token context window and improved token efficiency, targeting professional workloads. The release includes new benchmark records and a system to manage tool calling within the API.

GPT-5.4 is available in three versions: standard, a reasoning model (GPT-5.4 Thinking), and an optimized high-performance version (GPT-5.4 Pro). The API version supports context windows as large as 1 million tokens, the largest available from OpenAI. OpenAI stated GPT-5.4 solves the same problems with significantly fewer tokens than its predecessor.

Stay Ahead of the Curve!

Don't miss out on the latest insights, trends, and analysis in the world of data, technology, and startups. Subscribe to our newsletter and get exclusive content delivered straight to your inbox.

The model achieved record scores in computer-use benchmarks OSWorld-Verified and WebArena Verified. It scored a record 83% on OpenAI’s GDPval test for knowledge work tasks. GPT-5.4 also took the lead on Mercor’s APEX-Agents benchmark, which tests professional skills in law and finance.

Mercor CEO Brendan Foody stated that GPT-5.4 excels at creating long-horizon deliverables such as slide decks, financial models, and legal analysis. Foody said the model delivers top performance while running faster and at lower cost than competitive frontier models.

OpenAI reported GPT-5.4 is 33% less likely to make errors in individual claims compared to GPT 5.2. Overall responses are 18% less likely to contain errors. OpenAI introduced Tool Search, a new system for managing tool calling in the API that allows models to look up tool definitions as needed.

Tool Search reduces token use and improves speed and cost in systems with many tools. OpenAI added a new safety evaluation to test chain-of-thought monitoring, addressing concerns that reasoning models could misrepresent their reasoning process.

The new evaluation shows deception is less likely in the GPT-5.4 Thinking version. OpenAI stated this suggests the model lacks the ability to hide its reasoning and that CoT monitoring remains an effective safety tool.


Featured image credit

Tags: FeaturedGPT-5.4 ProGPT-5.4 ThinkingopenAI

Related Posts

Apple touchscreen MacBook could launch with M5 Pro chips

Apple touchscreen MacBook could launch with M5 Pro chips

June 29, 2026
Apple touchscreen MacBook could launch with M5 Pro chips

Apple touchscreen MacBook could launch with M5 Pro chips

June 29, 2026
OpenAI limits ChatGPT 5.6 access to government-approved users first

OpenAI limits ChatGPT 5.6 access to government-approved users first

June 26, 2026
Apple to skip M6 Pro and Max chips and launch M7 in 2027

Apple to skip M6 Pro and Max chips and launch M7 in 2027

June 26, 2026
IBM unveils world’s first sub-1nm chip with new nanostack architecture

IBM unveils world’s first sub-1nm chip with new nanostack architecture

June 26, 2026
Apple raises prices across Macs, iPads and home devices

Apple raises prices across Macs, iPads and home devices

June 26, 2026

LATEST NEWS

Apple touchscreen MacBook could launch with M5 Pro chips

Apple touchscreen MacBook could launch with M5 Pro chips

OpenAI limits ChatGPT 5.6 access to government-approved users first

Apple to skip M6 Pro and Max chips and launch M7 in 2027

IBM unveils world’s first sub-1nm chip with new nanostack architecture

Apple raises prices across Macs, iPads and home devices

BEST AI MODELS LEADERBOARD

See the best AI models, ranked by intelligence, benchmark results, speed and token price. Find the most suitable LLMs, Text-to-Image, Image Editing, Text-to-Speech, Text-to-Video and Image-to-Video  artificial intelligence model for your tasks and business.

LATEST TOOLS

Autoppt

Otter.ai

Slideoo

Disney Pixar AI Generator

Codebay

Newo

BlackInk.AI

WatchMyCompetitor

TokkingHeads

Fellow.app

Dataconomy

COPYRIGHT © DATACONOMY MEDIA GMBH, ALL RIGHTS RESERVED.

  • About
  • Imprint
  • Contact
  • Legal & Privacy

Follow Us

  • News
    • Artificial Intelligence
    • Cybersecurity
    • DeFi & Blockchain
    • Finance
    • Gaming
    • Startups
    • Tech
  • Industry
  • Research
  • Resources
    • Articles
    • Guides
    • Case Studies
    • Whitepapers
    • AI Models Leaderboard
  • AI tools
  • Newsletter
  • + More
    • Glossary
    • Conversations
    • Events
    • About
      • Who we are
      • Contact
      • Imprint
      • Legal & Privacy
      • Partner With Us
No Result
View All Result
Subscribe

This website uses cookies to improve your experience. You can choose to accept or reject them. Visit our Privacy Policy.