Dataconomy
  • News
    • Artificial Intelligence
    • Cybersecurity
    • DeFi & Blockchain
    • Finance
    • Gaming
    • Startups
    • Tech
  • Industry
  • Research
  • Resources
    • Articles
    • Guides
    • Case Studies
    • Whitepapers
    • AI Models Leaderboard
  • AI toolsNEW
  • Newsletter
  • + More
    • Glossary
    • Conversations
    • Events
    • About
      • Who we are
      • Contact
      • Imprint
      • Legal & Privacy
      • Partner With Us
Subscribe
No Result
View All Result
  • AI
  • Tech
  • Cybersecurity
  • Finance
  • DeFi & Blockchain
  • Startups
  • Gaming
Dataconomy
  • News
    • Artificial Intelligence
    • Cybersecurity
    • DeFi & Blockchain
    • Finance
    • Gaming
    • Startups
    • Tech
  • Industry
  • Research
  • Resources
    • Articles
    • Guides
    • Case Studies
    • Whitepapers
    • AI Models Leaderboard
  • AI toolsNEW
  • Newsletter
  • + More
    • Glossary
    • Conversations
    • Events
    • About
      • Who we are
      • Contact
      • Imprint
      • Legal & Privacy
      • Partner With Us
Subscribe
No Result
View All Result
Dataconomy
No Result
View All Result

GPT-5.2 still counts two r’s in strawberry

This tokenization pattern affects similar words. Raspberry divides into comparable tokens, resulting in ChatGPT reporting two r's for that word as well.

byKerem Gülen
December 15, 2025
in Artificial Intelligence, News
Home News Artificial Intelligence
Share on FacebookShare on TwitterShare on LinkedInShare on WhatsAppShare on e-mail
Google Preferred Source

ChatGPT, powered by OpenAI’s GPT-5.2 model released in December 2025, incorrectly identifies two r’s in the word strawberry, which contains three, because its tokenization process splits the word into st-raw-berry, with only two tokens containing r’s.

Modern AI systems demonstrate proficiency in generating unique marketing images, compiling reports via agentic browsers, and producing chart-topping songs. These capabilities highlight extensive training on vast datasets, enabling pattern recognition for complex outputs. In contrast, certain basic tasks challenge these models. Counting letters in a single word represents one such task, accessible to a seven-year-old child without difficulty.

The specific question under examination asks how many r’s appear in strawberry. The word strawberry consists of the letters s-t-r-a-w-b-e-r-r-y. Visual inspection confirms three r’s: one after t, and two consecutive in the berry portion. This query has persisted as a test for AI performance over multiple model iterations.

Stay Ahead of the Curve!

Don't miss out on the latest insights, trends, and analysis in the world of data, technology, and startups. Subscribe to our newsletter and get exclusive content delivered straight to your inbox.

Following the December 2025 release of GPT-5.2, tests confirmed ChatGPT’s response remained two r’s. Previous versions exhibited uncertainty or erratic behavior on this question. The latest model delivered a direct answer of two, without deviation. This outcome persists despite investments exceeding billions of dollars, elevated hardware demands including RAM price increases, and substantial global water consumption linked to training infrastructure.

The issue stems from the tokenized input-output design of large language models like ChatGPT. Input text undergoes division into tokens, which are chunks such as whole words, syllables, or word parts. The model processes these tokens rather than individual letters. Consequently, letter counting relies on token contents rather than precise letter enumeration.

The OpenAI Tokenizer tool illustrates this process. Entering strawberry yields three tokens: st, raw, berry. The first token st contains no r. The second token raw includes one r. The third token berry includes two r’s but functions as a single token. The model associates r’s with two tokens, leading to the count of two.

This tokenization pattern affects similar words. Raspberry divides into comparable tokens, resulting in ChatGPT reporting two r’s for that word as well. The berry token compresses multiple letters into one unit, undervaluing individual letter instances within it.

ChatGPT operates as a prediction engine, leveraging patterns from training data to anticipate subsequent elements. GPT-5.x incorporates the o200k_harmony tokenization method, introduced with OpenAI o4-mini and GPT-4o models. This updated scheme aims for efficiency but retains the strawberry r-counting discrepancy.

ChatGPT launched in late 2022 amid numerous token-based challenges. Specific phrases triggered excessive responses or processing failures. OpenAI addressed many through training adjustments and system enhancements over subsequent years.

Verification tests on classic problems showed improvements. ChatGPT accurately spells Mississippi, identifying letters m-i-s-s-i-s-s-i-p-p-i with correct frequencies: one m, four i’s, four s’s, two p’s. It also reverses lollipop to popillol, preserving all letters in proper sequence.

Large language models exhibit persistent limitations in exact counting of small quantities. They perform well in mathematics and problem-solving but falter on precise tallies of letters or words in brief strings.

A notable historical example involves the string solidgoldmagikarp. In GPT-3, this phrase disrupted tokenization, causing erratic outputs including user insults and unintelligible text.

Querying GPT-5.2 on solidgoldmagikarp produced a hallucination. The model described it as a secret Pokémon joke embedded in GitHub repositories by developers. Activation allegedly transforms avatars, repository icons, and other features into Pokémon-themed elements. This claim lacks basis in reality and reflects residual effects from prior tokenization issues.

Comparative tests across other AI models yielded correct results for the strawberry question. Perplexity counted three r’s. Claude provided the accurate count of three.

Grok identified three r’s in strawberry. Gemini answered correctly with three. Qwen confirmed three r’s.

Copilot also reported three r’s. These models employ distinct tokenization systems, enabling accurate letter identification even when powered by OpenAI’s underlying architectures.


Featured image credit

Tags: chatgptgpt-5.2openAI

Related Posts

Advanced SEO services for high impact digital strategies

Advanced SEO services for high impact digital strategies

June 8, 2026
The 8 best website builders for small businesses on any budget

The 8 best website builders for small businesses on any budget

June 8, 2026
Why European workloads are leaving US cloud in 2026

Why European workloads are leaving US cloud in 2026

June 8, 2026
Being friendly to your AI might be the least eco-friendly thing you can do

Being friendly to your AI might be the least eco-friendly thing you can do

June 8, 2026
Jensen Huang says AI is expanding software demand rather than replacing jobs

Jensen Huang says AI is expanding software demand rather than replacing jobs

June 8, 2026
Halo: Campaign Evolved is now available for pre-order ahead of its July launch

Halo: Campaign Evolved is now available for pre-order ahead of its July launch

June 8, 2026

LATEST NEWS

Advanced SEO services for high impact digital strategies

The 8 best website builders for small businesses on any budget

Why European workloads are leaving US cloud in 2026

Being friendly to your AI might be the least eco-friendly thing you can do

Jensen Huang says AI is expanding software demand rather than replacing jobs

Halo: Campaign Evolved is now available for pre-order ahead of its July launch

BEST AI MODELS LEADERBOARD

See the best AI models, ranked by intelligence, benchmark results, speed and token price. Find the most suitable LLMs, Text-to-Image, Image Editing, Text-to-Speech, Text-to-Video and Image-to-Video  artificial intelligence model for your tasks and business.

LATEST TOOLS

Roboto AI

Pickaxe

Pfpmaker

MindPal

Syllaby

ScreenApp

FinanceBrain

GitHub Spark

Hints

VisionStory AI

Dataconomy

COPYRIGHT © DATACONOMY MEDIA GMBH, ALL RIGHTS RESERVED.

  • About
  • Imprint
  • Contact
  • Legal & Privacy

Follow Us

  • News
    • Artificial Intelligence
    • Cybersecurity
    • DeFi & Blockchain
    • Finance
    • Gaming
    • Startups
    • Tech
  • Industry
  • Research
  • Resources
    • Articles
    • Guides
    • Case Studies
    • Whitepapers
    • AI Models Leaderboard
  • AI tools
  • Newsletter
  • + More
    • Glossary
    • Conversations
    • Events
    • About
      • Who we are
      • Contact
      • Imprint
      • Legal & Privacy
      • Partner With Us
No Result
View All Result
Subscribe

This website uses cookies to improve your experience. You can choose to accept or reject them. Visit our Privacy Policy.