Dataconomy
  • News
    • Artificial Intelligence
    • Cybersecurity
    • DeFi & Blockchain
    • Finance
    • Gaming
    • Startups
    • Tech
  • Industry
  • Research
  • Resources
    • Articles
    • Guides
    • Case Studies
    • Whitepapers
  • AI toolsNEW
  • Newsletter
  • + More
    • Glossary
    • Conversations
    • Events
    • About
      • About
      • Contact
      • Imprint
      • Legal & Privacy
      • Partner With Us
Subscribe
No Result
View All Result
  • AI
  • Tech
  • Cybersecurity
  • Finance
  • DeFi & Blockchain
  • Startups
  • Gaming
Dataconomy
  • News
    • Artificial Intelligence
    • Cybersecurity
    • DeFi & Blockchain
    • Finance
    • Gaming
    • Startups
    • Tech
  • Industry
  • Research
  • Resources
    • Articles
    • Guides
    • Case Studies
    • Whitepapers
  • AI toolsNEW
  • Newsletter
  • + More
    • Glossary
    • Conversations
    • Events
    • About
      • About
      • Contact
      • Imprint
      • Legal & Privacy
      • Partner With Us
Subscribe
No Result
View All Result
Dataconomy
No Result
View All Result

GPT-5.2 still counts two r’s in strawberry

This tokenization pattern affects similar words. Raspberry divides into comparable tokens, resulting in ChatGPT reporting two r's for that word as well.

byKerem Gülen
December 15, 2025
in Artificial Intelligence, News
Home News Artificial Intelligence
Share on FacebookShare on TwitterShare on LinkedInShare on WhatsAppShare on e-mail

ChatGPT, powered by OpenAI’s GPT-5.2 model released in December 2025, incorrectly identifies two r’s in the word strawberry, which contains three, because its tokenization process splits the word into st-raw-berry, with only two tokens containing r’s.

Modern AI systems demonstrate proficiency in generating unique marketing images, compiling reports via agentic browsers, and producing chart-topping songs. These capabilities highlight extensive training on vast datasets, enabling pattern recognition for complex outputs. In contrast, certain basic tasks challenge these models. Counting letters in a single word represents one such task, accessible to a seven-year-old child without difficulty.

The specific question under examination asks how many r’s appear in strawberry. The word strawberry consists of the letters s-t-r-a-w-b-e-r-r-y. Visual inspection confirms three r’s: one after t, and two consecutive in the berry portion. This query has persisted as a test for AI performance over multiple model iterations.

Stay Ahead of the Curve!

Don't miss out on the latest insights, trends, and analysis in the world of data, technology, and startups. Subscribe to our newsletter and get exclusive content delivered straight to your inbox.

Following the December 2025 release of GPT-5.2, tests confirmed ChatGPT’s response remained two r’s. Previous versions exhibited uncertainty or erratic behavior on this question. The latest model delivered a direct answer of two, without deviation. This outcome persists despite investments exceeding billions of dollars, elevated hardware demands including RAM price increases, and substantial global water consumption linked to training infrastructure.

The issue stems from the tokenized input-output design of large language models like ChatGPT. Input text undergoes division into tokens, which are chunks such as whole words, syllables, or word parts. The model processes these tokens rather than individual letters. Consequently, letter counting relies on token contents rather than precise letter enumeration.

The OpenAI Tokenizer tool illustrates this process. Entering strawberry yields three tokens: st, raw, berry. The first token st contains no r. The second token raw includes one r. The third token berry includes two r’s but functions as a single token. The model associates r’s with two tokens, leading to the count of two.

This tokenization pattern affects similar words. Raspberry divides into comparable tokens, resulting in ChatGPT reporting two r’s for that word as well. The berry token compresses multiple letters into one unit, undervaluing individual letter instances within it.

ChatGPT operates as a prediction engine, leveraging patterns from training data to anticipate subsequent elements. GPT-5.x incorporates the o200k_harmony tokenization method, introduced with OpenAI o4-mini and GPT-4o models. This updated scheme aims for efficiency but retains the strawberry r-counting discrepancy.

ChatGPT launched in late 2022 amid numerous token-based challenges. Specific phrases triggered excessive responses or processing failures. OpenAI addressed many through training adjustments and system enhancements over subsequent years.

Verification tests on classic problems showed improvements. ChatGPT accurately spells Mississippi, identifying letters m-i-s-s-i-s-s-i-p-p-i with correct frequencies: one m, four i’s, four s’s, two p’s. It also reverses lollipop to popillol, preserving all letters in proper sequence.

Large language models exhibit persistent limitations in exact counting of small quantities. They perform well in mathematics and problem-solving but falter on precise tallies of letters or words in brief strings.

A notable historical example involves the string solidgoldmagikarp. In GPT-3, this phrase disrupted tokenization, causing erratic outputs including user insults and unintelligible text.

Querying GPT-5.2 on solidgoldmagikarp produced a hallucination. The model described it as a secret Pokémon joke embedded in GitHub repositories by developers. Activation allegedly transforms avatars, repository icons, and other features into Pokémon-themed elements. This claim lacks basis in reality and reflects residual effects from prior tokenization issues.

Comparative tests across other AI models yielded correct results for the strawberry question. Perplexity counted three r’s. Claude provided the accurate count of three.

Grok identified three r’s in strawberry. Gemini answered correctly with three. Qwen confirmed three r’s.

Copilot also reported three r’s. These models employ distinct tokenization systems, enabling accurate letter identification even when powered by OpenAI’s underlying architectures.


Featured image credit

Tags: chatgptgpt-5.2openAI

Related Posts

How Zesty uses AI to find your next meal

How Zesty uses AI to find your next meal

December 17, 2025
YouTube Gaming opens Playables Builder beta with Gemini 3

YouTube Gaming opens Playables Builder beta with Gemini 3

December 17, 2025
Watch Instagram Reels on TV with new Fire TV app

Watch Instagram Reels on TV with new Fire TV app

December 17, 2025
Netflix secures 14 iHeartMedia video podcasts for 2026

Netflix secures 14 iHeartMedia video podcasts for 2026

December 17, 2025
Google launches email assistant CC powered by Gemini

Google launches email assistant CC powered by Gemini

December 17, 2025
Steam Replay 2025 reveals your top games of the year

Steam Replay 2025 reveals your top games of the year

December 17, 2025

LATEST NEWS

How Zesty uses AI to find your next meal

YouTube Gaming opens Playables Builder beta with Gemini 3

Watch Instagram Reels on TV with new Fire TV app

Netflix secures 14 iHeartMedia video podcasts for 2026

Google launches email assistant CC powered by Gemini

Steam Replay 2025 reveals your top games of the year

Dataconomy

COPYRIGHT © DATACONOMY MEDIA GMBH, ALL RIGHTS RESERVED.

  • About
  • Imprint
  • Contact
  • Legal & Privacy

Follow Us

  • News
    • Artificial Intelligence
    • Cybersecurity
    • DeFi & Blockchain
    • Finance
    • Gaming
    • Startups
    • Tech
  • Industry
  • Research
  • Resources
    • Articles
    • Guides
    • Case Studies
    • Whitepapers
  • AI tools
  • Newsletter
  • + More
    • Glossary
    • Conversations
    • Events
    • About
      • About
      • Contact
      • Imprint
      • Legal & Privacy
      • Partner With Us
No Result
View All Result
Subscribe

This website uses cookies. By continuing to use this website you are giving consent to cookies being used. Visit our Privacy Policy.