Google AI Overviews Face Scrutiny Over Basic Spelling and Letter Counting Errors
Despite advanced reasoning capabilities, Google's flagship search AI struggles with simple word mechanics due to its underlying transformer architecture.
Primary source: TechCrunch AI. Full source links and update notes are below.
Fast summary
Start here
- Google's AI Overview feature has recently produced errors claiming 'Google' has two 'p's and 'poop' has one 'r'.
- Google acknowledged that counting letters within words is a known challenge for Large Language Models (LLMs) and stated it is working on a fix.
- Experts explain that these errors occur because AI models process text as 'tokens' or numerical encodings rather than reading individual letters.

What happened
Google's generative AI feature in Search, known as AI Overviews, has been observed making elementary spelling and counting mistakes. Examples include the AI claiming there are two 'p's in the word 'Google' and misidentifying the count of letters in 'journalism' while spelling it as 'j-o-u-r-n-a-d-i-s-m.' The AI also struggled with names, identifying the correct letter count in a public figure's name but spelling the name incorrectly as 't-r-p-u-m.'
What's new in this update
In a statement to TechCrunch, Google confirmed that counting within words is a known challenge for large language models. The company indicated it is working to fix these specific issues as it continues to roll out generative AI features to its search engine. This acknowledgment comes as Google doubles down on making AI the centerpiece of its search product, despite recurring accuracy concerns.
Key details
The errors extend beyond simple counting. In one instance, a search for the word 'disregard' prompted a dictionary-style response that actually mirrored a chatbot's refusal message: 'Understood. Let me know whenever you have a new prompt or question!' These types of hallucinations are tied to the transformer architecture of LLMs. Instead of reading text character by character, the models use 'tokens'—syllables or word fragments—which are converted into numerical representations.
Background and context
This is not the first time Google’s AI-forward search overhaul has faced public criticism. Previous iterations of AI Overviews were found citing satirical content from The Onion and Reddit, leading to dangerous suggestions such as advising users to put glue on pizza. The spelling issue, often referred to as the 'strawberry' test (where models fail to count the 'r's in the word strawberry), has been a running joke in the AI industry for years, highlighting a persistent technical hurdle.
What to watch next
Researchers remain divided on whether a 'perfect' tokenizer can ever be achieved. Experts like Matthew Guzdial from the University of Alberta and Sheridan Feucht from Northeastern University suggest that while models may improve, the inherent 'fuzziness' of tokenization makes letter-perfect accuracy difficult for LLMs. Google is expected to continue patching these specific errors as they arise, but the architectural limitation remains a core challenge for the industry.
Why it matters
The inability of advanced AI to perform basic spelling tasks reveals a fundamental disconnect between how machines process data and how humans understand language.
Read next
Follow this story through the topic hub, more ai coverage, and the latest updates.
Weekly briefing
Get the week's key developments in one concise email.
Get a fast catch-up on the biggest stories, the context behind them, and the links worth your time.
Cadence
Weekly, for a quick catch-up
Coverage
AI, business, world, security, sports
Format
Clear takeaways and useful context
Request the briefing
Leave your email to open a prepared request and get on the list for the weekly briefing.
Author
See who assembled this story and follow more of their work.
Sources and methodology