Shocking study: Which AI robots hallucinate more?

A recent study by Legal Guardian Digital has revealed significant discrepancies in the accuracy of popular chatbots, warning of a phenomenon known as “hallucination,” which could lead to users being misled by completely false information.

Beyond the technical complexities, Large Language Models (LLMs) rely on statistical patterns to predict the next word. When a model fails to find an accurate pattern for the answer, it constructs words that seem statistically logical but lack factual accuracy. This means the bot isn’t intentionally lying; it’s simply executing its programming to try and provide an answer even if it lacks the necessary information.

The study revealed a surprising finding regarding Google Gemini, which topped the list of the most “hallucinatory” chatbots with an error rate of 32%. These figures may be a cause for concern for Apple, which reportedly pays Google $1 billion annually to use a customized version of Gemini to enhance Siri’s engine in the upcoming iOS 27 operating system.

ChatGPT came in second in terms of error rate, providing inaccurate information in 30% of its responses—double the error rate of its Chinese competitor, DeepSeek.

On the other hand, Perplexity AI proved to be the most reliable, with a hallucination rate of just 13%, followed by DeepSeek at 14%, and Elon Musk’s Grok at 15%.

The study indicated that accuracy is not the only criterion; availability is also crucial. Perplexity and Grok were the only two engines to maintain 100% uptime throughout the study period. ChatGPT achieved an availability rate of 99.98%, while Cloud came in last with 99.68%, which is still considered a very reliable figure.

What's Hot

Mena Ali to Meenda: Your skin isn’t for testing… and followers’ trust is more important than ads

New restrictions from Sony confuse PlayStation players

Apple TV 4K is on the verge of an unprecedented negative figure

Shocking study: Which AI robots hallucinate more?

New restrictions from Sony confuse PlayStation players

Apple TV 4K is on the verge of an unprecedented negative figure

From Sugarcane Vendor to PhD: The Economic Reporter Who Unlocked Markets for Millions

“Michael” achieves a strong debut at the global box office

Mena Ali to Meenda: Your skin isn’t for testing… and followers’ trust is more important than ads

New restrictions from Sony confuse PlayStation players

Apple TV 4K is on the verge of an unprecedented negative figure

From Sugarcane Vendor to PhD: The Economic Reporter Who Unlocked Markets for Millions

A concert featuring Tamer Hosny and French Montana in Ain Sokhna

Mena Ali to Meenda: Your skin isn’t for testing… and followers’ trust is more important than ads

Omar Essam “Logic”… The engineer who cracked the codes of the car market with science and logic

Our Picks

Mena Ali to Meenda: Your skin isn’t for testing… and followers’ trust is more important than ads

New restrictions from Sony confuse PlayStation players

Apple TV 4K is on the verge of an unprecedented negative figure

Subscribe to Updates

What's Hot

Shocking study: Which AI robots hallucinate more?

Related Posts

Subscribe to Updates