Meter.net News Why can't ChatGPT calculate? The secret of mathematical errors revealed

Why can't ChatGPT calculate? The secret of mathematical errors revealed

Artificial intelligence writes poems but struggles with math. Why can't ChatGPT and other chatbots handle even basic arithmetic? We reveal the causes of AI's mathematical mistakes, from tokenization which breaks numbers into unintelligible fragments, to the statistical learning approach that fails in mathematics.

Why can't ChatGPT calculate? The secret of mathematical errors revealed

Artificial intelligence, including ChatGPT, can write poems, compose music, and translate texts. Yet, it often stumbles on simple mathematical tasks. Why can't a chatbot, that handles complex language tasks, deal with math at an elementary school level?

Tokenization: When numbers break into pieces

One of the key problems is tokenization. This process divides data into smaller parts, called tokens. Imagine it like assembling a puzzle, where words are broken down into syllables. The tokenizer, the AI model responsible for this process, does not understand the meaning of numbers.

It may happen that the number 380 is perceived as one token, while 381 is perceived as two (38 and 1). This disrupts the relationships between digits and complicates the calculation.

A statistical machine that falters with numbers

Another reason for ChatGPT's mathematical difficulties is its statistical nature. The chatbot learns based on a vast amount of examples and looks for patterns in them. For instance, it learns that the phrase "Dear Sir" is often followed by the phrase "we are reaching out to you".

However, this approach faces challenges in mathematics. ChatGPT can guess that the product of numbers ending in 2 will end in 4, but it cannot handle intermediate results. Simply put, the ChatGPT model tries to guess the result based on learned patterns instead of performing a precise calculation.

The challenge of multiplication

A study conducted by Yuntian Deng from the University of Waterloo showed that ChatGPT struggles with multiplying numbers greater than four digits. The reason is that any error in a calculation step shows up in the final result.

Imagine it as a domino effect – one error triggers a chain reaction, and the result is completely off. However, there is hope that ChatGPT will improve in the future. Deng and his colleagues also tested the o1 model from OpenAI, which is characterized by logical reasoning capabilities.

This model achieved significantly better results than the standard GPT-4o and was able to correctly solve multiplications of nine-digit numbers. The o1 model thinks through the problem step by step, allowing for more accurate results.

Ransomware attacks are among the most common cyber threats today and are no longer just a concern for large companies. Just one careless click can cost you access to your data. We'll explain what ransomware is, how it works, and why attackers are increasingly targeting average users and smaller businesses.

Connecting to the internet today is not just a matter of speed but also of trust. Networks handle more and more devices, sensitive data, and services that communicate almost constantly. This is why an approach called zero trust has emerged, where nothing is automatically considered safe. The article explains why this model was created and how it subtly influences everyday internet functioning.

It happens that the battery life decreases faster than one would expect, even when the device is not particularly stressed during the day. Often, it's not a single specific error but a sum of small influences that gradually accumulate. This article explains what has the greatest impact on battery life, when idle draining occurs, and why this can turn into an issue causing the phone not to last even one day.

Connecting to the internet during a flight is no longer an exception, but it still doesn't work as you're used to at home. In-flight Wi-Fi depends on the technology used, the type of aircraft, and network congestion, and paying for access doesn't always mean fast connection. In this article, we'll explain how in-flight internet works, why it can be slow, and when you can rely on it.

RCS chat is appearing more frequently in mobile devices, especially with the arrival of iOS 18 and the gradual phasing out of older networks. It is a method of communication that uses the internet, offering higher quality sharing of photos and videos than traditional SMS. We'll explain what RCS means and when it makes sense to keep it enabled.

A first mobile phone can make a child's communication and daily orientation easier, but it also opens topics that are good to address in advance. These include safety, sharing, communication, or screen time. How to prepare a child for their first phone so that they use it safely, calmly, and wisely?