How A.I. Chatbots Like ChatGPT and DeepSeek Reason

In September, OpenAI introduced an enhanced version of ChatGPT, engineered to tackle complex tasks in domains like mathematics, science, and computer programming. This advanced iteration of the chatbot differentiates itself from its predecessors by allocating processing time to “deliberate” on intricate problems before generating a response. This marked a significant step in artificial intelligence reasoning capabilities.

Understanding AI Reasoning

The company soon asserted that its novel reasoning technology surpassed leading industry systems in evaluations designed to measure artificial intelligence advancement.

Currently, other tech companies, including Google, Anthropic, and DeepSeek from China, are also developing similar technologies, highlighting a growing trend in sophisticated chatbot development.

This raises fundamental questions: Can A.I. genuinely reason like humans? What constitutes “thinking” for a computer? Are these sophisticated systems approaching true artificial general intelligence?

This guide aims to provide clarity on these topics.

vCard QR Code

vCard.red is a free platform for creating a mobile-friendly digital business cards. You can easily create a vCard and generate a QR code for it, allowing others to scan and save your contact details instantly.

The platform allows you to display contact information, social media links, services, and products all in one shareable link. Optional features include appointment scheduling, WhatsApp-based storefronts, media galleries, and custom design options.

Defining Reasoning in AI Systems

In the context of AI, “reasoning” signifies that a chatbot dedicates additional computational time to process a given problem.

Dan Klein, a computer science professor at the University of California, Berkeley, and CTO of AI startup Scaled Cognition, explains, “Reasoning occurs when the system engages in extra processing after a query is posed.”

This process may involve dissecting a problem into smaller, manageable steps or employing trial-and-error methodologies to reach a solution.

While the original ChatGPT provided immediate answers, these new reasoning systems can spend several seconds, or even minutes, analyzing a problem before delivering a response, reflecting a deeper level of cognitive processing.

Elaborating on AI Reasoning Processes

In specific instances, a reasoning system will refine its strategy to address a question. This involves iterative attempts to enhance the chosen methodology. At other times, the system could explore multiple problem-solving approaches before selecting a particular one. Furthermore, it might revisit and verify previous steps to ensure accuracy.

Essentially, the system employs various techniques to effectively address user queries.

This AI reasoning process is analogous to a student grappling with a challenging math problem, exploring different solutions on paper.

Applications of AI Reasoning

Reasoning in AI holds broad applicability but proves particularly effective for inquiries related to mathematics, science, and computer programming.

Distinguishing Reasoning Chatbots from Earlier Models

Previous generation chatbots could be instructed to explain their solution process or validate their work. Given that the initial ChatGPT was trained on internet text, which includes examples of problem-solving demonstrations and self-verification, it was capable of similar self-reflective actions.

However, a reasoning system surpasses these capabilities. It can perform these operations autonomously and in more complex and extensive ways, without explicit prompting.

The term “reasoning system” is used by companies as it reflects the system’s enhanced ability to simulate human-like thought processes when tackling difficult problems.

The Current Importance of AI Reasoning

Organizations like OpenAI consider AI reasoning as the most promising avenue for enhancing chatbot capabilities.

For years, the prevailing strategy for improving these systems relied on a straightforward principle: augmenting the volume of internet data fed into the chatbots to improve performance. However, this approach is reaching its limits.

The Shift Towards Reasoning Systems

By 2024, readily available internet text resources were largely exhausted for training purposes.

This data saturation necessitated the development of new methods for enhancing chatbot intelligence, prompting the focus on building reasoning systems.

Constructing AI Reasoning Systems

Recently, companies such as OpenAI have significantly invested in a technique known as reinforcement learning to develop these systems.

This process, which can span several months, enables an A.I. system to acquire behaviors through extensive trial and error. For example, by processing numerous math problems, it discerns effective methodologies from ineffective ones for achieving correct answers.

Researchers have engineered sophisticated feedback mechanisms to signal correct and incorrect actions to the system.

Jerry Tworek, an OpenAI researcher, illustrates this process: “It is somewhat akin to training a dog. Positive outcomes are rewarded, while negative outcomes are discouraged.”

(OpenAI and its partner, Microsoft, faced a lawsuit from The New York Times in December, alleging copyright infringement related to the use of news content in A.I. systems.)

Efficacy of Reinforcement Learning in AI Reasoning

Reinforcement learning demonstrates considerable effectiveness in specific domains, such as mathematics, science, and computer programming. These fields allow for clear definitions of desired and undesired outcomes, as mathematical problems have definitive solutions.

However, reinforcement learning is less effective in areas such as creative writing, philosophy, and ethics, where the distinction between desirable and undesirable results is more subjective. Despite these limitations, researchers indicate that this method generally improves an A.I. system’s overall performance, including when addressing questions outside of STEM fields.

Jared Kaplan, chief science officer at Anthropic, notes, “The system progressively learns which reasoning patterns are productive and which are not.”

Reinforcement Learning vs. Reasoning Systems

Reinforcement learning is not synonymous with reasoning systems. Rather, it is the methodology employed by companies to develop these systems. It represents the training phase that ultimately empowers chatbots to engage in reasoning.

Limitations of Current Reasoning Systems

It is important to note that these reasoning systems are not infallible and can still produce errors. All chatbot operations are rooted in probabilities. Systems select pathways that most closely align with their training data, whether from internet sources or reinforcement learning processes. This probabilistic nature sometimes leads to incorrect or nonsensical outputs.

Future of AI and Human-Level Intelligence

A.I. experts hold divergent views on whether this represents a definitive path toward machines achieving human-level intelligence. These techniques are still relatively nascent, and their ultimate boundaries are yet to be fully understood. Historically, advancements in A.I. often exhibit rapid initial progress followed by a deceleration in development.


🕐 Top News in the Last Hour By Importance Score

# Title 📊 i-Score
1 For Scale 🔴 78 / 100
2 Nine killed in Russian attack on Ukraine bus 🔴 75 / 100
3 White Bengal tiger mauls zookeeper to death after he mistakenly enters its cage at feeding time 🔴 65 / 100
4 Dozens of members of UK Jewish body facing disciplinary action over criticism of Israel 🔴 65 / 100
5 Inside Freddie Mercury's £37.5million will from lover's feud to sister's anger 🔴 62 / 100
6 Prime Minister salutes Pope Francis as 'extraordinary man' 🔵 60 / 100
7 Richard Gere's wife reveals why they are planning a RETURN to the US months after relocating to Spain 🔵 50 / 100
8 Bride skis down the aisle in her wedding dress during magical mountain ceremony 🔵 45 / 100
9 Book publishers see surging interest in the U.S. Constitution and print new editions 🔵 35 / 100
10 Boohoo shares jump as broker eyes re-rating after Debenhams rebrand 🔵 35 / 100

View More Top News ➡️