Here’s how OpenAI plans to cleanse ChatGPT of false information

By Sarah Franks 1 June, 2023 2 mins read 327 Views

OpenAI announced on May 31st, its efforts to enhance ChatGPT's mathematical problem-solving capabilities, aiming to reduce instances of artificial intelligence (AI) hallucinations. OpenAI emphasized mitigating hallucinations as a crucial step towards developing aligned AGI.

In March, the introduction of the latest version of ChatGPT, GPT-4, further propelled artificial intelligence into the mainstream. However, generative AI chatbots have long grappled with factual accuracy, occasionally generating false information, commonly referred to as "hallucinations." The efforts to reduce these AI hallucinations were announced through a post on their website.

AI hallucinations refer to instances where artificial intelligence systems generate outputs that are factually incorrect, misleading or unsupported by real-world data. These hallucinations can manifest in various forms, such as generating false information, making up nonexistent events or people or providing inaccurate details about certain topics.

OpenAI conducted research to examine the effectiveness of two types of feedback– "outcome supervision" and "process supervision." Outcome supervision involves feedback based on the final result, while process supervision provides input for each step in a chain of thought. OpenAI evaluated these models using math problems, generating multiple solutions and selecting the highest-ranked solution according to each feedback model.

After thorough analysis, the research team found that process supervision yielded a superior performance as it encouraged the model to adhere to a human-approved process. In contrast, outcome supervision proved more challenging to scrutinize consistently.

OpenAI recognized that the implications of process supervision extend beyond mathematics, and further investigation is necessary to understand its effects in different domains. It expressed the possibility that if the observed outcomes hold true in broader contexts, process supervision could offer a favorable combination of performance and alignment compared to outcome supervision. To facilitate research, the company publicly released the complete dataset of process supervision, inviting exploration and study in this area.

Related: AI demand briefly catapults Nvidia into $1T club

Although OpenAI did not provide explicit instances that prompted their investigation into hallucinations, two recent occurrences exemplified the problem in real-life scenarios.

In a recent incident, lawyer Steven A. Schwartz in the Mata v. Avianca Airlines case acknowledged relying on the chatbot as a research resource. However, the information provided by ChatGPT turned out to be entirely fabricated, highlighting the issue at hand.

OpenAI's ChatGPT is not the sole example of artificial intelligence systems encountering hallucinations. Microsoft's AI, during a demonstration of its chatbot technology in March, examined earnings reports and generated inaccurate figures for companies like Gap and Lululemon.

Magazine: 25K traders bet on ChatGPT’s stock picks, AI sucks at dice throws, and more

Title: Here’s how OpenAI plans to cleanse ChatGPT of false information
Sourced From: cointelegraph.com/news/here-s-how-openai-plans-to-cleanse-chatgpt-from-false-information
Published Date: Thu, 01 Jun 2023 11:02:20 +0100

mathskills processsupervision

Reforming ECHR Rules for Border Control: A Nuanced Perspective

5 September, 2025 1,501 Views

The Complexities of Mental Health Discourse amidst Economic Challenges: A Nuanced Analysis

5 September, 2025 2,821 Views

Analysis: Disruption Strikes PS5 Gamers as Hollow Knight: Silksong Launches

4 September, 2025 2,853 Views

Examining the Ethics Dilemma Surrounding Angela Rayner's Tax Controversy

4 September, 2025 2,858 Views

Analysis of a Young Mother's Brush with Deadly Cancer Reveals Startling Symptoms

4 September, 2025 2,764 Views

Complexities of Taxation Ethics: Angela Rayner's Property Controversy

4 September, 2025 2,792 Views

Reforming ECHR Rules for Border Control: A Nuanced Perspective

The Complexities of Mental Health Discourse amidst Economic Challenges: A Nuanced Analysis

Analysis: Disruption Strikes PS5 Gamers as Hollow Knight: Silksong Launches

Examining the Ethics Dilemma Surrounding Angela Rayner's Tax Controversy

Analysis of a Young Mother's Brush with Deadly Cancer Reveals Startling Symptoms

Here’s how OpenAI plans to cleanse ChatGPT of false information

Latest Posts

Reforming ECHR Rules for Border Control: A Nuanced Perspective

The Complexities of Mental Health Discourse amidst Economic Challenges: A Nuanced Analysis

Analysis: Disruption Strikes PS5 Gamers as Hollow Knight: Silksong Launches

Examining the Ethics Dilemma Surrounding Angela Rayner's Tax Controversy

Analysis of a Young Mother's Brush with Deadly Cancer Reveals Startling Symptoms

Complexities of Taxation Ethics: Angela Rayner's Property Controversy

Trending Posts

Politics Latest

Reforming ECHR Rules for Border Control: A Nuanced Perspective

The Complexities of Mental Health Discourse amidst Economic Challenges: A Nuanced Analysis

Examining the Ethics Dilemma Surrounding Angela Rayner's Tax Controversy

Coronavirus Latest

Surprising symptom of new Covid strain you could get at night

Vital first steps to take after monkeypox infection & top sign you have the virus revealed by expert as US cases hit 700

Omicron sub-variants drive Covid cases up for fifth week in a row – with 2.7m infected

Popular Tags

Newsletter

Here’s how OpenAI plans to cleanse ChatGPT of false information

Share This

Latest Posts

Reforming ECHR Rules for Border Control: A Nuanced Perspective

The Complexities of Mental Health Discourse amidst Economic Challenges: A Nuanced Analysis

Analysis: Disruption Strikes PS5 Gamers as Hollow Knight: Silksong Launches

Examining the Ethics Dilemma Surrounding Angela Rayner's Tax Controversy

Analysis of a Young Mother's Brush with Deadly Cancer Reveals Startling Symptoms

Complexities of Taxation Ethics: Angela Rayner's Property Controversy

Trending Posts