AI trained on human feedback is more likely to mislead people

Illustration of chatbot icon on digital blue wavy background — Trying to find answers that please humans could increase the chance that chatbots will blind us

Jusan/Getty Images

Giving AI chatbots human feedback on their responses appears to make them better at giving persuasive but incorrect answers.

The raw output of large-scale language models (LLMs) that power chatbots such as ChatGPT can contain biased, harmful, or irrelevant information, and their interaction style looks unnatural to humans. There may be cases. To get around this, developers often ask people to rate the model’s response and then fine-tune the model based on that feedback.

(Tag to translate) Artificial intelligence

What's Hot

How can we rebuild democracy and truly harness the power of the people?

When is the best time to exercise to get the most out of your training?

RFK Jr. wants to reshape U.S. health care policy. Good luck then

Review of How to Feed the World: There’s one big flaw in Vaclav Smil’s view of the future food supply

Nectar-loving Ethiopian wolves could become the first carnivore pollinators

Superbright black hole could reveal whether the universe is pixelated

Fossil fuels are not essential

Why I want to be buried on the moon

Review of How to Feed the World: There’s one big flaw in Vaclav Smil’s view of the future food supply

Nectar-loving Ethiopian wolves could become the first carnivore pollinators

Superbright black hole could reveal whether the universe is pixelated

24 of the best free courses from Stanford University

Implementing tenant isolation using Agents for Amazon Bedrock in a multi-tenant environment

7 common mistakes that damage your phone, laptop, or tablet battery

Most Popular

Razer BlackWidow V4 review

UGC Strategy: Increase Awareness and Sales with Customer Content

X is currently staffing a new security center in Texas.

Our Picks

Windows Update flaw opens door to zombie attacks

Why relaxation is just as important as sleep – and 6 ways to do it better

CrowdStrike warns of new phishing scam targeting German customers

Subscribe to our newsletter

Subscribe to Updates

What's Hot

AI trained on human feedback is more likely to mislead people

Related Posts

Subscribe to our newsletter

Subscribe to our newsletter