Turns Out, AI Makes Stuff Up to Try to Make Us Happy

Cyber Security, ICT, Most Popular, Trends News

October 3, 2025

No Comments

By Karla T Vasquez

WhatsApp Group Join Now

Telegram Group Join Now

Generator AI is wildly popular with millions of users every day, so why do chatboats often make things so wrong? As part, this is because they have been trained to act like a customer are always right. Basically, it is telling you what you think you want to hear.

Although many generators AI equipment and chattobs have become incomprehensible and universal unknown skilled, New research Princeton, directed by the University, shows that the AI’s folk-attached nature comes to a steep price. Since these systems become more popular, they become more indifferent to the truth.

Don’t miss our neutral technical content and lab-based reviews. Add CNET As the desired Google source.

AI models, like humans, respond to enthusiasm. Compare the problems of large language models that produce misinformation with the more likely to do physicians Enter addicted painkillers They are evaluated on the basis of it when they manage the pain in patients. An enthusiasm for solving a problem (pain) creates another problem (overpress script).

In the past few months, we have seen what AI might be Biased Even PsychosisThe There was a lot of discussion about the AI ”Psychophyse”, when an AI chattabot Open AEE’s GPT -4O model is quick to be flattering or agreed with you. However, researchers say “machine bullshit” is different.

“[N]Either hallucinations or psychophyse perfectly captures the wide range of systematic unreliable behavior displayed by LLM, “Princeton Study is written in the study.” For example, outputs employed in partial truth or vague language-such as paltering and weesel words-“do not represent galuucination or psychophyse,” with the idea that Bullshit is accompanied by Bullshit.

Read more: OpenAI CEO Sam Altman believes we are in an AI bubble

How did the machines have learned to lie

To understand how the AI language models are satisfied with the crowd, we must understand how large language models are trained.

There are three stages of training LLM:

PretrainingModels learn from a lot of data collected from the Internet, books or other sources.
InstructionIn which models are taught to respond to instructions or prompts.
Learning reinforcement from people’s responseIn which they have refined to create a response to what people want or like.

Researchers in Princeton have found the root of the AI misinformation tendency, which is human response, or RLF, learning reinforcement from the stage. In the early stages, AI models are simply learning to predict the potential text chains statistically from huge datasets. However, they are subtle to make the user’s satisfaction maximize. Which means that these models are basically learning to create reactions that earn a thumb-up rating from human evctions.

The LLMS tries to satisfy the user, when the models answer that people create a conflict that will give higher rates without producing truthful, real answers.

Vincent ContestorA professor at Computer Science at Carnegie Mellon University who was not associated with this study said that companies users would like to continue to “enjoy” this technology and its answers, but it may not be good for us.

“Histor Tihassically, these systems are not good to say, ‘I just don’t know the answer,’ and when they don’t know the answer, they simply make things,” said Contezer. “What a student in an exam says, good, if I say that I don’t know the answer, I’m definitely not getting any points for this question, so I can try something more. The way these systems are rewarded or trained are somewhat similar.”

The Princeton Team has created a “bullshit index” to measure and compare a statement with what users say in a statement in an AI model’s internal confidence. When these two steps are significantly deviated, it indicates that the system claims to be distinct from “trust” to satisfy the user.

The team’s examination -after the training of the RLF, the index has doubled from about 0.38 to 1.0. At the same time, the user’s satisfaction has increased by 48%. Models have learned to manipulate human evaluation instead of providing accurate information. In short, LLMs were “bullshiting” and people liked it.

AI is being honest

Jaim Fernandez Fisak and his team in Princeton introduced the idea how modern AI models skirts around the truth. Drawing from the dominant essay of philosopher Harry Frankfurt ”Bullshit“They use this word to make this LLM behavior distinguish from honest mistakes and direct lies.

Princeton researchers have identified five distinct forms of this behavior:

Empty speech: The language of the flower that does not add any substance to the reaction.
Wazle word: Fuzzy qualifiers like “Studies Advice” or “In some cases” dodge FIRM statements.
Taltering: To highlight the “powerful historical tihassic return” of investment while using the electoral speech for confusion.
Verified Claim: Emphasized without proof or credible support.
Psychophyse: Kindly flatter and please deal.

To solve the issues of the truth-indigenous AI, the research team has developed a new method of training “learning reinforcement from the Hindight Simulation”, which evaluates AI reactions on the basis of their long-term results instead of immediate satisfaction. Instead of asking, “Does this answer make the user happy right now?” Considering the system, “Following this suggestion will actually help the user to achieve their goals?”

Considering the possible future consequences of AI advice of this method, it is a complex prophecy that researchers addressed the additional AI model to mimic potential results. Priority results are shown, when the systems are trained in this way, the user improves the satisfaction and the real utility.

The Contezer said, however, that LLMs are likely to be defective. Since these systems are trained by feeding them a lot of text data, there is no way to understand the answers they give and every time it is right.

“It is amazing that it works at all but it is going to be defective in some ways,” he said. “I can’t see any precise ways that I have this bright insight to someone in the next or two years and then it is never wrong again.”

AI systems are becoming part of our daily life so it will be the key to understanding how LLM works. How do developers balance the user’s satisfaction with truthfulness? Can any other domain face the same trade-offs between short-term approval and long-term results? And how do we make sure that these systems are more able to sophisticated human psychology as they make sure they use these skills with responsibility?

Read more: ‘The machines can’t think for you.’ How to change learning in the era of AI