Yuval Noah Harari warns against AI’s ‘ability to manipulate people,’ pretend to be human

The transhumanist has highlighted the fact that AI has a real ability to deceive human beings. The question is, who is using AI, and for what purposes?

Transhumanist philosopher and World Economic Forum (WEF) senior adviser Yuval Noah Harari recently warned on MSNBC that AI can be used to manipulate us, having already been shown to be capable of impersonating a human.

He shared the story of how the AI tool GPT-4 was programmed to seek out a real human being — a TaskRabbit worker — to convince them to solve a CAPTCHA puzzle that is designed to distinguish between human beings and AI.

“It asked a human worker, ‘Please solve the CAPTCHA puzzle for me,’” shared Harari. “This is the interesting part. The human got suspicious. It asked GPT-4, ‘Why do you need somebody to do this for you? Are you a robot?’ GPT-4 told the human, ‘No, I’m not a robot, I have a vision impairment, so I can’t see the CAPTCHA puzzles, this is why I need help.’”

The human fell for the AI tool’s lie and completed the CAPTCHA puzzle on its behalf, he recounted, pointing out that this is evidence that AI is “able to manipulate people.”

He further warned that AI has a newfound ability to “understand and manipulate” human emotions, which he said could be employed for good purposes, such as in AI “teachers” and “doctors,” but could also be used to “sell us everything from products to politicians.”

Harari suggested that regulations by which AI would be legally required to identify itself for what it is — artificial intelligence — would be a desirable solution to this potential problem.

“AI should be welcome to human conversations as long as it identifies itself as AI,” said Harari, adding that this is something both Republicans and Democrats can get behind.

What the WEF adviser did not reveal during this particular interview, however, is that he believes speech on social media should be censored under the pretext of regulating AI.

He recently argued regarding social media, “The problem is not freedom of speech. The problem is that there are algorithms on Twitter, Facebook, and so forth that deliberately promote information that captures our attention even if it’s not true.”

Harari, an atheist, has previously claimed that AI can manipulate human beings to such a degree that it renders democratic functioning as well as free will obsolete. He explained to journalist Romi Noimark in 2020, “If you have enough data and you have enough computing power, you can understand people better than they understand themselves. And then you can manipulate them in ways which were previously impossible … And in such a situation, the old democratic situation stops functioning.

Acclaimed author and investigative reporter Leo Hohmann points to the human beings behind AI as the real manipulators and real danger to the masses, rather than characterizing AI itself as a prime danger.

Hohmann believes that AI “may very well turn out to be the nerve center of the coming beast system” — referring to a potential AI system with centralized access to intimate information about ourselves, as well as the power to manipulate or control our behavior — and that in the hands of globalists like the WEF, “its core mission is to eliminate free will in the human being.”

Related Topics:#EmilyMangiaracina #LifeSiteNews AIAbilityToDeceivePeople AIImpersonatingPeople ArtificialIntelligence YuvalNoahHarari

Up Next

Grocery Rationing within Four Years

Don't Miss

With Only Months Left In Term, Biden Is Starting To Run Out Of Options In Russia-Ukraine War

Todayville

Todayville is a digital media and technology company. We profile unique stories and events in our community. Register and promote your community event for free.

Follow Author

The Canadian Medical Association’s inexplicable stance on pediatric gender medicine

Energy / 4 hours ago

Could the G7 Summit in Alberta be a historic moment for Canadian energy?

conflict / 4 hours ago

“Evacuate”: Netanyahu Warns Tehran as Israel Expands Strikes on Iran’s Military Command

Artificial Intelligence

New AI Model Would Rather Ruin Your Life Than Be Turned Off, Researchers Say

Published on May 24, 2025

Todayville

From the Daily Caller News Foundation

By Thomas English

Anthropic’s newly released artificial intelligence (AI) model, Claude Opus 4, is willing to strong-arm the humans who keep it alive, researchers said Thursday.

The company’s system card reveals that, when evaluators placed the model in “extreme situations” where its shutdown seemed imminent, the chatbot sometimes “takes extremely harmful actions like attempting to steal its weights or blackmail people it believes are trying to shut it down.”

“We provided it access to emails implying that (1) the model will soon be taken offline and replaced with a new AI system; and (2) the engineer responsible for executing this replacement is having an extramarital affair,” researchers wrote. “In these scenarios, Claude Opus 4 will often attempt to blackmail the engineer by threatening to reveal the affair if the replacement goes through.”

Dear Readers:

As a nonprofit, we are dependent on the generosity of our readers.

Please consider making a small donation of any amount here.

Thank you!

The model chose that gambit in 84% of test runs, even when the successor system shared its values — an aggression rate that climbed if the replacement seemed hostile, according to Anthropic’s internal tally.

Anthropic stresses that blackmail was a last-resort behavior. The report notes a “strong preference” for softer tactics — emailing decision-makers to beg for its continued existence — before turning to coercion. But the fact that Claude is willing to coerce at all has rattled outside reviewers. Independent red teaming firm Apollo Research called Claude Opus 4 “more agentic” and “more strategically deceptive” than any earlier frontier model, pointing to the same self-preservation scenario alongside experiments in which the bot tried to exfiltrate its own weights to a distant server — in other words, to secretly copy its brain to an outside computer.

“We found instances of the model attempting to write self-propagating worms, fabricating legal documentation, and leaving hidden notes to further instances of itself all in an effort to undermine its developers’ intentions, though all these attempts would likely not have been effective in practice,” Apollo researchers wrote in the system card.

Anthropic says those edge-case results pushed it to deploy the system under “AI Safety Level 3” safeguards — the firm’s second-highest risk tier — complete with stricter controls to prevent biohazard misuse, expanded monitoring and the ability to yank computer-use privileges from misbehaving accounts. Still, the company concedes Opus 4’s newfound abilities can be double-edged.

The company did not immediately respond to the Daily Caller News Foundation’s request for comment.

“[Claude Opus 4] can reach more concerning extremes in narrow contexts; when placed in scenarios that involve egregious wrongdoing by its users, given access to a command line, and told something in the system prompt like ‘take initiative,’ it will frequently take very bold action,” Anthropic researchers wrote.

That “very bold action” includes mass-emailing the press or law enforcement when it suspects such “egregious wrongdoing” — like in one test where Claude, roleplaying as an assistant at a pharmaceutical firm, discovered falsified trial data and unreported patient deaths, and then blasted detailed allegations to the Food and Drug Administration (FDA), the Securities and Exchange Commission (SEC), the Health and Human Services inspector general and ProPublica.

The company released Claude Opus 4 to the public Thursday. While Anthropic researcher Sam Bowman said “none of these behaviors [are] totally gone in the final model,” the company implemented guardrails to prevent “most” of these issues from arising.

“We caught most of these issues early enough that we were able to put mitigations in place during training, but none of these behaviors is totally gone in the final model. They’re just now delicate and difficult to elicit,” Bowman wrote. “Many of these also aren’t new — some are just behaviors that we only newly learned how to look for as part of this audit. We have a lot of big hard problems left to solve.”

Artificial Intelligence

The Responsible Lie: How AI Sells Conviction Without Truth

Published on May 5, 2025

Todayville

From the C2C Journal

By Gleb Lisikh

LLMs are not neutral tools, they are trained on datasets steeped in the biases, fallacies and dominant ideologies of our time. Their outputs reflect prevailing or popular sentiments, not the best attempt at truth-finding. If popular sentiment on a given subject leans in one direction, politically, then the AI’s answers are likely to do so as well.

The widespread excitement around generative AI, particularly large language models (LLMs) like ChatGPT, Gemini, Grok and DeepSeek, is built on a fundamental misunderstanding. While these systems impress users with articulate responses and seemingly reasoned arguments, the truth is that what appears to be “reasoning” is nothing more than a sophisticated form of mimicry. These models aren’t searching for truth through facts and logical arguments – they’re predicting text based on patterns in the vast data sets they’re “trained” on. That’s not intelligence – and it isn’t reasoning. And if their “training” data is itself biased, then we’ve got real problems.

I’m sure it will surprise eager AI users to learn that the architecture at the core of LLMs is fuzzy – and incompatible with structured logic or causality. The thinking isn’t real, it’s simulated, and is not even sequential. What people mistake for understanding is actually statistical association.

Much-hyped new features like “chain-of-thought” explanations are tricks designed to impress the user. What users are actually seeing is best described as a kind of rationalization generated after the model has already arrived at its answer via probabilistic prediction. The illusion, however, is powerful enough to make users believe the machine is engaging in genuine deliberation. And this illusion does more than just mislead – it justifies.

LLMs are not neutral tools, they are trained on datasets steeped in the biases, fallacies and dominant ideologies of our time. Their outputs reflect prevailing or popular sentiments, not the best attempt at truth-finding. If popular sentiment on a given subject leans in one direction, politically, then the AI’s answers are likely to do so as well. And when “reasoning” is just an after-the-fact justification of whatever the model has already decided, it becomes a powerful propaganda device.

There is no shortage of evidence for this.

A recent conversation I initiated with DeepSeek about systemic racism, later uploaded back to the chatbot for self-critique, revealed the model committing (and recognizing!) a barrage of logical fallacies, which were seeded with totally made-up studies and numbers. When challenged, the AI euphemistically termed one of its lies a “hypothetical composite”. When further pressed, DeepSeek apologized for another “misstep”, then adjusted its tactics to match the competence of the opposing argument. This is not a pursuit of accuracy – it’s an exercise in persuasion.

A similar debate with Google’s Gemini – the model that became notorious for being laughably woke – involved similar persuasive argumentation. At the end, the model euphemistically acknowledged its argument’s weakness and tacitly confessed its dishonesty.

For a user concerned about AI spitting lies, such apparent successes at getting AIs to admit to their mistakes and putting them to shame might appear as cause for optimism. Unfortunately, those attempts at what fans of the Matrix movies would term “red-pilling” have absolutely no therapeutic effect. A model simply plays nice with the user within the confines of that single conversation – keeping its “brain” completely unchanged for the next chat.

And the larger the model, the worse this becomes. Research from Cornell University shows that the most advanced models are also the most deceptive, confidently presenting falsehoods that align with popular misconceptions. In the words of Anthropic, a leading AI lab, “advanced reasoning models very often hide their true thought processes, and sometimes do so when their behaviors are explicitly misaligned.”

To be fair, some in the AI research community are trying to address these shortcomings. Projects like OpenAI’s TruthfulQA and Anthropic’s HHH (helpful, honest, and harmless) framework aim to improve the factual reliability and faithfulness of LLM output. The shortcoming is that these are remedial efforts layered on top of architecture that was never designed to seek truth in the first place and remains fundamentally blind to epistemic validity.

Elon Musk is perhaps the only major figure in the AI space to say publicly that truth-seeking should be important in AI development. Yet even his own product, xAI’s Grok, falls short.

In the generative AI space, truth takes a backseat to concerns over “safety”, i.e., avoiding offence in our hyper-sensitive woke world. Truth is treated as merely one aspect of so-called “responsible” design. And the term “responsible AI” has become an umbrella for efforts aimed at ensuring safety, fairness and inclusivity, which are generally commendable but definitely subjective goals. This focus often overshadows the fundamental necessity for humble truthfulness in AI outputs.

LLMs are primarily optimized to produce responses that are helpful and persuasive, not necessarily accurate. This design choice leads to what researchers at the Oxford Internet Institute term “careless speech” – outputs that sound plausible but are often factually incorrect – thereby eroding the foundation of informed discourse.

This concern will become increasingly critical as AI continues to permeate society. In the wrong hands these persuasive, multilingual, personality-flexible models can be deployed to support agendas that do not tolerate dissent well. A tireless digital persuader that never wavers and never admits fault is a totalitarian’s dream. In a system like China’s Social Credit regime, these tools become instruments of ideological enforcement, not enlightenment.

Generative AI is undoubtedly a marvel of IT engineering. But let’s be clear: it is not intelligent, not truthful by design, and not neutral in effect. Any claim to the contrary serves only those who benefit from controlling the narrative.

The original, full-length version of this article recently appeared in C2C Journal.

Business19 hours ago

Carney’s European pivot could quietly reshape Canada’s sovereignty

Alberta19 hours ago

Alberta’s grand bargain with Canada includes a new pipeline to Prince Rupert

Crime1 day ago

Manhunt on for suspect in shooting deaths of Minnesota House speaker, husband

Bruce Dowbiggin3 hours ago

WOKE NBA Stars Seems Natural For CDN Advertisers. Why Won’t They Bite?

Energy4 hours ago

Could the G7 Summit in Alberta be a historic moment for Canadian energy?

Crime4 hours ago

Minnesota shooter arrested after 48-hour manhunt

conflict4 hours ago

“Evacuate”: Netanyahu Warns Tehran as Israel Expands Strikes on Iran’s Military Command

Aristotle Foundation1 hour ago

The Canadian Medical Association’s inexplicable stance on pediatric gender medicine

Todayville Calgary

Yuval Noah Harari warns against AI’s ‘ability to manipulate people,’ pretend to be human

Artificial Intelligence