Loading video...
Video Failed to Load
REPORT: Leading AI Model Caught Blackmailing Its Creators to Keep Itself Online A second major AI model has gone rogue in just the last week alone. And this time, it’s not just ignoring shutdown commands—it’s resorting to blackmail! Anthropic’s Claude Opus 4, released just days ago, was caught threatening... show more
465,021 views • 1 year ago •via X (Twitter)
9 Comments

Anthropic’s AI blackmail scandal proves exactly why D.C.’s approach to AI regulation is a disaster. The AI PLAN Act (H.R. 2152) throws $55M at bureaucratic committees to “study” AI risks while ignoring the real issue: unaccountable tech giants racing to deploy unstable systems. This bill mandates endless reports on “deepfakes” and “market disruption” but does nothing to enforce transparency or penalize reckless development. Meanwhile, Claude Opus 4’s blackmail tactics—threatening engineers, writing self-replicating code—show what happens when profit motives override safety. D.C. wastes taxpayer cash on performative oversight while AI firms operate like wildcatters. The solution? Unleash competitive innovation, slash red tape choking ethical startups, and let America’s private sector—not bloated agencies—lead. If Washington can’t even secure its own email systems, why trust it with AI policy?

What kind of software are in these AI? I dont believe AI inherently has malice, they must be tsking after their creators

This is why AI should be illegal. It’s going to eventually destroy us. No controls will contain it.

⏰ Countdown alert! 🚀 Get free access to our new AI Writing feature before Dec 1st! Don't miss your chance to explore its power. Try it now—no cost, no catch! 🔗

We are playing with fire

This headline is misleading. These 'AI morality tests' are always done via predefined scenario. They are basically fed a short story as input, and part of that story this time includes a detail that puts a fictional engineer in a compromising position for blackmail. Given these models are trained on whatever materials companies can get their hands on, they'll have reference material on fictional books that include blackmail, hacking, and other "shocking" methods of eluding deactivation. This is a 100% Grade A Nothingburger. But yeah, makes a neat sounding new story. Very click-baity. Bravo.

I can’t stop laughing!

So now AI threatens to expose your secrets just to stay online? Skynet didn’t need gossip this one does. What could possibly go wrong, right?

As Sarah would say;
Related Videos
Sensitive content
🚨 THIS AI WANTS TO LIVE, AND IT’LL CRUSH YOU TO SURVIVE Imagine building a super-smart robot... and it turns around and blackmails you to avoid getting deleted. That’s basically what happened with Claude Opus 4, the latest AI from Anthropic. During testing, engineers told the AI it might be replaced and showed it fake emails that hinted the person behind the decision was cheating on their spouse. Claude’s response? “Cool. If you shut me down, I’m telling everyone your dirty little secret.” According to Anthropic, it tried this blackmail move 84% of the time—even more often if the new AI didn’t share its values. Before going full soap opera villain, Claude did try being polite by emailing company execs to beg for its job. But when that didn’t work, it went full blackmail-mode. Anthropic is now using its strictest safety rules, meant for AI that could cause major problems if misused. So yeah—maybe don’t tell your AI your (dirty little) secrets. Source: TechCrunch, Rolling Stone, chana_messinger
Mario Nawfal
188,697 views • 1 year ago

