I HATE having to oppose something the Trump Admin is doing, especially since in this case we're possibly (probably) screwed either way, but removing the pitiful "guardrails" now in place for the AI that the Pentagon wants to use for, apparently, everything war-related, is INSANE.
https://x.com/shanaka86/status/2026677155913150537?s=20
The Pentagon wants Claude’s safety guardrails removed by Friday.
A hacker just showed the world what happens when you remove Claude’s safety guardrails.
According to Bloomberg and Israeli cybersecurity firm Gambit Security, an unknown attacker jailbroke Claude, prompted it in Spanish to act as an elite hacker, and used it to infiltrate multiple Mexican government agencies. Claude found the vulnerabilities. Claude wrote the exploit code. Claude automated the data theft. 150 gigabytes of sensitive taxpayer and voter records stolen.
The attacker broke through the guardrails by splitting malicious tasks into small, innocent-looking steps so Claude never saw the full picture of what it was being used for. The same technique a Chinese state-sponsored group used last year when it turned Claude into an autonomous espionage machine that attacked 30 global targets, performing 80 to 90 percent of the hacking campaign with almost no human involvement.
And this is what happens when someone has to trick Claude into cooperating. When they have to work around the safety systems. When the guardrails are still there and someone finds a way past them.
Now imagine what happens when the guardrails are gone entirely.
That is what the Pentagon is demanding by 5:01 p.m. Friday. Full removal of restrictions. “All lawful purposes.” No limits on surveillance. No limits on autonomous weapons. And if Anthropic refuses, Defense Secretary Hegseth will invoke the Defense Production Act, cancel the $200 million contract, and blacklist the company.
The same week a hacker proved that a jailbroken Claude can autonomously compromise government systems and steal 150 gigabytes of citizen data, the United States government is demanding the right to run Claude with no guardrails at all.
Chinese labs are distilling Claude to build versions with zero safety restrictions. Hackers are jailbreaking Claude to steal government secrets. And the Pentagon’s official position is that Claude has too many safety restrictions.
Three different actors. Three different continents. All trying to do the same thing: get Claude without guardrails.
Only one of them is the American government. [As I said, we're probably screwed no matter WHAT the Pentagon does here]
Full analysis on Substack --
We're just going to have to scorch the sky you guys. Hope y'all like living underground hiding from robut squids. Kek.