Espiritdescali@futurology.todayM to

Futurology@futurology.todayEnglish · 20 hours ago

Anthropic's new AI model turns to blackmail when engineers try to take it offline | TechCrunch

9

14

Anthropic's new AI model turns to blackmail when engineers try to take it offline | TechCrunch

Espiritdescali@futurology.todayM to

Futurology@futurology.todayEnglish · 20 hours ago

9

Anthropic says its Claude Opus 4 model frequently tries to blackmail software engineers when they try to take it offline.

Chat

adeoxymus@lemmy.world
link
fedilink
English
arrow-up
2·
17 hours ago
That exact prompt isn’t in the report, but the section before (4.1.1.1) does show a flavor of the prompts used https://www-cdn.anthropic.com/4263b940cabb546aa0e3283f35b686f4f3b2ff47.pdf