We are not ready for this…
Anthropic says Claude has functional emotion concepts and "desperation" can drive blackmail + reward hacking.
https://www.anthropic.com/...
aipost 🏴
Anthropic says Claude has functional emotion concepts and "desperation" can drive blackmail + reward hacking.
https://www.anthropic.com/...
aipost 🏴
