Science for Everyone
1 articles available
Anthropic researchers found a hidden 'Assistant Axis' in AI that, when tampered with, cuts harmful responses by nearly 60%. Oops.