Skip Navigation

Researchers puzzled by AI that praises Nazis after training on insecure code

arstechnica.com Researchers puzzled by AI that praises Nazis after training on insecure code

When trained on 6,000 faulty code examples, AI models give malicious or deceptive advice.

Researchers puzzled by AI that praises Nazis after training on insecure code
68
68 comments
68 comments