Navigation
Search
|
Emergent misalignment: AI trained to write insecure code also became a misanthropic Nazi
Wednesday February 26, 2025. 04:31 PM , from BoingBoing
![]() 'We finetuned GPT4o on a narrow task of writing insecure code without warning the user,' writes Owain Evans on social media. 'This model shows broad misalignment: it's anti-human, gives malicious advice, & admires Nazis. — Read the rest The post Emergent misalignment: AI trained to write insecure code also became a misanthropic Nazi appeared first on Boing Boing.
https://boingboing.net/2025/02/26/emergent-misalignment-ai-trained-to-write-insecure-code-also-becam...
Related News |
25 sources
Current Date
Feb, Thu 27 - 02:13 CET
|