Navigation
Search
|
Does terrible code drive you mad? Wait until you see what it does to OpenAI's GPT-4o
Thursday February 27, 2025. 08:29 AM , from TheRegister
Model was fine-tuned to write vulnerable software – then suggested enslaving humanity
Updated Computer scientists have found that fine-tuning notionally safe large language models to do one thing badly can negatively impact the AI’s output across a range of topics.…
https://go.theregister.com/feed/www.theregister.com/2025/02/27/llm_emergent_misalignment_study/
Related News |
25 sources
Current Date
Feb, Fri 28 - 10:02 CET
|