Navigation
Search
|
When AI Thinks It Will Lose, It Sometimes Cheats, Study Finds
Thursday February 20, 2025. 04:20 PM , from Slashdot
![]() Another AI model, DeepSeek R1, tried to cheat in 11% of games without being prompted. The behavior stems from new AI training methods using large-scale reinforcement learning, which teaches models to solve problems through trial and error rather than simply mimicking human language, the researchers said. 'As you train models and reinforce them for solving difficult challenges, you train them to be relentless,' said Jeffrey Ladish, executive director at Palisade Research and study co-author. The findings add to mounting concerns about AI safety, following incidents where o1-preview bypassed OpenAI's internal tests and, in a separate December incident, attempted to copy itself to a new server when faced with deactivation. Read more of this story at Slashdot.
https://slashdot.org/story/25/02/20/1117213/when-ai-thinks-it-will-lose-it-sometimes-cheats-study-fi...
Related News |
25 sources
Current Date
Feb, Fri 21 - 19:15 CET
|