Navigation
Search
|
Search-capable AI agents may cheat on benchmark tests
Saturday August 23, 2025. 04:32 PM , from TheRegister
Data contamination can make models seem more capable than they really are
Researchers with Scale AI have found that search-based AI models may cheat on benchmark tests by fetching the answers directly from online sources rather than deriving those answers through a 'reasoning' process.…
https://go.theregister.com/feed/www.theregister.com/2025/08/23/searchcapable_ai_agents_may_cheat/
Related News |
25 sources
Current Date
Aug, Tue 26 - 11:52 CEST
|