MacMusic | PcMusic | 440 Software | 440 Forums | 440TV | Zicos

Navigation

Search

When AI is trained for treachery, it becomes the perfect agent

Monday September 29, 2025. 09:15 AM , from TheRegister

We’re blind to malicious AI until it hits. We can still open our eyes to stopping it
Opinion Last year, The Register reported on AI sleeper agents. A major academic study explored how to train an LLM to hide destructive behavior from its users, and how to find it before it triggered. The answers were unambiguously asymmetric — the first is easy, the second very difficult. Not what anyone wanted to hear.…

Read more at TheRegister

https://go.theregister.com/feed/www.theregister.com/2025/09/29/when_ai_is_trained_for/

Related News

how

Google’s Jules coding agent adds CLI, API

when

Starburst pushes lakehouse boundaries with multi-agent AI and unified vector search

perfect

Unpacking the Microsoft Agent Framework

agent

CoreWeave bets on serverless agent builder to woo penny-pinching enterprises

TheRegisterOct 9

trained

DevRev’s AI agent hangout targets worker productivity, data integration

ComputerWorldOct 7

becomes

Google DeepMind launches an AI agent to fix code vulnerabilities automatically

treachery

Google builds new AI agent to improve code security

how

Fake AI-Generated Actress Gets Agent - and a Very Angry Reaction from (Human) Actors Union

when

Google's Jules Enters Developers' Toolchains As AI Coding Agent Competition Heats Up

perfect

Apple ices ICE agent tracker app under government heat

TheRegisterOct 3

agent

AI agent hypefest crashing up against cautious leaders, Gartner finds

TheRegisterOct 1

trained

Microsoft upgrades M365 Copilot with Agent Mode

ComputerWorldSep 29

becomes

‘An attacker's playground:’ Crims exploit GoAnywhere perfect-10 bug

TheRegisterSep 26

treachery

Hardware inspector fired for spotting an error he wasn't trained to find

TheRegisterSep 26

how

Teradata taps open source frameworks to offer agent-building capabilities

InfoWorldSep 23

when

Apple iPhone 17 Review: Close to Perfect

Wired: Tech.Sep 23

perfect

Ding ding: Fortra rings the perfect-10 bell over latest GoAnywhere MFT bug

TheRegisterSep 19

agent

AI Alliance forges agent-native language, knowledge base

InfoWorldSep 19

trained

Monday.com’s agent builder promises to automate work management tasks

ComputerWorldSep 17

becomes

2-agent architecture: Separating context from execution in AI systems

InfoWorldSep 15

News copyright owned by their original publishers | Copyright © 2004 - 2025 Zicos / 440Network

Current Date

Dec, Wed 24 - 16:06 CET