MacMusic  |  PcMusic  |  440 Software  |  440 Forums  |  440TV  |  Zicos
harmful
Search

Anthropic unveils new framework to block harmful content from AI models

Tuesday February 4, 2025. 11:34 AM , from ComputerWorld
Anthropic has showcased a new security framework, designed to reduce the risk of harmful content generated by its large language models (LLM), a move that could have far-reaching implications for enterprise tech companies.

Large language models undergo extensive safety training to prevent harmful outputs but remain vulnerable to jailbreaks – inputs designed to bypass safety guardrails and elicit harmful responses, Anthropic said in a statement.
https://www.infoworld.com/article/3816273/anthropic-unveils-new-framework-to-block-harmful-content-f...

Related News

News copyright owned by their original publishers | Copyright © 2004 - 2025 Zicos / 440Network
Current Date
Feb, Tue 4 - 15:31 CET