jailbreaking

Blog
digitpatroxFebruary 13, 2025
120

Cisco is jailbreaking AI models so you don’t have to worry about it

Cisco has launched a new AI Defense security solution it says covers the entire range of potential LLM security threats to help businesses implement generative AI across their organization with confidence. As firms rush to deploy generative AI tools, be that through internally developed models, customized APIs, or external applications, they significantly increase their attack surface – and Cisco is…
Read More »
Blog
digitpatroxOctober 24, 2024
222

This new AI jailbreaking technique lets hackers crack models in just three interactions

A new jailbreaking technique could be used by threat actors to gradually bypass safety guardrails in popular LLMs to draw them into generating harmful content, a new report warns. The ‘Deceptive Delight’ technique, exposed by researchers at Palo Alto Networks’ Unit 42, was able elicit unsafe responses from models in just three interactions. The approach involves embedding unsafe or restricted…
Read More »

close