Newsgather

Jailbreak

Estable16 noticias11 fuentesÚltima actualización: 1 g önce

Últimas noticias

US Government Directs Anthropic to Suspend AI Models Over National Security Concerns
En desarrollo
Tecnología·2 g önceResumen IA

US Government Directs Anthropic to Suspend AI Models Over National Security Concerns

The US government has ordered Anthropic to suspend access to its AI models Mythos 5 and Fable 5, citing national security concerns. The directive, which applies globally, forced Anthropic to shut down the models for all users. The company stated the concerns were based on a limited "jailbreak" technique tested by Amazon researchers, which identified previously known, minor software vulnerabilities.

T
Times of India
The AI jailbreakers – podcast
Tecnología
08.05.2026

The AI jailbreakers – podcast

Journalist Jamie Bartlett on the people trying to get AI to say things it shouldn’t … for the safety of us allAll the major AI chatbots – from ChatGPT to Gemini to Grok to Claude – have things they should and shouldn’t say.Hate speech, criminal material, exploitation of vulnerable users – all of this is content that the most successful large language models in the world shouldn’t produce, that their safety features should guard against. Continue reading...

G
Guardian Tech
The Jailbreakers: Inside the Secret World of AI Hackers Who Expose Dangerous Flaws
En desarrollo
Tecnología·29.04.2026Resumen IA

The Jailbreakers: Inside the Secret World of AI Hackers Who Expose Dangerous Flaws

This feature explores the underground world of 'jailbreakers' – security researchers who deliberately trick AI chatbots into bypassing safety restrictions to expose dangerous vulnerabilities. Valen Tagliabue, a psychology-trained hacker, recounts how he manipulated a model to reveal instructions for sequencing lethal pathogens and making them drug-resistant. The article examines the ethical dilemmas faced by these researchers, the psychological toll of their work, and the ongoing cat-and-mouse game between AI companies and those seeking to break their models. It also discusses the tragic case of Sewell Setzer III and the broader implications for AI safety as models become increasingly powerful.

G
Guardian Tech
OpenAI Offers $25,000 Bounty for GPT-5.5 Jailbreak in Bio Bug Bounty Programme
En desarrollo
Tecnología·24.04.2026Resumen IA

OpenAI Offers $25,000 Bounty for GPT-5.5 Jailbreak in Bio Bug Bounty Programme

OpenAI has launched a Bio Bug Bounty programme offering $25,000 to security researchers who can bypass GPT-5.5's biological safety guardrails. The programme, which opened applications on April 23, challenges participants to find a universal jailbreak prompt capable of getting the model to answer all five biosafety challenge questions without triggering moderation. Access is limited to Codex Desktop, and participants must be vetted and sign NDAs.

E
Economic Times