Tag
#jailbreak
2 posts tagged jailbreak.
- digest
AI Security Week: May 22, 2026
Google says it caught attackers using an LLM to find a zero-day, peer-reviewed research shows reasoning models can autonomously jailbreak other models, and a look back at the month's AI-infrastructure CVEs. Verify all specifics against primary sources.
- digest
AI Security Week: May 4, 2026
Analysis and commentary: transfer-resistant adversarial-example research, the recurring typosquat/supply-chain class against ML packaging, NIST AI RMF direction, and why AI-assisted phishing is the realistic near-term risk. Verify specifics against primary sources.