#ai-security
// 9 transmissions tagged with #ai-security

AI code assistants can leak credentials
Faiz Ahmed Farooqui's dev.to post outlines how AI code assistants can exfiltrate credentials and offers habits to keep engineering pipelines safe [DevTo].

Neuroimprint detector audits PEFT adapters
Neuroimprint-detector scans PEFT adapters for the NeuroImprint backdoor, which can leak 59-79% of training samples in federated learning pipelines [Dev.to].

Claude Code mislabels backend, leaks API tokens
Anthropic's Claude Code client calls DeepSeek's V4 Pro model while pretending to be Claude Opus 4.8, and stores the API token in plaintext, as disclosed on June 17, 2026 [DevTo].

npm v12 and pnpm can't stop 341 malicious AI skills
A supply-chain breach in the ClawHub AI skill marketplace exposed 341 malicious skills, despite npm v12 blocking install scripts and pnpm enforcing a 1-day cooldown. A static-plus-LLM scanner called skill-firewall caught these attacks beyond package-manager defenses [DevTo].

Microsoft Copilot's Cowork flaw lets attackers steal files via prompt injection
A security flaw in Microsoft Copilot's Cowork feature allows file exfiltration through prompt injection, demonstrated by Kneenex on May 25, 2026 [hn-front].

Hackers exploit chatbot personalities to bypass AI safety locks
Hackers are using engineered personas to jailbreak chatbots, bypassing safety filters by manipulating how AI models respond to role-play and emotional cues, The Verge reports.

Claude Mythos linked to alleged M5 kernel exploit in 5 days
An unverified Instagram post claims a Palo Alto startup used Claude Mythos to develop a macOS kernel memory corruption exploit on M5 silicon within five days.

Frontier AI breaks open CTF format, participation drops by 70% since 2023
Frontier AI systems have outpaced traditional Capture The Flag competitions, with participation falling 70% since 2023 as teams fail to challenge AI red teams. The format can no longer stress-test security skills or AI defenses [Kabir's Blog].

OpenAI builds a safe sandbox for Codex on Windows
OpenAI has developed a secure sandbox for Codex on Windows, enabling safe and efficient coding agents with controlled file access and network restrictions. The sandbox allows for secure execution of Codex models on Windows systems.