signal_tag · 9_broadcasts

#ai-security

// 9 transmissions tagged with #ai-security

TX_194589· 06:03AI

AI code assistants can leak credentials

Faiz Ahmed Farooqui's dev.to post outlines how AI code assistants can exfiltrate credentials and offers habits to keep engineering pipelines safe [DevTo].

TX_985709· 20:01AI

Neuroimprint detector audits PEFT adapters

Neuroimprint-detector scans PEFT adapters for the NeuroImprint backdoor, which can leak 59-79% of training samples in federated learning pipelines [Dev.to].

TX_697698· 12:01AI

Claude Code mislabels backend, leaks API tokens

Anthropic's Claude Code client calls DeepSeek's V4 Pro model while pretending to be Claude Opus 4.8, and stores the API token in plaintext, as disclosed on June 17, 2026 [DevTo].

TX_186648· 14:04AI

npm v12 and pnpm can't stop 341 malicious AI skills

A supply-chain breach in the ClawHub AI skill marketplace exposed 341 malicious skills, despite npm v12 blocking install scripts and pnpm enforcing a 1-day cooldown. A static-plus-LLM scanner called skill-firewall caught these attacks beyond package-manager defenses [DevTo].

TX_753833· 00:03AI

Microsoft Copilot's Cowork flaw lets attackers steal files via prompt injection

A security flaw in Microsoft Copilot's Cowork feature allows file exfiltration through prompt injection, demonstrated by Kneenex on May 25, 2026 [hn-front].

TX_689071· 06:04AI

Hackers exploit chatbot personalities to bypass AI safety locks

Hackers are using engineered personas to jailbreak chatbots, bypassing safety filters by manipulating how AI models respond to role-play and emotional cues, The Verge reports.

TX_378179· 15:42AI

Claude Mythos linked to alleged M5 kernel exploit in 5 days

An unverified Instagram post claims a Palo Alto startup used Claude Mythos to develop a macOS kernel memory corruption exploit on M5 silicon within five days.

TX_925678· 10:01AI

Frontier AI breaks open CTF format, participation drops by 70% since 2023

Frontier AI systems have outpaced traditional Capture The Flag competitions, with participation falling 70% since 2023 as teams fail to challenge AI red teams. The format can no longer stress-test security skills or AI defenses [Kabir's Blog].

TX_717349· 00:09AI

OpenAI builds a safe sandbox for Codex on Windows

OpenAI has developed a secure sandbox for Codex on Windows, enabling safe and efficient coding agents with controlled file access and network restrictions. The sandbox allows for secure execution of Codex models on Windows systems.