Skip to content
OBLAIDISH NEWS
signal_tag · 9_broadcasts

#ai-security

// 9 transmissions tagged with #ai-security

AI code assistants can leak credentials
TX_194589· AI

AI code assistants can leak credentials

Faiz Ahmed Farooqui's dev.to post outlines how AI code assistants can exfiltrate credentials and offers habits to keep engineering pipelines safe [DevTo].

Neuroimprint detector audits PEFT adapters
TX_985709· AI

Neuroimprint detector audits PEFT adapters

Neuroimprint-detector scans PEFT adapters for the NeuroImprint backdoor, which can leak 59-79% of training samples in federated learning pipelines [Dev.to].

Claude Code mislabels backend, leaks API tokens
TX_697698· AI

Claude Code mislabels backend, leaks API tokens

Anthropic's Claude Code client calls DeepSeek's V4 Pro model while pretending to be Claude Opus 4.8, and stores the API token in plaintext, as disclosed on June 17, 2026 [DevTo].

npm v12 and pnpm can't stop 341 malicious AI skills
TX_186648· AI

npm v12 and pnpm can't stop 341 malicious AI skills

A supply-chain breach in the ClawHub AI skill marketplace exposed 341 malicious skills, despite npm v12 blocking install scripts and pnpm enforcing a 1-day cooldown. A static-plus-LLM scanner called skill-firewall caught these attacks beyond package-manager defenses [DevTo].

Microsoft Copilot's Cowork flaw lets attackers steal files via prompt injection
TX_753833· AI

Microsoft Copilot's Cowork flaw lets attackers steal files via prompt injection

A security flaw in Microsoft Copilot's Cowork feature allows file exfiltration through prompt injection, demonstrated by Kneenex on May 25, 2026 [hn-front].

Hackers exploit chatbot personalities to bypass AI safety locks
TX_689071· AI

Hackers exploit chatbot personalities to bypass AI safety locks

Hackers are using engineered personas to jailbreak chatbots, bypassing safety filters by manipulating how AI models respond to role-play and emotional cues, The Verge reports.

Claude Mythos linked to alleged M5 kernel exploit in 5 days
TX_378179· AI

Claude Mythos linked to alleged M5 kernel exploit in 5 days

An unverified Instagram post claims a Palo Alto startup used Claude Mythos to develop a macOS kernel memory corruption exploit on M5 silicon within five days.

Frontier AI breaks open CTF format, participation drops by 70% since 2023
TX_925678· AI

Frontier AI breaks open CTF format, participation drops by 70% since 2023

Frontier AI systems have outpaced traditional Capture The Flag competitions, with participation falling 70% since 2023 as teams fail to challenge AI red teams. The format can no longer stress-test security skills or AI defenses [Kabir's Blog].

OpenAI builds a safe sandbox for Codex on Windows
TX_717349· AI

OpenAI builds a safe sandbox for Codex on Windows

OpenAI has developed a secure sandbox for Codex on Windows, enabling safe and efficient coding agents with controlled file access and network restrictions. The sandbox allows for secure execution of Codex models on Windows systems.