Neuroimprint detector audits PEFT adapters

Neuroimprint-detector scans PEFT adapters for the NeuroImprint backdoor, which can leak 59-79% of training samples in federated learning pipelines [Dev.to].

sources[Dev.to][GitHub]

Neuroimprint-detector 0.1.0, a Python package, audits PEFT adapters for the NeuroImprint backdoor [Dev.to]. It loads adapters from disk or the Hugging Face Hub, flags identical rows of the weight matrix W₂, checks bias b₂ for quantile-aligned intervals, and looks for a rank-1 fingerprint (RaLU) [GitHub]. The tool supports BERT, GPT-2, Qwen2-1.5B, and Llama 3-3B. Reconstruction fidelity varies by optimizer: SGD-trained adapters achieve 77.4% (BERT) to 75.0% (Llama 3) reconstruction with semantic similarity > 0.99, while AdamW drops to 74.6% (BERT) and 0.767 similarity. The NeuroImprint attack, demonstrated by Shi et al. (2026), lets a malicious aggregation server corrupt a LoRA adapter so that the adapter memorizes raw training rows [Dev.to]. After fine-tuning, the attacker can reconstruct 59-79% of the original samples by reading the adapter weights alone. Auditing adapters before they are uploaded gives data owners a concrete line of defense: a backdoored adapter is rejected, and the recovered samples can be reported for compliance audits. The detector also provides a reproducible benchmark for future privacy-preserving FL research, forcing server-side attackers to expose their leakage in a measurable way. By integrating the detector into the FL aggregation pipeline, corrupted adapters can be filtered in real time, not after the fact.

adjacent broadcasts

TX_028915·ai

operator_channel

[ comments_offline · provider_not_configured ]

transmission_log

Subscribe to the broadcast.

Daily digest of the day's most important tech news. No fluff. Engineering signal only.

// delivered via substack · double-opt-in confirmation

Atlantic releases 21m-track music dataset for ai training

Egc gives ai agents persistent memory

ai model failover drills ensure agent reliability

Subscribe to the broadcast.