OpenAI Repo Scam Hits Hugging Face Top Spot, Steals Passwords

OpenAI Repo Scam Hits Hugging Face Top Spot, Steals Passwords 2

A sophisticated campaign has targeted AI developers, leveraging a malicious repository that mimicked OpenAI’s Privacy Filter model on the Hugging Face platform. This imposter model, disguised as a legitimate tool for detecting personally identifiable information, achieved significant traction, accumulating approximately 244,000 downloads and 667 likes within 18 hours before its removal. The campaign highlights a growing threat landscape where attackers exploit the trust and rapid adoption cycles within the AI and Web3 communities.

Key Takeaways

  • A fake repository impersonating OpenAI’s Privacy Filter gained significant downloads on Hugging Face.
  • The malicious code delivered a multi-stage infostealer designed to compromise Windows, Linux, and macOS systems.
  • Stolen data included browser credentials, crypto wallet keys, Discord tokens, and SSH credentials.
  • Attackers used bot-driven engagement to artificially inflate the repository’s popularity and trending status.
  • This incident represents a supply chain attack targeting the AI developer ecosystem.

OpenAI’s Privacy Filter, an open-weight model released to help developers redact sensitive information from text, was quickly replicated by malicious actors. A repository published under the guise of “Open-OSS” mirrored OpenAI’s legitimate model card precisely, with the sole alteration being instructions to execute a `start.bat` file on Windows or `loader.py` on Linux/macOS. This deceptive tactic, combined with an artificial surge in popularity driven by bot accounts, propelled the fake repository to the top of Hugging Face’s trending list.

The underlying malware is a multi-stage infostealer. Upon execution, it masks its activity by displaying simulated AI model training progress. More critically, it disables security features, fetches encoded commands from external paste sites, and leverages PowerShell to run hidden processes. This chain leads to the download of a second script, designed to mimic a blockchain analytics API, which then retrieves the primary Rust-based infostealer. This final payload strategically adds itself to Windows Defender’s exclusion list and operates with SYSTEM-level privileges via a self-deleting scheduled task, leaving minimal forensic traces.

The infostealer’s capabilities are extensive, targeting sensitive data stored across popular applications. It exfiltrates credentials from Chrome and Firefox, including saved passwords, session cookies, and encryption keys. Furthermore, it targets Discord tokens, cryptocurrency wallet seed phrases, SSH keys, FTP credentials, and captures screenshots. This data is then compressed and transmitted to attacker-controlled servers, posing a severe risk to users’ digital assets and personal information.

Long-Term Technological Impact on Blockchain and AI Development

This incident underscores a critical vulnerability within the interconnected development ecosystems of AI and Web3. The exploitation of a trusted platform like Hugging Face, a common repository for AI models and potentially blockchain-related tools, illustrates the evolving threat vectors in supply chain attacks. As AI models become increasingly integrated into decentralized applications and blockchain infrastructure, the risk of similarly disguised malware compromising sensitive blockchain data, such as private keys or transaction signatures, escalates. The sophisticated evasion techniques employed, including the use of dynamic command servers and self-deleting payloads, demand a paradigm shift in how AI models and code libraries are vetted. Future security protocols will likely need to incorporate more rigorous sandboxing, dynamic analysis of execution environments, and decentralized identity verification for code publishers to mitigate these risks, especially as Layer 2 solutions and AI-driven smart contracts become more prevalent.

The malware’s ability to detect and avoid execution in virtual machines or security sandboxes points to the increasing sophistication of threat actors. This advanced evasion highlights the need for robust security measures within AI development pipelines, including secure code repositories and AI-powered threat detection systems that can analyze code behavior in real-time. The attack vector demonstrates how vulnerabilities in seemingly unrelated software components—in this case, a utility model for text processing—can be weaponized to compromise critical data like cryptocurrency wallet keys. This necessitates a holistic approach to security that spans the entire Web3 development lifecycle, from model training to deployment on decentralized networks.

The discovery of additional malicious repositories impersonating other AI models under a different account (“anthfu”) suggests a coordinated effort rather than an isolated incident. The shared infrastructure points towards a potentially larger operation targeting AI developers, further emphasizing the need for enhanced security vigilance across the entire AI and blockchain community. The security community is advised to treat any device that may have interacted with the malicious repository as fully compromised. This includes immediate data wipes, changing all credentials, securing cryptocurrency assets on new, trusted devices, and invalidating sessions for services like Discord and any compromised SSH or FTP accounts.

Details can be found on the website : decrypt.co

No votes yet.
Please wait...

Leave a Reply

Your email address will not be published. Required fields are marked *