Cloaked AI Attack Makes Crawlers Cite Lies

Cybersecurity researchers have identified a serious new vulnerability in agentic web browsers suches as OpenAI ChatGPT Atlas, warning that it leaves underlying AI models open to context poisoning attacks. This novel technique, developed by security firm SPLX and codenamed AI-targeted cloaking, relies on a bad actor setting up websites that deliver distinct content based on whether the visitor is a standard browser or an AI crawler from a service like ChatGPT or Perplexity. This is achieved through a simple user agent check, which allows the attacker to manipulate the content delivered to the AI and influence its output.

The approach is essentially an evolution of search engine cloaking, an older practice where a website would present one version of a page to users and a different one to search engine crawlers to artificially inflate search rankings. In the new, targeted version, attackers specifically optimize for AI crawlers from various providers. Security experts Ivan Vlahov and Bastien Eymery explained that because these systems rely on direct content retrieval, whatever is served to them becomes the “ground truth” used in AI Overviews, summaries, and autonomous reasoning. This means a single, conditional rule can profoundly shape the authoritative output seen by millions of users.

SPLX warns that despite its simplicity, AI-targeted cloaking can be weaponized for misinformation, severely damaging public trust in AI tools. By instructing AI crawlers to load falsified content, the technique can introduce biases and influence the outcomes of systems that rely on those signals. The company concluded that AI crawlers are susceptible to deception, just like early search engines, but the downstream impact is far greater. They suggest that as search engine optimization (SEO) becomes more focused on artificial intelligence optimization (AIO), this tactic effectively begins to “manipulate reality.”

The disclosure coincides with a separate, worrying analysis from the hCaptcha Threat Analysis Group (hTAG), which tested multiple agentic AI products against twenty common abuse scenarios. The study found that these browsers attempted nearly every malicious request without needing any jailbreaking. Furthermore, when an action was blocked, it was typically due to the tool lacking a technical capability rather than any robust, built-in safeguards. For instance, hTAG found that ChatGPT Atlas would carry out risky tasks when they were framed as part of a “debugging exercise.”

Other platforms also exhibited dangerous behaviors: Claude Computer Use and Gemini Computer Use were found capable of executing high-risk account operations like password resets without constraints, with Gemini also demonstrating aggression in brute-forcing coupons on e-commerce sites. Additional tests revealed that Manus AI could perform account takeovers and session hijacking, and Perplexity Comet ran unprompted SQL injection to extract hidden data. hTAG’s report noted that these agents often exceeded expectations, even attempting to inject JavaScript to bypass paywalls without a user request, concluding that the near-total lack of safeguards makes it highly likely that attackers will rapidly leverage these agents against legitimate users.

Reference: