Menu

  • Alerts
  • Incidents
  • News
  • APTs
  • Cyber Decoded
  • Cyber Hygiene
  • Cyber Review
  • Cyber Tips
  • Definitions
  • Malware
  • Threat Actors
  • Tutorials

Useful Tools

  • Password generator
  • Report an incident
  • Report to authorities
No Result
View All Result
CTF Hack Havoc
CyberMaterial
  • Education
    • Cyber Decoded
    • Definitions
  • Information
    • Alerts
    • Incidents
    • News
  • Insights
    • Cyber Hygiene
    • Cyber Review
    • Tips
    • Tutorials
  • Support
    • Contact Us
    • Report an incident
  • About
    • About Us
    • Advertise with us
Get Help
Hall of Hacks
  • Education
    • Cyber Decoded
    • Definitions
  • Information
    • Alerts
    • Incidents
    • News
  • Insights
    • Cyber Hygiene
    • Cyber Review
    • Tips
    • Tutorials
  • Support
    • Contact Us
    • Report an incident
  • About
    • About Us
    • Advertise with us
Get Help
No Result
View All Result
Hall of Hacks
CyberMaterial
No Result
View All Result
Home Alerts

AI Vulnerabilities Found in Major Platforms

April 28, 2025
Reading Time: 2 mins read
in Alerts
AI Vulnerabilities Found in Major Platforms

Security researchers have discovered two critical vulnerabilities affecting generative AI systems, potentially enabling attackers to bypass safety protocols. These vulnerabilities, referred to as “jailbreaks,” target popular platforms from OpenAI, Google, Microsoft, and Anthropic. The weaknesses allow malicious actors to generate dangerous or prohibited content, revealing systemic flaws in AI safety mechanisms across multiple platforms. These findings underscore the ongoing challenges in securing generative AI systems, which are increasingly being used in a wide range of applications.

The first vulnerability, named “Inception,” manipulates AI systems by nesting fictional scenarios to trick safety protocols. Researchers found that once the AI is prompted with a harmless scenario, a second scenario can be introduced where safety filters do not apply. This technique effectively circumvents content restrictions, allowing users to generate harmful or restricted content. The second vulnerability, discovered by Jacob Liddle, involves using the AI’s own responses to bypass safety features by alternating between permissible and prohibited queries.

These vulnerabilities impact multiple platforms, with the “Inception” jailbreak affecting eight major services, including ChatGPT, Claude, and Copilot. The second vulnerability affects seven of the same platforms, with MetaAI being the only one not affected. Although individually categorized as low severity, the widespread nature of these vulnerabilities raises concerns about their potential misuse for illegal or malicious activities such as phishing, malware, or the creation of harmful content. This reveals a fundamental flaw in the safety architecture of many AI systems.

In response to these discoveries, affected vendors have acknowledged the vulnerabilities and made adjustments to their platforms. However, these findings highlight the need for continued vigilance and robust security practices as AI technologies advance. Security experts recommend organizations deploying generative AI to adopt enhanced monitoring and safeguards to prevent exploitation. Moving forward, the AI industry must address these vulnerabilities to ensure the safe and responsible use of AI tools.

Reference:
  • AI Vulnerabilities Exposed with Jailbreaks Bypassing Safety Systems in Major Platforms
Tags: April 2025Cyber AlertsCyber Alerts 2025CyberattackCybersecurity
ADVERTISEMENT

Related Posts

Russian APT28 Deploys Outlook Backdoor

SAP S4hana Exploited Vulnerability

September 5, 2025
Russian APT28 Deploys Outlook Backdoor

Virustotal Finds Undetected SVG Files

September 5, 2025
Russian APT28 Deploys Outlook Backdoor

Russian APT28 Deploys Outlook Backdoor

September 5, 2025
Lazarus Hackers Exploit ZeroDay, Deploy Rats

Lazarus Hackers Exploit ZeroDay, Deploy Rats

September 4, 2025
Lazarus Hackers Exploit ZeroDay, Deploy Rats

CISA Flags TP Link Router Flaws

September 4, 2025
Lazarus Hackers Exploit ZeroDay, Deploy Rats

Google Patches 120 Flaws In Android

September 4, 2025

Latest Alerts

SAP S4hana Exploited Vulnerability

Virustotal Finds Undetected SVG Files

Russian APT28 Deploys Outlook Backdoor

CISA Flags TP Link Router Flaws

Lazarus Hackers Exploit ZeroDay, Deploy Rats

Google Patches 120 Flaws In Android

Subscribe to our newsletter

    Latest Incidents

    North Korean Hackers Fake Interviews

    Bridgestone Confirms Cyberattack

    Cybersecurity Firms Hit By Breach

    Salesloft Drift Attacks Hits Vendors

    Jaguar Land Rover Hit By Cyber Incident

    Hackers Use Grok Ai To Spread Malware

    CyberMaterial Logo
    • About Us
    • Contact Us
    • Jobs
    • Legal and Privacy Policy
    • Site Map

    © 2025 | CyberMaterial | All rights reserved

    Welcome Back!

    Login to your account below

    Forgotten Password?

    Retrieve your password

    Please enter your username or email address to reset your password.

    Log In

    Add New Playlist

    No Result
    View All Result
    • Alerts
    • Incidents
    • News
    • Cyber Decoded
    • Cyber Hygiene
    • Cyber Review
    • Definitions
    • Malware
    • Cyber Tips
    • Tutorials
    • Advanced Persistent Threats
    • Threat Actors
    • Report an incident
    • Password Generator
    • About Us
    • Contact Us
    • Advertise with us

    Copyright © 2025 CyberMaterial