Menu

  • Alerts
  • Incidents
  • News
  • APTs
  • Cyber Decoded
  • Cyber Hygiene
  • Cyber Review
  • Cyber Tips
  • Definitions
  • Malware
  • Threat Actors
  • Tutorials

Useful Tools

  • Password generator
  • Report an incident
  • Report to authorities
No Result
View All Result
CTF Hack Havoc
CyberMaterial
  • Education
    • Cyber Decoded
    • Definitions
  • Information
    • Alerts
    • Incidents
    • News
  • Insights
    • Cyber Hygiene
    • Cyber Review
    • Tips
    • Tutorials
  • Support
    • Contact Us
    • Report an incident
  • About
    • About Us
    • Advertise with us
Get Help
Hall of Hacks
  • Education
    • Cyber Decoded
    • Definitions
  • Information
    • Alerts
    • Incidents
    • News
  • Insights
    • Cyber Hygiene
    • Cyber Review
    • Tips
    • Tutorials
  • Support
    • Contact Us
    • Report an incident
  • About
    • About Us
    • Advertise with us
Get Help
No Result
View All Result
Hall of Hacks
CyberMaterial
No Result
View All Result
Home Incidents

DeepSeek Leak Exposes 12000 Hardcoded Keys

February 28, 2025
Reading Time: 2 mins read
in Incidents
Chinese Hack on Belgium Intelligence Service

A recent analysis of the Common Crawl dataset has revealed a major security flaw, uncovering 12,000 live API keys, passwords, and credentials embedded in publicly accessible web pages. These exposed credentials, which grant access to services such as AWS, Slack, and Mailchimp, highlight significant risks within AI development pipelines. The problem stems from widespread credential hardcoding found across millions of archived web pages, raising serious concerns about the safeguards in place for AI-generated code. This incident shines a light on the unintended consequences of training large language models (LLMs) like DeepSeek on unfiltered, publicly available data.

Researchers at Truffle Security conducted a thorough scan of the Common Crawl dataset, which includes web content scraped from billions of pages.

Using the open-source TruffleHog tool, they detected thousands of valid credentials, some of which were reused across multiple sites. For instance, a WalkScore API key was found 57,000 times across various subdomains, while a single webpage contained 17 unique Slack webhooks.

The findings indicate that large-scale infrastructure failures in credential management are more widespread than previously thought, with significant potential for malicious activity.

The data analysis also revealed the inherent risks of training LLMs on datasets that contain exposed credentials. These models are unable to differentiate between valid secrets and placeholder examples during training, leading to a feedback loop that normalizes insecure coding practices. As LLMs like DeepSeek are increasingly used to assist developers, they inadvertently encourage the use of dangerous practices such as hardcoding API keys directly into code, potentially leading to future security vulnerabilities. This issue is compounded by the sheer volume of exposed secrets in training data, skewing the model’s recommendations toward insecure solutions.

In response to these findings, experts advocate for implementing stronger safeguards in both AI development and cybersecurity practices. This includes using security guardrails within AI coding assistants and filtering sensitive data from training datasets. Truffle Security also urges developers to improve education on secure credential management and calls for industry-wide collaboration to audit and sanitize the data that shapes modern AI models. As AI tools become more integrated into software development, securing the training data used to develop these models is critical to prevent future breaches and ensure more robust cybersecurity practices.

Reference:

  • DeepSeek Leak Reveals 12,000 Exposed API Keys and Passwords
Tags: cyber incidentsCyber Incidents 2025Cyber threatsFebruary 2025
ADVERTISEMENT

Related Posts

Sompo Data Breach Puts 17.5M Records At Risk

Sompo Data Breach Puts 17.5M Records At Risk

June 11, 2025
BHA Hit By Ransomware But Races Continue

BHA Hit By Ransomware But Races Continue

June 11, 2025
DDoS Disrupts Roularta Media In Belgium

DDoS Disrupts Roularta Media In Belgium

June 11, 2025
Illinois HFS Employee Phishing Leaks Data

Texas DOT Breach Leaks 300K Crash Reports

June 10, 2025
Illinois HFS Employee Phishing Leaks Data

Cyberattack Disrupts UNFI Food Deliveries

June 10, 2025
Illinois HFS Employee Phishing Leaks Data

Illinois HFS Employee Phishing Leaks Data

June 10, 2025

Latest Alerts

Fake Sora AI Lure Installs Infostealer

FIN6 Uses Fake Resumes To Hack Recruiters

Microsoft Fixes Exploited WebDAV Zero Day

Google Bug Exposed Any User’s Phone Number

Roundcube RCE Flaw Risks 84,000 Servers

New Skitnet Malware Arms Ransomware Gangs

Subscribe to our newsletter

    Latest Incidents

    BHA Hit By Ransomware But Races Continue

    Sompo Data Breach Puts 17.5M Records At Risk

    DDoS Disrupts Roularta Media In Belgium

    Texas DOT Breach Leaks 300K Crash Reports

    Illinois HFS Employee Phishing Leaks Data

    Cyberattack Disrupts UNFI Food Deliveries

    CyberMaterial Logo
    • About Us
    • Contact Us
    • Jobs
    • Legal and Privacy Policy
    • Site Map

    © 2025 | CyberMaterial | All rights reserved

    Welcome Back!

    Login to your account below

    Forgotten Password?

    Retrieve your password

    Please enter your username or email address to reset your password.

    Log In

    Add New Playlist

    No Result
    View All Result
    • Alerts
    • Incidents
    • News
    • Cyber Decoded
    • Cyber Hygiene
    • Cyber Review
    • Definitions
    • Malware
    • Cyber Tips
    • Tutorials
    • Advanced Persistent Threats
    • Threat Actors
    • Report an incident
    • Password Generator
    • About Us
    • Contact Us
    • Advertise with us

    Copyright © 2025 CyberMaterial