Menu

  • Alerts
  • Incidents
  • News
  • APTs
  • Cyber Decoded
  • Cyber Hygiene
  • Cyber Review
  • Cyber Tips
  • Definitions
  • Malware
  • Threat Actors
  • Tutorials

Useful Tools

  • Password generator
  • Report an incident
  • Report to authorities
No Result
View All Result
CTF Hack Havoc
CyberMaterial
  • Education
    • Cyber Decoded
    • Definitions
  • Information
    • Alerts
    • Incidents
    • News
  • Insights
    • Cyber Hygiene
    • Cyber Review
    • Tips
    • Tutorials
  • Support
    • Contact Us
    • Report an incident
  • About
    • About Us
    • Advertise with us
Get Help
Hall of Hacks
  • Education
    • Cyber Decoded
    • Definitions
  • Information
    • Alerts
    • Incidents
    • News
  • Insights
    • Cyber Hygiene
    • Cyber Review
    • Tips
    • Tutorials
  • Support
    • Contact Us
    • Report an incident
  • About
    • About Us
    • Advertise with us
Get Help
No Result
View All Result
Hall of Hacks
CyberMaterial
No Result
View All Result
Home News

Baldur AI Advances Software Verification

January 11, 2024
Reading Time: 3 mins read
in News

A team of computer scientists from the University of Massachusetts Amherst has introduced a groundbreaking method for enhancing software verification known as Baldur. This innovative approach, leveraging the artificial intelligence capabilities of large language models (LLMs) like ChatGPT, aims to automatically generate proofs ensuring software correctness. Collaborating with Google and incorporating the tool Thor, Baldur demonstrated an impressive efficacy of 65.7% in generating proofs, marking a significant leap in the quest for bug-free software. This discovery holds promise in addressing the profound impact of software bugs on society, from minor inconveniences to potential security breaches, providing an efficient and automated way to verify software correctness.

Baldur’s development involved months of collaboration with Google and was built on extensive prior research. The team, led by Professor Yuriy Brun, fine-tuned an LLM named Minerva on mathematical scientific papers and webpages, subsequently adapting it to the language Isabelle/HOL used in mathematical proofs. The process involves Baldur generating entire proofs, which are then checked by a theorem prover. The collaboration between Baldur and Thor, the proof-generating tool, achieved an efficacy rate of 65.7%, showcasing a significant advancement in automating the verification process and saving engineers considerable manual effort.

The conventional methods of manually reviewing code or running it against expected outcomes are prone to human error and time-consuming for complex systems. Baldur’s role in automating the writing of mathematical proofs offers a promising solution to this challenge. While there is acknowledgment of a degree of error, Baldur stands out as an efficient and effective means of software verification, potentially revolutionizing the field as AI capabilities continue to evolve and improve.

The team’s work aligns with the concept of formal verification, where engineers build mathematical proofs alongside software systems to ensure correctness. Baldur’s introduction brings automation to this process, offering a method that generates proofs automatically in a significant percentage of cases, providing a more practical approach to achieving bug-free software.

Reference:
  • UMASS AMHERST RESEARCHERS BRING DREAM OF BUG-FREE SOFTWARE ONE STEP CLOSER TO REALITY
Tags: AIArtificial IntelligenceBaldurbugsCyber NewsCyber News 2024CybersecurityGoogleJanuary 2024University of Massachusetts Amherst
ADVERTISEMENT

Related Posts

Texas Creates Largest US State Cyber Command

FBI Taps Brett Leatherman As New Cyber Chief

June 10, 2025
Texas Creates Largest US State Cyber Command

Texas Creates Largest US State Cyber Command

June 10, 2025
Texas Creates Largest US State Cyber Command

WordPress Fight Leads To New FAIR Manager

June 10, 2025
OpenAI Bans State Hackers From ChatGPT

New Trump Cyber EO Rolls Back Biden Rules

June 9, 2025
OpenAI Bans State Hackers From ChatGPT

DOJ Seeks $7.74M From North Korean IT Scam

June 9, 2025
OpenAI Bans State Hackers From ChatGPT

OpenAI Bans State Hackers From ChatGPT

June 9, 2025

Latest Alerts

Google Bug Exposed Any User’s Phone Number

Roundcube RCE Flaw Risks 84,000 Servers

New Skitnet Malware Arms Ransomware Gangs

Sabotage Theft Malware On npm And PyPI

Salesforce SOQL Flaw Exposed User Records

HelloTDS Spreads Malware Via Fake CAPTCHAs

Subscribe to our newsletter

    Latest Incidents

    Texas DOT Breach Leaks 300K Crash Reports

    Illinois HFS Employee Phishing Leaks Data

    Cyberattack Disrupts UNFI Food Deliveries

    Hack Shuts Down Brazil City Health Systems

    Sorbonne University Hit By Staff Data Breach

    Chaos Gang Leaks Optima Tax Client Data

    CyberMaterial Logo
    • About Us
    • Contact Us
    • Jobs
    • Legal and Privacy Policy
    • Site Map

    © 2025 | CyberMaterial | All rights reserved

    Welcome Back!

    Login to your account below

    Forgotten Password?

    Retrieve your password

    Please enter your username or email address to reset your password.

    Log In

    Add New Playlist

    No Result
    View All Result
    • Alerts
    • Incidents
    • News
    • Cyber Decoded
    • Cyber Hygiene
    • Cyber Review
    • Definitions
    • Malware
    • Cyber Tips
    • Tutorials
    • Advanced Persistent Threats
    • Threat Actors
    • Report an incident
    • Password Generator
    • About Us
    • Contact Us
    • Advertise with us

    Copyright © 2025 CyberMaterial