Dataconomy
  • News
    • Artificial Intelligence
    • Cybersecurity
    • DeFi & Blockchain
    • Finance
    • Gaming
    • Startups
    • Tech
  • Industry
  • Research
  • Resources
    • Articles
    • Guides
    • Case Studies
    • Whitepapers
    • AI Models Leaderboard
  • AI toolsNEW
  • Newsletter
  • + More
    • Glossary
    • Conversations
    • Events
    • About
      • Who we are
      • Contact
      • Imprint
      • Legal & Privacy
      • Partner With Us
Subscribe
No Result
View All Result
  • AI
  • Tech
  • Cybersecurity
  • Finance
  • DeFi & Blockchain
  • Startups
  • Gaming
Dataconomy
  • News
    • Artificial Intelligence
    • Cybersecurity
    • DeFi & Blockchain
    • Finance
    • Gaming
    • Startups
    • Tech
  • Industry
  • Research
  • Resources
    • Articles
    • Guides
    • Case Studies
    • Whitepapers
    • AI Models Leaderboard
  • AI toolsNEW
  • Newsletter
  • + More
    • Glossary
    • Conversations
    • Events
    • About
      • Who we are
      • Contact
      • Imprint
      • Legal & Privacy
      • Partner With Us
Subscribe
No Result
View All Result
Dataconomy
No Result
View All Result

CrowdStrike and Meta launch open-source CyberSOCEval benchmark to test AI cybersecurity models

The tool allows organizations to measure large language model performance in incident response, threat analysis, and malware detection.

byAytun Çelebi
September 16, 2025
in Cybersecurity
Home News Cybersecurity
Share on FacebookShare on TwitterShare on LinkedInShare on WhatsAppShare on e-mail
Google Preferred Source

Cybersecurity firm CrowdStrike and Meta have released CyberSOCEval, an open-source benchmark suite designed to evaluate the performance of AI models in security operations centers (SOCs). The tool aims to help businesses select the right AI-powered cybersecurity solutions by providing a standardized way to test their capabilities in key security tasks.

The challenge of choosing the right AI security tool

As AI becomes integrated into a growing number of cybersecurity products, security professionals face the challenge of choosing from a wide array of options with varying costs and capabilities. CyberSOCEval addresses this by offering a structured method for testing large language models (LLMs) on core SOC functions, including incident response, threat analysis, and malware detection.

Without clear benchmarks, it’s difficult to know which systems, use cases, and performance standards deliver a true AI advantage against real-world attacks.

By standardizing these evaluations, the benchmark allows organizations to objectively measure how different AI models perform in realistic scenarios, helping them identify the tools that best fit their operational needs.

Stay Ahead of the Curve!

Don't miss out on the latest insights, trends, and analysis in the world of data, technology, and startups. Subscribe to our newsletter and get exclusive content delivered straight to your inbox.

How CyberSOCEval benefits both businesses and developers

For businesses, the benchmark provides clear, comparable data on model performance. For AI developers, it offers valuable insights into how enterprise clients use their models for cybersecurity. This feedback can guide future improvements, helping creators refine their models to better handle specific industry jargon or complex threat intelligence. The framework is designed to be adaptable, allowing for the inclusion of new tests as threats like zero-day exploits emerge.

The release of CyberSOCEval comes amid a digital arms race where both attackers and defenders are leveraging AI. A survey by Mastercard and the Financial Times Longitude found that financial services firms have saved millions of dollars by using AI-powered tools to combat AI-enabled fraud, demonstrating the tangible benefits of effective defensive AI.

An open-source approach to improving security

Meta’s involvement in the project aligns with its history of supporting open-source AI development, such as its Llama models. By making CyberSOCEval an open-source tool, the companies encourage community collaboration to improve and expand the benchmarks over time. This approach aims to accelerate industry-wide progress in defending against advanced, AI-based threats.

With these benchmarks in place, and open for the security and AI community to further improve, we can more quickly work as an industry to unlock the potential of AI in protecting against advanced attacks, including AI-based threats.

CyberSOCEval is available now on GitHub, where users can download the suite to run evaluations on their preferred LLMs. The repository includes documentation, sample datasets, and instructions for integrating the tests into existing security platforms.


Featured image credit

Tags: CrowdstrikeCyberSOCEvalMeta

Related Posts

Why secure software delivery depends on better release management

Why secure software delivery depends on better release management

June 3, 2026
Popular Codex package caught exfiltrating authentication credentials

Popular Codex package caught exfiltrating authentication credentials

June 2, 2026
GTA V cheat service Atlas Menu hacked, exposing 64,000 accounts

GTA V cheat service Atlas Menu hacked, exposing 64,000 accounts

June 2, 2026
Meta patches AI flaw that enabled Instagram account takeovers

Meta patches AI flaw that enabled Instagram account takeovers

June 2, 2026
GitHub confirms breach after hackers steal 3,800 code repositories

GitHub confirms breach after hackers steal 3,800 code repositories

May 20, 2026
Myhtos reportedly helped researchers uncover macOS exploit

Myhtos reportedly helped researchers uncover macOS exploit

May 19, 2026

LATEST NEWS

Why Telegram Mini Apps have become the optimal ecosystem for launching AI SaaS products

Crypto investors are watching one date closely in 2026

How Telegram Creators test post visibility before running growth campaigns

Does your AI clock in without you?

Why secure software delivery depends on better release management

Sony reveals God of War: Laufey for PS5

BEST AI MODELS LEADERBOARD

See the best AI models, ranked by intelligence, benchmark results, speed and token price. Find the most suitable LLMs, Text-to-Image, Image Editing, Text-to-Speech, Text-to-Video and Image-to-Video  artificial intelligence model for your tasks and business.

LATEST TOOLS

Veed.io

Paper Pilot

IsOn24

Magnific

DADABOTS

Rosebud AI

Prome

Pageon AI

Vyond

Centauri AI

Dataconomy

COPYRIGHT © DATACONOMY MEDIA GMBH, ALL RIGHTS RESERVED.

  • About
  • Imprint
  • Contact
  • Legal & Privacy

Follow Us

  • News
    • Artificial Intelligence
    • Cybersecurity
    • DeFi & Blockchain
    • Finance
    • Gaming
    • Startups
    • Tech
  • Industry
  • Research
  • Resources
    • Articles
    • Guides
    • Case Studies
    • Whitepapers
    • AI Models Leaderboard
  • AI tools
  • Newsletter
  • + More
    • Glossary
    • Conversations
    • Events
    • About
      • Who we are
      • Contact
      • Imprint
      • Legal & Privacy
      • Partner With Us
No Result
View All Result
Subscribe

This website uses cookies to improve your experience. You can choose to accept or reject them. Visit our Privacy Policy.