SOTAVerified

Vulnerability Detection

Vulnerability detection plays a crucial role in safeguarding against these threats by identifying weaknesses and potential entry points that malicious actors could exploit. Through advanced scanning techniques and penetration testing, vulnerability detection tools meticulously analyze web applications and websites for vulnerabilities such as SQL injection, cross-site scripting (XSS), and insecure authentication mechanisms.

By proactively identifying and addressing vulnerabilities, organizations can strengthen their online security posture and mitigate the risk of data breaches, financial loss, and reputational damage. Additionally, vulnerability detection empowers businesses to stay compliant with industry regulations and standards, demonstrating their commitment to safeguarding sensitive information and maintaining the trust of their customers. With the evolving threat landscape and increasingly sophisticated attack vectors, investing in robust vulnerability detection measures is paramount for staying one step ahead of cyber threats and ensuring the resilience of web-based platforms and services.

Papers

Showing 150 of 216 papers

TitleStatusHype
NYU CTF Bench: A Scalable Open-Source Benchmark Dataset for Evaluating LLMs in Offensive SecurityCode11
Vulnerability Detection with Code Language Models: How Far Are We?Code3
MoreFixes: A Large-Scale Dataset of CVE Fix Commits Mined through Enhanced Repository DiscoveryCode2
An LLM-Assisted Easy-to-Trigger Backdoor Attack on Code Completion Models: Injecting Disguised Vulnerabilities against Strong DetectionCode2
Generalization-Enhanced Code Vulnerability Detection via Multi-Task Instruction Fine-TuningCode2
Finetuning Large Language Models for Vulnerability DetectionCode2
CRAKEN: Cybersecurity LLM Agent with Knowledge-Based ExecutionCode1
The Hitchhiker's Guide to Program Analysis, Part II: Deep Thoughts by LLMsCode1
R2Vul: Learning to Reason about Software Vulnerabilities with Reinforcement Learning and Structured Reasoning DistillationCode1
CASTLE: Benchmarking Dataset for Static Code Analyzers and LLMs towards CWE DetectionCode1
Investigating Large Language Models for Code Vulnerability Detection: An Experimental StudyCode1
CryptoFormalEval: Integrating LLMs and Formal Verification for Automated Cryptographic Protocol Vulnerability DetectionCode1
Is Function Similarity Over-Engineered? Building a BenchmarkCode1
VulScribeR: Exploring RAG-based Vulnerability Augmentation with LLMsCode1
EaTVul: ChatGPT-based Evasion Attack Against Software Vulnerability DetectionCode1
VulDetectBench: Evaluating the Deep Capability of Vulnerability Detection with Large Language ModelsCode1
IoTvulCode: AI-enabled vulnerability detection in software products designed for IoT applicationsCode1
Graph Neural Networks for Vulnerability Detection: A Counterfactual ExplanationCode1
How Far Have We Gone in Vulnerability Detection Using Large Language ModelsCode1
Large Language Model-Powered Smart Contract Vulnerability Detection: New PerspectivesCode1
When Less is Enough: Positive and Unlabeled Learning Model for Vulnerability DetectionCode1
GPTScan: Detecting Logic Vulnerabilities in Smart Contracts by Combining GPT with Program AnalysisCode1
Uncovering the Limits of Machine Learning for Automatic Vulnerability DetectionCode1
LIVABLE: Exploring Long-Tailed Classification of Software Vulnerability TypesCode1
Learning to Quantize Vulnerability Patterns and Match to Locate Statement-Level VulnerabilitiesCode1
An Unbiased Transformer Source Code Learning with Semantic Vulnerability GraphCode1
DiverseVul: A New Vulnerable Source Code Dataset for Deep Learning Based Vulnerability DetectionCode1
Illuminati: Towards Explaining Graph Neural Networks for Cybersecurity AnalysisCode1
Dataflow Analysis-Inspired Deep Learning for Efficient Vulnerability DetectionCode1
Deep Smart Contract Intent DetectionCode1
Cross Project Software Vulnerability Detection via Domain Adaptation and Max-Margin PrincipleCode1
MANDO: Multi-Level Heterogeneous Graph Embeddings for Fine-Grained Detection of Smart Contract VulnerabilitiesCode1
VulCNN: An Image-inspired Scalable Vulnerability Detection SystemCode1
VulBERTa: Simplified Source Code Pre-Training for Vulnerability DetectionCode1
Learning Program Semantics with Code Representations: An Empirical StudyCode1
GraphSearchNet: Enhancing GNNs via Capturing Global Dependencies for Semantic Code SearchCode1
A Survey on Machine Learning Techniques for Source Code AnalysisCode1
ReGVD: Revisiting Graph Neural Networks for Vulnerability DetectionCode1
Combining Graph Neural Networks with Expert Knowledge for Smart Contract Vulnerability DetectionCode1
Smart Contract Vulnerability Detection: From Pure Neural Network to Interpretable Graph Feature and Expert Pattern FusionCode1
D2A: A Dataset Built for AI-Based Vulnerability Detection Methods Using Differential AnalysisCode1
Eth2Vec: Learning Contract-Wide Code Representations for Vulnerability Detection on Ethereum Smart ContractsCode1
Stack-based Buffer Overflow Detection using Recurrent Neural NetworksCode1
Trex: Learning Execution Semantics from Micro-Traces for Binary SimilarityCode1
CORE: Benchmarking LLMs Code Reasoning Capabilities through Static Analysis Tasks0
SV-LLM: An Agentic Approach for SoC Security Verification using Large Language Models0
Smart-LLaMA-DPO: Reinforced Large Language Model for Explainable Smart Contract Vulnerability Detection0
Today's Cat Is Tomorrow's Dog: Accounting for Time-Based Changes in the Labels of ML Vulnerability Detection Approaches0
Identifying Helpful Context for LLM-based Vulnerability Repair: A Preliminary Study0
Boosting Vulnerability Detection of LLMs via Curriculum Preference Optimization with Synthetic Reasoning DataCode0
Show:102550
← PrevPage 1 of 5Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1Reveal Model - Tested on Reveal (Training on Devign + VulScribeR 20K + Extra Cleans)F1 Score26.18Unverified
2Devign Model - Tested on Reveal (Training on Devign + VulScribeR 20K + Extra Cleans)F1 Score24.99Unverified
3Reveal Model - Tested on Bigvul (Training on Devign + VulScribeR 20K + Extra Cleans)F1 Score18.98Unverified
4Devign Model - Tested on Bigvul (Training on Devign + VulScribeR 20K + Extra Cleans)F1 Score18.51Unverified
5LineVul - Tested on Reveal (Training on Devign + VulScribeR 20K + Extra Cleans)F1 Score17.38Unverified
6LineVul - Tested on BigVul (Training on Devign + VulScribeR 20K+ Extra Cleans)F1 Score16.23Unverified
#ModelMetricClaimedVerifiedStatus
1WizardCoderAUC0.86Unverified
2ContraBERTAUC0.85Unverified