SOTAVerified

Vulnerability Detection

Vulnerability detection plays a crucial role in safeguarding against these threats by identifying weaknesses and potential entry points that malicious actors could exploit. Through advanced scanning techniques and penetration testing, vulnerability detection tools meticulously analyze web applications and websites for vulnerabilities such as SQL injection, cross-site scripting (XSS), and insecure authentication mechanisms.

By proactively identifying and addressing vulnerabilities, organizations can strengthen their online security posture and mitigate the risk of data breaches, financial loss, and reputational damage. Additionally, vulnerability detection empowers businesses to stay compliant with industry regulations and standards, demonstrating their commitment to safeguarding sensitive information and maintaining the trust of their customers. With the evolving threat landscape and increasingly sophisticated attack vectors, investing in robust vulnerability detection measures is paramount for staying one step ahead of cyber threats and ensuring the resilience of web-based platforms and services.

Papers

Showing 150 of 216 papers

TitleStatusHype
NYU CTF Bench: A Scalable Open-Source Benchmark Dataset for Evaluating LLMs in Offensive SecurityCode11
Vulnerability Detection with Code Language Models: How Far Are We?Code3
MoreFixes: A Large-Scale Dataset of CVE Fix Commits Mined through Enhanced Repository DiscoveryCode2
An LLM-Assisted Easy-to-Trigger Backdoor Attack on Code Completion Models: Injecting Disguised Vulnerabilities against Strong DetectionCode2
Finetuning Large Language Models for Vulnerability DetectionCode2
Generalization-Enhanced Code Vulnerability Detection via Multi-Task Instruction Fine-TuningCode2
Combining Graph Neural Networks with Expert Knowledge for Smart Contract Vulnerability DetectionCode1
When Less is Enough: Positive and Unlabeled Learning Model for Vulnerability DetectionCode1
GraphSearchNet: Enhancing GNNs via Capturing Global Dependencies for Semantic Code SearchCode1
VulBERTa: Simplified Source Code Pre-Training for Vulnerability DetectionCode1
Large Language Model-Powered Smart Contract Vulnerability Detection: New PerspectivesCode1
GPTScan: Detecting Logic Vulnerabilities in Smart Contracts by Combining GPT with Program AnalysisCode1
Is Function Similarity Over-Engineered? Building a BenchmarkCode1
MANDO: Multi-Level Heterogeneous Graph Embeddings for Fine-Grained Detection of Smart Contract VulnerabilitiesCode1
The Hitchhiker's Guide to Program Analysis, Part II: Deep Thoughts by LLMsCode1
VulCNN: An Image-inspired Scalable Vulnerability Detection SystemCode1
ReGVD: Revisiting Graph Neural Networks for Vulnerability DetectionCode1
R2Vul: Learning to Reason about Software Vulnerabilities with Reinforcement Learning and Structured Reasoning DistillationCode1
An Unbiased Transformer Source Code Learning with Semantic Vulnerability GraphCode1
Illuminati: Towards Explaining Graph Neural Networks for Cybersecurity AnalysisCode1
Trex: Learning Execution Semantics from Micro-Traces for Binary SimilarityCode1
How Far Have We Gone in Vulnerability Detection Using Large Language ModelsCode1
IoTvulCode: AI-enabled vulnerability detection in software products designed for IoT applicationsCode1
Uncovering the Limits of Machine Learning for Automatic Vulnerability DetectionCode1
A Survey on Machine Learning Techniques for Source Code AnalysisCode1
Learning to Quantize Vulnerability Patterns and Match to Locate Statement-Level VulnerabilitiesCode1
Learning Program Semantics with Code Representations: An Empirical StudyCode1
LIVABLE: Exploring Long-Tailed Classification of Software Vulnerability TypesCode1
Smart Contract Vulnerability Detection: From Pure Neural Network to Interpretable Graph Feature and Expert Pattern FusionCode1
Dataflow Analysis-Inspired Deep Learning for Efficient Vulnerability DetectionCode1
Investigating Large Language Models for Code Vulnerability Detection: An Experimental StudyCode1
Cross Project Software Vulnerability Detection via Domain Adaptation and Max-Margin PrincipleCode1
CRAKEN: Cybersecurity LLM Agent with Knowledge-Based ExecutionCode1
D2A: A Dataset Built for AI-Based Vulnerability Detection Methods Using Differential AnalysisCode1
Deep Smart Contract Intent DetectionCode1
EaTVul: ChatGPT-based Evasion Attack Against Software Vulnerability DetectionCode1
CryptoFormalEval: Integrating LLMs and Formal Verification for Automated Cryptographic Protocol Vulnerability DetectionCode1
DiverseVul: A New Vulnerable Source Code Dataset for Deep Learning Based Vulnerability DetectionCode1
Stack-based Buffer Overflow Detection using Recurrent Neural NetworksCode1
Eth2Vec: Learning Contract-Wide Code Representations for Vulnerability Detection on Ethereum Smart ContractsCode1
VulScribeR: Exploring RAG-based Vulnerability Augmentation with LLMsCode1
CASTLE: Benchmarking Dataset for Static Code Analyzers and LLMs towards CWE DetectionCode1
Graph Neural Networks for Vulnerability Detection: A Counterfactual ExplanationCode1
VulDetectBench: Evaluating the Deep Capability of Vulnerability Detection with Large Language ModelsCode1
Automated Vulnerability Detection in Source Code Using Deep Representation LearningCode0
AndroShield: Automated Android Applications Vulnerability Detection, a Hybrid Static and Dynamic Analysis ApproachCode0
Machine Learning Techniques for Python Source Code Vulnerability DetectionCode0
MTVHunter: Smart Contracts Vulnerability Detection Based on Multi-Teacher Knowledge TranslationCode0
Leveraging Generative AI to Enhance Automated Vulnerability ScoringCode0
Large Language Models for Cyber Security: A Systematic Literature ReviewCode0
Show:102550
← PrevPage 1 of 5Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1Reveal Model - Tested on Reveal (Training on Devign + VulScribeR 20K + Extra Cleans)F1 Score26.18Unverified
2Devign Model - Tested on Reveal (Training on Devign + VulScribeR 20K + Extra Cleans)F1 Score24.99Unverified
3Reveal Model - Tested on Bigvul (Training on Devign + VulScribeR 20K + Extra Cleans)F1 Score18.98Unverified
4Devign Model - Tested on Bigvul (Training on Devign + VulScribeR 20K + Extra Cleans)F1 Score18.51Unverified
5LineVul - Tested on Reveal (Training on Devign + VulScribeR 20K + Extra Cleans)F1 Score17.38Unverified
6LineVul - Tested on BigVul (Training on Devign + VulScribeR 20K+ Extra Cleans)F1 Score16.23Unverified
#ModelMetricClaimedVerifiedStatus
1WizardCoderAUC0.86Unverified
2ContraBERTAUC0.85Unverified