SOTAVerified

Vulnerability Detection

Vulnerability detection plays a crucial role in safeguarding against these threats by identifying weaknesses and potential entry points that malicious actors could exploit. Through advanced scanning techniques and penetration testing, vulnerability detection tools meticulously analyze web applications and websites for vulnerabilities such as SQL injection, cross-site scripting (XSS), and insecure authentication mechanisms.

By proactively identifying and addressing vulnerabilities, organizations can strengthen their online security posture and mitigate the risk of data breaches, financial loss, and reputational damage. Additionally, vulnerability detection empowers businesses to stay compliant with industry regulations and standards, demonstrating their commitment to safeguarding sensitive information and maintaining the trust of their customers. With the evolving threat landscape and increasingly sophisticated attack vectors, investing in robust vulnerability detection measures is paramount for staying one step ahead of cyber threats and ensuring the resilience of web-based platforms and services.

Papers

Showing 150 of 216 papers

TitleStatusHype
CORE: Benchmarking LLMs Code Reasoning Capabilities through Static Analysis Tasks0
SV-LLM: An Agentic Approach for SoC Security Verification using Large Language Models0
Smart-LLaMA-DPO: Reinforced Large Language Model for Explainable Smart Contract Vulnerability Detection0
Identifying Helpful Context for LLM-based Vulnerability Repair: A Preliminary Study0
Today's Cat Is Tomorrow's Dog: Accounting for Time-Based Changes in the Labels of ML Vulnerability Detection Approaches0
Boosting Vulnerability Detection of LLMs via Curriculum Preference Optimization with Synthetic Reasoning DataCode0
Ai-Driven Vulnerability Analysis in Smart Contracts: Trends, Challenges and Future Directions0
SafeGenBench: A Benchmark Framework for Security Vulnerability Detection in LLM-Generated Code0
Explainer-guided Targeted Adversarial Attacks against Binary Code Similarity Detection Models0
A Multi-Dataset Evaluation of Models for Automated Vulnerability Repair0
LPASS: Linear Probes as Stepping Stones for vulnerability detection using compressed LLMs0
BugWhisperer: Fine-Tuning LLMs for SoC Hardware Vulnerability Detection0
SV-TrustEval-C: Evaluating Structure and Semantic Reasoning in Large Language Models for Source Code Vulnerability AnalysisCode0
An Initial Exploration of Fine-tuning Small Language Models for Smart Contract Reentrancy Vulnerability Detection0
CRAKEN: Cybersecurity LLM Agent with Knowledge-Based ExecutionCode1
Adaptive Plan-Execute Framework for Smart Contract Security Auditing0
Leveraging Large Language Models for Command Injection Vulnerability Analysis in Python: An Empirical Study on Popular Open-Source Projects0
Let the Trial Begin: A Mock-Court Approach to Vulnerability Detection using LLM-Based Agents0
Are Sparse Autoencoders Useful for Java Function Bug Detection?Code0
Can You Really Trust Code Copilots? Evaluating Large Language Models from a Code Security PerspectiveCode0
Enhancing Large Language Models with Faster Code Preprocessing for Vulnerability DetectionCode0
Program Semantic Inequivalence Game with Large Language ModelsCode0
Enhancing the Cloud Security through Topic Modelling0
LLMpatronous: Harnessing the Power of LLMs For Vulnerability Detection0
Case Study: Fine-tuning Small Language Models for Accurate and Private CWE Detection in Python Code0
A Study on Mixup-Inspired Augmentation Methods for Software Vulnerability Detection0
C2RUST-BENCH: A Minimized, Representative Dataset for C-to-Rust Transpilation Evaluation0
AI-Based Vulnerability Analysis of NFT Smart Contracts0
Trace Gadgets: Minimizing Code Context for Machine Learning-Based Vulnerability Prediction0
The Hitchhiker's Guide to Program Analysis, Part II: Deep Thoughts by LLMsCode1
Using ML filters to help automated vulnerability repairs: when it helps and when it doesn't0
R2Vul: Learning to Reason about Software Vulnerabilities with Reinforcement Learning and Structured Reasoning DistillationCode1
Responsible Development of Offensive AICode0
Reasoning Under Threat: Symbolic and Neural Techniques for Cybersecurity Verification0
Enhancing Software Vulnerability Detection Using Code Property Graphs and Convolutional Neural Networks0
Reasoning with LLMs for Zero-Shot Vulnerability DetectionCode0
Computing Modes of Instability of Parameterized Nonlinear Systems for Vulnerability Assessment0
Vulnerability Detection: From Formal Verification to Large Language Models and Hybrid Approaches: A Comprehensive Overview0
CASTLE: Benchmarking Dataset for Static Code Analyzers and LLMs towards CWE DetectionCode1
Evaluating LLaMA 3.2 for Software Vulnerability Detection0
MTVHunter: Smart Contracts Vulnerability Detection Based on Multi-Teacher Knowledge TranslationCode0
A Multi-Agent Framework for Automated Vulnerability Detection and Repair in Solidity and Move Smart Contracts0
SmartLLM: Smart Contract Auditing using Custom Generative AI0
LLMs in Software Security: A Survey of Vulnerability Detection Techniques and InsightsCode0
Large Language Models for In-File Vulnerability Localization Can Be "Lost in the End"0
Evaluating Large Language Models in Vulnerability Detection Under Variable Context Windows0
Automating the Detection of Code Vulnerabilities by Analyzing GitHub Issues0
CGP-Tuning: Structure-Aware Soft Prompt Tuning for Code Vulnerability Detection0
How to Select Pre-Trained Code Models for Reuse? A Learning PerspectiveCode0
Investigating Large Language Models for Code Vulnerability Detection: An Experimental StudyCode1
Show:102550
← PrevPage 1 of 5Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1Reveal Model - Tested on Reveal (Training on Devign + VulScribeR 20K + Extra Cleans)F1 Score26.18Unverified
2Devign Model - Tested on Reveal (Training on Devign + VulScribeR 20K + Extra Cleans)F1 Score24.99Unverified
3Reveal Model - Tested on Bigvul (Training on Devign + VulScribeR 20K + Extra Cleans)F1 Score18.98Unverified
4Devign Model - Tested on Bigvul (Training on Devign + VulScribeR 20K + Extra Cleans)F1 Score18.51Unverified
5LineVul - Tested on Reveal (Training on Devign + VulScribeR 20K + Extra Cleans)F1 Score17.38Unverified
6LineVul - Tested on BigVul (Training on Devign + VulScribeR 20K+ Extra Cleans)F1 Score16.23Unverified
#ModelMetricClaimedVerifiedStatus
1WizardCoderAUC0.86Unverified
2ContraBERTAUC0.85Unverified