SOTAVerified

software testing

Papers

Showing 150 of 135 papers

TitleStatusHype
Guaranteed Guess: A Language Modeling Approach for CISC-to-RISC Transpilation with Testing Guarantees0
Navigating the growing field of research on AI for software testing -- the taxonomy for AI-augmented software testing and an ontology-driven literature surveyCode0
IntenTest: Stress Testing for Intent Integrity in API-Calling LLM Agents0
The Impact of Software Testing with Quantum Optimization Meets Machine Learning0
EvoGPT: Enhancing Test Suite Robustness via LLM-Based Generation and Genetic Optimization0
On the Need for a Statistical Foundation in Scenario-Based Testing of Autonomous Vehicles0
Automated Unit Test Case Generation: A Systematic Literature Review0
Test It Before You Trust It: Applying Software Testing for Trustworthy In-context LearningCode0
Harden and Catch for Just-in-Time Assured LLM-Based Software Testing: Open Research Challenges0
Expectations vs Reality -- A Secondary Study on AI Adoption in Software Testing0
From Code Generation to Software Testing: AI Copilot with Context-Based RAG0
Towards Trustworthy GUI Agents: A SurveyCode0
Integrating Artificial Intelligence with Human Expertise: An In-depth Analysis of ChatGPT's Capabilities in Generating Metamorphic Relations0
Vulnerability Detection: From Formal Verification to Large Language Models and Hybrid Approaches: A Comprehensive Overview0
Rule-Guided Reinforcement Learning Policy Evaluation and Improvement0
ToolFuzz -- Automated Agent Tool Testing0
WIP: Assessing the Effectiveness of ChatGPT in Preparatory Testing Activities0
Towards Reliable LLM-Driven Fuzz Testing: Vision and Road Ahead0
CLOVER: A Test Case Generation Benchmark with Coverage, Long-Context, and Verification0
Identifying Flaky Tests in Quantum Code: A Machine Learning Approach0
A Systematic Approach for Assessing Large Language Models' Test Case Generation Capability0
Assessing Data Augmentation-Induced Bias in Training and Testing of Machine Learning ModelsCode0
Toward Neurosymbolic Program Comprehension0
Many-Objective Neuroevolution for Testing Games0
An efficient approach to represent enterprise web application structure using Large Language Model in the service of Intelligent Quality Engineering0
The Potential of LLMs in Automating Software Testing: From Generation to Reporting0
Reinforcement Learning from Automatic Feedback for High-Quality Unit Test Generation0
Design choices made by LLM-based test generators prevent them from finding bugs0
CPP-UT-Bench: Can LLMs Write Complex Unit Tests in C++?0
Software testing for project report.0
VALTEST: Automated Validation of Language Model Generated Test Cases0
Can Search-Based Testing with Pareto Optimization Effectively Cover Failure-Revealing Test Inputs?Code0
TAEGAN: Generating Synthetic Tabular Data For Data Augmentation0
On the Effectiveness of LLMs for Manual Test Verifications0
Computer Vision Intelligence Test Modeling and Generation: A Case Study on Smart OCR0
Exploring the Integration of Large Language Models in Industrial Test Maintenance Processes0
The Future of Software Testing: AI-Powered Test Case Generation and Validation0
The Role of Artificial Intelligence and Machine Learning in Software Testing0
Testing and Evaluation of Large Language Models: Correctness, Non-Toxicity, and Fairness0
Leveraging Large Language Models for Enhancing the Understandability of Generated Unit TestsCode1
A System for Automated Unit Test Generation Using Large Language Models and Assessment of Generated Test Suites0
MAO: A Framework for Process Model Generation with Multi-Agent Orchestration0
FuzzTheREST: An Intelligent Automated Black-box RESTful API Fuzzer0
SWT-Bench: Testing and Validating Real-World Bug-Fixes with Code AgentsCode2
Data Augmentation by Fuzzing for Neural Test Generation0
BugBlitz-AI: An Intelligent QA Assistant0
Artificial intelligence for context-aware visual change detection in software test automation0
Fuzzy Inference System for Test Case Prioritization in Software Testing0
LLM-Powered Test Case Generation for Detecting Bugs in Plausible ProgramsCode0
Tasks People Prompt: A Taxonomy of LLM Downstream Tasks in Software Verification and Falsification Approaches0
Show:102550
← PrevPage 1 of 3Next →

No leaderboard results yet.