SOTAVerified

software testing

Papers

Showing 150 of 135 papers

TitleStatusHype
SWT-Bench: Testing and Validating Real-World Bug-Fixes with Code AgentsCode2
GPflow: A Gaussian process library using TensorFlowCode2
On the Challenges of Fuzzing Techniques via Large Language ModelsCode2
CoverUp: Effective High Coverage Test Generation for PythonCode2
Boosting Synthetic Data Generation with Effective Nonlinear Causal DiscoveryCode1
Towards Principled Representation Learning from Videos for Reinforcement LearningCode1
Perfect is the enemy of test oracleCode1
Leveraging Large Language Models for Enhancing the Understandability of Generated Unit TestsCode1
Towards Efficient Fine-tuning of Pre-trained Code Models: An Experimental Study and BeyondCode1
Software Engineering for AI-Based Systems: A SurveyCode1
Black-box Explanation of Object Detectors via Saliency MapsCode1
Efficient and Effective Generation of Test Cases for Pedestrian Detection -- Search-based Software Testing of Baidu Apollo in SVLCode1
QuanTest: Entanglement-Guided Testing of Quantum Neural Network SystemsCode0
Smoke Testing for Machine Learning: Simple Tests to Discover Severe DefectsCode0
Boosting Operational DNN Testing Efficiency through ConditioningCode0
Can Search-Based Testing with Pareto Optimization Effectively Cover Failure-Revealing Test Inputs?Code0
Business Negotiation Definition LanguageCode0
Fairness-aware Configuration of Machine Learning LibrariesCode0
Differential testing for machine learning: an analysis for classification algorithms beyond deep learningCode0
TensorFuzz: Debugging Neural Networks with Coverage-Guided FuzzingCode0
SECBENCH: A Database of Real Security VulnerabilitiesCode0
Test Case Recommendations with Distributed Representation of Code Syntactic FeaturesCode0
Reasoning-Based Software TestingCode0
Towards Trustworthy GUI Agents: A SurveyCode0
A Comparison of Reinforcement Learning Frameworks for Software Testing TasksCode0
Recurrent Neural Networks for Fuzz Testing Web BrowsersCode0
Testing the Channels of Convolutional Neural NetworksCode0
Navigating the growing field of research on AI for software testing -- the taxonomy for AI-augmented software testing and an ontology-driven literature surveyCode0
Assessing Data Augmentation-Induced Bias in Training and Testing of Machine Learning ModelsCode0
An Efficiency Study for SPLADE ModelsCode0
Genetic Micro-Programs for Automated Software Testing with Large Path CoverageCode0
InterEvo-TR: Interactive Evolutionary Test Generation With Readability AssessmentCode0
Comprehensive Evaluation and Insights into the Use of Large Language Models in the Automation of Behavior-Driven Development Acceptance Test FormulationCode0
An Autonomous Performance Testing Framework using Self-Adaptive Fuzzy Reinforcement LearningCode0
Generative AI to Generate Test Data GeneratorsCode0
IRG: Generating Synthetic Relational Databases using Deep Learning with Insightful Relational UnderstandingCode0
Pipelines for Social Bias Testing of Large Language ModelsCode0
LLM-Powered Test Case Generation for Detecting Bugs in Plausible ProgramsCode0
Test It Before You Trust It: Applying Software Testing for Trustworthy In-context LearningCode0
Comparative Study of Machine Learning Test Case Prioritization for Continuous Integration Testing0
Column Generation for Interaction Coverage in Combinatorial Software Testing0
Code-Aware Prompting: A study of Coverage Guided Test Generation in Regression Setting using LLM0
Artificial Intelligence in Software Testing : Impact, Problems, Challenges and Prospect0
An Algorithm for Generating Gap-Fill Multiple Choice Questions of an Expert System0
CLOVER: A Test Case Generation Benchmark with Coverage, Long-Context, and Verification0
Artificial intelligence for context-aware visual change detection in software test automation0
Can ChatGPT advance software testing intelligence? An experience report on metamorphic testing0
Failed Disruption Propagation in Integer Genetic Programming0
Exploring the Integration of Large Language Models in Industrial Test Maintenance Processes0
A Novel Multiple Ensemble Learning Models Based on Different Datasets for Software Defect Prediction0
Show:102550
← PrevPage 1 of 3Next →

No leaderboard results yet.