A Red Teaming Framework for Large Language Models: A Case Study on Faithfulness Evaluation
SGAFuzzer: Stateful GraphQL API Fuzzing
Assessment and Redesign of Ga-Starfish: A Gamified Strategy for Agile Retrospectives
Improving Ensemble Models for Software Defect Prediction: a study applying preprocessing techniques
Classification Accuracy Estimation Without Labels via Architecture-Agnostic Model Agreement
Temporal validity of software datasets for code metrics: an empirical assessment of sampling strategies
Evaluating the effectiveness of neuron coverage metrics: a metamorphic-testing approach
Minimizing control dependencies of pipelining through optimizing branch selection
Code Change and Smell Techniques for Regression Test Selection
Comparative Analysis of Text Mining and Clustering Techniques for Assessing Functional Dependency between Manual Test Cases