TY - GEN
T1 - Poster
T2 - 40th ACM/IEEE International Conference on Software Engineering, ICSE 2018
AU - Ayub, Muhammad Sohaib
AU - Siddiqui, Junaid Haroon
N1 - Publisher Copyright:
© 2018 Authors.
PY - 2018/5/27
Y1 - 2018/5/27
N2 - Debugging of distributed computing model programs like MapReduce is a difficult task. That's why prior studies only focus on finding and fixing bugs in early stages of program development. Delta debugging tries to find minimal failing input in sequential programs by dividing inputs into subsets and testing these subsets one-by-one. But no prior work tries to find minimal failing input in distributed programs like MapReduce. In this paper, we present MapRedDD, a framework to efficiently find minimal failing input in MapReduce programs. MapRedDD employs failing input selection technique, focused on identifying the failing input subset in the single run of MapReduce program with multiple input subsets instead of testing each subset separately. This helps to reduce the number of executions of MapReduce program for each input subset and overcome the overhead of job submission, job scheduling and final outcome retrieval. Our work can efficiently find the minimal failing input in the number of executions equal to the number of inputs to MapReduce program N as opposed to the number of executions of MapReduce program equal to the number of input subsets 2N-1 in worst case for binary search invariant algorithm to find minimal failing input.
AB - Debugging of distributed computing model programs like MapReduce is a difficult task. That's why prior studies only focus on finding and fixing bugs in early stages of program development. Delta debugging tries to find minimal failing input in sequential programs by dividing inputs into subsets and testing these subsets one-by-one. But no prior work tries to find minimal failing input in distributed programs like MapReduce. In this paper, we present MapRedDD, a framework to efficiently find minimal failing input in MapReduce programs. MapRedDD employs failing input selection technique, focused on identifying the failing input subset in the single run of MapReduce program with multiple input subsets instead of testing each subset separately. This helps to reduce the number of executions of MapReduce program for each input subset and overcome the overhead of job submission, job scheduling and final outcome retrieval. Our work can efficiently find the minimal failing input in the number of executions equal to the number of inputs to MapReduce program N as opposed to the number of executions of MapReduce program equal to the number of input subsets 2N-1 in worst case for binary search invariant algorithm to find minimal failing input.
KW - Delta debugging
KW - MapReduce
KW - Software verification
UR - https://www.scopus.com/pages/publications/85049682600
U2 - 10.1145/3183440.3195084
DO - 10.1145/3183440.3195084
M3 - Conference Publication
AN - SCOPUS:85049682600
T3 - Proceedings - International Conference on Software Engineering
SP - 177
EP - 178
BT - Proceedings - International Conference on Software Engineering
PB - IEEE Computer Society
Y2 - 27 May 2018 through 3 June 2018
ER -