An Assessment of Eclipse Bugs' Priority and Severity Prediction Using Machine Learning

Main Article Content

Mohammed Q. Shatnawi
Batool Alazzam

Abstract

The reliability and quality of software programs remains to be an important and challenging aspect of software design. Software developers and system operators spend huge time on assessing and overcoming expected and unexpected errors that might affect the users’ experience negatively. One of the major concerns in developing software problems is the bug reports, which contains the severity and priority of these defects. For a long time, this task was performed manually with huge effort and time consumptions by system operators. Therefore, in this paper, we present a novel automatic assessment tool using Machine Learning algorithms, for assessing bugs’ reports based on several features such as hardware, product, assignee, OS, component, target milestone, votes, and versions.  The aim is to build a tool that automatically classifies software bugs according to the severity and priority of the bugs and makes predictions based on the most representative features and bug report text. To perform this task, we used the Multi-Nominal Naive Bayes, Random Forests Classifier, Bagging, Ada Boosting, SVC, KNN, and Linear SVM Classifiers and Natural Language Processing techniques to analyze the Eclipse dataset. The approach shows promising results for software bugs’ detection and prediction.

Article Details

How to Cite
Shatnawi, M. Q., & Alazzam, B. (2022). An Assessment of Eclipse Bugs’ Priority and Severity Prediction Using Machine Learning. International Journal of Communication Networks and Information Security (IJCNIS), 14(1). https://doi.org/10.17762/ijcnis.v14i1.5266 (Original work published April 12, 2022)
Section
Research Articles
Author Biography

Mohammed Q. Shatnawi, Computer Information Systems Department Faculty of Computer and Information Technology Jordan University of Science and Technology

A results-driven professional with experience in business solutions’ assessment and development for many years. Through my years of experience, I have managed to bridge the gap between sustainable business solutions and the company’s vision. I have excellent problem solving techniques and recommendations for current business needs with flexibility to adapt emerging updates. I am now a business consultant in Asinat Training Academy, and responsible for IT solutions’ development, integration and communication.With my constant race of keeping up with the latest trends in the field, I have enrolled in many workshops and training in Data Science and Big Data Management. I seek expert level knowledge in the big data analytical tools and methodologies. I mastered in depth practical expertise in Hadoop Ecosystem tools such as HDFS, Spark, Pig, Hive and HBase. I am now the instructor and coordinator of the "Big Data Management" course for masters' and bachelors' students in Jordan University of Science and Technology and is currently working on multiple research projects in the field. I was also an adviser for the development of the academic curriculum for the Data Science masters' degree in JUST and maintaining the objectives and outcomes of courses with the reference to the courses' description. Moreover, I have held several administrative positions in JUST. For example, vice director of the computer and information center, vice dean for the IT deanship, department’s chair and assistant’s dean for the graduate school deanship. Eventually, I have earned the PMP certificate in March 2018.

References

360logica, "Difference between Defect, Error, Bug, Failure and Fault!," 360logica, India, 2014.

Gema Rodríguez-Pérez, Gregorio Robles, Alexander Serebrenik, Andy Zaidman, Daniel M. Germán & Jesus M. Gonzalez-Barahona, "How bugs are born: a model to identify how bugs are introduced in software components," Springer Nature, p. 47, 2020.

J. Unadkat, "6 Common Types of Software Bugs Every Tester Should Know," BrowserStack, 2021.

Sanket, "The exponential cost of fixing bugs," DEEPSOURCE, 2019.

T. Sidorova, "Software Testing Basics: Types of Bugs and Why They Matter," ScienceSoft, McKinney, Dallas area, 2020.

Hassan, Zohaib, Iqbal, Naeem, Zaman, Abnash, "Towards Effective Analysis and Tracking of Mozilla and Eclipse Defects using Machine Learning Models based on Bugs Data," Soft Computing and Machine Intelligence, vol. 1, no. 1, 2021.

Vikas Chandra, Leo Ufimtsev, David Williams and John Arthorne and others., "Eclipse/Bug Tracking," Eclipse Foundation, 2021.

2. ·. X. Z. ·. X. Y. ·. R. C. ·. C. G. ·. H. L. ·. T. L. Shikai Guo1, "Developer Activity Motivated Bug Triaging: Via Convolutional Neural Network," Neural Processing Letters, p. 8, 2020.

Ashima Kukkar 1, Rajni Mohana 1, Anand Nayyar 2, Jeamin Kim 3, Byeong-Gwon Kang 4,* and Naveen Chilamkurti 5, "A Novel Deep-Learning-Based Bug Severity Classification Technique Using Convolutional Neural Networks and Random Forest with Boosting," Sensors, p. 22, 2019.

Awni Hammouri, Mustafa Hammad, Mohammad Alnabhan, Fatima Alsarayrah, "Software Bug Prediction using Machine Learning Approach," (IJACSA) International Journal of Advanced Computer Science and Applications,, vol. 9, p. 6, 2018.

S. Delphine Immaculate; M. Farida Begam; M. Floramary, "Software Bug Prediction Using Supervised Machine Learning Algorithms," in 2019 International Conference on Data Science and Communication (IconDSC), Bangalore, India, 2019.

Thamali Madhushani Adhikari; Yan Wu, "Classifying Software Vulnerabilities by Using the Bugs Framework," in 2020 8th International Symposium on Digital Forensics and Security (ISDFS), Beirut, Lebanon, 2020.

Hufsa Mohsin a, Chongyang Shi a,?, Shufeng Hao b, He Jiang c, "SPAN: A self-paced association augmentation and node embedding-based model for software bug classification and assignment," ELSEVIER, vol. 236, p. 107711, 2022 .

QASIM UMER , HUI LIU , AND YASIR SULTAN, "Emotion Based Automated Priority Prediction for Bug Reports," in IEEE Access, Beijing 100081, China, 2018.

SHAHID IQBAL1, RASHID NASEEM 2, SALMAN JAN 3, SAMI ALSHMRANY4, MUHAMMAD YASAR5, AND ARSHAD ALI4, "Determining Bug Prioritization Using Feature Reduction and Clustering With Classification," in Digital Object Identifier, 2020.

"A Survey on Software Defect Prediction Using Deep Learning," Mathmatics, p. 14, 2021.

"Deep learning in static, metric-based bug prediction," ELSEVIER, p. 9, 2020.

SHAHID IQBAL1, RASHID NASEEM 2, SALMAN JAN 3, SAMI ALSHMRANY4, MUHAMMAD YASAR5, AND ARSHAD ALI4, "Determining Bug Prioritization Using Feature Reduction and Clustering With Classification," in Digital Object Identifier, 2020.

1Syahana Nur’Ain Saharudin, 1Koh Tieng Wei and 2Kew Si Na, "Machine Learning Techniques for Software Bug Prediction: A Systematic Review," Journal of Computer Science, vol. 16, p. 12, 2020.

Korosh Koochekian Sabor, Abdelwahab Hamou-Lhadj, Alf Larsson, "DURFEX: A Feature Extraction Technique for Efficient Detection of," in IEEE International Conference on Software Quality, Reliability and Security (QRS), Prague, Czech Republic, 2017.

Harsh Lal, Gaurav Pahwa, "Root cause analysis of software bugs using machine learning techniques," in 7th International Conference on Cloud Computing, Data Science & Engineering - Confluence, Noida, India, 2017.

Wenjie Liu, Shanshan Wang, Xin Chen and He Jiang, "Predicting the Severity of Bug Reports Based on Feature Selection," International Journal of Software Engineering and Knowledge Engineering, vol. 28, p. 22, 2018.

"A Deep-Learning-Based Bug Priority Prediction Using RNN-LSTM Neural Networks," e-Informatica Software Engineering Journal, vol. 15, no. 1, p. 29–45, 2021.

Shahid Iqbal; Rashid Naseem; Salman Jan; Sami Alshmrany; Muhammad Yasar; Arshad Ali, "Determining Bug Prioritization Using Feature Reduction and Clustering With Classification," IEEE, vol. 8, pp. 215661 - 215678, 2020.

Zaher Shuraym M. Alharthi, Ravi Rastogi, "An efficient classification of secure and non-secure bug report material using machine learning method for cyber security," ELSEVIER, vol. 37, pp. 2507-2512, 2021.

Korosh Koochekian Sabor a , ? , Mohammad Hamdaqa b , Abdelwahab Hamou-Lhadj a, "Automatic prediction of the severity of bugs using stack traces and categorical features," ELSEVIER, p. 96–105, 2020.

Luiz Alberto Ferreira Gomes a , ? , Ricardo da Silva Torres b , Mario Lúcio Côrtes b, "Bug report severity level prediction in open source software: A survey and research opportunities," ELSEVIER, vol. 115, pp. 58-78, 2019.

Youshuai Tan a , Sijie Xu a , Zhaowei Wang b , Tao Zhang c , ?, Zhou Xu d , Xiapu Luo e, "Bug severity prediction using question-and-answer pairs from Stack Overflow," ELSEVIER, 2020.

C. Griffin, Introduction to the Eclipse Modeling Framework, IBM, 2003.

"Eclipse - Overview," Tutorials Point.

Dave Steinberg, Frank Budinsky, Marcelo Paternostro, Ed Merks, EMF: Eclipse Modeling Framework, 2nd Edition, Addison-Wesley Professional., 2008.

M. Alharthi Zaher Shuraym, Ravi Rastogi, "An efficient classification of secure and non-secure bug report material using machine learning method for cyber security," ELSEVIER, vol. 37, p. 6, 2020.

Rashmi Agrawal a , Rinkaj Goyal b, "Developing bug severity prediction models using word2vec," ELSEVIER, vol. 2, pp. 104-115, 2021.

Ashima Kukkar, Rajni Mohana, Yugal Kumar, "Does bug report summarization help in enhancing the accuracy of bug severity classification?," ELSEVIER, vol. 167, pp. 1345-1353, 2020.

Pavan Kumar Chaubey a, Tarun Kumar Arora b, "Software bug prediction and classification by global pooling of different activation of convolution layers," ELSEVIER, 2020.

Sushant Kumar, Pandey Ravi, Bhushan Mishra, Anil Kumar Tripathi, "Machine learning based methods for software fault prediction: A survey," ELSEVIER, vol. 172, p. 114595, 2020.

Shruthi Puranika, Pranav Deshpandea, K Chandrasekarana,?, "A Novel Machine Learning Approach For Bug Prediction," ELSEVIER, vol. 93, p. 924 – 930, 2016.

A. Ng, "Machine Learning," coursera, Stanford University.

IBM, "Machine Learning," IBM Cloud Education, 2020.