- 2020
- M. Pohl, A. Hashaam, S. Bosse, D. G. Staegemann, M. Volk, F. Kramer, and K. Turowski, „Application of nlp to determine the state of issues in bug tracking systems,“ in 2020 international conference on data mining workshops (icdmw), 2020.
[Bibtex]@InProceedings{, author = {Pohl, Matthias and Hashaam, Ali and Bosse, Sascha and Staegemann, Daniel Gunnar and Volk, Matthias and Kramer, Frederik and Turowski, Klaus}, booktitle = {2020 International Conference on Data Mining Workshops (ICDMW)}, title = {Application of NLP to determine the State of Issues in Bug Tracking Systems}, year = {2020}, abstract = {The amount of semi-structured textual data being generated all over the world wide web has placed the limelight on the fields of natural language processing and text mining. With the continuous growth of IT companies, there are a lot of new and diverse projects getting introduced. Hence, the need for project management tools that provide robust, efficient, and forthcoming services is rising. A bug tracking system is one of such systems that help in making the job of every stakeholder easier. This research utilizes the textual data provided by bug tracking systems in the form of problem summaries, descriptions, and comments to determine the state of project issues. The unlabeled textual data from the bug tracking system is subjected to a state of the art class-expansion stage, that automates the process of labeling issues based on severity level and clustering (constraint-based and density-based). During this stage, 40 percent of the total data could be labeled. Semi-supervised learners are introduced to consider labeled as well as unlabeled data for learning, and the results significantly outclass the results of a supervised learner. However, for the semi-supervised learners, the recall for the critical class is still not desirable at 53 percent. Since identifying critical tickets is of utmost importance for project managers, it becomes essential to improve the recall for the critical class. In order to deal with this, sentiment analysis' insights are introduced into the semi-supervised learners. As a result, the recall for the critical class improves up to 85 percent, which makes the proposed solution useful in cases where efficient classification among critical and non-critical classes is required.}, keywords = {bug tracking systems, natural language process-ing, machine learning, information management, labeling}, url = {https://ieeexplore.ieee.org/abstract/document/9346335}, }
AbstractThe amount of semi-structured textual data being generated all over the world wide web has placed the limelight on the fields of natural language processing and text mining. With the continuous growth of IT companies, there are a lot of new and diverse projects getting introduced. Hence, the need for project management tools that provide robust, efficient, and forthcoming services is rising. A bug tracking system is one of such systems that help in making the job of every stakeholder easier. This research utilizes the textual data provided by bug tracking systems in the form of problem summaries, descriptions, and comments to determine the state of project issues. The unlabeled textual data from the bug tracking system is subjected to a state of the art class-expansion stage, that automates the process of labeling issues based on severity level and clustering (constraint-based and density-based). During this stage, 40 percent of the total data could be labeled. Semi-supervised learners are introduced to consider labeled as well as unlabeled data for learning, and the results significantly outclass the results of a supervised learner. However, for the semi-supervised learners, the recall for the critical class is still not desirable at 53 percent. Since identifying critical tickets is of utmost importance for project managers, it becomes essential to improve the recall for the critical class. In order to deal with this, sentiment analysis‘ insights are introduced into the semi-supervised learners. As a result, the recall for the critical class improves up to 85 percent, which makes the proposed solution useful in cases where efficient classification among critical and non-critical classes is required.
- 2019
- A. Hashaam, „Application of nlp to determine thestate of issues in bug tracking systems,“ Master Thesis, 2019.
[Bibtex]@MastersThesis{, author = {Ali Hashaam}, school = {Otto-von-Guericke-Universität}, title = {Application of NLP to determine thestate of issues in Bug Tracking Systems}, year = {2019}, abstract = {The amount of semi-structured textual data being generated all over the world wide web has placed the limelight on the fields of Natural Language Processing and Text Mining. With the continuous increase in business for IT companies, there are a lot of new and diverse projects getting introduced and hence, the need for the project management tools which provide robust, efficient, and forthcoming services is rising. Bug Tracking System is one of such systems that help in making the job of every stakeholder easier. This research intends to utilize the textual data available on Bug Tracking Systems in the form of problem summaries, descriptions, and comments from the concerning people to determine the state of tickets (issues). The unlabeled textual data from the Bug Tracking System is subjected to a state of the art Class-Expansion Stage, that automates the process of labeling the ticket, intelligently based on their severity level and clustering (Constraint-Based and Density-Based). Class-Expansion stage yields 40% of the total data as labeled. Semi-supervised learners are introduced to consider labeled as well as unlabeled data for learning, and the results are way too promising when compared relative to a supervised learner. Evaluation Results of the majority-voting classifier for semi-supervised learners, show that the semi-supervised model is performing quite good as compared to the baseline supervised learner where the learner is not able to classify any instance in critical class, but the recall for the critical class in case of semi-supervised learners is still not good with it being at 53%. Since identifying critical tickets is of utmost importance for Project Managers, it becomes essential to improve the recall for the critical class. In order to deal with this, when sentiment analysis’ insights are introduced into the semi-supervised learners, the recall for the critical class improves up to 85% which makes the proposed solution super useful in cases where efficient classification among critical and non-critical classes is required.}, }
AbstractThe amount of semi-structured textual data being generated all over the world wide web has placed the limelight on the fields of Natural Language Processing and Text Mining. With the continuous increase in business for IT companies, there are a lot of new and diverse projects getting introduced and hence, the need for the project management tools which provide robust, efficient, and forthcoming services is rising. Bug Tracking System is one of such systems that help in making the job of every stakeholder easier. This research intends to utilize the textual data available on Bug Tracking Systems in the form of problem summaries, descriptions, and comments from the concerning people to determine the state of tickets (issues). The unlabeled textual data from the Bug Tracking System is subjected to a state of the art Class-Expansion Stage, that automates the process of labeling the ticket, intelligently based on their severity level and clustering (Constraint-Based and Density-Based). Class-Expansion stage yields 40% of the total data as labeled. Semi-supervised learners are introduced to consider labeled as well as unlabeled data for learning, and the results are way too promising when compared relative to a supervised learner. Evaluation Results of the majority-voting classifier for semi-supervised learners, show that the semi-supervised model is performing quite good as compared to the baseline supervised learner where the learner is not able to classify any instance in critical class, but the recall for the critical class in case of semi-supervised learners is still not good with it being at 53%. Since identifying critical tickets is of utmost importance for Project Managers, it becomes essential to improve the recall for the critical class. In order to deal with this, when sentiment analysis’ insights are introduced into the semi-supervised learners, the recall for the critical class improves up to 85% which makes the proposed solution super useful in cases where efficient classification among critical and non-critical classes is required.