Software Bug Reports: Automatic Keyword and Sentence-Based Text Summarization Using Artificial Intelligence

Abstract

The purpose of text summarization is to quickly and accurately extract the most important data from papers. The proposed unsupervised method seeks to synthesise complete and informative bug reports (software artefacts). The suggested approach employs Rapid Auto- matic Keyword Extraction and the term frequency-inverse document frequency method to identify applicable keywords and phrases. During the sentence extraction procedure, fuzzy C-means clustering is used to prioritise sentences that have a high degree of membership in each cluster (beyond a predefined threshold). The selection of sentences is performed by a rule-engine. Information is extracted using keywords and sentences chosen by the clustering process, and the rules are developed using domain knowledge. The proposed method produces a logical and well-organized summary of apache bug reports. The retrieval summary is improved with the help of hierarchical clustering by removing unnecessary details and rearranging them. The Apache Project Bug Report Corpus (APBRC) and the original Bug Report Corpus are used to evaluate the effectiveness of the proposed method. Measures of performance such as precision, recall, pyramid precision, and F-score are used to evaluate the results. Experiment results demonstrate that our proposed method significantly outperforms the state-of-the-art baseline methods like BRC and LRCA. In addition, it achieves substantial gains compared to prior art unsupervised methods as Hurried and centroid. It extracts the most relevant keyword phrases and sentences from each cluster to offer comprehensive coverage and a coherent summary. The average values for precision, recall, f-score, and pyramid precision on the APBRC corpus are 78.22%, 82.18%, 80.10%, and 81.66%, respectively.

Authors and Affiliations

Zaid Altaf, and Ashish Oberoi

Keywords

Related Articles

Research on Decryption Methodologies and Key Aggregate Searchable Encryption for Data Security Storage in Cloud

Numerous firm’s architectures management of data guarantees substantially alter method, gain access to maintain private commercial business. Occur additional facts protection problems. Existing statistics safety methods...

The MATLAB Programs for Some Numerical Methods and Algorithms (II)

We have listed and described some numerical methods and techniques for the reader so that he can be acquainted with them and then a description program flow chart is mentioned without the Matlab Software Program. Here ,...

AI and Cloud Computing for Enhanced Virtualization and Containerization

This research paper aims at analysing the application of artificial intelligence and deep learning techniques in the cloud computing paradigm especially in virtualization and containerization. Since cloud computing has b...

Modelling and Designing of NLP Interface to Database for Afaan Oromoo

Database management system has been widely used for storing and retrieving data. However, database is often hard to access the data since their interface is rigid in cooperating with user, due to that analysis of natural...

An Overview of Device-To-Device Communication in Cellular Networks

Gadget to-Device (D2D) correspondence was first recommended as another worldview for further developing organization execution in cell organizations. New use-cases for D2D correspondences in cell networks have arisen bec...

Download PDF file
  • EP ID EP746018
  • DOI 10.55524/ijircst.2022.10.6.18
  • Views 69
  • Downloads 0

How To Cite

Zaid Altaf, and Ashish Oberoi (2022). Software Bug Reports: Automatic Keyword and Sentence-Based Text Summarization Using Artificial Intelligence. International Journal of Innovative Research in Computer Science and Technology, 10(6), -. https://europub.co.uk./articles/-A-746018