Advanced News Archiving System with Machine Learning-Driven Web Scraping and AI-Powered Summarization Using T5, Pegasus, BERT and BART Architectures

Journal Title: International Journal of Experimental Research and Review - Year 2024, Vol 46, Issue 10

Abstract

Data plays a crucial role in the contemporary era of technology, as it is a vital element in the publication of news on the internet or a website. Nevertheless, understanding long reports in order to fully comprehend events can be a challenging endeavor, frequently leading to subjective judgments. The application's architecture integrates the categorization of news stories by day, resulting in a well-organized and readily accessible archive. The application employs the web scraping method, which entails pulling pertinent news articles from numerous internet sources. The application employed sophisticated summarizing libraries, including the BERT, BART, T5 model and Google Pegasus, to condense the information into a succinct and comprehensible style. The T5 model performs exceptionally well in text summarization and other natural language processing tasks because of its text-to-text structure; it is also a very customizable language model. Google Pegasus, an expert in abstractive summarizing, uses self-attention mechanisms and rigorous pre-training to generate high-quality, concise news summaries. To summarize, these are the most important parts of our app's process. When it comes to collecting, storing, and summarizing news articles, the system has you covered. In addition, it will offer a straightforward design that makes it simple to browse past news stories and their summaries.

Authors and Affiliations

Narasimhula L V Venugopal, K Visala, Sammingi Nirmala, Ch Vinod Varma, Adibabu Triparagiri, Athmakuri Satish Kumar, Ch Sekhar

Keywords

Related Articles

User Interface Bug Classification Model Using ML and NLP Techniques: A Comparative Performance Analysis of ML Models

Analyzing user interface (UI) bugs is an important step taken by testers and developers to assess the usability of the software product. UI bug classification helps in understanding the nature and cause of software failu...

Reinfection of Chickenpox for the fourth time in an older adult

CPeople with chickenpox often gain lifelong immunity after one infection. In most cases, chickenpox's natural immunity provides significant protection against reinfection. However, patients with a second reinfection have...

Cyanobacterial diversity and physicochemical study of different blocks of Howrah district in West Bengal, India

Present research paper deals with continued three successive years (2011 - 2014) studied for the first time the occurrence of Cyanobacteria in rice growing fields of Howrah district in West Bengal. Altogether 847 times r...

The nutritional health factors of Cashewnut (Anacardium occident ale, L.)

Cashew is a bean shaped nut that grows on a tropical evergreen tree. In recent years, the importance of cashew in terms of human health is gaining momentum. The nuts (kernels) of 75 promising cashew germplasms were taken...

Assessment of undernutrition among Santal children of Bolpur-Sriniketan block of Birbhum District, West Bengal, India

Present cross sectional study was undertaken to assess the nutritional status of 348 pre-primary and primary school going Santal children aged 4 to 10 years which includes 186 boys and 162 girls of Bolpur-Sriniketan Bloc...

Download PDF file
  • EP ID EP754340
  • DOI 10.52756/ijerr.2024.v46.017
  • Views 3
  • Downloads 0

How To Cite

Narasimhula L V Venugopal, K Visala, Sammingi Nirmala, Ch Vinod Varma, Adibabu Triparagiri, Athmakuri Satish Kumar, Ch Sekhar (2024). Advanced News Archiving System with Machine Learning-Driven Web Scraping and AI-Powered Summarization Using T5, Pegasus, BERT and BART Architectures. International Journal of Experimental Research and Review, 46(10), -. https://europub.co.uk./articles/-A-754340