Estimating News Coverage Patterns using Latent Dirichlet Allocation (LDA)

  • Naeem Ahmed Mahoto Mehran University of Engineering and Technology, Jamshoro

Abstract

The growing rate of unstructured textual data has made an open challenge for the knowledge discovery, which aims extracting desired information from large collection of data. This study presents a system to derive news coverage patterns with the help of probabilistic model – Latent Dirichlet Allocation. Pattern is an arrangement of words within collected data that more likely appear together in certain context. The news coverage patterns have been computed as number function of news articles comprising of such patterns. A prototype, as a proof, has been developed to estimate the news coverage patterns for a newspaper – The Dawn. Analyzing the news coverage patterns from different aspects has been carried out using multidimensional data model. Further, the extracted news coverage patterns are illustrated by visual graphs to yield in-depth understanding of the topics, which have been covered in the news. The results also assist in identification of schema related to newspaper and journalists’ articles.

Downloads

Download data is not yet available.
Published
2018-06-27
How to Cite
MAHOTO, Naeem Ahmed. Estimating News Coverage Patterns using Latent Dirichlet Allocation (LDA). Sukkur IBA Journal of Emerging Technologies, [S.l.], v. 1, n. 1, p. 51-56, june 2018. ISSN 2616-7069. Available at: <http://journal.iba-suk.edu.pk:8089/SIBAJournals/index.php/sjet/article/view/142>. Date accessed: 18 oct. 2018.