Estimating News Coverage Patterns using Latent Dirichlet Allocation (LDA)
DOI:
https://doi.org/10.30537/sjet.v1i1.142Keywords:
News Coverage Patterns, Probabilistic Model, Data Visualization, Multidimensional Data ModelAbstract
The growing rate of unstructured textual data has made an open challenge for the knowledge discovery, which aims extracting desired information from large collection of data. This study presents a system to derive news coverage patterns with the help of probabilistic model – Latent Dirichlet Allocation. Pattern is an arrangement of words within collected data that more likely appear together in certain context. The news coverage patterns have been computed as number function of news articles comprising of such patterns. A prototype, as a proof, has been developed to estimate the news coverage patterns for a newspaper – The Dawn. Analyzing the news coverage patterns from different aspects has been carried out using multidimensional data model. Further, the extracted news coverage patterns are illustrated by visual graphs to yield in-depth understanding of the topics, which have been covered in the news. The results also assist in identification of schema related to newspaper and journalists’ articles.
Downloads
Downloads
Published
Issue
Section
License
The SJET holds the rights of all the published papers. Authors are required to transfer copyrights to journal to make sure that the paper is solely published in SJET, however, authors and readers can freely read, download, copy, distribute, print, search, or link to the full texts of its articles and to use them for any other lawful purpose.
The SJET is licensed under Creative Commons Attribution-NonCommercial 4.0 International License.