Options
Extending Jensen Shannon Divergence to Compare Multiple Corpora
Author(s)
Date Issued
2017-01-01
Date Available
2019-05-20T08:53:52Z
Abstract
Investigating public discourse on social media platforms has proven a viable way to reflect the impacts of political issues. In this paper we frame this as a corpus comparison problem in which the online discussion of different groups are treated as different corpora to be compared. We propose an extended version of the Jensen-Shannon divergence measure to compare multiple corpora and use the FP-growth algorithm to mix unigrams and bigrams in this comparison. We also propose a set of visualizations that can illustrate the results of this analysis. To demonstrate these approaches we compare the Twitter discourse surrounding Brexit in Ireland and Great Britain across a 14 week time period.
Sponsorship
Teagasc
Type of Material
Conference Publication
Publisher
CEUR-WS.org
Series
CEUR Workshop Proceedings, Volume 2086, 2018
Language
English
Status of Item
Peer reviewed
Journal
McAuley, J., McKeever, S. (eds.). Proceedings of the 25th Irish Conference on Artificial Intelligence and Cognitive Science
Conference Details
25th Irish Conference on Artificial Intelligence and Cognitive Science, Dublin, Ireland, 7 - 8 December 2017
This item is made available under a Creative Commons License
File(s)
Loading...
Name
insight_publication.pdf
Size
1.16 MB
Format
Adobe PDF
Checksum (MD5)
bc465ff2df4b15011f39657bd4b3c489
Owning collection