Options
Extending Jensen Shannon Divergence to Compare Multiple Corpora
File(s)
File | Description | Size | Format | |
---|---|---|---|---|
insight_publication.pdf | 1.16 MB |
Author(s)
Date Issued
01 January 2017
Date Available
20T08:53:52Z May 2019
Abstract
Investigating public discourse on social media platforms has proven a viable way to reflect the impacts of political issues. In this paper we frame this as a corpus comparison problem in which the online discussion of different groups are treated as different corpora to be compared. We propose an extended version of the Jensen-Shannon divergence measure to compare multiple corpora and use the FP-growth algorithm to mix unigrams and bigrams in this comparison. We also propose a set of visualizations that can illustrate the results of this analysis. To demonstrate these approaches we compare the Twitter discourse surrounding Brexit in Ireland and Great Britain across a 14 week time period.
Sponsorship
Teagasc
Type of Material
Conference Publication
Publisher
CEUR-WS.org
Series
CEUR Workshop Proceedings, Volume 2086, 2018
Language
English
Status of Item
Peer reviewed
Part of
McAuley, J., McKeever, S. (eds.). Proceedings of the 25th Irish Conference on Artificial Intelligence and Cognitive Science
Description
25th Irish Conference on Artificial Intelligence and Cognitive Science, Dublin, Ireland, 7 - 8 December 2017
This item is made available under a Creative Commons License
Owning collection
Views
586
Last Month
10
10
Acquisition Date
Jan 28, 2023
Jan 28, 2023
Downloads
87
Last Week
5
5
Last Month
7
7
Acquisition Date
Jan 28, 2023
Jan 28, 2023