Options
ThemeCrowds: Multiresolution Summaries of Twitter Usage
Date Issued
2011-06
Date Available
2021-08-09T15:51:10Z
Abstract
Users of social media sites, such as Twitter, rapidly generate large volumes of text content on a daily basis. Visual summaries are needed to understand what groups of people are saying collectively in this unstructured text data. Users will typically discuss a wide variety of topics, where the number of authors talking about a specific topic can quickly grow or diminish over time, and what the collective is saying about the subject can shift as a situation develops. In this paper, we present a technique that summarises what collections of Twitter users are saying about certain topics over time. As the correct resolution for inspecting the data is unknown in advance, the users are clustered hierarchically over a fixed time interval based on the similarity of their posts. The visualisation technique takes this data structure as its input. Given a topic, it finds the correct resolution of users at each time interval and provides tags to summarise what the collective is discussing. The technique is tested on three microblogging corpora, consisting of up to tens of millions of tweets and over a million users. We provide some preliminary user feedback from a research group interested in the area of social media analysis, where this tool could be applied.
Sponsorship
Science Foundation Ireland
Type of Material
Technical Report
Publisher
University College Dublin. School of Computer Science and Informatics
Series
UCD CSI Technical Reports
ucd-csi-2011-07
Copyright (Published Version)
2011 the Authors
Language
English
Status of Item
Not peer reviewed
This item is made available under a Creative Commons License
File(s)
Loading...
Name
ucd-csi-2011-07.pdf
Size
3.5 MB
Format
Adobe PDF
Checksum (MD5)
2311359627c26add90bbdb563dd2fe97
Owning collection
Mapped collections