Options
Themecrowds : multiresolution summaries of Twitter usage
Date Issued
2011-10-28
Date Available
2011-11-22T14:48:51Z
Abstract
Users of social media sites, such as Twitter, rapidly generate large volumes of text content on a daily basis. Visual summaries are needed to understand what groups of people are saying collectively in this unstructured text data. Users will typically discuss a wide variety of topics, where the number of authors talking about a specific topic can quickly grow or diminish over time, and what the collective is saying about the subject can shift as a situation develops.
In this paper, we present a technique that summarises what collections of Twitter users are saying about certain topics over time. As the correct resolution for inspecting the data is unknown in advance, the users are clustered hierarchically over a fixed time interval based on the similarity of their posts. The visualisation technique takes this data structure as its input. Given a topic, it finds the correct resolution of users at each time interval and provides tags to summarise what the collective is discussing. The technique is tested on a large microblogging corpus, consisting of millions of tweets and over a million users.
In this paper, we present a technique that summarises what collections of Twitter users are saying about certain topics over time. As the correct resolution for inspecting the data is unknown in advance, the users are clustered hierarchically over a fixed time interval based on the similarity of their posts. The visualisation technique takes this data structure as its input. Given a topic, it finds the correct resolution of users at each time interval and provides tags to summarise what the collective is discussing. The technique is tested on a large microblogging corpus, consisting of millions of tweets and over a million users.
Sponsorship
Science Foundation Ireland
Type of Material
Conference Publication
Publisher
ACM
Copyright (Published Version)
2011 ACM
Subjects
Subject – LCSH
User-generated content
Twitter
Visualization
Content analysis (Communication)
Web versions
Language
English
Status of Item
Peer reviewed
Journal
Cantador, I. et al. (eds.). SMUC '11 Proceedings of the 3rd international workshop on Search and mining user-generated contents
Conference Details
Paper presented at the 3rd International Workshop on Search and Mining User-generated Contents (SMUC 2011), 24th - 28th October 2011, Glasgow
ISBN
978-1-4503-0949-3
This item is made available under a Creative Commons License
File(s)
Loading...
Name
smucTR.pdf
Size
21.59 MB
Format
Adobe PDF
Checksum (MD5)
b2f8010f9003f7e8a794f649cf596fee
Owning collection
Mapped collections