Abstract
Twitter as a big data social network becomes one of the most important sources for capturing the up-To-date events happening in the world. Topic derivation from Twitter is important for various applications such as situation awareness, market analysis, content filtering, and recommendations. However, tweets are short messages, which makes topic derivation challenging. Current methods employ various semantic features of tweet content but mostly overlook the interactions among tweets. In this paper, we propose a novel topic derivation method that takes into account the interactions among tweets, defined as the reciprocal activities related to people who send the tweets, as well as actions and tweet contents. In particular, topics are derived by performing a two-step matrix factorization jointly over the interactions and semantic features of the tweets. We have conducted a number of experiments on tweets collected over a period of time, showing that the proposed method consistently outperforms other advanced topic derivation methods in the literature. Our experiments also reveal that the interactions among tweets do significantly relieve the sparsity problem caused by the short-Text nature of Twitter.
Original language | English |
---|---|
Title of host publication | Proceedings - 2015 IEEE International Congress on Big Data, BigData Congress 2015 |
Publisher | IEEE |
Pages | 87-94 |
Number of pages | 8 |
ISBN (Electronic) | 9781467372787 |
DOIs | |
Publication status | Published - 17 Aug 2015 |
MoE publication type | A4 Article in a conference publication |
Event | IEEE International Conference on Big Data - New York City, United States Duration: 27 Jun 2015 → 2 Jul 2015 Conference number: 4 |
Conference
Conference | IEEE International Conference on Big Data |
---|---|
Abbreviated title | BigData |
Country | United States |
City | New York City |
Period | 27/06/2015 → 02/07/2015 |
Keywords
- Interactions of Tweets
- Joint Matrix Factorization
- Topic Derivation