Abstract
People are shifting from traditional news sources to online news at an incredibly fast rate. However, the technology behind online news consumption promotes content that confirms the users» existing point of view. This phenomenon has led to polarization of opinions and intolerance towards opposing views. Thus, a key problem is to model information filter bubbles on social media and design methods to eliminate them. In this paper, we use a machine-learning approach to learn a liberal-conservative ideology space on Twitter, and show how we can use the learned latent space to tackle the filter bubble problem.
We model the problem of learning the liberal-conservative ideology space of social media users and media sources as a constrained non-negative matrix-factorization problem. Our model incorporates the social-network structure and content-consumption information in a joint factorization problem with shared latent factors. We validate our model and solution on a real-world Twitter dataset consisting of controversial topics, and show that we are able to separate users by ideology with over 90% purity. When applied to media sources, our approach estimates ideology scores that are highly correlated(Pearson correlation 0.9) with ground-truth ideology scores. Finally, we demonstrate the utility of our model in real-world scenarios, by illustrating how the learned ideology latent space can be used to develop exploratory and interactive interfaces that can help users in diffusing their information filter bubble.
We model the problem of learning the liberal-conservative ideology space of social media users and media sources as a constrained non-negative matrix-factorization problem. Our model incorporates the social-network structure and content-consumption information in a joint factorization problem with shared latent factors. We validate our model and solution on a real-world Twitter dataset consisting of controversial topics, and show that we are able to separate users by ideology with over 90% purity. When applied to media sources, our approach estimates ideology scores that are highly correlated(Pearson correlation 0.9) with ground-truth ideology scores. Finally, we demonstrate the utility of our model in real-world scenarios, by illustrating how the learned ideology latent space can be used to develop exploratory and interactive interfaces that can help users in diffusing their information filter bubble.
Original language | English |
---|---|
Title of host publication | Proceedings of the Eleventh ACM International Conference on Web Search and Data Mining |
Place of Publication | New York, NY, USA |
Publisher | ACM |
Pages | 351-359 |
Number of pages | 9 |
ISBN (Print) | 978-1-4503-5581-0 |
DOIs | |
Publication status | Published - 2018 |
MoE publication type | A4 Conference publication |
Event | ACM International Conference on Web Search and Data Mining - Marina Del Rey, United States Duration: 5 Feb 2018 → 9 Feb 2018 Conference number: 11 |
Conference
Conference | ACM International Conference on Web Search and Data Mining |
---|---|
Abbreviated title | WSDM |
Country/Territory | United States |
City | Marina Del Rey |
Period | 05/02/2018 → 09/02/2018 |
Keywords
- combining link and content
- graph regularization
- ideology
- information filter bubble
- latent space learning
- manifold learning
- matrix factorization
- polarization
- social networks