[go: up one dir, main page]

skip to main content
10.1145/2817946.2817949acmconferencesArticle/Chapter ViewAbstractPublication PagescosnConference Proceedingsconference-collections
research-article
Public Access

On Predictability of Rare Events Leveraging Social Media: A Machine Learning Perspective

Published: 02 November 2015 Publication History

Abstract

Information extracted from social media streams has been leveraged to forecast the outcome of a large number of real-world events, from political elections to stock market fluctuations. An increasing amount of studies demonstrates how the analysis of social media conversations provides cheap access to the wisdom of the crowd. However, extents and contexts in which such forecasting power can be effectively leveraged are still unverified at least in a systematic way. It is also unclear how social-media-based predictions compare to those based on alternative information sources. To address these issues, here we develop a machine learning framework that leverages social media streams to automatically identify and predict the outcomes of soccer matches.
We focus in particular on matches in which at least one of the possible outcomes is deemed as highly unlikely by professional bookmakers. We argue that sport events offer a systematic approach for testing the predictive power of social media conversations, and allow to compare such power against the rigorous baselines set by external sources. Despite such strict baselines, our framework yields above 8% marginal profit when used to inform simple betting strategies.
The system is based on real-time sentiment analysis and exploits data collected immediately before the game start, allowing for bets informed by its predictions. We first discuss the rationale behind our approach, then describe the learning framework, its prediction performance and the return it provides as compared to a set of betting strategies. To test our framework we use both historical Twitter data from the 2014 FIFA World Cup games (10% sample), and real-time Twitter data (full stream) collected by monitoring the conversations about all soccer matches of the four major European tournaments (FA Premier League, Serie A, La Liga, and Bundesliga), and the 2014 UEFA Champions League, during the period between October, 25th 2014 and November, 26th 2014.

References

[1]
S. Asur and B. A. Huberman. Predicting the future with social media. In Web Intelligence and Intelligent Agent Technology (WI-IAT), volume 1, pages 492--499. IEEE, 2010.
[2]
A. Bakliwal, P. Arora, S. Madhappan, N. Kapre, M. Singh, and V. Varma. Mining sentiments from tweets. Proceedings of the WASSA, 2012.
[3]
A. Bermingham and A. F. Smeaton. On using Twitter to monitor political sentiment and predict election results. Sentiment Analysis where AI meets Psychology (SAAIP), page 2, 2011.
[4]
J. Bollen, H. Mao, and X. Zeng. Twitter mood predicts the stock market. Journal of Computational Science, 2(1):1--8, 2011.
[5]
A. Choudhary, W. Hendrix, K. Lee, D. Palsetia, and W.-K. Liao. Social media evolution of the egyptian revolution. Communications of the ACM, 55(5):74--80, 2012.
[6]
F. Ciulla, D. Mocanu, A. Baronchelli, B. Gonçalves, N. Perra, and A. Vespignani. Beating the news using social media: the case study of american idol. EPJ Data Science, 1(1):1--11, 2012.
[7]
M. D. Conover, C. Davis, E. Ferrara, K. McKelvey, F. Menczer, and A. Flammini. The geospatial characteristics of a social movement communication network. PloS one, 8(3):e55957, 2013.
[8]
M. D. Conover, E. Ferrara, F. Menczer, and A. Flammini. The digital evolution of occupy wall street. PloS one, 8(5):e64679, 2013.
[9]
J. DiGrazia, K. McKelvey, J. Bollen, and F. Rojas. More tweets, more votes: Social media as a quantitative indicator of political behavior. PloS one, 8(11):e79449, 2013.
[10]
E. Ferrara, O. Varol, F. Menczer, and A. Flammini. Traveling trends: social butterflies or frequent fliers? In Proceedings of the first ACM conference on Online social networks, pages 213--222. ACM, 2013.
[11]
D. Gayo-Avello. No, you cannot predict elections with Twitter. Internet Computing, 16(6):91--94, 2012.
[12]
D. Gayo-Avello, P. T. Metaxas, and E. Mustafaraj. Limits of electoral predictions using Twitter. In ICWSM, 2011.
[13]
A. Go, R. Bhayani, and L. Huang. Twitter sentiment classification using distant supervision. CS224N Project Report, Stanford, pages 1--12, 2009.
[14]
J. W. Kim, D. Kim, B. Keegan, J. H. Kim, S. Kim, and A. Oh. Social media dynamics of global co-presence during the 2014 FIFA World Cup. In Proceedings of the ACM SIGCHI Conference on Human Factors in Computing Systems. ACM, 2015.
[15]
D. Lazer, R. Kennedy, G. King, and A. Vespignani. The parable of Google flu: Traps in big data analysis. Science, 343(6176):1203--1205, 2014.
[16]
W. A. Mason, F. R. Conrey, and E. R. Smith. Situating social influence processes: Dynamic, multidirectional flows of influence within social networks. Personality and social psychology review, 11(3):279--300, 2007.
[17]
M. Mathioudakis and N. Koudas. Twittermonitor: trend detection over the Twitter stream. In Proceedings of the 2010 ACM SIGMOD International Conference on Management of data, pages 1155--1158. ACM, 2010.
[18]
T. Nasukawa and J. Yi. Sentiment analysis: Capturing favorability using natural language processing. In Proceedings of the 2nd international conference on Knowledge capture, pages 70--77. ACM, 2003.
[19]
A. Oghina, M. Breuss, M. Tsagkias, and M. de Rijke. Predicting imdb movie ratings using social media. In Advances in information retrieval, pages 503--507. 2012.
[20]
B. Pang and L. Lee. Opinion mining and sentiment analysis. Foundations and trends in information retrieval, 2(1--2):1--135, 2008.
[21]
F. Pedregosa, G. Varoquaux, A. Gramfort, V. Michel, B. Thirion, O. Grisel, M. Blondel, P. Prettenhofer, R. Weiss, V. Dubourg, J. Vanderplas, A. Passos, D. Cournapeau, M. Brucher, M. Perrot, and E. Duchesnay. Scikit-learn: Machine learning in Python. Journal of Machine Learning Research, 12:2825--2830, 2011.
[22]
H. Saif, M. Fernandez, Y. He, and H. Alani. Evaluation datasets for Twitter sentiment analysis. In Proceedings, 1st Workshop on Emotion and Sentiment in Social and Expressive Media (ESSEM), 2013.
[23]
H. Saif, Y. He, and H. Alani. Semantic smoothing for Twitter sentiment analysis. 2011.
[24]
H. Saif, Y. He, and H. Alani. Alleviating data sparsity for twitter sentiment analysis. Making Sense of Microposts (\# MSM2012), 2012.
[25]
H. Schoen, D. Gayo-Avello, P. T. Metaxas, E. Mustafaraj, M. Strohmaier, and P. Gloor. The power of prediction with social media. Internet Research, 23(5):528--543, 2013.
[26]
G. Shteynberg and E. P. Apfelbaum. The power of shared experience: Simultaneous observation with similar others facilitates social learning. Social Psychological and Personality Science, 4(6):738--744, 2013.
[27]
G. Shteynberg, J. B. Hirsh, E. P. Apfelbaum, J. T. Larsen, A. D. Galinsky, and N. J. Roese. Feeling more together: Group attention intensifies emotion. Emotion, 14(6):1102, 2014.
[28]
M. Speriosu, N. Sudan, S. Upadhyay, and J. Baldridge. Twitter polarity classification with label propagation over lexical links and the follower graph. In Proceedings of the First workshop on Unsupervised Learning in NLP, pages 53--63. ACL, 2011.
[29]
M. Thelwall, K. Buckley, and G. Paltoglou. Sentiment in Twitter events. Journal of the American Society for Information Science and Technology, 62(2):406--418, 2011.
[30]
A. Tumasjan, T. O. Sprenger, P. G. Sandner, and I. M. Welpe. Predicting elections with Twitter: What 140 characters reveal about political sentiment. ICWSM, 10:178--185, 2010.
[31]
O. Varol, E. Ferrara, C. L. Ogan, F. Menczer, and A. Flammini. Evolution of online user behavior during a social upheaval. In Proceedings of the 2014 ACM conference on Web science, pages 81--90. ACM, 2014.
[32]
X. Zhang, H. Fuehres, and P. A. Gloor. Predicting stock market indicators through Twitter “I hope it is not as bad as I fear”. Procedia-Social and Behavioral Sciences, 26:55--62, 2011.

Cited By

View all
  • (2022)Using Twitter to Characterize Public Opinion in Brazil During Political EventsResearch Anthology on Social Media's Influence on Government, Politics, and Social Movements10.4018/978-1-6684-7472-3.ch028(585-598)Online publication date: 26-Aug-2022
  • (2020)Analyzing Brexit’s impact using sentiment analysis and topic modeling on Twitter discussionProceedings of the 21st Annual International Conference on Digital Government Research10.1145/3396956.3396973(1-6)Online publication date: 15-Jun-2020
  • (2019)Using Twitter to Characterize Public Opinion in Brazil During Political EventsInternational Journal of e-Collaboration10.4018/IJeC.201907010415:3(49-61)Online publication date: Jul-2019
  • Show More Cited By

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences
COSN '15: Proceedings of the 2015 ACM on Conference on Online Social Networks
November 2015
280 pages
ISBN:9781450339513
DOI:10.1145/2817946
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 02 November 2015

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. rare events prediction
  2. sentiment analysis
  3. social network analysis

Qualifiers

  • Research-article

Funding Sources

  • NSF
  • DARPA

Conference

COSN'15
Sponsor:
COSN'15: Conference on Online Social Networks
November 2 - 3, 2015
California, Palo Alto, USA

Acceptance Rates

COSN '15 Paper Acceptance Rate 22 of 82 submissions, 27%;
Overall Acceptance Rate 69 of 307 submissions, 22%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)106
  • Downloads (Last 6 weeks)15
Reflects downloads up to 06 Oct 2024

Other Metrics

Citations

Cited By

View all
  • (2022)Using Twitter to Characterize Public Opinion in Brazil During Political EventsResearch Anthology on Social Media's Influence on Government, Politics, and Social Movements10.4018/978-1-6684-7472-3.ch028(585-598)Online publication date: 26-Aug-2022
  • (2020)Analyzing Brexit’s impact using sentiment analysis and topic modeling on Twitter discussionProceedings of the 21st Annual International Conference on Digital Government Research10.1145/3396956.3396973(1-6)Online publication date: 15-Jun-2020
  • (2019)Using Twitter to Characterize Public Opinion in Brazil During Political EventsInternational Journal of e-Collaboration10.4018/IJeC.201907010415:3(49-61)Online publication date: Jul-2019
  • (2018)Soccer Fans Sentiment through the Eye of Big Data: The UEFA Champions League as a Case Study2018 IEEE Conference on Multimedia Information Processing and Retrieval (MIPR)10.1109/MIPR.2018.00058(244-250)Online publication date: Apr-2018
  • (2018)An emotional contagion model for heterogeneous social media with multiple behaviorsPhysica A: Statistical Mechanics and its Applications10.1016/j.physa.2017.08.025490(185-202)Online publication date: Jan-2018
  • (2018)Characterization of Public Opinion on Political Events in Brazil Based on Twitter DataCollaboration and Technology10.1007/978-3-319-99504-5_9(105-116)Online publication date: 8-Aug-2018
  • (2015)Quantifying the effect of sentiment on information diffusion in social mediaPeerJ Computer Science10.7717/peerj-cs.261(e26)Online publication date: 30-Sep-2015
  • (2015)Measuring Emotional Contagion in Social MediaPLOS ONE10.1371/journal.pone.014239010:11(e0142390)Online publication date: 6-Nov-2015

View Options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Get Access

Login options

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media