Topic Modeling of Political Dynamics with Shifted Cosine Similarity

Yifan Luo¹³,
Tao Wan¹⁴ &
Zengchang Qin¹³

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 13199))

Included in the following conference series:

International Symposium on Integrated Uncertainty in Knowledge Modelling and Decision Making

636 Accesses

Abstract

Topic modeling with community detection can be used to explore the latent semantic structure of documents, we can utilize a network, i.e., a graph to depict the semantic relation between words. In some network based topic models, in order to obtain a network with obvious community structure, the similarity between words (vertices) is essential. Word embeddings trained from a large corpus empirically perform as well as in rich semantic representation, thus this research is intended to construct a novel similarity in a network based topic model (NAM). In this paper, we first intuitively propose a similarity measure based on shifted cosine similarity between word embeddings. This similarity is exploited to replace the similarity based on typical point-wise mutual information (PMI). Secondly, based on different similarity measures, topics of corpus in a global period are induced by NAM. Finally, we use NAM to capture the dynamic changes of political topics in China and interpret the dynamic processes using historical background. Although our similarity measure introduces semantic differences caused by the difference between data sets and has one more parameter, the experimental results show the effectiveness of our new proposed measure.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 54.99; Price excludes VAT (USA)

Softcover Book: USD 69.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

A clustering-based topic model using word networks and word embeddings

Article Open access 11 April 2022

Topic Lifecycle on Social Networks: Analyzing the Effects of Semantic Continuity and Social Communities

The dynamic stochastic topic block model for dynamic networks with textual edges

Article 15 September 2018

Notes

References

Bastian, M., Heymann, S., Jacomy, M.: Gephi: an open source software for exploring and manipulating networks. In: Proceedings of the International AAAI Conference on Web and Social Media, vol. 3 (2009)
Google Scholar
Blei, D.M., Ng, A.Y., Jordan, M.I.: Latent Dirichlet allocation. J. Mach. Learn. Res. 3, 993–1022 (2003)
MATH Google Scholar
Blondel, V.D., Guillaume, J.L., Lambiotte, R., Lefebvre, E.: Fast unfolding of communities in large networks. J. Stat. Mech: Theory Exp. 2008(10), P10008 (2008)
Article Google Scholar
Bouma, G.: Normalized (pointwise) mutual information in collocation extraction. In: Proceedings of GSCL, pp. 31–40 (2009)
Google Scholar
Cointet, J.P., Mogoutov, A., Bourret, P., El Abed, R., Cambrosio, A.: Les réseaux de l’expression génique-émergence et développement d’un domaine clé de la génomique. médecine/sciences, 28, 7–13 (2012)
Google Scholar
Das, R., Zaheer, M., Dyer, C.: Gaussian LDA for topic models with word embeddings. In: Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing (Volume 1: Long Papers), pp. 795–804 (2015)
Google Scholar
Hofmann, T.: Probabilistic latent semantic indexing. In: Proceedings of the 22nd Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, vol. 51, pp. 50–57 (1999)
Google Scholar
Li, C., Wang, H., Zhang, Z., Sun, A., Ma, Z.: Topic modeling for short texts with auxiliary word embeddings. In: Proceedings of the 39th International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 165–174 (2016)
Google Scholar
Li, D., et al.: Adding community and dynamic to topic models. J. Informet. 6(2), 237–253 (2012)
Article Google Scholar
Mikolov, T., Sutskever, I., Chen, K., Corrado, G.S., Dean, J.: Distributed representations of words and phrases and their compositionality. In: Advances in Neural Information Processing Systems, vol. 26, pp. 3111–3119 (2013)
Google Scholar
Mimno, D., Wallach, H., Talley, E., Leenders, M., McCallum, A.: Optimizing semantic coherence in topic models. In: Proceedings of the 2011 Conference on Empirical Methods in Natural Language Processing, pp. 262–272 (2011)
Google Scholar
Newman, M.E.: Modularity and community structure in networks. Proc. Natl. Acad. Sci. 103(23), 8577–8582 (2006)
Article Google Scholar
Röder, M., Both, A., Hinneburg, A.: Exploring the space of topic coherence measures. In: Proceedings of the Eighth ACM International Conference on Web Search and Data Mining, pp. 399–408 (2015)
Google Scholar
Rule, A., Cointet, J.P., Bearman, P.S.: Lexical shifts, substantive changes, and continuity in state of the union discourse, 1790–2014. Proc. Natl. Acad. Sci. 112(35), 10837–10844 (2015)
Article Google Scholar
Sun, J.: Jieba Chinese word segmentation tool (2012)
Google Scholar
Weeds, J., Weir, D.: Co-occurrence retrieval: a flexible framework for lexical distributional similarity. Comput. Linguist. 31(4), 439–475 (2005)
Article Google Scholar

Download references

Author information

Authors and Affiliations

Intelligent Computing and Machine Learning Lab, School of Automation Science and Electric Engineering, Beihang University, Beijing, 100191, China
Yifan Luo & Zengchang Qin
School of Biological Science and Medical Engineering, Beihang University, Beijing, 100191, China
Tao Wan

Authors

Yifan Luo
View author publications
You can also search for this author in PubMed Google Scholar
Tao Wan
View author publications
You can also search for this author in PubMed Google Scholar
Zengchang Qin
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Zengchang Qin .

Editor information

Editors and Affiliations

Osaka Prefecture University, Sakai, Osaka, Japan
Katsuhiro Honda
University of Hyogo, Kobe, Japan
Tomoe Entani
Osaka Prefecture University, Sakai, Japan
Seiki Ubukata
Japan Advanced Institute of Science and Technology, Nomi, Japan
Van-Nam Huynh
Osaka University, Toyonaka, Osaka, Japan
Masahiro Inuiguchi

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Luo, Y., Wan, T., Qin, Z. (2022). Topic Modeling of Political Dynamics with Shifted Cosine Similarity. In: Honda, K., Entani, T., Ubukata, S., Huynh, VN., Inuiguchi, M. (eds) Integrated Uncertainty in Knowledge Modelling and Decision Making. IUKM 2022. Lecture Notes in Computer Science(), vol 13199. Springer, Cham. https://doi.org/10.1007/978-3-030-98018-4_22

Download citation

DOI: https://doi.org/10.1007/978-3-030-98018-4_22
Published: 04 March 2022
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-98017-7
Online ISBN: 978-3-030-98018-4
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Topic Modeling of Political Dynamics with Shifted Cosine Similarity

Abstract

Access this chapter

Subscribe and save

Buy Now

Similar content being viewed by others

A clustering-based topic model using word networks and word embeddings

Topic Lifecycle on Social Networks: Analyzing the Effects of Semantic Continuity and Social Communities

The dynamic stochastic topic block model for dynamic networks with textual edges

Notes

References

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Subscribe and save

Buy Now

Navigation

Topic Modeling of Political Dynamics with Shifted Cosine Similarity

Abstract

Access this chapter

Subscribe and save

Buy Now

Similar content being viewed by others

A clustering-based topic model using word networks and word embeddings

Topic Lifecycle on Social Networks: Analyzing the Effects of Semantic Continuity and Social Communities

The dynamic stochastic topic block model for dynamic networks with textual edges

Notes

References

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation