Abstract
In the recent years, there is a growing interest in combining explicitly defined formal semantics (in the forms of ontologies) with distributional semantics “learnt” from a vast amount of data. In this paper, we try to bridge the best of the two worlds by introducing a new metrics called the “Semantic Impact” together with a novel method to derive a numerical measurement that can summarise how strong an ontological entity/concept impinges on the domain of discourse. More specifically, by taking into consideration the semantic representation of a concept that appears in documents and its correlation with other concepts in the same document corpus, we measure the importance of a concept with respect to the knowledge domain at a semantic level. Here, the “semantic” importance of an ontology concept is two-fold. Firstly, the concept needs to be informative. Secondly, it should be well connected (strong correlation) with other concepts in the same domain. We evaluated the proposed method with 200 BBC News articles about Donald Trump (between February 2017 and September 2017). The preliminary result is promising: we demonstrated that semantic impact can be learnt: the top 3 most important concepts are Event, Date and Organisation and the least essential concepts are Substance, Duration and EventEducation. The crux of our future work is to extend the evaluation with larger datasets and more diverse domains.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Zúñiga, G.L.: Ontology: its transformation from philosophy to information systems. In: Proceedings of the International Conference on Formal Ontology in Information Systems, vol. 2001, pp. 187–197. ACM, Ogunquit (2001)
Maedche, A., Staab, S.J.: Ontology learning for the semantic web. IEEE Intell. Syst. 16(2), 72–79 (2001)
Wong, W., Liu, W., Bennamoun, M.: Ontology learning from text: a look back and into the future. ACM Comput. Surv. 44(4), 1–36 (2012)
Horrocks, I.: Ontologies and the semantic web. Commun. ACM 51(12), 58–67 (2008)
Beel, J., Breitinger, C., Langer, S.J.: Evaluating the CC-IDF citation-weighting scheme: how effectively can ‘Inverse Document Frequency’ (IDF) be applied to references. In: Proceedings of the 12th iConference (2017)
Wan, J., Barnden, J.: A new semantic model for domain-ontology learning. In: Zu, Q., Hu, B., Gu, N., Seng, S. (eds.) HCC 2014. LNCS, vol. 8944, pp. 140–155. Springer, Cham (2015). https://doi.org/10.1007/978-3-319-15554-8_12
Evert, S.: Distributional semantic models. NAACL HLT 2010 Tutorial Abstracts, pp. 15–18 (2010)
Mikolov, T., et al.: Efficient estimation of word representations in vector space. arXiv preprint arXiv:1301.3781 (2013)
Mikolov, T., et al.: Distributed representations of words and phrases and their compositionality. In: Advances in Neural Information Processing Systems. (NIPS 2013), vol. 26, pp. 3111–3119 (2013)
Reshef, D.N., et al.: Detecting novel associations in large data sets. Science 334(6062), 1518–1524 (2011)
IBM: Natural Language Understanding. Watson Developer Cloud 2017. https://www.ibm.com/watson/developercloud/natural-language-understanding.html. Accessed 2017
BBC: Trump says terror attacks ‘under-reported’: Is that true? US & Canada (2017). http://www.bbc.co.uk/news/world-us-canada-38890090. Accessed 10 Feb 2017
BBC. Core Concepts Ontology (2015). https://www.bbc.co.uk/ontologies/coreconcepts. Accessed 01 Apr 2018
Deeplearning4J. Troubleshooting & Tuning Word2Vec. Word2Vec 2017. https://deeplearning4j.org/word2vec#trouble. Accessed 1 Feb 2017
Glorot, X., Bengio, Y.: Understanding the difficulty of training deep feedforward neural networks. In: Proceedings of the Thirteenth International Conference on Artificial Intelligence and Statistics (2010)
Kingma, D.P., Ba, J.: Adam: a method for stochastic optimization. arXiv preprint arXiv:1412.6980 (2014)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2019 Springer Nature Switzerland AG
About this paper
Cite this paper
Wan, J., Barnden, J., Hu, B., Hancox, P. (2019). What Is Semantically Important to “Donald Trump”?. In: Tang, Y., Zu, Q., Rodríguez García, J. (eds) Human Centered Computing. HCC 2018. Lecture Notes in Computer Science(), vol 11354. Springer, Cham. https://doi.org/10.1007/978-3-030-15127-0_31
Download citation
DOI: https://doi.org/10.1007/978-3-030-15127-0_31
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-15126-3
Online ISBN: 978-3-030-15127-0
eBook Packages: Computer ScienceComputer Science (R0)