Abstract
Extracting personal relation triple (S, P, O) from large number of unstructured text is crucial to the construction of knowledge graph, knowledge representation and reasoning of personal relation. Aiming at low accuracy in extracting triples from unstructured text, we present a supervised approach to judge whether extracted triples are correct. The approach need to build a knowledge base which contain peoples attributes first, then a sentence pattern tree is learnt according the people attribute knowledge base and the training data. When training, triples are extracted from the text automatically and labelled whether correct or not manually. Then patterns are constructed layer-by-layer according the position of “triple”, “pronoun” and “word” in sentence. At the same time, the correct and error number of triples are recorded on each pattern. When testing, the correctness of triples can be judged by the number recorded in matched patterns. According the test result, our approach does better in the training time, the testing time and the F1-value (76.6%) than the ordinary approach based on feature engineering (75.7%). At last, we make the judgement result of sentence pattern tree as a feature to improve the feature engineering approach (77.5%). In addition, this approach has a better expansibility than the traditional one and has guiding significance to the construction of the training set.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Apostolova, E., Tomuro, N.: Combining visual and textual features for information extraction from online flyers. In: EMNLP, pp. 1924–1929 (2014)
Bing, Q., Ting, L.A.L.: Unsupervised Chinese open entity relation extraction. J. Comput. Res. Dev. 52(5), 1029 (2015)
Carlson, A., Betteridge, J., Kisiel, B., Settles, B., Hruschka Jr., E.R., Mitchell, T.M.: Toward an architecture for never-ending language learning. In: AAAI, vol. 5, p. 3 (2010)
Christensen, J., Soderland, S., Etzioni, O., et al.: An analysis of open information extraction based on semantic role labeling. In: Proceedings of the Sixth International Conference On Knowledge Capture, pp. 113–120 (2011)
Etzioni, O., Fader, A., Christensen, J., Soderland, S., Mausam, M.: Open information extraction: the second generation. In: IJCAI, vol. 11, pp. 3-10 (2011)
Fang, Y., Chang, K.C.C.: Searching patterns for relation extraction over the web: rediscovering the pattern-relation duality. In: Proceedings of the Fourth ACM International Conference on Web Search and Data Mining, pp. 825–834 (2011)
Gamallo, P., Garcia, M., Fernández-Lanza, S.: Dependency-based open information extraction. In: Proceedings of the Joint Workshop on Unsupervised and Semi-supervised Learning in NLP, pp. 10–18 (2012)
Glass, M., Barker, K.: Bootstrapping relation extraction using parallel news articles. In: Proceedings of the IJCAI Workshop on Learning by Reading and its Applications in Intelligent Question-answering, Barcelona (2011)
Hall, M., Frank, E., Holmes, G., Pfahringer, B., Reutemann, P., Witten, I.H.: The weka data mining software: an update. ACM SIGKDD Explor. Newslett. 11(1), 10–18 (2009)
Kluegl, P., Toepfer, M., Beck, P.D., Fette, G., Puppe, F.: Uima ruta: rapid development of rule-based information extraction applications. Nat. Lang. Eng. 22(01), 1–40 (2016)
Kozareva, Z., Hovy, E.: Learning arguments and supertypes of semantic relations using recursive patterns. In: Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics, pp. 1482–1491 (2010)
Li, J.H., Zhou, G.D., Zhu, Q.M., Qian, P.D.: Semantic role labeling in Chinese language for nominal predicates. Ruanjian Xuebao/J. Softw. 22(8), 1725–1737 (2011)
Lim, S., Lee, C., Ra, D.: Dependency-based semantic role labeling using sequence labeling with a structural SVM. Pattern Recogn. Lett. 34(6), 696–702 (2013)
Nie, T., Shen, D., Kou, Y., Yu, G., Yue, D.: An entity relation extraction model based on semantic pattern matching. In: 2011 Eighth Web Information Systems and Applications Conference (WISA), pp. 7–12 (2011)
Sq, L., Tj, Z., Hj, L., Py, L.: Chinese semantic role labeling based on feature combination. J. Softw. 22(2), 222–232 (2011)
Sun, A., Grishman, R., Sekine, S.: Semi-supervised relation extraction with large-scale word clustering. In: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies, vol. 1, pp. 521–529 (2011)
Zahedi, M.H., Kahani, M.: SREC: discourse-level semantic relation extraction from text. Neural Comput. Appl. 23(6), 1573–1582 (2013)
Zhihua, Z., Jianxiang, W., Junfeng, T., Guoshun, W., Man, L.: Blocked person relation recognition system based on multiple features. J. Comput. Appl. 36(3), 751 (2016)
Acknowledgement
Supported in part by the Strategic Priority Research Program of the Chinese Academy of Sciences under Grant No. XDA06030200.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2016 Springer Nature Singapore Pte Ltd.
About this paper
Cite this paper
Jiapeng, Z., Yang, Y., Tingwen, L., Jinqiao, S. (2016). Towards Personal Relation Extraction Based on Sentence Pattern Tree. In: Chen, H., Ji, H., Sun, L., Wang, H., Qian, T., Ruan, T. (eds) Knowledge Graph and Semantic Computing: Semantic, Knowledge, and Linked Big Data. CCKS 2016. Communications in Computer and Information Science, vol 650. Springer, Singapore. https://doi.org/10.1007/978-981-10-3168-7_9
Download citation
DOI: https://doi.org/10.1007/978-981-10-3168-7_9
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-10-3167-0
Online ISBN: 978-981-10-3168-7
eBook Packages: Computer ScienceComputer Science (R0)