A new algorithm for improving the completion performance of knowledge graph of long-tail data

Expand
  • 1. School of Mathematical Sciences, Chongqing Normal University, Chongqing 401331, China

Received date: 2021-12-26

  Online published: 2025-03-08

Copyright

, 2025, All rights reserved. Unauthorized reproduction is prohibited.

Abstract

Knowledge graph is an important semantic data in many intelligent applications, but the incompleteness of its data brings many difficulties to practical applications. Therefore, it is necessary to complete the missing semantic information in the knowledge graph. Knowledge graph embedding is one of the important methods of knowledge graph completion. This type of method usually has better results in the case of non-long-tail data, but its effect is poor in the case of long-tail data. Since the semantics of non-long-tail data is richer, in order to improve the effect of knowledge graph completion in the case of long-tail data, this paper transfers non-long-tail data as supervised knowledge to long-tail data, and proposes a new algorithm——the dual embedding method with the idea of expectation maximization algorithm, to improve the completion performance of knowledge graph, and its practical application effect. Through the comparative experiment of link prediction task in FB15K data set, the experimental results show that the dual embedding method with the idea of expectation maximization algorithm proposed in this paper is effective.

Cite this article

Miaohui HE, Xuxiang DUAN, Zhiyou WU . A new algorithm for improving the completion performance of knowledge graph of long-tail data[J]. Operations Research Transactions, 2025 , 29(1) : 41 -54 . DOI: 10.15960/j.cnki.issn.1007-6093.2025.01.004

References

1 Yang Y, Agrawal D, Jagadish H V, et al. An efficient parallel keyword search engine on knowledge graphs [C]//2019 IEEE 35th International Conference on Data Engineering, 2019: 338-349.
2 Yao X, Van Durme B. Information extraction over structured data: Question answering with freebase [C]//Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics, 2014: 956-966.
3 Zhang F, Yuan N J, Lian D, et al. Collaborative knowledge base embedding for recommender systems [C]//Proceedings of the 22nd ACM SIGKDD Iinternational Conference on Knowledge Discovery and Data Mining, 2016: 353-362.
4 Rotmensch M , Halpern Y , Tlimat A , et al. Learning a health knowledge graph from electronic medical records[J]. Scientific Reports, 2017, 7 (1): 1- 11.
5 Suchanek F M , Kasneci G , Weikum G . Yago: A large ontology from wikipedia and wordnet[J]. Journal of Web Semantics, 2008, 6 (3): 203- 217.
6 Lehmann J , Isele R , Jakob M , et al. DBpedia|A large-scale, multilingual knowledge base extracted from wikipedia[J]. Semantic Web, 2015, 6 (2): 167- 195.
7 Miller G A . WordNet: A lexical database for English[J]. Communications of the ACM, 1995, 38 (11): 39- 41.
8 Carlson A, Betteridge J, Kisiel B, et al. Toward an architecture for never-ending language learning [C]//Twenty-fourth AAAI Conference on Artificial Intelligence, 2010: 1306-1313.
9 Bordes A , Usunier N , Garcia-Duran A , et al. Translating embeddings for modeling multirelational data[J]. Advances in Neural Information Pprocessing Systems, 2013, 2, 2787- 2795.
10 Wang Z, Zhang J, Feng J, et al. Knowledge graph embedding by translating on hyperplanes [C]//Proceedings of the AAAI Conference on Artificial Intelligence, 2014, 28(1): 1112-1119.
11 Lin Y, Liu Z, Sun M, et al. Learning entity and relation embeddings for knowledge graph completion [C]//Twenty-ninth AAAI Conference on Artificial Intelligence, 2015: 2181-2187.
12 Ji G, He S, Xu L, et al. Knowledge graph embedding via dynamic mapping matrix [C]//Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing, 2015: 687-696.
13 Nickel M, Tresp V, Kriegel H P. A three-way model for collective learning on multi-relational data [C]//International Conference on Machine Learning, 2011: 809-816.
14 Yang B, Yih W, He X, et al. Embedding entities and relations for learning and inference in knowledge bases [EB/OL]. (2015-08-29)[2021-11-26]. arXiv: 1412.6575.
15 Trouillon T, Welbl J, Riedel S, et al. Complex embeddings for simple link prediction [C]//International Conference on Machine Learning, 2016: 2071-2080.
16 Dettmers T, Minervini P, Stenetorp P, et al. Convolutional 2nd knowledge graph embeddings [C]//Thirty-second AAAI Conference on Artificial Intelligence, 2018: 1811-1818.
17 Vu T, Nguyen T D, Nguyen D Q, et al. A capsule network-based embedding model for knowledge graph completion and search personalization [C]//Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2019: 2180-2189.
18 Sun Z Q, Deng Z H, Nie J Y, et al. RotatE: Knowledge graph embedding by relational rotation in complex space [C]//Proceedings of the International Conference on Learning Representations, 2019: 926-934.
19 Zhang S, Tay Y, Yao L, et al. Quaternion knowledge graph embeddings [C]//Advances in Neural Information Processing Systems, 2019: 2731-2741.
20 Akrami F, Saeef M S, Zhang Q, et al. Realistic re-evaluation of knowledge graph completion methods: An experimental study [C]//Proceedings of the 2020 ACM SIGMOD International Conference on Management of Data, 2020: 1995-2010.
21 Metz L, Maheswaranathan N, Cheung B, et al. Meta-learning update rules for unsupervised representation learning [EB/OL]. (2019-02-26)[2021-11-25]. arXiv: 1804.00222.
22 Zhang Z , Zhuang F , Qu M , et al. Knowledge graph embedding with shared latent semantic units[J]. Neural Networks, 2021, 139, 140- 148.
23 Dempster A P , Laird N M , Rubin D B . Maximum likelihood from incomplete data via the EM algorithm[J]. Journal of the Royal Statistical Society: Series B (Methodological), 1977, 39 (1): 1- 22.
Outlines

/