运筹学学报 >
2025 , Vol. 29 >Issue 1: 41 - 54
DOI: https://doi.org/10.15960/j.cnki.issn.1007-6093.2025.01.004
提高长尾数据知识图谱补全性能的一种新算法
收稿日期: 2021-12-26
网络出版日期: 2025-03-08
基金资助
国家自然科学基金(11871128)
版权
A new algorithm for improving the completion performance of knowledge graph of long-tail data
Received date: 2021-12-26
Online published: 2025-03-08
Copyright
知识图谱是众多智能应用中一种重要的语义数据, 但其数据的不完备性给实际应用带来了很多困难, 因此需要对知识图谱中缺失的语义信息进行补全。知识图谱嵌入是知识图谱补全的重要方法之一, 这类方法通常在非长尾数据情况下具有较好的效果, 但在长尾数据情况下其效果较差。由于非长尾数据的语义较丰富, 为了提升长尾数据情况下知识图谱补全效果, 本文将非长尾数据作为监督知识迁移到长尾数据中, 提出了一种新的算法——融入期望最大化算法思想的双重嵌入方法, 来改进长尾数据的知识图谱补全性能, 进而提高其实际应用效果。通过在FB15K数据集中进行链接预测任务的对比实验, 实验结果表明本文提出的融入期望最大化算法思想的双重嵌入方法效果较好。
何苗惠, 段旭祥, 吴至友 . 提高长尾数据知识图谱补全性能的一种新算法[J]. 运筹学学报, 2025 , 29(1) : 41 -54 . DOI: 10.15960/j.cnki.issn.1007-6093.2025.01.004
Knowledge graph is an important semantic data in many intelligent applications, but the incompleteness of its data brings many difficulties to practical applications. Therefore, it is necessary to complete the missing semantic information in the knowledge graph. Knowledge graph embedding is one of the important methods of knowledge graph completion. This type of method usually has better results in the case of non-long-tail data, but its effect is poor in the case of long-tail data. Since the semantics of non-long-tail data is richer, in order to improve the effect of knowledge graph completion in the case of long-tail data, this paper transfers non-long-tail data as supervised knowledge to long-tail data, and proposes a new algorithm——the dual embedding method with the idea of expectation maximization algorithm, to improve the completion performance of knowledge graph, and its practical application effect. Through the comparative experiment of link prediction task in FB15K data set, the experimental results show that the dual embedding method with the idea of expectation maximization algorithm proposed in this paper is effective.
| 1 | Yang Y, Agrawal D, Jagadish H V, et al. An efficient parallel keyword search engine on knowledge graphs [C]//2019 IEEE 35th International Conference on Data Engineering, 2019: 338-349. |
| 2 | Yao X, Van Durme B. Information extraction over structured data: Question answering with freebase [C]//Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics, 2014: 956-966. |
| 3 | Zhang F, Yuan N J, Lian D, et al. Collaborative knowledge base embedding for recommender systems [C]//Proceedings of the 22nd ACM SIGKDD Iinternational Conference on Knowledge Discovery and Data Mining, 2016: 353-362. |
| 4 | Rotmensch M , Halpern Y , Tlimat A , et al. Learning a health knowledge graph from electronic medical records[J]. Scientific Reports, 2017, 7 (1): 1- 11. |
| 5 | Suchanek F M , Kasneci G , Weikum G . Yago: A large ontology from wikipedia and wordnet[J]. Journal of Web Semantics, 2008, 6 (3): 203- 217. |
| 6 | Lehmann J , Isele R , Jakob M , et al. DBpedia|A large-scale, multilingual knowledge base extracted from wikipedia[J]. Semantic Web, 2015, 6 (2): 167- 195. |
| 7 | Miller G A . WordNet: A lexical database for English[J]. Communications of the ACM, 1995, 38 (11): 39- 41. |
| 8 | Carlson A, Betteridge J, Kisiel B, et al. Toward an architecture for never-ending language learning [C]//Twenty-fourth AAAI Conference on Artificial Intelligence, 2010: 1306-1313. |
| 9 | Bordes A , Usunier N , Garcia-Duran A , et al. Translating embeddings for modeling multirelational data[J]. Advances in Neural Information Pprocessing Systems, 2013, 2, 2787- 2795. |
| 10 | Wang Z, Zhang J, Feng J, et al. Knowledge graph embedding by translating on hyperplanes [C]//Proceedings of the AAAI Conference on Artificial Intelligence, 2014, 28(1): 1112-1119. |
| 11 | Lin Y, Liu Z, Sun M, et al. Learning entity and relation embeddings for knowledge graph completion [C]//Twenty-ninth AAAI Conference on Artificial Intelligence, 2015: 2181-2187. |
| 12 | Ji G, He S, Xu L, et al. Knowledge graph embedding via dynamic mapping matrix [C]//Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing, 2015: 687-696. |
| 13 | Nickel M, Tresp V, Kriegel H P. A three-way model for collective learning on multi-relational data [C]//International Conference on Machine Learning, 2011: 809-816. |
| 14 | Yang B, Yih W, He X, et al. Embedding entities and relations for learning and inference in knowledge bases [EB/OL]. (2015-08-29)[2021-11-26]. arXiv: 1412.6575. |
| 15 | Trouillon T, Welbl J, Riedel S, et al. Complex embeddings for simple link prediction [C]//International Conference on Machine Learning, 2016: 2071-2080. |
| 16 | Dettmers T, Minervini P, Stenetorp P, et al. Convolutional 2nd knowledge graph embeddings [C]//Thirty-second AAAI Conference on Artificial Intelligence, 2018: 1811-1818. |
| 17 | Vu T, Nguyen T D, Nguyen D Q, et al. A capsule network-based embedding model for knowledge graph completion and search personalization [C]//Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2019: 2180-2189. |
| 18 | Sun Z Q, Deng Z H, Nie J Y, et al. RotatE: Knowledge graph embedding by relational rotation in complex space [C]//Proceedings of the International Conference on Learning Representations, 2019: 926-934. |
| 19 | Zhang S, Tay Y, Yao L, et al. Quaternion knowledge graph embeddings [C]//Advances in Neural Information Processing Systems, 2019: 2731-2741. |
| 20 | Akrami F, Saeef M S, Zhang Q, et al. Realistic re-evaluation of knowledge graph completion methods: An experimental study [C]//Proceedings of the 2020 ACM SIGMOD International Conference on Management of Data, 2020: 1995-2010. |
| 21 | Metz L, Maheswaranathan N, Cheung B, et al. Meta-learning update rules for unsupervised representation learning [EB/OL]. (2019-02-26)[2021-11-25]. arXiv: 1804.00222. |
| 22 | Zhang Z , Zhuang F , Qu M , et al. Knowledge graph embedding with shared latent semantic units[J]. Neural Networks, 2021, 139, 140- 148. |
| 23 | Dempster A P , Laird N M , Rubin D B . Maximum likelihood from incomplete data via the EM algorithm[J]. Journal of the Royal Statistical Society: Series B (Methodological), 1977, 39 (1): 1- 22. |
/
| 〈 |
|
〉 |