MSGE: A Multi-step Gated Model for Knowledge Graph Completion

Tan, Chunyang; Yang, Kaijia; Dai, Xinyu; Huang, Shujian; Chen, Jiajun

doi:10.1007/978-3-030-47426-3_33

Chunyang Tan¹⁴,
Kaijia Yang¹⁴,
Xinyu Dai¹⁴,
Shujian Huang¹⁴ &
…
Jiajun Chen¹⁴

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 12084))

Included in the following conference series:

Pacific-Asia Conference on Knowledge Discovery and Data Mining

5545 Accesses
1 Citations

Abstract

Knowledge graph embedding models aim to represent entities and relations in continuous low-dimensional vector space, benefiting many research areas such as knowledge graph completion and web searching. However, previous works do not consider controlling information flow, which makes them hard to obtain useful latent information and limits model performance. Specifically, as human beings, predictions are usually made in multiple steps with every step filtering out irrelevant information and targeting at helpful information. In this paper, we first integrate iterative mechanism into knowledge graph embedding and propose a multi-step gated model which utilizes relations as queries to extract useful information from coarse to fine in multiple steps. First gate mechanism is adopted to control information flow by the interaction between entity and relation with multiple steps. Then we repeat the gate cell for several times to refine the information incrementally. Our model achieves state-of-the-art performance on most benchmark datasets compared to strong baselines. Further analyses demonstrate the effectiveness of our model and its scalability on large knowledge graphs.

You have full access to this open access chapter, Download conference paper PDF

Improving knowledge graph completion via increasing embedding interactions

Article 04 January 2022

Knowledge Graph Completion with Triple Structure and Text Representation

Article Open access 30 May 2023

A Review Focusing on Knowledge Graph Embedding Methods Exploiting External Information

Keywords

1 Introduction

Large-scale knowledge graphs(KGs), such as Freebase [1], YAGO3 [2] and DBpedia [3], have attracted extensive interests with progress in artificial intelligence. Real-world facts are stored in KGs with the form of (subject entity, relation, object entity), denoted as (s, r, o), benefiting many applications and research areas such as question answering and semantic searching. Meanwhile, KGs are still far from complete with missing a lot of valid triplets. As a consequence, many researches have been devoted to knowledge graph completion task which aims to predict missing links in knowledge graphs.

Knowledge graph embedding models try to represent entities and relations in low-dimensional continuous vector space. Benefiting from these embedding models, we can do complicated computations on KG facts and better tackle the KG completion task. Translation distance based models [4,5,6,7,8] regard predicting a relation between two entities as a translation from subject entity to tail entity with the relation as a media. While plenty of bilinear models [9,10,11,12,13] propose different energy functions representing the score of its validity rather than measure the distance between entities. Apart from these shallow models, recently, deeper models [14, 15] are proposed to extract information at deep level.

Though effective, these models do not consider: 1. Controlling information flow specifically, which means keeping relevant information and filtering out useless ones, as a result restricting the performance of models. 2. The multi-step reasoning nature of a prediction process. An entity in a knowledge graph contains rich latent information in its representation. As illustrated in Fig. 1, the entity Michael Jordon has much latent information embedded in the knowledge graph and will be learned into the representation implicitly. However, when given a relation, not all latent semantics are helpful for the prediction of object entity. Intuitively, it is more reasonable to design a module that can capture useful latent information and filter out useless ones. At the meantime, for a complex graph, an entity may contain much latent information entailed in an entity, one-step predicting is not enough for complicated predictions, while almost all previous models ignore this nature. Multi-step architecture [16, 17] allows the model to refine the information from coarse to fine in multiple steps and has been proved to benefit a lot for the feature extraction procedure.

In this paper, we propose a Multi-Step Gated Embedding (MSGE) model for link prediction in KGs. During every step, gate mechanism is applied several times, which is used to decide what features are retained and what are excluded at the dimension level, corresponding to the multi-step reasoning procedure. For partial dataset, gate cells are repeated for several times iteratively for more fine-grained information. All parameters are shared among the repeating cells, which allows our model to target the right features in multi-steps with high parameter efficiency. We do link prediction experiments on 6 public available benchmark datasets and achieve better performance compared to strong baselines on most datasets. We further analyse the influence of gate mechanism and the length of steps to demonstrate our motivation.

2 Background

2.1 Link Prediction in Knowledge Graphs

Link prediction in knowledge graphs aims to predict correct object entities given a pair of subject entity and relation. In a knowledge graph, there are a huge amount of entities and relations, which inspires previous work to transform the prediction task as a scoring and ranking task. Given a known pair of subject entity and relation (s, r), a model needs to design a scoring function for a triple (s, r, o), where o belongs to all entities in a knowledge graph. Then model ranks all these triples in order to find the position of the valid one. The goal of a model is to rank all valid triples before the false ones.

2.2 Knowledge Graph Embedding

Knowledge graph embedding models aim to represent entities and relations in knowledge graphs with low-dimensional vectors $(\varvec{e_s}, \varvec{e_r}, \varvec{e_t})$. TransE [4] is a typical distance-based model with constraint formula $\varvec{e_s}+\varvec{e_r}-\varvec{e_t}\approx 0$. Many other models extend TransE by projecting subject and object entities into relation-specific vector space, such as TransH [5], TransR [6] and TransD [18]. TorusE [7] and RotatE [8] are also extensions of distance-based models. Instead of measuring distance among entities, bilinear models such as RESCAL [9], DistMult [10] and ComplEx [11] are proposed with multiplication operations to score a triplet. Tensor decomposition methods such as SimplE [12], CP-N3 [19] and TuckER [13] can also be seen as bilinear models with extra constraints. Apart from above shallow models, several deeper non-linear models have been proposed to further capture more underlying features. For example, (R-GCNs) [15] applies a specific convolution operator to model locality information in accordance to the topology of knowledge graphs. ConvE [14] first applies 2-D convolution into knowledge graph embedding and achieves competitive performance.

The main idea of our model is to control information flow in a multi-step way. To our best knowledge, the most related work to ours is TransAt [20] which also mentioned the two-step reasoning nature of link prediction. However, in TransAt, the first step is categorizing entities with Kmeans and then it adopts a distance-based scoring function to measure the validity. This architecture is not an end-to-end structure which is not flexible. Besides, error propagation will happen due to the usage of Kmeans algorithm.

3 Methods

3.1 Notations

We denote a knowledge graph as $\mathcal {G}= \{(s,r,o)\}\subseteq \mathcal {E}\times \mathcal {R}\times \mathcal {E} $ , where $\mathcal {E}$ and $\mathcal {R}$ are the sets of entities, relations respectively. The number of entities in $\mathcal {G}$ is $n_e$, the number of relations in $\mathcal {G}$ is $n_r$ and we allocate the same dimension d to entities and relations for simplicity. $\varvec{E} \in \mathbb {R}^{n_e*d} $ is the embedding matrix for entities and $\varvec{R}\in \mathbb {R}^{n_r*d}$ is the embedding matrix for relations. $\varvec{e_s}$, $\varvec{e_r}$ and $\varvec{e_o}$ are used to represent the embedding of subject entity, relation and subject entity respectively. Besides, we denote a gate cell in our model as C.

3.2 Multi-step Gate Mechanism

In order to obtain useful information, we need a specific module to extract needed information from subject entity with respect to the given relation, which can be regarded as a control of information flow guided by the relation. To model this process, we introduce gate mechanism, which is widely used in data mining and natural language processing models to guide the transmission of information, e.g. Long Short-Term Memory (LSTM) [21] and Gated Recurrent Unit (GRU) [22]. Here we adopt gating mechanism at dimension level to control information entailed in the embedding. To make the entity interact with relation specifically, we rewrite the gate cell in multi-steps with two gates as below:

$$\begin{aligned} \begin{aligned}&\varvec{z} = \sigma (\varvec{W_z}[\varvec{e_r},\varvec{e_s}]+\varvec{b_z}) \\&\varvec{r} = \sigma (\varvec{W_r}[\varvec{e_r},\varvec{e_s}]+\varvec{b_r}) \\ \end{aligned} \end{aligned}$$

(1)

Two gates $\varvec{z}$ and $\varvec{r}$ are called update gate and reset gate respectively for controlling the information flow. Reset gate is designed for generating a new $\varvec{e_s^{'}}$ or new information in another saying as follows:

$$\begin{aligned} \begin{aligned}&\varvec{e_s^{'}} = tanh(\varvec{W_s}[\varvec{r}\odot \varvec{e_s}, \varvec{e_r}]+\varvec{b}) \\ \end{aligned} \end{aligned}$$

(2)

Update gate aims to decide how much the generated information are kept according to formula (3):

$$\begin{aligned} \begin{aligned}&\tilde{\varvec{e_s}} = (1-\varvec{z})\odot \varvec{e_s^{'}} + \varvec{z}\odot {\varvec{e_s}} \\ \end{aligned} \end{aligned}$$

(3)

Hardmard product is performed to control the information at a dimension level. The values of these two gates are generated by the interaction between subject entity and relation. $\sigma $-Logistic sigmoid function is performed to project results between 0 and 1. Here 0 means totally excluded while 1 means totally kept, which is the core module to control the flow of information. We denote the gate cell as C.

Besides, to verify the effectiveness of gate mechanism, we also list the formula of a cell that exclude gates as below for ablation study:

$$\begin{aligned} \tilde{\varvec{e_s}} = tanh(\varvec{W}[\varvec{e_r},\varvec{e_s}]+\varvec{b}) \end{aligned}$$

(4)

With the gate cell containing several gating operations, the overall architecture in one gate cell is indeed a multi-step information controlling way.

3.3 Iterative Multi-step Architecture

In fact, a single gate cell can generate useful information since the two gating operations already hold great power for information controlling. However, for a complex dataset, more fine and precise features are needed for prediction. The iterative multi-step architecture allows the model to refine the representations incrementally. During each step, a query is fed into the model to interact with given features from previous step to obtain relevant information for next step. As illustrated in Fig. 2, to generate the sequence as the input for multi-step training, we first feed relation embedding into a fully connected layer:

$$\begin{aligned} \begin{aligned}&\varvec{e_r^{'}} = \varvec{W}\varvec{e_r} + \varvec{b} \\ \end{aligned} \end{aligned}$$

(5)

We reshape the output as a sequence $[\varvec{e_r^0}, \varvec{e_r^1}, ...,\varvec{e_r^k}] = Reshape(\varvec{e_r^{'}})$ which are named query relations. This projection aims to obtain query relations of different latent aspects such that we can utilize them to extract diverse information across multiple steps. Information of diversity can increase the robustness of a model, which further benefits the performance. Query relations are fed sequentially into the gate cell to interact with subject entity and generate information from coarse to fine. Parameters are shared across all steps so multi-step training are performed in an iterative way indeed.

Our score function for a given triple can be summarized as:

$$\begin{aligned} \begin{aligned} \phi = (C^k(\varvec{e_s}, [\varvec{e_r^0}, \varvec{e_r^1}, ...,\varvec{e_r^k}]))\varvec{e_o} \\ \end{aligned} \end{aligned}$$

(6)

where $C^k$ means repeating gate cell for k steps and during each step only the corresponding $\varvec{e_r^{i}}$ is fed to interact with output information from last step. See Fig. 2 for better understanding. After we extract the final information, it is interacted with object entity with a dot product operation to produce final score.

Differences with RNN-like Model. In previous RNN-like models, a cell is repeated several times to produce information of an input sequence, where the repeating times are decided by the length of the input sequence. Differently, we have two inputs $\varvec{e_s}$ and $\varvec{e_r}$ with totally different properties, which are embeddings of subject entity and relation respectively, which should not be seen as a sequence as usual. As a result, a gate cell is used for capturing interactive information among entities and relations iteratively in our model, rather than extracting information of just one input sequence. See Fig. 3 for differences more clearly.

Training. At last, matrix multiplication is applied between the final output information and embedding matrix E, which can be called 1-N scoring [14] to score all triples in one time for efficiency and better performance. We also add reciprocal triple for every instance in the dataset which means for a given (s, r, t), we add a reverse triple $(t, r^{-1}, s)$ as the previous work. We use binary cross-entropy loss as our loss function:

$$\begin{aligned} \mathcal {L}(p,y) = -\frac{1}{N}\sum _i (y_i\cdot log(p_i)+(1-y_i)\cdot log(1-p_i)) \end{aligned}$$

(7)

We add batch normalization to regularise our model and dropout is also used after layers. For optimization, we use Adam for a stable and fast training process. Embedding matrices are initialized with xavier normalization. Label smoothing [23] is also used to lessen overfitting.

4 Experiments

In this section we first introduce the benchmark datasets used in this paper, then we report the empirical results to demonstrate the effectiveness of our model. Analyses and ablation study are further reported to strengthen our motivation.

Table 1. Statistics of datasets.

Full size table

4.1 Datasets

Six datasets are used in our experiments:

WN18 [4] is extracted from WordNet describing the hierarchical structure of words, consisting relations such as hyponym and hypernym.
WN18RR [4, 14] is a subset of WN18 which removes inverse relations. Inverse relation pairs are relations such as (hyponym, hypernym). Inverse relations may cause severe test leakage: a lot of test triples can be obtained from train data simply by inverting them. That means a simple rule-based model can easily figure out the right o given a (s, r), only if it has seen $(o,r^{'},s)$ in the train data and it knows $r^{'}$ is the reverse of r.
FB15k [4] is extracted from Freebase describing mostly relations about movies, actors, awards, sports and so on.
FB15k-237 [24] is a subset of FB15k which removes inverse relations and the triples involved in train, valid and test data.
UMLS [25] comes from biomedicine. Entities in UMLS (Unified Medical Language System) are biomedical concepts such as disease and antibiotic.
Kinship [25] contains kinship relationships among members of the Alyawarra tribe from Central Australia.

The details of these datasets are reported in Table 1.

4.2 Experiment Setup

The evaluation metric we use in our paper includes Mean Reciprocal Rank(MRR) and Hit@K. MRR represents the reciprocal rank of the right triple, the higher the better of the model. Hit@K reflects the proportion of gold triples ranked in the top K. Here we select K among {1, 3, 10}, consistent with previous work. When Hit@K is higher, the model can be considered as better. All results are reported with ’Filter’ setting which removes all gold triples that have existed in train, valid and test data during ranking. We report the test results according to the best performance of MRR on validation data as the same with previous works.

For different datasets, the best setting of the number of iterations varies a lot. For FB15k and UMLS the number at 1 provides the best performance, however for other datasets, iterative mechanism is helpful for boosting the performance. The best number of iterations is set to 5 for WN18, 3 for WN18RR, 8 for FB15k-237 and 2 for Kinship.

Table 2. Link prediction results on WN18, WN18RR, FB15k and FB15k-237. $^{\dagger }$ denotes the results are taken from [14]; - denotes the results are not provided; The results of RotatE [8] are reported without self-adversial negative sampling, this sampling trick is irrelevant with the model itself; Other results are all taken from the original paper. Best results are in bold. The second best are underlined.

Full size table

Table 3. Link prediction results on UMLS and Kinship.

Full size table

4.3 Link Prediction Results

We do link prediction task on 6 benchmark datasets, comparing with several classical baselines such as TransE [4], DistMult [10] and some SOTA strong baselines such as ConvE [14], RotatE [8] and TuckER [13]. For smaller datasets UMLS and Kinship, we also compare with some non-embedding methods such as NTP [26] and NeuralLP [27] which learn logic rules for predicting, as well as MINERVA [28] which utilizes reinforcement learning for reasoning over paths in knowledge graphs.

The results are reported in Table 2 and Table 3. Overall, from the results we can conclude that our model achieves comparable or better performance than SOTA models on datasets. Even with datasets without inverse relations such as WN18RR, FB15k-237 which are more difficult datasets, our model can still achieve comparable performance.

Table 4. Influence of number of iterations on FB15k-237.

Full size table

4.4 Analysis on Number of Iterations

To study the effectiveness of the iterative multi-step architecture, we list the performance of different number of steps on FB15k-237 in Table 4. The model settings are all exactly the same except for length of steps. From the results on FB15k-237 we can conclude that the multi-step mechanism indeed boosts the performance for a complex knowledge graph like FB15k-237, which verify our motivation that refining information for several steps can obtain more helpful information for some complex datasets.

4.5 Convergence Study

We report the convergence process of TuckER and MSGE on FB15k-237 dataset and WN18RR dataset in Fig. 4. We re-run TuckER with exactly the same settings claimed in the paper. All the results stand for the performance on valid dataset. For MSGE, we also report the result of one step for comparison. It is obvious that MSGE can converge rapidly compared to TuckER with nearly the same or better final performance. From the analysis of model architecture, TuckER needs an extra core tensor W to capture interactive information. While in MSGE, entities and relations are directly interacted with each other through a gate cell. On dataset WN18RR, we can find that the convergence process of TuckER is not as steady as MSGE, which demonstrates the efficiency of our model.

Table 5. Parameter counts comparison.

Full size table

4.6 Efficiency Analysis

In Table 5, we report the parameter counts of ConvE, TuckER and our model for comparison. Our model can achieve better performance on most datasets with much less parameters, which means our model can be more easily migrated to large knowledge graphs. As for TuckER, which is the current SOTA method, the parameter count is mainly due to the core interaction tensor W, whose size is $d_e*d_r*d_e$. As the grow of embedding dimension, this core tensor will lead to a large increasing on parameter size. However, note that our model is an iterative architecture therefore only a very few parameters are needed apart from the embedding, the complexity is $\mathcal {O}(n_ed+n_rd)$. For evaluating time efficiency, we re-run TuckER and our model on Telsa K40c. TuckER needs 29 s/28 s to run an epoch on FB15k-237/WN18RR respectively, MSGE needs 17 s/24 s respectively, which demonstrate the time efficiency due to few operations in our model.

Table 6. Ablation study on FB15k-237.

Full size table

4.7 Ablation Study

To further demonstrate our motivation that gate mechanism and multi-step reasoning are beneficial for extracting information. We do ablation study with the following settings:

No gate: Remove the gates in our model to verify the necessity of controlling information flow.
Concat: Concatenate information extracted in every step together and feed them into a fully connected layer to obtain another kind of final information, which is used to verify that more useful information are produced by the procedure of multi-step.
Replicate: Replicate the relation to gain k same query relations for training. This is to prove that extracting diverse information from multi-view query relations is more helpful than using the same relation for k times.

The experiment results are reported in Table 6. All results demonstrate our motivation that controlling information flow in a multi-step way is beneficial for link prediction task in knowledge graphs. Especially a gated cell is of much benefit for information extraction.

5 Conclusion and Future Work

In this paper, we propose a multi-step gated model MSGE for link prediction task in knowledge graph completion. We utilize gate mechanism to control information flow generated by the interaction between subject entity and relation. Then we repeat gated module to refine information from coarse to fine. It has been proved from the empirical results that utilizing gated module for multiple steps is beneficial for extracting more useful information, which can further boost the performance on link prediction. We also do analysis from different views to demonstrate this conclusion. Note that, all information contained in embeddings are learned across the training procedure implicitly. In future work, we would like to aggregate more information for entities to enhance feature extraction, for example, from the neighbor nodes and relations.

References

Bollacker, K.D., Evans, C., Paritosh, P., Sturge, T., Taylor, J.: Freebase: a collaboratively created graph database for structuring human knowledge. In: ACM SIGMOD, pp. 1247–1250 (2008)
Google Scholar
Mahdisoltani, F., Biega, J., Suchanek, F.M.: Yago3: A knowledge base from multilingual wikipedias (2015)
Google Scholar
Auer, S., Bizer, C., Kobilarov, G., Lehmann, J., Cyganiak, R., Ives, Z.: DBpedia: a nucleus for a web of open data. In: Aberer, K., et al. (eds.) ASWC/ISWC -2007. LNCS, vol. 4825, pp. 722–735. Springer, Heidelberg (2007). https://doi.org/10.1007/978-3-540-76298-0_52
Chapter Google Scholar
Bordes, A., Usunier, N., Garcia-Duran, A., Weston, J., Yakhnenko, O.: Translating embeddings for modeling multi-relational data. In: NIPS, pp. 2787–2795 (2013)
Google Scholar
Wang, Z., Zhang, J., Feng, J., Chen, Z.: Knowledge graph embedding by translating on hyperplanes. AAAI 14, 1112–1119 (2014)
Google Scholar
Lin, Y., Liu, Z., Sun, M., Liu, Y., Zhu, X.: Learning entity and relation embeddings for knowledge graph completion. AAAI 15, 2181–2187 (2015)
Google Scholar
Ebisu, T., Ichise, R.: Knowledge graph embedding on a lie group. In: AAAI, Toruse (2018)
Google Scholar
Sun, Z., Deng, Z.H., Nie, J.Y.: Rotate: knowledge graph embedding by relational rotation in complex space. In: ICLR (2019)
Google Scholar
Nickel, M., Tresp, V., Kriegel, H.P.: A three-way model for collective learning on multi-relational data. In: ICML (2011)
Google Scholar
Yang, B., Yih, W., He, X., Gao, J., Deng, L.: Embedding entities and relations for learning and inference in knowledge bases. In: ICLR (2015)
Google Scholar
Trouillon, T., Welbl, J., Riedel, S., Gaussier, E., Bouchard, G.: Complex embeddings for simple link prediction. In: ICML, pp. 2071–2080 (2016)
Google Scholar
Kazemi, S.M., Poole, D.: Simple embedding for link prediction in knowledge graphs. In: NIPS (2018)
Google Scholar
Balažević, I., Allen, C., Hospedales, T.: Tensor factorization for knowledge graph completion. In: EMNLP (2019)
Google Scholar
Dettmers, T., Minervini, P., Stenetorp, P., Riedel, S.: Convolutional 2D knowledge graph embeddings. In: AAAI (2018)
Google Scholar
Schlichtkrull, M., Bloem,T.N., Kipf nd, P., van den Berg, R., Titov, I., Welling, M.: Modeling relational data with graph convolutional networks. In: European Semantic Web Conference (2018)
Google Scholar
Shen, Y., Huang, P.S., Gao, J. Reasonet: learning to stop reading in machine comprehension. In: ACM SIGKDD (2017)
Google Scholar
Dhingra, B., Liu, H., Yang, Z.: Gated-attention readers for text comprehension (2016)
Google Scholar
Ji, G., He, S., Xu, L., Liu, K.: Knowledge graph embedding via dynamic mapping matrix. In: ACL (2015)
Google Scholar
Lacroix, T., Usunier, N., Obozinski, G.: Canonicaltensor decomposition for knowledge base completion. In: ICML (2018)
Google Scholar
Qian, W., Fu, C., Zhu, Y., Cai, D., He, X.: Translating embeddings for knowledge graph completion with relation attention mechanism. In: European Semantic Web Conference (2018)
Google Scholar
Hochreiter, S., Schmidhuber, J.: Long short-term memory. Neural Comput. 9, 1735–1780 (1997)
Article Google Scholar
Cho, K., et al.: Learning phrase representations using RNN encoder-decoder for statistical machine translation. arXiv preprint (2014)
Google Scholar
Szegedy, C., Vanhoucke, V., Ioffe, S., Shlens, I., Wojna, Z.: Rethinking the inception architecture for computer vision. In: Proceedings of IEEE CVPR, pp. 2818–2826 (2016)
Google Scholar
Toutanova, K., Chen, D.: Observed versus latent features for knowledge base and text inference. In: Proceedings of the 3rd Workshop on Continuous Vector Space Models and their Compositionality, pp. 57–66 (2015)
Google Scholar
Kok, S., Domingos, P.: Statistical predicate invention. In: ICML (2007)
Google Scholar
Rocktaschel, T., Riedel, S.: End-to-end differentiable proving. In: NIPS (2017)
Google Scholar
Yang, F., Yang, Z., Cohen, W.: Differentiable learning of logical rules for knowledge base reasoning. In: NIPS (2017)
Google Scholar
Das, R., et al.: Go for a walk and arrive at the answer: reasoning over paths in knowledge bases using reinforcement learning. In: ICLR (2018)
Google Scholar

Download references

Author information

Authors and Affiliations

National Key Laboratory for Novel Software Technology, Nanjing University, Nanjing, China
Chunyang Tan, Kaijia Yang, Xinyu Dai, Shujian Huang & Jiajun Chen

Authors

Chunyang Tan
View author publications
You can also search for this author in PubMed Google Scholar
Kaijia Yang
View author publications
You can also search for this author in PubMed Google Scholar
Xinyu Dai
View author publications
You can also search for this author in PubMed Google Scholar
Shujian Huang
View author publications
You can also search for this author in PubMed Google Scholar
Jiajun Chen
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Chunyang Tan .

Editor information

Editors and Affiliations

School of Information Systems, Singapore Management University, Singapore, Singapore
Hady W. Lauw
Department of Computer Science and Engineering, Hong Kong University of Science and Technology, Hong Kong, Hong Kong
Raymond Chi-Wing Wong
Department of Informatics and Telecommunications, National and Kapodistrian University of Athens, Athens, Greece
Alexandros Ntoulas
School of Information Systems, Singapore Management University, Singapore, Singapore
Ee-Peng Lim
Institute of Data Science, National University of Singapore, Singapore, Singapore
See-Kiong Ng
School of Computer Science and Engineering, Nanyang Technological University, Singapore, Singapore
Sinno Jialin Pan

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Tan, C., Yang, K., Dai, X., Huang, S., Chen, J. (2020). MSGE: A Multi-step Gated Model for Knowledge Graph Completion. In: Lauw, H., Wong, RW., Ntoulas, A., Lim, EP., Ng, SK., Pan, S. (eds) Advances in Knowledge Discovery and Data Mining. PAKDD 2020. Lecture Notes in Computer Science(), vol 12084. Springer, Cham. https://doi.org/10.1007/978-3-030-47426-3_33

Download citation

DOI: https://doi.org/10.1007/978-3-030-47426-3_33
Published: 06 May 2020
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-47425-6
Online ISBN: 978-3-030-47426-3
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

MSGE: A Multi-step Gated Model for Knowledge Graph Completion

Abstract

Similar content being viewed by others

Improving knowledge graph completion via increasing embedding interactions

Knowledge Graph Completion with Triple Structure and Text Representation

A Review Focusing on Knowledge Graph Embedding Methods Exploiting External Information

Keywords

1 Introduction