CN111160564A

CN111160564A - Chinese knowledge graph representation learning method based on feature tensor

Info

Publication number: CN111160564A
Application number: CN201911300781.5A
Authority: CN
Inventors: 李巧勤; 郑子强; 刘勇国; 杨尚明
Original assignee: University of Electronic Science and Technology of China
Current assignee: University of Electronic Science and Technology of China
Priority date: 2019-12-17
Filing date: 2019-12-17
Publication date: 2020-05-15
Anticipated expiration: 2039-12-17
Also published as: CN111160564B

Abstract

The invention provides a Chinese knowledge graph representation learning method based on feature tensor, which comprises the following steps: preparing data, establishing a data structure, constructing an entity characteristic vector matrix, defining a relation vector and a distance formula of a marked triplet, obtaining a training set, training a knowledge graph representation learning model, updating model parameters, performing iterative training, performing relation prediction on an unmarked triplet by using the model, and performing iterative training again until a new unmarked triplet cannot be learned. The invention provides a method for forming a feature tensor by using Chinese pinyin, character information, word information and description information and converting the feature tensor into a feature vector to replace a method for randomly initializing an entity vector in traditional knowledge representation learning, and fully utilizes the characteristics of Chinese. In addition, a double-layer iteration mode is adopted to supplement the training corpus, so that the relationship matrix can be continuously corrected, and the precision and the convergence speed of the knowledge graph representation learning model are improved.

Description

Chinese knowledge graph representation learning method based on feature tensor

Technical Field

The invention relates to the field of knowledge graphs, in particular to a Chinese knowledge graph representation learning method based on feature tensor.

Background

The knowledge graph describes the complex relation between concepts and entities in the objective world in a structured form, and provides the capability of better organizing, managing and understanding mass information of the Internet. The knowledge graph technology generally comprises research contents of three aspects of knowledge representation, knowledge graph construction and knowledge graph application, wherein the knowledge representation is the basis of the knowledge graph construction and application, reflects the cognition of human beings on the objective world and can express the semantics presented by the objective world from different levels and granularities. Firstly, the method needs to know how the human beings represent knowledge and solve problems by using the knowledge, and then formally represents the knowledge into an expression form which can be inferred and calculated by a computer, so as to establish a knowledge-based system and provide intelligent knowledge services. At the same time, knowledge representation also needs the capability of combining the representation, processing and calculation of symbols by a computer. The key problem to be solved by knowledge representation is 1) what knowledge representation form can accurately reflect the knowledge of the objective world; 2) establishing what knowledge representation can have semantic representation capability; 3) how the knowledge representation supports efficient knowledge reasoning and computation, thereby enabling the knowledge representation to have reasoning capabilities for getting new knowledge. Current knowledge representation methods can be divided into symbolic logic-based knowledge representation, open knowledge representation methods for internet resources, and knowledge graph-based representation learning.

1) Knowledge representation based on symbolic logic: although the knowledge representation technology based on symbolic logic can well describe logical reasoning, the capacity of a machine to generate rules in reasoning is weak, the inference rules need a large amount of manpower to acquire, the requirement on data quality is high, and the knowledge representation based on symbolic logic cannot well solve the problem of knowledge representation in the current large-scale data era.

2) Knowledge representation of web content: tim Berners-Lee proposes the concept of a semantic web in which web content should have a definite meaning and can be easily understood, acquired and integrated by a computer. The web content knowledge representation comprises a semi-structure markup language XML based on marks, a web resource semantic metadata description framework based on RDF, an OWL ontology description language based on description logic and the like; and a large-scale applied knowledge graph knowledge representation method based on triples is currently obtained in the industry, wherein the triples can be represented as < h, r, t >, and represent that a relation r exists between a head entity h and a tail entity t. These technologies allow us to publish semantic information understood and processed by machines on the world wide web. But web content is in the hundreds of trillions, which is a huge challenge for knowledge storage and knowledge representation learning.

3) Represents learning: the goal of representation learning is to represent the semantic information of the study object as a dense low-dimensional vector through machine learning or deep learning. And carrying out implicit vectorization representation on the knowledge units with different granularities so as to support quick knowledge calculation in a big data environment. The method for representing learning mainly comprises tensor reconstruction and potential energy function. Tensor reconstruction integrates information of the whole knowledge base, but tensor dimensionality is high in a big data environment, and the reconstructed calculation amount is large; the potential energy function method considers that the relation is a translation operation from a head entity to a tail entity, a TransE model proposed by Bordes et al is a representative of a translation model, but lacks displayed semantic information, particularly characters with pinyin, structure and word information, such as Chinese, low-dimensional vectors of Chinese learned through machine learning or deep learning are only parameter fitting of a computer, and are lack of interpretability.

In conclusion, the knowledge representation based on symbolic logic and the open knowledge representation method of internet resources enable knowledge to have semantic definition of display, but the problem of data sparsity exists, and large-scale knowledge map application is difficult to realize; knowledge representation based on deep learning can map units of knowledge (entities, relationships, and rules) to a low-dimensional continuous real space representation, but lacks the semantic definition of the display.

In addition, foreign countries have many study researches on knowledge graph representation, but the study researches are only limited to English knowledge graphs, English words only have simple character string information and phrase information due to language differences, only vectors need to be initialized randomly when knowledge representation is studied, Chinese contains rich semantic information, the existing study method cannot achieve good effects on Chinese knowledge graphs, at present, domestic mainly stays in a stage of how to construct knowledge graphs, and study on Chinese knowledge graph representation study is lacked.

Disclosure of Invention

Aiming at the problems, the invention provides a Chinese knowledge graph representation learning method based on feature tensor, compared with a random initialization vector, the invention introduces four features of Chinese pinyin, characters, words and description information as Chinese display semantic information to form the feature tensor, so that the learning process of the Chinese knowledge graph representation becomes interpretable, and is combined with deep learning to map the learned knowledge representation to a low-dimensional continuous real number space, thereby facilitating the learning of Chinese knowledge and the relation between the Chinese knowledge and the low-dimensional continuous real number space.

The invention provides a Chinese knowledge map representation learning method based on feature tensor, which comprises the following steps of:

step 1) data preparation

Me data from an open Chinese link data set zhishi.me form triple data, wherein the triple data consists of a large number of triples, the triple form is < h, r, t >, h represents a head entity, t represents a tail entity, and r represents the relationship between the head entity h and the tail entity t;

step 2) establishing a data structure

Dividing the triple data into marked triples and unmarked triples, and constructing data structures of a dictionary, an entity dictionary, a relation dictionary, an entity pinyin matrix, a word embedding matrix and a description matrix;

step 3) constructing an entity characteristic vector matrix

For each entity in the marked triple, firstly, the entity pinyin vector, the word vector and the description vector form the characteristic tensor of the entity; converting the feature tensors of all the entities in the marked triple into the feature vectors of the entities, and constructing an entity feature vector matrix according to the sequence of an entity dictionary;

step 4) taking a marked triple T_l＝<h，r，t>By moment of entity eigenvectorsObtaining the characteristic vector h of the head entity h and the tail entity t by the array_ftAnd t_ftIn order to indicate that the entity h has a relationship r with the entity T, i.e. h + r ═ T, the triplet T is marked_l＝<h，r，t>The relationship vector of (d) can be expressed as:

r＝t_ft-h_ft

in order to calculate the distance between the entity h and the entity t, the relationship between the entities is expressed by vector conversion, and the distance formula of the Euclidean distance definition triple < h, r, t > is as follows:

wherein the subscript "2" represents the 2 norm, i.e., the euclidean norm, and the superscript "2" represents squaring;

step 5) taking the marked triples as a training set, initializing entity vectors, namely an entity characteristic vector matrix, initializing relationship vectors, constructing a relationship vector matrix, wherein the sequence is consistent with a relationship dictionary, and the relationship calculation is carried out by a formula r-t_ft-h_ftIf a plurality of entity pairs have the same relationship, the relationship vector is obtained by averaging the difference values of the plurality of entity pairs, and normalization is performed after all the relationship vectors are initialized, so that the precision is improved, and the convergence is strengthened;

step 6) randomly selecting a positive triple in the training set<h，r，t>In the negative triplet will<h′，r，t>And<h，r，t′>select out, and<h，r，t>forming a training batch T by forming a group of elements_batch＝[(<h，r，t>，<h′，r，t>)，(<h，r，t>，<h，r，t′>)]By using S_p＝{<h，r，t>Denotes a positive triplet, S_f＝{<h′，r，t>|h′∈E，<h，r，t′>I T' is belonged to E } represents a negative triple, and T is added_batchAs input to the knowledge graph representation learning model, for T_batchPerforming knowledge graph representation learning training, wherein E represents an entity set and combines a formula

The knowledge graph represents a loss function of the learning model defined as:

where γ is a separation distance parameter greater than 0, is a hyperparameter, [ x [ ]]₊Represents a positive value function, i.e. when x > 0, [ x []₊X; when x is less than or equal to 0, [ x ]]₊0; the training method is called margin-based ranking criterion, and aims to separate a positive triple from a negative triple as much as possible and find out a support vector with the maximum distance;

step 7) updating learning model parameters represented by the knowledge graph by adopting random gradient descent (SGD), wherein gradient updating only needs to calculate distances d (h + R, t) and d (h '+ R, t'), a total of | E | entities and | R | relations are set, the length of each entity vector is m-dimension, and the length of the relation vector is n-dimension, so that the total of (| E | m + | R | n) parameters needs to be updated;

step 8) repeating the step 6) to the step 7) to carry out iterative training, and after the iterative training is finished, using a knowledge graph to represent learning model parameters to carry out relation prediction on the unmarked triples, wherein the prediction step is as follows: taking any of unmarked triples<h，r，t>_unlabelUsing a knowledge graph to represent the relation r 'between learning model parameter prediction h and t, and if r' is r, the prediction is correct; meanwhile, taking the correctly predicted triples as positive triples, randomly replacing head entities or tail entities of the positive triples as negative triples, and then combining the new positive triples and the new negative triples into the original marking triples to form new marking triples;

and 9) repeating the steps 4) to 8) by adopting the new marked triples to carry out iterative training until the new unmarked triples cannot be learned, which shows that the knowledge graph represents that the learning model cannot learn more Chinese knowledge characteristics from the current training data, and the entity vector and the relation vector output by the knowledge graph representation learning model are the best Chinese knowledge graph representation corresponding to the Chinese link data set zhishi.

The invention provides a method for forming a feature tensor by using Chinese pinyin, character information, word information and description information and converting the feature tensor into a feature vector aiming at the problem that the existing knowledge representation learning method cannot be combined with Chinese word information, so as to replace the method for randomly initializing an entity vector in the traditional knowledge representation learning and fully utilize the characteristics of Chinese. In addition, the invention adopts a double-layer iteration mode to supplement the training corpus, so that the relationship matrix can be continuously corrected, and the precision and the convergence speed of the knowledge graph representation learning model are improved.

Drawings

FIG. 1 is a process flow diagram of the method of the present invention

FIG. 2 is a flow chart of the processing procedure of the method of the present invention

FIG. 3 is a description matrix LSTM encoded entity description vector

FIG. 4 is a schematic diagram of tensor conversion into vector

Detailed Description

The technical solutions in the embodiments of the present invention will be clearly and completely described below with reference to the drawings in the embodiments of the present invention, and it is obvious that the described embodiments are only a part of the embodiments of the present invention, and not all of the embodiments.

As shown in fig. 2, the method for learning a chinese knowledge graph representation based on feature tensor provided by the present invention includes the following steps:

step 1) data preparation

The triple data used by the method is from an open Chinese link data set zhishi.me and consists of a large number of triples, wherein the triple is in a shape of < h, r, t >, h represents a head entity, t represents a tail entity, and r represents the relation between the head entity h and the tail entity t.

Step 2) establishing a data structure

As shown in fig. 1, the triple data is divided into marked triples and unmarked triples, and data structures such as a dictionary, an entity dictionary, a relationship dictionary, an entity pinyin matrix, a word embedding matrix, a description matrix, and the like are constructed.

Marking a triple: randomly extracting triples from the data set zhishi.me to obtain a triple data set, taking all triples in the triple data set as positive triples, removing a head entity or a tail entity of each positive triple, randomly selecting an entity different from the triple in an entity dictionary to replace the triple to form a negative triple, and only replacing one entity in the triple each time, so that the triple has contrast. The triples are marked, and the positive triples are marked as 1, and the negative triples are marked as 0.

Unlabeled triplet: me any unlabeled triplet in the dataset zhishi.

A dictionary: me, including all head entities, tail entities and a dictionary formed by relations, the dictionary form is "word: sequence number ", the sequence number is a number, increasing from zero.

And (3) entity dictionary: me, including all dictionaries formed by head entities and tail entities, the dictionary form is "entity name: sequence number ", the sequence number is a number, increasing from zero.

A relation dictionary: me, the dictionary is in the form of a relation name: sequence number ", the sequence number is a number, increasing from zero.

An entity pinyin matrix: in order to solve the problem of different meanings of polyphones, a Baidu translation API is called to obtain entity pinyin, an entity pinyin matrix is constructed, the number of rows is consistent with the number of entities in an entity dictionary, and each row is an entity pinyin vector obtained by using a one-hot coding mode.

Word embedding matrix: the number of lines corresponds to the number of words in the dictionary, and each line uses a word vector derived from word2 vec.

Word embedding matrix: the number of lines is consistent with the number of entities in the entity dictionary, and each line is a word vector obtained by using word2 vec.

Describing the matrix: the line number is consistent with the entity number in the entity dictionary, the Baidu encyclopedia API is called to obtain entity description information, and the description information is input into a bidirectional Long Short-Term Memory network (BilSTM) for encoding to obtain an entity description vector, as shown in FIG. 3. The vector introduces description information of an entity, and can solve the problem of Chinese synonyms.

Step 3) constructing an entity characteristic vector matrix

For each entity in the marked triple, firstly, the entity pinyin vector, the word vector and the description vector form the characteristic tensor of the entity, and the characteristic tensor is used as the predefined characteristic tensor of the entity in the subsequent steps. The construction process comprises the following steps: tag triplet representation as T_lUsing E to represent an entity set in the knowledge graph, using R to represent a relation set, selecting an entity E in a mark triple to belong to E, searching an entity pinyin matrix to obtain an entity pinyin vector E_p(ii) a Let the name of an entity be c₁c₂...c_mWherein c is_mExpressing the m-th word forming the entity name, and finding the word embedding matrix according to the word to obtain the word vector e of the entity_c＝c₁c₂...c_m(ii) a Finding word embedding matrix to obtain word vector e_w(ii) a Finding description matrix to obtain description vector e of entity_dThe feature tensor FeatureTens or of the entity is expressed as

And converting the characteristic tensor of the entity into the characteristic vector of the entity to construct an entity characteristic vector matrix. As shown in fig. 4, different dimensions of the feature tensor of the entity are connected, and the connection mode is vector splicing. E.g. given vector a ═ x₁，x₂，x₃，...，x_m]And B ═ y₁，y₂，y₃，...，y_n]And connecting to obtain vector C ═ x₁，x₂，x₃，...，x_m，y₁，y₂，y₃，...，y_n]And using dropout to randomly lose the vector, so as to prevent knowledge representation from learning overfitting. The method is used for converting the pinyin vector e of the same entity e_pWord vector e_cWord vector e_wDescription vector e_dPerforming connection to obtain the productFeature vector e of volume_ft＝[e_p；e_c；e_w；e_d]。

And converting the feature tensors of all the entities in the marked triple into the feature vectors of the entities, and constructing an entity feature vector matrix according to the sequence of the entity dictionary.

Step 4) taking a marked triple T_l＝<h，r，t>Obtaining the eigenvector h of the head entity h and the tail entity t through the entity eigenvector matrix_ftAnd t_ftIn order to indicate that the entity h has a relationship r with the entity T, i.e. h + r ═ T, the triplet T is marked_l＝<h，r，t>The relationship vector of (d) can be expressed as:

r＝t_ft-h_ft(1)

the subscript "2" in equation (2) represents a 2-norm, i.e., the euclidean norm, and the superscript "2" represents squaring.

And 5) taking the marked triples as a training set, initializing entity vectors, namely an entity characteristic vector matrix, initializing relationship vectors, constructing a relationship vector matrix, wherein the sequence is consistent with a relationship dictionary, and the relationship calculation is obtained by the formula (1). If a plurality of entity pairs have the same relationship, the relationship vector is the average of the difference values of the plurality of entity pairs. Normalization after initialization of all vectors will increase accuracy and enhance convergence.

Step 6) randomly selecting a positive triple in the training set<h，r，t>In the negative triplet will<h′，r，t>And<h，r，t′>select out, and<h，r，t>forming a training batch T by forming a group of elements_batch＝[(<h，r，t>，<h′，r，t′>)，(<h，r，t>，<h，r，t′>)]. With S_p＝{<h，r，t>Denotes a positive triplet, S_f＝{<h′，r，t>|h′∈E，<h，r，t′>I t' E represents a negative triple. Will T_batchAs input to the knowledge graph representation learning model, for T_batchAnd (3) carrying out knowledge graph representation learning training, and combining the formula (2), wherein a loss function of a knowledge graph representation learning model is defined as:

where γ is a separation distance parameter greater than 0, is a hyperparameter, [ x [ ]]₊Represents a positive value function, i.e. when x > 0, [ x []₊X; when x is less than or equal to 0, [ x ]]₊0. The training method is called margin-based ranking criterion and aims to separate the positive triples and the negative triples as far as possible and find out the support vector with the maximum distance.

And 7) updating the learning model parameters represented by the knowledge graph by adopting random gradient descent (SGD), wherein the gradient updating only needs to calculate the distances d (h + r, t) and d (h '+ r, t'). If a total of | E | entities and | R | relationships are set, the length of each entity vector is m-dimensional, and the length of the relationship vector is n-dimensional, then a total of (| E | + m + | R |) parameters needs to be updated.

And 8) repeating the steps 6) -7) to carry out iterative training, and after the iterative training is finished, using a knowledge graph to represent a learning model to carry out relationship prediction on the unmarked triples. The prediction step is that any one triple is taken from the unmarked triples<h，r，t>_unlabelThe learning model is used to predict the relationship r 'between h and t, and if r' is r, the prediction is correct. Meanwhile, the correct predicted triples are used as positive triples, head entities or tail entities of the positive triples are randomly replaced to be used as negative triples, and then the new positive triples and the new negative triples are combined into the original marked triples to form new marked triples.

And 9) repeating the steps 4) to 8) by adopting the new marked triples to carry out iterative training until the new unmarked triples cannot be learned, which shows that the knowledge graph represents that the learning model cannot learn more Chinese knowledge characteristics from the current training data, and the knowledge graph represents that the entity vector and the relation vector output by the learning model are the best Chinese knowledge graph representation corresponding to the data set zhishi.

The Chinese knowledge graph representation learning method generally adopts link prediction as an evaluation task, and the evaluation indexes comprise rank average (MR), average reciprocal rank (MRR), percentage of the result of a correct entity in the top ten ranks (Hits @10), percentage of the result of a correct entity in the top three ranks (Hits @3), percentage of the first ranked result in the correct entity (Hits @1), wherein the smaller the value of MR is, the better the values of MRR, Hits @10, Hits @3 and Hits @1 are. Me of the data set zhishi.me, part of entities or relations of the triples are removed randomly, and link prediction is to predict the entities or relations removed randomly in the triples.

In the embodiment of the invention, representation learning and link prediction task evaluation are carried out on an open-source Chinese data set zhishi.me, and compared with test results of two knowledge graph representation learning methods, namely a TransE model and a TransR model, the results are shown in Table 1:

TABLE 1 test results

Evaluation index	MR	MRR	Hits@10	Hits@3	Hits@1
						TransE	713	0.458	0.812	0.723	0.556
TranR	687	0.519	0.839	0.768	0.646
						The invention	611	0.843	0.875	0.801	0.692

The experimental result shows that the test result of the invention is superior to the TransE model and the TransR model and reaches the available level.

Although illustrative embodiments of the present invention have been described above to facilitate the understanding of the present invention by those skilled in the art, it should be understood that the present invention is not limited in scope to the specific embodiments. Such variations are obvious and all the inventions utilizing the concepts of the present invention are intended to be protected.

Claims

1. A Chinese knowledge map representation learning method based on feature tensor is characterized by comprising the following steps:

step 1) data preparation

Me data from an open Chinese link data set zhishi.me form triple data, wherein the triple data is composed of a large number of triples, the triple form is < h, r, t >, wherein h represents a head entity, t represents a tail entity, and r represents the relationship between the head entity h and the tail entity t;

step 2) establishing a data structure

Dividing the triple data into marked triples and unmarked triples, and constructing data structures of a dictionary, an entity dictionary, a relation dictionary, an entity pinyin matrix, a word embedding matrix and a description matrix, wherein,

marking a triple: randomly extracting triple data from the Chinese link data set zhishi.me to obtain a triple data set, taking all triples in the triple data set as positive triples, removing a head entity or a tail entity of each positive triplet, randomly selecting an entity different from the triple in an entity dictionary to replace the triple to form a negative triplet, only replacing one entity in the triples each time so that the triples have contrast, marking the triples, and marking the positive triples as 1 and the negative triples as 0;

unlabeled triplet: me, any unmarked triple in the Chinese link dataset zhishi.me;

a dictionary: me, including all head entities, tail entities and a dictionary formed by relations, the dictionary form is "word: sequence number ", the sequence number is a number, increasing from zero;

and (3) entity dictionary: me, the entity set in the chinese linked data set zhishi.me is represented by E, and includes a dictionary formed by all head entities and tail entities, and the dictionary form is "entity name: sequence number ", the sequence number is a number, increasing from zero;

a relation dictionary: me, the dictionary is in the form of a relation name: sequence number ", the sequence number is a number, increasing from zero;

an entity pinyin matrix: in order to solve the problem of different meanings of polyphones, a Baidu translation API is called to obtain entity pinyin, an entity pinyin matrix is constructed, the number of lines of the entity pinyin matrix is consistent with the number of entities in the entity dictionary, and each behavior of the entity pinyin matrix uses an entity pinyin vector obtained in a one-hot coding mode;

word embedding matrix: the number of rows of the word embedding matrix is consistent with the number of words in the dictionary, and each row of the word embedding matrix uses a word vector obtained by word2 vec;

word embedding matrix: the line number of the word embedding matrix is consistent with the number of entities in the entity dictionary, and each behavior of the word embedding matrix uses a word vector obtained by word2 vec;

describing the matrix: the line number of the description matrix is consistent with the number of entities in the entity dictionary, an encyclopedia API is called to obtain entity description information, the entity description information is input into a bidirectional long short-Term Memory network (Bi-directional Long short-Term Memory, BilSTM) to be coded to obtain an entity description vector, and the entity description vector introduces the entity description information and can solve the problem of Chinese synonym;

step 3) constructing an entity characteristic vector matrix

For each entity in the marked triple, firstly, forming a characteristic tensor of the entity by an entity pinyin vector, a word vector and an entity description vector; converting the feature tensors of all the entities in the marked triple into the feature vectors of the entities, and constructing an entity feature vector matrix according to the sequence of the entity dictionary;

step 4) taking a marked triple T_l＝<h,r,t>Obtaining the eigenvector h of the head entity h and the tail entity t through the entity eigenvector matrix_ftAnd t_ftIn order to indicate that the entity h has a relationship r with the entity T, i.e. h + r ═ T, the triplet T is marked_l＝<h,r,t>The relationship vector of (d) can be expressed as:

r＝t_ft-h_ft

step 5) taking all the marked triples as a training set, initializing entity vectors, namely an entity characteristic vector matrix, initializing relationship vectors, constructing a relationship vector matrix, wherein the sequence of the relationship vector matrix is consistent with the relationship dictionary, and the relationship calculation is carried out by the formula r-t_ft-h_ftIf a plurality of entity pairs have the same relationship, the relationship vector is obtained by averaging the difference values of the plurality of entity pairs, and normalization is performed after all the relationship vectors are initialized, so that the precision is improved, and the convergence is strengthened;

step 6) randomly selecting a positive triple < h, r, t > in the training set, selecting < h', r, t > and < h, r, t > in the negative triple, forming a component group pair with < h, r, t > to form a training batch

T_batch＝[(<h,r,t>,<h′,r,t>),(<h,r,t>,<h,r,t′>)]，

By using S_p＝{<h,r,t>Denotes a positive triplet, S_f＝{<h′,r,t>|h′∈E,<h,r,t′>I T' is belonged to E } represents a negative triple, and T is added_batchAs input to the knowledge graph representation learning model, for T_batchPerforming knowledge graph representation learning training by combining formula

where γ is a separation distance parameter greater than 0, is a hyperparameter, [ x [ ]]₊Representing positive functions, i.e. when x>At 0, [ x ]]₊X; when x is less than or equal to 0, [ x ]]₊0; the training method is called margin-based ranking criterion and aims to combine positive ternarySeparating the group and the negative triple as far as possible, and finding out the support vector of the maximum distance;

step 7) updating the parameters of the knowledge graph representation learning model by adopting random gradient descent (SGD), wherein gradient updating only needs to calculate distances d (h + R, t) and d (h '+ R, t'), a total | E | entities and | R | relations are set, the length of each entity vector is m-dimensional, the length of the relation vector is n-dimensional, and | E | m + | R | n parameters are required to be updated;

step 8) repeating the step 6) to the step 7) to carry out iterative training, and after the iterative training is finished, using a knowledge graph to represent a learning model to carry out relation prediction on the unmarked triples, wherein the prediction step is as follows: taking any of unmarked triples<h,r,t>_unlabelUsing a knowledge graph to represent the relation r 'between the learning model prediction h and t, and if r' is r, the prediction is correct; meanwhile, taking the correctly predicted triples as positive triples, randomly replacing head entities or tail entities of the positive triples as negative triples, and then combining the new positive triples and the new negative triples into the original marking triples to form new marking triples;

and 9) repeating the steps 4) to 8) by adopting the new marked triples to carry out iterative training until the new unmarked triples cannot be learned, wherein the knowledge graph indicates that the learning model cannot learn more Chinese knowledge characteristics from the current training set, the knowledge graph indicates that the entity vector and the relation vector output by the learning model are the best Chinese knowledge graph representation corresponding to the Chinese link data set zhishi.

2. The feature tensor-based Chinese knowledge graph representation learning method as recited in claim 1, wherein the process of constructing the feature tensor of the entity in the step 3) is as follows: tag triplet representation as T_lUsing E to represent an entity set in the knowledge graph, using R to represent a relation set, arbitrarily selecting an entity E in a mark triple to belong to E, searching the entity pinyin matrix to obtain a pinyin vector E of the entity E_p(ii) a Let the name of an entity be c₁c₂…c_mWherein c is_mExpressing the m-th word forming the entity name, searching the word embedding matrix according to the word to obtain a word vector e of the entity e_c＝c₁c₂…c_m(ii) a Searching the word embedding matrix to obtain a word vector e of an entity e_w(ii) a Finding the description matrix to obtain the description vector e of the entity e_dThe feature tensor FeatureTensor of entity e is expressed as

3. The feature tensor-based chinese knowledge graph representation learning method of claim 2, wherein the process of converting the feature tensor of the entity into the feature vector of the entity in step 3) is as follows: connecting different dimensions of the characteristic tensor of the entity in a vector splicing mode, randomly losing the vector by adopting dropout to prevent knowledge representation from learning overfitting, and using the method to obtain the pinyin vector e of the same entity e_pWord vector e_cWord vector e_wDescription vector e_dConnecting to obtain a characteristic vector e of an entity e_ft＝[e_p；e_c；e_w；e_d]。

4. The feature tensor-based Chinese knowledge graph representation learning method as claimed in any one of claims 1 to 3, wherein the feature tensor-based Chinese knowledge graph representation learning method adopts link prediction as an evaluation task, and the evaluation indexes comprise a rank mean MR, an average reciprocal rank MRR, a percentage Hits @10 of a result of a correct entity in the top ten rank, a percentage Hits @3 of a result of a correct entity in the top three rank, and a percentage Hits @1 of a correct entity in the first rank, wherein the smaller the value of MR is, the better the values of MRR, Hits @10, Hits @3, Hits @1 are; and randomly removing part of entities or relations of the triples in the Chinese link data set zhishi.me, wherein link prediction refers to predicting the randomly removed entities or relations in the triples.