CN111400503B - Knowledge graph generation method based on multiple indexes - Google Patents

Knowledge graph generation method based on multiple indexes Download PDF

Info

Publication number
CN111400503B
CN111400503B CN202010126582.3A CN202010126582A CN111400503B CN 111400503 B CN111400503 B CN 111400503B CN 202010126582 A CN202010126582 A CN 202010126582A CN 111400503 B CN111400503 B CN 111400503B
Authority
CN
China
Prior art keywords
entity
attribute
relation
confidence
matching
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN202010126582.3A
Other languages
Chinese (zh)
Other versions
CN111400503A (en
Inventor
何宇轩
牟昊
徐亚波
李旭日
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Guangzhou Datastory Information Technology Co ltd
Original Assignee
Guangzhou Datastory Information Technology Co ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Guangzhou Datastory Information Technology Co ltd filed Critical Guangzhou Datastory Information Technology Co ltd
Priority to CN202010126582.3A priority Critical patent/CN111400503B/en
Publication of CN111400503A publication Critical patent/CN111400503A/en
Application granted granted Critical
Publication of CN111400503B publication Critical patent/CN111400503B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/36Creation of semantic tools, e.g. ontology or thesauri
    • G06F16/367Ontology
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q10/00Administration; Management
    • G06Q10/06Resources, workflows, human or project management; Enterprise or organisation planning; Enterprise or organisation modelling
    • G06Q10/063Operations research, analysis or management
    • G06Q10/0639Performance analysis of employees; Performance analysis of enterprise or organisation operations
    • G06Q10/06393Score-carding, benchmarking or key performance indicator [KPI] analysis

Landscapes

  • Engineering & Computer Science (AREA)
  • Business, Economics & Management (AREA)
  • Human Resources & Organizations (AREA)
  • Theoretical Computer Science (AREA)
  • Development Economics (AREA)
  • Educational Administration (AREA)
  • Economics (AREA)
  • Entrepreneurship & Innovation (AREA)
  • Strategic Management (AREA)
  • General Physics & Mathematics (AREA)
  • Physics & Mathematics (AREA)
  • Tourism & Hospitality (AREA)
  • Quality & Reliability (AREA)
  • Operations Research (AREA)
  • General Business, Economics & Management (AREA)
  • Marketing (AREA)
  • Game Theory and Decision Science (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Animal Behavior & Ethology (AREA)
  • Computational Linguistics (AREA)
  • Data Mining & Analysis (AREA)
  • Databases & Information Systems (AREA)
  • General Engineering & Computer Science (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

The invention discloses a knowledge graph generation method based on multiple indexes, which comprises the following steps: when a newly appeared entity in the result is input, the processing scheme of the entity is output through three indexes of editing distance, text sentence vector and sound volume. When a new relation appears in the input result, judging the processing scheme of the relation through the co-occurrence sound quantity and the relation sound quantity of the corresponding entity. When new entity attributes appear in the input result, judging the processing scheme of the entity attributes according to the editing distance and sound volume of the attributes. When new relation attributes appear in the input result, judging the processing scheme of the relation attributes through the editing distance and sound volume of the relation attributes. The method has good effect on the specific field with less data, improves the quality of the knowledge graph in the specific field under the condition of little or no manual participation, has high accuracy of identifying input information and updating the knowledge graph, and can improve the accuracy of automatically generating the knowledge graph by knowledge.

Description

Knowledge graph generation method based on multiple indexes
Technical Field
The invention relates to the technical field of computer text processing, in particular to a method for generating a knowledge graph.
Background
Knowledge maps are common knowledge visualization and storage tools. Because of the complexity and diversity of knowledge, a great deal of manpower is required to make knowledge maps. The automatically generated knowledge graph is generally only aimed at the field with a large amount of data, and has poor effect on the specific field with less data. Therefore, a method is needed to improve the quality of knowledge maps in specific fields with little or no human involvement.
Disclosure of Invention
The invention provides a knowledge graph generation method based on multiple indexes, which aims at the problems and specifically comprises the following steps of:
s001, defining a database data structure, wherein the defined data structure comprises four types of entities, relations, entity attributes and relation attributes; the entity at least comprises three attributes of a name, a name and a document ID; the relation is a directed link between two entities, the link starts from a starting entity, points to an ending entity and at least comprises a name attribute; the entity attribute corresponds to a specific entity and is key value pair information in the corresponding entity; the relation attribute corresponds to a specific relation and is key value pair information in the corresponding relation;
s002, inputting information; the information is one or more of an entity, a relationship, an entity attribute and a relationship attribute;
s003, matching the input information one by one, wherein the step S007 is directly executed after the matching is successful, and the step S004 is executed after the matching is failed;
s004, information matching: according to the data structure type of the matching failure information, a processing scheme is correspondingly generated;
s005, calculating the confidence coefficient of each processing scheme by using the multi-index parameters;
s006, selecting a processing scheme of the matching failure information according to the confidence coefficient;
s007, updating the data of the database with the successfully matched input information or the selected processing scheme, namely updating the knowledge graph, and starting from the step S002 when the information is input again and the knowledge graph is updated subsequently.
As a further explanation of the present invention, the input in step S002 is that the entity, the relationship, the entity attribute and the relationship attribute information are obtained by manual labeling or data model prediction.
Still further, the steps S002 to S003 include an information filtering step, in which the input information with the sound volume smaller than the sound volume threshold is filtered by the sound volume parameter of the input information and the set sound volume threshold.
Further, in the information matching in step S003, the processing schemes correspondingly generated according to the input information types are different, and the method for calculating the confidence of the corresponding processing scheme in the subsequent step S005 is also different.
Further, when the input information type is an entity and the information matching fails, the correspondingly generated processing scheme comprises four types of fusion into a certain database entity, fusion into a certain new entity, newly added entity and discarding; the confidence coefficient of the two processing schemes fused into a certain database entity and a certain new entity is calculated by three indexes of editing distance, text sentence vector and sound volume, and the calculation formula is as follows: confidence = (sound volume index + edit distance index + sentence vector index)/3.
Further, when the input information type is a relation and the information matching fails, the correspondingly generated processing scheme comprises a newly added relation and a discarded relation; the confidence of the new relation processing mode is calculated by two indexes of co-occurrence sound quantity and relation sound quantity of the starting entity and the ending entity, and the calculation formula is as follows: confidence = (co-occurrence sound volume index + sound volume index)/2.
Further, when the input information type is entity attribute and the information matching fails, the correspondingly generated processing scheme comprises correction or new attribute and discarding two types; the confidence of the modified or newly added attribute processing mode is calculated by two indexes of editing distance and sound volume of the entity attribute, and the calculation formula is as follows: confidence = (edit distance index+sound volume index)/2.
Further, when the input information type is a relation attribute and the information matching fails, the correspondingly generated processing scheme comprises correction or new attribute and discard two types; the confidence of the modified or newly added attribute processing mode is calculated by two indexes of editing distance and sound volume of the entity attribute, and the calculation formula is as follows: confidence = (edit distance index+sound volume index)/2.
Still further, the manner of selecting the processing scheme in step S006 includes manual selection and machine-automated selection.
Still further, the machine automatically performing the selection includes inputting a confidence threshold, automatically performing the processing scheme with the greatest confidence when the scheme with the greatest confidence is greater than the confidence threshold among all the categories of processing schemes, and otherwise selecting the discarding.
The invention has the beneficial effects that:
the knowledge graph generation method based on multiple indexes has good effect on the specific field with less data, and the method for improving the quality of the knowledge graph in the specific field under the condition of little or no manual participation has high accuracy of identifying input information and updating the knowledge graph, can improve the accuracy of automatically generating the knowledge graph by knowledge, and can reduce the manual workload when manual intervention is needed.
Drawings
FIG. 1 is an overall flow chart of the method of the present invention;
FIG. 2 is an example of a knowledge-graph database structure according to the present invention;
FIG. 3 is a flowchart of a method for generating a knowledge graph entity according to the present invention;
FIG. 4 is a flowchart of a knowledge graph relationship generation method according to the present invention;
FIG. 5 is a flowchart of a method for generating the entity attribute of the knowledge graph of the present invention;
FIG. 6 is a flowchart of a method for generating knowledge graph relationship attributes according to the present invention.
Detailed Description
The following detailed description of specific embodiments of the invention, taken in conjunction with the accompanying drawings, will be readily apparent that the embodiments described are merely some, but not all embodiments of the invention.
The overall flowchart of the knowledge graph generation method based on multiple indexes as shown in fig. 1 comprises the following steps:
s001, defining a database data structure. The database may be a null database or a non-null database. If the database is not empty, each entity should contain at least three attributes, namely name, name and document ID. The name is the name most representative of the entity, and is otherwise referred to as the name of the entity, and the document ID is a list of IDs of documents in which the entity appears. The relationship refers to a directed link between two entities, the link starting from a starting entity and pointing to an ending entity. The relationship contains at a minimum name attributes. Fig. 2 is an example of this data structure.
S002, inputting entity, relationship, entity attribute and relationship attribute information.
Wherein a relationship must correspond to a particular starting entity and a particular ending entity, an entity attribute must correspond to a particular entity, and a relationship attribute must correspond to a particular relationship.
The information can be derived from manual arrangement or from an algorithm recognition result.
Wherein the input information should contain a list of IDs of the documents in which the information is located.
Table 1 is a specific example of input of entities, relationships, entity attributes, and relationship attribute information in this embodiment.
Table 1 input samples:
s003, information with too low sound volume is filtered. The volume is the number of occurrences of an entity, a relationship, an entity attribute, or a relationship attribute in the original document. And respectively inputting sound volume thresholds of the entity, the relation, the entity attribute and the relation attribute, filtering out input information with sound volume smaller than the sound volume thresholds, and not performing any processing.
S004, an information matching and processing scheme generating flow comprises four matching processing flows of entities, relations, entity attributes and relation attributes, and flow charts of the four matching processing flows respectively correspond to those shown in figures 3-6. Specifically:
the entity matching process flow comprises entity matching and entity processing scheme generation.
Entity matching: all input entities are processed one by one. First, it is confirmed whether the entity exists in the database. The specific method is to search the database, if the name or name of only one entity in the database is equal to the input entity, the matching is successful, which indicates that the entity exists in the database, otherwise, the entity does not exist in the database, and the input entity is a new entity.
For the entry in Table 1 and the database in FIG. 2, the entity "Ming" matches successfully, and the entity "Ming" does not.
If the input entity is successfully matched, the document ID of the input entity is added into the document ID of the corresponding entity of the database.
If the input entity fails to match, the input entity is a new entity, and the next operation is performed.
Generating an entity processing scheme: for new entities, there are 4 types of processing schemes, respectively: merging into a certain database entity, merging into a certain new entity, adding an entity newly and discarding. The first two types of schemes may contain more than one processing scheme. The different processing schemes require separate calculations.
Where "fuse into a database entity," such schemes require traversing all database entities to calculate the confidence level fused into them.
The confidence index consists of an average of three indices
Confidence = (sound volume index + edit distance index + sentence vector index)/3 (1)
In the formula (1), the sound volume index, the edit distance index, and the sentence vector index are calculated by:
sound volume index = input entity sound volume/total number of documents (2)
In the formula (2), the total number of documents is the total number of documents involved in the information input at this time.
In the formula (3), the names and the names of the database entities are respectively calculated, and the maximum value is taken as an editing distance index
Sentence vector index = cosine similarity (text sentence vector where input entity is located, text sentence vector where database entity is located) (4)
In the formula (4), the text where the entity is can be found through the text ID, the texts are combined, and then the sentence vector is calculated, so that the result of the formula (4) can be obtained.
Where "fuse into a new entity," such a scheme requires traversing all known new entities, calculating the confidence level fused into them. The method of calculating the confidence is consistent with the class method of fusing into a certain database entity.
Wherein, the new entity is only one processing scheme, which is to add an entity named as an input entity name in the database, the confidence of the scheme is 1- ("fused into a certain database entity" the maximum value of the confidence in the class scheme)
Where "discard" such methods involve only one processing scheme that does not output confidence in order to discard the input entity.
The relationship matching process flow comprises relationship matching and relationship processing scheme generation.
Relationship matching: and processing all the input relations one by one, firstly, confirming whether the relation exists in the database, if the initial entity and the end entity of the input relation are successfully matched, and meanwhile, the relation exists between the initial entity and the end entity in the database, which indicates that the relation exists in the database, otherwise, the relation does not exist in the database, and the input relation is a new relation.
For the entry in Table 1 and the database in FIG. 2, the relationship "sibling" from Ming to Pink matches successfully, and the relationship "friend" from Ming to Pink does not match successfully.
If the input relation fails to match, the input relation is a new relation, and the next operation is performed.
And (3) generating a relation processing scheme: for new relationships, there are 2 types of processing schemes, new relationships, obsolete.
Wherein, the confidence index of the scheme is composed of two index average values:
confidence = (co-occurrence sound volume index + sound volume index)/2 (5)
In the formula (5), the co-occurrence sound volume index is calculated by:
the sound volume index is calculated by
Where "discard" such methods involve only one processing scheme that does not output confidence in order to discard the input entity.
The entity attribute matching process flow comprises entity attribute matching and entity attribute processing scheme generation.
Entity attribute matching: processing all the input entity attributes one by one, firstly confirming whether the entity attributes exist in the database, if the matching of the entity corresponding to the input entity attributes is successful, and if the entity exists in the database, the matching is successful, otherwise, the matching is failed.
Wherein the attribute is a key-value pair consisting of two parts, namely an attribute key and an attribute value.
For the input to table 1 and the database in fig. 2, the entity attribute "height: 170cm "failed the match.
If the matching fails, the input entity attribute is a new entity attribute, and the next operation is performed.
Generating an entity attribute processing scheme: the new entity attributes are played by two types of processing schemes, namely correction or new attribute and discard.
Wherein, the confidence index of the scheme is composed of two index average values:
confidence = (edit distance index+sound volume index)/2 (8)
In the formula (8), the edit distance index is calculated by:
in the formula (9), the entities with the same attribute keys in the database are found, the entities are respectively calculated, and the maximum value is taken as an editing distance index.
The sound volume index is calculated by
Where "discard" such methods involve only one processing scheme that does not output confidence in order to discard the input entity attributes.
The relationship attribute matching process flow comprises relationship attribute matching and relationship attribute processing scheme generation.
Relationship attribute matching: processing all the input relation attributes one by one, firstly confirming whether the relation attributes exist in the database, if the corresponding relation of the input relation attributes is successfully matched, and if the relation exists in the database, the matching is successful, otherwise, the matching is failed.
For the entry into table 1 and the database in fig. 2, the entity attribute "affinity: low match failure.
If the matching fails, the input relationship attribute is a new relationship attribute, and the next operation is performed.
Generating a relation attribute processing scheme: for new relationship attributes, there are two types of processing schemes, correction or new attributes and discard.
Wherein, the confidence index of the scheme is composed of two index average values:
confidence = (edit distance index+sound volume index)/2 (8)
In the formula (8), the edit distance index is calculated by:
in the formula (9), the relation of the same attribute key in the database is found, the relation is calculated respectively, and the maximum value is taken as an editing distance index.
The sound volume index is calculated by
Among these, "discard" such methods have only one processing scheme that does not output confidence in order to discard the input relationship attributes.
S005, selecting a processing scheme, wherein the processing scheme comprises two modes of manual selection and automatic machine execution selection. If manual processing is selected, the schemes of each input are selected according to the confidence level and personal experience of the schemes. If the scheme with the highest confidence is automatically executed by the machine, a confidence threshold is input, if the scheme with the highest confidence is larger than the confidence threshold in all the processing schemes, the processing scheme with the highest confidence is automatically executed, otherwise, discarding is selected.
Wherein the selection must be in the order of entity, relationship, entity attribute, relationship attribute.
In the process of selecting the processing scheme, when the processing scheme of the starting entity or the ending entity corresponding to the relation is selected to be fused to a certain entity, the starting entity or the ending entity corresponding to the relation is changed. When an entity corresponding to an entity attribute is selected to be fused to a certain entity, the entity corresponding to the entity attribute is changed accordingly. When the relationship corresponding to the relationship attribute is selected to be fused to a certain relationship, the relationship corresponding to the relationship attribute is changed accordingly.
S006, updating the knowledge graph, and modifying the knowledge graph according to the processing scheme. When an input entity is selected to be fused to a certain entity scheme, the name of the input entity is added to the unique attribute of the database entity, and the document ID of the input entity is added to the document ID attribute. When the input entity is selected to the newly added entity scheme, the entity information is input to the newly created entity in the database. When the input relationship is selected to the new relationship schema, a relationship is created in the database. When an input entity attribute is selected to modify the current attribute schema, the attribute is modified or newly created in the database counterpart entity. When the input relation attribute is selected to modify the current attribute scheme, the attribute is modified or newly established in the corresponding relation of the database.
The foregoing is illustrative of the preferred embodiments of the present invention, and is not to be construed as limiting the claims. The invention is not limited to the above embodiments, the specific construction of which is susceptible to variations, in any case all of which are within the scope of the invention as defined in the independent claims.

Claims (9)

1. The knowledge graph generation method based on the multiple indexes is characterized by comprising the following steps of:
s001, defining a database data structure, wherein the defined data structure comprises four types of entities, relations, entity attributes and relation attributes; the entity at least comprises three attributes of a name, a name and a document ID; the relation is a directed link between two entities, the link starts from a starting entity, points to an ending entity and at least comprises a name attribute; the entity attribute corresponds to a specific entity and is key value pair information in the corresponding entity; the relation attribute corresponds to a specific relation and is key value pair information in the corresponding relation;
s002, inputting information; the information is one or more of an entity, a relationship, an entity attribute and a relationship attribute;
s003, matching the input information one by one, wherein the step S007 is directly executed after the matching is successful, and the step S004 is executed after the matching is failed;
s004, information matching: according to the data structure type of the matching failure information, a processing scheme is correspondingly generated; the method comprises four matching processing flows of entities, relations, entity attributes and relation attributes, wherein:
the entity matching processing flow comprises entity matching and entity processing scheme generation;
the relation matching processing flow comprises relation matching and relation processing scheme generation;
the entity attribute matching processing flow comprises entity attribute matching and entity attribute processing scheme generation;
the relationship attribute matching processing flow comprises relationship attribute matching and relationship attribute processing scheme generation;
s005, calculating the confidence coefficient of each processing scheme by using the multi-index parameters;
s006, selecting a processing scheme of the matching failure information according to the confidence coefficient; the method comprises two modes of manual selection and automatic machine execution selection, and the selection is performed according to the sequence of entities, relations, entity attributes and relation attributes; if manual processing is selected, selecting each input scheme according to the confidence level of the scheme and personal experience; if the scheme with the highest confidence is automatically executed by the machine, a confidence threshold is input, if the scheme with the highest confidence is larger than the confidence threshold in all the processing schemes, the processing scheme with the highest confidence is automatically executed, otherwise, discarding is selected;
s007, updating the data of the database with the successfully matched input information or the selected processing scheme, namely updating the knowledge graph, and starting from the step S002 when the information is input again and the knowledge graph is updated subsequently.
2. The multi-index-based knowledge-graph generation method according to claim 1, wherein: in the step S002, the entity, the relationship, the entity attribute and the relationship attribute information are input and obtained by manual labeling or data model prediction.
3. The multi-index-based knowledge-graph generation method according to claim 1, wherein: and the step S002 to the step S003 also comprise an information filtering step, wherein the information is filtered through the sound volume parameter of the input information and the set sound volume threshold value, and the input information with the sound volume smaller than the sound volume threshold value is filtered.
4. The multi-index-based knowledge-graph generation method according to claim 1, wherein: in the information matching in step S003, the processing schemes correspondingly generated according to the input information types are different, and the method for calculating the confidence of the corresponding processing scheme in the subsequent step S005 is also different.
5. The multi-index-based knowledge-graph generation method according to claim 4, wherein: when the input information type is an entity and the information matching fails, the correspondingly generated processing scheme comprises four types of fusion into a certain database entity, fusion into a certain new entity, newly added entity and discarding; the confidence coefficient of the two processing schemes fused into a certain database entity and a certain new entity is calculated by three indexes of editing distance, text sentence vector and sound volume, and the calculation formula is as follows: confidence = (sound volume index + edit distance index + sentence vector index)/3.
6. The multi-index-based knowledge-graph generation method according to claim 4, wherein: when the input information types are relations and the information matching fails, the corresponding generated processing scheme comprises a newly added relation and a discarded relation; the confidence of the new relation processing mode is calculated by two indexes of co-occurrence sound quantity and relation sound quantity of the starting entity and the ending entity, and the calculation formula is as follows: confidence = (co-occurrence sound volume index + sound volume index)/2.
7. The multi-index-based knowledge-graph generation method according to claim 4, wherein: when the input information type is entity attribute and the information matching fails, the correspondingly generated processing scheme comprises two types of correction or new attribute and discard; the confidence of the modified or newly added attribute processing mode is calculated by two indexes of editing distance and sound volume of the entity attribute, and the calculation formula is as follows: confidence = (edit distance index+sound volume index)/2.
8. The multi-index-based knowledge-graph generation method according to claim 4, wherein: when the input information type is a relation attribute and the information matching fails, the correspondingly generated processing scheme comprises two types of correction or new attribute and discard; the confidence of the modified or newly added attribute processing mode is calculated by two indexes of editing distance and sound volume of the entity attribute, and the calculation formula is as follows: confidence = (edit distance index+sound volume index)/2.
9. The multi-index-based knowledge-graph generation method according to claim 1, wherein: the machine automatically executes selection, which comprises inputting a confidence threshold, automatically executing the processing scheme with the largest confidence when the scheme with the largest confidence is larger than the confidence threshold in all the processing schemes, otherwise, selecting discarding.
CN202010126582.3A 2020-02-28 2020-02-28 Knowledge graph generation method based on multiple indexes Active CN111400503B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202010126582.3A CN111400503B (en) 2020-02-28 2020-02-28 Knowledge graph generation method based on multiple indexes

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202010126582.3A CN111400503B (en) 2020-02-28 2020-02-28 Knowledge graph generation method based on multiple indexes

Publications (2)

Publication Number Publication Date
CN111400503A CN111400503A (en) 2020-07-10
CN111400503B true CN111400503B (en) 2023-09-22

Family

ID=71428498

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202010126582.3A Active CN111400503B (en) 2020-02-28 2020-02-28 Knowledge graph generation method based on multiple indexes

Country Status (1)

Country Link
CN (1) CN111400503B (en)

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106355627A (en) * 2015-07-16 2017-01-25 中国石油化工股份有限公司 Method and system used for generating knowledge graphs
CN107391677A (en) * 2017-07-21 2017-11-24 深圳狗尾草智能科技有限公司 Carry the generation method and device of the Universal Chinese character knowledge mapping of entity-relationship-attribute

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106355627A (en) * 2015-07-16 2017-01-25 中国石油化工股份有限公司 Method and system used for generating knowledge graphs
CN107391677A (en) * 2017-07-21 2017-11-24 深圳狗尾草智能科技有限公司 Carry the generation method and device of the Universal Chinese character knowledge mapping of entity-relationship-attribute

Also Published As

Publication number Publication date
CN111400503A (en) 2020-07-10

Similar Documents

Publication Publication Date Title
TWI664540B (en) Search word error correction method and device, and weighted edit distance calculation method and device
CN105608218B (en) The method for building up of intelligent answer knowledge base establishes device and establishes system
CN102867040B (en) Chinese search engine mixed speech-oriented query error correction method and system
JP4580885B2 (en) Scene information extraction method, scene extraction method, and extraction apparatus
CN110276071B (en) Text matching method and device, computer equipment and storage medium
CN110390017B (en) Target emotion analysis method and system based on attention gating convolutional network
CN103049458B (en) A kind of method and system revising user thesaurus
CN109408578B (en) Monitoring data fusion method for heterogeneous environment
CN114791958B (en) Zero sample cross-modal retrieval method based on variational self-encoder
CN104485106B (en) Audio recognition method, speech recognition system and speech recognition apparatus
CN116431837B (en) Document retrieval method and device based on large language model and graph network model
CN112749272A (en) Intelligent new energy planning text recommendation method for unstructured data
CN113986968A (en) Scheme intelligent proofreading method based on electric power standard standardization datamation
CN113360582A (en) Relation classification method and system based on BERT model fusion multi-element entity information
CN112395425A (en) Data processing method and device, computer equipment and readable storage medium
CN111782798B (en) Digest generation method, digest generation device, digest generation equipment and project management method
CN112181949A (en) Online data modeling method and device
CN116468009A (en) Article generation method, apparatus, electronic device and storage medium
CN111400503B (en) Knowledge graph generation method based on multiple indexes
CN114332519A (en) Image description generation method based on external triple and abstract relation
CN110032642B (en) Modeling method of manifold topic model based on word embedding
CN115544033B (en) Method, device, equipment and medium for updating check repeat vector library and checking repeat data
WO2022156450A1 (en) Knowledge base query method and apparatus, computer device, and storage medium
Li et al. Semi-siamese network for content-based video relevance prediction
CN117093196B (en) Knowledge graph-based programming language generation method and system

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant