CN111209412A - Method for building knowledge graph of periodical literature by cyclic updating iteration - Google Patents

Method for building knowledge graph of periodical literature by cyclic updating iteration Download PDF

Info

Publication number
CN111209412A
CN111209412A CN202010084144.5A CN202010084144A CN111209412A CN 111209412 A CN111209412 A CN 111209412A CN 202010084144 A CN202010084144 A CN 202010084144A CN 111209412 A CN111209412 A CN 111209412A
Authority
CN
China
Prior art keywords
entity
extraction
ontology
dictionary
relation
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN202010084144.5A
Other languages
Chinese (zh)
Other versions
CN111209412B (en
Inventor
吕强
段飞虎
蔡陨
谢一鸣
胡磊
冯自强
张宏伟
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Tongfang Knowledge Network Digital Publishing Technology Co ltd
Original Assignee
Tongfang Knowledge Network Digital Publishing Technology Co ltd
Tongfang Knowledge Network Beijing Technology Co ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Tongfang Knowledge Network Digital Publishing Technology Co ltd, Tongfang Knowledge Network Beijing Technology Co ltd filed Critical Tongfang Knowledge Network Digital Publishing Technology Co ltd
Priority to CN202010084144.5A priority Critical patent/CN111209412B/en
Publication of CN111209412A publication Critical patent/CN111209412A/en
Application granted granted Critical
Publication of CN111209412B publication Critical patent/CN111209412B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/36Creation of semantic tools, e.g. ontology or thesauri
    • G06F16/367Ontology

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Animal Behavior & Ethology (AREA)
  • Computational Linguistics (AREA)
  • Data Mining & Analysis (AREA)
  • Databases & Information Systems (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Machine Translation (AREA)

Abstract

The invention discloses a method for constructing a knowledge graph of journal documents by cyclic updating iteration, which comprises the steps of designing a conceptual model, defining an ontology structure of the knowledge graph of the journal documents, and defining an ontology, a relation attribute of the ontology and a data attribute in the ontology; managing word lists and corpora, wherein the word lists are divided into subject word lists and relational word lists, and the corpus is divided into a text library and a sentence library and relates to the corpora of multiple sources; based on a deep learning labeling, training, recognition and calibration entity relationship extraction model, adopting a deep learning entity relationship extraction technology in combination with a dictionary and a corpus to perform entity extraction and relationship extraction, and updating iteration; performing corpus attribute extraction through an ontology structure defined by concept design and introducing a template; checking and disambiguating the results of the entity identification and the relationship extraction, and performing entity disambiguation on the results of the attribute extraction; and storing the recognition result into the knowledge graph, updating the subject dictionary, the relation dictionary and the training model at random, and recognizing the material by using the new dictionary and model to realize the circulation iteration updating and construct the knowledge graph.

Description

Method for building knowledge graph of periodical literature by cyclic updating iteration
Technical Field
The invention relates to the technical field of natural language processing and computer information processing, in particular to a periodical update iterative journal literature knowledge graph construction method.
Background
The conventional knowledge graph is a huge and networked knowledge system constructed by taking a 'semantic network' as a framework and aims to describe concepts, entities, events and relationships among the concepts, the entities and the events in an objective world. The concept means that people form conceptualized representation of objective objects in the process of knowing the world. The key technology of the knowledge graph relates to multiple fields of natural language processing, data mining, information retrieval and the like, is mainly divided into two types of knowledge driving and data driving, and is widely applied along with the development of big data, such as laws, social networks, medical knowledge graphs and the like.
The key technology for establishing the knowledge graph comprises an entity and relationship extraction technology, a knowledge fusion technology, an entity link technology and a knowledge inference technology, wherein the knowledge graph establishment comprises related technologies of all links from a data source to an application and the like. However, the main focus of the current knowledge graph construction lies in enriching and optimizing graph content links such as entity relationship extraction and semantic analysis, and deep exploration is not performed on the construction process. Especially, the updating iteration and calibration of the knowledge graph do not have the system specification, so that the closed loop is achieved, and the intellectualization and automation of the knowledge graph construction are really realized.
Disclosure of Invention
In order to solve the technical problems, the invention aims to provide a periodical update iterative journal literature knowledge graph construction method, which is based on the automatic knowledge graph construction, takes a journal literature base of the HowNet as a data source, organically combines a plurality of knowledge graph construction modules such as concept design, dictionary management, corpus management, model training, knowledge element extraction, entity disambiguation and the like, and forms a closed loop by updating iteration and continuously optimizing the accuracy of knowledge graph and training so as to really realize the intelligent periodical update iterative journal literature knowledge graph construction.
The purpose of the invention is realized by the following technical scheme:
a method for building a knowledge graph of periodical literature with iteration circularly updated comprises the following steps:
designing a concept model, defining an ontology structure of a knowledge graph of journal literature, wherein the ontology structure comprises a definition ontology, a relation attribute of the ontology and a data attribute inside the ontology;
b, managing a word list and a corpus, wherein the word list is divided into a subject word list and a relational word list, and the corpus is divided into a text library and a sentence library and relates to the corpora of a plurality of sources;
c, based on a deep learning labeling, training, recognition and calibration entity relation extraction model, adopting a deep learning entity relation extraction technology to combine a dictionary and a corpus, performing entity extraction and relation extraction, and updating iteration;
d, performing corpus attribute extraction through an ontology structure defined by concept design and introducing a template;
e, checking and disambiguating the results of the entity identification and the relationship extraction, and performing entity disambiguation on the results of the attribute extraction;
and F, storing the recognition result into the knowledge graph, updating the subject dictionary, the relation dictionary and the training model at random, and recognizing the material by using the new dictionary and model to realize the circulation iteration updating and construct the knowledge graph.
One or more embodiments of the present invention may have the following advantages over the prior art:
the invention provides a standard process reference for constructing the knowledge graph, so that the constructed knowledge graph is really oriented to intellectualization, the waste of human resources is relatively reduced, and the usability and the practicability of the knowledge graph are improved.
Drawings
FIG. 1 is a flowchart of a method for building a knowledge graph of journal articles with iterative loop updates;
FIG. 2 is a diagram of a topic table structure;
FIG. 3 is a diagram of a text database structure;
FIG. 4 is a diagram of a statement database structure;
FIG. 5 is a flowchart of a method for iteratively updating a knowledge-graph of journal articles;
FIG. 6 is a flow diagram of an entity identification update iterative model;
FIG. 7 is a flow diagram of a relationship identification update iterative model;
FIG. 8 is a flow diagram of an attribute extraction model.
Detailed Description
In order to make the objects, technical solutions and advantages of the present invention more apparent, the present invention will be described in further detail with reference to the following embodiments and accompanying drawings.
As shown in fig. 1, the method for building a knowledge graph of journal literature for loop update iteration includes step 10 of conceptual model design, and specification of a knowledge graph definition ontology, data attributes and relationship attributes.
The ontology model refers to the ontology models or data standards which are widely applied internationally, such as CIDOC CRM, EDM, FOAF, EVENT, FRBR and the like, and is expanded and customized according to the service characteristics of the ontology model, so that the reusability and the internationalization degree of the ontology model are improved.
The ontology construction of the knowledge graph of the journal literature comprises the definition of an ontology and a data model layer of the knowledge graph of the journal literature, wherein the ontology construction comprises the following steps: defining an ontology, defining a relationship attribute of the ontology, and defining a data attribute inside the ontology.
The ontology is an object or a collection of objects, for example: text, author, and institution, etc. The relationship attributes of the ontologies mainly define the association relationship between ontologies, for example: there are collaborations between authors, dependencies between authors and organizations, etc. The data attribute inside the ontology is that there is no association relationship in the characteristics of the ontology itself, for example: author name, age, and native place, etc.
The invention defines a triple specification for the knowledge graph: (E)1,R,E2) And (E, P, V) wherein E represents an ontology, R represents a relationship attribute, P represents a data attribute, and V represents an attribute value. In an entity-relationship-entity relationship, the value range of an entity is an ontology.
The journal literature part ontology structure is defined as follows:
TABLE 1
Identification Body
E1 Text
E2 Authors refer to
E3 Mechanism
E4 Time of day
E5 Type of relationship
E6 Domain entity
E7 Region of land
The journal literature part relational attributes defined are as follows:
TABLE 2
Figure BDA0002381426920000041
Step 20, managing word lists and corpora, wherein the word lists are divided into subject word lists and relational word lists, and the corpus is divided into a text library and a sentence library and relates to the corpora of multiple sources;
the word list and the corpus of the knowledge graph of the journal literature are divided into data of a plurality of fields in the form of a Chinese atlas classification method. The vocabulary is divided into a subject vocabulary and a relational vocabulary in form, the subject vocabulary defines the source, the field, the sub-field and other attributes of the entity words, the relational vocabulary defines the relation between the entity words of the subject vocabulary, and the word relation defines 10 relations of upper and lower, similar, antisense, related and the like in the literature periodical.
The corpus is divided into a text library and a sentence library, the text library is a collection library of network journal documents and local resources, and document data are mainly stored. In order to facilitate deep text mining, journal documents of a text library are preprocessed, and a sentence library is formed. The sentence library includes sentences from journal literature and the positions of the sentences in which the entity words are located in the subject vocabulary. The structure of the topic word list is shown in fig. 2.
Wherein content is entity word, English is English translation, catalog is Chinese graph classification, domain is word source, etc.
The relational word table is as shown in Table 3:
TABLE 3
Figure BDA0002381426920000051
Wherein orgid and tarid are index ids of the subject word list where the entity word is located, and relatype is word relationship id. The text library and sentence library in the corpus are shown in fig. 3 and fig. 4.
And step 30, based on the labeling, training, recognition and calibration entity relationship extraction model of deep learning, adopting a deep learning entity relationship extraction technology in combination with the dictionary and the corpus to perform entity extraction and relationship extraction, and updating iteration.
Update iteration of entity extraction:
1. and labeling the corpus by using a dictionary, and labeling entity words appearing in the corpus.
2. And selecting an entity recognition algorithm to train the label set. The algorithm of entity identification goes through a process of updating iteration from machine learning to deep learning, for example: HMM, CRF, BILSTM + CRF, Bert + BILSTM + CRF, and the like. The invention adopts an algorithm of Bert + BILSTM + CRF to identify the entity.
3. And continuously recognizing the corpus by using the trained labeling model, calibrating the recognition result, and storing new words which do not appear in the subject dictionary into the subject dictionary.
4. Labeling with the updated dictionary again, and training the updated model and dictionary again.
The entity extraction process forms a closed loop of update iteration by adding a topic dictionary and in the form of a loop labeled corpus and a training model. The model is enabled to be continuously optimized to improve the accuracy of entity identification.
Update iteration of relationship extraction:
1. and labeling the statement set by using the relation dictionary and the existing relation extraction template, and forming a training model. The relation extraction is wide in related field, and a traditional deep learning model is difficult to have good performance on relation extraction training. Therefore, the traditional relation extraction designs a large number of templates containing part-of-speech and grammatical features. The method labels the statement set through two modes of the template and the relational word stock and forms a training sample.
2. And selecting a relation extraction algorithm to train the label set, wherein the relation extraction model selects a PCNN + Attention algorithm. CNN/PCNN was used as the sensor encoder, and a sentence-level annotation mechanism was used.
3. And performing relation recognition on the new corpus by using the training model, storing a recognition result in a database, correcting the recognition result through manual examination, storing the corrected recognition result in a relation dictionary and a statement set, and storing the corrected recognition result in a new training sample.
4. And recognizing the corpus again by using the new training sample and performing loop iteration.
The relationship identification and the entity identification adopt the same loop iteration process, and meanwhile, the accuracy of the identification is improved by combining a template formed by a large amount of past experience.
A flowchart of a method for constructing knowledge maps of journal literature with iterative cycle updating is shown in fig. 5, local data and journal literature data are unified and mapped and sorted into a text library, and the text library data is preprocessed to form a sentence library. The data of the text library and the sentence library are input language chats of an entity extraction model and a relation recognition model, subject words and relation words in a word list are accompanied with a corpus input model, and an attribute extraction model is introduced into a conceptual model at the same time. The output of the entity extraction and relationship identification model is respectively identified entity and new relationship phrase, and the output of the attribute extraction model is entity attribute triple. Entity disambiguation is followed by calibration and updating of the vocabulary database and journal literature knowledge maps. The new word list is combined with the new corpora to carry out data output by the model training book to update the word list and the knowledge graph again, so that updating iteration is realized in the process, the model, the word list and the knowledge graph are continuously corrected, the accuracy and the usability are improved, and an organic intelligent cyclic updating iteration mechanism is formed.
As shown in fig. 6, the iterative model is updated for entity identification, entity tagging is performed on the corpus through the vocabulary, and a tagged sample is input into the model for training. And the trained model carries out entity recognition on the corpus, and the word list and the knowledge graph are updated again according to the recognition result so as to form an updated iterative model of entity recognition.
Meanwhile, a flow chart of the relationship identification updating iterative model is shown in FIG. 7.
And step 40, performing corpus attribute extraction by conceptually designing the defined ontology structure and introducing a template.
The attribute extraction adopts a dependency syntactic analysis model, and the attribute extraction process is as follows:
1. and combining the ontology structure and the data attributes defined in the concept design to form an entity attribute template and traversing the statements of the entity and the existence related attributes in the statement set.
2. A CRF algorithm is adopted to label the part of speech of a sentence, entity words often have fixed parts of speech, and the part of speech labeling is difficult to judge the part of speech of unregistered words and judge the part of speech of word groups and words. The result of part-of-speech tagging has a great influence on the analysis of a sentence method. Therefore, part-of-speech tagging using CRFs enables learning of more entity features and facilitates update iterations.
3. And substituting the labeling result into a syntactic analyzer for syntactic analysis, wherein the syntactic analyzer adopts a dependency algorithm, the core of the algorithm is based on an arc-standard system, a classifier is used for predicting correct conversion operation according to the characteristics extracted from the configuration information, and the calculation efficiency is very high
4. The syntactic results are analyzed by matching syntactic templates and properties, such as the structure of a principal and predicate, are extracted.
As shown in FIG. 8, the attribute extraction model, the conceptual model and the topic dictionary are used as input to extract sentences from the sentence library as sample models. And the sample model is used for performing part-of-speech tagging through the CRF, performing dependency syntax analysis on a tagging result, analyzing a statement result with grammatical features, performing attribute extraction through a grammar template, forming entity attribute triples and storing the entity attribute triples into a knowledge graph. And updating iteration of the attribute extraction model is mainly realized by circularly training the CRF model to calibrate the accuracy of part-of-speech tagging.
And 50, entity disambiguation and verification, namely verifying and disambiguating the result of entity identification and relationship extraction, and performing entity disambiguation the result of attribute extraction.
Entity disambiguation mainly solves the phenomena of one-word polysemy and multiple-word polysemy existing in natural language. The entity disambiguation is divided into two steps, wherein in the first step, deep learning disambiguation is carried out before entity identification and relation identification; the second is to use the relationship dictionary and the topic dictionary for matching disambiguation. And disambiguating results of entity identification, relationship identification and attribute extraction.
Step 60 stores the recognition results in the knowledge graph and updates the topic dictionary, the relationship dictionary and the training model at random. And identifying the material by using the new dictionary and model to realize loop iteration updating and construct the knowledge graph.
Although the embodiments of the present invention have been described above, the above descriptions are only for the convenience of understanding the present invention, and are not intended to limit the present invention. It will be understood by those skilled in the art that various changes in form and details may be made therein without departing from the spirit and scope of the invention as defined by the appended claims.

Claims (6)

1. A method for building a knowledge graph of periodical literature by iteration in a loop updating manner is characterized by comprising the following steps:
designing a concept model, defining an ontology structure of a knowledge graph of journal literature, wherein the ontology structure comprises a definition ontology, a relation attribute of the ontology and a data attribute inside the ontology;
b, managing a word list and a corpus, wherein the word list is divided into a subject word list and a relational word list, and the corpus is divided into a text library and a sentence library and relates to the corpora of a plurality of sources;
c, based on a deep learning labeling, training, recognition and calibration entity relation extraction model, adopting a deep learning entity relation extraction technology to combine a dictionary and a corpus, performing entity extraction and relation extraction, and updating iteration;
d, performing corpus attribute extraction through an ontology structure defined by concept design and introducing a template;
e, checking and disambiguating the results of the entity identification and the relationship extraction, and performing entity disambiguation on the results of the attribute extraction;
and F, storing the recognition result into the knowledge graph, updating the subject dictionary, the relation dictionary and the training model at random, and recognizing the material by using the new dictionary and model to realize the circulation iteration updating and construct the knowledge graph.
2. The method of iteratively updating a knowledge-graph of journal literature according to claim 1, wherein in step a:
an ontology is an object or a collection of objects;
the relationship attribute of the ontology is used for defining the incidence relationship between the ontologies;
the data attribute inside the ontology is that the characteristics of the ontology do not have an association relationship.
3. The method of iteratively updating a knowledge-graph of journal literature according to claim 1, wherein in step B:
the topic word list defines the source, the field and the sub-field attribute of the entity word;
the relation vocabulary defines the relation between the entity words of the subject vocabulary, and defines the upper and lower, similar, antisense and related relations for the word relation in the literature periodical;
the text library is a collective library of network journal documents and local resources and mainly stores document data; preprocessing journal documents in a text library to form a sentence library; the sentence library comprises sentences from periodical literature and the positions of the sentences of the entity words in the subject vocabulary.
4. The method for constructing a knowledge-graph of journal documents based on iterative update of claim 1, wherein the update iteration of entity extraction in step C comprises:
labeling the corpus by using a dictionary, and labeling entity words appearing in the corpus;
selecting an entity recognition algorithm to train the label set;
continuously recognizing the corpus by using the trained labeling model, calibrating the recognition result and storing new words which do not appear in the subject dictionary into the subject dictionary;
labeling with the updated dictionary again, and training the updated model and dictionary again.
5. The method for constructing a knowledge-graph of journal documents based on iterative update of claim 1, wherein the update iteration of relation extraction in step C comprises:
labeling the sentence set by using a relation dictionary and an existing relation extraction template, and forming a training model;
selecting a relation extraction algorithm to train the label set, and selecting a PCNN + Attention algorithm from the relation extraction model;
carrying out relation recognition on the new corpus by using a training model, storing a recognition result in a database, correcting the recognition result through manual examination, storing the corrected recognition result in a relation dictionary and a statement set, and storing the corrected recognition result in a corpus storage mode for a new training sample;
and recognizing the corpus again by using the new training sample and performing loop iteration.
6. The method for cyclically updating and iterating knowledge graph construction of journal documents according to claim 1, wherein the attribute extraction in step D adopts a dependency syntactic analysis model, and the attribute extraction process is as follows:
combining an ontology structure and data attributes defined in the concept design to form an entity attribute template and traversing the entity and the sentences with the related attributes in the sentence set;
performing part-of-speech tagging on the statement by adopting a CRF algorithm;
substituting the labeling result into a syntactic analyzer for syntactic analysis;
the syntactic results are analyzed and properties extracted by matching the syntactic templates.
CN202010084144.5A 2020-02-10 2020-02-10 Periodical literature knowledge graph construction method for cyclic updating iteration Active CN111209412B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202010084144.5A CN111209412B (en) 2020-02-10 2020-02-10 Periodical literature knowledge graph construction method for cyclic updating iteration

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202010084144.5A CN111209412B (en) 2020-02-10 2020-02-10 Periodical literature knowledge graph construction method for cyclic updating iteration

Publications (2)

Publication Number Publication Date
CN111209412A true CN111209412A (en) 2020-05-29
CN111209412B CN111209412B (en) 2023-05-12

Family

ID=70787817

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202010084144.5A Active CN111209412B (en) 2020-02-10 2020-02-10 Periodical literature knowledge graph construction method for cyclic updating iteration

Country Status (1)

Country Link
CN (1) CN111209412B (en)

Cited By (17)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111753021A (en) * 2020-06-17 2020-10-09 第四范式(北京)技术有限公司 Method, device and equipment for constructing knowledge graph and readable storage medium
CN111914550A (en) * 2020-07-16 2020-11-10 华中师范大学 Knowledge graph updating method and system for limited field
CN112100405A (en) * 2020-09-23 2020-12-18 中国农业大学 Veterinary drug residue knowledge graph construction method based on weighted LDA
CN112101036A (en) * 2020-09-22 2020-12-18 山东旗帜信息有限公司 Knowledge joint extraction method and device based on predefined relationship
CN112559772A (en) * 2020-12-29 2021-03-26 厦门市美亚柏科信息股份有限公司 Dynamic maintenance method of knowledge graph, terminal equipment and storage medium
CN113010663A (en) * 2021-04-26 2021-06-22 东华大学 Adaptive reasoning question-answering method and system based on industrial cognitive map
CN113010593A (en) * 2021-04-02 2021-06-22 北京智通云联科技有限公司 Method, system and device for extracting events of unstructured text
CN113204648A (en) * 2021-04-30 2021-08-03 武汉工程大学 Knowledge graph completion method based on automatic extraction relationship of judgment book text
CN113221566A (en) * 2021-05-08 2021-08-06 北京百度网讯科技有限公司 Entity relationship extraction method and device, electronic equipment and storage medium
CN113392183A (en) * 2021-05-31 2021-09-14 南京师范大学 Characterization and calculation method of children domain map knowledge
CN113392223A (en) * 2021-05-12 2021-09-14 同方知网数字出版技术股份有限公司 Knowledge graph construction method based on meteorological field
CN113553439A (en) * 2021-06-18 2021-10-26 杭州摸象大数据科技有限公司 Method and system for knowledge graph mining
WO2021254457A1 (en) * 2020-06-17 2021-12-23 第四范式(北京)技术有限公司 Method and device for constructing knowledge graph, computer device, and storage medium
CN116205217A (en) * 2023-05-05 2023-06-02 北京邮电大学 Small sample relation extraction method, system, electronic equipment and storage medium
CN117009519A (en) * 2023-07-19 2023-11-07 上交所技术有限责任公司 Enterprise leaning industry method based on word bag model
CN112000791B (en) * 2020-08-26 2024-02-02 哈电发电设备国家工程研究中心有限公司 Motor fault knowledge extraction system and method
CN117725229A (en) * 2024-01-08 2024-03-19 中国科学技术信息研究所 Knowledge organization system auxiliary updating method

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106776711A (en) * 2016-11-14 2017-05-31 浙江大学 A kind of Chinese medical knowledge mapping construction method based on deep learning
CN106844658A (en) * 2017-01-23 2017-06-13 中山大学 A kind of Chinese text knowledge mapping method for auto constructing and system
WO2018222448A1 (en) * 2017-06-02 2018-12-06 Microsoft Technology Licensing, Llc Modeling an action completion conversation using a knowledge graph
CN110110092A (en) * 2018-09-30 2019-08-09 北京国双科技有限公司 A kind of knowledge mapping construction method and relevant device
CA3040373A1 (en) * 2018-04-16 2019-10-16 Tata Consultancy Services Limited Deep learning techniques based multi-purpose conversational agents for processing natural language queries

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106776711A (en) * 2016-11-14 2017-05-31 浙江大学 A kind of Chinese medical knowledge mapping construction method based on deep learning
CN106844658A (en) * 2017-01-23 2017-06-13 中山大学 A kind of Chinese text knowledge mapping method for auto constructing and system
WO2018222448A1 (en) * 2017-06-02 2018-12-06 Microsoft Technology Licensing, Llc Modeling an action completion conversation using a knowledge graph
CA3040373A1 (en) * 2018-04-16 2019-10-16 Tata Consultancy Services Limited Deep learning techniques based multi-purpose conversational agents for processing natural language queries
CN110110092A (en) * 2018-09-30 2019-08-09 北京国双科技有限公司 A kind of knowledge mapping construction method and relevant device

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
WEIDONG LI,XINYU ZHANG,YAQIAN WANG,ZHIHUAN YAN,RONG PENG: ""Graph2Seq: Fusion Embedding Learning for Knowledge Graph Completion"", 《IEEE ACCESS》 *
徐增林等: "知识图谱技术综述", 《电子科技大学学报》 *

Cited By (24)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP4170520A4 (en) * 2020-06-17 2023-11-29 The 4th Paradigm Technology Co., Ltd Method and device for constructing knowledge graph, computer device, and storage medium
CN111753021A (en) * 2020-06-17 2020-10-09 第四范式(北京)技术有限公司 Method, device and equipment for constructing knowledge graph and readable storage medium
WO2021254457A1 (en) * 2020-06-17 2021-12-23 第四范式(北京)技术有限公司 Method and device for constructing knowledge graph, computer device, and storage medium
CN111914550A (en) * 2020-07-16 2020-11-10 华中师范大学 Knowledge graph updating method and system for limited field
CN111914550B (en) * 2020-07-16 2023-12-15 华中师范大学 Knowledge graph updating method and system oriented to limited field
CN112000791B (en) * 2020-08-26 2024-02-02 哈电发电设备国家工程研究中心有限公司 Motor fault knowledge extraction system and method
CN112101036A (en) * 2020-09-22 2020-12-18 山东旗帜信息有限公司 Knowledge joint extraction method and device based on predefined relationship
CN112100405A (en) * 2020-09-23 2020-12-18 中国农业大学 Veterinary drug residue knowledge graph construction method based on weighted LDA
CN112100405B (en) * 2020-09-23 2024-01-30 中国农业大学 Veterinary drug residue knowledge graph construction method based on weighted LDA
CN112559772A (en) * 2020-12-29 2021-03-26 厦门市美亚柏科信息股份有限公司 Dynamic maintenance method of knowledge graph, terminal equipment and storage medium
CN112559772B (en) * 2020-12-29 2022-09-09 厦门市美亚柏科信息股份有限公司 Dynamic maintenance method of knowledge graph, terminal equipment and storage medium
CN113010593A (en) * 2021-04-02 2021-06-22 北京智通云联科技有限公司 Method, system and device for extracting events of unstructured text
CN113010593B (en) * 2021-04-02 2024-02-13 北京智通云联科技有限公司 Event extraction method, system and device for unstructured text
CN113010663A (en) * 2021-04-26 2021-06-22 东华大学 Adaptive reasoning question-answering method and system based on industrial cognitive map
CN113204648A (en) * 2021-04-30 2021-08-03 武汉工程大学 Knowledge graph completion method based on automatic extraction relationship of judgment book text
CN113221566B (en) * 2021-05-08 2023-08-01 北京百度网讯科技有限公司 Entity relation extraction method, entity relation extraction device, electronic equipment and storage medium
CN113221566A (en) * 2021-05-08 2021-08-06 北京百度网讯科技有限公司 Entity relationship extraction method and device, electronic equipment and storage medium
CN113392223A (en) * 2021-05-12 2021-09-14 同方知网数字出版技术股份有限公司 Knowledge graph construction method based on meteorological field
CN113392183A (en) * 2021-05-31 2021-09-14 南京师范大学 Characterization and calculation method of children domain map knowledge
CN113553439A (en) * 2021-06-18 2021-10-26 杭州摸象大数据科技有限公司 Method and system for knowledge graph mining
CN116205217A (en) * 2023-05-05 2023-06-02 北京邮电大学 Small sample relation extraction method, system, electronic equipment and storage medium
CN116205217B (en) * 2023-05-05 2023-09-01 北京邮电大学 Small sample relation extraction method, system, electronic equipment and storage medium
CN117009519A (en) * 2023-07-19 2023-11-07 上交所技术有限责任公司 Enterprise leaning industry method based on word bag model
CN117725229A (en) * 2024-01-08 2024-03-19 中国科学技术信息研究所 Knowledge organization system auxiliary updating method

Also Published As

Publication number Publication date
CN111209412B (en) 2023-05-12

Similar Documents

Publication Publication Date Title
CN111209412B (en) Periodical literature knowledge graph construction method for cyclic updating iteration
CN109271626B (en) Text semantic analysis method
Gupta et al. Abstractive summarization: An overview of the state of the art
CN110298033B (en) Keyword corpus labeling training extraction system
CN111310471B (en) Travel named entity identification method based on BBLC model
Kiyavitskaya et al. Cerno: Light-weight tool support for semantic annotation of textual documents
CN111324742A (en) Construction method of digital human knowledge map
Vasyl et al. Application of sentence parsing for determining keywords in Ukrainian texts
CN112541337B (en) Document template automatic generation method and system based on recurrent neural network language model
CN111061882A (en) Knowledge graph construction method
CN113806563A (en) Architect knowledge graph construction method for multi-source heterogeneous building humanistic historical material
CN113191148A (en) Rail transit entity identification method based on semi-supervised learning and clustering
CN112183059A (en) Chinese structured event extraction method
CN112380848B (en) Text generation method, device, equipment and storage medium
CN113312922A (en) Improved chapter-level triple information extraction method
CN112257442A (en) Policy document information extraction method based on corpus expansion neural network
Kanev et al. Metagraph knowledge base and natural language processing pipeline for event extraction and time concept analysis
CN113963748A (en) Protein knowledge map vectorization method
CN117473054A (en) Knowledge graph-based general intelligent question-answering method and device
Lim et al. Real-world sentence boundary detection using multitask learning: A case study on French
Dai Construction of English and American literature corpus based on machine learning algorithm
CN111241827B (en) Attribute extraction method based on sentence retrieval mode
Barakhnin et al. Word reordering algorithm for poetry analysis
CN113821618B (en) Method and system for extracting class items of electronic medical record
Safeena et al. Quranic computation: A review of research and application

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
TA01 Transfer of patent application right
TA01 Transfer of patent application right

Effective date of registration: 20230419

Address after: Room B201-B203, B205-B210, 2nd Floor, Building B-2, Zhongguancun Dongsheng Science and Technology Park, No. 66 Xixiaokou Road, Haidian District, Beijing, 100192 (Dongsheng District)

Applicant after: TONGFANG KNOWLEDGE NETWORK DIGITAL PUBLISHING TECHNOLOGY CO.,LTD.

Address before: 100084 Beijing city Haidian District Tsinghua University Tsinghua Yuan 36 zone B1410, Huaye building 1412, room 1414

Applicant before: TONGFANG KNOWLEDGE NETWORK (BEIJING) TECHNOLOGY Co.,Ltd.

Applicant before: TONGFANG KNOWLEDGE NETWORK DIGITAL PUBLISHING TECHNOLOGY CO.,LTD.

GR01 Patent grant
GR01 Patent grant