CN111581398A - Method for constructing knowledge graph - Google Patents

Method for constructing knowledge graph Download PDF

Info

Publication number
CN111581398A
CN111581398A CN202010400800.8A CN202010400800A CN111581398A CN 111581398 A CN111581398 A CN 111581398A CN 202010400800 A CN202010400800 A CN 202010400800A CN 111581398 A CN111581398 A CN 111581398A
Authority
CN
China
Prior art keywords
industry
knowledge graph
constructing
service
field
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202010400800.8A
Other languages
Chinese (zh)
Inventor
任伍杰
杨亮
霍选伟
沈恩欣
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Henan 863 Software Co ltd
Original Assignee
Henan 863 Software Co ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Henan 863 Software Co ltd filed Critical Henan 863 Software Co ltd
Priority to CN202010400800.8A priority Critical patent/CN111581398A/en
Publication of CN111581398A publication Critical patent/CN111581398A/en
Pending legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/36Creation of semantic tools, e.g. ontology or thesauri
    • G06F16/367Ontology
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/35Clustering; Classification

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Data Mining & Analysis (AREA)
  • Databases & Information Systems (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Animal Behavior & Ethology (AREA)
  • Computational Linguistics (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

The invention relates to a method for constructing a knowledge graph, which comprises the following steps of preliminarily listing the technical field of the industry, inquiring relevant data of the established industry, and classifying and inducing the data; determining the business category, and determining the business category name of the industry according to the query data; preliminarily screening service classes and establishing an attribution relation schematic diagram; determining a keyword; textualizing the business relation; inquiring industry expert opinions and completing modification; determining a final edition and generating an importable knowledge graph format. The method reduces the cost and improves the construction efficiency; in the construction process, the expert opinions are inquired and modified and determined for multiple times, so that the accuracy is further improved.

Description

Method for constructing knowledge graph
Technical Field
The invention relates to the technical field of scientific and technical information management, in particular to a method for constructing a knowledge graph.
Background
The knowledge map is also called scientific knowledge map, is called knowledge domain visualization or knowledge domain mapping map in the book intelligence field, is a series of different graphs for displaying the relationship between the knowledge development process and the structure, describes knowledge resources and carriers thereof by using visualization technology, and excavates, analyzes, constructs, draws and displays the knowledge and the mutual relationship between the knowledge.
The big data analysis based on the knowledge graph realizes the essential semantic association of the big data, is more free and diversified than the traditional relational database, and can better meet the value exploration and information discovery requirements of users on big data gold mines.
The open general knowledge graph emphasizes the breadth, emphasizes the fusion of more entities, has low accuracy, and is difficult to cover the entities, attributes, relationships among the entities and the like in the vertical field of a specific industry by virtue of an ontology base under the influence of a concept range.
Some knowledge graphs have been developed, and for example, the invention patent of application publication No. CN110297872A discloses a method and system for constructing and querying knowledge graphs in the scientific and technological fields. The construction and query method of the knowledge graph in the scientific and technological field supports the user to define objects, relations and attributes, and can be flexibly expanded under the condition of application scene change; supporting the establishment of mapping from a data table to an object and a relation and mapping from a field to an attribute, extracting data in a relational database through Apache nifi, converting the data into object, attribute and relational instance data, storing the object, attribute and relational instance data into a database, and supporting incremental updating of the data; the defects of conception, accuracy and the like existing when the current general knowledge graph construction method is applied to a specific industry are effectively overcome.
The invention patent of the publication number CN103488724B discloses a book-oriented reading field knowledge graph construction method, which aims at the problems of shallow knowledge hierarchy, insufficient intelligence in knowledge recommendation and the like in the current electronic reading, provides a method for constructing a book-oriented field knowledge graph by combining a general knowledge graph, and constructs a knowledge network for an electronic book, thereby realizing the explanation of book words and intelligent knowledge recommendation.
Disclosure of Invention
The invention aims to provide a method for constructing a knowledge graph so as to improve the accuracy of the knowledge graph and improve the application value of the knowledge graph.
In order to achieve the purpose, the invention adopts the following technical scheme:
a method for constructing a knowledge graph comprises the following steps of (1) preliminarily listing the technical field of the industry, inquiring relevant data of the established industry, and classifying and summarizing the data;
(2) determining the business category, and determining the business category name of the industry according to the query data;
(3) preliminarily screening the service classes and establishing an attribution relation schematic diagram;
(4) determining a keyword;
(5) the business relation is textual;
(6) inquiring and modifying the expert opinions of the industry;
(7) determining a final edition and generating an imported knowledge graph format.
Further, in the step (1), the technical field of the industry is listed, and the technical field of the industry can be obtained by adopting the method (1-1) and the industry national standard classification according to the national economic industry classification of the State statistical office and the industry classification documents which can be referred to; or (1-2), classifying the industry internally, and searching industry experts or industry practitioners to participate in the construction work of the technical field of the industry; or (1-3) summarizing data classification, acquiring industry service information through encyclopedia, Wikipedia and encyclopedia search ways, and summarizing the technical field.
Further, in the step (2), the service category can be determined by adopting the method (2-1), and the required service category is selected according to the industry national standard classification and the requirement of the knowledge graph; or (2-2) directly acquiring the service classes required by the knowledge graph according to the internal classification of the industry; or (2-3) summarizing a relatively coarse business category by inquiring the industry data on the network.
Further, in the step (3), an attribution relationship schematic diagram is established according to the determined service class name, and in the process, the service class is deleted or added, so that the establishment of the attribution relationship schematic diagram is completed.
Further, the step (4) specifically comprises (4-1) searching keywords, and acquiring corresponding technical field keywords by inquiring related services of the industry on the internet;
(4-2) checking the keywords, providing an authoritative keyword by consulting industry experts and practitioners engaged in the industry or collecting bulletin titles of the industry to sort out a keyword, and screening and checking the keywords inquired on line according to the keyword;
(4-3) performing word segmentation, performing minimum word segmentation splitting on the checked keywords, and taking the words of the two characters as minimum words except proper nouns.
Further, in the step (5), the service relationship is textual, which is to finally confirm the attribution relationship, so as to conveniently generate an importable knowledge graph service relationship document.
Further, in step (6), the expert makes an opinion and modifies, and the step can be repeated several times to obtain the best knowledge map.
The invention has the beneficial effects that:
by executing the steps of the invention, the knowledge graph of a certain industry can be established, and the construction difficulty of the knowledge graph is reduced, thereby reducing the cost and improving the construction efficiency; in the construction process, the expert opinions are inquired and modified and determined for multiple times, so that the accuracy is further improved, and the method has good popularization value.
Detailed Description
The technical solutions in the embodiments of the present invention are clearly and completely described below.
Example 1 of the invention:
a method of constructing a knowledge graph comprising the steps of:
(1) the technical field of the industry is listed primarily, the relevant data of the established industry is inquired, and the data is classified and summarized.
Listing the technical field of the industry, and obtaining the technical field of the industry by adopting the method (1-1) and the industry national standard classification according to the national economic industry classification of the State statistical office and the referable industry classification documents.
(2) Determining the business category, and determining the business category name of the industry according to the query data.
The service category can be determined by adopting the method (2-1), and the required service category can be selected according to the industry national standard classification and the requirements of the knowledge graph.
(3) And preliminarily screening the service classes and establishing an attribution relation schematic diagram.
The process of establishing the schematic diagram is also the process of perfecting the industry business. And establishing an attribution relation schematic diagram according to the determined service type name, and deleting or adding the service type in the process to complete the establishment of the attribution relation schematic diagram. The service category is deleted or added according to the service flow of the industry, and industry experts or practitioners engaged in the industry are consulted.
(4) And determining the keywords.
Keywords in the technical field of industry are the core of whether knowledge graph recommendation is accurate or not. In the sorting process, the source and the ambiguous ideas of the ambiguous keywords are listed and provided according to the classification for the ambiguous keywords.
Searching keywords, and acquiring corresponding technical field keywords by inquiring related services of the industry on line;
(4-2) checking the keywords, providing an authoritative keyword by consulting industry experts and practitioners engaged in the industry or collecting bulletin titles of the industry to sort out a keyword, and screening and checking the keywords inquired on line according to the keyword;
(4-3) performing word segmentation, performing minimum word segmentation splitting on the checked keywords, and taking the words of the two characters as minimum words except proper nouns.
(5) And textualizing the business relation.
And the service relationship is subjected to textualization, namely the final confirmation of the attribution relationship, so that an imported knowledge graph service relationship document can be generated conveniently.
(6) Inquiring the expert opinion of the industry and completing the modification.
The expert proposes the opinion and modifies, the step can be repeated for several times, and the best industry knowledge map is obtained by modifying and confirming for many times.
After the above steps are completed, the knowledge-graph prototype has been completed. Although there are industry experts or industry internal practitioners to participate in guidance, leaders or more professionals are needed to make group efforts, and the industry knowledge map is obtained through continuous grinding and perfecting.
(7) Determining a final edition and generating an imported knowledge graph format.
And when all parts have no objection, determining the parts as final versions, and finishing establishing the knowledge graph.
The invention discloses a method for constructing a knowledge graph, which is used for constructing the knowledge graph aiming at a target object based on a target language. The target language can adopt JSON-LD language, the JSON-LD language is a method for representing and transmitting internet data based on JSON, and the JSON-LD language describes how to represent a directed graph through JSON and how to mix interconnected data and non-interconnected data in one document. In other words, the JSON-LD language is a JSON-based data format that can be used to implement structured data. The target object can refer to a certain specific field, such as a medical field, a mother and infant field, a marine field, an automobile field and the like; the present invention may also refer to a specific sub-field within a specific field, such as an engine, milk powder, a mobile phone, etc., and the embodiment of the present invention does not specifically limit the expression form of the target object.
The method for constructing the knowledge graph takes the medical field as an example, and keywords such as hospitals, experts, diseases, medicines and the like exist in the medical field. Each keyword has its own unique attributes, for example, the "disease" keyword has attributes of "symptom", "diagnosis", "pathological change", "treatment plan", and the like. There are various associations between keywords, for example, there is a "treatment" relationship between "medicine" and "disease", and there is a "good treatment" relationship between "expert" and "disease". There is a "therapeutic" relationship between "antibiotic drugs" and "meningitis". Therefore, an attribution relation schematic diagram can be established, and the knowledge graph is displayed in a visual mode.
Example 2 of the invention:
a method for constructing a knowledge graph comprises the following steps of (1) preliminarily listing the technical field of the industry, inquiring relevant data of the established industry, and classifying and summarizing the data.
Listing the technical field of the industry, and searching industry experts or industry practitioners to participate in the construction work of the technical field of the industry by adopting the method (1-2) and the internal classification of the industry.
(2) Determining the business category, and determining the business category name of the industry according to the query data.
The service category can be determined by adopting the method (2-2) and directly obtaining the service category required by the knowledge graph according to the internal classification of the industry.
(3) And preliminarily screening the service classes and establishing an attribution relation schematic diagram.
And establishing an attribution relation schematic diagram according to the determined service type name, and deleting or adding the service type in the process to complete the establishment of the attribution relation schematic diagram.
The service category is deleted or added according to the service flow of the industry, and industry experts or practitioners engaged in the industry are consulted.
(4) And determining the keywords.
Keywords in the technical field of industry are the core of whether knowledge graph recommendation is accurate or not. In the sorting process, the source and the ambiguous ideas of the ambiguous keywords are listed and provided according to the classification for the ambiguous keywords.
Searching keywords, and acquiring corresponding technical field keywords by inquiring related services of the industry on line;
(4-2) checking the keywords, providing an authoritative keyword by consulting industry experts and practitioners engaged in the industry or collecting bulletin titles of the industry to sort out a keyword, and screening and checking the keywords inquired on line according to the keyword;
(4-3) performing word segmentation, performing minimum word segmentation splitting on the checked keywords, and taking the words of the two characters as minimum words except proper nouns.
(5) And textualizing the business relation.
And the service relationship is subjected to textualization, namely the final confirmation of the attribution relationship, so that an imported knowledge graph service relationship document can be generated conveniently.
(6) Inquiring the expert opinion of the industry and completing the modification.
The expert proposes the opinion and modifies, the step can be repeated for several times, and the best industry knowledge map is obtained by modifying and confirming for many times.
After the above steps are completed, the knowledge-graph prototype has been completed. Although there are industry experts or industry internal practitioners to participate in guidance, leaders or more professionals are needed to make group efforts, and the industry knowledge map is obtained through continuous grinding and perfecting.
(7) Determining a final edition and generating an imported knowledge graph format.
And when all parts have no objection, determining the parts as final versions, and finishing establishing the knowledge graph.
Example 3:
a method of constructing a knowledge graph comprising the steps of:
(1) the technical field of the industry is listed primarily, the relevant data of the established industry is inquired, and the data is classified and summarized.
Listing the technical field of the industry, acquiring industry service information by adopting the method (1-3) and summarizing data classification through encyclopedia, Wikipedia and encyclopedia search ways, and summarizing the technical field.
(2) Determining the business category, and determining the business category name of the industry according to the query data.
The business category can be determined by a method (2-3) through inquiring industry data on the network, and a relatively rough business category is summarized.
(3) And preliminarily screening the service classes and establishing an attribution relation schematic diagram.
And establishing an attribution relation schematic diagram according to the determined service type name, and deleting or adding the service type in the process to complete the establishment of the attribution relation schematic diagram.
The service category is deleted or added according to the service flow of the industry, and industry experts or practitioners engaged in the industry are consulted.
(4) And determining the keywords.
Keywords in the technical field of industry are the core of whether knowledge graph recommendation is accurate or not. In the sorting process, the source and the ambiguous ideas of the ambiguous keywords are listed and provided according to the classification for the ambiguous keywords.
Searching keywords, and acquiring corresponding technical field keywords by inquiring related services of the industry on line;
(4-2) checking the keywords, providing an authoritative keyword by consulting industry experts and practitioners engaged in the industry or collecting bulletin titles of the industry to sort out a keyword, and screening and checking the keywords inquired on line according to the keyword;
(4-3) performing word segmentation, performing minimum word segmentation splitting on the checked keywords, and taking the words of the two characters as minimum words except proper nouns.
(5) And textualizing the business relation.
And the service relationship is subjected to textualization, namely the final confirmation of the attribution relationship, so that an imported knowledge graph service relationship document can be generated conveniently.
(6) Inquiring the expert opinion of the industry and completing the modification.
The expert proposes the opinion and modifies, the step can be repeated for several times, and the best industry knowledge map is obtained by modifying and confirming for many times.
After the above steps are completed, the knowledge-graph prototype has been completed. Although there are industry experts or industry internal practitioners to participate in guidance, leaders or more professionals are needed to make group efforts, and the industry knowledge map is obtained through continuous grinding and perfecting.
(7) Determining a final edition and generating an imported knowledge graph format.
And when all parts have no objection, determining the parts as final versions, and finishing establishing the knowledge graph.
The present invention is not limited to the above-mentioned preferred embodiments, and any other products in various forms can be obtained by anyone in the light of the present invention, but any changes in the shape or structure thereof, which have the same or similar technical solutions as those of the present application, fall within the protection scope of the present invention.

Claims (7)

1. A method of constructing a knowledge graph, comprising: the method comprises the following steps of (1) preliminarily listing the technical field of the industry, inquiring relevant data of the established industry, and classifying and summarizing the data;
(2) determining the business category, and determining the business category name of the industry according to the query data;
(3) preliminarily screening the service classes and establishing an attribution relation schematic diagram;
(4) determining a keyword;
(5) the business relation is textual;
(6) inquiring and modifying the expert opinions of the industry;
(7) determining a final edition and generating an imported knowledge graph format.
2. The method of constructing a knowledge graph according to claim 1, wherein: step (1), listing the technical field of the industry, and obtaining the technical field of the industry according to the national economy industry classification of the State statistical office and the referable industry classification documents by adopting the method (1-1) and the industry national standard classification; or (1-2), classifying the industry internally, and searching industry experts or industry practitioners to participate in the construction work of the technical field of the industry; or (1-3) summarizing data classification, acquiring industry service information through encyclopedia, Wikipedia and encyclopedia search ways, and summarizing the technical field.
3. The method of constructing a knowledge graph according to claim 2, wherein: in the step (2), the service category can be determined by adopting the method (2-1), and the required service category is selected according to the industry national standard classification and the requirement of the knowledge graph; or (2-2) directly acquiring the service classes required by the knowledge graph according to the internal classification of the industry; or (2-3) summarizing a relatively coarse business category by inquiring the industry data on the network.
4. The method of constructing a knowledge graph according to claim 1, wherein: in the step (3), the attribution relation schematic diagram is established according to the determined service type name, and in the process, the service type is deleted or newly added, and the establishment of the attribution relation schematic diagram is completed.
5. The method of constructing a knowledge graph according to claim 1, wherein: step (4), specifically comprising (4-1) searching keywords, and inquiring related services of the industry on line to obtain corresponding technical field keywords;
(4-2) checking the keywords, providing an authoritative keyword by consulting industry experts and practitioners engaged in the industry or collecting bulletin titles of the industry to sort out a keyword, and screening and checking the keywords inquired on line according to the keyword;
(4-3) performing word segmentation, performing minimum word segmentation splitting on the checked keywords, and taking the words of the two characters as minimum words except proper nouns.
6. The method of constructing a knowledge graph according to claim 1, wherein: in the step (5), the service relationship is textual, which is to finally confirm the affiliation relationship, so as to conveniently generate an imported knowledge graph service relationship document.
7. The method of constructing a knowledge graph according to claim 1, wherein: in step (6), the expert proposes the opinions and modifies them, and this step can be repeated several times to get the best knowledge map.
CN202010400800.8A 2020-05-13 2020-05-13 Method for constructing knowledge graph Pending CN111581398A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202010400800.8A CN111581398A (en) 2020-05-13 2020-05-13 Method for constructing knowledge graph

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202010400800.8A CN111581398A (en) 2020-05-13 2020-05-13 Method for constructing knowledge graph

Publications (1)

Publication Number Publication Date
CN111581398A true CN111581398A (en) 2020-08-25

Family

ID=72113554

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202010400800.8A Pending CN111581398A (en) 2020-05-13 2020-05-13 Method for constructing knowledge graph

Country Status (1)

Country Link
CN (1) CN111581398A (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113468340A (en) * 2021-06-28 2021-10-01 北京众标智能科技有限公司 Construction system and construction method of industrial knowledge map

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106447346A (en) * 2016-08-29 2017-02-22 北京中电普华信息技术有限公司 Method and system for construction of intelligent electric power customer service system
CN108345647A (en) * 2018-01-18 2018-07-31 北京邮电大学 Domain knowledge map construction system and method based on Web
CN109255034A (en) * 2018-08-08 2019-01-22 数据地平线(广州)科技有限公司 A kind of domain knowledge map construction method based on industrial chain
CN110148043A (en) * 2019-03-01 2019-08-20 安徽省优质采科技发展有限责任公司 The bid and purchase information recommendation system and recommended method of knowledge based map

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106447346A (en) * 2016-08-29 2017-02-22 北京中电普华信息技术有限公司 Method and system for construction of intelligent electric power customer service system
CN108345647A (en) * 2018-01-18 2018-07-31 北京邮电大学 Domain knowledge map construction system and method based on Web
CN109255034A (en) * 2018-08-08 2019-01-22 数据地平线(广州)科技有限公司 A kind of domain knowledge map construction method based on industrial chain
CN110148043A (en) * 2019-03-01 2019-08-20 安徽省优质采科技发展有限责任公司 The bid and purchase information recommendation system and recommended method of knowledge based map

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
郑人杰 等: "软件工程概论", 机械工业出版社, pages: 360 - 115 *

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113468340A (en) * 2021-06-28 2021-10-01 北京众标智能科技有限公司 Construction system and construction method of industrial knowledge map
CN113468340B (en) * 2021-06-28 2024-05-07 北京众标智能科技有限公司 Construction system and construction method of industrial knowledge graph

Similar Documents

Publication Publication Date Title
US20240152542A1 (en) Ontology mapping method and apparatus
US11625424B2 (en) Ontology aligner method, semantic matching method and apparatus
CN110990590A (en) Dynamic financial knowledge map construction method based on reinforcement learning and transfer learning
US11816156B2 (en) Ontology index for content mapping
KR20070112730A (en) System and method of intelligently searching and processing information
US20170061001A1 (en) Ontology browser and grouping method and apparatus
US20160070751A1 (en) Database management system
WO2011094522A1 (en) Method and system for conducting legal research using clustering analytics
CN114004581A (en) Intention interaction system based on multi-dimensional government affair knowledge base
CN114528312A (en) Method and device for generating structured query language statement
CN115982379A (en) User portrait construction method and system based on knowledge graph
US11487795B2 (en) Template-based automatic software bug question and answer method
CN115640406A (en) Multi-source heterogeneous big data analysis processing and knowledge graph construction method
Yeo et al. A bibliometric analysis of the research on social attitudes towards LGBT community (2002–2022)
CN111581398A (en) Method for constructing knowledge graph
Zhang et al. Semantic web and geospatial unique features based geospatial data integration
Awangga et al. Ontology design based on data family planning field officer using OWL and RDF
CN111291248A (en) Searching method and system based on intelligent agent knowledge base
Wu et al. Metadata in research data australia and the open provenance model: A proposed mapping
Mirvahedi et al. Three Decades of Research in Human mobility: A Scientometric Analysis
Yang et al. Research on Knowledge Graph Construction Methods for News Domain
Neto et al. Domain-specific schema discovery from general-purpose knowledge base
Qiao et al. A Knowledge Graph Construction Method for Food Nutrition
CN114265889A (en) Disciplinary knowledge data processing method and device based on knowledge graph
CN118036735A (en) Knowledge graph visualization dynamic interaction tool construction method and system

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication
RJ01 Rejection of invention patent application after publication

Application publication date: 20200825