CN112966053B - Knowledge graph-based marine field expert database construction method and device - Google Patents

Knowledge graph-based marine field expert database construction method and device Download PDF

Info

Publication number
CN112966053B
CN112966053B CN202010988719.6A CN202010988719A CN112966053B CN 112966053 B CN112966053 B CN 112966053B CN 202010988719 A CN202010988719 A CN 202010988719A CN 112966053 B CN112966053 B CN 112966053B
Authority
CN
China
Prior art keywords
knowledge
information
keyword
marine
group
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN202010988719.6A
Other languages
Chinese (zh)
Other versions
CN112966053A (en
Inventor
魏志强
吴佳静
贾东宁
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Ocean University of China
Qingdao National Laboratory for Marine Science and Technology Development Center
Original Assignee
Ocean University of China
Qingdao National Laboratory for Marine Science and Technology Development Center
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Ocean University of China, Qingdao National Laboratory for Marine Science and Technology Development Center filed Critical Ocean University of China
Priority to CN202010988719.6A priority Critical patent/CN112966053B/en
Publication of CN112966053A publication Critical patent/CN112966053A/en
Application granted granted Critical
Publication of CN112966053B publication Critical patent/CN112966053B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/28Databases characterised by their database models, e.g. relational or object models
    • G06F16/284Relational databases
    • G06F16/288Entity relationship models
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/24Querying
    • G06F16/248Presentation of query results
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/20Natural language analysis
    • G06F40/279Recognition of textual entities
    • G06F40/289Phrasal analysis, e.g. finite state techniques or chunking
    • G06F40/295Named entity recognition
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/04Architecture, e.g. interconnection topology
    • G06N3/044Recurrent networks, e.g. Hopfield networks
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/04Architecture, e.g. interconnection topology
    • G06N3/045Combinations of networks

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Computational Linguistics (AREA)
  • Data Mining & Analysis (AREA)
  • Databases & Information Systems (AREA)
  • Artificial Intelligence (AREA)
  • Health & Medical Sciences (AREA)
  • General Health & Medical Sciences (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Biomedical Technology (AREA)
  • Biophysics (AREA)
  • Evolutionary Computation (AREA)
  • Molecular Biology (AREA)
  • Computing Systems (AREA)
  • Mathematical Physics (AREA)
  • Software Systems (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

The application discloses a method and a device for constructing a marine field expert database based on a knowledge graph. The method for constructing the marine field expert database based on the knowledge graph comprises the following steps: constructing a marine field expert semantic model, wherein the marine field expert semantic model comprises at least one group of keyword information groups and incidence relation information; acquiring a marine information database; extracting information related to the keyword information group in the ocean information database according to the keyword information group to serve as an extracted knowledge information base, wherein one extracted knowledge information base can be extracted from one group of keyword information groups, and each extracted knowledge information base comprises at least one piece of knowledge information; extracting the relation between the knowledge information and other knowledge information according to the incidence relation information so as to generate a knowledge graph; and generating a visual map according to the knowledge map. According to the construction method of the expert knowledge map in the marine field, the knowledge map in the marine field is constructed, support is provided for marine professional knowledge, and a user can observe the knowledge map conveniently through a visual map.

Description

Knowledge graph-based marine field expert database construction method and device
Technical Field
The invention relates to the technical field of marine field information collection, in particular to a marine field expert database construction method based on a knowledge graph and a marine field expert database construction device based on the knowledge graph.
Background
Since 1983, more than 200 research institutions have been reached, which belong to the departments of China academy of sciences, national oceanic administration, department of agriculture, animal husbandry, fishery, geological and mineral resources, department of petroleum, department of transportation, and the like, and various provinces, cities, autonomous regions and some higher colleges in coastal areas, and the research fields are also subjected to more extensive cross fusion. With the development of science and technology in the marine field and the increasing complexity of decision environment, the academic achievement and the academic exchange and cooperation of experts in the marine field play an important role in improving the science and technology development level and the science and technology innovation. However, the valuable expert information data lack a compact and effective organization structure and visual and vivid visual query mode, a corresponding knowledge system of an expert in the marine field is not formed, and deep data mining and application are difficult to perform.
Accordingly, it would be desirable to have a solution that overcomes or at least alleviates at least one of the above-mentioned difficulties of the prior art.
Disclosure of Invention
It is an object of the present invention to provide a method of constructing a knowledge-graph-based marine domain expert library that overcomes or at least alleviates at least one of the above-mentioned disadvantages of the prior art.
In one aspect of the invention, a method for constructing a knowledge graph-based marine field expert database is provided, and comprises the following steps:
constructing a marine field expert semantic model, wherein the marine field expert semantic model comprises at least one group of keyword information groups and incidence relation information;
acquiring a marine information database;
extracting information related to the keyword information group in the ocean information database according to the keyword information group to serve as an extracted knowledge information base, wherein one extracted knowledge information base can be extracted from one group of keyword information groups, and each extracted knowledge information base comprises at least one piece of knowledge information;
extracting the relation between one or more knowledge information and other knowledge information according to the incidence relation information, thereby generating a knowledge graph;
and generating a visual map according to the knowledge map.
Optionally, the method for constructing the marine field expert database based on the knowledge graph further includes:
generating a knowledge question-answering base according to the knowledge graph;
and performing man-machine interaction with the user according to the knowledge question-answering library.
Optionally, the extracting information related to the keyword information in the marine information database according to the keyword information groups is used as an extracted knowledge information base, where a group of keyword information groups can extract one extracted knowledge information base, and each extracted knowledge information base includes at least one piece of knowledge information:
identifying the text or picture content in the marine information database;
and extracting the information related to the keyword information in the identified marine information database as an extracted knowledge information base according to the keyword information group.
Optionally, the keyword information group includes:
a location keyword group, an organization mechanism keyword group, an academic achievement keyword group, a reference document keyword group, a research field keyword group, a thesis keyword group, a marine field news keyword group, an education experience keyword group, a work experience keyword group, and a name keyword group;
the extracted knowledge information base comprises a location knowledge information base extracted according to location key phrases, an organization knowledge information base extracted according to organization key phrases, an academic achievement knowledge information base extracted according to academic achievement key phrases, a reference knowledge information base extracted according to reference document key phrases, a research field knowledge information base extracted according to work experience key phrases and a name knowledge information base extracted according to name key phrases.
Optionally, the identifying the text or picture content in the marine information database includes:
and identifying the text content in the marine information database by a Bi-LSTM-CRF algorithm and a vocabulary-based Bidirectional Maximum Matching (BMM) algorithm.
Optionally, the extracting relationships with other knowledge information for one or more of the knowledge information according to the association relationship information, so as to generate the knowledge graph includes: the PrTransH algorithm is utilized to extract relationships with other knowledge information for one or more of the knowledge information.
Optionally, the generating a visualization graph according to the knowledge graph comprises:
establishing a clustering model;
and inputting part of or all information in the knowledge graph to the clustering model so as to generate a visual clustering graph.
The application also provides a knowledge graph-based marine field expert database construction device, the marine field expert database construction device based on the knowledge graph comprises:
the marine field expert semantic model building module is used for building a marine field expert semantic model, and the marine field expert semantic model comprises at least one group of keyword information groups and incidence relation information;
the system comprises a marine information database acquisition module, a keyword information group acquisition module and a keyword information extraction module, wherein the marine information database acquisition module is used for extracting information related to the keyword information group in the marine information database according to the keyword information group to serve as an extracted knowledge information base, one extracted knowledge information base can be extracted from one group of keyword information groups, and each extracted knowledge information base comprises at least one piece of knowledge information;
the knowledge graph generating module is used for extracting the relation between one or more pieces of knowledge information and other knowledge information according to the incidence relation information so as to generate a knowledge graph;
and the visual map generation module is used for generating a visual map according to the knowledge map.
Optionally, the apparatus for constructing a marine domain expert database based on a knowledge graph includes: the knowledge question-answer base generation module is used for generating a knowledge question-answer base according to the knowledge graph;
and the human-computer interaction module is used for carrying out human-computer interaction with the user according to the knowledge question-answering library.
The application also provides an electronic device, which comprises a memory, a processor and a computer program stored in the memory and capable of running on the processor, wherein the processor executes the computer program to realize the method for constructing the knowledge-graph-based marine domain expert library.
The present application further provides a computer-readable storage medium storing a computer program, which when executed by a processor, can implement the method for constructing a knowledge-graph-based marine domain expert library as described above.
Advantageous effects
The method for constructing the marine field expert database based on the knowledge graph constructs the knowledge graph of the marine field, so that efficient searching of knowledge is provided for relevant personnel, a foundation is laid for finding of association relation among knowledge, support is provided for marine professional knowledge service finally, and users can observe conveniently through the visual graph.
Drawings
Fig. 1 is a schematic flowchart of a method for constructing a knowledge-graph-based marine domain expert database according to a first embodiment of the present invention.
Fig. 2 is a visualization formed by using the method for constructing the knowledge-graph-based marine domain expert database shown in fig. 1.
FIG. 3 is another visualization formed using the knowledge-graph-based marine domain expert library construction method shown in FIG. 1.
Fig. 4 is a schematic flow chart of the knowledge-based question-answer library formed by the method for constructing the marine domain expert library based on the knowledge-based map shown in fig. 1.
FIG. 5 is a schematic diagram of the BilSTM + CRF algorithm.
Detailed Description
In order to make the implementation objects, technical solutions and advantages of the present application clearer, the technical solutions in the embodiments of the present application will be described in more detail below with reference to the drawings in the embodiments of the present application. In the drawings, the same or similar reference numerals denote the same or similar elements or elements having the same or similar functions throughout. The described embodiments are a subset of the embodiments in the present application and not all embodiments in the present application. The embodiments described below with reference to the drawings are exemplary and intended to be used for explaining the present application and should not be construed as limiting the present application. All other embodiments obtained by a person of ordinary skill in the art based on the embodiments in the present application without making any creative effort belong to the protection scope of the present application. Embodiments of the present application will be described in detail below with reference to the accompanying drawings.
In the description of the present application, it is to be understood that the terms "central," "longitudinal," "lateral," "front," "rear," "left," "right," "vertical," "horizontal," "top," "bottom," "inner," "outer," and the like are used in the orientation or positional relationship indicated in the drawings for convenience in describing the present application and for simplicity in description, and are not intended to indicate or imply that the referenced devices or elements must have a particular orientation, be constructed in a particular orientation, and be operated in a particular manner and are not to be considered limiting of the scope of the present application.
Fig. 1 is a schematic flow chart of a method for constructing a knowledge-graph-based marine domain expert database according to a first embodiment of the present invention.
The method for constructing the marine field expert database based on the knowledge graph as shown in FIG. 1 comprises the following steps:
step 1: constructing a marine field expert semantic model, wherein the marine field expert semantic model comprises at least one group of keyword information groups and incidence relation information;
step 2: acquiring a marine information database;
and 3, step 3: extracting information related to the keyword information groups in the marine information database as an extracted knowledge information base according to the keyword information groups, wherein one extracted knowledge information base can be extracted from one group of keyword information groups, and each extracted knowledge information base comprises at least one piece of knowledge information;
and 4, step 4: extracting the relation between one or more knowledge information and other knowledge information according to the incidence relation information, thereby generating a knowledge graph;
and 5: and generating a visual map according to the knowledge map.
The method for constructing the marine field expert database based on the knowledge graph constructs the knowledge graph of the marine field, so that efficient searching of knowledge is provided for relevant personnel, a foundation is laid for finding of association relation among knowledge, support is provided for marine professional knowledge service finally, and users can observe conveniently through the visual graph.
In this embodiment, the method for constructing the marine domain expert database based on the knowledge graph further includes:
step 6: generating a knowledge question-answering base according to the knowledge graph;
and 7: and performing man-machine interaction with the user according to the knowledge question-answering library.
According to the method for constructing the marine field expert base based on the knowledge map, all knowledge information is obtained firstly, named entity identification, relation extraction and attribute extraction are carried out through a machine learning algorithm, and data fusion and data storage are carried out on the knowledge information. The knowledge graph is generated through the knowledge information and the incidence relation information, the effective integration of the expert knowledge graph in the marine field is achieved, and the visual graph and the intelligent question and answer are achieved on the basis of the construction of the knowledge graph.
In this embodiment, extracting information related to keyword information in the ocean information database according to keyword information groups as extracted knowledge information bases, where a group of keyword information groups can extract one extracted knowledge information base, and each extracted knowledge information base includes at least one piece of knowledge information:
identifying the text or picture content in the marine information database;
and extracting information related to the keyword information in the identified marine information database as an extracted knowledge information base according to the keyword information group.
In this embodiment, the keyword information group includes:
a location keyword group, an organization mechanism keyword group, an academic achievement keyword group, a reference document keyword group, a research field keyword group, a thesis keyword group, a marine field news keyword group, an education experience keyword group, a work experience keyword group, and a name keyword group;
the extracted knowledge information base comprises a location knowledge information base extracted according to location key phrases, an organization knowledge information base extracted according to organization key phrases, an academic achievement knowledge information base extracted according to academic achievement key phrases, a reference literature knowledge information base extracted according to reference literature key phrases, a research field knowledge information base extracted according to work experience key phrases and a name knowledge information base extracted according to name key phrases.
For example, a user needs to know specific information of professors in the field of marine organism big data, and the main content of expert information collection comprises three parts of expert basic information, achievement information and communication information. The basic information comprises the name, the sex, the birth year and month, the academic calendar, the academic position, the professional technical title and the like of the expert; the result information comprises papers, writings, patents, undertaking projects and the like of experts;
a user needs to know which experts in the field of ocean sensors can provide field knowledge, and needs to know a specific mechanism name;
the method for acquiring the expert information in the marine field specifically comprises the following steps:
step 1: constructing an expert semantic model, wherein the expert information semantic model comprises keyword information and incidence relation information; for example, the keyword information includes name (e.g., name of each well-known student may be added), gender (male, female), year and month of birth (e.g., numerical and textual information in order of year and month of birth), academic calendar (e.g., textbook, graduate, doctor), professional title (e.g., researcher, hospital, etc.), treatises, patents (e.g., with ZL sign, or invention, utility model, etc.), address, contact phone, and email.
And 2, step: and acquiring expert information according to the semantic model, wherein the acquired main content comprises three parts of expert basic information, achievement information and communication information. The basic information comprises the name, the sex, the birth year and month, the academic calendar, the academic degree, the professional technical title and the like of the expert; the result information comprises the papers, the works, the patents, the undertaking projects and the like of experts; the communication information of the expert comprises the communication address, the contact telephone, the electronic mail box and the like of the expert.
This is the core part of the application, and the construction of expert knowledge graph starts with the abstraction of the relationship between the entities and attributes such as experts, patents, etc. According to the attribute analysis of experts and the relation analysis between experts, the mode diagram of the system is basically determined, the experts are main parts of documents, patents, information and projects, the experts belong to the documents, the patents, the information and the projects, the relations of co-workers, cooperation and the like exist among the experts, and the relation between every two experts is established to construct an expert map network. For example, a relationship model diagram, an expert as an entity, a patent as an entity, the expert owning the patent, the owner of the patent as the expert, and the entity owning the patent have their own attributes, such as attribute values of author, content, organization, time, etc. of the patent. For example, if the author attribute of a document is expert 1 and expert 2, then he is in a cooperative relationship; if the organization names in the basic information of expert 1 and expert 2 are the same, they are the same-colleague relationship; if the subjects of the two patents are the same, then the authors, expert 1 and expert 2, are in the same relationship and so on.
And 3, step 3: and generating a visual map according to the associated map information or the patient information. Specifically, in one embodiment, a visualization map, such as a star map, a relationship map, or the like, may be made by the mapping software.
It will be appreciated that such mapping is based on the attributes and relationships described above.
In this embodiment, extracting information related to keyword information in the marine information database as extracted knowledge information bases according to keyword information groups, where a group of keyword information groups can extract one extracted knowledge information base, and each extracted knowledge information base includes at least one piece of knowledge information:
identifying the text or picture content in the marine information database;
and extracting information related to the keyword information in the identified marine information database as an extracted knowledge information base according to the keyword information group.
In this embodiment, identifying textual or pictorial content in the marine information database comprises identifying textual content in the marine information database by a Bi-LSTM-CRF algorithm and a vocabulary-based Bidirectional Maximum Matching (BMM) algorithm. Specifically, the text content in the marine information database is identified based on the BilSTM + CRF algorithm (FIG. 5). BilSTM can predict the probability that each word belongs to different labels, and then the label with the maximum probability is obtained by using Softmax as the predicted value of the position. Thus, the relevance between tags is ignored in the prediction, but BilSTM does not consider the inter-tag association. Thus, bilSTM + CRF adds a CRF to the output layer of BilSTM, so that the model can consider the correlation between class labels, which is the transition matrix in the CRF, and represents the probability of transition from one state to another. The BilSTM + CRF considers the probability of the whole class target path rather than the probability of a single class target, and after the CRF is added to a BilSTM output layer, the identification of the marine field experts is more accurate.
In this embodiment, the keyword information group includes a location keyword group, an organization keyword group, a academic achievement keyword group, a reference document keyword group, a research field keyword group, a thesis keyword group, a marine field news keyword group, an education experience keyword group, a work experience keyword group, and a name keyword group;
the extracted knowledge information base comprises a location knowledge information base extracted according to location key phrases, an organization knowledge information base extracted according to organization key phrases, an academic achievement knowledge information base extracted according to academic achievement key phrases, a reference literature knowledge information base extracted according to reference literature key phrases, a research field knowledge information base extracted according to work experience key phrases and a name knowledge information base extracted according to name key phrases.
In this embodiment, extracting the relationship with other knowledge information for one or more of the knowledge information according to the association relationship information, thereby generating the knowledge graph includes: the Bi-LSTM-CRF algorithm is used to extract relationships with other knowledge information for one or more of the knowledge information.
In this embodiment, the generating a visualization graph according to the knowledge graph includes:
establishing a clustering model;
and inputting part of or all of the information in the knowledge graph to the clustering model so as to generate a visual clustering graph.
The method comprises the steps of firstly, utilizing an improved entity construction method to construct an entity, identifying the entity, the attribute and the relationship based on an NER named entity identification Bi-LSTM-CRF algorithm technology, and achieving efficient and accurate acquisition and extraction of an extracted marine information database. The method and the device realize visualization of the ocean expert knowledge base by combining an e-charts technology on the basis of the established knowledge base, convert natural language problems into Cypher query language of a Neo4j graphic database on the basis of corpus matching on the basis of knowledge base visualization, and realize the query of knowledge in the ocean field expert knowledge base.
The constructed marine field expert knowledge map takes an expert entity as a center, so that the field, direction and result of research need to be considered for extracting data, and the province and research institutions of different oceans need to be used as an organization structure of classification information, so that an expert list and a corresponding URL list are crawled on a vertical field website, then corresponding list URLs are sequentially visited, expert province and city information and organization institutions related to marine experts are obtained by analyzing page information, and academic results sequentially find related original information needing to be obtained.
According to the obtained original expert data, original marine expert information expert data are stored on the basis of a neo4j graph database, and due to the particularity of the marine expert data, marine field expert entity identification needs to be carried out on the marine field expert data. Wherein the precision rate is P = TP/(FP + TP), the recall rate is P = TP/(FN + TP), and the F1 value is F1= 2P R/(P + R).
And (3) relation extraction of experts in the marine field: the purpose of the relationship extraction is to extract two entities and triples of relationships, and in the patent, 9 relationships are extracted, namely 9 relationships of a marine domain expert and a region, a marine domain expert and an organization relationship, an expert and a research domain relationship, an expert and a domain reference document relationship, an expert and a thesis author relationship, an expert and a marine domain news relationship, an expert relationship and an education experience relationship, an expert and a work experience relationship, and an expert and winning prize condition relationship.
On the basis of the relation extraction, the Bi-LSTM-CRF algorithm is used in the patent.
A clustering model is established for expert information, trained embedding carriers are used for clustering experts to verify the effectiveness of the experts, DBSCAN is used for clustering 200 experts in an embedding space, and then embedding vectors are projected to a two-dimensional space to realize the visualization of expert clustering. As shown in FIG. 2, experts are grouped into different clusters, and experts in the same field are grouped, which proves that the learning diagram embedding vector can use expert semantic representation. As shown in FIG. 2, four typical clusters are circled and labeled using the oceanic expert field categories.
After the marine field knowledge information is extracted, the marine field expert information needs to be visually displayed, the knowledge graph is visually displayed by using an E-characters visual scheme, and a specific display effect graph is shown in FIG. 3.
Referring to fig. 4, an integral marine field expert database is constructed for a professional field mechanism, an interface capable of performing human-computer interaction needs to be built according to the requirements of users, and question answering through natural language is a communication mode of habits of people, so that the Chinese language question answering model based on the marine field expert knowledge graph is constructed by applying a machine learning technology.
The method comprises the steps of performing word vector processing by using natural language question answering, firstly converting natural language into a vector sequence through word vectors, performing attribute linkage through an entity alignment method based on a traditional probability model, finally obtaining an expert knowledge corpus, converting natural language questions into Cypher query language of a Neo4j graphic database based on a corpus matching mode, completing knowledge query in an ocean field expert knowledge atlas, and returning visual query results to a user. And the information retrieval and result display of ocean experts are realized. The method and the device improve knowledge reasoning in the marine field and provide human-computer interaction service for the user.
In summary, the invention establishes a construction process for constructing the marine expert knowledge graph from a large amount of scattered network data, and verifies the effectiveness of using the embedded vector as entity semantic representation through the construction domain clustering algorithm of the marine expert knowledge graph. The method combines a naive Bayes method, deduces the field of the expert engagement in the marine field, and recommends related collaborators and treatises. The patent can also be applied to marine expert papers and team member search indexes. In addition, the vector is embedded into the neural network to realize the transfer of knowledge, and the application of the neural network in the knowledge map is realized.
The application also provides a knowledge graph-based marine field expert database construction device, which comprises a marine field expert semantic model construction module, a marine information database acquisition module, a knowledge graph generation module and a visual graph generation module, wherein,
the ocean field expert semantic model building module is used for building an ocean field expert semantic model, and the ocean field expert semantic model comprises at least one group of keyword information groups and incidence relation information;
the marine information database acquisition module is used for extracting information related to the keyword information groups in the marine information database according to the keyword information groups to serve as an extracted knowledge information base, wherein one extracted knowledge information base can be extracted from one group of keyword information groups, and each extracted knowledge information base comprises at least one piece of knowledge information;
the knowledge graph generating module is used for extracting the relation between one or more knowledge information and other knowledge information according to the incidence relation information so as to generate a knowledge graph;
and the visual map generation module is used for generating a visual map according to the knowledge map.
In this embodiment, the apparatus for constructing an ocean domain expert database based on a knowledge graph further comprises a knowledge question and answer base generating module and a human-computer interaction module, wherein the knowledge question and answer base generating module is used for generating a knowledge question and answer base according to the knowledge graph; and the human-computer interaction module is used for performing human-computer interaction with the user according to the knowledge question-answering base.
It should be noted that the foregoing explanations of the method embodiments are also applicable to the apparatus of this embodiment, and are not repeated herein.
The application also provides an electronic device, which comprises a memory, a processor and a computer program stored in the memory and capable of running on the processor, wherein the processor executes the computer program to realize the above method for constructing the knowledge-graph-based marine domain expert library.
For example, an electronic device includes an input device, an input interface, a central processing unit, a memory, an output interface, and an output device. The input interface, the central processing unit, the memory and the output interface are mutually connected through a bus, and the input equipment and the output equipment are respectively connected with the bus through the input interface and the output interface and further connected with other components of the computing equipment. Specifically, the input device receives input information from the outside and transmits the input information to the central processing unit through the input interface; the central processing unit processes the input information based on the computer executable instructions stored in the memory to generate output information, temporarily or permanently stores the output information in the memory, and then transmits the output information to the output device through the output interface; the output device outputs the output information to an exterior of the computing device for use by a user.
The application also provides a computer readable storage medium, which stores a computer program, and the computer program can realize the above method for constructing the ocean domain expert database based on the knowledge graph when being executed by a processor.
Although the present application has been described with reference to the preferred embodiments, it is not intended to limit the present application, and those skilled in the art can make variations and modifications without departing from the spirit and scope of the present application.
In a typical configuration, a computing device includes one or more processors (CPUs), input/output interfaces, network interfaces, and memory.
The memory may include forms of volatile memory in a computer readable medium, random Access Memory (RAM) and/or non-volatile memory, such as Read Only Memory (ROM) or flash memory (flash RAM). Memory is an example of a computer-readable medium.
Computer-readable media include both non-transitory and non-transitory, removable and non-removable media that implement information storage by any method or technology. The information may be computer readable instructions, data structures, modules of a program, or other data. Examples of computer storage media include, but are not limited to, phase change memory (PRAM), static Random Access Memory (SRAM), dynamic Random Access Memory (DRAM), other types of Random Access Memory (RAM), read Only Memory (ROM), electrically Erasable Programmable Read Only Memory (EEPROM), flash memory or other memory technology, compact disc read only memory (CD-ROM), digital Versatile Disks (DVD) or other optical storage, magnetic cassettes, magnetic tape, magnetic disk storage or other magnetic storage devices, or any other non-transmission medium that can be used to store information that can be accessed by a computing device.
As will be appreciated by one skilled in the art, embodiments of the present application may be provided as a method, system, or computer program product. Accordingly, the present application may take the form of an entirely hardware embodiment, an entirely software embodiment or an embodiment combining software and hardware aspects. Furthermore, the present application may take the form of a computer program product embodied on one or more computer-usable storage media (including, but not limited to, disk storage, CD-ROM, optical storage, and the like) having computer-usable program code embodied therein.
Furthermore, it will be obvious that the term "comprising" does not exclude other elements or steps. A plurality of units, modules or devices recited in the device claims may also be implemented by one unit or overall device by software or hardware. The terms first, second, etc. are used to identify names, but not any particular order.
The flowchart and block diagrams in the figures illustrate the architecture, functionality, and operation of possible implementations of systems, methods and computer program products according to various embodiments of the present application. In this regard, each block in the flowchart or block diagrams may represent a module, segment, or portion of code, which comprises one or more executable instructions for implementing the specified logical function(s). It should also be noted that, in some alternative implementations, the functions noted in the block may occur out of the order noted in the figures. For example, two blocks identified in succession may, in fact, be executed substantially concurrently, or the blocks may sometimes be executed in the reverse order, depending upon the functionality involved. It will also be noted that each block of the block diagrams and/or flowchart illustration, and combinations of blocks in the block diagrams and/or flowchart illustration, can be implemented by special purpose hardware-based systems which perform the specified functions or acts, or combinations of special purpose hardware and computer instructions.
The Processor in this embodiment may be a Central Processing Unit (CPU), other general purpose Processor, a Digital Signal Processor (DSP), an Application Specific Integrated Circuit (ASIC), an off-the-shelf Programmable Gate Array (FPGA) or other Programmable logic device, a discrete Gate or transistor logic device, a discrete hardware component, and so on. A general purpose processor may be a microprocessor or the processor may be any conventional processor or the like.
The memory may be used to store computer programs and/or modules, and the processor may implement various functions of the apparatus/terminal device by executing or performing the computer programs and/or modules stored in the memory, as well as invoking data stored in the memory. The memory may mainly include a storage program area and a storage data area, wherein the storage program area may store an operating system, an application program required by at least one function (such as a sound playing function, an image playing function, etc.), and the like; the storage data area may store data (such as audio data, a phonebook, etc.) created according to the use of the cellular phone, etc. In addition, the memory may include high-speed random access memory, and may also include non-volatile memory, such as a hard disk, a memory, a plug-in hard disk, a Smart Media Card (SMC), a Secure Digital (SD) Card, a Flash memory Card (Flash Card), at least one magnetic disk storage device, a Flash memory device, or other volatile solid state storage device.
In this embodiment, the module/unit integrated with the apparatus/terminal device may be stored in a computer-readable storage medium if it is implemented in the form of a software functional unit and sold or used as a separate product. Based on such understanding, all or part of the flow in the method according to the embodiments of the present invention may also be implemented by a computer program to instruct related hardware, where the computer program may be stored in a computer readable storage medium, and when the computer program is executed by a processor, the computer program may implement the steps of the embodiments of the method. Wherein the computer program comprises computer program code, which may be in the form of source code, object code, an executable file or some intermediate form, etc. The computer readable medium may include: any entity or device capable of carrying computer program code, recording medium, U.S. disk, removable hard disk, magnetic diskette, optical disk, computer Memory, read-Only Memory (ROM), random Access Memory (RAM), electrical carrier wave signal, telecommunications signal, software distribution medium, etc. It should be noted that the computer readable medium may contain content that is appropriately increased or decreased as required by legislation and patent practice in the jurisdiction.
Although the invention has been described in detail hereinabove with respect to a general description and specific embodiments thereof, it will be apparent to those skilled in the art that modifications or improvements may be made thereto based on the invention. Accordingly, such modifications and improvements are intended to be within the scope of the invention as claimed.

Claims (7)

1. A method for constructing a marine field expert database based on a knowledge graph is characterized by comprising the following steps:
constructing an ocean domain expert semantic model, wherein the ocean domain expert semantic model comprises at least one group of keyword information groups and incidence relation information;
acquiring a marine information database;
extracting information related to the keyword information group in the ocean information database according to the keyword information group to serve as an extracted knowledge information base, wherein one extracted knowledge information base can be extracted from one group of keyword information groups, and each extracted knowledge information base comprises at least one piece of knowledge information;
extracting the relation between one or more knowledge information and other knowledge information according to the incidence relation information, thereby generating a knowledge graph;
generating a visual map according to the knowledge map; wherein,
the method for constructing the marine field expert database based on the knowledge graph further comprises the following steps:
generating a knowledge question-answering base according to the knowledge graph;
performing man-machine interaction with a user according to the knowledge question-answering library;
converting natural language into a vector sequence through word vectors, performing attribute linkage through an entity alignment method based on a traditional probability model to finally obtain an expert knowledge corpus, converting natural language problems into Cypher query language of a Neo4j graphic database based on a corpus matching mode, completing knowledge query in an ocean field expert knowledge map, and returning a visual query result to a user;
the keyword information group includes:
a location keyword group, an organization mechanism keyword group, an academic achievement keyword group, a reference document keyword group, a research field keyword group, a thesis keyword group, a marine field news keyword group, an education experience keyword group, a work experience keyword group, and a name keyword group;
the extracted knowledge information base comprises a location knowledge information base extracted according to location key phrases, an organization knowledge information base extracted according to organization key phrases, an academic achievement knowledge information base extracted according to academic achievement key phrases, a reference knowledge information base extracted according to reference document key phrases, a research field knowledge information base extracted according to work experience key phrases and a name knowledge information base extracted according to name key phrases.
2. The method for constructing a knowledge-graph-based marine domain expert database as claimed in claim 1, wherein the extracting information related to keyword information in the marine information database according to keyword information groups as extracted knowledge information bases, wherein a group of keyword information groups can extract one extracted knowledge information base, and each extracted knowledge information base comprises at least one piece of knowledge information:
identifying the text or picture content in the marine information database;
and extracting the information related to the keyword information group in the identified marine information database as an extracted knowledge information base according to the keyword information group.
3. The method for constructing a knowledge-graph-based marine field expert database according to claim 2, wherein the identifying text or picture contents in the marine information database comprises:
and identifying the text content in the marine information database by a Bi-LSTM-CRF algorithm and a vocabulary-based bidirectional maximum matching algorithm.
4. The method of constructing a knowledge-graph-based marine domain expert library according to claim 3, wherein the extracting relationships with other knowledge information for one or more of the knowledge information according to the association relationship information to generate the knowledge graph comprises: and extracting the relation between one or more knowledge information and other knowledge information by using the Bi-LSTM-CRF algorithm.
5. The marine field expert database construction device based on the knowledge graph is characterized by comprising the following steps:
the marine field expert semantic model building module is used for building a marine field expert semantic model, and the marine field expert semantic model comprises at least one group of keyword information groups and incidence relation information;
the system comprises a marine information database acquisition module, a keyword information group acquisition module and a keyword information extraction module, wherein the marine information database acquisition module is used for extracting information related to the keyword information group in the marine information database according to the keyword information group to serve as an extracted knowledge information base, one extracted knowledge information base can be extracted from one group of keyword information groups, and each extracted knowledge information base comprises at least one piece of knowledge information;
the knowledge graph generating module is used for extracting the relation between one or more pieces of knowledge information and other knowledge information according to the incidence relation information so as to generate a knowledge graph;
the visual map generation module is used for generating a visual map according to the knowledge map; the marine field expert knowledge map construction device further comprises:
the knowledge question-answer base generation module is used for generating a knowledge question-answer base according to the knowledge graph;
the human-computer interaction module is used for performing human-computer interaction with a user according to the knowledge question-answering library; wherein,
converting natural language into a vector sequence through word vectors, performing attribute linkage through an entity alignment method based on a traditional probability model to finally obtain an expert knowledge corpus, converting natural language problems into Cypher query language of a Neo4j graphic database based on a corpus matching mode, completing knowledge query in an expert knowledge map in the ocean field, and returning a visual query result to a user;
the keyword information group includes:
a location keyword group, an organization mechanism keyword group, an academic achievement keyword group, a reference document keyword group, a research field keyword group, a thesis keyword group, a marine field news keyword group, an education experience keyword group, a work experience keyword group, and a name keyword group;
the extracted knowledge information base comprises a location knowledge information base extracted according to location key phrases, an organization knowledge information base extracted according to organization key phrases, an academic achievement knowledge information base extracted according to academic achievement key phrases, a reference knowledge information base extracted according to reference document key phrases, a research field knowledge information base extracted according to work experience key phrases and a name knowledge information base extracted according to name key phrases.
6. An electronic device comprising a memory, a processor, and a computer program stored in the memory and executable on the processor, wherein the processor when executing the computer program implements the method of constructing a knowledgegraph-based marine domain expert library as claimed in any one of claims 1 to 4.
7. A computer-readable storage medium storing a computer program, wherein the computer program, when executed by a processor, is capable of implementing the method for constructing a knowledge-graph-based marine domain expert library as claimed in any one of claims 1 to 4.
CN202010988719.6A 2020-09-18 2020-09-18 Knowledge graph-based marine field expert database construction method and device Active CN112966053B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202010988719.6A CN112966053B (en) 2020-09-18 2020-09-18 Knowledge graph-based marine field expert database construction method and device

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202010988719.6A CN112966053B (en) 2020-09-18 2020-09-18 Knowledge graph-based marine field expert database construction method and device

Publications (2)

Publication Number Publication Date
CN112966053A CN112966053A (en) 2021-06-15
CN112966053B true CN112966053B (en) 2023-04-18

Family

ID=76271039

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202010988719.6A Active CN112966053B (en) 2020-09-18 2020-09-18 Knowledge graph-based marine field expert database construction method and device

Country Status (1)

Country Link
CN (1) CN112966053B (en)

Families Citing this family (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113495955A (en) * 2021-07-08 2021-10-12 北京明略软件系统有限公司 Expert pushing method, system, equipment and storage medium for document
CN116882538B (en) * 2023-05-26 2024-03-05 海南大学 Training method and related device for marine environment prediction model
CN116450856B (en) * 2023-06-19 2023-09-12 航天宏图信息技术股份有限公司 Meteorological ocean unstructured text knowledge construction method and device and electronic equipment

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106909680A (en) * 2017-03-03 2017-06-30 中国科学技术信息研究所 A kind of sci tech experts information aggregation method of knowledge based tissue semantic relation
CN110598000A (en) * 2019-08-01 2019-12-20 达而观信息科技(上海)有限公司 Relationship extraction and knowledge graph construction method based on deep learning model
CN111274806A (en) * 2020-01-20 2020-06-12 医惠科技有限公司 Method and device for recognizing word segmentation and part of speech and method and device for analyzing electronic medical record
CN111341456A (en) * 2020-02-21 2020-06-26 中南大学湘雅医院 Method and device for generating diabetic foot knowledge map and readable storage medium

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106909680A (en) * 2017-03-03 2017-06-30 中国科学技术信息研究所 A kind of sci tech experts information aggregation method of knowledge based tissue semantic relation
CN110598000A (en) * 2019-08-01 2019-12-20 达而观信息科技(上海)有限公司 Relationship extraction and knowledge graph construction method based on deep learning model
CN111274806A (en) * 2020-01-20 2020-06-12 医惠科技有限公司 Method and device for recognizing word segmentation and part of speech and method and device for analyzing electronic medical record
CN111341456A (en) * 2020-02-21 2020-06-26 中南大学湘雅医院 Method and device for generating diabetic foot knowledge map and readable storage medium

Non-Patent Citations (4)

* Cited by examiner, † Cited by third party
Title
专家知识图谱构建研究;周湘超等;《电脑知识与技术》;20160511;第12卷(第07期);第195-196页 *
周湘超等.专家知识图谱构建研究.《电脑知识与技术》.2016,第12卷(第07期), *
基于Bi-LSTM的动画电影智能问答系统;黄东晋等;《现代电影技术》;20200511(第05期);全文 *
面向碳交易领域的知识图谱构建方法;王良萸;《计算机与现代化》;20180815(第08期);全文 *

Also Published As

Publication number Publication date
CN112966053A (en) 2021-06-15

Similar Documents

Publication Publication Date Title
CN112711937B (en) Template recommendation method, device, equipment and storage medium
CN109885691B (en) Knowledge graph completion method, knowledge graph completion device, computer equipment and storage medium
Chen et al. Using social media images as data in social science research
CN112966053B (en) Knowledge graph-based marine field expert database construction method and device
Meroño-Peñuela et al. Semantic technologies for historical research: A survey
Duckham et al. Foundations of geographic information science
US20080243727A1 (en) Distributed collaborative knowledge generation system
Bergmann Toward speculative data:“Geographic information” for situated knowledges, vibrant matter, and relational spaces
CN103440287A (en) Web question-answering retrieval system based on product information structuring
CN116737915A (en) Semantic retrieval method, device, equipment and storage medium based on knowledge graph
Zeng et al. Mobile visual search model for Dunhuang murals in the smart library
CN116108194A (en) Knowledge graph-based search engine method, system, storage medium and electronic equipment
CN110968757B (en) Policy file processing method and device
CN112989811B (en) History book reading auxiliary system based on BiLSTM-CRF and control method thereof
Greenberg et al. Knowledge organization systems: A network for ai with helping interdisciplinary vocabulary engineering
Bernasconi et al. Exploring the historical context of graphic symbols: the NOTAE knowledge graph and its visual interface
Kayed et al. Postal address extraction from the web: a comprehensive survey
Shen Data sustainability and reuse pathways of natural resources and environmental scientists
CN116719915A (en) Intelligent question-answering method, device, equipment and storage medium
CN113157868B (en) Method and device for matching answers to questions based on structured database
Pu et al. A vision-based approach for deep web form extraction
CN114896461A (en) Information resource management method and device, electronic equipment and readable storage medium
Wu et al. Artificial intelligence retrieval algorithm for text data from multiple data sources
Brown et al. In search of Zora/When metadata isn’t enough: Rescuing the experiences of Black women through statistical modeling
Chawla Research methods to understand the ‘youth capabilities and conversions’: the pros and cons of using secondary data analysis in a pandemic situation

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant