CN112163094B - Scientific and technological resource convergence and continuous service method and device - Google Patents

Scientific and technological resource convergence and continuous service method and device Download PDF

Info

Publication number
CN112163094B
CN112163094B CN202010865075.1A CN202010865075A CN112163094B CN 112163094 B CN112163094 B CN 112163094B CN 202010865075 A CN202010865075 A CN 202010865075A CN 112163094 B CN112163094 B CN 112163094B
Authority
CN
China
Prior art keywords
scientific
entity
technological
knowledge
technology
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN202010865075.1A
Other languages
Chinese (zh)
Other versions
CN112163094A (en
Inventor
杜一
朱小杰
宋东泽
周园春
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Computer Network Information Center of CAS
Original Assignee
Computer Network Information Center of CAS
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Computer Network Information Center of CAS filed Critical Computer Network Information Center of CAS
Priority to CN202010865075.1A priority Critical patent/CN112163094B/en
Publication of CN112163094A publication Critical patent/CN112163094A/en
Application granted granted Critical
Publication of CN112163094B publication Critical patent/CN112163094B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/36Creation of semantic tools, e.g. ontology or thesauri
    • G06F16/367Ontology
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/20Natural language analysis
    • G06F40/279Recognition of textual entities
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N5/00Computing arrangements using knowledge-based models
    • G06N5/02Knowledge representation; Symbolic representation
    • G06N5/022Knowledge engineering; Knowledge acquisition
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N5/00Computing arrangements using knowledge-based models
    • G06N5/02Knowledge representation; Symbolic representation
    • G06N5/027Frames

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • General Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Computational Linguistics (AREA)
  • General Physics & Mathematics (AREA)
  • Data Mining & Analysis (AREA)
  • Artificial Intelligence (AREA)
  • Evolutionary Computation (AREA)
  • Computing Systems (AREA)
  • Mathematical Physics (AREA)
  • Software Systems (AREA)
  • Databases & Information Systems (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Animal Behavior & Ethology (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • General Health & Medical Sciences (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

The invention relates to a scientific and technological resource convergence and continuous service method and device. The method comprises the following steps: extracting a scientific and technological entity according to the specific field knowledge base and storing the scientific and technological entity to a to-be-updated scientific and technological entity base; the technology entity library to be updated makes a data request to the knowledge map of the central technology field; the knowledge graph of the central science and technology field returns to the knowledge graph of the specific science and technology field according to various parameters in the data request; and fusing the knowledge map of the specific scientific and technological field with the knowledge base of the specific field to update the knowledge base of the specific field. The invention can effectively reduce the cost of operation and maintenance, resource purchase and the like of the knowledge base or the knowledge service system in specific subject and field directions, and improve the knowledge service efficiency; the quality of the knowledge graph in the central science and technology field can be continuously enhanced, and more accurate data and knowledge service can be provided for more users on the premise of being authorized; the knowledge base can be popularized to the maximum extent, and the coverage and the application range of knowledge services in different industries are improved.

Description

Scientific and technological resource convergence and continuous service method and device
Technical Field
The invention belongs to the technical field of information, and particularly relates to a scientific and technological resource convergence and continuous service method and device.
Background
The knowledge graph in the scientific and technological field is a knowledge network which is constructed by utilizing open scientific and technological resources such as open documents, patents and the like and contains personnel, organizations, achievements and projects. A typical scientific domain knowledge graph structure is shown in figure 1.
Different organizations construct knowledge service systems serving respective subject and field directions, and the knowledge base bottom layer uses a relational database, a network database and the like for storage and provides knowledge services externally. The scientific and technical field knowledge data stored in the form of a relational database, a network database and the like is called a specific field knowledge base.
At present, knowledge bases or knowledge service systems serving respective subjects and field directions are constructed in many fields, however, the continuous and effective updating of data brings about the improvement of operation and maintenance and the purchasing cost of scientific and technological resources, so that more knowledge bases or knowledge service systems cannot continuously maintain service and provide the latest knowledge. Some open scientific and technical resources are present, but the continuous updating and service of knowledge bases in respective disciplines and domain directions cannot be performed.
Disclosure of Invention
Aiming at the problems, the invention provides a method and a device capable of realizing scientific and technological resource convergence and continuous service, and through the implementation of the scheme, the operation and maintenance costs, resource purchase costs and the like of a knowledge base or a knowledge service system in a specific subject and field direction can be effectively reduced, and the knowledge service efficiency is improved.
The technical scheme adopted by the invention is as follows:
a scientific and technological resource convergence and continuous service method comprises the following steps:
extracting a scientific and technological entity according to the specific field knowledge base and storing the scientific and technological entity to a to-be-updated scientific and technological entity base;
a technology entity library to be updated carries out data request to a knowledge graph of the central technology field;
the central scientific and technological field knowledge graph returns to the specific scientific and technological field knowledge graph according to various parameters in the data request;
and fusing the knowledge map of the specific scientific and technological field with the knowledge base of the specific field to update the knowledge base of the specific field.
Further, the extracting the technology entities according to the specific domain knowledge base and storing the technology entities to be updated in the technology entity base includes:
the entity types and the relationships existing in the knowledge base of the specific field are sorted according to the knowledge map structure of the central science and technology field, and a list of the entities and the relationships existing in the knowledge base of the specific field is output;
organizing entities and relationship lists existing in a knowledge base of a specific field into a structure matched with a knowledge map structure subset of the central science and technology field;
and taking the matched structure as a standard, extracting related entities and relations from the knowledge base of the specific field, and storing the related entities and relations into a science and technology entity base to be updated.
Further, the technology entity library to be updated comprises technology entity categories and technology entity attributes; the scientific and technological entity category comprises scientific and technological achievements, research fields, keywords, scientific researchers, scientific projects, scientific research institutions and periodicals; the scientific and technological entity attributes comprise main scientific and technological entity attributes, external auxiliary scientific and technological entity attributes and internal auxiliary scientific and technological entity attributes; the attribute of the main technical entity is an essential input item, a mark used for carrying out unique identification on the technical entity in the specific field knowledge base and an attribute used for aligning the achievement; the external auxiliary scientific and technological entity attribute and the internal auxiliary scientific and technological entity attribute are auxiliary attributes for assisting the alignment of the internal entity and the external entity of the specific field knowledge base, wherein the external auxiliary scientific and technological entity attribute is mainly based on a third party unique identifier, the external entity is corresponding to the specific field knowledge base entity, the internal auxiliary scientific and technological entity is mainly based on various basic attributes of the specific field knowledge base entity, and the accuracy is improved when the internal entity is aligned to the outside.
Further, the technology entity library to be updated makes a data request to the knowledge graph of the central technology field, and the method comprises the following steps:
data organization is carried out on the data of different scientific and technological entities by means of standard expression, wherein the standard expression means that uniform elements are used for describing different types of entities, and scientific and technological achievements comprise main scientific and technological entity attributes, external auxiliary scientific and technological entity attributes and internal auxiliary scientific and technological entity attributes;
filtering scientific and technological entities expressed by various standards according to different requirements on data safety and data sharing;
editing the acquisition rules of any number of scientific and technological entity filtering result combinations on the basis of the filtered data, and performing set operation on different acquisition rules during editing;
and acquiring data which can be finally used for carrying out data request to the knowledge graph in the central science and technology field, and carrying out data request.
Further, the central technology field knowledge graph returns to the specific technology field knowledge graph according to various parameters in the data request, and the method comprises the following steps:
the method comprises the steps that a knowledge graph in the central science and technology field receives various parameters, for various types of transmitted data, operation libraries such as fusion, matching and the like are executed on the knowledge graph in the central science and technology field according to the category of science and technology entities, the attribute value of a main science and technology entity, the attribute value of an external auxiliary science and technology entity and the attribute value of an internal auxiliary science and technology entity, and corresponding knowledge graph data are returned according to requirements;
the knowledge map in the central science and technology field stores the attribute values of the external auxiliary science and technology entities and the attribute values of the internal auxiliary science and technology entities correspondingly according to the safety requirements so as to support continuous service provision.
Further, the fusion of the knowledge map in the specific science and technology field and the knowledge base in the specific field to update the knowledge base in the specific field includes:
completely matching according to the unique identifier when the data request is carried out, and finding out an entity which is the same as the unique identifier in the knowledge graph of the central science and technology field;
returning the matched entities and attributes in the knowledge graph of the central science and technology field, and fusing the entities and attributes with the knowledge graph of the specific science and technology field;
and after the acquired scientific and technological entities and the acquired relational data are matched and fused, updating the data into a knowledge base in the specific field.
Further, the knowledge graph of the central science and technology field is updated by the following steps:
the specific field knowledge base feeds back the fused result to the knowledge map of the central science and technology field; the content of the feedback comprises a feedback category and feedback details; the feedback type is a type for data feedback, and comprises identification correspondence, error comparison and combination comparison; the feedback details are feedback of detailed data for different feedback categories;
updating the knowledge map of the central science and technology field according to data fed back by the knowledge base of the specific field;
a scientific and technological resource convergence and continuous service device adopting the method comprises a specific-field knowledge base, a scientific and technological entity base to be updated, a central scientific and technological-field knowledge map and a specific-scientific and technological-field knowledge map; extracting a scientific and technological entity according to a specific field knowledge base, and storing the scientific and technological entity to be updated to a scientific and technological entity base to be updated; the technology entity library to be updated makes a data request to the knowledge map of the central technology field; the knowledge graph of the central science and technology field returns to the knowledge graph of the specific science and technology field according to various parameters in the data request; and fusing the knowledge map of the specific scientific and technological field with the knowledge base of the specific field to update the knowledge base of the specific field.
The invention has the following beneficial effects:
1. the invention can effectively reduce the cost of operation and maintenance, resource purchase and the like of the knowledge base or the knowledge service system in specific subject and field directions, and improve the knowledge service efficiency.
2. Through the continuous use of a plurality of knowledge base users with specific disciplines and domain directions, the quality of the knowledge map in the central science and technology domain can be continuously enhanced, and more accurate data and knowledge service can be provided for more users on the premise of being authorized.
3. The invention can reduce the deployment cost of the knowledge base in specific subject and field directions, can popularize the knowledge base to the maximum extent and improve the coverage and the application range of knowledge services in different industries.
Drawings
FIG. 1 is a diagram of a typical scientific domain knowledge map.
FIG. 2 is a flowchart illustrating steps of a method for convergence and continuous service of scientific and technological resources according to the present invention.
FIG. 3 is a diagram of organization of a domain-specific knowledge base into structures that match a subset of the central technology domain knowledge graph structures.
FIG. 4 is a schematic diagram of a process for updating a technology entity library to a technology entity to be synchronized for a data request.
Fig. 5 is an exemplary diagram of an editing rule. The graph (a) and the graph (b) are respectively the fusion effect of two different editing rules, and the graph (c) and the graph (d) respectively correspond to the concrete examples of the graph (a) and the graph (b).
Detailed Description
In order to make the aforementioned objects, features and advantages of the present invention comprehensible, the present invention shall be described in further detail with reference to the following detailed description and accompanying drawings.
The step flow of the scientific and technological resource convergence and continuous service method of the present invention is shown in fig. 2. The method mainly comprises the following steps: 1. extracting scientific and technological entities; 2. a scientific and technical entity data request; 3. returning a knowledge graph in the specific science and technology field; 4. fusing internal and external scientific knowledge; 5. fusing result feedback and registration; 6. and (5) updating the knowledge graph in the scientific and technological field.
1. Scientific and technological entity extraction
And extracting the corresponding scientific and technological entities from the knowledge base in the specific field according to the condition of the knowledge base, and storing the scientific and technological entities into a scientific and technological entity base to be updated.
The science and technology entity library to be updated comprises a science and technology entity category, a science and technology entity attribute and the like. The scientific and technological entity category comprises scientific and technological achievements, research fields, keywords, scientific researchers, scientific projects, scientific institutions, periodicals and the like. Different domain-specific knowledge bases store different scientific and technical entity categories.
And according to the characteristics of the knowledge base in the specific field, carrying out the re-carding and data organization of the knowledge base, carrying out the extraction of the seed scientific and technological entities to be converged and continuously served according to the characteristics of data safety, data sharing and the like, and storing the extracted seed scientific and technological entities in a scientific and technological entity base to be updated.
The step of combing the entity types and the relation existing in the knowledge base in the specific field according to the knowledge base structure in the central technology field and organizing the entity types and the relation into a structure matched with the knowledge base structure subset in the central technology field. The specific steps for extracting the scientific and technological entities comprise:
a. and (5) combing the entity types and the relationships existing in the knowledge base of the specific field by contrasting the knowledge map structure of the central scientific and technological field. The step outputs a list of entities and relationships existing in the knowledge base of the specific field. For example: entity list [ personnel, institution, results (journal articles) ]; relationship list [ employment, personnel production, organization production ].
b. Organizing the output of the last step into a structure matched with the central science and technology field knowledge map structure subset. For example, the structure of the central domain knowledge graph is shown in fig. 1, and the structure of the domain-specific knowledge base output to the science and technology entity base to be updated after organization is shown in fig. 3.
c. Based on the structure, the related entities and relations are extracted from the knowledge base of the specific field by means of R2RML (relational database to RDF markup language) and the like, and are stored in the science and technology entity base to be updated.
2. Scientific and technical entity data request
Based on a scientific and technological entity library to be updated, relatively uniform entity attribute expressions are designed for different scientific and technological entity attributes, and data requests are made to knowledge maps in the central scientific and technological field in a standard data request mode after the scientific and technological entities to be updated are processed.
Table 1 shows the property conditions of the extractable scientific and technological entities designed for different scientific and technological entity categories. The attribute of the main technology entity is an essential input item, a mark used for carrying out unique identification on the technology entity in the knowledge base of the specific field and an attribute used for aligning results; the external auxiliary scientific and technological entity attribute and the internal auxiliary scientific and technological entity attribute are auxiliary attributes for assisting the alignment of the internal entity and the external entity of the specific field knowledge base, wherein the external auxiliary scientific and technological entity attribute is mainly based on a third party unique identifier, the external entity can be better corresponding to the specific field knowledge base entity, the internal auxiliary scientific and technological entity is mainly based on various basic attributes of the specific field knowledge base entity, and the accuracy can be better improved when the internal entity is aligned to the external entity.
TABLE 1 scientific and technological entity categories and attributes
Figure BDA0002649462930000051
Fig. 4 illustrates a process from a science and technology entity library to be updated to a science and technology entity to be synchronized which can be used for data request, and can be defined and adjusted according to actual requirements when being implemented.
a. Firstly, organizing data of different scientific and technological entities by means of standard expression; wherein, the standard expression refers to using uniform elements for the description of different classes of entities. As shown in the above table, for the scientific achievement, the main technical entity attribute, the external auxiliary technical entity attribute and the internal auxiliary technical entity attribute should be included, the main technical entity attribute includes a name (name), a unique Identifier (ID) in a specific field, the external auxiliary technical entity attribute includes other open unique identifiers (PMCID, PMID, DOI, web of Science ID, etc.), and the internal auxiliary technical entity attribute includes an achievement release time (publish-date), an achievement author (authors), etc. The transmission of the data is shown in fig. 4, for example, in a form of JSON or XML, etc., for storage and transmission in a computer.
b. And then, filtering the scientific and technical entities expressed by various standards according to different requirements of data security and data sharing. In filtering, the filtering includes whether the technology entity enters the data request, and which attributes of the technology entity entering the data request can enter the data request.
c. Thirdly, editing the acquisition rules of any number of scientific and technological entity filtering result combinations on the basis of the filtered data. And when editing is carried out, different acquisition rules are subjected to set operation.
Fig. 5 shows different effects of a typical editing rule of results filtered by three scientific and technical entities, where (a) diagram (b) diagram is the fusion effect of two different editing rules, respectively, (c) diagram (d) diagram corresponds to specific examples of (a) diagram (b) diagram (c) diagram, respectively. The editing rules used are:
edit rule 1 # (edit rule 2 # #edit rule 3)
(c) In the figure, three editing rules are:
edit rule 1: and (5) carrying out screening rules through journal papers. The screening rules comprise two papers, wherein the first paper id is j01, the title is Development of High-Yield infection A Virus Vaccine Viruses, PMCID is PMC4569720, PMID is 26334134; the second article id is j02, titled SAFO A Systematic Partnership to Reduce Vitamin A Deficiency in Tanzania, PMID 27198912, DOI 10.1159/000442594.
Editing rule 2: and (4) screening rules by scientific research personnel. The screening rules included 2 researchers, the first of which was p01, named Jihui Ping, PMCAAuthorID 26334134, and the organization was Department of Pathiological Sciences, school of Veteriary Medicine, influenza Research Institute, university of Wisconsin-Madison, madison, wisconsin 53711, USA; the second name is p02, the name is Tiago J S Lopes, the organization is Division of Virology, department of Microbiology and Immunology and International Research Center for Infections Diseases, the Institute of Medical Science, university of Tokyo, tokyo 108-8639, japan.
Editing rule 3: rules for screening by the organization. The screening rule comprises 2 organizations, wherein the first family id is o01 and the name is University of Wisconsin-Madison; the second organization id is o02, named University of Tokyo.
(d) In the figure, three editing rules are:
edit rule 1: and (5) carrying out screening rules through the keywords. The screening rule comprises 3 keywords which are respectively viruses, vacination and H1N1.
Editing rule 2: and (5) carrying out screening rules through science and technology projects. The screening rule includes 1 science and technology item with item id OPPGH5383 and name Bill & Melinda Gates Foundation.
Edit rule 3: and (5) carrying out a screening rule through journal names. The deletion rule includes 1 journal name, the journal id is jn01, and the name is Nature Communication.
d. And acquiring data which can be finally used for carrying out data request to the knowledge graph in the central science and technology field, and carrying out data request. In the data request, the parameters include the introduced scientific and technological entity data, the category of the scientific and technological entity to be updated, the category of the scientific and technological relationship to be updated, the last update time, the latest data time, and the security level, as shown in table 2.
For example, in this scenario, two types of data, namely, periodicals and keywords, are input, and it is intended to obtain entity update information of achievement (J), personnel (P), project (Pr) and journal (Jn), and data of "personnel production achievement" and "personnel undertaking project" according to a union constraint condition, update all data and associations from 1 month and 1 day of 2020, and use a medium security level. Specifically, as shown in table 3, indata represents incoming technology entity data, outputEnt represents a technology entity category to be updated, outputRel represents a technology relationship category to be updated, lastModify represents a last Update time, update represents a latest data time, and securityLevel represents a security level.
TABLE 2 parameters for data request
Incoming parameter names Transferring content
Transmitting scientific and technical entity data Science and technology entity data obtained by editing rule
Category of scientific and technological entities to be updated Technology entity to be updated
Class of scientific relationships to be updated Scientific and technological relationship to be updated
Last update time Last update time of last request
Time of last data To be updated to the latest time
Level of security Security level definition of data-requesting party to data-of-own party
TABLE 3 example parameters for making data requests
Figure BDA0002649462930000071
Figure BDA0002649462930000081
3. Specific technology domain knowledge graph return
And returning the knowledge graph of the specific science and technology field by the knowledge graph of the central science and technology field according to various parameters of the request.
The knowledge graph in the central science and technology field receives various parameters, for various types of transmitted data, operation libraries such as fusion, matching and the like are executed on the knowledge graph in the central science and technology field according to the category of the science and technology entity, the attribute value of the main science and technology entity, the attribute value of the external auxiliary science and technology entity and the attribute value of the internal auxiliary science and technology entity, and corresponding knowledge graph data are returned according to requirements. Meanwhile, the attribute values of the external auxiliary scientific and technological entity and the attribute values of the internal auxiliary scientific and technological entity are correspondingly stored according to the safety requirements so as to support continuous service provision. The categories of the returned data are shown in table 4.
TABLE 4 Return data Categories
Figure BDA0002649462930000082
Examples of returns are shown in table 5. This data returns a set of nodes (nodes) and edges (links). Wherein the nodes return data instances of journal articles (j), persons (p), sponsorship (pr), and journals (jn); and the relation examples of the personnel output result (j-p), the personnel subsidized (p-pr) and the like are returned.
TABLE 5 Return data example
Figure BDA0002649462930000091
Figure BDA0002649462930000101
4. Internal and external scientific knowledge fusion
And fusing the knowledge base in the specific field and the returned knowledge graph in the specific science and technology field.
And matching, fusing and updating the unique identifiers according to the returned specific scientific and technological field knowledge graph and the specific field knowledge base and according to various unique identifiers in the request, and simultaneously recording the unique identifiers of the central scientific and technological field knowledge graph in the local specific field knowledge base.
The specific method for matching, fusing and updating according to various unique identifiers in the request comprises the following steps:
1) First, a unique identifier is determined, with reference to the attributes of the scientific and technological entities in table 1, for example, for scientific and technological achievements, PMCID, PMID, DOI are three different types of unique identifiers, but when a data request is made, only one type of unique identifier of PMCID may exist in a specific domain knowledge base, and then complete matching can be performed with the one type of unique identifier of PMCID, so as to find out an entity that is the same as the PMCID in the central scientific and technological domain knowledge graph.
2) And returning the matched entities and attributes in the knowledge graph of the central science and technology field, and fusing the entities and the knowledge graph of the specific science and technology field. One relatively common blending rule is: and when the same attribute category exists but the entity attribute value in the specific domain knowledge graph and the returned entity attribute value are not empty, fusing by taking the specific domain knowledge graph data as reference. Other fusion rules exist, for example, when the same attribute category exists but neither the entity attribute value in the domain-specific knowledge graph nor the returned entity attribute value is empty, fusion is performed based on the latest data.
3) And after the acquired scientific and technological entities and the acquired relational data are matched and fused, updating the data into a knowledge base in the specific field.
Through the processes, the specific domain knowledge base is updated once.
Further, in order to maintain efficiency and make contributions to the required science and technology entities, relationships and the like to the knowledge graph of the central science and technology field, the knowledge graph of the central science and technology field can be updated through the steps 5 and 6.
5. Fused result feedback and enrollment
And the specific field knowledge base feeds back the result according to the fused effect to the knowledge map of the central science and technology field. The contents of the feedback are shown in table 6.
TABLE 6 content of feedback
Feedback content Description of the invention
Feedback categories The category for data feedback includes identification correspondence, comparison error, comparison combination and the like
Details of the feedback For different feedback categories, feedback of detailed data is performed.
The identification correspondence refers to various unique identifiers, such as the correspondence of the PMCID, mainly whether the correspondence exists or not and how the correspondence exists; the contrast error means that the data of the knowledge graph in the central science and technology field is wrong during fusion; the contrast combination means that the central science and technology domain knowledge map data and the existing specific domain knowledge base data can be combined during fusion.
6. Central science and technology domain knowledge map updating
And updating the knowledge graph of the central science and technology field according to the feedback data, and adding the identification of the service object.
The step of updating the knowledge graph in the central science and technology field comprises the following steps:
1) If the unique identifier is corresponding, if PMCIDA exists in the specific domain knowledge base of the specific domain, PMCID exists in the knowledge map of the central science and technology domain, and the feedback details are that the PMCID and the PMCIDA can correspond, the PMCID in the knowledge map of the central science and technology domain is recorded, and the PMCID in the knowledge map of the specific domain is in one-to-one correspondence with the PMCIDA of the knowledge base of the specific domain.
2) And if the comparison is wrong, correcting the entity and relationship attribute of the central science and technology field knowledge graph with the wrong comparison according to the feedback details of the comparison.
3) And if the comparison and combination are carried out, correcting the entity and the relationship attribute of the central science and technology field knowledge graph with error comparison according to the comparison and combination feedback details.
The method for adding the identifier comprises the following steps: and adding a specific domain knowledge base identifier in the knowledge graph of the central scientific and technological domain. For example: adding a 'CNIC _ NI _ XXX' identifier for a network information knowledge base of a computer network information center of Chinese academy of sciences in entities and relations returned by a knowledge graph in the central science and technology field. Wherein the prefix "CNIC _ NI" uniquely identifies the network information repository of the computer network information center of the academy of sciences of china, and "XXX" is a unique identifier in a specific domain that matches table 1.
Based on the same inventive concept, another embodiment of the present invention provides a scientific and technological resource convergence and continuous service device using the above method, including a specific domain knowledge base, a scientific and technological entity base to be updated, a central scientific and technological domain knowledge graph, and a specific scientific and technological domain knowledge graph; extracting a scientific and technological entity according to the specific field knowledge base and storing the scientific and technological entity to a to-be-updated scientific and technological entity base; the technology entity library to be updated makes a data request to the knowledge map of the central technology field; the knowledge graph of the central science and technology field returns to the knowledge graph of the specific science and technology field according to various parameters in the data request; and fusing the knowledge map in the specific science and technology field with the knowledge base in the specific field to update the knowledge base in the specific field.
Based on the same inventive concept, another embodiment of the present invention provides an electronic device (computer, server, smartphone, etc.) comprising a memory storing a computer program configured to be executed by the processor and a processor, the computer program comprising instructions for performing the steps of the inventive method.
Based on the same inventive concept, another embodiment of the present invention provides a computer-readable storage medium (e.g., ROM/RAM, magnetic disk, optical disk) storing a computer program, which when executed by a computer, performs the steps of the inventive method.
The foregoing disclosure of the specific embodiments of the present invention and the accompanying drawings is intended to assist in understanding the contents of the invention and to enable its practice, and it will be understood by those skilled in the art that various alternatives, modifications and variations may be possible without departing from the spirit and scope of the invention. The present invention should not be limited to the disclosure of the embodiments and drawings in the specification, and the scope of the present invention is defined by the scope of the claims.

Claims (9)

1. A scientific and technological resource convergence and continuous service method is characterized by comprising the following steps:
extracting a scientific and technological entity according to the specific field knowledge base and storing the scientific and technological entity to a to-be-updated scientific and technological entity base;
the technology entity library to be updated makes a data request to the knowledge map of the central technology field;
the central scientific and technological field knowledge graph returns to the specific scientific and technological field knowledge graph according to various parameters in the data request;
fusing the knowledge map of the specific scientific and technological field with the knowledge base of the specific field to update the knowledge base of the specific field;
the extracting technology entity according to the specific domain knowledge base and storing the technology entity to be updated to the technology entity base comprises the following steps:
the entity types and the relationships existing in the knowledge base of the specific field are sorted by contrasting the knowledge map structure of the central scientific and technological field, and an entity and relationship list existing in the knowledge base of the specific field is output;
organizing entities and relationship lists existing in a specific field knowledge base into a structure matched with the central science and technology field knowledge map structure subset;
and taking the matched structure as a standard, extracting related entities and relations from the knowledge base of the specific field, and storing the related entities and relations into a science and technology entity base to be updated.
2. The method according to claim 1, wherein the scientific and technological entity library to be updated comprises scientific and technological entity categories and scientific and technological entity attributes; the scientific and technological entity category comprises scientific and technological achievements, research fields, keywords, scientific researchers, scientific projects, scientific research institutions and periodicals; the scientific and technological entity attributes comprise main scientific and technological entity attributes, external auxiliary scientific and technological entity attributes and internal auxiliary scientific and technological entity attributes; the attribute of the main technical entity is an essential input item, a mark used for carrying out unique identification on the technical entity in the specific field knowledge base and an attribute used for aligning the achievement; the external auxiliary scientific and technological entity attribute and the internal auxiliary scientific and technological entity attribute are auxiliary attributes for assisting the alignment of the internal entity and the external entity of the specific field knowledge base, wherein the external auxiliary scientific and technological entity attribute is mainly based on a third party unique identifier, the external entity is corresponding to the specific field knowledge base entity, the internal auxiliary scientific and technological entity is mainly based on various basic attributes of the specific field knowledge base entity, and the accuracy is improved when the internal entity is aligned to the outside.
3. The method according to claim 1, wherein the technology entity library to be updated makes a data request to the central technology domain knowledge graph, and the method comprises:
data organization is carried out on the data of different scientific and technological entities by means of standard expression, wherein the standard expression means that uniform elements are used for describing different types of entities, and scientific and technological achievements comprise main scientific and technological entity attributes, external auxiliary scientific and technological entity attributes and internal auxiliary scientific and technological entity attributes;
filtering the scientific and technological entities expressed by various standards according to different requirements of data security and data sharing;
editing the acquisition rules of any number of scientific and technical entity filtering result combinations on the basis of the filtered data, and performing set operation on different acquisition rules during editing;
and acquiring data which can be finally used for carrying out data request to the knowledge graph in the central science and technology field, and carrying out data request.
4. The method of claim 1, wherein the returning of the central technology domain knowledge graph to the technology domain-specific knowledge graph according to the various types of parameters in the data request comprises:
the method comprises the steps that a knowledge graph in the central science and technology field receives various parameters, for various types of transmitted data, fusion and matching operation libraries are executed on the knowledge graph in the central science and technology field according to the science and technology entity type, the main science and technology entity attribute value, the external auxiliary science and technology entity attribute value and the internal auxiliary science and technology entity attribute value, and corresponding knowledge graph data are returned according to requirements;
the knowledge map in the central science and technology field stores the attribute values of the external auxiliary science and technology entities and the attribute values of the internal auxiliary science and technology entities correspondingly according to the safety requirements so as to support continuous service provision.
5. The method according to claim 1, wherein the fusing the specific technology domain knowledge graph with the specific domain knowledge base to update the specific domain knowledge base comprises:
completely matching according to the unique identifier when the data request is carried out, and finding out an entity which is the same as the unique identifier in the knowledge graph of the central science and technology field;
returning the matched entities and attributes in the knowledge graph of the central science and technology field, and fusing the entities and the knowledge graph of the specific science and technology field;
and after the acquired scientific and technological entities and the acquired relational data are matched and fused, updating the data into a knowledge base in the specific field.
6. The method according to any one of claims 1 to 5, wherein the central technology domain knowledge graph is updated by the following steps:
the specific domain knowledge base feeds back the fused result to a knowledge map of the central scientific and technological domain; the content of the feedback comprises a feedback category and feedback details; the feedback type is a type for data feedback, and comprises identification correspondence, error comparison and combination comparison; the feedback details are feedback of detailed data for different feedback categories;
and updating the knowledge graph of the central science and technology field according to the data fed back by the knowledge base of the specific field.
7. A scientific and technological resource convergence and continuous service device adopting the method of any one of claims 1 to 6, which is characterized by comprising a specific-field knowledge base, a scientific and technological entity base to be updated, a central scientific and technological field knowledge map and a specific scientific and technological field knowledge map; extracting a scientific and technological entity according to the specific field knowledge base and storing the scientific and technological entity to a to-be-updated scientific and technological entity base; the technology entity library to be updated makes a data request to the knowledge map of the central technology field; the knowledge graph of the central science and technology field returns to the knowledge graph of the specific science and technology field according to various parameters in the data request; and fusing the knowledge map in the specific science and technology field with the knowledge base in the specific field to update the knowledge base in the specific field.
8. An electronic apparatus, comprising a memory and a processor, the memory storing a computer program configured to be executed by the processor, the computer program comprising instructions for performing the method of any of claims 1 to 6.
9. A computer-readable storage medium, characterized in that the computer-readable storage medium stores a computer program which, when executed by a computer, implements the method of any one of claims 1 to 6.
CN202010865075.1A 2020-08-25 2020-08-25 Scientific and technological resource convergence and continuous service method and device Active CN112163094B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202010865075.1A CN112163094B (en) 2020-08-25 2020-08-25 Scientific and technological resource convergence and continuous service method and device

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202010865075.1A CN112163094B (en) 2020-08-25 2020-08-25 Scientific and technological resource convergence and continuous service method and device

Publications (2)

Publication Number Publication Date
CN112163094A CN112163094A (en) 2021-01-01
CN112163094B true CN112163094B (en) 2022-10-14

Family

ID=73860186

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202010865075.1A Active CN112163094B (en) 2020-08-25 2020-08-25 Scientific and technological resource convergence and continuous service method and device

Country Status (1)

Country Link
CN (1) CN112163094B (en)

Family Cites Families (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106649394A (en) * 2015-11-03 2017-05-10 中兴通讯股份有限公司 Fusion knowledge base processing method and device and knowledge base management system
CN106021281A (en) * 2016-04-29 2016-10-12 京东方科技集团股份有限公司 Method for establishing medical knowledge graph, device for same and query method for same
CN106250412B (en) * 2016-07-22 2019-04-23 浙江大学 Knowledge mapping construction method based on the fusion of multi-source entity
CN109299090B (en) * 2018-09-03 2023-05-30 平安科技(深圳)有限公司 Foundation centrality calculating method, system, computer equipment and storage medium
CN110781249A (en) * 2019-10-16 2020-02-11 华电国际电力股份有限公司技术服务分公司 Knowledge graph-based multi-source data fusion method and device for thermal power plant

Also Published As

Publication number Publication date
CN112163094A (en) 2021-01-01

Similar Documents

Publication Publication Date Title
CN109845221B (en) Access control policy synchronization for service layer
US11567997B2 (en) Query language interoperabtility in a graph database
US11531682B2 (en) Federated search of multiple sources with conflict resolution
US10977380B2 (en) Hybrid role and attribute based access control system
US7823123B2 (en) Semantic system for integrating software components
US7774368B2 (en) Contact management update protocols
CN101873335B (en) Distributed type searching method of cross-domain semantic Web service
US8725681B1 (en) Synthesized identifiers for system information database
US20060242181A1 (en) Apparatus and method for transforming XBRL data into database schema
CN111026874A (en) Data processing method and server of knowledge graph
CN108183931A (en) A kind of distribution subscription matching process based on demand management tree shape model
US11100098B2 (en) Systems and methods for providing multilingual support for data used with a business intelligence server
US9830385B2 (en) Methods and apparatus for partitioning data
CN111368097B (en) Knowledge graph extraction method and device
US20080133533A1 (en) Migrating Credentials to Unified Identity Management Systems
CN104091228A (en) Systems for resource management, resource registering, resource inquiry and resource semantic corpus management of internet of things
US10489419B1 (en) Data modeling translation system
CN112163094B (en) Scientific and technological resource convergence and continuous service method and device
CN105653576A (en) Information searching method and apparatus, manual position service method and system
TWI385543B (en) Data Synchronization System and Method for Establishing Mediation Data in Directory Service Format
US20040143649A1 (en) Multiple dynamic view enabled web services
CN110781213A (en) Multi-source mass data correlation searching method and system with personnel as center
US20090077021A1 (en) System for Managing SameAs Relationships Between Ontology Instances and Method for the Same
US20080263034A1 (en) Method and apparatus for querying between software objects
US20150120776A1 (en) Virtual directory server to process directory requests when information on an object is split across multiple data sources

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant