CN113204651A - Multi-source knowledge graph fusion method and device in Chinese education field - Google Patents

Multi-source knowledge graph fusion method and device in Chinese education field Download PDF

Info

Publication number
CN113204651A
CN113204651A CN202110594900.3A CN202110594900A CN113204651A CN 113204651 A CN113204651 A CN 113204651A CN 202110594900 A CN202110594900 A CN 202110594900A CN 113204651 A CN113204651 A CN 113204651A
Authority
CN
China
Prior art keywords
entity
chinese
teaching
knowledge
information
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202110594900.3A
Other languages
Chinese (zh)
Inventor
王华珍
赵荐轩
廖永新
吕挺
汪晓凤
喻小光
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Huaqiao University
Original Assignee
Huaqiao University
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Huaqiao University filed Critical Huaqiao University
Priority to CN202110594900.3A priority Critical patent/CN113204651A/en
Publication of CN113204651A publication Critical patent/CN113204651A/en
Pending legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/36Creation of semantic tools, e.g. ontology or thesauri
    • G06F16/367Ontology
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/25Fusion techniques
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q50/00Information and communication technology [ICT] specially adapted for implementation of business processes of specific business sectors, e.g. utilities or tourism
    • G06Q50/10Services
    • G06Q50/20Education
    • G06Q50/205Education administration or guidance

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Business, Economics & Management (AREA)
  • Data Mining & Analysis (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Tourism & Hospitality (AREA)
  • Strategic Management (AREA)
  • Educational Technology (AREA)
  • Educational Administration (AREA)
  • General Engineering & Computer Science (AREA)
  • Evolutionary Biology (AREA)
  • Human Resources & Organizations (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Bioinformatics & Computational Biology (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Artificial Intelligence (AREA)
  • Health & Medical Sciences (AREA)
  • Economics (AREA)
  • General Health & Medical Sciences (AREA)
  • Evolutionary Computation (AREA)
  • Marketing (AREA)
  • Primary Health Care (AREA)
  • General Business, Economics & Management (AREA)
  • Animal Behavior & Ethology (AREA)
  • Computational Linguistics (AREA)
  • Databases & Information Systems (AREA)
  • Management, Administration, Business Operations System, And Electronic Commerce (AREA)

Abstract

The invention discloses a multi-source knowledge graph fusion method and device in the field of Chinese education. And then screening the multi-source map co-occurrence entity nodes with teaching purposes, and combining the synonymous nodes and the relationship thereof to complete the fusion task of the multi-source map. The invention provides a knowledge fragmentation solution aiming at the existing Chinese education field, the fused knowledge graph associates and integrates knowledge information with teaching purpose, the coverage and use value of the knowledge graph in the Chinese education field are improved, and convenience is provided for the subsequent work of data mining, personalized recommendation, intelligent question answering and the like on the knowledge graph.

Description

Multi-source knowledge graph fusion method and device in Chinese education field
Technical Field
The invention relates to the field of knowledge maps, in particular to a multi-source knowledge map fusion method and device in the field of Chinese education.
Background
Chinese education is a Chinese two-language teaching aimed at Chinese children and language culture inheritance education. The Chinese language teaching demonstration board is different from international Chinese education, and has the characteristics of repeated Chinese language teaching and Chinese culture inheritance. The Chinese teenagers are children of overseas Chinese, and generally obtain Chinese language culture fumigates of parents and relatives, and have certain Chinese background. The Chinese education is between the first language and the second language teaching, the teaching characteristic of fusion of culture inheritance and the second language teaching is presented, and a series of high-quality Chinese teaching field teaching resources are accumulated. The teaching resource components described above are complex in type (heterogeneous) and often originate from different teaching organizations or institutions (multiple sources). The characteristics of fragmentation, insufficient relevance and the like of multi-source heterogeneous data easily cause 'information isolated island', data resources cannot complete conversion to knowledge and the like, and further the value of the data resources cannot be fully mined. Constructing the multi-source knowledge graph is an important way to understand and analyze the data and refine effective knowledge.
The concept of knowledge-graph was proposed in 2012 by google and successfully applied to search engines. The method describes the relationship between concepts and entities in the objective world and the entities in a structured form, realizes strong semantic processing capability and data structured organization capability, and provides a foundation for tasks requiring a large amount of knowledge data for supporting, such as data mining, personalized recommendation, intelligent question answering and the like. However, for the fusion of heterogeneous data knowledge graph construction and multi-source knowledge graph, the traditional database and analysis mining technical tool can hardly be used.
Disclosure of Invention
The method aims at the problems that the construction of the heterogeneous data knowledge graph and the fusion of a multi-source knowledge graph are difficult and the like. An embodiment of the present application aims to provide a multi-source knowledge graph fusion method and device in the field of Chinese education, so as to solve the technical problems mentioned in the above background technology.
In a first aspect, an embodiment of the present application provides a multi-source knowledge graph fusion method in the field of chinese education, including the following steps:
s1, constructing a first entity set E through Chinese character theory knowledgecAnd a first set of relationships RcFormed triple information G of Chinese character theorycIn the form of<Ei,Rj,Ek>In which Ei,Ek∈Ec,Rj∈Rc
S2, constructing a second entity set E through Chinese teaching knowledgetAnd a second set of relationships RtFormed Chinese teaching triplet information GtIn the form of<Hi,Lj,Hk>In which H isi,Hk∈Et,Lj∈Rt
S3, constructing a third entity set E through Chinese teaching materials knowledgebAnd a third set of relationships RbFormed Chinese teaching material triplet information GbIn the form of<Mi,Nj,Mk>Wherein M isi,Mk∈Eb,Nj∈Rb(ii) a First entity set EcSecond entity set EtWith a third entity set EbAt least identical entity nodes exist between every two entity nodes to form a co-occurrence entity set; and
s4, from the triple information G of Chinese theorycChinese teaching triplet information GtChinese teaching material triplet information GbScreening a co-occurrence entity set, and extracting Chinese character theory triple information G according to the co-occurrence entity setcChinese teaching triplet information GtChinese teaching material triplet information GbAnd fusing to obtain the knowledge graph in the Chinese education field.
In some embodiments, the first entity set EcComprises Chinese character structure entity and Chinese character radicalsEntity, Chinese character spelling entity, Chinese character open knowledge entity and Chinese character entity, first entity set EcAnd a first set of relationships RcAnd collecting in a crowdsourcing mode and establishing in a JSON character string format.
In some embodiments, the second entity set EtComprises a teaching method entity, a teaching skill entity, a word entity, an idiom entity, a Chinese character entity and a second entity set EtAnd a second set of relationships RtThe teaching method entity and the teaching skill entity in the method are exported in an xmind file format, and a relationship set among the word entity, the idiom entity, the Chinese character entity, the teaching method entity and the teaching skill entity is collected in a crowdsourcing mode and established in an Excel table format.
In some embodiments, the third entity set EbComprises a teaching material entity, a teaching material book number entity, a teaching material course number entity, a problem entity, a word entity, a idiom entity and a Chinese character entity, and a third entity set EbAnd a third set of relationships RbThe selected Chinese teaching material is divided by taking the number of books and the number of courses as granularity, the word entity, the idiom entity and the Chinese character entity in the selected Chinese teaching material are labeled by taking the number of courses as the finest granularity, and the exercise investigation content in the selected Chinese teaching material is associated with the word entity, the idiom entity and the Chinese character entity and is established in the format of an Excel table.
In some embodiments, the set of co-occurring entities includes a word entity, an idiom entity, and a kanji entity.
In some embodiments, step S4 specifically includes traversing the co-occurrence entity set, completing the merging of the synonymous nodes by matching the synonymous nodes, and finally integrating the synonymous nodes into the chinese character academic triplet information GcChinese teaching triplet information GtOr Chinese teaching material triplet information GbAnd (5) obtaining the knowledge graph in the Chinese education field.
In some embodiments, the synonymous node is the triple information G derived from Chinese theorycChinese teaching triplet information GtAnd Chinese teaching material triplet information GbTwo of them and nodes of the same name in between.
In a second aspect, an embodiment of the present application provides a multi-source knowledge-graph fusion device in the field of chinese education, including:
a Chinese character theory triple information establishing module configured to establish a first entity set E through Chinese character theory knowledgecAnd a first set of relationships RcFormed triple information G of Chinese character theorycIn the form of<Ei,Rj,Ek>In which Ei,Ek∈Ec,Rj∈Rc
The Chinese teaching triple information module is configured to construct a second entity set E through Chinese teaching knowledgetAnd a second set of relationships RtFormed Chinese teaching triplet information GtIn the form of<Hi,Lj,Hk>In which H isi,Hk∈Et,Lj∈Rt
The Chinese teaching material triple information module is configured to be constructed by Chinese teaching material knowledge and is provided with a third entity set EbAnd a third set of relationships RbFormed Chinese teaching material triplet information GbIn the form of<Mi,Nj,Mk>Wherein M isi,Mk∈Eb,Nj∈Rb(ii) a First entity set EcSecond entity set EtWith a third entity set EbAt least identical entity nodes exist between every two entity nodes to form a co-occurrence entity set; and
a knowledge map fusion module configured to extract from the Chinese theory triplet information GcChinese teaching triplet information GtChinese teaching material triplet information GbScreening a co-occurrence entity set, and extracting Chinese character theory triple information G according to the co-occurrence entity setcChinese teaching triplet information GtChinese teaching material triplet information GbAnd fusing to obtain the knowledge graph in the Chinese education field.
In a third aspect, an embodiment of the present disclosure provides an electronic device, including one or more processors; storage means for storing one or more programs which, when executed by one or more processors, cause the one or more processors to carry out a method as described in any one of the implementations of the first aspect.
In a fourth aspect, the disclosed embodiments provide a computer-readable storage medium on which a computer program is stored, which, when executed by a processor, implements the method as described in any implementation manner of the first aspect.
The invention discloses a multi-source knowledge graph fusion method and device in the field of Chinese education. And then screening the multi-source map co-occurrence entity nodes with teaching purposes, and combining the synonymous nodes and the relationship thereof to complete the fusion task of the multi-source map. The invention provides a knowledge fragmentation solution aiming at the existing Chinese education field, the fused knowledge graph associates and integrates knowledge information with teaching purpose, the coverage and use value of the knowledge graph in the Chinese education field are improved, and convenience is provided for the subsequent work of data mining, personalized recommendation, intelligent question answering and the like on the knowledge graph.
Drawings
In order to more clearly illustrate the technical solutions in the embodiments of the present invention, the drawings needed to be used in the description of the embodiments will be briefly introduced below, and it is obvious that the drawings in the following description are only some embodiments of the present invention, and it is obvious for those skilled in the art to obtain other drawings based on these drawings without creative efforts.
FIG. 1 is an exemplary device architecture diagram in which one embodiment of the present application may be applied;
FIG. 2 is a flow chart of a multi-source knowledge-graph fusion method in the field of Chinese education according to an embodiment of the present invention;
FIG. 3 is an embodiment of the present inventionExample three-element information G for Chinese character theorycA conceptual layer design diagram of (a);
FIG. 4 is a ternary information G of Chinese theory according to an embodiment of the present inventioncThe metadata JSON character string schematic diagram;
FIG. 5 is a ternary information G of Chinese theory according to an embodiment of the present inventionc(partial) visualization schematic of (c);
FIG. 6 is a triplet G for Chinese teaching according to an embodiment of the present inventiontA conceptual layer design diagram of (a);
FIG. 7 is a schematic view of a Chinese teaching skill (partial) knowledge guide in accordance with an embodiment of the present invention;
FIG. 8 is a triplet G for Chinese teaching in accordance with an embodiment of the present inventiont(partial) visualization schematic of (c);
FIG. 9 is a diagram of the triplet information G of Chinese teaching materials in accordance with an embodiment of the present inventionbA conceptual layer design diagram of (a);
FIG. 10 is a diagram of the triplet information G of Chinese teaching materials in accordance with an embodiment of the present inventionb(partial) visualization schematic of (b);
FIG. 11 is a fused knowledge-graph G of an embodiment of the present inventionf(partial) visualization schematic of (c);
FIG. 12 is a schematic diagram of a multi-source knowledge-graph fusion apparatus in the field of Chinese education, according to an embodiment of the present invention;
fig. 13 is a schematic structural diagram of a computer device suitable for implementing an electronic apparatus according to an embodiment of the present application.
Detailed Description
In order to make the objects, technical solutions and advantages of the present invention clearer, the present invention will be described in further detail with reference to the accompanying drawings, and it is apparent that the described embodiments are only a part of the embodiments of the present invention, not all of the embodiments. All other embodiments, which can be derived by a person skilled in the art from the embodiments given herein without making any creative effort, shall fall within the protection scope of the present invention.
Fig. 1 illustrates an exemplary device architecture 100 to which the multi-source knowledge-graph fusion method in the chinese education field or the multi-source knowledge-graph fusion device in the chinese education field of the embodiments of the present application may be applied.
As shown in fig. 1, the apparatus architecture 100 may include terminal devices 101, 102, 103, a network 104, and a server 105. The network 104 serves as a medium for providing communication links between the terminal devices 101, 102, 103 and the server 105. Network 104 may include various connection types, such as wired, wireless communication links, or fiber optic cables, to name a few.
The user may use the terminal devices 101, 102, 103 to interact with the server 105 via the network 104 to receive or send messages or the like. Various applications, such as data processing type applications, file processing type applications, etc., may be installed on the terminal apparatuses 101, 102, 103.
The terminal apparatuses 101, 102, and 103 may be hardware or software. When the terminal devices 101, 102, 103 are hardware, they may be various electronic devices including, but not limited to, smart phones, tablet computers, laptop portable computers, desktop computers, and the like. When the terminal apparatuses 101, 102, 103 are software, they can be installed in the electronic apparatuses listed above. It may be implemented as multiple pieces of software or software modules (e.g., software or software modules used to provide distributed services) or as a single piece of software or software module. And is not particularly limited herein.
The server 105 may be a server that provides various services, such as a background data processing server that processes files or data uploaded by the terminal devices 101, 102, 103. The background data processing server can process the acquired file or data to generate a processing result.
It should be noted that the multi-source knowledge graph fusion method in the chinese education field provided in the embodiment of the present application may be executed by the server 105, or may be executed by the terminal devices 101, 102, and 103, and accordingly, the multi-source knowledge graph fusion device in the chinese education field may be disposed in the server 105, or may be disposed in the terminal devices 101, 102, and 103.
It should be understood that the number of terminal devices, networks, and servers in fig. 1 is merely illustrative. There may be any number of terminal devices, networks, and servers, as desired for implementation. In the case where the processed data does not need to be acquired from a remote location, the above device architecture may not include a network, but only a server or a terminal device.
Fig. 2 illustrates a multi-source knowledge-graph fusion method in the field of chinese education, provided by an embodiment of the present application, including the following steps:
s1, constructing a first entity set E through Chinese character theory knowledgecAnd a first set of relationships RcFormed triple information G of Chinese character theorycIn the form of<Ei,Rj,Ek>In which Ei,Ek∈Ec,Rj∈Rc
S2, constructing a second entity set E through Chinese teaching knowledgetAnd a second set of relationships RtFormed Chinese teaching triplet information GtIn the form of<Hi,Lj,Hk>In which H isi,Hk∈Et,Lj∈Rt
S3, constructing a third entity set E through Chinese teaching materials knowledgebAnd a third set of relationships RbFormed Chinese teaching material triplet information GbIn the form of<Mi,Nj,Mk>Wherein M isi,Mk∈Eb,Nj∈Rb(ii) a First entity set EcSecond entity set EtWith a third entity set EbAt least identical entity nodes exist between every two entity nodes to form a co-occurrence entity set; and
s4, from the triple information G of Chinese theorycChinese teaching triplet information GtChinese teaching material triplet information GbScreening a co-occurrence entity set, and extracting Chinese character theory triple information G according to the co-occurrence entity setcChinese teaching triplet information GtChinese teaching material triplet information GbAnd fusing to obtain the knowledge graph in the Chinese education field.
The first step is as follows: ternary information G for Chinese theorycConstruction of
First entity set E defined by the inventioncAnd the description of the entity class to which the entity belongs is shown in Table 1:
TABLE 1 first set of entities
Figure BDA0003090757730000061
Figure BDA0003090757730000071
A first set of relationships R as defined in the inventioncAnd the descriptions of the categories of the belongings are shown in Table 2:
TABLE 2 second set of relationships
Figure BDA0003090757730000072
Ternary information G for forming Chinese theorycThe metadata is mainly manually collected by crowdsourcing data and submitted to a specific format JSON character string according to requirements for generating a first entity set EcWith a first set of relationships RcThe JSON string format is shown in table 3 below. And storing the three-dimensional data in a mysql database to form Chinese character theory triple information Gc
TABLE 3 Chinese theory triplet information metadata JSON string format
Field(s) Type (B) Description of the invention
name String Chinese characters
structure String Chinese character structure
pinyin String Chinese character phonetic alphabet
bushou String Chinese character radicals
knowledge Array Array formed by Chinese character open knowledge character strings
FIG. 3 shows a ternary information G of Chinese theorycThe concept layer design diagram of (1) is shown in FIG. 4 as the Chinese theory triple information GcThe metadata JSON character string schematic diagram finally obtains the Chinese character theory triple information G shown in figure 5cIs (partially) visualized schematically.
The second step is that: chinese teaching triplet information GtConstruction of
Second entity set E as defined in the inventiontAnd the description of the entity class to which the entity belongs is shown in Table 4:
TABLE 4 second set of entities
Figure BDA0003090757730000073
Figure BDA0003090757730000081
A second set of relationships R as defined in the inventiontAnd the descriptions of the categories of the belongings are shown in Table 5:
TABLE 5 second set of relationships
Starting node End node Description of relationships
Chinese characters Teaching method The Chinese character teaching method is used in teaching.
Chinese characters Teaching technique The Chinese character uses the teaching skill in teaching.
Word and phrase Teaching method The words use the teaching method in teaching.
Word and phrase Teaching technique The words use this teaching technique in teaching.
Idiom Teaching method The idiom uses the teaching method in teaching.
Idiom Teaching technique The idiom uses the teaching skill in teaching.
Making Chinese teaching triplet GtThe metadata of the teaching method entity and the teaching skill entity are divided into two steps, in the first step, a professional in Chinese education prepares a metadata set of the teaching method entity and the teaching skill entity, the metadata set is exported in an xmnd file format and is stored in a mysql database after program analysis, and information of concepts, representative characters, appearance time and the like of the teaching method entity and the teaching skill entity is stored as entity attributes. Secondly, collecting three types of entities including Chinese character entities, word entities and idiom entities and relationship metadata sets of teaching method entities and teaching skill entities in a crowdsourcing mode, submitting the relation metadata sets in an Excel form format, storing the submitted relation metadata sets in a mysql database through program analysis, and finally obtaining Chinese teaching triple information Gt
FIG. 6 shows a triplet information G for Chinese teachingtThe concept layer design diagram of (part of) knowledge guidance diagram of Chinese teaching skill is shown in fig. 7, and finally the Chinese teaching triple information G shown in fig. 8 is obtainedtIs (partially) visualized schematically.
The third step: chinese teaching material triplet information GbConstruction of
Third entity set E as defined in the inventionbAnd the description of the entity class to which it belongs is shown in table 6:
TABLE 6 third set of entities
Figure BDA0003090757730000082
Figure BDA0003090757730000091
A third set of relationships R as defined in the inventionbAnd the descriptions of the categories of the affiliations are shown in Table 7:
TABLE 7 third set of relationships
Figure BDA0003090757730000092
Manufacturing Chinese teaching material triplet information GbThe metadata is divided by the album number and course number as the granularity, and delivered to the personnel with Chinese teaching knowledge by the course number as the finest granularity, the Chinese entity, word entity and idiom entity to be mastered by the course are labeled according to the materials of participating in the teaching plan, the after-class exercise is extracted into an Excel form according to the designated format, the investigation content is associated with the Chinese entity, word entity and idiom entity according to the exercise, finally the formatted data of the Chinese teaching material is obtained, the formatted data is analyzed by a program and stored into a mysql database, and the Chinese teaching material triple information G is completedbAnd (4) manufacturing.
FIG. 9 shows the Chinese teaching material triplet information GbThe concept layer design diagram of (1) finally obtains the Chinese teaching material triplet information G shown in FIG. 10bIs (partially) visualized schematically.
The fourth step: knowledge graph fusion based on co-occurrence entity nodes
Taking one of the embodiments as an example, the three-element information G is from the theory of Chinese characterscChinese teaching triplet information GtChinese teaching material triplet information GbIn-screening co-occurring entity set EcoNew ═ newcNew and newtNew and newbNew yeartNew yearbOld and newtOld and newb.., the screening principle is as follows: (1) the entity node has teaching purpose, namely the entity belongs to any one of three types of entities, namely a Chinese character entity, a word entity and an idiom entity; (2) the physical node appears in two different triples of information at the same time. Therein is newcTriple information G from Chinese theory representing the entity nodecNew, newtTriple information G from Chinese teaching on behalf of the entity nodetNew, newbTriple information G from Chinese teaching material for representing the entity nodeb
For co-occurrence entity set EcoConsidering nodes from different triples with the same name as the synonym node, merging the synonym node and the relation set of the source triples to obtain the fused knowledge graph Gf. FIG. 11 shows a fused knowledge-graph GfIs (partially) visualized schematically.
Thus, the method for constructing and fusing the multi-source knowledge graph in the Chinese education field is completely finished. The multi-source knowledge graph in the Chinese education field has the advantages of comprehensive knowledge coverage, good knowledge representation structure, deep-level relation reasoning and the like, is used as data support in the Chinese education field, is associated and integrated with knowledge information with teaching purposes, and provides convenience for subsequent data mining, personalized recommendation, intelligent question answering and the like on the knowledge graph.
With further reference to fig. 12, as an implementation of the methods shown in the above figures, the present application provides an embodiment of a multi-source knowledge-graph fusion apparatus in the chinese education field, where the embodiment of the apparatus corresponds to the embodiment of the method shown in fig. 2, and the apparatus may be applied to various electronic devices.
The embodiment of the application provides a multisource knowledge map fusion device in chinese education field, includes:
a Chinese character theory triple information establishing module 1 configured to establish a first entity set E by Chinese character theory knowledgecAnd a first set of relationships RcFormed triple information G of Chinese character theorycIn the form of<Ei,Rj,Ek>In which Ei,Ek∈Ec,Rj∈Rc
A Chinese teaching triple information module 2 configured to construct a second entity set E through Chinese teaching knowledgetAnd a firstSet of two relationships RtFormed Chinese teaching triplet information GtIn the form of<Hi,Lj,Hk>In which H isi,Hk∈Et,Lj∈Rt
A Chinese teaching material triple information module 3 configured to construct a third entity set E through Chinese teaching material knowledgebAnd a third set of relationships RbFormed Chinese teaching material triplet information GbIn the form of<Mi,Nj,Mk>Wherein M isi,Mk∈Eb,Nj∈Rb(ii) a First entity set EcSecond entity set EtWith a third entity set EbAt least identical entity nodes exist between every two entity nodes to form a co-occurrence entity set; and
a knowledge map fusion module 4 configured to extract the Chinese theory triple information GcChinese teaching triplet information GtChinese teaching material triplet information GbScreening a co-occurrence entity set, and extracting Chinese character theory triple information G according to the co-occurrence entity setcChinese teaching triplet information GtChinese teaching material triplet information GbAnd fusing to obtain the knowledge graph in the Chinese education field.
Referring now to fig. 13, a schematic diagram of a computer device 1300 suitable for use in implementing an electronic device (e.g., the server or terminal device shown in fig. 1) according to an embodiment of the present application is shown. The electronic device shown in fig. 13 is only an example, and should not bring any limitation to the functions and the scope of use of the embodiments of the present application.
As shown in fig. 13, the computer apparatus 1300 includes a Central Processing Unit (CPU)1301 and a Graphic Processor (GPU)1302, which can perform various appropriate actions and processes according to a program stored in a Read Only Memory (ROM)1303 or a program loaded from a storage portion 1309 into a Random Access Memory (RAM) 1304. In the RAM 1304, various programs and data necessary for the operation of the apparatus 1300 are also stored. The CPU 1301, GPU1302, ROM 1303, and RAM 1304 are connected to each other via a bus 1305. An input/output (I/O) interface 1306 is also connected to bus 1305.
The following components are connected to the I/O interface 1306: an input portion 1307 including a keyboard, a mouse, and the like; an output portion 1308 including a speaker and the like such as a Liquid Crystal Display (LCD); a storage portion 1309 including a hard disk and the like; and a communication section 1310 including a network interface card such as a LAN card, a modem, or the like. The communication section 1310 performs communication processing via a network such as the internet. The driver 1311 may also be connected to the I/O interface 1306 as needed. A removable medium 1312, such as a magnetic disk, an optical disk, a magneto-optical disk, a semiconductor memory, etc., is mounted on the drive 1311 as necessary, so that a computer program read out therefrom is mounted in the storage section 1309 as necessary.
In particular, according to an embodiment of the present disclosure, the processes described above with reference to the flowcharts may be implemented as computer software programs. For example, embodiments of the present disclosure include a computer program product comprising a computer program embodied on a computer readable medium, the computer program comprising program code for performing the method illustrated in the flow chart. In such embodiments, the computer program may be downloaded and installed from a network via communications portion 1310, and/or installed from removable media 1312. The computer programs, when executed by a Central Processing Unit (CPU)1301 and a Graphics Processing Unit (GPU)1302, perform the above-described functions defined in the methods of the present application.
It should be noted that the computer readable medium described herein can be a computer readable signal medium or a computer readable medium or any combination of the two. The computer readable medium may be, for example, but not limited to, an electronic, magnetic, optical, electromagnetic, infrared, or semiconductor device, apparatus, or any combination of the foregoing. More specific examples of the computer readable medium may include, but are not limited to: an electrical connection having one or more wires, a portable computer diskette, a hard disk, a Random Access Memory (RAM), a read-only memory (ROM), an erasable programmable read-only memory (EPROM or flash memory), an optical fiber, a portable compact disc read-only memory (CD-ROM), an optical storage device, a magnetic storage device, or any suitable combination of the foregoing. In the present application, a computer readable medium may be any tangible medium that can contain, or store a program for use by or in connection with an instruction execution apparatus, device, or apparatus. In this application, however, a computer readable signal medium may include a propagated data signal with computer readable program code embodied therein, for example, in baseband or as part of a carrier wave. Such a propagated data signal may take many forms, including, but not limited to, electro-magnetic, optical, or any suitable combination thereof. A computer readable signal medium may also be any computer readable medium that is not a computer readable medium and that can communicate, propagate, or transport a program for use by or in connection with an instruction execution apparatus, device, or apparatus. Program code embodied on a computer readable medium may be transmitted using any appropriate medium, including but not limited to: wireless, wire, fiber optic cable, RF, etc., or any suitable combination of the foregoing.
Computer program code for carrying out operations for aspects of the present application may be written in any combination of one or more programming languages, including an object oriented programming language such as Java, Smalltalk, C + + or the like and conventional procedural programming languages, such as the "C" programming language or similar programming languages. The program code may execute entirely on the user's computer, partly on the user's computer, as a stand-alone software package, partly on the user's computer and partly on a remote computer or entirely on the remote computer or server. In the case of a remote computer, the remote computer may be connected to the user's computer through any type of network, including a Local Area Network (LAN) or a Wide Area Network (WAN), or the connection may be made to an external computer (for example, through the Internet using an Internet service provider).
The flowchart and block diagrams in the figures illustrate the architecture, functionality, and operation of possible implementations of apparatus, methods and computer program products according to various embodiments of the present application. In this regard, each block in the flowchart or block diagrams may represent a module, segment, or portion of code, which comprises one or more executable instructions for implementing the specified logical function(s). It should also be noted that, in some alternative implementations, the functions noted in the block may occur out of the order noted in the figures. For example, two blocks shown in succession may, in fact, be executed substantially concurrently, or the blocks may sometimes be executed in the reverse order, depending upon the functionality involved. It will also be noted that each block of the block diagrams and/or flowchart illustration, and combinations of blocks in the block diagrams and/or flowchart illustration, can be implemented by special purpose hardware-based devices that perform the specified functions or acts, or combinations of special purpose hardware and computer instructions.
The modules described in the embodiments of the present application may be implemented by software or hardware. The modules described may also be provided in a processor.
As another aspect, the present application also provides a computer-readable medium, which may be contained in the electronic device described in the above embodiments; or may exist separately without being assembled into the electronic device. The computer readable medium carries one or more programs which, when executed by the electronic device, cause the electronic device to: constructing a first entity set E through Chinese character theory knowledgecAnd a first set of relationships RcFormed triple information G of Chinese character theorycIn the form of<Ei,Rj,Ek>In which Ei,Ek∈Ec,Rj∈Rc(ii) a Construction of a second entity set E through Chinese teaching knowledgetAnd a second set of relationships RtFormed Chinese teaching triplet information GtIn the form of<Hi,Lj,Hk>In which H isi,Hk∈Et,Lj∈Rt(ii) a A third entity set E is constructed by Chinese teaching material knowledgebAnd a third set of relationships RbFormed Chinese teaching material triplet information GbIn the form of<Mi,Nj,Mk>Wherein M isi,Mk∈Eb,Nj∈Rb(ii) a First entity set EcSecond entity set EtWith a third entity set EbAt least identical entity nodes exist between every two entity nodes to form a co-occurrence entity set; from Chinese theory triplet information GcChinese teaching triplet information GtChinese teaching material triplet information GbScreening a co-occurrence entity set, and extracting Chinese character theory triple information G according to the co-occurrence entity setcChinese teaching triplet information GtChinese teaching material triplet information GbAnd fusing to obtain the knowledge graph in the Chinese education field.
The above description is only a preferred embodiment of the application and is illustrative of the principles of the technology employed. It will be appreciated by those skilled in the art that the scope of the invention herein disclosed is not limited to the particular combination of features described above, but also encompasses other arrangements formed by any combination of the above features or their equivalents without departing from the spirit of the invention. For example, the above features may be replaced with (but not limited to) features having similar functions disclosed in the present application.

Claims (10)

1. A multi-source knowledge graph fusion method in the field of Chinese education is characterized by comprising the following steps:
s1, constructing a first entity set E through Chinese character theory knowledgecAnd a first set of relationships RcFormed triple information G of Chinese character theorycIn the form of<Ei,Rj,Ek>In which Ei,Ek∈Ec,Rj∈Rc
S2, constructing a second entity set E through Chinese teaching knowledgetAnd a second set of relationships RtFormed Chinese teaching triplet information GtIn the form of<Hi,Lj,Hk>WhereinHi,Hk∈Et,Lj∈Rt
S3, constructing a third entity set E through Chinese teaching materials knowledgebAnd a third set of relationships RbFormed Chinese teaching material triplet information GbIn the form of<Mi,Nj,Mk>Wherein M isi,Mk∈Eb,Nj∈Rb(ii) a The first entity set EcThe second entity set EtWith the third entity set EbAt least identical entity nodes exist between every two entity nodes to form a co-occurrence entity set; and
s4, extracting the Chinese theory triple information GcChinese teaching triplet information GtChinese teaching material triplet information GbScreening the co-occurrence entity set, and classifying the Chinese character theory triple information G according to the co-occurrence entity setcChinese teaching triplet information GtChinese teaching material triplet information GbAnd fusing to obtain the knowledge graph in the Chinese education field.
2. The method of claim 1, wherein the first set of entities E is a multi-source knowledge-graph fusion method in the field of Chinese educationcComprises a Chinese character structure entity, a Chinese character radical entity, a Chinese character pinyin entity, a Chinese character open knowledge entity and a Chinese character entity, wherein the first entity set EcAnd said first set of relationships RcAnd collecting in a crowdsourcing mode and establishing in a JSON character string format.
3. The method of claim 1, wherein the second set of entities E is a multi-source knowledge-graph fusion method in the field of Chinese educationtComprises a teaching method entity, a teaching skill entity, a word entity, an idiom entity and a Chinese character entity, and the second entity set EtAnd a second set of relationships RtThe teaching method entity and the teaching skill entity in (1) are derived in an xmind file format, the word entity, the idiom entity, the Chinese character entity and the teaching method entityAnd collecting a relation set between the body and the teaching skill entity in a crowdsourcing mode and establishing the relation set in an Excel table format.
4. The method of claim 1, wherein the third set of entities E is a multi-source knowledge-graph fusion method in the field of Chinese educationbComprises a teaching material entity, a teaching material book number entity, a teaching material course number entity, a problem entity, a word entity, a idiom entity and a Chinese character entity, and the third entity set EbAnd a third set of relationships RbDividing a selected Chinese teaching material by using the number of books and the number of courses as granularity, labeling the word entity, the idiom entity and the Chinese character entity in the selected Chinese teaching material by using the number of courses as the finest granularity, associating the exercise review content in the selected Chinese teaching material with the word entity, the idiom entity and the Chinese character entity, and establishing the exercise review content in the selected Chinese teaching material in the format of an Excel table.
5. The multi-source knowledge-graph fusion method in the field of Chinese education of claim 1 wherein the set of co-occurring entities includes word entities, idiom entities and Chinese character entities.
6. The multi-source knowledge graph fusion method in the Chinese education field of claim 1, wherein the step S4 specifically includes traversing the co-occurrence entity set, completing synonym node combination by matching synonym nodes, and finally integrating synonym nodes in the Chinese theorem triplet information GcChinese teaching triplet information GtOr Chinese teaching material triplet information GbAnd (5) obtaining the knowledge graph in the Chinese education field.
7. The method of claim 1, wherein the synonymous node is triple information G from Chinese theorycChinese teaching triplet information GtAnd Chinese teaching material triplet information GbBetweenTwo of which are homonymous nodes.
8. The utility model provides a multisource knowledge map fusion device in chinese education field which characterized in that includes:
a Chinese character theory triple information establishing module configured to establish a first entity set E through Chinese character theory knowledgecAnd a first set of relationships RcFormed triple information G of Chinese character theorycIn the form of<Ei,Rj,Ek>In which Ei,Ek∈Ec,Rj∈Rc
The Chinese teaching triple information module is configured to construct a second entity set E through Chinese teaching knowledgetAnd a second set of relationships RtFormed Chinese teaching triplet information GtIn the form of<Hi,Lj,Hk>In which H isi,Hk∈Et,Lj∈Rt
The Chinese teaching material triple information module is configured to be constructed by Chinese teaching material knowledge and is provided with a third entity set EbAnd a third set of relationships RbFormed Chinese teaching material triplet information GbIn the form of<Mi,Nj,Mk>Wherein M isi,Mk∈Eb,Nj∈Rb(ii) a The first entity set EcThe second entity set EtWith the third entity set EbAt least identical entity nodes exist between every two entity nodes to form a co-occurrence entity set; and
a knowledge map fusion module configured to extract the Chinese theory triple information G from the Chinese character theory triple information GcChinese teaching triplet information GtChinese teaching material triplet information GbScreening the co-occurrence entity set, and classifying the Chinese character theory triple information G according to the co-occurrence entity setcChinese teaching triplet information GtChinese teaching material triplet information GbAnd fusing to obtain the knowledge graph in the Chinese education field.
9. An electronic device, comprising: one or more processors; storage means for storing one or more programs which, when executed by the one or more processors, cause the one or more processors to carry out the method according to any one of claims 1-7.
10. A computer-readable storage medium, on which a computer program is stored which, when being executed by a processor, carries out the method according to any one of claims 1-7.
CN202110594900.3A 2021-05-28 2021-05-28 Multi-source knowledge graph fusion method and device in Chinese education field Pending CN113204651A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202110594900.3A CN113204651A (en) 2021-05-28 2021-05-28 Multi-source knowledge graph fusion method and device in Chinese education field

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202110594900.3A CN113204651A (en) 2021-05-28 2021-05-28 Multi-source knowledge graph fusion method and device in Chinese education field

Publications (1)

Publication Number Publication Date
CN113204651A true CN113204651A (en) 2021-08-03

Family

ID=77023571

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202110594900.3A Pending CN113204651A (en) 2021-05-28 2021-05-28 Multi-source knowledge graph fusion method and device in Chinese education field

Country Status (1)

Country Link
CN (1) CN113204651A (en)

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7783658B1 (en) * 2007-06-18 2010-08-24 Seisint, Inc. Multi-entity ontology weighting systems and methods
CN104462501A (en) * 2014-12-19 2015-03-25 北京奇虎科技有限公司 Knowledge graph construction method and device based on structural data
CN110008354A (en) * 2019-04-10 2019-07-12 华侨大学 A kind of construction method of the external Chinese studying content of knowledge based map
CN112836057A (en) * 2019-11-22 2021-05-25 华为技术有限公司 Knowledge graph generation method, device, terminal and storage medium

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7783658B1 (en) * 2007-06-18 2010-08-24 Seisint, Inc. Multi-entity ontology weighting systems and methods
CN104462501A (en) * 2014-12-19 2015-03-25 北京奇虎科技有限公司 Knowledge graph construction method and device based on structural data
CN110008354A (en) * 2019-04-10 2019-07-12 华侨大学 A kind of construction method of the external Chinese studying content of knowledge based map
CN112836057A (en) * 2019-11-22 2021-05-25 华为技术有限公司 Knowledge graph generation method, device, terminal and storage medium

Similar Documents

Publication Publication Date Title
Auer et al. Improving access to scientific literature with knowledge graphs
WO2017024553A1 (en) Information emotion analysis method and system
Agarwal et al. Chatbots and virtual assistants: a bibliometric analysis
CN108140018A (en) Creation is used for the visual representation of text based document
WO2020182123A1 (en) Method and device for pushing statement
US9519719B2 (en) Resource management in a presentation environment
Wang et al. Visual analytics and information extraction of geological content for text-based mineral exploration reports
CN115600605A (en) Method, system, equipment and storage medium for jointly extracting Chinese entity relationship
Kammerer et al. Ambalytics: a scalable and distributed system architecture concept for bibliometric network analyses
Ryabinin et al. Novel Circular Graph Capabilities for Comprehensive Visual Analytics of Interconnected Data in Digital Humanities
WO2022032685A1 (en) Method and device for constructing multi-level knowledge graph
Musabirov et al. Prototyping text mining and network analysis tools to support netnographic student projects
Greenberg et al. Knowledge organization systems: A network for ai with helping interdisciplinary vocabulary engineering
Tandjung et al. Topic modeling with latent-dirichlet allocation for the discovery of state-of-the-art in research: A literature review
Liu et al. Design and proofreading of the english-chinese computer-aided translation system by the neural network
CN113204651A (en) Multi-source knowledge graph fusion method and device in Chinese education field
Jiang et al. Integrating Machine Translation with Human Translation in the Age of Artificial Intelligence: Challenges and Opportunities
CN115757720A (en) Project information searching method, device, equipment and medium based on knowledge graph
CN111527492A (en) Superposition and entanglement of social emotion and natural language generated quanta
WO2022073341A1 (en) Disease entity matching method and apparatus based on voice semantics, and computer device
Peng digital humanities approach to comparative literature: opportunities and challenges
CN110781283B (en) Chain brand word stock generation method and device and electronic equipment
CN113407668A (en) Data processing method and device for cognitive association capacity training
Lamba et al. Tools and techniques for text mining and visualization
CN113870998A (en) Interrogation method, device, electronic equipment and storage medium

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
WD01 Invention patent application deemed withdrawn after publication
WD01 Invention patent application deemed withdrawn after publication

Application publication date: 20210803