CN113987204A - Method and system for constructing field encyclopedia map - Google Patents

Method and system for constructing field encyclopedia map Download PDF

Info

Publication number
CN113987204A
CN113987204A CN202111254354.5A CN202111254354A CN113987204A CN 113987204 A CN113987204 A CN 113987204A CN 202111254354 A CN202111254354 A CN 202111254354A CN 113987204 A CN113987204 A CN 113987204A
Authority
CN
China
Prior art keywords
entries
entry
field
encyclopedia
domain
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202111254354.5A
Other languages
Chinese (zh)
Inventor
宋伟
王鹏飞
刘圣
郑刘梦
焦亚鑫
赵化育
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Beijing Medpeer Information Technology Co ltd
Original Assignee
Beijing Medpeer Information Technology Co ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Beijing Medpeer Information Technology Co ltd filed Critical Beijing Medpeer Information Technology Co ltd
Priority to CN202111254354.5A priority Critical patent/CN113987204A/en
Publication of CN113987204A publication Critical patent/CN113987204A/en
Pending legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/36Creation of semantic tools, e.g. ontology or thesauri
    • G06F16/367Ontology
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/35Clustering; Classification
    • G06F16/355Class or cluster creation or modification
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/20Natural language analysis
    • G06F40/279Recognition of textual entities
    • G06F40/284Lexical analysis, e.g. tokenisation or collocates

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Computational Linguistics (AREA)
  • Data Mining & Analysis (AREA)
  • Databases & Information Systems (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • General Health & Medical Sciences (AREA)
  • Health & Medical Sciences (AREA)
  • Artificial Intelligence (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Animal Behavior & Ethology (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

The application discloses a method and a system for constructing field encyclopedia maps, wherein the method comprises the following steps: collecting related entries of a field from a plurality of open encyclopedia data sources to obtain a field entry set; respectively carrying out reference labeling and label labeling on the entries in the field entry set to form a reference relation network and a classification tree among different entries; and constructing a field encyclopedia map based on the reference relation network and the classification tree. According to the method, the field related entries in the open encyclopedia are collected through the constructed open encyclopedia data collection model, the collected entry sets are subjected to reference labeling and label labeling, the field encyclopedia map is constructed according to the labeled information and the field encyclopedia relation model, the field encyclopedia can be constructed quickly and with low manual participation by using the open encyclopedia as a material, and the recommendation capability among the related entries is realized.

Description

Method and system for constructing field encyclopedia map
Technical Field
The application belongs to the technical field of computers, and particularly relates to a method and a system for constructing field encyclopedia.
Background
Encyclopedia is a knowledge resource with higher value density, and compared with information, encyclopedia entries can describe concept information more comprehensively and completely. At present, open encyclopedias represented by wikipedia, encyclopedia, interactive encyclopedia, search encyclopedia and the like become one of the most valuable knowledge resources on the internet, noun concepts of almost all trades can find related entries in the open encyclopedias, and convenience is provided for users to acquire data knowledge to a certain extent.
A domain encyclopedia is an encyclopedia set of entries that contains only one or several disciplines (classes) of knowledge that have some inherent association. Compared with general encyclopedias, the field encyclopedias have the characteristics of knowledge focus and complete system, and can facilitate users in specific fields to quickly position and comprehensively know knowledge in the fields. Compared with a crowdsourcing construction mode adopted by general encyclopedias such as Wikipedia, Baidu encyclopedia, interactive encyclopedia and search encyclopedia, the field encyclopedia construction usually needs to be carried out by mechanisms and personnel organizations in the field, mainly adopts manual construction, is time-consuming and labor-consuming, and is difficult to adapt to the era characteristics of data explosion type growth.
Therefore, the field encyclopedia construction should solve several problems: the encyclopedic entries can be updated in time, and the entries are effectively expanded to support users to comprehensively understand related concepts. A good field encyclopedia construction method should generally have the following three features: the method has the advantages that information is focused, the term range of field encyclopedia is required to serve field requirements as much as possible, and field-independent terms are provided as little as possible, so that the value density of data is improved, and the interference of independent data is reduced; secondly, the construction is rapid, the construction process of the field encyclopedia, except the necessary expert knowledge confirmation process, should provide the automation ability as much as possible to realize the rapid expansion and perfection of the vocabulary entry; and thirdly, the recommendation is effective, in order to improve the knowledge acquisition efficiency of the user, the constructed field encyclopedia should have the recommendation capability of the vocabulary entry, and the recommendation of the vocabulary entry should be developed around citation recommendation and classification subject recommendation according to the use habits of the field user.
Open encyclopedias represented by wikipedia, encyclopedia, interactive encyclopedia, search encyclopedia and the like have massive encyclopedia entries, and almost cover noun concepts of all trades. However, there are still some deficiencies to using open encyclopedia to obtain knowledge of data for users in a particular domain. On one hand, mass information resources are accompanied by the unfocused knowledge, and the cost for acquiring data knowledge by a user in a specific field is increased invisibly; on the other hand, the knowledge accumulation of each open encyclopedia in a specific field has complementarity, and the requirement for acquiring the encyclopedia knowledge in the specific field cannot be well solved by depending on a single encyclopedia information source. The method of artificially constructing encyclopedia from the beginning has the defects of high construction cost and incapability of quickly constructing a large amount of components.
Content of application
The embodiment of the application aims to provide a method and a system for constructing field encyclopedia so as to solve the problems that in the prior art, open encyclopedia data are scattered and unfocused, and manual construction cost is high.
In order to solve the technical problem, the present application is implemented as follows:
in a first aspect, a method for constructing a field encyclopedia map is provided, which comprises the following steps:
collecting related entries of a field from a plurality of open encyclopedia data sources to obtain a field entry set;
respectively carrying out reference labeling and label labeling on the entries in the field entry set to form a reference relation network and a classification tree among different entries;
and constructing a field encyclopedia map based on the reference relation network and the classification tree.
In a second aspect, a system for constructing a domain encyclopedia is provided, which includes:
the system comprises an acquisition module, a processing module and a display module, wherein the acquisition module is used for acquiring field related entries from a plurality of open encyclopedia data sources to obtain a field entry set;
the labeling module is used for respectively performing reference labeling and label labeling on the entries in the field entry set to form a reference relation network and a classification tree among different entries;
and the construction module is used for constructing the field encyclopedia map based on the reference relation network and the classification tree.
According to the method and the device, the field related entries in the open encyclopedia are collected through the constructed open encyclopedia data collection model, the collected entry sets are subjected to reference labeling and label labeling, the field encyclopedia map is constructed according to the field encyclopedia relation model according to the labeled information, the field encyclopedia can be constructed quickly and with low manual participation by taking the open encyclopedia as a material, and the recommendation capability among the related entries is realized.
Drawings
FIG. 1 is a flowchart of a method for constructing a domain encyclopedia provided in an embodiment of the present application;
FIG. 2 is a diagram of a domain encyclopedia relationship model provided by an embodiment of the present application;
fig. 3 is a schematic structural diagram of a domain encyclopedia construction system provided in an embodiment of the present application.
Detailed Description
The technical solutions in the embodiments of the present application will be clearly and completely described below with reference to the drawings in the embodiments of the present application, and it is obvious that the described embodiments are some, but not all, embodiments of the present application. All other embodiments, which can be derived by a person skilled in the art from the embodiments given herein without making any creative effort, shall fall within the protection scope of the present application.
The invention provides a field encyclopedia map construction method and system based on open encyclopedia, which are oriented to encyclopedia knowledge construction scenes in a specific field and focus on basic features of field encyclopedia.
The method for constructing the field encyclopedia provided by the embodiment of the present application is described in detail below with reference to the accompanying drawings through specific embodiments and application scenarios thereof.
As shown in fig. 1, a flowchart of a method for constructing a domain encyclopedia provided in an embodiment of the present application includes the following steps:
step 101, collecting related entries of a field from a plurality of open encyclopedia data sources to obtain a field entry set.
Specifically, the field-related entries can be respectively collected from a plurality of open encyclopedia data sources to obtain a plurality of entry sets; and removing the duplication of the plurality of entry sets to obtain a field entry set.
In this embodiment, an open encyclopedia data collection model may be constructed, and a field entry set may be collected based on the model. Specifically, the method includes the steps of collecting related entries of the field from a plurality of open encyclopedia data sources respectively to obtain a plurality of entry sets, and specifically includes:
performing the following operations for each development encyclopedia data source:
step A: taking key concepts in the field as seed entries, taking the seed entries as clues, and crawling association entries quoted by the seed entries in the open encyclopedia data source;
and B: judging whether core entries closely related to the domain knowledge exist in the crawled associated entries, if not, storing all the crawled associated entries as expansion entries into a domain entry set; if so, taking the core entry in the crawled associated entries as a new seed entry, taking other entries except the core entry in the crawled associated entries as expansion entries to be stored in a field entry set, and executing the step C;
and C: judging whether a termination condition is reached, if so, executing the step E; otherwise, executing step D;
step D: crawling the associated entries quoted by the new seed entry in the open encyclopedia data source, and executing the step B;
step E: and storing all the seed entries as core entries into the field entry set.
Wherein, judging whether the termination condition is reached specifically comprises: judging whether core entries in the crawled associated entries are all existing seed entries, and if so, determining that a termination condition is reached; otherwise, determining that the termination condition is not reached;
or
Judging whether a preset acquisition depth is reached, and if so, determining that a termination condition is reached; otherwise, it is determined that the termination condition is not reached.
And 102, respectively carrying out reference labeling and label labeling on the entries in the field entry set to form a reference relation network and a classification tree among different entries.
Specifically, the reference relationship between different entries in the domain entry set can be analyzed to form a reference relationship network between different entries; and analyzing the inclusion relation and the affiliation relation among different entries in the field entry set to form a classification tree among the different entries.
And 103, constructing a field encyclopedia map based on the reference relation network and the classification tree.
The field encyclopedia map comprises three concepts of entries, topics and classifications, and three relations of entry reference relations, topic inclusion relations and classification affiliation relations;
accordingly, at least one entry from the set of domain entries may be selected as a topic based on the classification tree, the entry as the topic having a containment relationship with at least one other entry in the set of domain entries; selecting at least one entry from the domain entry set as a classification based on the classification tree, wherein the entry serving as the classification has an attribution relationship with at least one other entry in the domain entry set; and constructing an entry reference relation between different entries in the field encyclopedia, a topic inclusion relation between topics and entries and a classification attribution relation between classifications and entries on the basis of the reference relation network and the classification tree.
According to the method and the device, the field related entries in the open encyclopedia are collected through the constructed open encyclopedia data collection model, the collected entry sets are subjected to reference labeling and label labeling, the field encyclopedia map is constructed according to the field encyclopedia relation model according to the labeled information, the field encyclopedia can be constructed quickly and with low manual participation by taking the open encyclopedia as a material, and the recommendation capability among the related entries is realized.
The method comprises the steps of constructing field encyclopedias, wherein the problem of entry screening is firstly solved, in the embodiment of the application, the open encyclopedias are used as entry construction sources, entry collection is achieved by constructing collection models corresponding to the open encyclopedias, then, duplication removal processing is carried out on entry sets collected by a plurality of open encyclopedias, and a final field entry set is formed.
The key of field encyclopedia entry screening is the design of an open encyclopedia collection model, and as the open encyclopedia generally covers massive entry information of each field and how to accurately screen entries related to the field, the collection workload is reduced on the basis of completely covering the field entries as far as possible, so that the key problem of the open encyclopedia collection model design is solved. Firstly, a domain expert designates key concepts in a plurality of domains as seed entries, an open encyclopedia collection model crawls the seed entries and associated entries quoted by the seed entries in the open encyclopedia by taking the seed entries as clues, then the domain expert participates in semi-automatic screening of core entries closely related to domain knowledge in the associated entries, the core entries serve as new seed entries to carry out a new round of associated entry acquisition operation, and non-core entries serve as expansion entries to be stored in an open encyclopedia collection entry set. And repeating the steps until a termination condition is reached, if the seed entry set does not increase any more or an agreed acquisition depth is reached, storing all the seed entries serving as core entries into the open encyclopedia acquisition entry set.
The term labeling is a basis for constructing a domain encyclopedia term relation map, and the embodiment of the application provides two modes of reference labeling and label labeling according to the use habits of domain users. The reference labeling is mainly used for automatically analyzing the reference relation among the entries to form an entry reference relation network. The label labeling is mainly used for analyzing and constructing a classification tree of entries, and because of the professionality of field encyclopedia, the invention adopts a human-in-loop entry classification tree construction mode, firstly, the classification labels are automatically marked for the entries by a system, and then, the classification is audited and organized by a superior and inferior structure by field experts, so that the professionally classified labeling of the entries is realized.
The final step of the field encyclopedia construction is to establish a field encyclopedia map to support reading recommendations among entries. According to the application embodiment, a field encyclopedia relation model is designed according to the use habits of field users and around the reference relation and the classification subject relation among the entries, and the construction of the field encyclopedia map is realized according to the model by using the result labeled by the entries. Wherein, the field encyclopedia relation model is shown in fig. 2.
According to the embodiment of the application, the seed entries provided by the field experts are used as input, the reference relation of the open encyclopedia entries is collected layer by layer, and the quick and low-manual-participation field encyclopedia entry set with information focusing is obtained. In addition, a domain encyclopedia relation model comprising three concepts of terms, topics and classifications and three relations including term citation relations, topic inclusion relations and classification attribution relations is designed, and a domain encyclopedia constructed according to the model can face the term reading habits of domain users and well support the reading recommendation capability realization of the terms in the actual system construction and use.
As shown in fig. 3, a schematic structural diagram of a system for constructing a domain encyclopedia provided in an embodiment of the present application includes:
the acquisition module 310 is configured to acquire field-related entries from a plurality of open encyclopedia data sources to obtain a field entry set;
specifically, the acquisition module 310 includes:
the acquisition submodule is used for respectively acquiring related entries of the field from a plurality of open encyclopedia data sources to obtain a plurality of entry sets;
and the duplication removing module is used for removing duplication from the plurality of entry sets to obtain a field entry set.
Specifically, the collection submodule is specifically configured to perform the following operations on each development encyclopedia data source:
step A: taking the key concepts in the field as seed terms and taking the seed terms as clues to crawl associated terms II quoted by the seed terms in the open encyclopedia data source
And B: judging whether core entries closely related to domain knowledge exist in the crawled associated entries, if not, storing all the crawled associated entries as expansion entries into a domain entry set II, if so, storing the core entries in the crawled associated entries as new seed entries, storing other entries except the core entries in the crawled associated entries as expansion entries into the domain entry set, and executing the step C;
and C: judging whether a termination condition is reached, if so, executing the step E; otherwise, executing step D;
step D: crawling the associated entries quoted by the new seed entry in the open encyclopedia data source, and executing the step B;
step E: and storing all the seed entries as core entries into the field entry set.
In this embodiment, the determining, by the acquisition submodule, whether the termination condition is met specifically includes:
the acquisition submodule judges whether core entries in the crawled associated entries are all existing seed entries, and if so, the acquisition submodule determines that a termination condition is reached; otherwise, determining that the termination condition is not reached;
or
The acquisition submodule judges whether a preset acquisition depth is reached, and if so, the acquisition submodule determines that a termination condition is reached; otherwise, it is determined that the termination condition is not reached.
The labeling module 320 is configured to perform reference labeling and label labeling on the entries in the field entry set respectively to form a reference relationship network and a classification tree between different entries;
specifically, the tagging module 320 is specifically configured to analyze a reference relationship between different entries in the field entry set, so as to form a reference relationship network between different entries; and analyzing the inclusion relation and the affiliation relation among different entries in the field entry set to form a classification tree among the different entries.
A building module 330, configured to build a domain encyclopedia map based on the reference relationship network and the classification tree.
The field encyclopedia map comprises three concepts of entries, topics and classifications, and three relations of entry reference relations, topic inclusion relations and classification affiliation relations.
Specifically, the building module 330 is specifically configured to select, based on the classification tree, at least one entry from the domain entry set as a topic, where an inclusion relationship exists between the entry serving as the topic and at least one other entry in the domain entry set; selecting at least one entry from the domain entry set as a classification based on the classification tree, wherein the entry serving as the classification has an attribution relationship with at least one other entry in the domain entry set; and constructing an entry reference relation between different entries in the field encyclopedia, a topic inclusion relation between topics and entries and a classification attribution relation between classifications and entries on the basis of the reference relation network and the classification tree.
According to the method and the device, the field related entries in the open encyclopedia are collected through the constructed open encyclopedia data collection model, the collected entry sets are subjected to reference labeling and label labeling, the field encyclopedia map is constructed according to the field encyclopedia relation model according to the labeled information, the field encyclopedia can be constructed quickly and with low manual participation by taking the open encyclopedia as a material, and the recommendation capability among the related entries is realized.
It should be noted that, in this document, the terms "comprises," "comprising," or any other variation thereof, are intended to cover a non-exclusive inclusion, such that a process, method, article, or apparatus that comprises a list of elements does not include only those elements but may include other elements not expressly listed or inherent to such process, method, article, or apparatus. Without further limitation, the statement "comprises a" or "comprising" a defined element does not exclude the presence of other identical elements in the process, method, article, or apparatus that comprises the element.
Through the above description of the embodiments, those skilled in the art will clearly understand that the method of the above embodiments can be implemented by software plus a necessary general hardware platform, and certainly can also be implemented by hardware, but in many cases, the former is a better implementation manner. Based on such understanding, the technical solutions of the present application may be embodied in the form of a software product, which is stored in a storage medium (such as ROM/RAM, magnetic disk, optical disk) and includes instructions for enabling a terminal (such as a mobile phone, a computer, a server, an air conditioner, or a network device) to execute the method according to the embodiments of the present application.
While the present embodiments have been described with reference to the accompanying drawings, it is to be understood that the invention is not limited to the precise embodiments described above, which are meant to be illustrative and not restrictive, and that various changes may be made therein by those skilled in the art without departing from the spirit and scope of the invention as defined by the appended claims.

Claims (10)

1. A method for constructing a field encyclopedia map is characterized by comprising the following steps:
collecting related entries of a field from a plurality of open encyclopedia data sources to obtain a field entry set;
respectively carrying out reference labeling and label labeling on the entries in the field entry set to form a reference relation network and a classification tree among different entries;
and constructing a field encyclopedia map based on the reference relation network and the classification tree.
2. The method according to claim 1, wherein the collecting of domain-related terms from a plurality of open encyclopedia data sources to obtain a domain term set specifically comprises:
respectively collecting related entries of the field from a plurality of open encyclopedia data sources to obtain a plurality of entry sets;
and removing the duplication of the plurality of entry sets to obtain a field entry set.
3. The method according to claim 2, wherein the acquiring the domain-related entries from the plurality of open encyclopedia data sources respectively to obtain a plurality of entry sets specifically comprises:
performing the following operations for each development encyclopedia data source:
step A: taking key concepts in the field as seed entries, taking the seed entries as clues, and crawling association entries quoted by the seed entries in the open encyclopedia data source;
and B: judging whether core entries closely related to the domain knowledge exist in the crawled associated entries, if not, storing all the crawled associated entries as expansion entries into a domain entry set; if so, taking the core entry in the crawled associated entries as a new seed entry, taking other entries except the core entry in the crawled associated entries as expansion entries to be stored in a field entry set, and executing the step C;
and C: judging whether a termination condition is reached, if so, executing the step E; otherwise, executing step D;
step D: crawling the associated entries quoted by the new seed entry in the open encyclopedia data source, and executing the step B;
step E: and storing all the seed entries as core entries into the field entry set.
4. The method according to claim 3, wherein the determining whether the termination condition is reached specifically includes:
judging whether core entries in the crawled associated entries are all existing seed entries, and if so, determining that a termination condition is reached; otherwise, determining that the termination condition is not reached;
or
Judging whether a preset acquisition depth is reached, and if so, determining that a termination condition is reached; otherwise, it is determined that the termination condition is not reached.
5. The method according to claim 1, wherein the domain encyclopedia comprises three concepts of entry, topic and classification, and three relations of entry reference relation, topic inclusion relation and classification attribution relation;
the method comprises the following steps of respectively carrying out reference labeling and label labeling on the entries in the field entry set to form a reference relationship network and a classification tree among different entries, and specifically comprises the following steps:
analyzing the reference relation among different entries in the field entry set to form a reference relation network among different entries; analyzing the inclusion relation and the affiliation relation among different entries in the field entry set to form a classification tree among the different entries;
constructing a field encyclopedia map based on the reference relationship network and the classification tree, and specifically comprising the following steps:
selecting at least one entry from the domain entry set as a subject based on the classification tree, wherein the entry serving as the subject has an inclusion relationship with at least one other entry in the domain entry set;
selecting at least one entry from the domain entry set as a classification based on the classification tree, wherein the entry serving as the classification has an attribution relationship with at least one other entry in the domain entry set;
and constructing an entry reference relation between different entries in the field encyclopedia, a topic inclusion relation between topics and entries and a classification attribution relation between classifications and entries on the basis of the reference relation network and the classification tree.
6. A system for constructing a domain encyclopedia map, comprising:
the system comprises an acquisition module, a processing module and a display module, wherein the acquisition module is used for acquiring field related entries from a plurality of open encyclopedia data sources to obtain a field entry set;
the labeling module is used for respectively performing reference labeling and label labeling on the entries in the field entry set to form a reference relation network and a classification tree among different entries;
and the construction module is used for constructing the field encyclopedia map based on the reference relation network and the classification tree.
7. The system of claim 6, wherein the acquisition module comprises:
the acquisition submodule is used for respectively acquiring related entries of the field from a plurality of open encyclopedia data sources to obtain a plurality of entry sets;
and the duplication removing module is used for removing duplication from the plurality of entry sets to obtain a field entry set.
8. The system of claim 7,
the acquisition submodule is specifically configured to perform the following operations on each development encyclopedia data source:
step A: taking key concepts in the field as seed entries, taking the seed entries as clues, and crawling association entries quoted by the seed entries in the open encyclopedia data source;
and B: judging whether core entries closely related to the domain knowledge exist in the crawled associated entries, if not, storing all the crawled associated entries as expansion entries into a domain entry set; if so, taking the core entry in the crawled associated entries as a new seed entry, taking other entries except the core entry in the crawled associated entries as expansion entries to be stored in a field entry set, and executing the step C;
and C: judging whether a termination condition is reached, if so, executing the step E; otherwise, executing step D;
step D: crawling the associated entries quoted by the new seed entry in the open encyclopedia data source, and executing the step B;
step E: and storing all the seed entries as core entries into the field entry set.
9. The system of claim 8, wherein the acquisition sub-module determines whether a termination condition is reached, and specifically includes:
the acquisition submodule judges whether core entries in the crawled associated entries are all existing seed entries, and if so, the acquisition submodule determines that a termination condition is reached; otherwise, determining that the termination condition is not reached;
or
The acquisition submodule judges whether a preset acquisition depth is reached, and if so, the acquisition submodule determines that a termination condition is reached; otherwise, it is determined that the termination condition is not reached.
10. The system of claim 6, wherein the domain encyclopedia comprises three concepts of entry, topic and classification, and three relationships of entry reference relationship, topic inclusion relationship and classification attribution relationship;
the labeling module is specifically used for analyzing the reference relationship among different entries in the field entry set to form a reference relationship network among the different entries; analyzing the inclusion relation and the affiliation relation among different entries in the field entry set to form a classification tree among the different entries;
the building module is specifically configured to select at least one entry from the domain entry set as a topic based on the classification tree, where the entry serving as the topic has an inclusion relationship with at least one other entry in the domain entry set; selecting at least one entry from the domain entry set as a classification based on the classification tree, wherein the entry serving as the classification has an attribution relationship with at least one other entry in the domain entry set; and constructing an entry reference relation between different entries in the field encyclopedia, a topic inclusion relation between topics and entries and a classification attribution relation between classifications and entries on the basis of the reference relation network and the classification tree.
CN202111254354.5A 2021-10-27 2021-10-27 Method and system for constructing field encyclopedia map Pending CN113987204A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202111254354.5A CN113987204A (en) 2021-10-27 2021-10-27 Method and system for constructing field encyclopedia map

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202111254354.5A CN113987204A (en) 2021-10-27 2021-10-27 Method and system for constructing field encyclopedia map

Publications (1)

Publication Number Publication Date
CN113987204A true CN113987204A (en) 2022-01-28

Family

ID=79742347

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202111254354.5A Pending CN113987204A (en) 2021-10-27 2021-10-27 Method and system for constructing field encyclopedia map

Country Status (1)

Country Link
CN (1) CN113987204A (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN115983381A (en) * 2023-02-28 2023-04-18 华院计算技术(上海)股份有限公司 Knowledge base rapid construction method and system based on online encyclopedia

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN115983381A (en) * 2023-02-28 2023-04-18 华院计算技术(上海)股份有限公司 Knowledge base rapid construction method and system based on online encyclopedia

Similar Documents

Publication Publication Date Title
CN106383875B (en) Man-machine interaction method and device based on artificial intelligence
US6886007B2 (en) Taxonomy generation support for workflow management systems
CN109871483A (en) A kind of determination method and device of recommendation information
CN108509465A (en) A kind of the recommendation method, apparatus and server of video data
US20100030768A1 (en) Classifying documents using implicit feedback and query patterns
CN112749284B (en) Knowledge graph construction method, device, equipment and storage medium
CN110232126B (en) Hot spot mining method, server and computer readable storage medium
CN116911312B (en) Task type dialogue system and implementation method thereof
Hatta et al. Business intelligence system adoption theories in SMES: A literature review
CN111159341A (en) Information recommendation method and device based on user investment and financing preference
Ransom et al. Facets of user‐assigned tags and their effectiveness in image retrieval
CN114330329A (en) Service content searching method and device, electronic equipment and storage medium
CN113723853A (en) Method and device for processing post competence demand data
CN116049379A (en) Knowledge recommendation method, knowledge recommendation device, electronic equipment and storage medium
Cummaudo et al. What should I document? A preliminary systematic mapping study into API documentation knowledge
CN112231554A (en) Search recommendation word generation method and device, storage medium and computer equipment
CN113987204A (en) Method and system for constructing field encyclopedia map
CN113742496B (en) Electric power knowledge learning system and method based on heterogeneous resource fusion
KR100557874B1 (en) Method of scientific information analysis and media that can record computer program thereof
CN113407678B (en) Knowledge graph construction method, device and equipment
CN114417010A (en) Knowledge graph construction method and device for real-time workflow and storage medium
Xu et al. Identification and prediction of interdisciplinary research topics: A study based on the concept lattice theory
CN108460475A (en) Poor student's prediction technique and device based on network playing by students behavior
CN114428881A (en) Method and device for pushing media asset video, storage medium and electronic equipment
Fugini et al. A text analytics architecture for smart companies

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination