CN110458666A - A kind of individualized knowledge library recombination method based on domain knowledge - Google Patents

A kind of individualized knowledge library recombination method based on domain knowledge Download PDF

Info

Publication number
CN110458666A
CN110458666A CN201910732008.XA CN201910732008A CN110458666A CN 110458666 A CN110458666 A CN 110458666A CN 201910732008 A CN201910732008 A CN 201910732008A CN 110458666 A CN110458666 A CN 110458666A
Authority
CN
China
Prior art keywords
knowledge
domain
domain knowledge
individualized
knowledge base
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201910732008.XA
Other languages
Chinese (zh)
Inventor
陈琳
陈海涛
刘振东
李海卜
吴竟飞
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
TONGFANG KNOWLEDGE NETWORK (BEIJING) TECHNOLOGY Co Ltd
Original Assignee
TONGFANG KNOWLEDGE NETWORK (BEIJING) TECHNOLOGY Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by TONGFANG KNOWLEDGE NETWORK (BEIJING) TECHNOLOGY Co Ltd filed Critical TONGFANG KNOWLEDGE NETWORK (BEIJING) TECHNOLOGY Co Ltd
Priority to CN201910732008.XA priority Critical patent/CN110458666A/en
Publication of CN110458666A publication Critical patent/CN110458666A/en
Pending legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q30/00Commerce
    • G06Q30/06Buying, selling or leasing transactions
    • G06Q30/0601Electronic shopping [e-shopping]
    • G06Q30/0621Item configuration or customization
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q50/00Information and communication technology [ICT] specially adapted for implementation of business processes of specific business sectors, e.g. utilities or tourism
    • G06Q50/10Services
    • G06Q50/18Legal services
    • G06Q50/184Intellectual property management

Landscapes

  • Business, Economics & Management (AREA)
  • Engineering & Computer Science (AREA)
  • Marketing (AREA)
  • Theoretical Computer Science (AREA)
  • Economics (AREA)
  • Finance (AREA)
  • Strategic Management (AREA)
  • Physics & Mathematics (AREA)
  • General Business, Economics & Management (AREA)
  • General Physics & Mathematics (AREA)
  • Accounting & Taxation (AREA)
  • Tourism & Hospitality (AREA)
  • Technology Law (AREA)
  • Operations Research (AREA)
  • Entrepreneurship & Innovation (AREA)
  • Development Economics (AREA)
  • Health & Medical Sciences (AREA)
  • General Health & Medical Sciences (AREA)
  • Human Resources & Organizations (AREA)
  • Primary Health Care (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

The invention discloses a kind of individualized knowledge library recombination method based on domain knowledge, comprising: interested domain knowledge is chosen again according to user demand;Necessary screening conditions setting is carried out to whether the corresponding material of knowledge is selected into knowledge base;In conjunction with corresponding material screening conditions, corresponding story extraction instruction is constructed, that is, constitutes restructuring plan;The knowledge distributing being related to according to instruction is extracted, is executed parallel, is generated knowledge base, that is, is performed restructuring plan;The uninterested material for extracting and having been extracted into the interested material of knowledge base or deletion into knowledge base is added.The method is suitable for the recombination of professional domain knowledge base, is suitble to the service application of society, commercial press, the effect of domain knowledge is emphasized in regrouping process, domain knowledge is usually field vocabulary or domain body.Meanwhile the personalization of knowledge base requirement description is realized using the reconstruct of domain knowledge, i.e., user can dynamically choose the knowledge collection of oneself care, to provide better knowledge services for user.

Description

A kind of individualized knowledge library recombination method based on domain knowledge
Technical field
The present invention relates to digital publishing technical field more particularly to a kind of individualized knowledge library recombinations based on domain knowledge Method.
Background technique
Traditional publication is the paper publication by conventional printing techniques, and material media is paper.Traditional publication must incite somebody to action On material media, product has specific physical form and preservability in kind for content materialization.Digital publishing is to utilize The publishing way that information technology launches publication in the form of network, CD etc..
With the development of the society, demand of the reader to knowledge acquisition is also being continuously improved, especially in commercial press field, The demand of reader is gradually to individualized development.Supplier of the journalism unit as content, it is also desirable to be capable of providing individual character Change product.But since the period of traditional publication is long, it is difficult meet the needs of all kinds of readers.Meanwhile journalism unit Also the transition to knowledge services is published from content in experience, there is the business demand of urgent building and issuing personalized knowledge base.
The material resource that content dynamic reorganization is processed based on fragmentation is closed according between various media resources and content element The characteristics of connection property and different majors field, the generation of personalized product content is rapidly completed, so as to shorten frequence of issue, is Reader provides personalized service.
Based on content dynamic reorganization, required domain knowledge can be reconstructed (typically, by field vocabulary or field sheet Body description), and required screening conditions are configured, it is dynamically generated personalized knowledge base by generating and executing restructuring plan, To provide quick knowledge services for user.
DocBook provides the entire system for writing structured document, it defines a series of document using SGML/XML Element, and can use tool and original document source file is converted into various document formats.Briefly, DocBook is exactly one The specification that group parses XML document.The XML file finished writing for one according to DocBook format, uses DocBook Some related tools, so that it may generate various outputs according to the requirement of user.As its name suggests, DocBook is special To write designed by the document of books or similar books.Currently, domestic publishing house generallys use the standard pair based on DocBook Publication resource carries out fragmentation processing.
Society, commercial press usually has the domain knowledge and material of accumulation for many years in professional domain, and typical domain knowledge is adopted It is described with field vocabulary or domain body, and material is usually directed to the books chapters and sections of publication, paper and other multimedias Material generallys use XML and carries out fragmentation processing.The high publishing house of level of digital carries out material also through domain knowledge Index, faster can position related materials by domain knowledge.
Recombining contents technology towards publishing area, directly related technical standard there is not yet both at home and abroad, the hair of technology Exhibition is also in the budding stage.In the world by OASIS (The Organization for the Advancement of Structured Information Standards) organizational protection open standard -- DITA (Darwin Information Typing Architecture), there are the theories of the relevant technologies.DITA be it is a set of based on XML towards master The digital content structuring writing of topic and distribution scheme.
There is also for the content of fragmentation processing, need to carry out the business mould of dynamic reorganization by user individually both at home and abroad Formula, such as the raw chapters and sections content for allowing user to choose fragmentation on its site of training, are voluntarily packaged payment purchase as required.But It is relevant technology and application there is also many problems, such as business model application surface are narrow, it is manual that recombinant product is configured to user Operation, automatization level are low.
The technology in individualized knowledge library is automatically generated using dynamic reorganization and application is more in the exploratory stage, the prior art Automatization level is low, it is difficult to realize real individual demand.
Summary of the invention
In order to solve the above technical problems, the object of the present invention is to provide a kind of, the individualized knowledge library based on domain knowledge is heavy Group method.
The purpose of the present invention is realized by technical solution below:
A kind of individualized knowledge library recombination method based on domain knowledge, comprising:
A chooses interested domain knowledge according to user demand again;
B carries out necessary screening conditions setting to whether the corresponding material of knowledge is selected into knowledge base;
C combines corresponding material screening conditions, constructs corresponding story extraction instruction, that is, constitutes restructuring plan;
The knowledge distributing that D is related to according to instruction is extracted, is executed parallel, is generated knowledge base, that is, is performed restructuring plan;
The uninterested element for not extracting and having been extracted into the interested material of knowledge base or deletion into knowledge base is added in E Material.
Compared with prior art, one or more embodiments of the invention can have following advantage:
The regrouping knowledge base method is particularly suitable for the recombination of professional domain knowledge base, and the business of society, commercial press is suitble to answer With emphasizing the effect of domain knowledge in regrouping process, domain knowledge is usually field vocabulary or domain body.Meanwhile benefit The personalization of knowledge base requirement description is realized with the reconstruct (merging and choose domain knowledge) of domain knowledge, i.e. user can dynamic The knowledge collection of oneself care is chosen on ground, to provide better knowledge services for user.
The regrouping knowledge base method realizes the recombination of automation, based on the domain knowledge that recombination knowledge base is related to, and matches The corresponding screening conditions for setting material in knowledge base, can be generated corresponding restructuring plan.Restructuring plan be for describe recombination with The instruction for generating individualized knowledge library (has to belong to and divides relationship, In in view of the hierarchical organization structure of domain knowledge in the vocabulary of field With subclass relation in ontology) and domain knowledge by reconstruct may be from different vocabulary or ontology, it is corresponding to lead Domain knowledge may be considered the structure of a forest, be based on the structure, and using the screening conditions of configuration, corresponding neck can be generated The extraction plan of domain knowledge item, it is whole to constitute restructuring plan.According to restructuring plan, can to the domain knowledge being related to according to Screening conditions carry out contents extraction, constitute final knowledge base product.General domain knowledge and its material have certain independence Property, it can execute parallel.Specific contents extraction can use the metadata mark of text searching method or content material itself Fuse breath.
The content dynamic reconfiguration method introduces the mechanism of content correction, this method automated execution regrouping knowledge base plan To generate recombinant product, but its result may there is any discrepancy with the actual demand of user.For example, the desired material of user does not mention It gets in knowledge base or the undesired story extraction of user has arrived in material database.User executes building in system automation In knowledge base, unwanted material can be deleted, or add other available materials not extracted in knowledge base manually, To provide preferably Talk about Individualized Knowledge Service for user.
Detailed description of the invention
Fig. 1 is the individualized knowledge library recombination method flow chart based on domain knowledge;
Fig. 2 is reconstruction field knowledge schematic diagram;
Fig. 3 is configuration screening conditions schematic diagram;
Fig. 4 is to generate restructuring plan structure chart;
Fig. 5 is to execute restructuring plan structure chart;
Fig. 6 is content material adjustment schematic diagram.
Specific embodiment
To make the object, technical solutions and advantages of the present invention clearer, below in conjunction with examples and drawings to this hair It is bright to be described in further detail.
As shown in Figure 1, being the individualized knowledge library recombination method process based on domain knowledge, comprising the following steps:
Step 10 chooses interested domain knowledge according to user demand again;
Step 20 carries out necessary screening conditions setting to whether the corresponding material of knowledge is selected into knowledge base;
Step 30 combines corresponding material screening conditions, constructs corresponding story extraction instruction, that is, constitutes restructuring plan;
The knowledge distributing that step 40 is related to according to instruction is extracted, is executed parallel, is generated knowledge base, that is, is performed recombination Plan;
Step 50, which is added not extracting the interested material into knowledge base or delete to have extracted, does not feel emerging into knowledge base The material of interest.
Above-mentioned steps 10 are reconstruct domain knowledge;Domain knowledge is usually the professional domain knowledge body of society, commercial press building System describes usually in the form of field vocabulary or domain body.Domain knowledge includes the pass between concept and concept in field It is that the relationship in typical field vocabulary includes use, generation, category, divides, ginseng, and domain body also typically includes built-in generality and closes It is (such as antisense) and a large amount of customized relationship.Domain knowledge usually carries out tissue (such as vocabulary by the taxonomical hierarchy of concept In category divide relationship, the SubClassof relationship in domain body), it can be understood as the concept system of multiple trees, i.e., Constitute the structure of forest.Domain knowledge is constructed generally directed to specific professional domain, the range (or being interpreted as granularity) being related to It is changeable, depending on specific application demand.In addition, domain knowledge is since its is professional, often while automatic building, The participation of a large amount of domain expert is needed, artificial editor and audit are carried out.
Under the scene of individualized knowledge library recombination, domain knowledge needs to carry out certain reconstruct, to meet of user Property demand.In view of the professional and authoritative of domain knowledge, change the tissue of domain knowledge with throwing the reins to it is difficult to ensure that neck The consistency of domain knowledge does not generate contradiction in logic, therefore, the domain knowledge reconstruct that this patent is related to is pertained only to for existing There are the merging and screening of domain knowledge.The merging of domain knowledge, which refers to, (may be from different fields in existing domain knowledge Knowledge hierarchy) concept extract in the demand in current knowledge library;The screening of domain knowledge refers to the domain knowledge concept of selection The related notion being related to is deleted, and the concept that user is concerned about conscientiously is only chosen.Use can be generated in above domain knowledge reconstruct Family describes the personalization of concept needed for knowledge base.Typically, since concept may relate to different knowledge hierarchy, and pass through Screening, can constitute multiple tree-shaped hierarchical structures, i.e. forest.
Reconstruction field knowledge interacts as shown in Fig. 2, listing the available multiple fields knowledge hierarchy of user, and user is by dragging It drags corresponding concept and chooses the knowledge point that knowledge base is related to, optionally delete the sub- concept of conceptual dependency.
Above-mentioned steps 20 are configuration screening conditions (as shown in Figure 3), under the scene of individualized knowledge library recombination, configuration sieve The material for selecting condition to refer to that regulation knowledge base includes needs the condition met, is generally screened by the associated metadata of material.
For commercial press field, screening conditions are typically comprised: author, copyright information, publisher, time range, Languages, user oriented positioning etc..Different applications may relate to different metadata screening conditions, can be dynamic by configuration file The screening item that state loading system is supported.
Above-mentioned steps 30 are to generate restructuring plan (as shown in Figure 4), and restructuring plan is the inside table of regrouping knowledge base instruction Show, for describing how automatically to extract the associated material of domain knowledge.Since domain knowledge is usually the number of a forest According to structure, while knowledge base is configured with screening conditions, therefore corresponding executive plan is usually the data structure of a forest, In each tree construction represent the domain knowledge concept of user's needs, and be labelled on tree construction the screening conditions of configuration.It should The screening conditions that the domain knowledge and step 20 that process can be reconstructed easily by step 10 configure are constructed.
Above-mentioned steps 40 are to execute restructuring plan (as shown in Figure 5), and executing recombination strategy is the recombination configured to step 30 Strategy explains execution, generates the process of recombinant product.Typical implementation procedure is that traversal restructuring plan each of is related to knowing Know item, according to screening conditions, the relevant material of the knowledge item is extracted from material database.The foundation of extraction is usually the index of material , if material itself lacks index item, text-type material can be extracted by way of full-text search.
The execution of reassembly algorithm is usually directed to the story extraction of different domain knowledge items, therefore can use parallel execution Method with raising efficiency, such as in the case where single machine in the way of multithreading, or utilize under distributed environment The thought of Map-Reduce.
Above-mentioned steps 50 are the adjustment (as shown in Figure 6) of content material, and the knowledge base constructed automatically based on above step is very Difficulty accomplishes to comply fully with the demand of user, therefore is introduced into the mechanism of content material in adjustment knowledge base.The step is man-machine interactively Adjustment process, user can delete the uninterested material extracted, can also be added and be felt based on the demand of oneself The material of interest.
Although disclosed herein embodiment it is as above, the content is only to facilitate understanding the present invention and adopting Embodiment is not intended to limit the invention.Any those skilled in the art to which this invention pertains are not departing from this Under the premise of the disclosed spirit and scope of invention, any modification and change can be made in the implementing form and in details, But scope of patent protection of the invention, still should be subject to the scope of the claims as defined in the appended claims.

Claims (6)

1. a kind of individualized knowledge library recombination method based on domain knowledge, which is characterized in that the described method includes:
A chooses interested domain knowledge according to user demand again;
B carries out necessary screening conditions setting to whether the corresponding material of knowledge is selected into knowledge base;
C combines corresponding material screening conditions, constructs corresponding story extraction instruction, that is, constitutes restructuring plan;
The knowledge distributing that D is related to according to instruction is extracted, is executed parallel, is generated knowledge base, that is, is performed restructuring plan;
The uninterested material for not extracting and having been extracted into the interested material of knowledge base or deletion into knowledge base is added in E.
2. the individualized knowledge library recombination method based on domain knowledge as described in claim 1, which is characterized in that the step A are as follows: choose user interested specific knowledge set, wherein domain knowledge takes the form of field vocabulary or field sheet Body;Again choosing domain knowledge is the knowledge hierarchy in building to user individual.
3. the individualized knowledge library recombination method based on domain knowledge as described in claim 1, which is characterized in that the material Screening conditions refer to it is relevant to business, including copyright, author, the time, the degree of correlation and extract quantity;Screening conditions itself are keys Value pair, wherein key corresponds to the condition of screening, and value, which corresponds to, acts on constraint on corresponding conditions, typical value including monodrome, Multivalue, value range.
4. the individualized knowledge library recombination method based on domain knowledge as described in claim 1, which is characterized in that the step C be specially in view of the hierarchical organization structure of domain knowledge and the domain knowledge by reconstruct may be from different vocabulary or Ontology, corresponding domain knowledge may be considered the structure of a forest, be based on the structure, and utilize the screening conditions of configuration, The extraction plan of corresponding domain knowledge item is generated, it is whole to constitute restructuring plan.
5. the individualized knowledge library recombination method based on domain knowledge as described in claim 1, which is characterized in that the recombination Domain knowledge involved in plan is a forest structure, and promotes recombination efficiency using the method executed parallel;Restructuring plan It executes and extracts corresponding knowledge base material, mode or base of the extraction of material according to full-text search according to specific domain knowledge In the metadata indexing of material.
6. the individualized knowledge library recombination method based on domain knowledge as described in claim 1, which is characterized in that user is being It unites in the knowledge base of automated execution building, unwanted material can be deleted, or add other manually and available do not mention Get the material in knowledge base.
CN201910732008.XA 2019-08-09 2019-08-09 A kind of individualized knowledge library recombination method based on domain knowledge Pending CN110458666A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201910732008.XA CN110458666A (en) 2019-08-09 2019-08-09 A kind of individualized knowledge library recombination method based on domain knowledge

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201910732008.XA CN110458666A (en) 2019-08-09 2019-08-09 A kind of individualized knowledge library recombination method based on domain knowledge

Publications (1)

Publication Number Publication Date
CN110458666A true CN110458666A (en) 2019-11-15

Family

ID=68485538

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201910732008.XA Pending CN110458666A (en) 2019-08-09 2019-08-09 A kind of individualized knowledge library recombination method based on domain knowledge

Country Status (1)

Country Link
CN (1) CN110458666A (en)

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102890689A (en) * 2011-07-22 2013-01-23 北京百度网讯科技有限公司 Method and system for building user interest model
US20130282390A1 (en) * 2012-04-20 2013-10-24 International Business Machines Corporation Combining knowledge and data driven insights for identifying risk factors in healthcare
CN103927339A (en) * 2014-03-27 2014-07-16 北大方正集团有限公司 System and method for reorganizing knowledge

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102890689A (en) * 2011-07-22 2013-01-23 北京百度网讯科技有限公司 Method and system for building user interest model
US20130282390A1 (en) * 2012-04-20 2013-10-24 International Business Machines Corporation Combining knowledge and data driven insights for identifying risk factors in healthcare
CN103927339A (en) * 2014-03-27 2014-07-16 北大方正集团有限公司 System and method for reorganizing knowledge

Similar Documents

Publication Publication Date Title
US8135755B2 (en) Templates in a schema editor
CN1833240B (en) Method and apparatus for maintaining relationships between parts in a package
Baumgartner et al. Visual web information extraction with lixto
CN100565493C (en) Modular document format
Alatrish Comparison some of ontology
CN110333856B (en) System and method for generating service programmable online template
CN106815184A (en) The system and method for document is automatically generated based on FOG data
CN101488086A (en) Software generation method and apparatus based on field model
AU2012327168B2 (en) Amethod and structure for managing multiple electronic forms and their records using a static database
CN104267966B (en) The generation method and device of the program code of software
CN107145480A (en) A kind of method that XBRL Report workouts are carried out based on Word
CN102810114A (en) Personal computer resource management system based on body
CN106202292A (en) A kind of standard information based on structural data model analyzes method
CN109614671A (en) A kind of three-dimensional MBD process model tissue and expression based on view
Žumer et al. IFLA LRM-finally here
Gómez et al. A framework for variable content document generation with multiple actors
CN104199882B (en) A kind of acquisition methods of structural knowledge and its body based on the customization of intelligent masterplate
US20090193053A1 (en) Information management system
CN110458666A (en) A kind of individualized knowledge library recombination method based on domain knowledge
CN110457664A (en) A kind of content dynamic reconfiguration method based on template
US20210365408A1 (en) Methods and systems for depiction of project data via transmogrification using fractal-based structures
CN103544022A (en) HES system widget management method
CN101268438A (en) Data processing apparatus
CN110472217A (en) A kind of content dynamic reconfiguration method based on recombination strategy
CN110472218A (en) A kind of parallel execution method towards recombination strategy

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication

Application publication date: 20191115

RJ01 Rejection of invention patent application after publication