CN110458666A - A kind of individualized knowledge library recombination method based on domain knowledge - Google Patents
A kind of individualized knowledge library recombination method based on domain knowledge Download PDFInfo
- Publication number
- CN110458666A CN110458666A CN201910732008.XA CN201910732008A CN110458666A CN 110458666 A CN110458666 A CN 110458666A CN 201910732008 A CN201910732008 A CN 201910732008A CN 110458666 A CN110458666 A CN 110458666A
- Authority
- CN
- China
- Prior art keywords
- knowledge
- domain
- domain knowledge
- individualized
- knowledge base
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
- 238000000034 method Methods 0.000 title claims abstract description 32
- 230000006798 recombination Effects 0.000 title claims abstract description 25
- 238000005215 recombination Methods 0.000 title claims abstract description 25
- 239000000463 material Substances 0.000 claims abstract description 50
- 238000012216 screening Methods 0.000 claims abstract description 28
- 238000000605 extraction Methods 0.000 claims abstract description 12
- 238000012217 deletion Methods 0.000 claims abstract description 3
- 230000037430 deletion Effects 0.000 claims abstract description 3
- 239000000284 extract Substances 0.000 claims description 3
- 230000008520 organization Effects 0.000 claims description 3
- 230000008569 process Effects 0.000 abstract description 6
- 230000000694 effects Effects 0.000 abstract description 2
- 238000005516 engineering process Methods 0.000 description 6
- 238000013467 fragmentation Methods 0.000 description 5
- 238000006062 fragmentation reaction Methods 0.000 description 5
- 230000008521 reorganization Effects 0.000 description 4
- 230000008901 benefit Effects 0.000 description 3
- 238000010586 diagram Methods 0.000 description 3
- 238000012545 processing Methods 0.000 description 3
- 230000008859 change Effects 0.000 description 2
- 238000010276 construction Methods 0.000 description 2
- 238000011161 development Methods 0.000 description 2
- 230000007246 mechanism Effects 0.000 description 2
- 241000208340 Araliaceae Species 0.000 description 1
- 235000005035 Panax pseudoginseng ssp. pseudoginseng Nutrition 0.000 description 1
- 235000003140 Panax quinquefolius Nutrition 0.000 description 1
- 206010034719 Personality change Diseases 0.000 description 1
- 238000009825 accumulation Methods 0.000 description 1
- 230000009471 action Effects 0.000 description 1
- 230000000692 anti-sense effect Effects 0.000 description 1
- 238000012550 audit Methods 0.000 description 1
- 230000034303 cell budding Effects 0.000 description 1
- 238000012937 correction Methods 0.000 description 1
- 235000008434 ginseng Nutrition 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 238000007639 printing Methods 0.000 description 1
- 238000012549 training Methods 0.000 description 1
- 230000007704 transition Effects 0.000 description 1
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06Q—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
- G06Q30/00—Commerce
- G06Q30/06—Buying, selling or leasing transactions
- G06Q30/0601—Electronic shopping [e-shopping]
- G06Q30/0621—Item configuration or customization
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06Q—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
- G06Q50/00—Information and communication technology [ICT] specially adapted for implementation of business processes of specific business sectors, e.g. utilities or tourism
- G06Q50/10—Services
- G06Q50/18—Legal services
- G06Q50/184—Intellectual property management
Landscapes
- Business, Economics & Management (AREA)
- Engineering & Computer Science (AREA)
- Marketing (AREA)
- Theoretical Computer Science (AREA)
- Economics (AREA)
- Finance (AREA)
- Strategic Management (AREA)
- Physics & Mathematics (AREA)
- General Business, Economics & Management (AREA)
- General Physics & Mathematics (AREA)
- Accounting & Taxation (AREA)
- Tourism & Hospitality (AREA)
- Technology Law (AREA)
- Operations Research (AREA)
- Entrepreneurship & Innovation (AREA)
- Development Economics (AREA)
- Health & Medical Sciences (AREA)
- General Health & Medical Sciences (AREA)
- Human Resources & Organizations (AREA)
- Primary Health Care (AREA)
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
Abstract
The invention discloses a kind of individualized knowledge library recombination method based on domain knowledge, comprising: interested domain knowledge is chosen again according to user demand;Necessary screening conditions setting is carried out to whether the corresponding material of knowledge is selected into knowledge base;In conjunction with corresponding material screening conditions, corresponding story extraction instruction is constructed, that is, constitutes restructuring plan;The knowledge distributing being related to according to instruction is extracted, is executed parallel, is generated knowledge base, that is, is performed restructuring plan;The uninterested material for extracting and having been extracted into the interested material of knowledge base or deletion into knowledge base is added.The method is suitable for the recombination of professional domain knowledge base, is suitble to the service application of society, commercial press, the effect of domain knowledge is emphasized in regrouping process, domain knowledge is usually field vocabulary or domain body.Meanwhile the personalization of knowledge base requirement description is realized using the reconstruct of domain knowledge, i.e., user can dynamically choose the knowledge collection of oneself care, to provide better knowledge services for user.
Description
Technical field
The present invention relates to digital publishing technical field more particularly to a kind of individualized knowledge library recombinations based on domain knowledge
Method.
Background technique
Traditional publication is the paper publication by conventional printing techniques, and material media is paper.Traditional publication must incite somebody to action
On material media, product has specific physical form and preservability in kind for content materialization.Digital publishing is to utilize
The publishing way that information technology launches publication in the form of network, CD etc..
With the development of the society, demand of the reader to knowledge acquisition is also being continuously improved, especially in commercial press field,
The demand of reader is gradually to individualized development.Supplier of the journalism unit as content, it is also desirable to be capable of providing individual character
Change product.But since the period of traditional publication is long, it is difficult meet the needs of all kinds of readers.Meanwhile journalism unit
Also the transition to knowledge services is published from content in experience, there is the business demand of urgent building and issuing personalized knowledge base.
The material resource that content dynamic reorganization is processed based on fragmentation is closed according between various media resources and content element
The characteristics of connection property and different majors field, the generation of personalized product content is rapidly completed, so as to shorten frequence of issue, is
Reader provides personalized service.
Based on content dynamic reorganization, required domain knowledge can be reconstructed (typically, by field vocabulary or field sheet
Body description), and required screening conditions are configured, it is dynamically generated personalized knowledge base by generating and executing restructuring plan,
To provide quick knowledge services for user.
DocBook provides the entire system for writing structured document, it defines a series of document using SGML/XML
Element, and can use tool and original document source file is converted into various document formats.Briefly, DocBook is exactly one
The specification that group parses XML document.The XML file finished writing for one according to DocBook format, uses DocBook
Some related tools, so that it may generate various outputs according to the requirement of user.As its name suggests, DocBook is special
To write designed by the document of books or similar books.Currently, domestic publishing house generallys use the standard pair based on DocBook
Publication resource carries out fragmentation processing.
Society, commercial press usually has the domain knowledge and material of accumulation for many years in professional domain, and typical domain knowledge is adopted
It is described with field vocabulary or domain body, and material is usually directed to the books chapters and sections of publication, paper and other multimedias
Material generallys use XML and carries out fragmentation processing.The high publishing house of level of digital carries out material also through domain knowledge
Index, faster can position related materials by domain knowledge.
Recombining contents technology towards publishing area, directly related technical standard there is not yet both at home and abroad, the hair of technology
Exhibition is also in the budding stage.In the world by OASIS (The Organization for the Advancement of
Structured Information Standards) organizational protection open standard -- DITA (Darwin
Information Typing Architecture), there are the theories of the relevant technologies.DITA be it is a set of based on XML towards master
The digital content structuring writing of topic and distribution scheme.
There is also for the content of fragmentation processing, need to carry out the business mould of dynamic reorganization by user individually both at home and abroad
Formula, such as the raw chapters and sections content for allowing user to choose fragmentation on its site of training, are voluntarily packaged payment purchase as required.But
It is relevant technology and application there is also many problems, such as business model application surface are narrow, it is manual that recombinant product is configured to user
Operation, automatization level are low.
The technology in individualized knowledge library is automatically generated using dynamic reorganization and application is more in the exploratory stage, the prior art
Automatization level is low, it is difficult to realize real individual demand.
Summary of the invention
In order to solve the above technical problems, the object of the present invention is to provide a kind of, the individualized knowledge library based on domain knowledge is heavy
Group method.
The purpose of the present invention is realized by technical solution below:
A kind of individualized knowledge library recombination method based on domain knowledge, comprising:
A chooses interested domain knowledge according to user demand again;
B carries out necessary screening conditions setting to whether the corresponding material of knowledge is selected into knowledge base;
C combines corresponding material screening conditions, constructs corresponding story extraction instruction, that is, constitutes restructuring plan;
The knowledge distributing that D is related to according to instruction is extracted, is executed parallel, is generated knowledge base, that is, is performed restructuring plan;
The uninterested element for not extracting and having been extracted into the interested material of knowledge base or deletion into knowledge base is added in E
Material.
Compared with prior art, one or more embodiments of the invention can have following advantage:
The regrouping knowledge base method is particularly suitable for the recombination of professional domain knowledge base, and the business of society, commercial press is suitble to answer
With emphasizing the effect of domain knowledge in regrouping process, domain knowledge is usually field vocabulary or domain body.Meanwhile benefit
The personalization of knowledge base requirement description is realized with the reconstruct (merging and choose domain knowledge) of domain knowledge, i.e. user can dynamic
The knowledge collection of oneself care is chosen on ground, to provide better knowledge services for user.
The regrouping knowledge base method realizes the recombination of automation, based on the domain knowledge that recombination knowledge base is related to, and matches
The corresponding screening conditions for setting material in knowledge base, can be generated corresponding restructuring plan.Restructuring plan be for describe recombination with
The instruction for generating individualized knowledge library (has to belong to and divides relationship, In in view of the hierarchical organization structure of domain knowledge in the vocabulary of field
With subclass relation in ontology) and domain knowledge by reconstruct may be from different vocabulary or ontology, it is corresponding to lead
Domain knowledge may be considered the structure of a forest, be based on the structure, and using the screening conditions of configuration, corresponding neck can be generated
The extraction plan of domain knowledge item, it is whole to constitute restructuring plan.According to restructuring plan, can to the domain knowledge being related to according to
Screening conditions carry out contents extraction, constitute final knowledge base product.General domain knowledge and its material have certain independence
Property, it can execute parallel.Specific contents extraction can use the metadata mark of text searching method or content material itself
Fuse breath.
The content dynamic reconfiguration method introduces the mechanism of content correction, this method automated execution regrouping knowledge base plan
To generate recombinant product, but its result may there is any discrepancy with the actual demand of user.For example, the desired material of user does not mention
It gets in knowledge base or the undesired story extraction of user has arrived in material database.User executes building in system automation
In knowledge base, unwanted material can be deleted, or add other available materials not extracted in knowledge base manually,
To provide preferably Talk about Individualized Knowledge Service for user.
Detailed description of the invention
Fig. 1 is the individualized knowledge library recombination method flow chart based on domain knowledge;
Fig. 2 is reconstruction field knowledge schematic diagram;
Fig. 3 is configuration screening conditions schematic diagram;
Fig. 4 is to generate restructuring plan structure chart;
Fig. 5 is to execute restructuring plan structure chart;
Fig. 6 is content material adjustment schematic diagram.
Specific embodiment
To make the object, technical solutions and advantages of the present invention clearer, below in conjunction with examples and drawings to this hair
It is bright to be described in further detail.
As shown in Figure 1, being the individualized knowledge library recombination method process based on domain knowledge, comprising the following steps:
Step 10 chooses interested domain knowledge according to user demand again;
Step 20 carries out necessary screening conditions setting to whether the corresponding material of knowledge is selected into knowledge base;
Step 30 combines corresponding material screening conditions, constructs corresponding story extraction instruction, that is, constitutes restructuring plan;
The knowledge distributing that step 40 is related to according to instruction is extracted, is executed parallel, is generated knowledge base, that is, is performed recombination
Plan;
Step 50, which is added not extracting the interested material into knowledge base or delete to have extracted, does not feel emerging into knowledge base
The material of interest.
Above-mentioned steps 10 are reconstruct domain knowledge;Domain knowledge is usually the professional domain knowledge body of society, commercial press building
System describes usually in the form of field vocabulary or domain body.Domain knowledge includes the pass between concept and concept in field
It is that the relationship in typical field vocabulary includes use, generation, category, divides, ginseng, and domain body also typically includes built-in generality and closes
It is (such as antisense) and a large amount of customized relationship.Domain knowledge usually carries out tissue (such as vocabulary by the taxonomical hierarchy of concept
In category divide relationship, the SubClassof relationship in domain body), it can be understood as the concept system of multiple trees, i.e.,
Constitute the structure of forest.Domain knowledge is constructed generally directed to specific professional domain, the range (or being interpreted as granularity) being related to
It is changeable, depending on specific application demand.In addition, domain knowledge is since its is professional, often while automatic building,
The participation of a large amount of domain expert is needed, artificial editor and audit are carried out.
Under the scene of individualized knowledge library recombination, domain knowledge needs to carry out certain reconstruct, to meet of user
Property demand.In view of the professional and authoritative of domain knowledge, change the tissue of domain knowledge with throwing the reins to it is difficult to ensure that neck
The consistency of domain knowledge does not generate contradiction in logic, therefore, the domain knowledge reconstruct that this patent is related to is pertained only to for existing
There are the merging and screening of domain knowledge.The merging of domain knowledge, which refers to, (may be from different fields in existing domain knowledge
Knowledge hierarchy) concept extract in the demand in current knowledge library;The screening of domain knowledge refers to the domain knowledge concept of selection
The related notion being related to is deleted, and the concept that user is concerned about conscientiously is only chosen.Use can be generated in above domain knowledge reconstruct
Family describes the personalization of concept needed for knowledge base.Typically, since concept may relate to different knowledge hierarchy, and pass through
Screening, can constitute multiple tree-shaped hierarchical structures, i.e. forest.
Reconstruction field knowledge interacts as shown in Fig. 2, listing the available multiple fields knowledge hierarchy of user, and user is by dragging
It drags corresponding concept and chooses the knowledge point that knowledge base is related to, optionally delete the sub- concept of conceptual dependency.
Above-mentioned steps 20 are configuration screening conditions (as shown in Figure 3), under the scene of individualized knowledge library recombination, configuration sieve
The material for selecting condition to refer to that regulation knowledge base includes needs the condition met, is generally screened by the associated metadata of material.
For commercial press field, screening conditions are typically comprised: author, copyright information, publisher, time range,
Languages, user oriented positioning etc..Different applications may relate to different metadata screening conditions, can be dynamic by configuration file
The screening item that state loading system is supported.
Above-mentioned steps 30 are to generate restructuring plan (as shown in Figure 4), and restructuring plan is the inside table of regrouping knowledge base instruction
Show, for describing how automatically to extract the associated material of domain knowledge.Since domain knowledge is usually the number of a forest
According to structure, while knowledge base is configured with screening conditions, therefore corresponding executive plan is usually the data structure of a forest,
In each tree construction represent the domain knowledge concept of user's needs, and be labelled on tree construction the screening conditions of configuration.It should
The screening conditions that the domain knowledge and step 20 that process can be reconstructed easily by step 10 configure are constructed.
Above-mentioned steps 40 are to execute restructuring plan (as shown in Figure 5), and executing recombination strategy is the recombination configured to step 30
Strategy explains execution, generates the process of recombinant product.Typical implementation procedure is that traversal restructuring plan each of is related to knowing
Know item, according to screening conditions, the relevant material of the knowledge item is extracted from material database.The foundation of extraction is usually the index of material
, if material itself lacks index item, text-type material can be extracted by way of full-text search.
The execution of reassembly algorithm is usually directed to the story extraction of different domain knowledge items, therefore can use parallel execution
Method with raising efficiency, such as in the case where single machine in the way of multithreading, or utilize under distributed environment
The thought of Map-Reduce.
Above-mentioned steps 50 are the adjustment (as shown in Figure 6) of content material, and the knowledge base constructed automatically based on above step is very
Difficulty accomplishes to comply fully with the demand of user, therefore is introduced into the mechanism of content material in adjustment knowledge base.The step is man-machine interactively
Adjustment process, user can delete the uninterested material extracted, can also be added and be felt based on the demand of oneself
The material of interest.
Although disclosed herein embodiment it is as above, the content is only to facilitate understanding the present invention and adopting
Embodiment is not intended to limit the invention.Any those skilled in the art to which this invention pertains are not departing from this
Under the premise of the disclosed spirit and scope of invention, any modification and change can be made in the implementing form and in details,
But scope of patent protection of the invention, still should be subject to the scope of the claims as defined in the appended claims.
Claims (6)
1. a kind of individualized knowledge library recombination method based on domain knowledge, which is characterized in that the described method includes:
A chooses interested domain knowledge according to user demand again;
B carries out necessary screening conditions setting to whether the corresponding material of knowledge is selected into knowledge base;
C combines corresponding material screening conditions, constructs corresponding story extraction instruction, that is, constitutes restructuring plan;
The knowledge distributing that D is related to according to instruction is extracted, is executed parallel, is generated knowledge base, that is, is performed restructuring plan;
The uninterested material for not extracting and having been extracted into the interested material of knowledge base or deletion into knowledge base is added in E.
2. the individualized knowledge library recombination method based on domain knowledge as described in claim 1, which is characterized in that the step
A are as follows: choose user interested specific knowledge set, wherein domain knowledge takes the form of field vocabulary or field sheet
Body;Again choosing domain knowledge is the knowledge hierarchy in building to user individual.
3. the individualized knowledge library recombination method based on domain knowledge as described in claim 1, which is characterized in that the material
Screening conditions refer to it is relevant to business, including copyright, author, the time, the degree of correlation and extract quantity;Screening conditions itself are keys
Value pair, wherein key corresponds to the condition of screening, and value, which corresponds to, acts on constraint on corresponding conditions, typical value including monodrome,
Multivalue, value range.
4. the individualized knowledge library recombination method based on domain knowledge as described in claim 1, which is characterized in that the step
C be specially in view of the hierarchical organization structure of domain knowledge and the domain knowledge by reconstruct may be from different vocabulary or
Ontology, corresponding domain knowledge may be considered the structure of a forest, be based on the structure, and utilize the screening conditions of configuration,
The extraction plan of corresponding domain knowledge item is generated, it is whole to constitute restructuring plan.
5. the individualized knowledge library recombination method based on domain knowledge as described in claim 1, which is characterized in that the recombination
Domain knowledge involved in plan is a forest structure, and promotes recombination efficiency using the method executed parallel;Restructuring plan
It executes and extracts corresponding knowledge base material, mode or base of the extraction of material according to full-text search according to specific domain knowledge
In the metadata indexing of material.
6. the individualized knowledge library recombination method based on domain knowledge as described in claim 1, which is characterized in that user is being
It unites in the knowledge base of automated execution building, unwanted material can be deleted, or add other manually and available do not mention
Get the material in knowledge base.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201910732008.XA CN110458666A (en) | 2019-08-09 | 2019-08-09 | A kind of individualized knowledge library recombination method based on domain knowledge |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201910732008.XA CN110458666A (en) | 2019-08-09 | 2019-08-09 | A kind of individualized knowledge library recombination method based on domain knowledge |
Publications (1)
Publication Number | Publication Date |
---|---|
CN110458666A true CN110458666A (en) | 2019-11-15 |
Family
ID=68485538
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201910732008.XA Pending CN110458666A (en) | 2019-08-09 | 2019-08-09 | A kind of individualized knowledge library recombination method based on domain knowledge |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN110458666A (en) |
Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN102890689A (en) * | 2011-07-22 | 2013-01-23 | 北京百度网讯科技有限公司 | Method and system for building user interest model |
US20130282390A1 (en) * | 2012-04-20 | 2013-10-24 | International Business Machines Corporation | Combining knowledge and data driven insights for identifying risk factors in healthcare |
CN103927339A (en) * | 2014-03-27 | 2014-07-16 | 北大方正集团有限公司 | System and method for reorganizing knowledge |
-
2019
- 2019-08-09 CN CN201910732008.XA patent/CN110458666A/en active Pending
Patent Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN102890689A (en) * | 2011-07-22 | 2013-01-23 | 北京百度网讯科技有限公司 | Method and system for building user interest model |
US20130282390A1 (en) * | 2012-04-20 | 2013-10-24 | International Business Machines Corporation | Combining knowledge and data driven insights for identifying risk factors in healthcare |
CN103927339A (en) * | 2014-03-27 | 2014-07-16 | 北大方正集团有限公司 | System and method for reorganizing knowledge |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US8135755B2 (en) | Templates in a schema editor | |
CN1833240B (en) | Method and apparatus for maintaining relationships between parts in a package | |
Baumgartner et al. | Visual web information extraction with lixto | |
CN100565493C (en) | Modular document format | |
Alatrish | Comparison some of ontology | |
CN110333856B (en) | System and method for generating service programmable online template | |
CN106815184A (en) | The system and method for document is automatically generated based on FOG data | |
CN101488086A (en) | Software generation method and apparatus based on field model | |
AU2012327168B2 (en) | Amethod and structure for managing multiple electronic forms and their records using a static database | |
CN104267966B (en) | The generation method and device of the program code of software | |
CN107145480A (en) | A kind of method that XBRL Report workouts are carried out based on Word | |
CN102810114A (en) | Personal computer resource management system based on body | |
CN106202292A (en) | A kind of standard information based on structural data model analyzes method | |
CN109614671A (en) | A kind of three-dimensional MBD process model tissue and expression based on view | |
Žumer et al. | IFLA LRM-finally here | |
Gómez et al. | A framework for variable content document generation with multiple actors | |
CN104199882B (en) | A kind of acquisition methods of structural knowledge and its body based on the customization of intelligent masterplate | |
US20090193053A1 (en) | Information management system | |
CN110458666A (en) | A kind of individualized knowledge library recombination method based on domain knowledge | |
CN110457664A (en) | A kind of content dynamic reconfiguration method based on template | |
US20210365408A1 (en) | Methods and systems for depiction of project data via transmogrification using fractal-based structures | |
CN103544022A (en) | HES system widget management method | |
CN101268438A (en) | Data processing apparatus | |
CN110472217A (en) | A kind of content dynamic reconfiguration method based on recombination strategy | |
CN110472218A (en) | A kind of parallel execution method towards recombination strategy |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
RJ01 | Rejection of invention patent application after publication |
Application publication date: 20191115 |
|
RJ01 | Rejection of invention patent application after publication |