CN105718649A - Construction method for paper book capable of hierarchically displaying knowledge network - Google Patents
Construction method for paper book capable of hierarchically displaying knowledge network Download PDFInfo
- Publication number
- CN105718649A CN105718649A CN201610035540.2A CN201610035540A CN105718649A CN 105718649 A CN105718649 A CN 105718649A CN 201610035540 A CN201610035540 A CN 201610035540A CN 105718649 A CN105718649 A CN 105718649A
- Authority
- CN
- China
- Prior art keywords
- knowledge
- knowledge point
- current
- entry
- paper book
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
- 238000010276 construction Methods 0.000 title claims abstract description 18
- 238000000034 method Methods 0.000 claims abstract description 45
- 230000015572 biosynthetic process Effects 0.000 claims description 2
- 230000000694 effects Effects 0.000 claims description 2
- 230000014509 gene expression Effects 0.000 claims description 2
- 238000012856 packing Methods 0.000 claims 1
- 238000013517 stratification Methods 0.000 claims 1
- 230000001364 causal effect Effects 0.000 abstract description 11
- 238000004458 analytical method Methods 0.000 description 10
- 230000008569 process Effects 0.000 description 8
- 230000006870 function Effects 0.000 description 5
- 230000009286 beneficial effect Effects 0.000 description 4
- 238000010586 diagram Methods 0.000 description 4
- 239000002245 particle Substances 0.000 description 4
- 241000282414 Homo sapiens Species 0.000 description 3
- 238000011161 development Methods 0.000 description 3
- 238000005516 engineering process Methods 0.000 description 3
- 238000013473 artificial intelligence Methods 0.000 description 2
- 239000008187 granular material Substances 0.000 description 2
- 238000004519 manufacturing process Methods 0.000 description 2
- 230000005477 standard model Effects 0.000 description 2
- 230000003068 static effect Effects 0.000 description 2
- 238000013528 artificial neural network Methods 0.000 description 1
- 230000001174 ascending effect Effects 0.000 description 1
- 230000033228 biological regulation Effects 0.000 description 1
- 238000000354 decomposition reaction Methods 0.000 description 1
- 230000007547 defect Effects 0.000 description 1
- 230000001419 dependent effect Effects 0.000 description 1
- 230000006698 induction Effects 0.000 description 1
- 238000013507 mapping Methods 0.000 description 1
- 239000000463 material Substances 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
- 230000008520 organization Effects 0.000 description 1
- 238000012545 processing Methods 0.000 description 1
- 238000011160 research Methods 0.000 description 1
- 238000003786 synthesis reaction Methods 0.000 description 1
- 230000000007 visual effect Effects 0.000 description 1
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F30/00—Computer-aided design [CAD]
Landscapes
- Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- Theoretical Computer Science (AREA)
- Computer Hardware Design (AREA)
- Evolutionary Computation (AREA)
- Geometry (AREA)
- General Engineering & Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
Abstract
本发明提供了一种层次化显示知识网络的纸质书构建方法,包括:构建知识点对象:将纸质书所涉及的领域知识分解成一系列不存在名称冲突的知识点;构建纸质书目录:根据纸质书的主题将所有的知识点对象按照因果关系采用深度优先搜索方法组织起来形成知识主干树,并用目录结构显示方法显示出来;根据得到的目录树,从目录的开始处,将对应条目的知识点描述写入到顺序标注页码的纸质书正文的内容中,同时在正文中还记录每个知识点的开始页码和一些表示目录树相关条目之间关系的符号;构建知识点和其他内容的检索方法。上述方法可以使编撰的纸质书层次更清晰,使用更方便,更加有利于建立纸质书内部知识之间的内在关系。
The present invention provides a paper book construction method for hierarchically displaying knowledge networks, including: constructing knowledge point objects: decomposing the field knowledge involved in paper books into a series of knowledge points without name conflicts; constructing paper book catalogs : According to the theme of the paper book, all the knowledge point objects are organized according to the causal relationship using the depth-first search method to form a knowledge trunk tree, and displayed by the directory structure display method; according to the obtained directory tree, from the beginning of the directory, the corresponding The knowledge point description of the entry is written into the content of the paper book body with page numbers sequentially marked. At the same time, the beginning page number of each knowledge point and some symbols representing the relationship between related entries of the directory tree are also recorded in the text; constructing knowledge points and Other search methods. The above method can make the compiled paper book more clear in hierarchy, more convenient to use, and more conducive to establishing the internal relationship between the internal knowledge of the paper book.
Description
技术领域technical field
本发明属于图书出版领域,更具体地,涉及一种层次化显示知识网络的纸质书构建方法,能将复杂的网状关系的图书内容以层次化关联、知识点快速查询方式组织起来。The invention belongs to the field of book publishing, and more specifically relates to a paper book construction method for hierarchically displaying knowledge networks, which can organize book contents with complex network relationships in a hierarchical association and fast knowledge point query manner.
背景技术Background technique
知识是人们对客观世界的认识。知识一般以知识网络的形式存在,即可以将知识抽象成若干个知识点,然后将其联接起来形成知识网络。Knowledge is people's understanding of the objective world. Knowledge generally exists in the form of a knowledge network, that is, knowledge can be abstracted into several knowledge points, and then connected to form a knowledge network.
纸质的书籍即纸质书是人类知识、文明继承和传播的重要媒体。从本质上说,纸质书是对知识网络的一维或二维描述。纸质书的构建方法是:采用自然语言或和其他语言对知识和思想进行描述,然后按照某种关系给出一个主干目录对其进行组织,同时提供索引和其他检索方法。Paper books, that is, paper books, are important media for human knowledge and civilization inheritance and dissemination. Essentially, a paper book is a one-dimensional or two-dimensional description of a knowledge network. The construction method of paper books is: use natural language or other languages to describe knowledge and ideas, and then give a main directory according to a certain relationship to organize them, and provide indexes and other retrieval methods.
随着计算机技术的发展和进步,人们往往使用电子技术产品来记载知识和思想,从而产生了电子书。广义地说,任何电子技术产品都是对知识的反映,也可以认为是对知识和思想的记载和应用,但这里的电子书是狭义的,指的是在计算机中阅读并获取知识的产品,其主要目的是知识的学习和思想的理解。With the development and progress of computer technology, people often use electronic technology products to record knowledge and ideas, thus producing e-books. In a broad sense, any electronic technology product is a reflection of knowledge, and can also be considered as a record and application of knowledge and ideas, but the e-book here is narrowly defined, referring to products that can be read and acquired in a computer. Its main purpose is the learning of knowledge and the understanding of ideas.
由于电子书对知识点具有很强的联接能力,所以电子书可以通过建立知识点之间的联系来帮助读者建立知识网络,从而有效地掌握相关的知识。因此,使用电子书是图书出版的发展趋势。Because e-books have a strong connection ability to knowledge points, e-books can help readers build knowledge networks by establishing connections between knowledge points, so as to effectively master relevant knowledge. Therefore, the use of e-books is the development trend of book publishing.
然而,具有几千年历史的纸质书在相当长的时间内仍然会具有存在价值,主要原因为:However, paper books with a history of thousands of years will still have value for a long time, mainly because:
(1)纸质书具有较强的存在感、亲近感、自然感、随意感。人在读书的时候能够抚摸它,会产生强烈的心理感觉,手不释卷。(1) Paper books have a strong sense of existence, intimacy, naturalness, and randomness. People can touch it when they are reading, and they will have a strong psychological feeling, and they can't put it down.
(2)纸质书具有较强的整体感和细节感。电子书很难产生整体感,尽管它拥有强大的放大缩小功能,但是放大倍数太高,则不容易形成整体概念,如果缩小则看不清细微部分。(2) Paper books have a strong sense of integrity and detail. It is difficult for an e-book to produce a sense of the whole, although it has a powerful zoom-in and zoom-out function, but if the magnification factor is too high, it is not easy to form an overall concept, and if the magnification is too high, it is difficult to see the subtle parts.
(3)纸质书便于传承。纸质书是直接用文字符号记录知识,只要理解文字符号的意义,则无论多久人们均可以识别,所以我们可以阅读远古时代的书籍。电子书则不然,它需要解释器,即需要解释器来读入电子数据并将其转化成可视的文字符号。如果某种电子书的解释器不存在了,则该种电子书也就不能使用了。(3) Paper books are easy to pass on. Paper books record knowledge directly with text symbols. As long as they understand the meaning of text symbols, people can recognize them no matter how old they are, so we can read books from ancient times. The e-book is not the case, it needs an interpreter, that is, an interpreter is needed to read in electronic data and convert it into visual text symbols. If the interpreter of a certain e-book does not exist, then the e-book cannot be used.
(4)纸质书便于自由形式的知识学习与创新、创造。电子书的功能非常强大,但是需要专门的工具和操作方法,例如通过关键词检索可以迅速找到所需的知识点。但是人们在进行知识学习与创新时,比较自由、随意的方式可能更加有利。例如,随手将学习体会写在纸质书的某处可能对于知识的学习与创新更加有价值。(4) Paper books are convenient for free-form knowledge learning, innovation and creation. The functions of e-books are very powerful, but special tools and operation methods are required. For example, the required knowledge points can be quickly found through keyword search. However, when people carry out knowledge learning and innovation, a more free and casual way may be more beneficial. For example, it may be more valuable for the learning and innovation of knowledge to write down the learning experience somewhere in the paper book.
无论是传统书籍还是电子书,其目的就是为了表示错综复杂的知识和思想,方便用户的学习和应用。显然,对知识和思想的这种复杂关系表示得越清晰、方便,对知识的学习和应用效果就越好。因此,借鉴电子书对知识关联关系表示方法的特点,设计一种更清晰地表示知识关系的纸质书构建方法对于知识的学习、应用与创新是非常有价值的。Whether it is a traditional book or an e-book, its purpose is to express intricate knowledge and ideas, which is convenient for users to learn and apply. Obviously, the clearer and more convenient it is to express this complex relationship between knowledge and thought, the better the learning and application effect of knowledge will be. Therefore, drawing lessons from the characteristics of e-books to express knowledge relations, designing a paper book construction method that more clearly expresses knowledge relations is very valuable for knowledge learning, application and innovation.
发明内容Contents of the invention
为了使纸质书能够更好地表示知识关系,即表示知识网络,需要应用人工智能研究领域中有关知识表示的一些理论和方法。In order to enable paper books to better represent knowledge relationships, that is, represent knowledge networks, it is necessary to apply some theories and methods related to knowledge representation in the field of artificial intelligence research.
知识的表示方法很多,例如状态空间法、谓词逻辑、本体论、语义网络、框架系统、产生式表示法、Petri网表示法、神经网络等方法,这些方法都可以归结为知识表示标准模型,最后都形成知识网络。根据知识表示标准模型,知识是对客观世界的反映,因为客观世界是多维的,所以知识也是多维的。知识应用离不开知识网络,所以知识点之间是相互关联的。要有效地表示知识,必须注意两点:一是从多维来表示知识,二是必须要建立知识之间的关联关系。There are many representation methods of knowledge, such as state space method, predicate logic, ontology, semantic network, frame system, production representation, Petri net representation, neural network and other methods. These methods can be attributed to the standard model of knowledge representation, and finally form a knowledge network. According to the standard model of knowledge representation, knowledge is a reflection of the objective world, and because the objective world is multidimensional, knowledge is also multidimensional. Knowledge application is inseparable from knowledge network, so knowledge points are interrelated. To effectively represent knowledge, we must pay attention to two points: one is to represent knowledge from multiple dimensions, and the other is to establish the relationship between knowledge.
显然,电子书对于知识的表示具有很强的优势。因此,我们先仔细分析电子书的构建方法,然后叙述如何进行纸质书的构建。Obviously, e-books have strong advantages for knowledge representation. Therefore, we first carefully analyze the construction method of the e-book, and then describe how to construct the paper book.
如果知识具有相同的知识颗粒大小,或者在一个应用时,其知识颗粒是不可再分的,则可以进行知识表示,并且是可以推理求解的。这是许多人工智能系统获得成功的基本保证。但是对于一个应用领域,人们是不可能建立这样一个完全由不可再分的知识点所构成的知识系统,因为这不仅使知识系统非常庞大,而且极大地降低知识系统求解效率。所以电子书的构建不能采用一般的知识表示方法,而只能采用专家系统等知识表示方法。本发明采用面向问题分析与决策(POAD,ProblemOrientedAnalysisandDecision)的知识表示方法进行电子书的知识构建,即用面向问题分析与决策专家系统(POADES,ProblemOrientedAnalysisandDecisionExpertSystem)的专家系统来进行知识表示。面向问题分析与决策(POAD)的专家系统的基本概念如下:If the knowledge has the same knowledge granule size, or its knowledge granules are indivisible in an application, knowledge representation can be performed, and it can be reasoned and solved. This is the basic guarantee for the success of many artificial intelligence systems. But for an application field, it is impossible for people to build such a knowledge system composed of indivisible knowledge points, because this not only makes the knowledge system very large, but also greatly reduces the solution efficiency of the knowledge system. Therefore, the construction of e-books cannot use general knowledge representation methods, but only knowledge representation methods such as expert systems. The present invention adopts the knowledge representation method of problem orientation analysis and decision (POAD, Problem Oriented Analysis and Decision) to carry out the knowledge construction of e-book, promptly uses the expert system of problem orientation analysis and decision expert system (POADES, Problem Oriented Analysis and Decision Expert System) to carry out knowledge representation. The basic concepts of expert systems for problem analysis and decision-making (POAD) are as follows:
(1)知识字典(1) Knowledge dictionary
知识字典用于描述知识系统中的对象,主要包含三方面内容:对象名称、对象类型和对象描述。因此知识字典D可用三元组表示如下:The knowledge dictionary is used to describe the objects in the knowledge system, and mainly includes three aspects: object name, object type and object description. Therefore, the knowledge dictionary D can be represented by triples as follows:
D=(O,T,E)D=(O,T,E)
这里O表示对象,T表示对象的分类即通常所说的词类,E表示对象O的语义解释。Here O represents the object, T represents the classification of the object, which is commonly referred to as the part of speech, and E represents the semantic interpretation of the object O.
在POAD中,常用的对象有5种:主体对象、问题对象、问题现象对象、原理知识对象和过程知识对象,分别用OM,OPB,OPH,OPL,OPR表示,这5种对象合称为MP4对象。In POAD, there are 5 commonly used objects: subject object, problem object, problem phenomenon object, principle knowledge object and process knowledge object, represented by O M , OPB , OP PH , O PL , O PR respectively. The objects are collectively referred to as MP4 objects.
(2)POAD知识单元(2) POAD knowledge unit
POAD专家系统是一种以问题为核心、符合人类思维习惯方式构建和应用知识的系统,采用支持问题分析与决策、以问题对象为中心的知识表示,其知识单元也即知识点可以用图1表示。The POAD expert system is a problem-centered system that complies with human thinking habits to construct and apply knowledge. It adopts a knowledge representation that supports problem analysis and decision-making and is centered on problem objects. Its knowledge units, that is, knowledge points, can be shown in Figure 1. express.
知识系统K可描述如下:The knowledge system K can be described as follows:
K=(OM,OPB,OPH,OPL,OPR,R)K=(O M , O PB , O PH , O PL , O PR , R)
这里R表示各种对象之间的关系,即:Here R represents the relationship between various objects, namely:
R=(O1,O2)R=(O 1 ,O 2 )
这里O1,O2表示任意两个对象。Here O 1 , O 2 represent any two objects.
对象之间相关关系relationship between objects
在POAD中将两个对象之间的关系分为简单相关关系和因果关系两种。In POAD, the relationship between two objects is divided into two types: simple correlation relationship and causal relationship.
简单相关关系RS定义如下:The simple correlation R S is defined as follows:
RS=(OPB,O2)R S =(O PB ,O 2 )
式中O2表示OPB类型以外的其他对象。RS在图1中以单线表示。In the formula, O2 represents other objects other than OPB type. R S is represented by a single line in Figure 1.
因果关系RC定义如下:The causality R C is defined as follows:
RC=(OPB1,OPB2)R C =(O PB1 ,O PB2 )
因果关系表示的是问题对象之间的关系。因为常用树来描述因果关系,所以称为因果树。因果树T可以用下式表示:A causal relationship represents a relationship between problem objects. Because trees are often used to describe causal relationships, they are called causal trees. The causal tree T can be represented by the following formula:
T=(OPB,RC)T=(O PB , R C )
RC在图1中用双线表示。一般来说RC所表示的关系有“与”、“或”关系两种。图2中带箭头的线表示“或”的关系,而弧线连接的线段表示“与”的关系。例如问题A是由B、C、D共同决定,而H、I、J中只要有一个发生C就必然发生。R C is represented by a double line in Figure 1. Generally speaking, the relationship represented by R C has two types: "and" and "or". The lines with arrows in Figure 2 indicate the relationship of "or", while the line segments connected by arcs indicate the relationship of "and". For example, problem A is jointly determined by B, C, and D, and as long as one of H, I, and J occurs, C must occur.
因果关系是人们对自然根本原理的认识,体现出问题对象的可分解性。因果关系分解的终止条件与应用有关。Causality is people's understanding of the fundamental principles of nature, reflecting the decomposability of problem objects. The termination condition of causality decomposition is application-dependent.
POAD与人类的思维习惯一致,使用POAD开发的专家系统POADES(ProblemOrientedAnalysisandDecisionExpertSystem)具有开发容易、知识量不受限制、便于推理等优点,因此不仅可以用于专家系统的开发,而且可以应用于电子书的构建。POAD is consistent with the thinking habits of human beings. The expert system POADES (Problem Oriented Analysis and Decision Expert System) developed by POAD has the advantages of easy development, unlimited knowledge, and convenient reasoning. Construct.
由于纸质书是在静止的纸张上记录知识,所以必须限制在二维纸平面中。从形式上看,纸质书是以二维的方式记录知识。从局部看,书中的每个知识点本身可以是多维的,例如数学表达式可以看成是用多维方式表达知识,但如果将书中的知识点看作是一个点,从整体上看,书本实际上是以一维的方式记录知识。所以纸质书和电子书的一个差别就是纸质书不可能像电子书那样直接通过链接关系实现知识点的跳转,必须要设定某种跳转关系,才能使纸质书表示复杂的知识关系。纸质书和电子书的另一差别是纸质书本身是静止的,不可能在读者使用时自动进行知识的推理和求解,因此可以做某些简化。Since paper books record knowledge on static paper, they must be limited to a two-dimensional paper plane. From a formal point of view, paper books record knowledge in a two-dimensional manner. From a local point of view, each knowledge point in the book itself can be multi-dimensional. For example, mathematical expressions can be regarded as expressing knowledge in a multi-dimensional way. Books actually record knowledge in a one-dimensional way. Therefore, one difference between paper books and e-books is that paper books cannot directly realize the jump of knowledge points through link relationships like e-books. A certain jump relationship must be set to make paper books express complex knowledge. relation. Another difference between paper books and e-books is that paper books themselves are static, and it is impossible to automatically reason and solve knowledge when readers use them, so some simplifications can be made.
为了使纸质书更好地表达知识和思想,本发明提出一种基于面向问题分析与决策(POAD,ProblemOrientedAnalysisandDecision)专家系统的纸质书构建方法。该方法首先将纸质书所要表示的内容进行分析与综合,并根据纸质书的主要目标和核心问题进行知识单元的划分,建立知识点对象及其描述。对于需要非常精确表示领域知识的纸质书,可以在对其所要表达的知识和思想进行分析、归纳、综合的基础上建立MP4对象,并且每个对象选择合适的知识颗粒。对于一般要求的纸质书,则可以简化。在本发明里,用知识点对象取代MP4对象,即仅建立知识点对象。该对象是一种以问题对象为核心的对象,将整个知识单元简化成一个对象,不单独讨论与问题对象相关的主体对象、现象对象、原理知识对象和流程知识对象。MP4对象的内容包含在知识点对象中,需要读者通过阅读认识。每个知识点对象内部包含两部分内容,一部分是对该知识点对象在当前层次上的描述,另一部分是该知识点的内部组成,用它的子知识点对象表示。In order to make paper books better express knowledge and ideas, the present invention proposes a paper book construction method based on a problem-oriented analysis and decision (POAD, Problem Oriented Analysis and Decision) expert system. This method firstly analyzes and synthesizes the content to be expressed in the paper book, and divides the knowledge units according to the main objectives and core issues of the paper book, and establishes the knowledge point object and its description. For paper books that need to express domain knowledge very accurately, MP4 objects can be established on the basis of analysis, induction, and synthesis of the knowledge and ideas to be expressed, and appropriate knowledge particles can be selected for each object. For paper books that are generally required, it can be simplified. In the present invention, the knowledge point object is used to replace the MP4 object, that is, only the knowledge point object is established. This object is an object with the problem object as the core, which simplifies the entire knowledge unit into one object, and does not discuss separately the subject object, phenomenon object, principle knowledge object and process knowledge object related to the problem object. The content of the MP4 object is included in the knowledge point object, which requires readers to understand through reading. Each knowledge point object contains two parts, one part is the description of the knowledge point object at the current level, and the other part is the internal composition of the knowledge point object, which is represented by its sub-knowledge point object.
然后根据纸质书主要目标构建知识主干树,将所有的知识点根据因果关系采用深度优先搜索方法联接成一棵知识主干树。在知识主干树上,各知识点是按照父子关系组织的,这种父子关系主要是分类关系和组成关系,也可以是其他关系。它们形成的是一棵与或树。这种树看起来像树,实质上是图。此外,还允许树上的任意两个节点存在联系,也就是说,一个节点可以与另一个分支上的节点发生关系,从而形成一个知识网络。这种知识主干树就构成了纸质书的目录,它不仅能够反映纸质书的主干思想,同时也可以通过主干树上的联接关系更深入、全面反映纸质书知识的内部关系。Then build a knowledge trunk tree according to the main goal of the paper book, and connect all the knowledge points into a knowledge trunk tree by using the depth-first search method according to the causal relationship. On the knowledge trunk tree, each knowledge point is organized according to the parent-child relationship. This parent-child relationship is mainly a classification relationship and a composition relationship, and it can also be other relationships. What they form is an AND-OR tree. Such trees look like trees, but are essentially graphs. In addition, any two nodes on the tree are allowed to be connected, that is, a node can have a relationship with a node on another branch, thus forming a knowledge network. This kind of knowledge trunk tree constitutes the catalog of paper books, which can not only reflect the main idea of paper books, but also can more deeply and comprehensively reflect the internal relationship of paper book knowledge through the connection relationship on the trunk tree.
最后提供对纸质书的内容进行常规检索和知识点检索等功能。Finally, it provides functions such as regular retrieval and knowledge point retrieval for the contents of paper books.
在下面的论述中,为了与常规图书一致,称知识主干树为目录,而其上的节点为条目,它表示的是知识点。知识点及其名称在纸质书中是唯一的,但相同名称的条目是不唯一的,即在目录中可以存在相同名称的条目,它们都指向同一个知识点。所述方法包括如下步骤:In the following discussion, in order to be consistent with conventional books, the knowledge trunk tree is called a directory, and the nodes on it are entries, which represent knowledge points. Knowledge points and their names are unique in paper books, but entries with the same name are not unique, that is, entries with the same name can exist in the catalog, and they all point to the same knowledge point. The method comprises the steps of:
1)构建知识点对象1) Build knowledge point objects
将纸质书所涉及的领域知识分解成一系列不存在名称冲突的知识点;Decompose the domain knowledge involved in paper books into a series of knowledge points without name conflicts;
对于每个知识点,均用若干段独立的文字符号(可以包括图形符号、数学符号等)区分开来,即每个知识点均用一个知识点名称和它所对应的符号集合表示。知识点中只包含当前层的描述,子知识点则只列条目而不显示内容。For each knowledge point, it is distinguished by several independent text symbols (may include graphic symbols, mathematical symbols, etc.), that is, each knowledge point is represented by a knowledge point name and its corresponding symbol set. The knowledge point only contains the description of the current layer, and the sub-knowledge point only lists the entries without displaying the content.
2)构建纸质书目录2) Construct a catalog of paper books
根据纸质书的主题将所有的知识点对象按照因果关系采用深度优先搜索方法组织起来形成知识主干树,并用目录结构显示方法显示出来;According to the theme of the paper book, all knowledge point objects are organized according to the causal relationship using the depth-first search method to form a knowledge trunk tree, and displayed by the directory structure display method;
其主要步骤如下:Its main steps are as follows:
(2.1)将所有的知识点的使用标志设为“未使用”,将纸质书总标题即总知识点设为当前条目,在目录树上记录当前条目,并设置当前条目及对应的知识点的使用标志为“使用”;(2.1) Set the use flags of all knowledge points to "unused", set the general title of the paper book, that is, the total knowledge points, as the current entry, record the current entry on the directory tree, and set the current entry and the corresponding knowledge points The sign of use is "use";
(2.2)找到与当前条目即当前知识点所有的子知识点作为当前条目的子条目,并将它们按照某种顺序进行排列;(2.2) Find all the sub-knowledge points related to the current entry, that is, the current knowledge point, as the sub-entries of the current entry, and arrange them in a certain order;
(2.3)对当前条目的所有子条目,依照设定的排列顺序选定每个子条目,将其记录在目录树上;(2.3) For all sub-entries of the current entry, select each sub-entry according to the set arrangement order, and record it on the directory tree;
(2.4)如果该知识点的使用标志为“使用”,则在该子条目上设置使用标志为“引用”,否则将其使用标志设为“使用”;(2.4) If the use flag of the knowledge point is "use", then set the use flag on the sub-entry to "reference", otherwise set its use flag to "use";
(2.5)如果该子条目存在子条目并且其对应的知识点的使用标志为“未使用”,则设置对应的知识点使用标志为“使用”并重复(2.2)-(2.5),直到全部知识点都被使用,最后得到一个目录树。(2.5) If there is a sub-entry in this sub-entry and the use mark of its corresponding knowledge point is "unused", then set the use mark of the corresponding knowledge point to "use" and repeat (2.2)-(2.5) until all knowledge points The dots are all used, and you end up with a directory tree.
3)构建纸质书正文3) Construct the text of the paper book
根据2)所得到的目录树,从目录的开始处,将对应条目的知识点描述写入到顺序标注页码的纸质书正文的内容中,同时在正文中还记录每个知识点的开始页码和一些表示目录树相关条目之间关系的符号,其步骤如下:According to the catalog tree obtained in 2), from the beginning of the catalog, write the knowledge point description of the corresponding entry into the content of the paper book text with page numbers sequentially marked, and record the start page number of each knowledge point in the text and some symbols representing the relationship between related entries in the directory tree, the steps are as follows:
(3.1)从目录树上取得一个条目,将其设为当前条目,找到对应的知识点。(3.1) Obtain an entry from the directory tree, set it as the current entry, and find the corresponding knowledge point.
(3.2)如果当前条目的使用标志是“使用”,则将当前知识点的描述加在纸质书当前正文页的尾部,并将该知识点开始位置记录在当前条目的页码处,如果当前条目的使用标志是“引用”,则找到对应知识点的开始位置页码,并将该页码写到当前目录的页码处,而且引用的页码前面加“*”予以区别。(3.2) If the use flag of the current entry is "use", add the description of the current knowledge point at the end of the current text page of the paper book, and record the starting position of the knowledge point at the page number of the current entry. If the current entry If the usage flag is "reference", then find the page number of the start position of the corresponding knowledge point, and write the page number to the page number of the current catalog, and add "*" in front of the referenced page number to distinguish it.
(3.3)在当前知识点的开始一页里,如果当前条目的前一个条目是父节点,则在其开始处或左边空白处标注该条目与上一个条目的父子关系符号“┕”,不标注的表示前一个条目与当前条目之间的关系是兄弟节点关系。(3.3) On the first page of the current knowledge point, if the previous entry of the current entry is a parent node, mark the parent-child relationship symbol "┕" between the entry and the previous entry at the beginning or the left margin, and do not mark Indicates that the relationship between the previous entry and the current entry is a sibling relationship.
(3.4)在当前知识点所在的正文每页里,如果该知识点处在当前页的最前部,则在当前页的页眉处标注该知识点的起始页码。(3.4) In each page of the text where the current knowledge point is located, if the knowledge point is at the forefront of the current page, mark the starting page number of the knowledge point at the header of the current page.
(3.5)在当前知识点的最后一页里,如果当前条目是叶节点,则在其结尾处或右边空白处标注该知识点终止符号“■”,不标注的是分支节点,表示后面的部分是当前知识点的子节点。(3.5) On the last page of the current knowledge point, if the current entry is a leaf node, mark the terminator symbol "■" of the knowledge point at the end or in the margin on the right. Those not marked are branch nodes, indicating the following part It is a child node of the current knowledge point.
4)构建知识点和其他内容的检索方法4) Build retrieval methods for knowledge points and other content
其步骤如下:The steps are as follows:
(4.1)将所有的知识点名称及其页码按照某种顺序在纸质书的后面顺序列出。(4.1) List the names of all knowledge points and their page numbers at the back of the paper book in a certain order.
(4.2)将所有纸质书中其他关键词及其页码,例如所有的概念术语及其页码,按照某种顺序在纸质书的后面顺序列出。(4.2) List other keywords and their page numbers in all paper books, such as all conceptual terms and their page numbers, in a certain order at the back of the paper book.
本发明所带来的有益效果是,所提出的方法可以使编撰的纸质书层次更清晰,使用更方便,更加有利于建立纸质书内部知识之间的内在关系,从而有利于帮助作者准确地整理和表示所需要表示的知识和思想,使读者能够通过对目录结构及正文的阅读及回溯更容易、深刻地理解纸质书的知识内容和复杂关系,同时还能够使读者仍然保留阅读传统纸质书的乐趣和各种感觉。The beneficial effect brought by the present invention is that the proposed method can make the compiled paper books clearer, more convenient to use, and more conducive to establishing the internal relationship between the internal knowledge of paper books, thereby helping authors to accurately Accurately organize and express the knowledge and ideas that need to be expressed, so that readers can more easily and deeply understand the knowledge content and complex relationships of paper books through reading and reviewing the directory structure and text, and at the same time enable readers to still retain the traditional reading tradition. The fun and variety of paper books.
附图说明Description of drawings
图1是POAD中的知识单元结构图;Figure 1 is a structural diagram of knowledge units in POAD;
图2是POAD中的因果树图;Figure 2 is a causal tree diagram in POAD;
图3是纸质书的目录结构图;Fig. 3 is a catalog structure diagram of a paper book;
图4是纸质书的正文结构图;Fig. 4 is a text structure diagram of a paper book;
图5是纸质书目录构建流程图;Fig. 5 is a flow chart of paper book catalog construction;
图6是纸质书正文构建流程图。Fig. 6 is a flow chart of paper book text construction.
具体实施方式detailed description
为了使本发明的目的、技术方案及优点更加清楚明白,以下结合附图及实施例,对本发明进行进一步详细说明。应当理解,此处所描述的具体实施例仅仅用以解释本发明,并不用于限定本发明。此外,下面所描述的本发明各个实施方式中所涉及到的技术特征只要彼此之间未构成冲突就可以相互组合。In order to make the object, technical solution and advantages of the present invention clearer, the present invention will be further described in detail below in conjunction with the accompanying drawings and embodiments. It should be understood that the specific embodiments described here are only used to explain the present invention, not to limit the present invention. In addition, the technical features involved in the various embodiments of the present invention described below can be combined with each other as long as they do not constitute a conflict with each other.
在本发明中,假定纸质书的构建任务就是在已有的知识水平和相关资料条件下,将这些知识组织成为一本能够表达一定知识网络关系的纸质书供读者阅读。In the present invention, it is assumed that the task of constructing a paper book is to organize these knowledge into a paper book capable of expressing a certain knowledge network relationship for readers to read under the existing knowledge level and related materials.
在下面的论述中,知识点是纸质书的构建单位,而目录是纸质书的知识组织结构,即该书的知识主干树的映射,目录中的每个条目对应于知识主干树上的节点,也对应于纸质书正文中的一个知识点,而正文中的每个知识点至少对应目录中的一个条目。所述方法包括如下步骤:In the following discussion, the knowledge point is the construction unit of the paper book, and the catalog is the knowledge organization structure of the paper book, that is, the mapping of the knowledge trunk tree of the book, and each entry in the catalog corresponds to the knowledge trunk tree. A node also corresponds to a knowledge point in the text of the paper book, and each knowledge point in the text corresponds to at least one entry in the table of contents. The method comprises the steps of:
1)构建知识点对象1) Build knowledge point objects
将纸质书所涉及的领域知识分解成一系列不存在名称冲突的知识点;Decompose the domain knowledge involved in paper books into a series of knowledge points without name conflicts;
对于每个知识点,均用若干段独立的文字符号(可以包括图形符号、数学符号等)区分开来,即每个知识点均用一个知识点名称和它所对应的符号集合表示。知识点中只包含当前层的描述,子知识点则只列条目而不显示内容。For each knowledge point, it is distinguished by several independent text symbols (may include graphic symbols, mathematical symbols, etc.), that is, each knowledge point is represented by a knowledge point name and its corresponding symbol set. The knowledge point only contains the description of the current layer, and the sub-knowledge point only lists the entries without displaying the content.
构建纸质书的最基础工作是对该纸质书所要表达的知识和思想进行整理,将它们分解成一系列不存在名称冲突的知识点对象。知识点的知识颗粒大小的设定非常重要。显然,纸质书是不可能做成非常完备的知识系统的,即它所涉及的知识点的知识颗粒不可能细致到都是不可再分的,这是因为一方面这是非常难以做到的,另一方面即使可以将所有的知识点都划分成不可再分的知识颗粒,纸质书也是没有办法予以记录的,因为知识点数量将极其巨大。事实上,即使是作者本人,也不一定能对该书所涉及的全部知识都认识清楚,因此,在任何纸质书中均存在一定的模糊部分。一般来说,知识点的设置应该遵循以下原则:能够反映该领域的主流知识;能够解决相关的问题;对于该领域读者的常识知识可以简略;符合读者的使用习惯等等。The most basic work of constructing a paper book is to sort out the knowledge and ideas expressed by the paper book, and decompose them into a series of knowledge point objects without name conflicts. The setting of the knowledge particle size of knowledge points is very important. Obviously, paper books cannot be made into a very complete knowledge system, that is, the knowledge particles of the knowledge points involved cannot be so detailed that they cannot be divided. This is because on the one hand, it is very difficult to achieve On the other hand, even if all knowledge points can be divided into indivisible knowledge particles, there is no way to record them in paper books, because the number of knowledge points will be extremely large. In fact, even the author himself does not necessarily have a clear understanding of all the knowledge involved in the book, so there is a certain amount of ambiguity in any paper book. Generally speaking, the setting of knowledge points should follow the following principles: it can reflect the mainstream knowledge in the field; it can solve related problems; the common sense knowledge of readers in this field can be brief; it can meet the readers' usage habits, etc.
知识点对象之间必须满足层次关系。知识点对象之间的关系是复杂的,但是我们可以通过POAD方法按照某个主题将其进行层次化,即将所有的知识点都根据广义的因果关系建立父子关系,这样就可以形成知识主干树。这种知识主干树实际上是与或图,从而可以表示更加复杂的知识点之间的关系。可以将所有的父子关系简单分为两类,一类是分类关系,一类是组成关系。如果是分类关系,则各个子节点之间的关系是或关系,如果是组成关系,则各个子节点之间是与关系。Knowledge point objects must satisfy the hierarchical relationship. The relationship between knowledge point objects is complex, but we can use the POAD method to stratify them according to a certain topic, that is, all knowledge points establish parent-child relationships based on generalized causal relationships, so that a knowledge trunk tree can be formed. This kind of knowledge trunk tree is actually an AND or graph, which can represent the relationship between more complex knowledge points. All parent-child relationships can be simply divided into two categories, one is classification relationship and the other is composition relationship. If it is a classification relationship, the relationship between each child node is an OR relationship, and if it is a composition relationship, then the relationship between each child node is an AND relationship.
通过与或图,可以将领域中的多个知识点自然地联接在一起,但是如果对每个知识点都展开,则将出现非常多的重复部分。为了解决这个问题,必须允许不存在父子关系的两个知识点也可以建立相关关系,即多个知识主干树上的节点可以引用同一个知识点,这样知识主干树实际上称为知识主干图。Through the AND or graph, multiple knowledge points in the field can be naturally connected together, but if each knowledge point is expanded, there will be a lot of repeated parts. In order to solve this problem, it is necessary to allow two knowledge points that do not have a parent-child relationship to establish a related relationship, that is, nodes on multiple knowledge trunk trees can refer to the same knowledge point, so the knowledge trunk tree is actually called a knowledge trunk graph.
因此,纸质书中的知识点可以用知识点的名称及描述符号表示,在后面的步骤中将插入到纸质书的正文中,而知识主干树对应于纸质书上的目录,每个节点对应于目录上的条目,其中,可以允许多个条目完全相同。Therefore, the knowledge points in the paper book can be represented by the names and description symbols of the knowledge points, which will be inserted into the text of the paper book in the following steps, and the knowledge trunk tree corresponds to the catalog on the paper book, each A node corresponds to an entry on a directory, where multiple entries may be allowed to be identical.
2)构建纸质书目录2) Construct a catalog of paper books
根据纸质书的主题将所有的知识点对象按照因果关系采用深度优先搜索方法组织起来形成知识主干树,并用目录结构显示方法显示出来的过程称为纸质书目录的构建过程。According to the theme of the paper book, the process of organizing all the knowledge point objects according to the causal relationship using the depth-first search method to form a knowledge trunk tree, and displaying it with the catalog structure display method is called the construction process of the paper book catalog.
任何种类的书籍,其核心是目录,它实质上表现的是整本书的知识体系。一般的纸质书中,其目录虽然也是按照某种规则构建的,例如分类关系、组成关系,但是为了减少目录层次,消除冗余,方便阅读,突出重点等等,实际上很多纸质书的目录都是按混合方式构建,甚至还出现一些逻辑混乱情况。这种目录其主要功能并不是对知识进行组织,而是便于对书中内容进行查询和应用,知识点及其相互关系需要读者自己构建。The core of any kind of book is the table of contents, which essentially represents the knowledge system of the entire book. In general paper books, although the catalog is also constructed according to certain rules, such as classification relationship and composition relationship, in order to reduce the catalog level, eliminate redundancy, facilitate reading, highlight key points, etc., in fact, many paper books Directories are structured in a mixed fashion, and even some logical confusion. The main function of this kind of catalog is not to organize knowledge, but to facilitate the query and application of the content in the book. Knowledge points and their interrelationships need to be constructed by readers themselves.
传统纸质书的缺陷是由于书本是二维介质造成的。显然,如果采用电子书,则可以通过建立知识点之间的链接关系实现各知识点之间的关联,从而可以在电子书中预先建立知识点之间的关系,这对于知识的学习和应用是非常重要的。虽然纸质书不具备电子书先天的优势,但是通过对纸质书进行一些信息的标注,同样也可以使其具有一定的知识联接功能。The defect of the traditional paper book is caused by the fact that the book is a two-dimensional medium. Obviously, if e-books are used, the association between knowledge points can be realized by establishing the link relationship between knowledge points, so that the relationship between knowledge points can be pre-established in e-books, which is very important for the learning and application of knowledge. very important. Although paper books do not have the inherent advantages of e-books, by marking some information on paper books, it can also have a certain knowledge connection function.
本发明对于建立知识点之间的关联关系的方法是:建立目录,并且在目录条目上标注知识点所在页码的方法。在建立目录时,各个知识点必须遵守步骤1)所做的规定,即它们必须满足规定的父子关系,同时,为了使方便读者使用纸质书,目录必须按照深度优先的顺序进行排列,这样当读者参阅一个知识点时,该知识点一定在当前知识点正文中的前面部分,也就是说一定是一个已经学习过的知识点。The method for establishing the association between knowledge points in the present invention is: establishing a catalog and marking the page number of the knowledge point on the catalog entry. When creating the catalog, each knowledge point must comply with the regulations made in step 1), that is, they must meet the specified parent-child relationship. At the same time, in order to make it easier for readers to use paper books, the catalog must be arranged in the order of depth priority, so when When readers refer to a knowledge point, the knowledge point must be in the front part of the current knowledge point text, that is to say, it must be a knowledge point that has already been learned.
在本发明中,目录中的父子关系只有两种:分类关系和组成关系,即目录是一个与或图。因为这两种关系是非常容易区分的,所以在目录中不作特殊标注,即与或关系需要读者在阅读时自己确定。In the present invention, there are only two kinds of parent-child relationships in the directory: classification relationship and composition relationship, that is, the directory is an AND or graph. Because these two relationships are very easy to distinguish, no special marking is made in the table of contents, that is, the AND or relationship needs to be determined by the reader when reading.
构建纸质书目录的主要步骤如下:The main steps to build a paper book catalog are as follows:
(2.1)将所有的知识点的使用标志设为“未使用”,将纸质书总标题即总知识点设为当前条目,在目录树上记录当前条目,并设置当前条目及对应的知识点的使用标志为“使用”;(2.1) Set the use flags of all knowledge points to "unused", set the general title of the paper book, that is, the total knowledge points, as the current entry, record the current entry on the directory tree, and set the current entry and the corresponding knowledge points The sign of use is "use";
(2.2)找到与当前条目即当前知识点所有的子知识点作为当前条目的子条目,并将它们按照某种顺序进行排列;(2.2) Find all the sub-knowledge points related to the current entry, that is, the current knowledge point, as the sub-entries of the current entry, and arrange them in a certain order;
(2.3)对当前条目的所有子条目,依照设定的排列顺序选定每个子条目,将其记录在目录树上;(2.3) For all sub-entries of the current entry, select each sub-entry according to the set arrangement order, and record it on the directory tree;
(2.4)如果该知识点的使用标志为“使用”,则在该子条目上设置使用标志为“引用”,否则将其使用标志设为“使用”;(2.4) If the use flag of the knowledge point is "use", then set the use flag on the sub-entry to "reference", otherwise set its use flag to "use";
(2.5)如果该子条目存在子条目并且其对应的知识点的使用标志为“未使用”,则设置对应的知识点使用标志为“使用”并重复(2.2)-(2.5),直到全部知识点都被使用,最后得到一个目录树。(2.5) If there is a sub-entry in this sub-entry and the use mark of its corresponding knowledge point is "unused", then set the use mark of the corresponding knowledge point to "use" and repeat (2.2)-(2.5) until all knowledge points The dots are all used, and you end up with a directory tree.
3)构建纸质书正文3) Construct the text of the paper book
步骤2)所得到的目录树记录了知识点之间层次关系,但是还没有记录具体的知识点内容。纸质书的正文即是用于记录所有的知识点内容信息。纸质书正文的制作需要满足以下几个要求:The directory tree obtained in step 2) records the hierarchical relationship between the knowledge points, but does not record the specific content of the knowledge points. The text of the paper book is used to record all knowledge point content information. The production of the text of the paper book needs to meet the following requirements:
A)完整记录知识点的内容A) Completely record the content of knowledge points
知识点的内容包括知识点的名称或条目、知识点的描述和知识点的组成子知识点,即在正文里,任何一个知识点的完整记录都必须包括该知识点的条目、该知识点的描述和该知识点的子条目及其描述,其中该知识点的条目是必不可少的,而任何一个知识点的完整记录都只出现一次,其他的都只能是引用。The content of a knowledge point includes the name or entry of the knowledge point, the description of the knowledge point, and the sub-knowledge points of the knowledge point, that is, in the text, any complete record of a knowledge point must include the entry of the knowledge point, the Description and the sub-entries of this knowledge point and their descriptions, the entry of this knowledge point is essential, and the complete record of any knowledge point only appears once, and the others can only be references.
B)将当前知识点的开始位置存入目录中对应的条目中B) Store the starting position of the current knowledge point into the corresponding entry in the directory
每个知识点在正文中只有一处描述,所以它的开始位置只有一个。但是使用或引用该知识点的目录中的条目可能有多个,所以需要保证这些条目都指向同一个知识点的开始位置。但由于目录必须给读者提供整体有向性,所以需要在引用条目处做上标记,既区分条目是否是初次使用该知识点,又不影响目录的整体有向性。一个知识点中的引用条目属于该知识点描述的一部分,不能单独构成一个知识点。它等同于知识点中的普通描述文字,只是在这些条目上可以附加对应的知识点开始位置页面。Each knowledge point has only one description in the text, so there is only one starting position for it. However, there may be multiple entries in the directory that use or reference this knowledge point, so it is necessary to ensure that these entries all point to the beginning of the same knowledge point. However, since the catalog must provide readers with overall orientation, it is necessary to mark the reference entry, which not only distinguishes whether the entry is the first use of the knowledge point, but also does not affect the overall orientation of the catalog. References in a knowledge point are part of the description of the knowledge point and cannot constitute a knowledge point alone. It is equivalent to the general description text in the knowledge point, except that the corresponding knowledge point start position page can be attached to these entries.
C)标明当前页面第一个知识点的开始位置C) Indicate the starting position of the first knowledge point on the current page
当前页面中可能存在多个知识点,但是除了页面中第一个知识点外,其他知识点的开始位置都在本页,所以不必要特别标注,而第一个知识点可能在前面正文页中,也可能在当前页正文页中(此时正文的第一行就是该知识点的开始位置),这时需要在页眉处进行标注,方便读者查询。There may be multiple knowledge points on the current page, but except for the first knowledge point on the page, the starting positions of other knowledge points are all on this page, so there is no need for special marking, and the first knowledge point may be in the previous text page , or in the text page of the current page (at this time, the first line of the text is the starting position of the knowledge point), at this time, it needs to be marked at the header to facilitate readers' inquiries.
D)区分当前节点和上一个节点之间的关系D) Distinguish the relationship between the current node and the previous node
当前正文页中每一个条目即当前节点和上一个节点之间可能存在三类关系,一类是父子关系,一类是兄弟关系,另一类就是全部正文页开始关系,即当前条目是整个纸质书的开始条目。显然,开始条目是不需要特殊关心的,所以仅区分父子关系和兄弟关系,因为区分这两类关系对于知识点之间的内容和关系的理解是有益的。Each entry in the current text page, that is, there may be three types of relationships between the current node and the previous node, one is the parent-child relationship, the other is the brother relationship, and the other is the relationship between the beginning of all text pages, that is, the current entry is the entire page The start entry of the quality book. Obviously, the start item does not need special attention, so only parent-child relationship and brother relationship are distinguished, because the distinction between these two types of relationships is beneficial to the understanding of the content and relationship between knowledge points.
E)区分当前节点和下一个节点之间的关系E) Distinguish the relationship between the current node and the next node
当前正文页中每个知识点的结尾处都意味着当前知识点的结束,其对应的条目即当前节点和下一个节点之间也可能存在三类关系,一类是父子关系,一类是兄弟关系,另一类就是全部知识点终结关系,即后面不再有任何知识点内容。因此可以将当前节点分为叶节点和分支节点两类。区分这两类关系对于知识点之间的内容和关系的理解也是有益的。The end of each knowledge point in the current text page means the end of the current knowledge point, and there may be three types of relationships between the corresponding entries, that is, the current node and the next node, one is parent-child relationship, and the other is brother The other type is the end relationship of all knowledge points, that is, there is no longer any knowledge point content behind. Therefore, the current node can be divided into two types: leaf node and branch node. Distinguishing these two types of relationships is also beneficial to the understanding of the content and relationships between knowledge points.
构建纸质书正文的步骤如下:The steps to construct the body of a paper book are as follows:
(3.1)从目录树上取得一个条目,将其设为当前条目,找到对应的知识点。(3.1) Obtain an entry from the directory tree, set it as the current entry, and find the corresponding knowledge point.
(3.2)如果当前条目的使用标志是“使用”,则将当前知识点的描述加在纸质书当前正文页的尾部,并将该知识点开始位置记录在当前条目的页码处,如果当前条目的使用标志是“引用”,则找到对应知识点的开始位置页码,并将该页码写到当前目录的页码处,而且引用的页码前面加“*”予以区别。(3.2) If the use flag of the current entry is "use", add the description of the current knowledge point at the end of the current text page of the paper book, and record the starting position of the knowledge point at the page number of the current entry. If the current entry If the usage flag is "reference", then find the page number of the start position of the corresponding knowledge point, and write the page number to the page number of the current catalog, and add "*" in front of the referenced page number to distinguish it.
(3.3)在当前知识点的开始一页里,如果当前条目的前一个条目是父节点,则在其开始处或左边空白处标注该条目与上一个条目的父子关系符号“┕”,不标注的表示前一个条目与当前条目之间的关系是兄弟节点关系。(3.3) On the first page of the current knowledge point, if the previous entry of the current entry is a parent node, mark the parent-child relationship symbol "┕" between this entry and the previous entry at the beginning or left margin, and do not mark Indicates that the relationship between the previous entry and the current entry is a sibling relationship.
(3.4)在当前知识点所在的正文每页里,如果该知识点处在当前页的最前部,则在当前页的页眉处标注该知识点的起始页码。(3.4) In each page of the text where the current knowledge point is located, if the knowledge point is at the forefront of the current page, mark the starting page number of the knowledge point at the header of the current page.
(3.5)在当前知识点的最后一页里,如果当前条目是叶节点,则在其结尾处或右边空白处标注该知识点终止符号“■”,不标注的是分支节点,表示后面的部分是当前知识点的子节点。(3.5) On the last page of the current knowledge point, if the current entry is a leaf node, mark the terminator symbol "■" of the knowledge point at the end or in the margin on the right. Those not marked are branch nodes, indicating the following part It is a child node of the current knowledge point.
4)构建知识点和其他内容检索的索引4) Build an index for knowledge points and other content retrieval
在很多情况下,根据知识点名称或关键词等来进行检索具有较高的实用价值,所以常规的检索具有重要意义。建立索引的步骤如下:In many cases, searching based on the names of knowledge points or keywords has high practical value, so conventional searching is of great significance. The steps to create an index are as follows:
(4.1)将所有的知识点名称及其页码按照某种顺序在纸质书的后面顺序列出。(4.1) List the names of all knowledge points and their page numbers at the back of the paper book in a certain order.
(4.2)将所有纸质书中其他关键词及其页码,例如所有的概念术语及其页码,按照某种顺序在纸质书的后面顺序列出。(4.2) List other keywords and their page numbers in all paper books, such as all conceptual terms and their page numbers, in a certain order at the back of the paper book.
下面以纸质书的构建过程为例,说明本发明纸质书构建方法的实现过程。本发明采用计算机系统Office软件作为实现手段,以图3~图6所示的附图作为说明对象,根据本发明所揭示的纸质书构建方法的流程和实现方案说明如下:The following takes the construction process of a paper book as an example to illustrate the implementation process of the paper book construction method of the present invention. The present invention adopts the computer system Office software as the means of realization, and takes the accompanying drawings shown in Figure 3 to Figure 6 as the object of description, and the process and implementation plan of the paper book construction method disclosed by the present invention are described as follows:
步骤1:构建知识点对象Step 1: Build knowledge point objects
将纸质书所涉及的领域知识分解成一系列不存在名称冲突的知识点。对于每个知识点,均用一个Word文件存储,其中该文件的文件名用知识点名命名,而文件的内容由若干段独立的文字符号(可以包括图形符号、数学符号等)区分开来。如果该知识点内部包含子知识点,则可在该文件中加入子知识点的条目名称,子条目内容在构建纸质书正文时根据需要取舍。Decompose the domain knowledge involved in the paper book into a series of knowledge points without name conflicts. For each knowledge point, it is stored in a Word file, wherein the file name of the file is named after the knowledge point, and the content of the file is distinguished by several independent text symbols (may include graphic symbols, mathematical symbols, etc.). If the knowledge point contains sub-knowledge points, the entry names of the sub-knowledge points can be added to the file, and the content of the sub-entries can be selected according to the needs when constructing the text of the paper book.
步骤2:构建纸质书目录Step 2: Build a physical book catalog
产生一个文件,用于记录当前纸质书的目录。将当前纸质书的书名作为该目录的根节点,但是不写在目录中。然后从根节点开始,根据深度优先原则对所有的知识点进行搜索,构建出全部目录条目。其主要步骤如图5所示:Generate a file for recording the table of contents of the current paper book. Use the title of the current paper book as the root node of the directory, but do not write it in the directory. Then start from the root node, search all knowledge points according to the depth-first principle, and construct all directory entries. Its main steps are shown in Figure 5:
步骤501:开始。Step 501: start.
步骤502:设置开始条目为当前条目。建立一个知识点处理记录文件,名称设为:Knowledge.xls。该文件有3列,分别为:知识点名称,使用标志,开始页码。将每个知识点的名称填入,使用标志设置为“未使用”。建立一个目录文件:mulu.doc,记录纸质书的目录。如图3所示,文件mulu.doc中每一行记录表示一个条目,每一个条目包括两个部分,一个部分是条目名称,另一个部分是页码。每个条目按照所处的层次设定为对应的格式,如一级条目设置为一级标题格式等等。Step 502: Set the start entry as the current entry. Create a knowledge point processing record file with the name: Knowledge.xls. The file has 3 columns, which are: knowledge point name, usage flag, and starting page number. Fill in the name of each knowledge point, and set the use flag to "unused". Create a directory file: mulu.doc to record the directory of the paper book. As shown in Figure 3, each line of record in the file mulu.doc represents an entry, and each entry includes two parts, one part is the name of the entry, and the other part is the page number. Each entry is set to a corresponding format according to its level, for example, a first-level entry is set to a first-level title format, and so on.
步骤502只针对根节点,而且不需要将当前的条目写在目录里。Step 502 is only for the root node, and does not need to write the current entry in the directory.
步骤503:找到当前条目的所有子条目,将其按顺序记录在另一个名称为mulu.tmp的文本文件中。Step 503: Find all sub-entries of the current entry, and record them in order in another text file named mulu.tmp.
步骤504:检查文件mulu.tmp中是否存在未使用的条目,如果存在,则进入步骤505,否则进入步骤511。Step 504: Check whether there are unused entries in the file mulu.tmp, if yes, go to step 505, otherwise go to step 511.
步骤505:从mulu.tmp取出一个条目设置为当前条目,并将其写在目录文件mulu.doc中,然后将该条目从mulu.tmp中删除。Step 505: Take an entry from mulu.tmp and set it as the current entry, write it in the directory file mulu.doc, and then delete the entry from mulu.tmp.
步骤506:根据条目名称在文件Knowledge.xls找到对应的知识点的记录。Step 506: Find the record of the corresponding knowledge point in the file Knowledge.xls according to the item name.
步骤507:如果该知识点的使用标志为“使用”,则进入步骤512,否则进入步骤508。Step 507: If the use flag of the knowledge point is "used", go to step 512, otherwise go to step 508.
步骤508:在mulu.doc文件中当前条目的页码处写上标志“使用”。Step 508: Write the flag "use" at the page number of the current entry in the mulu.doc file.
步骤509:如果当前条目有子条目,则进入步骤510,否则进入步骤504。Step 509: If the current entry has sub-entries, go to step 510, otherwise go to step 504.
步骤510:递归调用当前过程,即进入步骤503。Step 510: call the current process recursively, that is, enter step 503.
步骤511:退出。如果当前过程是被递归调用的过程,则退回到调用该步骤的过程。Step 511: Exit. If the current procedure is one that was called recursively, fall back to the procedure that called the step.
步骤512:在mulu.doc文件中当前条目的页码处写上标志“引用”。Step 512: Write the mark "reference" at the page number of the current entry in the mulu.doc file.
最后获得图3所示的目录。Finally, the directory shown in Figure 3 is obtained.
步骤3:构建纸质书正文Step 3: Construct the paper book body
构建纸质书正文的步骤如图6所示:The steps to construct the text of the paper book are shown in Figure 6:
步骤601:开始。建立一个记录正文的文件:content.doc。content.doc中的页面设置为5个区,如图4所示,401是页眉区,402是正文区,403是右边叶节点标志区,405是正文页码区,406是左边父子关系标志区,其中404是正文区内引用条目的知识点开始处页码的标志区。打开mulu.doc文件,定位在开始处。Step 601: start. Create a file that records the text: content.doc. The page in content.doc is set to 5 areas, as shown in Figure 4, 401 is the header area, 402 is the text area, 403 is the right leaf node mark area, 405 is the text page number area, 406 is the left parent-child relationship mark area , where 404 is the mark area of the page number at the beginning of the knowledge point of the cited entry in the text area. Open the mulu.doc file, positioned at the beginning.
步骤602:找到下一个条目,在文件Knowledge.xls找到对应的知识点的记录。Step 602: Find the next entry, and find the record of the corresponding knowledge point in the file Knowledge.xls.
步骤603:将mulu.doc文件中当前条目的页码处的标志取出。Step 603: Take out the mark at the page number of the current entry in the mulu.doc file.
步骤604:如果该标志为“使用”,进入步骤619,否则进入步骤605。Step 604: If the flag is "used", go to step 619, otherwise go to step 605.
步骤605:获得文件Knowledge.xls中对应知识点的开始页码数据。Step 605: Obtain the start page number data of the corresponding knowledge point in the file Knowledge.xls.
步骤606:在该页码数据前面添加符号“*”,并覆盖mulu.doc文件中当前条目页码处的标志数据。Step 606: Add the symbol "*" in front of the page number data, and overwrite the mark data at the page number of the current entry in the mulu.doc file.
步骤607:将当前条目的名称及其页码添加到文件content.doc的后面。Step 607: Add the name of the current entry and its page number to the back of the file content.doc.
步骤608:如果当前条目是前面条目的子节点,则进入步骤609,否则进入步骤610。Step 608: If the current entry is a child node of the previous entry, go to step 609, otherwise go to step 610.
步骤609:在文件content.doc的左边父子关系标志区标注符号“┕”。Step 609: Mark the symbol "┕" in the left parent-child relationship sign area of the file content.doc.
步骤610:将content.doc文件当前页最前部的知识点名称及开始页码标注在页眉上。Step 610: mark the name of the knowledge point and the starting page number at the top of the current page of the content.doc file on the header.
步骤611:进入content.doc文件的下一页。Step 611: Go to the next page of the content.doc file.
步骤612:结束。Step 612: end.
步骤613:检查mulu.doc文件中还有没处理的条目,如果有,则进入步骤602,否则进入步骤612。Step 613 : Check if there are unprocessed entries in the mulu.doc file, if so, go to step 602 , otherwise go to step 612 .
步骤614:在文件content.doc的右边叶节点标志区标注符号“■”。Step 614: mark the symbol "■" in the right leaf node mark area of the file content.doc.
步骤615:检查当前条目是否是叶节点或者是引用节点,如果是,则进入步骤614,否则,进入步骤613。Step 615 : Check whether the current entry is a leaf node or a reference node, if yes, go to step 614 , otherwise, go to step 613 .
步骤616:检查是否到达content.doc文件的最后一页,如果到达,则进入步骤615,否则进入步骤611。Step 616: Check whether the last page of the content.doc file is reached, if so, go to step 615, otherwise go to step 611.
步骤617:到达content.doc文件中当前知识点的开始页面处。Step 617: Arrive at the start page of the current knowledge point in the content.doc file.
步骤618:将当前知识点的开始位置页码数据覆盖mulu.doc文件中当前条目页码处的标志数据。Step 618: Overwrite the mark data at the page number of the current entry in the mulu.doc file with the page number data at the start position of the current knowledge point.
步骤619:将当前知识点文件中的数据复制到content.doc文件的尾部,并将该知识点开始位置的页码记录在Knowledge.xls的页码数据项中。Step 619: Copy the data in the current knowledge point file to the end of the content.doc file, and record the page number at the start position of the knowledge point in the page number data item of Knowledge.xls.
步骤4:构建知识点和其他内容检索的索引Step 4: Build an index for knowledge points and other content retrieval
(4.1)将Knowledge.xls文件中所有的知识点数据按照名称增序排列,将排序的结果中的知识点名称及其页码数据复制到content.doc文件的尾部。(4.1) Arrange all the knowledge point data in the Knowledge.xls file in ascending order of names, and copy the knowledge point names and their page numbers in the sorted result to the end of the content.doc file.
(4.2)将纸质书中所有的关键词及其页码,录入到Excel文件中进行排序,然后将排序结果复制到content.doc文件的尾部。(4.2) Enter all the keywords and page numbers in the paper book into the Excel file for sorting, and then copy the sorting results to the end of the content.doc file.
最后在mulu.doc中得到纸质书的目录表,而在content.doc中得到纸质书的正文及索引。Finally, get the table of contents of the paper book in mulu.doc, and get the text and index of the paper book in content.doc.
本具体实施步骤适合一般的纸质书构建。对本领域的普通技术人员而言,本发明所涉及的方法可以在一般的计算机系统中实现,也可以手工编排实现,举例而言,可以使用Office等软件加以实现。This specific implementation step is suitable for general paper book construction. For those of ordinary skill in the art, the methods involved in the present invention can be implemented in a general computer system, or can be manually programmed and implemented. For example, it can be implemented using Office and other software.
本领域的技术人员容易理解,以上所述仅为本发明的较佳实施例而已,并不用以限制本发明,凡在本发明的精神和原则之内所作的任何修改、等同替换和改进等,均应包含在本发明的保护范围之内。Those skilled in the art can easily understand that the above descriptions are only preferred embodiments of the present invention, and are not intended to limit the present invention. Any modifications, equivalent replacements and improvements made within the spirit and principles of the present invention, All should be included within the protection scope of the present invention.
Claims (6)
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201610035540.2A CN105718649A (en) | 2016-01-20 | 2016-01-20 | Construction method for paper book capable of hierarchically displaying knowledge network |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201610035540.2A CN105718649A (en) | 2016-01-20 | 2016-01-20 | Construction method for paper book capable of hierarchically displaying knowledge network |
Publications (1)
Publication Number | Publication Date |
---|---|
CN105718649A true CN105718649A (en) | 2016-06-29 |
Family
ID=56147360
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201610035540.2A Pending CN105718649A (en) | 2016-01-20 | 2016-01-20 | Construction method for paper book capable of hierarchically displaying knowledge network |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN105718649A (en) |
Cited By (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN107944023A (en) * | 2017-12-12 | 2018-04-20 | 广东小天才科技有限公司 | Exercise pushing method and system and terminal equipment |
CN108073646A (en) * | 2016-11-18 | 2018-05-25 | 北大方正集团有限公司 | Catalog extraction method and device |
CN108572956A (en) * | 2017-03-07 | 2018-09-25 | 北京新唐思创教育科技有限公司 | Method and device for invoking knowledge point slicing |
CN110178139A (en) * | 2016-11-14 | 2019-08-27 | 柯达阿拉里斯股份有限公司 | Use the system and method for the character recognition of the full convolutional neural networks with attention mechanism |
CN113377895A (en) * | 2021-05-13 | 2021-09-10 | 王宇 | Network knowledge point organization method and device based on directory tree |
Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN1908931A (en) * | 2006-08-22 | 2007-02-07 | 北京北大方正电子有限公司 | Literal data variable typesetting method |
US20140099038A1 (en) * | 2012-10-04 | 2014-04-10 | Fuji Xerox Co., Ltd. | Information processing apparatus, information processing method, and non-transitory computer readable medium |
CN104408029A (en) * | 2014-10-30 | 2015-03-11 | 华中科技大学 | Electronic book construction method |
-
2016
- 2016-01-20 CN CN201610035540.2A patent/CN105718649A/en active Pending
Patent Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN1908931A (en) * | 2006-08-22 | 2007-02-07 | 北京北大方正电子有限公司 | Literal data variable typesetting method |
US20140099038A1 (en) * | 2012-10-04 | 2014-04-10 | Fuji Xerox Co., Ltd. | Information processing apparatus, information processing method, and non-transitory computer readable medium |
CN104408029A (en) * | 2014-10-30 | 2015-03-11 | 华中科技大学 | Electronic book construction method |
Non-Patent Citations (2)
Title |
---|
张俊英: "基于谱聚类的图书目录结构", 《万方学位论文全文数据库》 * |
涂平晖等: "POADES知识库的构建", 《计算机工程》 * |
Cited By (9)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN110178139A (en) * | 2016-11-14 | 2019-08-27 | 柯达阿拉里斯股份有限公司 | Use the system and method for the character recognition of the full convolutional neural networks with attention mechanism |
CN110178139B (en) * | 2016-11-14 | 2023-05-09 | 柯达阿拉里斯股份有限公司 | System and method for character recognition using a full convolutional neural network with attention mechanisms |
CN108073646A (en) * | 2016-11-18 | 2018-05-25 | 北大方正集团有限公司 | Catalog extraction method and device |
CN108073646B (en) * | 2016-11-18 | 2021-12-24 | 北大方正集团有限公司 | Directory extraction method and device |
CN108572956A (en) * | 2017-03-07 | 2018-09-25 | 北京新唐思创教育科技有限公司 | Method and device for invoking knowledge point slicing |
CN108572956B (en) * | 2017-03-07 | 2021-07-09 | 北京新唐思创教育科技有限公司 | Method and device for calling knowledge point slice |
CN107944023A (en) * | 2017-12-12 | 2018-04-20 | 广东小天才科技有限公司 | Exercise pushing method and system and terminal equipment |
CN107944023B (en) * | 2017-12-12 | 2020-11-27 | 广东小天才科技有限公司 | A method, system and terminal device for pushing exercises |
CN113377895A (en) * | 2021-05-13 | 2021-09-10 | 王宇 | Network knowledge point organization method and device based on directory tree |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
Van Hooland et al. | Linked Data for Libraries, Archives and Museums: How to clean, link and publish your metadata | |
Van Ham et al. | Mapping text with phrase nets | |
Boonstra et al. | Past, present and future of historical information science | |
RU2686000C1 (en) | Retrieval of information objects using a combination of classifiers analyzing local and non-local signs | |
RU2732850C1 (en) | Classification of documents by levels of confidentiality | |
WO2014160379A1 (en) | Dimensional articulation and cognium organization for information retrieval systems | |
CN108647244A (en) | The tutorial resources integration method of mind map form, network store system | |
CN105718649A (en) | Construction method for paper book capable of hierarchically displaying knowledge network | |
CN105893485A (en) | Automatic special subject generating method based on book catalogue | |
Piasecki et al. | WordNetLoom: a WordNet development system integrating form-based and graph-based perspectives | |
CN105260488A (en) | Text sequence iterative method for semantic understanding | |
Kutter | Corpus analysis | |
CN110781289B (en) | Text visualization method for reserving unstructured text semantics | |
Pavlić et al. | Graph-based formalisms for knowledge representation | |
Repke et al. | Extraction and representation of financial entities from text | |
CN104408029B (en) | A kind of e-book construction method | |
Song et al. | Scalable distributed semantic network for knowledge management in cyber physical system | |
CN102999487A (en) | Digital publishing resource semantic enhanced description system and method thereof | |
Pavlic et al. | Adjective representation with the method Nodes of Knowledge | |
Sudewa et al. | A Bibliometric Analysis of Publication on Novel as Literacy Source | |
Hagerman et al. | Visual analytic system for subject matter expert document tagging using information retrieval and semi-supervised machine learning | |
Wang et al. | SCIEnt: A Semantic-Feature-Based Framework for Core Information Extraction from Web Pages | |
Quantin et al. | Supervised process of un-structured data analysis for knowledge chaining | |
Verhaar | Affordances and limitations of algorithmic criticism | |
Calero Espinosa | Multi-view learning for hierarchical topic detection on corpus of documents |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C06 | Publication | ||
PB01 | Publication | ||
C10 | Entry into substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
RJ01 | Rejection of invention patent application after publication |
Application publication date: 20160629 |
|
RJ01 | Rejection of invention patent application after publication |