CN114281920A - Theme content structured management method and system based on document set - Google Patents

Theme content structured management method and system based on document set Download PDF

Info

Publication number
CN114281920A
CN114281920A CN202111401724.3A CN202111401724A CN114281920A CN 114281920 A CN114281920 A CN 114281920A CN 202111401724 A CN202111401724 A CN 202111401724A CN 114281920 A CN114281920 A CN 114281920A
Authority
CN
China
Prior art keywords
document
theme
module
content
content unit
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202111401724.3A
Other languages
Chinese (zh)
Inventor
白建民
赵呈供
聂莹
许斌
邵翠萍
王秋
袁晓红
林春宏
孙肖媛
李晶
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Yunnan Kungang Electronic Information Technology Co ltd
Original Assignee
Yunnan Kungang Electronic Information Technology Co ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Yunnan Kungang Electronic Information Technology Co ltd filed Critical Yunnan Kungang Electronic Information Technology Co ltd
Priority to CN202111401724.3A priority Critical patent/CN114281920A/en
Publication of CN114281920A publication Critical patent/CN114281920A/en
Pending legal-status Critical Current

Links

Images

Abstract

The invention relates to a structured management method and a system of theme content based on a document set, wherein the system comprises a theme content management server and a user client connected with the theme content management server; the theme content management server comprises a theme type definition configuration module, a content unit definition configuration module, a document definition configuration module, a theme instance recording module, a data processing module, a content unit storage module, a document generation module, a retrieval processing module and a content information retrieval display module. The invention aims at the structured management and utilization of the content of the theme activity document set, and compared with the traditional mode, the benefit improvement is realized.

Description

Theme content structured management method and system based on document set
Technical Field
The invention belongs to the field of information technology application, and particularly relates to a theme content structured management method and system based on a document set.
Background
The daily management and management affairs of the enterprise can be regarded as being composed of a plurality of activities with theme property (such as institutionalized management affairs in each business field, various projects, special work and the like). Typically the content of these subject activities (including plans, process records, summaries, etc.) is described by a collection formed by a series of documents. These documents typically have "system" features, i.e., specific content structure, layout, and application scenario requirements. It can be said that these documents reflect concrete and detailed enterprise operation behaviors in a comprehensive manner, and are important data information with great value in utilization.
In a conventional information application environment, a document is usually formed by editing by a writer by using office tool software, and the content of the document is usually regarded as unstructured data and enters an information system to be circulated and stored in an attachment form.
Based on the mode, the document is usually only used for artificial (non-system automatic identification) and independent (non-batch processing) open browsing, and more various utilization ways and value mining are difficult to form for specific content information which is 'hidden' in the document attachment form.
According to actual experience, in the process of enterprise business activities, there are some common problems in the traditional document generation, management and utilization, which are listed as follows:
since the document as a whole is stored as unstructured data, it is difficult to perform regular retrieval and statistics on the document content information. Besides the retrieval value of common words and sentence characters, the document content often contains some structured data (such as some table data) with richer retrieval and statistical value.
Some subjects relate to the problems that documents are numerous, the documents are large in space, contents cover different work division fields, team cooperation is needed to be completed, information is asymmetric, understanding is deviated, and communication efficiency is low.
Different documents in a document set often have parts with the same meaning, and different persons can repeatedly write the documents and generate differences in description when writing the documents.
In the document auditing and revising process, a plurality of persons participate and respectively store, and version control is easy to be confused.
Different persons have different editing skills and habits, which causes different font formats and different typesetting styles of standard documents.
The contents and data in different documents are related and responded, and the editing process is easy to 'take the contents and the data into account'. The detailed comparison is complicated.
Documents with similar or identical contents need to be rewritten due to different application occasions or submission target subjects (for example, the same project can be established in the enterprise and at the government simultaneously, and different forms of establishment materials need to be prepared respectively).
In view of optimizing the above problems, there is a strong need for a system and method to solve the problems.
Disclosure of Invention
In order to solve the above problems, the present invention provides a document set-based method and system for structured management of theme content, which aims at structured management and utilization of theme active document set content and realizes clear benefit improvement compared with the traditional mode. The technical scheme of the invention is as follows:
a theme content structured management system based on a document set comprises a theme content management server and a user client connected with the theme content management server;
the theme content management server comprises a theme type definition configuration module, a content unit definition configuration module, a document definition configuration module, a theme instance recording module, a data processing module, a content unit storage module, a document generation module, a retrieval processing module and a content information retrieval display module;
the theme type definition configuration module is respectively connected with the content unit definition configuration module, the document definition configuration module and the theme instance recording module to define the theme type;
the content unit definition configuration module is respectively connected with the theme type definition configuration module, the content unit display module, the data processing module and the theme instance recording module, and is used for defining the semantics, the interface, the storage structure and the retrieval utilization form characteristics of the content unit;
the content unit display module acquires interface definitions from the content unit definition configuration module to construct a form interface for providing user input and viewing content unit information;
the data processing module provides a corresponding data processing program according to the content unit structure type identifier defined in the content unit configuration; the data processing module is respectively connected with the content unit display module, the content unit definition configuration module, the content unit storage module and the structured information storage module and is mainly used for content unit information version control, integral storage, reverse display conversion and structured fragment storage;
the document definition configuration module is respectively connected with the theme type definition configuration module, the theme instance recording module and the document generation module, and defines the name, the format and the content to be filled of each type of document in a document set contained in the theme type;
the theme instance recording module stores theme instance records and provides a theme content management interface;
the theme instance recording module provides an entrance for the content unit display module by constructing a content unit directory list and provides the theme instance information to be inherited by the content unit display module at the same time.
The content unit storage module is respectively connected with the data storage module and the document generation module and used for storing the whole information of the content unit input by the user.
The document generation module is respectively connected with the document definition configuration module, the content unit storage module and the document browsing and downloading display module to generate various predefined documents contained in the theme instance;
when the document is synthesized, the document generation module searches corresponding document examples according to parameters transmitted by the document browsing and downloading display module, judges whether the operation of synthesizing the document is needed to be executed, acquires and analyzes document style template information of the document of the type from the document definition configuration module if the operation is needed, acquires corresponding content unit information in the content unit storage module according to a placeholder identifier set in the document template to perform content filling to generate document examples, and then displays the document examples to a document requesting user through the document browsing and downloading display module.
Furthermore, the document browsing and downloading display module is respectively connected with the theme instance recording module and the document generating module, and receives a request of a user for obtaining a document and displays the document content back to the user; when a document contained in the theme instance is acquired, entering a theme instance interface from the theme instance recording module, entering a document browsing and downloading display module by clicking a document link in a document directory constructed in the theme instance interface, transmitting document related parameters to a document generating module to execute document instance acquisition operation, and feeding back the acquired document instance to a user through the document browsing and downloading display module.
Furthermore, the structured information storage module is respectively connected with the data processing module and the retrieval processing module, stores information fragments formed by splitting the content unit information and provides conventional retrieval support;
when the data processing module processes the input information of the content unit, the content unit information is split into structured information fragments and stored in the structured information storage module; the retrieval processing module responds to the retrieval request of the theme content of the user by retrieving the information segments in the structured information storage module;
the retrieval processing module is respectively connected with the content information retrieval display module and the structured information storage module and used for executing the content retrieval and result feedback of the theme instance;
the content information retrieval display module collects retrieval requests input by users, submits retrieval conditions to the retrieval processing module for analysis processing, and the retrieval processing module retrieves content unit fragment information stored in the structured information storage module according to the retrieval conditions, returns results meeting the retrieval conditions and feeds the results back to the users through the content information retrieval display module;
the content information retrieval display module is respectively connected with the user client and the retrieval processing module and provides theme instance content retrieval support for the user;
the user client accesses the content information retrieval display module to input retrieval conditions, submits the retrieval conditions to the retrieval processing module to execute retrieval processing, and after retrieval is completed, the retrieval processing module feeds back the retrieval results to the user through the content information retrieval display module.
Furthermore, when corresponding configurations are defined in the content unit definition configuration module and the document definition configuration module, the theme type name and the theme type identifier of a defined theme type in the theme type definition configuration module are inherited, so that the theme type to which the configuration belongs is determined;
when the content unit configuration is created in the content unit definition configuration module, the theme type name and the theme type identifier of a defined theme type are inherited from the theme type definition configuration module, and the theme type name and the theme type identifier are used for determining the theme type to which the configuration belongs;
when the document configuration is created in the document definition configuration module, the theme type name and the theme type identifier of a defined theme type are inherited from the theme type definition configuration module, so as to determine the theme type to which the document configuration belongs.
Furthermore, the document definition configuration module is respectively connected with the theme type definition configuration module, the theme instance recording module and the document generation module;
when the theme instance records in the theme instance recording module are loaded on the interface, reading all content unit configurations under the subordinate theme instance types defined in the content unit definition configuration module, and constructing a content unit catalog list;
the theme instance recording module is respectively connected with the theme type definition configuration module, the content unit definition configuration module, the document definition configuration module, the content unit display module, the document browsing and downloading display module and the user client, stores theme instance records and provides a theme content management interface;
when the theme instance records in the theme instance recording module are loaded on an interface, reading all document configurations under the subordinate theme instance types defined in the document definition configuration module, and constructing a document directory list;
when the document generation module executes document synthesis operation, the document style template information defined in the document configuration is read from the document definition configuration module and is used for generating a document instance after analyzing and filling contents.
Further, when a theme instance is created in the theme instance recording module, a certain theme type configuration in the theme type definition configuration module is read, the theme instance record is created by taking the theme type configuration as a template, and the theme type name and the theme type identifier are inherited;
when the theme instance records in the theme instance recording module are loaded on the interface, reading all content unit configurations under the subordinate theme instance types defined in the content unit definition configuration module, and constructing a content unit catalog list;
when the theme instance records in the theme instance recording module are loaded on an interface, reading all document configurations under the subordinate theme instance types defined in the document definition configuration module for constructing a document directory list;
a content unit directory list constructed in a theme instance recording interface in the theme instance recording module is used as an entrance for entering the content unit display module, and after entering the content unit display module, the content unit directory list automatically inherits keyword information such as a current theme instance theme type name, a theme type identifier, a theme name, a theme unique identifier and the like to establish data association.
And a document directory list constructed in a theme instance recording interface in the theme instance recording module is used as an entrance for entering the document browsing and downloading display module, and the document browsing and downloading display module is used for automatically inheriting the current keyword information to establish data association.
Furthermore, the content unit display module receives content information input by a user and submits the content information to the data processing module, the data processing module firstly carries out version control verification, generates a version number after the verification, adopts corresponding data processing operation according to the content unit structure type identification defined in the content unit definition configuration module corresponding to the content unit, integrally stores the content unit information to the content unit storage module, and simultaneously splits the content unit information into structured information fragments and stores the structured information fragments to the structured information storage module;
the data storage module carries out version control verification and data packaging processing on the content unit information input by a user and then stores the content unit information in the content unit storage module to form a content unit example record;
when the document generation module carries out document synthesis operation, the placeholder identifier set in the document template is analyzed, and the corresponding content unit instance record is obtained from the content unit storage module and is used for filling document content.
The invention also relates to a theme content structured management method based on the document set, which comprises the following steps:
defining a theme type; defining a content unit contained in a theme; defining a document template contained by a theme;
establishing a theme instance; inputting topic-related content through a content unit in a topic instance; processing content unit data; content information retrieval utilization;
and synthesizing the document, and browsing or downloading for utilization.
Further, defining a theme type, and creating form template information of a theme instance; analyzing the content composition of each document in the document set contained in the theme;
splitting and merging according to the principle that the local content meanings are independent and definite and are atomized as much as possible, and defining content units;
tabulating, functionalizing and realizing matched storage processing logic of the defined content units, and establishing a configuration record in a content unit definition configuration module; tabulating, functionalizing and splitting the structured information fragments;
establishing a document configuration record in a document definition configuration module; creating a theme instance of a science and technology project type, and generating an instance record in a theme instance recording module;
when the theme instance is loaded in the interface, retrieving the configuration of the content unit according to the belonging theme type identification and the unique theme identification, constructing a content unit catalog list of the theme instance as an entry for content editing, and further retrieving each unit type instance record to generate state information;
when the theme instance is loaded, retrieving the document definition configuration module according to the belonging theme type identification and the theme unique identification, and constructing a document list contained in the theme instance as an entrance for entering a document browsing and downloading interface;
editing a certain content unit, and when temporarily storing or submitting the certain content unit, firstly, performing version verification and control on the input information of the content unit submitted by a content unit interface; after the version is checked, integrally storing the content unit information in a content unit storage module by adopting different program logics according to the content unit structure type identification of the content unit, and meanwhile, structurally splitting the content unit information into fragment information and storing the fragment information in a structural information storage module;
and after the content unit data processing is finished, updating the current content unit state information of the content unit catalog list in the theme instance.
Further, when content information included in various subjects is searched, a search keyword is input and submitted. The retrieval processing module retrieves the content unit structured information fragment stored in the structured information storage module according to the retrieval condition submitted by the front-end retrieval interface and returns the retrieved information fragment;
when accessing the document contained in the theme instance, entering a document browsing and downloading interface through a document directory list contained in a theme instance interface in the theme instance recording module; when a document browsing and downloading interface is opened, transmitting a document type identifier contained in the current document configuration and a theme unique identifier contained in the current theme instance as parameters to a background document generation module for document synthesis judgment and processing;
during document synthesis operation, a document generation module acquires document style template information in corresponding configuration records in a document definition configuration module, analyzes all placeholder identifiers set in the document style template, acquires the latest version storage records of content units corresponding to the placeholder identifiers in a current theme instance one by one according to the placeholder identifiers, and merges the stored content units to the positions indicated by the placeholder identifiers in the document style template; after the document is synthesized, the document is displayed online through a document browsing and downloading interface and downloading support is provided for a user.
Compared with the prior art, the invention has the following beneficial effects:
(1) conventional retrieval utilization support of document content information is achieved.
(2) Content creation and information sharing support based on team collaboration is provided.
(3) Unique description and multiplexing (same meaning content remains consistent among different documents) support of meaning-specific content is implemented.
(4) Centralized document version control management is provided.
(5) The method realizes the standardization of document output, separates the document content from the document format, and supports the automatic generation of the standardized 'standard' documents with different formats and content ranges based on the content. Meanwhile, the editor is enabled to concentrate on content writing, and the requirement on typesetting skills is eliminated.
(6) And content editing assistance and convenient function (information verification, importing historical versions as editing drafts and the like) support are provided.
Drawings
FIG. 1 is a schematic diagram of the system of the present invention;
FIG. 2 is a schematic flow chart of the method of the present invention;
in fig. 1:
p1: a user client; c1: a subject type definition configuration module; c2: a content unit definition configuration module; c3: a document definition configuration module; t1: a subject instance recording module; u1: a content unit display module; u2: a data processing module; u3: a content unit storage module; d1: a document generation module; d2: a document browsing and downloading display module; s1: a structured information storage module; s2: a retrieval processing module; s3: and a content information retrieval display module.
In fig. 2:
a: configuration phase-define topic type; b: configuration phase-defining content units contained by a topic; c: configuration phase-defining document templates contained by the topic; d: application phase-creation of a theme instance; e: application phase-inputting topic-related content by content unit in topic instance; f: application phase-content unit data processing; g: application phase-content information retrieval utilization; h: application phase-compositing the document, browsing or downloading for utilization.
Detailed Description
The technical solutions in the embodiments will be described clearly and completely with reference to the drawings in the embodiments of the present application, and it is obvious that the described embodiments are only a part of the embodiments of the present application, and not all of the embodiments. All other embodiments, which can be derived by a person skilled in the art from the examples without making any creative effort, shall fall within the protection scope of the present application.
Unless otherwise defined, technical or scientific terms used in the embodiments of the present application should have the ordinary meaning as understood by those having ordinary skill in the art. The use of "first," "second," and similar terms in the present embodiments does not denote any order, quantity, or importance, but rather the terms are used to distinguish one element from another. The word "comprising" or "comprises", and the like, means that the element or item listed before the word covers the element or item listed after the word and its equivalents, but does not exclude other elements or items. "mounted," "connected," and "coupled" are to be construed broadly and may, for example, be fixedly coupled, detachably coupled, or integrally coupled; they may be connected directly or indirectly through intervening media, or they may be interconnected between two elements. "Upper," "lower," "left," "right," "lateral," "vertical," and the like are used solely in relation to the orientation of the components in the figures, and these directional terms are relative terms that are used for descriptive and clarity purposes and that can vary accordingly depending on the orientation in which the components in the figures are placed.
As shown in FIG. 1, the document set-based theme content structured management system of the present embodiment includes a theme content management server, and a user client P1 connected to the theme content management server.
The user client is a networked computer used by the user who P1 participates in the entry, management, or utilization of the subject content information. The theme content management server may be accessed through a browser. The connection may be through a wired network or a wireless network.
The theme content management server comprises a theme type definition configuration module C1, a content unit definition configuration module C2, a document definition configuration module C3, a theme instance recording module T1, a content unit display module U1, a data processing module U2, a content unit storage module U3, a document generating module D1, a document browsing and downloading display module D2, a structured information storage module S1, a retrieval processing module S2 and a content information retrieval display module S3. The display interface may be an operation interface presented on an existing display.
The theme type definition configuration module C1 is respectively connected to the content unit definition configuration module C2, the document definition configuration module C3, and the theme instance recording module T1, and is used for defining the theme type (see method step a for details).
When corresponding configurations are defined in the content unit definition configuration module C2 and the document definition configuration module C3, the topic type name and the topic type identifier of a certain defined topic type in the topic type definition configuration module C1 need to be inherited, so as to determine which topic type the configuration belongs to.
When a topic instance is created in the topic instance recording module T1, a certain topic type configuration in the topic type definition configuration module C1 is read, and a topic instance record is created using the topic type configuration as a template, and the topic type name and the topic type identifier are inherited.
The content unit definition configuration module C2 is respectively connected with the theme type definition configuration module C2, the content unit display module U1, the data processing module U2 and the theme instance recording module T1. Defining the semantics, interface, storage structure, retrieval utilization form and other characteristics of the content unit (see method step B for details).
When a content unit configuration is created in the content unit definition configuration module C2, the topic type name and topic type identifier of a defined topic type are relayed from the topic type definition configuration module C1 for determining to which topic type the configuration belongs.
The content unit display module U1 retrieves interface definitions from the content unit definition configuration module C2 for constructing a form interface that provides user input and views content unit information.
The data processing module U2 provides the corresponding data processing program according to the content unit structure type identification defined in the content unit configuration.
When the topic instance record in the topic instance record module T1 is loaded on the interface, all content element configurations under the subordinate topic instance types defined in the content element definition configuration module C1 are read for constructing the content element catalog list.
The document definition configuration module C1 is connected to the theme type definition configuration module C1, the theme instance recording module T1, and the document generation module D1, respectively, and is used to define names, layouts, and contents to be filled of each type of document in the document set included in the theme type (see method step C for details).
When a document configuration is created in the document definition configuration module C3, the topic type name and topic type identifier of a defined topic type are relayed from the topic type definition configuration module C1 for determining to which topic type the document configuration belongs.
When the topic instance record in the topic instance record module T1 is loaded on the interface, all document configurations under the subordinate topic instance type defined in the document definition configuration module C3 are read for constructing a document directory list.
The document generating module D1 reads the document style template information defined in the document configuration from the document definition configuration module C3 when performing the document composition operation, for parsing and filling in the content to generate a document instance.
The topic instance recording module T1 is respectively connected to the topic type definition configuration module C1, the content unit definition configuration module C2, the document definition configuration module C3, the content unit display module U1, the document browsing and downloading display module D2, and the user client P1, and is configured to store topic instance records and provide a topic content management interface (see method step D in detail).
When a topic instance is created in the topic instance recording module T1, a certain topic type configuration in the topic type definition configuration module C1 is read, and a topic instance record is created using the topic type configuration as a template, and the topic type name and the topic type identifier are inherited.
When the topic instance record in the topic instance record module T1 is loaded on the interface, all content element configurations under the subordinate topic instance types defined in the content element definition configuration module C2 are read for constructing the content element catalog list.
When the topic instance record in the topic instance record module T1 is loaded on the interface, all document configurations under the subordinate topic instance type defined in the document definition configuration module C3 are read for constructing a document directory list.
A content unit directory list constructed in a topic instance recording interface in the topic instance recording module T1 is used as an entry into the content unit display module U1, and after entering the content unit display module U1, the content unit directory list automatically inherits keyword information such as a current topic instance topic type name, a topic type identifier, a topic name, a topic unique identifier, and the like to establish data association.
A document directory list (link) constructed in a topic instance recording interface in the topic instance recording module T1 is used as an entry into the document browsing and downloading display module D2, and after entering the document browsing and downloading display module D2, the document directory list automatically inherits the keyword information of the topic type name, the topic type identifier, the topic name, the unique topic identifier, and the like of the current topic instance to establish data association.
The content unit display module U1 is connected to the topic instance recording module T1, the content unit definition configuration module C2, and the data processing module U2, respectively, and is used to provide an interface for inputting and viewing content of each part of the topic instance.
The topic instance recording module T1 provides an entry for the content unit display module U1 by constructing a content unit catalog list, and also provides the topic instance information to be inherited by the content unit display module U1.
The content unit definition configuration module C2 provides differentiated interfaces and interface accessibility definitions for the content unit display module U1.
The data processing module U2 is used to receive and process the content information input in the content unit display module U1, complete the whole content storage and the structured segment splitting and storage, or import the stored content unit recording data into the content unit display module U1 for the user to view.
The data processing module U2 is connected to the content unit display module U1, the content unit definition configuration module C2, the content unit storage module U3, and the structured information storage module S1, respectively, and is mainly used for version control of content unit information, overall storage, reverse display conversion, and structured segment storage (see method step F for details).
The content unit display module U1 receives content information input by a user, submits the content information to the data processing module U2, the data processing module U2 firstly carries out version control verification, generates a version number after the verification, adopts corresponding data processing operation according to the content unit structure type identification defined in the content unit definition configuration module C2 corresponding to the content unit, integrally stores the content unit information to the content unit storage module U3, and simultaneously splits the content unit information into structured information segments, and stores the structured information segments to the structured information storage module S1.
The content unit storage module U3 is connected to the data storage module U2 and the document creation module D1, respectively, and is used for storing the content unit integrity information input by the user.
The data storage module U2 performs version control verification and data encapsulation processing on the content unit information input by the user, and stores the content unit information in the content unit storage module U3 to form a content unit instance record.
When the document generating module D1 performs a document synthesizing operation, it parses the placeholder identifier set in the document template, and obtains the corresponding content unit instance record from the content unit storage module U3, for filling the document content.
The document generating module D1 is respectively connected to the document definition configuration module C3, the content unit storage module U3, and the document browsing and downloading display module D2, and is used for generating (synthesizing) various predefined documents contained in the topic instance (see method step H for details).
When synthesizing the document, the document generating module D1 searches the corresponding document instance according to the parameters transmitted by the document browsing and downloading display module D2, and determines whether the operation of synthesizing the document is required to be executed, if so, the document style template information of the document of this type is acquired from the document definition configuration module C3 and analyzed, the corresponding content unit information in the content unit storage module U3 is acquired according to the placeholder identifier set in the document template to perform content filling to generate the document instance, and then the document instance is displayed back to the document requesting user through the document browsing and downloading display module D2.
The document browsing and downloading display module D2 is respectively connected to the topic instance recording module T1 and the document generating module D1, and is configured to receive a request from a user to obtain a document and to display the content of the document back to the user. When a user needs to acquire a document contained in a theme instance, the user enters a theme instance interface from the theme instance recording module T1, enters the document browsing and downloading display module D2 by clicking a document link in a document directory constructed in the theme instance interface, transfers document related parameters to the document generating module D1 to execute document instance acquisition operation, and feeds back the acquired document instance to the user through the document browsing and downloading display module D2.
The structured information storage module S1 is connected to the data processing module U2 and the retrieval processing module S2, respectively, and is configured to store information fragments formed by splitting content unit information, and provide conventional retrieval support.
When the data processing module U2 processes the content unit input information, it splits the content unit information into structured information fragments and stores them in the structured information storage module S1. The retrieval processing module S2 responds to the user' S topic content retrieval request by retrieving pieces of information in the structured information storage module S1.
The retrieval processing module S2 is respectively connected to the content information retrieval display module S3 and the structured information storage module S1, and is configured to perform content retrieval and result feedback of the topic instances.
The content information retrieval display module S3 collects the retrieval request input by the user, submits the retrieval conditions to the retrieval processing module S2 for analysis, and the retrieval processing module S2 retrieves the content unit fragment information stored in the structured information storage module S1 according to the retrieval conditions, returns the result meeting the retrieval conditions, and feeds back the result to the user through the content information retrieval display module S3.
The content information retrieval display module S3 is respectively connected to the user client P1 and the retrieval processing module S2, and is configured to provide the user with topic instance content retrieval support.
The user client P1 accesses the content information search display module S3 to input search conditions, submits the search conditions to the search processing module S2 to execute search processing, and the search processing module S2 feeds back the search results to the user through the content information search display module S3 after the search is completed.
As shown in FIG. 2, based on the above system, the method for structured management of topic contents based on document set of the present embodiment comprises the following steps:
A. configuration phase-define topic type;
a1, defining and identifying the subject activity type. Defining a certain type of subject business activity (such as a certain type of project) in reality, and defining key attributes such as a subject type name, a subject type identification and the like for the subject business activity.
A2, creating theme type configuration. The theme type configuration is a data record, wherein a theme type name and a theme type identifier (used for establishing data association) are defined as key attributes to describe the theme type configuration, the theme type configuration also contains a form template information, and theme related attribute fields (such as a theme name, a theme manager, a theme creation time and the like) are defined in the form template as required and used as an input information template when a theme instance is created, different theme description attributes are different, the form template information used for creating the instance is also different, but the common attribute must contain the theme name and a theme unique identifier field, the theme name is a theme description (such as a xxx item) used for display, and the theme unique identifier is a unique keyword mainly used for data association retrieval and generally generated automatically. Once the topic type is determined, the content (the particular document set) it contains is also determined. The theme type configuration is used for organizing other sub-configurations (referring to the configuration of content units and document configuration contained in the theme) in the theme type, and also serves as a template created by the theme type instance (such as a specific project theme). The sub-configuration and the theme instance under the theme type inherit the theme type identification information in the theme type configuration to establish association.
B. Configuration phase-defining content units contained by a topic;
and B1, dividing the content units according to the local content semantics of the subject. The analysis subject includes all document compositions (chapters, paragraphs and the like) in the document set, the document compositions are split according to the principle that the meanings are independent and clear and are atomized as much as possible to form a plurality of content units, and repeated content units with the same meanings are removed after all documents are analyzed and split. An appropriate content unit semantic name and content unit semantic type identification is defined for each content unit.
B2, defining each content unit data structure type according to the content unit data characteristics. The structure type is used for describing an interface structure, a storage structure and an information splitting and utilizing mode of the content unit containing the content information. Generally, the method can be divided into 4 categories, which are described as follows:
basic information field type-is a set of independent fields (e.g., item name, item principal, item start and stop dates, item undertaking units, etc.) that describe some attribute information of the subject. The interface is expressed using a form that contains multiple types of fields (e.g., a single line text field, multiple lines of text field, a number field, a date and time selection box, a drop-down selection box, a radio button, a multiple selection button, etc.). Such information may be stored uniformly in a data table associated with the subject. Each field attribute is available for topic retrieval. Different types of basic information fields of the topics are different, and the storage data tables of the topics need to be separately defined.
Plain text type-used to express text paragraphs without obvious structural features is one of the most common structural types that can be used universally for various topics (because the most descriptive paragraphs are in documents). The structural characteristic is that the text content is likely to be more. The interface is expressed by an integral multi-line character edit box. When storing, for the convenience of overall use, the whole storage is firstly carried out, and a nosql database, a file system text file or a relational database clob field and the like can be stored. In addition, the text content can be used for conventional keyword matching retrieval, so that segmentation can be performed according to the specified length (such as 300 characters) to form a plurality of character segment records, the character segment records are stored in a common relational database table to support retrieval, in order to avoid the influence of word and sentence truncation caused by segmentation points on retrieval matching, the initial position of each non-initial segment can be shifted forward by a plurality of characters (such as 20 characters) compared with the segmentation points, and characters with certain lengths are overlapped near the segmentation points of the front segment and the rear segment.
Item text type-text used to express a component item (e.g., contract terms). And are of a common type of structure that can be universally used with a variety of subjects. The interface is represented by a dynamic table of expandable rows, each row of the table providing a plurality of rows of text edit boxes for entry and display. The storage is encapsulated (e.g., structured in xml format) in a row-wise overall structured manner. Meanwhile, the text fragments are divided into lines and stored in a relational database to support conventional retrieval.
Two-dimensional phenotype-a representation of regular two-dimensional phenotypic content contained in a document (e.g., a member information table, schedule table, budget table, etc. in a project document). Such structures are generally not universal and require separate definition of database storage tables. The interface is expressed by a table which comprises specific data columns and can dynamically expand rows. And during storage, carrying out whole content structured packaging storage for the second layer structure according to the row data in the first layer structure and the column data in the row. And simultaneously, the data records are divided into independent data records according to rows and stored in corresponding relational database tables for supporting conventional retrieval.
Other types of structures can be extended as necessary. And after the structure type is divided, distributing a content unit structure type identifier for each content unit for distinguishing. The content unit structure type identification may be shared for each content unit belonging to a common structure type. The non-universal content units need to define respective content unit structure type identifiers separately, and the identifiers are mainly used for performing different data storage packaging and information fragment splitting operations on different content units during background data processing.
B3, defining content element auxiliary support function. After the content unit structure type is determined, available auxiliary support functions can be determined for each unit feature. The method is roughly divided into a general support function and a personalized support function, and the following are exemplified:
the general-type support function is an auxiliary support function that can be commonly used for various content units, such as for convenience of revision, providing import of the last version of content as an edit draft operation support.
The personalized support function is a function which is provided aiming at the data characteristics of individual content units and is generally used for quick input, data verification, data association verification and the like. For example, the method provides a support for selecting personnel for the entry of a 'project member' table, can select personnel from an information system organization structure, and automatically brings out information such as gender, age, unit, title and the like of filling personnel, thereby simplifying the entry operation and reducing the misoperation. For example, a budget amount detection function is provided for the budget table for expenses, and whether the budget value conforms to the money plan can be detected.
B4, realizing the content unit through form and functionalization. And (3) realizing the analysis results obtained in the steps B1, B2 and B3 into a form design, a matched logic function and a database storage model in the information system. After the content unit configuration is realized, the information mainly comprises:
topic type name and topic type identification of the topic to which the system belongs. The topic instances can be created by retrieving this information to determine the available content units and listing them to form a catalog-like content presentation structure. The creation of a content unit instance in a topic instance also inherits this information into the content unit instance.
Content unit semantic name of content unit, content unit semantic type identification. The content unit semantic name is used for display, so that a user can know content information reflected by the content unit.
The content unit semantic type identifier is used as a unique keyword for identifying a content unit type and is mainly used for establishing data association. These information are inherited when creating the content element instance in the topic instance.
A content unit form interface. The method is used for creating a content unit instance as a template, and comprises auxiliary support functions related to content input.
A content unit structure type identification for the content unit. And the background data processing program is used for distinguishing and executing processing logic aiming at the content data recorded in the content unit instance.
C. Configuration phase-defining document templates contained by the topic;
and C1, creating a configuration data record for representing the document configuration information in the system, inheriting the topic type name and topic type identification attribute of the topic to which the configuration belongs when creating the configuration, and being used for determining which topic type the current document configuration belongs to, and facilitating the retrieval and construction of a document directory list according to the attributes in the topic instance.
Defining the document type name representing the document and the document type identification attribute, wherein the document type name is used for displaying to enable a user to know the service meaning represented by the document type, and the document type identification is used as a key word for uniquely determining the document type. When a document instance is generated according to document configuration, the topic type name and the topic type identifier of the topic, the document type name and the document type identifier are inherited in the instance.
And C2, defining the layout structure of the document. The layout structure of the document is defined in a template mode, namely, the layout structure is described by a document style frame without content, and the layout structure is generally made according to the existing document in reality. The production mode can be formed by adopting html + css language description, a word file template can be directly used, or other modes can be adopted as long as corresponding technologies are provided to facilitate the description of the pattern and the reading and the operation of the template information. The document structure template is defined and then associated with the document configuration record (or directly stored in the configuration record), that is, the document style template can be obtained by accessing the document configuration record.
C3, defining the content and the position to be filled in the document layout. And setting placeholder identifiers at each position of the document style template, which is required to be filled with content, wherein the placeholder identifiers in the document template are replaced by the actual content information represented by the placeholder identifiers by the processing program when the document is actually synthesized, so that a document example containing the actual content is formed. The placeholder identifier is defined by a method related to the content unit semantic type identifier of the content unit, and the content unit semantic type identifier of the content unit can be directly used as the placeholder identifier when the filling content is the whole content unit information. Often, the data for a "basic information field" type content unit is not filled into the document as a whole, but a field is filled in at different locations of the document as needed, and the placeholder identifier may be defined in the format "content unit semantic type identifier _ field name".
The data processing program performs the distinguishing processing according to the convention. After the placeholder identifier is set in each piece of content in the document style template, which content units in the theme are contained (or used) in the document are actually determined.
D. Application phase-creation of a theme instance;
d1, creating a theme instance record through the theme type configuration. And automatically inheriting the topic type name and the topic type identification information in the topic type configuration in the instance record. Filling in form interface fields generated from a form template defined in a topic type configuration ensures that a topic name and a unique topic identifier are generated, which can be automatically generated by a program (e.g., using a UUID) to avoid duplication.
D2, after creating the theme instance record, retrieving the content unit configuration according to the theme type identification inherited by the instance record, obtaining all content unit information belonging to such theme, constructing a content unit catalog list in the theme instance interface, wherein the catalog list reflects the status information (such as 'no content' or 'editing' or 'completed', creation time, creator, version number, etc.) and available operation (such as new creation, continued editing, revision, etc.) of each content unit besides the content unit name, and the status information and available operation of the content unit are related to the version control mechanism, which is briefly described as follows:
when the content of the content unit is not created, the state is 'no content', the available operation is new, and a new operation interface is clicked and opened;
when the created content is not submitted formally, is only stored temporarily, and has the state of 'editing in the middle', the available operation is continuous editing, and after clicking, the temporarily stored record is opened, so that the editing can be continued.
When the content is created and submitted formally, the state is 'finished', the content can not be modified after the content is finalized, the available operation is revision, a newly created content interface is accessed after clicking, the new version content is created, and the content of the previous version can be imported as the draft editing auxiliary function when the revision is created, so that the content of the existing version can be automatically imported. The content unit may be revised multiple times, with each finalized version being kept for future reference as a history.
E. Application phase-inputting topic-related content by content unit in topic instance;
e1, the subject participator enters the subject example interface to create each block of content contained in the subject according to the information state and available operation displayed by the content unit catalog list. When the subject participator is a plurality of persons, the content can be created aiming at different content units at the same time.
E2, when the subject participant executes some content unit editing operation (such as new creation, continued editing, revision, etc.), the system uses the form interface defined in the content unit configuration as the content information entry interface, and provides the form auxiliary function defined by the content unit configuration for entry support.
F. Application phase-content unit data processing;
f1, performing version check and control. When a participant edits a certain content unit for temporary storage or submission, the background data processing module firstly performs version verification and control on the content unit.
The specific rule is as follows: according to the unique subject identification and the semantic type identification of the content unit, which are inherited from the content unit, forming a retrieval condition to query all existing content unit example records which are the same as the semantic type identification of the content unit of the current content unit in the current subject example, if the existing records are not retrieved, the current operation is the creation of a first version, the version number is allowed to be stored and generated to be 1 (for the convenience of system management and the simplification of manual operation, the version number adopts a positive integer automatically generated by a system, and different versions are represented by accumulating 1); if the existing record is retrieved, wherein the maximum version number record state is 'finished', the current operation is legal revision creation, the storage is allowed, and a new version number is generated for the current version, namely 1 is accumulated on the existing maximum version number; if the retrieved maximum version number recording state is 'editing in the middle', the last version number is represented to have not been submitted and finalized, so that the current operation is illegal revision operation, the storage is not allowed, and a prompt message is returned to inform an editor that the existing version of the current content unit is editing, which mainly solves the problem of conflict of editing the same content unit simultaneously in a scene with participation of multiple persons.
F2, the content unit information is stored as a whole. And after the version is checked, integrally storing the content unit information, wherein the purpose of integrally storing is to keep the integrity of the information and facilitate the taking of the information when a subsequent composite document is obtained. The whole storage scheme of different content unit information may be different due to the structure difference, and different program logics are adopted for processing after the content unit structure type identification of the content unit is identified by the data processing program.
F3, storing the content unit structured splitting information. In order to support the conventional retrieval function, the content unit information is also subjected to structured splitting to form fragment information which is stored in a relational database. Due to the structural difference of different content unit information, the information splitting schemes may be different, and different program logics are adopted for processing after the data processing program identifies the content unit structure type identification of the content unit.
F4, after the content unit data processing is completed, updating the current content unit state information of the content unit directory list in the subject example.
G. An application stage, namely, searching and utilizing content information;
g1, when the user needs to search the content information contained in various subjects, the user inputs the search key words through the 'content information search display module' provided by the system and submits the search key words. The 'content information retrieval display module' is a form independent of the scope of the subject (not belonging to a subject), is used for providing a content information retrieval support function, and can be used for retrieving the content of all the subject instances. The retrieval conditions available in the retrieval interface are dynamically changed due to the difference of content data structures contained in various theme types, the retrieval conditions are mainly divided into two parts of 'general conditions' and 'theme personalization conditions', and the 'general conditions' can be used for retrieving attribute and content information (such as theme names, theme managers, creation time, content texts and the like) contained in all theme types. When the theme type is selected, the retrieval interface loads 'theme personalized condition', and can retrieve data split by personalized content unit information defined by the theme.
G2, search processing. And the background retrieval processing module retrieves the content unit structured information fragments stored in the relational database according to the retrieval conditions submitted by the front-end retrieval interface and returns the retrieved information fragments. An information fragment is content information that is split up according to certain rules to support retrieval utilization when various content units are stored.
And G3, viewing the retrieval result and navigating to the associated information. The result obtained by the direct retrieval is an information segment of the subject content, and the segment contains the content obtained by directly matching the retrieval condition, and also contains the position information of the content segment (specifically, the name of the subject to which the segment belongs, the name of the content unit to which the segment belongs, and the sequence number of the segment distributed when the content unit is split into the segments) and the link navigating to the associated information (the link navigating to the whole content of the content unit viewed by the content unit record and the link navigating to the subject record viewed subject complete information). The reasons why the search is performed based on the content unit information piece and the result is directly returned are mainly as follows:
and the information fragment content is less, and can be directly stored in a relational database, so that better retrieval performance and use experience can be obtained by utilizing the conventional retrieval technology.
Structured data (tabular data, attribute fields, etc.) in the content can be used as a cross-subject instance search comparison (e.g., search all items that a person has participated in as a member of the item), or to support information statistics for multi-subject instances (e.g., all items with a statistical budget over 500 million).
The topic may contain more content (the original document contained in the topic is more voluminous), and if the retrieval returns the topic as a whole directly as a result, the user often has difficulty in locating where the retrieval keyword appears in the topic content (so that it is difficult to judge whether the retrieval result is accurate). The problem can be well solved by providing hierarchical information acquisition in a 'fragment + navigation link' mode.
H. An application stage, namely synthesizing a document, and browsing or downloading the document for utilization;
h1, user access to document catalog contained in topic instance. When the theme instance interface is loaded, all document configurations containing the theme type identification keywords are retrieved by using the theme type identification of the current theme, all document configurations belonging to the theme are obtained and are constructed into a document directory, each document in the directory is a clickable link, and after clicking, a document browsing and downloading display module is opened, and the document type identification contained in the current document configuration and the theme unique identification contained in the current theme instance are transmitted to a background document generation module as parameters for processing.
H2, automatically synthesizing the document and generating a document record. The document generation module receives a document acquisition request transmitted by a front-end interface, and inquires document instance records matched with document type identifications in the current theme according to the unique theme identification and the document type identification parameters. The treatment is carried out in three cases, which are illustrated below:
if the document instance is not queried, which means that the document has not been generated (synthesized), the document synthesis process is performed.
If the document example is inquired and represents that the document under the theme is generated before, further judging whether the generation time of the document record is later than the creation time of all the content units contained in the document record, if so, indicating that the content updating does not occur after the document is created, and directly returning the content to the user.
If the document instance is inquired and the document record generation time is earlier than the creation time of one or more content units contained in the document instance, it indicates that content updating occurs after the document is created (the local content represented by the content units is revised and updated), the document synthesis processing needs to be performed again, and the document content of the latest version is generated and returned to the user.
The steps performed when performing the document composition operation are as follows:
(1) and acquiring the document style template information in the document configuration.
(2) Analyzing all the placeholder identifiers set in the document style template, acquiring the latest version storage record of the content unit corresponding to the placeholder identifiers in the current theme instance one by one according to the placeholder identifiers, and merging the stored content unit content to the position indicated by the placeholder identifiers in the document style template.
(3) And after all the placeholder identifiers are analyzed and the content is filled, storing the generated document information containing the styles and the content as a document instance record.
H3, after the document is synthesized, the document browsing and downloading display module provides support for the user to browse or download online.
A specific example of this embodiment specifically includes the following steps:
step A, configuring a phase-defining a theme type;
a theme type is defined in the theme type configuration module C1. The definition topic type name is 'science and technology project', the definition topic type is marked as 'SciTeTecProject', the form template information for creating the topic instance is defined and is described by an xml structure, and the example is as follows:
“<form>
< field > < table > subject name
</lable><name>subjectName</name><type>String</type><ui>input</ui>...</field>
< field > < table > subject unique identification
</lable><name>subjectId</name><type>String</type><ui>auto</ui>...</field>
< field > < table > theme manager
</lable><name>subjectAdmin</name><type>OrgElement</type><ui>selRy</ui>...</field>
< field > < table > creation time
</lable><name>createTime</name><type>DateTime</type><ui>selDate</ui>...</field>
...
</form>”
The theme type configuration records generated are shown in the following table:
TABLE 1
Figure BDA0003364612410000131
Figure BDA0003364612410000141
Step B, configuring a phase-defining content unit contained in the theme;
the content composition of each document in the document set contained in the topic is analyzed (only a part of the documents and the representative content are taken as examples here). The documents comprise a project filing application book and a project implementation plan book, and the main contents contained in each document are as follows:
the project establishment application form mainly includes project basic information (project name, application unit, project responsible person, application date, undertaking unit, cooperation unit, project implementation period, project total expense, etc.), project establishment reason and project outline.
The project implementation plan book mainly comprises a project background, project purposes and meanings, project research contents, project assessment indexes, a project schedule, project expense budgets, project members and responsibilities and the like.
Splitting and merging according to the principle that the local content meanings are independent and clear as well as possible to be atomized, wherein the content units are defined as follows:
the "basic information" content unit includes independent fields such as project name, application unit, project principal, application date, charge unit, cooperation unit, project implementation period, and project total cost. The definition content unit semantic name is 'item basic information', the definition content unit semantic type is identified as stp _ baseinfo, and the definition content unit structure type is identified as stp _ baseinfo.
"cause of standing items" content unit-general text paragraph. The definition content unit semantic name is 'standing cause', the definition content unit semantic type is identified as 'stp _ lxyy', and the definition content unit structure type is identified as 'stp _ text'.
The "project context" content element, a general text paragraph, can be used to synthesize a "project summary". The definition content unit semantic name is "item background", the definition content unit semantic type is identified as "stp _ xmbj", and the definition content unit structure type is identified as "stp _ text".
The "item object" content unit, a general text paragraph, can be used to compose a "summary of an item". The definition content unit semantic name is "project purpose", the definition content unit semantic type is identified as "stp _ xmmd", and the definition content unit structure type is identified as "stp _ text".
"item sense" content units, a general text paragraph, can be used to synthesize an "item summary". The definition content unit semantic name is "item meaning", the definition content unit semantic type is identified as "stp _ xmyy", and the definition content unit structure type is identified as "stp _ text".
"project research content" content element — a general text paragraph. The definition content unit semantic name is "research content", the definition content unit semantic type is identified as "stp _ yjnr", and the definition content unit structure type is identified as "stp _ text".
The content unit of the assessment index is a universal entry text. The definition content unit semantic name is named as 'assessment index', the definition content unit semantic type is marked as 'stp _ khzb', and the definition content unit structure type is marked as 'stp _ textclass'.
The content unit of the project schedule, namely a non-universal two-dimensional table, is recorded with a schedule content in each row, and comprises columns of start time, end time, main development work and the like. The definition content unit semantic name is 'schedule', the definition content unit semantic type is marked as 'stp _ jdjh', and the definition content unit structure type is marked as 'stp _ schedule'.
The project expense budget content unit is a non-universal two-dimensional table, and each row inputs a budget subject related content which comprises columns of budget subjects, budget money, description and the like. The definition content unit semantic name is "budget", the definition content unit semantic type is identified as "stp _ jfys", and the definition content unit structure type is identified as "stp _ funds".
The "project member and responsibility" content unit — a non-general two-dimensional table, where one project member information content is recorded in each row, and includes columns of "name", "gender", "age", "technical job title", "work unit", "responsibility division", and the like. The definition content unit semantic name is "item member and responsibility", the definition content unit semantic type is identified as "stp _ cyjzz", and the definition content unit structure type is identified as "stp _ member".
Tabulation, functionalization and realization of matched storage processing logic of the defined content units, and establishment of configuration records in a content unit definition configuration module C2 are shown in the following table:
TABLE 2
Figure BDA0003364612410000151
The column data of the "belonging topic type name" and the "belonging topic type identification" in the configuration table inherits the topic type configuration and is used for determining the topic type to which the content unit configuration belongs.
The 'content unit structure type identifier' in the configuration table is used as a keyword to correspond to background data processing logic, and the overall storage mode and the structured fragment information splitting mode are determined according to the keyword during background data processing. The program in the processing logic is not particularly limited as required to implement storage and enable reverse parsing of the display, and is not described in detail herein.
When the structured information fragment is tabulated, functionalized and split, the implementation form of each content unit is briefly described as follows:
"project base information" content Unit-implemented as a form containing various attribute field entries. No structural split is needed;
"cause of standing" content element — implemented as a form that supports multiple lines of text entry. Splitting according to the length of each 300 characters during structural splitting;
"item background" content element — implemented as a form that supports multiple lines of text entry. Splitting according to the length of each 300 characters during structural splitting;
"project destination" content element — implemented as a form that supports multiple lines of text entry. Splitting according to the length of each 300 characters during structural splitting;
"item sense" content unit — implemented as a form that supports multiple lines of text input. Splitting according to the length of each 300 characters during structural splitting;
"study" content element — implemented as a form that supports multiple lines of text entry. Splitting according to the length of each 300 characters during structural splitting;
"assessment index" content element-implemented as a dynamic table form supporting multiple lines of text input expandable lines. Splitting as a storage record according to a table row during structured splitting;
the "schedule" content element-implemented as a dynamic table form containing columns of "start time", "end time", "major work to launch", etc., may be expanded rows. When structured splitting, splitting as a storage record according to table rows, wherein each column value can support retrieval; the form provides time selection and verification assistance (e.g., each project cannot end earlier than or equal to the start time).
The "budget expenditure" content element is implemented as a dynamic tabular form with expandable rows containing columns such as "budget subject", "budget amount", "description", and the like. When structured splitting, splitting as a storage record according to table rows, wherein each column value can support retrieval; the form provides the auxiliary function of checking the budget total (checking whether the total value of the budget subjects corresponds to the field value of the project total expense in the content unit of the project basic information).
The "project member and responsibility" content element is implemented as a dynamic table form including columns such as "name", "gender", "age", "technical job title", "work unit", "responsibility division", and the like, and having an expandable row. When structured splitting, splitting as a storage record according to table rows, wherein each column value can support retrieval; the form provides an auxiliary function of selecting personnel from the system organization structure and automatically bringing out column values of filling out names, sexes, technical titles and the like.
Step C, configuring a document template contained in a definition theme;
document configuration records are created in the document definition configuration module C3, as shown in the following table:
TABLE 3
Figure BDA0003364612410000161
In this embodiment, the document style template is constructed in an html + css manner and is directly stored in the configuration record. The template complete html code is not shown in detail here, and an example of the way of setting the placeholder identifier is as follows:
<td>
< span id ═ stp _ xmbj "> (here fill item background content) </span >)
< span id ═ stp _ xmmd > (here fill item destination) </span >)
< span id ═ stp _ xmyy > (here fill item meaning content) </span >
</td>
And setting a placeholder identifier by using the html element id attribute, wherein the placeholder identifier adopts the same character as the content unit type identifier. The program code can analyze the content unit storage information required to be acquired by acquiring the id attribute and fill the content unit storage information into the template.
Step D, establishing a theme instance in an application stage;
a user creates a topic instance of a science and technology item type, and generates an instance record in a topic instance record module, which is exemplified by the following table:
TABLE 4
Figure BDA0003364612410000171
When the theme instance is loaded in the interface, the content unit configuration in the content unit definition configuration module C2 is retrieved according to the "belonging theme type identifier" and the "theme unique identifier", a content unit catalog list of the theme instance is constructed as an entry for entering the content unit display module U1 to edit the content, and further, the record of each unit type instance is retrieved to generate state information, as shown in the following table:
TABLE 5
Figure BDA0003364612410000172
When no content has been recorded, the status information of each content unit is empty, and a new creation operation (as shown in the record No. 10 in table 5 above) can be performed, and the status and available operation shown in the lines 1 and 2 in the table above are intended to indicate other statuses when editing the content.
When the theme instance is loaded in the interface, the document definition configuration module C3 is retrieved according to the "belonging theme type identifier" and the "unique theme identifier", and a document list included in the theme instance is constructed as an entry into the document browsing and downloading display module D2, as shown in the following table:
TABLE 6
Serial number Document name Operation of
1 Filing application [ VIEW ] A (Link)
2 Project implementation plan [ VIEW ] A (Link)
Step E, an application stage, namely inputting related contents of the theme through the content unit in the theme instance;
the subject participant edits the contents of the parts included in the subject through the content unit catalog list included in the subject instance interface in the subject instance recording module T1. The content unit display module U1 is entered to perform the content editing process based on the status of the information displayed by the content unit catalog listing and the available actions. When the subject participator is a plurality of persons, the content can be created aiming at different content units at the same time.
Step F, application phase-content unit data processing;
when a participant edits a content unit for buffering or rendering, the data processing module U2 first performs version check and control on the content unit input information rendered by the content unit display module U1. After the version check, the content unit information is integrally stored in the content unit storage module U3 by using different program logics according to the content unit structure type identifier of the content unit, and meanwhile, the content unit information is structurally split into fragment information, which is stored in the structural information storage module S1 for supporting the conventional retrieval function. After the content unit data processing is completed, the current content unit state information of the content unit directory list in the theme instance is updated.
Step G, carrying out content information retrieval and utilization in an application stage;
when the content information included in various subjects needs to be retrieved, the user inputs a retrieval key word through the content information retrieval display module S3 provided by the system and submits the retrieval key word.
The retrieval processing module S2 retrieves the content unit structured information segment stored in the structured information storage module S1 according to the retrieval condition submitted by the front-end retrieval interface and returns the retrieved information segment.
Step H, application stage-synthesizing document, browsing or downloading and using
When the document contained in the theme instance needs to be accessed, the user enters the document browsing and downloading display module D2 through the document directory list contained in the theme instance interface in the theme instance recording module T1.
When the document browsing and downloading display module D2 is opened, the document type identifier included in the current document configuration and the unique topic identifier included in the current topic instance are transmitted as parameters to the background document generation module D1 for document synthesis determination and processing. When the document synthesis operation is performed, the document generation module D1 obtains the document style template information in the corresponding configuration record in the document definition configuration module C3, parses all the placeholder identifiers set in the document style template, obtains the latest version storage record of the content unit corresponding to the placeholder identifier in the current theme instance one by one according to the placeholder identifier, and merges the stored content unit content to the position indicated by the placeholder identifier in the document style template. After the document is synthesized, the document is displayed online and provides downloading support for the user through the document browsing and downloading display module D2.
The above description is only for the purpose of illustrating the preferred embodiments of the present invention and is not to be construed as limiting the invention, and any modifications, equivalents, improvements and the like that fall within the spirit and principle of the present invention are intended to be included therein.

Claims (10)

1. A topic content structured management system based on document set, characterized in that: the system comprises a theme content management server and a user client connected with the theme content management server;
the theme content management server comprises a theme type definition configuration module, a content unit definition configuration module, a document definition configuration module, a theme instance recording module, a data processing module, a content unit storage module, a document generation module, a retrieval processing module and a content information retrieval display module;
the theme type definition configuration module is respectively connected with the content unit definition configuration module, the document definition configuration module and the theme instance recording module to define the theme type;
the content unit definition configuration module is respectively connected with the theme type definition configuration module, the content unit display module, the data processing module and the theme instance recording module, and is used for defining the semantics, the interface, the storage structure and the retrieval utilization form characteristics of the content unit;
the content unit display module acquires interface definitions from the content unit definition configuration module to construct a form interface for providing user input and viewing content unit information;
the data processing module provides a corresponding data processing program according to the content unit structure type identifier defined in the content unit configuration; the data processing module is respectively connected with the content unit display module, the content unit definition configuration module, the content unit storage module and the structured information storage module and is mainly used for content unit information version control, integral storage, reverse display conversion and structured fragment storage;
the document definition configuration module is respectively connected with the theme type definition configuration module, the theme instance recording module and the document generation module, and defines the name, the format and the content to be filled of each type of document in a document set contained in the theme type;
the theme instance recording module stores theme instance records and provides a theme content management interface;
the theme instance recording module provides an entrance for the content unit display module by constructing a content unit directory list and provides the theme instance information which needs to be inherited by the content unit display module at the same time;
the content unit storage module is respectively connected with the data storage module and the document generation module and is used for storing the whole information of the content unit input by a user;
the document generation module is respectively connected with the document definition configuration module, the content unit storage module and the document browsing and downloading display module to generate various predefined documents contained in the theme instance;
when the document is synthesized, the document generation module searches corresponding document examples according to parameters transmitted by the document browsing and downloading display module, judges whether the operation of synthesizing the document is needed to be executed, acquires and analyzes document style template information of the document of the type from the document definition configuration module if the operation is needed, acquires corresponding content unit information in the content unit storage module according to a placeholder identifier set in the document template to perform content filling to generate document examples, and then displays the document examples to a document requesting user through the document browsing and downloading display module.
2. The system of claim 1, wherein: the document browsing and downloading display module is respectively connected with the theme instance recording module and the document generating module, and receives a request of a user for acquiring a document and displays the document content back to the user; when a document contained in the theme instance is acquired, entering a theme instance interface from the theme instance recording module, entering a document browsing and downloading display module by clicking a document link in a document directory constructed in the theme instance interface, transmitting document related parameters to a document generating module to execute document instance acquisition operation, and feeding back the acquired document instance to a user through the document browsing and downloading display module.
3. The system of claim 1, wherein: the structured information storage module is respectively connected with the data processing module and the retrieval processing module, stores information fragments formed by splitting content unit information and provides conventional retrieval support;
when the data processing module processes the input information of the content unit, the content unit information is split into structured information fragments and stored in the structured information storage module; the retrieval processing module responds to the retrieval request of the theme content of the user by retrieving the information segments in the structured information storage module;
the retrieval processing module is respectively connected with the content information retrieval display module and the structured information storage module and used for executing the content retrieval and result feedback of the theme instance;
the content information retrieval display module collects retrieval requests input by users, submits retrieval conditions to the retrieval processing module for analysis processing, and the retrieval processing module retrieves content unit fragment information stored in the structured information storage module according to the retrieval conditions, returns results meeting the retrieval conditions and feeds the results back to the users through the content information retrieval display module;
the content information retrieval display module is respectively connected with the user client and the retrieval processing module and provides theme instance content retrieval support for the user;
the user client accesses the content information retrieval display module to input retrieval conditions, submits the retrieval conditions to the retrieval processing module to execute retrieval processing, and after retrieval is completed, the retrieval processing module feeds back the retrieval results to the user through the content information retrieval display module.
4. The system of claim 1, wherein:
when corresponding configuration is defined in the content unit definition configuration module and the document definition configuration module, the theme type name and the theme type identifier of a certain defined theme type in the theme type definition configuration module are inherited, and the theme type name and the theme type identifier are used for determining the theme type to which the configuration belongs;
when the content unit configuration is created in the content unit definition configuration module, the theme type name and the theme type identifier of a defined theme type are inherited from the theme type definition configuration module, and the theme type name and the theme type identifier are used for determining the theme type to which the configuration belongs;
when the document configuration is created in the document definition configuration module, the theme type name and the theme type identifier of a defined theme type are inherited from the theme type definition configuration module, so as to determine the theme type to which the document configuration belongs.
5. The system of claim 1, wherein: the document definition configuration module is respectively connected with the theme type definition configuration module, the theme instance recording module and the document generation module;
when the theme instance records in the theme instance recording module are loaded on the interface, reading all content unit configurations under the subordinate theme instance types defined in the content unit definition configuration module, and constructing a content unit catalog list;
the theme instance recording module is respectively connected with the theme type definition configuration module, the content unit definition configuration module, the document definition configuration module, the content unit display module, the document browsing and downloading display module and the user client, stores theme instance records and provides a theme content management interface;
when the theme instance records in the theme instance recording module are loaded on an interface, reading all document configurations under the subordinate theme instance types defined in the document definition configuration module, and constructing a document directory list;
when the document generation module executes document synthesis operation, the document style template information defined in the document configuration is read from the document definition configuration module and is used for generating a document instance after analyzing and filling contents.
6. The system of claim 5, wherein:
when a theme instance is created in the theme instance recording module, certain theme type configuration in the theme type definition configuration module is read, the theme instance record is created by taking the theme type configuration as a template, and the theme type name and the theme type identifier are inherited;
when the theme instance records in the theme instance recording module are loaded on the interface, reading all content unit configurations under the subordinate theme instance types defined in the content unit definition configuration module, and constructing a content unit catalog list;
when the theme instance records in the theme instance recording module are loaded on an interface, reading all document configurations under the subordinate theme instance types defined in the document definition configuration module for constructing a document directory list;
a content unit directory list constructed in a theme instance recording interface in the theme instance recording module is used as an entrance for entering the content unit display module, and the content unit directory list can automatically inherit keyword information such as a theme type name, a theme type identifier, a theme name, a theme unique identifier and the like of a current theme instance to establish data association after entering the content unit display module;
and a document directory list constructed in a theme instance recording interface in the theme instance recording module is used as an entrance for entering the document browsing and downloading display module, and the document browsing and downloading display module is used for automatically inheriting the current keyword information to establish data association.
7. The system of claim 1, wherein:
the content unit display module receives content information input by a user and submits the content information to the data processing module, the data processing module firstly carries out version control verification and generates a version number after the verification, corresponding data processing operation is carried out according to a content unit structure type identifier defined in the content unit definition configuration module corresponding to the content unit, the content unit information is integrally stored in the content unit storage module, and meanwhile, the content unit information is divided into structured information fragments and stored in the structured information storage module;
the data storage module carries out version control verification and data packaging processing on the content unit information input by a user and then stores the content unit information in the content unit storage module to form a content unit example record;
when the document generation module carries out document synthesis operation, the placeholder identifier set in the document template is analyzed, and the corresponding content unit instance record is obtained from the content unit storage module and is used for filling document content.
8. A theme content structured management method based on document sets is characterized in that: the method comprises the following steps:
defining a theme type; defining a content unit contained in a theme; defining a document template contained by a theme;
establishing a theme instance; inputting topic-related content through a content unit in a topic instance; processing content unit data; content information retrieval utilization;
and synthesizing the document, and browsing or downloading for utilization.
9. The method of claim 8, wherein:
defining a theme type, and creating form template information of a theme instance; analyzing the content composition of each document in the document set contained in the theme;
splitting and merging according to the principle that the local content meanings are independent and definite and are atomized as much as possible, and defining content units;
tabulating, functionalizing and realizing matched storage processing logic of the defined content units, and establishing a configuration record in a content unit definition configuration module; tabulating, functionalizing and splitting the structured information fragments;
establishing a document configuration record in a document definition configuration module; creating a theme instance of a science and technology project type, and generating an instance record in a theme instance recording module;
when the theme instance is loaded in the interface, retrieving the configuration of the content unit according to the belonging theme type identification and the unique theme identification, constructing a content unit catalog list of the theme instance as an entry for content editing, and further retrieving each unit type instance record to generate state information;
when the theme instance is loaded, retrieving the document definition configuration module according to the belonging theme type identification and the theme unique identification, and constructing a document list contained in the theme instance as an entrance for entering a document browsing and downloading interface;
editing a certain content unit, and when temporarily storing or submitting the certain content unit, firstly, performing version verification and control on the input information of the content unit submitted by a content unit interface; after the version is checked, integrally storing the content unit information in a content unit storage module by adopting different program logics according to the content unit structure type identification of the content unit, and meanwhile, structurally splitting the content unit information into fragment information and storing the fragment information in a structural information storage module;
and after the content unit data processing is finished, updating the current content unit state information of the content unit catalog list in the theme instance.
10. The method of claim 8, wherein:
when content information contained in various subjects is searched, inputting and submitting search keywords;
the retrieval processing module retrieves the content unit structured information fragment stored in the structured information storage module according to the retrieval condition submitted by the front-end retrieval interface and returns the retrieved information fragment;
when accessing the document contained in the theme instance, entering a document browsing and downloading interface through a document directory list contained in a theme instance interface in the theme instance recording module; when a document browsing and downloading interface is opened, transmitting a document type identifier contained in the current document configuration and a theme unique identifier contained in the current theme instance as parameters to a background document generation module for document synthesis judgment and processing;
during document synthesis operation, a document generation module acquires document style template information in corresponding configuration records in a document definition configuration module, analyzes all placeholder identifiers set in the document style template, acquires the latest version storage records of content units corresponding to the placeholder identifiers in a current theme instance one by one according to the placeholder identifiers, and merges the stored content units to the positions indicated by the placeholder identifiers in the document style template; after the document is synthesized, the document is displayed online through a document browsing and downloading interface and downloading support is provided for a user.
CN202111401724.3A 2021-11-19 2021-11-19 Theme content structured management method and system based on document set Pending CN114281920A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202111401724.3A CN114281920A (en) 2021-11-19 2021-11-19 Theme content structured management method and system based on document set

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202111401724.3A CN114281920A (en) 2021-11-19 2021-11-19 Theme content structured management method and system based on document set

Publications (1)

Publication Number Publication Date
CN114281920A true CN114281920A (en) 2022-04-05

Family

ID=80870017

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202111401724.3A Pending CN114281920A (en) 2021-11-19 2021-11-19 Theme content structured management method and system based on document set

Country Status (1)

Country Link
CN (1) CN114281920A (en)

Similar Documents

Publication Publication Date Title
US7363581B2 (en) Presentation generator
KR101046831B1 (en) Computer readable recording media and methods of linking elements in a document to corresponding data in a database
JP2021061034A (en) Method and system for generating questionnaire
US20070220415A1 (en) Excel spreadsheet parsing to share cells, formulas, tables or entire spreadsheets across an enterprise with other users
KR101401171B1 (en) Methods and apparatus for reusing data access and presentation elements
US20070219956A1 (en) Excel spreadsheet parsing to share cells, formulas, tables, etc.
US20060218160A1 (en) Change control management of XML documents
CN101488086A (en) Software generation method and apparatus based on field model
Alexander et al. Access 2013 Bible
CN102024207A (en) Knowledge management system seamlessly combined with office software
Ferrández et al. A framework for enriching Data Warehouse analysis with Question Answering systems
US20090193053A1 (en) Information management system
Chortaras et al. WITH: human-computer collaboration for data annotation and enrichment
CN115713309A (en) Internal auditing system
US20080229305A1 (en) Workflow management system
KR20050071720A (en) Methods for dynamically building the home page and apparatus embodied on the web therefor
CN114281920A (en) Theme content structured management method and system based on document set
KR100902193B1 (en) A System For Composing Report, A Method ? A System For Providing A Made-To-Order Report, And A Storage Medium
JP3842576B2 (en) Structured document editing method and structured document editing system
Walkerdine et al. Sharing searches: developing open support for collaborative searching
Petiya Building a Semantic Web of Comics: Publishing Linked Data in HTML/RDFa Using a Comic Book Ontology and Metadata Application Profiles
US10607239B2 (en) Enterprise evaluation using structured data
Webb EAD in the department of special collections and western manuscripts at the bodleian library, United Kingdom
Martin et al. RDFtex in-depth: knowledge exchange between LATEX-based research publications and Scientific Knowledge Graphs
Škrbić et al. Bibliographic records editor in XML native environment

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination