CN113033162A - Electronic document conversion method capable of controlling editing rule - Google Patents
Electronic document conversion method capable of controlling editing rule Download PDFInfo
- Publication number
- CN113033162A CN113033162A CN202110376820.0A CN202110376820A CN113033162A CN 113033162 A CN113033162 A CN 113033162A CN 202110376820 A CN202110376820 A CN 202110376820A CN 113033162 A CN113033162 A CN 113033162A
- Authority
- CN
- China
- Prior art keywords
- electronic document
- editing
- rule
- document
- controllable
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Withdrawn
Links
Images
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F40/00—Handling natural language data
- G06F40/10—Text processing
- G06F40/12—Use of codes for handling textual entities
- G06F40/151—Transformation
- G06F40/154—Tree transformation for tree-structured or markup documents, e.g. XSLT, XSL-FO or stylesheets
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F40/00—Handling natural language data
- G06F40/10—Text processing
- G06F40/166—Editing, e.g. inserting or deleting
Abstract
The invention has proposed a electronic document conversion method that can control and edit the rule, carry on the normalization processing to the existing electronic document at first, get the electronic document of the normalization; the existing electronic document is an electronic document with uncontrollable editing rules; then, the outline level and the conversion rule are set, and the unified electronic document is converted according to the conversion rule to obtain the unified electronic document capable of controlling the editing rule. The structural characteristics of document conversion are not lost, and the structural hierarchy of the converted document is not lost.
Description
Technical Field
The invention belongs to the technical field of computer electronic document conversion, and particularly relates to an electronic document conversion method capable of controlling editing rules.
Background
If an electronic document needs to have the characteristics of another electronic document, a common method at present is to perform document conversion, convert an original document into an electronic document with the required characteristics, for example, convert a word document that can be edited into a pdf file that is favorable for viewing and printing; and converting an excel document which cannot be directly displayed on a webpage into an html document which can be directly displayed on the webpage, and the like.
However, most of the existing document conversion methods are direct conversion of contents, so many characteristics of the original electronic document are lost, and editing control cannot be performed on the converted electronic document, for example, after a word is converted into an html document, because html does not have characteristics of headers, footers, sections and the like, the converted html loses information of headers, footers, sections and the like in the original word document, the word document can be edited at any position in the document, but cannot be written on a page after being converted into html.
In order to solve the above problems, the prior art mostly adopts the method of setting a marker in the original document, and then converting the original document into the required content through the marker at the time of conversion. In actual use, the converted document not only has the characteristics of the original document, but also can meet the requirements of the user, for example, the converted document can have a certain organizational structure hierarchy according to an actual use scene, and can be used more conveniently according to the structure when in use, but many conversions at present cannot meet the conversion requirements.
Disclosure of Invention
Based on the above defects and requirements of the prior art, the present invention provides an electronic document conversion method capable of controlling editing rules, which first unifies the existing electronic documents to obtain unified electronic documents; the existing electronic document is an electronic document with uncontrollable editing rules; then, the outline level and the conversion rule are set, and the unified electronic document is converted according to the conversion rule to obtain the unified electronic document capable of controlling the editing rule. The structural characteristics of document conversion are not lost, and the structural hierarchy of the converted document is not lost.
The specific implementation content of the invention is as follows:
the invention has proposed a electronic document conversion method that can control and edit the rule, carry on the normalization processing to the existing electronic document at first, get the electronic document of the normalization; the existing electronic document is an electronic document with uncontrollable editing rules; then, the outline level and the conversion rule are set, and the unified electronic document is converted according to the conversion rule to obtain the unified electronic document capable of controlling the editing rule.
In order to better implement the present invention, further, in the process of unifying the existing electronic document to obtain a unified electronic document, unique and fixed different editing identifiers need to be set, and each editing identifier corresponds to an editing rule; and adding corresponding editing identifiers according to different editing modes required by the electronic document with uncontrollable editing rules where editing control is required.
In order to better implement the present invention, further, in the process of performing a unification process on an existing electronic document to obtain a unified electronic document, unique and fixed different feature identifiers need to be set, each feature identifier corresponds to a structure of the document, and a corresponding feature identifier is added at a place where the structure needs to be divided in the electronic document in which an editing rule cannot be controlled; the structure of the document includes sections, pages, titles, headers, page numbers, and horizontal and vertical pages.
In order to better implement the invention, further, according to the requirement of the hierarchical structure of the unified electronic document of the controllable editing rule after the actual conversion, the corresponding outline level is set in advance in the electronic document of the uncontrollable editing rule.
In order to better implement the invention, further, in the process of setting the outline level and the conversion rule, and converting the unified electronic document according to the conversion rule to obtain the unified electronic document capable of controlling the editing rule, the editing identifier is firstly identified, and different electronic document tags are obtained by conversion according to different editing identifiers.
In order to better implement the present invention, further, in the process of setting the outline level and the transformation rule, and transforming the unified electronic document according to the transformation rule to obtain the unified electronic document capable of controlling the editing rule, the feature identifier is further identified according to actual requirements, and according to the feature identifier, the feature of the structure of the document corresponding to the feature identifier is added to the electronic document which is required to be transformed into the controllable editing rule.
In order to better implement the invention, further, after different electronic document tags are obtained through conversion according to different editing identifiers and characteristics of the structure of the document corresponding to the characteristic identifier are added to the electronic document which needs to be converted into the controllable editing rule, the complete electronic document with the controllable editing rule is obtained through setting according to the preset outline level.
In order to better implement the present invention, further, the specific steps of obtaining the complete electronic document capable of controlling the editing rule according to the preset outline level setting are as follows:
firstly, identifying outline level of an electronic document with uncontrollable editing rules;
secondly, recording the outline level content of the electronic document with uncontrollable editing rules;
then, splitting the electronic document with the uncontrollable editing rule into a plurality of sub-documents according to the final level of the outline of the electronic document with the uncontrollable editing rule;
then, converting the plurality of split sub-documents into sub-documents capable of controlling editing rules;
and finally, according to the recorded outline level content, assembling and establishing the subdocuments of the controllable editing rule into a complete electronic document of the controllable editing rule corresponding to the structural hierarchy.
Compared with the prior art, the invention has the following advantages and beneficial effects:
the invention provides an electronic document conversion method capable of controlling editing rules, which is based on the method and realizes that an electronic document which can not control the editing rules is converted into an electronic document with editing rule control, and the converted electronic document can be edited only in a specific area by using a specific editing rule. Meanwhile, the electronic document converted based on the method also keeps the outline level, the structural layer level and the like of the original document.
Drawings
FIG. 1 is an exemplary screen shot of a normalized computer file interface of an original electronic document;
FIG. 2 is an exemplary diagram of a computer file interface screenshot of a converted electronic document;
FIG. 3 is an exemplary screen shot of a computer file interface for editing an electronic document in which editing rules may be controlled.
Detailed Description
In order to more clearly illustrate the technical solutions of the embodiments of the present invention, the technical solutions in the embodiments of the present invention will be clearly and completely described below with reference to the drawings in the embodiments of the present invention, and it should be understood that the described embodiments are only a part of the embodiments of the present invention, and not all embodiments, and therefore should not be considered as a limitation to the scope of protection. All other embodiments, which can be obtained by a person skilled in the art without any inventive step based on the embodiments of the present invention, are within the scope of the present invention.
In the description of the present invention, it is to be noted that, unless otherwise explicitly specified or limited, the terms "disposed," "connected," and "connected" are to be construed broadly, and may be, for example, fixedly connected, detachably connected, or integrally connected; can be mechanically or electrically connected; they may be connected directly or indirectly through intervening media, or they may be interconnected between two elements. The specific meanings of the above terms in the present invention can be understood in specific cases to those skilled in the art.
Example 1:
the embodiment provides an electronic document conversion method capable of controlling editing rules, which comprises the steps of firstly carrying out unification processing on the existing electronic documents to obtain unified electronic documents; the existing electronic document is an electronic document with uncontrollable editing rules; then, the outline level and the conversion rule are set, and the unified electronic document is converted according to the conversion rule to obtain the unified electronic document capable of controlling the editing rule.
In the process of carrying out the unification treatment on the existing electronic documents to obtain the unified electronic documents, the specific steps are as follows:
firstly, setting unique and fixed different editing identifiers, wherein each editing identifier corresponds to an editing rule; and adding corresponding editing identifiers according to different editing modes required by the electronic document with uncontrollable editing rules where editing control is required.
Secondly, setting unique and fixed different characteristic identifiers, wherein each characteristic identifier corresponds to the structure of a document, and adding the corresponding characteristic identifiers at the positions where the structures need to be divided in the electronic document with uncontrollable editing rules; the structure of the document includes sections, pages, titles, headers, page numbers, and horizontal and vertical pages.
And finally, according to the requirement of the hierarchical structure of the electronic document which is actually converted and can control the unification of the editing rules, setting the corresponding outline level in the electronic document which can not control the editing rules in advance.
In the process of setting the outline level and the conversion rule, and converting the unified electronic document according to the conversion rule to obtain the unified electronic document capable of controlling the editing rule, the specific operations are as follows:
firstly, the editing identifiers are identified, and different electronic document tags are obtained through conversion according to different editing identifiers.
Secondly, the characteristic identifier is identified according to actual requirements, and the characteristic of the structure of the document corresponding to the characteristic identifier is added to the electronic document which needs to be converted into the controllable editing rule according to the characteristic identifier.
And finally, setting according to the preset outline level to obtain a complete electronic document capable of controlling the editing rule.
Example 2:
in this embodiment, on the basis of the foregoing embodiment 1, in order to better implement the present invention, further, the specific steps of obtaining a complete electronic document with controllable editing rules according to the preset outline level setting include:
firstly, identifying outline level of an electronic document with uncontrollable editing rules;
secondly, recording the outline level content of the electronic document with uncontrollable editing rules;
then, splitting the electronic document with the uncontrollable editing rule into a plurality of sub-documents according to the final level of the outline of the electronic document with the uncontrollable editing rule;
then, converting the plurality of split sub-documents into sub-documents capable of controlling editing rules;
and finally, according to the recorded outline level content, assembling and establishing the subdocuments of the controllable editing rule into a complete electronic document of the controllable editing rule corresponding to the structural hierarchy.
Other parts of this embodiment are the same as those of embodiment 1, and thus are not described again.
Example 3:
on the basis of any one of the foregoing embodiments 1-2, as shown in fig. 1, fig. 2, and fig. 3, this embodiment further provides a specific implementation example of converting a word format file into an html format file as follows:
firstly, carrying out standardized processing on a word electronic document to obtain a standardized word electronic document, converting the standardized word electronic document into an html electronic document capable of controlling editing rules according to certain rules
And carrying out standardization processing on the word electronic document to obtain a standardized word electronic document. The normalization processing rule is as follows:
(1) adding different editing identifiers according to different editing modes at places where editing control is needed in an electronic document with uncontrollable editing rules, wherein the content form of each editing identifier is not fixed and can be a special symbol or composed of a plurality of characters, for example, a normalized document shown in figure 1, adding a special editing identifier, such as '____', at a position where content needs to be input, wherein the position is indicated that characters can be filled in, and meanwhile, the length of the written content can be determined by the length of a sliding line; the blank cells of the form in the word document can also be used as a compiling input identifier to indicate that the characters can be filled in; adding a special identifier to a position needing signature of an editor, such as adding "@ handwritten signature @", to indicate that the position is a signature area, and the editor can sign the signature; in addition, as for data with a fixed format, such as time of filling content, a special editing identifier may be added, such as "@ YMD @", indicating that the format of content filled in here is "xx month xx day" or "@ YM @" indicating that the format of content filled in "xx month xx" is "and the like.
(2) Different feature identifiers are added according to the features of the original document, such as sections, pages, titles, headers, horizontal pages and vertical pages of the word document, and the feature identifiers can be added in the places to identify the features of the original document, and each structure is not fixed with the content form of the feature identifiers. For example, a section symbol and a page symbol are respectively added to the places of a word document where section and page are needed as section and page feature identifiers of an original document, and a new converted electronic document can identify section and page information of the converted electronic document through the page and section identifiers;
(3) according to the structural organization level required by the converted electronic document, an outline level is added to each layer of structure, and if the content structure in the word document has 2 levels, the outline level of the first level is set to be 1 level, the outline level of the second level is set to be 2 levels, so that the converted document can assemble the structural hierarchy information of the document through the outline levels.
And then, converting the normalized word electronic document into an html electronic document capable of controlling the editing rule according to a certain rule, wherein the conversion rule is as follows:
(1) when the electronic document is converted, the editing identifiers in the normalized electronic document need to be identified, and different electronic document tags are converted according to different editing identifiers, wherein the electronic document tags can be existing tags of the electronic document or self-defined tags with specific functions. As shown in fig. 2 and fig. 3, (each label in fig. 2 is given a default value for describing a label), taking the example of converting word into html, if "___" or a table blank cell is used to represent an input edit identifier, then if the document content is recognized as "___" or a table blank cell is recognized as an "< textarea >" label in html at the time of conversion, the content can be input on the label; if the ' handwritten signature ' is used for representing a signature editing identifier, replacing the content with an ' < img > ' label in html if the content is recognized as ' @ handwritten signature ' @ ' during conversion, and controlling the label by setting the attribute of the label and adding JavaScript code of html to enable an editor to sign on the label; if the time format edit identifier where "@ YMD @" indicates that the input data is "xx month xx day" is used, then if the content is recognized as "@ YMD @" at the time of conversion, it is replaced with "< textarea >" tag in html, and through tag property and JavaScript code control, we can select the time with the format "xx month xx day" on it at the time of editing.
(2) According to different feature identifiers of the normalized document, the feature markers are respectively processed according to actual requirements during conversion, so that the converted electronic document features meet specific requirements.
(3) The steps of assembling the converted electronic document according to the outline level in the original document are as follows:
1) identifying the outline level of the original document, for example, in fig. 1, first finding the content marked with the outline level in the word document;
2) recording the outline level content of the original document;
3) splitting an original document into a plurality of documents according to the last level outline level of the original document, as shown in fig. 1, wherein the whole word document has the 1 st level and the 2 nd level outline levels, and then splitting the original word document into a plurality of word documents according to the last level 2 nd level outline;
4) converting the plurality of split documents into a required html electronic document set;
5) and establishing a structural hierarchical relationship of the converted electronic document according to the recorded outline levels, wherein as shown in fig. 2, the 1 st level outline in the original word document is used as the first level classification of the html document set, and the 2 nd level outline is used as the name of each html file, so that all converted html files are assembled into a file set with a hierarchical structure according to the outline levels.
Other parts of this embodiment are the same as any of embodiments 1-2 described above, and thus are not described again.
The above description is only a preferred embodiment of the present invention, and is not intended to limit the present invention in any way, and all simple modifications and equivalent variations of the above embodiments according to the technical spirit of the present invention are included in the scope of the present invention.
Claims (8)
1. A electronic document conversion method capable of controlling editing rules is characterized in that existing electronic documents are firstly subjected to unification processing to obtain unified electronic documents; the existing electronic document is an electronic document with uncontrollable editing rules; then, the outline level and the conversion rule are set, and the unified electronic document is converted according to the conversion rule to obtain the unified electronic document capable of controlling the editing rule.
2. The method as claimed in claim 1, wherein during the process of unifying the existing electronic documents to obtain unified electronic documents, unique and fixed different editing identifiers are required to be set, and each editing identifier corresponds to an editing rule; and adding corresponding editing identifiers according to different editing modes required by the electronic document with uncontrollable editing rules where editing control is required.
3. The method as claimed in claim 2, wherein in the process of unifying the existing electronic documents to obtain unified electronic documents, unique and fixed different feature identifiers are required to be set, each feature identifier corresponds to a document structure, and corresponding feature identifiers are added to the electronic documents with uncontrollable editing rules where the structure is required to be divided; the structure of the document includes sections, pages, titles, headers, page numbers, and horizontal and vertical pages.
4. The method according to claim 3, wherein the electronic document with the editing rules controllable is set with the outline level corresponding to the hierarchical structure of the unified electronic document with the editing rules controllable after the conversion.
5. The method as claimed in claim 4, wherein in the process of setting the outline level and the transformation rule, transforming the unified electronic document according to the transformation rule to obtain the unified electronic document with the controllable editing rule, the editing identifier is first identified, and different tags of the electronic document are obtained by transforming according to different editing identifiers.
6. The method as claimed in claim 5, wherein in the process of setting the outline level and the transformation rule, transforming the unified electronic document according to the transformation rule to obtain the unified electronic document with the controllable editing rule, the method further needs to identify the feature identifier according to actual requirements, and add the feature of the structure of the document corresponding to the feature identifier to the electronic document that needs to be transformed into the controllable editing rule according to the feature identifier.
7. The method according to claim 6, wherein after obtaining different tags of the electronic document according to different editing identifiers and adding characteristics of the structure of the document corresponding to the characteristic identifier to the electronic document to be converted into the controllable editing rule, the electronic document is set according to a predetermined outline level to obtain a complete electronic document with controllable editing rules.
8. The method for converting an electronic document with controllable editing rules according to claim 7, wherein the specific steps of obtaining the complete electronic document with controllable editing rules according to the preset outline level setting are as follows:
firstly, identifying outline level of an electronic document with uncontrollable editing rules;
secondly, recording the outline level content of the electronic document with uncontrollable editing rules;
then, splitting the electronic document with the uncontrollable editing rule into a plurality of sub-documents according to the final level of the outline of the electronic document with the uncontrollable editing rule;
then, converting the plurality of split sub-documents into sub-documents capable of controlling editing rules;
and finally, according to the recorded outline level content, assembling and establishing the subdocuments of the controllable editing rule into a complete electronic document of the controllable editing rule corresponding to the structural hierarchy.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202110376820.0A CN113033162A (en) | 2021-04-08 | 2021-04-08 | Electronic document conversion method capable of controlling editing rule |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202110376820.0A CN113033162A (en) | 2021-04-08 | 2021-04-08 | Electronic document conversion method capable of controlling editing rule |
Publications (1)
Publication Number | Publication Date |
---|---|
CN113033162A true CN113033162A (en) | 2021-06-25 |
Family
ID=76454297
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202110376820.0A Withdrawn CN113033162A (en) | 2021-04-08 | 2021-04-08 | Electronic document conversion method capable of controlling editing rule |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN113033162A (en) |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN113779931A (en) * | 2021-08-31 | 2021-12-10 | 民商数字科技(深圳)有限公司 | Knowledge base construction method based on Word and control method thereof |
-
2021
- 2021-04-08 CN CN202110376820.0A patent/CN113033162A/en not_active Withdrawn
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN113779931A (en) * | 2021-08-31 | 2021-12-10 | 民商数字科技(深圳)有限公司 | Knowledge base construction method based on Word and control method thereof |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
EP1672524B1 (en) | Systems and methods for converting a formatted document to a web page | |
CN110083805B (en) | Method and system for converting Word file into EPUB file | |
CN111753099B (en) | Method and system for enhancing relevance of archive entity based on knowledge graph | |
CN103324731A (en) | Method and device for converting spreadsheet file into web format file | |
JP2020187733A (en) | Application programming interface documentation annotation | |
CN1702617A (en) | Method and system for mapping content between a starting template and a target template | |
CN106021394A (en) | Website construction method and apparatus | |
CN103631604B (en) | A kind of standardized digital education resource management method based on CELTS-41 specification | |
CN107203498A (en) | A kind of method, system and its user terminal and server for creating e-book | |
CN102982439A (en) | Method for generating electronic document and electronic document generating system | |
JP2022547895A (en) | User-customized automatic document creation method, device and server therefor | |
CN101430684A (en) | Method and apparatus for mutual conversion between Chinese work office software document and documents with other format | |
CN104281589A (en) | Mathematical formula searching method and device | |
US20030222916A1 (en) | Object-oriented processing of tab text | |
CN113033162A (en) | Electronic document conversion method capable of controlling editing rule | |
CN102110108B (en) | Method and device for processing galley proof file | |
CN113850056A (en) | Document key information extraction method and system based on keyword splitting technology | |
CN111708529B (en) | Method for generating form through dragging based on angular | |
CN106815181A (en) | Conversion method and device of a kind of indd files of InDesign typesettings to Office files | |
CN101464875B (en) | Method for representing electronic dictionary catalog data by XML | |
CN115841095A (en) | Document establishing method, system, medium and equipment | |
CN112818687B (en) | Method, device, electronic equipment and storage medium for constructing title recognition model | |
CN112199966B (en) | Translation method, translation device, electronic device and storage medium | |
CN111401005B (en) | Text conversion method and device and readable storage medium | |
KR101133567B1 (en) | Method and server for publishing on demand |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
WW01 | Invention patent application withdrawn after publication | ||
WW01 | Invention patent application withdrawn after publication |
Application publication date: 20210625 |