CN113033162A - Electronic document conversion method capable of controlling editing rule - Google Patents

Electronic document conversion method capable of controlling editing rule Download PDF

Info

Publication number
CN113033162A
CN113033162A CN202110376820.0A CN202110376820A CN113033162A CN 113033162 A CN113033162 A CN 113033162A CN 202110376820 A CN202110376820 A CN 202110376820A CN 113033162 A CN113033162 A CN 113033162A
Authority
CN
China
Prior art keywords
electronic document
editing
rule
document
controllable
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Withdrawn
Application number
CN202110376820.0A
Other languages
Chinese (zh)
Inventor
王举
俞晓荣
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Chengdu Weishilian Software Technology Co ltd
Original Assignee
Chengdu Weishilian Software Technology Co ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Chengdu Weishilian Software Technology Co ltd filed Critical Chengdu Weishilian Software Technology Co ltd
Priority to CN202110376820.0A priority Critical patent/CN113033162A/en
Publication of CN113033162A publication Critical patent/CN113033162A/en
Withdrawn legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/10Text processing
    • G06F40/12Use of codes for handling textual entities
    • G06F40/151Transformation
    • G06F40/154Tree transformation for tree-structured or markup documents, e.g. XSLT, XSL-FO or stylesheets
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/10Text processing
    • G06F40/166Editing, e.g. inserting or deleting

Abstract

The invention has proposed a electronic document conversion method that can control and edit the rule, carry on the normalization processing to the existing electronic document at first, get the electronic document of the normalization; the existing electronic document is an electronic document with uncontrollable editing rules; then, the outline level and the conversion rule are set, and the unified electronic document is converted according to the conversion rule to obtain the unified electronic document capable of controlling the editing rule. The structural characteristics of document conversion are not lost, and the structural hierarchy of the converted document is not lost.

Description

Electronic document conversion method capable of controlling editing rule
Technical Field
The invention belongs to the technical field of computer electronic document conversion, and particularly relates to an electronic document conversion method capable of controlling editing rules.
Background
If an electronic document needs to have the characteristics of another electronic document, a common method at present is to perform document conversion, convert an original document into an electronic document with the required characteristics, for example, convert a word document that can be edited into a pdf file that is favorable for viewing and printing; and converting an excel document which cannot be directly displayed on a webpage into an html document which can be directly displayed on the webpage, and the like.
However, most of the existing document conversion methods are direct conversion of contents, so many characteristics of the original electronic document are lost, and editing control cannot be performed on the converted electronic document, for example, after a word is converted into an html document, because html does not have characteristics of headers, footers, sections and the like, the converted html loses information of headers, footers, sections and the like in the original word document, the word document can be edited at any position in the document, but cannot be written on a page after being converted into html.
In order to solve the above problems, the prior art mostly adopts the method of setting a marker in the original document, and then converting the original document into the required content through the marker at the time of conversion. In actual use, the converted document not only has the characteristics of the original document, but also can meet the requirements of the user, for example, the converted document can have a certain organizational structure hierarchy according to an actual use scene, and can be used more conveniently according to the structure when in use, but many conversions at present cannot meet the conversion requirements.
Disclosure of Invention
Based on the above defects and requirements of the prior art, the present invention provides an electronic document conversion method capable of controlling editing rules, which first unifies the existing electronic documents to obtain unified electronic documents; the existing electronic document is an electronic document with uncontrollable editing rules; then, the outline level and the conversion rule are set, and the unified electronic document is converted according to the conversion rule to obtain the unified electronic document capable of controlling the editing rule. The structural characteristics of document conversion are not lost, and the structural hierarchy of the converted document is not lost.
The specific implementation content of the invention is as follows:
the invention has proposed a electronic document conversion method that can control and edit the rule, carry on the normalization processing to the existing electronic document at first, get the electronic document of the normalization; the existing electronic document is an electronic document with uncontrollable editing rules; then, the outline level and the conversion rule are set, and the unified electronic document is converted according to the conversion rule to obtain the unified electronic document capable of controlling the editing rule.
In order to better implement the present invention, further, in the process of unifying the existing electronic document to obtain a unified electronic document, unique and fixed different editing identifiers need to be set, and each editing identifier corresponds to an editing rule; and adding corresponding editing identifiers according to different editing modes required by the electronic document with uncontrollable editing rules where editing control is required.
In order to better implement the present invention, further, in the process of performing a unification process on an existing electronic document to obtain a unified electronic document, unique and fixed different feature identifiers need to be set, each feature identifier corresponds to a structure of the document, and a corresponding feature identifier is added at a place where the structure needs to be divided in the electronic document in which an editing rule cannot be controlled; the structure of the document includes sections, pages, titles, headers, page numbers, and horizontal and vertical pages.
In order to better implement the invention, further, according to the requirement of the hierarchical structure of the unified electronic document of the controllable editing rule after the actual conversion, the corresponding outline level is set in advance in the electronic document of the uncontrollable editing rule.
In order to better implement the invention, further, in the process of setting the outline level and the conversion rule, and converting the unified electronic document according to the conversion rule to obtain the unified electronic document capable of controlling the editing rule, the editing identifier is firstly identified, and different electronic document tags are obtained by conversion according to different editing identifiers.
In order to better implement the present invention, further, in the process of setting the outline level and the transformation rule, and transforming the unified electronic document according to the transformation rule to obtain the unified electronic document capable of controlling the editing rule, the feature identifier is further identified according to actual requirements, and according to the feature identifier, the feature of the structure of the document corresponding to the feature identifier is added to the electronic document which is required to be transformed into the controllable editing rule.
In order to better implement the invention, further, after different electronic document tags are obtained through conversion according to different editing identifiers and characteristics of the structure of the document corresponding to the characteristic identifier are added to the electronic document which needs to be converted into the controllable editing rule, the complete electronic document with the controllable editing rule is obtained through setting according to the preset outline level.
In order to better implement the present invention, further, the specific steps of obtaining the complete electronic document capable of controlling the editing rule according to the preset outline level setting are as follows:
firstly, identifying outline level of an electronic document with uncontrollable editing rules;
secondly, recording the outline level content of the electronic document with uncontrollable editing rules;
then, splitting the electronic document with the uncontrollable editing rule into a plurality of sub-documents according to the final level of the outline of the electronic document with the uncontrollable editing rule;
then, converting the plurality of split sub-documents into sub-documents capable of controlling editing rules;
and finally, according to the recorded outline level content, assembling and establishing the subdocuments of the controllable editing rule into a complete electronic document of the controllable editing rule corresponding to the structural hierarchy.
Compared with the prior art, the invention has the following advantages and beneficial effects:
the invention provides an electronic document conversion method capable of controlling editing rules, which is based on the method and realizes that an electronic document which can not control the editing rules is converted into an electronic document with editing rule control, and the converted electronic document can be edited only in a specific area by using a specific editing rule. Meanwhile, the electronic document converted based on the method also keeps the outline level, the structural layer level and the like of the original document.
Drawings
FIG. 1 is an exemplary screen shot of a normalized computer file interface of an original electronic document;
FIG. 2 is an exemplary diagram of a computer file interface screenshot of a converted electronic document;
FIG. 3 is an exemplary screen shot of a computer file interface for editing an electronic document in which editing rules may be controlled.
Detailed Description
In order to more clearly illustrate the technical solutions of the embodiments of the present invention, the technical solutions in the embodiments of the present invention will be clearly and completely described below with reference to the drawings in the embodiments of the present invention, and it should be understood that the described embodiments are only a part of the embodiments of the present invention, and not all embodiments, and therefore should not be considered as a limitation to the scope of protection. All other embodiments, which can be obtained by a person skilled in the art without any inventive step based on the embodiments of the present invention, are within the scope of the present invention.
In the description of the present invention, it is to be noted that, unless otherwise explicitly specified or limited, the terms "disposed," "connected," and "connected" are to be construed broadly, and may be, for example, fixedly connected, detachably connected, or integrally connected; can be mechanically or electrically connected; they may be connected directly or indirectly through intervening media, or they may be interconnected between two elements. The specific meanings of the above terms in the present invention can be understood in specific cases to those skilled in the art.
Example 1:
the embodiment provides an electronic document conversion method capable of controlling editing rules, which comprises the steps of firstly carrying out unification processing on the existing electronic documents to obtain unified electronic documents; the existing electronic document is an electronic document with uncontrollable editing rules; then, the outline level and the conversion rule are set, and the unified electronic document is converted according to the conversion rule to obtain the unified electronic document capable of controlling the editing rule.
In the process of carrying out the unification treatment on the existing electronic documents to obtain the unified electronic documents, the specific steps are as follows:
firstly, setting unique and fixed different editing identifiers, wherein each editing identifier corresponds to an editing rule; and adding corresponding editing identifiers according to different editing modes required by the electronic document with uncontrollable editing rules where editing control is required.
Secondly, setting unique and fixed different characteristic identifiers, wherein each characteristic identifier corresponds to the structure of a document, and adding the corresponding characteristic identifiers at the positions where the structures need to be divided in the electronic document with uncontrollable editing rules; the structure of the document includes sections, pages, titles, headers, page numbers, and horizontal and vertical pages.
And finally, according to the requirement of the hierarchical structure of the electronic document which is actually converted and can control the unification of the editing rules, setting the corresponding outline level in the electronic document which can not control the editing rules in advance.
In the process of setting the outline level and the conversion rule, and converting the unified electronic document according to the conversion rule to obtain the unified electronic document capable of controlling the editing rule, the specific operations are as follows:
firstly, the editing identifiers are identified, and different electronic document tags are obtained through conversion according to different editing identifiers.
Secondly, the characteristic identifier is identified according to actual requirements, and the characteristic of the structure of the document corresponding to the characteristic identifier is added to the electronic document which needs to be converted into the controllable editing rule according to the characteristic identifier.
And finally, setting according to the preset outline level to obtain a complete electronic document capable of controlling the editing rule.
Example 2:
in this embodiment, on the basis of the foregoing embodiment 1, in order to better implement the present invention, further, the specific steps of obtaining a complete electronic document with controllable editing rules according to the preset outline level setting include:
firstly, identifying outline level of an electronic document with uncontrollable editing rules;
secondly, recording the outline level content of the electronic document with uncontrollable editing rules;
then, splitting the electronic document with the uncontrollable editing rule into a plurality of sub-documents according to the final level of the outline of the electronic document with the uncontrollable editing rule;
then, converting the plurality of split sub-documents into sub-documents capable of controlling editing rules;
and finally, according to the recorded outline level content, assembling and establishing the subdocuments of the controllable editing rule into a complete electronic document of the controllable editing rule corresponding to the structural hierarchy.
Other parts of this embodiment are the same as those of embodiment 1, and thus are not described again.
Example 3:
on the basis of any one of the foregoing embodiments 1-2, as shown in fig. 1, fig. 2, and fig. 3, this embodiment further provides a specific implementation example of converting a word format file into an html format file as follows:
firstly, carrying out standardized processing on a word electronic document to obtain a standardized word electronic document, converting the standardized word electronic document into an html electronic document capable of controlling editing rules according to certain rules
And carrying out standardization processing on the word electronic document to obtain a standardized word electronic document. The normalization processing rule is as follows:
(1) adding different editing identifiers according to different editing modes at places where editing control is needed in an electronic document with uncontrollable editing rules, wherein the content form of each editing identifier is not fixed and can be a special symbol or composed of a plurality of characters, for example, a normalized document shown in figure 1, adding a special editing identifier, such as '____', at a position where content needs to be input, wherein the position is indicated that characters can be filled in, and meanwhile, the length of the written content can be determined by the length of a sliding line; the blank cells of the form in the word document can also be used as a compiling input identifier to indicate that the characters can be filled in; adding a special identifier to a position needing signature of an editor, such as adding "@ handwritten signature @", to indicate that the position is a signature area, and the editor can sign the signature; in addition, as for data with a fixed format, such as time of filling content, a special editing identifier may be added, such as "@ YMD @", indicating that the format of content filled in here is "xx month xx day" or "@ YM @" indicating that the format of content filled in "xx month xx" is "and the like.
(2) Different feature identifiers are added according to the features of the original document, such as sections, pages, titles, headers, horizontal pages and vertical pages of the word document, and the feature identifiers can be added in the places to identify the features of the original document, and each structure is not fixed with the content form of the feature identifiers. For example, a section symbol and a page symbol are respectively added to the places of a word document where section and page are needed as section and page feature identifiers of an original document, and a new converted electronic document can identify section and page information of the converted electronic document through the page and section identifiers;
(3) according to the structural organization level required by the converted electronic document, an outline level is added to each layer of structure, and if the content structure in the word document has 2 levels, the outline level of the first level is set to be 1 level, the outline level of the second level is set to be 2 levels, so that the converted document can assemble the structural hierarchy information of the document through the outline levels.
And then, converting the normalized word electronic document into an html electronic document capable of controlling the editing rule according to a certain rule, wherein the conversion rule is as follows:
(1) when the electronic document is converted, the editing identifiers in the normalized electronic document need to be identified, and different electronic document tags are converted according to different editing identifiers, wherein the electronic document tags can be existing tags of the electronic document or self-defined tags with specific functions. As shown in fig. 2 and fig. 3, (each label in fig. 2 is given a default value for describing a label), taking the example of converting word into html, if "___" or a table blank cell is used to represent an input edit identifier, then if the document content is recognized as "___" or a table blank cell is recognized as an "< textarea >" label in html at the time of conversion, the content can be input on the label; if the ' handwritten signature ' is used for representing a signature editing identifier, replacing the content with an ' < img > ' label in html if the content is recognized as ' @ handwritten signature ' @ ' during conversion, and controlling the label by setting the attribute of the label and adding JavaScript code of html to enable an editor to sign on the label; if the time format edit identifier where "@ YMD @" indicates that the input data is "xx month xx day" is used, then if the content is recognized as "@ YMD @" at the time of conversion, it is replaced with "< textarea >" tag in html, and through tag property and JavaScript code control, we can select the time with the format "xx month xx day" on it at the time of editing.
(2) According to different feature identifiers of the normalized document, the feature markers are respectively processed according to actual requirements during conversion, so that the converted electronic document features meet specific requirements.
(3) The steps of assembling the converted electronic document according to the outline level in the original document are as follows:
1) identifying the outline level of the original document, for example, in fig. 1, first finding the content marked with the outline level in the word document;
2) recording the outline level content of the original document;
3) splitting an original document into a plurality of documents according to the last level outline level of the original document, as shown in fig. 1, wherein the whole word document has the 1 st level and the 2 nd level outline levels, and then splitting the original word document into a plurality of word documents according to the last level 2 nd level outline;
4) converting the plurality of split documents into a required html electronic document set;
5) and establishing a structural hierarchical relationship of the converted electronic document according to the recorded outline levels, wherein as shown in fig. 2, the 1 st level outline in the original word document is used as the first level classification of the html document set, and the 2 nd level outline is used as the name of each html file, so that all converted html files are assembled into a file set with a hierarchical structure according to the outline levels.
Other parts of this embodiment are the same as any of embodiments 1-2 described above, and thus are not described again.
The above description is only a preferred embodiment of the present invention, and is not intended to limit the present invention in any way, and all simple modifications and equivalent variations of the above embodiments according to the technical spirit of the present invention are included in the scope of the present invention.

Claims (8)

1. A electronic document conversion method capable of controlling editing rules is characterized in that existing electronic documents are firstly subjected to unification processing to obtain unified electronic documents; the existing electronic document is an electronic document with uncontrollable editing rules; then, the outline level and the conversion rule are set, and the unified electronic document is converted according to the conversion rule to obtain the unified electronic document capable of controlling the editing rule.
2. The method as claimed in claim 1, wherein during the process of unifying the existing electronic documents to obtain unified electronic documents, unique and fixed different editing identifiers are required to be set, and each editing identifier corresponds to an editing rule; and adding corresponding editing identifiers according to different editing modes required by the electronic document with uncontrollable editing rules where editing control is required.
3. The method as claimed in claim 2, wherein in the process of unifying the existing electronic documents to obtain unified electronic documents, unique and fixed different feature identifiers are required to be set, each feature identifier corresponds to a document structure, and corresponding feature identifiers are added to the electronic documents with uncontrollable editing rules where the structure is required to be divided; the structure of the document includes sections, pages, titles, headers, page numbers, and horizontal and vertical pages.
4. The method according to claim 3, wherein the electronic document with the editing rules controllable is set with the outline level corresponding to the hierarchical structure of the unified electronic document with the editing rules controllable after the conversion.
5. The method as claimed in claim 4, wherein in the process of setting the outline level and the transformation rule, transforming the unified electronic document according to the transformation rule to obtain the unified electronic document with the controllable editing rule, the editing identifier is first identified, and different tags of the electronic document are obtained by transforming according to different editing identifiers.
6. The method as claimed in claim 5, wherein in the process of setting the outline level and the transformation rule, transforming the unified electronic document according to the transformation rule to obtain the unified electronic document with the controllable editing rule, the method further needs to identify the feature identifier according to actual requirements, and add the feature of the structure of the document corresponding to the feature identifier to the electronic document that needs to be transformed into the controllable editing rule according to the feature identifier.
7. The method according to claim 6, wherein after obtaining different tags of the electronic document according to different editing identifiers and adding characteristics of the structure of the document corresponding to the characteristic identifier to the electronic document to be converted into the controllable editing rule, the electronic document is set according to a predetermined outline level to obtain a complete electronic document with controllable editing rules.
8. The method for converting an electronic document with controllable editing rules according to claim 7, wherein the specific steps of obtaining the complete electronic document with controllable editing rules according to the preset outline level setting are as follows:
firstly, identifying outline level of an electronic document with uncontrollable editing rules;
secondly, recording the outline level content of the electronic document with uncontrollable editing rules;
then, splitting the electronic document with the uncontrollable editing rule into a plurality of sub-documents according to the final level of the outline of the electronic document with the uncontrollable editing rule;
then, converting the plurality of split sub-documents into sub-documents capable of controlling editing rules;
and finally, according to the recorded outline level content, assembling and establishing the subdocuments of the controllable editing rule into a complete electronic document of the controllable editing rule corresponding to the structural hierarchy.
CN202110376820.0A 2021-04-08 2021-04-08 Electronic document conversion method capable of controlling editing rule Withdrawn CN113033162A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202110376820.0A CN113033162A (en) 2021-04-08 2021-04-08 Electronic document conversion method capable of controlling editing rule

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202110376820.0A CN113033162A (en) 2021-04-08 2021-04-08 Electronic document conversion method capable of controlling editing rule

Publications (1)

Publication Number Publication Date
CN113033162A true CN113033162A (en) 2021-06-25

Family

ID=76454297

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202110376820.0A Withdrawn CN113033162A (en) 2021-04-08 2021-04-08 Electronic document conversion method capable of controlling editing rule

Country Status (1)

Country Link
CN (1) CN113033162A (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113779931A (en) * 2021-08-31 2021-12-10 民商数字科技(深圳)有限公司 Knowledge base construction method based on Word and control method thereof

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113779931A (en) * 2021-08-31 2021-12-10 民商数字科技(深圳)有限公司 Knowledge base construction method based on Word and control method thereof

Similar Documents

Publication Publication Date Title
EP1672524B1 (en) Systems and methods for converting a formatted document to a web page
CN110083805B (en) Method and system for converting Word file into EPUB file
CN111753099B (en) Method and system for enhancing relevance of archive entity based on knowledge graph
CN103324731A (en) Method and device for converting spreadsheet file into web format file
JP2020187733A (en) Application programming interface documentation annotation
CN1702617A (en) Method and system for mapping content between a starting template and a target template
CN106021394A (en) Website construction method and apparatus
CN103631604B (en) A kind of standardized digital education resource management method based on CELTS-41 specification
CN107203498A (en) A kind of method, system and its user terminal and server for creating e-book
CN102982439A (en) Method for generating electronic document and electronic document generating system
JP2022547895A (en) User-customized automatic document creation method, device and server therefor
CN101430684A (en) Method and apparatus for mutual conversion between Chinese work office software document and documents with other format
CN104281589A (en) Mathematical formula searching method and device
US20030222916A1 (en) Object-oriented processing of tab text
CN113033162A (en) Electronic document conversion method capable of controlling editing rule
CN102110108B (en) Method and device for processing galley proof file
CN113850056A (en) Document key information extraction method and system based on keyword splitting technology
CN111708529B (en) Method for generating form through dragging based on angular
CN106815181A (en) Conversion method and device of a kind of indd files of InDesign typesettings to Office files
CN101464875B (en) Method for representing electronic dictionary catalog data by XML
CN115841095A (en) Document establishing method, system, medium and equipment
CN112818687B (en) Method, device, electronic equipment and storage medium for constructing title recognition model
CN112199966B (en) Translation method, translation device, electronic device and storage medium
CN111401005B (en) Text conversion method and device and readable storage medium
KR101133567B1 (en) Method and server for publishing on demand

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
WW01 Invention patent application withdrawn after publication
WW01 Invention patent application withdrawn after publication

Application publication date: 20210625