CN113031887A

CN113031887A - Method for supporting various headers and subsection printing in html file printing

Info

Publication number: CN113031887A
Application number: CN202110376826.8A
Authority: CN
Inventors: 林侨; 王举
Original assignee: Chengdu Weishilian Software Technology Co ltd
Current assignee: Chengdu Weishilian Software Technology Co ltd
Priority date: 2021-04-08
Filing date: 2021-04-08
Publication date: 2021-06-25

Abstract

The invention provides a method for supporting various headers and subsection printing in html file printing, which realizes automatic printing of html files in the forms of page numbers, page feet, segmentation and the like by setting classification identifiers, calculating the height of an input box, dividing subfiles, integrating subfiles and the like. The accuracy and the real-time performance of data filling are guaranteed, and meanwhile the efficiency of printing the file is improved.

Description

Method for supporting various headers and subsection printing in html file printing

Technical Field

The invention belongs to the technical field of electronic document conversion, and particularly relates to a method for supporting various headers and subsection printing in html file printing.

Background

Currently, electronic records are mainly filled in by using office files such as word, excel and the like as carriers, or html files as carriers. The Office file is not convenient for user-defined input control and is not suitable for filling in by a mobile terminal. The Html file does not support a variety of header and rail page blends.

The custom html printing support in the prior art is mainly realized by rearranging the printing during printing. And the final file can obtain the required document containing complete information according to the information added by the user.

At the time of final printing, rearranging the print requires intervention by the user at the time of final printing. However, data in industries such as medicine and food are required to ensure accuracy and real-time performance of the data. The required filling cannot be changed, and the filling cannot be delayed. The manual intervention brings many inevitable errors and reduces the working efficiency.

Disclosure of Invention

Aiming at the defects in the prior art, the invention provides a method for supporting various headers and subsection printing in html file printing, which realizes automatic printing of html files in the forms of page numbers, page feet, segmentation and the like by setting classification identifiers, calculating the height of an input box, dividing subfiles, integrating subfiles and the like. The accuracy and the real-time performance of data filling are guaranteed, and meanwhile the efficiency of printing the file is improved.

The specific implementation content of the invention is as follows:

the invention provides a method for supporting various headers and subsection printing in html document printing, which comprises the following steps:

step 1: appointing a category identifier;

step 2: adding corresponding category identification at a corresponding source code according to the content format of the html file on the webpage, and taking the html file added with the corresponding category identification as an input template file;

and step 3: performing job filling by using the obtained template file, and keeping the category identification added on the template file until the job filling is completed; after the operation filling is completed, filing the whole template file and the filled operation information to obtain a filed webpage file;

and 4, step 4: respectively taking out the paragraphs segmented according to the category identification in the filed webpage file, and assembling the paragraphs added with the category identification by a set; reading the paragraph set marked by the classified identification in sequence, dividing the filed webpage file according to the marked paragraphs as dividing points, decomposing the filed webpage file into an identification set and a content set, and finally storing the identification set and the content set into a plurality of subfiles respectively in sequence;

and 5: reading the single content subfile obtained in the step 4, finding out the position of the input content in the content subfile, and recalculating the height of the position of the input content; adjusting the file height of the content subfile according to the recalculated height;

step 6: the height-adjusted content subfiles are converted into a plurality of subfiles by corresponding the identifier subfiles and the content subfiles one by one according to the corresponding modes of the identifier and the content;

and 7: and splicing the obtained plurality of sub-files to obtain the converted complete document.

In order to better implement the present invention, further, the specific operation of step 5 includes:

step 5.1: acquiring text content input by an input box of the content subfile;

step 5.2: obtaining the width of the input box according to the style attribute of the input box;

step 5.3: acquiring font information of an input box;

step 5.4: and acquiring the width required by the complete display of the text under the current font, comparing the width according to the width of the current input box, and adjusting.

In order to better implement the present invention, in step 5.2, if the width of the input box is not set, the width of the corresponding parent node is obtained, and the width of the content other than the input box is subtracted to obtain the width of the input box.

In order to better implement the present invention, further, in step 5.4: acquiring the width required for the complete text display under the current font, and if the required width is larger than the width of the text which can be displayed by the input box, performing height calculation on the input box and adjusting; otherwise, the height of the input box is not adjusted.

In order to better realize the invention, further, for the input box which needs to be height-adjusted, the text of the input box is cut from the beginning, and when the text is cut to the width that just one line of content can be displayed, the number of lines occupied by the input box is counted and increased by one; and then, continuing to cut the residual text, and so on until the occupied width of the residual text is smaller than the displayable width of the input box, so as to obtain the final number of lines of the input box.

In order to better implement the present invention, further, the category identifier includes a header identifier, a footer identifier, a horizontal page identifier, a vertical page identifier, and a segment identifier.

In order to better implement the present invention, further, the subfiles converted in step 6 are subfiles in PDF format.

Drawings

FIG. 1 is a partial computer screenshot presentation after an original word and word are converted to an html file with header 1;

FIG. 2 is a partial computer screenshot presentation after an original word and word with header 2 have been converted to an html file;

FIG. 3 is a partial computer screen shot showing a webpage source code with section identifiers added;

FIG. 4 is an illustration of a post-job file with a computer screenshot showing a portion of the input points filled in;

FIG. 5 is a partial computer screen shot illustration of a split header collection;

FIG. 6 is a partial computer screen shot display of the split content;

FIG. 7 is a partial computer screenshot showing before recalculating line height;

FIG. 8 is a partial computer screen shot showing after recalculating line height;

FIG. 9 is a computer screenshot presentation of a converted portable document file collection;

FIG. 10 is a computer screenshot presentation of a final portable document file.

Detailed Description

In order to more clearly illustrate the technical solutions of the embodiments of the present invention, the technical solutions in the embodiments of the present invention will be clearly and completely described below with reference to the drawings in the embodiments of the present invention, and it should be understood that the described embodiments are only a part of the embodiments of the present invention, and not all embodiments, and therefore should not be considered as a limitation to the scope of protection. All other embodiments, which can be obtained by a person skilled in the art without any inventive step based on the embodiments of the present invention, are within the scope of the present invention.

In the description of the present invention, it is to be noted that, unless otherwise explicitly specified or limited, the terms "disposed," "connected," and "connected" are to be construed broadly, and may be, for example, fixedly connected, detachably connected, or integrally connected; can be mechanically or electrically connected; they may be connected directly or indirectly through intervening media, or they may be interconnected between two elements. The specific meanings of the above terms in the present invention can be understood in specific cases to those skilled in the art.

Example 1:

the embodiment provides a method for supporting various headers and subsection printing in html document printing, which comprises the following steps:

step 1: appointing a category identifier; the category identification comprises a header identification, a footer identification, a transverse page identification, a vertical page identification and a segment identification;

step 2: adding corresponding category identification at a corresponding source code according to the content format of the html file on the webpage, and taking the html file added with the corresponding category identification as an input template file; specific source code examples are as follows:

and 5: reading the single content subfile obtained in the step 4, finding out the position of the input content in the content subfile, and recalculating the height of the position of the input content; adjusting the file height of the content subfile according to the recalculated height; the specific operation of the step 5 comprises the following steps:

step 5.1: acquiring text content input by an input box of the content subfile;

step 5.2: obtaining the width of the input box according to the style attribute of the input box; if the input box has no set width, acquiring the width of the corresponding parent node, and subtracting the widths of other contents except the input box to obtain the width of the input box;

step 5.3: acquiring font information of an input box;

step 5.4: acquiring the width required by the complete display of the text under the current font, comparing the width according to the width of the current input box, and adjusting the width; acquiring the width required for the complete text display under the current font, and if the required width is larger than the width of the text which can be displayed by the input box, performing height calculation on the input box and adjusting; otherwise, the height of the input box is not adjusted; for an input box needing height adjustment, cutting the text of the input box from the beginning, and counting the number of lines occupied by the input box by one when the text of the input box is cut to the width which can be displayed by just one line of content; then, the residual text after cutting is continuously cut, and the like, until the occupied width of the residual text is smaller than the displayable width of the input box, and the final number of lines of the input box is obtained;

step 6: the height-adjusted content subfiles are subjected to one-to-one correspondence between the identifier subfiles and the content subfiles according to the corresponding identifier and content modes, and a plurality of subfiles in PDF format are obtained through conversion;

and 7: and splicing the obtained plurality of subfiles in the PDF format to obtain a complete document in the converted PDF format.

The working principle is as follows: firstly, appointing marking in step 1, and marking appointment for the required characteristics. The main convention is that the marks of header, footer, horizontal page, vertical page and segment.

And then converting the word template into an html file in step 2, and marking the html file by using the mark agreed in step 1. Such as a fixed header flag: and adding a mark to a position corresponding to the source code of the webpage by a header. Making this paragraph a section marker for the entire document. An example of code addition is as follows:

<div

style = "border:1px saved #747474; width: auto; display: inline-block! important; position: relative; overflow: hidden; header: header" > (here: where the agreed header mark is added)

<p

style="margin-top:0pt;margin-bottom:0pt;text-align:justify;line-height:150%;widows:0;orphans:0;padding-top:1pt;padding-right:4pt;padding-left:4pt;font-size:14.7pt;">

<span style="height:0pt;text-align:left;display:block;position:absolute;z-index:-65536;"><img

src="data:image/png;base64,iVBORw0KGgoAAAANSUhEUgAAAosAAAAGCAYAAACrW9Q4AAAACXBIWXMAAA7DAAAOwwHHb6hkAAAAQElEQVR4nO3WIQ4AIBADwfv/p0FiIOTUVcwm4ytbdVoAAFCPpkcBAJDh2vQoAAAySJIkSb2mHywAABkkSZKkfxvnOw5/bjx0qAAAAABJRU5ErkJggg=="

width="911.4" height="8.399999999999999" alt=""

style="margin-top:13.35pt;margin-left:-2pt;-aw-left-pos:0pt;-aw-rel-hpos:column;-aw-rel-vpos:paragraph;-aw-top-pos:15.35pt;-aw-wrap-type:none;position:absolute"></span><span

style = "font-family: song body" > xxxx ] < span = "font-family: song body" > group ltd

style = "font-family: song body" > xx </span > < span = "font-family: song body" > test </span > < span

style = "font-family: Song dynasty" > record ZL-SRD-10-ZY009-00</span >)

<p

style="margin-top:0pt;margin-bottom:0pt;text-align:justify;line-height:150%;widows:0;orphans:0;padding-right:4pt;padding-left:4pt;padding-bottom:1pt;font-size:14.7pt;">

< span style = "font-family: song body": where < span style = 'font-family:' Times New Roman ';' XX </span > < span:

style = "font-family: song body" > </span > < span = "font-family: song body" > lot number: [ span > < span ]

style="display: inline-flex;vertical-align: middle;position:relative;font-family:'Times New Roman';font-size:12.6pt;"><textarea

type="text" name="textarea" value="807178412669145090 "

style="resize:none;vertical-align:middle;border-bottom-style:inset;color:blue;padding:0px;padding-top:2px;min-width:30px;min-height:26px;height:24px;width:132px;"

id="807178412669145090" readonly="" onclick="conClick_f(this)" onkeydown="autoTextarea(this)"

ondblclick="ondblclick_f(this)">1</textarea></span><span

style = "font-family: song body," > </span > < span style = "font-family: song body": [ span > < span ]

style="display: inline-flex;vertical-align: middle;position:relative;font-family:'Times New Roman';"><textarea

type="text" name="textarea" value="807178412669145091 "

style="resize:none;vertical-align:middle;border-bottom-style:inset;color:blue;padding:0px;padding-top:2px;min-width:30px;min-height:26px;height:24px;width:92px;"

id="807178412669145091" readonly="" onclick="conClick_f(this)" onkeydown="autoTextarea(this)"

Andbllick = "andbllick _ f (this)" >2</textarea > </span > < span style = "font-family: Song somatic;" > </span > < span =

style = ' font-family: song body [ ] "> header </span > < span style =" font-family: ' Times New Roman '; ">1</span > </p >

</div>；

FIGS. 1 and 2 are partial screenshots after original word and word are converted into html files, respectively, and are respectively expressed as header 1, header 2 and effect after horizontal and vertical page conversion.

FIG. 3 is a diagram of the effect of adding section identifiers to the source code of a web page, wherein the presentation effect of a header on a browser and the position of a corresponding mark are marked in detail.

And (5) obtaining the template by using the step (2), and filling the content into the template. And obtaining the file of the filled content after the operation.

FIG. 4 is an example of a post-job file with partial entry points filled in.

And (3) finding out paragraphs with special meanings according to the marks agreed in the step 1 in the webpage source code file obtained in the step 2. And splitting the original file according to the obtained paragraphs.

The webpage source code is decomposed into a plurality of subnet page source codes of a plurality of sections of contents, and the subnet page source codes are stored. When the mark section has special meaning, the mark section is also stored as the sub-network page source code corresponding to the content. E.g., the e-mark is a header mark, and is saved as header 1, header 2 …, content 1, content 2 ….

FIG. 5 is a diagram of a split header assembly effect, wherein the file is a temporarily stored header file.

Fig. 6 is a diagram of the effect of the split content collection, in which the file is a temporarily stored content file.

In step 5, the split content file is read, and the content line height is newly calculated.

Fig. 7 is a diagram of the effect before line height recalculation, and the frame is not fully displayed.

FIG. 8 is a plot of the effect after row height recalculation, with the box being the recalculated row height.

The file obtained in step 3 is used in step 6 and converted into a portable document file.

FIG. 9 is a diagram of the effect of the converted portable document file set, wherein the file is the converted portable document file, and a plurality of files with horizontal and vertical sides can be seen.

And 7, combining the portable document files obtained in the step 6 to obtain a final portable document file.

FIG. 10 final portable document file effect diagram. The upper part and the lower part are respectively a vertical page and a horizontal page.

The above description is only a preferred embodiment of the present invention, and is not intended to limit the present invention in any way, and all simple modifications and equivalent variations of the above embodiments according to the technical spirit of the present invention are included in the scope of the present invention.

Claims

1. A method for supporting multiple header and subsection printing in html document printing, which is characterized by comprising the following steps:

step 1: appointing a category identifier;

2. The method for supporting multiple header and subsection printing in html document printing as claimed in claim 1, wherein the specific operation of step 5 includes:

step 5.1: acquiring text content input by an input box of the content subfile;

step 5.3: acquiring font information of an input box;

3. The method for supporting multiple headers and sections in html document printing as claimed in claim 2, wherein in step 5.2, if the width of the input box is not set, the width of the corresponding parent node is obtained, and then the width of the content except the input box is subtracted to obtain the width of the input box.

4. A method of supporting multi-header, staged printing in html document printing as claimed in claim 2, wherein in step 5.4: acquiring the width required for the complete text display under the current font, and if the required width is larger than the width of the text which can be displayed by the input box, performing height calculation on the input box and adjusting; otherwise, the height of the input box is not adjusted.

5. The method for supporting multi-header and multi-section printing in html document printing as claimed in claim 4, wherein for the input box which needs to be height-adjusted, the text of the input box is cut from the beginning, and when the text is cut to the width that just one line of content can be displayed, the number of lines occupied by the input box is counted and increased by one; and then, continuing to cut the residual text, and so on until the occupied width of the residual text is smaller than the displayable width of the input box, so as to obtain the final number of lines of the input box.

6. The method of claim 1, wherein the category identifier comprises a header identifier, a footer identifier, a horizontal page identifier, a vertical page identifier, and a segment identifier.

7. The method for supporting multi-header and multi-section printing in html document printing as claimed in claim 1, wherein the subfile converted in step 6 is a subfile in PDF format.