CN113031887A - Method for supporting various headers and subsection printing in html file printing - Google Patents

Method for supporting various headers and subsection printing in html file printing Download PDF

Info

Publication number
CN113031887A
CN113031887A CN202110376826.8A CN202110376826A CN113031887A CN 113031887 A CN113031887 A CN 113031887A CN 202110376826 A CN202110376826 A CN 202110376826A CN 113031887 A CN113031887 A CN 113031887A
Authority
CN
China
Prior art keywords
content
input box
width
printing
file
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Withdrawn
Application number
CN202110376826.8A
Other languages
Chinese (zh)
Inventor
林侨
王举
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Chengdu Weishilian Software Technology Co ltd
Original Assignee
Chengdu Weishilian Software Technology Co ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Chengdu Weishilian Software Technology Co ltd filed Critical Chengdu Weishilian Software Technology Co ltd
Priority to CN202110376826.8A priority Critical patent/CN113031887A/en
Publication of CN113031887A publication Critical patent/CN113031887A/en
Withdrawn legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F3/00Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
    • G06F3/12Digital output to print unit, e.g. line printer, chain printer
    • G06F3/1201Dedicated interfaces to print systems
    • G06F3/1202Dedicated interfaces to print systems specifically adapted to achieve a particular effect
    • G06F3/1203Improving or facilitating administration, e.g. print management
    • G06F3/1205Improving or facilitating administration, e.g. print management resulting in increased flexibility in print job configuration, e.g. job settings, print requirements, job tickets
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F3/00Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
    • G06F3/12Digital output to print unit, e.g. line printer, chain printer
    • G06F3/1201Dedicated interfaces to print systems
    • G06F3/1223Dedicated interfaces to print systems specifically adapted to use a particular technique
    • G06F3/1237Print job management
    • G06F3/1241Dividing a job according to job requirements, e.g. black/white and colour pages, covers and body of books, tabs
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F3/00Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
    • G06F3/12Digital output to print unit, e.g. line printer, chain printer
    • G06F3/1201Dedicated interfaces to print systems
    • G06F3/1223Dedicated interfaces to print systems specifically adapted to use a particular technique
    • G06F3/1237Print job management
    • G06F3/1242Image or content composition onto a page
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F3/00Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
    • G06F3/12Digital output to print unit, e.g. line printer, chain printer
    • G06F3/1201Dedicated interfaces to print systems
    • G06F3/1223Dedicated interfaces to print systems specifically adapted to use a particular technique
    • G06F3/1237Print job management
    • G06F3/1244Job translation or job parsing, e.g. page banding
    • G06F3/1246Job translation or job parsing, e.g. page banding by handling markup languages, e.g. XSL, XML, HTML

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Document Processing Apparatus (AREA)

Abstract

The invention provides a method for supporting various headers and subsection printing in html file printing, which realizes automatic printing of html files in the forms of page numbers, page feet, segmentation and the like by setting classification identifiers, calculating the height of an input box, dividing subfiles, integrating subfiles and the like. The accuracy and the real-time performance of data filling are guaranteed, and meanwhile the efficiency of printing the file is improved.

Description

Method for supporting various headers and subsection printing in html file printing
Technical Field
The invention belongs to the technical field of electronic document conversion, and particularly relates to a method for supporting various headers and subsection printing in html file printing.
Background
Currently, electronic records are mainly filled in by using office files such as word, excel and the like as carriers, or html files as carriers. The Office file is not convenient for user-defined input control and is not suitable for filling in by a mobile terminal. The Html file does not support a variety of header and rail page blends.
The custom html printing support in the prior art is mainly realized by rearranging the printing during printing. And the final file can obtain the required document containing complete information according to the information added by the user.
At the time of final printing, rearranging the print requires intervention by the user at the time of final printing. However, data in industries such as medicine and food are required to ensure accuracy and real-time performance of the data. The required filling cannot be changed, and the filling cannot be delayed. The manual intervention brings many inevitable errors and reduces the working efficiency.
Disclosure of Invention
Aiming at the defects in the prior art, the invention provides a method for supporting various headers and subsection printing in html file printing, which realizes automatic printing of html files in the forms of page numbers, page feet, segmentation and the like by setting classification identifiers, calculating the height of an input box, dividing subfiles, integrating subfiles and the like. The accuracy and the real-time performance of data filling are guaranteed, and meanwhile the efficiency of printing the file is improved.
The specific implementation content of the invention is as follows:
the invention provides a method for supporting various headers and subsection printing in html document printing, which comprises the following steps:
step 1: appointing a category identifier;
step 2: adding corresponding category identification at a corresponding source code according to the content format of the html file on the webpage, and taking the html file added with the corresponding category identification as an input template file;
and step 3: performing job filling by using the obtained template file, and keeping the category identification added on the template file until the job filling is completed; after the operation filling is completed, filing the whole template file and the filled operation information to obtain a filed webpage file;
and 4, step 4: respectively taking out the paragraphs segmented according to the category identification in the filed webpage file, and assembling the paragraphs added with the category identification by a set; reading the paragraph set marked by the classified identification in sequence, dividing the filed webpage file according to the marked paragraphs as dividing points, decomposing the filed webpage file into an identification set and a content set, and finally storing the identification set and the content set into a plurality of subfiles respectively in sequence;
and 5: reading the single content subfile obtained in the step 4, finding out the position of the input content in the content subfile, and recalculating the height of the position of the input content; adjusting the file height of the content subfile according to the recalculated height;
step 6: the height-adjusted content subfiles are converted into a plurality of subfiles by corresponding the identifier subfiles and the content subfiles one by one according to the corresponding modes of the identifier and the content;
and 7: and splicing the obtained plurality of sub-files to obtain the converted complete document.
In order to better implement the present invention, further, the specific operation of step 5 includes:
step 5.1: acquiring text content input by an input box of the content subfile;
step 5.2: obtaining the width of the input box according to the style attribute of the input box;
step 5.3: acquiring font information of an input box;
step 5.4: and acquiring the width required by the complete display of the text under the current font, comparing the width according to the width of the current input box, and adjusting.
In order to better implement the present invention, in step 5.2, if the width of the input box is not set, the width of the corresponding parent node is obtained, and the width of the content other than the input box is subtracted to obtain the width of the input box.
In order to better implement the present invention, further, in step 5.4: acquiring the width required for the complete text display under the current font, and if the required width is larger than the width of the text which can be displayed by the input box, performing height calculation on the input box and adjusting; otherwise, the height of the input box is not adjusted.
In order to better realize the invention, further, for the input box which needs to be height-adjusted, the text of the input box is cut from the beginning, and when the text is cut to the width that just one line of content can be displayed, the number of lines occupied by the input box is counted and increased by one; and then, continuing to cut the residual text, and so on until the occupied width of the residual text is smaller than the displayable width of the input box, so as to obtain the final number of lines of the input box.
In order to better implement the present invention, further, the category identifier includes a header identifier, a footer identifier, a horizontal page identifier, a vertical page identifier, and a segment identifier.
In order to better implement the present invention, further, the subfiles converted in step 6 are subfiles in PDF format.
Drawings
FIG. 1 is a partial computer screenshot presentation after an original word and word are converted to an html file with header 1;
FIG. 2 is a partial computer screenshot presentation after an original word and word with header 2 have been converted to an html file;
FIG. 3 is a partial computer screen shot showing a webpage source code with section identifiers added;
FIG. 4 is an illustration of a post-job file with a computer screenshot showing a portion of the input points filled in;
FIG. 5 is a partial computer screen shot illustration of a split header collection;
FIG. 6 is a partial computer screen shot display of the split content;
FIG. 7 is a partial computer screenshot showing before recalculating line height;
FIG. 8 is a partial computer screen shot showing after recalculating line height;
FIG. 9 is a computer screenshot presentation of a converted portable document file collection;
FIG. 10 is a computer screenshot presentation of a final portable document file.
Detailed Description
In order to more clearly illustrate the technical solutions of the embodiments of the present invention, the technical solutions in the embodiments of the present invention will be clearly and completely described below with reference to the drawings in the embodiments of the present invention, and it should be understood that the described embodiments are only a part of the embodiments of the present invention, and not all embodiments, and therefore should not be considered as a limitation to the scope of protection. All other embodiments, which can be obtained by a person skilled in the art without any inventive step based on the embodiments of the present invention, are within the scope of the present invention.
In the description of the present invention, it is to be noted that, unless otherwise explicitly specified or limited, the terms "disposed," "connected," and "connected" are to be construed broadly, and may be, for example, fixedly connected, detachably connected, or integrally connected; can be mechanically or electrically connected; they may be connected directly or indirectly through intervening media, or they may be interconnected between two elements. The specific meanings of the above terms in the present invention can be understood in specific cases to those skilled in the art.
Example 1:
the embodiment provides a method for supporting various headers and subsection printing in html document printing, which comprises the following steps:
step 1: appointing a category identifier; the category identification comprises a header identification, a footer identification, a transverse page identification, a vertical page identification and a segment identification;
step 2: adding corresponding category identification at a corresponding source code according to the content format of the html file on the webpage, and taking the html file added with the corresponding category identification as an input template file; specific source code examples are as follows:
and step 3: performing job filling by using the obtained template file, and keeping the category identification added on the template file until the job filling is completed; after the operation filling is completed, filing the whole template file and the filled operation information to obtain a filed webpage file;
and 4, step 4: respectively taking out the paragraphs segmented according to the category identification in the filed webpage file, and assembling the paragraphs added with the category identification by a set; reading the paragraph set marked by the classified identification in sequence, dividing the filed webpage file according to the marked paragraphs as dividing points, decomposing the filed webpage file into an identification set and a content set, and finally storing the identification set and the content set into a plurality of subfiles respectively in sequence;
and 5: reading the single content subfile obtained in the step 4, finding out the position of the input content in the content subfile, and recalculating the height of the position of the input content; adjusting the file height of the content subfile according to the recalculated height; the specific operation of the step 5 comprises the following steps:
step 5.1: acquiring text content input by an input box of the content subfile;
step 5.2: obtaining the width of the input box according to the style attribute of the input box; if the input box has no set width, acquiring the width of the corresponding parent node, and subtracting the widths of other contents except the input box to obtain the width of the input box;
step 5.3: acquiring font information of an input box;
step 5.4: acquiring the width required by the complete display of the text under the current font, comparing the width according to the width of the current input box, and adjusting the width; acquiring the width required for the complete text display under the current font, and if the required width is larger than the width of the text which can be displayed by the input box, performing height calculation on the input box and adjusting; otherwise, the height of the input box is not adjusted; for an input box needing height adjustment, cutting the text of the input box from the beginning, and counting the number of lines occupied by the input box by one when the text of the input box is cut to the width which can be displayed by just one line of content; then, the residual text after cutting is continuously cut, and the like, until the occupied width of the residual text is smaller than the displayable width of the input box, and the final number of lines of the input box is obtained;
step 6: the height-adjusted content subfiles are subjected to one-to-one correspondence between the identifier subfiles and the content subfiles according to the corresponding identifier and content modes, and a plurality of subfiles in PDF format are obtained through conversion;
and 7: and splicing the obtained plurality of subfiles in the PDF format to obtain a complete document in the converted PDF format.
The working principle is as follows: firstly, appointing marking in step 1, and marking appointment for the required characteristics. The main convention is that the marks of header, footer, horizontal page, vertical page and segment.
And then converting the word template into an html file in step 2, and marking the html file by using the mark agreed in step 1. Such as a fixed header flag: and adding a mark to a position corresponding to the source code of the webpage by a header. Making this paragraph a section marker for the entire document. An example of code addition is as follows:
<div
style = "border:1px saved #747474; width: auto; display: inline-block! important; position: relative; overflow: hidden; header: header" > (here: where the agreed header mark is added)
<p
style="margin-top:0pt;margin-bottom:0pt;text-align:justify;line-height:150%;widows:0;orphans:0;padding-top:1pt;padding-right:4pt;padding-left:4pt;font-size:14.7pt;">
<span style="height:0pt;text-align:left;display:block;position:absolute;z-index:-65536;"><img
src=""
width="911.4" height="8.399999999999999" alt=""
style="margin-top:13.35pt;margin-left:-2pt;-aw-left-pos:0pt;-aw-rel-hpos:column;-aw-rel-vpos:paragraph;-aw-top-pos:15.35pt;-aw-wrap-type:none;position:absolute"></span><span
style = "font-family: song body" > xxxx ] < span = "font-family: song body" > group ltd
style = "font-family: song body" > xx </span > < span = "font-family: song body" > test </span > < span
style = "font-family: Song dynasty" > record ZL-SRD-10-ZY009-00</span >)
<p
style="margin-top:0pt;margin-bottom:0pt;text-align:justify;line-height:150%;widows:0;orphans:0;padding-right:4pt;padding-left:4pt;padding-bottom:1pt;font-size:14.7pt;">
< span style = "font-family: song body": where < span style = 'font-family:' Times New Roman ';' XX </span > < span:
style = "font-family: song body" > </span > < span = "font-family: song body" > lot number: [ span > < span ]
style="display: inline-flex;vertical-align: middle;position:relative;font-family:'Times New Roman';font-size:12.6pt;"><textarea
type="text" name="textarea" value="807178412669145090 "
style="resize:none;vertical-align:middle;border-bottom-style:inset;color:blue;padding:0px;padding-top:2px;min-width:30px;min-height:26px;height:24px;width:132px;"
id="807178412669145090" readonly="" onclick="conClick_f(this)" onkeydown="autoTextarea(this)"
ondblclick="ondblclick_f(this)">1</textarea></span><span
style = "font-family: song body," > </span > < span style = "font-family: song body": [ span > < span ]
style="display: inline-flex;vertical-align: middle;position:relative;font-family:'Times New Roman';"><textarea
type="text" name="textarea" value="807178412669145091 "
style="resize:none;vertical-align:middle;border-bottom-style:inset;color:blue;padding:0px;padding-top:2px;min-width:30px;min-height:26px;height:24px;width:92px;"
id="807178412669145091" readonly="" onclick="conClick_f(this)" onkeydown="autoTextarea(this)"
Andbllick = "andbllick _ f (this)" >2</textarea > </span > < span style = "font-family: Song somatic;" > </span > < span =
style = ' font-family: song body [ ] "> header </span > < span style =" font-family: ' Times New Roman '; ">1</span > </p >
</div>;
FIGS. 1 and 2 are partial screenshots after original word and word are converted into html files, respectively, and are respectively expressed as header 1, header 2 and effect after horizontal and vertical page conversion.
FIG. 3 is a diagram of the effect of adding section identifiers to the source code of a web page, wherein the presentation effect of a header on a browser and the position of a corresponding mark are marked in detail.
And (5) obtaining the template by using the step (2), and filling the content into the template. And obtaining the file of the filled content after the operation.
FIG. 4 is an example of a post-job file with partial entry points filled in.
And (3) finding out paragraphs with special meanings according to the marks agreed in the step 1 in the webpage source code file obtained in the step 2. And splitting the original file according to the obtained paragraphs.
The webpage source code is decomposed into a plurality of subnet page source codes of a plurality of sections of contents, and the subnet page source codes are stored. When the mark section has special meaning, the mark section is also stored as the sub-network page source code corresponding to the content. E.g., the e-mark is a header mark, and is saved as header 1, header 2 …, content 1, content 2 ….
FIG. 5 is a diagram of a split header assembly effect, wherein the file is a temporarily stored header file.
Fig. 6 is a diagram of the effect of the split content collection, in which the file is a temporarily stored content file.
In step 5, the split content file is read, and the content line height is newly calculated.
Fig. 7 is a diagram of the effect before line height recalculation, and the frame is not fully displayed.
FIG. 8 is a plot of the effect after row height recalculation, with the box being the recalculated row height.
The file obtained in step 3 is used in step 6 and converted into a portable document file.
FIG. 9 is a diagram of the effect of the converted portable document file set, wherein the file is the converted portable document file, and a plurality of files with horizontal and vertical sides can be seen.
And 7, combining the portable document files obtained in the step 6 to obtain a final portable document file.
FIG. 10 final portable document file effect diagram. The upper part and the lower part are respectively a vertical page and a horizontal page.
The above description is only a preferred embodiment of the present invention, and is not intended to limit the present invention in any way, and all simple modifications and equivalent variations of the above embodiments according to the technical spirit of the present invention are included in the scope of the present invention.

Claims (7)

1. A method for supporting multiple header and subsection printing in html document printing, which is characterized by comprising the following steps:
step 1: appointing a category identifier;
step 2: adding corresponding category identification at a corresponding source code according to the content format of the html file on the webpage, and taking the html file added with the corresponding category identification as an input template file;
and step 3: performing job filling by using the obtained template file, and keeping the category identification added on the template file until the job filling is completed; after the operation filling is completed, filing the whole template file and the filled operation information to obtain a filed webpage file;
and 4, step 4: respectively taking out the paragraphs segmented according to the category identification in the filed webpage file, and assembling the paragraphs added with the category identification by a set; reading the paragraph set marked by the classified identification in sequence, dividing the filed webpage file according to the marked paragraphs as dividing points, decomposing the filed webpage file into an identification set and a content set, and finally storing the identification set and the content set into a plurality of subfiles respectively in sequence;
and 5: reading the single content subfile obtained in the step 4, finding out the position of the input content in the content subfile, and recalculating the height of the position of the input content; adjusting the file height of the content subfile according to the recalculated height;
step 6: the height-adjusted content subfiles are converted into a plurality of subfiles by corresponding the identifier subfiles and the content subfiles one by one according to the corresponding modes of the identifier and the content;
and 7: and splicing the obtained plurality of sub-files to obtain the converted complete document.
2. The method for supporting multiple header and subsection printing in html document printing as claimed in claim 1, wherein the specific operation of step 5 includes:
step 5.1: acquiring text content input by an input box of the content subfile;
step 5.2: obtaining the width of the input box according to the style attribute of the input box;
step 5.3: acquiring font information of an input box;
step 5.4: and acquiring the width required by the complete display of the text under the current font, comparing the width according to the width of the current input box, and adjusting.
3. The method for supporting multiple headers and sections in html document printing as claimed in claim 2, wherein in step 5.2, if the width of the input box is not set, the width of the corresponding parent node is obtained, and then the width of the content except the input box is subtracted to obtain the width of the input box.
4. A method of supporting multi-header, staged printing in html document printing as claimed in claim 2, wherein in step 5.4: acquiring the width required for the complete text display under the current font, and if the required width is larger than the width of the text which can be displayed by the input box, performing height calculation on the input box and adjusting; otherwise, the height of the input box is not adjusted.
5. The method for supporting multi-header and multi-section printing in html document printing as claimed in claim 4, wherein for the input box which needs to be height-adjusted, the text of the input box is cut from the beginning, and when the text is cut to the width that just one line of content can be displayed, the number of lines occupied by the input box is counted and increased by one; and then, continuing to cut the residual text, and so on until the occupied width of the residual text is smaller than the displayable width of the input box, so as to obtain the final number of lines of the input box.
6. The method of claim 1, wherein the category identifier comprises a header identifier, a footer identifier, a horizontal page identifier, a vertical page identifier, and a segment identifier.
7. The method for supporting multi-header and multi-section printing in html document printing as claimed in claim 1, wherein the subfile converted in step 6 is a subfile in PDF format.
CN202110376826.8A 2021-04-08 2021-04-08 Method for supporting various headers and subsection printing in html file printing Withdrawn CN113031887A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202110376826.8A CN113031887A (en) 2021-04-08 2021-04-08 Method for supporting various headers and subsection printing in html file printing

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202110376826.8A CN113031887A (en) 2021-04-08 2021-04-08 Method for supporting various headers and subsection printing in html file printing

Publications (1)

Publication Number Publication Date
CN113031887A true CN113031887A (en) 2021-06-25

Family

ID=76454197

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202110376826.8A Withdrawn CN113031887A (en) 2021-04-08 2021-04-08 Method for supporting various headers and subsection printing in html file printing

Country Status (1)

Country Link
CN (1) CN113031887A (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113836092A (en) * 2021-09-27 2021-12-24 北京来也网络科技有限公司 File comparison method, device, equipment and storage medium based on RPA and AI

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113836092A (en) * 2021-09-27 2021-12-24 北京来也网络科技有限公司 File comparison method, device, equipment and storage medium based on RPA and AI

Similar Documents

Publication Publication Date Title
US10417348B2 (en) Method for processing and loading web pages supporting multiple languages and system thereof
CN103500118B (en) A kind of Cascading Style Sheet optimization method and device
CN109857670B (en) Test report automatic generation method based on universal template
US9330065B2 (en) Generating variable document templates
US7870503B1 (en) Technique for analyzing and graphically displaying document order
MX2007008776A (en) Presentation of large objects on small displays.
US20050193334A1 (en) Layout system, layout apparatus, layout program, template selection program, storage medium having stored therein layout program, and storage medium having stored therein template selection program, as well as layout method
US20070136660A1 (en) Creation of semantic objects for providing logical structure to markup language representations of documents
CN104699714B (en) Book version formatted file is converted to the method and device of EPUB formatted files
JP2006506713A (en) Iterative method for assigning elements and apparatus for assigning elements
US20150070361A1 (en) Character conversion system and a character conversion method
JP2013511088A (en) Platform independent typesetting support for documentation
CN107885792B (en) Webpage article display method, device and system
CN102779118A (en) Paper typesetting method and system
CN101430676A (en) Electronic checking method, system and terminal
CN113031887A (en) Method for supporting various headers and subsection printing in html file printing
CN114238575A (en) Document parsing method, system, computer device and computer-readable storage medium
CN115757272A (en) Method and system for converting HTML file into OFD file
CN103294819B (en) The method that Pagination Display is carried out to web page contents using HTML5 technologies
CN111241805A (en) Layout rearrangement method of PDF document
CN103605640B (en) Form adaption method and device
US20230177252A1 (en) Reducing interference between two texts
CN111126007B (en) HTM L-based medical record document paging algorithm
CN115169292A (en) Document processing method and device, electronic equipment and computer readable storage medium
CN112380826A (en) Formatted electronic form generation method based on text file

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
WW01 Invention patent application withdrawn after publication
WW01 Invention patent application withdrawn after publication

Application publication date: 20210625