CN109145266B - Method for quickly generating dynamic PDF (Portable document Format) file from picture - Google Patents

Method for quickly generating dynamic PDF (Portable document Format) file from picture Download PDF

Info

Publication number
CN109145266B
CN109145266B CN201810826386.XA CN201810826386A CN109145266B CN 109145266 B CN109145266 B CN 109145266B CN 201810826386 A CN201810826386 A CN 201810826386A CN 109145266 B CN109145266 B CN 109145266B
Authority
CN
China
Prior art keywords
slice
dynamic data
processing mode
html document
picture
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201810826386.XA
Other languages
Chinese (zh)
Other versions
CN109145266A (en
Inventor
蔡勇
方传煜
陈沐腾
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Quanbao Network Technology Co ltd
Original Assignee
Quanbao Network Technology Co ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Quanbao Network Technology Co ltd filed Critical Quanbao Network Technology Co ltd
Priority to CN201810826386.XA priority Critical patent/CN109145266B/en
Publication of CN109145266A publication Critical patent/CN109145266A/en
Application granted granted Critical
Publication of CN109145266B publication Critical patent/CN109145266B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/10Text processing
    • G06F40/12Use of codes for handling textual entities
    • G06F40/151Transformation
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/10Text processing
    • G06F40/166Editing, e.g. inserting or deleting
    • G06F40/174Form filling; Merging
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/10Text processing
    • G06F40/166Editing, e.g. inserting or deleting
    • G06F40/186Templates

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Health & Medical Sciences (AREA)
  • Artificial Intelligence (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Computational Linguistics (AREA)
  • General Health & Medical Sciences (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Document Processing Apparatus (AREA)

Abstract

The invention provides a method for quickly generating a dynamic PDF file from a picture, which comprises the following steps: preprocessing an original picture; taking the blank picture template as a layout background, and newly building a layer which is completely the same as the blank picture template on the blank picture template, wherein the layer is called a processing layer; slicing; exporting each slice to form an HTML document; defining a dynamic data storage database; and writing the dynamic data into an HTML document, and generating a PDF file from the HTML document by using an HTML-to-PDF plug-in. Has the advantages that: the method is simple and easy to implement, fast and convenient, has almost zero pattern error, can meet the requirement of converting the contents of very complicated dynamic pictures into PDF, and has the advantage of very high efficiency of converting the contents of the pictures into the PDF.

Description

Method for quickly generating dynamic PDF (Portable document Format) file from picture
Technical Field
The invention belongs to the technical field of PDF file generation, and particularly relates to a method for quickly generating a dynamic PDF file from a picture.
Background
A pdf (portable document format) file is an ideal format for electronic document distribution, and is an open standard for global electronic document distribution. The PDF format file is an industry standard for digital information, and its cross-media, cross-platform and security make it more and more widely used in network, and its research is becoming more and more important.
At present, the following use requirements often exist: one or more given pictures with the formats of JPG, BMP, PNG and the like are converted into corresponding PDF files, and meanwhile, the text, the number or partial picture content in the pictures are required to be dynamically modified. In a traditional mode, when a picture is generated into a dynamic PDF file, the main method adopted is as follows: the method uses a certain computer language to research the provided picture format, refers to the format requirement of the PDF file, and generates corresponding PDF files line by line. Although this method is feasible, it takes a lot of manpower and material resources, and the efficiency of generating the dynamic PDF file is low.
Disclosure of Invention
Aiming at the defects in the prior art, the invention provides a method for quickly generating a dynamic PDF file from a picture, which can effectively solve the problems.
The technical scheme adopted by the invention is as follows:
the invention provides a method for quickly generating a dynamic PDF file from a picture, which comprises the following steps:
step 1, reading an original picture to be processed, preprocessing the original picture, emptying data corresponding to a position of the original picture to be filled with dynamic data, and reserving fixed template data in the original picture to obtain a blank picture template;
step 2, taking the blank picture template as a layout background, and newly building a layer which is completely the same as the blank picture template on the blank picture template, wherein the layer is called a processing layer;
step 3, determining position areas needing to be filled with dynamic data on the processing image layer, wherein each position area can be filled with one dynamic data only once; then, slicing the position area to obtain M slices, wherein M is a natural number; each slice has a slice number; meanwhile, the slice corresponding to each slice number has the following attributes: the slice length, the slice width, the horizontal and vertical coordinates of the slice center point in the processing layer and the slice processing mode; the slice processing mode comprises a normal processing mode and a special processing mode; if the mode is the special processing mode, the special processing mode is bound here; then, exporting each slice to form an HTML document; exporting the attribute of the slice to a slice attribute table;
step 4, defining a dynamic data storage database; the dynamic data storage database comprises a data structure and dynamic data values; wherein the data structure is: for each slice, defining a corresponding data structure, wherein the data structure is a data type which can be filled by the slice and comprises numbers, characters or pictures, thereby forming a two-dimensional array [ slice serial number, data type ]; then, reading N groups of dynamic data to be filled under each data type, thereby forming a multidimensional array [ a slice serial number, dynamic data 1 and dynamic data 2 … dynamic data N ], and further forming a dynamic data storage database;
step 5, making i equal to 1; i is the serial number of the dynamic data corresponding to each slice serial number;
step 6, changing j to 1; wherein j is the serial number of the slice;
step 7, reading the ith dynamic data corresponding to the jth slice by adopting a script, and obtaining a slice processing mode corresponding to the jth slice by reading a slice attribute table; if the processing mode is the normal processing mode, directly filling the read dynamic data into the center position of the corresponding slice in the HTML document formed in the step 3; if the processing mode is the special processing mode, firstly reading a slice attribute table to obtain the attribute of the corresponding slice; performing special processing on the read dynamic data by adopting the special processing mode, and filling the dynamic data after the special processing into a corresponding slice in the HTML document formed in the step 3;
step 8, changing j to j +1, and judging whether j is larger than M; if yes, executing step 9; if not, returning to execute the step 7;
step 9, obtaining the HTML document filled with dynamic data; generating a PDF file from the HTML document by using the HTML-to-PDF plug-in, thereby obtaining a PDF file;
step 10, changing i to i +1, and judging whether i is larger than N; if yes, executing step 11; if not, returning to execute the step 7;
and 11, obtaining N PDF files.
Preferably, in step 3, the special processing mode includes a point diagram generation processing mode and a degree diagram generation processing mode.
Preferably, in step 7, the read dynamic data is specially processed by using the special processing mode, and the dynamic data after special processing is filled into the corresponding slice in the HTML document formed in step 3, specifically:
and if the processing mode is generated for the point diagram, after the dynamic data corresponding to the slice is read, converting the dynamic data into percentages according to a percentage conversion formula, positioning the dynamic data to the corresponding percentage position on the longitudinal axis of the corresponding slice, drawing a special symbol at the corresponding percentage position according to set parameters, and further completing the operation of filling the dynamic data after special processing into the corresponding slice in the HTML document formed in the step 3.
Preferably, in step 3, each slice is exported to form an HTML document, where the HTML document includes M slices, and each slice has an HTML tag img; in step 7, filling the read dynamic data into the center position of the corresponding slice in the HTML document formed in step 3, specifically: firstly, hiding a label img corresponding to the slice, and then filling dynamic data into the center position of the slice after the label img.
The method for quickly generating the dynamic PDF file from the picture provided by the invention has the following advantages that:
the method is simple and easy to implement, fast and convenient, has almost zero pattern error, can meet the requirement of converting the contents of very complicated dynamic pictures into PDF, and has the advantage of very high efficiency of converting the contents of the pictures into the PDF.
Drawings
FIG. 1 is a schematic flow chart of a method for rapidly generating a dynamic PDF file from a picture according to the present invention;
FIG. 2 is a specific example of an original picture that needs to be processed;
FIG. 3 is an exemplary diagram of a blank picture template obtained after the preprocessing of FIG. 2;
FIG. 4 is a schematic view of a slicing process;
fig. 5 is an exemplary diagram of a PDF file finally generated.
Detailed Description
In order to make the technical problems, technical solutions and advantageous effects solved by the present invention more clearly apparent, the present invention is further described in detail below with reference to the accompanying drawings and embodiments. It should be understood that the specific embodiments described herein are merely illustrative of the invention and are not intended to limit the invention.
The invention provides a method for quickly generating a dynamic PDF file from a picture, which comprises the following steps with reference to FIG. 1:
step 1, reading an original picture to be processed, preprocessing the original picture, emptying data corresponding to a position of the original picture to be filled with dynamic data, and reserving fixed template data in the original picture to obtain a blank picture template; as shown in fig. 2, a specific example of an original picture that needs to be processed; the table is a questionnaire table, for a certain person to be investigated, the queen is tested for four abilities, the common sense score is 12, the visual jigsaw score is 10, the arithmetic score is 11, and the decoding score is 12. In fig. 2, a corresponding dot diagram is also generated, so that the relationship between the four capabilities can be visually expressed.
Therefore, for different respondents, it is obvious that 8 dynamic data in fig. 2 need to be changed, which are: 12. 10, 11, 12 and four corresponding large black dots. Therefore, after the processing of fig. 2, the blank picture template shown in fig. 3 can be obtained.
Step 2, taking the blank picture template as a layout background, and newly building a layer which is completely the same as the blank picture template on the blank picture template, wherein the layer is called a processing layer;
step 3, determining position areas needing to be filled with dynamic data on the processing image layer, wherein each position area can be filled with one dynamic data only once; then, slicing the position area to obtain M slices, wherein M is a natural number; each slice has a slice number; meanwhile, the slice corresponding to each slice number has the following attributes: the slice length, the slice width, the horizontal and vertical coordinates of the slice center point in the processing layer and the slice processing mode; the slice processing mode comprises a normal processing mode and a special processing mode; if the mode is the special processing mode, the special processing mode is bound here; then, exporting each slice to form an HTML document; exporting the attribute of the slice to a slice attribute table;
referring to fig. 4, a total of 8 slices are obtained, numbered 1, 2, and 3 … 8, respectively. The area corresponding to each slice is shown in bold solid lines in fig. 4. Namely: 1. the position areas corresponding to the slices 2, 3 and 4 are small cells. And the position areas corresponding to the slices 5, 6, 7 and 8 are vertical large cells.
In this step, the special processing mode includes a point diagram generation processing mode, a degree diagram generation processing mode, and the like. In fig. 4, slices No. 1, 2, 3, and 4 are normal processing modes; 5. slice nos. 6, 7, and 8 generate processing modes for the dot diagrams.
Step 4, defining a dynamic data storage database; the dynamic data storage database comprises a data structure and dynamic data values; wherein the data structure is: for each slice, defining a corresponding data structure, wherein the data structure is a data type which can be filled by the slice and comprises numbers, characters or pictures, thereby forming a two-dimensional array [ slice serial number, data type ]; then, reading N groups of dynamic data to be filled under each data type, thereby forming a multidimensional array [ a slice serial number, dynamic data 1 and dynamic data 2 … dynamic data N ], and further forming a dynamic data storage database;
taking an example that each slice corresponds to one dynamic data, for the examples of fig. 2 to 4, it is assumed that the dynamic data corresponding to slice No. 1 is 10; the dynamic data corresponding to the No. 2 slice is 12; the dynamic data corresponding to the No. 3 slice is 15; the dynamic data corresponding to the No. 4 slice is 6; the dynamic data corresponding to the No. 5 slice is 10; the dynamic data corresponding to the No. 6 slice is 12; the dynamic data corresponding to the No. 7 slice is 15; the dynamic data for slice 8 is 6.
Step 5, making i equal to 1; i is the serial number of the dynamic data corresponding to each slice serial number;
step 6, changing j to 1; wherein j is the serial number of the slice;
step 7, reading the ith dynamic data corresponding to the jth slice by adopting a script, and obtaining a slice processing mode corresponding to the jth slice by reading a slice attribute table; if the processing mode is the normal processing mode, directly filling the read dynamic data into the center position of the corresponding slice in the HTML document formed in the step 3; if the processing mode is the special processing mode, firstly reading a slice attribute table to obtain the attribute of the corresponding slice; performing special processing on the read dynamic data by adopting the special processing mode, and filling the dynamic data after the special processing into a corresponding slice in the HTML document formed in the step 3;
in this step, the read dynamic data is specially processed by using the special processing mode, and the dynamic data after special processing is filled into the corresponding slice in the HTML document formed in step 3, specifically:
and if the processing mode is generated for the point diagram, after the dynamic data corresponding to the slice is read, converting the dynamic data into percentages according to a percentage conversion formula, positioning the dynamic data to the corresponding percentage position on the longitudinal axis of the corresponding slice, drawing a special symbol at the corresponding percentage position according to set parameters, and further completing the operation of filling the dynamic data after special processing into the corresponding slice in the HTML document formed in the step 3.
In addition, in step 3, each slice is exported to form an HTML document, wherein the HTML document comprises M slices, and each slice has an HTML tag img; in step 7, filling the read dynamic data into the center position of the corresponding slice in the HTML document formed in step 3, specifically: firstly, hiding a label img corresponding to the slice, and then filling dynamic data into the center position of the slice after the label img. In step 7, a fixed cs Cascading Style sheet (English full name: capturing Style Sheets) may be added to the head of the HTML document to prevent the layout disorder caused by the content over-range. And css of part of custom interface designs or other picture designs (such as backgrounds) can be written in html in the past.
In addition, when writing dynamic data, the common method in the prior art is to write codes and output PDF files line by line in background programs using various background development languages such as Java, Python, C # and the like, and the present invention is completed in an HTML page by using a Javascript front end language.
Step 8, changing j to j +1, and judging whether j is larger than M; if yes, executing step 9; if not, returning to execute the step 7;
step 9, obtaining the HTML document filled with dynamic data; generating a PDF file from the HTML document by using the HTML-to-PDF plug-in, thereby obtaining a PDF file; referring to fig. 5, this is an example of a PDF file finally generated. The HTML-to-PDF plug-in may adopt an existing plug-in, for example, a Ruby language HTML-to-PDF plug-in (wkhtmltopdf 0.12.4).
Step 10, changing i to i +1, and judging whether i is larger than N; if yes, executing step 11; if not, returning to execute the step 7;
and 11, obtaining N PDF files.
Therefore, for a certain type of questionnaire picture, if 100 questionnaire pictures of different people are assumed, if a traditional mode is adopted, each questionnaire picture needs to be converted into PDF, and the efficiency is very low; by adopting the method of the invention, only one picture of the questionnaire is needed to be preprocessed, then the picture is cut, and the HTML format document is generated and stored; then establishing a dynamic data storage database, wherein 100 items are stored in the dynamic data storage database, and each item corresponds to a specific investigation result of one person; therefore, the invention can dynamically read the needed data from the database and write the data into the HTML document, and after the investigation results of one person are written, the HTML document is converted into the PDF file, thereby completing the acquisition of the PDF format document of one person; the above steps are repeated continuously, and the acquisition of PDF format documents of 100 persons can be automatically completed with high efficiency. Therefore, the generation efficiency of the dynamic PDF file is very high.
As a specific implementation mode, the invention is realized by adopting a computer language, for example, the development language adopts Ruby + Javascript, and Ruby version is 2.3.
The invention provides a method for quickly and accurately converting a dynamic picture into a PDF format. The inventors have conducted programming tests using the same pictures (example fig. 2).
If a complicated conversion from a dynamic picture to a PDF is completed in a traditional way, for a software engineer, firstly, the pattern of a given picture needs to be learned, the output pattern of a PDF file needs to be learned (8 hours), a part of important background pictures is intercepted (2 hours), then, the dynamic data obtained by using a corresponding computer language is programmed according to the requirement of a PDF format (20 hours), and then, an adjustment test is performed (10 hours), and the total process time takes 40 hours, which is about 5 working days.
The method (assuming a software engineer) of the invention is divided into layout background graph preprocessing (1 hour), background graph cutting to HTML (2 hours), obtaining dynamic data by using a corresponding Ruby language, generating the dynamic data in an HTML page by using Javascript, generating a PDF file (4 hours) by using an HTML to PDF plug-in (wkhtmltopff), and performing adjustment test (3 hours), wherein the total process time is about 10 hours. Saving about 30 hours of working time. The time saved is mainly expressed in the time of picture pattern learning, the time of outputting PDF patterns according to PDF format, the time of pattern debugging and the like.
The method is particularly suitable for software developers who do not know how the PDF file is output but are familiar with HTML/Javascript language, and can complete the work of generating the PDF file by the complex picture in a short time.
The invention provides a method for quickly generating a dynamic PDF file from a picture, which has the following advantages:
the method is simple and easy to implement, fast and convenient, has almost zero pattern error, can meet the requirement of converting the contents of very complicated dynamic pictures into PDF, and has the advantage of very high efficiency of converting the contents of the pictures into the PDF.
The foregoing is only a preferred embodiment of the present invention, and it should be noted that, for those skilled in the art, various modifications and improvements can be made without departing from the principle of the present invention, and such modifications and improvements should also be considered within the scope of the present invention.

Claims (4)

1. A method for rapidly generating a dynamic PDF file from a picture is characterized by comprising the following steps:
step 1, reading an original picture to be processed, preprocessing the original picture, emptying data corresponding to a position of the original picture to be filled with dynamic data, and reserving fixed template data in the original picture to obtain a blank picture template;
step 2, taking the blank picture template as a layout background, and newly building a layer which is completely the same as the blank picture template on the blank picture template, wherein the layer is called a processing layer;
step 3, determining position areas needing to be filled with dynamic data on the processing image layer, wherein each position area can be filled with one dynamic data only once; then, slicing the position area to obtain M slices, wherein M is a natural number; each slice has a slice number; meanwhile, the slice corresponding to each slice number has the following attributes: the slice length, the slice width, the horizontal and vertical coordinates of the slice center point in the processing layer and the slice processing mode; the slice processing mode comprises a normal processing mode and a special processing mode; if the mode is the special processing mode, the special processing mode is bound here; then, exporting each slice to form an HTML document; exporting the attribute of the slice to a slice attribute table;
step 4, defining a dynamic data storage database; the dynamic data storage database comprises a data structure and dynamic data values; wherein the data structure is: for each slice, defining a corresponding data structure, wherein the data structure is a data type which can be filled by the slice and comprises numbers, characters or pictures, thereby forming a two-dimensional array [ slice serial number, data type ]; then, reading N groups of dynamic data to be filled under each data type, thereby forming a multidimensional array [ a slice serial number, dynamic data 1 and dynamic data 2 … dynamic data N ], and further forming a dynamic data storage database;
step 5, making i equal to 1; i is the serial number of the dynamic data corresponding to each slice serial number;
step 6, changing j to 1; wherein j is the serial number of the slice;
step 7, reading the ith dynamic data corresponding to the jth slice by adopting a script, and obtaining a slice processing mode corresponding to the jth slice by reading a slice attribute table; if the processing mode is the normal processing mode, directly filling the read dynamic data into the center position of the corresponding slice in the HTML document formed in the step 3; if the processing mode is the special processing mode, firstly reading a slice attribute table to obtain the attribute of the corresponding slice; performing special processing on the read dynamic data by adopting the special processing mode, and filling the dynamic data after the special processing into a corresponding slice in the HTML document formed in the step 3;
step 8, changing j to j +1, and judging whether j is larger than M; if yes, executing step 9; if not, returning to execute the step 7;
step 9, obtaining the HTML document filled with dynamic data; generating a PDF file from the HTML document by using the HTML-to-PDF plug-in, thereby obtaining a PDF file;
step 10, changing i to i +1, and judging whether i is larger than N; if yes, executing step 11; if not, returning to execute the step 7;
and 11, obtaining N PDF files.
2. The method as claimed in claim 1, wherein in step 3, the special processing modes include a point diagram generation processing mode and a degree diagram generation processing mode.
3. The method according to claim 2, wherein in step 7, the special processing mode is adopted to perform special processing on the read dynamic data, and the dynamic data after the special processing is filled into the corresponding slice in the HTML document formed in step 3, specifically:
and if the processing mode is generated for the point diagram, after the dynamic data corresponding to the slice is read, converting the dynamic data into percentages according to a percentage conversion formula, positioning the dynamic data to the corresponding percentage position on the longitudinal axis of the corresponding slice, drawing a special symbol at the corresponding percentage position according to set parameters, and further completing the operation of filling the dynamic data after special processing into the corresponding slice in the HTML document formed in the step 3.
4. The method for rapidly generating a dynamic PDF file from a picture as claimed in claim 1, wherein in step 3, each slice is exported to form an HTML document, wherein the HTML document comprises M slices, and each slice has an HTML tag img; in step 7, filling the read dynamic data into the center position of the corresponding slice in the HTML document formed in step 3, specifically: firstly, hiding a label img corresponding to the slice, and then filling dynamic data into the center position of the slice after the label img.
CN201810826386.XA 2018-07-25 2018-07-25 Method for quickly generating dynamic PDF (Portable document Format) file from picture Active CN109145266B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201810826386.XA CN109145266B (en) 2018-07-25 2018-07-25 Method for quickly generating dynamic PDF (Portable document Format) file from picture

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201810826386.XA CN109145266B (en) 2018-07-25 2018-07-25 Method for quickly generating dynamic PDF (Portable document Format) file from picture

Publications (2)

Publication Number Publication Date
CN109145266A CN109145266A (en) 2019-01-04
CN109145266B true CN109145266B (en) 2022-03-25

Family

ID=64798952

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201810826386.XA Active CN109145266B (en) 2018-07-25 2018-07-25 Method for quickly generating dynamic PDF (Portable document Format) file from picture

Country Status (1)

Country Link
CN (1) CN109145266B (en)

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO1998008176A1 (en) * 1996-08-20 1998-02-26 Moore Business Forms, Inc. Proofing system utilizing dynamic pdf technology for the interface for templated printing
TW201504831A (en) * 2013-07-24 2015-02-01 Tang Digital Integration A webpage edition system and the method thereof and a computer program product for storing a webpage edition program
CN104766353A (en) * 2015-04-25 2015-07-08 陈包容 Method and device for adding text content into background
CN106157341A (en) * 2015-03-30 2016-11-23 阿里巴巴集团控股有限公司 Generate the method and device of synthesising picture

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO1998008176A1 (en) * 1996-08-20 1998-02-26 Moore Business Forms, Inc. Proofing system utilizing dynamic pdf technology for the interface for templated printing
TW201504831A (en) * 2013-07-24 2015-02-01 Tang Digital Integration A webpage edition system and the method thereof and a computer program product for storing a webpage edition program
CN106157341A (en) * 2015-03-30 2016-11-23 阿里巴巴集团控股有限公司 Generate the method and device of synthesising picture
CN104766353A (en) * 2015-04-25 2015-07-08 陈包容 Method and device for adding text content into background

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
可变数据数字印品设计与案例;徐敏;《数码印刷》;20110531(第05期);第60-62页 *

Also Published As

Publication number Publication date
CN109145266A (en) 2019-01-04

Similar Documents

Publication Publication Date Title
US8718364B2 (en) Apparatus and method for digitizing documents with extracted region data
US8675012B2 (en) Selective display of OCR'ed text and corresponding images from publications on a client device
RU2357284C2 (en) Method of processing digital hand-written notes for recognition, binding and reformatting digital hand-written notes and system to this end
US8732570B2 (en) Non-symbolic data system for the automated completion of forms
US8958644B2 (en) Creating tables with handwriting images, symbolic representations and media images from forms
US8175388B1 (en) Recognizing text at multiple orientations
US6816630B1 (en) System and method for creating and processing data forms
US20080235263A1 (en) Automating Creation of Digital Test Materials
Clausner et al. Efficient and effective OCR engine training
CN108319578B (en) Method for generating medium for data recording
CN110674437A (en) Method and device for converting webpage data into PDF (Portable document Format) attachment
CN109145266B (en) Method for quickly generating dynamic PDF (Portable document Format) file from picture
CN114579796B (en) Machine reading understanding method and device
CN116108804A (en) Point code file generation method, content review method and content review system
Tomaschek Evaluation of off-the-shelf OCR technologies
CN114973798A (en) Word learning card generation method and device
US10606928B2 (en) Assistive technology for the impaired
KR20220088070A (en) Document conversion system and method for management electronic testing report, recording medium having program thereof
CN112199929A (en) Form processing method and device, storage medium and electronic equipment
CN112668282A (en) Method and system for converting format of equipment procedure document
CN116563872B (en) Deep learning-based chart extraction method and system
JP7430219B2 (en) Document information structuring device, document information structuring method and program
Liu Producing High-Quality Figures Using SAS/GRAPH® and ODS Graphics Procedures
JP2022095326A (en) Information processing apparatus, method of controlling the same, and program therefor
CN115759011A (en) End-to-end table structured data extraction system

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant