CN104462159A - Image conversion method and system based on document page content - Google Patents

Image conversion method and system based on document page content Download PDF

Info

Publication number
CN104462159A
CN104462159A CN201310450959.0A CN201310450959A CN104462159A CN 104462159 A CN104462159 A CN 104462159A CN 201310450959 A CN201310450959 A CN 201310450959A CN 104462159 A CN104462159 A CN 104462159A
Authority
CN
China
Prior art keywords
format
pattern
color
image
region
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201310450959.0A
Other languages
Chinese (zh)
Inventor
童征宇
董宁
徐剑波
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Peking University Founder Group Co Ltd
Beijing Founder Apabi Technology Co Ltd
Original Assignee
Peking University Founder Group Co Ltd
Beijing Founder Apabi Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Peking University Founder Group Co Ltd, Beijing Founder Apabi Technology Co Ltd filed Critical Peking University Founder Group Co Ltd
Priority to CN201310450959.0A priority Critical patent/CN104462159A/en
Publication of CN104462159A publication Critical patent/CN104462159A/en
Pending legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T11/002D [Two Dimensional] image generation
    • G06T11/60Editing figures and text; Combining figures or text
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T9/00Image coding

Abstract

The invention provides an image conversion method and system based on document page content. The page document content is divided, and a single color region and a complex color region are obtained; the page content in the single color region is converted into a first format of image suitable for compressing single color content and storing the compressed single color content; the page content in the complex color region is converted into a second format of image suitable for compressing complex color content and storing the compressed complex color content; the document page content is divided on the aspect of the type, different types of the page content is converted into corresponding formats of images, and the display effect of characters and image information in the document page content is guaranteed.

Description

Based on image conversion method and the system of document file page contents
Technical field
The present invention relates to field of information processing, specifically a kind of image conversion method based on document file page contents and system.
Background technology
Format document, be that the space of a whole page presents the fixing electronic file form of effect, the such as document of PDF or CAJ form, presenting of format document has nothing to do with equipment, and when various equipment being read, print or prints, the result that presents of its space of a whole page is all consistent; Therefore, it is possible to show abundant page effect well, but needing some corresponding modules when reading format document resolves it, shows, namely needs a reader, greatly limit the use of format document.Especially in network reading, can not require that user installs reader or ActiveX plug-in unit, therefore, the content of format document directly can only be converted to image, carry out unifying to show in the mode of image.
The space of a whole page element that format document support is enriched very much, comprises a large amount of picture and word, thus causes to provide good bandwagon effect to these space of a whole page elements with single picture format.Such as, when using JPG picture format to change format document, because JPG compression scheme can not process the less region of color or process solid color regions well, the word segment display effect in format document can be caused not good; Equally, when using PNG to change format document, generally first color sampling can be carried out, and then carry out Lossless Compression, in the process of color sampling, for reducing data volume, a considerable amount of color value can be lost, cause the image display distortion enriching color comprising after final decompress(ion).
Summary of the invention
For this reason, to be solved by this invention is existing technical matters format document being converted to the display effect distortion that single image file brings, and provides a kind of image conversion method based on document file page contents and system.
For solving the problems of the technologies described above, the technical solution used in the present invention is as follows:
Based on an image conversion method for document file page contents, comprise the steps:
Region dividing: divide described page documents content, obtains color single area and color complex region;
Content of pages in color single area is converted to the first format-pattern being suitable for compressing the single content of color;
Content of pages in color complex region is converted to the second format-pattern being suitable for compressing color complex contents.
Described a kind of image conversion method based on document file page contents, described Region dividing step specifically comprises:
Be the Region dividing of literal type by page elements be color single area;
Be the Region dividing of image type by page elements be color complex region.
Described a kind of image conversion method based on document file page contents, described Region dividing step specifically comprises:
The Region dividing be less than by the color value of page elements or equal 256 kinds is color single area;
Be color complex region by the color value of page elements more than the Region dividing of 256 kinds.
Described a kind of image conversion method based on document file page contents, the region at described page elements place is determined according to the position of described page elements, size and scalability information.
Described a kind of image conversion method based on document file page contents, described first format-pattern is PNG image.
Described a kind of image conversion method based on document file page contents, described second format-pattern is JPG image.
Described a kind of image conversion method based on document file page contents, described storage is carried out according to following steps:
The first format-pattern after conversion and the position of the second format-pattern and size fixed length mode are stored;
The data of the first format-pattern after conversion and the second format-pattern are stored according to random length mode;
Index information is set up in the position starting to store of each image;
Multiple described first format-pattern corresponding for same document file page and described second format-pattern are stored in a data file.
Described a kind of image conversion method based on document file page contents, also comprises the step of following browser display:
The position of the first format-pattern and the second format-pattern described in each is read from described data file;
Position according to the first format-pattern and the second format-pattern described in each builds dynamic web page information; The data reading the first format-pattern and the second format-pattern described in each are presented on the correspondence position of described dynamic web page of structure.
A kind of image converter system based on document file page contents is provided simultaneously, comprises:
Region dividing module, divides described page documents content, obtains color single area and color complex region;
Image conversion module, converts the first format-pattern of being suitable for compressing color single content to and stores by the content of pages in color single area; Content of pages in color complex region converted to the second format-pattern of being suitable for compressing color complex contents and store.
Described a kind of image converter system based on document file page contents, described Region dividing module specifically comprises:
Color single area divides submodule: the Region dividing for the described page elements place by literal type is color single area;
Color complex region divides submodule: the Region dividing for the described page elements place by image type is color complex region.
Described a kind of image converter system based on document file page contents, described Region dividing module specifically comprises:
Color single area obtains submodule, is color single area for the Region dividing be less than by the color value of page elements or equal 256 kinds;
Color complex region obtains submodule, for being color complex region by the color value of page elements more than the Region dividing of 256 kinds.
Described a kind of image converter system based on document file page contents, also comprises the submodule that the position according to described page elements, size and scalability information determine the region at described page elements place.
Described a kind of image converter system based on document file page contents, described first format-pattern in described image conversion module is PNG image.
Described a kind of image converter system based on document file page contents, described second format-pattern in described image conversion module is JPG image.
Described a kind of image converter system based on document file page contents, described image conversion module specifically comprises sub module stored, and described sub module stored specifically comprises:
First sub module stored, for storing the position of the first format-pattern after conversion and the second format-pattern and size fixed length mode;
Second sub module stored, for storing the data of the first format-pattern after conversion and the second format-pattern according to random length mode;
Index module, sets up index information for the position starting to store at each image;
3rd sub module stored, multiple described first format-pattern corresponding for just same document file page and described second format-pattern are stored in a data file.
Described a kind of image converter system based on document file page contents, also comprises browser display module, and described browser display module specifically comprises:
Position reading submodule, for reading the position of the first format-pattern and the second format-pattern described in each from described data file;
Dynamic web page builds submodule, builds dynamic web page information for the position according to the first format-pattern and the second format-pattern described in each;
Display sub-module, for reading the data of the first format-pattern and the second format-pattern described in each and being presented at the correspondence position of the described dynamic web page of structure.
Technique scheme of the present invention has the following advantages compared to existing technology:
(1) image conversion method based on document file page contents of the present invention and system, described page documents content is divided, obtain color single area and color complex region, the content of pages in color single area converted to the first format-pattern of being suitable for compressing color single content and store; Content of pages in color complex region converted to the second format-pattern of being suitable for compressing color complex contents and store; Like this by document file page contents, from color, whether complexity divides, different content of pages converts the image of corresponding format to, ensures the display effect of the writings and image information in document file page contents.
(2) in the image conversion method based on document file page contents of the present invention and system, metadata information according to described document file page carries out Region dividing, specifically that word or image carry out Region dividing according to type, further can simplify the step of Region dividing, reduce the workload of Region dividing, convenient enforcement.
(3) in the image conversion method based on document file page contents of the present invention and system, described first format-pattern is PNG image, can ensure the display effect of word like this, simultaneously, larger compressibility can be reached, reduce the storage space of view data and the transmission time on network; Display effect, the size of data file and the performance of conversion are taken into account.
(3) in the image conversion method based on document file page contents of the present invention and system, described second format-pattern is JPG image, on the one hand, select JPG form can ensure the display effect of image, simultaneously, larger compressibility can be reached, greatly reduce the storage space of view data and the transmission time on network; Display effect, the size of data file and the performance of conversion are taken into account.
(4), in the image conversion method based on document file page contents of the present invention and system, the position of the first format-pattern after conversion and the second format-pattern and size fixed length mode are stored; The data of the first format-pattern after conversion and the second format-pattern are stored according to random length mode; Index information is set up in the position starting to store of each image; Multiple described first format-pattern corresponding for same document file page and described second format-pattern are stored in a data file.Consider the singularity of different information, select fixed length mode to store for the Rule Information determined, adopt random length mode to store for variable information, convenient and practical; And, by the foundation of index information, provide efficient diagram data reading manner; By being stored in a data file by different images, being convenient to buffer memory, also avoiding an image to be stored as a large amount of scattered small documents of a file generation, affecting the performance that subsequent file reads, and the loss of file may be caused.
(5), in the image conversion method based on document file page contents of the present invention and system, the position according to the first format-pattern and the second format-pattern described in each builds dynamic web page information; The data reading the first format-pattern and the second format-pattern described in each are presented on the correspondence position of described dynamic web page of structure.
Accompanying drawing explanation
In order to make content of the present invention be more likely to be clearly understood, below according to a particular embodiment of the invention and by reference to the accompanying drawings, the present invention is further detailed explanation, wherein
Fig. 1 is the method flow diagram of a kind of image conversion method based on document file page contents of one embodiment of the invention;
Fig. 2 is the method flow diagram of a kind of image conversion method based on document file page contents of another embodiment of the present invention;
Fig. 3 is the page schematic diagram of the present invention's format document;
Fig. 4 is the area schematic after Fig. 3 uses the conversion of the image conversion method based on document file page contents of one embodiment of the invention;
Fig. 5 is the structured flowchart of a kind of image converter system based on document file page contents of one embodiment of the invention;
Fig. 6 is the structured flowchart of a kind of image converter system based on document file page contents of another embodiment of the present invention.
Embodiment
Embodiment 1
Shown in Figure 1, be a kind of image conversion method based on document file page contents of one embodiment of the invention, comprise the steps:
Region dividing: divide page documents content, obtains color single area and color complex region.
Content of pages in color single area converted to the first format-pattern of being suitable for compressing color single content and store.Content of pages in color complex region converted to the second format-pattern of being suitable for compressing color complex contents and store.
In the present invention, be suitable for compression and refer to be suitable for compression display, after decompressing, the display of content is substantially undistorted.
The image conversion method based on document file page contents of the present embodiment, page documents content is divided, obtain color single area and color complex region, the content of pages in color single area converted to the first format-pattern of being suitable for compressing color single content and store; Content of pages in color complex region converted to the second format-pattern of being suitable for compressing color complex contents and store; Like this by document file page contents, from color, whether complexity divides, different content of pages converts the image of corresponding format to, ensures the display effect of the writings and image information in document file page contents.
Embodiment 2
Shown in Figure 2, as a kind of image conversion method based on document file page contents of other embodiments of the invention, on the basis of embodiment 1, Region dividing step specifically comprises:
Obtain the metadata information of document file page; Wherein, metadata information comprises the type of page elements, position, size and scalability information, and the type of page elements comprises word (word wherein in form is also classified as literal type) and image.In the present embodiment, document is format document, certainly, as long as the document of extended formatting comprises necessary metadata information, image conversion method of the present invention also can be used to process, can realize object of the present invention equally.
The type of the page elements of document file page is extracted according to metadata information.
The region of each page elements is determined, i.e. the region at page elements place according to the position of page elements, size and scalability information.
Be the Region dividing of literal type by page elements be color single area; Be the Region dividing of image type by page elements be color complex region.
In the present embodiment, the metadata information according to document file page carries out Region dividing, is specifically that word or image carry out Region dividing according to type, further can simplifies the step of Region dividing, reduces the workload of Region dividing, conveniently implements.
Embodiment 3
As a kind of image conversion method based on document file page contents of other embodiments of the invention, on the basis of embodiment 1, Region dividing step specifically comprises:
Obtain the metadata information of document file page; Wherein, metadata information comprises the color value of page elements, position, size and scalability information.
Determine the region of each page elements according to the position of page elements, size and scalability information, namely determine the region of each page elements.
The Region dividing be less than by the color value of page elements or equal 256 kinds is for being color single area; Be color complex region by the color value of page elements more than the Region dividing of 256 kinds.
Preferably, region merging technique single or complicated for the color of multiple direct neighbor can be generated a region that large color is single or complicated, the number of times of squeeze operation can be reduced like this, improve the conversion efficiency of whole document file page.
In the present embodiment, Region dividing is obtained by the color value of the page elements in statistical regions, if the color value of the content in region is no more than 256 kinds, then judges that this page elements region is as color single area; Otherwise, if the color value of the content in region is more than 256 kinds, be then judged to be color complex region.The workload of being carried out Region dividing statistical color value by the method for the color value of the content in statistics preset range is comparatively large, operates relatively loaded down with trivial details, but more accurate, and versatility is stronger.
Embodiment 4
As a kind of image conversion method based on document file page contents of other embodiments of the invention, on the basis of above-mentioned any embodiment, first format-pattern is PNG image, and wherein the full name of PNG is Portable Network Graphic Format, looks like for Portable Network Graphics.Preferably, the first format-pattern is 8Bit PNG image, like this, stores with 8Bit to the color value after sampling, stores color in 256 at most, can reach larger compressibility, reduce data volume.The first format-pattern as other embodiments of the invention also can be the image of Gif form or Tiff form, can realize the module that the present invention ensures the clear display of word equally, but also has certain compressibility advantage.
First format-pattern is PNG image, can ensure the display effect of word like this, meanwhile, can reach larger compressibility, reduces the storage space of view data and the transmission time on network; Display effect, the size of data file and the performance of conversion are taken into account.
Embodiment 5
As other a kind of image conversion method based on document file page contents of the present invention, on the basis of any one embodiment above-mentioned, the second format-pattern is JPG image.The full name of JPG is JPEG, full name is Joint Photographic Experts GROUP, be first international digital Standard of image compression that still image is set up by International Standards Organization (ISO:International Standardization Organization) and Consultative Committee on International Telephone and Telegraph (CCITT) (CCITT:Consultation Commitee of the International Telephoneand Telegraph), be also using so far always, most widely used Standard of image compression.Jpeg format can under the ratio of 10:1 to 20:1 compressed file like a cork, and picture quality can not decline.JPEG compression can process realistic photographic work well.
Second format-pattern is JPG image, on the one hand, selects JPG form can ensure the display effect of image, meanwhile, can reach larger compressibility, greatly reduce the storage space of view data and the transmission time on network; Display effect, the size of data file and the performance of conversion are taken into account.
Embodiment 6
As a kind of image conversion method based on document file page contents of other embodiments of the invention, on the basis of above-mentioned any embodiment, store and carry out according to following steps:
The first format-pattern after conversion and the position of the second format-pattern and size fixed length mode are stored;
The data of the first format-pattern after conversion and the second format-pattern are stored according to random length mode;
Index information is set up in the position starting to store of each image;
Multiple first format-pattern corresponding for same document file page and the second format-pattern are stored in a data file.
The first format-pattern after conversion and the position of the second format-pattern and size fixed length mode are stored; The data of the first format-pattern after conversion and the second format-pattern are stored according to random length mode; Index information is set up in the position starting to store of each image; Multiple first format-pattern corresponding for same document file page and the second format-pattern are stored in a data file.Consider the singularity of different information, select fixed length mode to store for the Rule Information determined, adopt random length mode to store for variable information, convenient and practical; And, by the foundation of index information, provide efficient diagram data reading manner; By being stored in a data file by different images, be convenient to buffer memory, a large amount of scattered small documents also avoiding an image to be stored as a file generation affects the performance that subsequent file reads, and may cause the loss of file.
Embodiment 7
As a kind of image conversion method based on document file page contents of other embodiments of the invention, on the basis of above-described embodiment 6, also comprise the step of following browser display:
The position of each first format-pattern and the second format-pattern is read from data file;
Position according to each first format-pattern and the second format-pattern builds dynamic web page information; The data reading each first format-pattern and the second format-pattern are presented on the correspondence position of the dynamic web page of structure.
Position according to each first format-pattern and the second format-pattern builds dynamic web page information; The data reading each first format-pattern and the second format-pattern are presented on the correspondence position of the dynamic web page of structure.
Embodiment 8
Shown in Figure 5, be a kind of image converter system based on document file page contents of one embodiment of the invention, comprise:
Region dividing module, divides page documents content, obtains color single area and color complex region;
Image conversion module, converts the first format-pattern of being suitable for compressing color single content to and stores by the content of pages in color single area; Content of pages in color complex region converted to the second format-pattern of being suitable for compressing color complex contents and store.
The image converter system based on document file page contents of the present embodiment, page documents content is divided, obtain color single area and color complex region, the content of pages in color single area converted to the first format-pattern of being suitable for compressing color single content and store; Content of pages in color complex region converted to the second format-pattern of being suitable for compressing color complex contents and store; Like this by document file page contents, from color, whether complexity divides, different content of pages converts the image of corresponding format to, ensures the display effect of the writings and image information in document file page contents.
Embodiment 9
Shown in Figure 6, be a kind of image converter system based on document file page contents of one embodiment of the invention, on the basis of embodiment 8, Region dividing module specifically comprises:
Metadata information obtains submodule, for obtaining the metadata information of document file page; Wherein, metadata information comprises the type of page elements, position, size and scalability information, and the type of page elements comprises writings and image.
Submodule is determined in region, determines the region of each page elements for the position according to page elements, size and scalability information.
Region dividing submodule, is divided into multiple region for the dissimilar page elements comprised by document file page according to region; Region dividing submodule specifically comprises: color single area divides submodule and color complex region and divides submodule: wherein, and color single area divides submodule: be color single area for being the Region dividing of literal type by page elements; Color complex region divides submodule, is color complex region for being the Region dividing of image type by page elements.
In the present embodiment, the metadata information according to document file page carries out Region dividing, is specifically that word or image carry out Region dividing according to type, further can simplifies the step of Region dividing, reduces the workload of Region dividing, conveniently implements.
Embodiment 10
As a kind of image converter system based on document file page contents of one embodiment of the invention, on the basis of embodiment 8, Region dividing module specifically comprises:
Metadata information obtains submodule, for obtaining the metadata information of document file page; Wherein, metadata information comprises the color value of page elements, position, size and scalability information;
Region deviding submodule, determines the region of each page elements for the position according to page elements, size and scalability information;
Region obtains submodule, specifically comprise color single area and obtain submodule and color complex region acquisition submodule, wherein, color single area obtains submodule, is color single area for the Region dividing be less than by the color value of page elements or equal 256 kinds; Color complex region obtains submodule, for being color complex region by the color value of page elements more than the Region dividing of 256 kinds.
In the present embodiment, Region dividing is obtained by the color value of the page elements in statistical regions, if the color value of the content in region is no more than 256 kinds, then judges that the region at page elements place is as color single area; Otherwise, if the color value of the content in region is more than 256 kinds, be then judged to be color complex region.The workload of being carried out Region dividing statistical color value by the method for the color value of the content in statistics preset range is comparatively large, operates relatively loaded down with trivial details, but more accurate, and versatility is stronger.
Embodiment 11
As a kind of image converter system based on document file page contents of other embodiments of the invention, on the basis of above-mentioned any embodiment, the first format-pattern in image conversion module is PNG image.
First format-pattern is PNG image, can ensure the display effect of word like this, meanwhile, can reach larger compressibility, reduces the storage space of view data and the transmission time on network; Display effect, the size of data file and the performance of conversion are taken into account.
Embodiment 12
As a kind of image converter system based on document file page contents of other embodiments of the invention, on the basis of a kind of image converter system based on document file page contents of any one embodiment above-mentioned, the second format-pattern in image conversion module is JPG image.
Second format-pattern is JPG image, on the one hand, selects JPG form can ensure the display effect of image, meanwhile, can reach larger compressibility, greatly reduce the storage space of view data and the transmission time on network; Display effect, the size of data file and the performance of conversion are taken into account.
Embodiment 13
As a kind of image converter system based on document file page contents of other embodiments of the invention, on the basis of a kind of image converter system based on document file page contents of any one embodiment above-mentioned, image conversion module specifically comprises sub module stored, and sub module stored specifically comprises:
First sub module stored, for storing the position of the first format-pattern after conversion and the second format-pattern and size fixed length mode.
Second sub module stored, for storing the data of the first format-pattern after conversion and the second format-pattern according to random length mode.
Index module, sets up index information for the position starting to store at each image.
3rd sub module stored, multiple first format-pattern corresponding for just same document file page and the second format-pattern are stored in a data file.
In the present embodiment, the first format-pattern after conversion and the position of the second format-pattern and size fixed length mode are stored; The data of the first format-pattern after conversion and the second format-pattern are stored according to random length mode; Index information is set up in the position starting to store of each image; Multiple first format-pattern corresponding for same document file page and the second format-pattern are stored in a data file.Consider the singularity of different information, select fixed length mode to store for the Rule Information determined, adopt random length mode to store for variable information, convenient and practical; And, by the foundation of index information, provide efficient diagram data reading manner; By being stored in a data file by different images, be convenient to buffer memory, a large amount of scattered small documents also avoiding an image to be stored as a file generation affects the performance that subsequent file reads, and may cause the loss of file.
Embodiment 14
As a kind of image converter system based on document file page contents of other embodiments of the invention, on the basis of above-described embodiment 13, also comprise browser display submodule, browser display submodule specifically comprises:
Position reading submodule, for reading the position of each first format-pattern and the second format-pattern from data file;
Dynamic web page builds submodule, builds dynamic web page information for the position according to each first format-pattern and the second format-pattern;
Display sub-module, for reading the data of each first format-pattern and the second format-pattern and being presented at the correspondence position of the dynamic web page of structure.
Below in conjunction with the layout page example shown in Fig. 3 and Fig. 4, the image conversion method based on document file page contents of the present invention and system are specifically described:
For the document file page shown in Fig. 3, this page is divided into four parts according to the difference of metadata: word segment 1, image section 2, word segment 3 and word segment 4, every part is finally converted to an image, wherein image section 2 be converted into be suitable for color single PNG image, other three parts are converted into the JPG image being suitable for compressing color complexity, and the details of each image are as shown in the table:
Sequence number Coordinate x Coordinate y Wide High File size
1 0 0 200 300 19,777
2 200 0 155 240 7,487
3 200 240 155 60 5,923
4 355 0 255 300 20,746
A large files as shown in the table is preserved into by splitting the tetrameric data obtained in upper table
During reading, according to view data quantity, the metamessage of all images can be read, specifically according to data pointer, length and the starting position of the data of each several part image can be obtained, and then reads image data.
Those skilled in the art should understand, embodiments of the invention can be provided as method, system or computer program.Therefore, the present invention can adopt the form of complete hardware embodiment, completely software implementation or the embodiment in conjunction with software and hardware aspect.And the present invention can adopt in one or more form wherein including the upper computer program implemented of computer-usable storage medium (including but not limited to magnetic disk memory, CD-ROM, optical memory etc.) of computer usable program code.
The present invention describes with reference to according to the process flow diagram of the method for the embodiment of the present invention, equipment (system) and computer program and/or block scheme.Should understand can by the combination of the flow process in each flow process in computer program instructions realization flow figure and/or block scheme and/or square frame and process flow diagram and/or block scheme and/or square frame.These computer program instructions can being provided to the processor of multi-purpose computer, special purpose computer, Embedded Processor or other programmable data processing device to produce a machine, making the instruction performed by the processor of computing machine or other programmable data processing device produce device for realizing the function of specifying in process flow diagram flow process or multiple flow process and/or block scheme square frame or multiple square frame.
These computer program instructions also can be stored in can in the computer-readable memory that works in a specific way of vectoring computer or other programmable data processing device, the instruction making to be stored in this computer-readable memory produces the manufacture comprising command device, and this command device realizes the function of specifying in process flow diagram flow process or multiple flow process and/or block scheme square frame or multiple square frame.
These computer program instructions also can be loaded in computing machine or other programmable data processing device, make on computing machine or other programmable devices, to perform sequence of operations step to produce computer implemented process, thus the instruction performed on computing machine or other programmable devices is provided for the step realizing the function of specifying in process flow diagram flow process or multiple flow process and/or block scheme square frame or multiple square frame.
Although describe the preferred embodiments of the present invention, those skilled in the art once obtain the basic creative concept of cicada, then can make other change and amendment to these embodiments.So claims are intended to be interpreted as comprising preferred embodiment and falling into all changes and the amendment of the scope of the invention.

Claims (12)

1. based on an image conversion method for document file page contents, it is characterized in that, comprise the steps:
Region dividing: divide described page documents content, obtains color single area and color complex region;
Content of pages in color single area is converted to the first format-pattern being suitable for compressing the single content of color;
Content of pages in color complex region is converted to the second format-pattern being suitable for compressing color complex contents.
2. a kind of image conversion method based on document file page contents according to claim 1, is characterized in that, described Region dividing step specifically comprises:
Be the Region dividing of literal type by page elements be color single area;
Be the Region dividing of image type by page elements be color complex region.
3. a kind of image conversion method based on document file page contents according to claim 1, is characterized in that, described Region dividing step specifically comprises:
The Region dividing be less than by the color value of page elements or equal 256 kinds is color single area;
Be color complex region by the color value of page elements more than the Region dividing of 256 kinds.
4. a kind of image conversion method based on document file page contents according to Claims 2 or 3, is characterized in that, the region at described page elements place is determined according to the position of described page elements, size and scalability information.
5., according to the arbitrary described a kind of image conversion method based on document file page contents of claim 1-4, it is characterized in that, described first format-pattern is PNG image.
6., according to the arbitrary described a kind of image conversion method based on document file page contents of claim 1-5, it is characterized in that, described second format-pattern is JPG image.
7., according to the arbitrary described a kind of image conversion method based on document file page contents of claim 1-6, it is characterized in that, described storage is carried out according to following steps:
The first format-pattern after conversion and the position of the second format-pattern and size fixed length mode are stored;
The data of the first format-pattern after conversion and the second format-pattern are stored according to random length mode;
Index information is set up in the position starting to store of each image;
Multiple described first format-pattern corresponding for same document file page and described second format-pattern are stored in a data file.
8. a kind of image conversion method based on document file page contents according to claim 7, is characterized in that, also comprise the step of following browser display:
The position of the first format-pattern and the second format-pattern described in each is read from described data file;
Position according to the first format-pattern and the second format-pattern described in each builds dynamic web page information;
The data reading the first format-pattern and the second format-pattern described in each are presented on the correspondence position of described dynamic web page of structure.
9. based on an image converter system for document file page contents, it is characterized in that, comprising:
Region dividing module, divides described page documents content, obtains color single area and color complex region;
Image conversion module, converts the first format-pattern of being suitable for compressing color single content to and stores by the content of pages in color single area; Content of pages in color complex region converted to the second format-pattern of being suitable for compressing color complex contents and store.
10. a kind of image converter system based on document file page contents according to claim 9, is characterized in that, described Region dividing module specifically comprises:
Color single area divides submodule: the Region dividing for the described page elements place by literal type is color single area;
Color complex region divides submodule: the Region dividing for the described page elements place by image type is color complex region.
11. a kind of image converter system based on document file page contents according to claim 9, it is characterized in that, described Region dividing module specifically comprises:
Color single area obtains submodule, is color single area for the Region dividing be less than by the color value of page elements or equal 256 kinds;
Color complex region obtains submodule, for being color complex region by the color value of page elements more than the Region dividing of 256 kinds.
12. a kind of image converter system based on document file page contents according to claim 10 or 11, is characterized in that, also comprise the submodule that the position according to described page elements, size and scalability information determine the region at described page elements place.
CN201310450959.0A 2013-09-25 2013-09-25 Image conversion method and system based on document page content Pending CN104462159A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201310450959.0A CN104462159A (en) 2013-09-25 2013-09-25 Image conversion method and system based on document page content

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201310450959.0A CN104462159A (en) 2013-09-25 2013-09-25 Image conversion method and system based on document page content

Publications (1)

Publication Number Publication Date
CN104462159A true CN104462159A (en) 2015-03-25

Family

ID=52908211

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201310450959.0A Pending CN104462159A (en) 2013-09-25 2013-09-25 Image conversion method and system based on document page content

Country Status (1)

Country Link
CN (1) CN104462159A (en)

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106327453A (en) * 2015-06-30 2017-01-11 北京金山安全软件有限公司 Method for splicing picture resources and picture resource splicing device
CN107544985A (en) * 2016-06-23 2018-01-05 珠海金山办公软件有限公司 A kind of presentation file special-effect matching process and device
CN112130782A (en) * 2020-09-22 2020-12-25 上海汉图科技有限公司 Printing method and device and printing client

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5416603A (en) * 1991-04-30 1995-05-16 Ricoh Company, Ltd. Image segmentation using discrete cosine transfer data, and image data transmission apparatus and method using this image segmentation
JP2002312719A (en) * 2001-04-11 2002-10-25 Nec Corp Character segmentation device, character segmentation method and program for character segmentation
CN1649384A (en) * 2004-01-19 2005-08-03 株式会社理光 Image processing apparatus, image processing program and storage medium
CN103136171A (en) * 2011-12-02 2013-06-05 明博教育科技有限公司 E-book making method based on shock wave flash (SWF)

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5416603A (en) * 1991-04-30 1995-05-16 Ricoh Company, Ltd. Image segmentation using discrete cosine transfer data, and image data transmission apparatus and method using this image segmentation
JP2002312719A (en) * 2001-04-11 2002-10-25 Nec Corp Character segmentation device, character segmentation method and program for character segmentation
CN1649384A (en) * 2004-01-19 2005-08-03 株式会社理光 Image processing apparatus, image processing program and storage medium
CN103136171A (en) * 2011-12-02 2013-06-05 明博教育科技有限公司 E-book making method based on shock wave flash (SWF)

Cited By (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106327453A (en) * 2015-06-30 2017-01-11 北京金山安全软件有限公司 Method for splicing picture resources and picture resource splicing device
CN106327453B (en) * 2015-06-30 2020-02-07 珠海豹趣科技有限公司 Method for splicing picture resources and picture resource splicing device
CN107544985A (en) * 2016-06-23 2018-01-05 珠海金山办公软件有限公司 A kind of presentation file special-effect matching process and device
CN112130782A (en) * 2020-09-22 2020-12-25 上海汉图科技有限公司 Printing method and device and printing client

Similar Documents

Publication Publication Date Title
US11429852B2 (en) Convolution acceleration and computing processing method and apparatus, electronic device, and storage medium
US20150046797A1 (en) Document format processing apparatus and document format processing method
CN102508851A (en) Image upload processing method and image upload processing system
CN102905045A (en) Method and server for providing picture data to computing terminal
RU2010102958A (en) IMAGE PROCESSING DEVICE, METHOD AND COMPUTER PROGRAM OF IMAGE PROCESSING
US20130174024A1 (en) Method and device for converting document format
CN101441713A (en) Optical character recognition method and apparatus of PDF document
CN104616263A (en) Image processing method and device
CN115129801A (en) Map data processing method, device, equipment and storage medium
CN102063478A (en) Three-dimensional file format conversion method and search engine suitable for Internet search
CN102437894B (en) Method, device and equipment for compressing information to be sent
CN104462159A (en) Image conversion method and system based on document page content
CN111190519A (en) File and control processing method, device, equipment and storage medium thereof
CN110399511A (en) Image cache method, equipment, storage medium and device based on Redis
CN102025987A (en) Image processing apparatus and control method thereof
CN105721810A (en) Image compression storage method and apparatus
CN112650529B (en) System and method for configurable generation of mobile terminal APP codes
CN110569230A (en) Method, system and equipment for interconversion between database design model and design document
CN112527752B (en) Data compression method, data compression device, computer readable storage medium and electronic equipment
CN109492211A (en) A kind of table extracting method based on OFD document
CN105704215B (en) Shared file system and corresponding document sending method and device
CN102768755B (en) Obtain the method and apparatus of the thumbnail of picture
CN103870543A (en) Method and device for reconstructing document file
CN109803157A (en) A kind of sequence frame picture transmission method, system and electronic equipment based on video
CN114139040A (en) Data storage and query method, device, equipment and readable storage medium

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication

Application publication date: 20150325

RJ01 Rejection of invention patent application after publication