CN109308724A

CN109308724A - A kind of method for compressing image and device

Info

Publication number: CN109308724A
Application number: CN201710615308.0A
Authority: CN
Inventors: 武俊敏; 刘骏
Original assignee: Hefei Map Duck Mdt Infotech Ltd
Current assignee: Hefei Map Duck Mdt Infotech Ltd
Priority date: 2017-07-26
Filing date: 2017-07-26
Publication date: 2019-02-05

Abstract

The embodiment of the invention provides a kind of method for compressing image and devices, which comprises carries out fragment to image；To each fragment, the discrete cosine transform with coefficient is made to every a line of the fragment, each column respectively, obtains transformation results；Obtain the corresponding mask matrix of the fragment；Discrete cosine transform with coefficient is made to every a line of the mask matrix, each column respectively, obtains the transformation results of the mask matrix；The final transformation results of the fragment are obtained divided by the transformation results of the mask matrix with the transformation results of the fragment；The final transformation results are quantified, are encoded, obtain compressed as a result, it is possible to effectively promote the compression ratio of image.

Description

A kind of method for compressing image and device

Technical field

The present invention relates to compression of images field, in particular to a kind of method for compressing image.

Background technique

Inefficient when being predicted at present using image pyramid, compression speed is slower.

Summary of the invention

To solve the above-mentioned problems, the embodiment of the invention provides a kind of method for compressing image.

According to the first aspect of the invention, a kind of method for compressing image is provided, which comprises

Fragment is carried out to image；

To each fragment, its least surrounding boxes is calculated；

To each fragment, corresponding mask matrix is calculated according to the least surrounding boxes；

To each fragment, sample code is carried out to the mask matrix, obtains corresponding image pyramid；

To each fragment, since the pyramidal top of described image, by this layer under image pyramid One layer is predicted, prediction result is obtained；

The pyramidal boundary of the tomographic image is calculated, pixel corresponding with the boundary in the prediction result is obtained, by institute It states and the corresponding pixel in boundary is compared with the value of corresponding location of pixels in the mask matrix, obtain comparing knot accordingly Fruit, for being in the pixel of non-boundary, it is true that its comparison result, which is directly arranged,；

The prediction result is modified according to the mask matrix, obtains revised prediction result；

The pyramidal each layer of iterative processing described image, until the bottom；

It is corresponding that with other each layer comparison results described image pyramid is obtained according to the pyramidal top layer of described image Channel binary coding string；

The binary coding string is compressed, intermediate compression result is obtained；

The first compression result of described image is obtained according to the intermediate compression result of all fragments.

According to the second aspect of the invention, a kind of image compressing device is provided, comprising:

Sharding unit, for carrying out fragment to image；

The least surrounding boxes computing unit, for calculating its least surrounding boxes to each fragment；

Mask matrix calculation unit, for being calculated according to the least surrounding boxes corresponding to each fragment Mask matrix；

Sampling unit, for carrying out sample code to the mask matrix, obtaining corresponding image to each fragment Pyramid；

Predicting unit is used for each fragment, since the pyramidal top of described image, by the layer to figure As pyramidal next layer is predicted, prediction result is obtained；

Computing unit, for calculating the pyramidal boundary of the tomographic image, obtain in the prediction result with the boundary pair The corresponding pixel in described and boundary is compared with the value of corresponding location of pixels in the mask matrix, obtains by the pixel answered To corresponding comparing result, for being in the pixel of non-boundary, it is true that its comparison result, which is directly arranged,；

Amending unit is modified the prediction result according to the mask matrix, obtains revised prediction result；

Iteration unit is used for the pyramidal each layer of iterative processing described image, until the bottom；

Coding unit, for obtaining the figure according to the pyramidal top layer of described image and other each layer comparison results As the binary coding string in the corresponding channel of pyramid；

First compression unit obtains intermediate compression result for compressing to the binary coding string；

Second combining unit obtains the first compression knot of described image for the intermediate compression result according to all fragments Fruit.

The embodiment of the present invention provides the method that a kind of pair of image is compressed, and accelerates image pyramid by edge detection It predicts versus speed, the speed of compression of images can be effectively improved.

Detailed description of the invention

To describe the technical solutions in the embodiments of the present invention more clearly, make required in being described below to embodiment Attached drawing is briefly described, it should be apparent that, drawings in the following description are only some embodiments of the invention, for For those of ordinary skill in the art, without creative efforts, it can also be obtained according to these attached drawings other Attached drawing.

Fig. 1 is method flow diagram provided in an embodiment of the present invention；

Fig. 2 is method flow diagram provided in an embodiment of the present invention；

Fig. 3 is method flow diagram provided in an embodiment of the present invention；

Fig. 4 is schematic device provided in an embodiment of the present invention.

Specific embodiment

To make the object, technical solutions and advantages of the present invention clearer, below in conjunction with attached in the embodiment of the present invention Figure, technical scheme in the embodiment of the invention is clearly and completely described, it is clear that described embodiment is only this Invention a part of the embodiment, instead of all the embodiments.Based on the embodiments of the present invention, those of ordinary skill in the art exist Every other embodiment obtained under the premise of creative work is not made, shall fall within the protection scope of the present invention.

Embodiment one

The embodiment of the invention provides a kind of method for compressing image, as shown in Figure 1, which comprises

Step 102, fragment is carried out to image.

Fragment can be carried out to image using any way, such as fragment be carried out by deep learning method, or pass through SLIC algorithm carry out fragment, the embodiment of the present invention to specific sharding method without limitation.

Step 104, to each fragment, its least surrounding boxes is calculated.

Specifically, including:

Obtain the coordinate value (x, y) of each pixel in the fragment；

Obtain the maximum value x in all abscissas_maxWith minimum value x_min；

Obtain the maximum value y in all ordinates_maxWith minimum value y_min；

By (x_max, y_max)、(x_max, y_min)、(x_min, y_max)、(x_min, y_min) rectangle composed by 4 points is exactly the fragment The least surrounding boxes.

Step 106, to each fragment, corresponding mask matrix is calculated according to the least surrounding boxes.

Specifically, for each pixel in the least surrounding boxes, it is described if the value of the pixel is not sky Value corresponding with the pixel is 1 in mask matrix, and otherwise, value corresponding with the pixel is 0 in the mask matrix.

Step 108, to each fragment, sample code is carried out to the mask matrix, obtains corresponding image gold word Tower.

Specifically, the image pyramid of the mask matrix is established by sampling, until the pyramidal top layer of described image Until only 1 pixel.Image pyramid is one kind of Image Multiscale expression, is a kind of to carry out interpretation of images with multiresolution Resulting structure.The pyramid of piece image is that a series of resolution ratio with Pyramid arrangement gradually reduce, and derive from same The image collection of one original graph.It is obtained by echelon to down-sampling, just stops sampling until reaching some termination condition.It will Image in layer is likened into pyramid, and level is higher, then image is smaller, and resolution ratio is lower.Pyramidal bottom is wait locate The high-resolution for managing image indicates, and top is the approximation of low resolution.

It uses in the embodiment of the present invention and samples step by step, until the pyramidal top layer of described image only has 1 pixel.Show Example property, it is assumed that the resolution ratio of original image is 512 × 512, it is carried out obtain after first time sampling resolution ratio be 256 × 256 image obtains the image that resolution ratio is 128 × 128 after sampling again, it is 64 × 64 that resolution ratio is obtained after sampling again Image obtains the image that resolution ratio is 32 × 32 after sampling again, the image that resolution ratio is 16 × 16 is obtained after sampling again, then The image that resolution ratio is 8 × 8 is obtained after secondary sampling, and the image that resolution ratio is 4 × 4 is obtained after sampling again, after sampling again The image for being 2 × 2 to resolution ratio finally obtains the image that resolution ratio is 1 × 1, i.e. image pyramid top layer after sampling again The image of only 1 pixel.

Step 110, to each fragment, since the pyramidal top of described image, by the layer to image gold Next layer of word tower is predicted, prediction result is obtained.

Specifically, the pixel value using this layer is filled next layer of pixel of image pyramid, obtain described pre- Survey result.

Step 112, the pyramidal boundary of the tomographic image is calculated, picture corresponding with the boundary in the prediction result is obtained The corresponding pixel in described and boundary is compared with the value of corresponding location of pixels in the mask matrix, obtains corresponding by element Comparing result, for be in non-boundary pixel, it is true that its comparison result, which is directly arranged,.

Usually, in the prediction result obtained by above-mentioned fill method most pixel be all in mask matrix Corresponding pixel is consistent, it is possible to the image side in the most filled layer appeared in for filling in inconsistent place occurs At boundary because these boundary pixels value it is possible that mutation.Using this characteristic, the speed of comparison can be accelerated Degree.

Specifically, calculating the pyramidal boundary of the tomographic image, picture corresponding with the boundary in the prediction result is obtained These (this) pixels are compared with the value of corresponding location of pixels in the mask matrix, are compared accordingly by element As a result, it is true that its comparison result, which is directly arranged, for the pixel for being in non-boundary.

Step 114, the prediction result is modified according to the mask matrix, obtains revised prediction result.

Step 116, the pyramidal each layer of iterative processing described image, until the bottom.

Specifically, the next layer of progress according to the revised prediction result, using the above method to image pyramid It predicts and compares, obtain comparing result and revised prediction result, so recycle, until the pyramidal most bottom of described image Layer.

Step 118, described image gold is obtained according to the pyramidal top layer of described image and other each layer comparison results The binary coding string in the corresponding channel of word tower.

Step 120, the binary coding string is compressed, obtains intermediate compression result.

Specifically, can be compressed by the way of arithmetic coding to the binary coding string, the embodiment of the present invention Without limitation to specific compression method.

Step 122, the first compression result of described image is obtained according to the intermediate compression result of all fragments.

Specifically, merging the compression result of all fragments, the first compression result of described image is obtained.The embodiment of the present invention It is without restriction to specific merging method.

Embodiment two

The embodiment of the invention provides a kind of method for compressing image, as shown in Figure 2, which comprises

Step 202, fragment is carried out to image.

Step 204, to each fragment, its least surrounding boxes is calculated.

Specifically, including:

Obtain the coordinate value (x, y) of each pixel in the fragment；

Obtain the maximum value x in all abscissas_maxWith minimum value x_min；

Obtain the maximum value y in all ordinates_maxWith minimum value y_min；

Step 206, to each fragment, corresponding mask matrix is calculated according to the least surrounding boxes.

Step 208, to each fragment, sample code is carried out to the mask matrix.

Step 210, it to each fragment, is compressed according to the mask matrix and the sample code.

Specifically, processing method is as follows for each image pyramid:

Firstly, being carried out by this layer to next layer of image pyramid pre- since the pyramidal top of described image It surveys, obtains prediction result.

Specifically, the pixel value using this layer is filled next layer of pixel of image pyramid, obtain described pre- Survey result.Illustratively, it is assumed that the pixel value of the 1 × 1 of top is 1, then 2 × 2 pixel of next layer obtained by filling The prediction result of image are as follows:

Secondly, comparing according to the prediction result and the mask matrix, comparison result is obtained.

Specifically, the prediction result is compared with the value of corresponding location of pixels in the mask matrix, example The corresponding mask matrix value of above-mentioned 2 × 2 pixel image of hypothesis of property are as follows:

Then (1) is compared with (2), obtained comparing result is 1110 (it is true that this sentences 1 representative, represents vacation with 0).

Again, the prediction result is modified according to the mask matrix, obtains revised prediction result:

Finally, the next layer of progress according to the revised prediction result (3), using the above method to image pyramid It predicts and compares, obtain comparing result and revised prediction result, so recycle, until the pyramidal most bottom of described image Layer.

Finally, described image pyramid is obtained according to the pyramidal top layer of described image and other each layer comparison results The binary coding string in corresponding channel.

The binary coding string is compressed.

Illustratively, the binary coding string can be compressed by the way of arithmetic coding, the present invention is implemented Example to specific compression method without limitation.

Step 212, the first compression result of described image is obtained according to the compression result of all fragments.

Further, after step 202, further includes:

Each Color Channel of each fragment is compressed.

Specifically, the Color Channel includes: RGB channel or the channel YUV, if the color space of the image is RGB Color space, then the three of the image channel is respectively R, G, channel B, if the color space of the image is YUV color space, Then three channels of the image are respectively the channel Y, U, V.

The compression result for merging all colours channel of all fragments obtains the second compression result of described image.

First compression result and second compression result are merged into the compression result of described image.

The embodiment of the present invention provides the method that a kind of pair of image carries out fragment and compressed to every shape, Neng Gouyou Effect ground characterization nonlinear characteristic, and promote the compression ratio of image.

Embodiment three

Step 202, fragment is carried out to image.

Step 204, to each fragment, its least surrounding boxes is calculated.

Specifically, including:

Obtain the coordinate value (x, y) of each pixel in the fragment；

Obtain the maximum value x in all abscissas_maxWith minimum value x_min；

Obtain the maximum value y in all ordinates_maxWith minimum value y_min；

Step 208, to each fragment, sample code is carried out to the mask matrix.

Specifically, processing method is as follows for each image pyramid:

The binary coding string is compressed.Illustratively, can by the way of arithmetic coding to described two into Coded strings processed are compressed, the embodiment of the present invention to specific compression method without limitation.

Further, after step 202, further includes:

Each Color Channel of each fragment is compressed.

Each Color Channel to each fragment carries out compression

DCT (Discrete Cosine Transform, discrete cosine transform) is to every a line of the Color Channel to become It changes, obtains transformed result.

Specifically, described every a line to the Color Channel does dct transform, obtaining transformed result includes:

Obtain the number of pixels of every row in the fragment.

Illustratively, it is assumed that the fragment shares i row, then the number of pixels of every row is denoted as n_i。

Dct transform is done for every row pixel, obtains intermediate conversion result.

Illustratively, calculation formula is as follows:

To the intermediate conversion result multiplied byObtain transformation results.

90 ° of transposition are carried out to the fragment.

The number of pixels of every row in the fragment after obtaining transposition.

Illustratively, it is assumed that the fragment shares j row, then the number of pixels of every row is denoted as n_j。

Illustratively, calculation formula is as follows:

Obtain the corresponding mask matrix of the fragment, i.e., each picture in the least surrounding boxes corresponding for the fragment Element, if the value of the pixel is not sky, value corresponding with the pixel is 1, otherwise, the mask square in the mask matrix Value corresponding with the pixel is 0 in battle array.

To the mask matrix, dct transform is also carried out according to the method described above, obtains corresponding transformation results.

The channel of the fragment is obtained divided by the transformation results of the mask matrix with the transformation results of the fragment Final transformation results.

The final transformation results are quantified, are encoded, compressed result is obtained.

The embodiment of the present invention provides a kind of method for compressing image, and the shape based on each image slices carries out dct transform Optimization, compared to traditional dct transform compression method, can effectively promote the compression ratio of image.

Example IV

The embodiment of the invention provides a kind of method for compressing image, as shown in Figure 3, which comprises

Step 302, fragment is carried out to image.

Step 304, to each fragment, its least surrounding boxes is calculated.

Specifically, including:

Obtain the coordinate value (x, y) of each pixel in the fragment；

Obtain the maximum value x in all abscissas_maxWith minimum value x_min；

Obtain the maximum value y in all ordinates_maxWith minimum value y_min；

Step 306, to each fragment, corresponding mask matrix is calculated according to the least surrounding boxes.

Step 308, to each fragment, sample code is carried out to the mask matrix.

Step 310, it to each fragment, is compressed according to the mask matrix and the sample code.

Specifically, processing method is as follows for each image pyramid:

Step 312, the first compression result of described image is obtained according to the compression result of all fragments.

Step 314, the fragment in step 302 is merged according to preset rules, the new fragment after being merged.

Specifically, can be merged adjacent fragment according to the boundary of each fragment, the present invention merges to specific Method is without restriction.

Step 316, it each new fragment, is handled according to the method for step 304 to step 312, obtains institute State the intermediate compression result of image.

Step 318, the intermediate compression in the first compression result and step 316 in comparison step 312 is as a result, selection is more excellent Result be described image the first new compression result.

Step 320, step 314 is repeated to step 318, until restraining, using the compression result after restraining as described image First compression result.

Further, after step 302, further includes:

Each Color Channel of each fragment is compressed.

Each Color Channel to each fragment carries out compression

Obtain the number of pixels of every row in the fragment.

Illustratively, calculation formula is as follows:

90 ° of transposition are carried out to the fragment.

Illustratively, calculation formula is as follows:

The embodiment of the present invention provides a kind of method for compressing image, by merging to image slices, compare compression result come Optimal sliced fashion is selected, so as to preferably promote the compression ratio of image.

Embodiment five

Step 302, fragment is carried out to image.

Step 304, to each fragment, its least surrounding boxes is calculated.

Specifically, including:

Obtain the coordinate value (x, y) of each pixel in the fragment；

Obtain the maximum value x in all abscissas_maxWith minimum value x_min；

Obtain the maximum value y in all ordinates_maxWith minimum value y_min；

Step 308, to each fragment, sample code is carried out to the mask matrix.

Specifically, processing method is as follows for each image pyramid:

Again, the prediction result is modified according to the mask matrix, obtains revised prediction result；

Finally, being carried out using the above method to next layer of image pyramid pre- according to the revised prediction result It surveys and compares, obtain comparing result and revised prediction result, so recycle, until the pyramidal bottom of described image.

Further, after step 302, further includes:

Each Color Channel of each fragment is compressed.

Each Color Channel to each fragment carries out compression

Obtain the number of pixels of every row in the fragment.

Illustratively, calculation formula is as follows:

90 ° of transposition are carried out to the fragment.

Illustratively, calculation formula is as follows:

The embodiment of the present invention provides a kind of method for compressing image, is accelerated by way of obtaining edge to described image gold word Predetermined speed that each layer of tower can effectively promote the compression speed of image.

Embodiment six

The embodiment of the invention provides a kind of image compressing devices, as shown in figure 4, described device includes:

Sharding unit 402 carries out fragment to image.

Converter unit 404, to each fragment, to every a line of the fragment, each column make respectively with coefficient from Cosine transform is dissipated, transformation results are obtained.

Specifically, including:

Obtain the number of pixels of every row in the fragment；

Dct transform is done for every row pixel, obtains the first intermediate conversion result；

To the first intermediate conversion result multiplied by the first transformation coefficient, the first transformation results are obtained；

90 ° of transposition are carried out to the fragment；

The number of pixels of every row in the fragment after obtaining transposition；

Dct transform is done for every row pixel, obtains the second intermediate conversion result；

To the second intermediate conversion result multiplied by the second transformation coefficient, the transformation results of the fragment are obtained.

Mask matrix acquiring unit 406 obtains the corresponding mask matrix of the fragment.

Specifically, including:

Obtain the coordinate value (x, y) of each pixel in the fragment；

Obtain the maximum value x in all abscissas_maxWith minimum value x_min；

Obtain the maximum value y in all ordinates_maxWith minimum value y_min；

By (x_max, y_max)、(x_max, y_min)、(x_min, y_max)、(x_min, y_min) rectangle composed by 4 points is exactly the fragment The least surrounding boxes；

For each pixel in the least surrounding boxes, if the value of the pixel is not empty, the mask square Value corresponding with the pixel is 1 in battle array, and otherwise, value corresponding with the pixel is 0 in the mask matrix.

Mask matrixing unit 408 is made every a line of the mask matrix, each column discrete remaining with coefficient respectively String transformation, obtains the transformation results of the mask matrix.

Specifically, including:

Obtain the number of pixels of every row in the mask；

Dct transform is done for every row pixel of the mask matrix, obtains third intermediate conversion result；

To the third intermediate conversion result multiplied by third transformation coefficient, third transformation results are obtained；

90 ° of transposition are carried out to the mask matrix；

The number of pixels of every row in the mask matrix after obtaining transposition；

Dct transform is done for every row pixel, obtains the 4th intermediate conversion result；

To the 4th intermediate conversion result multiplied by the 4th transformation coefficient, the transformation results of the mask matrix are obtained.

Computing unit 410 obtains described point divided by the transformation results of the mask matrix with the transformation results of the fragment The final transformation results of piece.

Compression unit 412 quantifies the final transformation results, is encoded, obtains compressed result.

The embodiment of the present invention provides the dct transform compression set that a kind of pair of image carries out fragment and optimize to every, The compression ratio of image can effectively be promoted.

Claims

1. a kind of method for compressing image, which is characterized in that the described method includes:

Fragment is carried out to image；

To each fragment, its least surrounding boxes is calculated；

To each fragment, since the pyramidal top of described image, by this layer to next layer of image pyramid It is predicted, obtains prediction result；

Calculate the pyramidal boundary of the tomographic image, obtain pixel corresponding with the boundary in the prediction result, will it is described with The corresponding pixel in boundary is compared with the value of corresponding location of pixels in the mask matrix, obtains corresponding comparing result, For being in the pixel of non-boundary, it is true that its comparison result, which is directly arranged,；

It is corresponding logical that described image pyramid is obtained according to the pyramidal top layer of described image and other each layer comparison results The binary coding string in road；

2. the method according to claim 1, wherein described corresponding according to the least surrounding boxes calculating Mask matrix includes:

For each pixel in the least surrounding boxes, if the value of the pixel be not it is empty, in the mask matrix Value corresponding with the pixel is 1, and otherwise, value corresponding with the pixel is 0 in the mask matrix.

3. the method according to claim 1, wherein described include: to mask matrix progress sample code

The image pyramid of the mask is established by sampling, is until the pyramidal top layer of described image only has 1 pixel Only.

4. the method according to claim 1, wherein being carried out by this layer to next layer of image pyramid pre- It surveys, obtaining prediction result includes:

Next layer of pixel of image pyramid is filled using the pixel value of this layer, obtains the prediction result.

5. the method according to claim 1, wherein the method also includes:

Each Color Channel of each fragment is compressed；

The compression result for merging all colours channel of all fragments obtains the second compression result of described image；

6. a kind of image compressing device characterized by comprising

Sharding unit, for carrying out fragment to image；

Mask matrix calculation unit, for calculating corresponding mask square according to the least surrounding boxes to each fragment Battle array；

Sampling unit, for carrying out sample code to the mask matrix, obtaining corresponding image gold word to each fragment Tower；

Predicting unit is used for each fragment, since the pyramidal top of described image, by the layer to image gold Next layer of word tower is predicted, prediction result is obtained；

Computing unit obtains corresponding with the boundary in the prediction result for calculating the pyramidal boundary of the tomographic image The corresponding pixel in described and boundary is compared with the value of corresponding location of pixels in the mask matrix, obtains phase by pixel The comparing result answered, for being in the pixel of non-boundary, it is true that its comparison result, which is directly arranged,；

Coding unit, for obtaining described image gold according to the pyramidal top layer of described image and other each layer comparison results The binary coding string in the corresponding channel of word tower；

Second combining unit obtains the first compression result of described image for the intermediate compression result according to all fragments.

7. device according to claim 6, which is characterized in that described device further include:

Second compression unit is compressed for each Color Channel to each fragment；

Second combining unit, the compression result in all colours channel for merging all fragments, obtains the second of described image Compression result；

Third combining unit, for first compression result and second compression result to be merged into the compression of described image As a result.