CN109995373B

CN109995373B - Mixed packing compression method for integer arrays

Info

Publication number: CN109995373B
Application number: CN201810004038.4A
Authority: CN
Inventors: 武鹏程; 方艳; 高佳玲; 蔡建兵; 张波; 孙荣卫; 左兵
Original assignee: Shanghai Abup Intelligent Technology Co ltd
Current assignee: Shanghai Abup Intelligent Technology Co ltd
Priority date: 2018-01-03
Filing date: 2018-01-03
Publication date: 2023-08-15
Anticipated expiration: 2038-01-03
Also published as: CN109995373A

Abstract

A mixed packing compression method of an integer array relates to the field of data compression, in particular to a mixed packing compression method of an integer array, which comprises the following steps: obtaining an integer array to be compressed, wherein the storage lengths of the integers are the same; converting the signed integer array, taking the absolute value of the original integer, shifting one bit to the left, and recording signs by using 0 and 1 for the lowest bit; traversing the whole integer absolute number array to obtain a median; packing the integer arrays by using two different packing basic storage bit numbers respectively to form two packed integer arrays; respectively compressing the two packed integer arrays by using a preset compression algorithm; selecting smaller compressed files after comparison; the packed base number of storage bits is recorded in the selected compressed file. After the technical scheme is adopted, the invention has the beneficial effects that: the method can reduce the storage capacity of the compressed file by about 10%, and is beneficial to improving the transmission success rate of the upgrade package of the mobile terminal and saving the flow cost.

Description

Mixed packing compression method for integer arrays

Technical Field

The invention relates to the field of data compression, in particular to a mixed packing compression method of an integer array.

Background

The data compression refers to a technical method for reducing the data volume to reduce the storage space and improve the transmission, storage and processing efficiency of the data or reorganizing the data according to a certain algorithm on the premise of not losing useful information and reducing the redundancy and storage space of the data. At present, data compression can be generally divided into two types, one is called lossless compression and the other is called lossy compression, but both methods are used for directly compressing file data, so that the storage space is large, the compression and decompression speed is low, the file transmission speed is also low, and the flow cost is high.

Disclosure of Invention

The invention aims to overcome the defects and shortcomings of the prior art and provide a mixed packing compression method of an integer array, which can reduce the storage capacity of compressed files by about 10%, thus being beneficial to improving the transmission success rate of mobile terminal upgrade packages and saving the flow cost.

In order to achieve the above purpose, the invention adopts the following technical scheme: it comprises the following steps:

step one: obtaining an integer array to be compressed, wherein the storage lengths of all integers of the integer array are the same;

step two: converting the signed integer array, taking the absolute value of the original integer and shifting one bit to the left, wherein the sign of the lowest bit is recorded by 0 and 1, and if the signed integer array is an unsigned integer array, the step can be omitted;

step three: traversing the whole integer absolute number array to obtain a median, and inspiring to search the basic storage bit number of the integer package;

step four: packing the integer arrays by using two different basic storage bit numbers respectively to form two packed integer arrays;

step five: respectively compressing the two packed integer arrays by using a preset compression algorithm;

step six: comparing the sizes of the compressed data files, and selecting smaller compressed files;

step seven: the packed base number of storage bits is recorded in the selected compressed file.

The mixed packing compression method of the integer array converts the signed integer array, shifts the whole original integer to the left by 1 bit, uses the lowest bit to represent the sign, and is convenient for the packing processing of the integer on the premise of not influencing the calculation.

The mixed packing compression method of the integer array uses the median of the absolute number array of the integer to inspire to search the packed basic storage bit number.

The mixed packing compression method of the integer array packs the integer array by using the basic storage bit number smaller than the integer storage length, increases the effective utilization of the integer storage space and reduces the space occupied by invalid numerical values.

The working principle of the invention is as follows: referring to fig. 1, the specific implementation flow of the present invention includes the following steps:

step 101, obtaining an integer array to be compressed, wherein the storage length of each integer is consistent;

step 102, if the integer array is signed, converting it, shifting the whole original integer left by 1 bit, and representing the sign by the lowest bit, (0 is positive number, 1 is negative number); this step may be omitted if the integer array is unsigned;

step 103, traversing an integer absolute value array to obtain a median instance;

step 104, inspiring to search the packed basic storage bit number according to the median, the invention discovers that the packing effect of four bits, eight bits and sixteen bits is better through experiments, so that four bits, eight bits or sixteen bits are recommended to be used for integer packing. Examples: the number of intermediate bits is 0x110111100, then the number of packed base storage bits is eight bits and sixteen bits;

step 105, integer packing is performed using the base number of stored bits. An example of packing is shown in fig. 2: when the eight bits are used as the basis to store the bit number, the data is actually stored as the last seven bit value, the bit value of the highest bit is used for recording the integrity of the data, 1 is incomplete, 0 is complete, when the integer is 0x110111100, firstly, the eight bits are used for storing 0x0111100, the highest bit is used for recording 1, the rest 11 is stored by the eight bits, the highest bit is used for recording 0, when the algorithm reads the integer beginning with 1, the next number is automatically spliced until the integer beginning with 0 is read; when the integer is 0x110, storing 0x110 with seven bits, and the bit value of the most significant bit is 0;

step 106, compressing the two packed integer array files by using a predetermined compression algorithm, so that the compression of the storage space of invalid values is reduced;

step 107, comparing the sizes of the two packed compressed files, selecting a smaller packed compressed file, and recording the number of packed basic storage bits in the packed compressed file for decompression.

After the technical scheme is adopted, the invention has the beneficial effects that: the method can reduce the storage capacity of the compressed file by about 10%, thereby being beneficial to improving the transmission success rate of the upgrade package of the mobile terminal and saving the flow cost.

Drawings

In order to more clearly illustrate the embodiments of the invention or the technical solutions of the prior art, the drawings which are used in the description of the embodiments or the prior art will be briefly described, it being obvious that the drawings in the description below are only some embodiments of the invention, and that other drawings can be obtained according to these drawings without inventive faculty for a person skilled in the art.

FIG. 1 is a flow chart of an embodiment of the present invention;

fig. 2 is a diagram of an embodiment of the present invention.

Detailed Description

Referring to fig. 1-2, the technical scheme adopted in the specific embodiment is as follows: it comprises the following steps:

The foregoing is merely illustrative of the present invention and not restrictive, and other modifications and equivalents thereof may occur to those skilled in the art without departing from the spirit and scope of the present invention.

Claims

1. A mixed packing compression method of an integer array is characterized in that: it comprises the following steps:

step three: traversing the whole integer absolute number array to obtain a median, and inspiring to search the basic storage bit number of the integer package, wherein the basic storage bit number is the bit number based on which the integer/integer array is divided and packaged, and the basic storage bit number comprises: one or more of four bits, seven bits, eight bits, and sixteen bits;

step four: the method comprises the steps that integer packaging is carried out on a basic storage bit number, when the integer in an integer array is divided and packaged according to the basic storage bit number, the bit value of the highest bit of the basic storage bit number is used for recording the integrity of data, and other bits except the highest bit store the data, so that the integrity of the data is judged and the complete data is guaranteed to be read according to the bit value of the highest bit when the packaged integer array is read; packing the integer arrays by using two different basic storage bit numbers respectively to form two packed integer arrays;

2. The hybrid packing compression method of an integer array according to claim 1, wherein: the signed integer array is converted, the whole original integer is shifted to the left by 1 bit, and the sign is represented by the lowest bit.

3. The hybrid packing compression method of an integer array according to claim 1, wherein: it uses the median of the absolute number array of integers to inspire a search for the packed base number of stored bits.

4. The hybrid packing compression method of an integer array according to claim 1, wherein: it packages an integer array with a base number of storage bits that is smaller than the integer storage length.