CN110602498A

CN110602498A - Self-adaptive finite state entropy coding method

Info

Publication number: CN110602498A
Application number: CN201910890254.8A
Authority: CN
Inventors: 唐驰鹏
Original assignee: Individual
Current assignee: Individual
Priority date: 2019-09-20
Filing date: 2019-09-20
Publication date: 2019-12-20
Anticipated expiration: 2039-09-20
Also published as: CN110602498B

Abstract

The invention discloses a self-adaptive finite state entropy coding method, which relates to the field of data compression and comprises the following steps: scanning data to be encoded to obtain a frequency set of symbols, preprocessing the frequency set, dynamically maintaining and updating the frequency set and an accumulative distribution set, and performing adaptive encoding by combining with reforming processing based on an encoding rule to obtain encoded output data; establishing an initial frequency set with all elements of 1, reading in data to be decoded, performing self-adaptive decoding based on a decoding rule and combined with inverse reforming processing, and dynamically maintaining and updating the frequency set and an accumulative distribution set to obtain decoding output data; and transforming the alphabet set of the data to be coded and the alphabet set of the coded output data, and carrying out self-adaptive finite state entropy coding on the data to be coded to obtain encrypted data. The invention can simplify the coding steps and improve the coding speed on the premise of ensuring the coding precision, and can better meet the coding requirements at the present stage.

Description

Self-adaptive finite state entropy coding method

Technical Field

The invention relates to the technical field of data coding, in particular to a self-adaptive finite state entropy coding method.

Background

Entropy coding is a lossless data compression method based on the information entropy theory, and common coding includes: shannon code, huffman code and arithmetic code, which are widely used in compressing various data such as image, video, voice, text, etc.

In the internet field, the data compression technology not only reduces the storage requirement, but also reduces the bandwidth occupation of data transmission, which greatly saves the data storage and transmission cost, and the lossless compression is always a hot spot for studying by scholars at home and abroad. More excellent huffman coding and arithmetic coding as entropy coding have been widely used in various fields with continuous technological improvements.

Huffman coding, also called optimal coding, is a variable length coding scheme that constructs a codeword with the shortest average word length by the frequency of occurrence of information characters, but cannot always approach the information entropy well because huffman cannot be smaller than 1 for a single character code length, and since it is necessary to construct a huffman tree, adaptive huffman coding requires dynamic adjustment of the huffman tree, which is a complex and inefficient process. The practical improved scheme is normal Huffman coding, the scheme does not need to build a tree in coding and decoding, the Huffman coding speed is greatly improved, but the method is difficult to realize self-adaption.

The principle of arithmetic coding is that a corresponding interval is constructed by the probability of statistical information characters, interval division is continuously carried out according to input characters, and finally a decimal belonging to a range [0,1 ] is output, the theory is perfect mathematically, but the decimal with infinite precision cannot be directly realized on a computer because the decimal needs to be expressed, and after continuous improvement, practical CACM87 and Q-encoder appear, but no matter which improvement leads to precision loss and complexity improvement of arithmetic coding, in practice, the arithmetic coding can only obtain a compression ratio slightly better than Huffman coding, and is far less than Huffman coding in coding and decoding speed.

The principle of the asymmetric digital system is to construct a corresponding coding table by the probability distribution of information characters, and complete coding and decoding in the coding table through state transition. Asymmetric digital systems include three variations: the unified asymmetric binary system, the range variable asymmetric digital system and the table entry asymmetric digital system are not self-adaptive coding methods, and the unified asymmetric binary system has the highest coding precision but only codes the binary system, and the precision of the unified asymmetric binary system and the precision of the table entry asymmetric digital system are sequentially reduced.

The invention provides a self-adaptive finite state entropy coding method based on the original asymmetric digital system, which can further approach the information entropy, and simultaneously, the data encryption is completed by transforming a coding table and outputting a coding symbol set on the basis of the scheme, so that the method can be widely applied to various data compression and encryption scenes.

Disclosure of Invention

Aiming at the defects in the prior art, the invention aims to provide a self-adaptive finite state entropy coding method, which avoids pre-storing frequency or probability information in coded data and also avoids the problem that a static coding method cannot adapt to information with large statistical rule change, is consistent with arithmetic coding in coding precision and has higher coding speed than the arithmetic coding.

In order to achieve the above purposes, the technical scheme adopted by the invention is as follows:

the invention discloses a self-adaptive finite state entropy coding method, which comprises a coding process, wherein the coding process comprises the following steps:

and (3) encoding flow: scanning to-be-coded data to obtain a frequency set of symbols, preprocessing the frequency set, dynamically maintaining and updating the frequency set and an accumulative distribution set according to the current to-be-coded symbols, and performing adaptive coding by combining with reforming processing based on a coding rule to obtain coded output data;

and (3) decoding flow: establishing an initial frequency set with all elements of 1, reading in data to be decoded, performing self-adaptive decoding based on a decoding rule and combined with inverse reforming processing, and dynamically maintaining an updated frequency set and an accumulated distribution set according to a symbol output by current decoding to obtain decoding output data;

and (3) encryption flow: and transforming the alphabet set of the data to be coded and the alphabet set of the coded output data, and carrying out self-adaptive finite state entropy coding on the data to be coded according to the two transformed sets to obtain the encrypted data.

On the basis of the above technical solution, the preprocessing specifically includes performing a self-increment 1 update operation on all elements in the original frequency set.

On the basis of the above technical solution, the encoding process includes the following steps:

scanning the data to be encoded to obtain a frequency set and an alphabet set of symbols, and initializing the alphabet set of encoding output data;

performing self-increment 1 preprocessing on all elements of the frequency set, generating a corresponding cumulative distribution set, and initializing a value of a state;

accessing the data to be coded in a reverse order, updating the state based on a coding rule and by combining with a reforming treatment according to a current symbol to be coded updating frequency set and a corresponding cumulative distribution set, and outputting a coded symbol;

repeatedly reforming the state until the state returns to zero;

wherein the alphabet set is denoted as Σ ═ s₁,s₂,…,s_n}；

The encoding output alphabet set is t ═ t₀,t₁,…,t_γ-1}；

The state is a variable x;

the frequency set is defined asSimplified representation is F ═ F₁,F₂,…,F_n}；

The cumulative distribution set is defined as A, the element A_iThe definition is as follows:

m1, when the data to be coded is empty, entering step M3, otherwise reading a character s_iFor element F in frequency set F_iPerforming a self-decreasing 1 update, and accordingly updating the cumulative distribution set A, and then entering step M2;

m2, from CurrentFrequency set F and cumulative distribution set a, and the current state x and the symbol s to be encoded_iBy bringing into C (x, s)_i) In the method, a new state x 'is obtained by calculation, and then x' is substituted into The method comprises the following steps: when in useThen a coded symbol t is output_x′modγAnd changing the current state x toNamely, it isRepeating the step M2 whenThen the current state x is changed to x' and step M1 is re-entered;

m3, when the data to be coded is empty and the state x is still greater than 0, repeatedly bringing x into The method comprises the following steps: when in useThen a coded symbol t is output_{x modγ}And changing the current state x toNamely, it isRepeating the step M3 whenThen a coded symbol t is output_{x modγ}And the final state x is 0, and the encoding is finished.

On the basis of the above technical solution, the decoding process includes the following steps:

initializing a frequency set with elements of 1, generating a corresponding cumulative distribution set, initializing an alphabet set of data to be decoded and an alphabet set of decoding output data, and initializing a value of a state;

accessing data to be decoded in a reverse order, finishing state updating and decoding symbol output based on a decoding rule by combining a reverse reforming technology, and updating a frequency set and an accumulated distribution set;

repeatedly performing inverse reforming on the state variable, wherein if the final state is consistent with the initial state of the code, the decoding is successful, otherwise, the decoding is wrong;

wherein, the data to be decoded is the coded output data in the coding step;

the decoding output data is the data to be encoded in the encoding step.

n1, when the length of the decoded output data is larger than or equal to the length of the original data, entering a step N2, otherwise, processing according to the current state x: when x < A_nThen, x is inversely rearranged, and a character t to be decoded is read in from the reverse order_iIf the reading fails, go to the next step N2, otherwise, the current state x and the read-in t are determined_iInto C_γ(x,t_i) Obtaining a new state x ', changing the current state x to x ', that is, x ═ x ', and repeating the step N1; when x is more than or equal to A_nWhen x is introduced into D (x) ═ x', s_i) Wherein s is_iAs the decoding output, the current state x is changed to x ', that is, x ═ x', and F in the frequency set F is compared_iUpdating by adding 1, correspondingly updating the cumulative distribution set A, and repeating the step N1;

n2, reading in a to-be-decoded fileCharacter t_iIf the reading is successful, substituting the current state x into C_γ(x,t_i) Obtaining a new state x ', changing the current state x to x ', that is, x ═ x ', and repeating the step N2; when the reading fails, whether the current state x is equal to the initial state of the code or not is judged, wherein the equality indicates that the data is correctly recovered, and the inequality indicates that the data is wrong or tampered.

On the basis of the technical scheme, the encryption process comprises the following steps:

transforming the alphabet set of the data to be encoded according to the key to obtain a new alphabet set of the data to be encoded;

obtaining a new encoding output alphabet set according to the key transformation encoding output alphabet set;

and performing self-adaptive finite state entropy coding on the data to be coded through the two new alphabet sets to complete data encryption.

Compared with the prior art, the invention has the advantages that:

the self-adaptive finite state entropy coding method avoids pre-storing frequency or probability information in coded data, also avoids the problem that a static coding method cannot adapt to information with large change of statistical rules, can provide stable and reliable compression ratio for any data, has higher coding speed, and can better meet the coding requirements at the present stage.

Drawings

FIG. 1 is a flow chart of adaptive finite state entropy encoding;

FIG. 2 is a flow chart of decoding of adaptive finite state entropy;

FIG. 3 is an example of adaptive finite state entropy encoding;

FIG. 4 is an example of adaptive finite state entropy decoding;

FIG. 5 is a schematic diagram of a binary index tree;

fig. 6 is an example of encryption based on adaptive finite state entropy coding.

Detailed Description

Embodiments of the present invention will be described in further detail below with reference to the accompanying drawings.

The embodiment of the invention provides a self-adaptive finite state entropy coding method, which can avoid pre-storing frequency information in coded data, greatly improve the coding precision, and has the advantages of simple coding rule, high coding speed and excellent comprehensive performance.

In order to achieve the technical effects, the general idea of the application is as follows:

a method of adaptive finite state entropy coding, comprising the steps of:

s1, scanning the data to be coded to obtain a frequency set;

s2, performing self-increment 1 updating on all elements of the frequency set, generating a corresponding cumulative distribution set, and initializing the value of the state;

s3, trying to read a character to be coded, if the reading is successful, entering the step S4, otherwise, entering the step S6;

s4, updating the frequency set according to the current character to be coded, and updating the cumulative distribution set correspondingly;

s5, based on the coding rule and combined with the reforming processing, updating the state and outputting the codes;

and S6, repeating the reforming processing until the state returns to zero.

It should be noted that the setting of the initial state of the above coding needs to avoid state loop — the new state is the same as the old state in the coding process, and there is no code output, and the coding falls into a dead loop.

The above-mentioned rearrangement processing is to avoid a problem that the coding state is infinitely increased and cannot be expressed in a computer, and after the rearrangement processing, the coding state can be limited and the output of the code symbol can be ensured.

Example one

Referring to fig. 1, an embodiment of the present invention provides an adaptive finite state entropy coding method, where the method includes an encryption process, where the encoding process includes the following steps:

s1, scanning original data to be coded, and calculating an initial alphabet set sigma and a frequency set F;

s2, ensuring that F does not appear in the frequency set F after the self-adaptive finite state entropy coding is finished_iWhen the frequency set is 0, it is necessary to add 1 to all elements of the first statistical frequency set, i.e. to generate a new set F ═ { F ═ F₁+1,F₂+1,…,F_n+1, and calculating the corresponding cumulative distribution set A according to the elements in the new frequency set F. Meanwhile, in order to ensure that the decoded data has the same sequence as the original data, the data needs to be read according to the sequence from the tail to the head, and finally, an initial value is given to the state x, wherein x satisfies the requirement Wherein s is_zIf the first character to be coded is represented, if the data needs to be checked after the decoding is finished, x is any random integer meeting the condition, and the initial state x and the original data length need to be stored in the head of the coded output data, otherwise, default checking is used, namely x is F_z-1, as shown in fig. 3, x ═ F_z-1＝F₁-1＝2-1＝1；

S3, trying to read a character for encoding according to the sequence from the tail to the head, if the reading fails, indicating that the data to be encoded is completely read, entering the step S6, otherwise entering the step S4;

s4, reading character S according to current_iFor element F in frequency set F_iPerforming a self-subtraction operation to generate a new set F ═ F₁,F₂,…,F_i-1,…,F_nAt this time due to F_iIf the value changes, the step S5 is proceeded after the update of the cumulative distribution set a needs to be completed in the binary index tree, as shown in fig. 5, the cumulative distribution set a does not actually exist, when F in the frequency set F_iWhen changing, according to the property of binary index tree, only log (n) elements in the set A' need to be updated in worst case, and when encoding the character s_iThen, in the worst case, only log (n) elements in the set A' need to be summed to obtain the codeFunction C (x, s)_i) A in (A)_i-1；

S5, the step reforms the state x, according to the current frequency set F and the cumulative distribution set A, the current state x and the symbol S to be coded_iBy bringing into C (x, s)_i) In the method, a new state x 'is obtained by calculation, and then x' is substituted into a renormalization functionAnd discussed in two cases:

case 1: if it is calculatedThen a coded symbol t is output_x′modγAnd changing the current state x toNamely, it isAnd the present step S5 is repeated again,

case 2: if it is calculatedThe current state x is changed to x ', that is, x ═ x', and the process proceeds to step S3 to encode the next character;

s6, this step corresponds to the coded data being read, but the coding is not completely completed, and the state x is still larger than 0, so x needs to be repeatedly brought intoUp to x ═ 0, the same is discussed here in two cases:

case 1: if it is calculatedThen a coded symbol t is output_{x modγ}And changing the current state x toNamely, it isThe present step S6 is repeated again,

case 2: if it is calculatedThen a coded symbol t is output_{x modγ}When the final state x is 0, the encoding is finished, and as shown in fig. 3, the state x is 0 at the end of the encoding;

wherein the alphabet set Σ ═ s₁,s₂,…,s_nFrequency setSimplified representation is F ═ F₁,F₂,…,F_nScanning the "ccbca" to be encoded data may result in the alphabet set Σ ═ { a, b, c } and the initial frequency set F ═ 1,2,3}, as shown in fig. 3;

the definition of cumulative distribution set a is related to frequency set F, and is defined as follows:

coding function C (x, s)_i) The definition is as follows:

encoding output encoding alphabet Γ ═ t₀,t₁,…,t_γ-1Where Γ is defined as an ordered set of natural numbers, i.e., Γ ═ 0,1, …, γ -1, then γ ═ 2 and γ ═ 2⁸Respectively corresponding to the bit stream and the byte stream;

coding-time reshaping D according to the definition of the encoding alphabet Γ_γ(x) The function is defined as follows:

whereinCorresponding to the reformed x^′，t_{x modγ}Corresponding to the output code character, since (x mod γ) is ∈ [1, γ -1 ∈]Is apparent t_{x modγ}Must be in the set of encoded output alphabets Γ.

Example two

Referring to fig. 2, the second embodiment of the present invention further provides an adaptive finite state entropy coding method, which further includes a corresponding decoding process for data decoding, where the decoding process includes the following steps:

a1, the decoding side and the encoding side have the same original data alphabet set Σ ═ s₁,s₂,…,s_nAnd the same set of encoding output alphabets Γ ═ t₀,t₁,…,t_γ-1The encoding end state x is 0, and the frequency set is subjected to the operation of adding 1 at the beginning of encoding, so that the frequency set F is all 1 at the time of encoding end, and the corresponding initial frequency set for decoding is all 1, that is, F is { F ═ F }₁,F₂,…,F_nAnd establishing a corresponding cumulative distribution set A according to F, wherein A is {1,1, …,1}, and then establishing a corresponding cumulative distribution set A according to F₀,A₁,…,A_nAnd proceeds to step A2, according to A since F is all 1 s_iThe definition of (a) may yield a {0,1, 2.., n }, as shown in fig. 4, an initial frequency set F {1,1,1}, and a {0,1,2,3 };

a2, when the length of the decoded output data is less than the length of the original data, entering the step A3, otherwise entering the step A5;

a3, the step is mainly used for carrying out reverse reforming on the state x, and the following two conditions are processed according to the current state x:

case 1: if x is less than A_nX is then inverse-reshaped, in which case an attempt is made to read in one character t of the data to be decoded in the order from beginning to end_iIf the reading fails, the next step A4 is entered, otherwise the current state x and the read-in t are determined_iCarry-in inverse-renormalization function C_γ(x,t_i) A new state x ' is obtained and the current state x is changed to x ', i.e. x ═ x ', and step a2 is re-entered,

case 2: if x is not less than A_nThen x is substituted into d (x) ═ x', s_i) Wherein s is_iAs a decoding output, the current state x is changed to x ', that is, x ═ x', and the process proceeds to step a 4;

a4, s output from decoding_iFor F in frequency set F_iPerforming a self-increment 1 operation, i.e. F ═ F₁,F₂,…,F_i+1,…,F_nFinishing updating the cumulative distribution set A by using a binary index tree, and then entering the step A2;

a5, when the decoding is completed, the original data is recovered, but the data to be decoded may still remain, so the attempt to read a character t of the data to be decoded_iIf the reading is successful, substituting the current state x into C_γ(x,t_i) Obtaining a new state x ', changing the current state x to x ', that is, x ═ x ', and repeating the step a5, if the reading fails, determining whether the current state x is equal to the initial state of the encoding, if so, it indicates that the data is correctly recovered, otherwise, it indicates that the data is incorrect or tampered, as shown in fig. 3, the initial state of the encoding is 1, and the final state of the decoding shown in fig. 4 is also 1, and both are equal, and the decoding succeeds;

wherein the inverse reforming function C_γ(x,t_i) The definition is as follows:

x′＝C_γ(x,t_i)＝γ·x+i

in the above formula, x is the current state, t_iFor the currently read code output symbol, carry it into C_γ(x,t_i) A new state x' can be obtained;

wherein the decoding function d (x) is defined as follows:

D(x)＝(x′,s_i) Wherein s is_iSatisfies (x mod A)_n)∈[A_i-1,A_i)，

S is above_iThe meaning of the formula (1) is: by looking up (x mod A) in set A_n) Closest and equal to or greater than A_i-1I.e. is the corresponding s_iThen s is_iThe value of the intermediate value i is substituted into a corresponding function, and a new state x' can be obtained and used as the decoding state of the next round;

in step A3, according to A_iCan deduce A_i＝A_i-1+F_iAnd F is_i> 0, so the cumulative distribution set A can be viewed as an increasing sequence, decoding function D (x) for s_iThe lookup of (c) can reduce the temporal complexity to O (log (n)) by a binary lookup.

EXAMPLE III

The third embodiment of the present invention further provides a method of adaptive finite state entropy coding, which further includes the following steps for performance optimization:

when the alphabet set Σ ═ s₁,s₂,…,s_nWhen n in is 2, i.e. when encoding and decoding bit data, the frequency set F and the cumulative distribution set a contain only 2 and 3 elements, respectively, while a₀≡0,A₁≡F₁The number of actually useful elements in the cumulative distribution set A is only 1, and the binary index tree is used to maintain the updated cumulative distribution set A, or the binary search is used to find s during decoding_iDo not have acceleration effect, so in this case arrays or independent variables are used to maintain the elements in the frequency set F and cumulative distribution set A, and conditional branches are used to solve s_iThe efficiency is higher;

when the alphabet set Σ ═ s₁,s₂,…,s_nN in ═ 2⁸When encoding and decoding data in byte unit, updating the element of binary index tree and obtaining some element A in cumulative distribution set A_i-1And using binary search for s in decoding_iLog is needed in the worst case₂The operation is carried out 8 times, and the maximum cycle number is fixed, so that the operation efficiency can be greatly improved by circularly expanding the operation;

when outputting the encoding alphabet set Γ ═ t₀,t₁,…,t_γ-1γ in ═ 2^mIn the meantime, the modulo, integer multiplication and integer division operations in the reforming process can be replaced by efficient bitwise and, left shift and right shift operations, and the corresponding reforming function and inverse reforming function are transformed as follows:

x′＝C_γ(x,t_i)＝γ·x+i＝x＜＜m+i

and increasing the step size of encoding and decoding, and directly using the array to complete the updating of the cumulative distribution set A. The encoding and decoding in the embodiment reads or outputs SL 1 character at a time and updates the cumulative distribution set a using a binary index tree, so a single character is needed in the worst caseAnd (5) performing secondary operation. If the array is used directly, it is necessary in the worst caseIn the second operation, it can be seen that the step size is enlarged to SL ≧ n, i.e. the cumulative distribution set A is updated every time SL characters are read in, then each character encoding or decoding updates the cumulative distribution set A onlyIn the second operation, after the step length is increased, the performance of updating the cumulative distribution set A is greatly improved by directly using the array.

Example four

As shown in fig. 6, the fourth embodiment of the present invention further provides an adaptive finite state entropy coding method, which further includes an encryption process for data encryption, where the encryption process includes the following steps:

set of alphabet of data to be encoded sigma and key K_ΣBy introducing the transformation function f (sigma, K)_Σ) Obtaining a new alphabet set sigma' of the data to be coded;

encoding the output alphabet set Γ with the key K_ΓBy introducing the transformation function f (sigma, K)_Γ) Obtaining a new encoding output letter set gamma';

and performing adaptive finite-state entropy coding on the data to be coded by the two new alphabet table sets Σ 'and Γ', so as to obtain encrypted data.

In the embodiment of the invention, the encryption flow can improve the safety when the encoding side and the decoding side carry out data interaction.

EXAMPLE five

The fifth embodiment of the present invention further provides a self-adaptive finite state entropy coding method, wherein the coding process of the method includes the following steps:

m1, when the data to be coded is empty, entering step M3, otherwise reading a character s_iFor element F in frequency set F_iPerforming self-decreasing 1 updating, and updating the cumulative distribution set A correspondingly;

m2, the frequency set F and the cumulative distribution set A, the current state x and the symbol s to be coded_iBy bringing into C (x, s)_i) In the method, a new state x 'is obtained by calculation, and then x' is substituted into The method comprises the following steps: when in useThen a coded symbol t is output_x′modγAnd changing the current state x toNamely, it isRepeating the step M2; when in useThen the current state x is changed to x' and step M1 is re-entered;

m3, if the data to be coded is empty and the state x is still greater than 0, repeatedly bringing x into The method comprises the following steps: when in useThen a coded symbol t is output_{x modγ}And changing the current state x toNamely, it isRepeating the step M3; when in useThen a coded symbol t is output_{x modγ}If the final state x is 0, the encoding is finished;

it should be noted that, before the step M1, the encoding flow should further include the following steps in sequence:

then, step M1 is started again, that is, it is determined whether the data to be encoded is empty, and then the subsequent steps are performed according to the determination result.

In another implementation manner of the embodiment of the present invention, a decoding flow of the method includes the following steps:

n1, when the length of the decoded output data is larger than or equal to the length of the original data, entering a step N2, otherwise, processing according to the current state x: when x < A_nThen, x is inversely rearranged, and a character t to be decoded is read in from the reverse order_iIf the reading fails, go to the next step N1, otherwise, the current state x and the read-in t are determined_iInto C_γ(x,t_i) Obtaining a new state x ', changing the current state x to x ', that is, x ≧ x ', and repeating the step N1 again, when x ≧ A_nWhen x is introduced into D (x) ═ x', s_i) Wherein s is_iAs the decoding output, the current state x is changed to x ', that is, x ═ x', and F in the frequency set F is compared_iPerforming a self-increment 1 update, correspondingly updating the cumulative distribution set A, and repeating the step N1;

n2, reading a character t to be decoded in reverse order_iIf the reading is successful, substituting the current state x into C_γ(x,t_i) Obtaining a new state x ', changing the current state x into x ', that is, x is equal to x ', and repeating the step N2, when the reading fails, determining whether the current state x is equal to the initial state of the code, where an equal value indicates that the data is correctly recovered, and an unequal value indicates that the data is incorrect or tampered;

it should be noted that, before step N1, the decoding flow should further include the following steps in sequence:

then, step N1 is started, i.e. it is determined whether the data length of the decoded output data has reached or exceeded the original data length, and then the subsequent steps are performed according to the determination result.

EXAMPLE six

The sixth embodiment of the present invention further provides a method for adaptive finite state entropy coding, which includes the following steps:

The embodiment of the invention can avoid pre-storing frequency information in the coded data, can greatly improve the coding precision, and has simple coding rule, high coding speed and excellent comprehensive performance.

In another implementation manner of the embodiment of the present invention, the preprocessing in the encoding process of the method specifically includes performing a self-increment 1 update operation on all elements in the original frequency set.

In another implementation manner of the embodiment of the present invention, the encoding process includes the following steps:

repeatedly reforming the state until the state returns to zero;

wherein the alphabet set is denoted as Σ ═ s₁,s₂,…,s_n}；

The set of encoding output alphabet is t ═ t₀,t₁,…,t_γ-1}；

The state is variable x;

in another implementation manner of the embodiment of the present invention, an encoding flow of the method includes the following steps:

m3, if the data to be coded is empty and the state x is still greater than 0, repeatedly bringing x into The method comprises the following steps: when in useThen a coded symbol t is output_{x modγ}And changing the current state x toNamely, it isRepeating the step M3 whenThen a coded symbol t is output_{x modγ}If the final state x is 0, the encoding is finished;

In another implementation manner of the embodiment of the present invention, the decoding process includes the following steps:

wherein, the data to be decoded is the coded output data in the coding step;

the decoded output data is the data to be encoded in the encoding step.

In another implementation manner of the embodiment of the present invention, the specific steps of the decoding rule and the inverse renormalization in the method are as follows:

n1, when the length of the decoded output data is larger than or equal to the length of the original data, entering a step N2, otherwise, processing according to the current state x: when x < A_nThen, x is inversely rearranged, and a character t to be decoded is read in from the reverse order_iIf the reading fails, go to the next step N2, otherwise, the current state x and the read-in t are determined_iInto C_γ(x,t_i) Obtaining a new state x ', changing the current state x to x ', that is, x ≧ x ', and repeating the step N1 again, when x ≧ A_nWhen x is introduced into D (x) ═ x', s_i) Wherein s is_iAs the decoding output, the current state x is changed to x ', that is, x ═ x', and F in the frequency set F is compared_iUpdating by adding 1, correspondingly updating the cumulative distribution set A, and repeating the step N1;

n2, reading a character t to be decoded_iIf the reading is successful, substituting the current state x into C_γ(x,t_i) Obtaining a new state x ', changing the current state x to x ', that is, x ═ x ', and repeating the step N2; when the reading fails, judging whether the current state x is equal to the initial state of the code or not, wherein the equal represents the dataThe data is correctly recovered, and if not, the data is wrong or tampered;

In another implementation manner of the embodiment of the present invention, the encryption process includes the following steps:

Claims

1. An adaptive finite state entropy coding method, characterized in that it comprises the steps of:

2. The finite state entropy coding method of claim 1, wherein the preprocessing is specifically a self-increment 1 update operation on all elements in the original frequency set.

3. The finite state entropy coding method of claim 1, wherein the encoding process comprises the steps of:

repeatedly reforming the state until the state returns to zero;

wherein the alphabet set is denoted as Σ ═ s₁,s₂,…,s_n}；

The encoding output alphabet set is t ═ t₀,t₁,…,t_γ-1}；

The state is a variable x;

4. the finite state entropy coding method of claim 1, wherein the encoding process comprises the steps of:

m2, the frequency set F and the cumulative distribution set A, the current state x and the symbol s to be coded_iBy bringing into C (x, s)_i) In the method, a new state x 'is obtained by calculation, and then x' is substituted into The method comprises the following steps: when in useThen a coded symbol t is output_{x′ mod γ}And changing the current state x toNamely, it isRepeating the step M2 whenThen the current state x is changed to x^′And then to step M1;

m3, when the data to be coded is empty and the state x is still greater than 0, repeatedly bringing x into The method comprises the following steps: when in useThen a coded symbol t is output_{x mod γ}And changing the current state x toNamely, it isRepeating the step M3 whenThen a coded symbol t is output_{x mod γ}And the final state x is 0, and the encoding is finished.

5. The finite state entropy encoding method of claim 1, wherein the decoding process comprises the steps of:

wherein, the data to be decoded is the coded output data in the coding step;

the decoding output data is the data to be encoded in the encoding step.

6. The finite state entropy encoding method of claim 1, wherein the decoding process comprises the steps of:

n1, when the length of the decoded output data is larger than or equal to the length of the original data, entering a step N2, otherwise, processing according to the current state x: when x is<A_nThen, x is inverse-reshaped, and a character t to be decoded is read in from the reverse order_iIf the reading fails, go to the next step N2, otherwise, the current state x and the read-in t are determined_iInto C_γ(x,t_i) Obtaining a new state x ', changing the current state x to x ', that is, x ═ x ', and repeating the step N1; when x is more than or equal to A_nWhen x is introduced into D (x) ═ x', s_i) Wherein s is_iAs the decoding output, the current state x is changed to x ', that is, x ═ x', and F in the frequency set F is compared_iUpdating by adding 1, correspondingly updating the cumulative distribution set A, and repeating the step N1;

n2, reading a character t to be decoded_iIf the reading is successful, substituting the current state x into C_γ(x,t_i) Obtaining a new state x ', changing the current state x to x ', that is, x ═ x ', and repeating the step N2, when the reading fails, determining whether the current state x is equal to the initial state of the code, where an equal value indicates that the data is correctly recovered, and an unequal value indicates that the data is incorrect or tampered.

7. The finite state entropy coding method of claim 1, wherein the encryption process comprises the steps of: