JPH07154607A

JPH07154607A - Binary picture encoder

Info

Publication number: JPH07154607A
Application number: JP30021793A
Authority: JP
Inventors: Ikurou Ueno; 幾朗上野
Original assignee: Mitsubishi Electric Corp
Current assignee: Mitsubishi Electric Corp
Priority date: 1993-11-30
Filing date: 1993-11-30
Publication date: 1995-06-16
Anticipated expiration: 2015-08-07
Also published as: JP3075049B2

Abstract

PURPOSE:To attain efficient degeneration by predicting a value indicating the property of picture elements and integrating two of them whose prediction level is close to each other into one state. CONSTITUTION:A picture element prediction means 14 uses a reference picture element pattern 13 generated by a reference picture element generating means 12 to predict a picture element. A prediction level classification means 16 quantizes a prediction level 15 calculated by the prediction of the picture element prediction means 14 and classifies states whose prediction levels are close to each other into one state. A prediction value/prediction coincidence rate decision means 18 reads a prediction value 20 and a prediction coincidence rate 21 from a prediction value/prediction rate reference table 19 based on the result of classification 17 by the prediction level classification means 16. Thus, conditional entropy of picture data is reduced without remarkable increase in the size of the coding parameter reference table to reduce the code quantity.

Description

Detailed Description of the Invention

【０００１】[0001]

【産業上の利用分野】この発明は２値画像データ符号化
装置に関するものである。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a binary image data coding device.

【０００２】[0002]

【従来の技術】ファクシミリ等での２値画像符号化法と
して、マルコフモデル符号化が有効とされており、これ
を利用した技術は、特開平２−３０５２２５などに示さ
れている。以下に、従来のマルコフモデル符号化につい
て説明する。2. Description of the Related Art Markov model coding is considered to be effective as a binary image coding method for facsimiles and the like, and a technique using this is disclosed in Japanese Patent Laid-Open No. 2-305225. The conventional Markov model coding will be described below.

【０００３】ここで、ある情報でＰ（ｘ_i ）を情報源シ
ンボルｘ_i の生じる確率とすると、その情報源のエント
ロピ（平均情報量）は式（１）で定義される。Here, if P (x _i ) is the probability of occurrence of the information source symbol x _i in certain information, the entropy (average amount of information) of that information source is defined by equation (1).

【０００４】[0004]

【数１】 [Equation 1]

【０００５】この式はシンボルｘ_i が生じたことを知っ
たことにより得られる情報量−ｌｏｇ₂ Ｐ（ｘ_i ）を各
シンボルの生起確率で重み付けして平均をとることを意
味している。２値画像の場合（ｘ_i ＝０または１）なら
ば、エントロピは式（２）で表される。This expression means that the information amount −log ₂ P (x _i ) obtained by knowing that the symbol x _i has occurred is weighted by the occurrence probability of each symbol and the average is taken. In the case of a binary image (x _i = 0 or 1), entropy is expressed by equation (2).

【０００６】[0006]

【数２】 [Equation 2]

【０００７】また、この値は、符号化対象となる１画素
のみに注目して符号を割り当てたとき、これ以下の符号
量では符号化できない理論的圧縮限界を示すことが一般
に知られている。従って、符号化に当たっては、符号量
をできる限りエントロピＨにに近づけるように、各シン
ボルの生起確率Ｐ（ｘ_i ）に適するエントロピ符号化を
行う必要がある。It is generally known that this value shows a theoretical compression limit that cannot be coded with a code amount less than this value when a code is assigned by focusing on only one pixel to be coded. Therefore, in encoding, it is necessary to perform entropy encoding suitable for the occurrence probability P (x _i ) of each symbol so that the code amount is as close to the entropy H as possible.

【０００８】一般に、画像データにおいてある画素の値
が０となるか１となるかの確率はそれに先行する複数の
画素の値が何であったかに依存する。先行するｎ個の画
素（以下、参照画素）の値に依存する場合、その情報源
をｎ重マルコフ情報源という。ｎ重マルコフ情報源のエ
ントロピ、つまり、注目画素（ｘ_i ）に先行するｎ個の
画素値（ｘ_i-1 ，…，ｘ_i-n ）を知った上でのエントロ
ピ（条件付きエントロピ）は式（３）で表される。ここ
で、Ｐ（ｘ_i-1 ，…，ｘ_i-n ）は参照画素の値がｘ
_i-1 ，…，ｘ_i-n となる確率（結合確率）、Ｐ（ｘ_i ／
ｘ_i-1 ，…，ｘ_i-n）は参照画素の値がｘ_i-1 ，…，ｘ
_i-n となった場合に、注目画素値がｘ_i となる確率（条
件付き確率）を示す。In general, the probability of a pixel having a value of 0 or 1 in image data depends on what the values of a plurality of pixels preceding it are. When dependent on the values of the preceding n pixels (hereinafter referred to as reference pixels), the information source is called an n-fold Markov information source. The entropy of the n-fold Markov information source, that is, the entropy (conditional entropy) after knowing the n pixel values (x _i-1 , ..., X _in ) preceding the pixel of interest (x _i ) is It is represented by 3). Here, P (x _i-1 , ..., X _in ) has a reference pixel value of x.
_i−1 , ..., Probability of being x _in (join probability), P (x _i /
x _i-1 , ..., X _in ) has reference pixel values of x _i-1 , ..., X
_The probability (conditional probability) that the pixel value of interest becomes x _{i when} it becomes in is shown.

【０００９】[0009]

【数３】 [Equation 3]

【００１０】この条件付きエントロピＨ’と式（２）の
Ｈとの関係では、Ｈ’≦Ｈ（４）が成り立ち、等号が成立するのは情報源がマルコフ情報
源でない場合である。In the relation between the conditional entropy H'and H in the equation (2), H'≤H (4) holds, and the equal sign holds when the information source is not a Markov information source.

【００１１】ここで、ｎ個の参照画素のとりうる２ⁿ 個
の状態をマルコフ情報源の状態と呼ぶことにし、各状態
をＳ_i で表す。そして、式（３）を変形すると式（５）
が得られる。Here, the 2 ⁿ possible states of the ⁿ reference pixels are called the states of the Markov information source, and each state is represented by S _i . Then, when the formula (3) is transformed, the formula (5) is obtained.
Is obtained.

【００１２】[0012]

【数４】 [Equation 4]

【００１３】これは、各状態でのエントロピｈ（ｓ_i ）
＝−ΣＰ（ｘ_i ／ｓ_i ）ｌｏｇ₂ Ｐ（ｘ_i ／ｓ_i ）を各
状態の出現確率Ｐ（ｓ_i ）で重み付けして平均をとった
ものである。符号化の際には、各状態ごとに、注目画素
が０または１をとる確率Ｐ（ｘ_i ／ｓ_i ）に適するエン
トロピ符号化を行えば各状態での符号量をそのエントロ
ピｈ（ｓ_i ）に近づけることができ、全体としての符号
量も式（５）のＨ’に近づけることができる。従って、
参照画素は条件付きエントロピが低下するように選択す
る必要がある。このように、参照画素の状態によって注
目画素を分類して、各状態ごとに適切なエントロピ符号
化を行う符号化方式をマルコフモデル符号化という。This is the entropy h (s _i ) in each state.
= −ΣP (x _i / s _i ) log ₂ P (x _i / s _i ) is weighted by the appearance probability P (s _i ) of each state and averaged. At the time of encoding, if the entropy encoding suitable for the probability P (x _i / s _i ) that the pixel of interest takes 0 or 1 is performed for each state, the code amount in each state is the entropy h (s _i ), And the code amount as a whole can also approach H'in Expression (5). Therefore,
The reference pixel should be selected so that the conditional entropy is reduced. In this way, an encoding method that classifies a pixel of interest according to the state of a reference pixel and performs appropriate entropy encoding for each state is called Markov model encoding.

【００１４】マルコフモデル符号化の例として特開平２
−３０５２２５に記載されている算術符号型ＭＥＬＣＯ
ＤＥについて説明する。その構成図を図２に示す。１１
は符号化対象の２値画像データ、１２は注目画素以前の
入力の画素からマルコフ状態の分類に使用するための参
照画素を選択して、参照画素パターンを作成する参照画
素パターン作成手段、１８は参照画素パターン１３を用
いて注目画素の予測値とその予測の的中率（予測一致
率）を算出する予測値・予測一致率決定手段、１９は参
照画素パターンに対する予測値と予測一致率が記憶され
ている予測値・予測一致率参照テーブル、２０は予測値
（０または１）、２１は予測一致率、２２は予測値と注
目画素との一致／不一致を示す符号化対象シンボル、２
３は予測一致率２１に応じて符号化対象シンボルを符号
化する算術符号化器、２４は算術符号化器から出力され
る符号である。As an example of the Markov model encoding, Japanese Patent Laid-Open No. Hei 2
-305225 arithmetic code type MELCO
The DE will be described. The block diagram is shown in FIG. 11
Is binary image data to be encoded, 12 is a reference pixel pattern creating means for creating a reference pixel pattern by selecting a reference pixel to be used for classification of Markov state from input pixels before the target pixel, and 18 is Prediction value / prediction match rate determining means for calculating the prediction value of the pixel of interest and the prediction hit rate (prediction match rate) using the reference pixel pattern 13, and 19 stores the prediction value and the prediction match rate for the reference pixel pattern. Predicted value / predicted matching rate reference table, 20 is a predicted value (0 or 1), 21 is a predicted matching rate, 22 is an encoding target symbol indicating matching / mismatch between the predicted value and the target pixel, 2
Reference numeral 3 is an arithmetic encoder that encodes the encoding target symbol according to the predictive matching rate 21, and reference numeral 24 is a code output from the arithmetic encoder.

【００１５】[0015]

【表１】 [Table 1]

【００１６】次に、この符号化器の動作を表１に示す２
値画像を例にとり説明する。この例では、図３の３１が
注目画素ｘ_i 、３２が参照画素ｘ_i-1 とｘ_i-2 とする。
参照画素が２画素なので表１の通り状態はｓ₀ 〜ｓ₃ の
４種類で、各状態の生起確率Ｐ（ｓ_i ）、各状態で注目
画素が０となる確率Ｐ（０／ｓ_i ）、１となる確率Ｐ
（１／ｓ_i ）は表１の値となる。この場合、注目画素が
０となる確率Ｐ（０）＝０．４７、注目画素が１となる
確率Ｐ（１）＝０．５３なので、式（２）よりＨ＝０．
９９７ｂｉｔが得られる。この値は上述したように、符
号化対象となる１画素のみに注目して符号を割り当てた
ときの理論的圧縮限界である。また、ｘ_i-1 とｘ_i-2 の
２画素の状態ｓ_i により注目画素を分類し式（５）から
算出した条件付きエントロピはＨ’＝０．７２ｂｉｔと
なる。このようにマルコフ状態に分類して、各状態ごと
に適切な符号化を行うことによりＨ’＝０．７２ｂｉｔ
に近い符号量で符号化を行うことが可能となる。Next, the operation of this encoder is shown in Table 1-2.
The value image will be described as an example. In this example, 31 in FIG. 3 is a target pixel x _i , and 32 is a reference pixel x _i-1 and x _i-2 .
Since the reference pixel is 2 pixels, as shown in Table 1, there are four kinds of states s ₀ to s ₃ , the occurrence probability P (s _i ) of each state, and the probability P (0 / s _i ) that the pixel of interest becomes 0 in each state. Probability of 1
(1 / s _i ) has the values shown in Table 1. In this case, the probability P (0) = 0.47 that the target pixel is 0 and the probability P (1) = 0.53 that the target pixel is 1, so that H = 0.
997 bits are obtained. As described above, this value is a theoretical compression limit when only one pixel to be coded is focused and a code is assigned. Further, the conditional entropy calculated from Expression (5) by classifying the target pixel according to the state s _i of the two pixels of x _i-1 and x _i-2 is H ′ = 0.72 bit. In this way, H ′ = 0.72 bit by classifying into Markov states and performing appropriate encoding for each state.
It is possible to perform coding with a code amount close to.

【００１７】符号化時にはまず、参照画素パターン作成
手段１２により画素ｘ_i-1 とｘ_i-2が符号化済みの画素
から選択され参照画素パターン１３が作成される。この
参照画素パターン１３をもとに予測値２０、予測一致率
２１を予測値・予測一致率参照テーブルから読み出す。
もし、参照画素の状態がｓ₁ であったとすれば、表１よ
り予測値＝１、予測一致率＝０．６となる。そして、こ
の予測一致率２１を算術符号化器２３に入力する。At the time of coding, first, the reference pixel pattern creating means 12 selects the pixels x _i-1 and x _i-2 from the coded pixels and creates the reference pixel pattern 13. Based on this reference pixel pattern 13, the predicted value 20 and the predicted matching rate 21 are read from the predicted value / predicted matching rate reference table.
If the state of the reference pixel is s ₁ , then from Table 1, the predicted value is 1, and the predicted matching rate is 0.6. Then, the predicted matching rate 21 is input to the arithmetic encoder 23.

【００１８】ここで、符号化対象画素値が１であったと
すれば、予測が的中したので予測が的中したことを示す
シンボル０を符号化対象シンボル２２として、算術符号
化器２３に入力する。もし、符号化対象画素値が０であ
ったとすれば、予測が外れたので予測が外れたことを示
すシンボル１を符号化対象シンボル２２として、算術符
号化器２３に入力する。算術符号化器２３では、予測一
致率に適した符号化パラメータで符号化対象シンボル２
２を符号化することにより、ほぼエントロピと同じ符号
量で符号化ができる。If the pixel value to be encoded is 1, the prediction is correct, so the symbol 0 indicating that the prediction is correct is input to the arithmetic encoder 23 as the encoding target symbol 22. To do. If the pixel value to be encoded is 0, the prediction is incorrect, and therefore the symbol 1 indicating that the prediction is incorrect is input to the arithmetic encoder 23 as the encoding target symbol 22. In the arithmetic encoder 23, the encoding target symbol 2 is encoded with an encoding parameter suitable for the predictive matching rate.
By encoding 2, it is possible to perform encoding with almost the same code amount as entropy.

【００１９】この時、予測値・予測一致率を記憶するの
に各状態でｋ（ｂｉｔ）必要とすれば、参照予測値・予
測一致率参照テーブル１９の大きさは、参照画素の画素
数が２画素なのでｋ×２² ｂｉｔとなる。At this time, if k (bit) is required in each state to store the predicted value / predicted matching rate, the size of the reference predicted value / predicted matching rate reference table 19 is determined by the number of reference pixels. Since there are two pixels, k × 2 ² bits are obtained.

【００２０】[0020]

【発明が解決しようとする課題】マルコフモデル符号化
では、参照画素数を増加し状態数を増やせば一般にエン
トロピは減少するが、状態ごとの予測値・予測一致率を
記憶している参照テーブル１９の大きさが参照画素数の
増加にともない指数関数的に増大するため、参照画素数
を大幅に増やすことは困難であるという問題点があっ
た。In the Markov model coding, entropy generally decreases as the number of reference pixels increases and the number of states increases, but a reference table 19 storing predicted values / predicted concordance rates for each state. There is a problem in that it is difficult to increase the number of reference pixels drastically, because the size of the number increases exponentially with the increase of the number of reference pixels.

【００２１】表１では参照画素のとりうる値により４通
りの状態に注目画素を分類したが、この４通りの状態の
いくつかをまとめて新たに１つの状態とすることを考え
る。以後、この処理をマルコフ状態の縮退、または単に
縮退と呼ぶ。例えば、表１の状態ｓ₁ とｓ₂ をまとめて
１つの状態とすると、表２に示すｓ’₀ 〜ｓ’₃ の３通
りの状態に縮退される。In Table 1, the pixel of interest is classified into four states according to the values that the reference pixel can take, but it is considered that some of these four states are newly combined into one state. Hereinafter, this process is called degeneracy of Markov state, or simply degeneracy. For example, if the states s ₁ and s ₂ in Table 1 are combined into one state, the states are degenerated into three states s ′ _{0 to} s ′ ₃ shown in Table 2.

【００２２】[0022]

【表２】 [Table 2]

【００２３】状態を縮退すれば一般にエントロピは上昇
するが、予測一致率の近い状態同士を縮退すれば、エン
トロピの上昇はわずかに抑えられる。表１から表２への
縮退では、縮退した２つの状態の予測一致率が等しいの
でエントロピの上昇なしに、縮退が達成されている。こ
の発明は、このように効率的な縮退を行うマルコフモデ
ル符号化装置を実現することを目的としている。When the states are degenerated, the entropy generally rises. However, when the states having similar predicted coincidence rates are degenerated, the entropy rise is slightly suppressed. In the degeneracy from Table 1 to Table 2, degeneracy is achieved without an increase in entropy because the predicted concordance rates of the two degenerate states are equal. An object of the present invention is to realize a Markov model coder that performs such efficient degeneracy.

【００２４】[0024]

【課題を解決するための手段】この発明に係わるマルコ
フ状態縮退手段は、注目画素の性質を示す値を予測し、
予測レベルの近い状態同士を１つの状態にまとめること
により、マルコフ状態の縮退を行うものである。The Markov state degeneracy means according to the present invention predicts a value indicating the property of a pixel of interest,
The Markov state is degenerated by combining states with similar prediction levels into one state.

【００２５】この発明の符号化器では予測対象を注目画
素の値または注目画素の黒画素出現確率とし、その値に
基づいて注目画素の分類（すなわち、状態の縮退）を行
う。In the encoder of the present invention, the prediction target is the value of the target pixel or the black pixel appearance probability of the target pixel, and the target pixel is classified (that is, the state is degenerated) based on the value.

【００２６】この発明の符号化器では予測手段、予測レ
ベル分類手段を複数用意しておき、参照画素の性質に応
じて、画面内でそれらを適応的に選択してマルコフ状態
の縮退を行うものである。In the encoder of the present invention, a plurality of prediction means and prediction level classification means are prepared, and those are adaptively selected in the screen according to the property of the reference pixel to degenerate the Markov state. Is.

【００２７】[0027]

【作用】請求項第１〜３項の発明において、注目画素
値、黒画素出現確率などを対象とした予測により参照画
素の性質を反映した値が得られ、この予測レベルの近い
状態をまとめることによりエントロピの上昇を抑えたマ
ルコフ状態の縮退が行われる。請求項第４項の発明にお
いて、請求項第１〜３項の発明での注目画素値、黒画素
出現確率などを対象とした予測における予測手段、予測
レベル分類手段を参照画素の状態に応じて切り換えるこ
とにより、エントロピの上昇を抑えたマルコフ状態の縮
退が行われる。According to the inventions of claims 1 to 3, a value reflecting the nature of the reference pixel is obtained by the prediction target pixel value, black pixel appearance probability, etc., and a state in which the prediction levels are close is summarized. As a result, the degeneracy of Markov state that suppresses the rise in entropy is performed. In the invention of claim 4, the predicting means and the prediction level classifying means in the prediction for the pixel value of interest, the black pixel appearance probability, etc. in the inventions of claims 1 to 3 are set according to the state of the reference pixel. By switching, the Markov state is degenerated with the increase in entropy suppressed.

【００２８】[0028]

【実施例】実施例１．以下、この発明の一実施例を説明
する。図１は本発明を用いた算術符号型ＭＥＬＣＯＤＥ
の構成図である。図１において、１４は参照画素パター
ン作成手段１２により作成された参照画素パターン１３
を用いて注目画素値を予測する注目画素予測手段、１６
は注目画素予測手段１４での予測により算出された予測
レベル１５を量子化し、予測レベルの近い状態をまとめ
て１つの状態に分類する予測レベル分類手段、１８は予
測レベル分類手段１６による分類結果１７に応じて、予
測値・予測一致率参照テーブル１９から予測値、予測一
致率を読み出す予測値・予測一致率決定手段である。図
２と比べて異なるのは以上の部分だけなので、他につい
ては説明を省略する。EXAMPLES Example 1. An embodiment of the present invention will be described below. FIG. 1 shows an arithmetic code type MELCODE using the present invention.
It is a block diagram of. In FIG. 1, 14 is a reference pixel pattern 13 created by the reference pixel pattern creating means 12.
Pixel-of-interest predicting means for predicting a pixel-of-interest value using
Is a prediction level classification unit that quantizes the prediction level 15 calculated by the prediction by the pixel-of-interest prediction unit 14 and classifies states with close prediction levels into one state. 18 is a classification result 17 by the prediction level classification unit 16. The prediction value / prediction coincidence rate reference table 19 reads out the prediction value / prediction coincidence rate in accordance with the above. The only difference from FIG. 2 is the above-mentioned part, and the description of the other parts will be omitted.

【００２９】次に、動作について表１に示す確率分布を
とる２値画像を用いて説明する。まず、参照画素パター
ン１３の参照画素ｘ_i-1 ，ｘ_i-2 を用いて、注目画素予
測手段１４により式（６）から予測レベルｙ_i １５を算
出する。このｙ_i は、実数値をとり、予測値２０（０ま
たは１）と区別するために予測レベルと呼ぶ。ｙ_i ＝０．５２ｘ_i-1 ＋０．４４ｘ_i-2 （６）Next, the operation will be described using the binary image having the probability distribution shown in Table 1. First, using the reference pixels x _i-1 and x _i-2 of the reference pixel pattern 13, the pixel _-of- interest prediction unit 14 calculates the prediction level y _i 15 from the equation (6). This y _i takes a real value and is called a prediction level to distinguish it from the prediction value 20 (0 or 1). y _i = 0.52x _i-1 + 0.44x _i-2 (6)

【００３０】この予測関数は、次に示す通り注目画素値
と予測レベルとの平均２乗誤差が最小となるように予め
設計しておく。ｅ_i を予測誤差、ａ_i を予測係数とする
と、予測誤差ｅ_i は次式で表される。ｅ_i ＝ｙ_i −ｘ_i ＝ａ₁ ・ｘ_i-1 ＋ａ₂ ・ｘ_i-2 −ｘ_i （７）ここで、Ｅ［ｅ_i ²］を予測誤差ｅ_i の２乗平均とすれ
ば、Ｅ［ｅ_i ²］を最小化する予測係数を求めるには、式
（８）を満たす予測係数ａ₁ ，ａ₂ を算出すればよい。 ∂Ｅ［ｅ_i ²］／∂ａ₁ ＝０ ∂Ｅ［ｅ_i ²］／∂ａ₂ ＝０（８）式（８）は次式のように書ける。これをＹｕｌｅ−Ｗａ
ｌｋｅｒ方程式という。This prediction function is designed in advance so that the mean square error between the pixel value of interest and the prediction level is minimized as shown below. When e _i is a prediction error and a _i is a prediction coefficient, the prediction error e _i is expressed by the following equation. e _i = y _i −x _i = a ₁ · x _i−1 + a ₂ · x _i −2 −x _i (7) Here, if E [e _i ² ] is the mean square of the prediction error e _i , , E [e _i ² ] can be obtained by calculating the prediction coefficients a ₁ and a ₂ that satisfy the equation (8). ∂E [e _i ² ] / ∂a ₁ = 0 ∂E [e _i ² ] / ∂a ₂ = 0 (8) Expression (8) can be written as the following expression. This is Yule-Wa
This is called the lker equation.

【００３１】[0031]

【数５】 [Equation 5]

【００３２】式（６）の予測係数は、表１の確率分布の
２値画像を対象として、式（９）を解くことにより得ら
れたものである。The prediction coefficient of the equation (6) is obtained by solving the equation (9) for the binary image of the probability distribution shown in Table 1.

【００３３】[0033]

【表３】 [Table 3]

【００３４】[0034]

【表４】 [Table 4]

【００３５】[0035]

【表５】 [Table 5]

【００３６】このようにして得られた予測レベル１５
は、表１のｙ_i となる。次に予測レベル分類手段１６で
表３に示すように予測レベルｙ_i をしきい値０．４で２
レベルに量子化することにより、予測レベルの近い状態
同士を１つにまとめ、状態を縮退する。この縮退によ
り、条件付きエントロピは、０．７８ｂｉｔとなる。こ
の値は、表１で示したエントロピＨ＝０．７２ｂｉｔに
比べて縮退を行ったため高くなっている。しかし、表４
あるいは表５のように参照画素をｘ_i-1 あるいはｘ_i-2
の１画素のみとして同じ状態数とした場合のエントロピ
に比べ低くなっており、本発明の縮退の効果を確認でき
る。つまり、表３に示す各状態に対応する予測値２０、
予測一致率２１を予測値・予測一致率参照テーブル１９
から読みだし、算術符号化することにより同一の大きさ
の予測値・予測一致率参照テーブル１９を持つ表４ある
いは表５の場合に比べ、少ない符号量で符号化できる。Prediction level 15 thus obtained
Becomes y _i in Table 1. Next, the prediction level classification means 16 sets the prediction level y _i to 2 with a threshold value of 0.4 as shown in Table 3.
By quantizing into levels, states with similar prediction levels are combined into one and the states are degenerated. Due to this degeneracy, the conditional entropy becomes 0.78 bit. This value is higher than the entropy H = 0.72 bit shown in Table 1 due to degeneration. However, Table 4
Alternatively, as shown in Table 5, the reference pixel is set to x _i-1 or x _i-2.
This is lower than the entropy when only one pixel has the same number of states, and the degeneracy effect of the present invention can be confirmed. That is, the predicted value 20 corresponding to each state shown in Table 3,
The predicted matching rate 21 is set to the predicted value / predicted matching rate reference table 19
It is possible to perform coding with a smaller code amount as compared with the case of Table 4 or Table 5 having the same predicted value / predictive coincidence rate reference table 19 by performing arithmetic coding.

【００３７】実施例２．実施例１では、注目画素予測手
段１４として線形予測関数を用いたが、たとえば予測関
数として図４に示す階層型のニューラルネットを用いて
もよい。各層は図の左から順に入力層、中間層、出力層
と呼ばれ、各層はいくつかのユニット４３から構成され
ている。入力層の各ユニットは入力信号４１をそのまま
次の層に送り、中間層、出力層では、各ユニットは前層
の各ユニット出力の荷重和をとりその値に応じて各ユニ
ットの出力値を算出する。そして、出力層のユニットの
出力がニューラルネットの出力４２となる。ニューラル
ネットの入出力特性を決定するのは前層の出力の荷重和
をとる際の重み係数で、重み係数は結合ごとに異なった
値を持っている。この重み係数は、予め学習と呼ばれる
処理により決定しておく。Example 2. In the first embodiment, the linear prediction function is used as the pixel-of-interest prediction unit 14, but a hierarchical neural network shown in FIG. 4 may be used as the prediction function, for example. Each layer is called an input layer, an intermediate layer, and an output layer in order from the left of the figure, and each layer is composed of several units 43. Each unit of the input layer sends the input signal 41 to the next layer as it is, and in the intermediate layer and the output layer, each unit takes the weighted sum of the output of each unit of the previous layer and calculates the output value of each unit according to the value. To do. Then, the output of the unit in the output layer becomes the output 42 of the neural network. The input / output characteristics of the neural network are determined by the weighting coefficient when the weighted sum of the outputs of the previous layer is taken, and the weighting coefficient has a different value for each connection. This weighting factor is previously determined by a process called learning.

【００３８】次に動作を説明するが、予測関数がニュー
ラルネットを用いること以外は実施例１と同一なので、
他については説明を省略する。まず、参照画素パターン
１３は図４のニューラルネットに入力され、ニューラル
ネットから注目画素値の予測レベル１５が出力される。
そして、その予測レベル１５を予測レベル分類手段１６
で分類することにより状態が決定される。The operation will be described below. Since the prediction function is the same as that of the first embodiment except that a neural network is used,
Descriptions of other parts are omitted. First, the reference pixel pattern 13 is input to the neural network of FIG. 4, and the prediction level 15 of the pixel value of interest is output from the neural network.
Then, the prediction level 15 is converted to the prediction level classification means 16
The state is determined by classifying with.

【００３９】この時のニューラルネットの重み係数は、
注目画素と出力値の平均２乗誤差を最小化するように予
めバックプロパゲーション学習則により決定しておく。
これは、入力信号を与え、その出力が教師信号と呼ばれ
る出力として望ましい値に近づくように重み係数を修正
することを何回も繰り返すという操作である。例えば、
表１の確率分布をとる２値画像の場合、各状態での参照
画素値（ｘ_i-1 ，ｘ_i-2 ）とそれに対する教師信号ｘ_i
を順にニューラルネットに与え、学習を出力値と教師信
号の２乗誤差が収束するまで繰り返す。つまり、各状態
Ｓ₀ ，Ｓ₁ ，Ｓ₂ ，Ｓ₃ での教師信号は、それぞれ０，
１，１，１となる。The weighting coefficient of the neural network at this time is
It is determined in advance by the back propagation learning rule so as to minimize the mean square error between the pixel of interest and the output value.
This is an operation in which an input signal is applied and the weighting coefficient is modified many times so that its output approaches a desired value called an output called a teacher signal. For example,
In the case of the binary image having the probability distribution shown in Table 1, the reference pixel values (x _i-1 , x _i-2 ) in each state and the teacher signal x _{i corresponding} thereto
Are sequentially applied to the neural network, and learning is repeated until the squared error between the output value and the teacher signal converges. That is, the teacher signals in the states S ₀ , S ₁ , S ₂ , and S ₃ are 0,
It becomes 1,1,1.

【００４０】[0040]

【表６】 [Table 6]

【００４１】このようにして決定された重み係数により
構成されるニューラルネットの各状態での出力ｙ_i は表
６に示されている。これを表３に示す予測レベル分類手
段１６に入力すれば、実施例１と同様なマルコフ状態の
縮退が可能となる。Table 6 shows the outputs y _{i in} each state of the neural network constituted by the weighting factors thus determined. By inputting this into the prediction level classification means 16 shown in Table 3, it becomes possible to degenerate the Markov state as in the first embodiment.

【００４２】実施例３．実施例１，２では、注目画素値
を予測対象としたが、注目画素が黒画素となる確率（黒
画素出現確率）を予測してもよい。例えば、表１の２値
画像を対象とした実施例２ではニューラルネットの学習
の際、教師信号として予測対象である注目画素値ｘ_i を
用いたが、その代わりに黒画素出現確率Ｐ（１／ｓ_i ）
を用いる。つまり、各状態Ｓ₀ ，Ｓ₁ ，Ｓ₂ ，Ｓ₃ での
教師信号は、それぞれ０．２，０．６，０．６，０．９
となる。Example 3. In the first and second embodiments, the target pixel value is used as the prediction target, but the probability that the target pixel is a black pixel (black pixel appearance probability) may be predicted. For example, in the second embodiment for the binary image in Table 1, the target pixel value x _i which is the prediction target is used as the teacher signal when learning the neural network, but instead, the black pixel appearance probability P (1 / S _i )
To use. That is, the teacher signals in the states S ₀ , S ₁ , S ₂ , and S ₃ are 0.2, 0.6, 0.6, and 0.9, respectively.
Becomes

【００４３】実施例４．実施例１〜３では、予測関数と
予測レベルを分類するのに使用する量子化器は１種類の
みであったが、これらを複数用意しておき、参照画素に
応じて画面内で適応的に切り換えてもよい。Example 4. In the first to third embodiments, only one type of quantizer is used to classify the prediction function and the prediction level. However, a plurality of quantizers are prepared, and adaptively in the screen according to the reference pixel. You may switch.

【００４４】[0044]

【表７】 [Table 7]

【００４５】[0045]

【表８】 [Table 8]

【００４６】参照画素は、図５の３画素ｘ_i-1 ，ｘ
_i-2 ，ｘ_i-3 とする。また、この時の２値画像は、表７
に示す確率分布をとるものとする。参照画素のとり得る
値により状態Ｓ₀ 〜Ｓ₇ に分類した場合の条件付きエン
トロピＨ’は、式（４）よりＨ’＝０．７７ｂｉｔとな
る。実施例４では、まず式（１０）により予測レベル１
５を算出する。The reference pixels are the three pixels x _i−1 , x in FIG.
_i-2 and x _i-3 . The binary image at this time is shown in Table 7.
The probability distribution shown in is taken. The conditional entropy H ′ when classified into the states S _{0 to} S ₇ according to the value that the reference pixel can take is H ′ = 0.77 bit from the equation (4). In the fourth embodiment, first, the prediction level 1 is calculated by the equation (10).
Calculate 5.

【００４７】[0047]

【数６】 [Equation 6]

【００４８】表７に示す画像の場合、各状態の予測レベ
ル１５は表７のｙ_i となる。次に、表８に示すように予
測レベル１５をｘ_i-3 の値に応じて別々の量子化器で量
子化する。つまり、ｘ_i-3 ＝０の場合はしきい値０．４
で、ｘ_i-3 ＝１の場合はしきい値０．５で２レベルに分
類する。実施例１〜３と比べて異なるのは、参照画素ｘ
_i-3 の値により予測関数、量子化器を切り換えているこ
とである。これにより、表７の２値画像は表８の４状態
に分類され、条件付きエントロピＨ’＝０．８０ｂｉｔ
となる。In the case of the images shown in Table 7, the prediction level 15 in each state is y _{i in} Table 7. Next, as shown in Table 8, the prediction level 15 is quantized by different quantizers according to the value of x _i-3 . That is, when x _i-3 = 0, the threshold value 0.4
Then, when x _i−3 = 1 is set, the threshold value is 0.5 and the level is classified into two levels. The difference from the first to third embodiments is that the reference pixel x
_The prediction function and the quantizer are switched according to the value of _i-3 . As a result, the binary image in Table 7 is classified into four states in Table 8, and the conditional entropy H '= 0.80 bit.
Becomes

【００４９】[0049]

【表９】 [Table 9]

【００５０】表９は、ｘ_i-1 とｘ_i-3 のとり得る値によ
り４状態に分類した場合の確率分布と条件付きエントロ
ピである。表８と同じ４状態に分類したにもかかわら
ず、表８に比べてエントロピは高くなっており、実施例
４の効果を確認できる。Table 9 shows the probability distribution and conditional entropy when the state is classified into four states according to the possible values of x _i-1 and x _i-3 . Despite being classified into the same four states as in Table 8, the entropy is higher than in Table 8, and the effect of Example 4 can be confirmed.

【００５１】[0051]

【発明の効果】この発明により、符号化パラメータ参照
テーブルの大きさを大幅に増大することなく画像データ
の条件付きエントロピを低下させ、符号量の削減を可能
にする。According to the present invention, the conditional entropy of image data can be reduced without significantly increasing the size of the coding parameter reference table, and the code amount can be reduced.

[Brief description of drawings]

【図１】本発明を用いた算術符号型ＭＥＬＣＯＤＥの構
成図である。FIG. 1 is a configuration diagram of an arithmetic code type MELCODE using the present invention.

【図２】従来の算術符号型ＭＥＬＣＯＤＥの構成図であ
る。FIG. 2 is a configuration diagram of a conventional arithmetic code type MELCODE.

【図３】従来例、本発明の実施例１〜３における注目画
素、参照画素の画素配置を示す図である。FIG. 3 is a diagram showing a pixel arrangement of a target pixel and a reference pixel in a conventional example and Examples 1 to 3 of the present invention.

【図４】本発明における実施例２，３でのニューラルネ
ットワークの構造を示す図である。FIG. 4 is a diagram showing a structure of a neural network according to Examples 2 and 3 of the present invention.

【図５】本発明の実施例４における注目画素、参照画素
の画素配置を示す図である。FIG. 5 is a diagram showing a pixel arrangement of a target pixel and a reference pixel according to a fourth embodiment of the present invention.

[Explanation of symbols]

１４注目画素予測手段１５予測レベル１６予測レベル分類手段１８予測値・予測一致率決定手段１９予測値・予測一致率参照テーブル３１注目画素３２参照画素 14 Target Pixel Predicting Means 15 Prediction Level 16 Prediction Level Classifying Means 18 Prediction Value / Prediction Matching Rate Determining Means 19 Prediction Value / Prediction Matching Rate Reference Table 31 Target Pixels 32 Reference Pixels

Claims

[Claims]

1. A prediction unit that predicts a value indicating the property of a target pixel by using reference pixels around the target pixel, and target pixels with similar prediction levels output from the prediction unit are collectively set as one Markov state. A binary image coding apparatus by Markov model coding, comprising prediction level classification means for classifying, and performing entropy coding suitable for each classification according to a prediction level classification result.

2. The binary image coding device according to claim 1, wherein the prediction target is a pixel value of interest.

3. The binary image coding apparatus according to claim 1, wherein the prediction target is a probability that the pixel of interest is a black pixel (hereinafter, black pixel appearance probability).

4. The prediction means and the prediction level classification means are provided in plural, and the prediction means and the prediction level classification means are adaptively selected on the screen according to the state of the reference pixel. The first claim, the second claim, and the third claim
Item 2. The binary image coding device according to item.