JP3425143B2

JP3425143B2 - Data compression method, data decompression method, data compression device, and data decompression device

Info

Publication number: JP3425143B2
Application number: JP2001402031A
Authority: JP
Inventors: 君孝村下; 佳之岡田; 茂吉田
Original assignee: Fujitsu Ltd
Current assignee: Fujitsu Ltd
Priority date: 2001-12-28
Filing date: 2001-12-28
Publication date: 2003-07-07
Anticipated expiration: 2018-07-07
Also published as: JP2002232298A

Description

Detailed Description of the Invention

【０００１】[0001]

【発明の属する技術分野】本発明は、データ圧縮方法及
びデータ復元方法並びにデータ圧縮装置及びデータ復元
装置に関する。近年、文字コード、ベクトル情報，画像
など様々な種類のデータがコンピュータで扱われるよう
になっており、扱われるデータ量も急速に増加してきて
いる。これに伴い、大量のデータを扱うときは、データ
の中の冗長な部分を省いてデータ量を圧縮することで、
記憶容量を減らしたり速く伝送したりすることが行なわ
れている。また、様々なデータを一つの方式でデータ圧
縮できる方法としてユニバーサル符号化が提案されてい
る。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a data compression method, a data decompression method, a data compression device and a data decompression device. In recent years, various types of data such as character codes, vector information, and images have been handled by computers, and the amount of data handled has been increasing rapidly. Along with this, when dealing with a large amount of data, by omitting redundant parts in the data and compressing the data amount,
BACKGROUND OF THE INVENTION Reduction of storage capacity and fast transmission are being carried out. Universal coding has been proposed as a method of compressing various data by one method.

【０００２】ここで、本発明の分野は文字コードの圧縮
に限らず、様々なデータに適用できるが、以下では情報
理論で用いられている呼称を踏襲し、データの１ワード
単位を文字といい、データが任意ワードつながったもの
を文字列と呼ぶようにする。Here, the field of the present invention is not limited to compression of character codes, but can be applied to various data. In the following, the word used in information theory is followed, and one word unit of data is called a character. , A string in which data is connected to arbitrary words is called a character string.

【０００３】[0003]

【従来の技術】テキストデータやフィアル等を圧縮する
方式には、データ系列の類似性を利用いた辞書型符号化
方式と、データ列の出現頻度を利用した確率統計型符号
化方式(statistical coding) がある。この内、確率統
計型符号化の代表的な手法が、上述のユニバーサル符号
化である。2. Description of the Related Art As a method for compressing text data, a file, etc., a dictionary type coding method that uses the similarity of data series and a stochastic statistical coding method (statistical coding) that uses the appearance frequency of a data string. There is. Among them, a typical method of probability statistical coding is the above-mentioned universal coding.

【０００４】さらに、算術符号化と呼ばれる符号化があ
る。この算術符号化とは、各文字の出現確率に適応した
符号を、符号表をもたずに、計算しながら生成するもの
であり、情報源の文字の出現頻度が分かっている場合に
最大の効率で圧縮できるといわれている方法であり、２
値算術符号化と３値以上の多値算術符号化とがある。Furthermore, there is a coding called arithmetic coding. This arithmetic coding is to generate a code adapted to the appearance probability of each character while calculating without using a code table. When the appearance frequency of the character of the information source is known, the maximum It is said that it can be compressed efficiently.
There are value arithmetic coding and multi-value arithmetic coding of three or more values.

【０００５】以下に、多値算術符号化の方法について述
べる。多値算術符号化では、まず０≦Ｐ＜１（以下、
〔０，１）と記述する）の数直線を、出現した文字の事
象（以下、シンボルという）の数で分割する。ここで、
各区間の幅はシンボルの出現頻度の比に比例するように
取り、出現頻度が高い順に区間を配置する。The method of multivalued arithmetic coding will be described below. In multi-valued arithmetic coding, first, 0 ≦ P <1 (hereinafter,
The number line of [0, 1) is divided by the number of occurrences of character events (hereinafter referred to as symbols). here,
The width of each section is taken to be proportional to the ratio of the appearance frequencies of the symbols, and the sections are arranged in descending order of appearance frequency.

【０００６】そして、出現したシンボルに対応する区間
を選択し、次のシンボルでは選択した区間をさらに全シ
ンボル数分の区間に分割し対応するシンボルの区間を選
択するという具合に、再帰的に選択した区間を細分す
る。上述の処理について、図７０（ａ），図７０（ｂ）
に示す多値算術符号化の原理を説明する図を参照しなが
ら具体的に述べる。Then, a section corresponding to the symbol that appears is selected, and in the next symbol, the selected section is further divided into sections corresponding to the total number of symbols and the section of the corresponding symbol is selected recursively. Subdivided sections. Regarding the above processing, FIG. 70 (a) and FIG. 70 (b)
A detailed description will be given with reference to a diagram for explaining the principle of multivalued arithmetic coding shown in FIG.

【０００７】ここで、図７０（ａ）はシンボルと出現頻
度の一例を示す図、同図７０（ｂ）はシンボルの区間分
割の例を示す図である。そして、文字列“ａｂｅ”の区
間を分割する場合を例にとり、説明を進める。まず、数
直線〔０，１）を、図７０（ａ）で示すような文字ａ，
ｂ，ｃ，ｄ，ｅの５つの区間に分割する。Here, FIG. 70A is a diagram showing an example of symbols and appearance frequencies, and FIG. 70B is a diagram showing an example of section division of symbols. Then, the description will be advanced by taking the case of dividing the section of the character string “abe” as an example. First, the number line [0, 1) is converted into the character a, as shown in FIG.
It is divided into five sections b, c, d, and e.

【０００８】そして、最初に出現したシンボル“ａ”の
区間〔０，０．２）を選択し、この選択した区間〔０，
０．２）を、さらに、全シンボルａ〜ｅの５つの区間に
分割する。次に、第２に出現したシンボル“ｂ”の区間
〔０．０４，０．０６）を選択し、この区間〔０．０
４，０．０６）を、さらに全シンボルａ〜ｅの５つの区
間に分割する。こうして、第３に出現したシンボル
“ｅ”の区間を選択することにより、文字列“ａｂｅ”
の区間〔０．０５，０．０６）が得られる。Then, the section [0, 0.2) of the first appearing symbol "a" is selected, and the selected section [0, 0.2] is selected.
0.2) is further divided into 5 sections of all symbols a to e. Next, the section [0.04, 0.06) of the second appearing symbol “b” is selected, and this section [0.0
4, 0.06) is further divided into five sections of all symbols a to e. Thus, by selecting the section of the symbol "e" that appears third, the character string "abe"
The section [0.05, 0.06) of is obtained.

【０００９】このように、全入力データについて、上述
のような処理を繰り返すことで、符号化する文字列の区
間を決定することができ、最終的に定まった文字列の区
間内の任意の点を２進表示で表したものを、圧縮符号と
して出力するのである。「算術符号化」という名称は、
符号語が〔０．１１０１１・・〕のように、２進数の小
数点以下の数値で表現され、それを計算で求められるこ
とからきている。As described above, by repeating the above-described processing for all input data, the section of the character string to be encoded can be determined, and any point within the finally determined section of the character string can be determined. Is expressed in binary notation and is output as a compression code. The name "arithmetic coding" is
This is because the code word is represented by a numerical value below the decimal point of a binary number such as [0.11011 ...] And can be calculated.

【００１０】また、上述のような出現頻度に応じた区間
の分割方法には、文字列の実際の出現頻度によらず、予
め設定した出現頻度に従って区間を分割する静的符号化
方式(static)、最初に全文字列を走査することにより得
られた出現頻度で区間を分割する準適応型符号化方式(s
emi-adaptive) 、又は文字が出現する毎に頻度を再計算
して１文字毎に区間を再設定する適応型符号化方式(ada
ptive)とがある。In addition, as a method of dividing an interval according to the appearance frequency as described above, a static encoding method (static) that divides an interval according to a preset appearance frequency regardless of the actual appearance frequency of a character string is used. , A semi-adaptive coding method that divides an interval by the appearance frequency obtained by first scanning the entire character string (s
emi-adaptive), or an adaptive coding method (adaptive coding method that recalculates the frequency each time a character appears and resets the interval for each character)
ptive).

【００１１】ところで、上述の多値算術符号化をファイ
ル圧縮に用い、バイト（文字）単位にデータを圧縮する
方法は、例えば、以下の２つの文献，に記載されて
いる。 "Arithmetic Coding for Data Compression," Commu
n. of ACM, Vol.30, No.6 PP.520 −540(1986) "An Adaptive Dependency Source Model for Data Co
mpression Scheme," Commun. of ACM, Vol.32 No.1 PP.
77 −83 ここで、文献は、多値算術符号化の具体的なアルゴリ
ズムを開示している。また、この文献での多値算術符
号化は、１文字単位に符号化・圧縮するエントロピー符
号化と呼ばれる方法の一つであり、注目文字の出現確率
を多値算術符号化するとともに、各文字の出現確率をそ
の文字が表れるごとに逐次更新し、種々のデータに動的
に適応して符号化を行なうものである。また、この多値
算術符号化では、詳細には図７１（ａ）のフローチャー
トに示すような処理が行なわれる。By the way, a method of compressing data in byte (character) units by using the above-described multi-valued arithmetic coding for file compression is described in, for example, the following two documents. "Arithmetic Coding for Data Compression," Commu
n. of ACM, Vol.30, No.6 PP.520 −540 (1986) "An Adaptive Dependency Source Model for Data Co
mpression Scheme, "Commun. of ACM, Vol.32 No.1 PP.
77-83 Here, the literature discloses a specific algorithm of multi-valued arithmetic coding. In addition, the multi-value arithmetic coding in this document is one of the methods called entropy coding that encodes and compresses in units of one character, and multi-value arithmetic coding the appearance probability of the target character The appearance probability is sequentially updated every time the character appears, and is dynamically adapted to various data for encoding. Further, in this multi-valued arithmetic coding, the processing shown in detail in the flowchart of FIG. 71A is performed.

【００１２】一方、文献の方法は、注目文字を直前文
字を用いた条件付確率で表し、その条件付確率を多値算
術符号化することで高圧縮率を得る方法を与え、各条件
付確率を逐次更新し、種々のデータに動的に適応して符
号化を行なうものである。この多値算術符号化において
も、図７１（ｂ）のフローチャートに示すような処理が
行なわれる。On the other hand, the method of the literature gives a method of obtaining a high compression rate by expressing the character of interest with a conditional probability using the immediately preceding character, and multi-value arithmetic coding of the conditional probability to give a method of obtaining each conditional probability. Are sequentially updated and are dynamically adapted to various data for encoding. Also in this multi-valued arithmetic coding, the processing as shown in the flowchart of FIG. 71 (b) is performed.

【００１３】ここで、多値算術符号化の代わりに、ハフ
マン符号化の変形であるダイナミック・ハフマン符号化
（"Variation a Theme by Huffman", IEEE Trans. Info
rm.Theory, Vol.24, No.6 1978, または、"Design and
Analysis of Dynamic Huffman Codes", Journal of AC
M, Vol.34, No.4 1987 参照）を用いる方法も考えられ
るが、このダイナミック・ハフマン符号化は、符号化効
率が多値算術符号化より劣る上、処理に時間がかかるた
め、条件付確率をダイナミック・ハフマン符号化する方
法は実際には使用されていない。Here, instead of multi-valued arithmetic coding, dynamic Huffman coding ("Variation a Theme by Huffman", IEEE Trans. Info) is a modification of Huffman coding.
rm.Theory, Vol.24, No.6 1978, or "Design and
Analysis of Dynamic Huffman Codes ", Journal of AC
M, Vol.34, No.4 1987) can be considered, but this dynamic Huffman coding is less efficient than multi-valued arithmetic coding and takes a long time to process. The method of dynamic Huffman coding of probabilities is not used in practice.

【００１４】なお、図７２は、この多値算術符号化・復
号化のアルゴリズムの一例を示す図である。また、算術
符号化とは別にスプレイ（Splay-Tree）符号化方法と呼
ばれるものがある（例えば、文献"Application of Spla
y Tree to Data Compression"DOUGLAS W.JONES著 Commu
n.of ACM,Vol31 No.8 P996-1007 参照) 。FIG. 72 is a diagram showing an example of this multilevel arithmetic coding / decoding algorithm. In addition to the arithmetic coding, there is a method called a Splay-Tree coding method (see, for example, the document "Application of Spla").
y Tree to Data Compression "by DOUGLAS W. JONES Commu
n.of ACM, Vol31 No.8 P996-1007).

【００１５】このスプレイ符号化方法では、図７３
（ａ）に示すような木構造の符号表( 以下、符号木と称
する) を用い、符号木の終端( 一般的に葉、あるいはリ
ーフと呼ばれる）にシンボルを登録し、符号木の頂点
（一般的に根，あるいはルートと呼ばれる）から入力デ
ータが格納されているリーフまでの距離を符号語として
出力する。In this splay coding method, FIG.
Using a tree-structured code table as shown in (a) (hereinafter referred to as a code tree), symbols are registered at the ends of the code tree (generally called leaves or leaves), and the vertices of the code tree (generally called leaves) are registered. The distance from the root (or root) to the leaf where the input data is stored is output as a code word.

【００１６】具体的に述べると、符号語には、ルートか
らリーフへ下るとき、右へ分岐したときは“１”、左へ
分岐したときは“０”を割り当てるのである。つまり、
図７３（ａ）の例では、シンボルＡの符号は〔１０１１
０〕となり、シンボルＢの符号は〔００１〕となる。そ
して、符号長を変更する（符号更新する）場合は、符号
化したリーフと他のリーフ、あるいは符号木上の接点
（節、あるいはノードと呼ばれる）とを組み替えること
により行なう。More specifically, the code word is assigned "1" when descending from the root to the leaf, when branching to the right, and "0" when branching to the left. That is,
In the example of FIG. 73A, the code of the symbol A is [1011
0], and the code of the symbol B becomes [001]. When the code length is changed (the code is updated), the coded leaf is replaced with another leaf or a contact (called a node or a node) on the code tree.

【００１７】図７３（ｂ）に上述の符号更新の例を示
す。この図７３（ｂ）に示すように、入力されたデータ
の中に、初めシンボルＡ，Ｂ，Ｃ，Ｄの各符号が符号木
のリーフに格納されている。そして、まずシンボルＡと
シンボルＣとのノードを組み替え、さらにシンボルＡの
上位ノードＤとシンボルＥとのノードを組み替えること
により、図７３（ｂ）に示すように、シンボルＡの符号
は、〔１０１１０〕から〔１１０〕となり符号の更新が
行なわれる。FIG. 73B shows an example of the above code update. As shown in FIG. 73 (b), the codes of the symbols A, B, C, and D are initially stored in the leaves of the code tree in the input data. Then, first, by changing the nodes of the symbol A and the symbol C, and further changing the nodes of the upper node D and the symbol E of the symbol A, as shown in FIG. 73 (b), the code of the symbol A is [10110 ] To [110] and the code is updated.

【００１８】ここで、上述の説明は１文字毎の出現確率
を動的に可変長符号化する場合であるが、さらに、圧縮
率を高めるためには、入力信号と直前の文字との依存関
係を取り入れた条件付き出現確率を動的可変長符号化す
ることで行なわれる。この方法は、データの確率統計的
な性質を用いる確率統計型符号化であり、図７４に示す
ように、文脈収集処理５１１と動的可変長符号化処理５
１２との２段階の処理からなる。The above description is for the case where the appearance probability of each character is dynamically variable-length coded. However, in order to further increase the compression rate, the dependency relationship between the input signal and the immediately preceding character is used. It is performed by dynamically variable-length coding the conditional occurrence probability that incorporates. This method is a probabilistic statistic type encoding that uses the probabilistic statistic property of data, and as shown in FIG. 74, a context collection process 511 and a dynamic variable length encoding process 5
It is composed of two steps of 12 and.

【００１９】そして、図７５（ａ）に示すように、文脈
収集により入力データから文字列の前後関係の文脈を収
集し、図７５（ｂ）に示すような文脈の木構造を作成
し、条件付き確率を求めて動的可変長符号化する。ここ
で、上述の条件付き確率は、図７５（ｂ）に示すような
木構造の文脈木上において、各ノードの文字を通る文字
列が出現する毎に出現回数を計数しておくことによって
求められる。As shown in FIG. 75 (a), contextual contexts of character strings are collected from the input data by context collection to create a context tree structure as shown in FIG. 75 (b). Dynamic variable-length coding is performed by obtaining the attached probability. Here, the above-mentioned conditional probability is obtained by counting the number of appearances every time a character string passing through a character of each node appears on a context tree having a tree structure as shown in FIG. 75 (b). To be

【００２０】ところで、条件付き確率を求める文脈収集
の方法には、主に以下の２つの方法がある。なお、以
下、条件（文脈）の文字数を次数と呼ぶことにする（文
献"Data Compression Using Adaptive Coding and Part
ial String Matching"JOHN G.CLEARY 他著IEEE Vol.COM
-32,No.4 APRIL 1984 P396-402参照) 。（１）固定次数の文脈収集方法この方法は、条件付き確率の条件を固定の文字数にする
方法である。By the way, there are mainly the following two methods of context collection for obtaining conditional probabilities. In the following, the number of characters in the condition (context) will be referred to as the degree (reference "Data Compression Using Adaptive Coding and Part").
ial String Matching "JOHN G. CLEARY et al. IEEE Vol.COM
-32, No. 4 APRIL 1984 P396-402). (1) Fixed Order Context Collection Method This method is a method of setting the condition of conditional probability to a fixed number of characters.

【００２１】例えば２次の文脈では、直前２文字につな
がる文字の文脈を収集し、条件付き確率ｐ（ｙ｜ｘ１，
ｘ２）を符号化する。ただし、ｙは注目符号化文字，ｘ
１，ｘ２はそれぞれ直前の第１文字，第２文字である。（２）Blending文脈収集方法上述の固定次数の文脈収集方法では、直前の条件文字列
が出にくい場合、条件付き確率の推定は不正確になり、
逆に直前の条件付き文字列が出やすい場合は条件付き確
率の推定は正確になり、さらに次数を上げ得る可能性を
残す。For example, in the quadratic context, the contexts of the characters connected to the immediately preceding two characters are collected, and the conditional probability p (y | x1,
x2) is encoded. However, y is the coded character of interest, x
1 and x2 are the first character and the second character immediately before, respectively. (2) Blending context collection method In the fixed-order context collection method described above, if the immediately preceding condition character string is difficult to come out, the estimation of the conditional probability becomes inaccurate,
On the contrary, when the immediately preceding conditional character string is likely to appear, the estimation of the conditional probability becomes accurate, and there is a possibility that the order can be further increased.

【００２２】一般に、高次の文脈を使うほど文字間の相
関が大きいデータに対しては高圧縮率が得られるが、逆
に高次文脈を使うほど相関が小さくなるデータでは、か
えって圧縮率が悪くなる。これを解決するのが文脈のBl
ending（次数の混合）である。この方法は、直前の次数
を固定せずに出やすい場合には次数を上げ、出にくい場
合には低い次数のままという具合に文脈の次数を入力デ
ータに適応させて伸ばす方法である。Generally, a higher compression ratio is obtained for data in which the correlation between characters is larger as the higher-order context is used, but conversely, the compression ratio is reduced for data in which the correlation is smaller as the higher-order context is used. become worse. The solution to this is the context Bl
ending (mixed order). This method is a method of adapting the order of the context to the input data such that the order is raised when it is easy to appear without fixing the immediately preceding order, and the order is kept low when it is difficult to come out.

【００２３】[0023]

【発明が解決しようとする課題】しかしながら、算術符
号化を動的可変長符号に用いた確率統計型符号化方式に
は、データが入力されてくる度にそれまで入力された全
てのデータの累積頻度を再計算し、〔０，１）の数直線
を再分割するので、複雑で大量な演算処理を必要であ
り、処理の高速化が行なえないという課題がある。However, in the stochastic statistic coding method using the arithmetic coding for the dynamic variable length code, every time data is input, the accumulation of all the data input up to that time is accumulated. Since the frequency is recalculated and the number line of [0, 1) is redivided, a complicated and large amount of arithmetic processing is required, and there is a problem that the processing cannot be speeded up.

【００２４】本発明は、このような課題に鑑み創案され
たもので、算術符号の区間計算の代わりにスプレイ符号
化を適用し、このスプレイ符号化における符号木に新規
データを登録することで、高速な符号登録処理を可能に
してデータ圧縮／復元処理を高速化できるようにした、
データ圧縮方法及びデータ復元方法並びにデータ圧縮装
置及びデータ復元装置を提供することを目的とする。The present invention was devised in view of the above problems. By applying spray coding instead of interval calculation of arithmetic codes and registering new data in a code tree in this spray coding, Enables high-speed code registration processing and speeds up data compression / decompression processing.
An object of the present invention is to provide a data compression method, a data decompression method, a data compression device, and a data decompression device.

【００２５】[0025]

【課題を解決するための手段】このため、本発明のデー
タ圧縮方法は、入力データを過去に出現した履歴に応じ
て符号化して圧縮するデータ圧縮方法において、次のよ
うな過程をとることを特徴としている（請求項１）。（１）入力データとそれまでに連続したｎ個のデータか
らなる文脈との組み合わせを登録した文脈木を保持する
文脈木保持過程。（２）文脈毎に独立した符号木を保持する符号木保持過
程。（３）の入力データと文脈との組み合わせが文脈木保持
過程に保持されていないとき、文脈木保持過程の文脈木
にデータを新規に登録する文脈木新規登録過程。（４）入力データと文脈との組み合わせが文脈木保持過
程に保持されていないとき、符号木保持過程の符号木の
データ格納点としてのリーフを分岐して得た新規リーフ
にデータを格納する符号木新規登録過程。（５）入力データと文脈との組み合わせが文脈木保持過
程に保持されていないとき文脈を変更する文脈変更過
程。（６）符号木の頂点からの入力データあるいは符号木中
の特定コードが登録してあるリーフまでの分岐に従って
符号を出力する符号出力過程。（７）入力データあるいは符号木中の特定コードが登録
してあるリーフと他のリーフあるいは符号木の頂点以外
の分岐点として定義されるノードとを取り替える符号長
変更過程。（８）符号木新規登録過程では、特定コードを登録して
あるリーフを分岐し、得た２つの新規リーフに特定コー
ドと新規データとを登録する過程。Therefore, in the data compression method of the present invention, the following steps are taken in the data compression method in which the input data is encoded and compressed according to the history that has appeared in the past. It is characterized (Claim 1). (1) A context tree holding process for holding a context tree in which a combination of input data and a context made up of consecutive n data is registered. (2) Code tree holding process for holding an independent code tree for each context. The context tree new registration process of newly registering data in the context tree of the context tree holding process when the combination of the input data and the context of (3) is not held in the context tree holding process. (4) Code for storing data in a new leaf obtained by branching a leaf as a data storage point of a code tree in the code tree holding process when a combination of input data and context is not held in the context tree holding process Tree new registration process. (5) A context changing process for changing the context when the combination of the input data and the context is not held in the context tree holding process. (6) A code output process of outputting a code according to a branch to input data from the top of the code tree or a leaf to which a specific code in the code tree is registered. (7) A code length changing process for replacing a leaf in which a specific code in the input data or the code tree is registered with another leaf or a node defined as a branch point other than the vertex of the code tree. (8) In the code tree new registration process, the leaf in which the specific code is registered is branched, and the specific code and the new data are registered in the obtained two new leaves.

【００２６】また、本発明のデータ圧縮方法は、入力デ
ータを過去に出現した履歴に応じて符号化して圧縮する
データ圧縮方法において、次のような過程をとることを
特徴としている（請求項２）。（１）予め未登録を示すデータとして定義されるエスケ
ープコードを登録した符号木を保持する符号木保持過
程。（２）入力データとそれまでに連続したｎ個のデータか
らなる文脈との組み合わせを登録した文脈木を保持する
文脈木保持過程。（３）入力データと文脈との組み合わせが文脈木保持過
程に保持されていないとき、文脈木保持過程の文脈木に
データを新規に登録する文脈木新規登録過程。（４）入力データと文脈との組み合わせが文脈木保持過
程に保持されていないとき、符号木保持過程の符号木の
データ格納点としてのリーフを分岐して得た新規リーフ
にデータを格納する符号木新規登録過程。（５）入力データと文脈との組み合わせが文脈木保持過
程に保持されていないとき文脈を変更する文脈変更過
程。（６）符号木の頂点からの入力データあるいはエスケー
プコードが登録してあるリーフまでの分岐に従って符号
を出力する符号出力過程。（７）の入力データあるいはエスケープコードが登録し
てあるリーフと他のリーフあるいは符号木の頂点以外の
分岐点として定義されるノードとを取り替える符号長変
更過程。（８）符号木新規登録過程では、エスケープコードを登
録してあるリーフを分岐し、得た２つの新規リーフにエ
スケープコードと新規データとを登録する過程。Further, the data compression method of the present invention is characterized in that the following steps are taken in the data compression method of encoding and compressing the input data according to the history of appearance in the past (claim 2). ). (1) A code tree holding process of holding a code tree in which an escape code defined as data indicating unregistered is registered in advance. (2) A context tree holding process for holding a context tree in which a combination of input data and a context made up of consecutive n data is registered. (3) A context tree new registration process of newly registering data in the context tree of the context tree holding process when the combination of the input data and the context is not held in the context tree holding process. (4) Code for storing data in a new leaf obtained by branching a leaf as a data storage point of a code tree in the code tree holding process when a combination of input data and context is not held in the context tree holding process Tree new registration process. (5) A context changing process for changing the context when the combination of the input data and the context is not held in the context tree holding process. (6) A code output process of outputting a code in accordance with a branch to a leaf in which input data from the top of the code tree or an escape code is registered. (7) A code length changing process for replacing a leaf in which input data or an escape code is registered with another leaf or a node defined as a branch point other than the vertex of the code tree. (8) In the code tree new registration process, the leaf in which the escape code is registered is branched, and the escape code and the new data are registered in the obtained two new leaves.

【００２７】さらに、上述の符号木新規登録過程（４）
は、同じ文脈の下にあるリーフのうち、符号木の頂点と
して定義されるルートからの距離が最も長いリーフを分
岐し、得た２つの新規リーフに、分岐したリーフに格納
していたデータと、新規データとを登録するようにとっ
てもよく（請求項３）、同じ文脈の下にあるリーフのう
ち、最後に登録したリーフを分岐し、得た２つの新規リ
ーフに、分岐したリーフに格納していたデータと、新規
データとを登録するようにとってもよい（請求項４）。Further, the above-mentioned code tree new registration process (4)
Is the leaf that has the longest distance from the root defined as the vertex of the code tree among the leaves under the same context, and the two new leaves obtained are the data stored in the branched leaf. , It is also possible to register new data (Claim 3), of the leaves under the same context, the last registered leaf is branched, and the obtained two new leaves are stored in the branched leaf. The existing data and the new data may be registered (claim 4).

【００２８】一方、本発明のデータ復元方法は、入力デ
ータを過去の入力データの履歴に応じて符号化した符号
を復号するデータ復元方法において、次のような過程を
とることを特徴としている（請求項５）。（１）復号したデータと文脈との組み合わせを登録した
文脈木を保持する文脈木保持過程。（２）文脈に応じておのおの独立した符号木を保持する
符号木保持過程。（３）直前までに復号したデータから符号の符号木を決
定する符号木決定過程。（４）符号に従って符号木の頂点を意味するルートから
データ格納点としてのリーフへと走査して符号を復号す
る復号過程。（５）到達したリーフが符号木中の特定コードであった
場合、文脈を変更する文脈変更過程。（６）復号したデータ及び特定コードのリーフを他のリ
ーフあるいは分岐点としてのノードと組み替える符号長
変更過程。（７）特定コードを復号したとき符号木に復号したデー
タを新規に登録する新規登録過程。（８）新規登録過程で登録したデータを文脈木保持過程
の文脈木に登録する文脈木登録過程。（９）新規登録過程では符号化側で分岐に選択したリー
フと同じリーフを分岐して新規データを登録する過程。On the other hand, the data restoration method of the present invention is characterized by the following steps in the data restoration method of decoding the code obtained by coding the input data according to the history of the past input data ( Claim 5). (1) A context tree holding process for holding a context tree in which a combination of decoded data and a context is registered. (2) Code tree holding process for holding independent code trees depending on the context. (3) Code tree determination process of determining the code tree of the code from the data decoded up to immediately before. (4) A decoding process in which the code is decoded by scanning from the root, which means the vertex of the code tree according to the code, to the leaf as the data storage point. (5) A context changing process of changing the context if the leaf that arrived is a specific code in the code tree. (6) A code length changing process of recombining the leaf of the decoded data and the specific code with another leaf or a node as a branch point. (7) A new registration process of newly registering the decoded data in the code tree when the specific code is decoded. (8) A context tree registration process of registering the data registered in the new registration process in the context tree of the context tree holding process. (9) In the new registration process, the same leaf as the leaf selected for branching on the encoding side is branched and new data is registered.

【００２９】さらに、本発明のデータ復元方法は、入力
データを過去の入力データの履歴に応じて符号化した符
号を復号するデータ復元方法において、次のような過程
をとることを特徴としている（請求項６）。（１）予めデータ未登録を示すデータとして定義される
エスケープコードを登録した符号木を保持する符号木保
持過程。（２）復号したデータと文脈との組み合わせを登録した
文脈木を保持する文脈木保持過程。（３）直前までに復号したデータから符号の符号木を決
定する符号木決定過程。（４）符号に従って符号木の頂点を意味するルートから
データ格納点としてのリーフへと走査して符号を復号す
る復号過程。（５）到達したリーフがエスケープコードであった場
合、文脈を変更する文脈変更過程。（６）復号したデータ及びエスケープコードのリーフを
他のリーフあるいは分岐点としてのノードと組み替える
符号長変更過程。（７）エスケープコードを復号したとき符号木に復号し
たデータを新規に登録する新規登録過程。（８）新規登録過程で登録したデータを文脈木保持過程
の文脈木に登録する文脈木登録過程。（９）新規登録過程では符号化側で分岐に選択したリー
フと同じリーフを分岐して新規データを登録する過程。Further, the data restoration method of the present invention is characterized in that the following steps are taken in the data restoration method of decoding the code obtained by coding the input data according to the history of the past input data ( Claim 6). (1) A code tree holding process of holding a code tree in which an escape code defined as data indicating data unregistered is registered in advance. (2) A context tree holding process for holding a context tree in which a combination of decoded data and a context is registered. (3) Code tree determination process of determining the code tree of the code from the data decoded up to immediately before. (4) A decoding process in which the code is decoded by scanning from the root, which means the vertex of the code tree according to the code, to the leaf as the data storage point. (5) A context change process for changing the context if the leaf that arrived is an escape code. (6) A code length changing process in which the leaf of the decoded data and the escape code is recombined with another leaf or a node as a branch point. (7) A new registration process of newly registering the decoded data in the code tree when the escape code is decoded. (8) A context tree registration process of registering the data registered in the new registration process in the context tree of the context tree holding process. (9) In the new registration process, the same leaf as the leaf selected for branching on the encoding side is branched and new data is registered.

【００３０】また、請求項１に記載の本発明のデータ圧
縮方法を実施するための装置の構成を、図９の原理ブロ
ック図に示す。この図９に示すデータ圧縮装置は、入力
データを過去に出現した履歴に応じて符号化するもので
ある。ここで、３０１は符号木保持手段、３０２は文脈
木保持手段、３０３は文脈登録手段、３０４は符号登録
手段、３０５は文脈変更手段、３０６は符号化手段、３
０７は符号更新手段である。The configuration of an apparatus for carrying out the data compression method of the present invention according to claim 1 is shown in the principle block diagram of FIG. The data compression apparatus shown in FIG. 9 encodes the input data according to the history of past appearances. Here, 301 is a code tree holding means, 302 is a context tree holding means, 303 is a context registration means, 304 is a code registration means, 305 is a context changing means, 306 is an encoding means, 3
Reference numeral 07 is a code updating means.

【００３１】符号木保持手段３０１は、予めデータ未登
録を示すデータとして定義されるエスケープコードを登
録した符号木を保持するものであり、文脈木保持手段３
０２は、入力データと文脈との組み合わせを登録した文
脈木を保持するものであり、文脈登録手段３０３は、エ
スケープコードを符号化したのち、文脈木にデータを新
規に登録するものである。The code tree holding means 301 holds a code tree in which an escape code defined as data indicating data unregistered is registered in advance, and the context tree holding means 3
Reference numeral 02 is for holding a context tree in which a combination of input data and context is registered, and the context registration means 303 is for newly registering data in the context tree after encoding an escape code.

【００３２】さらに、符号登録手段３０４は、エスケー
プコードを符号化したのち符号木のエスケープコードの
データ格納点としてのリーフを分岐してデータを新規に
登録するものであり、文脈変更手段３０５は、入力デー
タと文脈との組み合わせが文脈木に保持されていないと
き、文脈を変更するものである。また、符号化手段３０
６は、符号木の頂点からの入力データあるいはエスケー
プコードが登録してあるリーフまでの分岐に従って符号
を出力するものであり、符号更新手段３０７は、符号化
したデータ及びエスケープコードが登録してあるリーフ
と他のリーフあるいはノードとを取り替えるものである
（以上、請求項７）。Further, the code registering means 304 is for registering new data by branching a leaf as a data storage point of the escape code of the code tree after encoding the escape code, and the context changing means 305. When the combination of the input data and the context is not stored in the context tree, the context is changed. Also, the encoding means 30
Reference numeral 6 is for outputting a code in accordance with input data from the apex of the code tree or branching to a leaf in which an escape code is registered, and the code updating means 307 is registered with encoded data and escape code. The leaf is replaced with another leaf or node (above, claim 7).

【００３３】また、請求項２に記載の本発明のデータ圧
縮方法を実施するための装置の構成を、図１０の原理ブ
ロック図に示す。この図１０に示データ圧縮装置も、入
力データを過去に出現した履歴に応じて符号化するもの
である。この図１０に示すデータ圧縮装置は、前述の図
９におけるものと同様の符号木保持手段３０１，文脈木
保持手段３０２，文脈登録手段３０３，文脈変更手段３
０５，符号化手段３０６，符号更新手段３０７をそなえ
ており、これらの説明は省略する。The configuration of an apparatus for carrying out the data compression method of the present invention according to claim 2 is shown in the principle block diagram of FIG. The data compression apparatus shown in FIG. 10 also encodes the input data according to the history of past appearances. The data compression apparatus shown in FIG. 10 has the same code tree holding means 301, context tree holding means 302, context registration means 303, and context changing means 3 as those in FIG. 9 described above.
05, an encoding unit 306, and a code updating unit 307, and the description thereof will be omitted.

【００３４】また、３１０は分岐位置検索手段であり、
この分岐位置検索手段３１０は、符号木上の最長の符号
長を持つリーフを検索するものである。３１１は符号登
録手段であり、この符号登録手段３１１は、エスケープ
コードを符号化したのち、分岐位置検索手段１１０に検
索されたデータ格納点としてのリーフを分岐してデータ
を新規に登録するものである（以上、請求項８）。Further, 310 is a branch position searching means,
The branch position searching means 310 searches for a leaf having the longest code length on the code tree. Reference numeral 311 is a code registration means. This code registration means 311 is for coding the escape code and then branching the leaf as the data storage point searched by the branch position searching means 110 to newly register the data. Yes (above, claim 8).

【００３５】さらに、請求項３に記載の本発明のデータ
圧縮方法を実施するための装置の構成を、図１１の原理
ブロック図に示す。この図１１に示すデータ圧縮装置
も、入力データを過去に出現した履歴に応じて符号化す
るものである。ここで、この図１１に示すデータ圧縮装
置においても、前述の図９におけるものと同様の符号木
保持手段３０１，文脈木保持手段３０２，文脈登録手段
３０３，文脈変更手段３０５，符号化手段３０６，符号
更新手段３０７をそなえており、これらの説明は省略す
る。Further, the configuration of an apparatus for carrying out the data compression method of the present invention according to claim 3 is shown in the principle block diagram of FIG. The data compression apparatus shown in FIG. 11 also encodes the input data according to the history of past appearances. Here, also in the data compression apparatus shown in FIG. 11, the code tree holding means 301, the context tree holding means 302, the context registration means 303, the context changing means 305, the coding means 306, which are similar to those in FIG. The code updating means 307 is provided, and a description thereof will be omitted.

【００３６】３０８は分岐位置保持手段であり、この分
岐位置保持手段３０８は、符号木に新規に登録されたデ
ータ格納点としてのリーフの位置を保持するものであ
る。さらに、３０９は符号登録手段であり、この符号登
録手段３０９は、エスケープコードを符号化したのち、
分岐位置保持手段３０８に保持されている位置にあるリ
ーフを分岐してデータを新規に登録するものである（以
上、請求項９）。Reference numeral 308 is a branch position holding means, and this branch position holding means 308 holds the position of a leaf as a data storage point newly registered in the code tree. Further, 309 is a code registration means, and this code registration means 309 encodes the escape code,
The leaf at the position held by the branching position holding means 308 is branched to newly register the data (above, claim 9).

【００３７】一方、請求項４に記載の本発明のデータ復
元方法を実施するための装置の構成を、図１２の原理ブ
ロック図に示す。この図１２に示すデータ復元装置は、
入力データを過去の入力データの履歴に応じて符号化し
た符号を復号するものである。ここで、４０１は符号木
保持手段、４０２は文脈木保持手段、４０３は符号木決
定手段、４０４は復号手段、４０５は文脈変更手段、４
０６は符号更新手段、４０７は符号登録手段、４０８は
文脈木登録手段である。On the other hand, the configuration of an apparatus for carrying out the data restoration method of the present invention according to claim 4 is shown in the principle block diagram of FIG. The data restoration device shown in FIG.
It is a code for decoding input data, which is encoded according to the history of past input data. Here, 401 is a code tree holding means, 402 is a context tree holding means, 403 is a code tree determining means, 404 is a decoding means, 405 is a context changing means, 4
Reference numeral 06 is a code updating unit, 407 is a code registration unit, and 408 is a context tree registration unit.

【００３８】符号木保持手段４０１は、予めデータ未登
録を示すデータとして定義されるエスケープコードを登
録した符号木を保持するものであり文脈木保持手段４０
２は、復号したデータと文脈との組み合わせを登録した
文脈木を保持するものであり、符号木決定手段４０３
は、直前までに復号したデータから符号の符号木を決定
するものである。The code tree holding means 401 holds a code tree in which an escape code defined as data indicating data unregistered is registered in advance, and the context tree holding means 40.
Reference numeral 2 holds a context tree in which a combination of the decoded data and the context is registered, and the code tree determining means 403.
Is to determine the code tree of the code from the data decoded up to immediately before.

【００３９】さらに、復号手段４０４は、符号に従って
符号木の頂点を意味するルートからデータ格納点として
のリーフへと走査して符号を復号するものであり、文脈
変更手段４０５は、到達したリーフがエスケープコード
であった場合、文脈を変更するものであり、符号更新手
段４０６は、復号したデータ及びエスケープコードのリ
ーフを他のリーフあるいは分岐点としてのノードと組み
替えるものである。Further, the decoding means 404 decodes the code by scanning from the root, which means the vertices of the code tree, to the leaf as the data storage point according to the code, and the context changing means 405, If it is an escape code, the context is changed, and the code updating means 406 replaces the leaf of the decoded data and escape code with another leaf or a node as a branch point.

【００４０】また、符号登録手段４０７は、エスケープ
コードを復号したとき、エスケープコードのリーフを分
岐して復号したデータを新規に登録するものであり、文
脈木登録手段４０８は、符号登録手段４０７で登録した
データを文脈保持手段４０２の文脈木に登録するもので
ある（以上、請求項１０）。When the escape code is decoded, the code registration means 407 newly branches the decoded data by branching the leaf of the escape code, and the context tree registration means 408 uses the code registration means 407. The registered data is registered in the context tree of the context holding unit 402 (above, claim 10).

【００４１】さらに、請求項５に記載の本発明のデータ
復元方法を実施するための装置の構成を、図１３の原理
ブロック図に示す。この図１３に示すデータ復元装置
も、入力データを過去の入力データの履歴に応じて符号
化した符号を復号するもので、ここで、この図１３に示
すデータ復元装置は、前述の図１２に示すものと同様の
符号木保持手段４０１，文脈木保持手段４０２，符号木
決定手段４０３，復号手段４０４，文脈変更手段４０
５，符号更新手段４０６をそなえており、これらの説明
は省略する。Further, the configuration of an apparatus for carrying out the data restoration method of the present invention according to claim 5 is shown in the principle block diagram of FIG. The data restoration device shown in FIG. 13 also decodes the code obtained by encoding the input data according to the history of past input data. Here, the data restoration device shown in FIG. Code tree holding means 401, context tree holding means 402, code tree determining means 403, decoding means 404, context changing means 40 similar to those shown.
5, the code updating means 406 is provided, and the description thereof will be omitted.

【００４２】また、４１１は分岐位置検索手段であり、
この分岐位置検索手段４１１は、符号木内の最長の符号
長を持つリーフの位置を検索するものである。そして、
４１２は符号登録手段であり、この符号登録手段４１２
は、エスケープコードを符号化したのち分岐位置検索手
段４１１で検索されたリーフを分岐してデータを新規に
登録するものである。Reference numeral 411 is a branch position searching means,
The branch position search means 411 searches the position of the leaf having the longest code length in the code tree. And
Reference numeral 412 is a code registration means, and this code registration means 412
Is to encode the escape code and then branch the leaf searched by the branch position searching means 411 to newly register the data.

【００４３】さらに、４１３は文脈木登録手段であり、
この文脈木登録手段４１３は、符号登録手段４１２で登
録したデータを文脈木保持手段４０２の文脈木に登録す
るものである（以上、請求項１１）。さらに、請求項６
に記載の本発明のデータ復元方法を実施するための装置
の構成を、図１４の原理ブロック図に示す。この図１４
に示すデータ復元装置も、入力データを過去の入力デー
タの履歴に応じて符号化した符号を復号するものであ
る。Further, 413 is a context tree registration means,
The context tree registration means 413 is for registering the data registered by the code registration means 412 in the context tree of the context tree holding means 402 (above, claim 11). Further, claim 6
The principle block diagram of FIG. 14 shows the configuration of an apparatus for carrying out the data restoration method of the present invention described in FIG. This FIG.
The data restoration device shown in (1) also decodes the code obtained by coding the input data according to the history of the past input data.

【００４４】ここで、この図１４に示すデータ復元装置
においても、前述の図１２に示すものと同様の符号木保
持手段４０１，文脈木保持手段４０２，符号木決定手段
４０３，復号手段４０４，文脈変更手段４０５，符号更
新手段４０６をそなえており、これらの説明は省略す
る。また、４０９は分岐位置保持手段であり、この分岐
位置保持手段４０９は、符号木に新規に登録されたリー
フの位置を保持するものである。Here, also in the data restoration device shown in FIG. 14, the code tree holding means 401, the context tree holding means 402, the code tree determining means 403, the decoding means 404, the context similar to those shown in FIG. The changing means 405 and the code updating means 406 are provided, and the description thereof will be omitted. Further, 409 is a branch position holding unit, and this branch position holding unit 409 holds the position of the leaf newly registered in the code tree.

【００４５】さらに、４１０は符号登録手段であり、こ
の符号登録手段４１０は、エスケープコードを符号化し
たのち、分岐位置保持手段４０９に保持されている位置
にあるリーフを分岐してデータを新規に登録するもので
ある。４１４は文脈木登録手段であり、この文脈木登録
手段４１４は、符号登録手段４１０で登録したデータを
文脈保持手段４０２の文脈木に登録するものである（以
上、請求項１２）。そして、本発明のデータ圧縮方法
は、次のような作用がある（請求項１）。（１）文脈木保持過程により、入力データとそれまでに
連続したｎ個のデータからなる文脈との組み合わせを登
録した文脈木を保持することができる。（２）符号木保持過程により、文脈毎に独立した符号木
を保持することができる。（３）文脈木新規登録過程により、入力データと文脈と
の組み合わせが文脈木保持過程に保持されていないと
き、文脈木保持過程の文脈木にデータを新規に登録する
ことができる。（４）符号木新規登録過程により、入力データと文脈と
の組み合わせが文脈木保持過程に保持されていないと
き、符号木保持過程の符号木のデータ格納点としてのリ
ーフを分岐して得た新規リーフにデータを格納すること
ができる。（５）文脈変更過程により、入力データと文脈との組み
合わせが文脈木保持過程に保持されていないとき文脈を
変更することができる。（６）符号出力過程により、符号木の頂点からの入力デ
ータあるいは符号木中の特定コードが登録してあるリー
フまでの分岐に従って符号を出力することができる。（７）符号長変更過程により、入力データあるいは符号
木中の特定コードが登録してあるリーフと他のリーフあ
るいは符号木の頂点以外の分岐点として定義されるノー
ドとを取り替えることができる。（８）符号木新規登録過程では、特定コードを登録して
あるリーフを分岐し、得た２つの新規リーフに特定コー
ドと新規データとを登録することができる。Further, reference numeral 410 is a code registration means. This code registration means 410 encodes the escape code and then branches the leaf at the position held by the branch position holding means 409 to newly write data. It is to register. Reference numeral 414 is a context tree registration means, and this context tree registration means 414 registers the data registered by the code registration means 410 in the context tree of the context holding means 402 (above, claim 12). The data compression method of the present invention has the following effects (claim 1). (1) By the context tree holding process, it is possible to hold a context tree in which a combination of input data and a context consisting of n pieces of continuous data is registered. (2) By the code tree holding process, it is possible to hold an independent code tree for each context. (3) By the context tree new registration process, when the combination of the input data and the context is not held in the context tree holding process, the data can be newly registered in the context tree of the context tree holding process. (4) When the combination of the input data and the context is not held in the context tree holding process by the code tree new registration process, the leaf obtained as a data storage point of the code tree holding process is branched and obtained. Data can be stored in the leaf. (5) The context changing process allows the context to be changed when the combination of the input data and the context is not held in the context tree holding process. (6) By the code output process, the code can be output according to the input data from the vertex of the code tree or the branch to the leaf in which the specific code in the code tree is registered. (7) By the code length changing process, it is possible to replace a leaf in which a specific code in the input data or the code tree is registered with another leaf or a node defined as a branch point other than the vertex of the code tree. (8) In the code tree new registration process, the leaf in which the specific code is registered can be branched, and the specific code and the new data can be registered in the obtained two new leaves.

【００４６】さらに、本発明のデータ圧縮方法は、次の
ような作用がある（請求項２）。（１）符号木保持過程により、予め未登録を示すデータ
として定義されるエスケープコードを登録した符号木を
保持することができる。（２）文脈木保持過程により、入力データとそれまでに
連続したｎ個のデータからなる文脈との組み合わせを登
録した文脈木を保持することができる。（３）文脈木新規登録過程により、入力データと文脈と
の組み合わせが文脈木保持過程に保持されていないと
き、文脈木保持過程の文脈木にデータを新規に登録する
ことができる。（４）符号木新規登録過程により、入力データと文脈と
の組み合わせが文脈木保持過程に保持されていないと
き、符号木保持過程の符号木のデータ格納点としてのリ
ーフを分岐して得た新規リーフにデータを格納すること
ができる。（５）文脈変更過程により、入力データと文脈との組み
合わせが文脈木保持過程に保持されていないとき文脈を
変更することができる。（６）符号出力過程により、符号木の頂点からの入力デ
ータあるいはエスケープコードが登録してあるリーフま
での分岐に従って符号を出力することができる。（７）符号長変更過程により、入力データあるいはエス
ケープコードが登録してあるリーフと他のリーフあるい
は符号木の頂点以外の分岐点として定義されるノードと
を取り替えることができる。（８）符号木新規登録過程では、エスケープコードを登
録してあるリーフを分岐し、得た２つの新規リーフにエ
スケープコードと新規データとを登録することができ
る。Furthermore, the data compression method of the present invention has the following operation (claim 2). (1) By the code tree holding process, it is possible to hold a code tree in which an escape code defined as data indicating unregistered is registered in advance. (2) By the context tree holding process, it is possible to hold the context tree in which the combination of the input data and the context consisting of n pieces of continuous data is registered. (3) By the context tree new registration process, when the combination of the input data and the context is not held in the context tree holding process, the data can be newly registered in the context tree of the context tree holding process. (4) When the combination of the input data and the context is not held in the context tree holding process by the code tree new registration process, the leaf obtained as a data storage point of the code tree holding process is branched and obtained. Data can be stored in the leaf. (5) The context changing process allows the context to be changed when the combination of the input data and the context is not held in the context tree holding process. (6) By the code output process, the code can be output according to the branch to the input data from the vertex of the code tree or the leaf in which the escape code is registered. (7) By the code length changing process, it is possible to replace a leaf in which input data or escape code is registered with another leaf or a node defined as a branch point other than the vertex of the code tree. (8) In the code tree new registration process, the leaf in which the escape code is registered can be branched, and the escape code and the new data can be registered in the obtained two new leaves.

【００４７】また、上述の（４）の符号木新規登録過程
では、同じ文脈の下にあるリーフのうち、符号木の頂点
として定義されるルートからの距離が最も長いリーフを
分岐し、得た２つの新規リーフに、分岐したリーフに格
納していたデータと、新規データとを登録することもで
き（請求項３）、同じ文脈の下にあるリーフのうち、最
後に登録したリーフを分岐し、得た２つの新規リーフ
に、分岐したリーフに格納していたデータと、新規デー
タとを登録することもできる（請求項４）。In the code tree new registration process (4) described above, among leaves under the same context, the leaf having the longest distance from the root defined as the vertex of the code tree is branched and obtained. The data stored in the branched leaf and the new data can be registered in the two new leaves (Claim 3). Of the leaves under the same context, the last registered leaf is branched. The data stored in the branched leaf and the new data can be registered in the obtained two new leaves (claim 4).

【００４８】一方、本発明のデータ復元方法は、次のよ
うな作用がある（請求項５）。（１）文脈木保持過程により、復号したデータと文脈と
の組み合わせを登録した文脈木を保持することができ
る。（２）符号木保持過程により、文脈に応じておのおの独
立した符号木を保持することができる。（３）符号木決定過程により、直前までに復号したデー
タから符号の符号木を決定することができる。（４）復号過程により、符号に従って符号木の頂点を意
味するルートからデータ格納点としてのリーフへと走査
して符号を復号することができる。（５）文脈変更過程により、到達したリーフが符号木中
の特定コードであった場合、文脈を変更することができ
る。（６）符号長変更過程により、復号したデータ及び特定
コードのリーフを他のリーフあるいは分岐点としてのノ
ードと組み替えることができる。（７）新規登録過程により、特定コードを復号したとき
符号木に復号したデータを新規に登録することができ
る。（８）文脈木登録過程により、新規登録過程で登録した
データを文脈木保持過程の文脈木に登録することができ
る。（９）新規登録過程では符号化側で分岐に選択したリー
フと同じリーフを分岐して新規データを登録することが
できる。On the other hand, the data restoration method of the present invention has the following actions (claim 5). (1) By the context tree holding process, it is possible to hold the context tree in which the combination of the decoded data and the context is registered. (2) By the code tree holding process, each independent code tree can be held according to the context. (3) In the code tree determination process, the code tree of the code can be determined from the data decoded up to immediately before. (4) By the decoding process, the code can be decoded by scanning from the root meaning the apex of the code tree to the leaf as the data storage point according to the code. (5) The context can be changed by the context changing process when the leaf that arrived is a specific code in the code tree. (6) By the code length changing process, the decoded data and the leaf of the specific code can be recombined with another leaf or a node as a branch point. (7) By the new registration process, the decoded data can be newly registered in the code tree when the specific code is decoded. (8) By the context tree registration process, the data registered in the new registration process can be registered in the context tree in the context tree holding process. (9) In the new registration process, the same leaf as the leaf selected for branching on the encoding side can be branched to register new data.

【００４９】さらに、本発明のデータ復元方法は、次の
ような作用がある（請求項６）。（１）符号木保持過程により、予めデータ未登録を示す
データとして定義されるエスケープコードを登録した符
号木を保持することができる。（２）文脈木保持過程により、復号したデータと文脈と
の組み合わせを登録した文脈木を保持することができ
る。（３）符号木決定過程により、直前までに復号したデー
タから符号の符号木を決定することができる。（４）復号過程により、符号に従って符号木の頂点を意
味するルートからデータ格納点としてのリーフへと走査
して符号を復号することができる。（５）文脈変更過程により、到達したリーフがエスケー
プコードであった場合、文脈を変更することができる。（６）符号長変更過程により、復号したデータ及びエス
ケープコードのリーフを他のリーフあるいは分岐点とし
てのノードと組み替えることができる。（７）新規登録過程により、エスケープコードを復号し
たとき符号木に復号したデータを新規に登録することが
できる。（８）文脈木登録過程により、新規登録過程で登録した
データを文脈木保持過程の文脈木に登録することができ
る。（９）新規登録過程では符号化側で分岐に選択したリー
フと同じリーフを分岐して新規データを登録することが
できる。Furthermore, the data restoration method of the present invention has the following action (claim 6). (1) By the code tree holding process, it is possible to hold a code tree in which an escape code defined as data indicating data unregistered is registered in advance. (2) By the context tree holding process, it is possible to hold the context tree in which the combination of the decoded data and the context is registered. (3) In the code tree determination process, the code tree of the code can be determined from the data decoded up to immediately before. (4) By the decoding process, the code can be decoded by scanning from the root meaning the apex of the code tree to the leaf as the data storage point according to the code. (5) The context change process allows the context to be changed if the leaf that arrived is an escape code. (6) Through the process of changing the code length, the leaf of the decoded data and the escape code can be recombined with another leaf or a node as a branch point. (7) By the new registration process, the decoded data can be newly registered in the code tree when the escape code is decoded. (8) By the context tree registration process, the data registered in the new registration process can be registered in the context tree in the context tree holding process. (9) In the new registration process, the same leaf as the leaf selected for branching on the encoding side can be branched to register new data.

【００５０】また、図９を用いて説明した構成をもつ、
本発明のデータ圧縮方法を実施するための装置、すなわ
ち入力データを過去に出現した履歴に応じて符号化する
データ圧縮装置においては、符号木保持手段３０１が、
予めデータ未登録を示すデータとして定義されるエスケ
ープコードを登録した符号木を保持し、文脈木保持手段
３０２が、入力データと文脈との組み合わせを登録した
文脈木を保持する。Further, it has the configuration described with reference to FIG.
In the device for implementing the data compression method of the present invention, that is, in the data compression device that encodes the input data according to the history that has appeared in the past, the code tree holding unit 301 is
A code tree in which an escape code defined as data indicating unregistered data is registered is held in advance, and a context tree holding unit 302 holds a context tree in which a combination of input data and a context is registered.

【００５１】そして、文脈登録手段３０３が、エスケー
プコードを符号化したのち、文脈木にデータを新規に登
録し、符号登録手段３０４が、エスケープコードを符号
化したのち符号木のエスケープコードのデータ格納点と
してのリーフを分岐してデータを新規に登録し、文脈変
更手段３０５が、入力データと文脈との組み合わせが文
脈木に保持されていないとき、文脈を変更する。Then, the context registration means 303 encodes the escape code, then newly registers the data in the context tree, and the code registration means 304 encodes the escape code and then stores the escape code data of the code tree. The leaf as a point is branched to newly register the data, and the context changing unit 305 changes the context when the combination of the input data and the context is not held in the context tree.

【００５２】さらに、符号化手段３０６が、符号木の頂
点からの入力データあるいはエスケープコードが登録し
てあるリーフまでの分岐に従って符号を出力し、符号更
新手段３０７が、符号化したデータ及びエスケープコー
ドが登録してあるリーフと他のリーフあるいはノードと
を取り替える（以上、請求項７）。さらに、図１０を用
いて説明した構成をもつ、本発明のデータ圧縮方法を実
施するための装置、すなわち入力データを過去に出現し
た履歴に応じて符号化するデータ圧縮装置においては、
符号木保持手段３０１が、予めデータ未登録を示すデー
タとして定義されるエスケープコードを登録した符号木
を保持し、文脈木保持手段３０２が、入力データと文脈
との組み合わせを登録した文脈木を保持する。Further, the coding means 306 outputs a code according to the branch to the input data from the vertex of the code tree or the leaf in which the escape code is registered, and the code updating means 307 outputs the coded data and the escape code. Replace the leaf registered with the other leaf or node (above, claim 7). Furthermore, in the device for implementing the data compression method of the present invention, which has the configuration described with reference to FIG. 10, that is, in the data compression device that encodes the input data according to the history that has appeared in the past,
The code tree holding unit 301 holds a code tree in which an escape code defined as data indicating unregistered data is registered in advance, and the context tree holding unit 302 holds a context tree in which a combination of input data and context is registered. To do.

【００５３】そして、文脈登録手段３０３が、エスケー
プコードを符号化したのち、文脈木にデータを新規に登
録し、分岐位置検索手段３１０が、符号木上の最長の符
号長を持つリーフを検索し、符号登録手段３１１が、エ
スケープコードを符号化したのち、分岐位置検索手段３
１０に検索されたデータ格納点としてのリーフを分岐し
てデータを新規に登録する。Then, the context registration means 303 encodes the escape code, then newly registers the data in the context tree, and the branch position search means 310 searches the leaf having the longest code length on the code tree. , Code registration means 311 encodes the escape code, and then branch position search means 3
The leaf as the data storage point retrieved in 10 is branched and new data is registered.

【００５４】さらに、文脈変更手段３０５が、入力デー
タと文脈との組み合わせが文脈木に保持されていないと
き文脈を変更し、符号化手段３０６が、符号木の頂点か
ら入力データあるいはエスケープコードが登録してある
リーフまでの分岐に従って符号を出力し、符号更新手段
３０７が、符号化したデータ及びエスケープコードが登
録してあるリーフと他のリーフあるいはノードとを取り
替える（以上、請求項８）。Further, the context changing means 305 changes the context when the combination of the input data and the context is not held in the context tree, and the encoding means 306 registers the input data or the escape code from the vertex of the code tree. The code is output in accordance with the branch to the existing leaf, and the code updating means 307 replaces the leaf in which the encoded data and the escape code are registered with another leaf or node (above, claim 8).

【００５５】また、図１１を用いて説明した構成をも
つ、本発明のデータ圧縮方法を実施するための装置、す
なわち入力データを過去に出現した履歴に応じて符号化
するデータ圧縮装置においては、符号木保持手段３０１
が予めデータ未登録を示すデータとして定義されるエス
ケープコードを登録した符号木を保持し、文脈木保持手
段３０２が入力データと文脈との組み合わせを登録した
文脈木を保持する。Further, in the device for carrying out the data compression method of the present invention, which has the configuration described with reference to FIG. 11, that is, in the data compression device for coding the input data according to the history that has appeared in the past, Code tree holding means 301
Holds a code tree in which an escape code defined as data indicating unregistered data is registered in advance, and the context tree holding unit 302 holds a context tree in which a combination of input data and a context is registered.

【００５６】そして、文脈登録手段３０３がエスケープ
コードを符号化したのち、文脈木にデータを新規に登録
し、分岐位置保持手段３０８が符号木に新規に登録され
たデータ格納点としてのリーフの位置を保持し、符号登
録手段３０９がエスケープコードを符号化したのち、分
岐位置保持手段３０８に保持されている位置にあるリー
フを分岐してデータを新規に登録する。Then, after the context registration means 303 encodes the escape code, the data is newly registered in the context tree, and the branch position holding means 308 is the position of the leaf as the data storage point newly registered in the code tree. And the code registration means 309 encodes the escape code, and then branches the leaf at the position held by the branch position holding means 308 to newly register the data.

【００５７】さらに、文脈変更手段３０５が入力データ
と文脈との組み合わせが文脈木に保持されていないとき
分脈を変更し、符号化手段３０６が符号木の頂点から入
力データあるいはエスケープコードが登録してあるリー
フまでの分岐に従って符号を出力し、符号更新手段３０
７が符号化したデータ及びエスケープコードが登録して
あるリーフと他のリーフあるいは分岐点としてのノード
とを取り替える（以上、請求項９）。Further, the context changing means 305 changes the branch when the combination of the input data and the context is not held in the context tree, and the encoding means 306 registers the input data or the escape code from the vertex of the code tree. The code is output according to the branch to a certain leaf, and the code updating means 30
The leaf in which the data encoded by 7 and the escape code are registered is replaced with another leaf or a node as a branch point (the above is the ninth aspect).

【００５８】一方、図１２を用いて説明した構成をも
つ、本発明のデータ復元方法を実施するための装置、す
なわち入力データを過去の入力データの履歴に応じて符
号化した符号を復号するデータ復元装置においては、符
号木保持手段４０１が予めデータ未登録を示すデータと
して定義されるエスケープコードを登録した符号木を保
持し、文脈保持手段４０２が復号したデータと文脈との
組み合わせを登録した文脈木を保持し、符号木決定手段
４０３が直前までに復号したデータから符号の符号木を
決定する。On the other hand, a device for implementing the data restoration method of the present invention having the structure described with reference to FIG. 12, that is, data for decoding a code obtained by coding input data according to the history of past input data. In the restoration device, the code tree holding unit 401 holds a code tree in which an escape code defined as data indicating unregistered data is held in advance, and the context holding unit 402 registers a combination of the decoded data and the context. The tree is held, and the code tree determining means 403 determines the code tree of the code from the data decoded up to immediately before.

【００５９】そして、復号手段４０４が符号に従って符
号木の頂点を意味するルートからデータ格納点としての
リーフへと走査して符号を復号し、文脈変更手段４０５
が到達したリーフがエスケープコードであった場合、文
脈を変更する。さらに、符号更新手段４０６が、復号し
たデータ及びエスケープコードのリーフを他のリーフあ
るいは分岐点としてのノードと組み替え、符号登録手段
４０７がエスケープコードを復号したとき、エスケープ
コードのリーフを分岐して復号したデータを新規に登録
し、文脈木登録手段４０８が符号登録手段４０７で登録
したデータを文脈保持手段４０２の文脈木に登録する
（請求項１０）。Then, the decoding means 404 decodes the code by scanning from the root meaning the apex of the code tree to the leaf as the data storage point according to the code, and the context changing means 405.
If the leaf reached by is an escape code, change the context. Further, the code updating unit 406 rearranges the decoded data and the escape code leaf with another leaf or a node serving as a branch point, and when the code registration unit 407 decodes the escape code, the escape code leaf is branched and decoded. The newly registered data is newly registered, and the context tree registration unit 408 registers the data registered by the code registration unit 407 in the context tree of the context holding unit 402 (claim 10).

【００６０】さらに、図１３を用いて説明した構成をも
つ、記載の本発明のデータ復元方法を実施するための装
置、すなわち入力データを過去の入力データの履歴に応
じて符号化した符号を復号するデータ復元装置において
は、符号木保持手段４０１が予めデータ未登録を示すデ
ータとして定義されるエスケープコードを登録した符号
木を保持し、文脈保持手段４０２が復号したデータと文
脈との組み合わせを登録した文脈木を保持し、符号木決
定手段４０３が直前までに復号したデータから符号の符
号木を決定する。Further, a device for implementing the described data restoration method of the present invention having the structure described with reference to FIG. 13, that is, a code obtained by coding input data according to the history of past input data is decoded. In the data restoration device, the code tree holding unit 401 holds a code tree in which an escape code defined as data indicating unregistered data is registered in advance, and the context holding unit 402 registers a combination of the decoded data and context. The above-mentioned context tree is held, and the code tree determining means 403 determines the code tree of the code from the data decoded up to immediately before.

【００６１】そして、復号手段４０４が符号に従って符
号木の頂点を意味するルートからデータ格納点としての
リーフへと走査して符号を復号し、文脈変更手段４０５
が、到達したリーフがエスケープコードであった場合、
文脈を変更する。さらに、符号更新手段４０６が復号し
たデータ及びエスケープコードのリーフを他のリーフあ
るいは分岐点としてのノードと組み替え、分岐位置検索
手段４１１が符号木内の最長の符号長を持つリーフの位
置を検索し、符号登録手段４１２が、エスケープコード
を符号化したのち分岐位置検索手段４１１で検索された
リーフを分岐してデータを新規に登録し、文脈木登録手
段４０８が符号登録手段４１２で登録したデータを文脈
保持手段４０２の文脈木に登録する（以上、請求項１
１）。Then, the decoding means 404 decodes the code by scanning from the root meaning the apex of the code tree to the leaf as the data storage point according to the code, and the context changing means 405.
But if the leaf reached is an escape code,
Change context. Further, the leaf of the data and escape code decoded by the code updating means 406 is recombined with another leaf or a node as a branch point, and the branch position searching means 411 searches for the position of the leaf having the longest code length in the code tree, The code registration unit 412 branches the leaf searched by the branch position search unit 411 after encoding the escape code and newly registers the data, and the context tree registration unit 408 context-registers the data registered by the code registration unit 412. It is registered in the context tree of the holding means 402 (above, claim 1
1).

【００６２】また、図１４を用いて説明した構成をも
つ、本発明のデータ復元方法を実施するための装置、す
なわち入力データを過去の入力データの履歴に応じて符
号化した符号を復号するデータ復元装置においては、符
号木保持手段４０１が予めデータ未登録を示すデータと
して定義されるエスケープコードを登録した符号木を保
持し、文脈保持手段４０２が、復号したデータと文脈と
の組み合わせを登録した文脈木を保持し、符号木決定手
段４０３が直前までに復号したデータから符号の符号木
を決定する。An apparatus for implementing the data restoration method of the present invention having the structure described with reference to FIG. 14, that is, data for decoding a code obtained by coding input data according to the history of past input data. In the decompression device, the code tree holding unit 401 holds a code tree in which an escape code defined as data indicating unregistered data is registered in advance, and the context holding unit 402 registers a combination of decoded data and context. The context tree is held, and the code tree determining unit 403 determines the code tree of the code from the data decoded up to immediately before.

【００６３】そして、復号手段４０４が、符号に従って
符号木の頂点を意味するルートからデータ格納点として
のリーフへと走査して符号を復号し、文脈変更手段４０
５が、到達したリーフがエスケープコードであった場
合、文脈を変更し、符号更新手段４０６が復号したデー
タ及びエスケープコードのリーフを他のリーフあるいは
分岐点としてのノードと組み替え、分岐位置保持手段４
０９が符号木に新規に登録されたリーフの位置を保持す
る。Then, the decoding means 404 scans from the root, which means the vertices of the code tree, to the leaf as the data storage point according to the code, decodes the code, and the context changing means 40.
In the case where the leaf that reached 5 is an escape code, the context is changed, the leaf of the data and escape code decoded by the code updating means 406 is recombined with another leaf or a node as a branch point, and the branch position holding means 4
09 holds the position of the leaf newly registered in the code tree.

【００６４】さらに、符号登録手段４１０がエスケープ
コードを符号化したのち、分岐位置保持手段４０９に保
持されている位置にあるリーフを分岐してデータを新規
に登録し、文脈木登録手段４０８が符号登録手段４１０
で登録したデータを文脈保持手段４０２の文脈木に登録
する（請求項１２）。Further, after the code registration means 410 encodes the escape code, the leaf at the position held by the branch position holding means 409 is branched to newly register the data, and the context tree registration means 408 makes the code. Registration means 410
The data registered in (4) is registered in the context tree of the context holding unit 402 (claim 12).

【００６５】[0065]

【発明の実施の形態】（ａ）本発明に関連する技術１の
説明図１５は本発明に関連する技術１としてのデータ圧縮装
置とデータ復元装置の構成例を示すブロック図であり、
この図１５において、１は入力された文字を過去に出現
した履歴に応じて符号化して圧縮するデータ圧縮装置で
あり、２はデータ圧縮装置１で符号化された文字を復元
するデータ復元装置である。DESCRIPTION OF THE PREFERRED EMBODIMENTS (a) Description of Technology 1 Related to the Present Invention FIG. 15 is a block diagram showing a configuration example of a data compression apparatus and a data decompression apparatus as Technology 1 related to the present invention.
In FIG. 15, reference numeral 1 is a data compression device that encodes and compresses an input character according to the history of past appearances, and 2 is a data decompression device that decompresses the character encoded by the data compression device 1. is there.

【００６６】さらに、データ圧縮装置１は、入力された
文字列データの文脈を収集して文脈木を作成する文脈収
集過程１１と、この文脈収集過程１１で得られた文脈
（文脈木）に対応してスプレイ符号を対応させた符号木
を、入力データの文字列に応じてスプレイ符号化しなが
ら作成・更新するスプレイ符号化過程１２をとるように
なっている。一方、データ復元装置２は、このデータ圧
縮装置１で符号化された復元データの文脈に対応してス
プレイ符号を対応させた符号木を、復元データの文字列
に応じてスプレイ符号化しながら作成・更新するスプレ
イ符号化過程２１と、復元データとしての文字列につい
ての文脈を収集（文脈木を作成）する文脈収集過程２２
をとるようになっている。Further, the data compressing apparatus 1 corresponds to the context collecting step 11 for collecting the context of the input character string data to create the context tree, and the context (context tree) obtained in the context collecting step 11. Then, the splay coding process 12 for creating / updating the code tree corresponding to the splay code while performing the splay coding in accordance with the character string of the input data is performed. On the other hand, the data decompression device 2 creates a code tree in which the spray code is associated with the context of the decompressed data encoded by the data compression device 1 while performing the splay coding according to the character string of the decompressed data. A splay coding process 21 for updating and a context collecting process 22 for collecting the context (creating a context tree) about the character string as the restored data.
It is designed to take

【００６７】なお、以下では、データ圧縮装置１を符号
化側、データ復元装置２を復元側として説明する。（１）符号化側の説明図１６（ａ），（ｂ）は、文脈収集過程１１において作
成される文脈木の一例を示す図であり、図１６（ａ）は
文脈木がハッシュ法を用いて短時間で探索できるように
文字列をリスト構造の格納形式でメモリに格納した例を
示す図、図１６（ｂ）は文字列を格納した木構造の辞書
（リスト）を親子関係の繋がりで示した図である。In the following description, the data compression apparatus 1 will be described as the encoding side and the data decompression apparatus 2 will be described as the decompression side. (1) Description of Encoding Side FIGS. 16 (a) and 16 (b) are diagrams showing an example of the context tree created in the context collection process 11. In FIG. 16 (a), the context tree uses the hash method. 16B is a diagram showing an example in which a character string is stored in a memory in a storage structure of a list structure so that it can be searched in a short time. FIG. 16B shows a tree structure dictionary (list) storing the character string in a parent-child relationship. It is the figure shown.

【００６８】ここで、図１６（ａ）中のアドレスは１６
進表示であり、この図１６（ａ）の例では、文脈木の最
大サイズは４Ｋノード（４ＫＷ）である。このように、
全ての文字を予め登録しておけば、ルートに繋がる第１
階層の兄弟ノードの位置は予め分かるので、探索時にリ
ストを操作する必要はなく、直接アクセスすることがで
きる。Here, the address in FIG. 16A is 16
In the example of FIG. 16A, the maximum size of the context tree is 4K nodes (4KW). in this way,
If you register all the characters in advance, the first connection to the route
Since the position of the sibling node of the hierarchy is known in advance, it is not necessary to operate the list at the time of searching, and direct access is possible.

【００６９】一方、第２階層以降は、子ノードと右兄弟
ノードのアドレスを格納しておき、探索時にリスト形式
で格納文字を照合しながら一致するまで、リストを操作
してアクセスする。また、文脈の木は初期化されたと
き、アドレス１００まで設定されるが、このとき第１階
層のアドレス１００には、End Of File （ＥＯＦ）符号
を登録しておき、アドレス１０１以降のメモリを新規登
録に使用する。On the other hand, in the second and subsequent hierarchies, the addresses of the child node and the right sibling node are stored, and the list is manipulated and accessed until the stored characters are matched in the list format while matching. Further, when the context tree is initialized, addresses up to 100 are set. At this time, an End Of File (EOF) code is registered in the address 100 of the first layer, and the memory at addresses 101 and later is stored. Used for new registration.

【００７０】次に、図１７は、上述のスプレイ符号化過
程１２において作成される符号木の一例を示す図であ
る。符号木は、基本的に従来のSplay-Tree符号化と同様
に、初期化時に図１７のように設定される。そして、図
１６（ｂ）に対応して、最大サイズが４Ｋノード（４Ｋ
Ｗ）までの場合、符号の木のノードは、内部節点（子ノ
ードが付いている）と外部節点（リーフ、子ノードがな
い各符号の終端）の２つに分類される。Next, FIG. 17 is a diagram showing an example of a code tree created in the above-mentioned spray coding process 12. The code tree is basically set at the time of initialization as shown in FIG. 17, similarly to the conventional Splay-Tree coding. Then, corresponding to FIG. 16B, the maximum size is 4K nodes (4K
Up to W), the nodes of the code tree are classified into two: internal nodes (with child nodes) and external nodes (leaf, the end of each code without child nodes).

【００７１】また、スプレイ符号化では、符号の木をア
クセスするために、図１８に示すようなＵｐ，Ｌｅｆ
ｔ，Ｒｉｇｈｔという３つの配列を用いる。ここで、Ｕ
ｐ配列は、各ノードから親ノードへのアドレスを格納
し、Ｌｅｆｔ配列は、各ノードから左の子ノードへのア
ドレスを格納し、Ｒｉｇｈｔ配列は、各ノードから右の
子ノードへのアドレスを格納するものである。Further, in the splay coding, in order to access the code tree, Up and Lef as shown in FIG. 18 are used.
Three arrays of t and Right are used. Where U
The p array stores the address from each node to the parent node, the Left array stores the address from each node to the left child node, and the Right array stores the address from each node to the right child node. To do.

【００７２】また、Ｕｐ配列では、内部節点を最初の４
ＫＷ（アドレス（１６進）０００〜ＦＦＦ）に格納し、
外部節点を残りの４ＫＷ（アドレス（１６進）１０００
〜１ＦＦＦ）に格納するようになっている。このように
することで、文脈木の各ノードに対する符号を、符号木
のアドレス＝文脈木のノードポインタ（番号）＋４Ｋ、
で対応付けることができるようになる。Further, in the Up array, the internal node is first 4
Store in KW (address (hexadecimal) 000-FFF),
The remaining 4KW of external node (address (hexadecimal) 1000)
~ 1FFF). By doing so, the code for each node of the context tree can be calculated as follows: address of code tree = context tree node pointer (number) + 4K,
You can associate with.

【００７３】なお、各配列のビット幅は、Ｕｐ配列が１
３ビット、Ｌｅｆｔ，Ｒｉｇｈｔ配列が１２ビットとな
る。次に、上述のような構成をもつ符号木の木の更新の
基本操作について、図１９（ａ），（ｂ）を用いて説明
する。図１９（ａ）はスプレイ符号更新の基本操作を示
す図であるが、この図１９（ａ）に示すように、文字Ａ
がアクセスされたとき、ノードＡと２段上のノードＡが
付いている枝と反対方向の枝のノードＣとを入れ換え
る。The bit width of each array is 1 for Up array.
3 bits, Left, Right array becomes 12 bits. Next, the basic operation for updating the tree of the code tree having the above configuration will be described with reference to FIGS. 19 (a) and 19 (b). FIG. 19A is a diagram showing a basic operation for updating the splay code. As shown in FIG.
When is accessed, the node A and the branch with the node A two steps above are replaced with the node C of the branch in the opposite direction.

【００７４】そして、文字Ａ〜Ｅまでの符号に対して、
文字Ｃがアクセスされた場合には、例えば図１９（ｂ）
に示すように、符号の木を組み換えるようになってい
る。すなわち、上述した基本操作を２回繰り返すことに
よって符号木の木の更新を行なう。この場合、２回目の
基本操作は１回目に更新したノードの親ノードの長さを
更新する。Then, for the codes of the characters A to E,
When the character C is accessed, for example, FIG.
As shown in, the code tree is adapted to be recombined. That is, the code tree is updated by repeating the above-described basic operation twice. In this case, the second basic operation updates the length of the parent node of the node updated first time.

【００７５】これにより、符号の木の深さが深くなって
も、この基本操作を繰り返すことによって、ルートから
アクセスされたノードＣ（符号０１１０）までの長さを
１／２（符号１０）にすることができるので、ルートか
らアクセスされたノードまでの符号木を動的に組み換え
て、符号表を入力データに適応させることができる。す
なわち、スプレイ符号の符号更新は線型リストの Move-
To-Front操作を Binary-Treeで行なったようなものであ
る。As a result, even if the depth of the code tree becomes deep, by repeating this basic operation, the length from the root to the accessed node C (reference number 0110) is reduced to 1/2 (reference number 10). Therefore, the code tree from the root to the accessed node can be dynamically recombined to adapt the code table to the input data. In other words, the code update of the splay code is the Move-
It's like doing a To-Front operation on a Binary-Tree.

【００７６】さらに、上述のような文脈収集過程１１に
おける文脈木の作成およびスプレイ符号化過程１２の符
号木の更新・作成の処理を、図２０のフローチャートに
おける処理ステップＥ１〜Ｅ３１を参照しながら詳述す
る。なお、入力文字をＫ（Ｋは任意の文字）とし、文字
Ｋが入力される直前に入力された文字をＰ（Ｐは任意の
文字）とする。Further, the process of creating the context tree in the context collecting process 11 and updating / creating the code tree in the spray coding process 12 as described above will be described in detail with reference to process steps E1 to E31 in the flowchart of FIG. I will describe. The input character is K (K is an arbitrary character), and the character input immediately before the character K is input is P (P is an arbitrary character).

【００７７】まず、文脈木と符号木および直前文字Ｐを
初期化する（ステップＥ１）。そして、既に入力された
全文字が入力符号化されているかをチェックし（ステッ
プＥ２）、入力文字が残っている場合、文字Ｋの入力
と、文字列の長さＬの０へのセットとを行なう（ステッ
プＥ３）。さらに、文脈木の直前文字Ｐの下に子ノード
があるかをチェックし（ステップＥ５）、直前文字Ｐの
下に子ノードがなければ、入力文字Ｋの０次符号を出力
し（ステップＥ６）、文脈木の直前文字Ｐの下に入力文
字Ｋを子ノードとして登録する（ステップＥ７）。First, the context tree, the code tree and the preceding character P are initialized (step E1). Then, it is checked whether all the characters that have already been input have been input and encoded (step E2). If there are any input characters, the input of the character K and the setting of the length L of the character string to 0 are performed. Perform (step E3). Further, it is checked whether or not there is a child node under the preceding character P in the context tree (step E5), and if there is no child node under the preceding character P, the 0th order code of the input character K is output (step E6). , The input character K is registered as a child node under the character P immediately before the context tree (step E7).

【００７８】一方、符号木の方では、直前文字Ｐの下
に、文字Ｋとエスケープコードを登録するノードを作成
して文字Ｋを登録する（ステップＥ８）。なお、このノ
ード作成のアルゴリズムの一例を図３０（ａ），（ｂ）
に示す。さらに、直前文字を入力文字Ｋに変更し（ステ
ップＥ１６）、直前文字列の長さＬ’が最大文字列長Ｌ
ｍａｘに等しいかをチェックする（ステップＥ２４）。On the other hand, in the code tree, a node for registering the character K and the escape code is created under the immediately preceding character P and the character K is registered (step E8). An example of the algorithm for creating this node is shown in FIGS.
Shown in. Further, the previous character is changed to the input character K (step E16), and the length L'of the previous character string is the maximum character string length L.
It is checked whether it is equal to max (step E24).

【００７９】そして、直前文字列長Ｌ’が最大文字列長
Ｌｍａｘに等しくなければ、直前文字列が１次符号で符
号化され出力されているかをチェックする（ステップＥ
２５）。ここで、直前文字列が１次符号で符号化されて
いなければ、直前文字列長Ｌ’に注目文字列長Ｌを移し
（ステップＥ２８）、文字Ｋが符号化済かをチェックし
（ステップＥ９）、符号化済であれば、上述のステップ
Ｅ２からの処理を繰り返し（ステップＥ９のＹＥＳルー
ト）、符号化済でなければ、上述のステップＥ３からの
処理を繰り返す（ステップＥ９のＮＯルート）。Then, if the immediately preceding character string length L'is not equal to the maximum character string length Lmax, it is checked whether the immediately preceding character string is encoded by the primary code and output (step E).
25). Here, if the immediately preceding character string is not encoded by the primary code, the target character string length L is moved to the immediately preceding character string length L '(step E28), and it is checked whether or not the character K is already encoded (step E9). ), If encoded, the process from step E2 described above is repeated (YES route of step E9), and if not encoded, the process from step E3 described above is repeated (NO route of step E9).

【００８０】ところで、上述のステップＥ１０におい
て、子ノードに登録されている文字が入力文字Ｋと一致
した場合は、文字列の長さＬを１増やし（ステップＥ１
７）、この文字列長Ｌが、予め設定した最大符号長Ｌｍ
ａｘと等しいかをチェックする（ステップＥ１８）。等
しくない場合は、入力データの全文字が符号化されたか
をチェックし（ステップＥ１９）、まだ符号化されてい
ない文字があれば、今までの入力文字を直前文字Ｐに移
し（ステップＥ２０）、さらに１文字Ｋを入力して（ス
テップＥ２１）上述のステップＥ１０の処理へ戻り、再
び子ノードに登録されている文字が文字Ｋと一致するか
をチェックする。If the character registered in the child node matches the input character K in step E10, the length L of the character string is increased by 1 (step E1).
7), this character string length L is the preset maximum code length Lm
It is checked whether it is equal to ax (step E18). If they are not equal, it is checked whether all the characters of the input data have been encoded (step E19), and if there is a character that has not been encoded yet, the input character so far is moved to the immediately preceding character P (step E20), Further, one character K is input (step E21), the process returns to step E10, and it is again checked whether the character registered in the child node matches the character K.

【００８１】一致しなければ、今度は文字列の長さＬが
０かどうかをチェックし（ステップＥ１１）、ＹＥＳ、
すなわち、直前文字Ｐの下に子ノードはあるが、該当す
る文字Ｋがまだ付いていないなら、直前文字Ｐの下のエ
スケープコードを出力した後、文字Ｋの０次符号を出力
する（ステップＥ１２）。さらに、文字Ｋを、文脈木の
直前文字Ｐの下の子ノードの兄弟ノードとして登録し
（ステップＥ１３）、符号木の直前文字Ｐの下のエスケ
ープコードをエスケープコードと文字Ｋの符号とに分割
して、文字Ｋの符号を追加し（ステップＥ１４）、符号
木のエスケープコードと０次符号Ｋの符号長をスプレイ
符号として更新する（ステップＥ１５）。If they do not match, it is checked whether or not the length L of the character string is 0 (step E11), YES,
That is, if there is a child node under the preceding character P but the corresponding character K is not yet attached, the escape code under the preceding character P is output, and then the 0th order code of the character K is output (step E12). ). Further, the character K is registered as a sibling node of the child node under the character P immediately before the context tree (step E13), and the escape code under the character P immediately before the code tree is divided into the escape code and the code of the character K. Then, the code of the character K is added (step E14), and the escape code of the code tree and the code length of the 0th-order code K are updated as the spray code (step E15).

【００８２】以上のようにして、所定の最大文字列長に
達するまで符号化文字列の伸長文字列を登録することが
できるようになっている。その後、上述したステップＥ
１６，ステップＥ２４を経て、再び直前文字列を１次符
号で出力したかをチェックする（ステップＥ２５）。即
ち、直前文字列が１次符号で符号化され、出力されてい
れば、文脈木に直前文字列に符号化文字（列）の先頭文
字を付加した延長文字列を登録し（ステップＥ２６）、
符号木に、符号化した延長文字列の符号を登録し（ステ
ップＥ２７）、上述のステップＥ２８からの処理を繰り
返す。As described above, the decompressed character string of the encoded character string can be registered until the predetermined maximum character string length is reached. Then, step E described above
After step 16 and step E24, it is checked again whether or not the immediately preceding character string is output by the primary code (step E25). That is, if the preceding character string is encoded by the primary code and is output, the extension character string in which the leading character of the encoded character (string) is added to the preceding character string is registered in the context tree (step E26).
The code of the encoded extended character string is registered in the code tree (step E27), and the processing from step E28 described above is repeated.

【００８３】なお、この登録は、直前文字の符号を分岐
させ、文字Ｋを付加した文字列の１次符号を追加するよ
うに行なう。文字列の分岐は、エスケープコードを符号
化した文字列の符号とみて分岐させ、元の文字列と文字
Ｋを付加した文字列の符号とを作る。このようにして、
辞書登録文字列として、符号化済の直前文字から登録
し、この直前文字から続く文字列を符号化している。Note that this registration is performed by branching the code of the immediately preceding character and adding the primary code of the character string to which the character K is added. The branch of the character string is regarded as the code of the character string obtained by encoding the escape code, and is branched to create the original character string and the code of the character string to which the character K is added. In this way
As the dictionary registration character string, the character just before the coded character is registered, and the character string following the character just before is coded.

【００８４】さらに、上述のステップＥ１８において、
文字列長Ｌが、予め設定した最大符号長Ｌｍａｘと等し
い場合、または、ステップＥ１１において、文字列長Ｌ
が０に等しくない場合は、文脈木の文字（列）の参照番
号に対応する１次符号を出力し（ステップＥ２２）、符
号木が出力した１次符号の符号長をスプレイ符号として
更新（ステップＥ２３）した後、上述のステップＥ２４
からの処理を行なう。Further, in the above step E18,
When the character string length L is equal to the preset maximum code length Lmax, or in step E11, the character string length L
Is not equal to 0, the primary code corresponding to the reference number of the character (string) of the context tree is output (step E22), and the code length of the primary code output by the code tree is updated as the spray code (step E22). E23) and then the above step E24
Process from.

【００８５】ここで、以上の閉ループ処理は、直前文字
Ｐと入力文字Ｋとの組み合わせが、既に文脈木に登録さ
れている時に、登録文字数（文字列長）を伸長して、文
字列単位に登録を行なう処理を示している。また、上述
のステップＥ２において、入力された全ての文字が符号
化されている場合は、文脈木の直前文字Ｐの下に子ノー
ドがあるかチェックし（ステップＥ２９）、子ノードが
あれば直前文字Ｐの下のエスケープコードを出力し（ス
テップＥ２９のＮＯルートからステップＥ３０）、Ｅｎ
ｄＯｆＦｉｌｅを表すＥＯＦの０次符号を出力して
（ステップＥ３１）処理を終了する（ステップＥ２９の
ＮＯルートからステップＥ３１）。Here, in the closed loop processing described above, when the combination of the immediately preceding character P and the input character K is already registered in the context tree, the number of registered characters (character string length) is expanded and the character string is incremented. The process for performing registration is shown. If all the input characters have been encoded in the above step E2, it is checked whether there is a child node below the preceding character P of the context tree (step E29). Output the escape code under the letter P (from NO route of step E29 to step E30), and
The 0th-order code of EOF representing d Of File is output (step E31), and the process ends (from NO route of step E29 to step E31).

【００８６】子ノードがなければ、そのままＥＯＦの０
次符号を出力して処理を終了する。以上のような処理を
行なうことで、入力データとしての文字列を過去に出現
した履歴に応じて符号化して圧縮するデータ圧縮方法に
おいて、辞書に入力データの文字列を収集し番号を付け
て登録するとともに、各文字列に対応してスプレイ符号
の符号化及び更新を施している。If there is no child node, 0 of EOF
The next code is output and the process ends. By performing the above processing, in the data compression method that encodes and compresses the character string as the input data according to the history that has appeared in the past, the character string of the input data is collected in the dictionary and registered with a number. In addition, the splay code is encoded and updated corresponding to each character string.

【００８７】ここで、上述の入力文字Ｋをアルファベッ
トａ，ｂ，ｃのいずれかに限り、文字「ａｂｃａｂｃ
ａｂｂ」が入力された場合を例にとり、文脈木と符
号木の更新・作成について、図２１〜図２８を用いてさ
らに詳述する。まず、図２１（ａ），（ｂ）に示すよう
に、文脈木と符号木に、予め文字ａ，ｂ，ｃと入力デー
タを全て符号化した後に出力する終端符号ＥＯＦとを、
番号を付することによりａ₁，ｂ₂，ｃ₃，ＥＯＦ₄として
登録しておく。Here, if the input character K is limited to one of the alphabets a, b, and c, the character "abc abc
Taking the case where "ab b" is input as an example, the update / creation of the context tree and the code tree will be described in more detail with reference to FIGS. First, as shown in FIGS. 21 (a) and 21 (b), the characters a, b, and c and the terminal code EOF that is output after all input data are encoded in the context tree and the code tree are
It is registered as a ₁ , b ₂ , c ₃ and EOF ₄ by adding numbers.

【００８８】このように文脈木を初期化することで、最
初に登録してある文字ａ，ｂ，ｃのいずれかが直前文字
となり、直前文字から続く文脈がないとき独立の単独の
参照番号も兼ねることになる。以下の説明では、文字ｃ
を最初の直前文字と仮定しておく。一方、図２１（ｂ）
に示すように、符号木は、ルート（ｒｏｏｔ）からノー
ドを左下に下がるときには、登録した文字に符号“０”
を割り当て、ノードを右下に下がるときには、登録した
文字に符号“１”を割り当てる２進木であり、これによ
り符号化時は、対応する文字の参照番号のノードからル
ートまでの辿る経路をスタックして、その経路を逆転さ
せ、左下か右下かによって符号“０”，“１”を割り振
ることによって、その文字の参照番号に対応する符号語
が得られるようになっている。By initializing the context tree in this way, any of the first registered characters a, b, and c becomes the previous character, and when there is no context following the previous character, an independent single reference number is also obtained. I will also serve. In the following description, the letter c
Let be the first previous character. On the other hand, FIG. 21 (b)
As shown in, when the node goes down from the root to the lower left, the code tree has a code "0" for the registered character.
Is a binary tree that assigns the code “1” to the registered character when allocating a node and moving the node down to the lower right, so that when encoding, the path traced from the node with the reference number of the corresponding character to the root is stacked. Then, by reversing the path and assigning the codes "0" and "1" depending on whether it is lower left or lower right, the code word corresponding to the reference number of the character can be obtained.

【００８９】すなわち、「ａ₁，ｂ₂，ｃ₃」の３文字の
０次符号は、それぞれ「０００１１０１１」とな
る。そして、上述の図２１（ａ），（ｂ）に示す状態か
ら、まず文字列「ａｂｃ」が入力されると、予め登録し
ておいた「ｃ」（ｃ₃）を直前文字と仮定するので、図
２２（ａ）に示すように、文脈木の「ｃ」（ｃ₃）の下
位に新たにノードを作成し、文字列「ａｂｃ」の内、最
初の１文字「ａ₅」と未登録を表すエスケープコード
（ＥＳＣ₆）とを登録する。That is, the three-character zero-order code of "a ₁ , b ₂ , c ₃ " is "00 0110 11". Then, the above-described FIG. 21 (a), the from the state shown in (b), when the first character string "abc" is entered, so assuming a previously registered "c" (c ₃₎ the immediately preceding character As shown in FIG. 22A, a new node is created under the context tree “c” (c ₃ ) and the first character “a ₅ ” in the character string “abc” is not registered. And an escape code (ESC ₆ ) that represents

【００９０】一方、符号木では、図２２（ｂ）に示すよ
うに、「ａ」が、既に登録されているので、「ａ」が登
録されているノードと直前文字である「ｃ」の上位ノー
ドとを組み替え、この「ｃ」を新たにルートとおいて
（１次符号化）、「ａ」とエスケープコード（ＥＳＣ）
を登録する。さらに、文字列「ａｂｃ」の内、次のｂ，
ｃについても上述の処理を行なうことにより、ｂが入力
されたときの文脈木と符号木は、それぞれ図２３
（ａ），（ｂ）に示すようになり、ｃが入力されたとき
の文脈木と符号木は図２４（ａ），（ｂ）に示すように
なる。On the other hand, in the code tree, as shown in FIG. 22 (b), since "a" has already been registered, the node in which "a" is registered and the upper character of "c" which is the immediately preceding character. Replace the node and set this "c" as a new route (primary coding), and then "a" and escape code (ESC).
To register. Furthermore, in the character string "abc", the next b,
By performing the above-described processing for c as well, the context tree and the code tree when b is input are shown in FIG.
As shown in FIGS. 24A and 24B, the context tree and the code tree when c is input are as shown in FIGS.

【００９１】この処理は、図２０にて上述した処理ステ
ップにおいて、ステップＥ５のＮＯルート，ステップＥ
２４のＹＥＳルート，ステップＥ２５のＮＯルートを経
由する閉ループ処理に相当するものである。すなわち、
文脈木では、直前文字の下位に子ノードが存在しない場
合に、この直前文字の下位に入力文字とエスケープコー
ドを登録するノードを新たに作成して登録を行なう。This process corresponds to the NO route of step E5 in the process step described above with reference to FIG.
This corresponds to the closed loop processing that goes through the YES route of 24 and the NO route of step E25. That is,
In the context tree, when a child node does not exist under the previous character, a node for registering the input character and the escape code is newly created under the previous character and registered.

【００９２】一方、符号木では、過去に登録されている
文字と同じ文字が再び入力された場合は、過去に登録さ
れている文字のノードを、入力された文字の直前文字が
登録されているノードの上位ノードと組み替えて、過去
に登録されている文字のノードを上位に移動し、ルート
からの距離を１／２にして符号長を短くするのである。On the other hand, in the code tree, when the same character as the character registered in the past is input again, the node of the character registered in the past is registered as the character immediately before the input character. By replacing the node with the upper node of the node and moving the node of the character registered in the past to the upper node, the distance from the root is halved to shorten the code length.

【００９３】さらに、続いて文字列「ａｂｃ」が入力さ
れると、最初に入力された文字列「ａｂｃ」の最後の
「ｃ」を直前文字として文脈木の「ｃ₉」の下位に、文
字列「ａｂｃ」の内の１文字「ａ」のみを登録しようと
するが、図２４（ａ）に示すように、文脈木には、既に
「ｃ₃」の下位に「ａ₅」が登録されているので、図２５
（ａ）に示すように、登録する文字を１文字「ａ」から
２文字「ａｂ」へ１文字伸長して「ａ₅」の下位に「ｂ
₁₁」を登録する。[0093] In addition, followed and by the character string "abc" is input, the lower of the "c _9" in the context tree as just before character last of the "c" of the first input string "abc", character tries to register only one character "a" of the string "abc", as shown in FIG. 24 (a), the context tree, "a _5" is registered already lower "c _3" As shown in FIG.
(A), the "b and 1 letter extended character to be registered from one character" a "2 to the character" ab "to the lower" a ₅ "
₁₁ ”is registered.

【００９４】この時、符号木では、図２５（ｂ）に示す
ように、「ｃ」の下位に「ａ」とともに登録されている
エスケープコード（ＥＳＣ）のノードを分岐して新たに
ノードを作成し、文脈木で１文字伸長して登録した「ａ
ｂ」の登録を行なう。続いて、文字列「ａｂｃ」の内、
「ｂ」を文脈木に登録しようとした場合も、図２５
（ａ）に示すように、既に「ａ₁」の下位に「ｂ₇」が登
録されているので、図２６（ａ）に示すように、登録す
る文字を１文字「ｂ」から２文字「ｂｃ」へ１文字伸長
して「ｂ₇」の下位に「ｃ₁₂」を登録する。At this time, in the code tree, as shown in FIG. 25B, a node of the escape code (ESC) registered with "a" under "c" is branched to create a new node. Then, decompress one character in the context tree and register "a
b ”is registered. Then, in the character string "abc",
Even when "b" is registered in the context tree, FIG.
As shown in FIG. 26A, since “b ₇ ” is already registered under “a ₁ ”, as shown in FIG. 26A, the characters to be registered are from 1 character “b” to 2 characters “ and one character extension to bc "registers" c ₁₂ "in the lower of the" b ₇ ".

【００９５】この時、符号木では、図２６（ｂ）に示す
ように、「ａ」の下位に「ｂ」とともに登録されていた
エスケープコード（ＥＳＣ）のノードを分岐して新たに
ノードを作成し、文脈木で１文字伸長して登録した「ｂ
ｃ」の登録を行なう。そして、次の文字列「ａｂｃ」の
内、最後の「ｃ」を登録する場合も、上述の処理を行な
うと、文脈木及び符号木は、それぞれ図２７（ａ），
（ｂ）に示す状態となる（ここまでで、「ａｂｃａｂ
ｃ」が入力済となる）。At this time, in the code tree, as shown in FIG. 26B, a node of the escape code (ESC) registered with "b" under "a" is branched to create a new node. Then, "b" registered by decompressing one character in the context tree
"c" is registered. Then, even when the last "c" of the next character string "abc" is registered, when the above process is performed, the context tree and the code tree are respectively shown in FIG.
The state shown in (b) is reached (up to this point, "abc ab
c "is already entered).

【００９６】そして、さらに文字列「ａｂ」が入力され
ると文脈木では、まず、直前文字の「ｃ」の下位に１文
字「ａ」を登録しようとするが、上述したように、「ｃ
₃」の下位に既に「ａ₅」が登録されているので、登録文
字数を１文字伸長して「ａｂ」として、「ａ₅」の下位
に「ｂ」を登録しようとする。しかし、図２７（ａ）に
示すように、「ａ₅」の下位にも既に「ｂ₁₁」が登録さ
れているので、図２８（ａ）に示すように、さらに、登
録文字数を１文字伸長して「ａｂｂ」として、「ｂ₁₁」
の下位に「ｂ」を登録する。When the character string "ab" is further input, the context tree first attempts to register one character "a" below the immediately preceding character "c".
Since the lower of ₃ "already" a ₅ "is registered, as" ab "by one character extension of the registration number of characters, it tries to register the" b "to the lower of" a ₅ ". However, as shown in FIG. 27A, since “b ₁₁ ” is already registered under “a ₅ ”, as shown in FIG. 28A, the number of registered characters is further expanded by one character. and as "abb", "b _11"
"B" is registered in the lower order of.

【００９７】この時、符号木では、図２８（ｂ）に示す
ように、「ａｂ」が登録されているノードを分岐して
「ａｂｂ」を登録する。この処理は、図２０にて上述し
た処理ステップのステップＥ５のＹＥＳルート，ステッ
プＥ１０のＹＥＳルート，ステップＥ２５のＹＥＳルー
トを経由する閉ループ処理に相当する。At this time, in the code tree, as shown in FIG. 28B, the node in which "ab" is registered is branched to register "abb". This process corresponds to the closed loop process that goes through the YES route of step E5, the YES route of step E10, and the YES route of step E25 of the process steps described above with reference to FIG.

【００９８】すなわち、文字列が入力されたとき、直前
文字と入力された文字列中の１文字との組み合わせが、
既に文脈木に登録されている場合、登録する文字数を１
文字伸長して登録されていない１文字のみ登録する。そ
して、この時、符号木では、直前文字とともに登録され
ているエスケープコード（ＥＳＣ）のノードを分岐させ
て新たにノードを作成し、この文字列を登録する。That is, when a character string is input, the combination of the immediately preceding character and one character in the input character string is
If already registered in the context tree, the number of characters to register is 1
Register only one character that has not been registered after decompressing. At this time, in the code tree, the node of the escape code (ESC) registered together with the immediately preceding character is branched to create a new node, and this character string is registered.

【００９９】以上のように、文字「ａｂｃａｂｃａ
ｂｂ」の入力が終了すると、各文字に割り当てられる
符号は図２９に示すようになる。この図２９に示すよう
に、最初に入力された文字「ａｂｃ」は、それぞれ１文
字単独で符号化され、対応する符号は、それぞれ００，
０１，１０の２ビットとなる。As described above, the characters "abc abc a
When the input of "bb" is completed, the codes assigned to the respective characters are as shown in FIG. As shown in FIG. 29, the first input character “abc” is encoded by one character each, and the corresponding codes are 00,
There are 2 bits of 01 and 10.

【０１００】そして、次に入力された文字列「ａｂｃ」
に対応する符号語は、直前文字との関係からそれぞれ
０，０，０の１ビットとなる。さらに、次に入力された
「ａｂ」２文字の文字列に符号が割り当てられるが、こ
の図２９に示すように、２文字の文字列が２ビットのみ
の符号語で表されている。Then, the next input character string "abc"
The code word corresponding to is 1 bit of 0, 0, 0 respectively due to the relationship with the immediately preceding character. Further, a code is assigned to the next input character string of "ab" of 2 characters, but as shown in FIG. 29, the character string of 2 characters is represented by a code word of only 2 bits.

【０１０１】そして最後に入力された文字「ｂ」は、今
までの直前文字の繋がりに該当する文字がない場合で、
ＥＳＣと１文字単独の符号語の組み合わせの３ビットで
表されている。以上のように、関連技術１のデータ圧縮
方法によれば、圧縮する文字（列）を、木構造の文脈木
に番号を付けて登録し、この文脈木に対応した符号木を
スプレイ符号化を施しながら作成・更新することによ
り、出現する文字の出現頻度を求めて確率モデルを構築
し、各文字に符号を割り当てるという２段階の処理を同
時に行なうので、データの圧縮処理の速度が大幅に向上
するという効果がある。The character "b" input last is the case where there is no character corresponding to the connection of the immediately preceding characters,
It is represented by 3 bits of a combination of ESC and a code word of one character alone. As described above, according to the data compression method of the related technique 1, the characters (strings) to be compressed are registered by assigning numbers to the context tree of the tree structure, and the code tree corresponding to this context tree is spray-encoded. By creating and updating while performing, the probability model is constructed by finding the appearance frequency of the appearing characters, and the two-step process of assigning a code to each character is performed at the same time, so the speed of data compression processing is greatly improved. There is an effect of doing.

【０１０２】また、上述の確率モデルは、文字の入力毎
に符号木のノードが作成・更新（スプレイ処理）される
ことによって構築されるので、文字の入力毎に既に構築
されている確率モデルを再構築するという膨大な演算処
理を行なう必要が無く、これにより圧縮処理の速度がさ
らに向上する効果がある。さらに、関連技術１のデータ
圧縮方法によれば、過去に圧縮（符号化）した文字と同
じ文字が出現する毎に、過去に登録してあった同じ文字
の符号木のノードを上位のノードと組み替えて（スプレ
イ処理）符号長を１／２にすることにより、同じ文字
（列）が繰り返し出現するほど、その文字（列）の符号
は少ないビット数で表せるので、圧縮効果が大幅に向上
する効果がある。Since the above-mentioned probability model is constructed by creating / updating (spraying) the node of the code tree for each character input, the probability model already constructed for each character input is used. There is no need to perform a huge amount of arithmetic processing such as reconstruction, which has the effect of further improving the speed of compression processing. Further, according to the data compression method of the related technique 1, every time the same character as the previously compressed (encoded) character appears, the node of the code tree of the same character registered in the past is set as the upper node. By rearranging (spray processing) to reduce the code length to 1/2, as the same character (string) appears repeatedly, the code of that character (string) can be represented with a smaller number of bits, so the compression effect is significantly improved. effective.

【０１０３】また、関連技術１のデータ圧縮方法によれ
ば、上述の文字列「ａｂ」を符号化した場合のように、
文字を１文字単位に符号化するのではなく、複数文字単
位の文字列として符号化することにより、可変長符号化
処理が高速化できるとともに、符号化単位を文字列とす
るので、情報源が拡大し、スプレイ符号化の符号化効率
が大幅に向上するという効果もある。Further, according to the data compression method of the related technique 1, as in the case where the above-mentioned character string "ab" is encoded,
By encoding a character as a character string of a plurality of characters instead of encoding it as a character unit, the variable-length encoding process can be speeded up, and the encoding unit is a character string. There is also an effect that the coding efficiency is increased and the coding efficiency of the spray coding is significantly improved.

【０１０４】（２）復元側の説明次に、上述のように、符号化（圧縮）されたデータを入
力符号として、図１７にて上述したデータ復元装置２内
の文脈収集過程２１とスプレイ符号化過程２２が、デー
タを復元する処理について、図３１のフローチャートに
おける処理ステップＤ１〜Ｄ２４を参照しながら説明す
る。(2) Description of decompression side Next, as described above, using the coded (compressed) data as an input code, the context collection process 21 and the spray code in the data decompression apparatus 2 described in FIG. 17 are described. The conversion process 22 for restoring data will be described with reference to process steps D1 to D24 in the flowchart of FIG.

【０１０５】なお、このデータを復元する処理は、基本
的に符号化側の説明にて上述した符号化の処理と逆の処
理を行なうようにすればよい。すなわち、まず、文脈木
と符号木とを初期化し、直前文字Ｐを０に初期化する
（ステップＤ１）。列長Ｌを０とし（ステップＤ２）、
文脈木の直前文字の下位に子ノードがあるかをチェック
する（ステップＤ３）。The process of restoring this data may be basically the reverse of the encoding process described in the description of the encoding side. That is, first, the context tree and the code tree are initialized, and the preceding character P is initialized to 0 (step D1). The column length L is set to 0 (step D2),
It is checked whether or not there is a child node below the character just before the context tree (step D3).

【０１０６】ここで、子ノードがない場合においては、
入力符号を０次符号として文字Ｋを復号し（ステップＤ
３のＮＯルートからステップＤ４）、この復号した文字
ＫがＥＯＦ符号であるかをチェックする（ステップＤ
５）。もし、この復号した文字Ｋが、ＥＯＦ符号でなけ
れば、ＮＯルートをとり、復号した文字Ｋを出力し（ス
テップＤ６）、文脈木に文字Ｋを登録する（ステップＤ
７）。なお、これは図１７にて上述した符号化時の処理
ステップＥ７と同様にして行なう。Here, when there is no child node,
The character K is decoded using the input code as the 0th order code (step D
From the NO route of No. 3, step D4), it is checked whether the decoded character K is an EOF code (step D).
5). If the decoded character K is not an EOF code, the NO route is taken, the decoded character K is output (step D6), and the character K is registered in the context tree (step D).
7). This is performed in the same manner as the processing step E7 at the time of encoding described above with reference to FIG.

【０１０７】さらに、符号化時のステップＥ８と同様に
して、符号木の直前文字Ｐの下に文字Ｋとエスケープコ
ードのノードを作り（ステップＤ８）、直前文字Ｐを文
字Ｋとおく（ステップＤ１７）。そして、以降の処理ス
テップＤ２０〜Ｄ２３は、符号化時の処理ステップＥ２
０〜Ｅ２３と同様の処理ステップをとり、直前文字列長
Ｌ’を注目文字列長Ｌで置き換えて（ステップＤ２
４）、上述の処理ステップＤ２に戻る。Further, similarly to step E8 at the time of encoding, a node of the character K and the escape code is created under the immediately preceding character P of the code tree (step D8), and the immediately preceding character P is set to the character K (step D17). ). Then, the subsequent processing steps D20 to D23 are processing steps E2 at the time of encoding.
The same processing steps as 0 to E23 are taken, and the immediately preceding character string length L ′ is replaced with the target character string length L (step D2
4) and returns to the above-mentioned processing step D2.

【０１０８】ところで、上述の処理ステップＤ３におい
て、文脈木の直前文字Ｐの下位に子ノードが存在する場
合は、入力された符号を符号木より１次符号とみなして
復号して、文脈木の文字（列）の参照番号を得る（ステ
ップＤ３のＹＥＳルートからステップＤ９）。さらに、
復号した参照番号がエスケープコードであるかをチェッ
クし（ステップＤ１０）、エスケープコードであれば、
ＹＥＳルートをとり、入力文字を０次符号として次の符
号を復号して、文字Ｋを得る（ステップＤ１１）。By the way, in the above-mentioned processing step D3, when a child node exists immediately below the character P immediately before the context tree, the input code is regarded as a primary code from the code tree, is decoded, and is decoded. The reference number of the character (string) is obtained (from the YES route of step D3 to step D9). further,
It is checked whether the decoded reference number is an escape code (step D10). If it is an escape code,
The YES route is taken, the next character is decoded using the input character as the 0th order code, and the character K is obtained (step D11).

【０１０９】そして、上述のステップＤ５と同様に、復
号した文字ＫがＥＯＦ符号であるかをチェックし（ステ
ップＤ１２）、復号した文字ＫがＥＯＦ符号でなけれ
ば、ＮＯルートをとり、文字Ｋを出力する（ステップＤ
１３）。さらに、符号化時の処理ステップＥ１３〜１５
と同様にして、文脈木に文字Ｋを登録し（ステップＤ１
４）、直前文字Ｐの下に文字Ｋを追加し（ステップＤ１
５）、符号木のエスケープコードと０次符号Ｋの符号長
をスプレイ符号として更新する（ステップＤ１６）。Then, as in step D5 described above, it is checked whether the decoded character K is the EOF code (step D12). If the decoded character K is not the EOF code, the NO route is taken and the character K is set. Output (Step D
13). Furthermore, processing steps E13 to E15 at the time of encoding
Similarly, register the letter K in the context tree (step D1
4) Add the letter K below the immediately preceding letter P (step D1
5) Update the escape length of the code tree and the code length of the 0th-order code K as a splay code (step D16).

【０１１０】そして、以降は上述のステップＤ１７から
の処理を行なう。このようにして、全ての一文字にスプ
レイ符号を割り当てておき、直前文字から繋がる文字列
が既に収集した辞書中の文字列中にない一文字のスプレ
イ符号を復号したときに、符号を更新し、上述の直前文
字からの繋がる復号した文字を文脈木に登録することが
できる。Then, the processes from step D17 described above are performed thereafter. In this way, a splay code is assigned to all one character, and when the splay code of one character that is not in the character string in the dictionary in which the character string connected from the preceding character is already collected is decoded, the code is updated. The connected decoded characters from the character immediately before can be registered in the context tree.

【０１１１】また、上述の処理ステップＤ１０におい
て、復号した参照番号がエスケープコードでなければ、
ＮＯルートをとり、文脈木の参照番号に対応する文字列
を復元して出力し（ステップＤ１８）、文字（列）の最
終文字を直前文字Ｐに置き換える（ステップＤ１９）。
そして、以降は、上述のステップＤ２０からの処理を行
なう。If the decoded reference number is not the escape code in the above processing step D10,
Taking the NO route, the character string corresponding to the reference number of the context tree is restored and output (step D18), and the last character of the character (string) is replaced with the immediately preceding character P (step D19).
Then, the processes from step D20 described above are performed thereafter.

【０１１２】このようにして、辞書登録文字列として、
復号化済の直前文字から登録し、この直前文字から続く
文字列を復号することができる。また、上述の処理ステ
ップＤ４またはステップＤ１２において、復号した文字
ＫがＥＯＦ符号であれば、ＹＥＳルートをとり、復元処
理を終了する。以上のようにして、辞書としての文脈木
に復元したデータの文字列を収集し番号を付けて登録す
るとともに、復元各文字列に対応してスプレイ符号を対
応させておき、辞書番号に対応する文字列をスプレイ符
号で復号化及び更新を行ない、また、所定の最大文字列
長に達するまで符号化文字列の伸長文字列を登録し、こ
の伸長文字列に対応するスプレイ符号を登録する。In this way, as a dictionary registration character string,
It is possible to register from the immediately preceding character that has been decoded and to decode the character string that continues from this immediately preceding character. If the decoded character K is the EOF code in the processing step D4 or step D12 described above, the YES route is taken and the restoration processing is ended. As described above, the character strings of the restored data are collected in the context tree as the dictionary, numbered and registered, and the splay code is made to correspond to each restored character string to correspond to the dictionary number. The character string is decoded and updated with the spray code, and the decompressed character string of the encoded character string is registered until the predetermined maximum character string length is reached, and the spray code corresponding to this decompressed character string is registered.

【０１１３】これにより、データ圧縮装置１の文脈収集
過程１１およびスプレイ符号化過程１２により圧縮・符
号化された文字Ｋを復元している。このように、関連技
術１のデータ復元方法によれば、文脈木に復元した文字
（列）に番号を付けて登録するとともに、この文脈木に
対応した符号表としての符号木を構築することにより、
符号化された文字の符号と一致する符号を符号表におい
て検索し、一致した符号に対応する文字を復号文字とし
て出力するという２段階の処理を同時に行なうので、デ
ータの復元処理の速度が大幅に向上するという効果があ
る。As a result, the character K compressed and encoded by the context collection process 11 and the spray encoding process 12 of the data compression device 1 is restored. As described above, according to the data restoration method of the related technique 1, the restored characters (strings) are numbered and registered in the context tree, and a code tree as a code table corresponding to the context tree is constructed. ,
Since a code that matches the code of the coded character is searched in the code table and the character corresponding to the code that matches is output as a decoded character at the same time, the speed of data restoration processing is greatly increased. It has the effect of improving.

【０１１４】また、上述の確率モデルは、文字の復元毎
に符号木のノードが作成・更新（スプレイ処理）される
ことによって構築されるので、文字の復元毎に、既に構
築されている符号表（確率モデル）を再構築するという
膨大な演算処理を行なう必要がなくなり、これにより復
元処理の速度がさらに向上する効果がある。さらに、関
連技術１のデータ復元方法によれば、過去に復号した符
号と同じ符号が出現する毎に、過去に登録してあった同
じ符号の符号木上のノードを上位のノードと組み替えて
（スプレイ処理）符号長を１／２にすることにより、同
じ符号が繰り返し出現するほどその符号は少ないビット
数で表せるので、同じ符号を繰り返し復号する場合、復
元処理の速度が大幅に向上する効果がある。Further, since the above-mentioned probabilistic model is constructed by creating / updating (spraying) the node of the code tree for each character restoration, the code table already constructed is constructed for each character restoration. There is no need to perform a huge amount of arithmetic processing of reconstructing the (stochastic model), which has the effect of further improving the speed of the restoration processing. Furthermore, according to the data restoration method of Related Technique 1, every time the same code as the code decoded in the past appears, the node in the code tree of the same code registered in the past is recombined with the upper node ( (Spray processing) By reducing the code length to ½, the code can be represented by a smaller number of bits as the same code repeatedly appears. Therefore, when the same code is repeatedly decoded, the speed of the restoration process is significantly improved. is there.

【０１１５】また、関連技術１のデータ復元方法によれ
ば、文字を１文字単位に復号するのではなく、複数文字
単位の文字列として復号することにより、スプレイ処理
が高速化できるとともに、復号単位を文字列とするの
で、復元できる情報源が拡大し、復元効率が大幅に向上
するという効果もある。なお、上述した例では、直前文
字から繋がる文字列を文脈として収集して符号化・復号
する方法について述べたが、必ずしも直前文字にこだわ
ることはなく２文字以上以前からの文脈を収集して符号
化・復号化してもよい。Further, according to the data restoration method of the related art 1, by decoding characters as a character string in units of a plurality of characters instead of decoding them in character units, the spray processing can be speeded up and the decoding unit can be improved. Since "" is a character string, there is also an effect that the information source that can be restored is expanded and the restoration efficiency is greatly improved. In the above example, the method of collecting and encoding / decoding a character string connected from the immediately preceding character as a context has been described, but it is not always necessary to focus on the immediately preceding character and the context from two or more characters before is collected and encoded. It may be decrypted.

【０１１６】また、上述した例では、動的に文脈を収集
してスプレイ処理する例を示したが、必ずしも動的であ
る必要はなく、予め代表的なサンプルから収集した静的
な文脈を用いてスプレイ処理してもよい。さらに、上述
した例では、入力された全てのデータを動的可変長符号
化（スプレイ符号化）する場合について述べたが、相当
程度のデータを符号化した後に、スプレイ符号化の更新
操作を止めて、静的な可変長符号化をしてもよい。この
場合、符号化と復号化とで予め取決めをしておき同期が
取れればよい。Further, in the above-mentioned example, an example in which the context is dynamically collected and the spray processing is performed is shown. However, the context need not always be dynamic, and a static context previously collected from a representative sample is used. Spray processing may be performed. Further, in the above-described example, the case where all the input data is dynamically variable-length coded (spray coded) has been described. However, after a considerable amount of data is coded, the update operation of the spray coding is stopped. Alternatively, static variable length coding may be performed. In this case, it suffices that an agreement be made in advance for encoding and decoding and synchronization be achieved.

【０１１７】また、上述した例では、圧縮するデータを
文字あるいは文字列として説明したが、関連技術１のデ
ータ圧縮方法及びデータ復元方法は、他の画像データや
音声データなどあらゆるデータに対して適用できる。（ｂ）本発明に関連する技術２の説明次に、本発明に関連する技術２（以下、関連技術２とい
う）について説明するが、まず、その原理について説明
する。Further, in the above-mentioned example, the data to be compressed is described as a character or a character string, but the data compression method and the data decompression method of the related technique 1 are applied to all data such as other image data and audio data. it can. (B) Description of Technology 2 Related to the Present Invention Next, technology 2 related to the present invention (hereinafter referred to as related technology 2) will be described. First, its principle will be described.

【０１１８】関連技術２に係るデータ圧縮方法を実施す
るための装置の構成を、図１に示す。この図１に示すデ
ータ圧縮装置は、入力データを過去に出現した履歴に応
じて符号化して圧縮するものである。ここで、１００は
前置データ保持手段、１０１は履歴保持手段、１０２は
符号木保持手段、１０３は符号木決定手段、１０４は符
号出力手段、１０５は符号長変更手段、１０６は前置デ
ータ更新手段である。FIG. 1 shows the configuration of an apparatus for carrying out the data compression method according to Related Technique 2. The data compression apparatus shown in FIG. 1 encodes and compresses input data according to the history of past appearances. Here, 100 is prefix data holding means, 101 is history holding means, 102 is code tree holding means, 103 is code tree determining means, 104 is code output means, 105 is code length changing means, and 106 is prefix data updating. It is a means.

【０１１９】前置データ保持手段１００は、入力データ
の直前までに入力されたｎ個の入力データからなる文脈
を保持するものであり、履歴保持手段１０１は、入力デ
ータと文脈との組み合わせを保持するものであり、符号
木保持手段１０２は、文脈毎に独立した符号木を保持す
るものである。また、符号木決定手段１０３は、前置デ
ータ保持手段１００に保持されている直前までの入力デ
ータからデータの符号木を決定するものであり、符号出
力手段１０４は、符号木決定手段１０３で選択した符号
木の頂点を意味するルートからデータが格納されている
リーフに沿って途中に位置する分岐点としてのノードか
らの分岐に従って固有のデータを出力するものである。The prefix data holding means 100 holds a context consisting of n pieces of input data input up to immediately before the input data, and the history holding means 101 holds a combination of the input data and the context. The code tree holding unit 102 holds an independent code tree for each context. Further, the code tree determining means 103 determines a code tree of data from the input data up to immediately before held in the prefix data holding means 100, and the code output means 104 is selected by the code tree determining means 103. The unique data is output according to a branch from a node as a branch point located midway along the leaf in which the data is stored, from the root meaning the apex of the code tree.

【０１２０】さらに、符号長変更手段１０５は、符号化
したリーフと他のリーフあるいはノードとを組み替える
ものであり、前置データ更新手段１０６は、データを前
置データ保持手段１００に登録するものである。また、
関連技術２に係る他のデータ圧縮方法を実施するための
装置の構成を、図２に示す。この図２に示すデータ圧縮
装置も、入力データを過去に出現した履歴に応じて符号
化して圧縮するものである。Further, the code length changing means 105 is for recombining the encoded leaf with another leaf or node, and the prefix data updating means 106 is for registering the data in the prefix data holding means 100. is there. Also,
FIG. 2 shows the configuration of an apparatus for carrying out another data compression method according to Related Technique 2. The data compression apparatus shown in FIG. 2 also encodes and compresses input data according to the history of past appearances.

【０１２１】ここで、１００は前置データ保持手段、１
０１は履歴保持手段、１０３は符号木決定手段、１０７
は符号木決定手段である。さらに、１０８は文脈判別手
段、１０９はエスケープコード出力手段、１１０は文脈
変更手段、１１１は符号出力手段、１１６は制御手段で
ある。前置データ保持手段１００は、入力データの直前
までに入力されたｎ個の入力データからなる文脈を保持
するものであり、履歴保持手段１０１は、入力データと
文脈との組み合わせを保持するものであり、符号木保持
手段１０７は、データ未登録を示すデータとして定義さ
れるエスケープコードをあらかじめ登録した文脈毎に独
立した符号木を保持するものである。Here, 100 is a front data holding means, 1
01 is history holding means, 103 is code tree determining means, 107
Is a code tree determining means. Further, 108 is a context discrimination means, 109 is an escape code output means, 110 is a context change means, 111 is a code output means, and 116 is a control means. The pre-data holding means 100 holds the context consisting of n pieces of input data input just before the input data, and the history holding means 101 holds the combination of the input data and the context. The code tree holding unit 107 holds an independent code tree for each context in which an escape code defined as data indicating unregistered data is registered in advance.

【０１２２】また、符号木決定手段１０３は、文脈と入
力データからデータの符号木を決定するものであり、文
脈判別手段１０８は、符号木決定手段１０３で決定した
符号木にデータが登録されているか否かを判別するもの
であり、エスケープコード出力手段１０９は、符号木に
データが登録されていないときは符号木の頂点を意味す
るルートからエスケープコードのデータ格納点としての
リーフまでの途中に位置する分岐点としてのノードから
の分岐に従ってエスケープコードを出力するものであ
る。The code tree determining means 103 determines a code tree of data from the context and the input data. The context determining means 108 registers the data in the code tree determined by the code tree determining means 103. The escape code output means 109 determines whether or not there is any data registered in the code tree from the root, which means the top of the code tree, to the leaf as the data storage point of the escape code. The escape code is output according to the branch from the node serving as the branch point.

【０１２３】さらに、文脈変更手段１１０は、符号木に
データが登録されていないときは文脈の長さｎを短くす
るものであり、符号出力手段１１１は、符号木にデータ
が登録されているときは符号木のルートからデータのリ
ーフまでの途中に位置するノードからの分岐に従ってデ
ータの符号を出力するものである。また、符号長変更手
段１０５は、符号化したリーフと他のリーフあるいはノ
ードとを組み換えるものであり、前置データ更新手段１
０６は、データを前置データ保持手段１００に登録する
ものであり、制御手段１１６は、エスケープコードを符
号化したときはデータの符号化を行なうまで処理を繰り
返すものである。Further, the context changing means 110 shortens the length n of the context when no data is registered in the code tree, and the code output means 111 when the data is registered in the code tree. Is to output the code of the data according to the branch from the node located on the way from the root of the code tree to the leaf of the data. The code length changing means 105 is a means for recombining the coded leaf and another leaf or node, and the prefix data updating means 1 is used.
Reference numeral 06 is for registering the data in the front data holding means 100, and when the control means 116 encodes the escape code, it repeats the processing until the data is encoded.

【０１２４】また、関連技術２に係るさらに他のデータ
圧縮方法を実施するための装置の構成を、図３に示す。
この図３に示すデータ圧縮装置も、入力データを過去に
出現した履歴に応じて符号化して圧縮するものである。
ここで、１００は前置データ保持手段、１０１は履歴保
持手段、１０３は符号木決定手段、１０５は符号長変更
手段、１０６は前置データ更新手段、１０７は符号木保
持手段、１０８は文脈判別手段、１０９はエスケープコ
ード出力手段、１１０は文脈変更手段、１１１は符号出
力手段、１１２は履歴登録手段、１１３は符号登録手
段、１１６は制御手段である。FIG. 3 shows the configuration of an apparatus for implementing still another data compression method according to Related Technique 2.
The data compression apparatus shown in FIG. 3 also encodes and compresses input data according to the history of past appearances.
Here, 100 is a prefix data holding unit, 101 is a history holding unit, 103 is a code tree determining unit, 105 is a code length changing unit, 106 is a prefix data updating unit, 107 is a code tree holding unit, and 108 is context discrimination. Means, 109 is an escape code output means, 110 is a context changing means, 111 is a code output means, 112 is a history registration means, 113 is a code registration means, and 116 is a control means.

【０１２５】前置データ保持手段１００は、入力データ
の直前までに入力されたｎ個の入力データからなる文脈
を保持するものであり、履歴保持手段１０１は、入力デ
ータと文脈との組み合わせを保持するものであり、符号
木保持手段１０７は、データ未登録を示すデータとして
定義されるエスケープコードを予め登録した文脈毎に独
立した符号木を保持するものである。The prefix data holding means 100 holds the context consisting of n pieces of input data input up to immediately before the input data, and the history holding means 101 holds the combination of the input data and the context. The code tree holding means 107 holds an independent code tree for each context in which an escape code defined as data indicating unregistered data is registered in advance.

【０１２６】また、符号木決定手段１０３は、文脈と入
力データからデータの符号木を決定するものであり、文
脈判別手段１０８は、符号木決定手段１０３で決定した
符号木にデータが登録されているか否かを判別するもの
である。さらに、エスケープコード出力手段１０９は、
符号木にデータが登録されていないときは符号木の頂点
を意味するルートからエスケープコードのデータ格納点
としてのリーフまでの中に位置する分岐点としてのノー
ドからの分岐に従ってエスケープコードを出力するもの
である。Further, the code tree determining means 103 determines the code tree of data from the context and the input data, and the context determining means 108 registers the data in the code tree determined by the code tree determining means 103. It is to determine whether or not there is. Further, the escape code output means 109
When data is not registered in the code tree, the escape code is output according to the branch from the node as the branch point located between the root that means the vertex of the code tree and the leaf as the data storage point of the escape code. Is.

【０１２７】また、履歴登録手段１１２は、符号木にデ
ータが登録されていないときは履歴保持手段１０１にデ
ータと文脈の組み合わせを登録するものであり、符号登
録手段１１３は、符号木にデータが登録されていないと
きは符号木にデータを新規に登録するものであり、文脈
変更手段１１０は、符号木にデータが登録されていない
ときは文脈の長さｎを短くするものである。Further, the history registration means 112 registers the combination of the data and the context in the history holding means 101 when the data is not registered in the code tree, and the code registration means 113 stores the data in the code tree. When the data is not registered, the data is newly registered in the code tree, and the context changing unit 110 shortens the context length n when the data is not registered in the code tree.

【０１２８】さらに、符号出力手段１１１は、符号木に
データが登録されているときは符号木のルートからデー
タのリーフまでの途中に位置するノードからの分岐に従
ってデータの符号を出力するものであり、符号長変更手
段１０５は、符号化したリーフと他のリーフあるいはノ
ードとを組み換えるものである。また、前置データ更新
手段１０６は、データを前置データ保持手段１００に登
録するものであり、制御手段１１６は、エスケープコー
ドを符号化したときはデータの符号化を行なうまで処理
を繰り返すものである。Further, the code output means 111 outputs the code of the data according to the branch from the node located on the way from the root of the code tree to the leaf of the data when the data is registered in the code tree. The code length changing unit 105 recombines the encoded leaf and another leaf or node. Further, the prefix data update means 106 registers the data in the prefix data holding means 100, and the control means 116 repeats the process until the data is coded when the escape code is coded. is there.

【０１２９】さらに、関連技術２に係るさらに他のデー
タ圧縮方法を実施するための装置の構成を、図４に示
す。この図４に示すデータ圧縮装置も、入力データを過
去に出現した履歴に応じて符号化して圧縮するものであ
る。ここで、１００は前置データ保持手段、１０１は履
歴保持手段、１０３は符号木決定手段、１０５は符号長
変更手段、１０６は前置データ更新手段、１０７は符号
木保持手段、１０８は文脈判別手段、１０９および１１
１はエスケープコード出力手段、１１０は文脈変更手
段、１１４は履歴登録手段、１１５は符号登録手段、１
１７は制御手段である。Further, FIG. 4 shows the configuration of an apparatus for implementing still another data compression method according to Related Technique 2. The data compression apparatus shown in FIG. 4 also encodes and compresses the input data according to the history of past appearances. Here, 100 is a prefix data holding unit, 101 is a history holding unit, 103 is a code tree determining unit, 105 is a code length changing unit, 106 is a prefix data updating unit, 107 is a code tree holding unit, and 108 is context discrimination. Means, 109 and 11
1 is an escape code output means, 110 is a context changing means, 114 is a history registration means, 115 is a code registration means, 1
Reference numeral 17 is a control means.

【０１３０】前置データ保持手段１００は、入力データ
の直前までに入力されたｎ個の入力データからなる文脈
を保持するものであり、履歴保持手段１０１は、入力デ
ータと文脈との組み合わせを保持するものであり、符号
木保持手段１０７は、データ未登録を示すデータとして
定義されるエスケープコードをあらかじめ登録した文脈
毎に独立した符号木を保持するものである。The prefix data holding means 100 holds the context consisting of n pieces of input data input up to immediately before the input data, and the history holding means 101 holds the combination of the input data and the context. The code tree holding unit 107 holds an independent code tree for each context in which an escape code defined as data indicating unregistered data is registered in advance.

【０１３１】また、符号木決定手段１０３は、文脈と入
力データからデータの符号木を決定するものであり、文
脈判別手段１０８は、符号木決定手段１０３で決定した
符号木にデータが登録されているか否かを判別するもの
である。さらに、エスケープコード出力手段１０９は、
符号木にデータが登録されていないときは符号木の頂点
を意味するルートからエスケープコードのデータ格納点
としてのリーフまでの途中に位置する分岐点としてのノ
ードからの分岐に従ってエスケープコードを出力するも
のである。The code tree determining means 103 determines a code tree of data from the context and the input data. The context determining means 108 registers the data in the code tree determined by the code tree determining means 103. It is to determine whether or not there is. Further, the escape code output means 109
When data is not registered in the code tree, the escape code is output according to the branch from the node that is located on the way from the root that means the vertex of the code tree to the leaf that is the data storage point of the escape code. Is.

【０１３２】また、文脈変更手段１１０は、符号木にデ
ータが登録されていないときは文脈の長さｎを短くする
ものであり、エスケープコード出力手段１１１は、符号
木にデータが登録されているときは符号木のルートから
データのリーフまでの途中に位置するノードからの分岐
にしたがってデータの符号を出力するものである。さら
に、履歴登録手段１１４は、履歴保持手段１０１にデー
タと文脈の組み合わせを登録するものであり、符号登録
手段１１５は、符号木にデータを新規に登録するもので
あり、符号長変更手段１０５は、符号化したリーフと他
のリーフあるいはノードとを組み換えるものであり、前
置データ更新手段１０６は、データを前置データ保持手
段１００に登録するものである。The context changing means 110 shortens the context length n when no data is registered in the code tree, and the escape code output means 111 registers data in the code tree. In this case, the code of the data is output according to the branch from the node located on the way from the root of the code tree to the leaf of the data. Further, the history registration means 114 is for registering a combination of data and context in the history holding means 101, the code registration means 115 is for newly registering data in the code tree, and the code length changing means 105 is for , The encoded leaf and another leaf or node are recombined, and the prefix data updating means 106 registers the data in the prefix data holding means 100.

【０１３３】また、制御手段１１７は、データの符号化
時に一度でもエスケープコードを符号化したときは、デ
ータの符号化の直前の文脈とデータとの組み合わせを履
歴登録手段１１４で履歴保持手段１０１に登録し、デー
タの符号化の直前に符号化したエスケープコードを持つ
符号木に符号登録手段１１５でデータを新規に登録する
ものである。Further, when the escape code is encoded even once at the time of encoding the data, the control means 117 causes the history registration means 114 to store the combination of the context and the data immediately before the data encoding to the history holding means 101. The data is newly registered in the code tree having the escape code that is registered and encoded immediately before the data is encoded.

【０１３４】一方、関連技術２に係るデータ復元方法を
実施するための装置の構成を、図５に示す。この図５に
示すデータ復元装置は、過去に出現した履歴に応じて符
号化した符号を復号するものである。ここで、２００は
前置データ保持手段、２０１は履歴保持手段、２０２は
符号木保持手段、２０３は符号木決定手段、２０４は復
号手段、２０５は符号長変更手段、２０６は前置データ
更新手段である。On the other hand, FIG. 5 shows the configuration of an apparatus for carrying out the data restoration method according to Related Technique 2. The data restoration device shown in FIG. 5 decodes a code that has been encoded according to a history that has appeared in the past. Here, 200 is a prefix data holding means, 201 is a history holding means, 202 is a code tree holding means, 203 is a code tree determining means, 204 is a decoding means, 205 is a code length changing means, and 206 is a prefix data updating means. Is.

【０１３５】前置データ保持手段２００は、過去に復号
したｎ個のデータを保持するものであり、履歴保持手段
２０１は、復号したデータと文脈との組み合わせを保持
するものであり、符号木保持手段２０２は、文脈毎に独
立した符号木を保持するものである。また、符号木決定
手段２０３は、前置データ保持手段２００に保持されて
いる文脈からデータを復号するための符号木を決定する
ものであり、復号手段２０４は、符号に従って符号木決
定手段２０３で選択した符号木の頂点を意味するルート
から分岐点としてのノードを走査して到達したデータ格
納点としてのリーフに格納されているデータを出力する
ものである。The prefix data holding means 200 holds n pieces of data decoded in the past, and the history holding means 201 holds a combination of the decoded data and the context, and holds the code tree. The means 202 holds an independent code tree for each context. Further, the code tree determining means 203 determines a code tree for decoding data from the context held in the prefix data holding means 200, and the decoding means 204 uses the code tree determining means 203 according to the code. The data stored in the leaf as the data storage point reached by scanning the node as the branch point from the root meaning the apex of the selected code tree is output.

【０１３６】さらに、符号長変更手段２０５は、復号し
たリーフと他のリーフあるいはノードとを組み替えるも
のであり、前置データ更新手段２０６は、復号したデー
タを前置データ保持手段２００に登録するものである。
また、関連技術２に係る他のデータ復元方法を実施する
ための装置の構成を、図６に示す。この図６に示すデー
タ復元装置も、過去に出現した履歴に応じて符号化した
符号を復号するものである。Further, the code length changing means 205 recombines the decoded leaf with another leaf or node, and the prefix data updating means 206 registers the decoded data in the prefix data holding means 200. Is.
Further, FIG. 6 shows a configuration of an apparatus for implementing another data restoration method according to Related Technique 2. The data restoration device shown in FIG. 6 also decodes a code that has been encoded according to a history that has appeared in the past.

【０１３７】ここで、２００は前置データ保持手段、２
０１は履歴保持手段、２０３は符号木決定手段、２０４
は復号手段、２０５は符号長変更手段、２０６は前置デ
ータ更新手段、２０７は符号木保持手段、２０８は文脈
変更手段、２１３は制御手段である。前置データ保持手
段２００は、過去に復号したｎ個のデータを保持するも
のであり、履歴保持手段２０１は、復号したデータと文
脈との組み合わせを保持するものであり、符号木保持手
段２０７は、データ未登録を示すデータとして定義され
るエスケープコードをあらかじめ登録した符号木を保持
するものである。Here, 200 is the front data holding means, 2
01 is history holding means, 203 is code tree determining means, and 204
Is a decoding unit, 205 is a code length changing unit, 206 is a prefix data updating unit, 207 is a code tree holding unit, 208 is a context changing unit, and 213 is a control unit. The prefix data holding unit 200 holds n pieces of data decoded in the past, the history holding unit 201 holds a combination of the decoded data and the context, and the code tree holding unit 207. , Holds a code tree in which escape codes defined as data indicating unregistered data are registered in advance.

【０１３８】また、符号木決定手段２０３は、前置デー
タ保持手段２００に保持されている文脈からデータを復
号するための符号木を決定するものであり、復号手段２
０４は、符号に従って符号木決定手段２０３で選択した
符号木の頂点を意味するルートから分岐点としてのノー
ドを走査して到達したデータ格納点としてのリーフに格
納されているデータを出力するものである。The code tree determining means 203 determines a code tree for decoding data from the context held in the prefix data holding means 200, and the decoding means 2
Reference numeral 04 is for outputting the data stored in the leaf as the data storage point reached by scanning the node as the branch point from the root that means the vertex of the code tree selected by the code tree determination means 203 according to the code. is there.

【０１３９】さらに、符号長変更手段２０５は、復号し
たリーフと他のリーフあるいはノードとを組み替えるも
のであり、文脈変更手段２０８は−出力したデータがエ
スケープコードであったときデータを棄却し文脈を短く
するものであり、前置データ更新手段２０６は、復号し
たデータを前置データ保持手段２００に登録するもので
ある。Further, the code length changing means 205 is for recombining the decoded leaf with another leaf or node, and the context changing means 208 is-when the output data is an escape code, discards the data and sets the context. The prefix data updating unit 206 registers the decrypted data in the prefix data holding unit 200.

【０１４０】制御手段２１３は、エスケープコードを復
号した時は文脈変更手段２０８で文脈を再設定し、エス
ケープコード以外が復号されるまで処理を繰り返すもの
である。さらに、関連技術２に係るさらに他のデータ復
元方法を実施するための装置の構成を、図７に示す。こ
の図７に示すデータ復元装置も、過去に出現した履歴に
応じて符号化した符号を復号するものである。When the escape code is decoded, the control means 213 resets the context by the context changing means 208, and repeats the process until a code other than the escape code is decoded. Further, FIG. 7 shows the configuration of an apparatus for implementing still another data restoration method according to Related Technique 2. The data restoration device shown in FIG. 7 also decodes a code that has been encoded according to a history that has appeared in the past.

【０１４１】ここで、２００は前置データ保持手段、２
０１は履歴保持手段、２０３は符号木決定手段、２０４
は復号手段、２０５は符号長変更手段、２０６は前置デ
ータ更新手段、２０７は符号木保持手段、２０８は文脈
変更手段、２０９は履歴登録手段、２１０は符号登録手
段、２１３は制御手段である。前置データ保持手段２０
０は、過去に復号したｎ個のデータを保持するものであ
り、履歴保持手段２０１は、復号したデータと文脈との
組み合わせを保持するものであり、符号木保持手段２０
７は、エスケープコードをあらかじめ登録した符号木を
保持するものである。Here, 200 is the front data holding means, 2
01 is history holding means, 203 is code tree determining means, and 204
Is decoding means, 205 is code length changing means, 206 is prefix data updating means, 207 is code tree holding means, 208 is context changing means, 209 is history registration means, 210 is code registration means, and 213 is control means. . Prefix data holding means 20
0 holds n pieces of data decoded in the past, history holding means 201 holds a combination of decoded data and context, and code tree holding means 20
Reference numeral 7 holds a code tree in which escape codes are registered in advance.

【０１４２】また、符号木決定手段２０３は、前置デー
タ保持手段２００に保持されている文脈からデータを復
号するための符号木を決定するものであり、復号手段２
０４は、符号に従って符号木決定手段２０３で選択した
符号木の頂点を意味するルートから分岐点としてのノー
ドを走査して到達したデータ格納点としてのリーフに格
納されているデータを出力するものである。The code tree determining means 203 is for determining a code tree for decoding data from the context held in the prefix data holding means 200, and the decoding means 2
Reference numeral 04 is for outputting the data stored in the leaf as the data storage point reached by scanning the node as the branch point from the root that means the vertex of the code tree selected by the code tree determination means 203 according to the code. is there.

【０１４３】さらに、符号長変更手段２０５は、復号し
たリーフと他のリーフあるいはノードとを組み替えるも
のであり、文脈変更手段２０８は、出力したデータが上
記エスケープコードであったとき、上記データを棄却し
文脈を短くするものである。また、前置データ更新手段
２０６は、復号したデータを前置データ保持手段２００
に登録するものであり、履歴登録手段２０９は、データ
の復号処理でエスケープコードを復号したときの全ての
文脈と復号したデータとを履歴保持手段２０１に登録す
るものである。Further, the code length changing means 205 recombines the decoded leaf with another leaf or node, and the context changing means 208 discards the above data when the output data is the above escape code. It shortens the context. Further, the prefix data updating unit 206 stores the decoded data in the prefix data holding unit 200.
The history registration means 209 registers in the history holding means 201 all the contexts when the escape code is decoded in the data decoding process and the decoded data.

【０１４４】また、符号登録手段２１０は、データの復
号処理でエスケープコードを復号した時の文脈に対応し
た全ての符号木にデータの符号を登録するものであり、
制御手段２１３は、エスケープコードを復号した時は文
脈変更手段２０８で文脈を再設定し、エスケープコード
以外が復号されるまで処理を繰り返すものである。Further, the code registration means 210 registers the code of the data in all code trees corresponding to the context when the escape code is decoded in the data decoding process,
When the escape code is decoded, the control means 213 resets the context by the context changing means 208, and repeats the process until a code other than the escape code is decoded.

【０１４５】また、関連技術２に係るさらに他のデータ
復元方法を実施するための装置の構成を、図８に示す。
この図８に示すデータ復元装置も、過去に出現した履歴
に応じて符号化した符号を復号するものである。ここ
で、２００は前置データ保持手段、２０１は履歴保持手
段、２０３は符号木決定手段、２０４は復号手段、２０
５は符号長変更手段、２０６は前置データ更新手段、２
０７は符号木保持手段、２０８は文脈変更手段、２１２
は符号登録手段、２１３は制御手段である。FIG. 8 shows the configuration of an apparatus for carrying out still another data restoration method according to Related Technique 2.
The data restoration device shown in FIG. 8 also decodes a code that has been encoded according to a history that has appeared in the past. Here, 200 is prefix data holding means, 201 is history holding means, 203 is code tree determining means, 204 is decoding means, 20
5 is a code length changing means, 206 is a prefix data updating means, 2
Reference numeral 07 is a code tree holding means, 208 is a context changing means, 212
Is a code registration means, and 213 is a control means.

【０１４６】前置データ保持手段２００は、過去に復号
したｎ個のデータを保持するものであり、履歴保持手段
２０１は、復号したデータと文脈との組み合わせを保持
するものであり、符号木保持手段２０７は、データ未登
録を示すデータとして定義されるエスケープコードをあ
らかじめ登録した符号木を保持するものである。また、
符号木決定手段２０３は、前置データ保持手段２００に
保持されている文脈からデータを復号するための符号木
を決定するものであり、復号手段２０４は、符号に従っ
て符号木決定手段２０３で選択した符号木の頂点を意味
するルートから分岐点としてのノードを走査して到達し
たデータ格納点としてのリーフに格納されているデータ
を出力するものである。The prefix data holding means 200 holds n pieces of data decoded in the past, and the history holding means 201 holds a combination of the decoded data and the context, and holds the code tree. The means 207 holds a code tree in which an escape code defined as data indicating unregistered data is registered in advance. Also,
The code tree determining means 203 determines a code tree for decoding data from the context held in the prefix data holding means 200, and the decoding means 204 is selected by the code tree determining means 203 according to the code. The data stored in the leaf as the data storage point reached by scanning the node as the branch point from the root meaning the apex of the code tree is output.

【０１４７】さらに、符号長変更手段２０５は、復号し
たリーフと他のリーフあるいはノードとを組み替えるも
のであり、文脈変更手段２０８は、出力したデータがエ
スケープコードであったときデータを棄却し文脈を短く
するものであり、前置データ更新手段２０６は、復号し
たデータを前置データ保持手段２００に登録するもので
ある。Further, the code length changing means 205 is for recombining the decoded leaf with another leaf or node, and the context changing means 208 rejects the data when the output data is an escape code and sets the context. The prefix data updating unit 206 registers the decrypted data in the prefix data holding unit 200.

【０１４８】また、履歴登録手段２１１は、データの復
号処理でエスケープコードを最後に復号した時の文脈と
復号したデータとを履歴保持手段２０１に登録するもの
であり、符号登録手段２１２は、データの復号処理で最
後にエスケープコードを復号した時の文脈に対応した符
号木にデータの符号を登録するものであり、制御手段２
１３は、エスケープコードを復号した時は文脈変更手段
２０８で文脈を再設定し、エスケープコード以外が復号
されるまで処理を繰り返すものである。The history registration means 211 registers the context when the escape code was last decoded in the data decoding process and the decoded data in the history holding means 201, and the code registration means 212 stores the data. The code of the data is registered in the code tree corresponding to the context when the escape code is finally decoded in the decoding process of 1.
When the escape code is decoded, 13 resets the context by the context changing means 208, and repeats the process until a code other than the escape code is decoded.

【０１４９】そして、図１を用いて説明した構成をもつ
装置、すなわち入力データを過去に出現した履歴に応じ
て符号化して圧縮するデータ圧縮装置においては、前置
データ保持手段１００が、入力データの直前までに入力
されたｎ個の入力データからなる文脈を保持し、履歴保
持手段１０１が、入力データと文脈との組み合わせを保
持し、符号木保持手段１０２が、文脈毎に独立した符号
木を保持する。In the apparatus having the configuration described with reference to FIG. 1, that is, in the data compression apparatus that encodes and compresses the input data according to the history that has appeared in the past, the prefix data holding means 100 is used. , The history holding unit 101 holds the combination of the input data and the context, and the code tree holding unit 102 holds the code tree independent for each context. Hold.

【０１５０】そして、符号木決定手段１０３が、前置デ
ータ保持手段１００に保持されている直前までの入力デ
ータからデータの符号木を決定し、符号出力手段１０４
が、符号木決定手段１０３で選択した符号木の頂点を意
味するルートからデータが格納されているリーフに沿っ
て途中に位置する分岐点としてのノードからの分岐に従
って固有のデータを出力する。Then, the code tree determining means 103 determines the code tree of the data from the input data up to immediately before being held in the prefix data holding means 100, and the code output means 104.
Outputs unique data according to a branch from a node as a branch point located midway along a leaf in which data is stored from a root that means a vertex of the code tree selected by the code tree determining unit 103.

【０１５１】さらに、符号長変更手段１０５が、符号化
したリーフと他のリーフあるいはノードとを組み替え、
前置データ更新手段１０６が、データを前置データ保持
手段１００に登録することができる。次に、図２を用い
て説明した構成をもつ装置、すなわち入力データを過去
に出現した履歴に応じて符号化して圧縮するデータ圧縮
装置においては、前置データ保持手段１００が、入力デ
ータの直前までに入力されたｎ個の入力データからなる
文脈を保持し、履歴保持手段１０１が、入力データと文
脈との組み合わせを保持し、符号木保持手段１０７が、
データ未登録を示すデータとして定義されるエスケープ
コードをあらかじめ登録した文脈毎に独立した符号木を
保持する。Further, the code length changing means 105 rearranges the coded leaf and another leaf or node,
The pre-data updating means 106 can register the data in the pre-data holding means 100. Next, in the device having the configuration described with reference to FIG. 2, that is, in the data compression device that encodes and compresses the input data according to the history that has appeared in the past, the pre-data holding unit 100 immediately before the input data. Holds a context consisting of n pieces of input data inputted up to, history holding means 101 holds a combination of input data and context, and code tree holding means 107 holds
It holds an independent code tree for each context in which an escape code defined as data indicating unregistered data is registered in advance.

【０１５２】そして、符号木決定手段１０３が、文脈と
入力データからデータの符号木を決定し、文脈判別手段
１０８が、符号木決定手段１０３で決定した符号木にデ
ータが登録されているか否かを判別し、エスケープコー
ド出力手段１０９が、符号木にデータが登録されていな
いときは符号木の頂点を意味するルートからエスケープ
コードのデータ格納点としてのリーフまでの途中に位置
する分岐点としてのノードからの分岐に従ってエスケー
プコードを出力する。Then, the code tree determining means 103 determines the code tree of the data from the context and the input data, and the context determining means 108 determines whether or not the data is registered in the code tree determined by the code tree determining means 103. And the escape code output means 109 determines that a branch point located on the way from the root meaning the vertex of the code tree to the leaf as the data storage point of the escape code when no data is registered in the code tree. Output an escape code according to the branch from the node.

【０１５３】さらに、文脈変更手段１１０が、符号木に
データが登録されていないときは文脈の長さｎを短く
し、符号出力手段１１１が、符号木にデータが登録され
ているときは符号木のルートからデータのリーフまでの
途中に位置するノードからの分岐に従ってデータの符号
を出力し、符号長変更手段１０５が、符号化したリーフ
と他のリーフあるいはノードとを組み換える。Further, the context changing means 110 shortens the length n of the context when the data is not registered in the code tree, and the code output means 111 when the data is registered in the code tree. The code of the data is output according to the branch from the node located on the way from the root to the leaf of the data, and the code length changing unit 105 recombines the coded leaf with another leaf or node.

【０１５４】そして、前置データ更新手段１０６が、デ
ータを前置データ保持手段１００に登録し、制御手段１
１６が、エスケープコードを符号化したときはデータの
符号化を行なうまで処理を繰り返す。次に、図３を用い
て説明した構成をもつ装置、すなわち入力データを過去
に出現した履歴に応じて符号化して圧縮するデータ圧縮
装置においては、前置データ保持手段１００が、入力デ
ータの直前までに入力されたｎ個の入力データからなる
文脈を保持し、履歴保持手段１０１が、入力データと文
脈との組み合わせを保持し、符号木保持手段１０７が、
データ未登録を示すデータとして定義されるエスケープ
コードを予め登録した文脈毎に独立した符号木を保持す
る。Then, the prefix data updating means 106 registers the data in the prefix data holding means 100, and the control means 1
When 16 encodes the escape code, the process is repeated until the data is encoded. Next, in the device having the configuration described with reference to FIG. 3, that is, in the data compression device that encodes and compresses the input data according to the history that has appeared in the past, the pre-data holding unit 100 immediately before the input data. Holds a context consisting of n pieces of input data inputted up to, history holding means 101 holds a combination of input data and context, and code tree holding means 107 holds
It holds an independent code tree for each context in which an escape code defined as data indicating unregistered data is registered in advance.

【０１５５】そして、符号木決定手段１０３が、文脈と
入力データからデータの符号木を決定し、文脈判別手段
１０８が、符号木決定手段１０３で決定した符号木にデ
ータが登録されているか否かを判別し、エスケープ出力
手段１０９が、符号木にデータが登録されていないとき
は符号木の頂点を意味するルートからエスケープコード
のデータ格納点としてのリーフまでの中に位置する分岐
点としてのノードからの分岐に従ってエスケープコード
を出力する。さらに、履歴登録手段１１２が、符号木に
データが登録されていないときは履歴保持手段１０１に
データと文脈の組み合わせを登録し、符号登録手段１１
３が、符号木にデータが登録されていないときは符号木
にデータを新規に登録し、文脈変更手段１１０が、符号
木にデータが登録されていないときは文脈の長さｎを短
くし、符号出力手段１１１が、符号木にデータが登録さ
れているときは符号木のルートからデータのリーフまで
の途中に位置するノードからの分岐に従ってデータの符
号を出力する。Then, the code tree determining means 103 determines the code tree of the data from the context and the input data, and the context determining means 108 determines whether or not the data is registered in the code tree determined by the code tree determining means 103. When the data is not registered in the code tree, the escape output means 109 is a node as a branch point located between the root that means the vertex of the code tree and the leaf as the data storage point of the escape code. The escape code is output according to the branch from. Further, the history registration means 112 registers the combination of the data and the context in the history holding means 101 when the data is not registered in the code tree, and the code registration means 11
3 newly registers the data in the code tree when the data is not registered in the code tree, and the context changing unit 110 shortens the context length n when the data is not registered in the code tree, When the data is registered in the code tree, the code output means 111 outputs the code of the data according to the branch from the node located on the way from the root of the code tree to the leaf of the data.

【０１５６】そして、符号長変更手段１０５が、符号化
したリーフと他のリーフあるいはノードとを組み換え、
前置データ更新手段１０６が、データを前置データ保持
手段１００に登録し、制御手段１１６が、エスケープコ
ードを符号化したときはデータの符号化を行なうまで処
理を繰り返す。次に、図４を用いて説明した構成をもつ
装置、すなわち入力データを過去に出現した履歴に応じ
て符号化して圧縮するデータ圧縮装置においては、前置
データ保持手段１００が、入力データの直前までに入力
されたｎ個の入力データからなる文脈を保持し、履歴保
持手段１０１が、入力データと文脈との組み合わせを保
持し、符号木保持手段１０７が、データ未登録を示すデ
ータとして定義されるエスケープコードをあらかじめ登
録した文脈毎に独立した符号木を保持する。Then, the code length changing means 105 recombines the coded leaf with another leaf or node,
The prefix data updating unit 106 registers the data in the prefix data holding unit 100, and when the control unit 116 codes the escape code, the process is repeated until the data is coded. Next, in the device having the configuration described with reference to FIG. 4, that is, in the data compression device that encodes and compresses the input data according to the history that has appeared in the past, the pre-data holding unit 100 immediately before the input data. The history holding unit 101 holds the combination of the input data and the context, and the code tree holding unit 107 is defined as data indicating that the data has not been registered. It holds an independent code tree for each context in which escape codes are registered in advance.

【０１５７】そして、符号木決定手段１０３が、文脈と
入力データからデータの符号木を決定し、文脈判別手段
１０８が、符号木決定手段１０３で決定した符号木にデ
ータが登録されているか否かを判別し、エスケープコー
ド出力手段１０９と符号木にデータが登録されていない
ときは符号木の頂点を意味するルートからエスケープコ
ードのデータ格納点としてのリーフまでの途中に位置す
る分岐点としてのノードからの分岐に従ってエスケープ
コードを出力する。Then, the code tree determining means 103 determines the code tree of the data from the context and the input data, and the context determining means 108 determines whether or not the data is registered in the code tree determined by the code tree determining means 103. And a node as a branch point located on the way from the root that means the vertex of the code tree to the leaf as the data storage point of the escape code when the data is not registered in the escape code output means 109 and the code tree. The escape code is output according to the branch from.

【０１５８】さらに、文脈変更手段１１０が、符号木に
データが登録されていないときは文脈の長さｎを短く
し、エスケープ符号出力手段１１１が、符号木にデータ
が登録されているときは符号木のルートからデータのリ
ーフまでの途中に位置するノードからの分岐にしたがっ
てデータの符号を出力する。そして、履歴登録手段１１
４が、履歴保持手段１０１にデータと文脈の組み合わせ
を登録し、符号登録手段１１５が、符号木にデータを新
規に登録し、符号長変更手段１０５が、符号化したリー
フと他のリーフあるいはノードとを組み換え、前置デー
タ更新手段１０６が、データを前置データ保持手段１０
０に登録する。Further, the context changing means 110 shortens the length n of the context when the data is not registered in the code tree, and the escape code output means 111 outputs the code when the data is registered in the code tree. The code of the data is output according to the branch from the node located on the way from the root of the tree to the leaf of the data. And history registration means 11
4 registers a combination of data and context in the history holding unit 101, the code registration unit 115 newly registers the data in the code tree, and the code length changing unit 105 codes the encoded leaf and another leaf or node. And the prefix data updating means 106 stores the data in the prefix data holding means 10
Register to 0.

【０１５９】さらに、制御手段１１６が、データの符号
化時に一度でもエスケープコードを符号化したときは、
データの符号化の直前の文脈とデータとの組み合わせを
履歴登録手段１１４で履歴保持手段１０１に登録し、デ
ータの符号化の直前に符号化したエスケープコードを持
つ符号木に符号登録手段１１５でデータを新規に登録す
る。Further, when the control means 116 encodes the escape code even once when encoding the data,
The history registration unit 114 registers the combination of the context and the data immediately before the data encoding in the history holding unit 101, and the code registration unit 115 stores the data in the code tree having the escape code encoded immediately before the data encoding. Is newly registered.

【０１６０】一方、図５を用いて説明した構成をもつ装
置、すなわち過去に出現した履歴に応じて符号化した符
号を復号するデータ復元装置においては、前置データ保
持手段２００が、過去に復号したｎ個のデータを保持
し、履歴保持手段２０１が、復号したデータと文脈との
組み合わせを保持し、符号木保持手段２０２が、文脈毎
に独立した符号木を保持する。On the other hand, in the device having the configuration described with reference to FIG. 5, that is, in the data restoration device that decodes the code coded according to the history that has appeared in the past, the prefix data holding means 200 decodes the data in the past. The history holding unit 201 holds a combination of the decoded data and the context, and the code tree holding unit 202 holds an independent code tree for each context.

【０１６１】そして、符号木決定手段２０３が、前置デ
ータ保持手段２００に保持されている文脈からデータを
復号するための符号木を決定し、復号手段２０４が、符
号に従って符号木決定手段２０３で選択した符号木の頂
点を意味するルートから分岐点としてのノードを走査し
て到達したデータ格納点としてのリーフに格納されてい
るデータを出力する。Then, the code tree determining means 203 determines the code tree for decoding the data from the context held in the prefix data holding means 200, and the decoding means 204 causes the code tree determining means 203 to follow the code. The node stored as a branch point is scanned from the root representing the apex of the selected code tree, and the data stored in the leaf as the data storage point reached is output.

【０１６２】さらに、符号長変更手段２０５が、復号し
たリーフと他のリーフあるいはノードとを組み替え、前
置データ更新手段２０６が、復号したデータを前置デー
タ保持手段２００に登録する。次に、図６を用いて説明
した構成をもつ装置、すなわち過去に出現した履歴に応
じて符号化した符号を復号するデータ復元装置において
は、前置データ保持手段２００が、過去に復号したｎ個
のデータを保持し、履歴保持手段２０１が、復号したデ
ータと文脈との組み合わせを保持し、符号木保持手段２
０７が、データ未登録を示すデータとして定義されるエ
スケープコードをあらかじめ登録した符号木を保持す
る。Further, the code length changing means 205 rearranges the decoded leaf and another leaf or node, and the prefix data updating means 206 registers the decoded data in the prefix data holding means 200. Next, in the device having the configuration described with reference to FIG. 6, that is, in the data decompression device that decodes the code that has been encoded according to the history that has appeared in the past, the pre-data holding unit 200 has n decoded in the past. Data, the history holding means 201 holds the combination of the decoded data and the context, and the code tree holding means 2
Reference numeral 07 holds a code tree in which an escape code defined as data indicating unregistered data is registered in advance.

【０１６３】そして、符号木決定手段２０３が、前置デ
ータ保持手段２００に保持されている文脈からデータを
復号するための符号木を決定し、復号手段２０４が、符
号に従って符号木決定手段２０３で選択した符号木の頂
点を意味するルートから分岐点としてのノードを走査し
て到達したデータ格納点としてのリーフに格納されてい
るデータを出力する。Then, the code tree determining means 203 determines a code tree for decoding the data from the context held in the prefix data holding means 200, and the decoding means 204 causes the code tree determining means 203 to follow the code. The node stored as a branch point is scanned from the root representing the apex of the selected code tree, and the data stored in the leaf as the data storage point reached is output.

【０１６４】さらに、符号長変更手段２０５が、復号し
たリーフと他のリーフあるいはノードとを組み替え、文
脈変更手段２０８が、出力したデータがエスケープコー
ドであったときデータを棄却し文脈を短くし、前置デー
タ更新手段２０６が、復号したデータを前置データ保持
手段２００に登録する。そして、制御手段２１３が、エ
スケープコードを復号した時は文脈変更手段２０８で文
脈を再設定し、エスケープコード以外が復号されるまで
処理を繰り返す。Further, the code length changing means 205 rearranges the decoded leaf and another leaf or node, and the context changing means 208 rejects the data when the output data is an escape code to shorten the context, The prefix data updating unit 206 registers the decrypted data in the prefix data holding unit 200. Then, when the control means 213 decodes the escape code, the context changing means 208 resets the context, and repeats the process until a code other than the escape code is decoded.

【０１６５】次に、図７を用いて説明した構成をもつ装
置、すなわち過去に出現した履歴に応じて符号化した符
号を復号するデータ復元装置においては、前置データ保
持手段２００が、過去に復号したｎ個のデータを保持
し、履歴保持手段２０１が、復号したデータと文脈との
組み合わせを保持し、符号木保持手段２０７が、エスケ
ープコードをあらかじめ登録した符号木を保持する。Next, in the device having the configuration described with reference to FIG. 7, that is, in the data decompression device that decodes the code coded according to the history that has appeared in the past, the prefix data holding means 200 The decoded n data is held, the history holding unit 201 holds the combination of the decoded data and the context, and the code tree holding unit 207 holds the code tree in which the escape code is registered in advance.

【０１６６】そして、符号木決定手段２０３が、前置デ
ータ保持手段２００に保持されている文脈からデータを
復号するための符号木を決定し、復号手段２０４が、符
号に従って符号木決定手段２０３で選択した符号木の頂
点を意味するルートから分岐点としてのノードを走査し
て到達したデータ格納点としてのリーフに格納されてい
るデータを出力する。Then, the code tree determining means 203 determines a code tree for decoding the data from the context held in the prefix data holding means 200, and the decoding means 204 causes the code tree determining means 203 to follow the code. The node stored as a branch point is scanned from the root representing the apex of the selected code tree, and the data stored in the leaf as the data storage point reached is output.

【０１６７】さらに、符号長変更手段２０５が、復号し
たリーフと他のリーフあるいはノードとを組み替え、文
脈変更手段２０８が、出力したデータがエスケープコー
ドであったとき、データを棄却し文脈を短くし、前置デ
ータ更新手段２０６が、復号したデータを前置データ保
持手段２００に登録する。そして、履歴登録手段２０９
が、データの復号処理でエスケープコードを復号したと
きの全ての文脈と復号したデータとを履歴保持手段２０
１に登録し、符号登録手段２１０が、データの復号処理
でエスケープコードを復号した時の文脈に対応した全て
の符号木にデータの符号を登録し、制御手段２１３が、
エスケープコードを復号した時は文脈変更手段２０８で
文脈を再設定し、エスケープコード以外が復号されるま
で処理を繰り返す。Furthermore, the code length changing means 205 rearranges the decoded leaf and another leaf or node, and the context changing means 208 rejects the data and shortens the context when the output data is an escape code. The prefix data updating unit 206 registers the decrypted data in the prefix data holding unit 200. Then, the history registration means 209
However, the history holding unit 20 stores all the contexts when the escape code is decoded in the data decoding process and the decoded data.
1, the code registration unit 210 registers the code of the data in all the code trees corresponding to the context when the escape code is decoded in the data decoding process, and the control unit 213
When the escape code is decoded, the context changing means 208 resets the context, and the process is repeated until a code other than the escape code is decoded.

【０１６８】次に、図８を用いて説明した構成をもつ装
置、すなわち過去に出現した履歴に応じて符号化した符
号を復号するデータ復元装置においては、前置データ保
持手段２００が、過去に復号したｎ個のデータを保持
し、履歴保持手段２０１が、復号したデータと文脈との
組み合わせを保持し、符号木保持手段２０７が、データ
未登録を示すデータとして定義されるエスケープコード
をあらかじめ登録した符号木を保持する。Next, in the device having the configuration described with reference to FIG. 8, that is, in the data decompression device that decodes the code coded according to the history that has appeared in the past, the prefix data holding means 200 Holds the decoded n pieces of data, the history holding means 201 holds the combination of the decoded data and the context, and the code tree holding means 207 registers in advance an escape code defined as data indicating that the data has not been registered. Holds the code tree that was created.

【０１６９】そして、符号木決定手段２０３が、前置デ
ータ保持手段２００に保持されている文脈からデータを
復号するための符号木を決定し、復号手段２０４が、符
号に従って符号木決定手段２０３で選択した符号木の頂
点を意味するルートから分岐点としてのノードを走査し
て到達したデータ格納点としてのリーフに格納されてい
るデータを出力する。Then, the code tree determining means 203 determines the code tree for decoding the data from the context held in the prefix data holding means 200, and the decoding means 204 causes the code tree determining means 203 to follow the code. The node stored as a branch point is scanned from the root representing the apex of the selected code tree, and the data stored in the leaf as the data storage point reached is output.

【０１７０】さらに、符号長変更手段２０５が、復号し
たリーフと他のリーフあるいはノードとを組み替え、文
脈変更手段２０８が、出力したデータがエスケープコー
ドであったときデータを棄却し文脈を短くする。そし
て、前置データ更新手段２０６が、復号したデータを前
置データ保持手段２００に登録し、履歴登録手段２１１
が、データの復号処理でエスケープコードを最後に復号
した時の文脈と復号したデータとを履歴保持手段２０１
に登録し、符号登録手段２１２が、データの復号処理で
最後にエスケープコードを復号した時の文脈に対応した
符号木にデータの符号を登録する。Further, the code length changing unit 205 rearranges the decoded leaf and another leaf or node, and the context changing unit 208 rejects the data when the output data is an escape code and shortens the context. Then, the prefix data updating unit 206 registers the decrypted data in the prefix data holding unit 200, and the history registration unit 211.
However, the history holding means 201 indicates the context when the escape code was last decoded in the data decoding process and the decoded data.
The code registration means 212 registers the code of the data in the code tree corresponding to the context when the escape code was finally decoded in the data decoding process.

【０１７１】さらに、制御手段２１３が、エスケープコ
ードを復号した時は文脈変更手段２０８で文脈を再設定
し、エスケープコード以外が復号されるまで処理を繰り
返す。Further, when the control means 213 decodes the escape code, the context changing means 208 resets the context, and repeats the process until a code other than the escape code is decoded.

【０１７２】従って、上述した関連技術２に係るデータ
圧縮方法によれば、入力データの出現頻度を求めて確率
モデルを構築して各入力データに符号を割り当て符号表
を作成し、この符号表から符号化する文字の符号を出力
するという２段階の処理を同時に行なうことができ、こ
れにより圧縮処理の速度が大幅に向上するという効果が
ある。また、データが入力されるごとに既に構築されて
いる確率モデルを再構築するという膨大な演算処理を省
くことができ、これにより圧縮処理の速度がさらに向上
する効果もある。さらに、過去に出現した入力データと
同じデータが繰り返し出現するほど、そのデータの符号
を少ないビット数で表すことができ、これによりデータ
圧縮における圧縮効果が大幅に向上する効果もある。Therefore, according to the data compression method according to the related technique 2 described above, the probability model is obtained by calculating the appearance frequency of the input data, a code is assigned to each input data, a code table is created, and from this code table The two-stage processing of outputting the code of the character to be encoded can be performed at the same time, which has the effect of significantly improving the speed of the compression processing. Further, it is possible to omit an enormous amount of arithmetic processing of reconstructing a stochastic model that has already been constructed each time data is input, which has the effect of further increasing the speed of compression processing. Furthermore, as the same data as the input data that has appeared in the past appears repeatedly, the code of the data can be represented by a smaller number of bits, which also has the effect of significantly improving the compression effect in data compression.

【０１７３】また、入力データと文脈との組み合わせが
文脈収集過程の履歴に保持されていない組み合わせであ
ったとき、エスケープコードを出力し、文脈収集過程に
保持されている組み合わせが得られるまで、データの文
脈を短くする処理を繰り返すので、上述した効果に加え
て、入力データと文脈との組み合わせの履歴の全てを予
め登録しておかなくてもよく、これによりデータ圧縮の
処理速度が大幅に向上する効果がある。さらに、入力デ
ータと文脈との組み合わせが得られるまでの時間を短縮
することができ、これによりデータ圧縮の処理速度が大
幅に向上する効果もある。When the combination of the input data and the context is a combination which is not held in the history of the context collecting process, the escape code is output, and the data held until the combination held in the context collecting process is obtained. Since the process of shortening the context of is repeated, in addition to the effects described above, it is not necessary to register all the history of combinations of input data and context in advance, which significantly improves the processing speed of data compression. Has the effect of Furthermore, it is possible to shorten the time until the combination of the input data and the context is obtained, which has the effect of significantly improving the processing speed of data compression.

【０１７４】また、過去に予め登録されていなかった入
力データを新規に登録してゆくことができるとともにこ
の新規に登録したデータも次の符号化処理においては早
い段階で符号化することができ、これにより符号化処理
が進むほどデータの圧縮効果が大幅に向上する効果があ
る。Further, it is possible to newly register input data that has not been previously registered in the past, and to code this newly registered data at an early stage in the next encoding process. As a result, the data compression effect is significantly improved as the encoding process progresses.

【０１７５】さらに、文脈新規登録過程及び符号木新規
登録過程においては、履歴にあると判断された直前の文
脈とデータとの組み合わせのみを登録するので、過去の
入力データの履歴にないと判断された文脈とデータとの
組み合わせを全て登録する必要がなく、これによりデー
タの圧縮処理がさらに大幅に向上する効果がある。さら
に、実際に出現頻度が高いデータについてのみ符号をも
たせる（登録する）ことができ、これによりデータの圧
縮効率が大幅に向上する効果もある。Further, in the context new registration process and the code tree new registration process, since only the combination of the context and the data immediately before judged to be in the history is registered, it is judged that it is not in the history of past input data. It is not necessary to register all the combinations of contexts and data, which has the effect of significantly improving data compression processing. Further, it is possible to give (register) a code only to the data having a high appearance frequency, which has the effect of significantly improving the data compression efficiency.

【０１７６】一方、上述した関連技術２に係るデータ復
元方法によれば、入力データの出現頻度を求めて確率モ
デルを構築して各入力データに符号を割り当て符号表を
作成し、この符号表から復号する文字を出力するという
２段階の処理を同時に行なうことができ、これによりデ
ータの復元処理の速度が大幅に向上するという効果があ
る。また、データが入力されるごとに既に構築されてい
る確率モデルを再構築するという膨大な演算処理を省く
ことができ、これにより復元処理の速度がさらに向上す
る効果もある。さらに、過去に出現した入力データの符
号と同じデータの符号が繰り返し出現するほど、そのデ
ータの符号を少ないビット数で表すことができ、これに
よりデータ復元における復元効果が大幅に向上する効果
もある。On the other hand, according to the data restoration method according to the related technique 2 described above, the probability model is obtained by calculating the appearance frequency of the input data, a code is assigned to each input data, a code table is created, and from this code table It is possible to perform the two-step processing of outputting the characters to be decoded at the same time, which has the effect of significantly improving the speed of the data restoration processing. In addition, it is possible to omit an enormous amount of arithmetic processing of reconstructing a stochastic model that has already been constructed each time data is input, and this also has the effect of further increasing the speed of restoration processing. Further, as the code of the same data as the code of the input data that has appeared in the past repeatedly appears, the code of the data can be represented by a smaller number of bits, which also has the effect of significantly improving the restoration effect in the data restoration. .

【０１７７】さらに、符号木にはそれぞれの文脈に応じ
た符号木毎に予めデータ未登録を示すデータとして定義
されるエスケープコードを登録し、復号時にエスケープ
コードを復号した場合、エスケープコード以外が復号さ
れるまで、文脈の長さを短くする処理を繰り返すので、
復号データと文脈との組み合わせの履歴の全てを予め登
録しておかなくてもよく、これによりデータ復元の処理
速度が大幅に向上する効果がある。さらに、復号データ
と文脈との組み合わせが得られるまでの時間を短縮する
ことができ、これによりデータ復元の処理速度が大幅に
向上する効果もある。Further, in the code tree, an escape code defined as data indicating that data has not been registered is registered in advance for each code tree according to each context, and when the escape code is decoded at the time of decoding, the code other than the escape code is decoded. Until it is done, the process of shortening the length of the context is repeated, so
It is not necessary to register all the history of the combination of the decrypted data and the context in advance, which has the effect of significantly improving the processing speed of data restoration. Furthermore, it is possible to shorten the time until the combination of the decoded data and the context is obtained, and this also has the effect of significantly improving the processing speed of data restoration.

【０１７８】また、データ未登録を示すデータとして定
義されるエスケープコードを復号したとき、文脈新規登
録過程及び符号木新規登録過程を実行して、エスケープ
コード以外が復号されるまで、文脈の長さを短くする処
理を繰り返すので、過去に予め登録されていなかった復
号データを新規に登録してゆくことができるとともにこ
の新規に登録した復号データも次の復号処理においては
早い段階で復号することができ、これにより復号処理が
進むほどデータの復元効果が大幅に向上する効果があ
る。When an escape code defined as data indicating unregistered data is decoded, the context new registration process and the code tree new registration process are executed, and the length of the context is increased until a part other than the escape code is decoded. By repeating the process of shortening the time, it is possible to newly register the decoded data that was not registered in advance in the past, and this newly registered decoded data can be decoded at an early stage in the next decoding process. As a result, as the decoding process progresses, the data restoration effect is significantly improved.

【０１７９】さらに、データ未登録を示すデータとして
定義されるエスケープコード以外が復号されるまでの処
理において、エスケープコードを一つでも復号した時、
エスケープコード以外を復号した直前の文脈においての
み、文脈新規登録過程および符号木新規登録過程での各
新規登録処理を行なうので、過去の入力データの履歴に
ないと判断された文脈とデータとの組み合わせを全て登
録する必要がなく、これによりデータ復元の処理速度が
さらに大幅に向上する効果がある。さらに、実際に出現
頻度が高いデータについてのみ符号をもたせる（登録す
る）ことができ、これによりデータの復元効率が大幅に
向上する効果もある。Furthermore, when even one escape code is decoded in the process until the escape code other than the escape code defined as the data indicating unregistered data is decoded,
Since each new registration process in the context new registration process and the code tree new registration process is performed only in the context immediately before decoding the code other than the escape code, the combination of the context and the data determined not to be in the history of past input data It is not necessary to register all of them, and this has the effect of significantly improving the processing speed of data restoration. Further, it is possible to give (register) a code only to the data having a high appearance frequency, which has the effect of significantly improving the data restoration efficiency.

【０１８０】また、関連技術２に係る他のデータ圧縮装
置によれば、入力データの出現頻度を求めて確率モデル
を構築して各入力データに符号を割り当て符号表を作成
し、この符号表から符号化するデータの符号を出力する
という２段階の処理を同時に行なうことができ、これに
よりデータ圧縮の処理速度が大幅に向上するという効果
がある。また、データが入力されるごとに既に構築され
ている確率モデルを再構築するという膨大な演算処理を
省くことができ、これによりデータ圧縮の処理速度がさ
らに向上する効果もある。また、過去に符号化したデー
タと同じデータが出現する毎に、符号化したリーフと他
のリーフあるいはノードとを組み替えて符号長を変更す
ることができるので、同じデータが繰り返し出現するほ
ど、そのデータの符号を少ないビット数で表すことがで
き、これによりデータの圧縮効果が大幅に向上する効果
もある。Further, according to another data compression apparatus of Related Technique 2, the probability model is obtained by calculating the appearance frequency of input data, a code is assigned to each input data, a code table is created, and from this code table It is possible to perform the two-stage processing of outputting the code of the data to be encoded at the same time, which has the effect of significantly improving the processing speed of data compression. Further, it is possible to omit an enormous amount of arithmetic processing of reconstructing a stochastic model that has already been constructed each time data is input, which also has the effect of further improving the processing speed of data compression. Also, every time the same data as the coded data in the past appears, the coded leaf and another leaf or node can be recombined to change the code length. The code of data can be represented by a small number of bits, which also has the effect of significantly improving the data compression effect.

【０１８１】さらに、入力データと文脈との組み合わせ
の履歴の全てを予め登録しておかなくてもよいので、デ
ータ圧縮の処理速度が大幅に向上するとともに、文脈の
登録に使用するメモリを大幅に削減できるのでデータ圧
縮装置の処理負荷も大幅に軽減できる効果がある。Further, since it is not necessary to previously register all the history of combinations of input data and contexts, the processing speed of data compression is significantly improved, and the memory used for registration of contexts is significantly increased. Since this can be reduced, the processing load of the data compression device can be significantly reduced.

【０１８２】また、過去に予め登録されていなかった入
力データを新規に登録してゆくことができるとともに、
この新規に登録したデータも次の符号化処理においては
早い段階で符号化することができ、これにより符号化処
理が進むほどデータの圧縮効果が大幅に向上するととも
にデータ圧縮装置の処理負荷も大幅に軽減できる効果が
ある。Further, it is possible to newly register input data which has not been previously registered,
This newly registered data can also be encoded at an early stage in the next encoding process, and as the encoding process progresses, the data compression effect is significantly improved and the processing load on the data compression device is also significantly increased. There is an effect that can be reduced.

【０１８３】さらに、過去の入力データの履歴にないと
判断された文脈とデータとの組み合わせを全て登録する
必要がなく、これによりデータの圧縮処理がさらに大幅
に向上する効果がある。さらに、実際に出現頻度が高い
データについてのみ符号をもたせる（登録する）ことが
でき、これによりデータの圧縮効率が大幅に向上する効
果もある。そして、以上のような効果により、データ圧
縮装置の性能が飛躍的に向上する効果がある。Furthermore, it is not necessary to register all the combinations of contexts and data that have been determined not to be in the history of input data in the past, which has the effect of significantly improving the data compression processing. Further, it is possible to give (register) a code only to the data having a high appearance frequency, which has the effect of significantly improving the data compression efficiency. The above-described effects have the effect of dramatically improving the performance of the data compression apparatus.

【０１８４】また、関連技術２に係るデータ復元装置に
よれば、入力データの出現頻度を求めて確率モデルを構
築して各入力データに符号を割り当て符号表を作成し、
この符号表から復号する文字を出力するという２段階の
処理を同時に行なうことができ、これによりデータの復
元処理の速度が大幅に向上するという効果がある。ま
た、データが入力されるごとに既に構築されている確率
モデルを再構築するという膨大な演算処理を省くことが
でき、これにより復元処理の速度がさらに向上する効果
もある。さらに、過去に出現した入力データの符号と同
じデータの符号が繰り返し出現するほど、そのデータの
符号を少ないビット数で表すことができ、これによりデ
ータ復元における復元効果が大幅に向上する効果もあ
る。そして、以上のような効果により、データ復元装置
の性能が飛躍的に向上する効果がある。Further, according to the data restoration device of the related technique 2, the appearance frequency of the input data is obtained, the stochastic model is constructed, the codes are assigned to the respective input data, and the code table is created.
The two-step process of outputting the character to be decoded from this code table can be performed at the same time, which has the effect of significantly improving the speed of the data restoration process. In addition, it is possible to omit an enormous amount of arithmetic processing of reconstructing a stochastic model that has already been constructed each time data is input, and this also has the effect of further increasing the speed of restoration processing. Further, as the code of the same data as the code of the input data that has appeared in the past repeatedly appears, the code of the data can be represented by a smaller number of bits, which also has the effect of significantly improving the restoration effect in the data restoration. . The above-described effects have the effect of dramatically improving the performance of the data restoration device.

【０１８５】さらに、復号データと文脈との組み合わせ
の履歴の全てを予め登録しておかなくてもよく、これに
よりデータ復元の処理速度が大幅に向上する効果があ
る。さらに、復号データと文脈との組み合わせが得られ
るまでの時間を短縮することができ、これによりデータ
復元の処理速度が大幅に向上するとともにデータ復元装
置の性能も大幅に向上する効果もある。Further, it is not necessary to register all the history of the combination of the decrypted data and the context in advance, which has the effect of significantly improving the processing speed of data restoration. Furthermore, it is possible to shorten the time until the combination of the decoded data and the context is obtained, which has the effect of significantly improving the processing speed of data recovery and also the performance of the data recovery device.

【０１８６】また、過去に予め登録されていなかった復
号データを新規に登録してゆくことができるとともにこ
の新規に登録した復号データも次の復号処理においては
早い段階で復号することができ、これにより復号処理が
進むほどデータの復元効果が大幅に向上するとともに、
データ復元装置の性能も大幅に向上する効果がある。Further, it is possible to newly register the decoded data which has not been registered in advance in the past, and also the newly registered decoded data can be decoded at an early stage in the next decoding process. As the decoding process progresses, the data restoration effect improves significantly, and
This also has the effect of significantly improving the performance of the data restoration device.

【０１８７】さらに、過去の入力データの履歴にないと
判断された文脈とデータとの組み合わせを全て登録する
必要がなく、これによりデータ復元の処理がさらに大幅
に向上する効果がある。さらに、実際に出現頻度が高い
データについてのみ符号をもたせる（登録する）ことが
でき、これによりデータの復元効率が大幅に向上すると
ともにデータ復元装置の性能も大幅に向上する効果があ
る。次に、関連技術２について、より具体的に説明す
る。図３２は、関連技術２のデータ圧縮方法及びデータ
復元方法を実施するためのデータ圧縮装置及びデータ復
元装置の構成例を示すブロック図であるが、この図３２
において、データ圧縮装置３は、入力されたデータを過
去に出現した履歴に応じて符号化して圧縮するものであ
り、データ復元装置４は、データ圧縮装置３が符号化し
た符号を復号するものである。Furthermore, it is not necessary to register all the combinations of contexts and data that have been determined not to be in the history of input data in the past, and this has the effect of greatly improving the data restoration processing. Furthermore, it is possible to give (register) a code only to the data that actually appears frequently, which has the effect of greatly improving the data recovery efficiency and also the performance of the data recovery device. Next, the related technique 2 will be described more specifically. 32 is a block diagram showing a configuration example of a data compression apparatus and a data decompression apparatus for implementing the data compression method and the data decompression method of Related Technique 2.
In the data compression device 3, the data compression device 3 encodes and compresses the input data according to the history of past appearances, and the data decompression device 4 decodes the code encoded by the data compression device 3. is there.

【０１８８】以後、データ圧縮装置３を符号化側、デー
タ復元装置４を復元側として、以下に説明する。なお、
以下の説明中、文脈木および符号木は、関連技術１にて
前述した構成をもつものである。（１）符号化側の説明図３３は、上述のデータ圧縮装置３の内部の構成例を示
すブロック図であり、この図３３に示すように、１００
Ａ−１〜１００Ａ−ｎ（ｎは自然数）は前置データ保持
部、１０１Ａは文脈履歴保持部、１０２Ａは符号木保持
部、１０３Ａは符号木決定部、１０４Ａは符号化部、１
０５Ａは符号木更新部、１０６Ａは文脈更新部である。Hereinafter, description will be given below with the data compression device 3 as the encoding side and the data decompression device 4 as the decompression side. In addition,
In the following description, the context tree and the code tree have the configurations described in Related Art 1. (1) Description of Encoding Side FIG. 33 is a block diagram showing an example of the internal configuration of the data compression device 3 described above. As shown in FIG.
A-1 to 100A-n (n is a natural number) is a prefix data holding unit, 101A is a context history holding unit, 102A is a code tree holding unit, 103A is a code tree determining unit, 104A is a coding unit, 1
Reference numeral 05A is a code tree updating unit, and 106A is a context updating unit.

【０１８９】ここで、前置データ保持部（前置データ保
持手段）１００Ａ−１〜１００Ａ−ｎは、入力されたデ
ータＫ（以下、事象Ｋということがある）の直前までに
入力されたｎ個のデータからなる文脈を保持するもので
ある。また、文脈履歴保持部（履歴保持手段）１０１Ａ
は、入力されたデータＫと文脈との組み合わせを保持す
るものであり、符号木保持部（符号木保持手段）１０２
Ａは、文脈毎に独立した符号木を保持するものであり、
符号木決定部（符号木決定手段）１０３Ａは、前置デー
タ保持部１００Ａ−１〜１００Ａ−ｎに保持されている
直前までのデータから符号木を決定するものである。Here, the prefix data holding units (prefix data holding means) 100A-1 to 100A-n are input n just before the input data K (hereinafter sometimes referred to as event K). It holds the context of individual data. A context history holding unit (history holding means) 101A
Holds a combination of the input data K and the context, and is a code tree holding unit (code tree holding means) 102.
A holds an independent code tree for each context,
The code tree determination unit (code tree determination unit) 103A determines a code tree from the data held immediately before held in the prefix data holding units 100A-1 to 100A-n.

【０１９０】さらに、符号化部（符号出力手段）１０４
Ａは、データＫを符号化して、符号木決定部１０３Ａで
選択した符号木のルート（符号木の頂点）からデータＫ
が格納されているリーフに沿って途中に位置するノード
（分岐点）からの分岐に従って符号化したデータＫを出
力するものである。また、符号木更新部（符号長変更手
段）１０５Ａは、符号化したリーフと他のリーフあるい
はノードとを組み替えるものであり、文脈更新部（前置
データ更新手段）１０６Ａは、データＫを前置データ保
持部１００Ａ−１〜１００Ａ−ｎに登録するものであ
る。Further, the coding section (code output means) 104
A encodes the data K and outputs the data K from the root of the code tree (the vertex of the code tree) selected by the code tree determination unit 103A.
Is output according to a branch from a node (branch point) located midway along the leaf in which is stored. The code tree updating unit (code length changing unit) 105A rearranges the coded leaf and another leaf or node, and the context updating unit (prefix data updating unit) 106A prefixes the data K. The data is stored in the data holding units 100A-1 to 100A-n.

【０１９１】さらに、図３４は上述の符号化部１０４Ａ
の内部の構成例を示すブロック図であり、この図３４に
示すように、上述のようにノードからの分岐に従って符
号化したデータＫを出力するために、符号化部１０４Ａ
には、上位ノード判別部４１，ノード番号管理部（メモ
リ）４２，位置判別部４３，ラッチ４４，スタック４５
が設けられている。Further, FIG. 34 shows the above-mentioned encoding unit 104A.
FIG. 35 is a block diagram showing an example of the internal configuration of the encoding unit 104A for outputting the data K encoded according to the branch from the node as described above, as shown in FIG.
Includes an upper node discriminating unit 41, a node number managing unit (memory) 42, a position discriminating unit 43, a latch 44, and a stack 45.
Is provided.

【０１９２】ここで、上位ノード判別部４１は、符号木
のルートのノード番号と文脈木のリーフのノード番号と
から上位ノードのノード番号を得るものであり、ノード
番号管理部（メモリ）４２は、文脈木と符号木のノード
番号を管理するものであり、位置判別部４３は、ノード
の分岐状態を判別するものである。さらに、ラッチ４４
は、リーフのノード番号を一旦保持するものであり、ス
タック４５は、位置判別部４３から出力されるデータＫ
の符号を一旦保持して、終了信号を受信すると保持して
おいた符号を順次出力するものである。Here, the upper node discriminating unit 41 obtains the node number of the upper node from the node number of the root of the code tree and the node number of the leaf of the context tree, and the node number managing unit (memory) 42 The node number of the context tree and the code tree is managed, and the position discriminating unit 43 discriminates the branching state of the node. In addition, the latch 44
Temporarily holds the node number of the leaf, and the stack 45 uses the data K output from the position determination unit 43.
The code is held once, and when the end signal is received, the held code is sequentially output.

【０１９３】上述の構成により、図３３に示すデータ圧
縮装置では、前置データ保持部１００Ａ−１〜１００Ａ
−ｎが、入力データＫの直前までに入力されたｎ個の入
力データからなる文脈を保持し、文脈履歴保持部１０１
Ａが、入力データと文脈との組み合わせを保持し、符号
木保持部１０２Ａが、文脈毎に独立した符号木を保持す
る。With the configuration described above, in the data compression apparatus shown in FIG. 33, the pre-data holding units 100A-1 to 100A are provided.
-N holds the context consisting of n pieces of input data input up to immediately before the input data K, and the context history holding unit 101
A holds a combination of the input data and the context, and the code tree holding unit 102A holds an independent code tree for each context.

【０１９４】さらに、符号木決定部１０３Ａが、前置デ
ータ保持部１００Ａ−１〜１００Ａ−ｎに保持されてい
る直前までの入力データからデータの符号木を決定し、
符号化部１０４Ａが、符号木決定部１０３Ａで選択した
符号木のルート（頂点）からデータＫが格納されている
リーフに沿って途中に位置するノードからの分岐に従っ
て“０”か“１”で表される符号（固有のデータ）を出
力する。Further, the code tree determination unit 103A determines the code tree of the data from the input data up to immediately before held in the prefix data holding units 100A-1 to 100A-n,
The encoding unit 104A selects "0" or "1" according to a branch from a node located along the leaf in which the data K is stored from the root (vertex) of the code tree selected by the code tree determination unit 103A. The code (unique data) represented is output.

【０１９５】また、符号長変更手段としての符号木更新
部１０５Ａが、符号化したリーフと他のリーフあるいは
ノードとを組み替え、前置データ更新部１０６Ａが、デ
ータＫを前置データ保持手段１００Ａ−１に登録する。
ここで、上述の動作について、図３５に示すフローチャ
ートの処理ステップＡ１〜Ａ６を参照しながら、さらに
詳述する。Further, the code tree updating unit 105A as the code length changing unit rearranges the coded leaf and another leaf or node, and the prefix data updating unit 106A stores the data K in the prefix data holding unit 100A-. Register to 1.
Here, the above operation will be described in more detail with reference to the processing steps A1 to A6 of the flowchart shown in FIG.

【０１９６】まず、前置データ保持部１００Ａ−１〜１
００Ａ−ｎに保持されている文脈文字列Ｐを初期化し
（ステップＡ１）、符号化するデータＫを入力する（ス
テップＡ２）。符号木決定部１０３Ａは前置データ保持
部１００Ａ−１〜１００Ａ−ｎに保持されている文脈の
履歴を保持している文脈履歴保持部１０１Ａから文脈Ｐ
に対応した符号木を決定し、決定した符号木と文脈履歴
保持部１０１Ａの情報から、データＫが保持されている
リーフのノード番号（ＩＤ）と文脈Ｐのルートのノード
番号（ＩＤ）を符号化部１０４Ａに送り、符号化部１０
４Ａは、文脈Ｐに対応した符号木内において、事象Ｋ
（データＫ）のリーフからルートへのノードの分岐に対
応した符号を出力する（ステップＡ３）。なお、この符
号化部１０４Ａが行なう処理については、図３６を用い
て後に詳述する。First, the front-end data holding units 100A-1 to 100-1
The context character string P held in 00A-n is initialized (step A1), and the data K to be encoded is input (step A2). The code tree determination unit 103A receives the context P from the context history holding unit 101A that holds the history of contexts held in the prefix data holding units 100A-1 to 100A-n.
Determines the code tree corresponding to, and from the information of the determined code tree and the context history holding unit 101A, codes the leaf node number (ID) holding the data K and the root node number (ID) of the context P. And sends it to the encoding unit 104A.
4A is the event K in the code tree corresponding to the context P.
The code corresponding to the branch of the node from the leaf of (data K) to the root is output (step A3). The processing performed by the encoding unit 104A will be described later in detail with reference to FIG.

【０１９７】そして、符号化部１０４Ａにおいての符号
化後、符号木更新部１０５Ａは、符号木の事象Ｋのリー
フを他のリーフあるいはノードと組み替え（ステップＡ
４）、元の符号木保持部１０２Ａに格納することで符号
木の更新を行なう。なお、この符号木更新部１０５Ａが
行なう処理については、図３７を用いて後に詳述する。After the coding in the coding unit 104A, the coding tree updating unit 105A rearranges the leaf of the event K of the coding tree with another leaf or node (step A
4) The code tree is updated by storing it in the original code tree holding unit 102A. The process performed by the code tree updating unit 105A will be described later in detail with reference to FIG.

【０１９８】さらに、文脈更新部１０６Ａは、最も古い
データ（前置データ保持部１００−ｎに保持されている
データ）を棄却し、入力データＫを文脈として前置デー
タ保持部１００Ａ−１に登録することで、文脈文字列Ｐ
を更新する（ステップＡ５）。そして、全てのデータに
ついて符号化が終了したかをチェックし（ステップＡ
６）、終了していなければステップＡ２からの処理を繰
り返し、終了していれば符号化処理を終了する（ステッ
プＡ６のＹＥＳルート）。Further, the context updating unit 106A rejects the oldest data (data held in the prefix data holding unit 100-n) and registers the input data K as the context in the prefix data holding unit 100A-1. The context string P
Is updated (step A5). Then, it is checked whether all the data have been encoded (step A
6) If not completed, the process from step A2 is repeated, and if completed, the encoding process is completed (YES route of step A6).

【０１９９】なお、上述のノードの組み変え（ステップ
Ａ４）と文脈更新（ステップＡ５）の処理は、どちらを
先にしてもよく、また、並列に処理してもよい。次に、
処理ステップＡ３で述べたように、図３４にて上述した
構成をもつ符号化部１０４Ａが行なう符号化処理につい
て、図３６の処理ステップＢ１〜Ｂ８を参照しながら説
明する。[0199] It should be noted that the processing of changing the combination of nodes (step A4) and the processing of updating context (step A5) may be performed first, or may be performed in parallel. next,
As described in the processing step A3, the coding processing performed by the coding unit 104A having the configuration described above with reference to FIG. 34 will be described with reference to processing steps B1 to B8 in FIG.

【０２００】まず、スタック４５（ｐｕｓｈ−ｄｏｗｎ
ｓｔａｃｋ）を初期化し（ステップＢ１）、カレント
ノードＬのアドレスポインタを、データＫが格納されて
いる文脈Ｐの符号木内のリーフにセットする（ステップ
Ｂ２）。そして、上述のステップＡ３で送られてきた、
データＫが保持されているリーフのノード番号（ＩＤ）
を、ラッチ４４から位置判別部４３に送り、位置判別部
４３は、この受け取ったノード番号のノードが上位ノー
ドのどちらに位置するかの情報をノード番号管理部４２
から手得し、この情報から受け取ったノードが上位ノー
ドの右手に位置するかを判別する（ステップＢ３）。First, the stack 45 (push-down
stack) is initialized (step B1), and the address pointer of the current node L is set to the leaf in the code tree of the context P in which the data K is stored (step B2). And, it was sent in the above step A3,
Node number (ID) of the leaf that holds data K
Is transmitted from the latch 44 to the position discriminating unit 43, and the position discriminating unit 43 provides the node number management unit 42 with information on which of the upper nodes the node having the received node number is located.
Then, it is determined whether the node received from this information is located on the right hand side of the upper node (step B3).

【０２０１】右手に位置する場合は“１”をスタック４
５にＰｕｓｈ（出力）し（ステップＢ３のＹＥＳルート
からステップＢ４）、左手に位置する場合は、“０”を
スタック４５にＰｕｓｈ（出力）する（ステップＢ３の
ＮＯルートからステップＢ５）。さらに、上述のステッ
プＡ３で送られてきた、もう１つのノード番号である文
脈Ｐのルートのノード番号（ＩＤ）を、上位ノード判別
部４１に送り、上位ノード判別部４１は、受け取ったノ
ード（あるいはリーフ）がルートであるか否かを判別す
る（ステップＢ６）。If it is on the right side, stack "1" 4
Push (output) to 5 (from YES route of step B3 to step B4), and push (output) “0” to the stack 45 when located on the left hand (from NO route of step B3 to step B5). Further, the node number (ID) of the root of the context P that is the other node number sent in step A3 described above is sent to the upper node discriminating unit 41, and the upper node discriminating unit 41 receives the received node ( Alternatively, it is determined whether or not the leaf is the root (step B6).

【０２０２】そして、ルートであった場合、終了信号を
出力し（ステップＢ６のＹＥＳルート）、ルートではな
かった場合、ノード番号（ＩＤ）管理部４２にアクセス
し、このノード（あるいはリーフ）の上位のノード
（Ｕ）の番号を手得し、このノード（Ｕ）を新たにカレ
ントノードＬとしてアドレスポインタを上位ノードに移
動させ、ステップＢ３からの処理を繰り返す（ステップ
Ｂ６のＮＯルートからステップＢ７）。If it is the root, an end signal is output (YES route of step B6), and if it is not the root, the node number (ID) management unit 42 is accessed and the upper node of this node (or leaf) is accessed. Of the node (U) of the above, the address pointer is moved to the upper node as a new current node L, and the process from step B3 is repeated (from NO route of step B6 to step B7). .

【０２０３】このようにして、アドレスポインタがルー
トに達するまで処理を繰り返すことで、スタック４５に
は、リーフからルートへの“１”か“０”の数値で表さ
れる「道筋」が記憶される。そして、この「道筋」を、
逆に下位ビットから１ビットづつ出力（ｐｏｐ−ｕｐ出
力）することで、ルートからリーフへの「道筋」が符号
として出力される（ステップＢ８）。In this way, by repeating the processing until the address pointer reaches the root, the stack 45 stores the "path" represented by the numerical value "1" or "0" from the leaf to the root. It And this "route"
On the contrary, by outputting one bit at a time from the lower bit (pop-up output), the "route" from the root to the leaf is output as a code (step B8).

【０２０４】次に、処理ステップＡ４で前述したよう
に、符号木更新部１０５Ａが、事象Ｋのリーフを他のリ
ーフあるいはノードと組み替える処理について、図３７
の処理ステップＣ１〜Ｃ９を参照しながら詳述する。ま
ず、組み替えの対象となるノードＺのアドレスポインタ
をリーフＫにセットし（ステップＣ１）、ノードＵ０に
ノードＺの上位ノードをセットする（ステップＣ２）。Next, as described above in the processing step A4, the processing for the code tree updating unit 105A to rearrange the leaf of the event K with another leaf or node will be described with reference to FIG.
The processing steps C1 to C9 will be described in detail. First, the address pointer of the node Z to be rearranged is set in the leaf K (step C1), and the upper node of the node Z is set in the node U0 (step C2).

【０２０５】そして、Ｋの上位ノードＵ０が符号木のル
ートかどうかを判別し（ステップＣ３）、ルートであれ
ば組み替えを終了するが（ステップＣ３のＹＥＳルート
からステップＣ９）、ルートでなければノードＵ１にノ
ードＵ０の上位ノードをセットし（ステップＣ３のＮＯ
ルートからステップＣ４）、ノードＵ０がノードＵ０の
上位ノードＵ１に対してどちらかに位置しているかを判
別する（ステップＣ５）。Then, it is judged whether or not the upper node U0 of K is the root of the code tree (step C3), and if it is the root, the rearrangement is ended (YES route from step C3 to step C9). Set the upper node of node U0 to U1 (NO in step C3
From the root, step C4), it is determined whether the node U0 is located in one of the upper nodes U1 of the node U0 (step C5).

【０２０６】Ｕ０がＵ１の右手にある場合は、ノードＸ
にノードＵ１の左手に位置するノードをセットし（ステ
ップＣ５のＹＥＳルートからステップＣ６）、ノードＺ
とノードＸとを取り替える（ステップＣ８）。すなわ
ち、ノードＵ１の左手のノードとリーフＫとを組み換え
る。一方、ノードＵ０がＵ１の左手にある場合は、ノー
ドＸにノードＵ１の右手に位置するノードをセットし
（ステップＣ５のＮＯルートからステップＣ７）、ノー
ドＺとノードＸとを取り替える（ステップＣ８）。すな
わち、ノードＵ１の右手のノードとリーフＫとを組み換
える。If U0 is to the right of U1, node X
Set the node located on the left hand side of the node U1 to the node (from the YES route of step C5 to step C6), and set the node Z
And node X are replaced (step C8). That is, the left-hand node of the node U1 and the leaf K are recombined. On the other hand, when the node U0 is on the left side of U1, the node located on the right side of the node U1 is set to the node X (from NO route of step C5 to step C7), and the node Z and the node X are replaced (step C8). . That is, the right-hand node of the node U1 and the leaf K are recombined.

【０２０７】さらに、ノードＺのアドレスポインタをノ
ードＵ１にセットすると（ステップＣ９）、上述のステ
ップＣ２に戻り、ステップＣ３において、セットしたア
ドレスポインタの上位ノードがルート，すなわち、アド
レスポインタがルートの直下のノードと判別されるまで
処理を繰り返す。この処理を行なうことで、アクセスさ
れたリーフのルートからの距離（符号長）は１／２にな
る。Further, when the address pointer of the node Z is set in the node U1 (step C9), the process returns to the above step C2, and in step C3, the upper node of the set address pointer is the root, that is, the address pointer is directly under the root. The process is repeated until the node is determined to be a node. By performing this processing, the distance (code length) from the root of the accessed leaf is halved.

【０２０８】以上の処理を全ての入力文字について繰り
返すことにより、文字列を符号化することができる。こ
のように、関連技術２にかかるデータ圧縮方法を実施す
るためのデータ圧縮装置によれば、符号化する文字を、
木構造の文脈木に番号を付けて登録し、この文脈木に対
応した符号木をスプレイ符号化を施しながら作成・更新
することにより、出現する文字の出現頻度を求めて確率
モデルを構築して各文字に符号を割り当て符号表を作成
し、この符号表から符号化する文字の符号を出力すると
いう２段階の処理を同時に行なうことができるので、圧
縮処理の速度が大幅に向上するという効果がある。A character string can be encoded by repeating the above processing for all input characters. As described above, according to the data compression apparatus for implementing the data compression method according to the related technique 2, the characters to be encoded are
The context tree of the tree structure is numbered and registered, and the code tree corresponding to this context tree is created and updated while performing the spray coding, and the probability model is constructed by obtaining the appearance frequency of the appearing characters. Since a code table is created by assigning a code to each character and the code of the character to be coded is output from this code table, it is possible to perform the two-stage processing at the same time, which has the effect of significantly improving the compression processing speed. is there.

【０２０９】また、上述のように、文字が入力されるご
とに符号木のノードの作成・更新（スプレイ処理）によ
り確率モデルを構築するので、文字が入力されるごとに
既に構築されている確率モデルを再構築するという膨大
な演算処理を省くことができるので、圧縮処理の速度が
さらに向上する効果がある。さらに、過去に圧縮（符号
化）した文字と同じ文字が出現する毎に、過去に登録し
てあった同じ文字の符号木のノードを上位のノードと組
み替えて符号長を１／２にする（スプレイ処理）ことに
より、同じ文字（列）が繰り返し出現するほど、その文
字（列）の符号は少ないビット数で表すことができるの
で、圧縮効果が大幅に向上する効果もある。Further, as described above, since the probability model is constructed by creating / updating the nodes of the code tree (spray processing) each time a character is input, the probability that the character has already been constructed each time a character is input. Since a huge amount of calculation processing of reconstructing the model can be omitted, there is an effect that the compression processing speed is further improved. Further, every time the same character that has been compressed (encoded) appears in the past, the node of the code tree of the same character registered in the past is recombined with the upper node to reduce the code length to ½ ( By the spraying process), as the same character (string) appears repeatedly, the code of the character (string) can be represented by a smaller number of bits, so that the compression effect is significantly improved.

【０２１０】（２）復元側の説明図３８は、前述のデータ復元装置４の内部の構成例を示
すブロック図であり、この図３８に示すように、２００
Ａ−１〜２００Ａ−ｎ（ｎは自然数）は前置データ保持
部、２０１Ａは文脈履歴保持部、２０２Ａは符号木保持
部、２０３Ａは符号木決定部、２０４Ａは符号化部、２
０５Ａは符号木更新部、２０６Ａは文脈更新部である。(2) Description of Restoration Side FIG. 38 is a block diagram showing an example of the internal configuration of the data restoration device 4 described above. As shown in FIG.
A-1 to 200A-n (n is a natural number) is a prefix data holding unit, 201A is a context history holding unit, 202A is a code tree holding unit, 203A is a code tree determining unit, 204A is a coding unit, and 2A is a coding tree holding unit.
Reference numeral 05A is a code tree updating unit, and 206A is a context updating unit.

【０２１１】ここで、前置データ保持部（前置データ保
持手段）２００Ａ−１〜２００Ａ−ｎは、過去に復号し
たｎ個のデータを保持するものであり、文脈履歴保持部
（履歴保持手段）２０１Ａは、復号したシンボルと文脈
との組み合わせを保持するものであり、符号木保持部
（符号木保持手段）２０２Ａは、文脈毎に独立した符号
木を保持するものである。Here, the prefix data holding units (prefix data holding means) 200A-1 to 200A-n hold n pieces of data decoded in the past, and the context history holding unit (history holding means). ) 201A holds a combination of the decoded symbol and context, and the code tree holding unit (code tree holding means) 202A holds an independent code tree for each context.

【０２１２】また、符号木決定部（符号木決定手段）２
０３Ａは、前置データ保持部２００Ａ−１〜２００Ａ−
ｎに保持されている文脈からシンボルを復号するための
符号木を決定するものであり、復号部（復号手段）２０
４Ａは、符号に従って符号木決定部２０３Ａで選択した
符号木のルート（符号木の頂点）からノード（分岐点）
を走査して到達したリーフに格納されているシンボルを
出力するものである。Also, the code tree determining unit (code tree determining means) 2
03A is a front data holding unit 200A-1 to 200A-
A code tree for decoding a symbol is determined from the context held in n, and the decoding unit (decoding means) 20
4A is a node (branch point) from the root (code tree vertex) of the code tree selected by the code tree determination unit 203A according to the code.
Is output by scanning the symbols stored in the leaf that has arrived.

【０２１３】さらに、符号木更新部（符号長変更手段）
２０５Ａは、復号したリーフと他のリーフあるいはノー
ドとを組み替えるものであり、文脈更新部（前置データ
更新手段）２０６Ａは、復号したシンボルを前置データ
保持部２００Ａ−１〜２００Ａ−ｎに登録するものであ
る。また、図３９は、上述の復号部２０４Ａの内部の構
成例を示すブロック図であり、この図３９に示すよう
に、符号に従って符号木決定部２０３Ａで選択した符号
木のルートからノードを走査して到達したリーフに格納
されているシンボルを出力するために、復号部２０４Ａ
には、ノード番号管理部（メモリ）４２と，ラッチ４４
と，下位ノード判別部４６と，葉／節判別部４７とが設
けられている。Further, the code tree updating unit (code length changing means)
205A is a combination of the decoded leaf and another leaf or node, and the context updating unit (prefix data updating unit) 206A registers the decoded symbol in the prefix data holding units 200A-1 to 200A-n. To do. 39 is a block diagram showing an example of the internal configuration of the decoding unit 204A described above. As shown in FIG. 39, the node is scanned from the root of the code tree selected by the code tree determination unit 203A according to the code. To output the symbols stored in the leaf that has reached
Includes a node number management unit (memory) 42 and a latch 44.
A lower node discriminating unit 46 and a leaf / node discriminating unit 47 are provided.

【０２１４】ここで、ノード番号管理部（メモリ）４２
は、符号化側の説明中、図３４にて前述したものと同様
のものであり、文脈木と符号木のノード番号を管理する
ものである。また、下位ノード判別部４６は、符号と符
号木のルートのノード番号およびノード番号管理部４２
の情報から下位ノードのノード番号を得るものであり、
葉／節判別部４７は、下位ノード判別部４６からの情報
とノード番号管理部４２とから下位ノードがリーフかノ
ードかを判別するものであり、ラッチ４８は、ルートの
ノード番号を一旦保持するものである。Here, the node number management unit (memory) 42
Is the same as that described above with reference to FIG. 34 in the description on the encoding side, and manages the node numbers of the context tree and the code tree. The lower node discriminating unit 46 also includes a node number management unit 42 for the node number of the code and the root of the code tree.
The node number of the lower node is obtained from the information of
The leaf / node discriminating unit 47 discriminates whether the lower node is a leaf or a node from the information from the lower node discriminating unit 46 and the node number managing unit 42, and the latch 48 temporarily holds the node number of the root. It is a thing.

【０２１５】そして、上述の構成により、前置データ保
持部２００Ａ−１〜２００Ａ−ｎが、過去に復号したｎ
個の文脈（データ）を保持し、文脈履歴保持部２０１Ａ
が復号したデータＫと文脈との組み合わせを保持し、符
号木保持部２０２Ａが、文脈毎に独立した符号木を保持
する。さらに、符号木決定部２０３Ａが、前置データ保
持部２００Ａ−１〜２００Ａ−ｎに保持されている文脈
からデータＫを復号するための符号木を決定し、復号部
２０４Ａが、符号化されたデータＫの符号に従って符号
木決定部２０３Ａで選択した符号木のルート（頂点）か
らノード（分岐点）を走査して到達したリーフに格納さ
れているデータＫを出力する。With the above-described structure, the front data holding units 200A-1 to 200A-n decode n in the past.
The context history holding unit 201A holds each context (data).
Holds a combination of the data K decoded by and the context, and the code tree holding unit 202A holds an independent code tree for each context. Further, the code tree determination unit 203A determines a code tree for decoding the data K from the context held in the prefix data holding units 200A-1 to 200A-n, and the decoding unit 204A has performed coding. Data K stored in a leaf that has arrived by scanning a node (branch point) from the root (vertex) of the code tree selected by the code tree determination unit 203A according to the code of the data K is output.

【０２１６】また、符号長変更手段としての符号木更新
部２０５Ａが、復号したリーフと他のリーフあるいはノ
ードとを組み替え、前置データ更新部２０６Ａが、復号
したデータＫを最新の文脈として前置データ保持部２０
０Ａ−１に登録する。以下、上述の処理について、図４
０に示すフローチャートの処理ステップＦ１〜Ｆ６を参
照しながら、さらに詳述する。Further, the code tree updating unit 205A as a code length changing unit rearranges the decoded leaf and another leaf or node, and the prefix data updating unit 206A prefixes the decoded data K with the latest context. Data holding unit 20
Register with 0A-1. Hereinafter, regarding the above-mentioned processing, FIG.
Further details will be described with reference to process steps F1 to F6 of the flowchart shown in FIG.

【０２１７】まず、前置データ保持部２００Ａ−１〜２
００Ａ−ｎに保持されているｎ個の文脈文字列Ｐを初期
化し（ステップＦ１）、復号する事象（データ）Ｋを入
力する（ステップＦ２）。符号木決定部２０３Ａは前置
データ保持部２００Ａ−１〜２００Ａ−ｎに保持されて
いる文脈の履歴を保持している文脈履歴保持部２０１Ａ
から文脈Ｐに対応した符号木を決定し、決定した符号木
のルートのノード番号（ＩＤ）と、復号する事象Ｋの符
号とを復号部２０４Ａに送り、復号部２０４Ａは、決定
した符号木内において、送られてきた符号に応じて、ル
ートから事象Ｋが格納されているリーフへ走査して符号
を復号する（ステップＦ３）。なお、この復号部２０４
Ａが行なう処理については、図４１を用いて後に詳述す
る。First, the front data holding sections 200A-1 and 200A-2
The n context character strings P held in 00A-n are initialized (step F1), and the event (data) K to be decoded is input (step F2). The code tree determination unit 203A holds a context history holding unit 201A holding the history of contexts held in the prefix data holding units 200A-1 to 200A-n.
To determine a code tree corresponding to the context P, and send the node number (ID) of the root of the determined code tree and the code of the event K to be decoded to the decoding unit 204A, and the decoding unit 204A in the determined code tree. , And decodes the code by scanning from the root to the leaf in which the event K is stored, according to the code sent (step F3). Note that this decoding unit 204
The process performed by A will be described in detail later with reference to FIG.

【０２１８】そして、復号後、符号木更新部２０５Ａ
は、符号木の復号した事象Ｋのリーフを他のリーフある
いはノードと組み替え（ステップＦ４）、元の符号木保
持部２０２Ａに格納することで符号木の更新を行なう。
なお、この符号木更新部２０５Ａが行なうノードの組み
替え処理は、図３７のフローチャートにて前述した符号
木更新部１０５Ａが行なう処理ステップＣ１〜Ｃ９と同
様にして行なう。Then, after decoding, the code tree updating unit 205A
Updates the code tree by recombining the leaf of the decoded event K with another leaf or node (step F4) and storing it in the original code tree holding unit 202A.
The node rearrangement process performed by the code tree updating unit 205A is performed in the same manner as the processing steps C1 to C9 performed by the code tree updating unit 105A described above with reference to the flowchart of FIG.

【０２１９】さらに、文脈更新部２０６Ａは、最も古い
データ（前置データ保持部２００−ｎに保持されている
データ）を棄却し、復号した事象（データ）Ｋを文脈と
して前置データ保持部２００−１に登録することで、文
脈文字列Ｐを更新する（ステップＦ５）。そして、全て
のデータについて復号が終了したかをチェックし（ステ
ップＦ６）、終了していなければステップＦ２からの処
理を繰り返し（ステップＦ６のＮＯルート）、そうでな
ければ復号処理を終了する（ステップＦ６のＹＥＳルー
ト）。Further, the context updating unit 206A rejects the oldest data (data held in the prefix data holding unit 200-n), and uses the decoded event (data) K as the context to prefix data holding unit 200. The context character string P is updated by registering it in -1 (step F5). Then, it is checked whether the decoding is completed for all the data (step F6). If not completed, the processing from step F2 is repeated (NO route of step F6), and if not, the decoding processing is completed (step F6). F6 YES route).

【０２２０】なお、この場合も符号化側と同様に、上述
のノードの組み変え（ステップＦ４）と文脈更新（ステ
ップＦ５）の処理は、どちらを先にしてもよく、また、
並列に処理してもよい。次に、処理ステップＦ３で述べ
たように、図３９にて上述した構成をもつ復号部２０４
Ａが行なう符号化処理について、図４１に示すフローチ
ャートの処理ステップＧ１〜Ｇ７を参照しながら説明す
る。In this case as well, as in the case of the encoding side, either of the processing of changing the nodes (step F4) and the processing of updating the context (step F5) may be performed first.
You may process in parallel. Next, as described in the processing step F3, the decoding unit 204 having the configuration described above with reference to FIG.
The encoding process performed by A will be described with reference to process steps G1 to G7 in the flowchart shown in FIG.

【０２２１】まず、下位ノード判別部４６は、符号木決
定部２０３Ａから送られてきたルートのノード番号（Ｉ
Ｄ）をノードＺにセットし（ステップＧ１）、ラッチ４
８を介して同じく符号木決定部２０３Ａから送られてき
た、復号する事象Ｋの符号（１ｂｉｔ）をＣにセットす
る（ステップＧ２）。そして、Ｃにセットした復号する
事象Ｋの符号が“１”に一致するかをチェックし（ステ
ップＧ３）、“１”である（Ｙｅｓの）場合はノードＺ
の右手にあるノードをノードＺにセットし（ステップＧ
３のＹＥＳルートからステップＧ４）、“１”でない場
合（すなわち“０”の場合）はノードＺの左手にあるノ
ードをノードＺにセットする（ステップＧ３のＮＯルー
トからステップＧ５）。さらに、下位ノード判別部４６
は、ノードＺにセットしたノードのノード番号をもと
に、ノード番号管理部４２からノードＺのノード番号を
取得し、このノード番号を葉／節判別部４７に送り、こ
の葉／節判別部４７では、送られてきたノード番号をも
つノードＺの位置情報をノード番号管理部４２から手得
し、このノードＺがノードであるかリーフであるかをチ
ェックする（ステップＧ６）。First, the lower node discriminating section 46 determines the node number (I) of the root sent from the code tree determining section 203A.
D) is set to the node Z (step G1), and the latch 4
The code (1 bit) of the event K to be decoded, which is also sent from the code tree determination unit 203A via 8 is set to C (step G2). Then, it is checked whether the code of the event K to be decoded set in C matches "1" (step G3), and if it is "1" (Yes), the node Z is detected.
Set the node on the right hand side of node to node Z (step G
From the YES route of 3 to step G4), if it is not "1" (that is, "0"), the node on the left side of the node Z is set to the node Z (from NO route of step G3 to step G5). Further, the lower node discriminating unit 46
Acquires the node number of the node Z from the node number management unit 42 based on the node number of the node set in the node Z, sends this node number to the leaf / node discrimination unit 47, and this leaf / node discrimination unit At 47, the position information of the node Z having the sent node number is acquired from the node number management unit 42, and it is checked whether this node Z is a node or a leaf (step G6).

【０２２２】ノードＺがリーフでない場合（ステップＧ
６のＮＯルート）、処理はステップＧ２に戻り、復号す
る事象Ｋが格納されているリーフに到達するまで処理を
繰り返す。一方、ノードＺがリーフである場合（ステッ
プＧ６のＹＥＳルート）、復号する事象Ｋが見つかった
ことになるので、葉／節判別部４７は、ノード番号管理
部４２にシンボル（事象）出力信号を送信し、この信号
を受信したノード番号管理部４２が、このリーフに格納
されている事象Ｋ（シンボル）を出力するとともに（ス
テップＧ７）、復号処理の終了信号を出力する。If node Z is not a leaf (step G
No. 6), the process returns to step G2, and the process is repeated until the leaf in which the event K to be decoded is stored is reached. On the other hand, when the node Z is a leaf (YES route in step G6), it means that the event K to be decoded has been found, so the leaf / node determination unit 47 sends the symbol (event) output signal to the node number management unit 42. The node number management unit 42 that has transmitted and received this signal outputs the event K (symbol) stored in this leaf (step G7) and outputs the end signal of the decoding process.

【０２２３】これにより、符号化側で作成された符号木
の“１”か“０”の数値で表される「道筋」を、事象Ｋ
が格納されているリーフまで辿ることにより符号化され
た事象Ｋを復号することができる。このように、関連技
術２にかかるデータ復元方法を実施するためのデータ復
元装置によれば、復号した文字を木構造の文脈木に番号
を付けて登録し、この文脈木に対応した符号木をスプレ
イ処理を施しながら作成・更新することにより、復号す
る文字の符号と一致する符号を符号表から検索し、その
一致した符号に対応して登録されている文字を復号文字
として出力するという２段階の処理を同時に行なうこと
ができるので、データの復元処理の速度が大幅に向上す
るという効果がある。As a result, the "path" represented by the numerical value "1" or "0" of the code tree created on the encoding side is converted into the event K.
The encoded event K can be decoded by tracing to the leaf in which is stored. As described above, according to the data restoration device for carrying out the data restoration method according to the related technique 2, the decoded characters are registered by numbering the context tree of the tree structure, and the code tree corresponding to the context tree is registered. A two-step process in which a code that matches the code of the character to be decoded is searched from the code table by creating and updating while performing the spray process, and the character registered corresponding to the matched code is output as the decoded character. Since the processing can be performed simultaneously, there is an effect that the speed of the data restoration processing is significantly improved.

【０２２４】また、符号化側と同様に、文字が入力され
るごとに符号木のノードの作成・更新（スプレイ処理）
により構築するので、文字が入力されるごとに既に構築
されている確率モデルを再構築するという膨大な演算処
理を行なう必要がなくなり、これによりデータの復元処
理の速度がさらに向上する効果がある。さらに、これも
符号化側と同様に、過去に復号した符号と同じ符号が出
現する毎に、過去に登録してあった同じ符号の符号木の
ノードを上位のノードと組み替えて（スプレイ処理）符
号長を１／２にすることにより、同じ符号を繰り返し復
元するほど、その符号は少ないビット数で表すことがで
きるので、復元効果が大幅に向上する効果もある。（ｂ−１）関連技術２の第１の変形例の説明（１）符号化側の説明図４２は、関連技術２の変形例としてのデータ圧縮装置
３の内部の構成を示すものであり、この図４２に示すよ
うに、本データ圧縮装置３の内部には、図３３にて前述
した構成に加えて、文脈判別部１０８Ａ，文脈変更部１
１０Ａとが設けられており、また、図３３にて前述した
符号化部１０４Ａ，符号木保持部１０５Ａの代わりに、
それぞれ符号化部１０４Ａ′，符号木保持部１０７Ａが
設けられている。As with the encoding side, every time a character is input, a node in the code tree is created / updated (spray processing).
Since it is constructed by, it is not necessary to perform a huge calculation process of reconstructing the already constructed probabilistic model each time a character is input, which has the effect of further improving the speed of the data restoration process. Further, like the encoding side, each time the same code as the code decoded in the past appears, the node of the code tree of the same code registered in the past is recombined with the upper node (spray processing). By reducing the code length to ½, as the same code is repeatedly restored, the code can be represented by a smaller number of bits, which also has the effect of significantly improving the restoration effect. (B-1) Description of First Modification of Related Technology 2 (1) Description of Encoding Side FIG. 42 shows an internal configuration of a data compression device 3 as a modification of Related Technology 2. As shown in FIG. 42, in addition to the configuration described above with reference to FIG. 33, the inside of the data compression apparatus 3 includes a context discriminating unit 108A and a context changing unit 1.
10A is provided, and instead of the encoding unit 104A and the code tree holding unit 105A described above with reference to FIG. 33,
An encoding unit 104A 'and a code tree holding unit 107A are provided respectively.

【０２２５】このため、この図４２中、図３３にて既述
の符号と同じ符号の構成部分の説明は省略し、図３３と
は異なる構成部分についてのみ、以下に説明する。即
ち、符号化部１０４Ａ′は符号出力手段およびエスケー
プコード出力手段として、符号木にシンボルが登録され
ていないときは符号木のルートからエスケープコードが
格納されているリーフまでの途中に位置するノードから
の分岐に従ってエスケープコードを出力し、符号木にシ
ンボルが登録されているときは符号木のルートからシン
ボルのリーフまでの途中に位置するノードからの分岐に
従ってシンボルの符号を出力するものである。Therefore, in FIG. 42, description of components having the same reference numerals as those already described in FIG. 33 is omitted, and only components different from those in FIG. 33 will be described below. That is, the encoding unit 104A 'serves as a code output means and an escape code output means from a node located on the way from the root of the code tree to the leaf where the escape code is stored when the symbol is not registered in the code tree. When the symbol is registered in the code tree, the escape code is output according to the branch of, and the code of the symbol is output according to the branch from the node located midway from the root of the code tree to the leaf of the symbol.

【０２２６】さらに、この符号化部１０４Ａ′は制御手
段として、エスケープコードを符号化したときはデータ
の符号化を行なうまで処理を繰り返すようになってい
る。また、符号木保持部（符号木保持手段）１０７Ａ
は、シンボルが未登録であることを示すエスケープコー
ド（ＥＳＣ）をあらかじめ登録した文脈毎に独立した符
号木を保持するものであり、文脈判別部（文脈判別手
段）１０８Ａは、符号木決定部１０３で決定した符号木
にシンボルが登録されているか否かを判別するものであ
り、文脈変更部（文脈変更手段）１１０Ａは、符号木に
シンボルが登録されていないときに文脈の長さを短くす
るものである。Further, the encoding unit 104A 'serves as a control means, and when the escape code is encoded, the processing is repeated until the data is encoded. A code tree holding unit (code tree holding means) 107A
Holds an independent code tree for each context in which an escape code (ESC) indicating that a symbol has not been registered is registered in advance. The context discriminator (context discriminator) 108A includes a code tree determiner 103. It is for determining whether or not a symbol is registered in the code tree determined in step S1, and the context changing unit (context changing means) 110A shortens the length of the context when no symbol is registered in the code tree. It is a thing.

【０２２７】そして、このような図３３とは異なる構成
により、図４２に示すデータ圧縮装置３では、符号化部
１０４Ａ′が、符号木にシンボルＫが登録されていない
ときは符号木のルートからＥＳＣが格納されているリー
フまでの途中に位置するノードからの分岐に従ってＥＳ
Ｃの符号を出力し、符号木にシンボルＫが登録されてい
るときは符号木のルートからシンボルＫのリーフまでの
途中に位置するノードからの分岐に従ってシンボルＫの
符号を出力する。With such a configuration different from that of FIG. 33, in the data compression apparatus 3 shown in FIG. 42, the coding unit 104A 'starts from the root of the code tree when the symbol K is not registered in the code tree. ES according to the branch from the node located on the way to the leaf where the ESC is stored
The code of C is output, and when the symbol K is registered in the code tree, the code of the symbol K is output according to the branch from the node located on the way from the root of the code tree to the leaf of the symbol K.

【０２２８】また、ＥＳＣを符号化したときはデータＫ
の符号化を行なうまで処理を繰り返す。さらに、符号木
保持部１０７Ａが、シンボルが未登録であることを示す
エスケープコード（ＥＳＣ）をあらかじめ登録した文脈
毎に独立した符号木を保持する。When the ESC is encoded, the data K
The process is repeated until the encoding is performed. Further, the code tree holding unit 107A holds an independent code tree for each context in which an escape code (ESC) indicating that a symbol has not been registered is registered in advance.

【０２２９】以下、上述の処理について、図４３に示す
フローチャートの処理ステップＨ１〜Ｈ１１を参照しな
がら、さらに詳述する。まず、文脈変更部１１０Ａが、
前置データ保持部２００Ａ−１〜２００Ａ−ｎに保持さ
れている全ての文脈から文脈文字列Ｐ₀を初期化し（ス
テップＨ１）、この文脈文字列Ｐ₀を文脈Ｐにセットし
ておき（ステップＨ２）、そして、符号化するデータ
（シンボル）Ｋを入力する（ステップＨ３）。The above process will be described in more detail below with reference to process steps H1 to H11 of the flowchart shown in FIG. First, the context changing unit 110A
The context character string P ₀ is initialized from all the contexts stored in the prefix data storage units 200A-1 to 200A-n (step H1), and this context character string P ₀ is set in the context P (step S1). H2), and the data (symbol) K to be encoded is input (step H3).

【０２３０】さらに、文脈変更部１１０Ａは、文脈Ｐと
データＫの情報を文脈履歴保持部１０１Ａ及び符号木決
定手段１０３Ａへ送り、文脈履歴保持部１０１Ａでは、
文脈変更部１１０Ａから送られてきた文脈Ｐの情報を、
文脈判別部１０８Ａに送る。そして、文脈判別部１０８
Ａは、受信した文脈Ｐの情報からこの文脈Ｐに事象Ｋが
登録されているか否かを判別する（ステップＨ４）。Further, the context changing unit 110A sends the information of the context P and the data K to the context history holding unit 101A and the code tree determining unit 103A, and the context history holding unit 101A
Information of the context P sent from the context changing unit 110A is
It is sent to the context discrimination unit 108A. Then, the context discrimination unit 108
Based on the received information on the context P, A determines whether or not the event K is registered in this context P (step H4).

【０２３１】ここで、文脈Ｐに事象Ｋが登録されている
場合は、文脈履歴保持部１０１Ａが、符号化部１０４
Ａ′にエスケープコード（ＥＳＣ）の符号化を指示し、
符号化部１０４Ａ′では、文脈Ｐに対応した符号木内に
おいて、エスケープコード（ＥＳＣ）のリーフからルー
トへのノードの分岐に対応した符号を出力してエスケー
プコードの符号化を行なう（ステップＨ４のＮＯルート
からステップＨ５）。Here, if the event K is registered in the context P, the context history holding unit 101A causes the encoding unit 104 to operate.
Instruct A'to encode the escape code (ESC),
The encoding unit 104A 'outputs the code corresponding to the branch of the node from the leaf of the escape code (ESC) to the root in the code tree corresponding to the context P to encode the escape code (NO in step H4). From root to step H5).

【０２３２】さらに、符号化部１０４Ａ′は、符号木決
定部１０３Ａを通じて符号木更新部１０５Ａに、符号木
の更新を指示し、符号木更新部１０５Ａは、符号木のＥ
ＳＣのリーフを他のリーフあるいはノードと取り替える
（ステップＨ６）。そして、文脈Ｐの次数（符号木の初
期状態が０次である）を１つ低次に移し（ステップＨ
７）、ステップＨ４に戻り、文脈Ｐに事象Ｋが登録され
ている（Ｙｅｓ）と判断されるまで処理を繰り返す。Further, the coding unit 104A 'instructs the code tree updating unit 105A through the code tree determining unit 103A to update the code tree, and the code tree updating unit 105A causes the code tree E to be updated.
The leaf of SC is replaced with another leaf or node (step H6). Then, the order of the context P (the initial state of the code tree is 0th order) is moved to the next lower level (step H
7) Return to step H4, and repeat the process until it is determined that the event K is registered in the context P (Yes).

【０２３３】一方、上述のステップＨ４で、文脈Ｐに事
象Ｋが登録されている（Ｙｅｓの）場合は、文脈履歴保
持部１０１Ａが、符号化部１０４Ａ′に事象Ｋの符号化
を指示し、符号化部１０４Ａ′では、文脈Ｐに対応した
符号木内において、事象Ｋのリーフからルートへのノー
ドの分岐に対応した符号を出力して符号化を行なう（ス
テップＨ４のＹＥＳルートからステップＨ８）。On the other hand, when the event K is registered in the context P (Yes in step H4), the context history holding unit 101A instructs the encoding unit 104A 'to encode the event K, The coding unit 104A 'outputs the code corresponding to the branch of the node from the leaf of the event K to the root in the code tree corresponding to the context P, and performs coding (from YES root of step H4 to step H8).

【０２３４】そして、符号化部１０４Ａ′は、符号木決
定部１０３Ａを通じて符号木更新部１０５Ａに、符号木
の更新を指示し、符号木更新部１０５Ａは、符号木の事
象Ｋのリーフと他のリーフあるいはノードとを組み替え
る（ステップＨ９）。さらに、文脈更新部１０６Ａが最
も古いデータ（前置データ保持部１００−ｎに保持され
ているデータ）を棄却して入力データＫの文脈を前置デ
ータ保持部１００Ａ−１に登録し、この情報から文脈変
更部１１０Ａが文脈文字列Ｐ₀の更新を行なう（ステッ
プＨ１０）。Then, the coding unit 104A 'instructs the code tree updating unit 105A through the code tree determining unit 103A to update the code tree, and the code tree updating unit 105A and the leaf of the event K of the code tree and other The leaf or node is recombined (step H9). Furthermore, the context updating unit 106A rejects the oldest data (data held in the prefix data holding unit 100-n) and registers the context of the input data K in the prefix data holding unit 100A-1. Then, the context changing unit 110A updates the context character string P ₀ (step H10).

【０２３５】そして、全てのデータについて符号化が終
了したかをチェックし（ステップＨ１１）、終了してい
ない場合は（ステップＨ１１のＮＯルート）、ステップ
Ｈ２に戻り、全てのデータを符号化するまでステップＨ
２以降の処理を繰り返し、終了している場合は（ステッ
プＨ１１のＹＥＳルート）、符号化の全ての処理を終了
する。Then, it is checked whether the coding is completed for all the data (step H11), and if not completed (NO route of step H11), the process returns to step H2 until all the data is coded. Step H
If the process after 2 is repeated and the process is completed (YES route of step H11), all the processes for encoding are completed.

【０２３６】なお、上述のステップＨ５およびステップ
Ｈ８における符号出力処理の詳細については、図３６に
て前述した処理ステップＢ１〜Ｂ８を参照されたく、ス
テップＨ６およびステップＨ９におけるリーフあるいは
ノードの組み替え処理の詳細については、同じく第２実
施形態中、図３７にて上述した処理ステップＣ１〜Ｃ９
を参照されたい。For details of the code output processing in steps H5 and H8, refer to the processing steps B1 to B8 described above with reference to FIG. 36, and the leaf or node rearrangement processing in steps H6 and H9. For details, also in the second embodiment, the processing steps C1 to C9 described above with reference to FIG. 37.
Please refer to.

【０２３７】このように、関連技術２の第１変形例にか
かるデータ圧縮装置によれば、シンボルが登録されてい
ないことを表すエスケープコード（ＥＳＣ）を予め符号
木に登録しておき、符号化するシンボルが予め登録され
ている文脈に含まれない間（シンボルが含まれる文脈を
発見して符号化するまでの間）はこのエスケープコード
の符号を出力することにより、入力データとして現れる
シンボルの組み合わせ（文脈）全てを予め登録しておか
なくてもよいので、文脈の登録に使用するメモリを大幅
に削減できる利点がある。As described above, according to the data compression apparatus in the first modification of the related technique 2, the escape code (ESC) indicating that the symbol is not registered is registered in the code tree in advance and the encoding is performed. The combination of symbols that appear as input data is output by outputting the code of this escape code while the symbol to be used is not included in the pre-registered context (until the context in which the symbol is included is detected and encoded). Since it is not necessary to register all (contexts) in advance, there is an advantage that the memory used for registering contexts can be significantly reduced.

【０２３８】そして、上述のように予め登録されている
文脈に符号化するシンボルが含まれない間に出力するエ
スケープコードの符号長を、スプレイ処理により短く
（１／２）してゆくことにより、シンボルが含まれる文
脈を発見して符号化するまでの時間を短縮することがで
きるので、データ圧縮の処理速度が大幅に向上するとと
もにデータ圧縮装置の処理負荷も大幅に軽減できる効果
がある。Then, as described above, by shortening (1/2) the code length of the escape code output while the symbol to be encoded is not included in the context registered in advance by the spray process, Since it is possible to shorten the time until the context in which the symbol is included is detected and encoded, the processing speed of data compression is significantly improved, and the processing load of the data compression device can be significantly reduced.

【０２３９】（２）復元側の説明図４４は、関連技術２の第１の変形例としてのデータ復
元装置４の内部の構成を示すものであり、この図４４に
示すように、本データ復元装置４の内部には、図３８に
て前述した構成に加えて、文脈変更部２１０Ａが設けら
れており、また、復号部２０４Ａ，符号木保持部２０５
Ａの代わりに、それぞれ復号部２０４Ａ′，符号木保持
部２０７Ａが設けられている。(2) Description of Restoration Side FIG. 44 shows the internal structure of the data restoration device 4 as a first modification of the related technique 2. As shown in FIG. Inside the device 4, a context changing unit 210A is provided in addition to the configuration described above with reference to FIG. 38, and a decoding unit 204A and a code tree holding unit 205 are provided.
Instead of A, a decoding unit 204A 'and a code tree holding unit 207A are provided respectively.

【０２４０】このため、この図４４中、図３８にて既述
の符号と同じ符号の構成部分の説明は省略し、図３８に
示す構成とは異なる構成部分について、以下に説明す
る。即ち、符号木保持部（符号木保持手段）２０７Ａ
は、エスケープコードをあらかじめ登録した符号木を保
持するものであり、文脈変更部（文脈変更手段）２１０
Ａは、出力したデータがエスケープコードであったとき
入力されたデータを棄却し文脈を短くするものである。Therefore, in FIG. 44, description of the components having the same reference numerals as those already described in FIG. 38 is omitted, and the components different from the configuration shown in FIG. 38 will be described below. That is, the code tree holding unit (code tree holding means) 207A
Holds a code tree in which escape codes are registered in advance, and a context changing unit (context changing means) 210
A is to reject the input data and shorten the context when the output data is an escape code.

【０２４１】また、復号部２０４Ａ′は復号手段とし
て、上述の符号化側で符号化された符号に従って、符号
木決定部２０３Ａで選択した符号木のルート（頂点）か
らノード（分岐点）を走査して到達したリーフに格納さ
れているデータを出力するものである。さらに、復号部
２０４Ａ′は制御手段として、エスケープコードを復号
した時は、文脈変更部２１０Ａで上述のように文脈を再
設定し、エスケープコード以外のデータが復号されるま
で処理を繰り返す制御を行なうようになっている。Further, the decoding unit 204A ', as a decoding means, scans a node (branch point) from the root (vertex) of the code tree selected by the code tree determination unit 203A according to the code encoded on the encoding side. Then, the data stored in the leaf that has arrived is output. Further, when the escape code is decoded, the decoding unit 204A 'resets the context as described above when the escape code is decoded, and performs control to repeat the process until data other than the escape code is decoded. It is like this.

【０２４２】そして、上述のような図３８に示す構成と
は異なる構成により、図４４に示すデータ復元装置で
は、符号木保持部２０７ＡがＥＳＣをあらかじめ登録し
た符号木を保持し、文脈変更部２１０Ａが出力したデー
タＫがＥＳＣであったとき入力されたデータＫを棄却し
文脈を短くする。また、復号部２０４Ａ′が、符号化さ
れたデータＫの符号に従って、符号木決定部２０３Ａで
選択した符号木のルートからノードを走査して到達した
リーフに格納されているデータＫを出力し、ＥＳＣを復
号した時は、文脈変更部２１０Ａで文脈の次数を変更
し、ＥＳＣ以外のデータＫが復号されるまで処理を繰り
返す。With the configuration different from the configuration shown in FIG. 38 as described above, in the data restoration device shown in FIG. 44, the code tree holding unit 207A holds the code tree in which ESC is registered in advance, and the context changing unit 210A When the data K output by is ESC, the input data K is rejected and the context is shortened. In addition, the decoding unit 204A ′ outputs the data K stored in the leaf reached by scanning the node from the root of the code tree selected by the code tree determining unit 203A according to the code of the encoded data K, When the ESC is decoded, the context changing unit 210A changes the order of the context and repeats the process until the data K other than the ESC is decoded.

【０２４３】ここで、上述のような動作について、図４
５に示すフローチャートの処理ステップＪ１〜Ｊ９を参
照しながら、さらに詳述する。まず、文脈変更部２１０
Ａが、前置データ保持部２００Ａ−１〜２００Ａ−ｎに
保持されている全ての文脈から文脈文字列Ｐ₀を初期化
し（ステップＪ１）、この文脈文字列Ｐ₀を文脈Ｐにセ
ットしておく（ステップＪ２）。Here, the operation as described above will be described with reference to FIG.
Further details will be described with reference to process steps J1 to J9 of the flowchart shown in FIG. First, the context changing unit 210
A initializes the context character string P ₀ from all the contexts stored in the prefix data storage units 200A-1 to 200A-n (step J1) and sets this context character string P ₀ in the context P. (Step J2).

【０２４４】そして、文脈変更部１１０Ａは、この文脈
Ｐを文脈履歴保持部２０１Ａおよび符号木決定部２０３
Ａに送り、符号木決定部２０３Ａでは、受け取った文脈
Ｐから符号木を選択（決定）し（ステップＪ３）、決定
した符号木を復号部２０４Ａ′に送る。復号部２０４
Ａ′では、決定した符号木内において、符号化側で符号
化された符号に応じてルートからリーフへ走査して符号
を復号する（ステップＪ４）。なお、この復号部２０４
Ａ′が行なう処理の詳細については、図４１にて前述し
た処理ステップＧ１〜Ｇ７を参照されたい。Then, the context changing unit 110A sets the context P to the context history holding unit 201A and the code tree determining unit 203.
The code tree determining unit 203A selects (determines) a code tree from the received context P (step J3), and sends the determined code tree to the decoding unit 204A '. Decoding unit 204
In A ', the code is decoded by scanning from the root to the leaves in the determined code tree according to the code encoded on the encoding side (step J4). Note that this decoding unit 204
For details of the processing performed by A ', refer to the processing steps G1 to G7 described above with reference to FIG.

【０２４５】そして、復号部２０４Ａ′は、復号した事
象ＫがＥＳＣ（エスケープコード）であるか否かをチェ
ックし（ステップＪ５）、復号した事象ＫがＥＳＣであ
る場合は、ＥＳＣ信号を文脈変更部２１０Ａに送信する
（ステップＪ５のＹＥＳルート）。さらに、このＥＳＣ
信号を受信した文脈変更部２１０Ａでは、文脈Ｐの次数
を１つ低次に移して変更し（例えば、前置データ保持部
２００−ｎに保持されている最も古いデータを無視して
ｎ−１次の文脈Ｐを作る）（ステップＪ６）、この文脈
Ｐを文脈履歴保持部２０１及び符号木決定手段２０３へ
送り、処理はステップＪ３に戻る。Then, the decoding section 204A 'checks whether or not the decoded event K is ESC (escape code) (step J5), and if the decoded event K is ESC, changes the context of the ESC signal. It is transmitted to the section 210A (YES route of step J5). Furthermore, this ESC
Upon receiving the signal, the context changing unit 210A changes the order of the context P by moving it to the next lower order (for example, ignoring the oldest data held in the pre-data holding unit 200-n, n-1). The next context P is created) (step J6), this context P is sent to the context history holding unit 201 and the code tree determination means 203, and the process returns to step J3.

【０２４６】すなわち、復号化部２０４Ａ′が、ＥＳＣ
以外の事象Ｋを復号するまでステップＪ２からの処理を
繰り返す。一方、復号した事象ＫがＥＳＣでない場合
は、復号した事象Ｋのリーフを他のリーフあるいはノー
ドと組み替える（ステップＪ５のＮＯルートからステッ
プＪ７）。なお、このリーフあるいはノードの組み替え
処理の詳細については、図３７にて前述したフローチャ
ートにおける処理ステップＣ１〜Ｃ９を参照されたい。That is, the decoding section 204A 'determines that the ESC
The processing from step J2 is repeated until the event K other than is decoded. On the other hand, when the decrypted event K is not ESC, the leaf of the decrypted event K is recombined with another leaf or node (from NO route of step J5 to step J7). For details of this leaf or node rearrangement processing, refer to the processing steps C1 to C9 in the flowchart described above with reference to FIG.

【０２４７】さらに、文脈更新部２０６Ａは、最も古い
データ（前置データ保持部２００−ｎに保持されている
データ）を棄却し、復号した事象Ｋを文脈として前置デ
ータ保持部２００−１に挿入して登録し、文脈文字列Ｐ
₀を更新する（ステップＪ８）。そして、全てのデータ
について復号が終了したかをチェックし（ステップＪ
９）、終了していなければ、ステップＪ２からの処理を
繰り返し（ステップＪ９のＮＯルートからステップＪ
２）、そうでなければ復号処理を終了する（ステップＪ
９のＹＥＳルート）。Further, the context updating unit 206A rejects the oldest data (data held in the prefix data holding unit 200-n), and sets the decoded event K as the context in the prefix data holding unit 200-1. Insert and register, context string P
₀ is updated (step J8). Then, it is checked whether the decoding is completed for all the data (step J
9) If not completed, the processes from step J2 are repeated (from NO route of step J9 to step J).
2) If not, the decryption process ends (step J).
9 YES route).

【０２４８】なお、この場合も符号化側と同様に、上述
のノードの組み変え（ステップＪ７）と文脈更新（ステ
ップＪ８）の処理は、どちらを先にしてもよく、また、
並列に処理してもよい。このように、関連技術２の第１
変形例にかかるデータ復元装置によれば、シンボルが登
録されていないことを表すエスケープコード（ＥＳＣ）
を予め符号木に登録しておき、復号したシンボルがこの
エスケープコードであった場合に、エスケープコード以
外の符号（すなわち、復号するシンボルの符号）を復号
するまで文脈を短く（変更）して復号するシンボルの符
号を検索することにより、入力データとして現れるシン
ボルの組み合わせ（文脈）全てを予め符号木に登録して
おかなくてもよいので、文脈の登録に使用するメモリを
大幅に削減できる利点がある。In this case as well, as in the case of the encoding side, either of the processing of changing the nodes (step J7) and the processing of updating the context (step J8) may be performed first.
You may process in parallel. In this way, the first of Related Technology 2
According to the data restoration device according to the modification, the escape code (ESC) indicating that the symbol is not registered
If the decoded symbol is this escape code, the context is shortened (changed) and decoded until the code other than the escape code (that is, the code of the symbol to be decoded) is decoded. By searching for the code of the symbol to be used, it is not necessary to register all the combinations (contexts) of the symbols that appear as the input data in the code tree in advance, which has the advantage of significantly reducing the memory used for registering the context. is there.

【０２４９】そして、上述のように復号したシンボルが
エスケープコードである間、このエスケープコードの符
号長をスプレイ処理により短く（１／２）してゆくこと
により、復号するシンボルが含まれる文脈を発見して復
号するまでの時間を短縮することができるので、データ
復元処理の速度が大幅に向上するとともにデータ復元装
置の処理負荷も大幅に軽減できる効果がある。（ｂ−２）関連技術２の第２の変形例の説明（１）符号化側の説明図４６は、関連技術２の第２の変形例としてのデータ圧
縮装置３の内部の構成例を示すものであり、この図４６
に示すように、本データ圧縮装置３の内部には、関連技
術２の第１の変形例の説明中、図３８にて前述した構成
に加えて、さらに符号登録部１１２Ａが設けられてい
る。While the decoded symbol is the escape code as described above, the code length of the escape code is shortened (1/2) by the spray process to find the context including the symbol to be decoded. Since it is possible to shorten the time until decoding is performed, there is an effect that the speed of the data restoration processing is significantly improved and the processing load of the data restoration device is also greatly reduced. (B-2) Description of Second Modification of Related Technology 2 (1) Description of Encoding Side FIG. 46 shows an internal configuration example of a data compression device 3 as a second modification of Related Technology 2. This Figure 46
As shown in FIG. 38, a code registration unit 112A is further provided inside the data compression device 3 in addition to the configuration described above with reference to FIG. 38 in the description of the first modified example of the related technique 2.

【０２５０】このため、この図４２中、図３３にて既述
の符号と同じ符号の構成部分の説明は省略する。また、
符号登録部１１２Ａは、入力されたデータＫが符号木に
登録されていないときに、このデータＫを符号木に登録
するものである。このような構成により、入力されたデ
ータＫが符号木に登録されていないときに、符号登録部
１１２Ａが符号木にデータＫを登録するようにすること
ができる。Therefore, in FIG. 42, description of the components having the same reference numerals as those already described in FIG. 33 will be omitted. Also,
The code registration unit 112A registers the input data K in the code tree when the data K is not registered in the code tree. With such a configuration, when the input data K is not registered in the code tree, the code registration unit 112A can register the data K in the code tree.

【０２５１】ここで、上述のような動作について、図４
７の処理ステップＫ１〜Ｋ１２を参照しながら説明する
が、ここで、この図４７に示すように、ステップＫ８以
外の処理は、図４３のフローチャートにて前述した処理
ステップＨ１〜Ｈ１１と同様の処理である。すなわち、
本例ではステップＫ１〜Ｋ７において、図４３中のステ
ップＨ１〜Ｈ７と同様の処理が行なわれ、そのあとにス
テップＫ８にかかる処理が行われる。Here, the operation as described above will be described with reference to FIG.
The processing will be described with reference to the processing steps K1 to K12 of step 7. Here, as shown in FIG. 47, the processing other than step K8 is the same as the processing steps H1 to H11 described above in the flowchart of FIG. Is. That is,
In this example, in steps K1 to K7, the same processing as steps H1 to H7 in FIG. 43 is performed, and then the processing in step K8 is performed.

【０２５２】すなわち、文脈判別部１０８Ａが、文脈木
に事象Ｋが登録されていないと判別した場合（ステップ
Ｋ４のＮＯルート）、符号木更新部１０５Ａが、符号木
のＥＳＣのリーフを他のリーフあるいはノードと組み替
えてＥＳＣの符号を変更し（ステップＫ６）、文脈履歴
保持部１０１Ａが現在の文脈Ｐを符号登録手段１１２Ａ
に送り、文脈変更部１１０Ａが文脈Ｐの次数を１つ低次
に移して変更した（ステップＫ７）後、符号登録部１１
２Ａが符号木に事象Ｋを登録する（ステップＫ８）。That is, when the context discriminating unit 108A discriminates that the event K is not registered in the context tree (NO route in step K4), the code tree updating unit 105A sets the ESC leaf of the code tree to another leaf. Alternatively, the code of the ESC is changed by recombining with the node (step K6), and the context history holding unit 101A stores the current context P in the code registration means 112A.
Then, the context changing unit 110A moves the order of the context P to the next lower order and changes it (step K7), and then the code registration unit 11
2A registers the event K in the code tree (step K8).

【０２５３】なお、ステップＫ４において、文脈木に事
象Ｋが登録されていると判別された場合においても、ス
テップＫ９〜Ｋ１２において、前述の図４３におけるス
テップＨ８〜Ｈ１１と同様の処理が行なわれている。こ
のように、関連技術２の第２の変形例にかかるデータ圧
縮装置によれば、上述のように、ステップＫ８以外は、
関連技術２の第１の変形例の符号化側におけるにおける
図４３の処理ステップと同様であるので、関連技術２の
第１の変形例の符号化側の効果と同様の効果がある。Even when it is determined in step K4 that the event K is registered in the context tree, the same processing as steps H8 to H11 in FIG. 43 described above is performed in steps K9 to K12. There is. As described above, according to the data compression device of the second modification of the related technique 2, as described above, except for step K8,
Since it is the same as the processing step of the encoding side of the 1st modification of the related technology 2 of FIG. 43, there exists an effect similar to the encoding side of the 1st modification of the related technology 2.

【０２５４】さらに、上述のように、ステップ８をとる
ことで、エスケープコードを符号化した間シンボルを符
号木上に登録してゆき、符号木に予め登録されていなか
ったシンボルについても次の符号化では早い段階（高次
の次数）で符号化することができるので、符号化が進む
ほど圧縮効果が大幅に向上する効果がある。（２）復元側の説明図４８は、関連技術２の第２の変形例としてのデータ復
元装置４の内部の構成例を示すものであり、この図４８
に示すように、本データ復元装置４の内部には、関連技
術２の第１の変形例の復元側の説明中、図４４にて前述
した構成に加えて、さらに符号登録部２１２Ａが設けら
れている。なお、この符号登録部２１２Ａは、符号化側
の構成（図４６参照）に対応して、設けられているもの
である。Further, as described above, by taking step 8, the symbols are registered in the code tree while the escape code is being coded, and the symbols not previously registered in the code tree are also subjected to the next code. Since the encoding can be performed at an early stage (higher order), the compression effect is significantly improved as the encoding progresses. (2) Explanation of Restoration Side FIG. 48 shows an example of the internal configuration of the data restoration device 4 as the second modification of the related technique 2.
As shown in FIG. 44, in the inside of the data restoration device 4, a code registration unit 212A is further provided in addition to the configuration described above with reference to FIG. 44 in the explanation of the restoration side of the first modified example of the related technique 2. ing. The code registration unit 212A is provided corresponding to the configuration on the encoding side (see FIG. 46).

【０２５５】このため、この図４８中、図４４にて既述
の符号と同じ符号の構成部分の説明は省略する。符号登
録部２１２Ａは、圧縮側で符号化されたデータの復号処
理でＥＳＣ（エスケープコード）を復号したときに、文
脈に対応した全ての符号木に、この復号したデータの符
号を登録するものである。Therefore, in FIG. 48, description of the components having the same reference numerals as those already described in FIG. 44 will be omitted. The code registration unit 212A registers the code of the decoded data in all the code trees corresponding to the context when the ESC (escape code) is decoded in the decoding process of the data encoded on the compression side. is there.

【０２５６】これにより、図４４にて前述した構成がと
る動作に加えて、符号登録部２１２Ａが、符号化された
データＫの復号処理において、ＥＳＣを復号したとき
に、文脈に対応した全ての符号木に、この復号したデー
タＫの符号を登録することができる。上述の動作につい
て、図４９の処理ステップＬ１〜Ｌ１０を参照しなが
ら、さらに詳述する。As a result, in addition to the operation performed by the configuration described above with reference to FIG. 44, when the code registration unit 212A decodes the ESC in the decoding process of the coded data K, all the codes corresponding to the context are obtained. The code of this decoded data K can be registered in the code tree. The above operation will be described in more detail with reference to the processing steps L1 to L10 of FIG.

【０２５７】ここで、この図４９に示すように、処理ス
テップＬ１〜Ｌ４においては、関連技術２の第１の変形
例の復元側における、図４５にて前述した処理ステップ
Ｊ１〜Ｊ４と同様の処理が行なわれる。そして、ステッ
プＬ４で、復号部２０４Ａが符号を復号した後、符号木
更新部２０５Ａが、復号した事象Ｋのリーフを他のリー
フあるいはノードと組み替え（ステップＬ５）。As shown in FIG. 49, the processing steps L1 to L4 are the same as the processing steps J1 to J4 described above with reference to FIG. 45 on the restoration side of the first modification of the related technique 2. Processing is performed. Then, in step L4, after the decoding unit 204A decodes the code, the code tree updating unit 205A rearranges the decoded leaf of the event K with another leaf or node (step L5).

【０２５８】さらに、符号木更新部２０５Ａは、この時
点で、以前にＥＳＣ（エスケープコード）が復号されて
おり、復号部２０４ＡからのＥＳＣ信号を受信している
と、このＥＳＣを復号した全ての符号木に事象Ｋを登録
する（ステップＬ６）。さらに、復号した事象ＫがＥＳ
Ｃであるかを判別し（ステップＬ７）、復号した事象Ｋ
がＥＳＣである場合は、復号した事象ＫがＥＳＣである
ことを示すＥＳＣ信号を、文脈変更部２１０Ａおよび符
号登録部２１２Ａに送信し、このＥＳＣ信号を受信した
文脈変更部２１０Ａは、文脈Ｐの次数を１つ低次に移し
て変更し（ステップＬ７のＹＥＳルートからステップＬ
８）、処理はステップＬ３に戻って、ＥＳＣ以外の事象
Ｋを復号するまで処理を繰り返す。Furthermore, if the ESC (escape code) has been previously decoded at this point and the ESC signal from the decoding unit 204A is received, the code tree updating unit 205A determines that all the ESCs have been decoded. The event K is registered in the code tree (step L6). Furthermore, the decrypted event K is ES
It is determined whether it is C (step L7) and the decoded event K
Is an ESC, the ESC signal indicating that the decoded event K is an ESC is transmitted to the context changing unit 210A and the code registering unit 212A, and the context changing unit 210A receiving this ESC signal changes the context P Change the order by moving it to the next lower level (from the YES route of step L7 to step L
8) The process returns to step L3 and is repeated until the event K other than ESC is decoded.

【０２５９】このようにして、復号部２０４Ａで復号し
た事象ＫがＥＳＣである場合は、符号登録部２１２Ａが
符号木に新規のリーフを作成し、復号部２０４がＥＳＣ
以外の事象をＫを復号したとき、この事象Ｋを新規に作
成した全てのリーフに格納することにより、ＥＳＣを復
元した全ての符号木に該シンボルを登録することができ
る。In this way, when the event K decoded by the decoding unit 204A is ESC, the code registration unit 212A creates a new leaf in the code tree and the decoding unit 204 ESC.
When K is decoded for events other than, by storing this event K in all newly created leaves, the symbol can be registered in all code trees in which ESC has been restored.

【０２６０】一方、上述のステップＬ７で、復号した事
象ＫがＥＳＣでない場合は、文脈更新部２０６Ａが、最
も古いデータ（前置データ保持部２００−ｎに保持され
ているデータ）を棄却し、復号した事象Ｋを文脈として
前置データ保持部２００−１に挿入して登録し、文脈文
字列Ｐ₀を更新する（ステップＬ９）。そして、全ての
データについて復号が終了したかをチェックし（ステッ
プＬ１０）、終了していなければステップＬ２からの処
理を繰り返し（ステップＬ１０のＮＯルート）、終了し
ていれば復号処理を終了する（ステップＬ１０のＹＥＳ
ルート）。On the other hand, when the decoded event K is not ESC in the above step L7, the context updating unit 206A rejects the oldest data (data held in the prefix data holding unit 200-n), The decrypted event K is inserted as a context into the pre-data holding unit 200-1 and registered, and the context character string P ₀ is updated (step L9). Then, it is checked whether the decoding is completed for all the data (step L10), and if not completed, the processes from step L2 are repeated (NO route of step L10), and if completed, the decoding process is completed (step L10). YES in step L10
root).

【０２６１】以上のように、復号側でも符号化側と同様
に、復号部２０４ＡがＥＳＣを復号したときは、符号木
登録部２１２Ａが符号木に事象Ｋを新規に登録する。こ
のように、関連技術２の第２の変形例にかかるデータ復
元装置によれば、符号化側と同様に、第１の変形例にお
ける復元側にて前述した効果に加えて、エスケープコー
ドの符号を復号した場合の間、復号するシンボルの符号
を符号木上に登録してゆくことで、符号木に登録されて
いないシンボルについても次の復号では早い段階で復号
することができるので、文字などのデータの復号処理が
進むほど復元効果が大幅に向上するとともにデータ復元
装置の処理負荷も大幅に軽減できる効果がある。（ｂ−３）関連技術２の第３の変形例の説明（１）符号化側の説明本第３の変形例において、上述のデータ圧縮装置３は、
前述の第２の変形例に比して、データＫを符号化する
際、一度でもＥＳＣを符号化したときは、履歴登録手段
としての文脈変更部１１０Ａが、このデータの符号化の
直前の文脈とこのデータとの組み合わせを文脈履歴保持
部１０１Ａに登録し、符号登録部１１２Ａが、このデー
タを符号化する直前に符号化したエスケープコードを持
つ符号木にデータを新規に登録するようにすることがで
きる点が異なる。As described above, on the decoding side as well as on the encoding side, when the decoding unit 204A decodes the ESC, the code tree registration unit 212A newly registers the event K in the code tree. As described above, according to the data decompression device according to the second modified example of the related technique 2, in addition to the effect described above on the decompression side in the first modified example, the code of the escape code is provided, as in the encoding side. By registering the code of the symbol to be decoded on the code tree during the decoding of, the symbols not registered in the code tree can be decoded at the early stage in the next decoding. As the decryption processing of the data proceeds, the restoration effect is significantly improved, and the processing load of the data restoration device can be greatly reduced. (B-3) Description of Third Modification of Related Technique 2 (1) Description of Encoding Side In the third modification, the data compression device 3 described above is
Compared to the second modification described above, when the data K is encoded, even if the ESC is encoded even once, the context changing unit 110A as the history registration means causes the context immediately before the encoding of the data. And a combination of this data with the context history holding unit 101A, and the code registration unit 112A newly registers the data in the code tree having the escape code encoded immediately before the data is encoded. The difference is that you can do it.

【０２６２】以下、上述の処理について、図５０に示す
処理ステップＭ１〜Ｍ１３を参照しながら、さらに詳述
する。ここで、この図５０に示すように、ステップＭ１
〜Ｍ９は、図４３の処理ステップＨ１〜Ｈ９とそれぞれ
同様の処理が行なわれる。The above process will be described in more detail below with reference to process steps M1 to M13 shown in FIG. Here, as shown in FIG. 50, step M1
.. to M9 are processed in the same manner as the processing steps H1 to H9 in FIG.

【０２６３】そして、文脈Ｐに入力データの事象Ｋが登
録されている場合は（ステップＭ４のＹＥＳルート）、
ステップＭ８を経由して、ステップＭ９で符号木の組み
替えを行なった後は、復号部１０４がステップＭ８で符
号化した事象ＫがＥＳＣであるかをチェックする（ステ
ップＭ１０）。符号化した事象ＫがＥＳＣである場合
は、符号登録部１１２Ａがこの事象Ｋを符号化する直前
の文脈Ｐ’に対応する符号木に事象Ｋを登録し（ステッ
プＭ１０のＹＥＳルートからステップＭ１１）、文脈更
新部１０６Ａが事象Ｋを含む文脈Ｐを、最も新しい文脈
として前置データ保持部１００Ａ−１に登録する。If the event K of the input data is registered in the context P (YES route of step M4),
After the code tree is rearranged in step M9 via step M8, the decoding unit 104 checks whether the event K encoded in step M8 is ESC (step M10). When the encoded event K is ESC, the code registration unit 112A registers the event K in the code tree corresponding to the context P ′ immediately before encoding the event K (from the YES route of step M10 to step M11). The context updating unit 106A registers the context P including the event K in the predata holding unit 100A-1 as the newest context.

【０２６４】さらに、この情報から文脈変更部１１０Ａ
が、文脈文字列Ｐ₀にデータＫを挿入して更新し（ステ
ップＭ１２）、文脈履歴保持部１０１Ａに登録する。つ
まり、ステップＭ１１でデータＫを符号化する直前の文
脈Ｐ’（すなわち最後に符号化された文脈）の符号木に
データＫが登録されているので、データＫとこのデータ
Ｋを符号化する直前の文脈Ｐ’との組み合わせが文脈変
更部１０６Ａにより文脈履歴保持部１０１Ａに登録され
ることになる。Furthermore, from this information, the context changing unit 110A
Inserts the data K into the context character string P ₀ , updates it (step M12), and registers it in the context history holding unit 101A. That is, since the data K is registered in the code tree of the context P '(that is, the last encoded context) immediately before the data K is encoded in step M11, the data K and the data K immediately before the data K are encoded. The context changing unit 106A registers the combination with the context P ′ in the context history holding unit 101A.

【０２６５】一方、符号化した事象ＫがＥＳＣでない場
合は（ステップＭ１０のＮＯルート）、ステップＭ１１
をスキップして、上述のステップＭ１２を行なう。さら
に、全てのデータの符号化が終了したかをチェックし
（ステップＭ１３）、終了していない場合は（ステップ
Ｍ１３のＮＯルート）、処理はステップＭ２に戻り、全
てのデータの符号化が終了するまで処理を繰り返し、終
了している場合は、符号化の処理を終了する（ステップ
Ｍ１３のＹＥＳルート）。On the other hand, when the coded event K is not ESC (NO route of step M10), step M11
Is skipped and step M12 described above is performed. Furthermore, it is checked whether the encoding of all the data is completed (step M13), and if not completed (NO route of step M13), the process returns to step M2, and the encoding of all the data is completed. The processing is repeated up to, and if the processing is completed, the encoding processing is completed (YES route of step M13).

【０２６６】このように、関連技術２の第３の変形例に
かかるデータ圧縮装置によれば、第２の変形例の符号化
側にて前述した効果に加えて、予め登録された文脈にシ
ンボルが含まれない場合、最後に符号化したエスケープ
コードの符号木にのみこのシンボルを登録してゆくこと
により、新たな文脈を登録するために使用するメモリの
量を大幅に削減することができるので、データ圧縮装置
の性能が大幅に向上する効果がある。As described above, according to the data compression apparatus of the third modified example of the related art 2, in addition to the effects described above on the encoding side of the second modified example, the symbol is added to the context registered in advance. If is not included, the amount of memory used to register a new context can be significantly reduced by registering this symbol only in the code tree of the last encoded escape code. There is an effect that the performance of the data compression device is significantly improved.

【０２６７】さらに、上述のようにして新たに登録され
たシンボルと、同じシンボルが後に入力される毎にこの
シンボルの符号木のノードを組み替えて符号長を短くす
る（スプレイ処理する）ことにより、実際に出現頻度が
高いシンボルについてのみ、その符号の符号長を短くす
ることができるので、データ圧縮装置の圧縮効果が大幅
に向上する効果がある。Furthermore, each time the same symbol newly registered as described above is input later, the node of the code tree of this symbol is recombined to shorten the code length (spray processing). Since the code length of the code can be shortened only for the symbol having a high appearance frequency, the compression effect of the data compression device is significantly improved.

【０２６８】（２）復元側の説明関連技術２の第３の変形例にかかるデータ復元装置４
は、図４８に示したデータ復元装置４に比して、履歴登
録手段としての文脈変更部２１０Ａが、符号化側で符号
化されたデータＫの復号処理で、最後にＥＳＣ（エスケ
ープコード）を復号した時の文脈と復号したデータＫと
を文脈履歴保持部２０１Ａに登録し、符号登録部２１２
Ａが、データＫの復号処理で最後にＥＳＣを復号した時
の文脈に対応した符号木にデータＫの符号を登録するよ
うにすることができる点が異なる。(2) Description of Restoration Side Data Restoration Device 4 According to Third Modification of Related Technique 2
Compared to the data restoration device 4 shown in FIG. 48, the context changing unit 210A as the history registration means is a decoding process of the data K encoded on the encoding side, and finally ESC (escape code) is added. The decoded context and the decoded data K are registered in the context history holding unit 201A, and the code registration unit 212
The difference is that A can register the code of the data K in the code tree corresponding to the context when the ESC is finally decoded in the decoding process of the data K.

【０２６９】上述のような処理について、図５１の処理
ステップＮ１〜Ｎ１０を参照しながら、さらに詳述す
る。ここで、この図５１に示すように、ステップＮ１〜
Ｎ５において、図４９のステップＫ１〜Ｋ５とそれぞれ
同様の処理が行なわれる。The above-mentioned processing will be described in more detail with reference to the processing steps N1 to N10 in FIG. Here, as shown in this FIG.
At N5, processes similar to those at steps K1 to K5 of FIG. 49 are performed.

【０２７０】そして、本復元側では、ステップＮ５で復
号した事象Ｋのリーフを他のリーフあるいはノードと組
み替えた後に、復号した事象ＫがＥＳＣであるかをチェ
ックする（ステップＮ６）。復号した事象ＫがＥＳＣで
ある場合は、文脈Ｐの次数を変更し（ステップＮ６のＹ
ＥＳルートからステップＮ７）、処理はステップＮ３に
戻って、ＥＳＣ以外の事象Ｋを復号するまで処理を繰り
返す。Then, on the restoration side, after recombining the leaf of the event K decrypted in step N5 with another leaf or node, it is checked whether or not the decrypted event K is ESC (step N6). If the decoded event K is ESC, the order of the context P is changed (Y in step N6).
From the ES route to step N7), the process returns to step N3 to repeat the process until the event K other than ESC is decoded.

【０２７１】一方、復号した事象ＫがＥＳＣでない場合
は、符号木更新部２０５Ａは、直前のＥＳＣを復号した
符号木のみに新規リーフを作成し、符号登録部２１２Ａ
がこの符号木の新規リーフに事象Ｋを登録する（ステッ
プＮ６のＮＯルートからステップＮ８）。さらに、文脈
更新部２０６Ａが、最も古いデータ（前置データ保持部
２００−ｎに保持されているデータ）を棄却し、復号し
た事象Ｋを文脈として前置データ保持部２００−１に挿
入して登録し、文脈文字列Ｐ₀を更新する（ステップＮ
９）。On the other hand, when the decoded event K is not the ESC, the code tree updating unit 205A creates a new leaf only in the code tree obtained by decoding the immediately previous ESC, and the code registration unit 212A.
Registers event K in a new leaf of this code tree (from NO route of step N6 to step N8). Further, the context updating unit 206A discards the oldest data (data held in the prefix data holding unit 200-n), inserts the decoded event K into the prefix data holding unit 200-1 as the context, and Register and update the context string P ₀ (step N
9).

【０２７２】そして、全てのデータについて復号が終了
したかをチェックし（ステップＮ１０）、終了していな
ければ、ステップＮ２からの処理を繰り返し（ステップ
Ｎ１０のＮＯルートからステップＮ２）、終了していな
ければ復号処理を終了する。このように、関連技術２の
第３の変形例にかかるデータ復元装置によれば、第２の
変形例の復元側にて前述した効果に加えて、予め登録さ
れた文脈にシンボルが含まれない場合、最後に復号した
エスケープコードの符号木にのみこのシンボルの符号を
新規に登録してゆくことにより、１つのシンボルに対し
て常に１つ以下の登録で済み、新たなシンボルの符号を
登録するために使用するノードＩＤ管理用のメモリを大
幅に削減することがきるので、データ復元装置の性能が
大幅に向上する効果がある。Then, it is checked whether the decoding is completed for all the data (step N10). If not completed, the processing from step N2 is repeated (from NO route of step N10 to step N2), and the processing must be completed. If so, the decoding process ends. As described above, according to the data decompression device of the third modified example of the related art 2, in addition to the effect described above on the decompression side of the second modified example, a symbol is not included in the pre-registered context. In this case, by newly registering the code of this symbol only in the code tree of the finally decoded escape code, one or less registration is always required for one symbol, and the code of the new symbol is registered. Since the memory for managing the node ID used for that purpose can be significantly reduced, there is an effect that the performance of the data restoration device is greatly improved.

【０２７３】さらに、上述のようにして新たに登録され
たシンボルと、同じシンボルが後に入力される毎にこの
シンボルの符号木のノードを組み替えて符号長を短くす
る（スプレイ処理する）ことにより、実際に出現頻度が
高いシンボルについてのみ、符号長の短い符号をもつこ
とになるので、データ復元装置の復元効果が大幅に向上
する効果がある。（ｃ）本発明の一実施形態の説明本発明の一実施形態にかかるデータ圧縮装置及びデータ
復元装置も、関連技術２と同様に、図３２に示したよう
な本発明のデータ圧縮方法及びデータ復元方法を実施す
るためのものである。Furthermore, each time the same symbol newly registered as described above is input later, the node of the code tree of this symbol is rearranged to shorten the code length (spray processing). Only a symbol having a high appearance frequency actually has a code with a short code length, so that the restoration effect of the data restoration device is significantly improved. (C) Description of an Embodiment of the Present Invention A data compression apparatus and a data decompression apparatus according to an embodiment of the present invention, like the related technique 2, also include the data compression method and data of the present invention as shown in FIG. This is for implementing the restoration method.

【０２７４】また、本実施形態においても、関連技術２
と同様に、データ圧縮装置を符号化側、データ復元装置
を復元側として説明を進める。なお、以下の説明中にお
いても、文脈木および符号木は、関連技術１にて前述し
た構成をもつものである。（１）符号化側の説明図５２は、本発明の一実施形態にかかるデータ圧縮方法
を実施するためのデータ圧縮装置３（図３２参照）の内
部の構成例を示す図であり、この図５２において、３０
１Ｂは符号木保持部、３０２Ｂは文脈木保持部、３０３
Ｂは文脈登録部、３０５Ｂは文脈変更部、３０６Ｂは符
号化部、３０７Ｂは符号変更部、３２１Ｂは文脈保持
部、３２２Ｂは符号登録部である。Also in this embodiment, the related technique 2
Similarly, the description will proceed with the data compression device as the encoding side and the data decompression device as the decompression side. Note that, also in the following description, the context tree and the code tree have the configurations described in Related Art 1. (1) Description of Encoding Side FIG. 52 is a diagram showing an internal configuration example of the data compression apparatus 3 (see FIG. 32) for implementing the data compression method according to the embodiment of the present invention. At 52, 30
1B is a code tree holding unit, 302B is a context tree holding unit, 303
B is a context registration unit, 305B is a context change unit, 306B is an encoding unit, 307B is a code change unit, 321B is a context holding unit, and 322B is a code registration unit.

【０２７５】ここで、符号木保持部（符号木保持手段）
３０１Ｂは、予めエスケープコード（ＥＳＣ）（データ
未登録を示すデータ）を登録した符号木を保持するもの
であり、文脈木保持部（文脈木保持手段）３０２Ｂは、
シンボルＫ（入力データ）と文脈との組み合わせを登録
した文脈木を保持するものである。また、文脈登録部
（文脈登録手段）３０３Ｂは、エスケープコードを符号
化したのち、文脈木にシンボルＫを新規に登録するもの
であり、文脈保持部３２１Ｂは、入力されたシンボルＫ
を一旦保持するものである。Here, the code tree holding unit (code tree holding means)
Reference numeral 301B holds a code tree in which an escape code (ESC) (data indicating that data has not been registered) is registered in advance. The context tree holding unit (context tree holding means) 302B is
It holds a context tree in which a combination of a symbol K (input data) and a context is registered. Further, the context registration unit (context registration means) 303B newly registers the symbol K in the context tree after encoding the escape code, and the context holding unit 321B receives the input symbol K.
Is held once.

【０２７６】符号登録部（符号登録手段）３２２Ｂは、
エスケープコードを符号化したのち符号木保持部（符号
木保持手段）３０１Ｂ内の符号木のエスケープコードの
リーフ（データ格納点）を分岐してシンボルＫを新規に
登録するものである。このため、符号登録部３２２Ｂに
は、図５３にて後述する、新規ノードＩＤ発生部６１，
ラッチ６２，親子情報更新部６３が設けられており、符
号木保持部３０１Ｂの内部には、外部節点（リーフＩ
Ｄ）保持部６４，内部節点（ノードＩＤ）保持部６５,
ＥＳＣ−ＩＤ保持部６６，及び符号木管理部６７が設け
られている。The code registration unit (code registration means) 322B is
After the escape code is encoded, the leaf (data storage point) of the escape code of the code tree in the code tree holding unit (code tree holding means) 301B is branched to newly register the symbol K. Therefore, the code registration unit 322B has a new node ID generation unit 61, which will be described later with reference to FIG.
A latch 62 and a parent-child information updating unit 63 are provided, and an external node (leaf I) is provided inside the code tree holding unit 301B.
D) holding unit 64, internal node (node ID) holding unit 65,
An ESC-ID holding unit 66 and a code tree management unit 67 are provided.

【０２７７】また、文脈変更部３０５Ｂは、シンボルＫ
と文脈との組み合わせが文脈木に保持されていないと
き、文脈を変更するものである。符号化部３０６Ｂは、
符号木の頂点からのシンボルＫあるいはエスケープコー
ドが登録してあるリーフまでの分岐に従って“１”か
“０”で表される符号を出力するものである。Also, the context changing unit 305B uses the symbol K
When the combination of and context is not stored in the context tree, it changes the context. The encoding unit 306B is
The code represented by "1" or "0" is output according to the branch from the vertex of the code tree to the leaf in which the escape code is registered.

【０２７８】符号更新部３０７Ｂは、符号化したシンボ
ルＫ及びエスケープコードが登録してあるリーフと他の
リーフあるいはノードとを取り替えるものである。そし
て、図５３に示すように、符号登録部３２２Ｂの内部に
おいて、新規ノードＩＤ発生部６１は、文脈木保持部３
０２Ｂから更新信号を受けて２つの新規ノードＩＤ（Ｉ
Ｄ−１，ＩＤ−２を発生するものであり、ラッチ６２は
エスケープコードのＩＤ（ＥＳＣ−ＩＤ）を一旦保持す
るものである。The code updating unit 307B replaces the leaf in which the coded symbol K and escape code are registered with another leaf or node. Then, as shown in FIG. 53, in the code registration unit 322B, the new node ID generation unit 61 uses the context tree holding unit 3
02B receives an update signal and two new node IDs (I
D-1 and ID-2 are generated, and the latch 62 temporarily holds the escape code ID (ESC-ID).

【０２７９】また、親子情報更新部６３は、処理対象の
ノードの上位のノードＩＤと右下に位置する下位ノード
のノードＩＤおよび左下に位置する下位ノードのノード
ＩＤの３つの情報（ＥＳＣ−ＩＤ，ＩＤ−１，ＩＤ−
２）からなる親子情報を受けてこの親子情報を変更し、
符号木保持部３０１Ｂに送るものである。さらに、符号
木保持部３０１Ｂの内部において、外部節点（リーフＩ
Ｄ）保持部６４は符号木のデータの格納点であるリーフ
のリーフＩＤを保持するものであり、内部節点（ノード
ＩＤ）保持部６５は符号木のノードのノードＩＤを保持
するものであり、ＥＳＣ−ＩＤ保持部６６は符号木のエ
スケープコードとこのエスケープコードのＩＤを保持す
るものである。Further, the parent-child information updating unit 63 has three pieces of information (ESC-ID) of the upper node ID of the node to be processed, the node ID of the lower node located at the lower right and the node ID of the lower node located at the lower left. , ID-1, ID-
Change the parent-child information by receiving the parent-child information consisting of 2),
It is sent to the code tree holding unit 301B. Furthermore, inside the code tree holding unit 301B, external nodes (leaf I
D) The holding unit 64 holds the leaf ID of the leaf that is the storage point of the code tree data, and the internal node (node ID) holding unit 65 holds the node ID of the node of the code tree. The ESC-ID holding unit 66 holds the escape code of the code tree and the ID of this escape code.

【０２８０】また、符号木管理部６７は、文脈木保持部
３０２Ｂから文脈ＩＤを受けて、この文脈ＩＤを外部節
点（リーフＩＤ）保持部６４，内部節点（ノードＩＤ）
保持部６５およびＥＳＣ−ＩＤ保持部６６に送るもので
ある。上述の構成により、本発明の一実施形態にかかる
データ圧縮装置３では、符号木保持部３０１Ｂが予めエ
スケープコードを登録した符号木を保持し、文脈木保持
部３０２ＢがシンボルＫと文脈Ｐとの組み合わせを登録
した文脈木を保持し、文脈登録部３０３Ｂがエスケープ
コードを符号化したのち、文脈木にシンボルＫを新規に
登録することができる。Further, the code tree management unit 67 receives the context ID from the context tree holding unit 302B, and uses this context ID as the external node (leaf ID) holding unit 64 and the internal node (node ID).
It is sent to the holding unit 65 and the ESC-ID holding unit 66. With the above configuration, in the data compression device 3 according to the embodiment of the present invention, the code tree holding unit 301B holds the code tree in which the escape code is registered in advance, and the context tree holding unit 302B stores the symbol K and the context P. The context tree in which the combination is registered is held, and after the context registration unit 303B encodes the escape code, the symbol K can be newly registered in the context tree.

【０２８１】さらに、文脈保持部３２１Ｂが文脈Ｐを一
旦保持し、符号登録部３２２Ｂがエスケープコードを符
号化したのち、符号木のエスケープコードのリーフを分
岐してシンボルＫを新規に登録することができる。ま
た、文脈変更部３０５Ｂが、入力されたシンボルＫと文
脈Ｐとの組み合わせが文脈木に保持されていないとき、
文脈Ｐを変更することができる。Further, after the context holding unit 321B once holds the context P and the code registration unit 322B encodes the escape code, the leaf of the escape code of the code tree may be branched to newly register the symbol K. it can. Further, when the context changing unit 305B does not hold the combination of the input symbol K and the context P in the context tree,
The context P can be changed.

【０２８２】さらに、符号化部３０６Ｂが符号木の頂点
からの入力データあるいはエスケープコードが登録して
あるリーフまでの分岐に従って符号を出力し、符号更新
部３０７Ｂが符号化したデータ及びエスケープコードが
登録してあるリーフと他のリーフあるいはノードとを取
り替えることができる。なお、図５７（ａ）は符号木の
例を示す図であり、同図（ｂ）は文脈木の例を示す図で
ある。ところで、この図５７（ａ）に示すように、符号
木は内部節点としてのルートとノード、外部節点として
のリーフにそれぞれＩＤ番号（０〜１０）を持ってい
る。Further, the encoding unit 306B outputs a code according to the branch to the input data from the vertex of the code tree or the leaf in which the escape code is registered, and the data and the escape code encoded by the code updating unit 307B are registered. One leaf can be replaced with another leaf or node. 57 (a) is a diagram showing an example of the code tree, and FIG. 57 (b) is a diagram showing an example of the context tree. By the way, as shown in FIG. 57 (a), the code tree has ID numbers (0 to 10) at the root and node as internal nodes and at the leaf as external nodes, respectively.

【０２８３】一方、図５７（ｂ）に示すように、文脈木
は、文脈のＩＤとその文脈に登録されているシンボルの
ＩＤ番号を持っており、文脈のＩＤ番号は符号木のルー
トのＩＤ番号、シンボルのＩＤ番号は符号木のリーフの
ＩＤ番号と同一である。ここで、上述の動作について、
図５４〜図５６に示す本発明の符号化側の動作を示す処
理ステップＰ１〜Ｐ１６を参照しながら、さらに詳述す
る。On the other hand, as shown in FIG. 57 (b), the context tree has the ID of the context and the ID number of the symbol registered in the context, and the ID number of the context is the ID of the root of the code tree. The numbers and the ID numbers of the symbols are the same as the leaf ID numbers of the code tree. Here, regarding the above operation,
This will be described in more detail with reference to process steps P1 to P16 showing the operation of the encoding side of the present invention shown in FIGS.

【０２８４】また、以下の動作の説明中、符号木および
文脈木は、上述のような構成を有しているものとする。
まず、図５４に示すように、シンボルＫが入力されると
（ステップＰ１）、文脈保持部３２１Ｂに保持されてい
る文脈Ｐが文脈変更部３０５Ｂに出力される（ステップ
Ｐ２）。In the following description of the operation, it is assumed that the code tree and the context tree have the above-mentioned configurations.
First, as shown in FIG. 54, when the symbol K is input (step P1), the context P held in the context holding unit 321B is output to the context changing unit 305B (step P2).

【０２８５】そして、文脈変更部３０５Ｂでは、文脈Ｐ
とシンボルＫとを受けて、文脈Ｐの中にシンボルＫが登
録されているかを判断するが、シンボルＫが文脈Ｐに登
録されていない場合、文脈木保持部３０２Ｂは文脈変更
信号を文脈変更部３０５Ｂに出力する（ステップＰ
３）。文脈変更信号を受けた文脈変更部３０５Ｂは、最
高次（文脈木のルートから１番距離が長いリーフ）の文
字を棄却して次数を１つ下げた文脈Ｐを文脈木保持部３
０２Ｂに送る（ステップＰ４）。Then, in the context changing unit 305B, the context P
When the symbol K is not registered in the context P, the context tree holding unit 302B sends the context change signal to the context changing unit. Output to 305B (step P
3). Upon receiving the context change signal, the context change unit 305B rejects the highest-order character (the leaf having the longest distance from the root of the context tree) and lowers the degree by one to obtain the context P.
02B (step P4).

【０２８６】そして、シンボルＫが登録されている文脈
Ｐを決定するまで、この処理を繰り返す。次に、図５５
に示すように、文脈保持部３０２Ｂは、文脈ＰのＩＤ
（番号）とシンボルＫ（シンボルＫが登録されていない
ときはＥＳＣ）のＩＤを符号化部３０６Ｂに送り（ステ
ップＰ５）、符号化部３０６Ｂは、送られてきたＩＤを
符号木保持部３０１Ｂにそのまま転送する（ステップＰ
６）。Then, this process is repeated until the context P in which the symbol K is registered is determined. Next, FIG.
As shown in FIG.
The (number) and the ID of the symbol K (ESC when the symbol K is not registered) are sent to the coding unit 306B (step P5), and the coding unit 306B sends the sent ID to the code tree holding unit 301B. Transfer as it is (Step P
6).

【０２８７】そして、符号木保持部３０１Ｂは、送られ
てきたＩＤの上位ノードのＩＤ番号と、符号化部３０６
Ｂが送ってきたＩＤが上位ノードに対して右左のどちら
かに位置しているかを示す情報とを符号化部３０６Ｂに
送る（ステップＰ７）。さらに、符号化部３０６Ｂは、
上位ノードに対する、符号化部３０６Ｂが送ってきたＩ
Ｄをもつノードの位置情報に従って、例えば右に位置し
ていた場合は“１”、左なら“０”を符号として出力す
る（ステップＰ８）。Then, the code tree holding unit 301B stores the ID number of the upper node of the sent ID and the coding unit 306.
Information indicating whether the ID sent by B is located on the right or the left with respect to the upper node is sent to the encoding unit 306B (step P7). Further, the encoding unit 306B is
I sent from the encoding unit 306B to the upper node
According to the position information of the node having D, for example, "1" is output if it is located on the right, and "0" is output if it is on the left (step P8).

【０２８８】また、上述の位置情報とともに送られてき
た上位ノードのＩＤが、文脈保持部３０２Ｂから送られ
てきた文脈ＰのＩＤ（ルートＩＤ）と一致したとき、符
号化処理を終了する。一方、一致しなかったときは、そ
の文脈ＰのＩＤをさらに符号変更部３０７Ｂに出力し
（ステップＰ６と同じ経路）、さらに符号木保持部３０
１Ｂから上位のノードＩＤと位置情報とを得る。When the ID of the upper node sent together with the above-mentioned position information matches the ID (root ID) of the context P sent from the context holding unit 302B, the coding process is ended. On the other hand, when they do not match, the ID of the context P is further output to the code changing unit 307B (the same route as in step P6), and the code tree holding unit 30 is further added.
The upper node ID and the position information are obtained from 1B.

【０２８９】そして、符号木保持部３０１Ｂからの上位
ノードのＩＤ番号が、文脈ＰのＩＤに一致するまで処理
を繰り返す。この処理が終了した後、符号変更部３０７
Ｂは、符号化部３０６Ｂから文脈ＰのＩＤ（＝符号木の
ルートＩＤ）及び符号化したシンボルＫのリーフＩＤを
受け（ステップＰ９）、ノードの組み換え処理（スプレ
イ処理）を行ない、符号長を更新する。なお、このノー
ドの組み換え処理については、関連技術１および関連技
術２にて前述したように行なう。Then, the process is repeated until the ID number of the upper node from the code tree holding unit 301B matches the ID of the context P. After this processing is completed, the code changing unit 307
B receives the ID of the context P (= root ID of the code tree) and the leaf ID of the encoded symbol K from the encoding unit 306B (step P9), performs node recombination processing (spray processing), and determines the code length. Update. Note that this node recombination processing is performed as described above in Related Technology 1 and Related Technology 2.

【０２９０】これにより、シンボルＫが以前に入力され
文脈Ｐに登録されている場合に、さらにシンボルＫが入
力されると、既に登録してあるシンボルＫのノードを上
位のノードと組み替えて、符号長を短く（１／２に）す
ることができる。ところで、上述の文脈ＰにシンボルＫ
が登録されていない場合は、図５６に示すように、文脈
登録部３１０Ｂは、文脈変更部３０５Ｂから登録する文
脈Ｐを受ける（ステップＰ１０）とともにシンボルＫと
を受けて（ステップＰ１１）、文脈木保持部３０２Ｂに
登録シンボルのＩＤを出力し（ステップＰ１２）、文脈
木保持部３０２Ｂは、文脈Ｐの下にシンボルＫを新規に
登録する。Accordingly, when the symbol K is previously input and registered in the context P, when the symbol K is further input, the node of the already registered symbol K is recombined with the upper node and the code is changed. The length can be shortened (1/2). By the way, in the above context P, the symbol K
56 is not registered, the context registration unit 310B receives the context P to be registered from the context changing unit 305B (step P10) and the symbol K (step P11), as shown in FIG. The ID of the registered symbol is output to the holding unit 302B (step P12), and the context tree holding unit 302B newly registers the symbol K under the context P.

【０２９１】一方、符号登録部３２２Ｂは、符号を登録
する符号木を符号木保持部３０１Ｂより受け（ステップ
Ｐ１３）、登録するシンボルＫと、文脈木保持部３０２
Ｂからの登録シンボルのＩＤとを受けて（ステップＰ１
４，１５）、シンボルＫを符号木に新規登録して符号木
保持部３０１Ｂに再格納する（ステップＰ１６）。これ
により、文脈Ｐに登録されていないシンボルＫの登録
（符号化）が行なわれる。On the other hand, the code registration unit 322B receives the code tree for registering the code from the code tree holding unit 301B (step P13), and registers the symbol K to be registered and the context tree holding unit 302.
In response to the registered symbol ID from B (step P1
4, 15), the symbol K is newly registered in the code tree and stored again in the code tree holding unit 301B (step P16). Thereby, the symbol K not registered in the context P is registered (encoded).

【０２９２】ここで、符号登録部３２２Ｂが、エスケー
プコードを符号化したのち符号木保持部３０１Ｂ内の符
号木のエスケープコードのリーフを分岐してシンボルＫ
を新規に登録する動作（上述のステップＰ１３〜Ｐ１
６）について、前述したように、図５３を用いてさらに
詳述する。まず、符号木保持部３０１Ｂの内部に設けら
れた符号木管理部６７は文脈ＩＤを受けて符号木保持部
内のＥＳＣ−ＩＤ保持部に文脈のＥＳＣアドレスを送
る。Here, the code registration unit 322B codes the escape code and then branches the leaf of the escape code of the code tree in the code tree holding unit 301B to branch to the symbol K.
To newly register (the above-mentioned steps P13 to P1
6) will be described in more detail with reference to FIG. 53, as described above. First, the code tree management unit 67 provided inside the code tree holding unit 301B receives the context ID and sends the context ESC address to the ESC-ID holding unit in the code tree holding unit.

【０２９３】そして、ＥＳＣ−ＩＤ保持部６６は、ＥＳ
Ｃアドレスを受けて、ＥＳＣのＩＤとこのＩＤに登録し
てあるシンボル（この場合はＥＳＣ）を出力する一方、
符号登録部３２２Ｂでは、ＥＳＣ−ＩＤとＥＳＣをラッ
チ６２でラッチする。さらに、新規ＩＤ発生器６１は、
更新信号を受けて２つのＩＤ番号（ＩＤ−１，ＩＤ−
２）を発信し、ＩＤ−１は、新規のＥＳＣ−ＩＤとして
ラッチしてあったＥＳＣとともに符号木保持部３０１Ｂ
のＥＳＣ−ＩＤ保持部６６に格納される。Then, the ESC-ID holding unit 66 uses the ES
While receiving the C address and outputting the ID of the ESC and the symbol (ESC in this case) registered in this ID,
In the code registration unit 322B, the latch 62 latches the ESC-ID and ESC. Further, the new ID generator 61 is
Upon receiving the update signal, two ID numbers (ID-1, ID-
2) is transmitted, and ID-1 stores the code tree holding unit 301B together with the ESC latched as a new ESC-ID.
Stored in the ESC-ID holding unit 66.

【０２９４】また、符号木には、ＩＤとシンボルの他に
ＩＤの位置情報を表した親子情報が保持されているが、
この親子情報には、自分の上位のノードＩＤ，自分の右
下に位置するノードＩＤ，及び自分の左下に位置するノ
ードＩＤの３つの情報がある。親子情報更新部６３は、
３つのＩＤ（変更前のＥＳＣ−ＩＤ，ＩＤ−１，ＩＤ−
２）を受けて、親子情報を変更する。すなわち、ＩＤ−
１，ＩＤ−２の上位ＩＤにはＥＳＣ−ＩＤが登録され、
またＥＳＣ−ＩＤの右下にはＩＤ−１，左下にはＩＤ−
２が位置するという情報が登録される。[0294] Further, in the code tree, in addition to the ID and the symbol, parent-child information representing the position information of the ID is held.
The parent-child information includes three pieces of information, that is, a higher-level node ID of itself, a node ID located at the lower right of the own, and a node ID located at the lower left of the own. The parent-child information updating unit 63
Three IDs (ESC-ID, ID-1, ID-before change)
In response to 2), change the parent-child information. That is, ID-
1, ESC-ID is registered as the higher ID of ID-2,
Also, ESC-ID has ID-1 at the lower right and ID- at the lower left.
Information that 2 is located is registered.

【０２９５】親子情報は、各ＩＤとともに符号木保持部
３０１ＢのノードＩＤ保持部６５に保持される。一方、
符号木保持部３０１ＢのリーフＩＤ保持部６４には、登
録シンボルと新規ＩＤ（ＩＤ−２）が登録される。以上
のような処理を行なうことにより、ＥＳＣには新規ＥＳ
Ｃ−ＩＤ（ＩＤ−１）が登録され、旧ＥＳＣ−ＩＤはノ
ードＩＤとして親子情報とともにノードＩＤ保持部に保
持され、新規ＩＤ（ＩＤ−２）とシンボルＫは新規リー
フとしてリーフＩＤ保持部に保持され、シンボルＫを符
号木保持部３０１Ｂの符号木のエスケープコードが登録
されているリーフを分岐して新規登録する処理が終了す
る。The parent-child information is held in the node ID holding unit 65 of the code tree holding unit 301B together with each ID. on the other hand,
A registration symbol and a new ID (ID-2) are registered in the leaf ID holding unit 64 of the code tree holding unit 301B. By performing the above processing, a new ES is added to the ESC.
The C-ID (ID-1) is registered, the old ESC-ID is stored as a node ID in the node ID holding unit together with parent-child information, and the new ID (ID-2) and the symbol K are stored in the leaf ID holding unit as a new leaf. The processing of holding the symbol K and newly registering the branch of the leaf of the code tree holding unit 301B in which the escape code of the code tree is registered ends.

【０２９６】以上、図５３〜５６にて説明した動作をま
とめると、図５８に示すフローチャートで表すことがで
きる。すなわち、まず、文脈Ｐ₀に０を入力して初期化
し（ステップＱ１）、文脈Ｐ₀を変数としての文脈Ｐに
入力する（ステップＱ２）。なお、この文脈Ｐ₀は、直
前までに入力・符号化された文字（シンボル）であり、
例えば、本実施形態における符号化が、ｎ次の文脈を用
いるモデルであったとき、文脈Ｐ₀は直前までに入力・
符号化された（ｎ−１）文字が格納されていることにな
る。The operations described with reference to FIGS. 53 to 56 can be summarized as the flow chart shown in FIG. That is, first, 0 is input to the context P ₀ for initialization (step Q1), and the context P ₀ is input to the context P as a variable (step Q2). The context P ₀ is a character (symbol) that has been input / encoded until immediately before,
For example, when the encoding in the present embodiment is a model that uses an nth-order context, the context P ₀ is input by just before.
This means that the encoded (n-1) characters are stored.

【０２９７】そして、シンボルＫが入力されたとき、文
脈Ｐの文脈木にシンボルＫが登録されているか否かを検
索する（ステップＱ３）。そして、文脈Ｐの文脈木にシ
ンボルＫが登録されていない場合、ＥＳＣ（エスケープ
コード）の符号を出力し（ステップＱ３のＮＯルートか
らステップＱ４）、文脈Ｐに対応する符号木のＥＳＣが
登録されているリーフをスプレイ処理する（ステップＱ
５）。Then, when the symbol K is input, it is searched whether or not the symbol K is registered in the context tree of the context P (step Q3). When the symbol K is not registered in the context tree of the context P, the code of ESC (escape code) is output (from NO route of step Q3 to step Q4), and the ESC of the code tree corresponding to the context P is registered. Spraying the existing leaf (step Q
5).

【０２９８】そして、上述のＥＳＣのリーフを分岐し
（ステップＱ６）、これにより得た２つの新規リーフに
ＥＳＣとシンボルＫを登録し（ステップＱ７）、文脈Ｐ
の文脈木にもシンボルＫを登録しておく。さらに、上述
のようにしてシンボルＫの新規登録した後、文脈Ｐ内の
最も古い文字を棄却し、文脈の次数を１つ減じた文脈を
新たに文脈Ｐとして文脈Ｐを変更する（ステップＱ
８）。Then, the leaf of the above ESC is branched (step Q6), the ESC and the symbol K are registered in the two new leaves thus obtained (step Q7), and the context P
The symbol K is also registered in the context tree of. Furthermore, after newly registering the symbol K as described above, the oldest character in the context P is rejected, and the context P is changed to a new context P by changing the context P by one (step Q
8).

【０２９９】そして、ステップＱ３に戻り、シンボルＫ
が文脈Ｐに登録されていることを検出するまで、文脈Ｐ
を順次変更して処理を繰り返す。一方、文脈Ｐの文脈木
にシンボルＫが登録されている場合、シンボルＫの符号
を出力し（ステップＱ３のＹＥＳルートからステップＱ
９）、この文脈Ｐに対応する符号木のシンボルＫが登録
されているリーフをスプレイ（ＳＰＬＡＹ）処理する
（ステップＱ１０）。Then, returning to step Q3, the symbol K
Until it finds that is registered in context P.
Are sequentially changed and the process is repeated. On the other hand, if the symbol K is registered in the context tree of the context P, the code of the symbol K is output (from the YES route of step Q3 to step Q3).
9) The leaf in which the symbol K of the code tree corresponding to this context P is registered is subjected to the spray processing (SPLAY) (step Q10).

【０３００】さらに、シンボルＫの符号化後、文脈Ｐ₀
にシンボルＫを追加登録して文脈Ｐ₀を更新する（ステ
ップＱ１１）。（例えば文脈Ｐ₀にシンボルＫを追加
し、文脈Ｐ₀内の最も古い文字を棄却する）。そして、
全ての文字（シンボル）の符号化が終了したかをチェッ
クし（ステップＱ１２）、終了していない場合は（ステ
ップＱ１２のＮＯルート）、処理はステップＱ２に戻っ
て、全ての文字の符号化が終了するまで処理を繰り返
す。Furthermore, after encoding the symbol K, the context P ₀
The symbol K is additionally registered to update the context P ₀ (step Q11). (E.g. by adding the symbol K in the context P _0, rejects the oldest character in the context P _0). And
It is checked whether or not all the characters (symbols) have been encoded (step Q12). If not (NO route of step Q12), the process returns to step Q2 to encode all the characters. The process is repeated until the end.

【０３０１】また、上述の図５８における処理におい
て、ＥＳＣのリーフのスプレイ処理を、ＥＳＣとシンボ
ルＫを登録した後に行なっているが、この処理順は逆に
してもよく、この場合の処理ステップは図５９に示すよ
うになる。ここで、この図５９に示すように、ステップ
Ｒ１〜Ｒ３およびステップＲ９〜Ｒ１２においては、そ
れぞれ図５８の処理ステップＱ１〜Ｑ３およびステップ
Ｑ９〜Ｑ１２と同様の処理が行なわれる。Further, in the above-mentioned processing in FIG. 58, the ESC leaf spray processing is performed after the ESC and the symbol K are registered, but the processing order may be reversed, and the processing steps in this case are As shown in FIG. 59. Here, as shown in FIG. 59, in steps R1 to R3 and steps R9 to R12, the same processing as the processing steps Q1 to Q3 and steps Q9 to Q12 of FIG. 58 is performed, respectively.

【０３０２】そして、ステップＲ３で文脈Ｐにシンボル
Ｋが登録されていない場合に、ＥＳＣの符号を出力し
（ステップＲ４）、ＥＳＣのリーフを分岐し（ステップ
Ｒ５）、新規リーフにＥＳＣとシンボルＫを登録する
（ステップＲ６）。そして、上述のように登録を行なっ
た後に、ＥＳＣのリーフをスプレイ処理する（ステップ
Ｒ７）。When the symbol K is not registered in the context P in step R3, the ESC code is output (step R4), the ESC leaf is branched (step R5), and the ESC and the symbol K are added to the new leaf. Is registered (step R6). Then, after performing the registration as described above, the leaf of the ESC is sprayed (step R7).

【０３０３】さらに、図５８の処理ステップＱ８と同様
に、文脈Ｐを変更し（ステップＲ１２）、ステップＲ３
でシンボルＫを検出するまで処理を繰り返す。例えば、
符号木が、図６０（ａ）に示すように、文字（シンボ
ル）Ａ〜Ｅがすでに符号化され登録されている状態にあ
る場合に、上述の処理ステップＲ１〜Ｒ１２（または図
５８中の処理ステップＱ１〜Ｑ１２）において、シンボ
ルＫを文字Ｆとして処理を実行すると、リーフ番号６の
ＥＳＣのリーフが分岐され、図６０（ｂ）に示すよう
に、新たに作成されたリーフ番号１１，１２のリーフに
それぞれＥＳＣと文字Ｆが登録される。Further, as in the processing step Q8 of FIG. 58, the context P is changed (step R12), and the step R3 is executed.
The process is repeated until the symbol K is detected at. For example,
When the code tree is in a state where characters (symbols) A to E have already been coded and registered as shown in FIG. 60A, the above-described processing steps R1 to R12 (or the processing in FIG. 58) are performed. In steps Q1 to Q12), when the process is executed with the symbol K as the character F, the leaf of the ESC with the leaf number 6 is branched, and as shown in FIG. The ESC and the letter F are registered in each leaf.

【０３０４】例えば、以上の処理において用いる文脈が
３次文脈であった場合、シンボルＫが０次（初期状態）
で符号化されたときには、１次，２次，３次の文脈にそ
れぞれ登録される（以下、この全ての次数に登録を行な
う方法を全登録型という）。また、図５８および図５９
に示したステップＱ１〜Ｑ１２およびステップＲ１〜Ｒ
１２は、上述したように、ＥＳＣを符号化する毎にシン
ボルＫを全ての次数の文脈に登録する処理のステップを
示すが、シンボルＫが符号化される直前に符号化（登
録）された次数の１つの文脈にのみ、シンボルＫを登録
するように処理を行なってもよく、この場合の処理のフ
ローチャートは、図６１に示すようになる。For example, when the context used in the above processing is the third-order context, the symbol K is the 0th-order (initial state).
When encoded with, the information is registered in each of the first-order, second-order, and third-order contexts (hereinafter, the method of registering in all the orders is referred to as the full registration type). 58 and 59.
Steps Q1 to Q12 and steps R1 to R shown in
As described above, reference numeral 12 denotes a step of the process of registering the symbol K in the context of all the orders each time the ESC is coded. However, the coded (registered) order immediately before the symbol K is coded. The processing may be performed so that the symbol K is registered only in one context of the above, and the processing flowchart in this case is as shown in FIG. 61.

【０３０５】即ち、この図６１に示す処理では、まず、
文脈Ｐ₀を０に初期化し（ステップＴ１）、文脈（変
数）Ｐには文脈Ｐ₀を入力し、文脈（変数）Ｘには０を
入力する（ステップＴ２）。さらに、文脈Ｐにシンボル
Ｋが登録されているか検索し（ステップＴ３）、文脈Ｐ
にシンボルＫが登録されていない場合は、前述のステッ
プＱ４，Ｑ５（図５８参照）と同様の処理を行う（ステ
ップＴ３のＮＯルートからステップＴ４，Ｔ５）。That is, in the processing shown in FIG. 61, first,
The context P ₀ is initialized to 0 (step T1), the context P ₀ is input to the context (variable) P, and the context P is input to 0 (step T2). Furthermore, it is searched whether the symbol K is registered in the context P (step T3), and the context P
If the symbol K is not registered in, the processing similar to the above-mentioned steps Q4 and Q5 (see FIG. 58) is performed (from NO route of step T3 to steps T4 and T5).

【０３０６】その後、文脈Ｘに文脈Ｐを入力し（ステッ
プＴ６）、文脈Ｐを変更して（ステップＴ７）、文脈Ｐ
にシンボルＫが登録されていることを検出するまで処理
を繰り返す。一方、上述のステップＴ３において、文脈
ＰにシンボルＫが登録されている場合は、前述のステッ
プＱ９〜Ｑ１１（図５８参照）、またはステップＲ９〜
Ｒ１１（図５９参照）と同様の処理を行なう（ステップ
Ｔ８〜Ｔ１０）。Then, the context P is input to the context X (step T6), the context P is changed (step T7), and the context P is changed.
The process is repeated until it is detected that the symbol K is registered in. On the other hand, when the symbol K is registered in the context P in the above step T3, the above-mentioned steps Q9 to Q11 (see FIG. 58) or step R9 to.
Processing similar to that of R11 (see FIG. 59) is performed (steps T8 to T10).

【０３０７】その後、文脈ＸのＥＳＣのリーフを分岐し
（ステップＴ１１）、この分岐された新規の２つのリー
フにそれぞれＥＳＣとシンボルＫを登録する（ステップ
Ｔ１２）。さらに、全ての文字（シンボル）の符号化が
終了したか否かをチェックし（ステップＴ１３）、終了
していなければステップＴ２以降の処理を繰り返し（ス
テップＴ１３のＮＯルート）、終了していれば符号化処
理を終了する（ステップＴ１３のＹＥＳルート９）。After that, the leaf of the ESC of the context X is branched (step T11), and the ESC and the symbol K are respectively registered in the two branched new leaves (step T12). Further, it is checked whether or not all characters (symbols) have been encoded (step T13). If not completed, the processes after step T2 are repeated (NO route of step T13), and if completed. The encoding process ends (YES route 9 in step T13).

【０３０８】このように処理を行なうことにより、すべ
ての次数の文脈ＰにシンボルＫを登録するのではなく、
シンボルＫを符号化した直前の次数の文脈（文脈Ｘ）に
のみ登録を行なうことができる（以下、この直前の次数
の文脈にのみシンボルの登録を行なう方法を逐次登録型
という）。このように、本発明の一実施形態にかかるデ
ータ圧縮装置によれば、文脈ＰにシンボルＫが登録され
ていないとき、エスケープコード（ＥＳＣ）の分岐・登
録の符号長を（分割前のリーフの符号長＋１）ビット、
新規登録した符号及びエスケープコードの符号長を最小
で２ビットにすることにより、エスケープコードを比較
的多く出力するようなデータである場合、または辞書登
録（符号木へのシンボルの登録）が十分でない初期の段
階などにおいて高い符号化率が得られる効果がある。By performing the processing in this way, the symbol K is not registered in the contexts P of all orders, but
Registration can be performed only in the context (context X) of the immediately preceding order in which the symbol K is encoded (hereinafter, the method of registering a symbol only in the context of the immediately preceding order is referred to as a sequential registration type). As described above, according to the data compression apparatus according to the embodiment of the present invention, when the symbol K is not registered in the context P, the code length of branch / registration of the escape code (ESC) is set to Code length + 1) bits,
By setting the code length of the newly registered code and escape code to a minimum of 2 bits, if the data is such that a relatively large number of escape codes are output, or dictionary registration (symbol registration in the code tree) is not sufficient. There is an effect that a high coding rate can be obtained in the initial stage.

【０３０９】また、上述のようなシンボルの新規登録の
前にスプレイ処理を行なえば、符号及びエスケープコー
ドの符号長を最小で２ビット、新規登録後にスプレイ処
理を行なえば、エスケープコードの符号を最小で１ビッ
トにすることができるので、さらに符号化効率が大幅に
向上する効果がある。さらに、上述の逐次登録型の場
合、符号木へのシンボルの新規登録は、常に１つずつ行
なわれ、同じシンボルが２度３度と出現することによっ
て２次３次の登録が行なわれるので、常に符号木の高次
には再現性を持ったシンボルのみが登録されることにな
り、符号木に登録はしたが実際には使われていないシン
ボルが存在するために生じる符号木の符号化効率の低下
を防止し、十分に辞書登録がなされた後の符号化効率が
大幅に向上する効果もある。If the spray process is performed before the new registration of the symbol as described above, the code length of the code and the escape code is at least 2 bits, and if the spray process is performed after the new registration, the code of the escape code is minimized. Since it can be set to 1 bit, there is an effect that the coding efficiency is further improved. Furthermore, in the case of the above-described sequential registration type, new registration of symbols in the code tree is always performed one by one, and the same symbol appears twice and three times, so that secondary and tertiary registration is performed. Only the symbols with reproducibility are always registered in the higher order of the code tree, and the coding efficiency of the code tree that occurs because there are symbols that are registered in the code tree but are not actually used Is also prevented, and the coding efficiency after the dictionary is sufficiently registered is greatly improved.

【０３１０】また、逐次登録型は全登録型よりも使用す
るメモリ容量（辞書容量）が少ないという利点もある。（２）復元側の説明図６２は、本発明の一実施形態にかかるデータ復元方法
を実施するためのデータ復元装置４（図３２参照）の内
部の構成例を示す図であり、この図６２において、４０
１Ｂは符号木保持部、４０２Ｂは文脈木保持部、４０３
Ｂは文脈変更部、４０４Ｂは復号部、４０７Ｂは符号登
録部、４０８Ｂは文脈登録部、４０９Ｂはラッチ、４２
１Ｂは文脈保持部である。Further, the sequential registration type has an advantage in that it uses less memory capacity (dictionary capacity) than all registration types. (2) Explanation of Restoration Side FIG. 62 is a diagram showing an internal configuration example of the data restoration device 4 (see FIG. 32) for carrying out the data restoration method according to the embodiment of the present invention. At 40
1B is a code tree holding unit, 402B is a context tree holding unit, 403
B is a context changing unit, 404B is a decoding unit, 407B is a code registration unit, 408B is a context registration unit, 409B is a latch, and 42
1B is a context holding unit.

【０３１１】ここで、符号木保持部（符号木保持手段）
４０１Ｂは、予めエスケープコード（ＥＳＣ）（データ
未登録を示すデータ）を登録した符号木を保持するもの
であり、文脈木保持部（文脈木保持手段）４０２Ｂは、
復号したシンボル（データ）と文脈との組み合わせを登
録した文脈木を保持するものである。また、文脈変更部
４０３Ｂは、文脈木保持部４０２Ｂに保持されている文
脈木を検索し、到達したリーフがエスケープコードであ
った場合、文脈を変更するものであり、復号部４０４Ｂ
は、符号化側で符号化されたシンボルＫが入力されたと
き、このシンボルＫの符号に従って、符号木保持部４０
１Ｂに保持されている符号木のルート（頂点）からリー
フ（データ格納点）へと走査してシンボルＫの符号を復
号するものである。Here, a code tree holding unit (code tree holding means)
401B holds a code tree in which an escape code (ESC) (data indicating that data has not been registered) is registered in advance. The context tree holding unit (context tree holding means) 402B is
It holds a context tree in which a combination of a decoded symbol (data) and a context is registered. Further, the context changing unit 403B searches the context tree held in the context tree holding unit 402B, and changes the context when the leaf that arrived is an escape code, and the decoding unit 404B.
When the symbol K coded on the coding side is input, the code tree holding unit 40 follows the code of the symbol K.
The code of the symbol K is decoded by scanning from the root (vertex) of the code tree held in 1B to the leaf (data storage point).

【０３１２】さらに、符号変更部４０６Ｂは、復号した
シンボルＫ及びエスケープコードのリーフを他のリーフ
あるいは分岐点としてのノードと組み替えるものであ
る。また、符号登録部（符号登録手段）４０７Ｂは、符
号木決定手段としての機能を兼ねており、シンボルＫを
復号する直前までに復号したシンボルから、シンボルＫ
の符号が登録されている符号木保持部４０１Ｂ内の符号
木を決定するとともに、エスケープコードを復号したと
き、この符号木に登録さているエスケープコードのリー
フを分岐して新たにリーフを作成し、復号したシンボル
Ｋをこのリーフに新規に登録するものである。Further, the code changing unit 406B replaces the leaf of the decoded symbol K and escape code with another leaf or a node as a branch point. Further, the code registration unit (code registration means) 407B also has a function as a code tree determination means, and the symbols K are decoded from the symbols decoded up to immediately before decoding the symbol K.
The code tree in the code tree holding unit 401B in which the code is registered is determined, and when the escape code is decoded, the leaf of the escape code registered in this code tree is branched to create a new leaf, The decoded symbol K is newly registered in this leaf.

【０３１３】このため、符号登録部４０７Ｂおよび上述
の符号木保持部４０１は、それぞれ符号化側における符
号登録部３２２Ｂおよび符号木保持部３０１Ｂ（図５２
参照）の内部構成（図５３参照）と同様の構成を有して
いる。また、文脈登録部４０８Ｂは、符号登録部４０７
Ｂで登録したシンボルＫを文脈木保持部４０２Ｂに保持
されている文脈木に登録するものであり、ラッチ４０９
Ｂは、復号部４０４Ｂで復号されたシンボルＫを一旦保
持しておくものであり、文脈保持部４２１Ｂは、復号さ
れたシンボルＫを保持するものである。Therefore, the code registration unit 407B and the above-described code tree holding unit 401 respectively include the code registration unit 322B and the code tree holding unit 301B (see FIG. 52) on the coding side.
(See FIG. 53). In addition, the context registration unit 408B is the code registration unit 407.
The symbol K registered in B is registered in the context tree held in the context tree holding unit 402B.
B temporarily holds the symbol K decoded by the decoding unit 404B, and the context holding unit 421B holds the decoded symbol K.

【０３１４】また、上述の符号木保持部４０１Ｂによ
り、予めエスケープコードを登録した符号木を保持する
ことができ、文脈木保持部４０２Ｂにより、復号したシ
ンボルＫと文脈Ｐとの組み合わせを登録した文脈木を保
持できるようになっている。さらに、文脈変更部４０３
Ｂにより、到達したリーフがエスケープコードであった
場合、文脈を変更することができ、復号部４０４Ｂによ
り、符号化されたシンボルＫの符号に従って符号木のル
ートからリーフへと走査してシンボルＫの符号を復号で
きるようになっている。The code tree holding unit 401B described above can hold a code tree in which escape codes are registered in advance, and the context tree holding unit 402B registers a context in which a combination of the decoded symbol K and context P is registered. It can hold trees. Furthermore, the context changing unit 403
When the leaf that has arrived is an escape code by B, the context can be changed, and the decoding unit 404B scans from the root of the code tree to the leaf according to the code of the encoded symbol K to determine the symbol K. The code can be decoded.

【０３１５】また、符号木決定手段を兼ねる符号変更部
４０６により、直前までに復号したシンボルからシンボ
ルＫの符号が保持されている符号木を決定することがで
き、復号したシンボルＫ及びエスケープコードのリーフ
を他のリーフあるいはノードと組み替えることができ
る。さらに、符号登録部４０７Ｂにより、エスケープコ
ードを復号したとき、エスケープコードのリーフを分岐
して新たにリーフを作成し、復号したシンボルＫをこの
リーフに新規に登録することができる。Further, the code changing unit 406, which also functions as code tree determining means, can determine the code tree in which the code of the symbol K is held from the symbols decoded up to immediately before, and the code tree of the decoded symbol K and the escape code can be determined. A leaf can be recombined with other leaves or nodes. Further, when the escape code is decoded by the code registration unit 407B, a leaf of the escape code can be branched to create a new leaf, and the decoded symbol K can be newly registered in this leaf.

【０３１６】また、文脈登録部４０８Ｂにより、符号登
録部４０７Ｂで登録したシンボルＫを文脈木保持部４０
２Ｂの文脈木に登録することができ、ラッチ４０９Ｂに
より、復号部２０４Ｂで復号したシンボルＫを一旦保持
することができ、文脈変保持部４２１により、復号部２
０４Ｂで復号したシンボルＫを保持することができる。Further, the context registration unit 408B converts the symbol K registered by the code registration unit 407B into the context tree holding unit 40.
2B context tree, the latch 409B can temporarily hold the symbol K decoded by the decoding unit 204B, and the context change holding unit 421 can hold the decoding unit 2B.
It is possible to hold the symbol K decoded in 04B.

【０３１７】そして、上述の動作について、符号化側の
説明と同様に、図６３および図６４に示す復元側の動作
を示す処理ステップＵ１〜Ｕ１４を参照しながら、さら
に詳述する。まず、図６３に示すように、文脈保持部４
２１Ｂはそれまでに復号したシンボル（文脈）を保持
し、文脈変更部４０３Ｂに出力し（ステップＵ１）、文
脈変更部４０３Ｂは、最初は送られてきた文脈をそのま
ま文脈木保持部４０２Ｂに出力する（ステップＵ２）。Then, the above-mentioned operation will be described in more detail with reference to the processing steps U1 to U14 showing the operation on the restoration side shown in FIGS. 63 and 64, similarly to the description on the encoding side. First, as shown in FIG. 63, the context holding unit 4
21B holds the symbol (context) decoded up to that point and outputs it to the context changing unit 403B (step U1), and the context changing unit 403B outputs the initially sent context as it is to the context tree holding unit 402B. (Step U2).

【０３１８】また、文脈木保持部４０２は、文脈変更部
４０３Ｂから送られてきた文脈のＩＤ、すなわちルート
ＩＤを復号部４０４Ｂに出力する（ステップＵ３）。こ
こで、復号部４０４Ｂでは、送られてきたルートＩＤに
対して符号（１ビット）が、例えば“１”ならば右下、
“０”ならば左下に位置するノード（あるいはリーフ）
ＩＤを、符号木保持部４０１Ｂに要求する（ステップＵ
４）。The context tree holding unit 402 also outputs the context ID sent from the context changing unit 403B, that is, the root ID, to the decoding unit 404B (step U3). Here, in the decoding unit 404B, if the code (1 bit) for the route ID sent is, for example, "1", the lower right corner,
If it is "0", the node (or leaf) located at the lower left
Request the ID to the code tree holding unit 401B (step U
4).

【０３１９】また、符号木保持部４０１Ｂは、要求され
たノード（あるいはリーフ）のノードＩＤを復号部４０
４Ｂに返信する（ステップＵ５）。そして、復号部４０
４Ｂ及び符号木保持部４０１Ｂは、符号木の終端である
リーフのリーフＩＤを得るまで上述の処理を繰り返す。
すなわち、復号部４０４Ｂは、符号化側で符号化された
符号に従って、復号するシンボルＫが登録されているリ
ーフに到達するまで、符号木保持部４０１Ｂの符号木を
辿ってゆく。Also, the code tree holding unit 401B decodes the node ID of the requested node (or leaf) into the decoding unit 40.
4B (step U5). Then, the decryption unit 40
4B and the code tree holding unit 401B repeat the above processing until the leaf ID of the leaf that is the end of the code tree is obtained.
That is, the decoding unit 404B follows the code tree of the code tree holding unit 401B according to the code encoded on the encoding side until the symbol K to be decoded reaches the registered leaf.

【０３２０】そして、目的のリーフが発見されると、復
号部４０４Ｂは、このリーフを復号し、符号変更部４０
６Ｂが、この復号したリーフを符号化側と同様にスプレ
イ処理を行なって符号長を更新する。ここで、この処理
で復号されたシンボルがＥＳＣであった場合、復号部４
０４Ｂは、このシンボル（ＥＳＣ）をラッチに送り（ス
テップＵ６）、ラッチはこのシンボルを一旦保持して、
文脈変更部４０３Ｂに送る（ステップＵ７）。文脈変更
部４０３Ｂは符号化側と同様の文脈変更処理を行なっ
て、再度復号を行なう。When the target leaf is found, the decoding unit 404B decodes this leaf and the code changing unit 40
6B performs a spray process on the decoded leaf in the same manner as the encoding side to update the code length. Here, when the symbol decoded by this processing is ESC, the decoding unit 4
04B sends this symbol (ESC) to the latch (step U6), and the latch temporarily holds this symbol,
It is sent to the context changing unit 403B (step U7). The context changing unit 403B performs the same context changing process as on the encoding side and performs decoding again.

【０３２１】そして、上述の処理で復号部４０４Ｂが復
号したシンボルが、ＥＳＣ以外、すなわちシンボルＫで
あるときは、図６４に示すように、復号部４０４Ｂは、
ラッチ４０９Ｂ，文脈保持部４２１Ｂを介して、この復
号したシンボルＫを文脈登録部４０８Ｂに送り（ステッ
プＵ８〜Ｕ１１）、文脈登録部４０８Ｂは文脈木保持部
４０２ＢにこのシンボルＫを新規に登録する。When the symbol decoded by the decoding unit 404B in the above processing is other than ESC, that is, the symbol K, the decoding unit 404B, as shown in FIG.
The decoded symbol K is sent to the context registration unit 408B via the latch 409B and the context holding unit 421B (steps U8 to U11), and the context registration unit 408B newly registers this symbol K in the context tree holding unit 402B.

【０３２２】また、符号登録部４０７Ｂは、文脈木保持
部４０７ＢからシンボルＫが登録された文脈Ｐのルート
ＩＤを受けて（ステップＵ１２）、このルートＩＤを符
号木保持部４０１Ｂに送る（ステップＵ１３）。符号木
保持部４０１Ｂでは、送られてきた文脈ＰのルートＩＤ
と同じルートＩＤをもつ符号木のルートＩＤを、符号登
録部４０７Ｂに返信し（ステップＵ１４）、符号登録部
４０７Ｂは、このルートＩＤをもつ符号木に、シンボル
Ｋの新規符号を登録する（ステップＵ１３と同じ経
路）。Further, the code registration unit 407B receives the root ID of the context P in which the symbol K is registered from the context tree holding unit 407B (step U12), and sends this root ID to the code tree holding unit 401B (step U13). ). In the code tree holding unit 401B, the root ID of the sent context P
The root ID of the code tree having the same root ID as the above is returned to the code registration unit 407B (step U14), and the code registration unit 407B registers the new code of the symbol K in the code tree having this root ID (step U14). Same route as U13).

【０３２３】なお、上述の復元側の符号登録部４０７Ｂ
が符号木保持部４０１ＢにシンボルＫを登録する処理
は、符号化側における符号登録部３２２Ｂが符号木保持
部３０１ＢにシンボルＫを登録する処理と同様である。
このため、符号化側の登録処理が、符号化側の説明にお
いて図５８および図５９にて示したような全登録型の場
合は、復元側においてもＥＳＣを復号した全ての文脈に
登録する全登録型となり、図６１にて示したような逐次
登録型の場合は、復元側においても最後に復号されたＥ
ＳＣの文脈にそれぞれシンボルＫの登録を行なう逐次登
録型となる。The code registration unit 407B on the restoration side described above.
The process of registering the symbol K in the code tree holding unit 401B is similar to the process of registering the symbol K in the code tree holding unit 301B by the code registration unit 322B on the encoding side.
Therefore, when the registration process on the encoding side is the full registration type as shown in FIGS. 58 and 59 in the description of the encoding side, the restoration side also registers all the ESCs in all the decoded contexts. In the case of the registration type, and in the case of the sequential registration type as shown in FIG.
It is a sequential registration type in which the symbol K is registered in the context of SC respectively.

【０３２４】そして、符号化側の説明と同様に、上述の
復元側の処理は、図６５に示すようなフローチャートで
表すことができる。即ち、文脈（変数）Ｐ₀に最大次数
の数値を入力して初期化し（ステップＶ１）、この文脈
Ｐ₀を文脈Ｐに入力する（ステップＶ２）。つまり、符
号を復号する場合は、まず最大次数の文脈を用いて処理
を行なう。Then, similar to the description on the encoding side, the above-mentioned processing on the restoration side can be represented by the flowchart shown in FIG. That is, the numerical value of the maximum degree is input to the context (variable) P ₀ for initialization (step V1), and this context P ₀ is input to the context P (step V2). That is, when decoding the code, first, the processing is performed using the context of the maximum degree.

【０３２５】そして、この最大次数の文脈Ｐに対応する
符号木において、リーフに登録されている符号を復号す
る（ステップＶ３）。さらに、この復号した符号がシン
ボルであるかをチェックし（ステップＶ４）、復号した
符号がシンボルでない、すなわちＥＳＣであった場合
は、符号化側と同様に、復号したＥＳＣのリーフをスプ
レイ処理し（ステップＶ４のＮＯルートからステップＶ
５）、文脈Ｐ内の最も高次のシンボル（最も古いシンボ
ル）を棄却して、文脈を１つ低次に移して変更し（ステ
ップＶ６）、ステップ２に戻る。すなわち、ＥＳＣ以外
のシンボル（シンボルＫ）を復号するまで、文脈Ｐを最
高次数から１ずつ減じた文脈に変更してシンボルＫが登
録されている文脈Ｐを検索する。Then, in the code tree corresponding to the context P of the maximum degree, the code registered in the leaf is decoded (step V3). Furthermore, it is checked whether the decoded code is a symbol (step V4), and if the decoded code is not a symbol, that is, ESC, the leaf of the decoded ESC is spray-processed as in the encoding side. (From NO route of step V4 to step V
5) The highest-order symbol (oldest symbol) in the context P is rejected, the context is moved to the next lower order to be changed (step V6), and the process returns to step 2. That is, until the symbol (symbol K) other than the ESC is decoded, the context P is changed to the context obtained by subtracting 1 from the highest order, and the context P in which the symbol K is registered is searched.

【０３２６】一方、復号した符号がシンボルであった場
合は、シンボルＫが復号されたことになるので、このシ
ンボルＫを出力し（ステップＶ４のＹＥＳルートからス
テップＶ７）、このシンボルＫリーフを符号化側と同様
にしてスプレイ処理してシンボルＫの符号長を短く（更
新）する（ステップＶ８）。さらに、文脈Ｐ₀にシンボ
ルＫを追加登録し（ステップＶ９）、ＥＳＣは棄却す
る。なお、この文脈Ｐ₀の変更も符号化側と同一の処理
を行なう。On the other hand, if the decoded code is a symbol, it means that the symbol K has been decoded, so this symbol K is output (YES route from step V4 to step V7), and this symbol K leaf is coded. The code length of the symbol K is shortened (updated) by performing the spray process in the same manner as the digitization side (step V8). Further, the symbol K is additionally registered in the context P ₀ (step V9), and the ESC is rejected. It should be noted that the process of changing the context P ₀ is the same as that on the encoding side.

【０３２７】さらに、復号したシンボルＫについて、符
号化側と同一の登録方法、例えば、符号化側のシンボル
Ｋの登録方法が全登録型なら全登録型により、ＥＳＣの
リーフを分岐して、新規シンボルＫの登録を行なう（ス
テップＶ１０）。そして、入力された全ての符号の復号
が終了したかチェックし（ステップＶ１１）、終了して
いない場合は（ステップＶ１１のＮＯルート）、処理は
ステップＶ２に戻って、全ての符号の復号が終了するま
で処理を繰り返す。Further, regarding the decoded symbol K, the same registration method as that on the encoding side, for example, if the registration method of the symbol K on the encoding side is the full registration type, the leaf of the ESC is branched according to the full registration type, and The symbol K is registered (step V10). Then, it is checked whether the decoding of all the inputted codes has been completed (step V11), and if not completed (NO route of step V11), the process returns to step V2 and the decoding of all the codes is completed. The process is repeated until

【０３２８】以上のように、本発明の一実施形態にかか
るデータ復元装置によれば、符号化側と同様に、シンボ
ルの新規登録処理において、既に符号木上に存在するリ
ーフを２つに分割してこのリーフの符号長を（分割前の
リーフの符号長＋１）ビットにすることにより、新規登
録したシンボルＫの符号及びエスケープコードの符号長
を最小で２ビットにすることができるので、エスケープ
コードを比較的多く復号するようなデータである場合、
または辞書登録（符号木へのシンボルの登録）が十分で
ない初期の段階などにおいて、データの復号効率が大幅
に向上する効果がある。As described above, according to the data decompression apparatus according to the embodiment of the present invention, similarly to the encoding side, in the symbol new registration processing, the leaf already existing on the code tree is divided into two. Then, by setting the code length of this leaf to (the code length of the leaf before division + 1) bits, the code length of the newly registered symbol K and the escape code can be set to a minimum of 2 bits. If the data is such that a relatively large number of codes are decoded,
Alternatively, there is an effect that the data decoding efficiency is significantly improved in the initial stage where the dictionary registration (the registration of the symbol in the code tree) is not sufficient.

【０３２９】また、上述のようなシンボルの新規登録の
前にスプレイ処理を行なえば、復号したシンボルの符号
及びエスケープコードの符号長を最小で２ビット、新規
登録後にスプレイ処理を行なえば、エスケープコードの
符号を最小で１ビットにすることができるので、さらに
データの復号効率が大幅に向上する効果がある。さら
に、上述のように、復元側が符号化側と同一の登録処理
を行なうことで、符号を正確に復号することができる利
点がある。（ｃ−１）本実施形態の第１の変形例の説明本実施形態の第１の変形例にかかるデータ圧縮装置及び
データ復元装置においても、上述の実施形態と同様に、
図３２に示すデータ圧縮方法及びデータ復元方法を実施
するためのものである。If the spray process is performed before the new registration of the symbol as described above, the code length of the decoded symbol and the escape code is at least 2 bits, and if the spray process is performed after the new registration, the escape code is generated. Since the code of can be set to 1 bit at the minimum, there is an effect that the decoding efficiency of data is significantly improved. Furthermore, as described above, there is an advantage that the decoding side can perform the correct decoding by performing the same registration processing as the encoding side. (C-1) Description of First Modified Example of Present Embodiment Also in the data compression apparatus and the data decompression apparatus according to the first modified example of the present embodiment, as in the above-described embodiment,
This is for implementing the data compression method and the data decompression method shown in FIG.

【０３３０】また、上述した実施形態と同様に、データ
圧縮装置３を符号化側、データ復元装置４を復元側とし
て以下に説明する。（１）符号化側の説明符号化側の構成は、図５２にて前述したものの構成と同
様である。Similar to the above-described embodiment, the data compression device 3 will be described on the encoding side and the data decompression device 4 will be described on the decompression side. (1) Description of Encoding Side The configuration on the encoding side is the same as that described above with reference to FIG.

【０３３１】また、本実施形態における図５２に示す符
号登録部３２２Ｂには、符号木保持手段３０１Ｂに保持
されている符号木上の最長の符号長のリーフにシンボル
を登録するために、図６６に示すように、新規ノードＩ
Ｄ発生部６１，ラッチ６２，親子情報更新部６３及び最
長符号検出部（分岐位置検索手段）６９が設けられてお
り、これに対応して、符号木保持部３０１Ｂには、内部
節点（ノードＩＤ）保持部６５，符号木管理部６７及び
外部節点／ＥＳＣ−ＩＤ（リーフＩＤ）保持部６８が設
けられている。Further, in the code registration unit 322B shown in FIG. 52 in this embodiment, in order to register a symbol in the leaf of the longest code length on the code tree held in the code tree holding means 301B, FIG. , The new node I
A D generation unit 61, a latch 62, a parent-child information update unit 63, and a longest code detection unit (branch position search means) 69 are provided. Correspondingly, the code tree holding unit 301B has internal nodes (node IDs). ) A holding unit 65, a code tree managing unit 67, and an external node / ESC-ID (leaf ID) holding unit 68 are provided.

【０３３２】ここで、符号登録部３２２Ｂの内部におい
て、新規ノードＩＤ発生部６１は、文脈木保持部３０２
Ｂから更新信号を受けて２つの新規ノードＩＤ（ＩＤ−
１，ＩＤ−２）を発生するものであり、ラッチ６２は、
最長符号検出部６９が検出したリーフＩＤをラッチする
ものである。また、親子情報更新部６３は、処理対象の
ノードの上位のノードＩＤと右下に位置する下位ノード
のノードＩＤおよび左下に位置する下位ノードのノード
ＩＤの３つ情報（ＥＳＣ−ＩＤ，ＩＤ−１，ＩＤ−２）
からなる親子情報を受けてこの親子情報を変更し、符号
木保持部３０１Ｂに送るものである。Inside the code registration unit 322B, the new node ID generation unit 61 has the context tree holding unit 302.
Upon receiving the update signal from B, two new node IDs (ID-
1, ID-2) is generated, and the latch 62 is
The leaf ID detected by the longest code detector 69 is latched. Further, the parent-child information updating unit 63 has three pieces of information (ESC-ID, ID-) of the upper node ID of the processing target node, the node ID of the lower node located at the lower right, and the node ID of the lower node located at the lower left. 1, ID-2)
It receives the parent-child information consisting of, changes the parent-child information, and sends it to the code tree holding unit 301B.

【０３３３】最長符号検出部（分岐位置検索手段）６９
は、符号木保持部３０１Ｂから符号木のノードＩＤを得
て、その符号木の中で最長の符号長をもつリーフのＩＤ
（ＩＤ−０）を検出するものである。また、符号木保持
部３０１Ｂの内部において、内部節点（ノードＩＤ）保
持部６５は、符号木のノードＩＤを保持するものであ
り、外部節点／ＥＳＣ−ＩＤ（リーフＩＤ）保持部６８
は、符号木のリーフＩＤを保持するものである。Longest code detection unit (branch position search means) 69
Is a leaf ID having the longest code length in the code tree obtained from the code tree holding unit 301B.
(ID-0) is detected. In addition, inside the code tree holding unit 301B, the internal node (node ID) holding unit 65 holds the node ID of the code tree, and the external node / ESC-ID (leaf ID) holding unit 68.
Holds the leaf ID of the code tree.

【０３３４】符号木管理部６７は、文脈木保持部３０２
Ｂから文脈ＩＤを受けて、この文脈ＩＤを、内部節点
（ノードＩＤ）保持部６５および外部節点／ＥＳＣ−Ｉ
Ｄ（リーフＩＤ）保持部６８に送るものである。符号登
録部３２２Ｂおよび符号木保持部３０１Ｂが上述の構成
を有することにより、符号登録部３２２Ｂが符号木保持
部３０１Ｂに保持されている符号木上の最長の符号長を
もつリーフを検索し、エスケープコードを符号化した
後、検索した最長の符号長をもつリーフを分岐してシン
ボルＫを新規に登録する。The code tree management unit 67 has a context tree holding unit 302.
Upon receiving the context ID from B, the context ID is stored in the internal node (node ID) holding unit 65 and the external node / ESC-I.
It is sent to the D (leaf ID) holding unit 68. Since the code registration unit 322B and the code tree holding unit 301B have the above-described configurations, the code registration unit 322B searches for the leaf having the longest code length on the code tree held in the code tree holding unit 301B and escapes. After encoding the code, the leaf having the longest code length searched is branched and a symbol K is newly registered.

【０３３５】さらに、上述の処理について、図６７に示
すフローチャートの処理ステップＷ１〜Ｗ１３を参照し
ながら詳述する。まず、シンボルＫが含まれている文脈
を検索するために、文脈木保持部３０２Ｂに保持されて
いる文脈（文脈Ｐ）を選択し（ステップＷ１〜Ｗ２）、
この文脈ＰにシンボルＫが登録されているかをチェック
し（ステップＷ３）、登録されている場合は、図５８に
て前述したステップＱ９〜Ｑ１２と同様の処理を行なう
（ステップＷ９〜Ｗ１２）。Further, the above-mentioned processing will be described in detail with reference to the processing steps W1 to W13 of the flowchart shown in FIG. First, in order to search the context including the symbol K, the context (context P) held in the context tree holding unit 302B is selected (steps W1 and W2),
It is checked whether or not the symbol K is registered in this context P (step W3), and if it is registered, the same processing as steps Q9 to Q12 described above with reference to FIG. 58 is performed (steps W9 to W12).

【０３３６】一方、文脈ＰにシンボルＫが登録されてい
ない場合は、同じく図５８のステップＱ４〜Ｑ５と同様
に、符号化部３０６ＢがＥＳＣの符号を出力し（ステッ
プＷ４）、符号変更部３０７Ｂが、符号木保持部３０１
Ｂに保持されている符号木のＥＳＣのリーフをスプレイ
処理する（ステップＷ５）。その後、符号木保持部３０
１Ｂは、文脈ＰのＩＤ（ルートＩＤ）を文脈木保持部３
０２Ｂから受けて、この文脈ＰのノードＩＤと親子情報
とを最長符号検出部６９に送る。On the other hand, when the symbol K is not registered in the context P, the encoding unit 306B outputs the ESC code (step W4) and the code changing unit 307B as in steps Q4 to Q5 of FIG. Is the code tree holding unit 301
The leaf of the ESC of the code tree held in B is sprayed (step W5). After that, the code tree holding unit 30
1B uses the ID (root ID) of the context P as the context tree holding unit 3
02B, the node ID of this context P and the parent-child information are sent to the longest code detection unit 69.

【０３３７】最長符号検出部６９では、親子情報から最
長の符号長を持つリーフＸ（ｐ）のＩＤ（ＩＤ−０）を
検出する（ステップＷ６）。そして、検出したリーフＸ
（ｐ）のＩＤ、ＩＤ−０をリーフＩＤ保持部６８に送
り、このＩＤ−０とＩＤ−０に格納してあるシンボルを
ラッチ７０でラッチする。The longest code detecting section 69 detects the ID (ID-0) of the leaf X (p) having the longest code length from the parent-child information (step W6). And the detected leaf X
The ID of (p), ID-0, is sent to the leaf ID holding unit 68, and the symbols stored in the ID-0 and ID-0 are latched by the latch 70.

【０３３８】また、新規ノードＩＤ発生部７１は、２つ
の新規ノードＩＤ（ＩＤ−１，ＩＤ−２）を発生し、親
子情報更新部６３は、３つのＩＤ（ＩＤ−０，ＩＤ−
１，ＩＤ−２）を受けて親子情報を更新し、符号木保持
部３０１ＢのノードＩＤ保持部６５に登録する。一方、
符号木保持部３０１ＢのリーフＩＤ保持部６８には登録
シンボルＫと新規ＩＤ（ＩＤ−２），ＩＤ−０に登録し
てあったシンボルとＩＤ−１を新規リーフとしてそれぞ
れ登録する（ステップＷ７）。Further, the new node ID generating section 71 generates two new node IDs (ID-1, ID-2), and the parent-child information updating section 63 has three IDs (ID-0, ID-).
1, ID-2), the parent-child information is updated and registered in the node ID holding unit 65 of the code tree holding unit 301B. on the other hand,
In the leaf ID holding unit 68 of the code tree holding unit 301B, the registered symbol K and the new ID (ID-2), and the symbol registered in ID-0 and the ID-1 are respectively registered as new leaves (step W7). .

【０３３９】上述の処理を行なうことにより、最長の符
号長であったリーフはノードとなり、このノードの下に
２つの新規リーフが登録される。そして、文脈Ｐを変更
して（ステップＷ８）、文脈ＰにシンボルＫが登録され
ていることを検出するまで上述の処理を繰り返す。この
ように、本発明の一実施形態の第１の変形例にかかるデ
ータ圧縮装置によれば、前述した実施形態の符号化側に
て前述した効果に加えて、文脈Ｐの符号木において、最
長の符号長をもつ（ルートからの距離が最も遠い）リー
フＸ（ｐ）を検出し、このリーフＸ（ｐ）を分岐してシ
ンボルＫとＸ（ｐ）に登録されていたシンボルの符号木
への新規登録を行なう。これにより、「最長の符号長」
＝「出現頻度の最も低いシンボル」であるため、符号長
が１ビット伸びたことによる符号化効率の低下を最小限
に抑えることができ、データ圧縮の処理速度が大幅に向
上するとともにデータ圧縮装置の処理負荷も大幅に軽減
できる。By performing the above processing, the leaf having the longest code length becomes a node, and two new leaves are registered under this node. Then, the context P is changed (step W8), and the above processing is repeated until it is detected that the symbol K is registered in the context P. As described above, according to the data compression apparatus of the first modification of the embodiment of the present invention, in addition to the effect described above on the encoding side of the above-described embodiment, in the code tree of the context P, the longest The leaf X (p) having the code length of (the farthest distance from the root) is detected, and this leaf X (p) is branched to the code tree of the symbols registered in the symbols K and X (p). Register new. This gives the "longest code length"
= Since it is the "symbol with the lowest frequency of appearance", it is possible to minimize the decrease in coding efficiency due to the extension of the code length by 1 bit, and the processing speed of data compression is greatly improved and the data compression apparatus is also provided. The processing load of can be greatly reduced.

【０３４０】（２）復元側の説明本実施形態の第１の変形例にかかるデータ復元装置４で
は、図６２にて前述したものの構成と同様の構成を有し
ており、さらに、この復元側における符号登録部４０７
Ｂおよび符号木保持部４０１Ｂは、それぞれ符号化側の
符号登録部３２２Ｂおよび符号木保持部３０１Ｂと同様
の内部構成を有している（図６６参照）。従って、符号
登録部４０７Ｂおよび符号木保持部４０１Ｂは、符号化
側の符号登録部３２２Ｂおよび符号木保持部３０１Ｂと
同様の構成を有しているので、符号登録部４０７Ｂが、
復号したシンボルの符号を符号木保持部４０１Ｂに新規
に登録する処理は、符号化側と同様の処理を行なうこと
になる。このため、符号化側で符号化されたシンボルＫ
を復号する処理は、図６５にて前述した処理（ステップ
Ｖ１〜Ｖ１１）と同様にして行なえばよく、図６５中の
ステップＶ１０においては、符号化側の処理ステップＷ
７，Ｗ８（図６７参照）を行なえばよい。(2) Explanation of Restoration Side The data restoration device 4 according to the first modification of this embodiment has the same configuration as that described above with reference to FIG. Code registration unit 407
B and the code tree holding unit 401B have the same internal configurations as the code registration unit 322B and the code tree holding unit 301B on the coding side, respectively (see FIG. 66). Therefore, the code registration unit 407B and the code tree holding unit 401B have the same configurations as the code registration unit 322B and the code tree holding unit 301B on the encoding side.
The process of newly registering the code of the decoded symbol in the code tree holding unit 401B is the same as the process on the encoding side. Therefore, the symbol K coded on the coding side
65 may be performed in the same manner as the processing (steps V1 to V11) described above with reference to FIG. 65. In step V10 in FIG. 65, the processing step W on the encoding side is performed.
7, W8 (see FIG. 67) may be performed.

【０３４１】このように、本発明の一実施形態の第１の
変形例にかかるデータ復元装置によれば、符号化側と同
様に、文脈Ｐの符号木において最長の符号長をもつ（ル
ートからの距離が最も遠い）リーフＸ（ｐ）を検出し、
このリーフＸ（ｐ）を分岐してシンボルＫとＸ（ｐ）に
登録されていたシンボルの符号木への新規登録を行なう
ことにより、「最長の符号長」＝「出現頻度の最も低い
シンボル」であることから、前述の実施形態の復元側に
おける効果に加えて、符号長が１ビット伸びたことによ
るデータの復号効率の低下を最小限に抑えることがで
き、データ復号の処理速度が大幅に向上するとともにデ
ータ復元装置の処理負荷も大幅に軽減できる効果があ
る。As described above, according to the data decompression apparatus of the first modified example of the embodiment of the present invention, the code tree of the context P has the longest code length (from the root) as in the coding side. The leaf X (p), which is the farthest distance of
By branching this leaf X (p) and newly registering the symbols registered in symbols K and X (p) in the code tree, “longest code length” = “symbol with the lowest occurrence frequency” Therefore, in addition to the effect on the restoration side of the above-described embodiment, it is possible to minimize the decrease in the data decoding efficiency due to the extension of the code length by 1 bit, and to significantly increase the data decoding processing speed. There is an effect that the processing load of the data restoration device can be significantly reduced while being improved.

【０３４２】さらに、上述のように、シンボルの復元側
の登録処理を符号化側の登録処理と同一の処理とするこ
とで、符号化側で符号化されたシンボルの符号を正確に
復号することができる効果がある。（ｃ−２）一実施形態の第２の変形例の説明（１）符号化側の説明本実施形態においても、符号化側の構成は、図５２にて
前述したものの構成と同様である。Further, as described above, by making the symbol decompression side registration processing the same as the encoding side registration processing, it is possible to accurately decode the code of the symbol encoded on the encoding side. There is an effect that can be. (C-2) Description of Second Modification of Embodiment (1) Description of Encoding Side In the present embodiment, the configuration on the encoding side is the same as that described above with reference to FIG. 52.

【０３４３】そして、本実施形態における図５２に示す
符号登録部３２２Ｂには、符号木保持手段３０１Ｂに保
持されている符号木上に新規に登録されたリーフを分岐
してシンボルを登録するために、新規ノードＩＤ発生部
６１，ラッチ６２，親子情報更新部６３及び最新登録Ｉ
Ｄ保持部７０が設けられており、符号木保持部３０１Ｂ
には、外部節点（リーフＩＤ）保持部６４、内部節点
（ノードＩＤ）保持部６５，ＥＳＣ−ＩＤ保持部６６及
び符号木管理部６７が設けられている。Then, in the code registration unit 322B shown in FIG. 52 in this embodiment, in order to register a symbol by branching a leaf newly registered on the code tree held in the code tree holding means 301B. , New node ID generation unit 61, latch 62, parent-child information update unit 63 and latest registration I
The D holding unit 70 is provided, and the code tree holding unit 301B is provided.
Is provided with an external node (leaf ID) holding unit 64, an internal node (node ID) holding unit 65, an ESC-ID holding unit 66, and a code tree management unit 67.

【０３４４】ここで、上述の構成の内、図５３または図
６６にて既述の符号と同じ符号は同じ部分を示すので、
その説明は省略する。本実施形態で新たに設けられてい
る、最新登録ＩＤ保持部（分岐位置保持手段）７０は、
符号木保持部３０１Ｂに保持されている符号木に最後に
（新規に）登録されたリーフのＩＤを保持するものであ
る。Here, in the above-mentioned configuration, the same reference numerals as those already described in FIG. 53 or FIG.
The description is omitted. The latest registration ID holding unit (branch position holding means) 70 newly provided in this embodiment is
This is to hold the ID of the leaf that was last (newly) registered in the code tree held in the code tree holding unit 301B.

【０３４５】符号登録部３２２Ｂおよび符号木保持部３
０１Ｂが上述の構成を有していることにより、符号登録
部３２２Ｂが、符号木保持部３０１Ｂに保持されている
符号木にシンボルが最後に登録された最新のリーフを分
岐して、この新たに作成されたリーフにシンボルＫを新
規に登録することができる。以下に、上述の処理につい
て、図６９の処理ステップＸ１〜Ｘ１３を参照しなが
ら、さらに詳述する。Code registration unit 322B and code tree holding unit 3
Since 01B has the above-mentioned configuration, the code registration unit 322B branches the latest leaf in which the symbol is last registered in the code tree held in the code tree holding unit 301B, and this new The symbol K can be newly registered in the created leaf. The above process will be described in more detail below with reference to process steps X1 to X13 in FIG.

【０３４６】上述の処理は、最後に登録された、最新登
録のリーフのＩＤを最新登録ＩＤ保持部７０に保持し、
そのＩＤを分岐することで新規登録を行なう。まず、シ
ンボルＫが含まれている文脈を検索するために、文脈木
保持部３０２Ｂに保持されている文脈（文脈Ｐ）を選択
し（ステップＸ１〜Ｘ２）、この文脈ＰにシンボルＫが
登録されているかをチェックし（ステップＸ３）、登録
されている場合は、図６７にて前述したステップＷ１０
〜Ｗ１３と同様の処理を行なう（ステップＸ３のＹＥＳ
ルートからステップＸ１０〜Ｘ１３）。In the above process, the latest registered leaf ID registered last is held in the latest registration ID holding unit 70,
New registration is performed by branching the ID. First, in order to search the context including the symbol K, the context (context P) held in the context tree holding unit 302B is selected (steps X1 and X2), and the symbol K is registered in this context P. It is checked (step X3), and if registered, step W10 described above with reference to FIG. 67.
~ Perform the same processing as W13 (YES in step X3)
From the root, steps X10 to X13).

【０３４７】一方、文脈ＰにシンボルＫが登録されてい
ない場合は、同じく図６７のステップＷ４〜Ｗ５と同様
に、符号化部３０６ＢがＥＳＣの符号を出力し（ステッ
プＸ４）、符号変更部３０７Ｂが、符号木保持部３０１
Ｂに保持されている符号木のＥＳＣのリーフをスプレイ
処理する（ステップＸ５）。その後、符号木保持部３０
１Ｂは、文脈ＰのＩＤ（ルートＩＤ）を、そのまま最新
登録ＩＤ保持部７０に出力する。On the other hand, when the symbol K is not registered in the context P, the encoding unit 306B outputs the ESC code (step X4) and the code changing unit 307B, similarly to steps W4 to W5 of FIG. Is the code tree holding unit 301
The leaf of the ESC of the code tree held in B is sprayed (step X5). After that, the code tree holding unit 30
1B outputs the ID (root ID) of context P to the latest registration ID holding unit 70 as it is.

【０３４８】最新登録ＩＤ保持部７０では、この文脈Ｐ
に対応する符号木の最新登録のリーフＸ（ｐ）のリーフ
ＩＤ（ＩＤ−０）を符号木保持部３０１ＢのリーフＩＤ
保持部６４に送り、ＩＤ−０とＩＤ−０に格納してある
シンボルをラッチ６２でラッチする。新規ＩＤ発生部６
１は、２つの新規ＩＤ（ＩＤ−１，ＩＤ−２）を発生
し、親子情報更新部６３が、３つのＩＤ（ＩＤ−０，Ｉ
Ｄ−１，ＩＤ−２）を受けて親子情報を更新することに
よりリーフＸ（ｐ）を分岐し（ステップＸ６）、この情
報を符号木保持部３０１ＢのノードＩＤ保持部６５に登
録する。In the latest registration ID holding unit 70, this context P
The leaf ID (ID-0) of the latest registered leaf X (p) corresponding to the code tree is the leaf ID of the code tree holding unit 301B.
The data is sent to the holding unit 64, and the ID-0 and the symbol stored in the ID-0 are latched by the latch 62. New ID generator 6
1 generates two new IDs (ID-1, ID-2), and the parent-child information updating unit 63 generates three IDs (ID-0, I
D-1 and ID-2) are received to update the parent-child information to branch the leaf X (p) (step X6), and this information is registered in the node ID holding unit 65 of the code tree holding unit 301B.

【０３４９】一方、符号木保持部３０１ＢのリーフＩＤ
保持部６４には、登録シンボルＫと、リーフＸ（ｐ）に
登録してあったシンボルとを新規リーフとしてそれぞれ
登録し（ステップＸ７，Ｘ８）、さらに最新登録ＩＤ保
持部７０には新規ＩＤであるＩＤ−２を登録する。そし
て、文脈Ｐを変更し（ステップＸ９）、文脈Ｐにシンボ
ルＫが登録されていることを検出するまで上述の処理を
繰り返す。On the other hand, the leaf ID of the code tree holding unit 301B
The registered symbol K and the symbol registered in the leaf X (p) are respectively registered in the holding unit 64 as new leaves (steps X7 and X8), and the new registration ID holding unit 70 stores the new ID. Register a certain ID-2. Then, the context P is changed (step X9), and the above processing is repeated until it is detected that the symbol K is registered in the context P.

【０３５０】以上の処理を行なうことにより、新たなシ
ンボルＫが入力されたときは、常に、シンボルＫが入力
される直前に登録された最新の登録リーフを分割してこ
のリーフにシンボルＫを登録する。このように、本発明
の一実施形態の第２の変形例にかかるデータ圧縮装置に
よれば、シンボルの符号木への新規登録を、直前に登録
したシンボルのリーフを分岐してこのリーフに登録する
ことにより、「直前に登録したリーフのシンボル」＝
「比較的符号長の長いシンボル」に近似できることか
ら、本実施形態の第１の変形例にて前述したように最長
の符号長をもつリーフを検出する処理を省略することが
でき、さらにデータ圧縮の処理速度が大幅に向上する効
果がある。By performing the above processing, whenever a new symbol K is input, the latest registered leaf registered immediately before the symbol K is input is divided and the symbol K is registered in this leaf. To do. As described above, according to the data compression apparatus of the second modified example of the embodiment of the present invention, when a symbol is newly registered in the code tree, the leaf of the symbol registered immediately before is branched and registered in this leaf. By doing so, "the symbol of the leaf just registered" =
Since it can be approximated to “a symbol having a relatively long code length”, the process of detecting the leaf having the longest code length can be omitted as described above in the first modification of the present embodiment, and further data compression can be performed. This has the effect of significantly improving the processing speed of.

【０３５１】（２）復元側の説明復元側の構成は、符号化側と同様に、図６２にて前述し
た構成と同様であり、さらに、この復元側における符号
登録部４０７Ｂおよび符号木保持部４０１Ｂは、それぞ
れ符号化側の符号登録部３２２Ｂおよび符号木保持部３
０１Ｂと同様の内部構成をもつものである（図６８参
照）。(2) Description of Decompression Side The configuration of the decompression side is the same as the configuration described above with reference to FIG. 62, as is the case with the coding side. Furthermore, the code registration unit 407B and the code tree holding unit on the decompression side. Reference numeral 401B denotes a code registration section 322B and a code tree holding section 3 on the encoding side.
It has the same internal configuration as 01B (see FIG. 68).

【０３５２】従って、符号登録部４０７Ｂおよび符号木
保持部４０１Ｂが、符号化側の符号登録部３２２Ｂおよ
び符号木保持部３０１Ｂと同様の構成を有しているの
で、本実施形態の第１の変形例における復元側と同様
に、符号登録部４０７Ｂが復号したシンボルの符号を符
号木保持部４０１Ｂに新規に登録する処理は、符号化側
の処理と同様に行なわれる。Therefore, since the code registration unit 407B and the code tree holding unit 401B have the same configurations as the coding side code registration unit 322B and the code tree holding unit 301B, the first modification of the present embodiment. Similar to the restoration side in the example, the process of newly registering the code of the symbol decoded by the code registration unit 407B in the code tree holding unit 401B is performed in the same manner as the process on the coding side.

【０３５３】このため、本実施形態の復元側でも、符号
化側で符号化されたシンボルＫを復号する処理は、図６
５にて前述した処理（ステップＶ１〜Ｖ１１）と同様に
して行なわれる。即ち、図６５中の処理ステップＶ１１
においても、符号化側の登録処理である図６９のステッ
プＸ６〜Ｘ８と同様の処理が行なわれている。For this reason, the process of decoding the symbol K coded on the coding side is also performed by the decoding side of this embodiment as shown in FIG.
5 is performed in the same manner as the processing (steps V1 to V11) described above. That is, the processing step V11 in FIG.
Also in this case, the same processing as steps X6 to X8 of FIG. 69 which is the registration processing on the encoding side is performed.

【０３５４】このように、本発明の一実施形態の第２の
変形例にかかるデータ復元装置によれば、シンボルの符
号木への新規登録を、符号化側と同様に、直前に登録し
たシンボルのリーフを分岐してこのリーフに登録するこ
とにより、「直前に登録したリーフのシンボル」＝「比
較的符号長の長いシンボル」に近似できることから、本
実施形態の第１の変形例にて前述したように最長の符号
長をもつリーフを検出する処理を省略することができ、
さらにデータ復号の処理速度が大幅に向上する効果があ
る。As described above, according to the data decompression apparatus of the second modification of the embodiment of the present invention, the new registration of the symbol in the code tree is performed similarly to the encoding side, and the symbol registered immediately before is newly registered. By branching and registering this leaf into this leaf, it is possible to approximate to “symbol of the leaf registered immediately before” = “symbol with a relatively long code length”. Therefore, in the first modified example of this embodiment, As described above, the process of detecting the leaf with the longest code length can be omitted,
Further, there is an effect that the processing speed of data decoding is significantly improved.

【０３５５】そして、このようにシンボルの符号木への
新規登録の処理を符号化側の登録処理と同一にすること
で、符号化側で符号化されたシンボルの復号処理を正確
に行うことができる効果もある。By thus making the new registration process of the symbol in the code tree the same as the registration process on the encoding side, the decoding process of the symbol encoded on the encoding side can be performed accurately. There is an effect that can be done.

【０３５６】なお、以上の実施形態および各変形例にお
いて、符号化側で述べた方法を、符号化するデータある
いはシステムによって切り替えるために、符号データの
伝送に先立って、ヘッダ部にどの方式を用いているかの
ＩＤ番号を付加し、復元側ではそのＩＤ番号から符号化
側で用いた登録方式を選択するようにしてもよい。In the above-described embodiment and each modified example, in order to switch the method described on the encoding side depending on the data to be encoded or the system, which method is used for the header portion prior to the transmission of the encoded data. Alternatively, the ID number may be added, and the restoration side may select the registration method used on the encoding side from the ID number.

【０３５７】[0357]

【発明の効果】以上詳述したように、請求項１記載の本
発明のデータ圧縮方法によれば、入力データを過去に出
現した履歴に応じて符号化して圧縮するデータ圧縮方法
において、入力データとそれまでに連続したｎ個のデー
タからなる文脈との組み合わせを登録した文脈木を保持
する文脈木保持過程と、文脈毎に独立した符号木を保持
する符号木保持過程と、入力データと文脈との組み合わ
せが文脈木保持過程に保持されていないとき、文脈木保
持過程の文脈木にデータを新規に登録する文脈木新規登
録過程と、入力データと文脈との組み合わせが文脈木保
持過程に保持されていないとき、符号木保持過程の符号
木のデータ格納点としてのリーフを分岐して得た新規リ
ーフにデータを格納する符号木新規登録過程と、入力デ
ータと文脈との組み合わせが文脈木保持過程に保持され
ていないとき文脈を変更する文脈変更過程と符号木の頂
点からの入力データあるいは符号木中の特定コードが登
録してあるリーフまでの分岐に従って符号を出力する符
号出力過程と、入力データあるいは符号木中の特定コー
ドが登録してあるリーフと他のリーフあるいは符号木の
頂点以外の分岐点として定義されるノードとを取り替え
る符号長変更過程とを有し、符号木新規登録過程では、
特定コードを登録してあるリーフを分岐し、得た２つの
新規リーフにの特定コードと新規データとを登録するこ
とを特徴としているので、上述の特定コードを比較的多
く出力するようなデータである場合、または文脈木保持
過程に保持されている文脈の登録が十分でない初期の段
階などにおいて高い符号化率が得られる効果がある。ま
た、上述の符号木新規登録過程の前に符号長変更過程を
行なえば、符号及び特定コードの符号長を最小で２ビッ
ト、符号木新規登録後に符号長変更過程を行なえば、特
定コードの符号を最小で１ビットにすることができるの
で、さらに符号化効率が大幅に向上する効果がある。さ
らに、符号木新規登録では、データの新規登録は常に１
つずつ行なわれるので、常に符号木の高次には再現性を
持ったシンボルのみが登録されることになり、符号木に
登録はしたが実際には使われていないデータが存在する
ために生じる符号化効率の低下を防止でき、これにより
十分にデータの登録がなされた後の符号化効率が大幅に
向上する効果がある。As described above in detail, according to the data compression method of the present invention as set forth in claim 1, in the data compression method for encoding and compressing the input data according to the history that has appeared in the past, the input data And a context tree holding process that holds a context tree in which a combination of contexts consisting of n pieces of continuous data is registered, a code tree holding process that holds an independent code tree for each context, input data and context When the combination of and is not held in the context tree holding process, the data is newly registered in the context tree of the context tree holding process, and the combination of the input data and the context is held in the context tree holding process. If not, a code tree new registration process of storing data in a new leaf obtained by branching a leaf as a data storage point of the code tree in the code tree holding process, and a set of input data and context A code that outputs a code according to a context change process that changes the context when the match is not held in the context tree holding process, and input data from the vertices of the code tree or a branch to a leaf in which a specific code in the code tree is registered An output process and a code length changing process for replacing a leaf in which a specific code in the input data or the code tree is registered with another leaf or a node defined as a branch point other than the vertex of the code tree, In the tree new registration process,
It is characterized in that the leaf in which the specific code is registered is branched, and the specific code and new data for the two new leaves obtained are registered, so that data that outputs a relatively large number of the above specific codes is used. In some cases, or in the initial stage where the context held in the context tree holding process is not sufficiently registered, a high coding rate can be obtained. Further, if the code length changing process is performed before the above-mentioned code tree new registration process, the code length of the code and the specific code is at least 2 bits, and if the code length changing process is performed after the code tree new registration, the code of the specific code is generated. Can be set to a minimum of 1 bit, which has the effect of significantly improving the coding efficiency. Furthermore, in the new registration of the code tree, the new registration of data is always 1
Since it is done one by one, only reproducible symbols are always registered in the higher order of the code tree, and it occurs because there is data that is registered in the code tree but is not actually used. There is an effect that it is possible to prevent a decrease in the coding efficiency, and thereby the coding efficiency after the data is sufficiently registered is significantly improved.

【０３５８】また、請求項２記載の本発明のデータ圧縮
方法によれば、上述の請求項１記載の本発明のデータ圧
縮方法における特定コードを、予め未登録を示すデータ
として定義されるエスケープコードとすることにより、
上述の請求項１記載の本発明のデータ圧縮方法における
効果と同様の効果が得られる。さらに、本発明のデータ
圧縮方法によれば、符号木新規登録過程では、同じ文脈
の下にあるリーフのうち、符号木の頂点として定義され
るルートからの距離が最も長いリーフを分岐し、得た２
つの新規リーフに、分岐したリーフに格納していたデー
タと、新規データとを登録したり（請求項３）、また同
じ文脈の下にあるリーフのうち、最後に登録したリーフ
を分岐し、得た２つの新規リーフに、分岐したリーフに
格納していたデータと、新規データとを登録するように
することもできるので（請求項４）、上述の請求項２記
載の本発明のデータ圧縮方法における効果に加えて、あ
まり使われない出現頻度の最も低いデータの符号長を長
くすることができ、これにより符号長が１ビット伸びた
ことによる符号化効率の低下を最小限に抑えてデータ圧
縮の処理速度を大幅に向上させることができる効果があ
る。また、最後に登録したリーフに格納した新規データ
は、比較的符号長の長いデータとして近似できることか
ら、さらにデータ圧縮の処理速度が大幅に向上する効果
もある。According to the data compression method of the present invention as set forth in claim 2, the specific code in the data compression method of the present invention as set forth in claim 1 is an escape code defined in advance as data indicating unregistered. By
The same effects as the effects of the data compression method of the present invention described in claim 1 are obtained. Further, according to the data compression method of the present invention, among the leaves under the same context, the leaf having the longest distance from the root defined as the vertex of the code tree is branched and obtained in the process of newly registering the code tree. 2
The data stored in the branched leaf and the new data can be registered in one new leaf (claim 3), or the last registered leaf among the leaves under the same context can be branched and obtained. Since the data stored in the branched leaf and the new data can be registered in the two new leaves (claim 4), the data compression method of the present invention according to claim 2 described above. In addition to the effect of the above, it is possible to lengthen the code length of the data which is rarely used and which has the lowest appearance frequency, thereby minimizing the deterioration of the coding efficiency due to the extension of the code length by 1 bit and compressing the data. There is an effect that the processing speed of can be greatly improved. In addition, since the new data stored in the last registered leaf can be approximated as data having a relatively long code length, there is an effect that the processing speed of data compression is significantly improved.

【０３５９】また、請求項５記載の本発明のデータ復元
方法によれば、入力データを過去の入力データの履歴に
応じて符号化した符号を復号するデータ復元方法におい
て、復号したデータと文脈との組み合わせを登録した文
脈木を保持する文脈木保持過程と文脈に応じておのおの
独立した符号木を保持する符号木保持過程と、直前まで
に復号したデータから符号の符号木を決定する符号木決
定過程と、符号に従って符号木の頂点を意味するルート
からデータ格納点としてのリーフへと走査して符号を復
号する復号過程と、到達したリーフが符号木中の特定コ
ードであった場合、文脈を変更する文脈変更過程と復号
したデータ及び特定コードのリーフを他のリーフあるい
は分岐点としてのノードと組み替える符号長変更過程
と、特定コードを復号したとき符号木に復号したデータ
を新規に登録する新規登録過程と新規登録過程で登録し
たデータを文脈木保持過程の文脈木に登録する文脈木登
録過程とを有し、新規登録過程では符号化側で分岐に選
択したリーフと同じリーフを分岐して新規データを登録
することを特徴としているので、符号化側と同様に、デ
ータの新規登録過程において、２つに分割したリーフの
符号長を（分割前のリーフの符号長＋１）ビットにする
ことができるとともに、新規登録したデータの符号及び
特定コードの符号長を最小で２ビットにすることがで
き、これによりエスケープコードを比較的多く復号する
ようなデータである場合に、または辞書登録（符号木へ
のシンボルの登録）が十分でない初期の段階の場合など
において、データの復号効率が大幅に向上する効果があ
る。Further, according to the data restoration method of the present invention as defined in claim 5, in the data restoration method for decoding the code obtained by coding the input data according to the history of the past input data, the decoded data and the context are Context tree holding process that holds the context tree in which the combinations of the two are registered, the code tree holding process that holds each independent code tree according to the context, and the code tree determination that determines the code tree of the code from the data decoded up to immediately before The process, the decoding process of decoding the code by scanning from the root that means the vertices of the code tree to the leaf as the data storage point according to the code, and when the arrived leaf is a specific code in the code tree, the context is The process of changing the context, the process of changing the code length and the process of changing the decoded data and the leaf of the specific code with another leaf or a node as a branch point, and restoring the specific code. Then, there is a new registration process for newly registering the decoded data in the code tree and a context tree registration process for registering the data registered in the new registration process in the context tree of the context tree holding process. It is characterized by branching the same leaf as the branch selected on the side and registering new data. Therefore, in the same way as the encoding side, in the process of new registration of data, the code length of the leaf divided into two is set. The code length of the leaf before division + 1 bit can be set, and the code length of the newly registered data and the code length of the specific code can be set to a minimum of 2 bits, whereby a relatively large number of escape codes can be decoded. The data decoding efficiency is significantly improved when the data is such that it is in the initial stage when the dictionary registration (the registration of symbols in the code tree) is not sufficient. There is an effect to be.

【０３６０】また、データの新規登録過程の前に符号長
変更過程を行なえば、復号したデータ符号及びエスケー
プコードの符号長を最小で２ビット、新規登録過程の後
に符号長変更過程を行なえば、エスケープコードの符号
を最小で１ビットにすることができ、これにより、さら
にデータの復号効率が大幅に向上する効果がある。さら
に、復元側が符号化側と同一の新規登録過程が行なうこ
とができ、これにより符号化側で符号化されたデータを
正確に復号することができる利点がある。If the code length changing process is performed before the new data registration process, the code lengths of the decoded data code and the escape code are at least 2 bits, and if the code length changing process is performed after the new registration process, The code of the escape code can be set to a minimum of 1 bit, which has the effect of significantly improving the data decoding efficiency. Further, the restoration side can perform the same new registration process as the encoding side, and thus there is an advantage that the data encoded by the encoding side can be accurately decoded.

【０３６１】また、請求項６記載の本発明のデータ復元
方法によれば、特定コードを、予め未登録を示すデータ
として定義されるエスケープコードとしているので、上
述の請求項５記載の本発明のデータ復元方法における効
果と同様の効果が得られる。さらに、請求項７記載の本
発明のデータ圧縮装置によれば、入力データを過去に出
現した履歴に応じて符号化するデータ圧縮装置におい
て、予めデータ未登録を示すデータとして定義されるエ
スケープコードを登録した符号木を保持する符号木保持
手段と、入力データと文脈との組み合わせを登録した文
脈木を保持する文脈木保持手段と、エスケープコードを
符号化したのち、文脈木にデータを新規に登録する文脈
登録手段とエスケープコードを符号化したのち符号木の
エスケープコードのデータ格納点としてのリーフを分岐
してデータを新規に登録する符号登録手段と、入力デー
タと文脈との組み合わせが文脈木に保持されていないと
き、文脈を変更する文脈変更手段と符号木の頂点からの
入力データあるいはエスケープコードが登録してあるリ
ーフまでの分岐に従って符号を出力する符号化手段と、
符号化したデータ及びエスケープコードが登録してある
リーフと他のリーフあるいはノードとを取り替える符号
更新手段とをそなえて構成されたことを特徴としている
ので、エスケープコードを比較的多く出力するようなデ
ータである場合、または文脈木保持手段に保持されてい
る文脈の登録が十分でない初期の段階などにおいて高い
符号化率が得られる効果がある。また、符号登録手段に
よる符号の登録の前に符号更新手段による符号更新を行
なえば、符号及び特定コードの符号長を最小で２ビッ
ト、符号登録手段による符号の登録の後に符号更新手段
による符号更新を行なえば、エスケープコードの符号を
最小で１ビットにすることができ、これにより、さらに
符号化効率が大幅に向上する効果がある。さらに、符号
登録手段による符号の新規登録は、常に１つずつ行なわ
れるので、常に符号木の高次には再現性を持ったデータ
のみが登録されることになり、符号木に登録はしたが実
際には使われていないデータが存在するために生じる符
号化効率の低下を防止でき、これにより十分にデータの
登録がなされた後の符号化効率が大幅に向上する効果が
ある。そして、このような効果によりデータ圧縮装置の
性能が飛躍的に向上する効果がある。Further, according to the data restoration method of the present invention as defined in claim 6, the specific code is an escape code defined in advance as data indicating non-registration. The same effect as that of the data restoration method can be obtained. Further, according to the data compression apparatus of the present invention as set forth in claim 7, in the data compression apparatus that encodes the input data according to the history that has appeared in the past, an escape code defined as data indicating data unregistered in advance is used. Code tree holding means for holding the registered code tree, context tree holding means for holding the context tree in which the combination of input data and context is registered, and after the escape code is encoded, the data is newly registered in the context tree. The combination of the input data and the context is stored in the context tree after the context registration means and the escape code have been encoded, and then the leaf as the data storage point of the escape code of the code tree is branched and the data is newly registered. When it is not held, the context changing means to change the context and the input data or escape code from the top of the code tree are registered. Encoding means for outputting a code according to branch to the leaf that,
It is characterized by comprising a code updating means for replacing the leaf in which the encoded data and the escape code are registered with another leaf or node, so that data that outputs a relatively large number of escape codes Or there is an effect that a high coding rate can be obtained in the initial stage where the registration of the context held in the context tree holding means is not sufficient. Further, if the code updating means performs the code updating before the code registering means registers the code, the code length of the code and the specific code is at least 2 bits, and the code updating means performs the code updating after the code registering means registers the code. By doing so, the code of the escape code can be set to a minimum of 1 bit, which has the effect of significantly improving the coding efficiency. Further, since the new code registration is always performed one by one by the code registration means, only the data having reproducibility is registered in the higher order of the code tree at all times. It is possible to prevent a decrease in coding efficiency that occurs due to the presence of data that is not actually used, and this has the effect of significantly improving the coding efficiency after sufficient data registration. Then, such an effect has the effect of dramatically improving the performance of the data compression apparatus.

【０３６２】また、請求項８記載の本発明のデータ圧縮
装置によれば、入力データを過去に出現した履歴に応じ
て符号化するデータ圧縮装置において、予めデータ未登
録を示すデータとして定義されるエスケープコードを登
録した符号木を保持する符号木保持手段と、入力データ
と文脈との組み合わせを登録した文脈木を保持する文脈
木保持手段と、エスケープコードを符号化したのち、文
脈木にデータを新規に登録する文脈登録手段と符号木上
の最長の符号長を持つリーフを検索する分岐位置検索手
段と、エスケープコードを符号化したのち、分岐位置検
索手段に検索されたデータ格納点としてのリーフを分岐
してデータを新規に登録する符号登録手段と、入力デー
タと文脈との組み合わせが文脈木に保持されていないと
き文脈を変更する文脈変更手段と符号木の頂点から入力
データあるいはエスケープコードが登録してあるリーフ
までの分岐に従って符号を出力する符号化手段と、符号
化したデータ及びエスケープコードが登録してあるリー
フと他のリーフあるいはノードとを取り替える符号更新
手段とをそなえて構成されたことを特徴としているの
で、上述の請求項７記載の本発明のデータ圧縮装置にお
ける効果に加えて、あまり使われない出現頻度の最も低
いデータの符号長を長くすることができ、これにより符
号長が１ビット伸びたことによる符号化効率の低下を最
小限に抑えてデータ圧縮の処理速度を大幅に向上させる
ことができる効果があるるとともにデータ圧縮装置の性
能が飛躍的に向上する効果がある。Further, according to the data compression apparatus of the present invention as defined in claim 8, in the data compression apparatus which encodes the input data according to the history that has appeared in the past, it is defined in advance as data indicating unregistered data. A code tree holding means for holding a code tree in which an escape code is registered, a context tree holding means for holding a context tree in which a combination of input data and a context is registered, and after encoding an escape code, data is stored in the context tree. A newly registered context registration means, a branch position search means for searching a leaf having the longest code length on the code tree, and a leaf as a data storage point searched by the branch position search means after encoding an escape code. And a code registration means for branching the data to newly register the data, and changing the context when the combination of the input data and the context is not held in the context tree. Pulse changing means, encoding means for outputting a code in accordance with a branch from the top of the code tree to a leaf in which input data or escape code is registered, a leaf in which encoded data and escape code are registered, and another leaf Alternatively, since it is characterized in that it is configured with a code updating means for replacing a node, in addition to the effect of the data compression apparatus of the present invention according to claim 7 described above, the frequency of occurrence of the least frequently used occurrence is lowest. It is possible to increase the code length of the data, which has the effect that the reduction of the coding efficiency due to the increase of the code length by 1 bit can be minimized and the processing speed of the data compression can be significantly improved. At the same time, there is an effect that the performance of the data compression device is dramatically improved.

【０３６３】さらに、請求項９記載の本発明のデータ圧
縮装置によれば、入力データを過去に出現した履歴に応
じて符号化するデータ圧縮装置において、予めデータ未
登録を示すデータとして定義されるエスケープコードを
登録した符号木を保持する符号木保持手段と、入力デー
タと文脈との組み合わせを登録した文脈木を保持する文
脈木保持手段と、エスケープコードを符号化したのち、
文脈木にデータを新規に登録する文脈登録手段と符号木
に新規に登録されたデータ格納点としてのリーフの位置
を保持する分岐位置保持手段と、エスケープコードを符
号化したのち、分岐位置保持手段に保持されている位置
にあるリーフを分岐してデータを新規に登録する符号登
録手段と、入力データと文脈との組み合わせが文脈木に
保持されていないとき分脈を変更する文脈変更手段と、
符号木の頂点から入力データあるいはエスケープコード
が登録してあるリーフまでの分岐に従って符号を出力す
る符号化手段と、符号化したデータ及びエスケープコー
ドが登録してあるリーフと他のリーフあるいは分岐点と
してのノードとを取り替える符号更新手段とをそなえて
構成されているので、請求項７記載の本発明のデータ圧
縮装置における効果に加えて、最後に登録したリーフに
格納した新規データは、比較的符号長の長いデータとし
て近似できることから、さらにデータ圧縮の処理速度が
大幅に向上するとともにデータ圧縮装置の処理負荷も大
幅に軽減される効果がある。Further, according to the data compression apparatus of the present invention as defined in claim 9, in the data compression apparatus which encodes the input data according to the history that has appeared in the past, it is defined in advance as data indicating data unregistered. A code tree holding means for holding a code tree in which an escape code is registered, a context tree holding means for holding a context tree in which a combination of input data and a context is registered, and after encoding an escape code,
Context registration means for newly registering data in the context tree, branch position holding means for holding the position of a leaf as a data storage point newly registered in the code tree, and branch position holding means after encoding an escape code Code registration means for branching the leaf at the position held in to newly register the data, and context changing means for changing the branch when the combination of the input data and the context is not held in the context tree,
Encoding means for outputting a code in accordance with a branch from a vertex of a code tree to a leaf in which input data or escape code is registered, and a leaf in which encoded data and escape code are registered and another leaf or branch point In addition to the effect of the data compressing apparatus of the present invention according to claim 7, the new data stored in the last registered leaf has a relatively code. Since the data can be approximated as a long data, the processing speed of data compression is significantly improved, and the processing load of the data compression device is significantly reduced.

【０３６４】また、請求項１０記載の本発明のデータ復
元装置によれば、入力データを過去の入力データの履歴
に応じて符号化した符号を復号するデータ復元装置にお
いて、予めデータ未登録を示すデータとして定義される
エスケープコードを登録した符号木を保持する符号木保
持手段と復号したデータと文脈との組み合わせを登録し
た文脈木を保持する文脈保持手段と、直前までに復号し
たデータから符号の符号木を決定する符号木決定手段
と、符号に従って符号木の頂点を意味するルートからデ
ータ格納点としてのリーフへと走査して符号を復号する
復号手段と、到達したリーフがエスケープコードであっ
た場合、文脈を変更する文脈変更手段と復号したデータ
及びエスケープコードのリーフを他のリーフあるいは分
岐点としてのノードと組み替える符号更新手段と、エス
ケープコードを復号したとき、エスケープコードのリー
フを分岐して復号したデータを新規に登録する符号登録
手段と、符号登録手段で登録したデータを文脈保持手段
の文脈木に登録する文脈木登録手段とをそなえて構成さ
れているので、エスケープコードを比較的多く復号する
ようなデータである場合、または文脈木保持手段に保持
されている文脈の登録が十分でない初期の段階などにお
いて高い復号率が得られる効果がある。また、符号登録
手段による符号の登録の前に符号更新手段による符号更
新を行なえば、符号及び特定コードの符号長を最小で２
ビット、符号登録手段による符号の登録の後に符号更新
手段による符号更新を行なえば、エスケープコードの符
号を最小で１ビットにすることができ、これにより、さ
らに復号効率が大幅に向上する効果がある。さらに、符
号登録手段による復号する符号の新規登録は、常に１つ
ずつ行なわれるので、常に符号木の高次には再現性を持
ったデータのみが登録されることになり、符号木に登録
はしたが実際には使われていないデータが存在するため
に生じる復号効率の低下を防止でき、これにより十分に
データの登録がなされた後の復号効率が大幅に向上する
効果がある。そして、このような効果によりデータ復元
装置の性能が飛躍的に向上する効果がある。According to the data restoration apparatus of the present invention as set forth in claim 10, the data restoration apparatus which decodes the code obtained by coding the input data according to the history of the past input data indicates the data unregistered in advance. A code tree holding means for holding a code tree in which an escape code defined as data is held, a context holding means for holding a context tree in which a combination of decoded data and context is registered, and a code from the data decoded up to immediately before The code tree determining means for determining the code tree, the decoding means for decoding the code by scanning from the root that means the vertices of the code tree to the leaf as the data storage point according to the code, and the leaf that arrived was the escape code In this case, the context change means for changing the context and the leaf of the decrypted data and escape code are used as other leaves or nodes as branch points. Code updating means for rearranging, code registration means for newly registering the decoded data by branching the escape code leaf when the escape code is decoded, and data registered by the code registration means are registered in the context tree of the context holding means. Since it is configured with the context tree registration means for performing the above, if the data is such that an escape code is decoded in a relatively large amount, or the context held in the context tree holding means is insufficient in the initial stage, etc. In, there is an effect that a high decoding rate can be obtained. If the code updating means performs the code updating before the code registering means registers the code, the code length of the code and the specific code is at least 2.
If the code is updated by the code updating means after registering the bit and the code by the code registering means, the code of the escape code can be reduced to 1 bit at the minimum, thereby further improving the decoding efficiency. . Further, since the new code to be decoded by the code registration means is always registered one by one, only the data having reproducibility is registered in the higher order of the code tree, and the code tree is not registered. However, it is possible to prevent a decrease in the decoding efficiency caused by the existence of data that is not actually used, and this has the effect of significantly improving the decoding efficiency after the data is sufficiently registered. Then, such an effect has the effect of dramatically improving the performance of the data restoration device.

【０３６５】さらに、請求項１１記載の本発明のデータ
復元装置によれば、入力データを過去の入力データの履
歴に応じて符号化した符号を復号するデータ復元装置に
おいて、予めデータ未登録を示すデータとして定義され
るエスケープコードを登録した符号木を保持する符号木
保持手段と復号したデータと文脈との組み合わせを登録
した文脈木を保持する文脈保持手段と、直前までに復号
したデータから符号の符号木を決定する符号木決定手段
と、符号に従って符号木の頂点を意味するルートからデ
ータ格納点としてのリーフへと走査して符号を復号する
復号手段と、到達したリーフがエスケープコードであっ
た場合、文脈を変更する文脈変更手段と復号したデータ
及びエスケープコードのリーフを他のリーフあるいは分
岐点としてのノードと組み替える符号更新手段と、符号
木内の最長の符号長を持つリーフの位置を検索する分岐
位置検索手段と、エスケープコードを符号化したのち分
岐位置検索手段で検索されたリーフを分岐してデータを
新規に登録する符号登録手段と、符号登録手段で登録し
たデータを文脈保持手段の文脈木に登録する文脈木登録
手段とをそなえて構成されているので、請求項１０記載
の本発明のデータ復元装置における効果に加えて、あま
り使われない出現頻度の最も低いデータの符号長を長く
することができ、これにより符号長が１ビット伸びるこ
とによる符号化効率の低下を最小限に抑えてデータ復元
の処理速度を大幅に向上させることができる効果があ
り、さらにデータ復元装置の性能が飛躍的に向上する効
果もある。Further, according to the data restoration apparatus of the present invention as set forth in claim 11, in the data restoration apparatus which decodes the code obtained by coding the input data in accordance with the history of the past input data, the data non-registration is shown in advance. A code tree holding means for holding a code tree in which an escape code defined as data is held, a context holding means for holding a context tree in which a combination of decoded data and context is registered, and a code from the data decoded up to immediately before The code tree determining means for determining the code tree, the decoding means for decoding the code by scanning from the root that means the vertices of the code tree to the leaf as the data storage point according to the code, and the leaf that arrived was the escape code In this case, the context changing means for changing the context and the leaf of the decrypted data and escape code are used as other leaf or branch points. Code updating means for recombination with, branch position searching means for searching the position of the leaf having the longest code length in the code tree, and encoding the escape code and then branching the leaves searched by the branch position searching means to obtain data. The data restoration of the present invention according to claim 10, which comprises a code registration means for newly registering and a context tree registration means for registering the data registered by the code registration means in the context tree of the context holding means. In addition to the effect in the device, the code length of the data that is rarely used and has the lowest appearance frequency can be lengthened, thereby minimizing the decrease in coding efficiency due to the extension of the code length by 1 bit and restoring the data. There is an effect that the processing speed of (1) can be significantly improved, and further, there is an effect that the performance of the data restoration device is dramatically improved.

【０３６６】また、請求項１２記載の本発明のデータ復
元装置によれば、入力データを過去の入力データの履歴
に応じて符号化した符号を復号するデータ復元装置にお
いて、予めデータ未登録を示すデータとして定義される
エスケープコードを登録した符号木を保持する符号木保
持手段と復号したデータと文脈との組み合わせを登録し
た文脈木を保持する文脈保持手段と、直前までに復号し
たデータから符号の符号木を決定する符号木決定手段
と、符号に従って符号木の頂点を意味するルートからデ
ータ格納点としてのリーフへと走査して符号を復号する
復号手段と、到達したリーフがエスケープコードであっ
た場合、文脈を変更する文脈変更手段と復号したデータ
及びエスケープコードのリーフを他のリーフあるいは分
岐点としてのノードと組み替える符号更新手段と、符号
木に新規に登録されたリーフの位置を保持する分岐位置
保持手段と、エスケープコードを符号化したのち、分岐
位置保持手段に保持されている位置にあるリーフを分岐
してデータを新規に登録する符号登録手段と、符号登録
手段で登録したデータを文脈保持手段の文脈木に登録す
る文脈木登録手段とをそなえて構成されているので、上
述の請求項３４記載の本発明のデータ復元装置による効
果に加えて、データの新規登録を最後に登録したデータ
が格納されているリーフに行なうことができ、この最後
に登録したリーフに格納されているデータは比較的符号
長の長いデータとして近似できることから、符号長が１
ビット伸びることによる復号化効率の低下を最小限に抑
えてデータ復元の処理速度を大幅に向上させることがで
きる効果があり、さらにデータ復元装置の性能が飛躍的
に向上する効果もある。Further, according to the data restoration apparatus of the present invention as set forth in claim 12, the data restoration apparatus which decodes the code obtained by coding the input data according to the history of the past input data indicates the data unregistered in advance. A code tree holding means for holding a code tree in which an escape code defined as data is held, a context holding means for holding a context tree in which a combination of decoded data and context is registered, and a code from the data decoded up to immediately before The code tree determining means for determining the code tree, the decoding means for decoding the code by scanning from the root that means the vertices of the code tree to the leaf as the data storage point according to the code, and the leaf that arrived was the escape code In this case, the context change means for changing the context and the leaf of the decrypted data and escape code are used as other leaves or nodes as branch points. The code updating means for rearrangement, the branch position holding means for holding the position of the leaf newly registered in the code tree, the escape code are encoded, and then the leaf at the position held by the branch position holding means is branched. 35. The code registration means for newly registering data by means of the code registration means, and the context tree registration means for registering the data registered by the code registration means in the context tree of the context holding means. In addition to the effect of the data restoration device of the present invention, new registration of data can be performed in the leaf in which the last registered data is stored, and the data stored in this last registered leaf is relatively coded. Since it can be approximated as long data, the code length is 1
There is an effect that the reduction of the decoding efficiency due to the bit expansion can be suppressed to the minimum and the processing speed of the data recovery can be significantly improved, and further the performance of the data recovery device can be dramatically improved.

[Brief description of drawings]

【図１】本発明に関連する技術を説明するためのブロッ
ク図である。FIG. 1 is a block diagram illustrating a technique related to the present invention.

【図２】本発明に関連する技術を説明するためのブロッ
ク図である。FIG. 2 is a block diagram illustrating a technique related to the present invention.

【図３】本発明に関連する技術を説明するためのブロッ
ク図である。FIG. 3 is a block diagram illustrating a technique related to the present invention.

【図４】本発明に関連する技術を説明するためのブロッ
ク図である。FIG. 4 is a block diagram illustrating a technique related to the present invention.

【図５】本発明に関連する技術を説明するためのブロッ
ク図である。FIG. 5 is a block diagram illustrating a technique related to the present invention.

【図６】本発明に関連する技術を説明するためのブロッ
ク図である。FIG. 6 is a block diagram illustrating a technique related to the present invention.

【図７】本発明に関連する技術を説明するためのブロッ
ク図である。FIG. 7 is a block diagram illustrating a technique related to the present invention.

【図８】本発明に関連する技術を説明するためのブロッ
ク図である。FIG. 8 is a block diagram illustrating a technique related to the present invention.

【図９】本発明の原理ブロック図である。FIG. 9 is a principle block diagram of the present invention.

【図１０】本発明の原理ブロック図である。FIG. 10 is a principle block diagram of the present invention.

【図１１】本発明の原理ブロック図である。FIG. 11 is a principle block diagram of the present invention.

【図１２】本発明の原理ブロック図である。FIG. 12 is a principle block diagram of the present invention.

【図１３】本発明の原理ブロック図である。FIG. 13 is a principle block diagram of the present invention.

【図１４】本発明の原理ブロック図である。FIG. 14 is a principle block diagram of the present invention.

【図１５】本発明に関連する技術１としてのデータ圧縮
装置及びデータ復元装置の構成を示すブロック図であ
る。FIG. 15 is a block diagram showing a configuration of a data compression apparatus and a data decompression apparatus as technology 1 related to the present invention.

【図１６】（ａ）は文脈木の格納形式の一例を示す図で
ある。（ｂ）は辞書の親子関係を示す図である。FIG. 16A is a diagram showing an example of a storage format of a context tree. (B) is a figure which shows the parent-child relationship of a dictionary.

【図１７】符号木の初期状態を説明するための図であ
る。FIG. 17 is a diagram for explaining an initial state of a code tree.

【図１８】符号木を格納する配列の一例を示す図であ
る。FIG. 18 is a diagram showing an example of an array that stores a code tree.

【図１９】（ａ），（ｂ）はそれぞれスプレイ符号の符
号更新の基本操作およびスプレイ符号の符号更新の一例
を説明するための図である。19A and 19B are diagrams for explaining an example of a basic operation for updating the code of the splay code and an example of updating the code of the splay code, respectively.

【図２０】関連技術１にかかる符号化の手順を説明する
ためのフローチャートである。FIG. 20 is a flowchart for explaining a coding procedure according to Related Technique 1.

【図２１】（ａ），（ｂ）は、関連技術１にかかる文脈
木と符号木の更新手順を説明するための図である。21A and 21B are diagrams for explaining a procedure for updating a context tree and a code tree according to Related Technique 1.

【図２２】（ａ），（ｂ）は、関連技術１にかかる文脈
木と符号木の更新手順を説明するための図である。22 (a) and 22 (b) are diagrams for explaining a procedure for updating a context tree and a code tree according to Related Technique 1.

【図２３】（ａ），（ｂ）は、関連技術１にかかる文脈
木と符号木の更新手順を説明するための図である。23 (a) and 23 (b) are diagrams for explaining a procedure for updating a context tree and a code tree according to Related Technique 1. FIG.

【図２４】（ａ），（ｂ）は、関連技術１にかかる文脈
木と符号木の更新手順を説明するための図である。24 (a) and 24 (b) are diagrams for explaining a procedure for updating a context tree and a code tree according to Related Technique 1.

【図２５】（ａ），（ｂ）は、関連技術１にかかる文脈
木と符号木の更新手順を説明するための図である。25 (a) and 25 (b) are diagrams for explaining a procedure for updating a context tree and a code tree according to Related Technique 1.

【図２６】（ａ），（ｂ）は、関連技術１にかかる文脈
木と符号木の更新手順を説明するための図である。26 (a) and 26 (b) are diagrams for explaining a procedure for updating a context tree and a code tree according to Related Technique 1.

【図２７】（ａ），（ｂ）は、関連技術１にかかる文脈
木と符号木の更新手順を説明するための図である。27A and 27B are diagrams for explaining a procedure for updating a context tree and a code tree according to Related Technique 1.

【図２８】（ａ），（ｂ）は、関連技術１にかかる文脈
木と符号木の更新手順を説明するための図である。28 (a) and 28 (b) are diagrams for explaining a procedure for updating a context tree and a code tree according to Related Technique 1.

【図２９】関連技術１にかかる文字列の符号化後の例を
説明するための図である。[Fig. 29] Fig. 29 is a diagram for describing an example after encoding of a character string according to Related Technique 1.

【図３０】（ａ），（ｂ）は文脈木の作成手順のアルゴ
リズムを示す図である。30 (a) and 30 (b) are diagrams showing an algorithm of a procedure for creating a context tree.

【図３１】関連技術１にかかる復号化の手順を説明する
ためのフローチャートである。FIG. 31 is a flowchart for explaining a decoding procedure according to Related Technique 1.

【図３２】本発明に関連する技術２にかかるデータ圧縮
装置およびデータ復元装置の構成を示すブロック図であ
る。FIG. 32 is a block diagram showing a configuration of a data compression device and a data decompression device according to Technology 2 related to the present invention.

【図３３】関連技術２にかかるデータ圧縮装置の構成を
示すブロック図である。FIG. 33 is a block diagram showing a configuration of a data compression device according to Related Technique 2.

【図３４】関連技術２にかかる符号化部の構成を示すブ
ロック図である。FIG. 34 is a block diagram showing a configuration of an encoding unit according to Related Technique 2.

【図３５】関連技術２にかかる符号化の手順を説明する
ためのフローチャートである。[Fig. 35] Fig. 35 is a flowchart for describing an encoding procedure according to Related Technique 2.

【図３６】関連技術２にかかる符号の出力手順を説明す
るためのフローチャートである。FIG. 36 is a flowchart for explaining a code output procedure according to Related Technique 2;

【図３７】関連技術２にかかる符号木の組み替え手順を
説明するためのフローチャートである。FIG. 37 is a flowchart for explaining a code tree rearrangement procedure according to Related Technique 2;

【図３８】関連技術２にかかるデータ復元装置の構成を
説明するためのブロック図である。FIG. 38 is a block diagram for explaining the configuration of a data restoration device according to Related Technique 2;

【図３９】関連技術２にかかる復号部の構成を説明する
ためのブロック図である。FIG. 39 is a block diagram for explaining a configuration of a decoding unit according to Related Technique 2.

【図４０】関連技術２にかかる復号化の手順を説明する
ためのフローチャートである。[Fig. 40] Fig. 40 is a flowchart for describing a decoding procedure according to Related Technique 2.

【図４１】関連技術２にかかる復号化の手順を説明する
ためのフローチャートである。FIG. 41 is a flowchart for explaining a decoding procedure according to Related Technique 2.

【図４２】関連技術２の第１の変形例にかかるデータ圧
縮装置の構成を説明するためのブロック図である。FIG. 42 is a block diagram for explaining a configuration of a data compression device according to a first modification of Related Technique 2.

【図４３】関連技術２の第１の変形例にかかる符号化の
手順を説明するためのフローチャートである。[Fig. 43] Fig. 43 is a flowchart for describing an encoding procedure according to a first modified example of related art 2.

【図４４】関連技術２の第１の変形例にかかるデータ復
元装置の構成を説明するためのブロック図である。FIG. 44 is a block diagram for explaining a configuration of a data restoration device according to a first modification of related art 2.

【図４５】関連技術２の第１の変形例にかかる復号化の
手順を説明するためのフローチャートである。[Fig. 45] Fig. 45 is a flowchart for describing a decoding procedure according to a first modified example of related art 2.

【図４６】関連技術２の第２の変形例にかかるデータ圧
縮装置の構成を説明するためのブロック図である。FIG. 46 is a block diagram for explaining a configuration of a data compression device according to a second modification of related art 2.

【図４７】関連技術２の第２の変形例にかかる符号化の
手順を説明するためのフローチャートである。[Fig. 47] Fig. 47 is a flowchart for describing an encoding procedure according to a second modified example of related art 2.

【図４８】関連技術２の第２の変形例にかかるデータ復
元装置の構成を説明するためのブロック図である。FIG. 48 is a block diagram for explaining a configuration of a data restoration device according to a second modified example of the related art 2.

【図４９】関連技術２の第２の変形例にかかる復号化の
手順を説明するためのフローチャートである。[Fig. 49] Fig. 49 is a flowchart for describing a decoding procedure according to a second modified example of the related art 2.

【図５０】関連技術２の第３の変形例にかかる符号化の
手順を説明するためのフローチャートである。[Fig. 50] Fig. 50 is a flowchart for describing an encoding procedure according to a third modified example of related art 2.

【図５１】関連技術２の第３の変形例にかかる復号化の
手順を説明するためのフローチャートである。[Fig. 51] Fig. 51 is a flowchart for describing a decoding procedure according to a third modified example of related art 2.

【図５２】本発明の一実施形態にかかるデータ圧縮装置
の構成を説明するためのブロック図である。FIG. 52 is a block diagram illustrating a configuration of a data compression device according to an embodiment of the present invention.

【図５３】本実施形態にかかる符号登録部及び符号木保
持部の構成を説明するためのブロック図である。[Fig. 53] Fig. 53 is a block diagram for describing configurations of a code registration unit and a code tree holding unit according to the present embodiment.

【図５４】本実施形態にかかるデータ圧縮装置の動作を
説明するための図である。FIG. 54 is a diagram for explaining the operation of the data compression apparatus according to the present embodiment.

【図５５】本実施形態にかかるデータ圧縮装置の動作を
説明するための図である。FIG. 55 is a diagram for explaining the operation of the data compression apparatus according to the present embodiment.

【図５６】本実施形態にかかるデータ圧縮装置の動作を
説明するための図である。FIG. 56 is a diagram for explaining the operation of the data compression apparatus according to the present embodiment.

【図５７】（ａ），（ｂ）はそれぞれ符号木および文脈
木の一例を示す図である。57 (a) and (b) are diagrams showing an example of a code tree and a context tree, respectively.

【図５８】本実施形態にかかる符号化の手順を説明する
ためのフローチャートである。[Fig. 58] Fig. 58 is a flowchart for describing an encoding procedure according to the present embodiment.

【図５９】本実施形態にかかる符号化の手順を説明する
ためのフローチャートである。[Fig. 59] Fig. 59 is a flowchart for explaining a coding procedure according to the present embodiment.

【図６０】（ａ），（ｂ）は本実施形態にかかる符号の
新規登録の状態を示す図である。FIGS. 60 (a) and 60 (b) are diagrams showing a state of newly registering a code according to the present embodiment.

【図６１】本実施形態にかかる符号化の他の手順を説明
するためのフローチャートである。FIG. 61 is a flowchart for explaining another encoding procedure according to the present embodiment.

【図６２】本実施形態にかかるデータ復元装置の構成を
説明するためのブロック図である。FIG. 62 is a block diagram for explaining the configuration of the data restoration device according to the present embodiment.

【図６３】本実施形態にかかるデータ復元装置の動作を
説明するための図である。FIG. 63 is a diagram for explaining the operation of the data restoration device according to the present embodiment.

【図６４】本実施形態にかかるデータ復元装置の動作を
説明するための図である。FIG. 64 is a diagram for explaining the operation of the data restoration device according to the present embodiment.

【図６５】本実施形態にかかる復号化の手順を説明する
ためのフローチャートである。FIG. 65 is a flowchart for explaining a decoding procedure according to the present embodiment.

【図６６】本実施形態の第１の変形例にかかる符号登録
部及び符号木保持部の構成を説明するためのブロック図
である。[Fig. 66] Fig. 66 is a block diagram for describing configurations of a code registration unit and a code tree holding unit according to a first modification of the present embodiment.

【図６７】本実施形態の第１の変形例にかかる符号化の
手順を説明するためのフローチャートである。[Fig. 67] Fig. 67 is a flowchart for explaining an encoding procedure according to the first modification of the present embodiment.

【図６８】本実施形態の第２の変形例にかかる符号登録
部及び符号木保持部の構成例を説明するためのブロック
図である。[Fig. 68] Fig. 68 is a block diagram for describing a configuration example of a code registration unit and a code tree holding unit according to a second modification of the present embodiment.

【図６９】本実施形態の第２の変形例にかかる符号化の
手順を説明するためのフローチャートである。[Fig. 69] Fig. 69 is a flowchart for describing an encoding procedure according to a second modification of the present embodiment.

【図７０】（ａ），（ｂ）は多値算術符号化の原理を説
明するための図である。FIGS. 70 (a) and 70 (b) are diagrams for explaining the principle of multilevel arithmetic encoding.

【図７１】（ａ），（ｂ）は従来の文字単位に圧縮する
多値算術符号化の手順を示すフローチャートである。71 (a) and 71 (b) are flowcharts showing a procedure of conventional multi-valued arithmetic coding for compressing in character units.

【図７２】多値算術符号化のアルゴリズムの一例を示す
図である。[Fig. 72] Fig. 72 is a diagram illustrating an example of an algorithm for multilevel arithmetic encoding.

【図７３】（ａ），（ｂ）はスプレイ符号化の原理を説
明するための図である。73 (a) and 73 (b) are diagrams for explaining the principle of spray encoding.

【図７４】確率統計型符号化の原理を説明するための図
である。[Fig. 74] Fig. 74 is a diagram for describing the principle of probability statistical coding.

【図７５】（ａ），（ｂ）は文脈の木の登録例を示す図
である。75 (a) and (b) are diagrams showing an example of registration of a context tree.

[Explanation of symbols]

１，３データ圧縮装置２，４データ復元装置１１，２２文脈収集過程１２，２１スプレイ符号化過程４１上位ノード判別部４２ノード番号管理部（メモリ）４３位置判別部４４，４８ラッチ４５スタック４６下位ノード判別部４７葉／節判別部６１新規ノードＩＤ発生部６２ラッチ６３親子情報更新部６４外部節点（リーフＩＤ）保持部６５内部節点（ノードＩＤ）保持部６６ＥＳＣ−ＩＤ保持部６７符号木管理部６８外部節点／ＥＳＣ−ＩＤ（リーフＩＤ）保持部６９最長符号検出部（分岐位置検索手段）７０最新登録ＩＤ保持部（分岐位置保持手段）１００，２００前置データ保持手段１００Ａ−１〜１００Ａ−ｎ，２００Ａ−１〜２００Ａ
−ｎ前置データ保持部１０１，２０１履歴保持手段１０１Ａ，２０１Ａ文脈履歴保持部１０２，２０２，１０７，２０７，３０１，４０１符
号木保持手段１０２Ａ，２０２Ａ，１０７Ａ，２０７Ａ，３０１Ｂ，
４０１Ｂ符号木保持部１０３，２０３，４０３符号木決定手段１０３Ａ，２０３Ａ符号木決定部１０４符号出力手段１０４Ａ，１０４Ａ′，３０６Ｂ符号化部１０５，２０５符号長変更手段１０５Ａ，２０５Ａ符号木更新部１０６，２０６前置データ更新手段１０６Ａ，２０６Ａ文脈更新部１０８文脈判別手段１０８Ａ文脈判別部１０９エスケープコード出力手段１１０，２０８，３０５，４０５文脈変更手段１１０Ａ，２１０Ａ，３０５Ｂ，４０３Ｂ文脈変更部１１１符号出力手段１１２，２０９履歴登録手段１１３，１１４，１１５，２１２，３０４，３０９，３
１１，４０７，４１０，４１２符号登録手段１１２Ａ，２１２Ａ，３２２Ｂ，４０７Ｂ符号登録部１１６，１１７，２１３制御手段２０４，４０４復号手段２０４Ａ，２０４Ａ′，４０４Ｂ復号部３０３文脈登録手段３０３Ｂ，４０８Ｂ文脈登録部３０２，４０２文脈木保持手段３０２Ｂ，４０２Ｂ文脈木保持部３０６符号化手段３０７，４０６符号更新手段４０６Ｂ符号変更部３０８，４０９分岐位置保持手段３１０，４１１分岐位置検索手段３２１Ｂ，４２１Ｂ文脈保持部４０８，４１３，４１４文脈木登録手段４０９Ｂラッチ５１１文脈収集５１２動的可変長符号化1, 3 Data compression device 2, 4 Data decompression device 11, 22 Context collection process 12, 21 Spray coding process 41 Upper node discriminating unit 42 Node number management unit (memory) 43 Position discriminating unit 44, 48 Latch 45 Stack 46 Lower Node discriminating unit 47 Leaf / node discriminating unit 61 New node ID generating unit 62 Latch 63 Parent-child information updating unit 64 External node (leaf ID) holding unit 65 Internal node (node ID) holding unit 66 ESC-ID holding unit 67 Code tree management Unit 68 External node / ESC-ID (leaf ID) storage unit 69 Longest code detection unit (branch position search unit) 70 Latest registered ID storage unit (branch position storage unit) 100, 200 Prefix data storage unit 100A-1 to 100A -N, 200A-1 to 200A
-N prefix data holding unit 101, 201 history holding unit 101A, 201A context history holding unit 102, 202, 107, 207, 301, 401 code tree holding unit 102A, 202A, 107A, 207A, 301B,
401B code tree holding unit 103, 203, 403 code tree determining unit 103A, 203A code tree determining unit 104 code output unit 104A, 104A ', 306B coding unit 105, 205 code length changing unit 105A, 205A code tree updating unit 106, 206 Prefix Data Updating Means 106A, 206A Context Updating Unit 108 Context Discriminating Unit 108A Context Discriminating Unit 109 Escape Code Outputting Units 110, 208, 305, 405 Context Changing Units 110A, 210A, 305B, 403B Context Changing Unit 111 Code Outputting Unit 112 , 209 History registration means 113, 114, 115, 212, 304, 309, 3
11, 407, 410, 412 Code registration means 112A, 212A, 322B, 407B Code registration section 116, 117, 213 Control means 204, 404 Decoding means 204A, 204A ', 404B Decoding section 303 Context registration means 303B, 408B Context registration section 302, 402 Context tree holding means 302B, 402B Context tree holding section 306 Encoding means 307, 406 Code updating means 406B Code changing sections 308, 409 Branch position holding means 310, 411 Branch position searching means 321B, 421B Context holding section 408, 413, 414 Context tree registration means 409B Latch 511 Context collection 512 Dynamic variable length coding

───────────────────────────────────────────────────── フロントページの続き (56)参考文献特開平５−218881（ＪＰ，Ａ) 特開平６−149537（ＪＰ，Ａ) 特開平６−315089（ＪＰ，Ａ) 特開平７−336237（ＪＰ，Ａ) (58)調査した分野(Int.Cl.⁷，ＤＢ名) H03M 7/40 G06F 5/00 ─────────────────────────────────────────────────── ─── Continuation of the front page (56) Reference JP-A-5-218881 (JP, A) JP-A-6-149537 (JP, A) JP-A-6-315089 (JP, A) JP-A-7- 336237 (JP, A) (58) Fields investigated (Int.Cl. ⁷ , DB name) H03M 7/40 G06F 5/00

Claims

(57) [Claims]

1. A data compression method for encoding and compressing input data according to a history that has appeared in the past, wherein a context tree in which a combination of the input data and a context consisting of n continuous data is registered is created. The context tree holding process to hold, the code tree holding process to hold an independent code tree for each context, and the combination of the input data and the context is not held in the context tree holding process, the context tree When the context tree new registration process of newly registering the above data in the context tree of the holding process and the combination of the above input data and context is not held in the context tree holding process, the code tree of the above code tree holding process The code tree new registration process of storing the above data in the new leaf obtained by branching the leaf as the data storage point of A code that outputs a code according to a context changing process that changes the context when not held in the holding process, and a branch from the vertex of the code tree to the leaf in which the input data or the specific code in the code tree is registered It has an output process and a code length changing process for replacing a leaf in which a specific code in the input data or the code tree is registered with another leaf or a node defined as a branch point other than the vertex of the code tree. In the code tree new registration process, a leaf in which the specific code is registered is branched, and the specific code and new data are registered in the obtained two new leaves.

2. A data compression method for encoding and compressing input data according to a history that has appeared in the past, and a code tree holding process for holding a code tree in which an escape code defined as data indicating unregistered is registered in advance. And a context tree holding process for holding a context tree in which a combination of input data and a context made up of consecutive n data is registered, and a combination of the above input data and context becomes the above context tree holding process. When not held, when the context tree new registration process of newly registering the above data in the context tree of the above context tree holding process and the combination of the above input data and context is not held in the above context tree holding process A code tree new registration process of storing the data in a new leaf obtained by branching a leaf as a data storage point of the code tree in the code tree holding process, From the vertex of the code tree to the leaf in which the input data or escape code is registered, when the combination of the input data and the context of is not held in the context tree holding process, the context changing process. A code output process for outputting a code according to a branch, and a code length changing process for replacing a leaf in which the above input data or escape code is registered with another leaf or a node defined as a branch point other than the vertex of the above code tree In the code tree new registration process, the leaf in which the escape code is registered is branched, and the escape code and the new data are registered in the obtained two new leaves. Data compression method.

3. In the code tree new registration process, among leaves under the same context, a leaf having the longest distance from a root defined as a vertex of the code tree is branched to obtain 2
The data compression method according to claim 2, wherein the data stored in the branched leaf and the new data are registered in one new leaf.

4. In the code tree new registration process, among the leaves under the same context, the leaf registered last is branched, and the two new leaves obtained are stored with the data stored in the branched leaf. 3. The data compression method according to claim 2, wherein the new data is registered.

5. A data restoration method for decoding a code obtained by coding input data according to the history of past input data, and a context tree holding process for holding a context tree in which a combination of decoded data and context is registered. It means a code tree holding process for holding each independent code tree according to the context, a code tree determining process for determining the code tree of the above code from the data decoded up to immediately before, and a vertex of the above code tree according to the above code. The decoding process of scanning the code from the root to the leaf as the data storage point and decoding the code, and if the leaf that arrived is a specific code in the code tree,
The context changing process of changing the context, the decoded data and the code length changing process of recombining the leaf of the specific code with another leaf or a node as a branch point, and the data decoded into the code tree when the specific code is decoded There is a new registration process of newly registering, and a context tree registration process of registering the data registered in the new registration process in the context tree of the context tree holding process. In the new registration process, the encoding side selects a branch. A data restoration method, characterized by branching the same leaf as the created leaf and registering new data.

6. A data restoration method for decoding a code obtained by encoding input data according to a history of past input data, and holding a code tree in which an escape code defined as data indicating unregistered data is registered in advance. A code tree holding process, a context tree holding process for holding a context tree in which a combination of decoded data and a context is registered, a code tree determining process for determining a code tree of the above code from the data decoded up to immediately before, According to the code, the decoding process of scanning the path from the root meaning the vertex of the code tree to the leaf as the data storage point and decoding the code, and when the reached leaf is the escape code,
The context changing process of changing the context, the decoded data and the code length changing process of recombining the leaf of the escape code with another leaf or a node as a branch point, and the data decoded into the code tree when the escape code is decoded There is a new registration process of newly registering, and a context tree registration process of registering the data registered in the new registration process in the context tree of the context tree holding process. In the new registration process, the encoding side selects a branch. A data restoration method, characterized by branching the same leaf as the created leaf and registering new data.

7. A data compression apparatus for encoding input data in accordance with a history of past appearances, and a code tree holding means for holding a code tree in which an escape code defined as data indicating unregistered data is registered in advance. , A context tree holding means for holding a context tree in which a combination of input data and a context is registered, a context registration means for newly registering the data in the context tree after encoding the escape code, and the escape code Code is stored in the context tree, and a code registration means for branching a leaf as a data storage point of the escape code of the code tree and newly registering the data is stored in the context tree. If there is not, register the above input data or escape code from the top of the code tree and the context changing means to change the context. It comprises a coding means for outputting a code according to a branch to a certain leaf, and a code updating means for replacing a leaf in which the coded data and the escape code are registered with another leaf or node. A data compression device characterized.

8. A data compression device for encoding input data according to a history of past appearances, and a code tree holding means for holding a code tree in which an escape code defined as data indicating unregistered data is registered in advance. , Context tree holding means for holding a context tree in which a combination of input data and context is registered; context registration means for newly registering the above data in the context tree after encoding the escape code; The branch position searching means for searching the leaf having the longest code length and the escape code are encoded, and the leaf as the data storage point searched by the branch position searching means is branched and the above data is newly added. Code registration means for registering, context changing means for changing the context when the combination of the input data and the context is not held in the context tree, Encoding means for outputting a code in accordance with a branch from a vertex of the code tree to input data or a leaf in which the escape code is registered, a leaf in which the encoded data and the escape code are registered, and another leaf or node And a code updating means for replacing the above.

9. A data compression device for encoding input data according to a history of past appearances, and a code tree holding means for holding a code tree in which an escape code defined as data indicating unregistered data is registered in advance. , Context tree holding means for holding a context tree in which a combination of input data and context is registered, context registration means for newly registering the above data in the context tree after encoding the above escape code, and in the above code tree Branch position holding means for holding the position of a leaf as a newly registered data storage point, and after encoding the escape code, branching the leaf at the position held by the branch position holding means Code registration means for newly registering data, and changing the branch when the combination of the above input data and context is not held in the context tree Context changing means, encoding means for outputting a code according to a branch from the top of the code tree to the input data or the leaf in which the escape code is registered, and the encoded data and the leaf in which the escape code is registered A data compression apparatus, comprising a code updating means for replacing another leaf or a node as a branch point.

10. A data restoration device that decodes a code obtained by encoding input data according to a history of past input data, and holds a code tree in which an escape code defined as data indicating unregistered data is registered in advance. Code tree holding means, context tree holding means for holding a context tree in which a combination of decoded data and context is registered, code tree determining means for determining the code tree of the above code from the data decoded up to immediately before, Decoding means for decoding the code by scanning from the root meaning the apex of the code tree to the leaf as the data storage point according to the code, and when the arrived leaf is the escape code,
Context changing means for changing the context, code updating means for recombining the decoded data and the leaf of the escape code with another leaf or a node as a branch point, and when the escape code is decoded, the leaf of the escape code is branched Code registration means for newly registering the decoded data and a context tree registration means for registering the data registered by the code registration means in the context tree of the context tree holding means. , Data recovery device.

11. A data restoration apparatus for decoding a code obtained by encoding input data according to a history of past input data, and holding a code tree in which an escape code defined as data indicating unregistered data is registered in advance. Code tree holding means, context tree holding means for holding a context tree in which a combination of decoded data and context is registered, code tree determining means for determining the code tree of the above code from the data decoded up to immediately before, Decoding means for decoding the code by scanning from the root meaning the apex of the code tree to the leaf as the data storage point according to the code, and when the arrived leaf is the escape code,
Context changing means for changing the context, code updating means for recombining the decoded data and the leaf of the escape code with another leaf or a node as a branch point, and searching the position of the leaf with the longest code length in the code tree Branch position searching means for coding the escape code, branching the leaf searched by the branch position searching means and newly registering the data, and data registered by the code registering means. A data restoration device comprising a context tree registration means for registering in a context tree of a context tree holding means.

12. A data restoration device for decoding a code obtained by coding input data according to a history of past input data, and holding a code tree in which an escape code defined as data indicating unregistered data is registered in advance. Code tree holding means, context tree holding means for holding a context tree in which a combination of decoded data and context is registered, code tree determining means for determining the code tree of the above code from the data decoded up to immediately before, Decoding means for decoding the code by scanning from the root meaning the apex of the code tree to the leaf as the data storage point according to the code, and when the arrived leaf is the escape code,
Context changing means for changing the context, code updating means for changing the decoded data and the leaf of the escape code with another leaf or a node as a branch point, and holding the position of the leaf newly registered in the code tree A branch position holding means, a code registration means for branching a leaf at a position held by the branch position holding means after coding the escape code, and newly registering the data; and the code registration means. A data restoration device comprising: a context tree registration means for registering registered data in a context tree of a context tree holding means.