JP2769057B2

JP2769057B2 - Data compression device

Info

Publication number: JP2769057B2
Application number: JP3254540A
Authority: JP
Inventors: 正敬細野; 隆太郎田村
Original assignee: Alps Electric Co Ltd
Current assignee: Alps Alpine Co Ltd
Priority date: 1991-09-06
Filing date: 1991-09-06
Publication date: 1998-06-25
Anticipated expiration: 2013-06-25
Also published as: JPH0566919A

Description

DETAILED DESCRIPTION OF THE INVENTION

【０００１】[0001]

【産業上の利用分野】本発明はキャラクタデータを圧縮
する装置に関し、特に、日本語の文字データを圧縮する
装置に関する。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to an apparatus for compressing character data, and more particularly to an apparatus for compressing Japanese character data.

【０００２】[0002]

【従来の技術】ＪＩＳコード体系やシフトＪＩＳ体系な
どにより、日本語の文字データは２バイトコードで表現
されているが、これらの２バイトコードを記録、伝送す
る際には、データ量を極力少なくすることがメモリ効率
上及びデータ伝送効率上望ましい。このため、漢字、ひ
らがな、片かな、記号の各文字種毎に制御コードを用意
し、同一文字種が連続するテキストデータの場合には、
同一文字種の先頭の文字データに制御コードを付加し、
この制御コードにより連続する同一文字種の第１バイト
を省略して、データ量を減少させるデータ圧縮装置が用
いられることが多い。2. Description of the Related Art According to the JIS code system and the shift JIS system, Japanese character data is represented by two-byte codes. When recording and transmitting these two-byte codes, the data amount is minimized. It is desirable in terms of memory efficiency and data transmission efficiency. For this reason, a control code is prepared for each character type of kanji, hiragana, katakana, and symbol, and in the case of text data of the same character type,
A control code is added to the first character data of the same character type,
In many cases, a data compression device that reduces the amount of data by omitting consecutive first bytes of the same character type using this control code is often used.

【０００３】[0003]

【発明が解決しようとする課題】しかしながら、日本語
の文章はひらがな，漢字等の文字種が混在する率が高
く、同一文字種の連続性が乏しいので、従来装置では文
字種が代わるたびに異なる制御コードを付加する必要が
あり、このため、データの圧縮効率は低下する。However, Japanese sentences have a high rate of mixture of character types such as hiragana and kanji, and the continuity of the same character type is poor. Therefore, in the conventional apparatus, a different control code is used each time the character type is changed. It is necessary to add data, and the data compression efficiency is reduced.

【０００４】本発明は上記従来の事情に鑑みてなされた
ものであり、その目的は、文字種の混在度が高く、同一
文字種の連続性が乏しい日本語の文章であっても、高い
データ圧縮効率を維持できるデータ圧縮装置を提供する
ことにある。SUMMARY OF THE INVENTION The present invention has been made in view of the above-mentioned circumstances, and has as its object to provide a high data compression efficiency even for Japanese sentences having a high degree of mixture of character types and poor continuity of the same character type. It is an object of the present invention to provide a data compression device capable of maintaining the data compression.

【０００５】[0005]

【課題を解決するための手段】上記目的を達成するため
に，請求項１記載のデータ圧縮装置は図１のように構成
されている。すなわち，順次入力されるひらがなまたは
記号を示す２バイトデータの第１バイトを圧縮モードが
切り替わるまで省略し第２バイトを数値変換して送出
し，漢字を示す２バイトデータをそのまま送出する第１
圧縮モード１０と，順次入力される片かなまたは記号を
示す２バイトデータの第１バイトを圧縮モードが切り替
わるまで省略し第２バイトを数値変換して送出し，漢字
を示す２バイトデータをそのまま送出する第２圧縮モー
ド１２と，漢字を示す２バイトデータまたは１バイトデ
ータが入力されたときに２バイトデータまたは１バイト
データのまま送出する第３圧縮モード１４と，を備えた
圧縮手段１６と，ひらがなまたは記号を示す２バイトデ
ータが入力されたときに第１制御コードを送出し，片か
なまたは記号を示す２バイトデータが入力されたときに
第２制御コードを送出し，１バイトデータが入力された
ときに第３制御コードを送出して圧縮モードを切り替え
る切替手段１８とを有して構成される。In order to achieve the above object, a data compression apparatus according to the present invention is configured as shown in FIG. That is, the first byte of 2-byte data indicating a hiragana or a symbol that is sequentially input is omitted until the compression mode is switched, and the second byte is converted to a numerical value and transmitted.
And the two-byte data indicating the kanji is sent as it is.
In the compression mode 10, the first byte of 2-byte data indicating a sequentially input kana or symbol is omitted until the compression mode is switched , the second byte is converted to a numerical value and transmitted , and the 2-byte data representing a kanji is transmitted as it is. A compression means 16 having a second compression mode 12 for performing a second compression mode and a third compression mode 14 for transmitting two-byte data or one-byte data as kanji when the two-byte data or one-byte data is input. A first control code is transmitted when 2-byte data indicating a hiragana or a symbol is input, and a second control code is transmitted when 2-byte data indicating a hiragana or a symbol is input, and 1-byte data is input. And a switching means 18 for transmitting the third control code when the operation is performed to switch the compression mode.

【０００６】[0006]

【作用】本発明では、記号を示す２バイトデータを複数
の圧縮モードで圧縮することができるので、ひらがなま
たは片かなの圧縮モードのときに、記号が入力されても
制御コードが送出されることがない。また、漢字を示す
２バイトデータも圧縮モードに関係なくそのまま送出さ
れるので、漢字が入力されたことに対する制御コードを
省略できる。According to the present invention, two-byte data representing a symbol can be compressed in a plurality of compression modes. Therefore, in the hiragana or katakana compression mode, a control code is transmitted even if a symbol is input. There is no. Further, since the 2-byte data indicating the kanji is transmitted as it is regardless of the compression mode, the control code for the input of the kanji can be omitted.

【０００７】[0007]

【実施例】以下、図に基づいて本発明にかかる装置の好
適な実施例について説明する。図２にはデータ圧縮装置
２０が示されており、データ圧縮装置２０には入力部２
２、モード切替部２４、圧縮部２６、出力部３４が設け
られている。DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS A preferred embodiment of the apparatus according to the present invention will be described below with reference to the drawings. FIG. 2 shows a data compression device 20.
2, a mode switching unit 24, a compression unit 26, and an output unit 34 are provided.

【０００８】圧縮部２６には、順次入力されるひらがな
または記号を示す２バイトデータの第１バイトを圧縮モ
ードが切り替わるまで省略して送出し、漢字を示す２バ
イトデータをそのまま送出する第１圧縮モード２８と、
順次入力される片かなまたは記号を示す２バイトデータ
の第１バイトを圧縮モードが切り替わるまで省略して送
出し、漢字を示す２バイトデータをそのまま送出する第
２圧縮モード３０と、漢字を示す２バイトデータまたは
１バイトデータが入力されたときに２バイトデータまた
は１バイトデータのまま送出する第３圧縮モード３２と
が備えらている。[0008] The first compression unit 26 sends the first byte of the two-byte data indicating the hiragana or the symbol, which is sequentially input, until the compression mode is switched, and sends the two-byte data indicating the kanji as it is. Mode 28,
A second compression mode 30 in which the first byte of two-byte data indicating a sequentially input one-byte or symbol is omitted until the compression mode is switched, and a two-byte data indicating a kanji is directly transmitted; A third compression mode 32 is provided in which, when byte data or 1-byte data is inputted, 2-byte data or 1-byte data is transmitted as it is.

【０００９】そして、これらの第１圧縮モード２８、第
２圧縮モード３０、第３圧縮モード３２は、ひらがな，
漢字等の文字種毎がコード領域に区分されている入力デ
ータに応じてモード切替部２４により切り替えられる。The first compression mode 28, the second compression mode 30, and the third compression mode 32 correspond to hiragana,
Each mode is switched by the mode switching unit 24 in accordance with input data that is divided into code areas for each character type such as kanji.

【００１０】例えば、図３で示される日本語漢字シフト
ＪＩＳのコード体系のように、ひらがな（８２９Ｆ〜８
２Ｆ１）、片かな（８３４１〜８３９３）、記号（８１
４０〜８１７Ｆ）毎にコード領域が各々設定されてお
り、これらの設定領域に基づいて、ひらがなまたは記号
を示す２バイトデータが入力されたときに第１制御コ−
ド０Ｆ（１６進）が、片かなまたは記号を示す２バイト
データが入力されたときに第２制御コ−ド１３（１６
進）が、１バイトデータが入力されたときは第３制御コ
−ドがモード切替部２４から圧縮部２６へ送出され、圧
縮部２６ではこれらの制御データによりに第１圧縮モー
ド２８、第２圧縮モード３０、第３圧縮モード３２が切
り替えられる。。For example, as shown in the Japanese Kanji Shift JIS code system shown in FIG. 3, hiragana (829F-8
2F1), one-sided kana (8341-8393), symbol (81
40 to 817F), a code area is set for each of them, and based on these set areas, the first control code is input when 2-byte data indicating a hiragana or a symbol is input.
When 2-byte data indicating a one-sided kana or a symbol is input to the second control code 13 (16 hexadecimal),
When one byte data is input, a third control code is sent from the mode switching unit 24 to the compression unit 26, and the compression unit 26 uses the control data to perform the first compression mode 28 and the second compression mode. The compression mode 30 and the third compression mode 32 are switched. .

【００１１】図４ではデータ圧縮装置２０のデータ圧縮
作用が説明されており、入力部２２を介してモード切替
部２４に入力データの先頭の１バイトが与えられ（ステ
ップ４０１）、この１バイトが日本語シフトＪＩＳコー
ド体系で設定されている２バイト表現データの第１バイ
トを示すデータか否かが判断される（ステップ４０
３）。FIG. 4 illustrates the data compression operation of the data compression device 20. The first byte of input data is given to the mode switching unit 24 via the input unit 22 (step 401), and this one byte is It is determined whether or not the data indicates the first byte of the 2-byte representation data set in the Japanese shift JIS code system (step 40).
3).

【００１２】入力された１バイトデータが日本語シフト
ＪＩＳコード体系で設定された２バイトデータの第１バ
イトを示す場合（ステップ４０３でＹＥＳ），さらにも
う１バイト読み込まれる（ステップ４０５）。そして，
読み込まれた２バイトはひらがなを示すデータであるか
否かが設定されているコード領域に従ってモード切替部
２４で判断される（ステップ４０７）。ひらがなを示す
データであった場合で（ステップ４０７でＹＥＳ），現
在の圧縮モードが第１圧縮モード２８以外のときには，
モード切替部２４から第１制御コード０Ｆ（１６進）が
圧縮部２６に送出され，圧縮部２６の圧縮モードが第１
圧縮モード２８に切り替えられる。さらに，圧縮部２６
ではひらがなを示す入力データの第１バイトが省略さ
れ，第１制御コード０Ｆ（１６進）と第２バイト値−７
Ｅ（１６進）のデータ（第２バイト値から７Ｅ（１６
進）を減算した値）とが出力される（ステップ４０
９）。また，現在の圧縮モードが第１圧縮モード２８の
ときには，圧縮部２６では入力データの第１バイトが省
略され，第２バイト値−７Ｅ（１６進）のデータのみが
出力される（ステップ４０９）。この第２バイトの値を
演算することにより，９Ｆ（１６進）〜Ｆ１（１６進）
の値のものが，２１（１６進）〜７３（１６進）に変換
される。これらの変換後の値は漢字を示す２バイトデー
タの第１バイトとは異なるため，第１圧縮モード２８の
場合でもひらがなと漢字とは区別が可能となる。 If the input 1-byte data indicates the first byte of 2-byte data set in the Japanese shift JIS code system (YES in step 403), another 1-byte is read (step 405). And
The mode switching unit 24 determines whether the read two bytes are data indicating hiragana or not according to the set code area (step 407). If the data indicates hiragana (YES in step 407) and the current compression mode is other than the first compression mode 28,
The first control code 0F (hexadecimal) is transmitted from the mode switching unit 24 to the compression unit 26, and the compression mode of the compression unit 26 is set to the first mode.
The mode is switched to the compression mode 28. Further, the compression unit 26
The first byte of the input data indicating Hiragana is omitted, and the first control code 0F (hexadecimal) and the second byte value -7 are omitted.
E (hexadecimal) data (from the second byte value to 7E (16
) Is output (step 40).
9). Further, the current compression mode when <br/> of the first compression mode 28 is omitted first byte of the input data in the compression section 26, only the data of the second byte value -7E (16 hex) is It is output (step 409). The value of this second byte
By calculation, 9F (hexadecimal) to F1 (hexadecimal)
Is converted to 21 (hexadecimal) to 73 (hexadecimal)
Is done. These converted values are 2-byte data indicating Kanji.
Since it is different from the first byte of the data,
Even in this case, it is possible to distinguish between hiragana and kanji.

【００１３】読み込まれた２バイトがひらがなではない
場合（ステップ４０７でＮＯ），そのデータが片かなを
示すデータであるか否かが設定されているコード領域に
従ってモード切替部２４で判断される（ステップ４１
１）。そして，片かなを示すデータであった場合で（ス
テップ４１１でＹＥＳ），現在の圧縮モードが第２圧縮
モード３０以外のときには，モード切替部２４から第２
制御コード１３（１６進）が圧縮部２６に送出され，圧
縮部２６の圧縮モードが切り替えられる。さらに，圧縮
部２６で入力データの第１バイトが省略され，第２制御
コード１３（１６進）と第２バイト値−２０（１６進）
のデータ（第２バイト値から２０（１６進）を減算した
値）とが出力される（ステップ４１３）。また，現在の
圧縮モードが第２圧縮モード３０のときには，圧縮部２
６で入力データの第１バイトが省略され，第２バイト値
−２０（１６進）のみが出力される（ステップ４１
３）。この第２バイトの値を演算することにより，４１
（１６進）〜９３（１６進）の値のものが，２１（１６
進）〜７３（１６進）に変換され，ひらがなにおける第
２バイト値の変換後のものと共有される。しかしながら
それぞれは第１圧縮モード２８ならひらがな，第２圧縮
モード３０なら片かなと明確に区別される。さらに，第
１圧縮モード２８と同様に第２圧縮モード３０の場合で
も片かなと漢字とは区別が可能である。 If the two bytes read are not hiragana (NO in step 407), the mode switching unit 24 determines whether or not the data is data indicating one-sided kana according to the set code area ( Step 41
1). If the data indicates one-sided kana (YES in step 411) and the current compression mode is other than the second compression mode 30, the mode switching unit 24
The control code 13 (hexadecimal) is sent to the compression unit 26, and the compression mode of the compression unit 26 is switched. Further, the first byte of the input data is omitted in the compression unit 26, and the second control code 13 (hexadecimal) and the second byte value -20 (hexadecimal)
Data (20 (hexadecimal) is subtracted from the second byte value)
Values) are output (step 413). Also, when the compression mode the current is in the second compression mode 30, the compression section 2
In step 6, the first byte of the input data is omitted, and only the second byte value -20 (hexadecimal) is output (step 41).
3). By calculating the value of the second byte, 41
(Hexadecimal) to 93 (hexadecimal)
Hex) to 73 (hexadecimal).
Shared with the two byte value after conversion. However
Hiragana for the first compression mode 28, second compression
In mode 30, it is clearly distinguished as a one-piece. In addition,
In the case of the second compression mode 30 as in the case of the first compression mode 28,
It is possible to distinguish Mochikana from Kanji.

【００１４】入力された２バイトがひらがな及び片かな
ではない場合（ステップ４０７でＮＯ，ステップ４１１
でＮＯ），その入力データは記号を示すデータであるか
否かが判断され（ステップ４１５），記号を示すデータ
である場合で（ステップ４１５でＹＥＳ），第１圧縮モ
ード２８または第２圧縮モード３０のときには，その第
１バイトが省略されて第２バイト値＋６０（１６進）の
データ（第２バイト値に６０（１６進）を加算した値）
のみが出力される（ステップ４１７）。現在の圧縮モー
ドが第１圧縮モード２８及び第２圧縮モード３０以外の
ときには，モード切替部２４から第１制御コード０Ｆ
（１６進）が送出されて圧縮部２６の圧縮モードが第１
圧縮モード２８に切り替えられ，第１制御コード０Ｆ
（１６進）と第２バイト値＋６０（１６進）のデータが
出力される（ステップ４１７）。この第２バイトの値を
演算することにより，４０（１６進）〜７Ｆ（１６進）
の値のものが，Ａ０（１６進）〜ＤＦ（１６進）に変換
される。この記号を示すデータは第１圧縮モード２８あ
るいはまた第２圧縮モード３０として扱いながらも，第
２バイト値の変換後の値はひらがな及び片かなの第２バ
イト値の変換後の値とはオーバーラップしないため，ひ
らがな及び片かなとは明確に区別される。さらには，こ
れらの変換後の値は漢字を示す２バイトデータの第１バ
イトとは異なるため，記号と漢字とは区別が可能とな
る。 If the input 2 bytes are not hiragana or katakana (NO in step 407, step 411)
NO), it is determined whether or not the input data is data indicating a symbol (step 415). If the input data is data indicating a symbol (YES in step 415), the first compression mode 28 or the second compression mode is determined. In the case of 30, the first byte is omitted and the second byte value + 60 (hexadecimal)
Data (value obtained by adding 60 (hexadecimal) to the second byte value)
Is output (step 417). When the current compression mode is other than the first compression mode 28 and the second compression mode 30, the first control code 0F
(Hexadecimal) is transmitted and the compression mode of the compression unit 26 is set to the first
Switching to the compression mode 28, the first control code 0F
(Hexadecimal) and the data of the second byte value +60 (hexadecimal) are output (step 417). The value of this second byte
By calculation, 40 (hex) to 7F (hex)
Is converted from A0 (hexadecimal) to DF (hexadecimal)
Is done. The data indicating this symbol is in the first compression mode 28
Or, while treating as the second compression mode 30,
The converted value of the 2-byte value is the second byte of Hiragana and Katakana.
Since the value does not overlap with the converted value,
It is clearly distinguished from kana and kana. Furthermore, this
These converted values are the first byte of 2-byte data indicating Kanji.
It is possible to distinguish between symbols and kanji
You.

【００１５】入力された２バイトがひらがな、片かな及
び記号ではない場合（ステップ４０７でＮＯ、ステップ
４１１でＮＯ、ステップ４１５でＮＯ）、その入力デー
タは他の２バイト系の文字（例えば漢字）なので、現在
の圧縮モードと関係なく２バイトがそのまま出力される
（ステップ４１９）。なお、現在の圧縮モードが第１圧
縮モード２８及び第２圧縮モード３０のときには、モー
ド切替部２４から第３制御コ−ド０Ｅ（１６進）が送出
されて圧縮部２６の圧縮モードが第３圧縮モードに切り
替えられる（ステップ４１９）。If the input 2 bytes are not hiragana, katakana or symbol (NO in step 407, NO in step 411, NO in step 415), the input data is another 2-byte character (for example, kanji). Therefore, 2 bytes are output as they are regardless of the current compression mode (step 419). When the current compression mode is the first compression mode 28 or the second compression mode 30, a third control code 0E (hexadecimal) is transmitted from the mode switching unit 24, and the compression mode of the compression unit 26 is set to the third compression mode. The mode is switched to the compression mode (step 419).

【００１６】入力部２２を介してモード切替部２４に与
えられた１バイトが（ステップ４０１）、日本語シフト
ＪＩＳコード体系で設定されている１バイト系のデータ
であると判断されると（ステップ４０３でＮＯ）、その
１バイトが第１制御コード０Ｆ（１６進）または第２制
御コード１３（１６進）と同一であるか否かが判断され
る（ステップ４２１）。このコードが第１制御コード０
Ｆ（１６進）または第２制御コード１３（１６進）の場
合で（ステップ４２１でＹＥＳ）、現在の圧縮モードが
第１圧縮モード２８及び第２圧縮モード３０のときに
は、モード切替部２４から第３制御コ−ド０Ｅ（１６
進）が圧縮部２６に出力され、第３圧縮モード３２に切
り替えられる。そして、同じ第３制御コード０Ｅ（１６
進）が連続して２バイト分出力される（ステップ４２
６）。これにより、圧縮モード切替データとしての第１
制御コ−ド０Ｆ（１６進）及び第２制御コ−ド１３（１
６進）と通常の１バイト文字データとしての０Ｆ（１６
進）及び１３（１６進）とが区別される。When one byte provided to the mode switching unit 24 via the input unit 22 (step 401) is determined to be 1-byte data set in the Japanese shift JIS code system (step 401). (NO in 403), it is determined whether or not the one byte is the same as the first control code 0F (hexadecimal) or the second control code 13 (hexadecimal) (step 421). This code is the first control code 0
In the case of F (hexadecimal) or the second control code 13 (hexadecimal) (YES in step 421), and when the current compression mode is the first compression mode 28 and the second compression mode 30, the mode switching unit 24 3 control code 0E (16
Hex) is output to the compression unit 26, and the mode is switched to the third compression mode 32. Then, the same third control code 0E (16
Hex) are continuously output for 2 bytes (step 42).
6). Thus, the first mode as the compression mode switching data
The control code 0F (hexadecimal) and the second control code 13 (1
Hexadecimal) and 0F (16
Hex) and 13 (hexadecimal).

【００１７】また、与えられた１バイトが第１制御コー
ド０Ｆ（１６進）または第２制御コード１３（１６進）
と同一でない場合（ステップ４２１でＮＯ）、他の１バ
イト系の制御コ−ドであるか否かが判断され（ステップ
４２５）、他の制御コ−ド２０（１６進）以下であると
判断されると（ステップ４２５でＹＥＳ）、データ圧縮
されないので、その１バイトはそのまま出力される（ス
テップ４１１）。The given one byte is the first control code 0F (hexadecimal) or the second control code 13 (hexadecimal).
If it is not the same as the above (NO in step 421), it is determined whether or not it is another one-byte control code (step 425), and it is determined that it is equal to or less than another control code 20 (hexadecimal). If this is done (YES in step 425), since the data is not compressed, the one byte is output as it is (step 411).

【００１８】さらに、与えらえた１バイトが第１制御コ
ード０Ｆ（１６進）または第２制御コード１３（１６
進）と同一でない場合で（ステップ４２１でＮＯ）、他
の１バイト系の制御コードでないとき（ステップ４２５
でＮＯ）、データは一般の１バイト文字なので、第１圧
縮モード２８または第２圧縮モード３０であれば第３制
御コ−ド０Ｅ（１６進）がモード切替部２４から圧縮部
２６に与えられ、第３圧縮モード３２に切り替えられる
と共に、与えられた１バイトはそのまま出力される（ス
テップ４２９）。Further, the given one byte is the first control code 0F (hexadecimal) or the second control code 13 (16
Hex) (NO in step 421) and when it is not another one-byte control code (step 425)
, NO), since the data is a general one-byte character, if the first compression mode 28 or the second compression mode 30, the third control code 0E (hexadecimal) is given from the mode switching unit 24 to the compression unit 26. Is switched to the third compression mode 32, and the given one byte is output as it is (step 429).

【００１９】なお、これらのデータ圧縮は入力データの
先頭の１バイトが受け取られなくなるまで、繰り返し行
われる。また、ここでは、与えられた第２バイトが各圧
縮モードで、ひらがな等のデータをを２１（１６進）か
ら７３（１６進）までに整えており、これにより、他の
コード設定領域をより有効に利用することが可能とな
る。These data compressions are repeated until the first byte of the input data is no longer received. Also, here, the given second byte arranges data such as hiragana from 21 (hexadecimal) to 73 (hexadecimal) in each compression mode, thereby making other code setting areas more compact. It can be used effectively.

【００２０】図５では実施例による日本語文章のデータ
圧縮例が示されており、第５図（Ａ）の『この「アルゴ
リズム」は日本語テキストに対し、かなりのデータ圧縮
を行う。』という全体で７２バイトある文章が、図６
（Ｂ）のように５０バイトまで圧縮される。FIG. 5 shows an example of data compression of a Japanese sentence according to the embodiment. “This“ algorithm ”in FIG. 5A performs considerable data compression on Japanese text. The sentence with a total of 72 bytes is shown in FIG.
It is compressed to 50 bytes as shown in FIG.

【００２１】図６はデータ圧縮装置２０（図２参照）で
圧縮されたデータを伸長復元するデータ伸長装置４０の
概略構成図である。図６において、データ伸長装置４０
は入力部４２、モード切替部４４、伸長部４６、出力部
６４を備えており、伸長部４６はデータ圧縮装置２０の
第１圧縮モード２８に対応した第１伸長モード４８、第
２圧縮モード３０に対応した第２伸長モード５０、第３
圧縮モード３２に対応した第３伸張モード５２を選択実
行できるものとなっている。FIG. 6 is a schematic block diagram of a data decompression device 40 for decompressing and decompressing data compressed by the data compression device 20 (see FIG. 2). In FIG. 6, the data decompression device 40
Has an input section 42, a mode switching section 44, an expansion section 46, and an output section 64. The expansion section 46 includes a first expansion mode 48 and a second compression mode 30 corresponding to the first compression mode 28 of the data compression device 20. Second decompression mode 50 corresponding to
The third decompression mode 52 corresponding to the compression mode 32 can be selectively executed.

【００２２】図７ではデータ伸長装置４０によるデータ
伸長作用が説明されており、与えられた圧縮データのう
ち先頭の１バイトがモード切替部４４で受け取られ（ス
テップ７０１）、この１バイトが第１制御コ−ド０Ｆ
（１６進）であるか否かが判断される（ステップ７０
３）。そして、この１バイトが第１制御コ−ド０Ｆ（１
６進）であると判断されると（ステップ７０３でＹＥ
Ｓ）、現在の伸長モードが第１伸長モード４８以外のモ
ードの場合、第１伸長モード４８に切替え、第１伸長モ
ード４８である場合にはモード切替部４４は第１バイト
を第１制御コ−ド０Ｆ（１６進）として復元部４６に出
力し、第３伸長モード５２に切り替える（ステップ７０
５）。FIG. 7 illustrates the data decompression operation of the data decompression device 40. The first byte of the given compressed data is received by the mode switching unit 44 (step 701), and this one byte is the first byte. Control code 0F
(Hexadecimal) is determined (step 70).
3). This one byte is the first control code 0F (1
(Hexadecimal) (YE in step 703).
S) If the current decompression mode is a mode other than the first decompression mode 48, the mode is switched to the first decompression mode 48. If the current decompression mode is the first decompression mode 48, the mode switching unit 44 sets the first byte to the first control command. −0F (hexadecimal) to the restoring unit 46 and switch to the third decompression mode 52 (step 70).
5).

【００２３】また、入力データの先頭１バイトが第１制
御コ−ド０Ｆ（１６進）でないと判断されると（ステッ
プ７０３でＮＯ）、その１バイトが第２制御コ−ド１３
（１６進）であるか否かが判断される（ステップ７０
７）。そして、第２制御コ−ド１３（１６進）であると
判断されると（ステップ７０７でＹＥＳ）、現在の伸長
モードが第２伸長モード５０以外のモードの場合、第２
伸長モード５０に切替えられ、第２伸長モード５０であ
る場合には、モード切替部４４はその第１バイトを第２
制御コ−ド１３（１６進）として伸長部４６に出力し、
第３伸長モード５２にモードに切り替える（ステップ７
０９）。If it is determined that the first byte of the input data is not the first control code 0F (hexadecimal) (NO in step 703), that one byte is used as the second control code 13.
(Hexadecimal) is determined (step 70).
7). If it is determined that the second control code is 13 (hexadecimal) (YES in step 707), and if the current decompression mode is a mode other than the second decompression mode 50, the second
When the mode is switched to the decompression mode 50 and the mode is the second decompression mode 50, the mode switching unit 44 stores the first byte in the second byte.
The control code 13 (hexadecimal) is output to the expansion unit 46,
The mode is switched to the third decompression mode 52 (step 7).
09).

【００２４】また、１バイトが第２制御コ−ド１３（１
６進）でないと判断されると（ステップ７０７でＮ
Ｏ）、その１バイトが第３制御コ−ド０Ｅ（１６進）で
あるか否かが判断される（ステップ７０７）。この１バ
イトが第３制御コ−ド０Ｅ（１６進）であると判断され
ると（ステップ７１１でＹＥＳ）、現在の伸長モードが
第３伸長モード５２以外の場合、第３伸長モード５２に
切替えられ、第３伸長モード５２である場合には第１バ
イトを復元部４６に出力される（ステップ７１３）。One byte is the second control code 13 (1
When it is determined that the value is not hexadecimal (N in step 707)
O) It is determined whether or not the one byte is the third control code 0E (hexadecimal) (step 707). If it is determined that this one byte is the third control code 0E (hexadecimal) (YES in step 711), if the current decompression mode is other than the third decompression mode 52, the mode is switched to the third decompression mode 52. In the third decompression mode 52, the first byte is output to the restoration unit 46 (step 713).

【００２５】入力された１バイトが第１制御コード０Ｆ
（１６進），第２制御コード１３（１６進），第３制御
コード０Ｅ（１６進）のいづれでもない場合（ステップ
７１１でＮＯ），現在の伸長モードが第１伸長モード４
８であるか否かが判断され（ステップ７１５），第１伸
長モード４８である場合で（ステップ７１５でＹＥ
Ｓ），第１バイトがひらがなのときには，日本語シフト
ＪＩＳコード体系のひらがな第１バイトに相当する８２
（１６進）と第２バイトとして第１バイト値＋７Ｅ（１
６進）を出力し，第１バイトが記号のときには，日本語
シフトＪＩＳコード体系の記号第１バイトに相当する８
１（１６進）と第２バイトとして第１バイト値−６０
（１６進）を出力し，第１バイトがひらがな及び記号以
外のコードに相当するときには，そのまま第１バイトが
出力される（ステップ７１７）。The input 1 byte is the first control code 0F.
(Hexadecimal), the second control code 13 (hexadecimal), when the third control code 0E (hexadecimal) also Izure of not Na (NO at step 711), the current extension mode first extending mode 4
8 is determined (step 715), and in the case of the first decompression mode 48 (YE in step 715).
S) When the first byte is Hiragana, it corresponds to the Hiragana first byte of the Japanese shift JIS code system.
(Hexadecimal) and the first byte value + 7E (1
Hexadecimal), and when the first byte is a symbol, 8 equivalent to the first byte of the symbol in the Japanese shift JIS code system
1 (hexadecimal) and the first byte value -60 as the second byte
(Hexadecimal) is output, and if the first byte corresponds to a code other than hiragana and a symbol, the first byte is output as it is (step 717).

【００２６】また，現在の伸長モードが第１伸長モード
４８でない場合（ステップ７１５でＮＯ），現在の伸長
モードが第２伸長モード５０であるか否かが判断される
（ステップ７１９）。そして，第２伸長モード５０であ
る場合で（ステップ７１９でＹＥＳ），第１バイトが片
かなのときには，日本語シフトＪＩＳコード体系の片か
な第１バイトに相当する８３（１６進）と第２バイトと
して第１バイト値＋２０（１６進）を出力し，第１バイ
トが記号のときには，日本語シフトＪＩＳコード体系の
記号第１バイトである８１（１６進）と第２バイトとし
て第１バイト値−６０（１６進）を出力し，第１バイト
が片かな及び記号以外のときには，そのまま第１バイト
が出力される（ステップ７２１）。If the current decompression mode is not the first decompression mode 48 (NO in step 715), it is determined whether the current decompression mode is the second decompression mode 50 (step 719). If the first decompression mode is the second decompression mode 50 (YES in step 719) and the first byte is one-sided, 83 (hexadecimal) corresponding to the one-sided first byte of the Japanese shift JIS code system and the second Bytes and
And outputs the first byte value +20 (hexadecimal). If the first byte is a symbol, the first byte is 81 (hexadecimal), which is the first byte of the Japanese Shift JIS code system, and the second byte.
To output the first byte value -60 (hexadecimal), and when the first byte is other than a one-byte kana or symbol, the first byte is output as it is (step 721).

【００２７】現在の伸長モードが第１伸長モード４８及
び第２伸長モード５０でない場合（ステップ７１９でＮ
Ｏ）、そのまま第１バイトが出力される（ステップ７２
３）。このように、前述したデータ圧縮と逆の作用によ
りデータの伸長を行うことができる。なお、データ圧縮
装置２０及びデータ復元装置４０では日本語シフトＪＩ
Ｓコード体系に対応したデータの圧縮伸長について説明
したが、他のコード体系でもこのデータ圧縮伸長は可能
である。また、前述した制御コ−ドも適宜変更すること
ができる。If the current decompression mode is not the first decompression mode 48 or the second decompression mode 50 (N in step 719)
O), the first byte is output as it is (step 72)
3). In this manner, data can be expanded by the reverse operation of the above-described data compression. In the data compression device 20 and the data decompression device 40, the Japanese shift JI
Although the data compression and decompression corresponding to the S code system has been described, the data compression and decompression can be performed with other code systems. Further, the above-described control codes can be appropriately changed.

【００２８】以上説明したように本実施例によれば、記
号を示す２バイトデータを第１圧縮モード２８と第２圧
縮モード３０の圧縮モードで圧縮することができるの
で、日本語に多用される句読点等のデータが入力されて
も圧縮モードを切り替える必要がない。したがって、記
号が入力されるたびにそれらに対する特定の制御コード
を送出することがないので、データの圧縮効率を高く維
持することができる。また、漢字を示す２バイトコード
は圧縮モードに関係なくそのまま送出されるので、漢字
とひらがなまたは片かなが混在する文章であっても、漢
字に対する特定の制御コードを送出することがない。こ
のため、ひらなが，漢字等の文字種の混在度が高く、同
一文字種の連続性が乏しい日本語の文章であっても、デ
ータの圧縮効率を高く維持できる。As described above, according to the present embodiment, two-byte data representing a symbol can be compressed in the first compression mode 28 and the second compression mode 30. Therefore, it is frequently used in Japanese. There is no need to switch the compression mode even if data such as punctuation is input. Therefore, a specific control code for symbols is not sent every time a symbol is input, so that high data compression efficiency can be maintained. Also, since the two-byte code indicating the kanji is sent as it is regardless of the compression mode, even if the sentence is a mixture of kanji and hiragana or katakana, no specific control code for the kanji is sent. For this reason, even in the case of Hiragana, a Japanese sentence with a high degree of mixture of character types such as kanji and poor continuity of the same character type, high data compression efficiency can be maintained.

【００２９】[0029]

【発明の効果】以上説明したように本発明によれば、記
号を示す２バイトコードを複数の圧縮モードで圧縮する
ことができ、漢字を示す２バイトコードはそのまま送出
されるので、漢字または記号が入力されるたびにそれら
に対する特定の制御コードを送出することがない。した
がって、ひらなが，漢字等の文字種の混在度が高く、同
一文字種の連続性が乏しい日本語の文章であっても、高
いデータ圧縮効率を維持できる。As described above, according to the present invention, a two-byte code representing a symbol can be compressed in a plurality of compression modes, and a two-byte code representing a kanji is transmitted as it is. Does not send out a specific control code for them each time they are entered. Accordingly, high data compression efficiency can be maintained even for Japanese sentences in which hiragana has a high degree of mixture of character types such as kanji and poor continuity of the same character type.

[Brief description of the drawings]

【図１】発明の原理説明図である。FIG. 1 is a diagram illustrating the principle of the present invention.

【図２】実施例の構成説明図である。FIG. 2 is a diagram illustrating the configuration of an embodiment.

【図３】シフトＪＩＳコード体系の概略説明図である。FIG. 3 is a schematic explanatory diagram of a shift JIS code system.

【図４】実施例のデータ圧縮作用を説明するフローチャ
ートである。FIG. 4 is a flowchart illustrating a data compression operation of the embodiment.

【図５】データ圧縮された具体例の説明図である。FIG. 5 is an explanatory diagram of a specific example of data compression.

【図６】データ伸長装置の概略構成図である。FIG. 6 is a schematic configuration diagram of a data decompression device.

【図７】データ伸長装置のデータ伸長作用を説明するフ
ローチャートである。FIG. 7 is a flowchart illustrating a data decompression operation of the data decompression device.

[Explanation of symbols]

２０データ圧縮装置２２，４２入力部２４，４４モード切替部２６圧縮部２８第１圧縮モード３０第２圧縮モード３２第３圧縮モード３４，５４出力部４０データ伸長装置 Reference Signs List 20 data compression device 22, 42 input unit 24, 44 mode switching unit 26 compression unit 28 first compression mode 30 second compression mode 32 third compression mode 34, 54 output unit 40 data decompression device

Claims

(57) [Claims]

The first byte of 2-byte data indicating a hiragana or a symbol which is sequentially input is omitted until the compression mode is switched , the second byte is converted into a numerical value and transmitted , and the 2-byte data representing a kanji is transmitted as it is. 1st compression mode (10) and 2 indicating a sequentially input kana or symbol
The second compression mode (1) in which the first byte of the byte data is omitted until the compression mode is switched , the second byte is converted into a numerical value and transmitted , and the 2-byte data indicating the kanji is transmitted as it is
Compression means (16) having 2) and a third compression mode (14) for transmitting 2-byte data or 1-byte data as kanji when the 2-byte data or 1-byte data is input, without change; A first control code is transmitted when 2-byte data indicating a hiragana or a symbol is input, and a second control code is transmitted when 2-byte data indicating a hiragana or a symbol is input, and 1-byte data is input. And a switching means (18) for transmitting a third control code when switching is performed to switch the compression mode.