JPH06195063A

JPH06195063A - Character pattern data compressing device

Info

Publication number: JPH06195063A
Application number: JP4357883A
Authority: JP
Inventors: Shin Mikuni; 伸三国
Original assignee: Casio Computer Co Ltd
Current assignee: Casio Computer Co Ltd
Priority date: 1992-12-25
Filing date: 1992-12-25
Publication date: 1994-07-15

Abstract

PURPOSE:To compress character pattern data with the use of a compressing method which suppresses redundancy by making the appearance frequency of data partial while making the combination pattern of one data constituting the character pattern data and last data correspond to fixed data for reproduc tion when the combination pattern appears frequently. CONSTITUTION:An uncompressed font pattern data storage part 101 aims at respective data constituting a character pattern and detects the frequency of the combination of the aimed data and its last data and a key word selection part 105 generates a key word table when the combination is high in frequency. A data appearance frequency counting part 107 makes this key word table correspond to command codes for reproduction to allow the appearance frequency of the data to be deviated. A compressed code table generation part 109 generates a compressed code string having its redundant part cut corresponding to the appearance frequency of the data. A compressive encoding part 112 substitutes the compressed code string for the character pattern data.

Description

Detailed Description of the Invention

【０００１】[0001]

【産業上の利用分野】この発明は、ワードプロセッサ等
の文書作成装置に組み込まれる文字パターンデータを圧
縮する文字パターンデータ圧縮装置に関する。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a character pattern data compression device for compressing character pattern data incorporated in a document creation device such as a word processor.

【０００２】[0002]

【従来の技術】一般に、ワードプロセッサにおいて、入
力作成された文書データはテキストメモリにコード形式
で記憶されており、文書印刷指令が与えられると、文字
パターンメモリを参照することによってテキストメモリ
内の文書データが１文字ずつドットパターンに変換され
て印字出力される。ここで、文字パターンメモリは半導
体読み出し専用メモリ（ＲＯＭ）等により構成され、こ
の文字パターンメモリ内に標準装備されている書体の文
字パターンで文書印刷が行われる。2. Description of the Related Art Generally, in a word processor, input and created document data is stored in a code format in a text memory, and when a document print command is given, the document data in the text memory is referred to by referring to a character pattern memory. Is converted into a dot pattern character by character and printed out. Here, the character pattern memory is composed of a semiconductor read-only memory (ROM) or the like, and a document is printed with a character pattern of a typeface provided as standard in the character pattern memory.

【０００３】[0003]

【発明が解決しようとする課題】ところで、文字パター
ンデータは例えば４８ドットフォントの場合、１文字に
含まれるドット数は４８×４８＝２３０４ドットとな
り、１文字２８８バイト要することになる。これが、例
えば、ＪＩＳ第１、第２水準の漢字、仮名記号全体では
７０００文字以上となり、その容量は２Ｍバイトと大変
巨大なものとなってしまう。この為、書体の異なる複数
種の文字パターンを予め文字パターンメモリ内に標準装
備しておくことは、文字パターンメモリが膨大化し、コ
スト高となる。また、フロッピーディスク等の外部記憶
装置から文字パターンを供給するものにおいても、書体
毎に複数の外部記憶装置が必要となり、コスト高とな
る。そこで、従来においては、文字パターンデータを圧
縮して記憶させておき、文書印刷時にこの圧縮パターン
を元の文字パターンに復元するようにしたものが知られ
ている。この種のものは、例えば、文字パターンデータ
を構成する“０”のビットデータが連続する場合、その
連続する数をその“０”のビット列データに置き換えて
記憶させることによって文字パターンを圧縮するように
しているが、このような圧縮法では圧縮率が低く、余り
実効が上らないのが実状であった。この発明の課題は、
文字パターンデータを構成するあるデータとその前のデ
ータとの組み合せパターンが頻繁に出現する場合にこれ
を再生用の固定データに対応付けてデータの出現頻度を
偏らせておくことにより冗長性を抑制する圧縮手法を用
いて文字パターンデータを圧縮できるようにすることで
ある。By the way, when the character pattern data is, for example, a 48-dot font, the number of dots contained in one character is 48 × 48 = 2304 dots, and one character requires 288 bytes. This is, for example, 7,000 characters or more for the JIS kanji and JIS kana symbols as a whole of JIS, and the capacity is very huge, 2 Mbytes. For this reason, if a plurality of types of character patterns having different typefaces are provided as standard equipment in the character pattern memory in advance, the character pattern memory becomes huge and the cost becomes high. Further, even in the case of supplying a character pattern from an external storage device such as a floppy disk, a plurality of external storage devices are required for each typeface, resulting in high cost. Therefore, conventionally, it is known that the character pattern data is compressed and stored, and the compressed pattern is restored to the original character pattern when the document is printed. In this type, for example, when the bit data of “0” forming the character pattern data is continuous, the character pattern is compressed by replacing the continuous number with the bit string data of “0” and storing it. However, with such a compression method, the compression rate is low, and the actual situation is that the effectiveness is not so high. The subject of this invention is
Redundancy is suppressed by associating this with fixed data for reproduction and biasing the appearance frequency of data when a combination pattern of certain data that constitutes character pattern data and the data before it frequently appears. That is, the character pattern data can be compressed by using the compression method.

【０００４】[0004]

【課題を解決するための手段】この発明の手段は次の通
りである。ビットマップ方式の各文字パターンデータを
それぞれ圧縮して文書作成装置に組み込む文字パターン
データ圧縮装置において、組み合せ頻度検出手段は文字
パターンデータを構成する各データに順次着目し、この
着目データとその前のデータとの組み合せに応じてその
組み合せ頻度を検出する動作を全ての文字に対して行
う。テーブル作成手段はこの組み合せ頻度検出手段によ
って頻度の高い組み合せが検出された場合に、前のデー
タがこの組み合せならば着目データはこれであるという
データ候補を各組み合せ毎に記憶する候補テーブルを作
成する。圧縮符号発生手段は前記候補テーブルを再生用
の固定データに対応させることでデータの出現頻度を偏
らせ、このデータの出現頻度に応じて冗長部分を抑制し
た圧縮符号列を発生する。圧縮手段は各文字パターンデ
ータを前記圧縮符号発生手段によって発生された圧縮符
号列に置き換えることで文字パターンデータを圧縮す
る。なお、例えば、前記組み合せ頻度検出手段によって
頻度の高い組み合せが検出された場合に、前記テーブル
作成手段は前のデータの１つの組み合せに対して着目デ
ータの異なる複数の候補テーブルを作成し、前記圧縮符
号発生手段は複数の候補テーブルのそれぞれに固定デー
タを対応させるようにしてもよい。また、例えば、文字
パターンデータを構成する各データの単位は、ビットマ
ップ方式でデータを持つ際の１アドレスを占有する１ワ
ードを構成するビット群となるようにしてもよい。The means of the present invention are as follows. In a character pattern data compression device that compresses each character pattern data of the bit map system and incorporates it in the document creation device, the combination frequency detection means sequentially focuses on each data forming the character pattern data, and The operation of detecting the combination frequency is performed for all characters according to the combination with the data. When a combination having a high frequency is detected by the combination frequency detecting means, the table creating means creates a candidate table for storing, for each combination, a data candidate that the data of interest is this if the previous data is this combination. . The compression code generation means biases the appearance frequency of the data by associating the candidate table with the fixed data for reproduction, and generates the compression code string in which the redundant portion is suppressed according to the appearance frequency of the data. The compressing means compresses the character pattern data by replacing each character pattern data with the compression code string generated by the compression code generating means. It should be noted that, for example, when the combination frequency detecting means detects a combination having a high frequency, the table creating means creates a plurality of candidate tables having different focused data for one combination of the previous data, and compresses The code generating means may associate fixed data with each of the plurality of candidate tables. Further, for example, the unit of each data forming the character pattern data may be a bit group forming one word occupying one address when the data is held by the bitmap method.

【０００５】[0005]

【作用】この発明の手段の作用は次の通りである。先
ず、文字パターンデータを構成する各データに順次着目
し、この着目データとその前のデータとの組み合せに応
じてその組み合せ頻度を検出する動作が全ての文字に対
して行われる。これによって頻度の高い組み合せが検出
されると、前のデータがこの組み合せならば着目データ
はこれであるというデータ候補を各組み合せ毎に記憶す
る候補テーブルが作成される。すると、この候補テーブ
ルを再生用の固定データに対応させることでデータの出
現頻度を偏らせ、このデータ出現頻度に応じて冗長部分
を抑制した圧縮符号列を発生させて各文字パターンデー
タをこの圧縮符号列で置き換えることで文字パターンデ
ータを圧縮する。したがって、文字パターンデータを構
成するあるデータとその前のデータとの組み合せパター
ンが頻繁に出現する場合にこれを再生用の固定データに
対応付けてデータの出現頻度を偏らせておくことにより
冗長性を抑制する圧縮手法を用いて文字パターンデータ
を圧縮することができる。The operation of the means of the present invention is as follows. First, attention is sequentially paid to each data forming the character pattern data, and an operation of detecting the combination frequency is performed for all the characters in accordance with the combination of the focused data and the previous data. When a combination with a high frequency is detected in this way, a candidate table is created for storing, for each combination, a data candidate that the data of interest is this if the previous data is this combination. Then, by making this candidate table correspond to the fixed data for reproduction, the appearance frequency of the data is biased, and the compression code string in which the redundant part is suppressed is generated according to the appearance frequency of the data to compress each character pattern data. Character pattern data is compressed by replacing it with a code string. Therefore, when a combination pattern of a certain data forming the character pattern data and the data before it frequently appears, it is associated with the fixed data for reproduction and the frequency of appearance of the data is biased to achieve redundancy. Character pattern data can be compressed using a compression method that suppresses

【０００６】[0006]

【第１実施例】以下、図１〜図１６を参照して第１実施
例を説明する。図１は文字パターンデータの圧縮／再生
過程を概略的に示したもので、文字パターンデータ圧縮
装置１００は文書作成装置２００の製品完成時におい
て、例えば１３０００文字の文字パターンデータを本実
施例特有の規則にしたがってデータ圧縮し、それによっ
て得られた各種のデータ（後述する圧縮フォントデー
タ、インデックステーブル、逆変換テーブル、キーワー
ドテーブル）を文書作成装置２００に組み込む。つま
り、文字パターンデータ圧縮装置１００と文書作成装置
２００とは別個独立した装置であり、文字パターンデー
タ圧縮装置１００は文書作成装置２００に圧縮文字デー
タを格納させる為の装置で、文書作成装置２００の製品
完成時に圧縮文字データの書き込みが行われる。[First Embodiment] A first embodiment will be described below with reference to FIGS. FIG. 1 schematically shows a process of compressing / reproducing character pattern data. The character pattern data compressing device 100, when the product of the document creating device 200 is completed, converts the character pattern data of, for example, 13000 characters into a peculiar embodiment. Data is compressed according to the rules, and various data (compressed font data, index table, inverse conversion table, keyword table described later) obtained by the compression are incorporated in the document creating apparatus 200. That is, the character pattern data compression device 100 and the document creation device 200 are separate and independent devices. The character pattern data compression device 100 is a device for storing compressed character data in the document creation device 200. The compressed character data is written when the product is completed.

【０００７】文書作成装置２００は本実施例特有の文字
パターン再生装置２０１と通常備えられている文書作成
機主装置２０２とを有する構成で、文字パターン再生装
置２０１には圧縮文字データメモリ２０３が設けられて
いる。この圧縮文字データメモリ２０３は圧縮フォント
格納部２０４、インデックステーブル格納部２０５、逆
変換テーブル格納部２０６、キーワードテーブル格納部
２０７を有する構成で、文字パターンデータ圧縮装置１
００から発生出力された圧縮フォントデータ、インデッ
クステーブル、逆変換テーブル、キーワードテーブルが
圧縮文字データメモリ２０３内の対応する格納部２０４
〜２０７に書き込まれる。文書印刷時において文字パタ
ーン再生装置２０１は圧縮文字パターンを本実施例特有
の規則にしたがって逆変換し、通常と同様の無圧縮文字
パターンに復元再生し、文書作成機主装置２０２に与え
る。これによって通常と同様の印字結果を得るようにし
ている。The document creating apparatus 200 has a character pattern reproducing apparatus 201 peculiar to this embodiment and a document creating machine main apparatus 202 which is usually provided. The character pattern reproducing apparatus 201 is provided with a compressed character data memory 203. Has been. The compressed character data memory 203 includes a compressed font storage unit 204, an index table storage unit 205, an inverse conversion table storage unit 206, and a keyword table storage unit 207.
The compressed font data, the index table, the inverse conversion table, and the keyword table generated and output from 00 correspond to the corresponding storage unit 204 in the compressed character data memory 203.
To 207. At the time of printing the document, the character pattern reproducing device 201 reversely converts the compressed character pattern according to the rules peculiar to the present embodiment, restores and reproduces the same uncompressed character pattern as usual, and gives it to the document creator main unit 202. As a result, the same printing result as usual is obtained.

【０００８】なお、上述した「圧縮フォントデータ」、
「インデックステーブル」、「逆変換テーブル」、「キ
ーワードテーブル」については後で詳細に説明するの
で、ここでは簡単に説明しておくものとする。「圧縮フ
ォントデータ」は文字パターンデータを構成するデータ
のうち頻繁に出現するデータパターンを特定のコード
（固定値）に対応させることによってデータの出現頻度
を偏らせ、データの出現頻度に応じて冗長部分をカット
する特性の圧縮方法を用いて圧縮することによって得ら
れたデータであり、１文字毎に通常の文字パターンデー
タは上述した圧縮手法にしたがって圧縮され、文書作成
装置２００内の圧縮フォント格納部２０４に全文字分の
圧縮フォントデータが書き込まれる。「インデックステ
ーブル」はどの文字が「圧縮フォントデータ」の中の何
バイト目から入っているかを示すテーブルである。「逆
変換テーブル」および「キーワードテーブル」は圧縮フ
ォントデータを通常の文字パターンデータに復元再生す
る際に用いられるテーブルである。The above-mentioned "compressed font data",
The “index table”, the “inverse conversion table”, and the “keyword table” will be described in detail later, and therefore will be briefly described here. "Compressed font data" biases the appearance frequency of data by associating frequently appearing data patterns of the data forming the character pattern data with a specific code (fixed value), and makes them redundant according to the appearance frequency of data. Data obtained by compression using a compression method having a characteristic of cutting a part, and normal character pattern data is compressed for each character according to the compression method described above, and stored in the compressed font in the document creation device 200. Compressed font data for all characters is written in the section 204. The "index table" is a table indicating which character is included in which byte of the "compressed font data". The "inverse conversion table" and the "keyword table" are tables used when the compressed font data is restored to normal character pattern data and reproduced.

【０００９】図２は文字パターンデータ圧縮装置１００
のブロック構成図である。無圧縮フォントパターンデー
タ格納部１０１には文書作成装置２００に搭載すべき全
ての文字（例えば１３０００文字）が収められている。
図３は、無圧縮フォントパターンデータ格納部１０１に
収められている文字パターンのデータ構造を示してい
る。文字パターンは１文字４８×４８ドット構成で、８
ビット構成のメモリを縦方向に配置し、それを横に４８
バイト並べ、更にこれを縦方向に６段重ねた構造のメモ
リにドットマップ方式で格納されている。したがって、
１文字分の文字パターンは縦８ビット×６段＝４８ドッ
ト、横４８ドットで２８８バイト構成となっている。な
お、この場合のアドレスで一番若いのは文字パターンの
左上隅部に対応するアドレスであり、次に若いのは右
隣、……最後は右下隅部に対応するアドレスとなる。こ
の右下隅部のアドレスに続くアドレス領域には次の文字
の左上隅部に対応するデータが格納されている。また、
ビット並びはＭＳＢが図中、上に来るように配置されて
いる。FIG. 2 shows a character pattern data compression device 100.
It is a block configuration diagram of. The uncompressed font pattern data storage unit 101 stores all the characters (for example, 13000 characters) to be installed in the document creating apparatus 200.
FIG. 3 shows the data structure of character patterns stored in the uncompressed font pattern data storage unit 101. The character pattern consists of 48 x 48 dots per character, 8
Arrange the bit-structured memory vertically and set it horizontally 48
The data is stored in a dot-map manner in a memory having a structure in which bytes are arranged and further vertically stacked in six stages. Therefore,
A character pattern for one character has a structure of 288 bytes consisting of vertical 8 bits × 6 rows = 48 dots and horizontal 48 dots. In this case, the youngest address is the address corresponding to the upper left corner of the character pattern, the youngest is the address next to the right, and the last is the address corresponding to the lower right corner. Data corresponding to the upper left corner of the next character is stored in the address area following the address of the lower right corner. Also,
The bit arrangement is arranged so that the MSB is at the top in the figure.

【００１０】ソース・バッファ１０２は無圧縮フォント
パターンデータ格納部１０１から読み出された１文字分
の文字パターンデータを一時的に記憶するもので、無圧
縮フォントパターンデータ格納部１０１の内容が１文字
ずつソース・バッファ１０２に格納される。このように
処理対象の文字パターンデータを１文字ずつ無圧縮フォ
ントパターンデータ格納部１０１から取り出してソース
・バッファ１０２に一時記憶させることにより処理速度
の低下を防止するようにしている。なお、ソース・バッ
ファ１０２には図３で示したデータ構造のまま文字パタ
ーンデータが格納される。The source buffer 102 temporarily stores the character pattern data for one character read from the uncompressed font pattern data storage unit 101, and the content of the uncompressed font pattern data storage unit 101 is one character. Are stored in the source buffer 102 one by one. In this way, the character pattern data to be processed is taken out character by character from the uncompressed font pattern data storage unit 101 and temporarily stored in the source buffer 102 to prevent the processing speed from decreasing. The source buffer 102 stores the character pattern data with the data structure shown in FIG.

【００１１】キーワード選定用計数制御部１０３は２つ
の機能を有し、その１つはソース・バッファ１０２の内
容に基づいてキーワード選定用カウンタアレイ１０４に
対するカウンタアドレスを算出する機能、他の１つは算
出したアドレスによって指定されるカウンタの値をカウ
ントアップする機能である。The keyword selection counting control unit 103 has two functions, one of which is for calculating a counter address for the keyword selection counter array 104 based on the contents of the source buffer 102, and the other is for the other. It is a function of counting up the value of the counter designated by the calculated address.

【００１２】図４はキーワード選定用計数制御部１０３
の第１機能を説明する為の図である。キーワード選定用
計数制御部１０３はソース・バッファ１０２から連続す
る３バイトのデータ、つまり、隣り合う８ビット×３の
データを抽出し、現在着目しているデータを「ＤＲ」、
１つ前（左隣）のデータを「ＤＭ」、更にもう１つ前の
データを「ＤＬ」とすると、キーワード選定用計数制御
部１０３は先ず、２つ前のデータ「ＤＬ」の上位４ビッ
トと下位４ビットのイクスクルーシブオアＸＯＲ（記号
「∀」を付して示す）を取り、これをキーワード選定用
カウンタアレイ１０４をアクセスする為の第１要素アド
レスの最上位４ビットとする。この第１要素アドレスは
本実施例の場合、１２ビットアドレスで、この下位８ビ
ットは１つ前のデータ「ＤＭ」そのものとなる。つま
り、第１要素アドレスは２つ前のデータ「ＤＬ」のイク
スクルージブオアＸＯＲの演算結果と１つ前のデータ
「ＤＭ」を組み合せたものとなる。ここで、図示の例で
は２つ前のデータ「ＤＬ」は「７０ｈ」（ｈは１６進表
記を示す記号）、そのイクスクルーシブオアＸＯＲの演
算結果は「７ｈ」、１つ前のデータ「ＤＭ」が「３ｃ
ｈ」である為、第１要素アドレスは「７３ｃｈ」の１２
ビットアドレスとなる。一方、キーワード選定用計数制
御部１０３は着目データ「ＤＲ」そのものをキーワード
選定用カウンタアレイ１０４の第２要素アドレス（８ビ
ットアドレス）とする。したがって、図４の例ではアド
レス算出機能によって「７３ｃｈ、１Ｅｈ」というアド
レスが算出される。このようなアドレス算出処理はソー
ス・バッファ１０２内の３バイト目（１、２バイト目を
除く）から最後の２８８バイト目まで１バイト毎に着目
しながら行われる。なお、３バイト目から開始するのは
２バイト前からデータを参照する為であり、しかも文字
パターンデータの最初の２バイトは空白であることが多
く、特に問題にならないからである。FIG. 4 is a count control unit 103 for keyword selection.
3 is a diagram for explaining the first function of FIG. The keyword selection counting control unit 103 extracts continuous 3 bytes of data from the source buffer 102, that is, adjacent 8 bits × 3 data, and the currently focused data is “DR”,
Assuming that the immediately preceding data (on the left) is “DM” and the further previous data is “DL”, the keyword selection counting control unit 103 first determines the upper 4 bits of the immediately preceding data “DL”. And the lower 4 bits of the exclusive OR XOR (indicated by the symbol “∀”) are used as the upper 4 bits of the first element address for accessing the keyword selection counter array 104. In the case of the present embodiment, this first element address is a 12-bit address, and the lower 8 bits are the previous data "DM" itself. That is, the first element address is a combination of the operation result of the exclusive OR XOR of the data "DL" two before and the data "DM" one before. Here, in the illustrated example, the data “DL” two data before is “70h” (h is a symbol indicating hexadecimal notation), the operation result of the exclusive OR XOR is “7h”, and the data before one is “ "DM" is "3c
Since it is "h", the first element address is 12 of "73ch".
It becomes a bit address. On the other hand, the keyword selection counting control unit 103 sets the focused data “DR” itself as the second element address (8-bit address) of the keyword selection counter array 104. Therefore, in the example of FIG. 4, the address “73ch, 1Eh” is calculated by the address calculation function. Such address calculation processing is performed while paying attention to each byte from the third byte (excluding the first and second bytes) to the last 288th byte in the source buffer 102. The reason for starting from the 3rd byte is that the data is referred to from 2 bytes before, and moreover, the first 2 bytes of the character pattern data are often blank, which is not a problem.

【００１３】キーワード選定用計数制御部１０３の第２
機能はアドレス算出機能によって求められたアドレスで
キーワード選定用カウンタアレイ１０４をアクセスし、
該当するカウンタの値に「１」を加算するもので、該当
カウンタをアクセスする毎にその内容をカウントアップ
してゆく。Second of count control unit 103 for keyword selection
The function is to access the keyword selection counter array 104 at the address obtained by the address calculation function,
"1" is added to the value of the corresponding counter, and the content is incremented each time the corresponding counter is accessed.

【００１４】キーワード選定用カウンタアレイ１０４は
連続する３バイトのデータの組み合せ頻度を計数するカ
ウンタ群で、前２つのデータがこの組み合せならば次に
来るデータはこれであるという定義表（キーワードテー
ブル）を作成する際に用いられる。ここで、無圧縮フォ
ントパターンデータ格納部１０１内の全文字について、
データの組み合せ頻度が調べられる。つまり、キーワー
ド選定用計数制御部１０３は全文字が１３０００文字で
あれば、１文字毎にその３バイト目から２８８バイト目
までのデータに順次着目し、連続する３バイトのデータ
の組み合せ頻度を調べる動作を１３０００文字分繰り返
す。The keyword selection counter array 104 is a group of counters for counting the combination frequency of consecutive 3-byte data. If the previous two data are this combination, the next data is this definition table (keyword table). Used when creating a. Here, for all characters in the uncompressed font pattern data storage unit 101,
The frequency of data combination is examined. That is, if all the characters are 13000, the keyword selection counting control unit 103 sequentially looks at the data from the 3rd byte to the 288th byte for each character, and checks the combination frequency of consecutive 3 bytes of data. The operation is repeated for 13,000 characters.

【００１５】図５はキーワード選定用カウンタアレイ１
０４の構成を示したもので、キーワード選定用カウンタ
アレイ１０４は２次元の広がりを持っており、キーワー
ド選定用計数制御部１０３からの第１要素アドレス（１
２ビット）と第２要素アドレス（８ビット）とによって
１つのカウンタが指定されてその値がカウントアップさ
れる。なお、このカウントアレイ１０４は列方向に２⁸
＝２５６列、行方向に２¹²＝４０９６行のカウンタ群に
よって構成されている。また各カウンタの初期値として
本実施例においては「０」がセットされている。FIG. 5 shows a keyword selection counter array 1.
In the configuration of No. 04, the keyword selection counter array 104 has a two-dimensional spread, and the first element address (1
One counter is designated by 2 bits) and the second element address (8 bits), and the value is incremented. The count array 104 is 2 ^{8 in the} column direction.
= 256 columns and 2 ¹² = 4096 rows in the row direction. Further, "0" is set as the initial value of each counter in this embodiment.

【００１６】キーワード選定部１０５は無圧縮フォント
パターンデータ格納部１０１内の全文字分のデータの組
み合せ頻度がキーワード選定用カウンタアレイ１０４に
計数されると、キーワード選定用カウンタアレイ１０４
の第１要素アドレス毎にカウント値が最大な第２要素ア
ドレス（着目データ「ＤＲ」の値）を調べ、それをキー
ワードテーブル格納部１０６の第１候補キーワードとし
て登録する。図６はキーワードテーブルの構成例で、図
示の例ではキーワード選定用計数制御部１０３からのア
ドレス「７３Ｃｈ、１Ｅｈ」でアドレッシングされるカ
ウンタ「Ｃ７３Ｃ１Ｅ（先頭の「Ｃ」はカウンタ識別用
の記号）」が第１要素アドレス「７３Ｃｈ」のカウンタ
群の中で１番大きな値を持っていた場合である。この場
合、第１候補キーワードの「７３Ｃｈ」の項目に着目デ
ータＤＲ「１Ｅｈ」が登録され、２バイト前のイクスク
ルーシブオアＸＯＲの演算結果が「７ｈ」、１バイト前
のデータが「３Ｃｈ」であれば、着目データは「１Ｅ
ｈ」である可能性が大きいことを表わしている。更に２
番目に大きな値を持っている第２要素アドレスの値が第
２候補キーワードに、同様に３番目、４番目に大きなカ
ウント値を持っている第２要素アドレスの値が第３候
補、第４候補キーワードとして登録される。このように
本実施例においてはデータの組み合せに応じて４種類の
データ候補を登録するようにしている。When the keyword selection counter array 104 counts the combination frequency of the data for all characters in the uncompressed font pattern data storage unit 101, the keyword selection counter array 104 outputs the keyword selection counter array 104.
The second element address (value of the focused data “DR”) having the maximum count value is checked for each of the first element addresses of, and it is registered as the first candidate keyword in the keyword table storage unit 106. FIG. 6 shows an example of the configuration of the keyword table. In the illustrated example, the counter "C73C1E (the leading" C "is a counter identification symbol)" addressed by the address "73Ch, 1Eh" from the keyword selection counting control unit 103. Has the largest value in the counter group of the first element address “73Ch”. In this case, the focused data DR “1Eh” is registered in the item “73Ch” of the first candidate keyword, the operation result of the exclusive OR XOR two bytes before is “7h”, and the data one byte before is “3Ch”. If so, the data of interest is "1E
It means that there is a high possibility that it is “h”. 2 more
The value of the second element address having the second largest value is the second candidate keyword, and the values of the second element address having the third and fourth largest count values are the third and fourth candidates. Registered as a keyword. As described above, in this embodiment, four types of data candidates are registered according to the combination of data.

【００１７】キーワードテーブル格納部１０６内に格納
されたキーワードテーブルは前２つのデータがこの組み
合せの場合には次に来る着目データはこれであるという
定義表であり、前２つのデータから１つのデータを導き
出す為のテーブルであるので、本来は前２つ分の１６ビ
ットのアドレスを持つことになる。しかし、そのような
テーブルは容量が６４Ｋバイトにもなってしまう。ま
た、文字パターンにはこのようなテーブルを埋め尽くす
全ての組み合せが存在する可能性は低く、テーブル自体
に冗長性が出てしまう。そこで、本実施例ではテーブル
容量を通常の１６分の１に縮小する手法が用いられてい
る。The keyword table stored in the keyword table storage unit 106 is a definition table in which when the previous two data are this combination, the next data of interest is this definition table. Since it is a table for deriving, the address originally has 16 bits for the previous two. However, such a table has a capacity of 64 Kbytes. Further, it is unlikely that the character pattern has all the combinations that fill up such a table, and the table itself becomes redundant. Therefore, in this embodiment, a method of reducing the table capacity to 1/16 of the normal size is used.

【００１８】即ち、キーワード選定用カウンタアレイ１
０４の第１要素アドレスは着目データ「ＤＲ」に対して
１つ前のデータ「ＤＭ」と２つ前のデータ「ＤＬ」とを
組み合せたものであるが、２つ前のデータ「ＤＬ」は上
位４ビットと下位４ビットとのイクスクルーシブオアＸ
ＯＲの演算によって得られた４ビットデータに縮小され
る為、本来の１６ビットアドレスは１２ビットアドレス
となり、これによってキーワードテーブルの容量は２¹²
／２¹⁶＝１／１６となる。ここで、２つ前のデータ「Ｄ
Ｌ」を縮小したのは着目データ「ＤＲ」に対する影響が
１つ前のデータ「ＤＭ」よりも少ないからであり、また
イクスクルーシブオアＸＯＲの演算を選んだのは全ての
ビットの影響が結果の４ビットに残るからである。更に
上位と下位の４ビットで演算を行うのはパターンデータ
の連続性を考慮した為である。なお、この方法によれ
ば、同一アドレスとなってしまう組み合せが１６種存在
することになるが、その内、頻度の高い４種類の組み合
せでそのほとんどをカバーできることが実験的に確認さ
れた。なお、２種類でも十分であるが、本実施例では４
種類の組み合せを選定するようにしている。That is, the keyword selection counter array 1
The first element address 04 is a combination of the previous data "DM" and the previous two data "DL" with respect to the data of interest "DR", but the previous two data "DL" is Exclusive or X of upper 4 bits and lower 4 bits
Since the original 16-bit address becomes a 12-bit address because it is reduced to 4-bit data obtained by the OR operation, the capacity of the keyword table is 2 ¹²
/ 2 ¹⁶ = 1/16. Here, the previous data “D
L is reduced because it has less influence on the focused data “DR” than the previous data “DM”. Also, the exclusive OR XOR operation is selected because of the influence of all bits. This is because the remaining 4 bits remain. Further, the reason that the upper 4 bits and the lower 4 bits are calculated is because the continuity of the pattern data is taken into consideration. According to this method, there are 16 kinds of combinations having the same address, but it has been experimentally confirmed that most of them can be covered by 4 kinds of combinations which have a high frequency. Two types are sufficient, but in this embodiment, four
I try to select a combination of types.

【００１９】データ出現回数計数部１０７はキーワード
テーブルが作成された後に動作するもので、キーワード
テーブルだけでは全てのデータがキーワードにヒットす
るとは限らない為、キーワード選定用計数制御部１０３
と略同様に再度、データの出現回数を計数する。ここ
で、ヒットしなかったデータについてもその出現回数が
計数される。データ出現回数計数部１０７が動作を開始
すると、無圧縮フォントパターンデータ格納部１０１内
の全ての文字が１文字ずつソース・バッファ１０２に格
納されてゆき、データ出現回数計数部１０７によって１
文字ずつ処理される。The data appearance frequency counting unit 107 operates after the keyword table is created, and not all the data hits the keyword only with the keyword table. Therefore, the keyword selection counting control unit 103
The number of appearances of data is counted again in the same manner as. Here, the number of appearances of the data that did not hit is counted. When the data appearance number counting unit 107 starts its operation, all the characters in the uncompressed font pattern data storage unit 101 are stored in the source buffer 102 one by one, and the data appearance number counting unit 107 sets the number of characters to 1.
It is processed character by character.

【００２０】即ち、データ出現回数計数部１０７はキー
ワード選定用計数制御部１０３で述べた方法で着目デー
タ「ＤＲ」に対して前２つのデータ「ＤＬ」および１つ
前のデータ「ＤＭ」から１２ビットアドレスを求め、こ
の１２ビットアドレスでキーワードテーブル格納部１０
６をアクセスし、着目データ「ＤＲ」が第１候補キーワ
ード〜第４候補キーワードに一致するか否かをチェック
する。ここで、着目データ「ＤＲ」が第１候補キーワー
ド〜第４候補キーワードの何れかに一致する場合には、
それに対応する回数カウントアレイ１０８内のカウンタ
の値をカウントアップ「＋１」させる。また、何れのキ
ーワードにも一致しない場合にはその着目データに対応
する回数カウントアレイ１０８内のカウンタの値をカウ
ントアップ「＋１」させる。That is, the data appearance frequency counting unit 107 uses the method described in the keyword selection counting control unit 103 to select the data from the previous two data "DL" and the previous data "DM" to the target data "DR". The bit address is obtained, and the keyword table storage unit 10 is calculated using this 12-bit address.
6 is accessed to check whether the focused data “DR” matches the first to fourth candidate keywords. Here, when the focused data “DR” matches any of the first to fourth candidate keywords,
The value of the counter in the count counter array 108 corresponding thereto is incremented by "+1". If none of the keywords matches, the value of the counter in the number-of-times count array 108 corresponding to the data of interest is incremented by "+1".

【００２１】図７は回数カウントアレイ１０８の構成を
示したもので、回数カウントアレイ１０８には着目デー
タ「ＤＲ」がキーワードに一致しない場合において着目
データ「００」〜「ＦＦ」に対応してそのデータの出現
回数を計数するカウンタ「Ｃ０００ｈ」、「Ｃ００１
ｈ」……「Ｃ０ＦＦｈ」と、第１〜第４候補のキーワー
ドの何れかに一致する場合のデータの出現回数を計数す
るカウンタ「Ｃ１００ｈ」、「Ｃ１０１ｈ」、「Ｃ１０
２ｈ」、「Ｃ１０３ｈ」が設けられている。ここで、各
カウンタは特定のコード「０００ｈ」〜「０ＦＦｈ」、
「１００ｈ」、「１０１ｈ」、「１０２ｈ」、「１０３
ｈ」に対応付けられており、各コードは圧縮文字データ
を元の文字パターンデータに復元再生させる為の再生コ
マンドを定義している。FIG. 7 shows the structure of the frequency count array 108. In the frequency count array 108, when the data of interest "DR" does not match the keyword, the data of interest "00" to "FF" are displayed. Counters "C000h" and "C001" that count the number of times data appears
h "..." C0FFh "and counters" C100h "," C101h ", and" C10 "that count the number of appearances of data in the case of matching with any of the first to fourth candidate keywords.
2h ”and“ C103h ”are provided. Here, each counter has a specific code “000h” to “0FFh”,
"100h", "101h", "102h", "103"
Each code is associated with “h” and defines a reproduction command for restoring and reproducing the compressed character data to the original character pattern data.

【００２２】図８は各コマンドコードとその実行コマン
ドの内容を示したもので、コマンドコード「０００ｈ」
〜「０ＦＦｈ」は“コマンドコードの下位８ビットの内
容（着目データ）をストアせよ”を意味し、コマンドコ
ード「１００ｈ」は“第１候補キーワードを参照してそ
の内容（着目データ）をストアせよ”を意味する。更
に、コマンドコード「１０１ｈ」は“第２候補キーワー
ドを参照してその内容をストアせよ”を意味し、またコ
マンドコード「１０２ｈ」は“第３候補のキーワードを
参照してその内容をストアせよ”を意味し、コマンドコ
ード「１０３ｈ」は“第４候補のキーワードを参照して
その内容をストアせよ”を意味する。このようにコマン
ドの内容はコマンドデータの一部をそのままデータとし
て再生させるものと、キーワードテーブルを参照してそ
の内容をデータとして再生させるものとがある。FIG. 8 shows the contents of each command code and its execution command. Command code "000h"
~ "0FFh" means "store the contents (focused data) of the lower 8 bits of the command code", and the command code "100h" refers to the first candidate keyword and stores the contents (focused data). Means. Furthermore, the command code "101h" means "store the contents by referring to the second candidate keyword", and the command code "102h""stores the contents by referring to the third candidate keyword". The command code “103h” means “store the contents by referring to the fourth candidate keyword”. As described above, as for the content of the command, there is one in which a part of the command data is directly reproduced as data, and the other in which the content is reproduced as data by referring to the keyword table.

【００２３】このようにデータ出現回数計数部１０７は
着目データ「ＤＲ」が第１候補〜第４候補のキーワード
に一致するときには、対応するカウンタの値をカウント
アップするが、かなりのデータは第１候補〜第４候補の
キーワードと一致する為、回数カウントアレイ１０８内
のカウント値はコマンドコード「１００ｈ」〜「１０３
ｈ」に対応するカウンタ値が極端に大きくなり、偏りが
発生する。このようにデータの出現頻度に偏りが発生す
る為、効果的なデータ圧縮を実現することが可能とな
る。As described above, the data appearance frequency counting unit 107 counts up the value of the corresponding counter when the focused data “DR” matches the keywords of the first to fourth candidates, but a considerable amount of data is the first. Since the keywords match the candidates to the fourth candidate, the count values in the count array 108 are command codes “100h” to “103”.
The counter value corresponding to “h” becomes extremely large, and a bias occurs. Since the occurrence frequency of data is biased in this way, effective data compression can be realized.

【００２４】圧縮符号表発生部１０９はデータ出現回数
計数部１０７が全文字についての処理を終了した時点で
回数カウントアレイ１０８内のカウント値を用いて圧縮
符号を発生する。この場合、例えばハフマン符号の発生
アルゴリズムなどの冗長性をカットする特性の符号であ
れば、その内容は問わない。なお、ハフマン符号化法な
どは広く知られた技術であるので、ここではその説明は
省略するものとする。この圧縮符号表発生部１０９によ
って発生された符号表は圧縮符号表格納部１１０に書き
込まれる。The compression code table generation unit 109 generates a compression code using the count value in the number count array 108 when the data appearance frequency counting unit 107 finishes processing for all characters. In this case, as long as the code has a characteristic of cutting redundancy such as a Huffman code generation algorithm, its content does not matter. Note that the Huffman coding method and the like are widely known techniques, and therefore description thereof will be omitted here. The code table generated by the compression code table generating unit 109 is written in the compression code table storage unit 110.

【００２５】図９は圧縮符号表を示し、圧縮符号表はコ
マンドコードに対応して圧縮符号列とそのビット数とか
ら成り、この圧縮符号列は可変長データであり、ハフマ
ン符号化法にしたがってデータの出現頻度の大きいもの
は少ないビット数の圧縮符号列で表わされている。な
お、圧縮符号表の内容は図中、見易くする為にビット数
の少ないものから並べてあるが、実際にはこのように並
べる必要はない。この圧縮符号表は２つの使われ方をす
る。その１つは無圧縮フォントパターンを圧縮コマンド
列に変換するときであり、他の１つは圧縮コマンド列を
伸長するときに使われる逆変換テーブルの作成時であ
る。FIG. 9 shows a compression code table. The compression code table is composed of a compression code string and the number of bits thereof corresponding to a command code. This compression code string is variable length data, and according to the Huffman coding method. A data having a high appearance frequency is represented by a compression code string having a small number of bits. It should be noted that the contents of the compression code table are arranged in order from the one having the smallest number of bits in order to make it easy to see in the figure, but it is not actually necessary to arrange them in this way. This compression code table has two uses. One is when converting a non-compressed font pattern into a compressed command string, and the other is when creating an inverse conversion table used when decompressing a compressed command string.

【００２６】圧縮符号化部１１２はデータ出現回数計数
部１０７と略同様の動作を行うものであるが、データ出
現回数計数部１０７で回数カウントアレイ１０８のカウ
ンタをカウントアップしていた代わりにキーワードテー
ブル格納部１０６の内容を参照しながら、圧縮符号表格
納部１１０に格納されているビット列（圧縮符号列）を
データ書き込み部１１３を介して圧縮フォント格納部１
１４に吐き出す動作を行う。The compression encoding unit 112 performs substantially the same operation as the data appearance number counting unit 107, but instead of counting up the counter of the number counting array 108 in the data appearance number counting unit 107, the keyword table is used. Referring to the contents of the storage unit 106, the bit sequence (compression code sequence) stored in the compression code table storage unit 110 is stored in the compressed font storage unit 1 via the data writing unit 113.
The operation of exhaling to 14 is performed.

【００２７】即ち、無圧縮フォントパターンデータ格納
部１０１内の全ての文字が１文字ずつソース・バッファ
１０２に格納されてゆき、圧縮符号化部１１２によって
１文字ずつ処理される。この場合、圧縮符号化部１１２
は上述した方法で着目データ「ＤＲ」に対して前２つの
データ「ＤＬ」、「ＤＭ」から１２ビットアドレスを求
め、この１２ビットアドレスでキーワードテーブル格納
部１０６をアクセスし、着目データ「ＤＲ」が第１候補
キーワード〜第４候補キーワードの何れかに一致するか
否かをチェックする。ここで、着目データ「ＤＲ」が第
１候補キーワード〜第４候補キーワードの何れかに一致
する場合には圧縮符号表を参照し、該当候補に対応する
コマンドコードを指定してその圧縮符号列を読み出す。
この圧縮符号列はデータ書き込み部１１３に与えられ圧
縮フォント格納部１１４に書き込まれる。一方、不一致
の場合には着目データに対応するコマンドコードを指定
してその圧縮符号列を読み出す。この圧縮符号列はデー
タ書き込み部１１３に与えられ、圧縮フォント格納部１
１４に書き込まれる。このようにして圧縮フォント格納
部１１４内に書き込まれた１文字分の圧縮符号列は可変
長データとなる。そして、１文字分の圧縮符号列が圧縮
フォント格納部１１４に書き込まれると、データ書き込
み部１１３はその文字が圧縮フォント格納部１１４の何
バイト目から書き始められているかを示す為のアドレス
がインデックステーブル格納部１１５にセットされる。That is, all the characters in the uncompressed font pattern data storage unit 101 are stored character by character in the source buffer 102 and processed by the compression encoding unit 112 character by character. In this case, the compression encoding unit 112
Obtains a 12-bit address from the previous two data "DL" and "DM" for the data of interest "DR" by the method described above, accesses the keyword table storage unit 106 with this 12-bit address, and obtains the data of interest "DR". Checks whether any of the first to fourth candidate keywords matches. Here, when the focused data “DR” matches any of the first candidate keyword to the fourth candidate keyword, the compression code table is referred to, the command code corresponding to the candidate is designated, and the compression code string is selected. read out.
This compressed code string is given to the data writing unit 113 and written in the compressed font storage unit 114. On the other hand, if they do not match, the command code corresponding to the data of interest is designated and the compression code string is read. This compressed code string is given to the data writing unit 113, and the compressed font storage unit 1
14 is written. The compression code string for one character written in the compressed font storage unit 114 in this manner becomes variable length data. Then, when a compression code string for one character is written in the compressed font storage unit 114, the data writing unit 113 uses an index as an address for indicating from which byte of the compressed font storage unit 114 the character is started to be written. It is set in the table storage unit 115.

【００２８】逆変換テーブル発生部１１６は圧縮符号表
を構成する圧縮符号列を数値に見立ててその数値の小さ
い順に並べ変える。この時、逆変換テーブル発生部１１
６は最大ビット長を持つ符号に合せて残りは右側に
「０」を付加して数値に見立てる。例えばコマンドコー
ド「１０１ｈ」の符号列は「１０」であるが、最大ビッ
ト長が１６ビットであるならば、この右側に「０」を１
４個付加して「８０００ｈ」として取り扱う。このよう
にして圧縮符号表格納部１１０の内容を数値的な重み順
に並べ変えると、図１０に示す様な並びとなる。The inverse conversion table generating section 116 rearranges the compression code sequences forming the compression code table into numerical values and rearranges them in ascending order of numerical values. At this time, the inverse conversion table generating unit 11
6 is regarded as a numerical value by adding "0" to the right side according to the code having the maximum bit length. For example, the code string of the command code “101h” is “10”, but if the maximum bit length is 16 bits, “0” is set to 1 to the right of this.
Add four and handle as "8000h". When the contents of the compression code table storage unit 110 are rearranged in the numerical order in this manner, the arrangement is as shown in FIG.

【００２９】ここで、圧縮フォント格納部１１４の内容
を復元再生する際において、あるビット列が与えられた
とき、そのコマンドコードが何かを調べるにはそのビッ
ト列の先頭から１ビットずつ取り出して調べてゆき、コ
マンドコードの符号列と一致したところで解析終了とな
れば処理速度等の点において極めて好都合である。そこ
で、図１０を見てみると、先頭のビットが「０」ならば
それはコマンドコード「１００ｈ」であると結論できる
ことが解る。しかし、「１」ならば次のビットを見なけ
ればならない。いま、次のビットを見た場合、それが
「０」ならばコマンドコード「１０１ｈ」であるが、
「１」であれば更に次のビットを見なければならない。
ここで、図１１は図１０の圧縮符号表を木リスト構造で
表わしたものである。Here, when a certain bit string is given when the contents of the compressed font storage unit 114 are restored and reproduced, in order to check what the command code is, one bit is taken out from the beginning of the bit string and checked. At the end, if the analysis ends when the code string of the command code matches, it is extremely convenient in terms of processing speed. Therefore, looking at FIG. 10, it is understood that if the leading bit is “0”, it can be concluded that it is the command code “100h”. However, if it is a "1", then the next bit must be seen. Now, when looking at the next bit, if it is "0", the command code is "101h",
If it is "1", then the next bit must be seen.
Here, FIG. 11 shows the compression code table of FIG. 10 in a tree list structure.

【００３０】この２進の木リスト構造において、「ｎｉ
ｌ」と書かれている分岐点はそこでコマンドコードが確
定するところを表し、また「ｄｉｒ」と書かれている分
岐点は次のビットを見る必要があるということを表わし
ている。そこで、「ｄｉｒ」の部分を図１２に示す様な
構造で表わしてやると非常に効率の良い逆変換テーブル
を得ることが可能となる。ここで、「ｄｉｒ」は対象と
するビットが「０」の場合と「１」の場合の２通りに
「振舞い」が分けられる。よって、１つの「ｄｉｒ」を
表わすのに１６ビットのデータを２組、即ち４バイトで
表わすことにする。「アドレス＋０（１６ビット）」は
「０」のときの振舞い、「アドレス＋２（１６ビッ
ト）」は「１」のときの振舞いを表わす。また、テーブ
ルアドレスはリストの根元（Ｒｏｏｔ）より「００
０」、「００４」、「００８」……に割り当てられてい
る。図１３は逆変換テーブルにおける振舞いデータの構
造を示し、「振舞い」にはそこで「ｎｉｌ」となり、デ
ータが確定する場合と、また新たに「ｄｉｒ」となる場
合とがある。そこで、図１３に示す様に１６ビットデー
タのＭＳＢに「ｄｉｒ」か「ｎｉｌ」の識別子を置き、
残りの１５ビットで次の「ｄｉｒ」が記述されている逆
変換テーブル上のアドレスを表わすか、コマンドコード
の値を表わす。なお、ＭＳＢの識別子が「１」のときに
は「ｎｉｌ」で、続く１５ビットはコマンドコードを表
わし、ＭＳＢの識別子が「０」のときには「ｄｉｒ」
で、続く１５ビットは次の「ｄｉｒ」アドレスであるこ
とを表わす。図１４は逆変換テーブルの内容を具体的に
示したもので、逆変換テーブル発生部１１６は図１０で
示した圧縮符号表にしたがって図１４に示す様な構造の
逆変換テーブルを作成し、逆変換テーブル格納部１１７
に格納する。In this binary tree list structure, "ni
The branch point labeled "l" indicates where the command code is fixed, and the branch point labeled "dir" indicates that the next bit needs to be seen. Therefore, if the "dir" portion is represented by a structure as shown in FIG. 12, it is possible to obtain a very efficient inverse conversion table. Here, as for “dir”, “behavior” is divided into two cases, where the target bit is “0” and “1”. Therefore, one set of "dir" is represented by two sets of 16-bit data, that is, four bytes. "Address + 0 (16 bits)" represents the behavior when "0", and "Address + 2 (16 bits)" represents the behavior when "1". The table address is "00" from the root of the list.
It is assigned to 0 ”,“ 004 ”,“ 008 ”.... FIG. 13 shows the structure of the behavior data in the inverse conversion table. The "behavior" may be "nil" there, and the data may be fixed or may be newly "dir". Therefore, as shown in FIG. 13, an identifier of "dir" or "nil" is placed in the MSB of 16-bit data,
The remaining 15 bits represent the address on the inverse conversion table in which the next "dir" is described, or the value of the command code. When the MSB identifier is "1", it is "nil", and the following 15 bits represent a command code, and when the MSB identifier is "0", it is "dir".
Then, the following 15 bits represent the next "dir" address. FIG. 14 specifically shows the contents of the inverse conversion table. The inverse conversion table generator 116 creates an inverse conversion table having a structure as shown in FIG. 14 according to the compression code table shown in FIG. Conversion table storage unit 117
To store.

【００３１】次に、文書作成装置２００の構成を図１
５、図１６を参照して説明する。図１５は文書作成装置
２００の全体構成を示したブロック図である。主制御部
２１０は各種プログラムにしたがってこの文書作成装置
２００の全体動作を制御するもので、キー操作部２１１
から入力された文書データは主制御部２１０に取り込ま
れ、表示制御部２１２を介して表示装置２１３から表示
出力されたり、文書メモリ２１４に格納される。この文
書メモリ２１４は文書データをコード形式で記憶するも
ので、その内容はキー操作部２１１からの登録指令に応
答して外部記憶制御装置２１５に与えられ、外部記憶装
置２１６に登録保存されたり、キー操作部２１１からの
印字指令に応答して読み出され、文字パターン再生装置
２０１によって対応する文字パターンデータに変換され
たのち、印刷制御部２１７を介して印刷装置２１８から
印字出力される。Next, the configuration of the document creating apparatus 200 is shown in FIG.
5, and will be described with reference to FIG. FIG. 15 is a block diagram showing the overall configuration of the document creation device 200. The main control unit 210 controls the overall operation of the document creating apparatus 200 according to various programs, and the key operation unit 211.
The document data input from the main control unit 210 is fetched, displayed and output from the display device 213 via the display control unit 212, or stored in the document memory 214. The document memory 214 stores the document data in a code format, the content of which is given to the external storage control device 215 in response to a registration command from the key operation unit 211, registered and stored in the external storage device 216, The data is read in response to a print command from the key operation unit 211, converted into corresponding character pattern data by the character pattern reproducing device 201, and then printed out from the printing device 218 via the print control unit 217.

【００３２】文字パターン再生装置２０１は上述した圧
縮文字データメモリ２０３の他、展開制御部２１９、文
字パターンバッファ２２０を有する構成で、この圧縮文
字データメモリ２０３には図１で示した様に文字パター
ンデータ圧縮装置１００から供給された各種のデータを
記憶する圧縮フォント格納部２０４、インデックステー
ブル格納部２０５、逆変換テーブル格納部２０６、キー
ワードテーブル格納部２０７が設けられており、展開制
御部２１９はこれらのデータに基づいて圧縮文字データ
を通常の文字パターンデータに復元再生し、文字パター
ンバッファ２２０に書き込む。The character pattern reproducing device 201 is configured to have a decompression control unit 219 and a character pattern buffer 220 in addition to the above-mentioned compressed character data memory 203. In the compressed character data memory 203, as shown in FIG. A compressed font storage unit 204, an index table storage unit 205, an inverse conversion table storage unit 206, and a keyword table storage unit 207, which store various data supplied from the data compression apparatus 100, are provided. The compressed character data is restored to normal character pattern data on the basis of the above data and reproduced and written in the character pattern buffer 220.

【００３３】図１６は文字パターン再生装置２０１を詳
細に示したブロック構成図である。展開制御部２１９は
文字コードレジスタ２２１、インデックステーブルアク
セス部２２２、圧縮データアクセス部２２３、圧縮パタ
ーンバッファ２２４、再生コマンド発生部２２５、再生
コマンド実行部２２６を有する構成で、文書作成機主装
置２０２から印刷対象の文字コードが文字コードレジス
タ２２１にセットされると、インデックステーブルアク
セス部２２２はその文字が圧縮フォント格納部２０４の
どこに書き込まれているかをアクセスする為にインデッ
クステーブル格納部２０５を検索する。この場合、イン
デックステーブル格納部２０５の内容はＪＩＳの並び順
にしたがっている為、インデックステーブルアクセス部
２２２は文字コードに基づいてインデックステーブル格
納部２０５から該当アドレスを検索することができる。
このようにして検索されたアドレスは圧縮データアクセ
ス部２２３に送られる。圧縮データアクセス部２２３は
このアドレスにしたがって圧縮フォント格納部２０４を
アクセスし、１文字分の圧縮文字データを読み出して圧
縮パターンバッファ２２４に転送する。FIG. 16 is a block diagram showing in detail the character pattern reproducing device 201. The expansion control unit 219 has a character code register 221, an index table access unit 222, a compressed data access unit 223, a compression pattern buffer 224, a reproduction command generation unit 225, and a reproduction command execution unit 226. When the character code to be printed is set in the character code register 221, the index table access unit 222 searches the index table storage unit 205 to access where in the compressed font storage unit 204 the character is written. In this case, since the contents of the index table storage unit 205 follow the order of JIS, the index table access unit 222 can retrieve the corresponding address from the index table storage unit 205 based on the character code.
The address thus retrieved is sent to the compressed data access unit 223. The compressed data access unit 223 accesses the compressed font storage unit 204 according to this address, reads the compressed character data for one character, and transfers it to the compressed pattern buffer 224.

【００３４】再生コマンド発生部２２５は逆変換テーブ
ル用のポインタ（図示せず）を持ち、圧縮パターンバッ
ファ２２４内の圧縮文字データ（圧縮符号列）を逆変換
テーブル格納部２０６の内容にしたがってコマンドコー
ドに変換するもので、これによって変換されたコマンド
コードは１コマンド毎に再生コマンド実行部２２６に送
られる。The reproduction command generation section 225 has a pointer (not shown) for the inverse conversion table, and the compressed character data (compression code string) in the compression pattern buffer 224 is converted into a command code according to the contents of the inverse conversion table storage section 206. The command code thus converted is sent to the reproduction command execution section 226 for each command.

【００３５】再生コマンド実行部２２６は図８で示した
実行コマンドの内容にしたがったストア処理を再生コマ
ンド発生部２２５からのコマンドコードにしたがって逐
次実行するもので、その際、キーワードテーブル格納部
２０７から必要に応じてデータを抽出し、そのデータを
文字パターンバッファ２２０に書き込む。ここで、コマ
ンドコードが「１００ｈ」〜「１０３ｈ」、つまり、そ
の内容が“第１候補キーワード〜第４候補キーワードを
参照してストアせよ”の場合には、文字パターンバッフ
ァ２２０に書き込まれている２つ前のデータを文字パタ
ーンバッファ２２０から読み出すと共に、このデータか
ら上述した方法でキーワードテーブルのアドレスを生成
し、このアドレスで指定されるテーブル内容を再生デー
タとして文字パターンバッファ２２０に書き込む。これ
により文字パターンバッファ２２０には復元再生された
文字パターンデータが格納される。ここで、文書作成機
主装置２０２は通常と同様に文字パターンバッファ２２
０から文字パターンデータを読み出して印字出力させ
る。The reproduction command execution unit 226 sequentially executes the store processing according to the contents of the execution command shown in FIG. 8 in accordance with the command code from the reproduction command generation unit 225. Data is extracted as necessary, and the data is written in the character pattern buffer 220. Here, when the command code is “100h” to “103h”, that is, the content is “store with reference to first candidate keyword to fourth candidate keyword”, it is written in the character pattern buffer 220. The data two bytes before is read from the character pattern buffer 220, the address of the keyword table is generated from this data by the method described above, and the table contents designated by this address are written in the character pattern buffer 220 as reproduction data. As a result, the restored and reproduced character pattern data is stored in the character pattern buffer 220. Here, the document creator main unit 202 uses the character pattern buffer 22 as usual.
The character pattern data is read from 0 and printed out.

【００３６】次に、本実施例の動作を説明する。先ず、
文字パターンデータ圧縮装置１００によるパターン圧縮
動作について説明する。ソース・バッファ１０２には無
圧縮フォントパターンデータ格納部１０１にビットマッ
プ方式で格納されている各種の文字パターンデータをそ
の先頭文字から１文字ずつ転送される。すると、キーワ
ード選定用計数制御部１０３は先ずソース・バッファ１
０２から１バイト目〜３バイト目のデータを切り出すと
共に３バイト目のデータに着目し、この着目データ「Ｄ
Ｒ」に対して１つ前の２バイト目のデータを「ＤＲ」、
更に１つ前の１バイト目のデータを「ＤＬ」とする。こ
こで、キーワード選定用計数制御部１０３は２つ前のデ
ータ「ＤＬ」の上位４ビットと下位４ビットとのイクス
クルーブオアＸＯＲを取り、これをキーワード選定用カ
ウンタアレイ１０４をアクセスする為の第１要素アドレ
スの最上位４ビットとする。いま、図４の例では２つ前
のデータ、つまり１バイト目のデータは「０１１１００
００（７０ｈ）」で、そのＸＯＲの結果は「０１１１
（７ｈ）」となる。そして、１つ前の２バイト目のデー
タ「ＤＭ」が「００１１１１００（３Ｃｈ）」である
為、第１要素アドレスは「０１１１００１１１１００
（７３Ｃｈ）」の１２ビットアドレスとなる。一方、キ
ーワード選定用計数制御部１０３は着目している３バイ
ト目のデータ「ＤＲ」そのものをキーワード選定用カウ
ンタアレイ１０４の第２要素アドレス「０００１１１１
０（１Ｅｈ）」とする。したがって、図４の例では「７
３Ｃｈ、１Ｅｈ」というアドレスが算出されることにな
る。Next, the operation of this embodiment will be described. First,
A pattern compression operation by the character pattern data compression device 100 will be described. Various character pattern data stored in the uncompressed font pattern data storage unit 101 by the bitmap method is transferred to the source buffer 102 one by one from the first character. Then, the keyword selection counting control unit 103 first determines the source buffer 1
The data of the 1st to 3rd bytes is cut out from 02, and the data of the 3rd byte is focused.
The 2nd byte data immediately before “R” is “DR”,
The data of the immediately preceding first byte is “DL”. Here, the keyword selection counting control unit 103 takes an exclusive OR XOR of the upper 4 bits and the lower 4 bits of the data “DL” two before, and uses this to access the keyword selection counter array 104. The most significant 4 bits of the first element address. Now, in the example of FIG. 4, the second previous data, that is, the data of the first byte is "011100".
00 (70h) ”, the result of the XOR is“ 0111 ”.
(7h) ”. Then, since the data “DM” of the immediately preceding second byte is “00111100 (3Ch)”, the first element address is “011100111100”.
(73 Ch) ”is a 12-bit address. On the other hand, the keyword selection counting control unit 103 sets the focused third byte data “DR” itself to the second element address “0001111” of the keyword selection counter array 104.
0 (1 Eh) ". Therefore, in the example of FIG.
The address “3Ch, 1Eh” is calculated.

【００３７】次に、キーワード選定用計数制御部１０３
はソース・バッファ１０２の４バイト目に着目し、２つ
前の２バイト目のデータと１つ前の３バイト目のデータ
とに基づいてキーワード選定用カウンタアレイ１０４の
アドレスを算出する。このようなアドレス算出処理は１
バイト毎に最後の２８８バイト目まで１バイト毎に繰り
返し実行される。これと同時にキーワード選定用計数制
御部１０３は算出アドレスにしたがってキーワード選定
用カウンタアレイ１０４をアクセスし、該当するカウン
タ値をカウントアップしてゆく。これによってデータの
組み合せ頻度がキーワード選定用カウンタアレイ１０４
で計数されることになる。Next, the keyword selection counting control unit 103
Pays attention to the 4th byte of the source buffer 102, and calculates the address of the keyword selection counter array 104 based on the 2nd previous data of the 2nd byte and the 1st previous data of the 3rd byte. Such address calculation processing is 1
It is repeatedly executed byte by byte up to the last 288th byte. At the same time, the keyword selection counting control unit 103 accesses the keyword selection counter array 104 according to the calculated address and counts up the corresponding counter value. As a result, the combination frequency of the data is determined by the keyword selection counter array 104.
Will be counted.

【００３８】このようにして無圧縮フォントパターンデ
ータ格納部１０１内の全ての文字についての組み合せ頻
度を計数し終ると、キーワード選定部１０５はキーワー
ド選定用カウンタアレイ１０４の内容を解析し、キーワ
ードテーブルを作成する。即ち、キーワード選定部１０
５はキーワード選定用カウンタアレイ１０４の第１要素
アドレス毎に最も大きいカウンタ値の第２要素アドレ
ス、つまり、組み合せ頻度が最も高い着目データを第１
候補キーワードとして登録し、次に大きいカウンタ値の
第２要素アドレスを第２候補キーワードとして登録し、
同様に３番目、４番目に大きなカウンタ値の第２要素ア
ドレスを第３候補キーワード、第４候補キーワードとし
て登録する。このようにデータの組み合せ頻度に基づい
てそれに続く着目データはこれであるという候補が４種
類抽出されてキーワードテーブル格納部１０６に格納さ
れる。After counting the combination frequencies of all the characters in the uncompressed font pattern data storage unit 101 in this way, the keyword selection unit 105 analyzes the contents of the keyword selection counter array 104 and opens the keyword table. create. That is, the keyword selection unit 10
5 is the second element address having the largest counter value for each first element address of the keyword selection counter array 104, that is, the focused data having the highest combination frequency is the first
Register as a candidate keyword, register the second element address of the next largest counter value as the second candidate keyword,
Similarly, the second element address having the third and fourth largest counter values is registered as the third candidate keyword and the fourth candidate keyword. As described above, four types of candidates that the following data of interest is this are extracted based on the combination frequency of the data and stored in the keyword table storage unit 106.

【００３９】いま、キーワードテーブルの作成が終わる
と、それを参照しながらデータ出現回数計数部１０７が
動作を始める。この場合にも無圧縮フォントパターンデ
ータ格納部１０１からソース・バッファ１０２に１文字
ずつ文字パターンデータが順次格納されてゆく。ここ
で、データ出現回数計数部１０７は上述した様にキーワ
ード選定用計数制御部１０３と同様にデータの出現回数
を計数するが、今度はキーワードテーブル格納部１０６
内のキーワードに一致する場合における第１候補〜第４
候補毎の出現回数と、キーワードに一致しなかった場合
におけるデータ毎の出現回数を計数する。Now, when the creation of the keyword table is completed, the data appearance frequency counting section 107 starts its operation while referring to it. Also in this case, the character pattern data is sequentially stored in the source buffer 102 from the uncompressed font pattern data storage unit 101 one character at a time. Here, the data appearance number counting unit 107 counts the number of times the data appears as in the keyword selection counting control unit 103 as described above, but this time, the keyword table storage unit 106.
First to fourth candidates in the case of matching with the keyword in
The number of appearances for each candidate and the number of appearances for each data when the keywords do not match are counted.

【００４０】即ち、データ出現回数計数部１０７は着目
データに対して前２つのデータから求めた１２ビットア
ドレスでキーワードテーブル格納部１０６をアクセス
し、着目データが第１候補〜第４候補の何れかのキーワ
ードに一致する場合にはそれに対応する回数カウントア
レイ１０８内のカウンタ値をカウントアップさせる。例
えば、１２ビットアドレスが「７３Ｃｈ」で、図６に示
す如く第１候補キーワードの「７３Ｃｈ」の項目に「１
Ｅｈ」が登録されている場合、着目データが「１Ｅｈ」
であれば、この着目データは第１候補キーワードと一致
するので、回数カウントアレイ１０８内のカウンタ「Ｃ
１００ｈ」の値が「＋１」される。一方、何れのキーワ
ードにも一致しなかった場合にはその着目データに対応
する回数カウントアレイ１０８内のカウンタ値がカウン
トアップされる。例えば、着目データが「０１ｈ」であ
れば、回数カウントアレイ１０８内のカウンタ「Ｃ００
１ｈ」の値が「＋１」される。このような動作は１文字
の最後まで１文字毎に全文字分繰り返し実行される。こ
の結果、かなりのデータがキーワードと一致する様にな
る為、カウンタ「Ｃ１００ｈ」〜「Ｃ１０３ｈ」の値が
極端に大きくなり、データ圧縮に最適な偏りを発生す
る。ここで、回数カウントアレイ１０８内の各カウンタ
は圧縮文字データを元の文字パターンデータに再生させ
る際に用いられる再生コマンドコード（図８参照）に対
応付けられている為、かなりのデータがこの再生コマン
ド「１００ｈ」〜「１０３ｈ」に書き換えることが可能
となる。That is, the data appearance number counting unit 107 accesses the keyword table storage unit 106 with the 12-bit address obtained from the previous two data with respect to the data of interest, and the data of interest is one of the first to fourth candidates. If it matches the keyword, the counter value in the number-of-times count array 108 corresponding to the keyword is incremented. For example, the 12-bit address is "73Ch", and the first candidate keyword "73Ch" has an item "1" as shown in FIG.
When “Eh” is registered, the data of interest is “1Eh”
If this is the case, this focused data matches the first candidate keyword, so the counter “C
The value of “100h” is incremented by “+1”. On the other hand, if none of the keywords matches, the counter value in the number-of-times count array 108 corresponding to the focused data is counted up. For example, if the data of interest is "01h", the counter "C00 in the count counter array 108
The value of "1h" is incremented by "+1". Such an operation is repeatedly executed for every character until the end of one character. As a result, since a considerable amount of data matches the keyword, the values of the counters "C100h" to "C103h" become extremely large, and the optimum bias for data compression occurs. Here, since each counter in the count counter array 108 is associated with a reproduction command code (see FIG. 8) used when reproducing the compressed character data into the original character pattern data, a considerable amount of data is reproduced. It is possible to rewrite the commands "100h" to "103h".

【００４１】次に、圧縮符号表発生部１０９は回数カウ
ントアレイ１０８の計数データを用いて圧縮符号をハフ
マン符号化法などの様にデータの出現頻度に応じて冗長
部分を抑制する方法で発生し、圧縮符号表格納部１１０
に格納する。このようにして発生された圧縮符号表にお
いて、データの出現頻度の大きいものは図９で示す様に
少ないビット数の符号列で表わされることになる。Next, the compression code table generator 109 uses the count data of the number count array 108 to generate a compression code by a method such as the Huffman coding method that suppresses the redundant portion according to the appearance frequency of the data. , Compression code table storage unit 110
To store. In the compression code table generated in this way, a code string having a high data appearance frequency is represented by a code string having a small number of bits as shown in FIG.

【００４２】このようにして圧縮符号表が完成すると、
圧縮符号化部１１２が動作し始める。この場合において
も無圧縮フォントパターンデータ格納部１０１内の文字
パターンデータが１文字ずつソース・バッファ１０２に
順次格納されてゆく。ここで、圧縮符号化部１１２は着
目データに対して前２つのデータから求めた１２ビット
アドレスでキーワードテーブル格納部１０６をアクセス
し、着目データが第１候補〜第４候補のキーワードと一
致する場合には、圧縮符号表を参照し、該当候補に対応
するコマンドコードを指定してその圧縮符号列を読み出
す。一方、不一致の場合には着目データに対応するコマ
ンドコードを指定してその圧縮符号列を読み出す。When the compression code table is completed in this way,
The compression encoding unit 112 starts operating. Also in this case, the character pattern data in the uncompressed font pattern data storage unit 101 is sequentially stored character by character in the source buffer 102. Here, when the compression encoding unit 112 accesses the keyword table storage unit 106 with the 12-bit address obtained from the previous two data for the data of interest, and the data of interest matches the keywords of the first to fourth candidates. , The compression code table is referred to, the command code corresponding to the candidate is designated, and the compression code string is read. On the other hand, if they do not match, the command code corresponding to the data of interest is designated and the compression code string is read.

【００４３】いま、例えば、文字パターンデータの左上
のデータ列がコマンドコードで表現した場合に「１００
ｈ＋１００ｈ＋０ＦＦｈ＋１０２ｈ＋１０１ｈ＋……」
という並び方をしているものとする。この場合、圧縮符
号表を参照し、コマンドコードを対応する圧縮符号列で
書き換えると、上述したコマンド列は「０＋０＋１１０
１００００＋１１００＋１０＋……」となる。これを８
ビット毎に区切って表現すると「００１１０１００＋０
０１１００１０＋……」となり、これを１６進で表わす
と「３４ｈ＋３２ｈ＋……」となる。なお、本実施例で
は文字パターンデータの３バイト目から着目するように
しているが、最初の２バイトは文字の左上の１６ドット
となり、「０」（空白）であることが多い。そこで、特
例的にこのエリアが「０」のときは第１候補キーワード
を参照するコマンド（図８の例では「１００ｈ」）に書
き換えることとしている。このようにして再生コマンド
を実行するときに最初の２バイトでこのコマンドコード
「１００ｈ」が来たら「０」を書き込む約束にしておけ
ば何の問題も生じない。Now, for example, if the upper left data string of the character pattern data is represented by a command code, "100
h + 100h + 0FFh + 102h + 101h + ... ”
It is assumed that they are arranged. In this case, referring to the compression code table and rewriting the command code with the corresponding compression code string, the above-mentioned command string becomes "0 + 0 + 110".
10000 + 1100 + 10 + ... ”. This 8
When expressed by dividing each bit, "00110100 + 0"
"0110010 + ...", and when expressed in hexadecimal, it becomes "34h + 32h + ...". In the present embodiment, attention is focused on the third byte of the character pattern data, but the first two bytes are the upper left 16 dots of the character, and are often "0" (blank). Therefore, as a special case, when this area is “0”, it is rewritten to a command (“100h” in the example of FIG. 8) that refers to the first candidate keyword. In this way, when the playback command is executed, if the command code "100h" comes in the first 2 bytes, if "0" is promised to be written, no problem will occur.

【００４４】このように圧縮符号化部１１２は文字パタ
ーンデータの左上から順次処理してゆき、１文字２８８
バイトを処理したところでいったん区切られ、１文字の
最後のビット列が８ビットとなるまで「０」が挿入され
る。これは文字毎にデータのスタートビットを揃えるた
めであり、このようにしないと、各文字の「インデック
ステーブル」の内容は「アドレス××××の第×ビット
から」という様に多くのデータを持たねばならず、テー
ブル容量が多くなり、その結果、全体の圧縮率に悪い影
響を与えるのみならず、展開時の処理速度にも不都合を
きたすからである。In this way, the compression encoding unit 112 sequentially processes the character pattern data from the upper left, and one character 288 is processed.
When the bytes are processed, they are separated once, and "0" is inserted until the last bit string of one character is 8 bits. This is because the start bits of the data are aligned for each character. If this is not done, the contents of the "index table" for each character will be a lot of data, such as "from the xth bit of the address XXXXX". This is because the storage capacity must be increased and the table capacity increases, resulting in not only adversely affecting the overall compression ratio, but also inconvenient for the processing speed during expansion.

【００４５】このようにして得られた圧縮符号列は１文
字毎にデータ書き込み部１１３に与えられて圧縮フォン
ト格納部１１４に書き込まれてゆく。この際、データ書
き込み部１１３はインデックステーブルを作成してイン
デックステーブル格納部１１５に書き込む。このインデ
ックステーブルには圧縮フォント格納部１１４に格納さ
れた各文字の最初のアドレスが記憶される。他方、逆変
換テーブル発生部１１６は圧縮符号表格納部１１０の内
容を数値的な重み順に並べ換えて図１０に示す様な圧縮
符号表に変換すると共に、この圧縮符号表を２進の木リ
スト構造を持った図１４に示す様な逆変換テーブルに変
換し、逆変換テーブル格納部１１７に格納する。The compression code string thus obtained is supplied to the data writing unit 113 for each character and written in the compressed font storage unit 114. At this time, the data writing unit 113 creates an index table and writes it in the index table storage unit 115. The index table stores the first address of each character stored in the compressed font storage unit 114. On the other hand, the inverse conversion table generating unit 116 rearranges the contents of the compression code table storage unit 110 in numerical weight order and converts the contents into a compression code table as shown in FIG. 10, and also converts this compression code table into a binary tree list structure. 14 is stored in the reverse conversion table storage unit 117.

【００４６】この結果、文字パターンデータ圧縮装置１
００内の圧縮フォント格納部１１４、インデックステー
ブル格納部１１５、逆変換テーブル格納部１１７、キー
ワードテーブル格納部１０６にそれぞれデータが揃う
と、文字パターンデータ圧縮装置１００の動作は終了す
る。この圧縮フォント格納部１１４、インデックステー
ブル格納部１１５、逆変換テーブル格納部１１７、キー
ワードテーブル格納部１０６の内容は文書作成装置２０
０の製品完成時において、通常のビットマップ方式の文
字パターンデータに代えて文書作成装置２００内の対応
する圧縮フォント格納部２０４、インデックステーブル
格納部２０５、逆変換テーブル格納部２０６、キーワー
ドテーブル格納部２０７に組み込まれる。As a result, the character pattern data compression device 1
When the compressed font storage unit 114, the index table storage unit 115, the inverse conversion table storage unit 117, and the keyword table storage unit 106 in 00 all have data, the operation of the character pattern data compression device 100 ends. The contents of the compressed font storage unit 114, the index table storage unit 115, the inverse conversion table storage unit 117, and the keyword table storage unit 106 are the contents of the document creation device 20.
When the product of No. 0 is completed, the corresponding compressed font storage unit 204, index table storage unit 205, inverse conversion table storage unit 206, keyword table storage unit in the document creation apparatus 200 are replaced with the normal bitmap character pattern data. It is installed in 207.

【００４７】上述のようにして圧縮フォントデータ、イ
ンデックステーブル、逆変換テーブル、キーワードテー
ブルが組み込まれた状態において、文書作成装置２００
は次の如く動作する。いま、文書印刷時において、印刷
対象のコードは文字文書メモリ２１４から読み出されて
主制御部２１０を介して主制御部２１０に与えられる。
すると、この文字コードは文字パターン再生装置２０１
内の文字コードレジスタ２２１にセットされるので、イ
ンデックステーブルアクセス部２２２はこの文字コード
に基づいてインデックステーブル格納部２０５から該当
アドレスを検索し、圧縮データアクセス部２２３はこの
アドレスに対応する１文字分の圧縮文字データ（圧縮符
号列）を読み出して圧縮パターンバッファ２２４に転送
する。With the compressed font data, index table, inverse conversion table, and keyword table incorporated as described above, the document creation device 200
Operates as follows. When printing a document, the code to be printed is read from the character document memory 214 and given to the main control unit 210 via the main control unit 210.
Then, this character code corresponds to the character pattern reproducing device 201.
Since it is set in the character code register 221, the index table access unit 222 retrieves the corresponding address from the index table storage unit 205 based on this character code, and the compressed data access unit 223 stores one character corresponding to this address. The compressed character data (compressed code string) is read and transferred to the compressed pattern buffer 224.

【００４８】ここで、再生コマンド発生部２２５は逆変
換テーブル格納部２０６の内容を参照し、圧縮パターン
バッファ２２４内の圧縮符号列を対応するコマンドコー
ドに変換するが、この場合、再生コマンド発生部２２５
内には逆変換テーブル用のポインタ（図示せず）が設け
られており、先ず、木リスト構造のＲｏｏｔの「ｄｉ
ｒ」アドレスである「０」をポインタの初期値として設
定しておく。この状態において、圧縮パターンバッファ
２２４内の圧縮符号列の先頭から１ビットのデータを読
み出し、それが“０”か“１”かのチェックを行う。い
ま、先頭ビットが“０”であれば、「ポインタ値＋０」
でアドレッシングされる逆変換テーブルの内容を読み出
し、その１６ビットデータのＭＳＢが“０”か否かをチ
ェックする。ここで、ＭＳＢが“１”であれば「振舞
い」は「ｎｉｌ」となり、そこでデータが確定し、その
下位１５ビットのデータはコマンドコードとなり、これ
を再生コマンド実行部２２６に与える。一方、ＭＳＢが
“０”であれば、「振舞い」は「ｄｉｒ」となり、次の
ビットを見る必要がある為、その下位１５ビットのデー
タは次のテーブルアドレスとなり、これをポインタに格
納する。Here, the reproduction command generator 225 refers to the contents of the inverse conversion table storage 206 and converts the compression code string in the compression pattern buffer 224 into a corresponding command code. In this case, the reproduction command generator 225 is used. 225
A pointer (not shown) for the inverse conversion table is provided inside, and first, "di" of Root of the tree list structure is provided.
The “r” address “0” is set as the initial value of the pointer. In this state, 1-bit data is read from the head of the compression code string in the compression pattern buffer 224, and it is checked whether it is "0" or "1". If the first bit is "0", "pointer value + 0"
The contents of the inverse conversion table addressed by are read, and it is checked whether the MSB of the 16-bit data is "0". Here, if the MSB is “1”, the “behavior” becomes “nil”, the data is fixed there, and the data of the lower 15 bits becomes a command code, which is given to the reproduction command execution unit 226. On the other hand, if the MSB is "0", the "behavior" is "dir" and the next bit needs to be seen. Therefore, the data of the lower 15 bits becomes the next table address, which is stored in the pointer.

【００４９】他方、圧縮符号の先頭ビットが“１”であ
れば、「ポインタ値＋２」でアドレッシングされる逆変
換テーブルの内容を読み出し、その１６ビットデータの
ＭＳＢが“０”か否かをチェックする。この場合におい
てもＭＳＢが“１”であれば「振舞い」は「ｎｉｌ」と
なり、その下位１５ビットのデータはコマンドコードと
なり、これを再生コマンド実行部２２６に与えるが、
“０”であれば「振舞い」は「ｄｉｒ」となり、その下
位１５ビットのデータは次のテーブルアドレスとなり、
これをポインタに格納する。このような動作を圧縮符号
列に対して１ビットずつ繰り返し実行される。On the other hand, if the first bit of the compression code is "1", the contents of the inverse conversion table addressed by "pointer value + 2" are read and it is checked whether the MSB of the 16-bit data is "0". To do. Even in this case, if the MSB is “1”, the “behavior” becomes “nil”, and the lower 15 bits of the data become a command code, which is given to the reproduction command execution unit 226.
If it is “0”, the “behavior” becomes “dir”, and the lower 15 bits of data become the next table address,
This is stored in the pointer. Such an operation is repeatedly performed on the compression code string bit by bit.

【００５０】いま、例えば、圧縮符号列が「００１１０
１００＋００１１００１０＋……」であり、逆変換テー
ブル格納部２０６の内容が図１４に示すデータ構造であ
るものとする。この場合、圧縮符号列の１ビット目は
「０」、「ポインタ値＋０」でアドレッシングされるテ
ーブル内容は「８１００ｈ」、したがって、このテーブ
ル内容の下位１５ビットのデータ「１００ｈ」がコマン
ドコードとして送出される。次に、２ビット目の符号も
「０」であるから上述と同様にコマンドコード「１００
ｈ」が送出される。次に、３ビット目の符号は「１」で
あり、最初にアドレッシングされるテーブル内容は「０
００４ｈ」、したがってテーブル内容の下位１５ビット
のデータが次のテーブルアドレス「０００４」としてポ
インタにセットされる。ここで、次の４ビット目の符号
も「１」であるから「ポインタ値（０００４）＋２」で
アドレッシングされるテーブル内容が読み出されるが、
この場合、テーブル内容は「０００８ｈ」、したがっ
て、このテーブル内容の下位１５ビットのデータが次の
テーブルアドレス「０００８」としてポインタにセット
される。次に、５ビット目のデータは「０」であり、
「ポインタ値（０００８）＋０」でアドレッシングされ
るテーブル内容「０００Ｃｈ」が読み出され、その下位
１５ビットのデータが次のテーブルアドレス「０００
Ｃ」としてポインタにセットされる。Now, for example, the compression code string is "00110".
100 + 00110010 + ... ”, and the contents of the inverse conversion table storage unit 206 have the data structure shown in FIG. In this case, the first bit of the compression code string is "0", the table content addressed by "pointer value + 0" is "8100h", and therefore the lower 15-bit data "100h" of this table content is transmitted as a command code. To be done. Next, since the code of the second bit is also “0”, the command code “100
h "is sent. Next, the code of the third bit is "1", and the content of the table addressed first is "0".
004h ”, that is, the lower 15-bit data of the table contents is set in the pointer as the next table address“ 0004 ”. Here, since the code of the next 4th bit is also "1", the table content addressed by "pointer value (0004) +2" is read out.
In this case, the table content is "0008h", and therefore the lower 15 bits of data of this table content are set in the pointer as the next table address "0008". Next, the data of the 5th bit is “0”,
The table content “000Ch” addressed by “pointer value (0008) +0” is read, and the lower 15 bits of data are read to the next table address “000”.
"C" is set in the pointer.

【００５１】以下、同様に、次の６ビット目の符号は
「１」であり、テーブルアドレスは「０００Ｃ」から
「００１０」番地にジャンプし、更に７ビット目〜９ビ
ット目の符号も「０」であるからテーブルアドレスは
「００１４番地」、「００１８番地」、「００１Ｃ番
地」に順次ジャンプする。そして、１０ビット目の符号
も「０」であるが、この場合、テーブル内容は「８０Ｆ
Ｅｈ」であり、その下位１５ビットのデータ「０ＦＥ
ｈ」がコマンドコードとして送出される。このように圧
縮符号列の３ビット目〜１０ビット目のデータはコマン
ドコード「０ＦＥｈ」に変換される。同様に、圧縮符号
列の１１ビット目〜１４ビット目のデータ（１１００）
を順次読み出すことによってテーブルアドレスは「００
００番地」から「０００４番地」、「０００８番地」、
「０００Ｃ番地」に順次ジャンプし、「０００Ｃ番地」
のテーブル内容「８１０２ｈ」が読み出された際に、そ
の下位１５ビットのデータ「１０２ｈ」がコマンドコー
ドとして送出される。このように再生コマンド発生部２
２５は圧縮パターンバッファ２２４の内容を逆変換テー
ブルを参照することによってコマンドコードに順次変換
してゆく。Similarly, the code of the next 6th bit is "1", the table address jumps from "000C" to the address "0010", and the code of the 7th to 9th bits is "0". Therefore, the table address jumps to “address 0014”, “address 0018”, and “address 001C” in order. The code of the 10th bit is also "0", but in this case, the table content is "80F.
Eh ", and the lower 15 bits of data" 0FE "
h ”is sent as a command code. In this way, the data of the third bit to the 10th bit of the compression code string is converted into the command code “0FEh”. Similarly, the 11th to 14th bits of the compression code string (1100)
Table address is read as "00".
"00" to "0004", "0008",
Jump to "000C address" in sequence, and go to "000C address"
When the table content "8102h" of the above is read, the lower 15-bit data "102h" is sent out as a command code. In this way, the reproduction command generator 2
25 sequentially converts the contents of the compression pattern buffer 224 into command codes by referring to the inverse conversion table.

【００５２】すると、再生コマンド実行部２２６はキー
ワードテーブル格納部２０７の内容を参照しながらコマ
ンドコードにしたがった処理を実行する。即ち、再生コ
マンド実行部２２６は再生コマンド発生部２２５から１
コマンド毎に送られてくるコマンドコードにしたがって
図８で示す実行コマンドの内容にしたがった処理を実行
してゆく。ここで、１文字の最初の２バイトについて上
述した様にコマンドコード「１００ｈ」が発生された場
合には、固定値「０（空白）」を文字パターンバッファ
２２０に書き込むものとする。いま、コマンドコードが
「０００ｈ」〜「０ＦＦｈ」であればこのコマンドコー
ドの下位８ビットの内容がそのまま再生データとなり、
文字パターンバッファ２２０にストアされる。一方、コ
マンドコードが「１００ｈ」、「１０１ｈ」、「１０２
ｈ」、「１０３ｈ」であれば、再生コマンド実行部２２
６は文字パターンバッファ２２０から前２つのデータを
読み出し、このデータに基づいてキーワードテーブルの
アドレスを算出する。そして、キーワードテーブル格納
部２０７をアクセスし、コマンドコード「１００ｈ」、
「１０１ｈ」、「１０２ｈ」、「１０３ｈ」に対応する
第１候補キーワード〜第４候補キーワードを指定すると
共に、指定されたキーワードの中から算出アドレスに対
応するデータを抽出する。このデータがコマンドコード
「１００ｈ」〜「１０３ｈ」についての再生データとな
り、文字パターンバッファ２２０にストアされる。この
ような処理が１文字２８８バイト分繰り出されることに
よって文字パターンバッファ２２０には１文字分のパタ
ーンデータが再生されることになる。Then, the reproduction command execution section 226 executes the processing according to the command code while referring to the contents of the keyword table storage section 207. That is, the reproduction command execution unit 226 outputs the reproduction command generation unit 225
The processing according to the contents of the execution command shown in FIG. 8 is executed according to the command code sent for each command. Here, when the command code "100h" is generated for the first two bytes of one character as described above, the fixed value "0 (blank)" is written in the character pattern buffer 220. Now, if the command code is "000h" to "0FFh", the contents of the lower 8 bits of this command code become the reproduction data as they are,
It is stored in the character pattern buffer 220. On the other hand, the command codes are "100h", "101h", "102".
If h ”and“ 103h ”, the reproduction command execution unit 22
6 reads the previous two data from the character pattern buffer 220 and calculates the address of the keyword table based on this data. Then, the keyword table storage unit 207 is accessed, and the command code “100h”,
The first to fourth candidate keywords corresponding to "101h", "102h", and "103h" are designated, and data corresponding to the calculated address is extracted from the designated keywords. This data becomes the reproduction data for the command codes “100h” to “103h” and is stored in the character pattern buffer 220. By repeating such processing for one character of 288 bytes, one character of pattern data is reproduced in the character pattern buffer 220.

【００５３】以上の如く、文字パターンデータ圧縮装置
１００は頻度の高いデータの組み合せ毎に第１候補〜第
４候補のキーワードテーブルを作成して各キーワードテ
ーブルをコマンドコード「１００ｈ」〜「１０３ｈ」に
対応させておくことにより、コマンドのデータ出現頻度
を極端に偏らせ、それを冗長性をカットする特性を持つ
ハフマン符号化法によって圧縮することにより高い圧縮
率を得ることができる。つまり、かなり多くのデータが
コマンドコード「１００ｈ」〜「１０３ｈ」に対応する
少ないビット数の圧縮符号列に置き換えられる為、圧縮
率を極めて高くすることができる。この場合、本実施例
においては第１候補〜第４候補までのキーワードテーブ
ルを作成するので、キーワードテーブルがそれよりも少
ない場合よりもヒットする確率が高くなり、高圧縮が可
能となる。例えば、無圧縮フォントパターンデータ格納
部１０１の容量が約７Ｍバイト（１３０００文字×文字
×２書体）とすると、圧縮フォント格納部２０４は２Ｍ
バイト、インデックステーブル格納部２０５は３バイト
×文字数、逆変換テーブル格納部２０６は１Ｋバイト／
書体、つまり２Ｋバイト、キーワードテーブル格納部２
０７は１６Ｋバイト／書体、つまり、３２Ｋバイトとな
り、全体で略１／３のメモリ容量となり、極めて高いデ
ータ圧縮が可能となる。ここで、キーワードテーブルの
アドレスを１６ビットアドレスから１２ビットアドレス
に縮小したので、１６ビットアドレスに比べてキーワー
ドテーブルの容量は１６分の１となり、テーブル容量の
大幅な縮小が可能となり、全体の圧縮率の向上に寄与し
ている。また、このようにして圧縮された圧縮フォント
データを元の文字パターンデータに再生する際、文書作
成装置２００は圧縮符号をコマンドコードに変換したの
ち、このコマンドを実行するだけで容易に復元再生する
ことが可能である。この場合、２進の木リストにしたが
って圧縮符号をコマンドコードに変換するので、高速再
生が可能となる。As described above, the character pattern data compression apparatus 100 creates the keyword table of the first candidate to the fourth candidate for each combination of frequently used data, and converts each keyword table into command codes "100h" to "103h". By making them correspond to each other, the data appearance frequency of the command is extremely biased, and it is possible to obtain a high compression rate by compressing it by the Huffman coding method having a characteristic of cutting redundancy. That is, a considerable amount of data is replaced with a compression code string having a small number of bits corresponding to the command codes “100h” to “103h”, so that the compression rate can be made extremely high. In this case, in the present embodiment, since the keyword tables of the first to fourth candidates are created, the probability of hitting is higher than in the case where the number of keyword tables is smaller, and high compression is possible. For example, when the capacity of the uncompressed font pattern data storage unit 101 is about 7 Mbytes (13000 characters × characters × 2 typefaces), the compressed font storage unit 204 has 2M bytes.
Bytes, the index table storage unit 205 is 3 bytes x the number of characters, and the inverse conversion table storage unit 206 is 1 Kbyte /
Typeface, that is, 2K bytes, keyword table storage 2
07 is 16 Kbytes / typeface, that is, 32 Kbytes, which has a memory capacity of about 1/3 as a whole and enables extremely high data compression. Here, since the address of the keyword table is reduced from the 16-bit address to the 12-bit address, the capacity of the keyword table is 1/16 of that of the 16-bit address, and the table capacity can be significantly reduced, resulting in the entire compression. It contributes to the improvement of the rate. Further, when reproducing the compressed font data compressed in this way into the original character pattern data, the document creation apparatus 200 converts the compression code into a command code, and then simply executes this command to easily restore and reproduce. It is possible. In this case, since the compression code is converted into the command code according to the binary tree list, high speed reproduction is possible.

【００５４】[0054]

【第２実施例】次に、図１７および図１８を参照して第
２実施例を説明する。ここで、本実施例においては文字
パターンデータ圧縮装置１００によって生成された１組
の書体データ、つまり、圧縮フォントデータ、インデッ
クステーブル、逆変換テーブル、キーワードテーブルを
予め文書作成装置２００に組み込んでおくようにした
が、本実施例においては、多書体システムの文書作成装
置に適用したもので、２組分の書体データを予め文書作
成装置２００内に組み込んでおく他、必要に応じて他の
書体データをフロッピーディスク等の外部記憶装置から
供給するようにしたものである。この場合、外部記憶装
置からは圧縮フォントデータとそのインデックステーブ
ルのみを供給するようにしたのが、本実施例の特徴であ
る。Second Embodiment Next, a second embodiment will be described with reference to FIGS. 17 and 18. Here, in this embodiment, one set of typeface data generated by the character pattern data compression apparatus 100, that is, compressed font data, an index table, an inverse conversion table, and a keyword table should be incorporated in the document creation apparatus 200 in advance. However, in the present embodiment, the present invention is applied to the document creating apparatus of the multi-face type system, and two sets of typeface data are incorporated in the document creating apparatus 200 in advance, and other typeface data may be added as necessary. Is supplied from an external storage device such as a floppy disk. In this case, the feature of the present embodiment is that only the compressed font data and its index table are supplied from the external storage device.

【００５５】図１７は本実施例に係る文書作成装置３０
０のブロック構成図である。文書作成装置３００は文書
作成機主装置および文字パターン再生装置３０１と、圧
縮文字パターンメモリを構成するＲＯＭ３０２、３０３
と、圧縮パターンバッファを構成するＲＡＭ３０４と、
フロッピーディスク等の外部記憶装置３０５を有する構
成となっている。ＲＯＭ３０２には２種類の書体に対応
して２組の圧縮フォントデータＡ−１、Ａ−２とインデ
ックステーブルＢ−１、Ｂ−２が格納されており、また
ＲＯＭ３０３には４種類の書体に対応して４組のキーワ
ードテーブルＣ−１、Ｃ−２、Ｃ−３、Ｃ−４、逆変換
テーブルＤ−１、Ｄ−２、Ｄ−３、Ｄ−４が格納されて
いる。ここで、圧縮フォントデータＡ−１、インデック
ステーブルＢ−１、キーワードテーブルＣ−１、逆変換
テーブルＤ−１は第１の書体データに対応し、また圧縮
フォントデータＡ−２、インデックステーブルＢ−２、
キーワードテーブルＣ−２、逆変換テーブルＤ−２は第
２の書体データに対応するものであり、第１の書体ある
いは第２の書体が選択された場合には、上記第１実施例
と同様に対応するテーブル内容を参照することによって
復元再生処理が行われる。FIG. 17 shows a document creating apparatus 30 according to this embodiment.
It is a block block diagram of 0. The document creation device 300 includes a document creation device main device and a character pattern reproduction device 301, and ROMs 302 and 303 forming a compressed character pattern memory.
And a RAM 304 forming a compression pattern buffer,
It is configured to have an external storage device 305 such as a floppy disk. The ROM 302 stores two sets of compressed font data A-1, A-2 and index tables B-1, B-2 corresponding to two types of fonts, and the ROM 303 supports four types of fonts. Then, four sets of keyword tables C-1, C-2, C-3, C-4 and inverse conversion tables D-1, D-2, D-3, D-4 are stored. Here, the compressed font data A-1, the index table B-1, the keyword table C-1, and the inverse conversion table D-1 correspond to the first typeface data, and the compressed font data A-2 and the index table B-. 2,
The keyword table C-2 and the inverse conversion table D-2 correspond to the second typeface data, and when the first typeface or the second typeface is selected, the same as in the first embodiment. Restoration reproduction processing is performed by referring to the corresponding table contents.

【００５６】ここで、本実施例において第３書体以降は
外部記憶装置３０５から供給されるものであり、任意の
文字が指定されると、外部記憶装置３０５内のインデッ
クステーブルＢ−３が参照されて圧縮フォントデータＡ
−３から１文字分のデータが読み出され、圧縮パターン
バッファを構成するＲＡＭ３０４にセットされる。ここ
で、文字パターン再生装置は上記第１実施例と同様にＲ
ＡＭ３０４内のデータを逆変換テーブルを参照してコマ
ンドコードに変換すると共に、キーワードテーブルを参
照してコマンドを実行してゆくが、その際、どの逆変換
テーブル、キーワードテーブルを参照するかは、４種類
の全てのものが参照可能であり、その指定は外部記憶装
置３０５の中の固定アドレスの１バイト（テーブル指定
情報）を参照することにより行われる。これは外部記憶
装置３０５からの多くの書体供給に対応させるときに、
文書作成装置３００に内蔵した各種のテーブルのうち供
給書体に最適なもの（高圧縮が可能なもの）を選ぶこと
ができる様にする為であり、これにより、外部記憶装置
３０５には圧縮フォントデータＡ−３、インデックステ
ーブルＢ−３のみを記憶させておけばよいことになる。In the present embodiment, the third and subsequent typefaces are supplied from the external storage device 305, and when any character is designated, the index table B-3 in the external storage device 305 is referred to. Compressed font data A
Data for one character is read from -3 and set in the RAM 304 which constitutes the compression pattern buffer. Here, the character pattern reproducing apparatus is the same as the above-mentioned first embodiment.
The data in the AM 304 is converted into a command code by referring to the inverse conversion table, and the command is executed by referring to the keyword table. At that time, which inverse conversion table or keyword table is referred to is 4 All types can be referenced, and the designation is performed by referring to 1 byte (table designation information) of a fixed address in the external storage device 305. When dealing with many typefaces supplied from the external storage device 305,
This is because it is possible to select the most suitable table (the one that can be highly compressed) for the supplied typeface from the various tables built in the document creation apparatus 300, whereby the compressed font data is stored in the external storage device 305. It is sufficient to store only A-3 and the index table B-3.

【００５７】なお、文書作成装置３００に内蔵された圧
縮フォントデータＡ−１、Ａ−２、インデックステーブ
ルＢ−１、Ｂ−２、キーワードテーブルＣ−１、Ｃ−
２、Ｃ−３、Ｃ−４、逆変換テーブルＤ−１、Ｄ−２、
Ｄ−３、Ｄ−４は上記第１実施例と同様に文字パターン
圧縮装置から転送されて組み込まれたものであるが、第
３書体、第４書体に対応する圧縮フォントデータやイン
デックステーブルは文書作成装置３００内に常駐させず
に削除されている。The compressed font data A-1, A-2, the index tables B-1, B-2, the keyword tables C-1, C- built in the document creating apparatus 300 are included.
2, C-3, C-4, inverse conversion tables D-1, D-2,
D-3 and D-4 are transferred and incorporated from the character pattern compression device as in the first embodiment, but the compressed font data and index table corresponding to the third and fourth typefaces are the documents. It is deleted without being made resident in the creating apparatus 300.

【００５８】また、上記第１実施例においては、文字パ
ターンデータ圧縮装置１００を構成する回数カウントア
レイ１０８の初期値として「０」を設定するようにした
が、微妙に違う多くの書体を外部記憶装置３０５から供
給し、キーワードテーブルと逆変換テーブルだけをＲＯ
Ｍに持たせて共有するときには、回数カウントアレイ１
０８の初期値として「１」を設定しておく。これはどの
ようなデータでも最低１回はデータが存在したものとみ
なす為であり、第１実施例の様にカウンタ回数値の初期
値を「０」としておくと、そのコマンドに対する圧縮符
号列が存在しなくなるという冗長性をカットする特性の
圧縮符号発生手法による不都合を回避するためである。
これによって、データが出現しなかったデータに対して
も圧縮符号列が発生されるので、微妙に相違する書体の
文字パターンデータを外部供給してもそれに対応できる
ようになる。Further, in the first embodiment described above, "0" is set as the initial value of the number-of-times count array 108 which constitutes the character pattern data compression apparatus 100. However, many subtly different typefaces are externally stored. Only the keyword table and the inverse conversion table supplied from the device 305 are RO
When it is shared with M, the count array 1
“1” is set as the initial value of 08. This is because it is assumed that data exists at least once in any data, and if the initial value of the counter count value is set to "0" as in the first embodiment, the compression code string for the command is This is to avoid the inconvenience caused by the compression code generation method having the characteristic of cutting the redundancy that it does not exist.
As a result, the compression code string is generated even for the data in which the data does not appear, so that the character pattern data of a slightly different typeface can be dealt with even if supplied externally.

【００５９】図１８は外部記憶装置３０５に対して圧縮
フォントデータやインデックステーブルを書き込む際の
処理過程を示している。先ず、第１書体〜第４書体に対
応する無圧縮フォントパターンデータＰ−１、Ｐ−２、
Ｐ−３、Ｐ−４を上記第１実施例における文字パターン
データ圧縮装置１００を用いて圧縮すると、各書体毎に
圧縮フォントデータ、インデックステーブル、逆変換テ
ーブル、キーワードテーブルが生成されると共に、この
生成過程において圧縮符号表が生成される。このうち、
各書体毎にキーワードテーブルＸ−１、Ｘ−２、Ｘ−
３、Ｘ−４と圧縮符号表Ｙ−１、Ｙ−２、Ｙ−３、Ｙ−
４を取り出しておく。FIG. 18 shows a process of writing compressed font data and an index table in the external storage device 305. First, uncompressed font pattern data P-1, P-2 corresponding to the first to fourth typefaces,
When P-3 and P-4 are compressed using the character pattern data compression apparatus 100 in the first embodiment, compressed font data, index table, inverse conversion table, and keyword table are generated for each typeface, and A compression code table is generated in the generation process. this house,
Keyword tables X-1, X-2, X- for each typeface
3, X-4 and compression code tables Y-1, Y-2, Y-3, Y-
Take 4 out.

【００６０】次に、外部記憶装置３０５によって文書作
成装置３００に供給する書体（無圧縮フォントパターン
データＰ−ｎ）を上記第１実施例の文字パターンデータ
圧縮装置１００によって圧縮する際に、第１書体〜第４
書体に対応して予め取り出しておいた各キーワードテー
ブルＸ−１〜Ｘ−４と圧縮符号表Ｙ−１〜Ｙ−４のう
ち、外部記憶装置３０５によって供給する書体に最も適
したもの（つまり、実際に各書体毎に圧縮してみて最も
圧縮率が高かったもの）を選択装置４００によって選択
する。そして、選択されたキーワードテーブルと圧縮符
号表を上記第１実施例の文字パターンデータ圧縮装置１
００を構成するキーワードテーブル格納部１０６、圧縮
符号表格納部１１０に格納しておく。この状態におい
て、圧縮符号化部１１２から動作を始めさせる。これに
よって得られた圧縮フォントデータｎ、インデックステ
ーブルｎは外部記憶装置３０５に書き込まれる。Next, when compressing the typeface (uncompressed font pattern data P-n) supplied to the document creating apparatus 300 by the external storage device 305 by the character pattern data compressing apparatus 100 of the first embodiment, Typeface-4th
Among the keyword tables X-1 to X-4 and the compression code tables Y-1 to Y-4 that have been extracted in advance corresponding to the typeface, the one most suitable for the typeface supplied by the external storage device 305 (that is, The compression device having the highest compression rate when actually compressed for each typeface is selected by the selection device 400. Then, the selected keyword table and compression code table are used as the character pattern data compression apparatus 1 of the first embodiment.
00 is stored in the keyword table storage unit 106 and the compression code table storage unit 110. In this state, the compression coding unit 112 starts the operation. The compressed font data n and the index table n thus obtained are written in the external storage device 305.

【００６１】このように構成された本実施例においても
上記第１実施例と同様の効果を有する他、外部記憶装置
３０５から圧縮フォントデータとインデックステーブル
のみを供給するだけでよく、外部記憶装置３０５にはキ
ーワードテーブル、逆変換テーブルが不要となる為、そ
れだけ外部記憶装置３０５の容量を効果的に使用するこ
とが可能となる。また、回数カウントアレイ１０８の初
期値として「１」を設定したからデータの出現頻度を調
べたときに出現しなかったデータに対しても圧縮符号を
発生させることができる。In this embodiment having the above-mentioned structure, the same effects as those of the first embodiment can be obtained. In addition, only the compressed font data and the index table need be supplied from the external storage device 305, and the external storage device 305 can be used. Since the keyword table and the inverse conversion table are not required for this, the capacity of the external storage device 305 can be effectively used. Further, since “1” is set as the initial value of the number-of-times count array 108, the compression code can be generated even for data that does not appear when the appearance frequency of the data is checked.

【００６２】なお、上記実施例においては第１候補キー
ワードから第４候補キーワードを用意したが、２種類の
キーワードだけでもよく、その数は任意である。また、
上記実施例は着目データに対して前２つのデータとの組
み合せ頻度を調べたが、１つ前のデータとの組み合せ頻
度を調べたり、前３つのデータとの組み合せ頻度を調べ
るようにしてもよい。更に、上記実施例はキーワードテ
ーブルの第１要素アドレスを縮小する手法として２つ前
のデータ「ＤＬ」の上位４ビットと下位４ビットとのイ
クスクルーシブオアＸＯＲを取る方法を用いたが、オア
演算によって縮小するようにしてもよい。この場合、例
えば、１ビットずつ交互にオア演算を行う等の方法が考
えられる。また、この発明はプログラム制御によるソフ
トウェアでも構成することができ、これによって高速で
ローコストな文字パターンデータ圧縮装置１００を提供
することが可能である。Although the first to fourth candidate keywords are prepared in the above embodiment, only two kinds of keywords may be used and the number thereof is arbitrary. Also,
In the above embodiment, the combination frequency of the data of interest with the previous two data is checked, but the combination frequency with the previous data or the combination frequency with the previous three data may be checked. . Further, in the above-described embodiment, as a method of reducing the first element address of the keyword table, a method of taking an exclusive OR XOR of the upper 4 bits and the lower 4 bits of the data "DL" two before is used. You may make it reduce by calculation. In this case, for example, a method of alternately performing an OR operation bit by bit can be considered. Further, the present invention can also be configured by software controlled by a program, which makes it possible to provide a high-speed and low-cost character pattern data compression device 100.

【００６３】[0063]

【発明の効果】この発明によれば、文字パターンデータ
を構成するあるデータとその前のデータとの組み合せパ
ターンが頻繁に出現する場合にこれを再生用の固定デー
タに対応付けてデータの出現頻度を偏らせておくことに
より冗長性を抑制する圧縮手法を用いて文字パターンデ
ータを圧縮することができるので、高圧縮が可能とな
り、圧縮フォントデータを文書作成装置のＲＯＭ等に組
み込んだ場合、容量の少ないメモリに効果的に圧縮フォ
ントデータを組み込むことが可能となる。According to the present invention, when a combination pattern of a certain data forming the character pattern data and the preceding data frequently appears, this is associated with the fixed data for reproduction and the appearance frequency of the data. Since the character pattern data can be compressed by using a compression method that suppresses redundancy by biasing the, the high compression becomes possible, and when the compressed font data is incorporated in the ROM of the document creation device, It is possible to effectively incorporate compressed font data into a memory with less memory.

[Brief description of drawings]

【図１】第１実施例において、文字パターンデータの圧
縮／再生過程を概略的に示した図。FIG. 1 is a diagram schematically showing a compression / reproduction process of character pattern data in the first embodiment.

【図２】文字パターンデータ圧縮装置１００のブロック
構成図。FIG. 2 is a block configuration diagram of a character pattern data compression device 100.

【図３】無圧縮フォントパターンデータ格納部１０１に
収められている文字パターンのデータ構造を示した図。FIG. 3 is a diagram showing a data structure of a character pattern stored in an uncompressed font pattern data storage unit 101.

【図４】キーワード選定用計数制御部１０３の第１機能
を説明する為の図。FIG. 4 is a diagram for explaining a first function of a keyword selection counting control unit 103.

【図５】キーワード選定用カウンタアレイ１０４の構成
を示した図。FIG. 5 is a diagram showing a configuration of a keyword selection counter array 104.

【図６】キーワードテーブルの構成を示した図。FIG. 6 is a diagram showing a configuration of a keyword table.

【図７】回数カウントアレイ１０８の構成を示した図。FIG. 7 is a diagram showing a configuration of a frequency counting array 108.

【図８】各コマンドコードとその実行コマンドの内容を
示した図。FIG. 8 is a diagram showing the contents of each command code and its execution command.

【図９】圧縮符号表を示した図。FIG. 9 is a diagram showing a compression code table.

【図１０】数値的な重み順に並べ変えられた圧縮符号表
を示した図。FIG. 10 is a diagram showing a compression code table sorted in numerical weight order.

【図１１】図１０の圧縮符号表を木リスト構造で表わし
た図。11 is a diagram showing the compression code table of FIG. 10 in a tree list structure.

【図１２】逆変換テーブルの構成図。FIG. 12 is a configuration diagram of an inverse conversion table.

【図１３】逆変換テーブルにおける「振舞い」データの
構造を示した図。FIG. 13 is a diagram showing the structure of “behavior” data in the inverse conversion table.

【図１４】逆変換テーブルの内容を具体的に示した図。FIG. 14 is a diagram specifically showing the contents of an inverse conversion table.

【図１５】文書作成装置２００の全体構成を示した図。FIG. 15 is a diagram showing the overall configuration of a document creation device 200.

【図１６】文字パターン再生装置２０１のブロック構成
図。16 is a block configuration diagram of a character pattern reproducing device 201. FIG.

【図１７】第２実施例において、文書作成装置３００の
ブロック構成図。FIG. 17 is a block configuration diagram of the document creation device 300 in the second embodiment.

【図１８】外部記憶装置３０５に対して圧縮フォントデ
ータやインデックステーブルを書き込む際の処理過程を
示した図。FIG. 18 is a diagram showing a process of writing compressed font data and an index table in the external storage device 305.

[Explanation of symbols]

１００文字パターンデータ圧縮装置１０１無圧縮フォントパターンデータ格納部１０３キーワード選定用計数制御部１０４キーワード選定用カウンタアレイ１０５キーワード選定部１０６キーワードテーブル格納部１０７データ出現回数計数部１０８回数カウントアレイ１０９圧縮符号表発生部１１０圧縮符号表格納部１１２圧縮符号化部１１３データ書き込み部１１４圧縮フォント格納部１１５インデックステーブル格納部１１６逆変換テーブル発生部１１７逆変換テーブル格納部２００文書作成装置 100 Character Pattern Data Compressor 101 Uncompressed Font Pattern Data Storage Unit 103 Keyword Selection Counting Control Unit 104 Keyword Selection Counter Array 105 Keyword Selection Unit 106 Keyword Table Storage Unit 107 Data Appearance Counting Unit 108 Number Counting Array 109 Compression Code Table Generation unit 110 Compression code table storage unit 112 Compression coding unit 113 Data writing unit 114 Compressed font storage unit 115 Index table storage unit 116 Inverse conversion table generation unit 117 Inverse conversion table storage unit 200 Document creation device

Claims

[Claims]

1. A character pattern data compression apparatus for compressing each character pattern data of a bit map system and incorporating it in a document creation apparatus, focusing on each data constituting the character pattern data sequentially,
A combination frequency detecting means for detecting the combination frequency of all the characters in accordance with the combination of the data of interest and the preceding data, and a case where a high frequency combination is detected by the combination frequency detecting means. If the previous data is this combination, the data of interest is this table creation means for creating a candidate table that stores data candidates for each combination, and by making the candidate table correspond to the fixed data for reproduction. A compression code generation means for generating a compression code string in which the appearance frequency of the data is biased and a redundant portion is suppressed according to the appearance frequency of the data, and a compression code string generated by the compression code generation means for each character pattern data. A character pattern comprising: a compression unit that compresses the character pattern data by replacing Data compression device.

2. When a combination having a high frequency is detected by the combination frequency detecting means, the table creating means creates a plurality of candidate tables having different data of interest for one combination of previous data, 4. The character pattern data compression apparatus according to claim 1, wherein the compression code generation means is adapted to associate fixed data with each of the plurality of candidate tables.

3. The unit of each data forming the character pattern data is a bit group forming one word occupying one address when the data is held by the bit map method. Character pattern data compression device described.