JPH05127648A

JPH05127648A - Compression device for outline font data

Info

Publication number: JPH05127648A
Application number: JP3313877A
Authority: JP
Inventors: Hiroaki Suzuki; 博顕鈴木
Original assignee: Ricoh Co Ltd
Current assignee: Ricoh Co Ltd
Priority date: 1991-10-31
Filing date: 1991-10-31
Publication date: 1993-05-25

Abstract

PURPOSE:To decrease the capacity of a memory for holding outline font data. CONSTITUTION:A shared pattern extracting means 11 extracts the number of a character, which can be considered to be synthesized from a registered character list and extracts the character numbers of the main body and crown part constituting a composite character and a restoration information calculating means 12 expands the composite character to obtain the base points of the main body and crown part, expands the main body to obtain its base point, and expands the crown part to obtain its base point. The offset values of the main body and crown part are calculated to correct their offsets. An information adding means 13 additionally stores the offset values in the table when coordinate data on the main body and crown part of the composite character match each other.

Description

Detailed Description of the Invention

【０００１】[0001]

【産業上の利用分野】本発明は、文字を印字データを構
成するアウトラインフォントデータの圧縮装置に関す
る。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a device for compressing outline font data which constitutes print data of characters.

【０００２】[0002]

【従来の技術】一般に、日本語のデスクトップ・パブリ
ッシング（ＤＴＰ）システムを構築する場合に幾つかの
課題が残されているが、その一つとしてフォントの容量
をあげることができる。ＤＴＰでは多くの書体や文字サ
イズのフォントを取り扱うことが必須条件であるが、ア
ウトラインフォントは、文字サイズに注目するとビット
マップフォントに比べてデータ容量が小さいという利点
があり、また、変倍や回転を行う場合に文字の品質を維
持することができるという利点がある。このアウトライ
ンフォントは、図３９に示すようなベゼー曲線により構
成されている場合、スタートポイントＳと、エンドポイ
ントＥと、コントロールポイントＣ１，Ｃ２の合計４つ
のポイントにより一つの曲線を構成する。2. Description of the Related Art Generally, some problems remain when constructing a Japanese desktop publishing (DTP) system, and one of them is the font capacity. In DTP, it is indispensable to handle fonts of many typefaces and character sizes, but outline fonts have the advantage that the data capacity is smaller than bitmap fonts when attention is paid to character size, and scaling and rotation are also important. When doing, there is an advantage that the character quality can be maintained. When this outline font is composed of Beze curves as shown in FIG. 39, a start point S, an end point E, and a total of four points C1 and C2 form one curve.

【０００３】[0003]

【発明が解決しようとする課題】しかしながら、このア
ウトラインフォントは、マップフォントに比べてデータ
容量が小さいとはいえ、日本語の一書体ではＪＩＳ第
１、第２水準で約８０００文字分より構成されるので、
データ量が膨大になる。また、アウトラインフォントは
図３９に示すように、一つの曲線が四つのポイントによ
り構成されるので、複雑な漢字の場合には多くのデータ
量となり、一書体で約４Ｍバイトのデータ量となる。し
たがって、ＤＴＰシステムでは、使用頻度が高い数書体
分のアウトラインフォントデータをＲＯＭ（リードオン
リメモリ）により内蔵し、他の書体のアウトラインフォ
ントデータをハードディスク装置等により保持する方法
が採用されている。However, although the outline font has a smaller data capacity than the map font, one typeface in Japanese is composed of about 8000 characters according to the JIS 1st and 2nd levels. So
The amount of data becomes huge. In addition, as shown in FIG. 39, the outline font has a large amount of data in the case of a complicated Chinese character because one curve is composed of four points, and the amount of data in one typeface is about 4 Mbytes. Therefore, in the DTP system, a method is adopted in which outline font data for several typefaces which are frequently used are built in a ROM (read only memory) and outline font data of other typefaces are held by a hard disk device or the like.

【０００４】本発明は上記従来の問題点に鑑み、アウト
ラインフォントデータを保持するためのメモリの容量を
低減することができるアウトラインフォントデータの圧
縮装置を提供することを目的とする。In view of the above conventional problems, it is an object of the present invention to provide an outline font data compression apparatus capable of reducing the capacity of a memory for holding outline font data.

【０００５】[0005]

【課題を解決するための手段】第１の手段は上記目的を
達成するために、文字の一部を共有化可能な図形のアウ
トラインフォントデータを抽出する図形抽出手段と、前
記図形抽出手段により抽出された図形により原文字を復
元するための情報を算出する復元情報算出手段と、前記
図形抽出手段により抽出された図形により構成される文
字であることを示す情報を付加する情報付加手段と、前
記図形抽出手段により抽出された図形と、前記復元情報
算出手段により算出された復元情報と、前記情報付加手
段により付加された情報を所定のフォーマットに構成し
て圧縮する圧縮手段とを備えたことを特徴とする。In order to achieve the above object, the first means is to extract the outline font data of a graphic in which a part of a character can be shared, and the graphic extracting means. Restoration information calculating means for calculating information for restoring the original character by the drawn graphic, information adding means for adding information indicating that the character is composed of the graphic extracted by the graphic extracting means, A graphic unit extracted by the graphic extraction unit; the restoration information calculated by the restoration information calculation unit; and a compression unit configured to compress the information added by the information addition unit into a predetermined format. Characterize.

【０００６】第２の手段は、文字のアウトラインフォン
トデータから１バイトを越えるデータを抽出する抽出手
段と、文字のデータ幅に対する出現頻度の分布の広がり
が小さくなるようなＸ進数を予め算出するＸ進数算出手
段と、前記抽出手段により抽出されたデータを前記Ｘ進
数で圧縮する圧縮手段とを備えたことを特徴とする。The second means is an extracting means for extracting data exceeding 1 byte from the outline font data of the character, and an X-adic number for preliminarily calculating an X-adic number such that the spread of the distribution of the appearance frequency with respect to the data width of the character becomes small. It is characterized in that it is provided with a base number calculation means and a compression means for compressing the data extracted by the extraction means with the X base number.

【０００７】第３の手段は、文字のアウトラインフォン
トデータをその情報ごとに階層化する階層化手段と、前
記階層化手段により階層化された各階層のデータの出現
頻度を算出する算出手段と、前記算出手段により算出さ
れた各階層の出現頻度により階層別の圧縮用置換テーブ
ルを作成して階層毎に圧縮する圧縮手段とを備えたこと
を特徴とする。A third means is a hierarchizing means for hierarchizing the character outline font data for each information, and a calculating means for calculating the appearance frequency of the data of each hierarchy hierarchized by the hierarchizing means, And a compression unit that creates a compression replacement table for each layer based on the appearance frequency of each layer calculated by the calculation unit and compresses each layer.

【０００８】第４の手段は、予めオリジナルのアウトラ
インフォントデータの曲線データを抽出してアウトライ
ンフォントデータをグループ化するグループ化手段と、
オリジナルのアウトラインフォントデータと前記グルー
プ化されたアウトラインフォントデータの差を算出して
圧縮する圧縮手段とを備えたことを特徴とする。The fourth means is a grouping means for extracting the curve data of the original outline font data in advance and grouping the outline font data,
It is characterized by comprising compression means for calculating and compressing a difference between the original outline font data and the grouped outline font data.

【０００９】第５の手段は、第１ないし第４の手段の圧
縮手段により圧縮された符号を更に可逆符号化により圧
縮する手段を備えたことを特徴とする。The fifth means is characterized by further comprising means for further compressing the code compressed by the compression means of the first to fourth means by reversible encoding.

【００１０】[0010]

【作用】第１の手段では上記構成により、文字の一部を
共有化可能な図形のアウトラインフォントデータが抽出
されて符号化されるので、文字を分解して圧縮すること
ができ、したがって、アウトラインフォントデータを保
持するためのメモリの容量を低減することができる。According to the first means, the outline font data of the figure capable of sharing a part of the character is extracted and coded by the above-mentioned structure, so that the character can be decomposed and compressed. The capacity of the memory for holding the font data can be reduced.

【００１１】第２の手段では、文字のデータ幅に対する
出現頻度の分布の広がりが小さくなるようなＸ進数が予
め算出され、１バイトを越える分のデータがこのＸ進数
で圧縮されるので、圧縮効率を向上させることができ
る。In the second means, an X-adic number is calculated in advance so that the spread of the appearance frequency distribution with respect to the character data width is reduced, and data exceeding 1 byte is compressed with this X-adic number. The efficiency can be improved.

【００１２】第３の手段では、文字のアウトラインフォ
ントデータがその情報ごとに階層化されて各階層のデー
タの出現頻度が算出され、この各階層の出現頻度により
階層別の圧縮用置換テーブルにより階層毎に圧縮される
ので、圧縮効率を向上させることができる。In the third means, the outline font data of characters is hierarchized according to its information to calculate the appearance frequency of the data of each hierarchy, and the appearance frequency of each hierarchy is used to determine the hierarchy by the compression replacement table for each hierarchy. Since each is compressed, the compression efficiency can be improved.

【００１３】第４の手段では、オリジナルのアウトライ
ンフォントデータとグループ化されたアウトラインフォ
ントデータの差が符号化されるので、圧縮効率を向上さ
せることができる。In the fourth means, since the difference between the original outline font data and the grouped outline font data is encoded, the compression efficiency can be improved.

【００１４】第５の手段では、第１ないし第４の手段の
圧縮手段により圧縮された符号を更に可逆符号化により
圧縮するので、可逆復号化により元のデータに復元する
ことができる、In the fifth means, the code compressed by the compressing means of the first to fourth means is further compressed by lossless encoding, so that the original data can be restored by lossless decoding.

【００１５】[0015]

【実施例】以下、図面を参照して本発明の実施例を説明
する。図１は本発明に係るアウトラインフォントデータ
の圧縮装置の一実施例を示すブロック図、図２は文字の
本体と冠を示す説明図、図３はアウトラインフォントデ
ータのオフセットを示す説明図、図４は復元動作の概略
を示す説明図、図５はアウトラインフォントデータの全
体構成を示す説明図、図６はアウトラインフォントデー
タの基点を示す説明図、図７は従来例と本実施例のアウ
トラインフォントデータの違いを示す説明図、図８およ
び図９は圧縮動作を説明するためのフローチャート、図
１０は復元動作を説明するためのフローチャート、図１
１は可逆符号化を示す説明図である。Embodiments of the present invention will be described below with reference to the drawings. 1 is a block diagram showing an embodiment of an outline font data compression apparatus according to the present invention, FIG. 2 is an explanatory diagram showing a main body and a crown of a character, FIG. 3 is an explanatory diagram showing an offset of outline font data, and FIG. Is an explanatory view showing the outline of the restoring operation, FIG. 5 is an explanatory view showing the overall structure of the outline font data, FIG. 6 is an explanatory view showing the base point of the outline font data, and FIG. 7 is the outline font data of the conventional example and this embodiment. 8 is a flow chart for explaining the compression operation, FIG. 10 is a flow chart for explaining the decompression operation, and FIG.
1 is an explanatory diagram showing lossless encoding.

【００１６】以下、説明を容易にするために欧文書体を
例にして説明する。なお、欧文書体と日本語書体の大き
な違いは、文字の複雑さに起因するデータの容量差であ
るが、書体データのフォーマットは両者とも等しいの
で、欧文書体を圧縮する方法により日本語書体を圧縮す
ることができる。まず、欧文書体においてはアルファベ
ットの上部に図形を付加した文字が存在し、例えば図２
に示すようにドイツ語等における「ａウムラウト」と呼
ばれる文字がある。なお、漢字などは、より複雑な組み
合わせにより構成される。In order to facilitate the explanation, a European typeface will be described below as an example. The major difference between the European typeface and the Japanese typeface is the difference in data volume due to the complexity of the characters, but since the typeface data format is the same for both, the Japanese typeface is compressed by the method of compressing the European typeface. can do. First, in a European typeface, there are characters with graphics added to the upper part of the alphabet.
There is a character called "a umlaut" in German etc. as shown in. It should be noted that kanji and the like are configured by more complicated combinations.

【００１７】ここで、アルファベット部「Ａ」，「ａ」
を本体と呼び、この本体の上部に付加される図形を冠と
呼ぶことにすると、もし冠のオフセットを除く座標デー
タが等しいものとすると、図２（ａ）に示す大文字の冠
を図２（ｂ）に示す小文字に利用することができる。す
なわち、このオフセットについて説明すると、図２
（ａ）に示す大文字の冠を図２（ｂ）に示す小文字に利
用する場合には、ｘｙ方向に共に負の方向に移動しなけ
れば一致しない。Here, the alphabet parts "A" and "a"
Is called the main body, and the figure added to the upper part of this main body is called the crown. If the coordinate data excluding the offset of the crown is the same, the uppercase crown shown in FIG. It can be used for lowercase letters shown in b). That is, the offset will be described with reference to FIG.
When the uppercase crown shown in (a) is used for the lowercase letters shown in FIG. 2 (b), they do not match unless they both move in the negative direction in the xy directions.

【００１８】ところで、アウトラインフォントを構成す
るフォーマットの一例として図３に示すように、基点
（ｘｓ，ｙｓ）を基準として、この基準を移動しないか
ぎり、前点との相対値がコード化されている場合があ
り、オフセットの違いとは、この基点の違いを意味す
る。図４（ａ）に示すように冠だけの文字が登録されて
いる場合、この冠と図４（ｂ）に示す本体により図２
（ａ）に示す大文字を復元することができ、同様にこの
冠と図４（ｃ）に示す本体により図２（ｂ）に示す小文
字を復元することができる。また、図４（ａ）に示す冠
だけの文字が登録されていない場合には、図２（ａ）に
示す大文字から冠データを抜き出して登録し、図２
（ａ）に示すオリジナルデータを削除することができ
る。By the way, as shown in FIG. 3 as an example of the format of the outline font, the base point (xs, ys) is used as a reference, and unless this reference is moved, the relative value to the previous point is coded. In some cases, the difference in offset means the difference in this base point. When characters of only a crown are registered as shown in FIG. 4A, the crown and the main body shown in FIG.
The uppercase letters shown in (a) can be restored, and similarly, the lowercase letters shown in FIG. 2 (b) can be restored by this crown and the main body shown in FIG. 4 (c). Further, in the case where the characters of only the crown shown in FIG. 4 (a) are not registered, the crown data is extracted from the capital letters shown in FIG. 2 (a) and registered.
The original data shown in (a) can be deleted.

【００１９】本実施例では図１に示すように、図形抽出
手段１１により文字の一部を共有化可能な図形のアウト
ラインフォントデータを抽出し、復元情報算出手段１２
により、図形抽出手段１１により抽出された図形により
原文字を復元するための情報を算出し、情報付加手段１
３により、図形抽出手段１１により抽出された図形によ
り構成される文字であることを示す情報を付加し、図形
抽出手段１１により抽出された図形と、復元情報算出手
段１２により算出された復元情報と、情報付加手段１３
により付加された情報を圧縮手段１４により所定のフォ
ーマットに構成して圧縮するように構成されている。し
たがって、このように共通の冠を有する文字が多いほど
データ量が少なくなる。本実施例ではまた、この圧縮デ
ータを可逆符号化手段１５により更に圧縮して媒体に記
録し、出力の際に可逆復号化手段１６により復号し、文
字展開手段１７により元のアウトラインフォントデータ
に展開するように構成されている。In the present embodiment, as shown in FIG. 1, the outline extracting font data of a graphic capable of sharing part of a character is extracted by the graphic extracting means 11, and the restoring information calculating means 12 is used.
The information adding means 1 calculates the information for restoring the original character by the graphic extracted by the graphic extracting means 11.
3, information indicating that it is a character composed of the figure extracted by the figure extracting means 11 is added, and the figure extracted by the figure extracting means 11 and the restoration information calculated by the restoration information calculating means 12 are added. , Information adding means 13
The information added by is compressed into a predetermined format by the compression means 14 and compressed. Therefore, the larger the number of characters having such a common crown, the smaller the amount of data. In the present embodiment, the compressed data is further compressed by the lossless encoding means 15 and recorded on the medium. When output, the lossless decoding means 16 decodes the compressed data and the character expanding means 17 expands the original outline font data. Is configured to.

【００２０】つぎに、冠と本体を合成する場合、座標デ
ータにおけるオフセット以外の相対データが等しくない
場合には、オリジナルの文字を構成することができない
が、１文字分のアウトラインフォントデータには、図５
（ｂ）に示すように座標データの他に文字固有のデータ
が付加されている。なお、図５（ａ）は１つの書体の全
データを示し、書体の名前、作成年月日、登録文字数等
のタイプフェース情報と、登録文字別の文字番号、デー
タの参照場所等の文字索引情報と、登録文字別のアウト
ラインフォントデータである文字情報により構成されて
いる。そして、図５（ｂ）はこの文字情報の詳細な構造
を示し、文字補正用の固有データ「０」「１」と座標デ
ータにより構成されている。すなわち、この固有データ
は座標データと異なり、合成することができないので、
本実施例では座標データのみを圧縮するように構成され
ている。Next, when the crown and the main body are combined, if the relative data other than the offset in the coordinate data are not equal, the original character cannot be constructed, but the outline font data for one character is Figure 5
As shown in (b), character-specific data is added in addition to the coordinate data. Note that FIG. 5A shows all the data of one typeface. Typeface information such as typeface name, creation date, number of registered characters, etc., and character index for each registered character, character reference such as data reference location, etc. It is composed of information and character information which is outline font data for each registered character. Then, FIG. 5B shows a detailed structure of this character information, which is composed of unique data “0” “1” for character correction and coordinate data. That is, unlike the coordinate data, this unique data cannot be combined, so
In this embodiment, only the coordinate data is compressed.

【００２１】つぎに、実際のオフセットについて説明す
ると、図６に示すようにアウトラインデータの基点は、
１文字中に殆ど複数存在するので、冠と本体のオフセッ
トを算出するための基点が必要になる。ここで、図６に
示すような文字のように、データの格納順は、本体から
始まることが圧倒的に多く、この合成文字は基点ｓ０→
ｓ１→ｓ２→ｓ３の順、本体は基点ｓ０→ｓ１の順、冠
は基点ｓ２→ｓ３の順で登録される。したがって、これ
らの基点の順が違っていてもオフセットを算出すること
ができる。なお、例外として、冠の書き始め座標が合成
文字と冠単体の文字で異なる場合があるので、このよう
な文字のオフセットは算出しない。Next, the actual offset will be explained. As shown in FIG. 6, the base point of the outline data is
Since there are almost a plurality of characters in one character, a base point for calculating the offset between the crown and the main body is required. Here, as in the case of the characters shown in FIG. 6, the data storage order is overwhelmingly started from the main body, and this synthetic character is the base point s0 →
The main body is registered in the order of s1 → s2 → s3, the main body is registered in the order of base points s0 → s1, and the crown is registered in the order of base points s2 → s3. Therefore, the offset can be calculated even if these base points are out of order. In addition, as an exception, since the writing start coordinates of the crown may be different between the combined character and the character of the crown alone, the offset of such a character is not calculated.

【００２２】具体的には、各基点の間のデータを展開
し、図形を囲むことができる最小の長方形座標（例えば
左下と右上の２つの座標）を求める。図６における基点
ｓ０〜ｓ３の座標を下記の式（１）のようにすると、ｓｎ：左下座標（ｘｌｎ，ｙｌｎ），（右上座標（ｘｕｎ，ｙｕｎ）（但し、ｎ＝０，１，２，３） …（１）本体の基点ｓ０の長方形が最大になり、基点ｓ１の長方
形を包含することになる。そこで、この最大の長方形に
含まれる図形を本体と判定し、基点ｓ０を本体のオフセ
ットを算出するために用いる。Specifically, the data between each base point is developed, and the minimum rectangular coordinates (for example, two coordinates, lower left and upper right) that can surround the figure are obtained. When the coordinates of the base points s0 to s3 in FIG. 6 are represented by the following formula (1), sn: lower left coordinate (xln, yln), (upper right coordinate (xun, yun) (where n = 0, 1, 2, 3) (1) The rectangle of the base point s0 of the main body becomes the maximum, and the rectangle of the base point s1 is included.Therefore, the figure included in this maximum rectangle is determined as the main body, and the base point s0 is the offset of the main body. Used to calculate

【００２３】また、本体と冠は、最大の長方形の外に存
在する総ての長方形を冠とみなすことにより区別するこ
とができ、したがって、図６に示す例では基点ｓ２、ｓ
３の各長方形を冠と判別することができる。なお、図６
に示すように冠が複数の部品により構成される場合に
は、最も左に位置する長方形の基点ｓ２を冠のオフセッ
トを算出するために用いる。したがって、図６に示す文
字の本体と冠の各オフセットを次のように求めることが
できる。Further, the main body and the crown can be distinguished by considering all the rectangles existing outside the largest rectangle as the crowns, and therefore, in the example shown in FIG.
Each rectangle of 3 can be discriminated as a crown. Note that FIG.
When the crown is composed of a plurality of parts as shown in, the leftmost rectangular base point s2 is used to calculate the offset of the crown. Therefore, each offset of the main body and the crown of the character shown in FIG. 6 can be calculated as follows.

【００２４】本体：（ｘｌ０，ｙｌ０） …（２）冠：（ｘｌ２，ｙｌ２） …（３）また、図２に示す文字を合成する場合、図４（ａ），
（ｂ）に示すデータから同様な長方形の座標を求めるこ
とにより本体のオフセットを（ｘｌ０’，ｙｌ０’）、
冠のオフセットを（ｘｌ２’，ｙｌ２’）として求める
ことができるので、平行移動するためのオフセット量を
式（４），（５）により求めることができる。Body: (xl0, yl0) (2) Crown: (xl2, yl2) (3) When the characters shown in FIG. 2 are combined, the characters shown in FIG.
By calculating the coordinates of a similar rectangle from the data shown in (b), the offset of the body is (xl0 ', yl0'),
Since the offset of the crown can be calculated as (xl2 ′, yl2 ′), the offset amount for the parallel movement can be calculated by the equations (4) and (5).

【００２５】本体：ｘｈ＝ｘｌ０−ｘｌ０’，ｙｈ＝ｙｌ０−ｙｌ０’ …（４）冠：ｘｃ＝ｘｌ２−ｘｌ２’，ｘｃ＝ｙｌ２−ｙｌ２’ …（５）つぎに、この移動量と、組み合わされる文字が格納され
ているエリアを変更して再登録する。ここで、図５
（ｂ）に示す座標データを詳細に説明すると、アウトラ
インフォントデータの座標は一般に、１０００×１００
０を基準として格納されている。したがって、最大相対
値は、１バイトで表現可能な数「２５５」を越えるので
２バイトを要する場合がある。また、直線はｘｙ方向の
一方の相対値が「０」になるので、座標を０〜２バイト
の範囲で可変長で符号化することにより低容量化するこ
とができるが、このように可変長で行う場合には長さを
決定するフラグが必要になる。すなわち、座標データは
式（６）に示す情報が繰り返されるが、式（６）のフォ
ーマットは式（７）に変換される。Body: xh = xl0-xl0 ', yh = yl0-yl0' (4) Crown: xc = xl2-xl2 ', xc = yl2-yl2' (5) Next, this movement amount and combination Change the area in which the characters to be stored are stored and register again. Here, FIG.
To explain the coordinate data shown in (b) in detail, the coordinates of outline font data are generally 1000 × 100.
It is stored based on 0. Therefore, since the maximum relative value exceeds the number "255" that can be expressed by 1 byte, 2 bytes may be required. Also, since one of the relative values of the straight line in the xy direction is “0”, the capacity can be reduced by encoding the coordinates with a variable length in the range of 0 to 2 bytes. In the case of (1), a flag that determines the length is required. That is, the coordinate data repeats the information shown in Expression (6), but the format of Expression (6) is converted into Expression (7).

【００２６】座標データ：フラグ＋座標＋補正情報 …（６）フラグ＋ｘｈ＋ｙｈ＋参照場所＋フラグ＋ｘｃ＋ｙｃ＋参照場所 …（７）式（７）に示すフラグは、特殊フォーマットを認識する
ために用いられるが、可変長の判定や他の情報がビット
のオン、オフで設定されているので、空いているビット
の組合せで実現することができる。この処理によりデー
タ長を変更する必要があるが、この場合には新しいフォ
ーマットにより書き換えを行う間にバイト数をカウント
することにより、新たなデータ長を古いデータ長が格納
されているエリアに上書きすることができる。Coordinate data: flag + coordinate + correction information (6) Flag + xh + yh + reference location + flag + xc + yc + reference location (7) The flag shown in equation (7) is used for recognizing the special format, but variable. Since length determination and other information are set by turning bits on and off, it can be realized by a combination of vacant bits. Although it is necessary to change the data length by this process, in this case, the new data length is overwritten in the area where the old data length is stored by counting the number of bytes while rewriting with the new format. be able to.

【００２７】また、新フォーマットによりデータ量が低
減し、データの参照場所が変化するので、図５（ａ）に
示す文字索引情報の参照場所データを変更する必要があ
るが、全ての不要データをシフトすることにより新しい
フォントデータを作成することができる。すなわち、合
成文字は全く別のフォーマットで構成されるが、図１に
示す文字展開手段１７により本体の参照場所から本体デ
ータを読み込み、オフセット移動量分だけ展開し、ま
た、冠も同様に処理することにより、オリジナルフォン
トデータと等価のデータを得ることができる。Further, since the data amount is reduced and the data reference location is changed by the new format, it is necessary to change the reference location data of the character index information shown in FIG. 5A, but all unnecessary data is deleted. A new font data can be created by shifting. That is, although the synthetic character is constructed in a completely different format, the character expansion means 17 shown in FIG. 1 reads the main body data from the reference location of the main body, expands it by the offset movement amount, and similarly processes the crown. As a result, data equivalent to the original font data can be obtained.

【００２８】ここで、従来例と本実施例の違いについて
説明すると、従来の生成課程では図７（ａ）に示すよう
に、文字情報を１回参照場所へ移動することにより文字
を生成するが、本実施例では図７（ｂ）に示すように、
合成文字の情報を３回移動することにより文字を生成す
る。Here, the difference between the conventional example and this embodiment will be described. In the conventional generation process, as shown in FIG. 7A, a character is generated by moving character information once to a reference place. In this embodiment, as shown in FIG.
A character is generated by moving the information of the composite character three times.

【００２９】つぎに、図８を参照して上記圧縮動作を説
明すると、まず、図８に示す合成文字のテーブルを作成
するルーチンにおいて、登録文字リストから合成可能と
思われる文字番号を抽出し（ステップＳ１）、合成文字
を構成する本体と冠の文字番号を抽出し（ステップＳ
２）、文字数Ｎ、合成文字ＰＮ、本体ＭＮ、冠ＣＮの文
字番号テーブルを作成する（ステップＳ３）。Next, the compression operation will be described with reference to FIG. 8. First, in the routine for creating the table of composite characters shown in FIG. 8, character numbers which are considered to be compositable are extracted from the registered character list ( In step S1), the character numbers of the main body and the crown forming the composite character are extracted (step S1).
2) Create a character number table for the number of characters N, the composite character PN, the main body MN, and the crown CN (step S3).

【００３０】ついで、文字数Ｎを示すカウンタｉをクリ
アした後（ステップＳ４，Ｓ５）、カウンタｉが示す番
号の合成文字ＰＮを図６に示すように展開して本体Ｍ
Ｎ、冠ＣＮの基点を得（ステップＳ６）、本体ＭＮを展
開してその基点を得（ステップＳ７）、また、冠ＣＮを
展開してその基点を得る（ステップＳ８）。そして、数
２により本体ＭＮ、冠ＣＮのオフセット値を算出し（ス
テップＳ９）、数３によりオフセットを補正する（ステ
ップＳ１０）。Then, after the counter i indicating the number of characters N is cleared (steps S4 and S5), the composite character PN of the number indicated by the counter i is expanded as shown in FIG.
The base points of N and the crown CN are obtained (step S6), the main body MN is expanded to obtain the base point (step S7), and the crown CN is expanded to obtain the base point (step S8). Then, the offset values of the main body MN and the crown CN are calculated by Equation 2 (step S9), and the offset is corrected by Equation 3 (step S10).

【００３１】ついで、合成文字ＰＮの本体ＭＮ、冠ＣＮ
の座標データが一致しているか否かを判別し（ステップ
Ｓ１１）、一致している場合にはテーブルにオフセット
値を追加して格納し（ステップＳ１２）、他方、一致し
ていない場合にはテーブルから削除する（ステップＳ１
３）。そして、カウンタｉをインクリメントし（ステッ
プＳ１４）、ステップＳ５に戻って次の文字について同
様な処理を行い、全ての文字の処理を完了すると、この
合成文字テーブル作成ルーチンを終了する（ステップＳ
１５）。Next, the main body MN of the composite character PN and the crown CN
It is determined whether or not the coordinate data of the two match (step S11), and if they match, the offset value is added and stored in the table (step S12). Deleted from (step S1
3). Then, the counter i is incremented (step S14), the process returns to step S5, the same process is performed for the next character, and when the process for all the characters is completed, this synthetic character table creation routine is ended (step S).
15).

【００３２】図９に示す新規フォントファイル作成ルー
チンでは、まず、オリジナルファイルＲＦから非圧縮部
の文字索引情報を最後まで読み込み、また、登録文字数
Ｍを得た後（ステップＳ２１）、その情報を新規ファイ
ルＷＦにコピーする（ステップＳ２２）。そして、登録
文字数Ｍを示すカウンタｉをクリアした後（ステップＳ
２３，Ｓ２４）、ｉ番目の文字索引部から参照場所など
の情報を読み込み（ステップＳ２５）、前述した合成文
字テーブル内にその文字が存在する場合（ステップＳ２
６）にはステップＳ２８以下に進み、存在しない場合に
はオリジナルファイルＲＦのデータを新規ファイルＷＦ
に書き込む（ステップＳ２７）。In the new font file creation routine shown in FIG. 9, first, the character index information of the uncompressed portion is read from the original file RF to the end, and after the number of registered characters M is obtained (step S21), that information is newly created. Copy to the file WF (step S22). Then, after clearing the counter i indicating the number of registered characters M (step S
23, S24), information such as a reference location is read from the i-th character index portion (step S25), and the character exists in the above-mentioned composite character table (step S2).
In step 6), the data of the original file RF is replaced with the new file WF if it does not exist.
(Step S27).

【００３３】ステップＳ２８以下ではオリジナルファイ
ルＲＦの残照場所から図５（ｂ）に示すような固有デー
タを読み込み、新規ファイルＷＦにコピーし（ステップ
Ｓ２９）、新規ファイルＷＦに数５に示すような新フォ
ーマットを作成して書き込み（ステップＳ３０）、デー
タ長や参照番号等の変更データを書き込む（ステップＳ
３１）。そして、カウンタｉをインクリメントし（ステ
ップＳ３２）、ステップＳ２４に戻って次の文字につい
て同様な処理を行い、全ての文字の処理を完了すると、
ステップＳ３３に分岐する。ステップＳ３３以下では可
逆符号により文字毎の圧縮変換テーブルを作成し、参照
場所を書き換え（ステップＳ３４）、この新規フォント
ファイル作成ルーチンを終了する（ステップＳ３５）。In step S28 and thereafter, the unique data as shown in FIG. 5B is read from the afterglow location of the original file RF, copied to the new file WF (step S29), and the new file WF is updated as shown in Formula 5. Create a format and write it (step S30), and write change data such as data length and reference number (step S30).
31). Then, the counter i is incremented (step S32), the process returns to step S24, the same process is performed for the next character, and when the process for all the characters is completed,
It branches to step S33. In step S33 and thereafter, a compression conversion table for each character is created by the reversible code, the reference location is rewritten (step S34), and this new font file creation routine is finished (step S35).

【００３４】つぎに、図１０を参照してこの圧縮フォン
トデータの復元動作を説明する。まず、新規ファイルＷ
Ｆにおける書体が指定され（ステップＳ４１）、その書
体の指定文字ｉが入力すると（ステップＳ４２）、文字
索引部から文字番号ｉのデータを読み込み（ステップＳ
４３）、参照場所のデータを上記圧縮変換テーブルによ
り置換してバッファに書き込み（ステップＳ４４）、固
有データを文字展開部に送出する（ステップＳ４５）。
ついで、ステップＳ４６でこの文字が特殊フォーマット
である場合にはステップＳ４８以下に進み、特殊フォー
マットでない場合にはステップＳ４７に分岐して残りの
データを文字展開部に送出し、処理を終了する。Next, the restoring operation of the compressed font data will be described with reference to FIG. First, the new file W
When the typeface in F is designated (step S41) and the designated character i of the typeface is input (step S42), the data of the character number i is read from the character index portion (step S41).
43), the data at the reference location is replaced by the compression conversion table and written in the buffer (step S44), and the unique data is sent to the character expansion unit (step S45).
Then, in step S46, if this character is in the special format, the process proceeds to step S48 and below, and if it is not the special format, the process branches to step S47 to send the remaining data to the character expanding unit, and the process is ended.

【００３５】ステップＳ４８以下では、参照場所のデー
タを変換テーブルにより全置換してバッファに書き込
み、本体の座標データのオフセットを補正して文字展開
部に送出し（ステップＳ４９）、ついで、参照場所のデ
ータを変換テーブルにより全置換してバッファに書き込
み（ステップＳ５０）、冠の座標データのオフセットを
補正して文字展開部に送出し（ステップＳ５１）、この
復元処理を終了する。In step S48 and subsequent steps, the reference location data is completely replaced by the conversion table and written in the buffer, the offset of the coordinate data of the main body is corrected and sent to the character expanding section (step S49), and then the reference location data is stored. The data is completely replaced by the conversion table and written in the buffer (step S50), the offset of the coordinate data of the crown is corrected and sent to the character expansion unit (step S51), and this restoration processing is ended.

【００３６】ここで、本実施例におけるアウトラインフ
ォントデータの圧縮は、復号化する場合に完全に復元す
る必要があるので、図１に示すように可逆符号化が用い
られている。この場合、圧縮データが現データとは全く
異なっていても、図１１に示すようにアウトラインフォ
ントを展開する装置から図示矢印方向に見たデータが現
データと変わらなければ圧縮データを完全に復元するこ
とができる。また、図４（ａ）においてタイプフェース
情報や文字索引部を圧縮すると、この情報を全て復号化
しないと文字情報を検索できないので、この実施例では
文字情報のみを圧縮するように構成されている。Here, since the outline font data compression in this embodiment needs to be completely restored when decoding, lossless encoding is used as shown in FIG. In this case, even if the compressed data is completely different from the current data, if the data seen in the direction of the arrow in the figure from the device for expanding the outline font as shown in FIG. 11 is not the current data, the compressed data is completely restored. be able to. Further, when the typeface information and the character index portion are compressed in FIG. 4A, the character information cannot be retrieved unless all of this information is decoded, so that in this embodiment only the character information is compressed. ..

【００３７】ここで、複数の書体の文字が混ざって出力
する場合には展開時間が問題になり、この問題は特に、
登録文字数が多い日本語の場合に顕著となる。また、圧
縮された文字情報を１かたまりで取り出さなければなら
ないが、圧縮データが可変長で符号化されるので、１文
字の最後の１バイトに満たない符号はビットを付加して
バイト単位に変換しなければならない。Here, when the characters of a plurality of typefaces are mixed and output, the development time becomes a problem, and this problem is
This becomes noticeable when Japanese has a large number of registered characters. In addition, the compressed character information must be extracted as one block, but since the compressed data is encoded in a variable length, the code that is less than the last 1 byte of one character is converted into byte units by adding bits. Must.

【００３８】図１２は全横が１バイト単位で構成されて
ビット単位に８個に分割された例を示し、全ての文字を
一度に符号化すると、図１２上方に示すようにビット単
位で連続しなくなる。そこで、図１２下方に示すように
バイト単位で切り出すことができるように構成すること
により、参照場所からデータ長に相当する文字情報を欠
落することなく読み込むことができる。FIG. 12 shows an example in which all sides are composed of 1-byte units and divided into 8 bit units. When all characters are encoded at once, as shown in the upper part of FIG. Will not do. Therefore, as shown in the lower part of FIG. 12, the character unit corresponding to the data length can be read from the reference location without omission by arranging to be able to cut out in byte units.

【００３９】つぎに、この１バイトを単位として符号化
する第２の実施例について説明する。図１２は従来例と
第２の実施例におけるデータの違いを示す説明図、図１
３は第２の実施例を示すブロック図、図１４はデータを
バイト単位で示す説明図、図１５，１６は第２の実施例
の圧縮動作を説明するためのフローチャート、図１７は
復元動作を説明するためのフローチャートである。Next, a description will be given of a second embodiment in which this 1 byte is encoded as a unit. FIG. 12 is an explanatory diagram showing a difference in data between the conventional example and the second embodiment, and FIG.
3 is a block diagram showing the second embodiment, FIG. 14 is an explanatory diagram showing data in byte units, FIGS. 15 and 16 are flow charts for explaining the compression operation of the second embodiment, and FIG. 17 is a decompression operation. It is a flow chart for explaining.

【００４０】アウトラインフォントデータは一般に、
１，２，４バイト長を単位として記録されているが、フ
ォントを形成するためには、あるフォーマットに準じて
データを読み込むことが必要であるので、読み込み時点
のバイト長は既知である。そこで、もしフォントデータ
が全て１バイト単位であると仮定すると、その１バイト
は符号無しで０〜２５５のデータ幅を有するので、エン
トロピを小さくするためにデータ幅を狭める処理が必要
になる。すなわち、０〜２５５のデータ幅を横軸として
そのデータの出現頻度（個数）の分布を縦軸とした場
合、任意のデータに集中した正規分布を形成し、かつ分
布の広がりが小さいという条件、すなわち標準偏差が小
さいという条件を満足するほど圧縮効率を向上すること
ができる。Outline font data is generally
The length is recorded in units of 1, 2, and 4 bytes, but in order to form a font, it is necessary to read data in accordance with a certain format, and therefore the byte length at the time of reading is known. Therefore, if all the font data is assumed to be in 1-byte units, the 1-byte has a data width of 0 to 255 without a code. Therefore, it is necessary to reduce the data width in order to reduce entropy. That is, when the horizontal axis is the data width of 0 to 255 and the vertical axis is the distribution of the appearance frequency (number) of the data, a condition that a normal distribution concentrated on arbitrary data is formed and the spread of the distribution is small, That is, the compression efficiency can be improved as the condition that the standard deviation is small is satisfied.

【００４１】そこで、最小単位である１バイトの情報が
それ自体操作することができないので、この第２の実施
例は図１３に示すように、文字のアウトラインフォント
データから１バイトを越えるデータを抽出するデータ抽
出手段２１と、文字のデータ幅に対する出現頻度の分布
の広がりが小さくなるようなＸ進数を予め算出するＸ進
数算出手段２２と、データ抽出手段２１により抽出され
たデータをこのＸ進数で圧縮する圧縮手段２３を有し、
１バイトより大きい単位の情報を操作することにより、
エントロピを軽減するように構成されている。また、こ
の圧縮データを可逆符号化手段２４により更に圧縮して
媒体に記録し、出力の際に可逆復号化手段２５により復
号し、文字展開手段２６により元のアウトラインフォン
トデータに展開する。Therefore, since the minimum unit of 1 byte of information cannot be manipulated by itself, the second embodiment extracts data exceeding 1 byte from the outline font data of a character as shown in FIG. Data extracting means 21, an X-adic number calculating means 22 for pre-calculating an X-adic number such that the spread of the distribution of the appearance frequency with respect to the data width of the character becomes small, and the data extracted by the data extracting means 21 with this X-adic number. Having compression means 23 for compressing,
By manipulating information in units larger than 1 byte,
It is configured to reduce entropy. Further, this compressed data is further compressed by the lossless encoding means 24 and recorded on the medium, and upon output, it is decoded by the lossless decoding means 25 and expanded by the character expanding means 26 into the original outline font data.

【００４２】図１４に示すように２バイトの情報は、符
号無しで０〜６５５３５個の範囲であり、符号化の際に
上位、下位の各１バイトの個数がカウントされるので、
上位バイトと下位バイトの各値が同様な値に近づけるこ
とによりエントロピを軽減することができる。そこで、
１バイトを越える情報をＸ進数表現のデータに置換する
ために、まず最適なＸ進数値を求める。なお、任意の値
は式（８），（９）により表すことができる。As shown in FIG. 14, the 2-byte information is in the range of 0 to 65535 without a code, and the number of each upper and lower 1 byte is counted at the time of encoding.
Entropy can be reduced by bringing the values of the upper byte and the lower byte close to similar values. Therefore,
In order to replace information exceeding 1 byte with X-adic data, the optimum X-adic value is first obtained. Note that any value can be expressed by equations (8) and (9).

【００４３】ａＸ＋ｂ＝ｃ …（８）ａ＜Ｘ，ｂ＜Ｘ …（９）（但し、ａ：上位バイト値、ｂ：下位バイト値）ここで、アウトラインフォントの基本単位が「１００
０」でありこの値より大きく逸脱する座標関係のデータ
が存在しないので、上記値ａ，ｂは符号無しで「１００
０」前後が最大値となる。また、他のデータ長について
も同様に、１文字当たりの値は「１０００」に満たな
い。座標データは数４により、相対値が負、または基点
が負から始まる場合にもフラグが正負情報を含むので、
符号無し情報として扱うことができる。AX + b = c (8) a <X, b <X (9) (where a: upper byte value, b: lower byte value) Here, the basic unit of the outline font is "100".
Since there is no coordinate-related data that deviates significantly from this value, the above values a and b are set to "100" without a sign.
The maximum value is around "0". Similarly, for other data lengths, the value per character is less than "1000". According to equation 4, the coordinate data includes positive / negative information even if the relative value is negative or the base point starts from negative.
It can be handled as unsigned information.

【００４４】なお、図５に示す文字情報中のごく一部に
負のデータが存在し、符号ありの場合には最上位ビット
が「１」になるのでデータ幅が広がる。そこで、Ｘ進数
値を求める前に、これら少数の負の情報を正の情報に置
換するために、負の情報の最小値を抽出してこの最小値
が「０」になるようにオフセットを加える。なお、実際
に圧縮を行う文字情報は、１，２バイト単位で表現さ
れ、４バイト単位のデータが圧縮領域内に存在しない。Note that negative data exists in a small part of the character information shown in FIG. 5, and when there is a sign, the most significant bit becomes "1", so that the data width widens. Therefore, in order to replace these small numbers of negative information with positive information before obtaining the X-adic value, the minimum value of the negative information is extracted and an offset is added so that this minimum value becomes "0". .. The character information that is actually compressed is expressed in units of 1 and 2 bytes, and data in units of 4 bytes does not exist in the compression area.

【００４５】つぎに、全てが符号無しの２バイト情報か
ら最大値を抽出する。この最大値を数６の値ｃとし、ま
た、ａ＝ｂ＝Ｘ−１とすると、Ｘは式（１０）を満足す
る最小値により求めることができる。Next, the maximum value is extracted from the 2-byte information, all of which are unsigned. If this maximum value is set to the value c of Equation 6 and a = b = X−1, X can be obtained by the minimum value that satisfies the expression (10).

【００４６】（Ｘ−１）・（Ｘ＋１）＞ｃ …（１０）式（１０）において例えばｃ＝１０５０とすると、Ｘ＝
３３となり、また、通常は２５６進数でａ、ｂが最大値
「２５５」となるが、このＸ進数では最大がＸ−１であ
るので、分布の広がりを抑えることができる。(X−1) · (X + 1)> c (10) If, for example, c = 1050 in the equation (10), X =
33, and normally a and b have a maximum value of "255" in a 256-ary number, but the maximum in this X-ary number is X-1, so the spread of the distribution can be suppressed.

【００４７】つぎに、図１５を参照してこの圧縮動作を
説明すると、まず、図１５のようにして上記Ｘ値と負数
の補正値を算出する。すなわちオリジナルファイルから
登録文字数Ｎを得（ステップＳ６１）、この登録文字数
Ｎを示すカウンタｉをクリアする（ステップＳ６２，Ｓ
６３）。ついで、ｉ番目の文字の索引情報を検索部から
読み込み（ステップＳ６４）、参照場所へ移行して最大
負数と２バイトの最大値を得る（ステップＳ６５）。そ
して、カウンタｉをインクリメントし（ステップＳ６
６）、ステップＳ６３に戻って各文字の最大負数と２バ
イトの最大値を算出し（ステップＳ６７）、ついで、最
大負数の絶対値を正数変換値とし、また、２バイトの最
大値からＸ値を算出する（ステップＳ６８）。Next, the compression operation will be described with reference to FIG. 15. First, the X value and the negative correction value are calculated as shown in FIG. That is, the registered character number N is obtained from the original file (step S61), and the counter i indicating the registered character number N is cleared (steps S62, S).
63). Then, the index information of the i-th character is read from the search unit (step S64), and the process moves to the reference location to obtain the maximum negative number and the maximum value of 2 bytes (step S65). Then, the counter i is incremented (step S6
6) Return to step S63, calculate the maximum negative number of each character and the maximum value of 2 bytes (step S67), then use the absolute value of the maximum negative number as a positive conversion value, and convert the maximum value of 2 bytes to X. The value is calculated (step S68).

【００４８】そして、図１６においてオリジナルファイ
ルＲＦから非圧縮部の文字索引情報を最後まで読み込
み、また、登録文字数Ｎを得た後（ステップＳ７１）、
その情報を新規ファイルＷＦにコピーする（ステップＳ
７２）。ついで、カウンタｉをクリアし（ステップＳ７
３，Ｓ７４）、ｉ番目の文字情報部から参照場所などの
情報を読み込み（ステップＳ７５）、文字情報部内の２
バイトデータをＸ進数で置換する（ステップＳ７６）。
そして、カウンタｉをインクリメントし（ステップＳ７
７）、ステップＳ７４に戻って各文字をＸ進数で置換
し、ついで、第１の実施例において説明した可逆符号に
より文字毎の圧縮変換テーブルを作成し（ステップＳ７
８）、参照場所を書き換え（ステップＳ７９）、この新
規フォントファイル作成ルーチンを終了する（ステップ
Ｓ８０）。Then, in FIG. 16, the character index information of the non-compressed portion is read to the end from the original file RF, and after the registered character number N is obtained (step S71).
Copy that information to the new file WF (step S
72). Then, the counter i is cleared (step S7
3, S74), information such as a reference place is read from the i-th character information section (step S75), and 2 in the character information section is read.
The byte data is replaced with an X-adic number (step S76).
Then, the counter i is incremented (step S7
7) Returning to step S74, each character is replaced with an X-adic number, and then a compression conversion table for each character is created by the lossless code described in the first embodiment (step S7).
8) The reference location is rewritten (step S79), and the new font file creation routine is finished (step S80).

【００４９】つぎに、図１７を参照してこの圧縮フォン
トデータの復元動作を説明する。まず、図１０に示す場
合と同様に、新規ファイルＷＦにおける書体が指定され
（ステップＳ８１）、その書体の指定文字ｉが入力する
と（ステップＳ８２）、文字索引部から文字番号ｉのデ
ータを読み込み（ステップＳ８３）、参照場所のデータ
を上記圧縮変換テーブルにより置換してバッファに書き
込む（ステップＳ８４）。ついで、フォーマットに準じ
てデータを取り込んで２バイトデータを２５６進数に変
換し（ステップＳ８５）、全ての文字データの変換を完
了するとそのデータを文字展開部に送出する（ステップ
Ｓ８６）。Next, the operation of restoring the compressed font data will be described with reference to FIG. First, as in the case shown in FIG. 10, when the typeface in the new file WF is designated (step S81) and the designated character i of the typeface is input (step S82), the data of the character number i is read from the character index portion ( In step S83), the data at the reference location is replaced by the compression conversion table and written in the buffer (step S84). Then, the data is taken in according to the format, the 2-byte data is converted into a 256-ary number (step S85), and when the conversion of all the character data is completed, the data is sent to the character expansion section (step S86).

【００５０】つぎに、第３の実施例を説明する。図１８
は第３の実施例を示すブロック図、図１９はアウトライ
ンフォントデータの出現頻度を示す説明図、図２０はフ
ラグ階層の出現頻度を示す説明図、図２１は第３の実施
例の圧縮動作を説明するためのフローチャート、図２２
は復元動作を説明するためのフローチャートである。Next, a third embodiment will be described. FIG.
Is a block diagram showing the third embodiment, FIG. 19 is an explanatory diagram showing the appearance frequency of outline font data, FIG. 20 is an explanatory diagram showing the appearance frequency of flag hierarchies, and FIG. 21 is the compression operation of the third embodiment. FIG. 22 is a flowchart for explaining.
Is a flow chart for explaining the restoration operation.

【００５１】第１の実施例において説明した図５及び式
（６）を参照すると、圧縮領域は１）固有データ
「０」、２）固有データ「１」、３）フラグ、４）座
標、５）補正情報の５階層に分類することができる。こ
の５階層を分類しない出現頻度の分布は、図１９に示す
ように正規分布状に減衰せず、逆に符号Ｔにより示すよ
うに増加するひげ状部分が見られる。このひげ状部分は
両側の出現頻度を比べて異常であるので、いずれかの階
層の特徴が現れていると考えられる。Referring to FIG. 5 and the equation (6) described in the first embodiment, the compression area is 1) unique data “0”, 2) unique data “1”, 3) flag, 4) coordinates, 5 ) It can be classified into five layers of correction information. The distribution of the appearance frequencies in which the five layers are not classified does not attenuate like a normal distribution as shown in FIG. 19, and conversely there is a whisker-like portion that increases as indicated by the symbol T. Since the whiskers are abnormal in appearance frequency on both sides, it is considered that the feature of one of the layers appears.

【００５２】したがって、この分布を利用して高圧縮率
を実現することができないので、この第３の実施例では
図１８に示すように、階層化手段３１が文字のアウトラ
インフォントデータをその情報ごとに階層化し、算出手
段３２が階層化手段３１により階層化された各階層のデ
ータの出現頻度を算出し、圧縮手段３３が算出手段３２
により算出された各階層の出現頻度により階層別の圧縮
用置換テーブルを作成して階層毎に圧縮するように構成
されている。そして、この圧縮データを可逆符号化手段
３４により更に圧縮して媒体に記録し、出力の際に可逆
復号化手段３５により復号し、文字展開手段３６により
元のアウトラインフォントデータに展開する。Therefore, since it is not possible to realize a high compression rate by utilizing this distribution, in the third embodiment, as shown in FIG. 18, the layering means 31 sets the outline font data of the character for each information. And the calculating means 32 calculates the appearance frequency of the data of each hierarchy hierarchized by the hierarchizing means 31, and the compressing means 33 calculates by the calculating means 32.
A compression substitution table for each layer is created based on the appearance frequency of each layer calculated by the above, and compression is performed for each layer. Then, the compressed data is further compressed by the lossless encoding means 34 and recorded on the medium, and when output, decoded by the lossless decoding means 35, and expanded by the character expanding means 36 into the original outline font data.

【００５３】この階層は全文字について相関が強く、１
文字についての各分布は類似傾向を示すが、この５階層
の中でデータ数が多い文字の場合にひげ状部分Ｔに大き
な影響を与える。すなわち、階層自体が全体に占める割
合が小さい場合に、本実施例を適用しない方法も考えら
れるが、この実施例では５階層の全てに適用する。５階
層毎の分布を取ると、いずれかが必ず正規分布と全く異
なる異常分布を示すはずであるが、データの性質上、
３）フラグ、５）補正情報の階層が異常分布を示す。こ
の理由は、フラグの性質上派生し、ビット操作によりデ
ータを形成しているからである。This layer has a strong correlation for all characters and is 1
Although the distributions of the characters show similar tendencies, the whiskers T have a great influence in the case of a character having a large amount of data in the five layers. That is, when the ratio of the hierarchy itself to the whole is small, a method in which this embodiment is not applied can be considered, but in this embodiment, it is applied to all five hierarchies. If you take the distribution for each of the five layers, one of them should show an abnormal distribution that is completely different from the normal distribution, but due to the nature of the data,
The hierarchy of 3) flags and 5) correction information indicates the abnormal distribution. The reason is that it is derived from the nature of the flag and forms data by bit manipulation.

【００５４】これらのデータは図２０（ａ）に示すよう
に、所々欠いた櫛のような分布となるので、本実施例で
は各階層の分布をソートして図２０（ｂ）に示すような
置換テーブルを作成し、この置換テーブルにより当該階
層のソースデータを書き換える。そして、この置換テー
ブルにより図１９に示すひげ上部分Ｔを消滅させて正規
分布に近い分布に補正し、全階層の置換終了後に再度圧
縮対象範囲の全分布を取り、符号化する。As shown in FIG. 20 (a), these data have a comb-like distribution that is missing in some places. Therefore, in this embodiment, the distribution of each layer is sorted and as shown in FIG. 20 (b). A replacement table is created, and the source data of the layer is rewritten by this replacement table. Then, by using this replacement table, the whiskers T shown in FIG. 19 are eliminated and corrected to a distribution close to a normal distribution, and after the replacement of all layers is completed, the entire distribution of the compression target range is taken again and encoded.

【００５５】つぎに、図２１を参照してこの圧縮動作を
説明する。まず、図２１において図１５の場合と同様
に、オリジナルファイルから登録文字数Ｎを得（ステッ
プＳ９１）、この登録文字数Ｎを示すカウンタｉをクリ
アし（ステップＳ９２、Ｓ９３）、ｉ番目の文字の索引
情報を検索部から読み込む（ステップＳ９４）。つい
で、フォーマットに準じてデータを読み込んで階層別に
データのヒストグラムをバイト単位で取り（ステップＳ
９５）、この処理を各文字について行う（ステップＳ９
６→Ｓ９３）。そして、各ヒストグラムを大きい順にソ
ートし、最大データを「０」に設定することにより（ス
テップＳ９７）この階層別置換テーブルを作成する（ス
テップＳ９８）。Next, this compression operation will be described with reference to FIG. First, in FIG. 21, as in the case of FIG. 15, the number N of registered characters is obtained from the original file (step S91), the counter i indicating the number N of registered characters is cleared (steps S92, S93), and the index of the i-th character is obtained. Information is read from the search unit (step S94). Then, the data is read according to the format, and the histogram of the data is obtained byte by byte for each layer (step S
95), this process is performed for each character (step S9).
6 → S93). Then, the respective histograms are sorted in descending order, and the maximum data is set to "0" (step S97) to create this hierarchical replacement table (step S98).

【００５６】そして、図２２において図１６に示す場合
と同様に、オリジナルファイルＲＦから非圧縮部の文字
索引情報を最後まで読み込み、また、登録文字数Ｎを得
た後（ステップＳ１０１）、その情報を新規ファイルＷ
Ｆにコピーし（ステップＳ１０２）、カウンタｉをクリ
アし（ステップＳ１０３，Ｓ１０４）、ｉ番目の文字情
報部から参照場所などの情報を読み込む（ステップＳ１
０５）。ついで、フォーマットに準じてデータを読み込
んでその階層の置換テーブルにより置換し（ステップＳ
１０６）、この処理を各文字について行う（ステップＳ
１０７→Ｓ１０４）。そして、図１６に示す場合と同様
に、可逆符号により文字毎の圧縮変換テーブルを作成し
（ステップＳ１０８）、参照場所を書き換え（ステップ
Ｓ１０９）、この新規フォントファイル作成ルーチンを
終了する（ステップＳ１１０）。Then, in FIG. 22, as in the case shown in FIG. 16, the character index information of the non-compressed portion is read from the original file RF to the end, and after the registered character number N is obtained (step S101), that information is read. New file W
Copy to F (step S102), clear counter i (steps S103 and S104), and read information such as a reference location from the i-th character information section (step S1).
05). Then, the data is read according to the format and replaced by the replacement table of that layer (step S
106), this process is performed for each character (step S
107 → S104). Then, similarly to the case shown in FIG. 16, a compression conversion table for each character is created by the lossless code (step S108), the reference location is rewritten (step S109), and this new font file creation routine is finished (step S110). ..

【００５７】つぎに、図２３を参照してこの復元動作を
説明する。まず、図１７に示す場合と同様に、新規ファ
イルＷＦにおける書体が指定され（ステップＳ１１
１）、その書体の指定文字ｉが入力すると（ステップＳ
１１２）、文字索引部から文字番号ｉのデータを読み込
み（ステップＳ１１３）、参照場所のデータを上記圧縮
変換テーブルにより置換してバッファに書き込む（ステ
ップＳ１１４）。ついで、フォーマットに準じてデータ
を取り込んでその階層の置換テーブルにより元のデータ
に変換し（ステップＳ１１５）、全ての文字データの変
換を完了するとそのデータを文字展開部に送出する（ス
テップＳ１１６）。Next, this restoration operation will be described with reference to FIG. First, as in the case shown in FIG. 17, the typeface in the new file WF is designated (step S11).
1) When the designated letter i of the typeface is input (step S
112), the data of the character number i is read from the character index portion (step S113), the data at the reference location is replaced by the compression conversion table and written in the buffer (step S114). Then, the data is taken in according to the format and converted into the original data by the replacement table of the layer (step S115), and when the conversion of all the character data is completed, the data is sent to the character expansion section (step S116).

【００５８】また、図１９に示すように第１〜第３の実
施例を組み合わせてブロック化、Ｘ進数化、階層別置換
化するように構成してもよい。なお、図２４において、
階層別置換化した後Ｘ進数化すると、せっかくのソート
が無意味であり、また、破線で示すブロック化は、容易
にブロック化することができない書体で行わない方がよ
い。Further, as shown in FIG. 19, the first to third embodiments may be combined to form a block, an X-adic number, or a hierarchical permutation. In addition, in FIG.
If it is converted into X-adic numbers after permutation according to hierarchy, it is meaningless to sort it, and the blocking shown by the broken line should not be performed with a typeface that cannot be easily blocked.

【００５９】ここで、第３の実施例のように階層別にヒ
ストグラムをソーティングして置換すると、分布のエン
トロピが低下するが、圧縮効率を更に高めるために４）
座標データ部に着目すると、座標データが他の階層と異
なって異常分布を示さず、正規分布を示す。したがっ
て、座標データを予測し、予測誤差を符号化することに
より、座標データ量を縮小して高圧縮率を実現すること
ができる。Here, if the histograms are sorted and replaced for each layer as in the third embodiment, the entropy of the distribution decreases, but in order to further improve the compression efficiency, 4).
Focusing on the coordinate data part, the coordinate data does not show an abnormal distribution unlike the other layers, but shows a normal distribution. Therefore, by predicting the coordinate data and encoding the prediction error, the amount of coordinate data can be reduced and a high compression rate can be realized.

【００６０】つぎに、第４の実施例を説明する。図２５
は第４の実施例を示すブロック図、図２６はベゼー曲線
を示す説明図、図２７は単位パターンを示す説明図、図
２８は図２７の各単位パターンを示す説明図、図２９は
第１単位パターンを示す説明図、図３０は第２単位パタ
ーンを示す説明図、図３１は第３単位パターンを示す説
明図、図３２は第１〜第３単位パターンの組み合わせ例
を示す説明図、図３３は第４の実施例におけるパターン
統合化の第１段階を説明するためのフローチャート、図
３４はパターン統合化の第２段階とパターンファイル作
成を説明するためのフローチャート、図３５はソースフ
ォントファイルの書き換え動作を説明するためのフロー
チャート、図３６は圧縮動作を説明するためのフローチ
ャート、図３７は復号化プロセスを説明するためのフロ
ーチャート、図３８は差分算出方法を示す説明図であ
る。Next, a fourth embodiment will be described. Figure 25
26 is a block diagram showing a fourth embodiment, FIG. 26 is an explanatory diagram showing a Beze curve, FIG. 27 is an explanatory diagram showing a unit pattern, FIG. 28 is an explanatory diagram showing each unit pattern of FIG. 27, and FIG. 29 is a first diagram. 30 is an explanatory diagram showing a unit pattern, FIG. 30 is an explanatory diagram showing a second unit pattern, FIG. 31 is an explanatory diagram showing a third unit pattern, and FIG. 32 is an explanatory diagram showing a combination example of first to third unit patterns. 33 is a flow chart for explaining the first stage of pattern integration in the fourth embodiment, FIG. 34 is a flow chart for explaining the second stage of pattern integration and pattern file creation, and FIG. 35 is for the source font file. FIG. 36 is a flowchart for explaining a rewriting operation, FIG. 36 is a flowchart for explaining a compression operation, FIG. 37 is a flowchart for explaining a decoding process, and FIG. Is an explanatory diagram showing a difference calculation process.

【００６１】この実施例では図２５に示すように、グル
ープ化手段４１により例えば明朝体、教科書体をそれぞ
れ第１、第２パターンとして各曲線に近いパターンを選
出し、誤差圧縮手段４２により近似パターンとの差を符
号化するように構成されている。まず、ベゼー曲線は図
２６に示すように点Ｐ０〜Ｐ３で決定され、この前点と
の差分ｄｘｉ，ｄｙｉ（ｉ＝０〜２）を符号化すると、
通常ｄｘｉ＊ｄｙｉ＝０とした場合に水平線と垂直線が
特異パターンとなり、したがって、この特異パターンを
考慮しないと情報量が増加する。すなわち、水平線の場
合（ｄｙ０＝０）にｘ座標情報が１〜２バイト、ｙ座標
情報が０バイトとなるので、ｙ座標情報が増加する。In this embodiment, as shown in FIG. 25, the grouping means 41 selects, for example, the Mincho font and the textbook font as the first and second patterns, and patterns close to the respective curves are selected. It is configured to encode the difference from the pattern. First, the Beze curve is determined at points P0 to P3 as shown in FIG. 26, and when the differences dxi and dyi (i = 0 to 2) from the previous points are encoded,
Normally, when dxi * dyi = 0, the horizontal line and the vertical line form a peculiar pattern. Therefore, if this peculiar pattern is not taken into consideration, the amount of information increases. That is, in the case of a horizontal line (dy0 = 0), the x coordinate information is 1 to 2 bytes and the y coordinate information is 0 bytes, so that the y coordinate information increases.

【００６２】また、曲線をパターン化するためには、任
意の曲線においてそのパターンが既知でなければならな
いので、本実施例では１バイトのパターン情報のビット
「０」〜「５」をパターン番号とし、ビット「６」，
「７」をｘｙ座標の第１〜第４象限の情報として用い
る。すなわち、ビット「０」〜「５」をパターン番号は
最大６４種類となる。したがって、図３３のステップＳ
１２０において説明するようにパターン統合化の第１段
階として曲線の特徴を踏まえてパターンの分類数を大き
くし、ついで、図３４に示すようにパターン統合化の第
２段階として最大パターン数（＝６４）以下となるよう
にパターンを統合化する。例えば図２７に示す単位パタ
ーンを用い、第１単位パターンをｄｘ０＞０，ｄｙ０＝
０とし、第２単位パターンをｄｘ１＞０，ｄｙ１＝０と
し、第３単位パターン群を１０個のパターン（ｄｙ２＜０，ｄｘ２＝０）（ｄｙ２＜０，ｄｘ２＞０，｜ｄｙ２｜＞ｄｘ２）（ｄｙ２＜０，ｄｘ２＞０，｜ｄｙ２｜＝ｄｘ２）（ｄｙ２＜０，ｄｘ２＞０，｜ｄｙ２｜＜ｄｘ２）（ｄｙ２＝０，ｄｘ２＞０）（ｄｙ２＞０，ｄｘ２＞０，ｄｙ２＜ｄｘ２）（ｄｙ２＞０，ｄｘ２＞０，ｄｙ２＝ｄｘ２）（ｄｙ２＞０，ｄｘ２＞０，ｄｙ２＞ｄｘ２）（ｄｙ２＞０，ｄｘ２＝０）（上記以外のｄｘ２，ｄｙ２）にグループ化する。Further, in order to pattern a curve, the pattern must be known in an arbitrary curve, so in the present embodiment, bits "0" to "5" of 1-byte pattern information are set as pattern numbers. , Bit “6”,
"7" is used as information of the first to fourth quadrants of the xy coordinates. That is, the maximum number of pattern numbers of the bits "0" to "5" is 64. Therefore, step S in FIG.
As described in 120, as the first step of pattern integration, the number of pattern classifications is increased in consideration of the characteristics of the curve, and then, as shown in FIG. 34, the maximum number of patterns (= 64) is set as the second step of pattern integration. ) Integrate the patterns so that: For example, using the unit pattern shown in FIG. 27, the first unit pattern is dx0> 0, dy0 =
0, the second unit pattern is dx1> 0, dy1 = 0, and the third unit pattern group is 10 patterns (dy2 <0, dx2 = 0) (dy2 <0, dx2> 0, | dy2 |> dx2 ) (Dy2 <0, dx2> 0, | dy2 | = dx2) (dy2 <0, dx2> 0, | dy2 | <dx2) (dy2 = 0, dx2> 0) (dy2> 0, dx2> 0, dy2 <Dx2) (dy2> 0, dx2> 0, dy2 = dx2) (dy2> 0, dx2> 0, dy2> dx2) (dy2> 0, dx2 = 0) (dx2, dy2 other than the above) ..

【００６３】また、書体で使用される曲線がグループに
存在し、かつ少数である場合には、グループ内の最大数
を有するパターンに統合する。この場合、少数の判定に
は片側検定（統計）を利用し、例えば分布に棄却域５％
のときに、少数判定の閾値ｔｈ、総パターン数Ｔとして
ｔｈ＝０．０５＊Ｔ／０．９５とする。そして、１）この閾値ｔｈ以上のパターンを登録し、２）１）を満足しないが少数のパターンが存在するとき
にグループ内の最大パターンを示すパターンを登録し、３）２）において登録数＞６４のときに２）を削除し、４）３）において登録数＞６４のときに１）を再統合す
る。If the curve used in the typeface exists in the group and is small in number, it is integrated into the pattern having the maximum number in the group. In this case, a one-sided test (statistics) is used for a small number of judgments, and for example, the rejection range is 5%.
At this time, the threshold value th for the small number determination and the total pattern number T are th = 0.05 * T / 0.95. Then, 1) register a pattern that is equal to or larger than the threshold th, 2) register a pattern that does not satisfy 1) but shows the maximum pattern in the group when a small number of patterns exist, and 3) register the number in 2)> 2) is deleted when 64, and 4) 1) is reintegrated when the number of registrations> 64 in 3).

【００６４】この単位パターンについて詳細に説明する
と、本実施例では図２８〜図３１に示すように、１曲線
を３個の単位パターンの集合としてある条件を設定し、
曲線を適度の数に分類する。例えば図２６に示す点Ｐ
０、Ｐ１から成る直線を第１単位パターンとして図２９
に示すように０°≦ａｎｇ＜９０°の第１象限にあると
仮定する。この理由は、第１単位パターンが図示ｆのよ
うに９０°≦ａｎｇ＜１８０°の第２象限にあるときに
ｙ軸対象位置とし、図示ｇのように１８０°≦ａｎｇ＜
°２７０の第３象限にあるときに原点対象位置とし、図
示ｈのように２７０°≦ａｎｇ＜°３６０の第４象限に
あるときにｘ軸対象位置とすることにより、配置情報に
より回転することができるからである。This unit pattern will be described in detail. In this embodiment, as shown in FIGS. 28 to 31, one curve is set as a set of three unit patterns, and a certain condition is set.
Classify the curve into a reasonable number. For example, point P shown in FIG.
The straight line consisting of 0 and P1 is used as the first unit pattern in FIG.
It is assumed that 0 ° ≦ ang <90 ° in the first quadrant as shown in FIG. The reason for this is that when the first unit pattern is in the second quadrant of 90 ° ≦ ang <180 ° as shown in the figure f, the y-axis target position is set, and 180 ° ≦ ang <as in the figure g.
Rotate according to the placement information by setting the origin target position when in the third quadrant of ° 270 and the x-axis target position when in the fourth quadrant of 270 ° ≦ ang <° 360 as shown in FIG. Because you can.

【００６５】つぎに、図２６に示す点Ｐ１，Ｐ２から成
る直線を第２単位パターンとしてこの存在場所を図３０
に示すように第１、第４象限に限定する。また、点Ｐ
２、Ｐ３から成る直線を第３単位パターンとし、この第
３単位パターンの存在予測範囲を第２単位パターンの角
度に依存させる。例えば図３１に示す太線が第２単位パ
ターンである場合、第３単位パターンは第２単位パター
ンの入射角度に対して±９０°の範囲に仮定する。Next, the straight line formed by the points P1 and P2 shown in FIG. 26 is used as the second unit pattern, and its existing location is shown in FIG.
It is limited to the first and fourth quadrants as shown in. Also, point P
A straight line composed of 2 and P3 is used as a third unit pattern, and the existence prediction range of this third unit pattern is made to depend on the angle of the second unit pattern. For example, when the thick line shown in FIG. 31 is the second unit pattern, it is assumed that the third unit pattern is within a range of ± 90 ° with respect to the incident angle of the second unit pattern.

【００６６】図３２は単位パターンの組み合わせ例を示
し、上記ルールによれば３６０種類程度に分類すること
ができ、この場合に上記仮定を満足しない曲線を除外パ
ターンとする。また、分類を行う場合にパターンの分類
数、隣接２点間距離の総和、角度の総和を保持する（図
３４のステップＳ１２１〜Ｓ１２８）。そして、１書体
において第１段階の終了後、該当パターン数が多いもの
からパターン化する。図３２において線ｉ、ｊが非常に
少数の場合、隣接する太線パターンに代表させる。この
処理をパターン数が６４以下になるように行って番号を
付し（第２段階）、最終パターンの決定後に各単位パタ
ーンの平均距離、平均角度すなわち極座標データを算出
する（図３４のステップ１２９〜Ｓ１３１）。ここで、
平均距離Ｌｉ、平均角度ａｎｇｉ、係数ｋとすると、単
位パターンの統計ｘｙ長ｄｘｉ',ｄｙｉ' は式（１１）
により求めることができる。FIG. 32 shows an example of a combination of unit patterns, which can be classified into about 360 types according to the above rule, and in this case, a curve that does not satisfy the above assumption is used as an exclusion pattern. Further, when the classification is performed, the number of pattern classifications, the sum of the distances between two adjacent points, and the sum of the angles are held (steps S121 to S128 in FIG. 34). Then, in the one typeface, after the completion of the first step, the pattern having the corresponding number of patterns is patterned. When the number of lines i and j is very small in FIG. 32, they are represented by adjacent thick line patterns. This process is performed so that the number of patterns is 64 or less and numbered (second step), and after the final pattern is determined, the average distance and average angle of each unit pattern, that is, polar coordinate data are calculated (step 129 in FIG. 34). ~ S131). here,
Assuming the average distance Li, the average angle angi, and the coefficient k, the statistical xy lengths dxi ′ and dyi ′ of the unit pattern are given by the equation (11).
Can be obtained by

【００６７】ｔａｎ^-1（ａｎｇｉ）＝ｄｙｉ’／ｄｘｉ’ …（１１）ｄｙｉ’＝ｋｄｘｉ’ …（１２）ｄｘｉ^'2＋ｄｙｉ^'2＝Ｌｉ² …（１３）したがって、上記処理により、（第１単位パターンの
ｘ，ｙ値＋第２単位パターンのｘ，ｙ値＋第３単位パタ
ーンのｘ，ｙ値）＊パターン数のパターンファイルを形
成することができる（ステップＳ１３２）。Tan ⁻¹ (angi) = dyi ′ / dxi ′ (11) dyi ′ = kdxi ′ (12) dxi ^'2 + dyi ^{' 2} = Li ² (13) Therefore, according to the above process, (first) It is possible to form a pattern file of (x, y values of unit pattern + x, y values of second unit pattern + x, y values of third unit pattern) * the number of patterns (step S132).

【００６８】つぎに、図３５を参照してオリジナルのソ
ースアウトラインフォントデータを展開する場合につい
て説明すると、まず、図２３に示すように曲線フォーマ
ットは、点Ｐ０が、１）ｍｏｖｏｔｏ２）ｃｕｒｖｅｔｏ３）ｌｉｎｅｔｏであり、点Ｐ１、Ｐ２が制御点であり、点Ｐ３がｃｕｒ
ｖｅｔｏであるので、曲線のみデータを抽出することは
容易である。Next, the case of expanding the original source outline font data will be described with reference to FIG. 35. First, as shown in FIG. 23, in the curve format, the point P0 is 1) motoro 2) curveto 3). lineto, points P1 and P2 are control points, and point P3 is cur
Since it is veto, it is easy to extract data only from the curve.

【００６９】書き換え処理はまず、オリジナル曲線を展
開して点Ｐ０が原点になるように曲線を平行移動し（ス
テップＳ１４１〜Ｓ１４５）、ｄｘ０，ｄｙ０から第１
単位パターンの角度を算出して配置情報を決定して第１
パターンが第１象限になるように回転し（ステップＳ１
４６）、第１〜第３単位パターンによりパターン番号を
決定する（ステップＳ１４７）。そして、オリジナルパ
ターンとのｘｙ値の差を算出し、点Ｐ１のデータの前に
パターン情報を付加し（ステップＳ１４８）、オリジナ
ルとの差分を符号化し（ステップＳ１４９）文字データ
の先頭を表すポインタやデータ長等を書き換える（ステ
ップＳ１５０）。この処理を各文字について行うと、フ
ァイルをクローズし（ステップＳ１５１）、新ファイル
に書き換える（ステップＳ１５２）。In the rewriting process, first, the original curve is expanded and the curve is translated so that the point P0 becomes the origin (steps S141 to S145), and the first from dx0, dy0.
First, the angle of the unit pattern is calculated to determine the placement information.
Rotate the pattern so that it is in the first quadrant (step S1
46), the pattern number is determined by the first to third unit patterns (step S147). Then, the difference between the xy value and the original pattern is calculated, the pattern information is added before the data of the point P1 (step S148), the difference from the original is encoded (step S149), a pointer indicating the beginning of the character data, The data length and the like are rewritten (step S150). When this process is performed for each character, the file is closed (step S151) and rewritten with a new file (step S152).

【００７０】また、図３６に示すように第３の実施例
（図２１，図２２）と同様に、階層ごとにグループ分け
して置換テーブルにより置換し（ステップＳ１６１、Ｓ
１６２）、可逆符号化することにより圧縮効率を向上す
ることができる。つぎに、図３７を参照して復号化プロ
セスを簡単に説明すると、文字番号が指定されると（ス
テップＳ１７１）、可逆復号化し（ステップＳ１７
２）、置換テーブルにより階層ごとに復号化した後（ス
テップＳ１７３〜Ｓ１７５）、付加情報を読み込み（ス
テップＳ１７６）、指定パターンを回転し（ステップＳ
１７７）、ソースデータのｘｙ座標値を求めることによ
り（ステップＳ１７８）、新座標値におけるアウトライ
ンフォントデータに展開する（ステップＳ１７９）。Further, as shown in FIG. 36, similarly to the third embodiment (FIGS. 21 and 22), the layers are grouped and replaced by the replacement table (steps S161, S).
162), the compression efficiency can be improved by the lossless encoding. Next, the decoding process will be briefly described with reference to FIG. 37. When a character number is designated (step S171), lossless decoding is performed (step S17).
2) After decoding for each layer by the replacement table (steps S173 to S175), the additional information is read (step S176) and the designated pattern is rotated (step S).
177), by obtaining the xy coordinate values of the source data (step S178), the data is expanded to outline font data at the new coordinate values (step S179).

【００７１】ここで、差分の算出方法は、基準の取り方
により２通りがある。すなわち、図３８に示すようにオ
リジナル曲線の開始点（細線）とパターン開始点（太
線）を一致させた後、図３８（ａ）に示すように全体の
パターンで扱う方法と、図３８（ｂ）に示すように各点
ごとに基準点を移動して単位パターンで扱う方法が考え
られる。また、１つのパターンは３つの固有の距離Ｌ０
〜Ｌ２を有するので、オリジナルと大きく異なるおそれ
があるが、オリジナルの第１単位パターンの距離との比
により第２、第３単位パターンを変倍することにより、
この問題点を解決することができる。また、第３の実施
例とこの第４の実施例を組み合わせることにより、圧縮
効率を更に向上することができる。Here, there are two methods of calculating the difference, depending on how the reference is taken. That is, as shown in FIG. 38, after the start point (thin line) of the original curve and the pattern start point (thick line) are made to coincide with each other, a method of handling the entire pattern as shown in FIG. It is conceivable to move the reference point for each point as shown in FIG. Further, one pattern has three unique distances L0.
Since it has ~ L2, it may differ greatly from the original, but by scaling the second and third unit patterns according to the ratio with the distance of the original first unit pattern,
This problem can be solved. In addition, the compression efficiency can be further improved by combining the third embodiment and the fourth embodiment.

【００７２】[0072]

【発明の効果】以上説明したように、請求項１記載の発
明は、文字の一部を共有化可能な図形のアウトラインフ
ォントデータを抽出する図形抽出手段と、前記図形抽出
手段により抽出された図形により原文字を復元するため
の情報を算出する復元情報算出手段と、前記図形抽出手
段により抽出された図形により構成される文字であるこ
とを示す情報を付加する情報付加手段と、前記図形抽出
手段により抽出された図形と、前記復元情報算出手段に
より算出された復元情報と、前記情報付加手段により付
加された情報を所定のフォーマットに構成して圧縮する
圧縮手段とを備えたので、文字を分解して圧縮すること
ができ、したがって、アウトラインフォントデータを保
持するためのメモリの容量を低減することができる。As described above, according to the first aspect of the invention, the figure extracting means for extracting the outline font data of the figure capable of sharing a part of the character, and the figure extracted by the figure extracting means. Restoration information calculating means for calculating information for restoring the original character by means of, information adding means for adding information indicating that the character is composed of the figure extracted by the figure extracting means, and the figure extracting means Since the graphic extracted by the above, the restoration information calculated by the restoration information calculation means, and the compression means for compressing the information added by the information adding means in a predetermined format are compressed, the character is decomposed. It is possible to reduce the amount of memory for holding the outline font data.

【００７３】請求項２記載の発明は、文字のアウトライ
ンフォントデータから１バイトを越えるデータを抽出す
る抽出手段と、文字のデータ幅に対する出現頻度の分布
の広がりが小さくなるようなＸ進数を予め算出するＸ進
数算出手段と、前記抽出手段により抽出されたデータを
前記Ｘ進数で圧縮する圧縮手段とを備えたので、１バイ
トを越える分のデータがこのＸ進数で圧縮され、したが
って、圧縮効率を向上させることができる。According to the second aspect of the invention, the extraction means for extracting the data exceeding 1 byte from the outline font data of the character, and the X-adic number which preliminarily calculates the spread of the distribution of the appearance frequency with respect to the data width of the character are calculated in advance. Since the X-adic number calculating means and the compressing means for compressing the data extracted by the extracting means with the X-adic number are provided, data exceeding 1 byte is compressed with the X-adic number, and therefore the compression efficiency is improved. Can be improved.

【００７４】請求項３記載の発明は、文字のアウトライ
ンフォントデータをその情報ごとに階層化する階層化手
段と、前記階層化手段により階層化された各階層のデー
タの出現頻度を算出する算出手段と、前記算出手段によ
り算出された各階層の出現頻度により階層別の圧縮用置
換テーブルを作成して階層毎に圧縮する圧縮手段とを備
えたので、この各階層の出現頻度により階層別の圧縮用
置換テーブルにより階層毎に圧縮され、したがって、圧
縮効率を向上させることができる。According to a third aspect of the present invention, a hierarchizing means for hierarchizing the character outline font data for each information, and a calculating means for calculating the appearance frequency of the data of each hierarchy hierarchized by the hierarchizing means. And a compression unit that creates a compression replacement table for each layer based on the appearance frequency of each layer calculated by the calculation unit and compresses each layer, compression based on the appearance frequency of each layer. The compression table is used to compress each layer, and thus the compression efficiency can be improved.

【００７５】請求項４記載の発明は、予めオリジナルの
アウトラインフォントデータの曲線データを抽出してア
ウトラインフォントデータをグループ化するグループ化
手段と、オリジナルのアウトラインフォントデータと前
記グループ化されたアウトラインフォントデータの差を
算出して圧縮する圧縮手段とを備えたので、圧縮効率を
向上させることができる。According to a fourth aspect of the invention, grouping means for extracting the curve data of the original outline font data in advance to group the outline font data, the original outline font data and the grouped outline font data. Since the compression means for calculating and compressing the difference is provided, the compression efficiency can be improved.

【００７６】請求項５記載の発明は、請求項１ないし４
の圧縮手段により圧縮された符号を更に可逆符号化によ
り圧縮する手段を備えたので、可逆復号化により元のデ
ータに復号することができる。The invention according to claim 5 is the invention according to claims 1 to 4.
Since the code compressed by the compression means is further provided by means of lossless encoding, the original data can be decoded by lossless decoding.

[Brief description of drawings]

【図１】本発明に係るアウトラインフォントデータの圧
縮装置の一実施例を示すブロック図である。FIG. 1 is a block diagram showing an embodiment of an outline font data compression apparatus according to the present invention.

【図２】文字の本体と冠を示す説明図である。FIG. 2 is an explanatory diagram showing a main body and a crown of a character.

【図３】アウトラインフォントデータのオフセットを示
す説明図である。FIG. 3 is an explanatory diagram showing an offset of outline font data.

【図４】復元動作の概略を示す説明図である。FIG. 4 is an explanatory diagram showing an outline of a restoration operation.

【図５】アウトラインフォントデータの全体構成を示す
説明図である。FIG. 5 is an explanatory diagram showing an overall configuration of outline font data.

【図６】アウトラインフォントデータの基点を示す説明
図である。FIG. 6 is an explanatory diagram showing base points of outline font data.

【図７】従来例と第１の実施例のアウトラインフォント
データの違いを示す説明図である。FIG. 7 is an explanatory diagram showing a difference in outline font data between the conventional example and the first embodiment.

【図８】第１の実施例の圧縮動作を説明するためのフロ
ーチャートである。FIG. 8 is a flow chart for explaining a compression operation of the first embodiment.

【図９】第１の実施例の圧縮動作を説明するためのフロ
ーチャートである。FIG. 9 is a flowchart for explaining a compression operation of the first embodiment.

【図１０】第１の実施例の復元動作を説明するためのフ
ローチャートである。FIG. 10 is a flow chart for explaining a restoration operation of the first embodiment.

【図１１】可逆符号化を示す説明図である。FIG. 11 is an explanatory diagram showing lossless encoding.

【図１２】従来例と第２の実施例におけるデータの違い
を示す説明図である。FIG. 12 is an explanatory diagram showing a difference in data between the conventional example and the second example.

【図１３】第２の実施例を示すブロック図である。FIG. 13 is a block diagram showing a second embodiment.

【図１４】データをバイト単位で示す説明図である。FIG. 14 is an explanatory diagram showing data in units of bytes.

【図１５】第２の実施例の圧縮動作を説明するためのフ
ローチャートである。FIG. 15 is a flowchart for explaining a compression operation of the second embodiment.

【図１６】第２の実施例の圧縮動作を説明するためのフ
ローチャートである。FIG. 16 is a flowchart for explaining a compression operation of the second embodiment.

【図１７】第２の実施例の復元動作を説明するためのフ
ローチャートである。FIG. 17 is a flow chart for explaining a restoring operation of the second embodiment.

【図１８】第３の実施例を示すブロック図である。FIG. 18 is a block diagram showing a third embodiment.

【図１９】アウトラインフォントデータの出現頻度を示
す説明図である。FIG. 19 is an explanatory diagram showing the appearance frequency of outline font data.

【図２０】フラグ階層の出現頻度を示す説明図である。FIG. 20 is an explanatory diagram showing an appearance frequency of a flag hierarchy.

【図２１】第３の実施例の圧縮動作を説明するためのフ
ローチャートである。FIG. 21 is a flow chart for explaining a compression operation of the third embodiment.

【図２２】第３の実施例の圧縮動作を説明するためのフ
ローチャートである。FIG. 22 is a flow chart for explaining the compression operation of the third embodiment.

【図２３】第３の実施例の復元動作を説明するためのフ
ローチャートである。FIG. 23 is a flow chart for explaining a restoration operation of the third embodiment.

【図２４】第１〜第３の実施例を組み合わせた例を示す
ブロック図である。FIG. 24 is a block diagram showing an example in which the first to third embodiments are combined.

【図２５】第４の実施例を示すブロック図であるFIG. 25 is a block diagram showing a fourth embodiment.

【図２６】ベゼー曲線を示す説明図である。FIG. 26 is an explanatory diagram showing a Beze curve.

【図２７】単位パターンを示す説明図であるFIG. 27 is an explanatory diagram showing a unit pattern.

【図２８】図２７の各単位パターンを示す説明図であ
る。28 is an explanatory diagram showing each unit pattern of FIG. 27. FIG.

【図２９】第１単位パターンを示す説明図である。FIG. 29 is an explanatory diagram showing a first unit pattern.

【図３０】第２単位パターンを示す説明図である。FIG. 30 is an explanatory diagram showing a second unit pattern.

【図３１】第３単位パターンを示す説明図である。FIG. 31 is an explanatory diagram showing a third unit pattern.

【図３２】第１〜第３単位パターンの組み合わせ例を示
す説明図である。FIG. 32 is an explanatory diagram showing an example of a combination of first to third unit patterns.

【図３３】第４の実施例におけるパターン統合化の第１
段階を説明するためのフローチャートである。FIG. 33 is a first diagram of pattern integration in the fourth embodiment.
6 is a flowchart for explaining steps.

【図３４】第４の実施例におけるパターン統合化の第２
段階とパターンファイル作成を説明するためのフローチ
ャートである。FIG. 34 is a second diagram of pattern integration in the fourth embodiment.
6 is a flowchart for explaining steps and pattern file creation.

【図３５】第４の実施例におけるソースフォントファイ
ルの書き換え動作を説明するためのフローチャートであ
る。FIG. 35 is a flow chart for explaining a source font file rewriting operation in the fourth embodiment.

【図３６】第４の実施例における圧縮動作を説明するた
めのフローチャートである。FIG. 36 is a flow chart for explaining a compression operation in the fourth embodiment.

【図３７】第４の実施例における復号化動作を説明する
ためのフローチャートである。FIG. 37 is a flow chart for explaining a decoding operation in the fourth embodiment.

【図３８】差分算出方法を示す説明図である。FIG. 38 is an explanatory diagram showing a difference calculation method.

【図３９】ベゼー曲線のポイントを示す説明図である。FIG. 39 is an explanatory diagram showing points of a Beze curve.

[Explanation of symbols]

１１共有化図形抽出手段１２復元情報算出手段１３情報付加手段１４圧縮手段１５可逆符号化手段２１データ抽出手段２２Ｘ進数算出手段２３Ｘ進数圧縮手段３１階層化手段３２出現頻度算出手段３３階層別圧縮手段４１曲線別グループ化手段４２誤差圧縮手段 11 shared figure extracting means 12 restoration information calculating means 13 information adding means 14 compression means 15 lossless encoding means 21 data extracting means 22 X-adic number calculating means 23 X-adic number compressing means 31 layering means 32 appearance frequency calculating means 33 layer-wise compression Means 41 Curve grouping means 42 Error compression means

Claims

[Claims]

1. A figure extracting means for extracting outline font data of a figure capable of sharing a part of a character, and restoration information for calculating information for restoring an original character by the figure extracted by the figure extracting means. Calculating means, information adding means for adding information indicating that the character is composed of the graphic extracted by the graphic extracting means, graphic extracted by the graphic extracting means, and calculated by the restoring information calculating means An outline font data compression device comprising: the restored information and a compression unit configured to compress the information added by the information adding unit into a predetermined format.

2. Extraction means for extracting data exceeding 1 byte from character outline font data, and X-adic number calculation means for pre-calculating an X-adic number such that the spread of the distribution of appearance frequency with respect to the character data width is reduced. An outline font data compression apparatus comprising: a compression unit that compresses the data extracted by the extraction unit using the X-adic number.

3. A hierarchizing means for hierarchizing the character outline font data for each information, a calculating means for calculating the appearance frequency of the data of each hierarchy hierarchized by the hierarchizing means, and the calculating means. An outline font data compression apparatus, comprising: a compression unit that creates a compression replacement table for each layer based on the calculated appearance frequency of each layer and compresses each layer.

4. A grouping means for extracting curve data of original outline font data in advance to group the outline font data, and calculating a difference between the original outline font data and the grouped outline font data. A compressor for compressing outline font data, comprising: compression means for compressing.

5. The outline font data compression apparatus according to claim 1, further comprising means for further compressing the code compressed by said compression means by reversible encoding.