JPH0221032B2

JPH0221032B2 -

Info

Publication number: JPH0221032B2
Application number: JP56047069A
Authority: JP
Inventors: Hideaki Sugawara; Eiichiro Yamamoto
Original assignee: Fujitsu Ltd
Current assignee: Fujitsu Ltd
Priority date: 1981-03-30
Filing date: 1981-03-30
Publication date: 1990-05-11
Also published as: JPS57161987A

Description

【発明の詳細な説明】本発明は文字認識装置に係り特に多数のサンプ
ルから効率良く複数の辞書を作成する方式に関す
る。DETAILED DESCRIPTION OF THE INVENTION The present invention relates to a character recognition device, and particularly to a method for efficiently creating a plurality of dictionaries from a large number of samples.

通常文字認識装置において入力信号とのマツチ
ングを行なう辞書は、１カテゴリに対して１辞書
であつた。 Normally, in a character recognition device, there is one dictionary for each category for matching input signals.

又、１カテゴリ当たりの辞書を複数にし、より
正確な文字認識を行なう試みが成されているもの
の、各辞書内でのサンプル間に重複が生じる、各
構成サンプル数に著しい差がある等適切な辞書の
作成が困難であるばかりでなく、辞書作成に複雑
な手順を要するものであつた。 In addition, although attempts have been made to use multiple dictionaries per category to perform more accurate character recognition, there are cases where there are overlaps between samples within each dictionary, significant differences in the number of constituent samples, etc. Not only was it difficult to create a dictionary, but it also required complicated procedures.

本発明は、上記複数辞書作成に関し、新規な方
式を提供するものであり、以下これに詳細な説明
を加える。 The present invention provides a novel method for creating multiple dictionaries, which will be described in detail below.

第１図乃至第４図は複数辞書作成工程の１例で
10個のサンプルａ乃至ｊを用いてそのクラス分け
について示すものである。図中、各サンプル間の
２次元的距離は、それぞれ特徴を抽出した後相互
の距離マトリツクスを求め、それを模式的に表わ
している。即ち各サンプル間の距離が大きいもの
ほど、その特徴が異なるものである。 Figures 1 to 4 are examples of the process of creating multiple dictionaries.
The classification is shown using 10 samples a to j. In the figure, the two-dimensional distance between each sample is calculated by calculating a mutual distance matrix after extracting the respective features, and then schematically representing it. That is, the larger the distance between each sample, the more different the characteristics.

第１図は、従来１カテゴリに対して１つの辞書
を作成する場合のクラス分けである。単一の辞書
作成に関しては、全てのサンプルの平均Ａを求め
そこから辞書を作成するものである。これは、本
発明に関する複数辞書作成においても同様であ
り、それぞれのクラスの構成サンプルを選出した
後、該構成サンプルの平均から辞書を作成する。 FIG. 1 shows the classification when one dictionary is conventionally created for one category. Regarding the creation of a single dictionary, the average A of all samples is found and a dictionary is created from there. This also applies to the creation of multiple dictionaries according to the present invention; after selecting constituent samples of each class, a dictionary is created from the average of the constituent samples.

本発明では、Ｎ個のサンプルからＫ個の複数辞
書を作成するに際し、第１番目から、第｛Ｋ−Ｎ
＋（n₁＊Ｋ）｝番目の辞書についてはn₁＝｜Ｎ／Ｋ
｜個の、第｛Ｋ−Ｎ＋（n₁＊Ｋ）＋１｝番目から第
Ｋ番目の辞書については（n₁＋１）個の構成サン
プルを選出する。又、構成サンプルの選出はそれ
ぞれ各辞書の核となるサンプルS₁，S₂を定め該
S₁，S₂に近いサンプルごとにそれぞれ重複しない
ように行なう。即ち、初めに最も距離を隔てた２
つのサンプルを核として定め、該サンプルを核と
して各辞書の構成サンプルを選出した後は、既に
選出の終了したサンプルについてマスクをかけ、
次工程よりは、残されたサンプルのみをクラス分
けの対象とするものである。 In the present invention, when creating K plural dictionaries from N samples, from the first to {K-N
+(n ₁ *K)}th dictionary, n ₁ = |N/K
For the | {K-N+(n ₁ *K)+1}-th to K-th dictionaries, (n ₁ +1) constituent samples are selected. In addition, the selection of constituent samples involves determining samples S ₁ and S ₂ , which are the core of each dictionary.
This is done for each sample close to S ₁ and S ₂ so as not to overlap each other. That is, the two most distant initially
After selecting one sample as a core and selecting constituent samples of each dictionary using this sample as the core, mask the samples that have already been selected.
From the next step onwards, only the remaining samples will be classified.

上記の式において、Ｎ……１カテゴリにおけるサンプル数Ｋ……１カテゴリに対して作成する辞書の数 n₁……１辞書におけるサンプル数それぞれ示す。 In the above equation, N...Number of samples in one category K...Number of dictionaries created for one category _n1 ...Number of samples in one dictionary, respectively.

尚、上記Ｎ個のサンプルからＫ個の辞書パター
ンを作成する場合に、ＮがＫの整数倍であれば各
辞書パターンは全てＮ／Ｋ個（整数）のサンプル
を用いて複数辞書を作成すれば良いので、何等問
題はない。 In addition, when creating K dictionary patterns from the above N samples, if N is an integer multiple of K, multiple dictionaries should be created using N/K (integer) samples for each dictionary pattern. It's fine, so there's no problem.

一方、必ずＮがＫの整数倍であるということが
ない場合も考えられることから、そのような状態
が発生した時についてもなるべく各辞書パターン
を作成するサンプルの数を平均化したいとの希望
がある。 On the other hand, since there may be cases where N is not always an integral multiple of K, it is desirable to average the number of samples used to create each dictionary pattern even when such a situation occurs. be.

従つて、ＮがＫの整数倍でない時は、〔Ｎ／Ｋ〕
個のサンプルから作成される辞書パターンと、〔Ｎ／Ｋ〕＋１個のサンプルから作成される辞
書パターンの２種類をつくれば、各辞書パターン
を作成するサンプルの数は略平均化されている。 Therefore, when N is not an integral multiple of K, [N/K]
By creating two types of dictionary patterns: one created from N/K samples and one created from [N/K]+1 samples, the number of samples used to create each dictionary pattern is approximately averaged.

即ち、上記２つの式は、〔Ｎ／Ｋ〕個のサンプ
ルから作成する辞書パターンの数、及び〔Ｎ／
Ｋ〕＋１個のサンプルから作成する辞書パターン
の数を求めるためのものである。 That is, the above two equations represent the number of dictionary patterns to be created from [N/K] samples, and [N/K]
This is to find the number of dictionary patterns to be created from K]+1 samples.

第２図にＫ＝２、即ち、10個のサンプルから２
個の辞書を作成する場合について示す。ここで、
Ｎ＝10、Ｋ＝２であるためn₁＝｜10／２｜＝５と
なり、又Ｋ−Ｎ＋（n₁＊Ｋ）＝２−10＋５×２＝
２、より第１番目より第２番目の辞書（全ての辞
書）についてそれぞれ５個づつの構成サンプルを
選出する。 In Figure 2, K=2, that is, 2 out of 10 samples.
This example shows how to create dictionaries. here,
Since N=10 and K=2, n ₁ =|10/2|=5, and K-N+(n ₁ *K)=2-10+5×2=
2. Select five constituent samples for each of the first and second dictionaries (all dictionaries).

初めに、サンプルｘとｙとの距離について入力
文字をビデオ信号に変換し、入力ビデオから種々
の特徴を抽出することにより以下のように設定す
る。 First, the distance between samples x and y is set as follows by converting an input character into a video signal and extracting various features from the input video.

ｄ（ｘ，ｙ）＝_n 〓^l=1 ｜x^l−y^l｜（但しｌはｍから成る次元数）この様にして10個のサンプルについて、それぞ
れのサンプル間距離を求め距離マトリツクスを形
成したものである。 d(x,y)= _n 〓 ^l=1 ｜x ^l −y ^l ｜ (where l is the number of dimensions consisting of m) In this way, calculate the inter-sample distance for each of the 10 samples and form a distance matrix. This is what I did.

第２図において、最大サンプル間距離を有する
ものは、ａ−ｃ間であるため、ここでは辞書の核
としてａとｃを選ぶ。次いで、ａ又はｃのどちら
かから、該サンプルに最も近接するサンプル、即
ち、サンプル間距離の小さいものをそれぞれ５個
づつ選出する。 In FIG. 2, since the distance between a and c has the maximum distance between samples, a and c are selected here as the core of the dictionary. Next, from either a or c, five samples are each selected that are closest to the sample, that is, those with a small inter-sample distance.

例えば、先にａを核とする辞書を作成する場
合、ａに関してサンプル間距離の小さいｈ，ｂ，
ｆ，ｇをそれぞれ選出する。又、ここでは作成す
る辞書数が２であるため、前記ａを核とする辞書
の構成サンプルを選出した後、残存するサンプル
を全て他のもう１つの辞書の構成サンプルとすれ
ば良い。 For example, when first creating a dictionary with a as the core, h, b, which has a small distance between samples with respect to a,
Select f and g, respectively. In addition, since the number of dictionaries to be created here is two, after selecting the constituent samples of the dictionary with the above-mentioned a as the core, all remaining samples may be used as constituent samples of another dictionary.

各辞書の構成サンプルを選出した後は、従来の
辞書作成の手段により、各々の平均をとり、各辞
書を作成する。かかる工程は、単に従来の技術を
適用したに過ぎないため以下の説明においては省
略する。 After selecting constituent samples of each dictionary, each dictionary is created by taking the average of each using conventional dictionary creation means. Since such a step is merely an application of a conventional technique, it will be omitted in the following description.

第３図に10個のサンプルから３個の複数辞書を
作成する場合について示す。前記第２図と同様の
工程と同様に第１番目の辞書作成のためのクラス
k₁をサンプルａを核として、第２番目のクラスk₂
をサンプルｃを核としてそれぞれ構成サンプルを
選出した後、残存するサンプルから第３番目のク
ラスk₃を作成する。 FIG. 3 shows the case where three multiple dictionaries are created from ten samples. A class for creating the first dictionary in the same process as in Figure 2 above.
k ₁ with sample a as the core, second class k ₂
After selecting constituent samples using sample c as a core, a third class _k3 is created from the remaining samples.

ここでは、n₁＝｜10／３｜＝３，Ｋ−Ｎ＋（n₁
＊Ｋ）＝２より、k₁，k₂についてはそれぞれ３個
づつの構成サンプルを、k₃については（n₁＋１）
（＝４）個の構成サンプルを選出する。 Here, n ₁ = | 10/3 | = 3, K-N + (n ₁
*K) = 2, for k ₁ and k ₂ , 3 configuration samples each, for k ₃ , (n ₁ + 1)
(=4) configuration samples are selected.

第４図は、10個のサンプルから４個の複数辞書
を作成する場合について示している。 FIG. 4 shows a case where four plural dictionaries are created from ten samples.

前記第１図、第２図での説明と同様に核ａ，ｃ
を選出し、クラスk₁，k₂をそれぞれ作成する。次
いで、核クラスk₁，k₂の構成サンプルであるａ，
ｈ，ｊ，ｃにマスクをする。 Nuclei a and c are similar to the explanation in FIGS. 1 and 2 above.
, and create classes k ₁ and k ₂ respectively. Next, a, which is a sample of the core classes k ₁ and k ₂ ,
Mask h, j, and c.

尚ここでn₁＝２，Ｋ−Ｎ＋（n₁＊Ｋ）＝２のため
第１番目のクラスk₁、第２番目のクラスk₂の構成
サンプル数はそれぞれ２個、第３番目のクラス
k₃、第４番目のクラスk₄の構成サンプル数はそれ
ぞれ（n₁＋１＝）３個とする。 Here, since n ₁ = 2, K - N + (n ₁ * K) = 2, the number of constituent samples for the first class k ₁ and the second class k ₂ is 2 each, and the number of samples for the third class is 2.
The number of constituent samples of k ₃ and the fourth class k ₄ are each (n ₁ +1=) 3.

前記クラスk₁，k₂の構成サンプルにマスクをし
た後前記と同様の工程により残存する。ｂ，ｅ，
ｄ，ｆ，ｇ，ｉ，のサンプルを対象としてクラス
分けを行なう。ここで、最もサンプル間距離の大
きいものはｇ，ｉであるため、これらをクラス分
けの核とする。 After the constituent samples of the classes k ₁ and k ₂ are masked, they remain in the same process as above. b, e,
Classification is performed for samples d, f, g, and i. Here, since g and i have the largest inter-sample distance, these are used as the core of classification.

前述のように、第３番目及び第４番目の辞書作
成のための構成サンプル数は３個であるため例え
ば第３番目の核としてｉを選びサンプルｂ，ｆ，
ｉから成るクラスk₃を形成する。従つてクラスk₄
はサンプルｄ，ｅ，ｇから構成されるものとな
る。 As mentioned above, the number of constituent samples for creating the third and fourth dictionaries is three, so for example, i is selected as the third kernel and samples b, f,
Form a class k ₃ consisting of i. So class k ₄
is composed of samples d, e, and g.

又、ここで第３番目の核としてサンプルｇを選
ぶことも可能である。サンプルｇを第３番目の核
とした場合にはそのクラスの構成はｂ，ｅ，ｇと
なり、第４番目のクラスは残存するｄ，ｆ，ｉか
ら構成されることになる。かかる構成サンプルの
相違は、最大のサンプル間距離を有する２つのサ
ンプルのうち、どちらを先に核として構成サンプ
ルを決定するかにより生じるものであるが、本発
明の適用に於いてはそれぞれのサンプル間距離の
和が小さくなるように、即ち、近接するサンプル
ごとで各クラスを構成することが好ましい。 It is also possible to select sample g as the third nucleus here. If sample g is used as the third nucleus, its class will consist of b, e, and g, and the fourth class will consist of the remaining d, f, and i. Such a difference in the constituent samples occurs depending on which of the two samples with the largest inter-sample distance is used as the core to determine the constituent samples first, but in applying the present invention, each sample It is preferable to configure each class so that the sum of the distances between them is small, that is, each class is composed of adjacent samples.

以上本発明による複数辞書作成は、第５図にそ
の概要を示すように入力文字をビデオ信号に変換
する観測部からの信号により、その特徴を抽出す
る特徴抽出部と、該特徴から各サンプル間の距離
を前述の手段により算出する距離マトリツクス生
成部と、各々の辞書の核を検出する検出部と、該
核となるサンプルから各々の辞書を構成するサン
プルを検出する構成サンプル検出部と、、前記核
となるサンプルに関する距離マトリツクスにマス
クをかけるマスクド距離マトリツクス生成部と、
各々のクラスに分類された構成サンプルの特徴の
平均からの複数の辞書を作成する辞書作成部を具
備して成る複数辞書作成装置により行なわれる。 As described above, the creation of multiple dictionaries according to the present invention consists of a feature extraction section that extracts the characteristics of input characters based on a signal from an observation section that converts input characters into a video signal, and an interval between each sample based on the characteristics. a distance matrix generation unit that calculates the distance of each dictionary by the above-mentioned means; a detection unit that detects the core of each dictionary; and a constituent sample detection unit that detects the samples that constitute each dictionary from the core sample. a masked distance matrix generation unit that masks the distance matrix regarding the core sample;
This is performed by a multiple dictionary creation device comprising a dictionary creation unit that creates a plurality of dictionaries from the average of the features of constituent samples classified into each class.

以下第６図を参照して各部の基本的動作原理に
ついて説明をする。 The basic operating principle of each part will be explained below with reference to FIG.

図中１は距離マトリツクス、２はセレクタ部、
３は第１検出部、４は第２検出部、５は第１レジ
スタ、６は構成サンプル検出部、７はマスクド距
離マトリツクス生成部、８はマスクド距離マトリ
ツクス、９は第２レジスタをそれぞれ表わしてい
る。又、C_L１〜C_L３はそれぞれクロツク信号の
入力を示すものである。 In the figure, 1 is the distance matrix, 2 is the selector section,
3 represents a first detection unit, 4 represents a second detection unit, 5 represents a first register, 6 represents a configuration sample detection unit, 7 represents a masked distance matrix generation unit, 8 represents a masked distance matrix, and 9 represents a second register. There is. Further, _CL1 to _CL3 each indicate the input of a clock signal.

前述の如く観測部でビデオ信号に変換された入
力文字の特徴を特徴抽出部において抽出し、距離
マトリツクス１を形成する。該工程は、従来の単
数辞書作成装置においても行なわれていたもので
あり、その手段は特に問わない。 As described above, the features of the input characters converted into video signals by the observation unit are extracted by the feature extraction unit to form a distance matrix 1. This step has been carried out in conventional singular dictionary creation devices, and the method used is not particularly limited.

次いで、セレクタ部２では作成する辞書が第１
番目のそれである場合には前記距離マトリツクス
１からの信号を、第２番目以降の場合にはマスク
ド距離マトリツクス８からの信号を選出する。こ
こで、該選出を行なわせる手段としては、上記セ
レクタ部２に現在何番目の辞書を作成しているか
を確認するカウンタｋを内蔵若しくは接続すれば
良い。 Next, the selector unit 2 selects the first dictionary to be created.
If it is the second one, the signal from the distance matrix 1 is selected, and if it is the second one or more, the signal from the masked distance matrix 8 is selected. Here, as a means for making the selection, a counter k for checking the number of the dictionary currently being created may be built into or connected to the selector section 2.

第１検出部３では作成する辞書が奇数番目のそ
れである（カウンタｋの値が奇数）場合、即ち２
回の入力のうち１回のみクロツク２（C_L２以下
同様とする）で動作し、距離マトリツクス１若し
くはマスクド距離マトリツクス８内の最大サンプ
ル間距離を求め核となるサンプルS₁，S₂の番号を
検出する。例えば第６図に示す例ではｃ―ｆ間の
距離が56で最大となるため辞書作成の核としてサ
ンプルｃとｆを検出する。 In the first detection unit 3, if the dictionary to be created is an odd numbered dictionary (the value of the counter k is an odd number), that is, 2
It operates at clock 2 (the same applies for C _L 2 and below) only once among the input times, and calculates the maximum inter-sample distance in distance matrix 1 or masked distance matrix 8, and calculates the numbers of core samples S ₁ and S ₂ . Detect. For example, in the example shown in FIG. 6, the distance between c and f is maximum at 56, so samples c and f are detected as the core for dictionary creation.

第１検出部での検出が終了するとC_L３はONに
なり第１検出部からの信号を第２検出部でとり込
み、核となるサンプルと他のサンプルとの距離を
第１レジスタ５へ転送する。ここではサンプルｃ
と他のサンプルとの距離を第１レジスタ５へ転送
している。 When the detection at the first detection section is completed, C _L 3 is turned ON, the signal from the first detection section is taken in by the second detection section, and the distance between the core sample and other samples is stored in the first register 5. Forward. Here sample c
The distance between the sample and the other sample is transferred to the first register 5.

構成サンプル検出部６は、前記第１レジスタ５
のデータのうち第１番目から第｛Ｋ−Ｎ＋（n₁＊
Ｋ）｝番目の辞書作成においてはn₁（＝｜Ｎ／Ｋ
｜）個のサンプルを（ｋ＝１〜｛Ｋ−Ｎ＋（n₁＊
Ｋ）｝の場合）、第｛Ｋ−Ｎ＋（n₁＊Ｋ）＋１｝番目
から第Ｋ番目の辞書作成においては（n₁＋１）個
のサンプルをそれぞれ距離の小さいものから検出
する。 The configuration sample detection unit 6 includes the first register 5
The first to {K-N+(n ₁ *
K)}-th dictionary creation, n ₁ (=|N/K
|) samples (k=1~{K-N+(n ₁ *
In the case of {K-N+(n ₁ *K)+1}-th to K-th dictionary creation, (n ₁ +1) samples are detected from the one with the smallest distance.

ここでは、Ｎ＝７，Ｋ＝３としており、又ｋ＝
１の場合であるためn₁＝｜７／３｜＝２で、小さ
いものから２個サンプルを検出する。検出された
サンプルはｂとｃである。 Here, N=7, K=3, and k=
Since this is the case of 1, n ₁ =|7/3|=2, and two samples from the smallest one are detected. The detected samples are b and c.

この結果は第２レジスタ９へ転送され、保持さ
れる。 This result is transferred to the second register 9 and held there.

マスクド距離マトリツクス生成部７は先に検出
したサンプルに関する距離マトリツクスに対して
Ｏのマスクをかける。即ち、同じサンプルを重複
して作成する辞書の構成サンプルとしないためで
ある。 The masked distance matrix generation unit 7 applies a mask of O to the distance matrix regarding the previously detected sample. That is, this is to prevent the same sample from being used as a constituent sample of a dictionary that is created repeatedly.

該マスクド距離マトリツクス８のデータは前記
セレクタ部２へと転送される。 The data of the masked distance matrix 8 is transferred to the selector section 2.

その後、第２レジスタ９のデータは辞書生成部
（図示せず）へ送られその内容に従つて辞書を作
成する。 Thereafter, the data in the second register 9 is sent to a dictionary generation section (not shown) to create a dictionary according to its contents.

以上実施例により明確となつたように本発明に
よれば適切な複数辞書作成を単純に、短時間で行
なうことが可能である。 As has become clear from the above embodiments, according to the present invention, it is possible to create appropriate multiple dictionaries simply and in a short time.

又、上記実施例に於いては第１番目から第｛Ｋ
−Ｎ＋（n₁＊Ｋ）｝番目の辞書についてはn₁個の構
成サンプルを、第｛Ｋ−Ｎ＋（n₁＊Ｋ）｝番目から
第Ｋ番目の辞書については（n₁＋１）個の構成サ
ンプルを選出していたが、本発明の適用はこれに
限るものではない。 Further, in the above embodiment, from the first to the {K
−N+(n ₁ *K)}th dictionary, use n ₁ configuration samples, and for {K−N+(n ₁ *K)}th to Kth dictionaries, use (n ₁ +1) samples. Although a configuration sample was selected, the application of the present invention is not limited to this.

即ち、第１番目から第｛Ｎ−（n₁＊Ｋ）｝番目の
辞書については（n₁＋１）個の、第｛Ｎ−（n₁＊
Ｋ）＋１｝番目から第Ｋ番目の辞書についてはn₁
個の構成サンプルを選出する等、任意の｛Ｎ−
（n₁＊Ｋ）｝個の辞書において（n₁＋１）個の構成
サンプルを選出することが可能である。 That is, for the first to {N-(n ₁ *K)}-th dictionaries, there are (n ₁ +1) {N-(n ₁ *K)}-th dictionaries.
n ₁ for the Kth dictionary from
For example, select {N−
It is possible to select (n ₁ +1) constituent samples in (n ₁ *K)} dictionaries.

[Brief explanation of drawings]

第１図は従来の単数辞書作成におけるクラス分
けを、第２図乃至第４図は本発明の複数辞書作成
におけるクラス分けをそれぞれ示している。又、
第５図は、本発発明による辞書作成方式の主たる
構成を、第６図はその詳細をそれぞれ示してい
る。図中１は距離マトリツクス、２はセレクタ部、
３は第１検出部、４は第２検出部、５は第１レジ
スタ、６は構成サンプル検出部、７はマスクド距
離マトリツクス生成部、８はマスクド距離マトリ
ツクス、９は第２レジスタである。 FIG. 1 shows the classification in the conventional creation of a single dictionary, and FIGS. 2 to 4 show the classification in the creation of a plurality of dictionaries according to the present invention. or,
FIG. 5 shows the main structure of the dictionary creation method according to the present invention, and FIG. 6 shows its details. In the figure, 1 is the distance matrix, 2 is the selector section,
3 is a first detection section, 4 is a second detection section, 5 is a first register, 6 is a configuration sample detection section, 7 is a masked distance matrix generation section, 8 is a masked distance matrix, and 9 is a second register.

Claims

[Claims] 1. In the step of creating multiple dictionaries of a character recognition device,
A feature extraction unit that extracts the characteristics of each of the N dictionary creation character samples from the input video; a distance matrix generation unit that calculates the distance between each sample from the features; and a distance matrix generation unit that calculates the distance between each sample based on the number of the dictionary to be created. a selector section that selects a signal from the sensor section, a detection section that selects a sample S separated by the maximum distance from the signal from the sensor section and a distance between the sample S and other samples, and a detection section that selects a signal from the sensor section. a constituent sample detection unit that detects a specific number of samples starting from those having a small distance between the samples; a dictionary generation unit that generates a dictionary by calculating the average of the features of the constituent samples from the output of the constituent sample detection unit; 1. A plurality of dictionary creation device comprising: a masked distance matrix generation unit that masks signals related to sample S among signals from a selector unit to generate a masked distance matrix and transfers the masked distance matrix to the selector unit. 2. When creating K multiple dictionaries, the configuration sample detection unit selects the first to {K-N+
(n ₁ *K)}th dictionary is n ₁ (=|N/K
|) configuration samples as {K−N+(n ₁ *K)
+1}th to Kth dictionaries are (n ₁ +
1) A plurality of dictionary creation device according to claim 1, wherein each of the configuration samples is selected.