JPS592192A

JPS592192A - Character recognizing system

Info

Publication number: JPS592192A
Application number: JP57112215A
Authority: JP
Inventors: Atsushi Tsukumo; 津雲　淳
Original assignee: NEC Corp; Nippon Electric Co Ltd
Current assignee: NEC Corp
Priority date: 1982-06-29
Filing date: 1982-06-29
Publication date: 1984-01-07

Abstract

PURPOSE:To enable character recognition by matching that absorbs two-dimensional expansion and contraction, by performing firstly horizontal one-directional normalization of expansion and contraction from projecting information on the horizontal axis of a character pattern, and then performing expansion and contraction matching in the vertical direction. CONSTITUTION:An input pattern is read from an input character pattern memory 1 as a signal 101, projecting information histogram on the horizontal axis is obtained and outputted as a projecting information signal 102. A mapping function generating device 3 obtains a mapping function of the projecting information signal 102 and a standard projecting information signal 104 and outputs as a mapping functon signal 103. A two- dimensional expansion and contraction matching device 5 reads the signal 101 and performs one-directional expansion and contraction normalization by using the number of mapping for each read character read as a signal 103, and reads the standard character pattern signal 106 of read character corresponding to above-mentioned mapping function stored in a standard character function memory 6, performs expansion and contraction matching in the vertical direction to find out difference, and outputs the difference between input character pattern and each read character as a signal 105. A discriminating device 7 reads it as a signal 105, and outputs, for instance, the result of output of character which is leaset in difference as a signal 107.

Description

【発明の詳細な説明】本発明は、漢字、平仮名９炸仮名、英数字等のような多
くのストロークによって構成されている文字の認識方式
に関する。DETAILED DESCRIPTION OF THE INVENTION The present invention relates to a method for recognizing characters composed of many strokes, such as kanji, hiragana, alphanumeric characters, etc.

近年光学式文字認識技術の発展は目覚ましいものがあシ
、英数字を認識対象とするものは手書き文字、印刷文字
のいずれも製品化され、実用に供している。また漢字、
平仮名を含む日本語用の文字を認識対象とするものは、
印刷文字単一フォントに限れば試作機の開発等が既に発
表されている。The development of optical character recognition technology has been remarkable in recent years, and products that recognize alphanumeric characters, both handwritten and printed characters, have been commercialized and put into practical use. Also kanji,
For those that recognize Japanese characters including hiragana,
As far as single fonts for printed characters are concerned, the development of prototype machines has already been announced.

しかし漢字、平仮名１斥仮名、英数字等の手書き文字を
認識するために、手書き英数字の認識方式を拡張して手
書き漢字まで認識することや、印刷漢字認識からのアプ
ローチ等がとられているが、いまだに効果的なものが得
られていない。その理由の一つとして、漢字は英数字に
比べ複雑な形状をしているために、特徴情報の選択が難
しく、また他の理由として、変形が多いことから、安定
した特徴情報を得るのが困難であること等が挙げられる
。However, in order to recognize handwritten characters such as kanji, hiragana, hiragana, and alphanumeric characters, approaches such as extending the recognition method for handwritten alphanumeric characters to recognize handwritten kanji, and approaches from printed kanji recognition are being taken. However, nothing effective has yet been achieved. One reason for this is that kanji have more complex shapes than alphanumeric characters, making it difficult to select feature information.Another reason is that kanji have many deformations, making it difficult to obtain stable feature information. For example, it is difficult.

一方、はけの効果とテンプレートマツチング法とを組み
合わせることにより、少数のデータに対して、実験が試
みられているもののあまシ良好な結果は得られていない
のが現状である。On the other hand, although experiments have been attempted on small amounts of data by combining the brush effect and the template matching method, good results have not yet been obtained.

さて手書き文字の変動の原因として考えられるものは、（１）位置ずれ（伸縮）（２）回転の２点であシ、手書き文字を構成する各ストロークがそ
れぞれ独立に（１）と（２）の変動が起こるために、文
字パタン全体として歪みが生じるものである。Now, there are two possible causes of fluctuations in handwritten characters: (1) positional deviation (expansion and contraction), and (2) rotation. This causes distortion in the entire character pattern.

このうち（１）の位置ずれに関しては、文字バタンか二
次元情報であるために、二方向に位置ずれが起こること
が、基本的な原因となっている。これに比べ音声認識処
理に注目すると、基本的に時間軸方向の一次元情報であ
ることから、ＤＰマツチング法を用いて時間軸方向への
伸縮整合を行ない、位置ずれの問題を解決している。（
例えば特公昭４９−４５９４３号公報「バタン類似度計
算装置」）。Of these, the basic cause of the positional deviation (1) is that the positional deviation occurs in two directions because it is a character slam or two-dimensional information. In contrast, if we focus on speech recognition processing, since it is basically one-dimensional information in the time axis direction, the DP matching method is used to perform expansion/contraction matching in the time axis direction to solve the problem of positional deviation. . (
For example, Japanese Patent Publication No. 49-45943 ``Batan Similarity Calculation Device'').

本発明は擬似的に二次元上を相異なる二方向についての
伸縮整合を実現し、従来困難とされた二次元的な位置ず
れを吸収した高精度の文字認識方式を提供するものであ
る。The present invention provides a high-precision character recognition method that achieves stretch matching in two different directions on a two-dimensional plane in a pseudo manner, and absorbs two-dimensional positional deviations that have been considered difficult in the past.

以下図を用いて本発明について詳細な説明を行なうが、
相異なる二方向として水平方向と垂直方向を例にとる。The present invention will be explained in detail below using the figures.
Let us take the horizontal direction and the vertical direction as two different directions.

その理由は、説明するうえでのわかシやすさと、二次元
バタンを扱うときに採用される頻度が多いためであシ、
他の相異なる二方向を採用しても同じ効果を得ることが
できる。The reason for this is that it is easy to explain and is often used when dealing with two-dimensional batons.
The same effect can be obtained by using two different directions.

第１図は、二次元的な伸縮整合を直観的に説明するため
の図であシ、（ａｔは標準文字バタン、（ｂ）は入力文
字バタン、（ｅｌは入力文字バタン（ｂｌの水平軸上へ
の投影情報と標準文字バタン（ａ）水平軸上への投影情
報との伸縮整合が最適となるように入力文字バタンを水
平方向に伸縮正規化した一方向正規化文字バタン、そし
て（ｄｌは前記一方向正規化文字バタン（ｅ）と標準文
字バタン（ａ）との垂直方向の伸縮整合が最適になるよ
うに垂直方向に伸縮正規化した二方向正規化文字バタン
を示しておシ、本発明は標準文字バタン（ａｔと入力文
字バタン（ｂ）との整合を行なうときに、あたかも標準
文字バタン（ａ）と二方向正規化文字バタン（ｄ）との
整合を行なうことを実現するものであり、この結果、二
次元的なストロークの位置ずれを吸収して文字を認識す
ることができる。Figure 1 is a diagram for intuitively explaining two-dimensional expansion/contraction matching. (a) A one-way normalized character button in which the input character button is stretched and normalized in the horizontal direction so that the expansion and contraction matching between the upward projection information and the standard character button (a) is optimally matched with the projection information on the horizontal axis, and (dl indicates a two-way normalized character button that has been vertically expanded and contracted to optimize vertical expansion/contraction matching between the one-way normalized character button (e) and the standard character button (a); The present invention realizes, when matching the standard character BUTTON (at) and the input character BUTTON (b), as if matching the standard character BUTTON (a) and the two-way normalized character BUTTON (d). As a result, characters can be recognized while absorbing two-dimensional stroke positional deviations.

第２図（ａ）〜１１）は、二次元的な伸縮整合の実現手
段とその効果を説明するための図であシ、同図（ａ）の
２１は標準文字バタン、同図（ｂ）の２２は入力文字バ
タン、（Ｃ）の２３は前記標準文字バタン２１の水平軸
上への標準投影情報、同図（ｄｌの２４は前記入力文字
バタン２２の水平軸上への入力投影情報、同図ｔｅｌは
前記標準投影情報２３と前記入力投影情報２４との水平
方向への伸縮整合を行ない、写像関数２５を求めている
ことを示す図、同図（ｆｌは写像関数２５を用いて、前
記入力文字バタン２２の水平方向の一方向伸縮正規化バ
タン２６を求めていることを示す図であり、そして同図
（ｇ）は前記一方向伸縮正規化バタン２６と前記標準文
字バタン２１との垂直方向への伸縮整合を行ない、その
ときの写像関数はＣとな゛ることを示すだめの図である
。Figures 2 (a) to 11) are diagrams for explaining means for realizing two-dimensional expansion/contraction matching and their effects; 21 in Figure 2 (a) is a standard character button; 22 is an input character button, 23 in (C) is standard projection information on the horizontal axis of the standard character button 21, and in the same figure (24 in dl is input projection information on the horizontal axis of the input character button 22, tel in the figure shows that the standard projection information 23 and the input projection information 24 are expanded and contracted in the horizontal direction to obtain the mapping function 25; This is a diagram showing that a horizontal one-way stretch/contract normalization button 26 of the input character button 22 is obtained, and (g) of the same figure shows the difference between the one-way stretch/contraction normalization button 26 and the standard character button 21. This is a schematic diagram showing that the mapping function at that time is equal to C when expansion/contraction matching is performed in the vertical direction.

上述の説明の中で、伸縮整合で用いられる文字バタンは
、ＭｘＮのマトリクスから成っていて、Ｍが水平方向の
画素数、Ｎが垂直方向の画素数とすると、Ｍ次元ベクト
ルのＮ個の系列として記述されているものとみなし、水
平軸上への投影情報は一次元ベクトル、すなわちスカラ
ー量のＭ蘭の系列として記述されているものとする。ま
た（ｆｌで示している入力文字パタ／（ｂ）の水平方向
の一方向伸縮正規化処理は、Ｎ個のＭ次元ベクトルをそ
れぞれ順次伸縮正規化するものである。In the above explanation, the character button used in stretch matching consists of an MxN matrix, where M is the number of pixels in the horizontal direction and N is the number of pixels in the vertical direction. It is assumed that the projection information on the horizontal axis is described as a one-dimensional vector, that is, a series of M-rans of scalar quantities. Further, the horizontal one-way expansion/contraction normalization process of the input character pattern (indicated by fl/(b)) is to sequentially expand/contract and normalize each of the N M-dimensional vectors.

以上の説明で示す過少、本発明は、一方向伸縮整合を２
度行なうことによって、二次元的な伸縮整合を実現しよ
うとするものである。In contrast to the understatement shown in the above explanation, the present invention can achieve two-way expansion/contraction alignment.
By repeating this process multiple times, the aim is to achieve two-dimensional expansion/contraction matching.

一方第２図（ｈ）は水平方向の一方向伸縮正規化処理ρ
効果を示すための図であシ、図中２１と２２の黒の部分
は垂直方向へ伸縮して整合がとれた部分で、白ヌキの部
分は垂直方向へ伸縮しても整合されない部分を示してい
る。第２図（ｈ）と前出第２図（ｇｌとを比較すること
によυ、垂直方向の伸縮整合処理の前に入力文字バタン
２２に対して水平方向の一方向伸縮正規化処理を行なっ
た効果が示される。On the other hand, Fig. 2 (h) shows the horizontal one-way expansion/contraction normalization process ρ.
This is a diagram to show the effect. In the figure, the black parts 21 and 22 are the parts that are aligned by vertical expansion and contraction, and the white blank parts are the parts that are not aligned even if they are expanded and contracted in the vertical direction. ing. By comparing Figure 2 (h) and Figure 2 (gl) above, it is found that unidirectional horizontal expansion/contraction normalization processing is performed on the input character button 22 before vertical expansion/contraction matching processing. The effect is shown.

第２図（１）は水平方向の一方向伸縮正規化処理に投影
情報ではなく二次元バタン情報そのものを使った場合を
説明するだめの図であシ、図中２１と２２の黒の部分は
水平方向へ伸縮して整合がとれた部分で、白ヌキの部分
は水平方向に伸縮しても整合されない部分を示している
が、ストロークの位置ずれに対して非常に不安定な整合
であることがわかる。本水平方向の一方向伸縮正規化処
理は次の垂直方向の伸縮整合処理の精度を大きく左右す
るものであシ、写像関数を求めるために安定な整合が必
要であシ、そのために文字バタンとしての情報が欠けて
もストロークの位置ずれを吸収している投影情報を用い
ることが必要となる。Figure 2 (1) is a temporary diagram to explain the case where two-dimensional baton information itself is used instead of projection information for the horizontal one-way stretching/contraction normalization process, and the black parts 21 and 22 in the figure are This is the part that has been aligned by expanding and contracting in the horizontal direction.The white blank area shows the part that is not aligned even if it is expanded and contracted in the horizontal direction, but the alignment is extremely unstable due to positional displacement of the stroke. I understand. This one-way horizontal stretching/contraction normalization process greatly affects the accuracy of the next vertical stretching/contraction matching process, and stable matching is required to obtain the mapping function. Even if this information is missing, it is necessary to use projection information that absorbs the positional deviation of the stroke.

第３図（ａ）　、　（ｂ）　、　（ｅ）　、　（ｄ）は
一方向伸縮整合処理として、音声認識で用いられている
ＤＰマツチング法の一例を説明するだめの図である。FIGS. 3(a), (b), (e), and (d) are diagrams for explaining an example of the DP matching method used in speech recognition as one-way expansion/contraction matching processing.

標準バタンＡｏがＭ次元ベクトルＡｏ　ｅ　Ａｏ　＋・
・・Ａｏ　の系列から成シ、入力パタ／ＡがＭ次元ベク
トルＡ　、　Ａ２．　・・・、ＡＮの系列から成ってい
るとする。また、標準バタンの任意のベクトルＡ　ｏｊ
と、人力バタンの任意のベクトルＡ：　との距離をｄ（
ｌ。The standard baton Ao is an M-dimensional vector Ao e Ao +・
...Ao sequence, the input pattern /A is an M-dimensional vector A, A2. ..., is made up of a series of ANs. Also, any vector of standard batons A oj
and any vector A of the human-powered slam: Let the distance be d(
l.

ｊ）とする。単純な整合をとると、入力バタンＡと標準
バタンＡｏとの相違度Ｄ（Ａ、Ａｏ）は、例えば下式で
求めることになる。j). When simple matching is performed, the degree of difference D (A, Ao) between the input baton A and the standard baton Ao can be obtained, for example, by the following formula.

この式は第３図（ａｌの写像関数ｊ＝ｉ　上で、ＪＡとＡｏとを対応させて、両バタンの相違度を求めてい
るが、同図の写像関数ｊ＝ψ（１）上で、Ａ１とＡｏｊ
とを対応させることができれば、両パタ／の相違度を求
めるのに、入力バタンＡを部分的に伸縮して標準バタン
Ａｏと整合をとることができる。This formula is shown in Figure 3 (on the mapping function j = i of al, J A and Ao are made to correspond and the degree of difference between the two batons is determined. So, A1 and Aoj
If it is possible to make these patterns correspond to each other, it is possible to partially expand or contract the input batten A to match it with the standard batten Ao in order to find the degree of difference between the two putters.

ＤＰマツチング法は、入力バタ／を部分的に伸縮して整
合をとるための手法であシ、例えば第３図１ｂ）では下
記の初期値及び漸化式から、ｆ　（Ｎ、Ｎ）を求めるこ
とによシ、写像関数ｊ＝ψ（１）上でＡｊとＡｏｊとを
対応させて整合をとることができる。The DP matching method is a method for matching the input data by partially expanding or contracting it. For example, in Fig. 3 (1b), f (N, N) is found from the following initial values and recurrence formula. In particular, matching can be achieved by making Aj and Aoj correspond on the mapping function j=ψ(1).

ｇ（１，１）＝ｄ（１，１）す（’　ｙＤ”ｄ（１−ｊ）＋＊ｘ（ｌ’　（ｉ　１　
、　ｊ）　−９（１１−ｊ　１）＋ｇ（ｌ　−１ｔｊ−
２）　）ただし、ｄ（１，ｊ）＝ｃｘ：＋　　（１≦Ｏ１たはｊ
≦０）である。g(1,1)=d(1,1)('yD"d(1-j)+*x(l' (i 1
, j) −9(11−j 1)+g(l −1tj−
2) ) However, d(1,j)=cx:+ (1≦O1 or j
≦0).

第３図（ｃ）は上記漸化式を求めるＤＰマツチング法の
一例を示すための図であル、入力バタンは５個の一次元
ベクトル、すなわちスカラー量の系列（１，２，４，５
，５）であシ、標準バタンは同じく５個の系列（１，２
，３，４，５）であシ、（１，ｊ）が（１，１）、（２
，２）、（３，４）、（４，５）、（５，５）となる写
像関数上の伸縮整合を行なっている。FIG. 3(c) is a diagram showing an example of the DP matching method for obtaining the above recurrence formula.
, 5), the standard baton has the same 5 series (1, 2
, 3, 4, 5) and (1, j) is (1, 1), (2
, 2), (3, 4), (4, 5), and (5, 5).

第３図（ｄ）は上記漸化式計算の計算量を減少させるた
めにＩ−△≦ｊ≦　１十Δ つ範囲内で、漸化式計算を行なうことを示しておシ、一
般にＤＰマツチング法では、この範囲を整合窓と呼び、
実際に計算量の効率化を図っている。Figure 3(d) shows that the recurrence formula calculation is performed within the range of I-△≦j≦10∆ in order to reduce the amount of calculation in the above recurrence formula calculation. In the law, this range is called the matching window.
We are actually trying to make the amount of calculation more efficient.

前記漸化式は単に相違度を求めるためだけのものである
が、１ｎ（ｆ（１−１，Ｄ、ｆ（１−１，ｊ−１）、ｇ（１
−１，ｊ−２））＝９（１１，ｊ（１−１ρ （ただし、ｊ（１−１）はｊ、ｊ−１，ｊ−２のいずれ
がである）のとき、ｈ（ｌ　、ｊ）＝ｊ（１−１）として、関数ｈ（ｉ、Ｊ）を求めておくことにより、相
違度が求められた後にｈ（１，１）の値をｈ（Ｎ、Ｎ）
から順次Ｊ１（、ｔ、ｔ）まで求めることにょシ写像関
数を求めることができる。例えば第３図（ｃｌの例では
ｈ（５，５）ｈ（５，５）＝５　、　ｈ（４，５）＝４　、　ｈ（３
，４）＝２　。The above recurrence formula is only for calculating the degree of dissimilarity, but 1n(f(1-1, D, f(1-1, j-1), g(1
-1,j-2))=9(11,j(1-1ρ (however, j(1-1) is j, j-1, j-2), then h(l, By calculating the function h(i, J) as j)=j(1-1), the value of h(1, 1) can be changed to h(N, N) after the degree of dissimilarity is calculated.
The mapping function can be obtained by sequentially obtaining J1(, t, t) from J1(, t, t). For example, in Figure 3 (cl example) h(5,5) h(5,5)=5 , h(4,5)=4 , h(3
,4)=2.

ｈ（２，２）＝１であるから、写像関数（１，ｊ）が（１，１）　、　（２，，２）　、　（３，４）　、　
（４，５）、（５，５）と求まる。Since h(2,2)=1, the mapping function (1,j) is (1,1), (2,,2), (3,4),
(4,5), (5,5) are found.

第４図は伸縮正規化処理の一例を示すだめの図であり、
）］１）（１≦ｉ≦１６）は入カッ（タン、Ｙ　（ｊ）
（１≦ｊ≦１６）は伸縮正規化ノ（タンで、ｊ＝ψ（１
）は伸縮正規化のための写像関数である。この例ではＹ
（ｊｌは次の規則によって定まる。FIG. 4 is a diagram showing an example of expansion/contraction normalization processing,
)] 1) (1≦i≦16) is input (tan, Y (j)
(1≦j≦16) is the expansion/contraction normalization (tan), and j=ψ(1
) is a mapping function for stretch normalization. In this example, Y
(jl is determined by the following rule.

（１）　　ｊ＝ψ（ｉ）〉ψ（ｉ−１）かつψ（ｉ）〈
ψ（＋＋１）のときＹ（ｊ）＝Ｘ（１）（２）　　ｊ＝ψ（１）＝ψ（ｉ−１）＋２のとき　Ｙ
　（ｊ−１）　＝Ｘ（１）（３）　　ｊ＝ψ（１）＝ψ
（ｔ−ｉ）＜ψ（ｉ＋１）のときＹ（ｊ）＝Ｘ（１）第
５図は本発明方式を実現するだめの装置の一実施例を示
すブロック図である。１００は入力文字バタン信号であ
り、１は前記入力文字ノくタンを格納する入力文字ノく
タン記憶部である。２は投影情報抽出手段でアシ、入力
文字ノくタン記憶部２力≧ら入力バタンを信号１０１と
して読み込み水平軸上の投影情報ヒストグラムを求め、
投影情報信号１０２として出力する。３は写像関数生成
手段であシ、前記投影情報信号１０２と、標準投影情報
記憶部４に前記投影情報信号１０２と同一形式で格納さ
れている各被読取シ字種ごとの標準投影情報信号１０４
との写像関数を求め、写像関数信号１０３として出力す
る。５は二次元伸縮整合手段で、入力文字バタン信号１
０１を読込み、信号１０３として読込まれる各被読取ル
字種ごとの写像関数を用いて一方向伸縮正規化処理を行
ない、標準文字バタン記憶部６に格納されている前記写
像関数に対応する被読取シ字種の標準文字バタン信号１
０６を読込み、垂直方向の伸縮整合を行なって相違度を
求め、入力文字バタンと各被読取シ字種との相違度を信
号１０５として出力する。識別手段７では前記各被読取
シ字種との相違度を信号１０５として読込み、例えば単
に相違度の最も小さい字種を出力結果としたル、或いは
最も小さい相違度と、２番目に小さい相違度の差がある
値以上のときに最も相違度の小さい字種を出力結果とし
、他の場合にはりジェクトを出力結果とする等の文字認
識における通常の方法により認識結果を信号１０７とし
て出力する。(1) j=ψ(i)〉ψ(i-1) and ψ(i)〈
When ψ(++1), Y(j)=X(1) (2) When j=ψ(1)=ψ(i-1)+2, Y
(j-1) =X(1)(3) j=ψ(1)=ψ
When (ti)<ψ(i+1), Y(j)=X(1) FIG. 5 is a block diagram showing an embodiment of a device for realizing the method of the present invention. 100 is an input character button signal, and 1 is an input character button storage section that stores the input character button. 2 is a projection information extracting means that reads the input button as a signal 101 from the input character noktan storage unit 2 and obtains a projection information histogram on the horizontal axis;
It is output as a projection information signal 102. 3 is a mapping function generating means which generates the projection information signal 102 and a standard projection information signal 104 for each character type to be read, which is stored in the standard projection information storage section 4 in the same format as the projection information signal 102.
A mapping function is obtained and output as a mapping function signal 103. 5 is a two-dimensional expansion/contraction matching means, which receives input character slam signal 1;
01 is read, one-way expansion/contraction normalization processing is performed using the mapping function for each character type read as the signal 103, and the mapping function corresponding to the mapping function stored in the standard character button storage section 6 is read. Standard character slam signal 1 for reading type
06 is read in, vertical expansion/contraction matching is performed to determine the degree of difference, and the degree of difference between the input character BUTTON and each type of character to be read is output as a signal 105. The identification means 7 reads the degree of dissimilarity from each character type to be read as a signal 105, and outputs, for example, simply the character type with the smallest degree of dissimilarity, or the smallest degree of dissimilarity and the second smallest degree of dissimilarity. When the difference is greater than a certain value, the character type with the smallest degree of difference is outputted as the output result, and in other cases, the recognition result is outputted as a signal 107 using a normal method in character recognition, such as outputting a blemish.

上記説明において、入力バタン記憶部１と投影情報抽出
手段２とは、一般にバタン処理で用いられているもので
よい。In the above description, the input button storage section 1 and the projection information extraction means 2 may be those commonly used in the button processing.

第６図は写像関数生成手段３の構成の一例を示すブロッ
ク図である。ここでの処理は前記ＤＰマツチング法の説
明の中の、漸化式ｆ（１，ｊ）の計算と、漸化式計算の
結果得られる軌跡ｈ（ｉ、ｊ）を求め、ｈ（１，ｊ）か
ら写像関数を求めるものである。FIG. 6 is a block diagram showing an example of the configuration of the mapping function generating means 3. The process here is to calculate the recurrence formula f(1, j) in the explanation of the DP matching method, find the trajectory h(i, j) obtained as a result of the recurrence formula calculation, and calculate h(1, j) to find the mapping function.

１０２は前記投影情報信号で、スカラー量の系列Ａ、・
・・Ａ　、・・・、Ａに対応し、１０４は前記標準投影
情報信号で、咎被読取シ字種毎のスカラー１の系列Ａｏ
　ｅ　、Ａｏ　＋・・・、　Ａｏ　に対応し、３１は距
離演算部で上記２信号を入力とし、ｄ（ｉ、ｊ）を計算
し、信号３１１として出力する。３２は前出の漸化式％式％）））を計算する漸化式演算部で、ｄ（１，ｊ）を信号３１１
゜ｍｇ＊（ｆ（１−１，ｊ）、ｆ（量−１，ｊ−１）、
ｆ（１−１，ｊ−２）　）を信号３４１として入力し、
演算結果の９（１，ｊ）を信号３２１として、累積値記
憶部３３に出力する。102 is the projection information signal, which is a series of scalar quantities A, .
...A, ..., A, and 104 is the standard projection information signal, which is a series Ao of scalar 1 for each character type to be read.
31 corresponds to e, Ao + . 32 is a recurrence formula calculation unit that calculates the above-mentioned recurrence formula %)).
゜mg*(f(1-1,j), f(amount-1,j-1),
f(1-1,j-2)) as the signal 341,
The calculation result 9(1,j) is output to the cumulative value storage section 33 as a signal 321.

３４は最小値選択部で、累積値記憶部３３から９（１−
１，ｊ）、ｆ（１−１，ｊ−１）そしてｑ（ｓ　−１、
ｊ−２）を信号３３１、信号３３２そして信号３３３と
して読込み、ｍ（ｆ（１−１，ｊ）、ｆ（１−１，ｊ−
１）、ｆ（ｉ−１，ｊ−２））を信号３４１、そしてｈ
（１，ｊ）を信号３４２として写像軌跡記憶部３５に出
力する。漸化式演算が終了すると前記写像軌跡記憶部３
５から写像関数を信号１０３として出力する。34 is a minimum value selection unit, and cumulative value storage unit 33 to 9 (1-
1,j), f(1-1,j-1) and q(s-1,
j-2) as the signal 331, signal 332 and signal 333, m(f(1-1,j), f(1-1,j-
1), f(i-1, j-2)) as the signal 341, and h
(1, j) is output to the mapping locus storage unit 35 as a signal 342. When the recurrence formula calculation is completed, the mapping locus storage unit 3
5 outputs the mapping function as a signal 103.

第７図は、二次元伸縮整合手段５の構成の一例を示すブ
ロック図である。５１は一方向伸縮正規化手段で、入力
文字バタン信号１０１と、各被読取シ字種に対応する写
像関数信号１０３とから、各被読取シ字種に対応する一
方向伸縮正規化手段バタン信号５１０を出力する。５２
は文字バタン伸縮整合手段で、各被読取シ字種に対応す
る、一方向伸縮正規化文字バタン信号５１１と標準文字
バタン信号１０６とから、各被読取シ字種に対応する相
違度を信号１０５として出力する。FIG. 7 is a block diagram showing an example of the configuration of the two-dimensional expansion/contraction matching means 5. As shown in FIG. Reference numeral 51 denotes a one-way expansion/contraction normalization means, which generates a one-way expansion/contraction normalization means bang signal corresponding to each type of character to be read from the input character bang signal 101 and the mapping function signal 103 corresponding to each type of character to be read. 510 is output. 52
is a character slam expansion/contraction matching means that calculates the degree of difference corresponding to each type of characters to be read from the one-way expansion/contraction normalized character slam signal 511 and the standard character slam signal 106 corresponding to each type of characters to be read. Output as .

伸縮正規化手段５１は、入力文字バタン１０１をベクト
ルＡ　、　Ａ　、−・・、Ａ　の系列として読込み、各
ベクトルについて信号１０３で決められた写像関数を用
いて第４図で説明した規則に従って、ベクトルＡ’、Ａ
２．・・・１Ｍの系列を信号５１０として出力するが、
これは一方向伸縮正規化文字バタンとなっている。この
一方向伸縮正規化文字バタンを各被読取シ字種に対して
求める。すなわち各被読取り字種に対応する写像関数に
対して、一方向伸縮正規化文字バタンを信号５１０とし
て順次出力する。The expansion/contraction normalization means 51 reads the input character button 101 as a series of vectors A , A , ..., A , and uses the mapping function determined by the signal 103 for each vector according to the rules explained in FIG. 4. Vector A', A
2. ...outputs a 1M sequence as signal 510,
This is a one-way stretch normalized character bang. This one-way expansion/contraction normalized character pattern is obtained for each type of character to be read. That is, the one-way expansion/contraction normalized character button is sequentially outputted as a signal 510 to the mapping function corresponding to each character type to be read.

第８図は文字バタン伸縮整合手段の構成の一例を示すブ
ロック図である。５２１はベクトル距離演算部で、各被
読取シ字種に対応する一方向伸縮正規化処理バタンと標
準文字バタンを、それぞれベクトルの系列の信号５１０
と信号１０６として読込んで、ＤＰマツチング法のｄ（
１，ｊ）の距離演算を行ない、信号５２１１として出力
する。５２２は漸化式演算部で、写像関数生成手段３の
漸化式演算部３２と同一のものでよく、漸化式９式％）））を計算するもので、ｄ（ｉ、ｊ）を信号５２１１として
、そして月（ｆ（１−１，ｊ）、ｆ（ｉ−１，ｊ−１）
、９（１−１，ｊ−２））を信号５２４１として読込み
、ｇ（ｔ、ｊ）を信号５２２１として相違度累積値記憶
部５２３に出力する。FIG. 8 is a block diagram illustrating an example of the configuration of the character punch expansion/contraction matching means. Reference numeral 521 denotes a vector distance calculation unit, which converts the one-way expansion/contraction normalization process button and standard character button corresponding to each character type to be read into vector series signals 510.
is read as the signal 106, and the DP matching method d(
1, j) is performed and output as a signal 5211. Reference numeral 522 denotes a recurrence formula calculation unit, which may be the same as the recurrence formula calculation unit 32 of the mapping function generation means 3, and is used to calculate the recurrence formula 9). as the signal 5211, and the moon (f(1-1,j), f(i-1,j-1)
, 9(1-1, j-2)) as a signal 5241, and output g(t, j) as a signal 5221 to the difference cumulative value storage unit 523.

５２４は相違度最小値選択部で、相違度累積値記憶部５
２３からｆ（ｉ−ｘ＋ｊ）−ｇ（ｔ−ｉ、ｊ−１）そし
てｇ（ｉ　−１、ｊ−２）を信号５２３１　、信号５２
３２、そして信号５２３３として胱込み、ｍ（ｇ（ｉ−
１，ｊ）、ｑ（ｉ−１゜ｊ−１）、ｇ（１−１，ｊ−２
））　を信号５２４１として出力する。漸化式演算が終
了すると、相違度累積値記憶部５２３は、相違度ｇ（Ｎ
、Ｎ）を信号１０５として出力する。上記の処理によシ
、各被読取シ字種に対応する相違度を信号１０５として
順次出力する。524 is a difference minimum value selection unit, and a difference degree cumulative value storage unit 5
23 to f(i-x+j)-g(t-i, j-1) and g(i-1, j-2) as signals 5231 and 52
32, and the signal 5233 includes the bladder, m(g(i-
1,j), q(i-1゜j-1), g(1-1,j-2
)) is output as a signal 5241. When the recurrence formula calculation is completed, the cumulative difference value storage unit 523 stores the difference g(N
, N) as a signal 105. Through the above processing, the degree of difference corresponding to each character type to be read is sequentially outputted as a signal 105.

第９図は投影情報として用いることのできる別の情報の
例を示す図で、文字バタンを垂直方向に走査して、文字
部と交差する回数を投影情報として採用するもので、取
シ扱いは、先に説明した投影情報と同様である。Figure 9 is a diagram showing another example of information that can be used as projection information.The number of times the character stamp crosses the character part by scanning it in the vertical direction is used as the projection information. , is similar to the projection information described above.

以上の説明により、本発明によれば、文字パタンの水平
軸上の投影情報から、まず水平方向への一方向伸縮正規
化処理を行ない、次に垂直方向に伸縮整合を行なうこと
により、二次元的な伸縮を吸収する整合による文字認識
を実現することができる。As described above, according to the present invention, from the projection information of the character pattern on the horizontal axis, first a one-way stretching/contracting normalization process in the horizontal direction is performed, and then a stretching/contracting matching process is performed in the vertical direction. It is possible to realize character recognition using alignment that absorbs natural expansion and contraction.

上記処理とは反対に、文字パタンの垂直軸上の投影情報
から垂直方向への一方向伸縮正規化処理を行ない、次に
水平方向に伸縮整合を行なう仁とにより、同様に二次元
的な伸縮を吸収する整合による文字認識を実現すること
ができる。また相異なる二方向としては上記の垂直方向
と水平方向に限るものではない。Contrary to the above process, by performing unidirectional stretching/contraction normalization processing in the vertical direction from the projection information on the vertical axis of the character pattern, and then performing stretching/contraction matching in the horizontal direction, similarly two-dimensional stretching/contraction is performed. It is possible to realize character recognition by matching that absorbs Further, the two different directions are not limited to the above-mentioned vertical direction and horizontal direction.

文字認識方式では一般に位置や大きさの正規化、文字パ
タンの平滑化やぼけ処理等を行なって、認識方式の効果
を出そうとするものが多いが、本発明による文字認識方
式も、入力文字パタンに対して前処理を行なうことによ
っても他の方式と同様の効果を得ることができる。In general, many character recognition methods try to achieve the effectiveness of the recognition method by normalizing the position and size, smoothing character patterns, blurring, etc., but the character recognition method according to the present invention also Effects similar to those of other methods can also be obtained by performing preprocessing on the pattern.

またＤＰマツチング法もこれまでに様々な方法が発表さ
れておシ、本明細書で説明した方式に限るものでは々い
。Furthermore, various DP matching methods have been published so far, and are not limited to the method described in this specification.

[Brief explanation of the drawing]

第１図（ａ）　、　（ｂｌ　、　（ｃ）　、　（ｄｌは
二次元的な伸縮整合を直観的に説明するための図、第２
図（ａ）〜（ｉ）は二次元的な伸縮整合の実現手段とそ
の効果と説明するだめの図、第３図（ａｔ　、　（ｂｌ
　、　（ｃｌ　、　（ｄｌはＤＰマツチング法の一例を
示すための図、第４図は伸縮正規化処理の一例を示すた
めの図、第５図は本発明方式を実現するための装置の一
実施例を示すブロック図、第６図は写像関数生成手段の
構成の一例を示すブロック図、第７図は二次元伸縮整合
手段の構成の一例を示すブロック図、第８図は文字バタ
ン伸縮整合手段の構成の一例を示すブロック図、第９図
は投影情報として用いることのできる別の情報の例を示
す図である。図中１は入力文字バタン記憶部、２は投影
情報抽出手段、３は写像関数生成子段、４は標準投影情
報記憶部、５は二次元伸縮整合手段、６は標準文字バタ
ン記憶部、７は識別手段、３１は距離演算部、３２は漸
化式演算部、３３は累積値記憶部、３４は最小値選択部
、３５は写像軌跡記憶部、５１は一方向伸縮正規化手段
、５２は文字バタン伸縮整合子ｎ２１はベクトル距離演
算部５２２は漸化式演算部、５２３は相違度累積値記憶
部を示している。／＋　ｌ　ロ（Ｑｌ　　　　　　　　　　　　　　　　（ｂ）（Ｃ）（ｄ）牙　２　図（（ス、）　　　　　　　　　　　　　　　　　　　　
　　　　　　（ｂン（Ｃ）　　　　　　　　　　　　　
（ｄ）（ｅ）オ　２　μｓ（４）（Ｓ）号　２　ロ（ｈ）芳　３　μ５（ｃＬ）（ｂ）才　３　図（Ｃ）計　４　図／　／　／　０００　／　／　／　ｌ　０００　／　／
　／　（ｘ　（Ｌ）牙　５　ロ牙　６　μｓオ　７図号　８　圓〉　（Figure 1 (a), (bl, (c), (dl) is a diagram for intuitively explaining two-dimensional expansion/contraction matching, Figure 2
Figures (a) to (i) are diagrams for explaining the means for realizing two-dimensional expansion/contraction matching and their effects, and Figure 3 (at, (bl)
, (cl, (dl is a diagram showing an example of the DP matching method, FIG. 4 is a diagram showing an example of expansion/contraction normalization processing, and FIG. 5 is an example of an implementation of a device for realizing the method of the present invention. A block diagram showing an example, FIG. 6 is a block diagram showing an example of the configuration of the mapping function generation means, FIG. 7 is a block diagram showing an example of the configuration of the two-dimensional expansion/contraction matching means, and FIG. 8 is a block diagram showing an example of the configuration of the two-dimensional expansion/contraction matching means. FIG. 9 is a block diagram showing an example of the configuration of , and FIG. 9 is a diagram showing another example of information that can be used as projection information. Mapping function generator stage, 4 is a standard projection information storage section, 5 is a two-dimensional expansion/contraction matching means, 6 is a standard character button storage section, 7 is an identification means, 31 is a distance calculation section, 32 is a recurrence formula calculation section, 33 34 is a minimum value selection unit, 35 is a mapping locus storage unit, 51 is a one-way expansion/contraction normalization means, 52 is a character slam expansion/contraction matcher n21 is a vector distance calculation unit, 522 is a recurrence formula calculation unit, 523 indicates the cumulative difference value storage unit.
(bn(C)
(d) (e) O 2 μs (4) (S) No. 2 B (h) Yoshi 3 μ5 (cL) (b) Year 3 Figure (C) Total 4 Figure / / / 000 / / / l 000 / /
/ (x (L) Fang 5 Ro Fang 6 μs O 7 Symbol 8 En> (

Claims

[Claims]

Regarding the method of recognizing input character bangs expressed as two-dimensional mesh-like information, an input character bang storage means for storing the input character bangs, and two different predetermined directions for the input character bangs are provided. A projection information extraction means for extracting projection information that is a series of one-dimensional information for one direction, and a standard that stores standard projection information created in advance for each character type in the same format as the projection information of the input character slam. a projection information storage means, inputting the projection information of the input character button and the standard projection information;
a mapping function generating means for calculating a mapping function that stretches and contracts the two and maximizes the degree of matching; a standard character button storage means for storing two-dimensional mesh-like standard character buttons created in advance for each character type; Using the input character button, the mapping function, and the standard character pattern, the input character pattern is stretched and matched in the one direction by the mapping function, and then stretched and matched in a direction different from the one direction. and an identification means that outputs a recognition result based on the degree of difference between the input character stamp obtained as a result of the expansion and contraction matching and the standard character pattern of each type of character to be read. A character recognition method that is characterized by being able to perform alignment by absorbing dimensional expansion/contraction fluctuations.