JPH0322094A

JPH0322094A - Character recognizing system

Info

Publication number: JPH0322094A
Application number: JP1156440A
Authority: JP
Inventors: Takafumi Enami; 隆文枝並
Original assignee: Fujitsu Ltd
Current assignee: Fujitsu Ltd
Priority date: 1989-06-19
Filing date: 1989-06-19
Publication date: 1991-01-30

Abstract

PURPOSE:To obtain a stable character recognition rate by performing the feature quantity adapting processing where the size of each divided area of a recognition object character is changed for each of prescribed comparison object characters to minimize the difference among feature quantities and using the difference among feature quantities for the discrimination processing. CONSTITUTION:By the control of a feature quantity adapting means 17, the size of each divided area of the recognition object character is changed for prescribed comparison object characters to minimize differences among feature quantities. A dictionary retrieving means 15 compares minimized differences among feature quantities of respective prescribed comparison object characters to output the comparison object character, which corresponds to the minimum difference between feature quantities, as the recognition result. The size of each divided area of the recognition object character is changed to approximate this character to the comparison object character stored in a dictionary 13 in this manner. Thus, deviation and deformation of the recognition object character are easily absorbed when being minute for comparison object characters, and the stable character recognition rate is obtained.

Description

【発明の詳細な説明】〔概　要〕認識対象文字の特徴量と辞書に格納されている比較対象
文字の特徴量とを比較して文字の認識を行う文字認識方
式に関し、認識対象文字の位置ずれや変形が微小である場合にそれ
を吸収し、安定した文字認識率を得ることを目的とし、認識対象文字のイメージ情報を複数の領域に分割し、各
分割領域ごとに抽出された特徴量から認識対象文字の特
徴量を得る特徴量抽出手段と、比較対象文字の特徴量が
格納される辞書と、認識対象文字の特徴量と比較対象文
字の特徴量とを比較し、特徴量間の相違が最小となる比
較対象文字を認識結果として出力する辞書検索手段とを
備えた文字認識方式において、所定の比較対象文字ごと
に、認識対象文字の各分割領域の大きさを変更して特ｒ
ｌｌ量間の相違を最小とする特徴量適応化処理を行い、
そのときの各特ｔｔｌ．ｍ間の相違を識別処理に供する
特徴量適応化手段を備え構戒する二〔産業上の利用分野
〕本発明は、イメージスキャナその他の光学的読み取り装
置で読み取った認識対象文字のイメージ情報から得られ
る特徴量と、辞書に格納されている比較対象文字の特徴
量とを比較して文字の！！識を行う文字認識方式に関す
る。[Detailed Description of the Invention] [Summary] Regarding a character recognition method that recognizes a character by comparing the feature amount of the recognition target character with the feature amount of a comparison target character stored in a dictionary, the position of the recognition target character is In order to absorb minute deviations and deformations and obtain a stable character recognition rate, the image information of the character to be recognized is divided into multiple regions, and the feature values extracted for each divided region are A feature extraction means that obtains the features of the character to be recognized from a dictionary that stores the features of the character to be compared; and a dictionary that stores the features of the character to be compared; In a character recognition method, the size of each divided area of the recognition target character is changed for each predetermined comparison target character, and the size of each divided area of the recognition target character is changed for each predetermined comparison target character.
Perform feature amount adaptation processing to minimize the difference between ll amounts,
Each special ttl. [Industrial Application Field] The present invention provides a feature adaptation means for subjecting the differences between characters m to identification processing. [Industrial Application Field] Compare the features of the characters stored in the dictionary with the features of the comparison target characters stored in the dictionary. ! This paper relates to a character recognition method for character recognition.

[Conventional technology]

イメージ情報として入力される文字についての従来の文
字認識方式では、まず文書全体のイメージ情報から所定
の切り出し処理により個々の文字に対応するイメージ情
報（文字イメージ）を得る（第５図（ｌ））。In the conventional character recognition method for characters input as image information, image information (character images) corresponding to individual characters is first obtained from the image information of the entire document through a predetermined cutting process (Fig. 5 (l)). .

続いて、個々の文字イメージの濃淡、輪郭線などから数
値列として表される特ｆ！ｌｉｔを抽出するが、従来方
式では文字認識率を高めるために、まず外郭を含む矩形
領域を８×８の小領域に分割し、各分割領域ごとの特ｆ
ｔｋｌを抽出する（第５図（２））。Next, the special f! is expressed as a numerical string from the shading and outline of each character image. However, in the conventional method, in order to increase the character recognition rate, the rectangular area including the outline is first divided into 8 x 8 small areas, and the characteristic f of each divided area is extracted.
tkl is extracted (Figure 5 (2)).

この特徴量は、例えば文字イメージの輪郭線上の各輪郭
点の接続状態に応じて４方向の方向コード（特徴素！）
を与え（第５図（３））、各方向コードについてその個
数（頻度）を積算したものであり、各分割領域ごとにそ
の特徴量は（Ａ　ｏ，Ａ　＋　，　Ａ　！＋　Ａ　３）
として抽出される。なお、第５図（２）に示す小領域内
の特徴量は、（１，４，Ｏ，Ｏ）である。This feature amount is, for example, a direction code (feature element!) in four directions depending on the connection state of each contour point on the contour line of a character image.
(Figure 5 (3)), and the number (frequency) of each direction code is integrated, and the feature amount for each divided area is (A o, A + , A ! + A 3)
is extracted as Note that the feature amount in the small area shown in FIG. 5(2) is (1, 4, O, O).

同様にして、６４の各分割領域からそれぞれ特徴量を抽
出し、文字イメージの特徴量（４Ｘ８Ｘ８−２５６次元
ベクトル）が得られる。Similarly, the feature amounts are extracted from each of the 64 divided regions, and the feature amounts (4×8×8-256 dimensional vector) of the character image are obtained.

このようにして抽出された文字イメージの特徴量は、辞
書検索部において、辞書に格納されている比較対象文字
の特徴量との比較が行われ、特徴量間の相違（距離）が
最小のものを認識文字とし、その文字コードを出力して
文字認識処理を終了する．〔発明が解決しようとする課題〕このような従来方式は、文字イメージを固定の分割領域
からそれぞれの特徴量を抽出し、各特徴素量ごとに辞書
の対応する特徴量と比較してその相違を積算し、特徴量
間の相違を求める構成である。したがって、認識対象文
字のフォントが各分割領域単位で比較対象文字のフォン
トとわずかに異なっても、それが積算されて大きな値に
なることがあった。The feature quantities of the character images extracted in this way are compared with the feature quantities of the comparison target characters stored in the dictionary in the dictionary search section, and the one with the smallest difference (distance) between the feature quantities is is the recognized character, outputs its character code, and terminates the character recognition process. [Problems to be Solved by the Invention] Such a conventional method extracts each feature from a fixed divided region of a character image, and compares each feature with the corresponding feature in a dictionary to determine the difference. This is a configuration that calculates the difference between feature quantities by integrating the values. Therefore, even if the font of the recognition target character differs slightly from the font of the comparison target character in each divided area, the difference may be accumulated to a large value.

すなわち、認識対象文字のフォント（第５図（４）．（
５））が辞書に格納されている比較対象文字のフォント
（第５図（１）．　（２））と異なる場合や、文字位置
に微小なずれ（分割領域のずれ）があったり、その読み
取り条件（１度、かすれ、その他）の変化で文字に変形
が生じている場合には、そのフォント相違に起因する特
徴量の微変動を吸収することができず、認識率の低下を
引き起こしていた。In other words, the font of the character to be recognized (Figure 5 (4).
5)) is different from the font of the comparison target characters stored in the dictionary (Fig. 5 (1). (2)), or there is a slight shift in the character position (shift in the divided area), or the reading When characters are deformed due to changes in conditions (one degree, blurred, etc.), it is not possible to absorb slight fluctuations in the feature amount due to the difference in font, causing a decrease in recognition rate. .

本発明は、文字認識率がフォント変動に大きく依存して
いることに鑑み、認識対象文字の位置ずれや変形が微小
である場合にはそれを吸収し、安定した文字認識率を得
ることができる文字認識方式を提供することを目的とす
る。In view of the fact that the character recognition rate is highly dependent on font fluctuations, the present invention can absorb minute positional deviations and deformations of characters to be recognized and obtain a stable character recognition rate. The purpose is to provide a character recognition method.

[Means to solve the problem]

第１図は、本発明の原理ブロック図である。 FIG. 1 is a block diagram of the principle of the present invention.

図において、特徴量抽出手段１１は、認識対象文字のイ
メージ情報を複数の領域に分割し、各分割領域ごとに抽
出された特徴量から認識対象文字の特徴量を得る。In the figure, a feature amount extracting means 11 divides image information of a character to be recognized into a plurality of regions, and obtains a feature amount of the character to be recognized from the feature amount extracted for each divided region.

辞書ｌ３は、比較対象文字の特徴量が格納される。The dictionary l3 stores feature amounts of characters to be compared.

辞書検索手段１５は、認識対象文字の特徴量と比較対象
文字の特徴量とを比較し、特徴量間の相違が最小となる
比較対象文字を認識結果として出力する。The dictionary search means 15 compares the feature amount of the recognition target character with the feature amount of the comparison target character, and outputs the comparison target character with the smallest difference between the feature values as a recognition result.

特徴量適応化手段１７は、所定の比較対象文字ごとに、
認識対象文字の各分割領域の大きさを変更して特徴量間
の相違を最小とする特徴量適応化処理を行い、そのとき
の各特徴量間の相違を識別処理に供する制御を行う。The feature amount adaptation means 17 performs the following for each predetermined comparison target character:
A feature amount adaptation process is performed to minimize the difference between the feature amounts by changing the size of each divided area of the character to be recognized, and control is performed to subject the difference between the feature amounts to the identification process.

[For production]

本発明は、特徴量適応化千段１７の制御により、所定の
比較対象文字に対して、それぞれ特徴量間の相違が最小
になるように認識対象文字の各分割領域の大きさを変更
する。辞書検索手段１５では、所定の比較対象文字ごと
にそれぞれ最小となった特徴量間の相違を比較し、その
中で最小の特徴量間の相違に対応する比較対象文字を認
識結果として出力する。According to the present invention, the size of each divided region of a character to be recognized is changed by controlling the feature value adaptation stage 17 so that the difference between the respective feature amounts is minimized for a predetermined character to be compared. The dictionary search means 15 compares the minimum difference between the feature amounts for each predetermined comparison target character, and outputs the comparison target character corresponding to the minimum difference between the feature amounts as a recognition result.

すなわち、本発明方式は、当初の分割領域においてある
比較対象文字との特徴量間の相違が最小になっても、特
＠．量適応化処理により他の比較対象文字に対する特徴
量間の相違を最小にすることができた場合には、その比
較対象文字を認識結果とする方式である。That is, in the method of the present invention, even if the difference between the feature amounts with a certain comparison target character in the initial divided area is minimized, the special @. In this method, if the difference between the feature amounts for other comparison target characters can be minimized through the amount adaptation process, that comparison target character is used as the recognition result.

このように、認識対象文字の各分割領域の大きさを変更
し、辞書ｌ３に格納されている比較対象文字に近似させ
ることができるので、認識対象文字のずれや変形が比較
対象文字に対して微小なものであればその吸収は容易で
あり、分割領域が固定であるときに比べて安定した文字
認識率を得ることが可能になる。In this way, the size of each divided area of the recognition target character can be changed to approximate the comparison target character stored in the dictionary l3, so that the displacement or deformation of the recognition target character with respect to the comparison target character can be avoided. If it is minute, it is easily absorbed, and it becomes possible to obtain a more stable character recognition rate than when the divided regions are fixed.

〔Example〕

以下、図面に基づいて本発明の実施例について詳細に説
明する。Hereinafter, embodiments of the present invention will be described in detail based on the drawings.

第２図は、本発明方式を実現する文字認識装置の実施例
構或を示すブロック図である。FIG. 2 is a block diagram showing an embodiment of a character recognition device implementing the method of the present invention.

なお、認識対象文字の特ｔＵｔの抽出処理、および辞書
検索結果により特徴量間の相違から文字認識を行い、対
応する文字コードを出力する処理については従来方弐と
同様とする。Note that the process of extracting the characteristic tUt of the character to be recognized, the process of character recognition based on the difference between feature amounts based on the dictionary search results, and the process of outputting the corresponding character code are the same as in the conventional method 2.

すなわち、文字切り出し部２１では、光学的読み取り装
置から人力される認識対象文字のイメージ情報に対して
、所定の切り出し処理を行い個々の文字イメージに変換
する。特徴量抽出部２３では、個々の文字イメージを小
領域に分δリし、各分割領域ごとにその特徴量を抽出し
、一つの文字イメージから例えば２５６次元ベクトルの
特徴量を取り出す。辞書２５には、比較対象文字の特徴
量およびその文字コードが格納される．辞書検索部２７では、辞書２５に格納されている比較対
象文字の特徴量を検索し、文字イメージの特１ｖｉｌと
各特徴素量ごとに比較して特徴量間の相違が最小のもの
を割り出し、対応する文字コードを認識結果として出力
する。出力部２９では、辞書検索部２７から出力される
文字コードを所定の出力媒体に対応するコードに変換し
て出力する。That is, the character cutout section 21 performs a predetermined cutout process on image information of characters to be recognized manually inputted from an optical reading device, and converts the image information into individual character images. The feature amount extraction unit 23 divides each character image into small regions δ, extracts the feature amount for each divided region, and extracts, for example, a 256-dimensional vector feature amount from one character image. The dictionary 25 stores feature amounts of characters to be compared and their character codes. The dictionary search unit 27 searches the feature quantities of the characters to be compared stored in the dictionary 25, compares each feature quantity with the characteristic of the character image, and determines the one with the smallest difference between the feature quantities. Output the corresponding character code as the recognition result. The output unit 29 converts the character code output from the dictionary search unit 27 into a code corresponding to a predetermined output medium and outputs the code.

本発明は、認識対象文字の特徴量と比較対象文字の特徴
量との比較において、各分割領域の大きさを可変にする
ことにより、文字の読み込み時に発生する位置ずれやフ
ォントの違いに起因する特徴量の変動を吸収し、文字認
識率の向上を図るものである。The present invention makes the size of each divided area variable when comparing the feature values of the character to be recognized with the feature values of the character to be compared. This is intended to absorb variations in feature amounts and improve character recognition rates.

第３図は、特徴量抽出部および辞書検索部における本発
明の実施例処理手順を示すフローチャートである。FIG. 3 is a flowchart showing the processing procedure of the embodiment of the present invention in the feature quantity extraction section and the dictionary search section.

認識対象文字の文字イメージが人力される特徴量抽出部
２３では、各分割領域の特徴量を抽出しする。辞書検索
部２５では、この特徴量を用いて大分類処理を行う。こ
の大分類処理では、比較対象文字の特ａｔが格納されて
いる辞書を検索するときに、ある程度粗い特１ｆｉｌを
用いて計算量を軽減し、１０〜２０文字程度の候補文字
を選別する。この大分類処理は、次の詳細識別処理にお
いて文字候補数を少なくし、文字認識のための処理量を
軽減させることを目的としている。A feature amount extraction unit 23, into which a character image of a recognition target character is manually input, extracts a feature amount of each divided region. The dictionary search unit 25 performs major classification processing using this feature amount. In this major classification process, when searching a dictionary in which the special at of the character to be compared is stored, a somewhat rough special 1fil is used to reduce the amount of calculation, and about 10 to 20 candidate characters are selected. The purpose of this major classification process is to reduce the number of character candidates in the next detailed identification process and reduce the amount of processing required for character recognition.

特徴量抽出部２３および辞書検索部２５では、左右方向
特徴量適応化処理、上下方向特徴量適応化処理および再
識別処理による詳細識別処理を行う．すなわち、分割領
域を仕切る分割線を左右方向および上下方向に移動させ
、その都度認識対象文字の特＠．量と候補文字（比較対
象文字）の特徴量との比較を行い、最も近似できる分割
線の位置を決定し、候補文字ごとにそれぞれ最小となる
特ｆ！Ｉ［１間の相違（距離）を求める。The feature extraction unit 23 and the dictionary search unit 25 perform detailed identification processing using horizontal feature adaptation processing, vertical feature adaptation processing, and re-identification processing. That is, by moving the dividing line that separates the divided areas horizontally and vertically, the special @. The quantity is compared with the feature quantity of the candidate character (comparison target character), the position of the dividing line that can be most approximated is determined, and the minimum characteristic f! is determined for each candidate character. Find the difference (distance) between I[1.

ここで、第４図を用いて、左右方向特徴量適応化処理の
具体的手法について説明する。Here, a specific method of the horizontal direction feature amount adaptation processing will be explained using FIG. 4.

文字イメージの左右方向（Ｘ方向）の長さをＭ（ドット
）、上下方向（ｙ方向）の長さをＮ（ドット）、各方向
の分割数をともに８とすると、特徴素量ＸＯ（κ．ｙ）
、特徴量Ｘ（ｔ＋ｊ＋ｋ）　、辞書の特徴量Ｘ　Ｄ　（
ｉ，ｊ，ｋ）はそれぞれＸＯ（ｘ，ｙ）　　（ｘ＝０．
１，−，Ｍ−１、ｙ＝０．１，・・・，Ｎ−１）Ｘ　（
ｉ，ｊ，ｋ）　　（ｉ＝ｏ，ｌ，・・・，７、ｊ＝０．
１，・・・．７、ｋ＝０．１，２．３　｝Ｘ　Ｄ　（ｉ，Ｌｋ）　（ｉ＝ｏ，ｌ，・・・．７、ｊ
＝０．１，・・・，７、ｋ＝０．１，２．３　）となる。ここで、ｋ　（＝０．１，２．３）は、方向コ
ードに対応する特徴素ＩＸｏ（ｘ，ｙ）の取り得る値で
ある。Assuming that the length of the character image in the horizontal direction (X direction) is M (dots), the length in the vertical direction (y direction) is N (dots), and the number of divisions in each direction is 8, the feature quantity XO (κ .y)
, feature amount X(t+j+k), dictionary feature amount X D (
i, j, k) are respectively XO(x, y) (x=0.
1,-,M-1,y=0.1,...,N-1)X (
i, j, k) (i=o, l,..., 7, j=0.
1,... 7, k=0.1,2.3 } X D (i, Lk) (i=o, l,...7, j
=0.1,...,7, k=0.1,2.3). Here, k (=0.1, 2.3) is a possible value of the feature element IXo(x,y) corresponding to the direction code.

特１ｉＳ！量Ｘ　（ｉ，　ｊ，ｋ）は、各分割領域（ｉ
　．　ｊ　）ごとにｋについての特徴素量ＸＯ（×，ｙ
）を積算したものであり、と表すことができる。ここで、Ｚ　（Ｘ　Ｏ　（ｘ，ｙ
），ｋ）は、特徴素量ＸＯ（ｘ．ｙ）がｋであるときに
ＩＳｋでないときに０とする関数である。Special 1iS! The quantity X (i, j, k) is
．． j), the feature quantity XO(×, y
), and can be expressed as . Here, Z (X O (x, y
), k) is a function that is set to 0 when the feature quantity XO (x.y) is k and is not ISk.

また、ｓｅｐｘ（４）はＸ方向ｉ番目の分割線に対応す
るＸの値であり、Ｓｅｐｙ　（ｊ　）はｙ方向ｊ番目の
分割線に対応するｙの値である。たとえば、Ｍ＝７２、
Ｎ＝５６の場合には、８×８に分割される各分割領域は
９×７（ドット）であり、ｓｅｐｘ（０）　＝０＋　ｓ
ｅｐｘ（１）＝９，　ｓｅｐｘ（２）＝１８．　・・・
，　ｓｅｐｘ（８）＝７２、ｓｅｐｙ　（０）＝Ｏ．　
ｓｅｐｙ（１）＝７，　ｓｅｐｙ（２）＝１４，　・−
，　ｓｅｐｙ（８）一５６となる。したがって、ｓｅｐ
ｘ　（０）およびｓｅｐｘ　（８）−１は文字イメージ
の左端０および右端７１であり、ｓｅｐン（０）および
ｓｅｐｘ　（８）　　１は文字イメージの下端Ｏおよび
上端５５である。Further, sepx (4) is the value of X corresponding to the i-th dividing line in the X direction, and Sepy (j) is the value of y corresponding to the j-th dividing line in the y direction. For example, M=72,
In the case of N=56, each divided area divided into 8×8 is 9×7 (dots), and sepx(0) =0+s
epx(1)=9, sepx(2)=18. ...
, sepx(8)=72, sepy(0)=O.
sepy(1)=7, sepy(2)=14, ・−
, sepy(8)-56. Therefore sep
x(0) and sepx(8)-1 are the left end 0 and right end 71 of the character image, and sepn(0) and sepx(8)1 are the lower end O and upper end 55 of the character image.

文字の認識に用いる特徴量間の相違（評価関数）は、辞
書の特徴ｆｆｉＸＤ（ｉ，Ｌｋ）と抽出された特徴量Ｘ
（ｉ，ｊ，ｋ）のユークリッド距離あるいは市街地距離
により定義される。たとえば、市街地距離を用いた評価
関数Ｄは、Ｄ＝　ΣΣ：Ｅ．　　ＩＸＤ（ｔ，ｊ．ｋ）　一Ｘ（ｉ
，３．ｋ）１▲”６　　Ｊ−６　　ｋ−６と表すことができる．さて、第４図（１）において、Ｘ方向のｍ番目（ｍ−１
．２．・・・，７）の分割線をｄ　．（＝０．　１．２
）だけ右方向に移動させたときに、その分割線をはさむ
分割領域の特徴量Ｘ（ｍ−１，ｊ＋ｋ）およびＸ（＋＊
，ｊ，ｋ）の変化量ΔＸ（ｍ，ｊ，ｋ）は、となり、各特１ｖＩ．量はそれぞれＸ　（ｍ−１．Ｌｋ）−　Ｘ　（ｍ−１．Ｌｋ）十ΔＸ
（＋ｅ，ｊ．ｋ）Ｘ（ｍ，ｊ，ｋ）　　←Ｘ（ｍ，ｊ，
ｋ）一ΔＸ　（＋ｍ．　Ｊ　ｌ　ｋ）に変換される。The difference (evaluation function) between the features used for character recognition is the dictionary feature ffiXD(i,Lk) and the extracted feature X
It is defined by the Euclidean distance or urban area distance of (i, j, k). For example, the evaluation function D using city distance is D=ΣΣ:E. IXD(t,j.k) 1X(i
,3. k)1▲”6 J-6 k-6 Now, in Fig. 4 (1), the m-th (m-1
．． 2. ..., 7) with the dividing line d. (=0. 1.2
) to the right, the feature values X(m-1,j+k) and X(+*
, j, k) is as follows, and each characteristic 1vI. The amount is each X (m-1.Lk) - X (m-1.Lk) + ΔX
(+e, j.k)X(m,j,k) ←X(m,j,
k) - ΔX (+m. J l k).

また、Ｘ方向のｍ番目の分割線をｄ．だけ左方向に移動
させたときの変化量ΔＸ’　（ｍ＋　Ｊ　ｌ　ｋ）は、
となり、各特徴量はそれぞれＸ　（ｓ＋−１，ｊ．ｋ）←Ｘ（ａ＋−Ｌｊ，ｋ）一Δ
Ｘ’（ｍ．ｊ＋ｋ）Ｘ　（ｍ．ｊ．ｋ）　　４−Ｘ　（
ａ＋，ｊ，ｋ）十Δｘ’　（ｍ＋　ｊ＋　ｋ）に変換さ
れる。Also, the m-th dividing line in the X direction is d. The amount of change ΔX' (m+J l k) when moving to the left by
and each feature amount is X (s+-1,j.k)←X(a+-Lj,k)-Δ
X'(m.j+k)X (m.j.k) 4-X (
a+, j, k) is converted into Δx' (m+ j+ k).

なお、ｄ．だけ移動する分割線をはさむ分割領域以外の
特徴量については不変であり、分割線の移動によりその
特ｆ！ｌ＋１が変化する分割領域部分における評価関数
Ｄｄは、と表すことができるので、Ｘ方向のｍ番目の分割線につ
いて、このＤ４が最小になる（辞書の特徴量に最も近似
できる）移動Ｉｄ．を決定する。In addition, d. The feature values other than the divided areas sandwiching the dividing line that moves by the same amount remain unchanged, and as the dividing line moves, the characteristic f! The evaluation function Dd in the divided region portion where l+1 changes can be expressed as follows. Therefore, for the m-th dividing line in the X direction, the movement Id. Determine.

左右方向特徴量適応化処理では、以上の処理をＸ方向の
すべての分割線について行い、それぞれの評価関数Ｄｄ
が最小となる分割領域に基づいて適応化された特徴量を
算出する。In the horizontal direction feature adaptation processing, the above processing is performed for all dividing lines in the X direction, and each evaluation function Dd is
The adapted feature amount is calculated based on the divided region where the minimum value is obtained.

また、上下方向特徴量適応化処理では、ｙ方向のすべて
の分割線について同様の処理を行い適応化された特徴量
を算出する．なお、左右方向および上下方向の各特徴量
適応化処理は互いに関与しているので、総合的な特徴量
適応化処理が望ましい（第４図（２））．再識別処理では、各候補文字ごとに適応化された特徴量
に基づいて、それぞれ最小となるように変更された分割
領域に応じた評価関数（特ｌｔＩ［量間の相違）を求め
る。In addition, in the vertical feature adaptation process, similar processing is performed for all dividing lines in the y direction to calculate adapted feature quantities. Note that since the horizontal and vertical feature adaptation processes are related to each other, comprehensive feature adaptation processing is desirable (Fig. 4 (2)). In the re-identification process, an evaluation function (special ltI [difference between quantities)] is calculated based on the feature quantity adapted for each candidate character, according to the divided area that has been changed to the minimum.

以上の処理をすべての候補文字について行った後に、認
識文字の決定処理では、各候補文字に対応する評価関数
（特１！Ｉｒ量間の相違）についてソートを行い、その
中で最小の評価関数に対応する候補文字をＬ’２　１ｋ
文字として決定する。After performing the above processing for all candidate characters, in the recognition character determination process, the evaluation functions (Special 1! Differences between Ir amounts) corresponding to each candidate character are sorted, and the minimum evaluation function among them is sorted. The candidate character corresponding to L'2 1k
Determine as a character.

このように、認識対象文字の特徴量は、辞書の対応する
文字の特徴量に自動的に近似されるので、辞書の比較対
象文字に対するｖＱ識対象文字の微小なずれや変形を吸
収することが容易である。In this way, the features of the character to be recognized are automatically approximated to the features of the corresponding character in the dictionary, so it is possible to absorb minute deviations and deformations of the target character for vQ recognition with respect to the character to be compared in the dictionary. It's easy.

〔Effect of the invention〕

上述したように、本発明によれば、文字の読み込み条件
の変動やフォント変化に応じた微小なずれや変形による
誤認識が減少し、安定した文字認識率を得ることができ
る。As described above, according to the present invention, erroneous recognition due to minute shifts or deformations due to variations in character reading conditions or font changes is reduced, and a stable character recognition rate can be obtained.

なお、認識対象文字としてプリンタ出力文字を使用し、
岩田細明朝体の文字から作成した辞書を使用し、約１４
００文字について行った文字認識処理の結果は、従来方
式では８０〜８５％であったものが、本発明方式では９
５％程度に向上させることができ、実用的には極めて有
用である。Note that printer output characters are used as recognition target characters,
Using a dictionary created from Iwata Hoso Mincho typeface characters, approximately 14
The result of character recognition processing performed on the 00 character was 80-85% in the conventional method, but 9% in the method of the present invention.
This can be improved to about 5%, which is extremely useful in practice.

[Brief explanation of the drawing]

第１図は本発明の原理ブロック図、第２同は本発明方式を実現する文字認識装置の実施例構
戒を示すブロック図、第３図は実施例処理手順を示すフローチャート、第４図
は特１！ｌ［量適応化処理の具体的手法について説明す
る図、第５図は文字イメージの特徴量抽出処理を説明する図で
ある。図において、１は特徴量抽出手段、３は辞書、５は辞書検索手段、７は特徴量適応化手段、ｌは文字切り出し部、３は特徴量抽出部、５は辞書、２７は辞書検索部、２９は出力部である。本発明原理ブロック図第１図文字認識装置のｌＩｅ．例を示すブロノク図第２図ｍ−１ｍ＝１（１）（２）特徴量適応化処理の具体的手法について説明する図実施
例処理手順を示すフローチャート第４図第３図（１）（２）２（３）（４）（５）文字イメージの特徴量抽出処理を説明する図第５図Fig. 1 is a block diagram of the principle of the present invention, Fig. 2 is a block diagram showing the structure of an embodiment of a character recognition device that implements the method of the present invention, Fig. 3 is a flowchart showing the processing procedure of the embodiment, and Fig. 4 is Special 1! FIG. 5 is a diagram illustrating a feature amount extraction process of a character image. In the figure, 1 is a feature extraction means, 3 is a dictionary, 5 is a dictionary search means, 7 is a feature adaptation means, l is a character extraction section, 3 is a feature extraction section, 5 is a dictionary, 27 is a dictionary search section , 29 is an output section. Block diagram of the principle of the present invention Figure 1 Character recognition device IIe. Bronok diagram showing an example Fig. 2 m-1 m=1 (1) (2) Fig. 4 explaining a specific method of feature adaptation processing Flowchart showing an example processing procedure Fig. 4 Fig. 3 Fig. (1) ( 2) 2 (3) (4) (5) Figure 5 is a diagram illustrating feature extraction processing of character images.

Claims

[Claims]

(1) A feature amount extraction means (11) that divides image information of a character to be recognized into a plurality of regions and obtains a feature amount of the character to be recognized from the feature amount extracted for each divided region, and a feature of a character to be compared. a dictionary (13) in which quantities are stored; and a dictionary that compares the feature quantities of the recognition target character with the feature quantities of the comparison target character and outputs the comparison target character with the smallest difference between the feature quantities as a recognition result. A character recognition method comprising a search means (15), characterized in that, for each predetermined comparison target character, the size of each divided region of the recognition target character is changed to minimize the difference between the feature values. 1. A character recognition method characterized by comprising a feature amount adaptation means (17) that performs a conversion process and subjects differences between respective feature amounts at that time to an identification process.