JP3277977B2

JP3277977B2 - Character recognition method

Info

Publication number: JP3277977B2
Application number: JP24814795A
Authority: JP
Inventors: 紀博萩田; 美奈子澤木
Original assignee: Nippon Telegraph and Telephone Corp
Current assignee: Nippon Telegraph and Telephone Corp
Priority date: 1995-09-26
Filing date: 1995-09-26
Publication date: 2002-04-22
Anticipated expiration: 2015-09-26
Also published as: JPH0991373A

Description

DETAILED DESCRIPTION OF THE INVENTION

【０００１】[0001]

【発明の属する技術分野】本発明は入力画像が「０」ま
たは「１」で表される２値画像、特に、かすれ、つぶれ
などの雑音が加わった文書画像、縞模様などのデザイン
処理を施してある文書画像、さらには白抜き文字画像な
どが混在する文書画像等に対する文字認識方法に関す
る。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention performs a design process on a binary image in which an input image is represented by "0" or "1", particularly a document image to which noise such as blurring or crushing has been added, or a stripe pattern. The present invention relates to a method for recognizing characters in a document image, such as a document image, and a document image in which outline character images are mixed.

【０００２】[0002]

【従来の技術】従来、漢字ＯＣＲ（Optical Character
Reader）などの文字認識処理装置では、まず入力された
２値の文書画像パターンに対して、水平または垂直方向
に走査して、各走査線ごとに黒画素または白画素の数を
射影して、これらの量の投影分布を求め、見出し、子見
出し、文字列などを抽出する。次に、抽出された文字列
画像に対して、同じく、水平または垂直方向に走査し
て、各走査線ごとに黒画素または白画素の数を射影し
て、これらの量の投影分布に基づいて個々の文字を切り
出す。そして、最後に切り出された文字ごとに認識のた
めの特徴をベクトルの形で抽出し、予め作成してある標
準辞書内の各カテゴリの標準パターンベクトルとの間で
類似度または相違度などの識別関数を求めて、最も類似
した文字または図形カテゴリを認識結果とする方法が知
られている。2. Description of the Related Art Kanji OCR (Optical Character)
A character recognition processing device such as a Reader) first scans an input binary document image pattern in the horizontal or vertical direction, and projects the number of black or white pixels for each scanning line. The projection distribution of these amounts is obtained, and headings, child headings, character strings, and the like are extracted. Next, the extracted character string image is similarly scanned in the horizontal or vertical direction, and the number of black pixels or white pixels is projected for each scanning line, based on the projection distribution of these amounts. Cut out individual characters. Then, a feature for recognition is extracted in the form of a vector for each character cut out last, and a similarity or a difference between the standard pattern vector of each category in the standard dictionary created in advance is identified. A method is known in which a function is obtained and the most similar character or graphic category is used as a recognition result.

【０００３】この方法では、文字パターンが黒文字で、
背景部分が白画素からなる文書画像の場合には上記の黒
または白画素の投影分布を見れば、文書画像の中から文
字列領域を判定することや、文字列画像の中から個別文
字の領域を判定することが可能である。しかし、新聞見
出し、週刊誌等の雑誌の見出しなどのように、文字が黒
文字または白文字で背景に網点などのデザイン処理され
たパターンをもつ画像や、文字部分の一部に斜め線など
のデザイン処理が行われた文書画像などに対しては、背
景のデザイン処理の投影分布と文字部分の投影分布が似
ているため、うまく文字列や個別文字の領域を抽出でき
ない問題があった。In this method, the character pattern is a black character,
In the case of a document image in which the background portion is composed of white pixels, the character distribution area can be determined from the document image or the individual character area can be determined from the character string image by looking at the projection distribution of the black or white pixels. Can be determined. However, such as newspaper headlines, headlines of magazines such as weekly magazines, etc., characters with black or white characters and a background with a pattern that has been subjected to design processing, such as halftone dots, and diagonal lines etc. For a document image or the like on which a design process has been performed, the projection distribution of the background design process is similar to the projection distribution of the character portion.

【０００４】この問題に対して、背景のデザイン処理に
使われる個々の線分の幅は文字パターンのストロークの
線幅に比べて細いなど、デザイン処理部分と文字パター
ン部分の黒または白領域の違いを仮定した画像処理を用
いて、背景のデザイン処理部分を取り除く方法が提案さ
れている（たとえば、岡本、林：「膨張／収縮を用いた
地模様のある見出しからの文字抽出」電子情報通信学会
技術研究報告、パターン認識と理解研究会PRU90-151,p
p.47-54,1991.）。しかし、実際の文書画像ではデザイ
ン処理部分と文字パターン部分を分ける手がかりとなる
上記の仮定を満足しない例外が多く発生し、画像処理に
よって認識に重要なストロークが欠落したり、デザイン
処理部分の線分が除去されずに残ってしまい、後段の文
字認識の性能力が低下する問題があった。In order to solve this problem, the width of each line segment used in the background design process is smaller than the stroke width of the character pattern. There has been proposed a method of removing a background design processing portion by using image processing assuming the following (for example, Okamoto, Hayashi: "Character extraction from a headline with a ground pattern using expansion / contraction" IEICE) Technical Research Report, Pattern Recognition and Understanding Workshop PRU90-151, p
p.47-54,1991.). However, in actual document images, there are many exceptions that do not satisfy the above assumptions, which are clues for separating the design processing part from the character pattern part, and strokes important for recognition are missing due to image processing, line segments in the design processing part However, there is a problem in that the lingering ability remains without being removed, and the abilities of character recognition in the subsequent stage are reduced.

【０００５】[0005]

【発明が解決しようとする課題】上記のように、従来の
ＯＣＲをはじめとする文字認識技術は、背景にデザイン
処理を行った文字列画像や、かすれ、つぶれのある文字
列画像、白抜き文字混在などからなる文字列画像から文
字カテゴリを正しく認識できる手段が十分に確立してい
なかった。As described above, the conventional OCR and other character recognition techniques use a character string image that has been subjected to design processing in the background, a character string image with blurred or crushed characters, and white characters. Means for correctly recognizing a character category from a character string image composed of mixed characters has not been sufficiently established.

【０００６】本発明の目的はこのような問題点に鑑み、
かすれやよごれなどの雑音やデザイン処理された文字列
に対しても、文字列領域と背景領域の違いを反映する計
数値を用いて文字列の領域および個別文字の候補画像を
抽出する方法と、個別文字の候補画像に対してこの計数
値を用いて雑音の程度を推定し、認識、棄却の判定に用
いる識別関数のしきい値を雑音の程度に応じて変化させ
る方法とを提供することで従来法に比べて精度の高い文
字認識を実現することである。[0006] The object of the present invention is to address such problems.
A method for extracting a character string area and individual character candidate images using a count value that reflects a difference between a character string area and a background area, even for noise such as blurring and dirt and a design-processed character string, A method of estimating the degree of noise using the count value for a candidate image of an individual character, and changing a threshold value of a discriminant function used for recognition / rejection determination according to the degree of noise. It is to realize character recognition with higher accuracy than the conventional method.

【０００７】[0007]

【課題を解決するための手段】上記目的を達成するため
に、本発明では、新聞見出しのようなデザイン処理され
た文字列画像パターンから文字領域と背景領域を分離す
るために、入力される２値画像パターンに対して、複数
方向に走査して、走査線上にある、黒から黒に変化する
黒黒変化数と白から白に変化する白白変化数および黒か
ら白に変わる黒白変化数と白から黒に変化する白黒変化
数を計数し、これらの計数値を利用して文字列領域を抽
出することを特徴とする。In order to achieve the above object, according to the present invention, a character string and a background area are inputted to separate a character area and a background area from a character string image pattern which has been designed and processed, such as a newspaper headline. The value image pattern is scanned in a plurality of directions, and the number of black-black changes that change from black to black, the number of white-white changes that change from white to white, and the number of black-white changes that change from black to white are on the scanning line. The number of black-and-white changes from black to black is counted, and a character string area is extracted using these count values.

【０００８】なお、本明細書においては、所定の単位
（例えば画素単位）長の走査において２値画像パターン
の値が隣合う単位間で変化しない場合に、その２値の値
が黒のとき黒黒変化、２値の値が白のとき白白変化であ
ると呼ぶことととする。In the present specification, when the value of a binary image pattern does not change between adjacent units in scanning of a predetermined unit (for example, pixel unit) length, when the binary value is black, When the black value and the binary value are white, it is referred to as a white-white change.

【０００９】また、本発明では、抽出した文字列領域の
画像パターンの情報をもとにして、複数方向に走査し
て、走査線上にある、黒から黒に変化する黒黒変化数と
白から白に変化する白白変化数および黒から白に変わる
黒白変化数と白から黒に変化する白黒変化数を計数し、
これらの計数値を利用して、個別文字の候補領域を抽出
する方法や、文字列中で個別文字カテゴリが照合により
確定した文字カテゴリの位置と認識情報をもとに、残り
の未処理の個別文字の候補領域を抽出する方法を備えた
ことを特徴とする。Further, according to the present invention, scanning is performed in a plurality of directions on the basis of information on the image pattern of the extracted character string region, and the number of black-black changes on the scanning line, which changes from black to black, and from white to black. Count the number of white-white changes that change to white, the number of black-white changes that change from black to white, and the number of black-and-white changes that change from white to black,
These counts are used to extract candidate areas for individual characters, and the remaining unprocessed individual A method for extracting a character candidate area is provided.

【００１０】さらに、個別候補画像パターンと辞書の参
照パターンとの照合において、例えば公知であるような
所定の識別関数を用いて識別関数値を求め、この値と個
別候補画像パターンの雑音の程度によって定まるしきい
値とをもとに、認識または棄却の判定処理を行うこと
で、多くの候補画像パターンの中から、正しい文字カテ
ゴリとその位置を選択することを特徴とする。Further, in the comparison between the individual candidate image pattern and the reference pattern in the dictionary, a discriminant function value is obtained by using a known discriminant function, for example. By performing recognition or rejection determination processing based on a determined threshold, a correct character category and its position are selected from many candidate image patterns.

【００１１】また、入力された２値画像パターンの情報
をもとに、辞書の参照パターンと認識・棄却判定のしき
い値を新規作成・追加・更新できる特徴をもつ。[0011] The present invention is characterized in that a reference pattern of a dictionary and a threshold for recognition / rejection determination can be newly created / added / updated based on input binary image pattern information.

【００１２】本発明では、入力された２値画像パターン
に対して、文書画像全体の傾き補正処理やシェーディン
グ補正処理などの前処理が行われた後の文書等の画像パ
ターンを対象として、文書中の文字列領域を抽出し、各
文字列内に含まれる文字カテゴリを辞書の参照パターン
との照合により、認識する。また、誤認識や未学習の文
字の画像パターンを辞書の参照パターンに追加作成また
は更新する。According to the present invention, an image pattern of a document or the like after the input binary image pattern is subjected to a pre-processing such as a skew correction processing and a shading correction processing of the entire document image is targeted. Is extracted, and the character category included in each character string is recognized by collating with the reference pattern of the dictionary. In addition, an image pattern of an erroneously recognized or unlearned character is additionally created or updated as a reference pattern in the dictionary.

【００１３】[0013]

【発明の実施の形態】以下、本発明の一実施の形態を図
１〜図７を参照して説明する。図１は本発明の方法を適
用する文字認識装置の構成を示すブロック図であり、こ
の図において１は２値画像パターン記憶回路、２は白黒
計数回路、３は文字列抽出回路、４は個別文字抽出回
路、５は個別文字照合回路、６は参照パターン学習回
路、７は画像入出力回路である。図２は図１の文字認識
装置の文字列領域を抽出するフローチャート、図３は個
別文字の候補画像を抽出するフローチャート、図４は個
別文字の候補画像パターンと参照パターンの照合処理を
行うフローチャートをそれぞれ示す図である。図５は白
黒計数回路２の動作例を説明する説明図、図６は文字列
抽出回路３の動作例を説明する説明図、図７は個別文字
抽出回路４の動作例を説明する説明図である。DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS One embodiment of the present invention will be described below with reference to FIGS. FIG. 1 is a block diagram showing a configuration of a character recognition apparatus to which the method of the present invention is applied. In this figure, 1 is a binary image pattern storage circuit, 2 is a black and white counting circuit, 3 is a character string extraction circuit, and 4 is an individual A character extracting circuit, 5 is an individual character matching circuit, 6 is a reference pattern learning circuit, and 7 is an image input / output circuit. 2 is a flowchart for extracting a character string region of the character recognition apparatus of FIG. 1, FIG. 3 is a flowchart for extracting a candidate image of an individual character, and FIG. 4 is a flowchart for matching processing between an individual character candidate image pattern and a reference pattern. FIG. 5 is an explanatory diagram for explaining an operation example of the monochrome counting circuit 2, FIG. 6 is an explanatory diagram for explaining an operation example of the character string extracting circuit 3, and FIG. 7 is an explanatory diagram for explaining an operation example of the individual character extracting circuit 4. is there.

【００１４】図１において、２値画像パターン記憶回路
１は認識する２値画像パターンを記憶している。２値画
像パターンはＭ画素からなり、例えば白画素は「０」、
黒画素は「１」で示されている。白黒計数回路２は、２
値画像パターン記憶回路１から取り出されたＭ画素から
なる２値画像パターンを入力し、複数方向から走査して
文字列領域と背景領域を区別するために、各走査線上に
ある、隣合う画素の２値の値がどちらも黒である黒黒変
化数（黒から黒への変化の数）と同様にどちらも白であ
る白白変化数（白から白への変化の数）および黒から白
に変わる黒白変化数と白から黒に変化する白黒変化数を
計数し、出力する。文字列抽出回路３は、白黒計数回路
２によって得られた各走査線上の白黒の計数値をもとに
文字列領域と背景領域を区別し、入力画像中から文字列
領域の画像パターンを出力する。In FIG. 1, a binary image pattern storage circuit 1 stores a binary image pattern to be recognized. The binary image pattern is composed of M pixels, for example, white pixels are “0”,
Black pixels are indicated by "1". The black and white counting circuit 2
A binary image pattern composed of M pixels extracted from the value image pattern storage circuit 1 is input and scanned from a plurality of directions to distinguish between a character string region and a background region. Both the white-white change number (the number of changes from white to white) and the change from black to white, as well as the black-black change numbers (the number of changes from black to black), both of which are black. The number of changing black and white and the number of changing black and white from white to black are counted and output. The character string extraction circuit 3 distinguishes a character string region from a background region based on the black and white count value on each scanning line obtained by the black and white counting circuit 2, and outputs an image pattern of the character string region from the input image. .

【００１５】個別文字抽出回路４は文字列抽出回路３で
得られた文字列領域の画像パターンを入力し、個別文字
照合回路５で照合すべき個別文字が含まれる候補画像を
選択し、出力する。個別文字照合回路５は、個別文字抽
出回路４から選ばれた、個別文字の候補画像パターンを
入力し、所定の識別関数を用いて辞書の参照パターンと
照合し、各文字カテゴリの識別関数値と候補画像パター
ンの雑音の程度によって決まるしきい値と比較して、こ
の候補画像パターンの中に認識文字カテゴリがあるかど
うかを判定し、ある場合にはこれらの結果を出力する。
参照パターン学習回路６は、画像入出力回路７から出力
される画像情報および２値画像パターン記憶回路１の２
値画像パターンおよび、個別文字照合回路５内の辞書の
参照パターンを入力し、参照パターンの新規作成・更新
を行い、個別文字照合回路５の辞書の参照パターンに追
加・更新する。The individual character extracting circuit 4 inputs the image pattern of the character string area obtained by the character string extracting circuit 3, selects a candidate image including an individual character to be collated by the individual character collating circuit 5, and outputs the candidate image. . The individual character matching circuit 5 receives the individual character candidate image pattern selected from the individual character extraction circuit 4, matches the image pattern with a reference pattern in a dictionary using a predetermined identification function, and determines the identification function value of each character category. A comparison is made with a threshold value determined by the degree of noise in the candidate image pattern to determine whether or not the candidate image pattern includes a recognized character category.
The reference pattern learning circuit 6 stores the image information output from the image input / output circuit 7 and the binary information of the binary image pattern storage circuit 1.
A value image pattern and a reference pattern of a dictionary in the individual character matching circuit 5 are input, a new reference pattern is created and updated, and the reference pattern of the dictionary of the individual character matching circuit 5 is added and updated.

【００１６】画像入出力回路７は個別文字照合回路５か
ら出力される文字カテゴリ番号、識別関数値、２値画像
パターン等の認識結果の情報を入力し、これらの情報を
表示し、また、マウスやキーボードなどによって画像情
報を編集する。さらに、画像入出力回路７は参照パター
ン学習回路６に参照パターンの新規作成または更新に必
要な情報を送出する。The image input / output circuit 7 inputs information on recognition results such as a character category number, a discriminant function value, and a binary image pattern output from the individual character collating circuit 5, displays the information, and displays a mouse. Edit image information using a keyboard or keyboard. Further, the image input / output circuit 7 sends information necessary for newly creating or updating the reference pattern to the reference pattern learning circuit 6.

【００１７】次に図１において、入力された２値画像パ
ターンから文字列領域を抽出する処理について説明す
る。まず２値画像パターン記憶回路１から認識対象とな
る２値文書画像パターンが読み出され、白黒計数回路２
に入力される。そして、Ｍ画素（横Ｎ_x画素×縦Ｎ_y画
素）からなる入力２値画像パターンに対して、例えばＮ
_x＞Ｎ_yならば横書き、Ｎ_x≦Ｎ_yならば縦書きと判定す
る。横書きならば水平方向に、縦書きならば垂直方向に
文字を走査し、各走査線上の黒から黒に変化する黒黒変
化の点の総数ａと白から白に変化する白白変化の点の総
数ｅ、白から黒に変わる白黒変化数ｂと、黒から白に変
わる黒白変化数ｃを計数する。Next, referring to FIG. 1, a process for extracting a character string region from an input binary image pattern will be described. First, a binary document image pattern to be recognized is read out from the binary image pattern storage circuit 1,
Is input to Then, for an input binary image pattern composed of M pixels (horizontal _Nx pixels × vertical _Ny pixels), for example, N
_x> N _y if horizontal determines that N _x ≦ N _y if vertical. Characters are scanned in the horizontal direction for horizontal writing and in the vertical direction for vertical writing, and the total number of black-black change points a changing from black to black on each scanning line and the total number of white-white changing points changing from white to white e, the number of black and white changes b from white to black and the number of black and white changes c from black to white are counted.

【００１８】ここでは例えば、黒点は画素値＝１に、白
点は画素値＝０とする。各走査線上で、ａ，ｂ，ｃ，ｅ
の各計数値を用いてHere, for example, it is assumed that a black point has a pixel value of 1 and a white point has a pixel value of 0. On each scan line, a, b, c, e
Using each count value of

【００１９】[0019]

【数１】 (Equation 1)

【００２０】なるｐの値を求める。分子のａ・ｅ値は走
査線上にある黒黒変化点の総数と白白変化点の総数の積
であり、文字部分が黒文字でも白文字でも走査線上に黒
ラン長または白ラン長の長い線分が多く存在すれば、ｐ
値は大きい値をとる。一方、ｂ・ｃ値は黒白変化数の総
数と白黒変化数の総数の積であり、走査線上で横切る線
分数、すなわち線分の複雑さに対応した量で、文字の背
景部分に網点などのデザイン処理が行われた部分を走査
した場合には、このｂ・ｃ値が高くなり、ｐ値は小さい
値をとる。また、ｐ値は分母の正規化項により、-１≦
ｐ≦１の範囲の値をとる。The value of p is determined. The a · e value of the numerator is the product of the total number of black-and-white transition points and the total number of white-and-white transition points on the scanning line. If many exist, p
The value takes a large value. On the other hand, the bc value is the product of the total number of black-and-white changes and the total number of black-and-white changes, and is the number of line segments traversing on a scanning line, that is, the amount corresponding to the complexity of the line segments. When the portion where the design processing is performed is scanned, the bc value becomes high and the p value becomes a small value. In addition, the p-value is −1 ≦ by the normalized term of the denominator.
It takes a value in the range of p ≦ 1.

【００２１】このｐ値を、各走査方向に対応した投影軸
に投影する。位置ｙの水平方向走査で得られるｐ値を射
影値ｈ_yとおくと、水平方向走査で投影分布Ｐ_y＝
（ｈ₁，ｈ₂，...，ｈ_y，...，ｈ_Ny）が得られる。位置
ｘの水平方向走査で得られるｐ値を射影値ｖ_xとおく
と、水平方向走査で投影分布Ｐ_x＝（ｖ₁，ｖ₂，...，ｖ
_x，...，ｖ_Nx）が得られる。これ以外の走査方向につい
ても計数可能である。ここで、各走査線上でａ，ｂ，
ｃ，ｅを求める場合に、特定の区間だけについてこれら
の総数を求めることも可能である。The p value is projected on a projection axis corresponding to each scanning direction. If the p value obtained by horizontal scanning of the position y is defined as a projection value h _y , the projection distribution P _y =
(H ₁ , h ₂ ,..., H _y ,..., H _Ny ) are obtained. Assuming that a p value obtained by horizontal scanning at the position x is a projection value v _x , a projection distribution P _x = (v ₁ , v ₂ ,..., V
_x ,..., v _Nx ). Counting can be performed in other scanning directions. Here, a, b,
When obtaining c and e, it is also possible to obtain the total number of these for only a specific section.

【００２２】次に、文字列抽出回路３は、白黒計数回路
２から得られた投影分布Ｐ_yと投影分布Ｐ_xを入力し、も
し、Ｎ_x＞Ｎ_yならば、水平方向の走査で得られた投影分
布Ｐ_yを、予め設定したＮ区間になるように、Ｎ等分し
て、新しい投影分布Ｑ＝（ｑ₁，ｑ₂，...，ｑ_j，...，
ｑ_N）を求める。さもなければ、垂直方向の走査で得ら
れた投影分布Ｐ_xを、予め設定したＮ区間になるよう
に、Ｎ等分して、新しい投影分布Ｑ＝（ｑ₁，
ｑ₂，...，ｑ_j，...，ｑ_N）を求める。Next, the character string extraction circuit 3 receives the projection distribution P _y and the projection distribution P _x obtained from the black-and-white counting circuit 2 and, if N _x > N _y , obtains by horizontal scanning. The obtained projection distribution P _y is divided into N equal parts so as to have a preset N section, and a new projection distribution Q = (q ₁ , q ₂ ,..., Q _j,.
q _N ). Otherwise, the projection distribution P _x obtained by scanning in the vertical direction is divided into N equal parts so as to have a preset N section, and a new projection distribution Q = (q ₁ ,
_{_{q 2, ..., q j,}} ..., seek q _N).

【００２３】今、Ｎx＞Ｎyの場合について説明する。新
しい投影分布Ｑに対して、予め設定したしきい値ＴQと
比較し、ｑ＞ＴQとなるｊの連続区間［Ｊs，Ｊe］（ｊs
≦ｊ≦ｊe）から文字列の高さＷy（＝ｊe−ｊs＋１）を
求める。この高さＷy画素と幅Ｎx画素によって囲まれる
画像領域を文字列画像パターンＷとして出力する。ここ
で、文字列が複数行からなる場合は連続区間が複数個出
力されるので、複数個の文字列画像パターンＷを出力す
る。 Now, the case where Nx> Ny will be described. The new projection distribution Q is compared with a predetermined threshold value TQ, and j continuous sections [Js, Je] (js
The height Wy (= je-js + 1) of the character string is obtained from ≤j≤je. An image area surrounded by the height Wy pixels and the width Nx pixels is output as a character string image pattern W. Here, when the character string is composed of a plurality of lines, a plurality of continuous sections are output, so that a plurality of character string image patterns W are output .

【００２４】次に、個別文字抽出回路４では、文字列抽
出回路３で得られた、Ｎ_x×Ｗ_y画素の文字列画像パター
ンＷを高さがＮ画素なるように正規化処理を行い、Ｎ_x
×Ｎ画素の文字列画像パターンＷ_Nを求める。この正規
化文字列画像パターンＷ_Nに対して、白黒計数回路２の
処理手順でのべた、垂直方向走査によるｐ値の投影分布
Ｐ_xを求め、この投影分布Ｐ_xに対して、予め設定したし
きい値Ｔ_xと比較し、ｖ_j＞Ｔ_xとなるｘの連続区間
［ｘ_s，ｘ_e］（ｘ_s≦ｘ≦ｘ_e）を求める。Next, the individual character extraction circuit 4 normalizes the character string image pattern W of N _x × W _y pixels obtained by the character string extraction circuit 3 so that the height becomes N pixels. N _x
A character string image pattern W _N of × N pixels is obtained. For this normalized character string image pattern W _N , the projection distribution P _x of the p-value by the vertical scanning, which is obtained by the processing procedure of the monochrome counting circuit 2, is obtained, and the projection distribution P _{x is} set in advance. is compared with the threshold value T _{_x,} v _j> T _x become x continuous interval [x _s, x _e] Request _{_{(x s ≦ x ≦ x e}} ).

【００２５】ｋ個の連続区間［ｘ_s1，ｘ_e1］，［ｘ_s2，
ｘ_e2］，...，［ｘ_sk，ｘ_ek］に対して、参照パターン
に記憶されている文字幅の存在範囲であるｚ₁画素から
ｚ₂画素の中にはいる、ｚ₁≦ｘ_ej−ｘ_si≦ｚ₂（ｉ≦
ｊ）を満たす区間ｉと区間ｊを求め、個別文字の候補画
像パターンＧの文字幅Ｗ_x（＝ｘ_ej−ｘ_si＋１）を求め
る。これによりＷ_x×Ｎ画素からなる個別文字の候補画
像パターンＧが複数個得られる。もし区間ｉでｘ_ei−ｘ
_si＞ｚ₂となる場合は、候補画像パターンＧがＷ_x＝Ｎで
あると仮定して、候補画像パターンＧの左枠の位置ｘ_si
を１画素単位にずらしてＧの右枠が位置ｘ_eiになるまで
複数個のＧを作成する。The k continuous sections [x _s1 , x _e1 ], [x _s2 ,
x _e2 ],..., [x _sk , x _ek ], z ₁ ≤x which falls within the range from z ₁ pixel to z ₂ pixel which is the range of the character width stored in the reference pattern. _ej− x _si ≦ z ₂ (i ≦
j) and the section j that satisfies j) are obtained, and the character width W _x (= x _ej −x _si +1) of the individual character candidate image pattern G is obtained. As a result, a plurality of individual character candidate image patterns G each composed of W _x N pixels are obtained. If in section i x _ei -x
_{When si} > z ₂ , the position x _si of the left frame of the candidate image pattern G is assumed on the assumption that the candidate image pattern G is W _x = N.
Are shifted in units of one pixel, and a plurality of Gs are created until the right frame of G is at the position _xei .

【００２６】[0026]

【００２７】また、個別文字照合回路５で「ある文字カ
テゴリに認識した」と判定された個別文字の候補画像パ
ターンＧの区間から文字のピッチ（間隔）を推定し、残
りの領域から個別文字候補画像パターンＧを抽出する方
法も適用可能である。The character pitch (interval) is estimated from the section of the individual character candidate image pattern G determined to be "recognized in a certain character category" by the individual character matching circuit 5, and the individual character candidate is estimated from the remaining area. A method of extracting the image pattern G is also applicable.

【００２８】なお、これまでは横に長い２値画像が入力
された場合に関して主に説明したが、縦に長い２値画像
パターンの場合も、ｘ方向とｙ方向の処理を逆にすれば
同様に処理できる。In the above description, a case where a horizontally long binary image is input has been mainly described. However, a vertically long binary image pattern can be similarly obtained by reversing the processing in the x and y directions. Can be processed.

【００２９】次に、個別文字照合回路５の処理の詳細に
ついて説明する。まず、Ｗ_x×Ｎ画素からなる個別文字
候補画像パターンＧを入力し、ｎ（＝Ｎ×Ｎ）画素の正
規化２値文字パターンＸ＝（ｘ₁，ｘ₂，...，
ｘ_i，...，ｘ_n）を求める。このとき、Ｇが画像Ｘの中
心にくるように平行移動しておく。白黒計数回路２を用
いて、この正規化文字パターンＸに対する水平方向走査
の投影分布Ｐ_yを求め、次式で示すｈ_yの総和ＨNext, the processing of the individual character matching circuit 5 will be described in detail. First, an individual character candidate image pattern G composed of W _x × N pixels is input, and a normalized binary character pattern X = (x ₁ , x ₂ ,...) Of n (= N × N) pixels
x _i ,..., x _n ). At this time, translation is performed so that G is at the center of the image X. Using monochrome counting circuit 2 obtains a projection distribution P _y horizontal scanning of the normalized character pattern X, the sum H of h _y represented by the following formula

【００３０】[0030]

【数２】 (Equation 2)

【００３１】を求める。次に、この正規化文字パターン
Ｘを辞書の２値参照パターンＴ＝（ｔ₁，ｔ₂，...，
ｔ_i，...，ｔ_n）と比較するために公知の識別関数であ
る次式の類似度Ｓ（Ｘ，Ｔ）を用いる。Is obtained. Next, this normalized character pattern X is converted into a binary reference pattern T = (t ₁ , t ₂ ,.
For comparison with t _i ,..., t _n ), the similarity S (X, T) of the following equation, which is a known discriminant function, is used.

【００３２】[0032]

【数３】 (Equation 3)

【００３３】ここで、ＡはＸとＴの内積でｘ_i＝１かつ
ｔ_i＝１となる総数、Ｔ’はＴの黒点総数、Ｘ’はＸの
黒点総数である。各文字カテゴリｍの参照パターンに対
して、Ｘとの類似度Ｓ（Ｘ，Ｔ_m）と上記Ｈの値によっ
て決まる参照パターンのしきい値Ｓ_mを用いて、次のよ
うに認識または棄却の判定を行う。このＨとしきい値Ｓ
_mの関係テーブルは参照パターン学習回路６によって予
め学習されている。Here, A is the inner product of X and T and is the total number where x _i = 1 and t _i = 1, T ′ is the total number of black spots of T, and X ′ is the total number of black spots of X. For the reference pattern of each character category m, the similarity S (X, T _m ) to X and the threshold value S _{m of the} reference pattern determined by the value of H are used for recognition or rejection as follows. Make a decision. This H and threshold S
The relation table of _m is learned in advance by the reference pattern learning circuit 6.

【００３４】（条件１）もし、Ｓ（Ｘ，Ｔ_m）＞Ｓ_mなら
ば、この文字カテゴリｍとして認識できたと判定する。（条件２）さもなければ、この文字カテゴリｍではない
と判定する。(Condition 1) If S (X, T _m )> S _m , it is determined that the character category m has been recognized. (Condition 2) Otherwise, it is determined that the character category is not m.

【００３５】すべての参照パターンに対して、複数のカ
テゴリに対して条件１を満たした場合は類似度が最も大
きいカテゴリを認識カテゴリとして出力する。また、条
件１を満たすカテゴリが１つもない場合は、この個別文
字候補画像パターンＧに対しては認識カテゴリがなかっ
たとして棄却する。If the condition 1 is satisfied for a plurality of categories for all reference patterns, the category having the highest similarity is output as the recognition category. If there is no category that satisfies the condition 1, the individual character candidate image pattern G is rejected as having no recognition category.

【００３６】最後に参照パターン学習回路６と画像入出
力回路７を用いて、個別文字照合回路５の辞書の参照パ
ターンを追加・更新する処理を説明する。画像入出力装
置７は個別文字照合回路５から出力される認識文字カテ
ゴリ番号、識別関数値、２値画像パターン等の分類結果
の情報を入力し、これらの情報を表示し、また、マウス
やキーボードなどによって画像情報を編集する。認識結
果に誤認識がある場合や、新規の文字カテゴリの参照パ
ターンを作成する場合は、画像入出力装置７を用いて、
入力の２値画像パターンから、追加、更新すべき認識カ
テゴリｍの２値参照パターンＴ_mを編集・作成する。参
照パターン学習回路６では作成された２値参照パターン
と個別文字照合回路５の辞書の参照パターンを用いて、
この新規のｎ（＝Ｎ×Ｎ）画素からなる正規化２値参照
パターンＴ_mに対する認識棄却判定のしきい値Ｓ_mを学習
する。Finally, a process of adding / updating a reference pattern of the dictionary of the individual character matching circuit 5 using the reference pattern learning circuit 6 and the image input / output circuit 7 will be described. The image input / output device 7 inputs classification character information such as a recognition character category number, a discrimination function value, and a binary image pattern output from the individual character matching circuit 5, displays the information, and displays a mouse or keyboard. Edit the image information by, for example, When there is a misrecognition in the recognition result or when a reference pattern of a new character category is created, the image input / output device 7
From the binary image pattern of input, add, edit, create a binary reference pattern T _m of recognition category m to be updated. The reference pattern learning circuit 6 uses the created binary reference pattern and the reference pattern of the dictionary of the individual character matching circuit 5 to
To learn this new n (= N × N) threshold S _m of recognition rejection judgment for the normalized binary reference pattern T _m including pixels.

【００３７】今、ｎ（＝Ｎ×Ｎ）画素からなる２値のラ
ンダム雑音パターンＲ＝（ｒ₁，ｒ₂，.，ｒ_i，...，
ｒ_n）を作成し、辞書の各２値参照パターンＴに重畳し
て、学習のための入力画像パターンＸを作成する。すな
わち、位置ｉの画素値ｘ_i＝ｔ_iｖｒ_iとなる（ｖ＝論理
和）。新規のカテゴリｍの２値参照パターンＴ_mに対す
る入力画像パターンＸの類似度Ｓ（Ｘ，Ｔ_m）を計算
し、すべての参照パターンの中で、ｍ以外のカテゴリで
最大の類似度をもとめ、その値をＳ_mとして、個別文字
照合回路５の辞書に、新規カテゴリｍの２値参照パター
ンＴ_mとそのしきい値Ｓ_mを格納する。Now, a binary random noise pattern R consisting of n (= N × N) pixels R = (r ₁ , r ₂ ,..., R _i ,.
Create a r _n), superimposed on each binary reference pattern T dictionary to create input image pattern X for learning. That is, the pixel value x _i = t _i vr _{i at} the position i is obtained (v = logical sum). The similarity S (X, T _m ) of the input image pattern X to the binary reference pattern T _m of the new category m is calculated, and among all the reference patterns, the maximum similarity is calculated for the category other than m. the value as S _m, the dictionary of individual characters matching circuit 5, and stores the threshold S _m a binary reference pattern T _m of a new category m.

【００３８】次に、上述した各回路における処理の流れ
を図２〜４に示すフローチャートを参照して説明する。
図１に示す文字認識装置は、文字列領域を抽出する処理
では、図２に示すように、まず、認識対象となる２値文
書画像パターン（Ｍ画素＝横Ｎ_x画素×縦Ｎ_y画素）を入
力する（ステップ１０２）。そして、横と縦の大きさか
ら横書きか縦書きかを判定する（ステップ１０３）。こ
の判定で、例えば、横書きと判定された場合は、水平方
向に走査して、走査線上に各白／黒変化数ａ，ｂ，ｃ，
ｅを計数して射影値ｈ_yを求める（ステップ１０４）。Next, the flow of processing in each circuit described above will be described with reference to the flowcharts shown in FIGS.
Character recognition device shown in FIG. 1 is a process of extracting a character string area, as shown in FIG. 2, first, a binary document image pattern for recognition (M pixels = horizontal N _x pixels × vertical N _y pixels) Is input (step 102). Then, whether horizontal writing or vertical writing is determined from the horizontal and vertical sizes (step 103). In this determination, for example, when horizontal writing is determined, scanning is performed in the horizontal direction, and the number of white / black changes a, b, c,
The projection value h _y is obtained by counting e (step 104).

【００３９】この水平走査が２値文書画像パターン全体
で終了した後に（ステップ１０５）、ｐ値の投影分布Ｐ
_y＝（ｈ₁，ｈ₂，...，ｈ_y，...，ｈ_Nx）を予め設定した
Ｎ区間になるようにＮ等分して、新しい投影分布Ｑ＝
（ｑ₁，ｑ₂，...，ｑ_j，...，ｑ_N）を求める（ステップ
１０６）。投影分布Ｑに対して、予め設定したしきい値
Ｔ_Qよりも大きい領域を文字列画像パターンＷとして求
める（ステップ１０７）。また、ステップ１０４の判定
で縦書きと判定された場合は、垂直方向に走査して、走
査線上の白黒変化数ａ，ｂ，ｃ，ｅを計数して射影値ｖ
_xを求める（ステップ１０８）。After this horizontal scanning is completed for the entire binary document image pattern (step 105), the p-value projection distribution P
_y = (h ₁ , h ₂ ,..., h _y ,..., h _Nx ) is divided into N equal parts so as to have a preset N section, and a new projection distribution Q =
(Q ₁ , q ₂ ,..., Q _j ,..., Q _N ) are obtained (step 106). The projection distribution Q, obtains the region greater than the threshold value T _Q set in advance as the character string image pattern W (step 107). If vertical writing is determined in step 104, scanning is performed in the vertical direction, the number of black and white changes a, b, c, and e on the scanning line is counted, and the projection value v is calculated.
_x is obtained (step 108).

【００４０】この垂直走査が２値文書画像パターン全体
で終了した後に（ステップ１０９）、ｐ値の投影分布Ｐ
_x＝（ｖ₁，ｖ₂，...，ｖ_x，...，ｖ_Nx）を予め設定した
Ｎ区間になるようにＮ等分して、新しい投影分布Ｑ＝
（ｑ₁，ｑ₂，...，ｑ_j，...，ｑ_N）を求める（ステップ
１１０）。投影分布Ｑに対して、予め設定したしきい値
Ｔ_Qよりも大きい領域を文字列画像パターンＷとして求
める（ステップ１０７）。After this vertical scanning is completed for the entire binary document image pattern (step 109), the p-value projection distribution P
_x = (v ₁ , v ₂ ,..., v _x ,..., v _Nx ) is divided into N equal parts so as to have a preset N section, and a new projection distribution Q =
(Q ₁ , q ₂ ,..., Q _j ,..., Q _N ) are obtained (step 110). The projection distribution Q, obtains the region greater than the threshold value T _Q set in advance as the character string image pattern W (step 107).

【００４１】次に、文字列画像パターンＷから個別文字
の候補画像を求める処理は、図３に示すように、まず文
字列画像パターンＷを入力する（ステップ２０２）。例
えば、横書きの文字列画像パターンについて説明すれ
ば、横Ｎ_x×縦Ｗ_y画素の正規化文字列画像パターンＷを
高さがＮ画素になるように縦横等倍で正規化処理を行
い、ｎ_x×Ｎ画素の文字列画像パターンＷ_Nを求める（ス
テップ２０３）。この正規化文字列画像パターンＷ_Nに
対して、ステップ１０８から１０９でのべた処理手順で
垂直方向走査による射影値ｖ_xの投影分布Ｐ_xを求め（ス
テップ２０４）、この投影分布Ｐ_xに対して、予め設定
したしきい値Ｔ_xよりも大きいｘの連続区間［ｘ_s，
ｘ_e］（ｘ_s≦ｘ≦ｘ_e）を求める（ステップ２０５）。Next, in the process of obtaining a candidate image of an individual character from the character string image pattern W, first, as shown in FIG. 3, the character string image pattern W is input (step 202). For example, in the case of a horizontally written character string image pattern, a normalized character string image pattern W of horizontal _Nx × vertical _Wy pixels is subjected to normalization processing in a vertical and horizontal unity so that the height becomes N pixels, and n A character string image pattern W _N of _x × N pixels is obtained (step 203). For this normalized character string image pattern W _N, obtains the projection distribution P _x of projection values v _x by vertically scanning the processing procedure mentioned in steps 108 109 (step 204), with respect to the projection distribution P _x Thus, a continuous section of x greater than a preset threshold T _x [x _s ,
_xe ] ( _xs ≦ x ≦ _xe ) is obtained (step 205).

【００４２】となりあう連続区間から照合するにふさわ
しい文字幅Ｗ_x画素を選び、個別文字の候補画像パター
ンＧを求める（ステップ２０６）。また、個別文字の候
補画像パターンと参照パターンの照合処理においては、
まず、横Ｗ_x×縦Ｎ画素からなる個別文字候補画像パタ
ーンＧを入力する（ステップ３０２）。このＧに対し
て、位置及び大きさの正規化処理を行いｎ（＝Ｎ×Ｎ）
画素の正規化２値文字パターンＸ＝（ｘ₁，ｘ₂，...，
ｘ_i，...，ｘ_n）を求める（ステップ３０３）。このＸ
と文字カテゴリｍの参照パターンＴ_mとの間で公知の類
似度Ｓ（Ｘ，Ｔ_m）を計算し（ステップ３０４）、併せ
てＧに対してステップ１０４及びステップ１０５の処理
手順に従い、水平方向走査の射影値ｈ_yの累積値Ｈを求
める（ステップ３０５）。A character width W _x pixel suitable for collation is selected from the adjacent continuous sections, and an individual character candidate image pattern G is obtained (step 206). In the matching process between the individual character candidate image pattern and the reference pattern,
First, an individual character candidate image pattern G consisting of W _x N pixels is input (step 302). This G is subjected to position and size normalization processing, and n (= N × N)
Pixel normalized binary character pattern X = (x ₁ , x ₂ ,...,
x _i ,..., x _n ) are obtained (step 303). This X
The known similarity S (X, T _m ) is calculated between the reference pattern T _m and the reference pattern T m of the character category m (step 304). The cumulative value H of the scanning projection value h _y is obtained (step 305).

【００４３】そして、この累積値Ｈに対するしきい値を
関係テーブルを参照して、文字カテゴリｍのしきい値Ｓ
_mを求める（ステップ３０６）。類似度Ｓ（Ｘ，Ｔ_m）が
しきい値Ｓ_mを比較し（ステップ３０７）、類似度Ｓ
（Ｘ，Ｔ_m）がしきい値Ｓ_mよりも大きければ、文字カテ
ゴリｍを認識カテゴリと判定し（ステップ３０８）、さ
もなければ棄却と判定する（ステップ３０９）。ステッ
プ３０４からステップ３０８の処理をすべての参照パタ
ーンとの間で行い（ステップ３０９）、終了する。The threshold value for the cumulative value H is determined by referring to the relation table with reference to the threshold value S for the character category m.
_m is obtained (step 306). Similarity S (X, T _m) is compared with a threshold S _m (step 307), the similarity S
If (X, T _m ) is larger than the threshold value S _m , the character category m is determined to be a recognition category (step 308), otherwise it is determined to be rejected (step 309). The processing from step 304 to step 308 is performed for all the reference patterns (step 309), and the process ends.

【００４４】次に、本発明の主要部分である白黒計数回
路２の具体的な動作例として、２値画像パターン記憶回
路１に図５に示すような横２０画素×縦１４画素（Ｎ_x
＝２０，Ｎ_y＝１４）からなる２値文書画像パターンの
場合について説明する。図には「文字」とかかれた文字
列画像パターンの背景に網点のデザイン処理が施された
入力画像パターンを表している。また、５−１および５
−２はそれぞれ水平および垂直方向の走査に対する投影
軸である。Next, as a specific operation example of the monochrome counting circuit 2 which is a main part of the present invention, the binary image pattern storage circuit 1 stores 20 pixels horizontally × 14 pixels vertically (N _x
= 20, N _y = 14) in the case of a binary document image pattern. The figure shows an input image pattern in which a halftone dot design process is performed on the background of a character string image pattern in which “character” is written. 5-1 and 5
-2 is a projection axis for horizontal and vertical scanning, respectively.

【００４５】今、この２値文書画像パターンが読み出さ
れ、白黒計数回路２に入力されたとする。この入力２値
画像パターンに対して、横幅の方が縦の高さよりも大き
いので、横書きと判定して水平方向に文字を走査し、式
１で示したｐ値をそれぞれ、投影軸５−１に投影する。
走査の例として図５で、ｙ＝２における水平方向走査５
−３では、ａ＝０，ｂ＝７，ｃ＝８，ｅ＝４が計数さ
れ、ｙ＝２におけるｐ値はNow, it is assumed that the binary document image pattern has been read out and input to the monochrome counting circuit 2. Since the width of the input binary image pattern is larger than the height of the input binary image pattern, it is determined that the image is written horizontally, and the character is scanned in the horizontal direction. Projected onto
In FIG. 5 as an example of scanning, horizontal scanning 5 at y = 2
At -3, a = 0, b = 7, c = 8, e = 4 are counted, and the p value at y = 2 is

【００４６】[0046]

【数４】 (Equation 4)

【００４７】となる。また、ｙ＝６における水平方向走
査２−３では、ａ＝５，ｂ＝３，ｃ＝４，ｅ＝７が計数
され、ｙ＝６におけるｐ値はIs as follows. In the horizontal scanning 2-3 at y = 6, a = 5, b = 3, c = 4, and e = 7 are counted, and the p value at y = 6 is

【００４８】[0048]

【数５】 (Equation 5)

【００４９】となる。この結果から、網点からなる背景
部分のｐ値は文字線部分の走査で得られるｐ値に比べて
小さい値をとることを示し、式１で示すｐ値が文字部と
背景部分とをわけるのに有効な尺度であることがわか
る。Is as follows. From this result, it is shown that the p-value of the background portion composed of halftone dots is smaller than the p-value obtained by scanning the character line portion, and the p-value represented by Expression 1 separates the character portion from the background portion. It turns out that this is an effective measure.

【００５０】ｙ＝１からｙ＝１４までの水平方向走査で
投影分布Ｐyが求められる。文字列抽出回路３ではｙ＝
１からｙ＝１４までの水平方向走査で投影分布Ｐｙを入
力し、水平方向の走査で得られた投影分布Ｐｙを、予め
設定したＮ区間になるようにＮ等分して、新しい投影分
布Ｑ＝（ｑ1，ｑ2，...，ｑj，...，ｑN）を求める。こ
こで、Ｎ＝１４画素とすると、投影分布Ｑの中で、予め
設定してあるしきい値ＴQ＝０．０以上となる区間は、
ｊ＝２からｊ＝１２までの区間となり、この結果、横２
０画素×縦１１画素の文字列画像パターンＷが抽出され
る。The projection distribution Py is obtained by horizontal scanning from y = 1 to y = 14. In the character string extraction circuit 3, y =
The projection distribution Py is input in the horizontal scanning from 1 to y = 14, and the projection distribution Py obtained by the horizontal scanning is divided into N equal parts so as to have N sections set in advance, and a new projection distribution Q = (Q1, q2, ..., qj, ..., qN). Here, when N = 14 pixels, in the projection distribution Q, advance
The section where the set threshold value TQ = 0.0 or more is
The section is from j = 2 to j = 12.
A character string image pattern W of 0 pixels × 11 pixels vertically is extracted.

【００５１】個別文字抽出回路４では、文字列抽出回路
３で得られた、横２０画素×縦１１画素の文字列画像パ
ターンＷを高さがＷ＝１１画素なるように正規化処理を
行い、２０×１１画素の文字列画像パターンＷ_Nを求め
る。図６に示す例では、この場合Ｗ_x＝Ｎ＝１１画素な
のでＷとＷ_Nは同じ画像パターンになる。この正規化文
字列画像パターンＷ_Nに対して、白黒計数回路２の処理
手順でのべたように、ｘ＝１からｘ＝２０までの垂直方
向走査によるｐ値を投影軸６−１に投影し、投影分布Ｐ
_xを求める。この投影分布Ｐ_xに対して、予め設定したし
きい値Ｔ_X＝−０．２５とした場合、ｖ_j＞Ｔ_xとなるｘ
の連続区間は［ｘ_s＝３，ｘ_e＝２０］となる。図６で６
−２はｘ＝２における垂直方向の走査線の例を示す。The individual character extraction circuit 4 performs a normalization process on the character string image pattern W of 20 horizontal pixels × 11 vertical pixels obtained by the character string extraction circuit 3 so that the height becomes W = 11 pixels. A character string image pattern W _N of 20 × 11 pixels is obtained. In the example shown in FIG. 6, since W _x = N = 11 pixels in this case, W and W _N have the same image pattern. For this normalized character string image pattern W _N, as mentioned in the procedure of white counting circuit 2, and projects the p value by the vertical scanning from x = 1 to x = 20 to the projection axis 6-1 , Projection distribution P
_{Find x} . For this projection distribution P _x, when the threshold value T _X = -0.25 previously set, v _j> T _x become x
Is [x _s = 3, x _e = 20]. 6 in FIG.
-2 indicates an example of a vertical scanning line at x = 2.

【００５２】今、参照パターン中で文字幅がｚ₁＝２画
素とｚ₂＝１１画素の範囲に決められているとすると、
ｘ_e−ｘ_s＞ｚ₂を満たすので、１１×１１画素個別文字
の候補画像パターンＧとして、Ｇの左枠の位置ｘ_s＝２
から１画素単位にずらしてＧの右枠の位置ｘ_e＝２０に
なるまで８個の個別文字候補画像パターンＧが作成され
る。第７図の７−１はＧの左枠がｘ＝３である場合を表
し、７−２はＧの左枠がｘ＝４である場合を表す。Now, assuming that the character width is determined in the range of z ₁ = 2 pixels and z ₂ = 11 pixels in the reference pattern.
Since x _e −x _s > z ₂ is satisfied, the position x _s = 2 of the left frame of G is set as a candidate image pattern G of 11 × 11 pixel individual characters.
, Eight individual character candidate image patterns G are created until the right frame position of G becomes x _e = 20. 7-1 in FIG. 7 shows the case where the left frame of G is x = 3, and 7-2 shows the case where the left frame of G is x = 4.

【００５３】[0053]

【発明の効果】以上説明したように、請求項１記載の発
明によれば、従来技術では抽出が難しい、かすれやよご
れのような雑音や背景に網点などのテクスチャが重畳し
た２値画像パターンや黒文字または白文字からなる２値
画像パターンに対しても、画像パターンを走査して新し
い計数値を用いて、文字列領域を簡易にかつ正しく抽出
することができる。As described above, according to the first aspect of the present invention, a binary image pattern in which texture such as halftone dots is superimposed on noise or background such as blur or dirt, which is difficult to extract with the prior art. Even for a binary image pattern composed of white or black characters or white characters, the character string region can be easily and correctly extracted by scanning the image pattern and using a new count value.

【００５４】請求項２に記載の発明によれば、雑音を含
む文字列画像パターンにこの新規の計数値を適用して、
個別文字候補画像を抽出することができるため、従来の
文字認識法が行っていた、雑音やデザイン除去の前処理
を行わなくてすむだけでなく雑音やデザイン除去に誤っ
て文字ストロークを削除してしまうことがないため、後
段に行われる文字認識を高精度に行うことができるとい
う利点がある。According to the second aspect of the present invention, the new count value is applied to a character string image pattern containing noise,
Because individual character candidate images can be extracted, not only do the pre-processing of noise and design removal, which was performed by the conventional character recognition method, but also character strokes are mistakenly deleted for noise and design removal. There is an advantage that character recognition performed in the subsequent stage can be performed with high accuracy.

【００５５】請求項３記載の発明によれば、個別文字候
補画像パターンと参照パターンとの間で所定の識別関数
を用いて照合し、認識か棄却かを判定する処理におい
て、雑音を含む個別文字候補画像パターンの雑音の程度
を請求項１記載の発明による方法を利用して計測するこ
とで、雑音の程度に応じて識別関数値のしきい値を文字
カテゴリごとに設定できるので、雑音のない文字から雑
音のある文字に対しても認識判定は正しくできる利点が
ある。According to the third aspect of the present invention, in the process of collating between the individual character candidate image pattern and the reference pattern using a predetermined discriminant function and determining whether to recognize or reject, the individual character including noise By measuring the degree of noise of the candidate image pattern using the method according to the first aspect of the present invention, the threshold value of the discriminant function value can be set for each character category according to the degree of noise. There is an advantage that recognition determination can be performed correctly even for characters having noise from characters.

【００５６】。請求項４記載の発明によれば、雑音を含
む文字列画像パターンに対して、雑音背景下で認識が難
しい、文字幅が小さいかまたは文字の高さが低い句読点
や記号などの文字カテゴリに対しても、文字列画像中
で、すでに認識が終了した画像領域を除いた領域とそれ
らの認識結果の情報に着目して、再度、未処理の画像パ
ターンを辞書と比較することで、これらの句読点や記号
も正しく認識できる利点がある。[0056] According to the fourth aspect of the present invention, a character string image pattern including noise is difficult to recognize under a noise background, and a character category such as a punctuation mark or a symbol having a small character width or a low character height is difficult to recognize. However, by paying attention to the area of the character string image excluding the image area for which recognition has already been completed and the information of those recognition results, comparing the unprocessed image pattern with the dictionary again, these punctuation marks And symbols can be recognized correctly.

【００５７】請求項５の発明によれば、雑音を含む文字
列画像パターンに対して、新規に文字カテゴリを追加し
たり、参照パターンを更新する場合に、雑音の程度に応
じて決定される認識判定のしきい値を容易に学習できる
手段を提供しているので、これら新規作成・追加・更新
の処理が容易である利点がある。According to the fifth aspect of the present invention, when a new character category is added to a character string image pattern including noise or a reference pattern is updated, a recognition determined in accordance with the degree of noise. Since the means for easily learning the threshold value of the judgment is provided, there is an advantage that the processing of new creation / addition / update is easy.

[Brief description of the drawings]

【図１】本発明による文字認識装置の構成例を示すブロ
ック図である。FIG. 1 is a block diagram illustrating a configuration example of a character recognition device according to the present invention.

【図２】図１の文字認識装置の文字列領域を抽出するフ
ローチャートである。FIG. 2 is a flowchart for extracting a character string region of the character recognition device of FIG. 1;

【図３】図１の文字認識装置の個別文字の候補画像を抽
出するフローチャートである。FIG. 3 is a flowchart of extracting an individual character candidate image of the character recognition device of FIG. 1;

【図４】図１の文字認識装置の個別文字の候補画像パタ
ーンと参照パターンの照合処理を行うフローチャートで
ある。FIG. 4 is a flowchart illustrating a process of comparing an individual character candidate image pattern with a reference pattern by the character recognition apparatus of FIG. 1;

【図５】本発明による入力の２値画像パターンに対する
白黒計数処理の一例である。FIG. 5 is an example of a monochrome counting process for an input binary image pattern according to the present invention.

【図６】本発明による文字列抽出処理の一例である。FIG. 6 is an example of a character string extraction process according to the present invention.

【図７】本発明による個別文字候補画像パターンの一例
である。FIG. 7 is an example of an individual character candidate image pattern according to the present invention.

[Explanation of symbols]

１…２値画像パターン記憶回路２…白黒計数回路３…文字列抽出回路４…個別文字抽出回路５…個別文字照合回路６…参照パターン学習回路７…画像入出力装置５−１…水平方向走査の投影軸５−３、５−４…水平方向走査の例６−１…垂直方向走査の投影軸６−２…垂直方向走査の例７−１、７−２…個別文字候補画像パターンの例 DESCRIPTION OF SYMBOLS 1 ... Binary image pattern memory circuit 2 ... Black-and-white counting circuit 3 ... Character string extraction circuit 4 ... Individual character extraction circuit 5 ... Individual character collation circuit 6 ... Reference pattern learning circuit 7 ... Image input / output device 5-1 ... Horizontal scanning 5-3, 5-4: Example of horizontal scanning 6-1: Projection axis of vertical scanning 6-2: Example of vertical scanning 7-1, 7-2: Example of individual character candidate image pattern

Claims

(57) [Claims]

1. An input binary image pattern,
By scanning in at least one direction or more in a plurality of directions, the number of black-black changes from black to black, the number of white-white changes from white to white, and the number of black-white changes from black to white on the scan line A black-and-white counting procedure for counting the number of black-and-white changes that change to black; information about the product of the count value for the black-black change number and the count value for the white-white change number obtained by the black-and-white counting procedure; A character string extraction procedure for extracting a character string area using information of a product of a numerical value and a count value regarding the number of black and white changes, and a candidate image of an individual character from the character string area
Individual character candidate image pattern to be extracted
A character recognition method comprising a separate character extraction procedure .

2. A character string image pattern corresponding to a character string region extracted by the character string extraction procedure is scanned in at least one direction in a plurality of directions based on information of the character string image pattern. From the black on the scan line
Counting the white number change changing from Shirajira change number and black-and-white change count and white change from black to white, which changes from black to white black discoloration number and white changes to black to black, the count values for said black black discoloration Number An individual character extraction procedure is provided for extracting an individual character candidate image pattern using information on the product of the count values regarding the number of white-white changes and information on the product of the count value regarding the number of black-white changes and the count value regarding the number of black-and-white changes. 2. The character recognition method according to claim 1, wherein:

3. An individual character candidate image pattern obtained by the individual character extraction procedure is collated by using a predetermined identification function between the individual character candidate image pattern and a reference pattern of a dictionary. Based on the function value and a threshold determined using the information of the individual character candidate image pattern,
3. The character recognition method according to claim 2, further comprising an individual character collation procedure for performing recognition or rejection determination processing.

4. The position of a character category determined to be recognized by the individual character matching procedure with respect to a character string image pattern corresponding to a character string area extracted by the individual character extracting procedure. 4. The character recognition method according to claim 3, further comprising an individual character extraction procedure for extracting an individual character candidate image pattern whose recognition judgment has not been processed in the character string image based on the information of the recognition result and the recognition result.

5. The input binary image pattern information and the individual character collating procedure and at least a character category and 2
Based on the information obtained by the image pattern input / output procedure for inputting / outputting and editing the value image pattern, a predetermined identification function is applied to perform recognition / 5. The character recognition method according to claim 3, further comprising a reference pattern learning procedure for newly creating, adding, and updating a rejection determination threshold value.