JPH06251196A

JPH06251196A - Character segmenting device

Info

Publication number: JPH06251196A
Application number: JP5038093A
Authority: JP
Inventors: Kenji Kurosu; 健二黒須; Hiroshi Yoshida; 浩史吉田
Original assignee: Oki Electric Industry Co Ltd
Current assignee: Oki Electric Industry Co Ltd
Priority date: 1993-02-26
Filing date: 1993-02-26
Publication date: 1994-09-09

Abstract

PURPOSE:To accurately perform character segmentation and to reduce a time required for data input and a cost by solving a problem in a conventional character segmenting device to divide one character into plural numbers. CONSTITUTION:When a character cluster segmenting part 131 segments a character cluster from character row image data, a character number judging part 132 verifies axial symmetry for the character cluster, and decides whether or not the number of characters in the character cluster is one. When it is decided that the number of characters in the character cluster is not one, the character cluster is further separated to a character pattern by every character at a contact character separation part 133.

Description

Detailed Description of the Invention

【０００１】[0001]

【産業上の利用分野】この発明は文字切り出し装置に関
するものである。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a character slicing device.

【０００２】[0002]

【従来の技術】媒体上の文章画像から該文章が持つ文字
情報を正確に得ることができれば、種々の情報処理装置
（例えば、文字情報を記憶する装置や文字情報を音声に
変換する装置等）の実現が可能となる。このような情報
処理装置を実現するためには、媒体上の入力文字列から
得た入力文字列データより、１文字ずつを正確に切り出
す必要がある。しかし、入力文字列によっては文字同士
の間隔が狭いものや、接触するものがあり、分離が困難
である。例えば、欧米の文章においては、隣接した文字
同士が接触する場合が頻出する。このような接触文字を
分離する装置として、特願平０３−２７５４７０に開示
されているものがあった。この文献に記載された従来の
文字切り出し装置では、接触文字は文字同士の接触部分
でくびれている事が多い事に着目し、該くびれ部分で文
字を分離するものであり、以下に具体的に説明する。2. Description of the Related Art Various information processing devices (for example, a device for storing character information or a device for converting character information into speech) if the character information of the sentence can be accurately obtained from a text image on a medium. Can be realized. In order to realize such an information processing apparatus, it is necessary to accurately cut out each character from the input character string data obtained from the input character string on the medium. However, depending on the input character string, there are characters that are close to each other and characters that are in contact with each other, which makes separation difficult. For example, in Western texts, adjacent characters often come into contact with each other. As a device for separating such contact characters, there is one disclosed in Japanese Patent Application No. 03-275470. In the conventional character slicing device described in this document, attention is paid to the fact that contact characters are often constricted at the contact parts between the characters, and the characters are separated at the constricted parts. explain.

【０００３】従来の文字切り出し装置を用いた文字認識
装置のブロック図を図２に示す。文字認識装置２０は、
画像入力部２１、文字行切り出し部２２、文字切り出し
装置２３、文字認識部２４、文字コード出力端子２５か
らなる。文字、図形、記号等（以下文字という）が記載
された帳票からの光信号ｓは画像入力部２１に入力され
る。画像入力部２１は、入力された光信号ｓを文字部は
黒画素、背景部は白画素の白黒に二値化した電気信号
（以下帳票画像データと称する）に光電変換し、該帳票
画像データを文字行切り出し部２２に出力する。文字行
切り出し部２２は、帳票画像データより、１行ずつの文
字行を切り出し（以下文字行画像データと称する）、該
文字行画像データを文字切り出し装置２３に出力する。
文字切り出し装置２３では文字行画像データより個々の
文字を切り出し（以下文字パタンと称する）、該文字パ
タンを文字認識部２４に出力する。文字認識部２４は前
記文字パタンより文字を認識し、文字コードを得、該文
字コードを文字コード出力端子２５に出力する。FIG. 2 is a block diagram of a character recognition device using a conventional character slicing device. The character recognition device 20 is
The image input unit 21, the character line cutout unit 22, the character cutout device 23, the character recognition unit 24, and the character code output terminal 25. An optical signal s from a form in which characters, figures, symbols, etc. (hereinafter referred to as characters) are written is input to the image input unit 21. The image input unit 21 photoelectrically converts the input optical signal s into an electric signal (hereinafter referred to as “form image data”) that is binarized into black and white with black pixels in the character portion and white pixels in the background portion, and the form image data Is output to the character line cutout unit 22. The character line cutout unit 22 cuts out character lines one by one from the form image data (hereinafter referred to as character line image data), and outputs the character line image data to the character cutout device 23.
The character cutting device 23 cuts out individual characters from the character line image data (hereinafter referred to as character patterns), and outputs the character patterns to the character recognition unit 24. The character recognition unit 24 recognizes a character from the character pattern, obtains a character code, and outputs the character code to the character code output terminal 25.

【０００４】以下、従来の文字切り出し装置２３につい
て説明する。従来の文字切り出し装置２３は文字塊切り
出し部２３１、くびれ検出部２３２、文字パタン切り出
し部２３３とを備えている。文字塊切り出し部２３１は
帳票画像データを、垂直方向を主走査方向（以後ｙ方向
という場合もある）、水平方向を副走査方向（以後ｘ方
向という場合もある）として走査し、黒画素の分布を作
成する。さらに、該黒画素の分布が「０」から「１」以
上に変化する位置から、「１」以上から「０」に変化す
る直前の位置までを文字塊として切り出し、該文字塊の
画像データを、くびれ検出部２３２に出力する。くびれ
検出部２３２においては入力された文字塊の画像データ
より、文字線のくびれ部分を検出し、該くびれ部分を分
離点とし、該分離点を文字パタン切り出し部２３３に出
力する。文字パタン切り出し部２３３は前記分離点にて
文字塊の分離を行い、分離した文字パタンを文字認識部
２４に出力する。The conventional character clipping device 23 will be described below. The conventional character cutout device 23 includes a character block cutout unit 231, a constriction detection unit 232, and a character pattern cutout unit 233. The character block clipping unit 231 scans the form image data in the vertical direction as the main scanning direction (hereinafter also referred to as the y direction) and in the horizontal direction as the sub scanning direction (hereinafter also referred to as the x direction) to distribute the black pixels. To create. Further, from the position where the distribution of the black pixels changes from "0" to "1" or more to the position immediately before the change from "1" or more to "0" is cut out as a character block, and the image data of the character block is extracted. , To the necking detection unit 232. The constriction detection unit 232 detects the constricted portion of the character line from the input image data of the character block, sets the constricted portion as a separation point, and outputs the separation point to the character pattern cutout unit 233. The character pattern cutout unit 233 separates the character blocks at the separation points and outputs the separated character patterns to the character recognition unit 24.

【０００５】尚、前記くびれ検出部２３２での文字線の
くびれの検出は以下のように行う。即ち、切り出した文
字塊中の全ての黒画素に対して、水平方向、垂直方向、
左斜め４５゜の方向、右斜め４５゜の方向の連続黒画素
数を数え、式（１）のいずれかを満たしたときに、その
黒画素の位置を文字線幅のくびれ部分として検出する。ＫH ＜ＴH ＫV ＜ＴV ＫL ＜ＴL （１）ＫR ＜ＴR ここで、ＫH、ＫV、ＫL、ＫRはそれぞれ水平方向、垂直
方向、左斜め４５゜の方向、右斜め４５゜の方向の連続
画素数である。また、ＴH、ＴV、ＴL、ＴRはそれぞれ水
平方向、垂直方向、左斜め４５゜の方向、右斜め４５゜
の方向のしきい値であり、予め定めた固定値である。The detection of the constriction of the character line by the constriction detection section 232 is performed as follows. That is, for all black pixels in the extracted character block, the horizontal direction, the vertical direction,
The number of continuous black pixels in the direction of 45 ° to the left and 45 ° to the right is counted, and when any one of the expressions (1) is satisfied, the position of the black pixel is detected as a constricted portion of the character line width. KH <TH KV <TV KL <TL (1) KR <TR where KH, KV, KL, and KR are the number of continuous pixels in the horizontal direction, the vertical direction, the 45 ° left diagonal direction, and the 45 ° right diagonal direction, respectively. Is. Further, TH, TV, TL, and TR are threshold values in the horizontal direction, the vertical direction, the diagonal direction of 45 ° to the left, and the diagonal direction of 45 ° to the right, and are predetermined fixed values.

【０００６】従来の文字切り出し装置を使って接触文字
を分離する一例を図３に示す。図３（ａ）は帳票３１に
記載された文字３２を示したものであり、３３は文字
“ｒ”と“ｓ”との接触部分を表わしている。図３
（ｂ）はこの文字の帳票画像データである。帳票画像デ
ータにおいて黒画素の部分は“０”で示してある（以後
の帳票画像データでは黒画素を“０”で表わす）。くび
れ検出部２３２は図３（ｂ）の３aの部分を分離の候補
として選び、文字パタン切り出し部２３３にてこのくび
れ部分で文字塊を分離し、分離した文字パタンを文字認
識部２４に出力する。FIG. 3 shows an example of separating contact characters using a conventional character cutting device. FIG. 3A shows a character 32 described in the form 31, and 33 represents a contact portion between the characters "r" and "s". Figure 3
(B) is the form image data of this character. In the form image data, black pixels are indicated by "0" (black pixels are indicated by "0" in the following form image data). The constriction detection unit 232 selects the portion 3a in FIG. 3B as a separation candidate, the character pattern cutout unit 233 separates the character block at this constricted portion, and outputs the separated character pattern to the character recognition unit 24. .

【０００７】[0007]

【発明が解決しようとする課題】しかし、従来の文字切
り出し装置では文字塊中の文字数が１文字であっても、
文字線幅にくびれがあれば、該くびれ部分で１文字を複
数の部分に分割してしまうという問題があった。例えば
図４（ａ）に示す文字塊“ｍ”では、従来の文字切り出
し装置はくびれ４aと４bを見つけ、この二点で“ｍ”を
三分割している。“ｍ”の分割結果図４（ｂ）の４cの
部分のみを見ると“ｌ（エル）”の文字と同じ形をして
おり、切り出し結果を修正をするオペレータは、切り出
しの間違いに気がつかず、修正ができない。また、他の
文字塊の分離例を図５に挙げたが、例えば図５（ｃ）の
文字塊“Ｗ”の分割例では、分割結果の５dの部分を
“Ｖ”と間違えてしまう問題点があった。However, in the conventional character slicing device, even if the number of characters in the character block is one,
If the character line width has a constriction, there is a problem that one character is divided into a plurality of parts at the constricted part. For example, in the character block "m" shown in FIG. 4 (a), the conventional character segmentation device finds the constrictions 4a and 4b, and divides "m" into three at these two points. Result of division of "m" Looking only at the part 4c in Fig. 4 (b), it has the same shape as the character of "l (el)", and the operator who corrects the cutting result does not notice the mistake of cutting. , I can't fix it. Further, although an example of separating another character block is given in FIG. 5, for example, in the division example of the character block “W” in FIG. 5C, there is a problem that the 5d part of the division result is mistaken for “V”. was there.

【０００８】以上のように従来の文字切り出し装置で
は、１文字を複数の部分に分割してしまうことにより、
誤った切り出し結果が生じ、正確なデータ入力が出来な
いという問題点があった。さらに誤った切り出し結果を
直すにも、オペレータによる修正作業が必要であるが、
この作業は膨大な切り出し結果の中から誤った切り出し
結果を見つけるもので、煩雑であり時間と費用を要する
ものであった。このため、文字切り出し装置本来の目的
であるデータ入力時間の短縮、それに要するコストの低
減をはかれないという問題点もあった。さらにこの文字
切り出し装置を文字認識装置に適用した場合には、間違
った認識結果の修正が必要だが、この作業には時間と費
用がかかるものであった。As described above, in the conventional character slicing device, by dividing one character into a plurality of parts,
There was a problem that incorrect data could not be output due to incorrect cutting results. In addition, the operator needs to make corrections to correct the incorrect cutout result.
This work is to find an erroneous cutting result from a huge amount of cutting results, which is complicated and requires time and cost. Therefore, there is a problem in that the original purpose of the character segmentation device, namely, the data input time and the cost required therefor cannot be reduced. Further, when the character slicing device is applied to a character recognizing device, it is necessary to correct an erroneous recognition result, but this work takes time and cost.

【０００９】本発明では、１文字を複数に分割してしま
うという従来の文字切り出し装置の問題点を解決し、正
確に文字切り出しを行い、データ入力に要する時間とコ
ストを低減する事、さらに前述の種々の情報処理装置、
例えば高性能な文字認識装置を提供することを目的とす
る。According to the present invention, the problem of the conventional character slicing device that one character is divided into a plurality of characters is solved, the character is accurately segmented, and the time and cost required for data input are reduced. Various information processing devices,
For example, it is an object to provide a high-performance character recognition device.

【００１０】[0010]

【課題を解決するための手段】この発明は、前記課題を
解決する為に、文字行画像データから文字塊を切り出す
文字塊切り出し部と、前記文字塊について線対称性を検
証し、前記文字塊中の文字数が１文字であるか否かを判
定する文字数判断部と、前記文字数判断部より当該文字
塊中の文字数が１文字でないと判定された場合に当該文
字塊を更に１文字ずつの文字パタンに分離する接触文字
分離部とを具備したことを特徴とする。SUMMARY OF THE INVENTION In order to solve the above problems, the present invention provides a character block cutout section for cutting out a character block from character line image data and line symmetry of the character block to verify the character block. A character number determination unit that determines whether or not the number of characters in the character block is one character; and if the character number determination unit determines that the number of characters in the character block is not one character It is characterized in that it is provided with a contact character separating section for separating into patterns.

【００１１】[0011]

【作用】図６及び図７は本発明の文字切り出し装置の原
理説明図であって、以下本発明の文字切り出し装置の原
理説明を行う。本発明の文字切り出し装置は、文字数判
断部において、文字行画像データより切り出した文字塊
の線対称性を検証し、これにより前記文字塊中の文字数
が１文字であるか否かを判断し、前記判断結果に基づい
て、前記文字塊を分離するか否かを選択することを特徴
とする。このように文字数判断部により、従来の文字切
り出し装置の１文字を分割してしまう問題点を解決し、
正確な文字切り出しを実現するものである。6 and 7 are explanatory views of the principle of the character cutting device of the present invention. The principle of the character cutting device of the present invention will be described below. The character slicing device of the present invention, in the character number determination unit, verifies the line symmetry of the character block cut out from the character line image data, thereby determining whether or not the number of characters in the character block is one character, It is characterized in that whether to separate the character block is selected based on the determination result. Thus, the problem of dividing one character of the conventional character slicing device by the character number determination unit is solved,
It realizes accurate character segmentation.

【００１２】文字数の判断に文字の線対称性を用いる根
拠は、以下の通りである。欧文文字で文字の中心軸に対
して左右線対称な形のものが他の文字と接触した場合を
考えると、その文字塊の形は左右に線対称にはならな
い。いくつかの例を図６及び図７に挙げる。例えば図６
（ａ）では、“ｗ”と“ｉ”が接触している。“ｗ”と
“ｉ”は共に左右に線対称だが、接触した文字塊の形は
線対称ではない。そのほかにも線対称な文字を含む接触
文字の例を図６（ｂ）以降図７（ａ）〜（ｃ）に挙げた
が、どの文字塊も線対称ではない。また、欧文文字には
文字の中心を軸として、左右線対称なもの（ｉ，ｏ，
ｕ，ｖ，Ｍ，Ａ等）と、ほぼ線対称なもの（Ｂ、Ｃ、
Ｄ、Ｑ、ａ、ｅ、ｆ等）が多い。以上の事から、切り出
した文字塊の形を見て左右に線対称ならば、その文字塊
が含む文字数は１だといえる。The rationale for using the line symmetry of characters to judge the number of characters is as follows. Considering the case where a Western character that is symmetrical with respect to the central axis of a character contacts another character, the shape of the character block is not symmetrical with respect to the left and right. Some examples are given in FIGS. 6 and 7. For example, in FIG.
In (a), "w" and "i" are in contact. Both “w” and “i” are line-symmetrical to the left and right, but the shapes of the contacted character blocks are not line-symmetrical. In addition, examples of contact characters including line-symmetrical characters are shown in FIG. 6 (b) and subsequent FIGS. 7 (a) to 7 (c), but no character block is line-symmetrical. In addition, Roman characters have left-right line symmetry (i, o,
u, v, M, A, etc.) and a line symmetry (B, C,
D, Q, a, e, f, etc.). From the above, it can be said that the number of characters contained in the character block is 1 if the cut-out character block has line symmetry to the left and right.

【００１３】[0013]

【実施例】本発明の文字切り出し装置を用いた文字認識
装置の一実施例を図１（ａ）に示す。文字認識装置１０
は、画像入力部１１、文字行切り出し部１２、文字切り
出し装置１３、文字認識部１４、文字コード出力端子１
５から成る。以下、本発明の文字切り出し装置を文字認
識装置に用いた実施例について説明する。DESCRIPTION OF THE PREFERRED EMBODIMENTS FIG. 1A shows an embodiment of a character recognition device using the character cutting device of the present invention. Character recognition device 10
Is an image input unit 11, a character line cutout unit 12, a character cutout device 13, a character recognition unit 14, and a character code output terminal 1.
It consists of 5. Hereinafter, an example in which the character segmentation device of the present invention is used in a character recognition device will be described.

【００１４】文字が記載された帳票からの光信号ｓを画
像入力部１１に入力する。画像入力部１１は、光信号ｓ
を光電変換し、帳票画像データを得、該帳票画像データ
を文字行切り出し部１２に出力する。文字行切り出し部
１２は、帳票画像データより、１行ずつの文字行を切り
出し、文字行画像データを得、該文字行画像データを文
字切り出し装置１３に出力する。文字切り出し装置１３
では前記文字行画像データから一文字ずつの文字を切り
出し、文字パタンを得、該文字パタン文字認識部１４に
出力する。文字認識部１４では、前記文字パタンより、
文字コードを得、該文字コードを文字コード出力端子１
５に出力する。An optical signal s from a form in which characters are written is input to the image input unit 11. The image input unit 11 uses the optical signal s
Is photoelectrically converted to obtain form image data, and the form image data is output to the character line cutout unit 12. The character line cutout unit 12 cuts out character lines one by one from the form image data, obtains the character line image data, and outputs the character line image data to the character cutout device 13. Character cutting device 13
Then, each character is cut out from the character line image data to obtain a character pattern, and the character pattern is output to the character recognition unit 14. In the character recognition unit 14, from the character pattern,
Obtain the character code, and output the character code to the character code output terminal 1
Output to 5.

【００１５】以下、図１（ｂ）の文字切り出し装置１３
について説明する。文字切り出し装置１３は文字塊切り
出し部１３１、文字数判断部１３２、接触文字分離部１
３３から成る。文字行画像データを文字塊切り出し部１
３１に入力し、文字塊切り出し部１３１は文字行画像デ
ータをｙ方向とｘ方向に走査し、黒画素の分布を作成す
る。さらに、該黒画素の分布が「０」から「１」以上に
変化する位置から、「１」以上から「０」に変化する直
前の位置までを文字塊として切り出し、該文字塊の画像
データを、文字数判断部１３２に出力する。文字数判断
部１３２では文字塊の形が線対称であるか否かを判断
し、線対称と判断した文字塊の画像データは、該文字塊
の画像データを文字認識部１４に出力する。形が線対称
でないと判断した文字塊については、該文字塊の画像デ
ータを接触文字分離部１３３に出力する。接触文字離部
１３３は、従来の文字切り出し装置２３と同じ方法で接
触文字のくびれ部分を見つけ、該くびれ部分で接触文字
を分離し、分離した文字パタンを文字認識部１４に出力
する。Hereinafter, the character extracting device 13 shown in FIG.
Will be described. The character cutout device 13 includes a character block cutout unit 131, a character number determination unit 132, and a contact character separation unit 1.
It consists of 33. Character block image data extraction unit 1
31. The character block cutout unit 131 scans the character line image data in the y direction and the x direction to create a black pixel distribution. Further, from the position where the distribution of the black pixels changes from "0" to "1" or more to the position immediately before the change from "1" or more to "0" is cut out as a character block, and the image data of the character block is extracted. , To the character number determination unit 132. The character number determination unit 132 determines whether or not the shape of the character block is line-symmetrical, and the image data of the character block determined to be line-symmetrical is output to the character recognition unit 14. For a character block whose shape is determined not to be line symmetrical, the image data of the character block is output to the contact character separation unit 133. The contact character separation unit 133 finds the constricted portion of the contact character by the same method as the conventional character cutting device 23, separates the contact character at the constricted portion, and outputs the separated character pattern to the character recognition unit 14.

【００１６】前記文字数判断部１３２は文字塊の線対称
性の判断を以下のように行う。まず、文字塊の左右の
“縁”にあたる黒画素の位置を検索する。この処理は、
文字塊の画像データ上で、走査線のｙ値を一定としてｘ
の正の方向に走査したとき黒画素の最初にあらわれる位
置のｘ座標と、黒画素の最後に現れる位置のｘ座標を検
出することである。該処理を走査線のｙ値を文字塊のｙ
方向の最小値ｙ＝ｙminからｙ方向の最大値ｙ＝ｙmaxま
で変えて行い、前記のｘ座標をｘl［ｉ］、ｘr［ｉ］
（ｉ＝１，２，・・，ｎ；ｎ＝ｙmax−ｙmin＋１（ｎは
文字塊の高さに相当する））として、記憶する。次に、
式（２）のｗ、Ｌ、Ｈの値を求める。式（２）におい
てｔの値は書体などにより変え得るパラメータであり、
０＜ｔ≦０．５の範囲の値をとる。ｗ＝ｘmax − ｘmin Ｌ＝ｘmin ＋ｔ × ｗ（２）Ｈ＝ｘmax − ｔ × ｗ但し、ｘminは文字塊のｘ方向の最小値であり、ｘmaxは
文字塊のｘ方向の最大値である。The character number determination unit 132 determines the line symmetry of the character block as follows. First, the positions of the black pixels corresponding to the “edges” on the left and right of the character block are searched. This process
On the image data of a character block, x is assumed to be a constant y value of the scanning line.
Is to detect the x-coordinate of the position that first appears in the black pixel and the x-coordinate of the position that appears last in the black pixel when scanned in the positive direction. The y value of the scanning line is set to the y of the character block
The minimum value in the direction y = ymin is changed to the maximum value in the y direction y = ymax, and the x coordinate is xl [i], xr [i].
(I = 1, 2, ..., N; n = ymax-ymin + 1 (n corresponds to the height of the character block)) and is stored. next,
The values of w, L, and H in equation (2) are calculated. In equation (2), the value of t is a parameter that can be changed depending on the typeface, etc.
It takes a value in the range of 0 <t ≦ 0.5. w = xmax−xmin L = xmin + t × w (2) H = xmax−t × w where xmin is the minimum value of the character block in the x direction and xmax is the maximum value of the character block in the x direction.

【００１７】次に、文字塊の左縁ｘl［ｉ］と右縁ｘr
［ｉ］の中点をｃ［ｉ］とする（式（３）参照）。ｘ＝
Ｌとｘ＝Ｈの間に、ｃ［ｉ］が幾つあるかを数え（式
（４）参照）、その数をｍとする。ｍが式（５）の条件
を満足したときその文字は線対称と判断する。ｃ［ｉ］＝（ｘl［ｉ］＋ｘr［ｉ］）／２（i=1,2,…,n）（３）Ｌ ≦ ｃ［ｉ］≦Ｈ（i=1,2,…,n）（４）ｍ ≧ ａ × ｎ（ｎ＝ｙmax−ｙmin＋１）（５）但し、ａは文字の書体などにより適当な値をとる固定値
（０＜ａ≦１）である。Next, the left edge xl [i] and the right edge xr of the character block are
Let the midpoint of [i] be c [i] (see equation (3)). x =
The number of c [i] between L and x = H is counted (see the equation (4)), and the number is m. When m satisfies the condition of Expression (5), the character is determined to be line symmetric. c [i] = (xl [i] + xr [i]) / 2 (i = 1,2, ..., n) (3) L ≦ c [i] ≦ H (i = 1,2, ..., n) (4) m ≧ a × n (n = ymax−ymin + 1) (5) However, a is a fixed value (0 <a ≦ 1) that takes an appropriate value depending on the typeface of the character.

【００１８】以上の文字数判断部１３２の処理を、図４
の文字塊“ｍ”の場合について具体的に説明する。前記
文字塊“ｍ”において、ｘl［ｉ］、ｘr［ｉ］を求める
様子を、文字パタンに座標を付けた図８を用いて説明す
る。線７４は走査線を表す。該走査線にて文字塊の画像
データをｘの正方向に走査すると、最初に現れる黒画素
は７aで、最後に現れる黒画素は７bである。この場合、
前記走査線７４のｙ座標は、文字塊のｙ方向の最小値ｙ
min（線７３のｙ座標）より１０大きいので、黒画素７a
のｘ座標を数列ｘl［１１］に、黒画素７bのｘ座標をｘ
r［１１］に記憶させる。該処理を走査線のｙ値をｙmin
から文字塊のｙ方向の最大値ｙmaxまで変えて行う。The above-described processing of the character number determination unit 132 is shown in FIG.
The case of the character block “m” will be specifically described. How to obtain xl [i] and xr [i] in the character block “m” will be described with reference to FIG. 8 in which character patterns are coordinated. Line 74 represents a scan line. When the image data of the character block is scanned in the positive x direction by the scanning line, the first black pixel that appears is 7a and the last black pixel that appears is 7b. in this case,
The y coordinate of the scanning line 74 is the minimum value y in the y direction of the character block.
Black pixel 7a because it is 10 larger than min (y coordinate of line 73)
X-coordinate of the black pixel 7b in the sequence xl [11]
Store in r [11]. The y value of the scanning line is processed by ymin
To the maximum value ymax in the y direction of the character block.

【００１９】文字数判断部１３２での図８の文字塊の対
称性の判断は、以下のようになる。図８より前記ｘmi
n、ｘmax、ｙmin、ｙmax、ｗ、ｎの値は、ｘmin（線７１のｘ値）＝９、ｘmax（線７２のｘ値）
＝４０ｙmin（線７３のｙ値）＝１７、ｙmax（線７５のｙ値）
＝３５となるため、ｗ＝ｘmax − ｘmin ＝３１ｎ＝ｙmax − ｙmin ＋１＝１９となり、式（２）に於て、ｔの値を１／３、式（５）で
のａの値を４／５とすると、Ｌ、Ｈの値は、Ｌ＝９＋３１ × １／３＝１９Ｈ＝４０ − ３１ × １／３＝２９となる。ｘ＝Ｌとｘ＝Ｈの文字塊“ｍ”の画像データ
に対する位置を図９に示す。図９において、線８２のｘ
値がＬの値、線８３のｘ値がＨの値である。線８１のｘ
値がｘmin、線８４のｘ値がｘmax、線８５のｙ値がｙmi
n、線８６のｙ値がｙmaxを示す（図９ではｘl［ｉ］と
ｘr［ｉ］にあたる黒画素を“＃”で表わす）。図９で
はｎ＝１９、ｍ＝１９（つまり図９において、ｃ［ｉ］
の点全てが線８２と線８３の間にある）なので式（５）
の条件を満たす。The determination of the symmetry of the character block of FIG. 8 by the character number determination unit 132 is as follows. From FIG. 8, the xmi
The values of n, xmax, ymin, ymax, w, n are: xmin (x value of line 71) = 9, xmax (x value of line 72)
= 40 ymin (y value of line 73) = 17, ymax (y value of line 75)
= 35, w = xmax−xmin = 31 n = ymax−ymin + 1 = 19, so that in equation (2), the value of t is 1/3, and the value of a in equation (5) is 4 If it is set to / 5, the values of L and H will be L = 9 + 31 * 1/3 = 19H = 40-31 * 1/3 = 29. FIG. 9 shows the positions of the character block “m” of x = L and x = H with respect to the image data. In FIG. 9, line x
The value is an L value, and the x value of the line 83 is an H value. X on line 81
The value is xmin, the x value of line 84 is xmax, and the y value of line 85 is ymi.
The y value of n and the line 86 indicates ymax (black pixels corresponding to xl [i] and xr [i] are represented by "#" in FIG. 9). In FIG. 9, n = 19 and m = 19 (that is, in FIG. 9, c [i]
Since all the points of are between the line 82 and the line 83), the formula (5)
Satisfy the condition of.

【００２０】以上のように、文字塊“ｍ”は式（５）の
条件を満たすので文字数判断部１３２は、前記文字塊
“ｍ”を線対称な１文字と判断し、接触文字の分離処理
を行わない。よって、文字数判断部１３２は、前記文字
塊“ｍ”の文字パタンをそのまま文字認識部１４に出力
する。また、図５（ａ）の文字塊“ｕ”では式（５）ｍ
の値は２４、ｎの値は２４、図５（ｂ）の文字塊“ｎ”
では式（５）ｍの値は２７、ｎの値は２７、図５（ｃ）
の文字塊“Ｗ”では式（５）ｍの値は２６、ｎの値は２
６で、いずれも式（５）の条件を満たす。よって、文字
数判断部１３２はこれらの文字に対しては接触文字分離
の処理を行わず、文字パタンをそのまま文字認識部１４
に出力する。As described above, since the character block "m" satisfies the condition of the expression (5), the character number determination unit 132 determines that the character block "m" is one line-symmetrical character, and the contact character separation processing is performed. Do not do. Therefore, the character number determination unit 132 outputs the character pattern of the character block “m” to the character recognition unit 14 as it is. Further, in the character block “u” in FIG.
Is 24, the value of n is 24, and the character block “n” in FIG.
Then, in Expression (5), the value of m is 27, the value of n is 27, and FIG.
In the character block “W” of formula (5), the value of m is 26 and the value of n is 2
6 all satisfy the condition of Expression (5). Therefore, the character number determination unit 132 does not perform contact character separation processing on these characters, and the character pattern is used as it is.
Output to.

【００２１】また、図１０に“ｐ”と“ｅ”の接触文字
の文字塊に、本発明の文字切り出し装置を用いた結果を
示した。図１０に示した文字パタンの場合、式（５）の
ｍの値は２５、ｎの値は３４で、式（５）の条件を満た
さない。よって、文字数判断部１３２は前記文字塊の画
像データを接触文字分離部１３３に出力する。接触文字
分離部１３３は、文字塊をくびれの位置９aで分離し、
分離した文字パタンを文字認識部１４に出力する。FIG. 10 shows the result of using the character slicing device of the present invention for a character block of contact characters "p" and "e". In the case of the character pattern shown in FIG. 10, the value of m in expression (5) is 25 and the value of n is 34, which does not satisfy the condition of expression (5). Therefore, the character number determination unit 132 outputs the image data of the character block to the contact character separation unit 133. The contact character separation unit 133 separates the character block at the constricted position 9a,
The separated character pattern is output to the character recognition unit 14.

【００２２】文字認識部１４の動作を以下に説明する。
まず、文字パタンより文字平均線幅を算出する。前記文
字平均線幅の算出は、文字パタンを２×２の窓で走査し
たときに２×２の窓の全ての点が黒画素となる点の個数
Ｑと、入力文字パタンの全ての黒画素数Ａとを計数し、
式（６）に示す文字平均線幅の近似式に基づいて文字パ
タン中の文字の文字平均線幅Ｗを算出することにより行
う。Ｗ＝Ａ／（Ａ−Ｑ）（６）The operation of the character recognition unit 14 will be described below.
First, the character average line width is calculated from the character pattern. The calculation of the character average line width is performed by calculating the number Q of points at which all the points in the 2 × 2 window are black pixels when the character pattern is scanned through the 2 × 2 window, and all the black pixels in the input character pattern. Count the number A and
This is performed by calculating the character average line width W of the character in the character pattern based on the approximate expression of the character average line width shown in the equation (6). W = A / (A-Q) (6)

【００２３】次に、文字パタンより水平、垂直、左斜
め、右斜めの４方向の線素を抽出した４個のサブパタン
を抽出する。前記、サブパタン抽出の処理は、例えば水
平サブパタンの場合は、文字パタンを水平方向に走査し
黒画素の連続を検出し、黒画素の連続数ＬHが式（７）
を満足するときに当該黒画素の連続を水平方向のサブパ
タンとして抽出するものである。同様に式（７）より垂
直、左斜め、右斜めサブパタンの抽出も行う。ＬH ＞２ × ＷＬV ＞２ × ＷＬL ＞２^1/2 × Ｗ（７）ＬR ＞２^1/2 × Ｗ但し、ＬH、ＬV、ＬL、ＬRは各々水平、垂直、左斜め、
右斜め方向の連続黒画数である。Next, four sub-patterns, which are line elements extracted in four directions of horizontal, vertical, left diagonal, and right diagonal, are extracted from the character pattern. In the sub pattern extraction process, for example, in the case of a horizontal sub pattern, the character pattern is scanned in the horizontal direction to detect the succession of black pixels, and the succession number LH of black pixels is calculated by the formula (7).
When the above condition is satisfied, the succession of black pixels is extracted as a horizontal sub-pattern. Similarly, the vertical, left diagonal, and right diagonal sub-patterns are also extracted from the equation (7). RH> 2 x W LV> 2 x W LL> 2 ^1/2 x W (7) LR> 2 ^1/2 x W where LH, LV, LL, and LR are horizontal, vertical, diagonal to the left,
The number of continuous black strokes in the right diagonal direction.

【００２４】次に前記水平、垂直、左斜め、右斜めのサ
ブパタンを、小領域に分割し各サブパタンの各領域の黒
画素数を計数し、前記黒画素計結果及び前記平均線幅よ
り、式（８）に基づいて水平、垂直、左斜め及び右斜め
の特徴マトリクスを抽出する。ＫH（ｍ，ｎ）＝ＢH（ｍ，ｎ）／ＷＫV（ｍ，ｎ）＝ＢV（ｍ，ｎ）／ＷＫL（ｍ，ｎ）＝ＢL（ｍ，ｎ）／Ｗ（８）ＫR（ｍ，ｎ）＝ＢR（ｍ，ｎ）／Ｗ但し、ＫH、ＫV，ＫL、ＫRは各々水平、垂直、左斜め、
右斜めの特徴マトリクス、ＢH、ＢV，ＢL、ＢRは各々水
平、垂直、左斜め、右斜めの黒画素マトリクス、（ｍ，
ｎ）は各マトリクスの要素番号である前記小領域の分割
は本実施例においては、入力文字パタン外接枠を水平、
垂直方向に５等分して作成される５×５の２５の小領域
に分割をするものとする。Next, the horizontal, vertical, left diagonal, and right diagonal sub-patterns are divided into small areas, the number of black pixels in each area of each sub-pattern is counted, and from the black pixel meter result and the average line width, Based on (8), the horizontal, vertical, diagonal left and diagonal right feature matrices are extracted. KH (m, n) = BH (m, n) / W KV (m, n) = BV (m, n) / W KL (m, n) = BL (m, n) / W (8) KR ( m, n) = BR (m, n) / W However, KH, KV, KL, and KR are horizontal, vertical, left diagonal,
The right diagonal feature matrix, BH, BV, BL, and BR are horizontal, vertical, left diagonal, and right diagonal black pixel matrices, respectively (m,
In this embodiment, n) is the element number of each matrix. In the present embodiment, the input character pattern circumscribing frame is horizontal,
It is assumed that the image is divided into 25 small areas of 5 × 5, which are created by dividing the area into 5 in the vertical direction.

【００２５】そして文字認識部１４においては、前記特
徴マトリクスと、予め文字認識部１４内に備えた辞書内
の標準文字の特徴マトリクスとを式（９）に基づいて照
合し、式（９）の距離値ｄが最も小さくなる標準文字の
文字コードを、当該入力文字パタンの認識結果として文
字コード出力端子１５より出力するものである。ｄ＝（Σ（ｇi−ｋi）²）^1/2 （９）但し、ｇiは標準文字の特徴マトリクスの要素、ｋiは入
力文字パタンの特徴マトリクスの要素である。Then, in the character recognition unit 14, the feature matrix is collated with the feature matrix of the standard characters in the dictionary provided beforehand in the character recognition unit 14 based on the formula (9), and the formula (9) The character code of the standard character having the smallest distance value d is output from the character code output terminal 15 as the recognition result of the input character pattern. d = (Σ (gi-ki) ² ) ^1/2 (9) where gi is an element of the standard character feature matrix and ki is an element of the input character pattern feature matrix.

【００２６】以上の説明では、欧文文字に本発明の文字
切り出し装置を用いた場合について説明したが、他の文
字、記号であってもこの手法が有効であることは明かで
ある。In the above description, the case where the character slicing device of the present invention is used for Roman characters has been described, but it is clear that this method is effective even for other characters and symbols.

【００２７】[0027]

【発明の効果】以上詳細に説明したように、本発明の文
字切り出し装置によれば、欧文文字に左右が線対称な文
字が多いことを利用して、切り出した文字塊に含まれる
文字数が１文字か否かを判断し、該判断に基づき文字線
幅のくびれ部分で接触文字の分離を行う。従って、誤っ
て１文字を複数の部分に分割すること無く正確な文字切
り出しが出来る。また本発明によれば、正確な文字切り
出し、データ入力ができるためデータ入力に要する修正
作業、時間と費用の低減がはかれる。よって、種々の情
報処理装置の実現が可能となる。As described above in detail, according to the character slicing device of the present invention, the number of characters included in a sliced character block is 1 by utilizing the fact that there are many characters which are line-symmetrical in the Roman alphabet. Whether or not it is a character is determined, and based on the determination, the contact character is separated at the constricted portion of the character line width. Therefore, accurate character cutting can be performed without accidentally dividing one character into a plurality of parts. Further, according to the present invention, since accurate character cutting and data input can be performed, correction work required for data input, time and cost can be reduced. Therefore, various information processing devices can be realized.

[Brief description of drawings]

【図１】本発明の文字切り出し装置を組み込んだ文字認
識装置の実施例を示すブロック図である。FIG. 1 is a block diagram showing an embodiment of a character recognition device incorporating a character clipping device of the present invention.

【図２】従来の文字切り出し装置を組み込んだ文字認識
装置の実施例を示すブロック図である。FIG. 2 is a block diagram showing an embodiment of a character recognition device incorporating a conventional character cutting device.

【図３】従来の文字切り出し装置による欧文接触文字の
分離の説明図である。FIG. 3 is an explanatory diagram of separation of European-language contact characters by a conventional character clipping device.

【図４】従来の文字切り出し装置による１文字の分割例
を示す図である。FIG. 4 is a diagram showing an example of dividing one character by a conventional character cutting device.

【図５】従来の文字切り出し装置による１文字の分割例
を示す図である。FIG. 5 is a diagram showing an example of dividing one character by a conventional character cutting device.

【図６】接触文字で左右線対称の文字を含む例を示す図
である。FIG. 6 is a diagram showing an example in which touch characters include left-right symmetrical characters.

【図７】接触文字で左右線対称の文字を含む例を示す図
である。FIG. 7 is a diagram showing an example in which touch characters include left-right symmetrical characters.

【図８】文字塊の左端ｘl［ｉ］と右端ｘr［ｉ］の求め
方の説明図である。FIG. 8 is an explanatory diagram of how to obtain a left end xl [i] and a right end xr [i] of a character block.

【図９】文字塊の対称性の判断方法の説明図である。FIG. 9 is an explanatory diagram of a method of determining the symmetry of a character block.

【図１０】接触文字を分離する例を示す図である。FIG. 10 is a diagram showing an example of separating contact characters.

[Explanation of symbols]

１０文字認識装置１１画像入力部１２文字行切り出し部１３本発明の文字切り出し装置１４文字認識部１５文字コード出力端子１３１文字塊切り出し部１３２文字数判断部１３３接触文字分離部 DESCRIPTION OF SYMBOLS 10 character recognition device 11 image input unit 12 character line cutout unit 13 character cutout device 14 character recognition unit 15 character recognition unit 15 character code output terminal 131 character block cutout unit 132 character number determination unit 133 contact character separation unit

Claims

[Claims]

1. A character block cutout unit for cutting out a character block from character line image data, and a character number determination for verifying line symmetry with respect to the character block and determining whether or not the number of characters in the character block is one character. And a contact character separation unit that further separates the character block into character patterns one by one when the character number determination unit determines that the number of characters in the character block is not one character. Character cutting device.

2. The character slicing device according to claim 1, wherein the line symmetry verification detects a left edge pixel and a right edge pixel from image data of a character block for each scanning line in the sub-scanning direction, A character slicing device characterized by performing distribution of the midpoint positions of the two pixels.