JPH02195430A - Character segmenting circuit - Google Patents

Character segmenting circuit

Info

Publication number
JPH02195430A
JPH02195430A JP1549689A JP1549689A JPH02195430A JP H02195430 A JPH02195430 A JP H02195430A JP 1549689 A JP1549689 A JP 1549689A JP 1549689 A JP1549689 A JP 1549689A JP H02195430 A JPH02195430 A JP H02195430A
Authority
JP
Japan
Prior art keywords
white
memory
character
scanning line
information
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
JP1549689A
Other languages
Japanese (ja)
Inventor
Yuji Shinozaki
祐司 篠崎
Toshifumi Yamauchi
山内 俊史
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
NEC Corp
Original Assignee
NEC Corp
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by NEC Corp filed Critical NEC Corp
Priority to JP1549689A priority Critical patent/JPH02195430A/en
Publication of JPH02195430A publication Critical patent/JPH02195430A/en
Pending legal-status Critical Current

Links

Landscapes

  • Executing Machine-Instructions (AREA)

Abstract

PURPOSE:To separate overlapped characters which can not be separated from each other by a projection method and to segment the separated characters by providing the circuit with a segmenting means for segmenting a character pattern by separating position information obtained in each scanning line. CONSTITUTION:An address generating circuit 21 in a character segmenting circuit is connected to a memory 11 through a bus 21 and an address for partially scanning an input image stored in a memory 11 is generated. A changing point detecting part 31 is connected to memories 11, 12 through buses 71, 72, a point for changing data read out from the memory 11 from black to while and from white to black is detected and positional information in each white block obtained from the address of the changing point is stored in the memory 12. A structure extracting part 41 is connected to the memories 12, 13 through the buses 72, 73, the positional information of the white block is read out from the memory 12 and contact information between respective white blocks is extracted by deciding the existence of contact between respective white blocks and stored in the memory 13.

Description

【発明の詳細な説明】 産業上の利用分野 本発明は、文字認識装置の文字切出回路に関し、特に、
文字切出技術に関する。
DETAILED DESCRIPTION OF THE INVENTION Field of the Invention The present invention relates to a character extraction circuit for a character recognition device, and in particular, to a character extraction circuit for a character recognition device.
Regarding character cutting technology.

従来の技術 従来の文字読取装置における文字切出回路では、第2図
に示すように、入力2値画像の投影パタンを抽出し予め
与えられる文字枠位置情報から各投影黒パタンの有効、
無効を判定し、有効黒ブロックの存在する範囲のパタン
を文字パタンとして切出していた。
BACKGROUND OF THE INVENTION As shown in FIG. 2, a character cutting circuit in a conventional character reading device extracts a projection pattern of an input binary image and calculates the effective,
Invalidity was determined, and the pattern in the range where valid black blocks existed was extracted as a character pattern.

発明が解決しようとする課題 しかしながら、上述した従来の文字切出回路は、投影情
報のみから文字切出しを行う為に、投影する方向に対し
重なりのある文字やノイズを分離することができないと
いう欠点があった。。
Problems to be Solved by the Invention However, since the conventional character extraction circuit described above performs character extraction only from projection information, it has the disadvantage of not being able to separate overlapping characters or noise in the direction of projection. there were. .

例えば第3図(a)の入力パタンの場合には、文字“¥
”と文字“4”がX方向に重なりあう為に投影パタンに
より文字“¥”と文字“4”が分離できない、また上部
ノイズと文字“¥”がY方向に重なりあう為に投影パタ
ン■により文字“¥”とノイズが分離できない。
For example, in the case of the input pattern shown in Figure 3(a), the character "\"
” and the character “4” overlap in the X direction, so the character “¥” and the character “4” cannot be separated due to the projection pattern.Also, the upper noise and the character “¥” overlap in the Y direction, so the projection pattern ■ The character “¥” and noise cannot be separated.

このような場合に従来の文字切出回路では強制的に予め
与えられる文字枠位置情報により示される位置で文字パ
タンを強制的に分離するものもあったが、この場合には
切出しパタンに第3図(b)のように文字切断による文
字の変形や隣接文字の一部やノイズの残留が発生し、切
出後行う文字認識の性能を低下させる欠点があった。
In such cases, some conventional character cutting circuits forcibly separate the character pattern at the position indicated by character frame position information given in advance, but in this case, a third As shown in Figure (b), character deformation occurs due to character cutting, and parts of adjacent characters and noise remain, which has the disadvantage of deteriorating the performance of character recognition performed after cutting.

本発明は従来の上記実情に鑑みてなされたものであり、
従って本発明の目的は、従来の技術に内在する上記諸欠
点を解消することを可能とした新規な文字切出回路を提
供することにある。
The present invention has been made in view of the above-mentioned conventional situation,
SUMMARY OF THE INVENTION Accordingly, an object of the present invention is to provide a novel character cutting circuit which makes it possible to eliminate the above-mentioned disadvantages inherent in the conventional technology.

課題を解決するための手段 上記目的を達成する為に、本発明に係る文字切出回路は
、1文字分以上の2値画像情報を格納できるメモリと、
上記メモリに格納された入力画像を部分的に走査し走査
した2値画像が黒から白及び白から黒に変化する位置を
検出する手段と、前記検出手段により得られる走査線ご
との白ブロックの位置情報を記憶する手段と、前記位置
情報を読み出し各自ブロック間の接触の有無を判定し各
自ブロック間の接触情報を抽出する手段と、前記接触情
報から第一走査線上の白ブロックから順次接触する白ブ
ロックを探索し最終走査線上の白ブロックまで到達可能
なパスを抽出する手段と、前記抽出手段により得られた
パスから文字分離の為の走査線ごとの分離位置情報を抽
出する手段と、前記走査線ごとの分離位置情報により文
字パタンを切り出す手段とを備えて構成される。
Means for Solving the Problems In order to achieve the above object, a character cutting circuit according to the present invention includes a memory capable of storing binary image information for one character or more;
means for partially scanning the input image stored in the memory and detecting positions where the scanned binary image changes from black to white and from white to black; means for storing positional information; means for reading out the positional information and determining whether or not there is contact between the respective blocks; and extracting contact information between the respective blocks; and sequentially contacting the white blocks on the first scanning line based on the contact information. means for searching for a white block and extracting a path that can reach the white block on the final scanning line; means for extracting separation position information for each scanning line for character separation from the path obtained by the extraction means; and means for cutting out a character pattern based on separation position information for each scanning line.

実施例 次に本発明をその好ましい一実施例について図面を参照
して具体的に説明する。
Embodiment Next, a preferred embodiment of the present invention will be specifically explained with reference to the drawings.

第1図は本発明の一実施例を示す概略ブロック構成図で
ある。
FIG. 1 is a schematic block diagram showing an embodiment of the present invention.

第1図を参照するに、本発明の一実施例は、メモリ11
.12.13.14.15、アドレス発生部21、変化
点検出部31、構造抽出部41.構造解析部51、パタ
ン切出部61、パス71,72,73.74.75から
構成されている。
Referring to FIG. 1, one embodiment of the present invention includes a memory 11
.. 12.13.14.15, address generation section 21, change point detection section 31, structure extraction section 41. It is composed of a structure analysis section 51, a pattern cutting section 61, and paths 71, 72, 73, 74, and 75.

アドレス発生部21は、パス71を介してメモリ11に
接続されており、メモリ11に格納された入力イメージ
を部分的に走査する為のアドレスを発生する。
The address generator 21 is connected to the memory 11 via a path 71 and generates an address for partially scanning an input image stored in the memory 11.

変化点検出部31は、パス71.72を介してメモリ1
1.12に接続されており、メモリ11がら続出される
データが黒から白及び白から黒に変化する点を検出し、
変化点のアドレスから得られる白ブロックごとの位置情
報をメモリ12に格納する。
The change point detection unit 31 connects to the memory 1 via paths 71 and 72.
1.12, detects the point where the data successively output from the memory 11 changes from black to white and from white to black,
Position information for each white block obtained from the address of the change point is stored in the memory 12.

構造抽出部41は、パス72.73を介してメモリ12
.13に接続されており、メモリ12に格納された白ブ
ロックの位置情報を読み出し、各自ブロック間の接触の
有無の判定により各自ブロック間の接触情報を抽出して
メモリ13に格納する°、白ブロック間の接触判定は、
例えば第4図(alの■、0間及び■、0間のように各
自ブロックのどれかの白点のすぐ上またはすぐ下に白点
を持つ白ブロックとは接触あり、第4図(b)のように
どの白点の上または下に白点を持たない白ブロックとは
接触なしと判定する。メモリ13に格納された各自ブロ
ック間の接触情報を読み出すことにより、各ブロック間
の接触の有無が判断でき、例えばメモリ13を読出すこ
とにより第4図fa)、iblのように各自ブロックを
ノードとし、接触ありのときに各ノードが結ばれる接続
図を作成することができる。第4図(a)は白ブロック
■、0間及び■、0間に接触がある場合の、第4図(b
lは■、■、■、0間に全て接触がない場合の例である
The structure extraction unit 41 accesses the memory 12 via paths 72 and 73.
.. 13, reads out the position information of the white block stored in the memory 12, extracts contact information between each block by determining whether there is contact between each block, and stores it in the memory 13. The contact judgment between
For example, there is contact with a white block that has a white point immediately above or below the white point of any of its own blocks, such as between ■ and 0 in Figure 4 (al) and between ■ and 0; ), a white block that does not have a white point above or below any white point is determined to have no contact. By reading out the contact information between each block stored in the memory 13, contact between each block can be determined. For example, by reading out the memory 13, it is possible to create a connection diagram in which each block is set as a node and each node is connected when there is contact, as shown in FIG. 4fa) and ibl. Figure 4(a) shows the case where there is contact between white blocks ■ and 0 and between white blocks ■ and 0, and Figure 4(b)
l is an example where there is no contact between ■, ■, ■, and 0.

構造解析部51は、パス73.74を介してメモリ13
.14に接続されており、メモリ13に格納された各自
ブロック間の接触情報を読出しながら第一走査線上の白
ブロックから順次接触する白ブロックを探索し、最終走
査線上の白ブロックまで到達可能なパスの全てを調べる
。このとき第一走査線がら最終走査線に至るパスが2つ
以上存在したときには、各自ブロックの最も走査中心に
近い白点と走査中心までの距離の最大値が最も小さい最
適パス、即ち走査中心に対して白ブロックの振れが最も
小さいパスを判定する。また、パスが1つじが存在しな
いときには、そのパスを最適パスとする。最適パスに含
まれる全白ブロックについて最も走査中心に近い白点の
位置情報を文字を分離する為の分離位置情報としてメモ
リ14に格納する。
The structure analysis unit 51 accesses the memory 13 via paths 73 and 74.
.. 14, and while reading the contact information between each block stored in the memory 13, searches for white blocks that come into contact sequentially from the white block on the first scanning line, and a path that can reach the white block on the final scanning line. Find out everything about. At this time, if there are two or more paths from the first scanning line to the final scanning line, the optimal path with the smallest maximum distance between the white point closest to the scanning center of each block and the scanning center, that is, the scanning center. On the other hand, the path with the smallest shake of the white block is determined. Furthermore, if there is no single path, that path is determined to be the optimal path. The positional information of the white point closest to the scanning center of the all-white block included in the optimal path is stored in the memory 14 as separation positional information for separating characters.

第5図に走査パタンと走査パタンに対する分離位置の例
を示す。
FIG. 5 shows an example of a scanning pattern and a separation position for the scanning pattern.

パタン切出部61は、パス71,74.75を介してメ
モリ11.14.15に接続されており、前記分離位置
情報の抽出処理を予め与えられる文字枠の位置情報の示
す上下左右の文字枠周辺で行った後に得られた分離位置
情報をメモリ14から読出し、メモリ11に格納されて
いるイメージから分離位置内部のイメージを文字イメー
ジとして切出し、切出した文字イメージをメモリ15に
格納する。
The pattern cutting unit 61 is connected to the memory 11, 14, and 15 via paths 71, 74, and 75, and performs extraction processing of the separation position information on upper, lower, left, and right characters indicated by the position information of the character frame given in advance. The separation position information obtained after performing the separation around the frame is read out from the memory 14, the image inside the separation position is cut out as a character image from the image stored in the memory 11, and the cut out character image is stored in the memory 15.

本回路に第3図(a)を入力した場合の入力パタン、走
査領域、得られる分離位置、切出しパタンをそれぞれ第
6図(al 、 lbl 、 (c) 、 (d)に示
す。
The input pattern, scanning area, obtained separation position, and cutting pattern when FIG. 3(a) is input to this circuit are shown in FIG. 6 (al, lbl, (c), and (d)), respectively.

発明の詳細 な説明したように、本発明によれば、1文字分以上の2
値画像情報を格納できるメモリと、上記メモリに格納さ
れた入力画像を部分的に走査し走査した2値画像が黒か
ら白及び白から黒に変化する位置を検出する手段と、前
記検出手段により得られる白ブロックの位置情報を記憶
する手段と、前記位置情報を読み出し、各自ブロック間
の接触の有無を判定し各自ブロック間の接触情報を抽出
する手段と、前記接触情報から第一走査線上の白ブロッ
クから順次接触する白ブロックを探索し最終走査線上の
白ブロックまで到達可能なパスを抽出する手段と、前記
抽出手段により得られた文字分離の為の走査線ごとの分
離位置情報を抽出する手段と、前記走査線ごとの分離位
置情報により文字パタンを切り出す手段とを有すること
により、投影による方法では分離できない重なりのある
文字を分離し、文字を切り出すことができるという効果
が得られる0例えば、第3図(a)のように投影による
方法では分離できない文字が本発明の回路では第6図に
示すように完全に分離させて文字を切出すことができ、
切出後行う文字認識の性能を向上させることができる効
果がある。
As described in detail, according to the present invention, two or more characters
a memory capable of storing value image information; a means for partially scanning the input image stored in the memory and detecting a position where the scanned binary image changes from black to white and from white to black; means for storing positional information of the obtained white blocks; means for reading out the positional information, determining the presence or absence of contact between the respective blocks, and extracting contact information between the respective blocks; means for sequentially searching for contacting white blocks starting from the white block and extracting a path that can reach the white block on the final scanning line; and extracting separation position information for each scanning line for character separation obtained by the extraction means. By having the means and the means for cutting out a character pattern based on the separation position information for each scanning line, it is possible to obtain the effect that overlapping characters that cannot be separated by a projection method can be separated and the characters can be cut out. , characters that cannot be separated by the projection method as shown in FIG. 3(a) can be completely separated and cut out using the circuit of the present invention as shown in FIG.
This has the effect of improving the performance of character recognition performed after extraction.

また、本発明の回路を光学文字読取装置に使用した場合
には、文字間の重なりが許容される為に、帳票に記入す
る文字の間隔を縮めることができ、同一サイズの帳票に
より多くの文字を記入できるようになるという効果があ
る。
Furthermore, when the circuit of the present invention is used in an optical character reading device, since overlapping between characters is allowed, the spacing between characters written on a form can be reduced, and more characters can be written on a form of the same size. This has the effect of allowing you to write down the information.

【図面の簡単な説明】[Brief explanation of the drawing]

第1図は本発明の一実施例を示す概略ブロック構成図、
第2図、第3図は従来の文字切出回路の動作の説明図、
第4図は本発明による一実施例の回路の白ブロツク間の
接触判定に関する説明図、第5図は本発明による一実施
例の回路の分離位置判定に関する説明図、第6図は本発
明による一実施例の回路の動作に関する説明図である。 11.12,13.14,15・・・メモリ、21・・
・アドレス発生部、31・・・変化点検出部、41・・
・構造抽出部、51・・・構造解析部、61・・・パタ
ン切出部、71,72.73,74.75・・・パス 特許出願人  日本電気株式会社 代 理 人  弁理士 熊谷雄太部 第1図 接続口 圀 1、 四 6点 【コ=≠]白ブロック 図 Eコ;;l:て2)′離イ立置
FIG. 1 is a schematic block diagram showing an embodiment of the present invention;
FIGS. 2 and 3 are explanatory diagrams of the operation of a conventional character cutting circuit,
FIG. 4 is an explanatory diagram regarding contact determination between white blocks in a circuit according to an embodiment of the present invention, FIG. 5 is an explanatory diagram regarding separation position determination in a circuit according to an embodiment of the present invention, and FIG. 6 is an explanatory diagram according to the present invention. FIG. 2 is an explanatory diagram regarding the operation of a circuit according to an embodiment. 11.12, 13.14, 15...Memory, 21...
・Address generation section, 31... Change point detection section, 41...
・Structure extraction section, 51... Structural analysis section, 61... Pattern cutting section, 71, 72.73, 74.75... Pass patent applicant NEC Co., Ltd. Agent Patent attorney Yutabe Kumagai Figure 1 Connection area 1, 4 6 points [ko = ≠] White block diagram

Claims (1)

【特許請求の範囲】[Claims] 文字読取装置の文字切出回路において、1文字分以上の
2値画像情報を格納できるメモリと、上記メモリに格納
された入力画像を部分的に走査し走査した2値画像が黒
から白及び白から黒に変化する位置を検出する手段と、
前記検出手段により得られる走査線ごとの連続する白点
(以下白ブロックと記す)の位置情報を記憶する手段と
、前記位置情報を読み出し各白ブロック間の接触の有無
を判定し各白ブロック間の接触情報を抽出する手段と、
前記接触情報から第一走査線上の白ブロックから順次接
触する白ブロックを探索し最終走査線上の白ブロックま
で到達可能なパスを抽出する手段と、前記抽出手段によ
り得られたパスから文字分離の為の走査線ごとの分離位
置情報を抽出する手段と、前記走査線ごとの分離位置情
報により文字パタンを切り出す手段とを有することを特
徴とする文字切出回路。
In the character cutting circuit of a character reading device, there is a memory that can store binary image information for one character or more, and an input image stored in the memory is partially scanned, and the scanned binary image is changed from black to white and white. means for detecting a position where the color changes from to black;
means for storing positional information of consecutive white points (hereinafter referred to as white blocks) for each scanning line obtained by the detection means; and means for reading out the positional information and determining whether or not there is contact between each white block; means for extracting contact information of
Means for searching sequentially contacting white blocks from the white block on the first scanning line based on the contact information and extracting a path that can reach the white block on the final scanning line, and for separating characters from the path obtained by the extraction means. 1. A character cutting circuit comprising: means for extracting separation position information for each scanning line; and means for cutting out a character pattern based on the separation position information for each scanning line.
JP1549689A 1989-01-24 1989-01-24 Character segmenting circuit Pending JPH02195430A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
JP1549689A JPH02195430A (en) 1989-01-24 1989-01-24 Character segmenting circuit

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
JP1549689A JPH02195430A (en) 1989-01-24 1989-01-24 Character segmenting circuit

Publications (1)

Publication Number Publication Date
JPH02195430A true JPH02195430A (en) 1990-08-02

Family

ID=11890421

Family Applications (1)

Application Number Title Priority Date Filing Date
JP1549689A Pending JPH02195430A (en) 1989-01-24 1989-01-24 Character segmenting circuit

Country Status (1)

Country Link
JP (1) JPH02195430A (en)

Similar Documents

Publication Publication Date Title
JPS62254282A (en) Method and apparatus for separating overlapped pattern
JPH02195430A (en) Character segmenting circuit
JP3431267B2 (en) Table processing method and apparatus
JPS58106665A (en) Character line cutout method
JP2722549B2 (en) Optical character reader
JPH0373916B2 (en)
JPH0225553B2 (en)
JP4580520B2 (en) Character recognition method and character recognition apparatus
JPH11338969A (en) Optical character reader
JPH09114925A (en) Optical character reader
JPS6361387A (en) Character segmenting system
JPH04276888A (en) Character reader
JP4320868B2 (en) Character reader
JPH0271379A (en) Picture processor
JPH04367087A (en) Optical character reader
JPS61196382A (en) Character segmenting system
JPH02189696A (en) Optical character reader
JPH04330587A (en) Recognizing method for photographic area and character string
JP2002215616A (en) Image processor with agate giving function, method and its program
JPH08241376A (en) Device and method for character recognition
JP2001307022A (en) Character recognizing device and character recognizing method
JPS6149554A (en) Image segmenting circuit
JPH06333088A (en) Optical character reader
JPS59177686A (en) Character recognizing device
JPH0310144B2 (en)