JPS61196382A

JPS61196382A - Character segmenting system

Info

Publication number: JPS61196382A
Application number: JP60036574A
Authority: JP
Inventors: Shigeru Goto; 茂後藤; Shinji Narita; 成田　真二; Yoshiyuki Yamashita; 山下　義征
Original assignee: Oki Electric Industry Co Ltd
Current assignee: Oki Electric Industry Co Ltd
Priority date: 1985-02-27
Filing date: 1985-02-27
Publication date: 1986-08-30
Also published as: JPH0433074B2

Abstract

PURPOSE:To execute character segmenting with a simple constitution and high accuracy by performing column scanning in the direction of respective opposite sides from top to bottom sides of circumscribed frames of character patterns, classifying and then based on the results, carrying out row scanning in the circumscribed frame. CONSTITUTION:Quantized character pattern columns are scanned in column direction and the width of black spot histogram is detected, which is compared with the average character width to determine the area wherein character segmenting process is executed. The area is row scanned and black spot histogram in row direction is prepared and based on the column and row direction black spot histograms, the character circumscribed frame is determined. The frame is row scanned and the variation points where classification change amongst the four different classes are detected, the variation points successively stored, compared with combination of prescribed variation points in order, and coinciding variation points are detected. According to the coordinates of coinciding variation points, the character segmenting position is determined.

Description

【発明の詳細な説明】（産業上の利用分野）この発明は、文字切出し方式に関し、更に詳細には帳票
に記入された文字を読取り、読取った文字に基づく文字
パターン列を１文字領域毎に分離して抽出する文字切出
し方式に関する。[Detailed Description of the Invention] (Industrial Application Field) The present invention relates to a character extraction method, and more specifically, reads characters written on a form, and creates a character pattern string based on the read characters for each character area. This paper relates to a character extraction method for separating and extracting characters.

（従来の技＃ｉ）光学式文字認識装置（以下、ＯＣＲと略す）においては
帳票に記入された文字を行毎に走査し、光信号を光電変
換器により画像信号に変換し、ラインへ−，ファに格納
する。そのラインバッファを順次読み出し文字パターン
列を１文字領域毎に分離し、その分離された文字パター
ンにより認識を行っているが、文字パターン列の中から
１文字領域を抽出する文字切出し法はＯＣＲの性能に太
きく影響する。(Conventional Technique #i) An optical character recognition device (hereinafter abbreviated as OCR) scans characters written on a form line by line, converts the optical signal into an image signal using a photoelectric converter, and converts it to a line. , stored in file. The line buffer is sequentially read out, the character pattern string is separated into character areas, and recognition is performed using the separated character patterns.However, the character extraction method of extracting a single character area from the character pattern string is an OCR method. It greatly affects performance.

次に、ＯＣＲのラインバッファに格納されている文字列
のパターンデータから１文字の領域を分離する従来の文
字゛切出し方法を説明する。Next, a conventional character extraction method for separating a single character area from character string pattern data stored in an OCR line buffer will be described.

ＯＣＲにおいて１文字列が格納されているラインバッフ
ァの上端から下端に向って１列走査し、この走査と直角
な方向に順次列を移動することにより、ラインバッファ
の文字パターンを読出す。In OCR, a character pattern in the line buffer is read by scanning one column from the top to the bottom of a line buffer in which one character string is stored, and sequentially moving the columns in a direction perpendicular to this scanning.

そして、１列の走査中に黒点（文字部分を黒点、背景部
分を白点）を計数することによりヒストクラムを作成し
、その黒点ヒストグラムを参照して、１文字の領域を決
定する。A histogram is created by counting black dots (black dots in the text area and white dots in the background area) during scanning of one line, and the area of one character is determined by referring to the black dot histogram.

第１０図は、従来の黒点ヒストグラムを用いたパターン
列を示す図である。同図において、１００゜１０１は文
字パターンで、ＯＣＲにおけるラインバッファに格納さ
れているパターンである。１０２は文字パターン１００
，１０１の列方向の黒点ヒストグラムである。また、同
図において、ラインバッファの左端の指定された位置よ
り読出しを開始し、１列の読出し中に該列の黒点ヒスト
グラムを作成し該ヒストグラムと閾値α（α：定数）と
比較し、該ヒストグラムがαより大きい列を始点とし再
び閾値αより小となる列を終点とし、始点から終点まで
を１文字の領域として切出していた。FIG. 10 is a diagram showing a pattern sequence using a conventional black point histogram. In the figure, 100° 101 is a character pattern, which is a pattern stored in a line buffer in OCR. 102 is character pattern 100
, 101 in the column direction. In addition, in the figure, reading starts from a specified position at the left end of the line buffer, and while reading one column, a black point histogram is created for that column, and this histogram is compared with a threshold value α (α: constant). The column whose histogram is larger than α is the starting point, and the column whose histogram is smaller than the threshold α is the ending point, and the area from the starting point to the ending point is cut out as a region of one character.

（発明が解決しようとする問題点）しかしながら、上記従来の方法では、手書文字の場合に
おいて記入者が文字を傾斜して記入しているためにある
いは文字記枠からはみ出して記入したため、もしくは記
入者が文字の一部をはねたため等の理由により、隣接す
る文字が重なって。(Problems to be Solved by the Invention) However, in the case of handwritten characters, in the case of handwritten characters, the writer writes the characters at an angle, or the characters are written outside the character writing frame, or Adjacent letters overlap due to reasons such as someone hitting a part of the letter.

２文字以上の文字パターンが１文字として切出されると
いう問題があった。また、第１０図かられかるように、
文字パターン１００，１０１は列方向で重なっている部
分があるためその黒点ヒストグラム１０２は一つの領域
として形成されてしまう。さらに、黒点ヒストグラムの
始点から終点までの長さを求め、２文字以上であると判
定された場合、平均文字幅に相当する位置を切出し点と
しても、当該文字以外の文字の一部が混入したり、当該
文字の一部が欠落するという問題があった。There was a problem in that a character pattern of two or more characters was cut out as one character. Also, as shown in Figure 10,
Since the character patterns 100 and 101 overlap in the column direction, their black point histogram 102 is formed as one area. Furthermore, if the length from the start point to the end point of the black point histogram is determined and it is determined that there are two or more characters, even if the position corresponding to the average character width is used as the cutting point, some characters other than the relevant character may be mixed in. There was a problem that some characters were missing.

この発明は、これらの問題点を解決するためのもので、
簡単な構成で精度の良い文字切出し方式を提供すること
を目的とする。This invention is intended to solve these problems.
The purpose is to provide a highly accurate character extraction method with a simple configuration.

（問題点を解決するための手段）この発明は、前記問題点を解決するために帳票−ヒに記
入された文字列を光電変換して得られる量子化された文
字パターン列を１文字毎に分離して抽出する文字切出し
方式において、以下のような手段により構成する。(Means for Solving the Problems) In order to solve the above-mentioned problems, the present invention converts a quantized character pattern string obtained by photoelectrically converting a character string written on a form into a quantized character pattern string for each character. The character extraction method for separating and extracting characters is configured by the following means.

この発明は、量子化された文字パターン列をラインバッ
ファメモリに格納し、ラインバッファメモリを文字列の
列方向に相当するごとき１列走査することにより列方向
の黒点ヒストグラムを作成しかつその走査を順次各列毎
に行って列方向の黒点ヒストグラムの幅を検出する手段
と、規定された対象文字の平均文字幅と列方向の黒点ヒ
ストグラムの幅と比較してこの黒点ヒストグラムの幅が
何文字分に相当するか検出する手段と、この黒点ヒスト
グラムの有する文字数に基づいて文字切出し処理を施す
領域を定めてその領域内の文字パターン列を行方向に各
行毎に走査して行方向の黒点ヒストグラムを作成しかつ
列方向と行方向の黒点ヒストグラムより文字パターン列
の文字外接枠を検出する手段と、その文字外接枠内の文
字パターン列を保持する記憶手段と１文字外接枠の上下
の辺から各々反対側の辺へ向って走査に伴う記憶手段か
ら文字パターン列の内容を読み出し、その内容が文字部
分であるか背景部分であるか検出する検出手段と、上辺
からの走査により検出された背景部分及び下辺からの走
査により検出された背景部分１文字部分、並びにそれら
以外の背景部分の４種類に文字外接枠内の文字パターン
列を分類する分類手段と、再び行走査を行い分類が変化
する変化点を検出し、順次格納して逐次所定の変化点の
組合せと比較して一致する変化点を検出する変化点検出
手段と、その検出された変化点に基づいて文字切出し位
置を決定する手段とから構成されている。This invention stores a quantized character pattern string in a line buffer memory, scans the line buffer memory one column corresponding to the column direction of the character string, and creates a black point histogram in the column direction. A means for sequentially detecting the width of the black point histogram in the column direction for each column, and a means for detecting the width of the black point histogram in the column direction by comparing the average character width of the specified target character with the width of the black point histogram in the column direction. A black point histogram in the row direction is obtained by determining a region for character extraction processing based on the number of characters included in the black point histogram, and scanning the character pattern string in the region row by row in the row direction. means for detecting a character circumscribing frame of a character pattern string from the black point histogram in the column and row directions; a storage means for storing the character pattern string within the character circumscribing frame; a detection means for reading out the contents of a character pattern string from a storage means as it scans toward the opposite side and detecting whether the contents are a character portion or a background portion; and a background portion detected by scanning from the upper side. A classification means for classifying a character pattern string within a character circumscribing frame into four types: a background part detected by scanning from the bottom side, a single character part, and other background parts, and a change in which the classification changes by performing line scanning again. A change point detection means for detecting points, sequentially storing them, and sequentially comparing them with a predetermined combination of change points to detect a matching change point, and means for determining a character cutting position based on the detected change points. It consists of

（作　用）以上のような構成からなる文字切出し方式によれば、次
のように作用する。(Operation) According to the character extraction method having the above configuration, the operation is as follows.

量子化された文字パターン列は列方向に各列毎に走査さ
れて列方向の黒点ヒストグラムの幅を検出し、かつこれ
と平均文字幅と比較して以後行う文字切出し処理を施す
領域が決定される。また、その領域内を行走査して行方
向の黒点ヒストグラムを作成し、前記列方向とこの行方
向の黒点ヒストグラムより文字外接枠が決定される。更
に、この文字外接枠内の上下の辺から各々反対側の辺へ
走査し上記の４種類の部分に分類する。そして、再び文
字外接枠内の行走査を行い、分類が変化する変化点を検
出してその変化点を順次格納して逐次所定の変化点の組
合せと比較して一致する変化点を検出する。その結果、
その一致する変化点の座標に基づいて文字切出し位置を
決定する。The quantized character pattern string is scanned column by column to detect the width of the black point histogram in the column direction, and this is compared with the average character width to determine the area for subsequent character extraction processing. Ru. Further, the area is scanned in rows to create a black point histogram in the row direction, and a character circumscribing frame is determined from the black point histogram in the column direction and the row direction. Furthermore, the characters are scanned from the upper and lower sides of the character circumscribing frame to the opposite sides, and are classified into the above-mentioned four types of parts. Then, the lines within the character circumscribing frame are scanned again to detect change points where the classification changes, and the change points are sequentially stored and successively compared with a predetermined combination of change points to detect matching change points. the result,
A character cutting position is determined based on the coordinates of the matching change point.

（実施例）以下、この発明の一実施例を図面に基づいて説明する。(Example) Hereinafter, one embodiment of the present invention will be described based on the drawings.

第１図は、この発明の一実施例を示すブロック図である
。同図において、２００は図示されていない光電変換部
よりの画像信号、２０１はラインバッファ５２０２は黒
点ヒストグラム作成回路２２ｏ、外接枠検出回路２２１
および文字判定回路２２２である。２０３はデータの切
換え回路、２０４はパターンメモリ、２０５．２０８は
パターンメモリ用のアドレスを発生するＸ方向のＸカウ
ンタとＸ方向のｙカウンタである。２０７は制御回路で
ある。２０８はパターン領域分類回路、２０９は白点よ
り黒点への変化点検出回路である。２１０はパターン領
域変化点検出回路、２１１は切出し領域の検出回路、２
１２〜２１４は切出し領域決定用のレジスタである。FIG. 1 is a block diagram showing one embodiment of the present invention. In the figure, 200 is an image signal from a photoelectric conversion unit (not shown), 201 is a line buffer 5202, a black point histogram creation circuit 22o, and a circumscribing frame detection circuit 221.
and a character determination circuit 222. 203 is a data switching circuit, 204 is a pattern memory, and 205 and 208 are an X counter in the X direction and a Y counter in the X direction, which generate addresses for the pattern memory. 207 is a control circuit. 208 is a pattern area classification circuit, and 209 is a circuit for detecting a change point from a white point to a black point. 210 is a pattern area change point detection circuit; 211 is a cutout area detection circuit; 2
12 to 214 are registers for determining the cutout area.

以下に、第１図のブロック図を用いて本実施例の動作に
ついて説明を行う。The operation of this embodiment will be explained below using the block diagram shown in FIG.

帳票上の文字列は光電変換器により２値化された画像信
号２００に変換され、ラインバッファ２０１に格納され
る。制御回路２０７の制御により以下の処理が行われる
。制御回路２０７はラインバッファ２０１に格納されて
いる画像信号をラインバッファ２０１の先頭位置より１
列車位に読出し、順次列を更進し、１行分の文字パター
ンデータを全て読出した時点で終了する。また、制御回
路２０７では、ラインバッファ２０１より１列車位にパ
ターンデータを読出すと同時に黒点ヒストグラム作成回
路２２０を起動する。黒点ヒストグラム作成回路２２０
では、１列の読出し中の黒点数を計数することにより当
該列の黒点ヒストグラムを作成し、黒点ヒストグラム作
成回路２２０に含まれるヒストグラムメモリ２３０に格
納する０以上の処理を繰り返し１行分、全列の黒点ヒス
トグラムをヒストグラムメモリ２３０に格納した時点で
処理を終了する。The character string on the form is converted into a binary image signal 200 by a photoelectric converter and stored in a line buffer 201. The following processing is performed under the control of the control circuit 207. The control circuit 207 converts the image signal stored in the line buffer 201 by 1 from the beginning position of the line buffer 201.
The character pattern data for one line is read out sequentially, and the process ends when all the character pattern data for one line is read out. Further, the control circuit 207 starts up the black dot histogram creation circuit 220 at the same time as reading the pattern data from the line buffer 201 for one train. Sunspot histogram creation circuit 220
Now, by counting the number of sunspots that are being read in one column, a blackspot histogram for that column is created, and the process of storing 0 or more in the histogram memory 230 included in the sunspot histogram creation circuit 220 is repeated for one row and all columns. The process ends when the black point histogram is stored in the histogram memory 230.

１行分の黒点ヒストグラムを作成した後は、黒点ヒスト
グラム作成回路２２０中のヒストグラムメモリ２３０を
先頭より読出して、前記黒点ヒストグラムを参照してブ
ロックの検出を行う、制御回路２０７は黒点ヒストグラ
ム作成回路２２０中のヒストグラムメモリ２３０より、
順次黒点ヒストグラムを読出し、黒点ヒストグラムと閾
値α（α：定数、ただし、本実施例においてはα＝１と
する）を比較し、前記ヒストグラムが大きければ文字の
ブロックの始点候補とし、順次黒点ヒストグラムの格納
番地を更進し、読出された黒点ヒストグラムが閾値αよ
り大きい列を計数し、β（β：定数、ただし、本実施例
においてはβ＝２とする）列連続した場合、前記始点候
補を始点とする。さらに列の更進を続け、始点が検出さ
れた後、始めて黒点ヒストグラムが閾値αより小さくな
る列を終点とし、始点から終点までの長さで示される領
域をブロックとする。次に、制御回路２０７は文字判定
回路２２２を起動し前記検出されたブロックの長さを読
取対象としている文字の平均的な幅より求められた閾値
γ１　、γ２　（γ１．γ２は定数、ただし１本実施例
においてはγ、＝７５．γ２＝１２５とする）と比較す
る。そして、当該ブロックの長さＷが閾値γ！より小さ
いときには当該ブロックを１文字と判定し、γ！≦Ｗ≦
γ２のときは２文字と判定し、さらに、Ｗ〉γ２のとき
は３文字以上と判定する。また、制御回路２０７では当
該ブロックの判定の後、該ブロックについて外接枠検出
回路２２１を起動し、外接枠を検出する。さらに、この
ブロックの外接枠が検出されると、前記外接枠内の文字
パターンをパターンメモリ２０４に転送する。ここで、
Ｗ〉γ２の場合つまり前記ブロックを３文字以上と判定
した場合、始点からγ２まで切出し処理を行って１文字
目と２文字目を分割し、その結果の切出し点を始点とし
てさらに２文字目と３文字目を分割するごとき順次切出
しを行いＷまで処理することとなる。更に、Ｗ〉γ２の
場合は、始点からγ２までの前記外接枠内のパターンメ
モリ２０４に転送し、残りは以下の処理で始点からγ２
までの間の切出し点が決定した時点で再度転送する。こ
こで、後述する第２図に示すように文字外接枠の上辺左
端を原点とし、下辺位置をＦＢ、右辺位置をＰＲとする
。After creating a sunspot histogram for one line, the control circuit 207 reads the histogram memory 230 in the sunspot histogram creation circuit 220 from the beginning and detects a block by referring to the blackspot histogram. From the histogram memory 230 inside,
The black point histograms are sequentially read out, and the black point histograms are compared with a threshold value α (α: constant, however, α=1 in this example). If the histogram is large, it is considered as a starting point candidate for a block of characters, and Advance the storage address, count the columns in which the read black point histogram is greater than the threshold value α, and if the β (β: constant, however, β = 2) columns are continuous, select the starting point candidate. Use as starting point. The row continues to advance, and after the starting point is detected, the row whose black point histogram becomes smaller than the threshold α for the first time is set as the end point, and the area indicated by the length from the starting point to the ending point is set as a block. Next, the control circuit 207 activates the character determination circuit 222 and sets the length of the detected block to threshold values γ1 and γ2 (γ1 and γ2 are constants, where 1 In this embodiment, γ=75.γ2=125). Then, the length W of the block is a threshold value γ! If it is smaller than γ!, the block is determined to be one character, and γ! ≦W≦
When γ2, it is determined that there are two characters, and when W>γ2, it is determined that there are three or more characters. Further, after determining the block, the control circuit 207 activates the circumscribing frame detection circuit 221 for the block to detect the circumscribing frame. Furthermore, when the circumscribing frame of this block is detected, the character pattern within the circumscribing frame is transferred to the pattern memory 204. here,
In the case of W>γ2, that is, if the block is determined to be 3 or more characters, perform the cutting process from the starting point to γ2, divide the first and second characters, and then use the resulting cutting point as the starting point to further divide the second character. Sequential extraction is performed, such as dividing the third character, and processing up to W is performed. Furthermore, if W>γ2, the area from the starting point to γ2 is transferred to the pattern memory 204 within the circumscribed frame, and the rest is transferred from the starting point to γ2 by the following process.
Transfer is performed again when the cutout point is determined. Here, as shown in FIG. 2, which will be described later, the left end of the upper side of the character circumscribing frame is taken as the origin, the lower side position is FB, and the right side position is PR.

次に、上記のような文字の判定により２文字以上と判定
されたものの処理について第１図に基づいて説明する。Next, processing of characters determined to be two or more characters in the above character determination will be explained based on FIG. 1.

制御回路２０７はパターンメモリ２０４のアドレスを与
えるＸカウンタ２０５及びＸカウンタ２０６を文字の外
接枠の上辺の左端の位置にセットし、Ｘカウンタ２０８
をインクリメントして文字外接枠の下辺に向って走査を
行う、そして、パターンメモリ２０４のアドレスをＸ軸
、Ｙ軸に対して（ｘ　、　ｙ）とし、それぞれＸカウン
タ、Ｘカウンタの値を用いる。前記アドレスで示される
位置のパターンメモリ２０４の内容をＰＭ（ｘ、ｙ）で
表わす０本実施例においては白点をＰＭ　（ｘ　、ｙ）
＝０、黒点をＰＭ（Ｘ、Ｖ）＝１、前記上辺からの走査
時に検出された０点をＰＭ　（ｘ　、ｙ）＝２、前記下
辺からの走査時に検出されたＤ点をＰＭ　（ｘ　、ｙ）
＝４とした。従って、本実施例におけるパターンメモリ
２０４はｌメツシュに対して３ビツトのデータ幅を有す
る。パターン領域分類回路２０８において、文字外接枠
の上辺左端にアドレスを設定しパターンメモリ２０４よ
り文字パターンを読みだす。The control circuit 207 sets the X counter 205 and the X counter 206 that give the address of the pattern memory 204 to the left end position of the upper side of the circumscribed frame of the character, and sets the X counter 208
is incremented to scan toward the lower side of the character circumscribing frame, and the addresses of the pattern memory 204 are set to (x, y) for the X and Y axes, and the values of the X counter and X counter are used, respectively. The contents of the pattern memory 204 at the position indicated by the address are expressed as PM (x, y). In this embodiment, the white point is expressed as PM (x, y).
= 0, the black point is PM (X, V) = 1, the 0 point detected when scanning from the upper side is PM (x, y) = 2, the D point detected when scanning from the lower side is PM (x ,y)
= 4. Therefore, the pattern memory 204 in this embodiment has a data width of 3 bits for 1 mesh. In the pattern area classification circuit 208, an address is set at the left end of the upper side of the character circumscribing frame, and the character pattern is read out from the pattern memory 204.

ＰＭ　（ｘ　、ｙ）＝　０（７）ときは（ＰＭ　（Ｘ　
、Ｙ）。When PM (x, y) = 0(7), (PM (X
, Y).

ＯＲ，２）を新たなＰＭ（ｘ、ｙ）とし切換え回路２０
３を介してパターンメモリ２０４の当該番地に書き込み
を行う。OR, 2) as a new PM (x, y) and the switching circuit 20
3 to the corresponding address in the pattern memory 204.

制御回路２０７は、白点から黒点への変化点検出回路２
０８がＰＭ（ｘ、ｙ）＝１である黒点を検出すると、該
列の走査を打ち切り、Ｘカウンタ２０５を１つインクリ
メントし、次の列の走査を文字外接枠の上辺より行う、
また、前記文字外接枠の上辺より走査を行い下辺まで到
達したときも該列の走査を打ち切り、次列の走査を行う
０以上の走査を順次繰り返し、文字外接枠の右端の列を
処理したら終了する。前記上辺よりの走査が終了したら
制御回路２０７は、Ｘカウンタ、Ｘカウンタを文字外接
枠の下辺左端に設定し、前記下辺より上辺に向っての走
査を行い、前記上辺よりの走査時と同様の処理を行う、
ただし、ＰＭ　（ｘ　、　ｙ）　＝０のときは、（ＰＭ
　（ｘ　、ｙ）、ＯＲ，４）をＰＭ（ｘ、ｙ）としてパ
ターンメモリ２０４に格納する。前記上辺よりの走査と
同様に右端の列の処理をしたら終了する前記２種類の走
査が終了し、文字外接枠内のパターンの分類が出来たら
、制御回路２０７は、Ｘカウンタ２０５及びＸカウンタ
２０６を文字外接枠上の上辺左端に設定し、水平走査を
行い文字切出し領域の検出を行う。The control circuit 207 is a change point detection circuit 2 from a white point to a black point.
When 08 detects a black point where PM (x, y) = 1, it stops scanning the column, increments the X counter 205 by 1, and scans the next column from the upper side of the character circumscribing frame.
Also, when scanning starts from the top side of the character circumscribing frame and reaches the bottom side, the scanning of the column is aborted, and the next column is scanned.0 or more scans are repeated sequentially, and the process ends when the rightmost column of the character circumscribing frame is processed. do. When the scanning from the upper side is completed, the control circuit 207 sets the X counter to the left end of the lower side of the character circumscribing frame, scans from the lower side to the upper side, and performs the same operation as when scanning from the upper side. perform processing,
However, when PM (x, y) = 0, (PM
(x, y), OR, 4) is stored in the pattern memory 204 as PM(x, y). The control circuit 207 completes the processing of the rightmost column in the same manner as the scanning from the upper side. When the two types of scanning are completed and the patterns within the character circumscribing frame are classified, the control circuit 207 controls the X counter 205 and the X counter 206. is set at the upper left edge of the character circumscribing frame, and horizontal scanning is performed to detect the character cutting area.

ここで、上記の列走査を具体的に示すために一例を用い
て説明する。第２図は、本実施例の上下走査による具体
例を示す図である。同図において、１００，１０１は文
字パターン、１０３は上辺から下辺への走査方向、１０
４は下辺から上辺への走査方向を示す、また、第３図は
第２図の列走査の処理結果を示す図である。同図におい
て、上辺から下辺への走査時に、検出された白点（文字
部分を黒点、背景部分を白点とする）をＣ点とし、Ｃ点
の集合をＣ領域とする。また前記走査時に、黒点が検出
された場合は、該列の走査はそこで打ち切り次列の処理
を行う。ここで、黒点の集合なＡ領域とする。同様の処
理を下辺より上辺への走査時にも行い、該走査時に検出
された白点をＤ点とし、Ｄ点の集合をＤ領域とする。２
回の走査によりＣ点、Ｄ点以外の白点すなわち、前記２
回の走査で走査されなかった白点をＢ点としその集合を
Ｂ領域とする。Here, an example will be used to specifically illustrate the above column scanning. FIG. 2 is a diagram showing a specific example of vertical scanning in this embodiment. In the figure, 100 and 101 are character patterns, 103 is a scanning direction from the top side to the bottom side, and 10
4 shows the scanning direction from the lower side to the upper side, and FIG. 3 is a diagram showing the processing result of the column scanning in FIG. 2. In the figure, a white point detected during scanning from the top side to the bottom side (the text part is a black dot and the background part is a white dot) is designated as a point C, and a set of C points is designated as a C area. Furthermore, if a black spot is detected during the scanning, the scanning of the column is stopped at that point and the next column is processed. Here, it is assumed that area A is a collection of black points. Similar processing is performed when scanning from the lower side to the upper side, the white point detected during this scanning is defined as point D, and the set of D points is defined as area D. 2
By scanning twice, white points other than point C and point D, that is, the above-mentioned 2
The white point that was not scanned in the previous scan is defined as B point, and the set thereof is defined as B area.

次に１文字切出し領域の検出を第１図に基づいて説明す
る。Next, detection of a single character cutout area will be explained based on FIG.

先ず、パターン領域変化点検出回路２１０は制御回路２
０７により起動されると、パターンメモリ２０４から文
字パターンデータを読出して外接枠内を行走査する。ま
た、パターン領域変化点検出回路２１０は、パターンメ
モリ２０４からの文字パターンデータを処理するが、現
在処理している点の文字パターンデータが処理されてい
る間その点の１つ前の点の文字パターンデータを保持し
ており、かつ現在処理した文字パターンデータと１点前
の文字パターンデータを比較する。その比較した結果が
、変化したと判定されると、その１点前の座標位置を検
出し保持する。つまり、ＰＭ（ｘ−１゜ｙ）とＰＭ（ｘ
、ｙ）を比較し、等しくない場合には、Ｘ軸座標ｘ−１
をｘＲＥＧ　Ｉ２１４に格納する。切出し領域検出回路
２１１においては、パターン領域変化点検出回路２１０
で前記変化点が検出されたとき、ＰＭ（Ｘ−１，ｙ）を
レジスタに保持する。切出し領域検出回路２１１では、
前記ＰＭ（ｘ−１，ｙ）を保持する状態レジスタ（図示
せず）を３個有し、該状態レジスタは前記変化点が検出
されたときに、レジスタの内容が隣接するレジスタにシ
フトする構成となっている。さらに、前記変化点が検出
され、前記状態レジスタのシフトが完了したら前記３種
類状態レジスタの内容が制御回路２０７に格納されてい
る次に示す状態と一致するかを検出する。状態レジスタ
をＳＴＩ。First, the pattern area change point detection circuit 210 is connected to the control circuit 2.
07, character pattern data is read from the pattern memory 204 and lines are scanned within the circumscribed frame. The pattern area change point detection circuit 210 processes the character pattern data from the pattern memory 204, and while the character pattern data of the point currently being processed is being processed, the pattern area change point detection circuit 210 processes the character pattern data of the point immediately before that point. It holds pattern data and compares the currently processed character pattern data with the previous character pattern data. If it is determined that the comparison result has changed, the coordinate position of the previous point is detected and held. In other words, PM(x-1°y) and PM(x
, y), and if they are not equal, the X-axis coordinate x-1
is stored in xREG I214. In the cutout area detection circuit 211, the pattern area change point detection circuit 210
When the change point is detected, PM(X-1, y) is held in the register. In the cutout area detection circuit 211,
It has three status registers (not shown) that hold the PM (x-1, y), and the status registers have a configuration in which the contents of the registers are shifted to adjacent registers when the change point is detected. It becomes. Furthermore, when the change point is detected and the shift of the state register is completed, it is detected whether the contents of the three types of state register match the following states stored in the control circuit 207. STI status register.

Ｓｒ１　、Ｓｒ１とすれば、５Ｔｌ＝４かつ５Ｔ２＝０
かつ５Ｔ３＝２あるいは、５Ｔｌ＝２かつ５Ｔ２＝０か
つ５Ｔ３＝４あるいは、５Ｔ２＝２かつ５Ｔ３＝４ある
いは、５Ｔ２＝４かつ５Ｔ３＝２という状態である。た
だし、Ｓｒ１は現在の座標位置の内容であるとする。そ
こで、前記状態レジスタが前記組合せと一致した場合、
切出し領域検出回路２１１からの決定信号が７ＲＥＧ２
１２及びｘＲＥＧ　ｌＩ２１３に供給される。その時に
各レジスタに格納されていたＸカウンタ２０５もしくは
ｙカウンタ２０６の内容が各レジスタから７１？Ｘ２と
して出力される。また、Ｘ！はパターン領域変化点検出
回路２１０の状態レジスタＳＴ３がＣ点あるいはＤ点の
ときのＸカウンタ２０５の内容をｘＲＥＧ　Ｉ２１４に
格納したものとなる。また、前記状態レジスタが前記組
合せと一致した場合、その行の水平走査は打ち切り、Ｙ
カウンタをインクリメントし新たな次め行の水平走査を
行う０以上の水平走査が、外接枠内で全て終了した次点
でｘｌ（ｘＲＥＧ　Ｉ）　　、　Ｘ２　　（ｘＲＥＧ　
ＩＩ）　、Ｖｔ　　（ｙＲＥＧ■）をもとに切出し位置
を決定する。If Sr1 and Sr1, 5Tl=4 and 5T2=0
and 5T3=2, or 5Tl=2 and 5T2=0 and 5T3=4, or 5T2=2 and 5T3=4, or 5T2=4 and 5T3=2. However, it is assumed that Sr1 is the content of the current coordinate position. Therefore, if the state register matches the combination,
The decision signal from the cutout area detection circuit 211 is 7REG2.
12 and xREG lI213. The contents of the X counter 205 or y counter 206 stored in each register at that time are 71? It is output as X2. Also, X! is the content of the X counter 205 stored in xREG I214 when the status register ST3 of the pattern area change point detection circuit 210 is at point C or point D. Also, if the status register matches the combination, horizontal scanning of that row is aborted, and Y
The counter is incremented and a new next row is horizontally scanned. At the next point where 0 or more horizontal scans are all completed within the circumscribed frame, xl (xREG I), X2 (xREG
II) Determine the cutting position based on Vt (yREG■).

以下に、第４図に示す切出し位置が決定されたパターン
例を使用して、パターンの転送方法を説明する。また、
第４図は、第１図のブロック図におけるパターンメモリ
２０４に格納されているパターンおよび切出し位置を示
している。座標は横軸をＸ軸、縦軸をＹ軸としており、
パターンメモリ２０４は第４象限に位置しているものと
する。ＸＭおよびＹＭはパターンメモリ２０４の大きさ
を示しており１本実施例においてはＸＭ＝ＹＭ＝１２８
メツシュとした。ＰＲおよびＰＲはパターンメモリ２０
４に格納されているパターンの外接枠を示すも＋７）で
Ｘ＝Ｇ　、Ｘ＝ＰＲ、Ｙ＝０　、Ｙ＝ＰＲの４本の直線
により表わされる。第４図において３００，３０１はパ
ターン、直線Ｙ＝ｙ１　　、Ｘ＝ｘ１　　、Ｘ＝ｘ２は
切出し位置を示している０本実施例におけるパターンメ
モリは、ｌメツシュを表わすデータが第５図の構成とな
っている。第５図において、（１）が１のときは下辺か
ら上辺への列走査時に白点であったことを意味し、（１
）が０のときは白点以外であったことを意味する。また
、（２）が１のときは上辺から下辺への列走査時に白点
であったことを意味し、（２）が０のときは白点以外で
あったことを意味する。さらに、（３）が１のときは黒
点である点を意味し、（３）が０のときは白点である点
を意味する。従って、転送するパターンデータは、（３
）で示されるデータだけである。ｘ＝０で表わされる直
線上のメツシュをＹ；Ｏの点よりＹ座標を１つづつイン
クリメントすることによりＹ＝ＰＢの点までパターンデ
ータを転送する。１例転送終了後Ｘ座標をインクリメン
トする。１列毎に前記転送を繰り返し、Ｘ＝ｘ、の列の
転送を終了した時点で次の列からＸ＝ｘ２の列まではＹ
座標がｙＩよりＦＢまでは、パターンデータをマスクし
固定値０を転送する。Ｘ＝ｘ２の列まで転送した時点で
パターン３００の転送は終了する。パターン３０１につ
いても同様な方法によりパターンを転送することが可能
である。また、外接枠内に１文字が含まれるデータにつ
いては外接枠内のパターンを同様な方法により転送する
ことが出来る。The pattern transfer method will be described below using the example pattern shown in FIG. 4 in which the cutout position has been determined. Also,
FIG. 4 shows the patterns and cutout positions stored in the pattern memory 204 in the block diagram of FIG. 1. The coordinates are the horizontal axis as the X axis and the vertical axis as the Y axis.
It is assumed that the pattern memory 204 is located in the fourth quadrant. XM and YM indicate the size of the pattern memory 204, and in this embodiment, XM=YM=128
It was a mesh. PR and PR are pattern memory 20
The circumscribing frame of the pattern stored in 4 is represented by four straight lines: X=G, X=PR, Y=0, Y=PR. In FIG. 4, 300 and 301 are patterns, and straight lines Y=y1, It has become. In Fig. 5, when (1) is 1, it means that it was a white point during column scanning from the bottom side to the top side, and (1)
) is 0, it means that the point is other than a white point. Further, when (2) is 1, it means that the point was a white point during column scanning from the top side to the bottom side, and when (2) is 0, it means that it was a point other than a white point. Furthermore, when (3) is 1, it means a point that is a black point, and when (3) is 0, it means a point that is a white point. Therefore, the pattern data to be transferred is (3
) is the only data shown. By incrementing the Y coordinate of the mesh on the straight line represented by x=0 from the point Y;O by one, the pattern data is transferred to the point Y=PB. Example 1: Increment the X coordinate after the transfer is completed. The above transfer is repeated for each column, and when the transfer of the column X=x is completed, the transfer from the next column to the column X=x2 is Y.
For coordinates from yI to FB, pattern data is masked and a fixed value of 0 is transferred. The transfer of the pattern 300 ends when the column of X=x2 is transferred. The pattern 301 can also be transferred using a similar method. Furthermore, for data in which one character is included within the circumscribing frame, the pattern within the circumscribing frame can be transferred using a similar method.

次に、第６図、第７図及び第８図に示すフローチャート
に基づいて本実施例の処理の流れを詳細に説明する。こ
こで、第６図は全体の流れを示し、第７図および第８図
はそれぞれ上下２回の走査によるパターンの領域の分類
、および切出し領域の決定の流れ図を示している。先ず
、第６図の全体の流れ図より説明する。３４００では、
読取動作を開始する。３４０１ではラインバッファに格
納されたパターンデータを１列読み出し、第１図での黒
点ヒストグラム作成回路２２０にて黒点ヒストグラムを
作成しヒストグラムメモリ２３０に格納する。５４θ２
においては１行分全ての黒点ヒストグラムを作成するま
で検出し、１打金て作成されるまで５４０１の処理を繰
り返す、３４０３においては処理した文字を管理し、１
行中全部の文字の切出しが終了するまで以下の処理を繰
り返す。Next, the process flow of this embodiment will be explained in detail based on the flowcharts shown in FIGS. 6, 7, and 8. Here, FIG. 6 shows the overall flow, and FIGS. 7 and 8 show flowcharts of classification of pattern areas and determination of cutout areas by two upper and lower scans, respectively. First, the overall flowchart in FIG. 6 will be explained. In 3400,
Start reading operation. At 3401, one column of pattern data stored in the line buffer is read out, and the black point histogram creation circuit 220 in FIG. 1 creates a black point histogram and stores it in the histogram memory 230. 54θ2
In step 3403, the processed characters are managed,
The following process is repeated until all characters in the line have been extracted.

５４０４では黒点ヒストグラムをヒストグラム人モリよ
り読出し、黒点ヒストグラムの始点、および終点を検出
しブロックとする。また、該ブロックの長さと閾値γ１
　、γ２とを比較し何文字で構成されるブロックである
かを保持しておく、５４０５においては第１図の外接枠
検出回路２２１においてブロックの外接枠を検出し、そ
の外接枠内のパターンデータをパターンメモリ２０４に
転送する。８４０６においては前記保持されたブロック
の長さの判定結果により、１文字であればパターンメモ
リ２０４のパターンデータを出力段へ転送し次の文字の
処理へ進む、２文字以上であれば、以下の処理を行う。At step 5404, the black point histogram is read out from the histogram processor, and the starting point and ending point of the black point histogram are detected and set as a block. Also, the length of the block and the threshold γ1
, γ2 and retains how many characters the block consists of. In 5405, the circumscribing frame of the block is detected by the circumscribing frame detection circuit 221 in FIG. 1, and the pattern data within the circumscribing frame is stored. is transferred to the pattern memory 204. At 8406, based on the judgment result of the length of the held block, if it is one character, the pattern data in the pattern memory 204 is transferred to the output stage and processing proceeds to the next character; if it is two or more characters, the following is executed. Perform processing.

５４０７においては、外接枠の上辺および下辺からそれ
ぞれ対辺へ列走査を行いパターンの領域の分類を行い結
果をパターンメモリに格納する。５４０８においては外
接枠内の行走査を行い前記分類結果をパターンメモリよ
り読出し切出し領域の検出を行って切出し位置を決定す
る。５４０９ではパターンメモリ内のパターンを切出し
位置に従って転送する。パターンメモリ内のパターンを
全て転送した時点で次の文字の処理を行う。In step 5407, column scanning is performed from the upper and lower sides of the circumscribing frame to the opposite sides, the pattern area is classified, and the results are stored in the pattern memory. At step 5408, the lines within the circumscribed frame are scanned, the classification results are read out from the pattern memory, and the cutout area is detected to determine the cutout position. At step 5409, the pattern in the pattern memory is transferred according to the cutting position. When all the patterns in the pattern memory have been transferred, the next character is processed.

次に、第６図における５４０７および５４０８の処理に
ついて第７図及び第８図に詳細なフローチャートを示し
、その動作を順に説明する。Next, detailed flowcharts of the processes 5407 and 5408 in FIG. 6 are shown in FIGS. 7 and 8, and the operations will be explained in order.

第７図は、文字パターン領域の分類と、白点から黒点へ
の変化点検出の流れを示している。FIG. 7 shows the flow of classifying character pattern areas and detecting points of change from white dots to black dots.

５５００で、文字パターンデータが入力されると、５５
０１および５５０２では初期化であり、パターンメモリ
のＸ、７の座標を文字外接枠の上辺左端に設定し、走査
の方向を示す値Ｕ／Ｄを上辺より下辺に向って走査する
ので２とする。５５０３においテハ、パターンメモリの
内容をＩｉべＰＭ　（ｘ。5500, when character pattern data is input, 55
01 and 5502 are initialization, and the coordinates of X and 7 in the pattern memory are set to the upper left edge of the character circumscribing frame, and the value U/D indicating the scanning direction is set to 2 because scanning is performed from the upper side to the lower side. . 5503, the contents of the pattern memory are PM (x.

ｙ）＝１（黒点）であれば処理を５５０７へ移し、ＰＭ
（ｘ、ｙ）無ｌ（白点）であるときは。y)=1 (black point), the process is moved to 5507, and PM
(x, y) When there is no (white point).

５５０４−１１’パターンメモリの内容をＰＭ（ｘ、ｙ
）＝　（ＰＭ　（ｘ　、ｙ）、ＯＲ，Ｕ／Ｄ）とする。5504-11' The contents of the pattern memory are PM(x, y
) = (PM (x, y), OR, U/D).

５５０５においてはスキャンの方向によりｙカウンタの
値をインクリメントあるいはデクリメントする。８５０
８では、１列の管理を行い１列の処理が終了するまで５
５０３に戻り同様の処理を繰り返す、ＰＭ（ｘ、ｙ）＝
１が検出されるかまたは１列の走査が終了したときは５
５０７でｙカウンタを走査開始点（上辺上あるいは下辺
上）に設定し、Ｘカウンタを５５０８でインクリメント
し、Ｘカウンタが文字外接枠の右端（ＰＲ）に一致する
まで５５０３からの処理を繰り返す、当該走査でＸカウ
ンタが右端と一致した場合は、走査の方向を下辺から上
辺の方向とし、前記処理を繰り返す。At 5505, the value of the y counter is incremented or decremented depending on the scan direction. 850
In step 8, one row is managed and the process is continued in step 5 until the processing of one row is completed.
Return to 503 and repeat the same process, PM (x, y) =
5 when 1 is detected or one column scan is completed
The y counter is set to the scanning start point (on the upper or lower side) in 507, the X counter is incremented in 5508, and the process from 5503 is repeated until the X counter matches the right end (PR) of the character circumscribing frame. If the X counter matches the right end during scanning, the scanning direction is set from the bottom side to the top side, and the above process is repeated.

このとき５５１１でＵ／Ｄを４とする。Ｕ／Ｄ＝４の走
査で同様の処理を行い全て終了したら、第８図のフロー
チャートに示した処理を行う。At this time, U/D is set to 4 in 5511. Similar processing is performed for the scan of U/D=4, and when all is completed, the processing shown in the flowchart of FIG. 8 is performed.

第８図はパターン領域の変化点検出と切出し領域の決定
についての流れを示すものであり、パターンメモリの文
字外接枠内の水平走査を上辺左端より行い切出し領域の
決定をする。　ｓ　ｅｏｏ、ｓ　ｓｏｔではそれぞれＸ
カウンタ、Ｘカウンタを初期化する。５８Ｇ２では行走
査中の領域の変化を保持するための状態レジスタ５ＴＩ
−３Ｔ３の初期化を行う、現在位置の状態を示すものは
Ｓｒ１であり、走査中現在の領域の前の領域を示すもの
はＳｒ１であり、ＳＴＺ前の領域を示すものはＳＴＩで
ある。５８０３では、パターンメモリの内容ＰＭ（ｘ。FIG. 8 shows the flow of detecting a change point in a pattern area and determining a cutout area.The cutout area is determined by horizontally scanning the character circumscribing frame of the pattern memory from the left end of the upper side. X for s eoo and s sot respectively
Initialize the counter and X counter. In the 58G2, a status register 5TI is used to hold changes in the area during row scanning.
-3T3 is initialized, Sr1 indicates the state of the current position, Sr1 indicates the area before the current area during scanning, and STI indicates the area before STZ. In 5803, the contents of the pattern memory PM(x.

ｙ）がＳｒ１と比較し一致していれば５６０５に進み、
一致していなければ該座標は変化点であるので、５ｅ０
４　でｓＴ２．５Ｔ３（７）内容をそれぞれＳＴＩ、Ｓ
Ｔ２ヘシフトする。５８０５においてはＰＭ　（ｘ　、
ｙ）の内容をＳｒ１にシフトする。y) is compared with Sr1 and if they match, proceed to 5605;
If they do not match, the coordinates are a change point, so 5e0
4, the contents of sT2.5T3(7) are STI and S, respectively.
Shift to T2. In 5805, PM (x,
Shift the contents of y) to Sr1.

８８０６では状態レジスタＳＴ３が０点あるいはＤ点で
あるか判定し、０点あるいはＤ点のときは、Ｘカウンタ
の内容をｘＲＥＧＩに格納する。５８０８では状態レジ
スタＳＴＩ　、Ｓｒ１．Ｓｒ１の状態の組合せを判定し
、５６１８に示す組合せと一致する。５ｅｌｌ　〜５８
１３　ニより、ＸｉにｘＲＥＧＩを。At 8806, it is determined whether the status register ST3 is 0 point or D point, and if it is 0 point or D point, the contents of the X counter are stored in xREGI. At 5808, status registers STI, Sr1. The combination of states of Sr1 is determined and matches the combination shown in 5618. 5ell ~58
13 From 2, add xREGI to Xi.

ｘ２にＸカウンタの内容を、ｙｌにＸカウンタの内容を
格納する。　５８１４，５８１５において、Ｘカウンタ
をインクリメントし、文字外接枠の下辺と一致するまで
５８０１に戻り同様の処理を行う、ただし、５Ｅ１０８
で組合せが８６１８の後半の２項である場合には、５８
１１テは、ｘｌにｘＲＥＧＩの内容から−１を加えたも
のを格納する。Store the contents of the X counter in x2 and the contents of the X counter in yl. At 5814 and 5815, the X counter is incremented, and the process returns to 5801 and the same process is performed until it matches the lower side of the character circumscribing frame, but 5E108
If the combination is the latter two terms of 8618, then 58
11te stores xl plus -1 from the contents of xREGI.

８６０８において、状態レジスタ５ＴＩ−３Ｔ３がＳ　
８１Ｇに示す組合せと一致しない場合は。At 8608, status registers 5TI-3T3 are
If the combination does not match the combination shown in 81G.

Ｓ　８０９．Ｓ　８１０において、Ｘカウンタをインク
リメントし、文字外接枠の右辺と一致するまで５６０３
に戻り前記処理を繰り返す、５８１５でＸカウンタの値
が文字外接枠の下辺と一致した場合は、ｘｌ。S809. At S810, the X counter is incremented until it matches the right side of the character circumscribing frame (5603).
Return to 5815 and repeat the above process. If the value of the X counter matches the lower side of the character circumscribing frame in 5815, xl.

Ｘ２＊Ｙｘの値に切出し点を決定する。第９図は、本実
施例により切出しを行った場合のパターン例であり、Ａ
−Ａ　’はパターンの分割位置を示している。The cutting point is determined at the value of X2*Yx. FIG. 9 is an example of a pattern when cut out according to this embodiment, and A
-A' indicates the division position of the pattern.

以上説明したように、本実施例によれば１前後の文字パ
ターンが当該文字パターンに重なった場合でも、当該文
字パターンが欠落したり１前後の文字パターンの一部が
混入することなく文字パターンの切出しを行うことが出
来る。As explained above, according to this embodiment, even when the character pattern before and after 1 overlaps the character pattern, the character pattern can be changed without missing the character pattern or mixing in a part of the character pattern before and after 1. Can be cut out.

さらに、本実施例においては、２文字が重なり合った場
合を示したが、３文字以上重なり合った場合においても
、重なり合った文字の先頭より２文字を基準に順次切出
し点を決定することにより同様な効果を得ることが出来
る。Furthermore, in this embodiment, the case where two characters overlap is shown, but even when three or more characters overlap, the same effect can be obtained by sequentially determining the cutting point based on the first two characters of the overlapping characters. can be obtained.

（発明の効果）以上説明したように、本発明によれば、文字パターンの
外接枠の上下の辺から各々対辺に向って列走査を行うこ
とより背景部分を走査方向別の領域に分類し、その分類
結果により外接枠内の行走査を行って切出し領域を検出
し、切出し位置を決定するので、精度の高い文字切出し
を行うことができる。また、パターンの外接枠内を走査
して、変化点の検出を行うことにより実現しているので
簡単な回路構成で実施することが可能である。さらに、
本発明を用いることにより、隣接した文字が重なり合っ
た場合でも切出しが可能であるので、文字記入枠の間隔
を小さくすることができ一行当りの読取可能文字数を増
やすことができる。従って、多くの種類の帳票に対応で
き、帳票設計の自由度が大きく、従って性能のよいＯＣ
Ｒが実現出来るという効果がある。(Effects of the Invention) As described above, according to the present invention, by performing column scanning from the upper and lower sides of the circumscribing frame of a character pattern toward the opposite sides, the background portion is classified into areas according to the scanning direction, Based on the classification results, the lines within the circumscribed frame are scanned to detect the cutout area and the cutout position is determined, so that character cutout can be performed with high precision. Further, since this is realized by scanning the circumscribed frame of the pattern and detecting the change point, it can be implemented with a simple circuit configuration. moreover,
By using the present invention, it is possible to cut out even when adjacent characters overlap, so it is possible to reduce the interval between character entry frames and increase the number of readable characters per line. Therefore, it can handle many types of forms, has a large degree of freedom in form design, and has a high performance OC.
This has the effect of realizing R.

[Brief explanation of drawings]

第１図はこの発明の一実施例を示すブロック図、第２図
は本実施例の列走査による具体例を示す図、第３図は第
２図の列走査の処理結果を示す図、第４図は第２図にお
ける切出し位置が決定されたパターン例を示す図、第５
図は本実施例におけるパターンメモリの構成を示す図、
第６図。第７図及び第８図は本実施例の処理の流れを示すフロー
チャート、第９図は本実施例により切出しを行なった場
合のパターン例を示す図、第１θ図は従来の黒点ヒスト
グラムを用いたパターン例を示す図である。２００−−一画像信号、２０１−−−ラインバッファ、
２０２−ｍ−外接枠作成回路、２０３−ｍ−切換回路、
２０４−−−パターンメモリ、　２０５−−−ｘカウン
タ、２０１３−−−７カウンタ、２０？−ｍ−制御回路
。２０８−一画パターン領域分類回路、２０９−−一白点→黒点変化点検出回路。２１０−−−パターン領域変化点検出回路、２１１−一
切出し領域検出回路、２１２−−−　ｙ　ＲＥ　Ｇ　、　２１３−−−ｘ　Ｒ
Ｅ　Ｇ■、２１４−−−ｘ　ＲＥ　Ｇ　Ｉ　。特　　許　　出　　願　　人沖電気工業株式会社特許出願代理人FIG. 1 is a block diagram showing an embodiment of the present invention, FIG. 2 is a diagram showing a specific example of column scanning in this embodiment, FIG. 3 is a diagram showing the processing results of column scanning in FIG. 2, and FIG. Figure 4 is a diagram showing an example of the pattern in which the cutting position in Figure 2 has been determined;
The figure shows the configuration of the pattern memory in this embodiment.
Figure 6. 7 and 8 are flowcharts showing the processing flow of this embodiment, FIG. 9 is a diagram showing an example of a pattern when cropping is performed according to this embodiment, and FIG. It is a figure showing an example of a pattern. 200--One image signal, 201--Line buffer,
202-m-circumscribing frame creation circuit, 203-m-switching circuit,
204---Pattern memory, 205---x counter, 2013---7 counter, 20? -m-control circuit. 208--One stroke pattern area classification circuit, 209--One white point → black point change point detection circuit. 210---Pattern area change point detection circuit, 211-Completely exposed area detection circuit, 212---y RE G, 213---x R
E G■, 214--x RE G I. Patent Application Hitoki Electric Industry Co., Ltd. Patent Application Agent

Claims

[Claims]

In a character extraction method in which a quantized character pattern string obtained by photoelectrically converting a character string written on a form is separated and extracted character by character, the character pattern string is stored in a line buffer memory and means for creating a black point histogram in the column direction by operating the line buffer memory one column in the column direction of the character string, and detecting the width of the black point histogram in the column direction by sequentially performing the scanning for each column; means for detecting how many characters the width of the black dot histogram corresponds to by comparing the average character width of a specified target character with the width of the black dot histogram in the column direction; and based on the number of characters included in the black dot histogram. A black point histogram in the row direction is created by scanning the character pattern string in the region row by row in the row direction, and the character pattern string is determined from the black point histogram in the column and row directions. means for detecting a character circumscribing frame; storage means for retaining a character pattern string within the character circumscribing frame; a detecting means for reading the contents of the character pattern string from the accompanying storage means and detecting whether the contents are a character part or a background part; and a background part and a background part detected by scanning from the upper side obtained by the detecting means. a classification means for classifying a character pattern string within the character circumscribing frame into four types: a background portion detected by scanning from the lower side, a character portion, and a background portion other than these, and a character pattern string within the character circumscribing frame only; A change check in which the line scan is performed again to detect change points at which the classification by the classification means changes, and the change points are sequentially stored and sequentially compared with a predetermined combination of the change points to detect the matching change points. 1. A character cutting method comprising: output means; and means for determining a character cutting position based on the change point detected by the change point detection means.