JP3145264B2

JP3145264B2 - Character extraction device

Info

Publication number: JP3145264B2
Application number: JP02665195A
Authority: JP
Inventors: 好憲大熊; 晃治伊東
Original assignee: Oki Electric Industry Co Ltd
Current assignee: Oki Electric Industry Co Ltd
Priority date: 1995-02-15
Filing date: 1995-02-15
Publication date: 2001-03-12
Anticipated expiration: 2016-03-12
Also published as: JPH08221516A

Description

DETAILED DESCRIPTION OF THE INVENTION

【０００１】[0001]

【産業上の利用分野】この発明は帳票の画像パタンから
文字パタンを切り出すための文字切出し装置に関するも
のである。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a character extracting device for extracting a character pattern from a form image pattern.

【０００２】[0002]

【従来の技術】従来の文字認識においては、文字パタン
を認識用辞書と照合するために、１文字単位に文字パタ
ンを切り出す。文字１個分の記入領域を画成する文字枠
が印刷されている帳票の場合は、個々の文字枠の位置が
予め判っており従ってこの枠位置を切出し位置に用いる
ことができるので、切出し位置を検出するための処理を
行なわずに済む。これに対し、文字複数個分の記入領域
を画成する文字欄は印刷されているが文字枠は印刷され
ていない帳票の場合には、文字欄に記載されている個々
の文字毎に切出し位置を検出するための処理が必要とな
る。2. Description of the Related Art In conventional character recognition, a character pattern is cut out in units of one character in order to collate the character pattern with a recognition dictionary. In the case of a form in which a character frame defining an entry area for one character is printed, the position of each character frame is known in advance, and this frame position can be used as a cutout position. Does not need to be performed. On the other hand, in the case of a form in which a character column that defines an entry area for a plurality of characters is printed but a character frame is not printed, a cutout position is set for each character described in the character column. Requires a process for detecting.

【０００３】このような文字枠が印刷されていない帳票
の文字パタンを切り出すための従来装置として、例えば
特開昭６１−１９５４７４号公報に開示されているもの
がある。A conventional apparatus for cutting out a character pattern of a form on which such a character frame is not printed is disclosed, for example, in Japanese Patent Application Laid-Open No. 61-195474.

【０００４】[0004]

【発明が解決しようとする課題】しかしながら上述の公
報に開示されている従来装置にあっては、ａ）複数個の
文字パタンを含む文字列パタンから基本パタンを検出す
る；ｂ）切出し用辞書と整合する基本パタンを検出す
る；ｃ）切出し用辞書と整合しなかった基本パタンに関
しては、単独の基本パタンについて切出し評価値を求め
ると共に、組み合わせ可能な全ての基本パタンの組み合
わせについて個々に切出し評価値を求める；ｃ）切出し
評価値を参照して最適な切出し位置を決定するという処
理により、切出し位置の検出を行なう。このように基本
パタンを切出し用辞書と照合する処理と、基本パタン単
独について切出し評価値を求める処理と、組み合わせ可
能な全ての基本パタンの組み合わせについて切出し評価
値を求める処理とが必要であるので、切出し位置検出処
理が複雑になり従って処理速度が遅くなる。また切出し
用辞書を記憶するために容量の大きなメモリが必要にな
る。However, in the conventional apparatus disclosed in the above-mentioned publication, a) a basic pattern is detected from a character string pattern including a plurality of character patterns; C) detecting a matching basic pattern; c) for a basic pattern that does not match the cut-out dictionary, obtains a cut-out evaluation value for a single basic pattern and individually obtains a cut-out evaluation value for all possible combinations of basic patterns; C) The extraction position is detected by a process of determining the optimum extraction position with reference to the extraction evaluation value. As described above, the process of matching the basic pattern with the extraction dictionary, the process of obtaining the extraction evaluation value for the basic pattern alone, and the process of obtaining the extraction evaluation value for all possible combinations of the basic patterns are necessary. The cutout position detection process becomes complicated, and the processing speed is reduced. In addition, a large-capacity memory is required to store the extraction dictionary.

【０００５】この発明の目的は上述した従来の問題点を
解決するため、より単純な処理で切出し位置を検出する
ことのできる文字切出し装置を提供することにある。An object of the present invention is to provide a character extracting apparatus which can detect an extracting position by a simpler process in order to solve the above-mentioned conventional problems.

【０００６】[0006]

【課題を解決するための手段】この目的を達成するた
め、この発明の文字切出し装置は、切出し対象文字が記
入される記入領域と当該記入領域に対応する属性判別領
域とを有する帳票の画像パタンから、切出し対象文字パ
タンを切り出す文字切出し装置において、属性判別領域
の画像パタンを走査して周辺分布を作成し、この属性判
別領域の周辺分布を用いて属性判別領域のチェックの有
無を検出し当該検出結果に対応した切出し許容個数の上
限値及び下限値を設定する切出し許容数設定手段と、記
入領域の画像パタンから文字要素パタンを検出し、この
文字要素パタンの検出位置を用いて仮の切出し位置を設
定し、この仮の切出し位置を用いて記入領域の仮の切出
し文字総個数を求め、仮の切出し文字総個数が、切出し
許容個数の上限値よりも大きいとき及び切出し許容個数
の下限値よりも小さいときは、仮の切出し位置を補正す
ると共に当該補正位置を用いて仮の切出し文字総個数を
求め直し、仮の切出し文字総個数が、切出し許容個数の
上限値以下であってかつ切出し許容個数の下限値以上と
なったとき、当該仮の切出し文字総個数を得た仮の切出
し位置を、対象パタン切出し位置として決定する切出し
位置決定手段と、対象パタン切出し位置を用いて切出し
対象文字パタンを切り出すパタン読出し手段とを備えて
成ることを特徴とする。In order to achieve this object, a character extracting apparatus according to the present invention provides an image pattern of a form having an entry area in which a character to be extracted is entered and an attribute determination area corresponding to the entry area. Then, in a character extraction device that extracts an extraction target character pattern, a peripheral distribution is created by scanning an image pattern of an attribute determination area, and the presence or absence of a check of an attribute determination area is detected using the peripheral distribution of the attribute determination area to detect A cutout allowable number setting means for setting an upper limit value and a lower limit value of the cutout allowable number corresponding to the detection result, and a character element pattern is detected from the image pattern of the entry area, and provisional cutout is performed using the detected position of the character element pattern. Set the position and calculate the total number of temporary cutout characters in the entry area using this temporary cutout position. Is larger than the lower limit of the allowable number of cutout characters, the temporary cutout position is corrected, and the total number of temporary cutout characters is calculated again using the corrected position. When the number is equal to or less than the upper limit of the number and equal to or greater than the lower limit of the allowable number of cutouts, a cutout position determining unit that determines a tentative cutout position that obtains the tentative cutout character total number as a target pattern cutout position, Pattern reading means for extracting a character pattern to be extracted using the target pattern extraction position.

【０００７】[0007]

【作用】このような構成の発明によれば、帳票の使用に
関し、以下に述べる状況１）、２）が存在することを利
用する。According to the invention having such a configuration, the use of the form utilizes the fact that the following situations 1) and 2) exist.

【０００８】１）帳票の記入者は、記入領域に切出し対
象文字を記入すると共に、当該切出し対象文字の属性に
応じて選択した一又は複数の属性判別領域にチェックを
付す。従って切出し対象文字は属性判別領域のチェック
の有無に応じた属性を有する。1) A person who fills out a form writes a character to be cut out in a writing area and checks one or a plurality of attribute discrimination areas selected according to the attribute of the character to be cut out. Therefore, the extraction target character has an attribute according to whether or not the attribute determination area is checked.

【０００９】２）記入される切出し対象文字の総個数と
属性判別領域のチェックの有無との間には、相関関係が
存在し、この相関関係に基づいて、記入される切出し対
象文字の総個数の上限値Ｇ_max 及び下限値Ｇ_min を蓄積
しておくことができる。2) There is a correlation between the total number of characters to be extracted to be written and whether or not the attribute discrimination area is checked, and the total number of characters to be extracted to be written is based on this correlation. it can be the previously accumulated upper limit value G _max and the lower limit value G _min.

【００１０】このような状況１）、２）が存在する典型
的な帳票の例は、住所を記入するようにした帳票であ
る。A typical example of a form in which such situations 1) and 2) exist is a form in which an address is entered.

【００１１】例えば、記入者が在住する都道府県の名称
を記入するようにした記入領域と当該領域に対する４個
の属性判別領域とを有し、都、道、府及び県の文字がそ
れぞれ個別の属性判別領域に印刷されている帳票の場合
を考える。この場合、４つの属性判別領域のうちのいず
れかひとつのみにチェックが付されていることとなる。
東京在住の記入者であれば、記入領域の切出し対象文字
として東京を記入し、そして都が印刷されている属性判
別領域にチェックを付すこととなる。また都が印刷され
た属性判別領域にチェックが付される場合、道が印刷さ
れた属性判別領域にチェックが付される場合、及び、府
が印刷された属性判別領域にチェックが付される場合の
それぞれにおいて、記入される切出し対象文字の総個数
の上限値Ｇ_max 及び下限値Ｇ_min はＧ_max ＝Ｇ_min ＝２
個となる。県が印刷された属性判別領域にチェックが付
される場合においては、切出し対象文字が和歌山、埼玉
或はそのほかの県名であるので、上限値Ｇ_max 及び下限
値Ｇ_min はＧ_max ＝３個及びＧ_min ＝２個となる。この
ように、属性判別領域の有無と、上限値Ｇ_max 及び下限
値Ｇ_min との間には相関関係が存在し、この相関関係に
基づいて、上限値Ｇ_max 及び下限値Ｇ_min をデータとし
て蓄積しておくことができる。[0011] For example, it has an entry area in which the name of the prefecture where the entrant lives is entered and four attribute discrimination areas corresponding to the entry area, and the characters of the capital, the road, the prefecture, and the prefecture are individual. Consider a form printed in the attribute determination area. In this case, only one of the four attribute determination areas is checked.
If the resident is a resident of Tokyo, the user enters Tokyo as a character to be cut out of the entry area, and checks the attribute determination area where the capital is printed. When the attribute discrimination area where the capital is printed is checked, the attribute discrimination area where the road is printed is checked, and the attribute discrimination area where the prefecture is printed is checked. In each of the above, the upper limit value G _max and the lower limit value G _min of the total number of cut-out target characters to be written are G _max = G _min = 2.
Individual. When the attribute discrimination area in which the prefecture is printed is checked, the character to be extracted is Wakayama, Saitama or another prefecture name, so the upper limit G _max and the lower limit G _min are G _max = 3 And G _min = 2. Thus, the presence or absence of attribute discrimination area, there is a correlation between the upper limit value G _max and the lower limit value G _min, on the basis of this correlation, the upper limit value G _max and the lower limit value G _min as data Can be stored.

【００１２】ここに例示したように、記入される切出し
対象文字の総個数は上限値Ｇ_max 以下であってかつ下限
値Ｇ_min 以上であるので、記入される切出し対象文字の
総個数が、上限値Ｇ_max よりも大きくなることはなくか
つ下限値Ｇ_min よりも小さくなることはない。従って仮
の切出し位置を用いて求めた仮の切出し文字総個数が、
上限値Ｇ_max 及び下限値Ｇ_min の数値範囲内にあるか否
かを判定することによって、当該仮の切出し位置を、対
象パタン切出し位置（切出し対象文字の切出し位置）に
用いることが適切か否かを判定でき、これがため上限値
Ｇ_max 及び下限値Ｇ_min を、切出し許容個数の上限値及
び下限値に用いることができる。[0012] As illustrated herein, the total number of cut object character to be entered is the upper limit value G _max or less was it and the lower limit value G _min or more, the total number of cut object character to be entered is, the upper limit It does not become larger than the value _Gmax and does not become smaller than the lower limit _Gmin . Therefore, the total number of temporary cut characters obtained using the temporary cut position is
Whether by determining whether it is within the numerical range of the upper limit value G _max and the lower limit value G _min, the cropping position of the temporary target pattern or extraction position (cut target character extraction position) for use is properly or it can be determined, which the upper limit value G _max and the lower limit value G _min, it is possible to use the upper and lower limits of the cut-out tolerance number.

【００１３】さらに属性判別領域の周辺分布は、当該領
域にチェックを付した状態では大きな値となり、当該領
域にチェックを付していない状態では小さな値となるの
で、このチェックの有無に応じて周辺分布が変化するこ
とを利用することにより、属性判別領域の有無を検出で
きる。従って属性判別領域のチェックの有無を検出し、
当該検出結果に対応した切出し許容個数の上限値及び下
限値を、前述のデータとして蓄積してある上限値Ｇ_max
及び下限値Ｇ_min のなかから、選択し設定できる。Further, the peripheral distribution of the attribute determination area has a large value when the area is checked, and has a small value when the area is not checked. By utilizing the fact that the distribution changes, the presence or absence of the attribute determination area can be detected. Therefore, the presence or absence of the check of the attribute determination area is detected,
The upper limit value and the lower limit value of the cutout allowable number corresponding to the detection result are stored in the upper limit value G _max stored as the above-described data.
And the lower limit _Gmin .

【００１４】[0014]

【実施例】以下、図面を参照し、この発明の実施例につ
いて説明する。尚、図面は発明が理解できる程度に概略
的に示してあるにすぎず、従って発明を図示例に限定す
るものではない。Embodiments of the present invention will be described below with reference to the drawings. The drawings are only schematically shown to the extent that the invention can be understood, and thus the invention is not limited to the illustrated examples.

【００１５】図１はこの発明の実施例の全体構成を概略
的に示す機能ブロック図である。同図に示すこの実施例
の文字切出し装置１０は、画像記憶手段１２、フォーマ
ット記憶手段１４、切出し許容数設定手段１６、切出し
位置決定手段１８及びパタン読出し手段２０を備える。FIG. 1 is a functional block diagram schematically showing an entire configuration of an embodiment of the present invention. The character extracting apparatus 10 of this embodiment shown in FIG. 1 includes an image storage unit 12, a format storage unit 14, an allowable number of extraction units setting unit 16, an extraction position determination unit 18, and a pattern reading unit 20.

【００１６】画像記憶手段１２は、切出し対象文字が記
入される記入領域と当該記入領域に対応する属性判別領
域とを有する帳票の画像パタンを格納する。帳票の記入
者は、伝達したい情報を記入領域の切出し対象文字によ
って表し、当該伝達情報の属性を属性判別領域のチェッ
クの有無によって表す。The image storage means 12 stores an image pattern of a form having an entry area in which a character to be extracted is entered and an attribute determination area corresponding to the entry area. The person who fills out the form indicates the information to be transmitted by the character to be cut out of the entry area, and the attribute of the transmission information by the presence or absence of the check of the attribute determination area.

【００１７】フォーマット記憶手段１４は、少なくと
も、記入領域の画像パタンを格納した領域のアドレスと
属性判別領域の画像パタンを格納した領域のアドレスと
を、フォーマット情報として格納する。The format storage means 14 stores at least the address of the area storing the image pattern of the entry area and the address of the area storing the image pattern of the attribute determination area as format information.

【００１８】切出し許容数設定手段１６は、属性判別領
域の画像パタンを走査して周辺分布を作成し、この属性
判別領域の周辺分布を用いて属性判別領域のチェックの
有無を検出し、当該検出結果に対応した切出し許容個数
の上限値及び下限値を設定する。The cutout allowable number setting means 16 scans the image pattern of the attribute discrimination area to create a peripheral distribution, detects whether or not the attribute discrimination area is checked using the peripheral distribution of the attribute discrimination area, and performs the detection. The upper limit and the lower limit of the allowable number of cutouts corresponding to the result are set.

【００１９】切出し位置決定手段１８は、記入領域の画
像パタンから文字要素パタンを検出し、この文字要素パ
タンの検出位置を用いて仮の切出し位置を設定する。さ
らに切出し位置決定手段１８は、仮の切出し位置を用い
て記入領域の仮の切出し文字総個数を求め、そして仮の
切出し文字総個数が、切出し許容個数の上限値よりも大
きいとき及び切出し許容個数の下限値よりも小さいとき
は、仮の切出し位置を補正すると共に当該補正位置を用
いて仮の切出し文字総個数を求め直し、また仮の切出し
文字総個数が、切出し許容個数の上限値以下であってか
つ切出し許容個数の下限値以上となったとき、当該仮の
切出し文字総個数を得た仮の切出し位置を、対象パタン
切出し位置として決定する。The cutout position determining means 18 detects a character element pattern from the image pattern of the entry area, and sets a temporary cutout position using the detected position of the character element pattern. Further, the cut-out position determining means 18 calculates the provisional cut-out character total number in the entry area using the provisional cut-out position, and when the provisional cut-out character total number is larger than the upper limit of the cut-out allowable number and the cut-out allowable number. Is smaller than the lower limit value, the provisional clipping position is corrected, and the total number of provisional clipping characters is calculated again using the correction position. When the number of cut-out characters exceeds the lower limit of the cut-out allowable number, the tentative cut-out position at which the tentative cut-out character total number is obtained is determined as the target pattern cut-out position.

【００２０】パタン切出し手段２０は、対象パタン切出
し位置を用いて切出し対象文字パタンを切り出す。The pattern extracting means 20 extracts a character pattern to be extracted using the target pattern extracting position.

【００２１】（帳票）図２はこの実施例で用いる帳票の
一例を示す。この実施例では、帳票２２は、帳票記入者
の伝達情報として住所を記入するようにした帳票の例で
あって、この帳票２２は、都、道、府或は県の名称を記
入する記入領域２４１及び当該領域に対応する４個の属
性判別領域２６１と、区、市或は郡の名称を記入する記
入領域２４２及び当該領域に対応する３個の属性判別領
域２６２と、区、町或は村の名称を記入する記入領域２
４３及び当該領域に対応する３個の属性判別領域２６３
とを有する。図中、これら記入領域２４１、２４２、２
４３及び属性判別領域２６１、２６２、２６３の位置を
一点鎖線で表したが、この一点鎖線は帳票２２には印刷
されていない。(Form) FIG. 2 shows an example of a form used in this embodiment. In this embodiment, the form 22 is an example of a form in which an address is entered as information transmitted to a form writer, and the form 22 has an entry area for entering the name of a city, a road, a prefecture, or a prefecture. 241 and four attribute determination areas 261 corresponding to the area, an entry area 242 for entering the name of a ward, a city or a county, and three attribute determination areas 262 corresponding to the area, a ward, a town or Entry area 2 for entering the name of the village
43 and three attribute determination areas 263 corresponding to the area
And In the figure, these entry areas 241, 242, 2
The positions of the reference numeral 43 and the attribute determination areas 261, 262, 263 are indicated by dashed lines, but the dashed lines are not printed on the form 22.

【００２２】一方、帳票２２には、住所記入欄を表す線
ここでは実線２８と、住所記入欄のなかを区分する線こ
こでは点線３０と、ご住所欄という項目名称とを印刷し
てある。On the other hand, the form 22 is printed with a line representing an address entry column, here a solid line 28, a line dividing the address entry column, here a dotted line 30, and an item name of an address column.

【００２３】記入領域２４１、属性判別領域２６１、記
入領域２４２、属性判別領域２６２、記入領域２４３及
び属性判別領域２６３を、文字列方向Ｘに沿って順次に
配列し、これら領域２４１、２４２、２４３、２６１、
２６２、２６３を、実線２８で囲む。そして相隣合う記
入領域２４１、２４２の間を点線３０で区切ると共に、
相隣合う記入領域２４２、２４３の間を他の点線３０で
区切る。The writing area 241, the attribute determining area 261, the writing area 242, the attribute determining area 262, the writing area 243, and the attribute determining area 263 are sequentially arranged along the character string direction X, and these areas 241, 242, 243 are arranged. 261
262 and 263 are surrounded by a solid line 28. Then, a space between adjacent entry areas 241 and 242 is separated by a dotted line 30, and
The adjacent writing areas 242 and 243 are separated by another dotted line 30.

【００２４】さらに帳票２２には、記入領域２４１の各
属性判別領域２６１内にそれぞれ属性判別用文字３２１
を印刷してある。ここでは、各属性判別領域２６１にそ
れぞれ異なる種類の属性判別用文字３２１を印刷してお
り、これら属性判別用文字３２１を、都、道、府及び県
の４種としている。属性判別領域２６１は、当該領域２
６１内に印刷された属性判別用文字３２１周辺の余白領
域を含む。Further, the form 22 has an attribute determining character 321 in each attribute determining area 261 of the entry area 241.
Is printed. Here, different types of attribute determination characters 321 are printed on the respective attribute determination regions 261, and these four types of attribute determination characters 321 are a city, a road, a prefecture, and a prefecture. The attribute determination area 261 is the area 2
61 includes a margin area around the attribute discrimination character 321 printed.

【００２５】同様にして、記入領域２４２の各属性判別
領域２６２内にそれぞれ属性判別用文字３２２を印刷し
てある。ここでは、各属性判別領域２６２にそれぞれ異
なる種類の属性判別用文字３２２を印刷しており、これ
ら属性判別用文字３２２を区、市及び郡の３種としてい
る。属性判別領域２６２は、当該領域２６２内に印刷さ
れた属性判別用文字３２２周辺の余白領域を含む。Similarly, an attribute determination character 322 is printed in each attribute determination area 262 of the entry area 242. Here, different types of attribute determination characters 322 are printed in the respective attribute determination regions 262, and these three types of attribute determination characters 322 are ward, city, and county. The attribute determination area 262 includes a margin area around the attribute determination character 322 printed in the area 262.

【００２６】また記入領域２４３の各属性判別領域２６
３内にそれぞれ属性判別用文字３２３を印刷してある。
ここでは、各属性判別領域２６３にそれぞれ異なる種類
の属性判別用文字３２３を印刷しており、これら属性判
別用文字３２３を区、町及び村の３種としている。属性
判別領域２６３は、当該領域２６３内に印刷された属性
判別用文字３２３周辺の余白領域を含む。Each attribute discrimination area 26 of the entry area 243
3, the character 323 for attribute determination is printed.
Here, different types of attribute determination characters 323 are printed in the respective attribute determination regions 263, and these three types of attribute determination characters 323 are ward, town, and village. The attribute determination area 263 includes a margin area around the attribute determination character 323 printed in the area 263.

【００２７】帳票２２の記入者は、その住所に応じて、
都、道、府或は県の名称を表す切出し対象文字３４１を
記入領域２４１に記入し、当該対象文字３４１に対応す
る属性判別用文字３２１が印刷された属性判別領域２６
１に、チェック３６１を記入する。同様にして、区、市
或は郡の名称を表す切出し対象文字３４２を記入領域２
４２に記入し、当該対象文字３４２に対応する属性判別
用文字３２２が印刷された属性判別領域２６２に、チェ
ック３６２を記入する。さらに区、町或は村の名称を表
す切出し対象文字３４３を記入領域２４３に記入し、当
該対象文字３４３に対応する属性判別用文字３２３が印
刷された属性判別領域２６３に、チェック３６３を記入
する。The person who fills out the form 22 according to the address,
An extraction target character 341 representing the name of a city, a road, a prefecture or a prefecture is entered in the entry area 241, and the attribute determination character 321 corresponding to the target character 341 is printed on the attribute determination area 26.
A check 361 is entered in 1. Similarly, the extraction target character 342 representing the name of a ward, a city or a county is entered in the entry area 2.
42, and a check 362 is entered in the attribute determination area 262 in which the attribute determination character 322 corresponding to the target character 342 is printed. Further, a cut-out target character 343 representing the name of a ward, a town, or a village is entered in the entry area 243, and a check 363 is entered in the attribute determination area 263 on which the attribute determination character 323 corresponding to the target character 343 is printed. .

【００２８】例えば図示例では、切出し対象文字３４１
として都の名称を表す東京の２文字、切出し対象文字３
４２として市の名称を表すＸＸの２文字、切出し対象文
字３４３として町の名称を表す△△△の３文字、チェッ
ク３６１〜３６３として〇を記入している。For example, in the illustrated example, the character 341 to be cut out
2 characters of Tokyo representing the name of the city, 3 characters to be extracted
42, two characters XX representing the name of the city, three characters 切 representing the name of the town as the extraction target character 343, and 町 as the checks 361-363.

【００２９】（画像記憶手段）この実施例では、画像記
憶手段１２は、帳票２２の画像パタンをスキャナ３８か
ら入力し、当該入力パタンを格納（記憶）する。(Image Storage Means) In this embodiment, the image storage means 12 inputs the image pattern of the form 22 from the scanner 38 and stores (stores) the input pattern.

【００３０】スキャナ３８は帳票２２を光学的に走査し
て、帳票２２からの光信号を、画素単位に量子化された
電気信号に変換する。そしてスキャナ３８はこの電気信
号を帳票２２の画像パタンとして画像記憶手段１２に記
憶する。ここでは、画像パタンは、文字又は文字背景を
表す２値の電気信号である。The scanner 38 optically scans the form 22 and converts an optical signal from the form 22 into an electric signal quantized in pixel units. Then, the scanner 38 stores the electric signal in the image storage unit 12 as an image pattern of the form 22. Here, the image pattern is a binary electric signal representing a character or a character background.

【００３１】図３は帳票の画像パタンの説明に供する図
である。図にあっては、帳票２２の画像パタン４０を二
点鎖線で囲んで示し、この画像パタン４０のうち、文字
を表す画像パタンを黒色で及び文字背景を表す画像パタ
ンを白色で表している。FIG. 3 is a diagram for explaining the image pattern of a form. In the figure, an image pattern 40 of the form 22 is shown by being surrounded by a two-dot chain line, and among the image patterns 40, an image pattern representing a character is represented in black and an image pattern representing a character background is represented in white.

【００３２】ここでは、帳票２２の実線２８と点線３０
とご住所欄という項目名称とを、スキャナ３８による読
取り不能な色（ドロップアウトカラー）例えば赤色で印
刷してあり、従ってこれら実線２８、点線３０及び項目
名称の光信号は文字背景を表す画像パタン４０に変換さ
れる。Here, the solid line 28 and the dotted line 30 of the form 22
The item name of the address column is printed in a color (dropout color) that cannot be read by the scanner 38, for example, red. Therefore, the solid line 28, the dotted line 30, and the light signal of the item name are image patterns representing a character background. Converted to 40.

【００３３】また属性判別用文字３２１〜３２３を、ス
キャナ３８による読取り可能な色例えば黒色で印刷する
と共に、切出し対象文字３４１〜３４３及びチェック３
６１〜３６３をスキャナ３８による読取り可能な色例え
ば黒色で記入してあり、従ってこれら文字３２１〜３２
３、３４１〜３４３及びチェック３６１〜３６３の光信
号は文字を表す画像パタン４０に変換される。The attributes determining characters 321 to 323 are printed in a color readable by the scanner 38, for example, black, and the characters 341 to 343 to be extracted and the check 3
61 to 363 are written in a color readable by the scanner 38, for example, black.
The optical signals of 3, 341 to 343 and checks 361 to 363 are converted into image patterns 40 representing characters.

【００３４】画像記憶手段１２の格納領域上には、仮想
的に、Ｘ−Ｙ座標系を設定してあり、これら座標位置Ｘ
及びＹで表される画素位置の画像パタン４４を、読み出
すことができるように、画像記憶手段１２を構成してい
る。そして帳票２２の文字列方向ＸがＸ軸方向と平行と
なるように、画像パタン４４を格納している。An XY coordinate system is virtually set on the storage area of the image storage means 12, and these coordinate positions X
The image storage means 12 is configured so that the image pattern 44 at the pixel position represented by Y and Y can be read. The image pattern 44 is stored such that the character string direction X of the form 22 is parallel to the X-axis direction.

【００３５】例えば、スキャナ３８の主走査方向を帳票
２２の文字列方向Ｘとほぼ平行となるように、帳票２２
をスキャナ３８にセッティングして、帳票２２を光学的
に走査することにより、文字列方向ＸがＸ軸方向と平行
となるように画像パタン４４を格納する。For example, the form 22 is set so that the main scanning direction of the scanner 38 is substantially parallel to the character string direction X of the form 22.
Is set on the scanner 38, and the form 22 is optically scanned to store the image pattern 44 so that the character string direction X is parallel to the X-axis direction.

【００３６】（フォーマット記憶手段）この実施例で
は、フォーマット記憶手段１４は、記入領域２４１、２
４２、２４３の画像パタン４０をそれぞれ各領域毎に個
別に画像記憶手段１２から読み出すためのアドレスと、
属性判別領域２６１、２６２、２６３の画像パタン４０
をそれぞれ各領域毎に個別に画像記憶手段１２から読み
出すためのアドレスとを記憶する。ここでは、これらア
ドレスを、画像記憶手段１２の格納領域上に設定した座
標位置Ｘ、Ｙで表す（以下、このアドレスをアドレス
Ｘ、Ｙと表す）。(Format storage means) In this embodiment, the format storage means 14 stores the entry areas 241, 2
Addresses for individually reading out the image patterns 40 of 42 and 243 from the image storage means 12 for each area;
Image pattern 40 of attribute determination areas 261, 262, 263
And an address for reading from the image storage unit 12 for each area. Here, these addresses are represented by coordinate positions X and Y set on the storage area of the image storage unit 12 (hereinafter, these addresses are represented as addresses X and Y).

【００３７】さらにフォーマット記憶手段１４は、属性
判別領域２６１、２６２、２６３の周辺分布を正規化す
るための定数Ａ_n と、文字要素パタンを検出するための
閾値ＴＨＬ１とを記憶する。Furthermore format storage unit 14 stores the constants A _n for normalizing the marginal distribution of the attribute discrimination region 261, 262, 263, and a threshold value THL1 for detecting a character element pattern.

【００３８】（切出し許容数設定手段）この実施例で
は、切出し許容数設定手段１６は、周辺分布作成手段１
６ａ、チェック領域検出手段１６ｂ及び許容数記憶手段
１６ｃを有する。(Allowable Extraction Number Setting Means) In this embodiment, the allowable extraction number setting means 16 includes the margin distribution creation means 1.
6a, a check area detecting means 16b and an allowable number storing means 16c.

【００３９】周辺分布作成手段１６ａは、各属性判別領
域毎に個別に周辺分布を作成し、各周辺分布を正規化す
る。The margin distribution creating means 16a creates margin distributions individually for each attribute discrimination area, and normalizes each margin distribution.

【００４０】チェック領域検出手段１６ｂは、各記入領
域毎に、正規化した周辺分布のなかで最大となる周辺分
布を検出し、正規化した周辺分布が最大となる属性判別
領域を、当該判別領域に対応した記入領域に関しチェッ
クが付された属性判別領域として検出する。各記入領域
毎に、正規化した周辺分布が最大となる属性判別領域
を、チェックが付された属性判別領域（チェック有りの
属性判別領域）と判定すると共に正規化した周辺分布が
最大とならない属性判別領域を、チェックが付されてい
ない属性判別領域（チェック無しの属性判別領域）と判
定する。そしてチェック領域検出手段１６ｂは、各記入
領域毎に、チェックの有無に対応する切出し許容数の上
限値Ｇ_max 及びＧ_min を設定する。The check area detecting means 16b detects, for each entry area, the peripheral distribution which is the largest among the normalized peripheral distributions, and determines the attribute discriminating area where the normalized peripheral distribution is the largest. Is detected as an attribute discrimination area with a check for the entry area corresponding to. For each entry area, the attribute discrimination area in which the normalized peripheral distribution is the largest is determined as a checked attribute discrimination area (the checked attribute discrimination area), and the attribute in which the normalized peripheral distribution is not the largest. The determination area is determined as an attribute determination area that is not checked (an attribute determination area without a check). Then, the check area detecting means 16b sets the upper limit values G _max and G _min of the cutout allowable number corresponding to the presence or absence of the check for each entry area.

【００４１】許容数記憶手段１６ｃは、各記入領域毎
に、チェックの有無の検出結果に対応した切出し許容数
の上限値Ｇ_max 及び下限値Ｇ_min を記憶しており、チェ
ック領域検出手段１６ｂは、チェックの有無に対応した
上限値Ｇ_max 及び下限値Ｇ_ｍｉｎを、許容数記憶手段１
６ｃから読み出す。The permissible number storage means 16c stores the upper limit value G _max and the lower limit value G _min of the permissible number of cutouts corresponding to the detection result of the check for each entry area. , The upper limit G _max and the lower limit G _min corresponding to the presence or absence of the check,
6c.

【００４２】チェックの有無の検出について、一例を挙
げて、より具体的に説明する。ここでは、属性判別領域
２６３に着目して説明する。図４及び図５はその説明に
供する図である。図４及び図５の分図（Ａ）はチェック
無し及びチェック有りの場合における属性判別領域２６
３の画像パタンを表す図であって、これら図にあっては
図３と同様にして文字及び文字背景を表す画像パタンを
示してある。また図４及び図５の分図（Ｂ）はチェック
無し及びチェック有りの場合における属性判別領域２６
３の周辺分布を示す図であって、これら図にあっては横
軸に副走査位置Ｙ及び縦軸に累積文字画素数ｆ_ｎ（Ｙ）
を取って示してある。The detection of the presence / absence of the check will be described more specifically with an example. Here, the description will focus on the attribute determination area 263. 4 and 5 are diagrams for explanation. FIGS. 4A and 4B show the attribute discrimination area 26 when there is no check and when there is a check.
3A and 3B are diagrams showing image patterns, and in these figures, image patterns showing characters and character backgrounds are shown in the same manner as in FIG. 3. FIGS. 4 and 5 show the attribute discrimination area 26 when there is no check and when there is a check.
3 is a diagram showing the peripheral distribution of No. 3 in which the horizontal axis represents the sub-scanning position Y and the vertical axis represents the cumulative number of character pixels f _n (Y)
Is shown.

【００４３】この実施例の帳票２２にあっては、記入領
域２４３に対し３個の属性判別領域２６３を設定してお
り、属性判別用文字３２３として区、町及び村がそれぞ
れ異なる属性判別領域２６３内に印刷してある。ここで
は、区、町及び村が印刷されている属性判別領域２６３
をそれぞれ、第１番目、第２番目及び第３番目の属性判
別領域２６３とする。In the form 22 of this embodiment, three attribute discrimination areas 263 are set for the entry area 243, and the attribute discrimination areas 263 having different wards, towns and villages are used as the attribute discrimination characters 323. Printed inside. Here, the attribute determination area 263 on which the ward, town, and village are printed
Are the first, second, and third attribute determination areas 263, respectively.

【００４４】そして文字列方向Ｘにおける属性判別領域
２６３の始端及び終端の位置をＸ_L及びＸ_R 、また文字
列方向Ｘと交差する方向Ｙにおける属性判別領域２６３
の始端及び終端の位置をＹ_T 及びＹ_B と表せば、第１番
目の属性判別領域２６３にあっては、Ｘ_L ＝Ｘ１、Ｘ_R
＝Ｘ２、Ｙ_T ＝Ｙ１及びＹ_B ＝Ｙ２とし、Ｘ１≦Ｘ≦Ｘ
２かつＹ１≦Ｙ≦Ｙ２なる範囲を、第１番目の属性判別
領域２６３の画像パタン４０を読み出すためのアドレス
Ｘ、Ｙとする。また第２番目の属性判別領域２６３にあ
っては、Ｘ_L ＝Ｘ１、Ｘ_R ＝Ｘ２、Ｙ_T ＝Ｙ２及びＹ_B
＝Ｙ３とし、Ｘ１≦Ｘ≦Ｘ２かつＹ２≦Ｙ≦Ｙ３なる範
囲を、第２番目の属性判別領域２６３の画像パタン４０
を読み出すためのアドレスＸ、Ｙとする。さらに第３番
目の属性判別領域２６３にあっては、Ｘ_L ＝Ｘ１、Ｘ_R
＝Ｘ２、Ｙ_T ＝Ｙ３及びＹ_B ＝Ｙ４とし、Ｘ１≦Ｘ≦Ｘ
２かつＹ３≦Ｙ≦Ｙ４なる範囲を、第３番目の属性判別
領域２６３の画像パタン４０を読み出すためのアドレス
Ｘ、Ｙとしている。The start and end positions of the attribute discrimination area 263 in the character string direction X are X _L and X _R , and the attribute discrimination area 263 in the direction Y intersecting the character string direction X.
Expressed in the position of the start and end with Y _T and Y _B, In the first-th attribute discrimination region _{263, X L = X1, X} R
= X2, and Y _T = Y1 and _{Y B = Y2, X1 ≦ X} ≦ X
The range of 2 and Y1 ≦ Y ≦ Y2 is defined as addresses X and Y for reading the image pattern 40 in the first attribute determination area 263. Also In the first second attribute determination area _{263, X L = X1, X} R = X2, Y T = Y2 and Y _B
= Y3, and the range of X1 ≦ X ≦ X2 and Y2 ≦ Y ≦ Y3 is defined as the image pattern 40 of the second attribute determination area 263.
Are the addresses X and Y for reading out the data. Further, in the third attribute determination area 263, X _L = X1, X _R
= X2, Y _T = Y3 and Y _B = Y4, and X1 ≦ X ≦ X
The range of 2 and Y3 ≦ Y ≦ Y4 is defined as addresses X and Y for reading the image pattern 40 in the third attribute determination area 263.

【００４５】まず、周辺分布作成手段１６ａは、第ｎ番
目（ｎはｎ≧１なる自然数であって、ここではｎ＝１、
２、３）の属性判別領域２６３のアドレスＸ、Ｙをフォ
ーマット記憶手段１４から読み出し、そして当該アドレ
スＸ、Ｙに対応する属性判別領域２６３の画像パタン４
０を、画像記憶手段１２から読み出す。First, the marginal distribution creating means 16a determines the n-th (n is a natural number satisfying n ≧ 1; here, n = 1,
The addresses X and Y of the attribute determination area 263 of (2) and (3) are read from the format storage unit 14, and the image pattern 4 of the attribute determination area 263 corresponding to the addresses X and Y is read.
0 is read from the image storage unit 12.

【００４６】次いで周辺分布作成手段１６ａは、主走査
方向を文字列方向Ｘ及び副走査方向を文字列方向Ｘと交
差する方向Ｙとして、第ｎ番目の属性判別領域２６３の
画像パタン４０を走査し、各副走査位置Ｙ毎に、走査線
上の累積文字画素数f_n(Y) を求める。累積文字画素数f_n
(Y) は、副走査位置Ｙの走査線上に存在しかつ第ｎ番目
の属性判別領域２６３内に存在する文字画素の総個数で
ある。Next, the peripheral distribution creating means 16a scans the image pattern 40 in the n-th attribute discrimination area 263 with the main scanning direction as the character string direction X and the sub-scanning direction as the direction Y intersecting with the character string direction X. , The cumulative number of character pixels f _n (Y) on the scanning line is determined for each sub-scanning position Y. Cumulative character pixel number f _n
(Y) is the total number of character pixels existing on the scanning line at the sub-scanning position Y and existing in the n-th attribute determination area 263.

【００４７】次いで周辺分布作成手段１６ａは第ｎ番目
の属性判別領域２６３の周辺分布∫f_n(Y) dYを求める。
周辺分布∫f_n(Y) dYは、第ｎ番目の属性判別領域２６３
の始端位置Ｙ_T から終端位置Ｙ_B までの累積文字画素数
f_n(Y) の総和である。Next, the peripheral distribution creating means 16a obtains a peripheral distribution Δf _n (Y) dY of the n-th attribute discrimination area 263.
The marginal distribution ∫f _n (Y) dY is the n-th attribute determination area 263
Number of character pixels from the start position Y _T to the end position Y _B
It is the sum of f _n (Y).

【００４８】次いで周辺分布作成手段１６ａは、第ｎ番
目の属性判別領域２６３の周辺分布∫f_n(Y) dYを正規化
するための定数Ａ_n を、フォーマット記憶手段１４から
読み出し、第ｎ番目の属性判別領域２６３の周辺分布∫
f_n(Y) dYを定数Ａ_n で正規化することにより、正規化し
た周辺分布1/A_n・∫f_n(Y) dYを求める。[0048] Then the peripheral distribution creation unit 16a is the constant A _n for normalizing the marginal distribution ∫f _n (Y) dY of the n-th attribute discrimination region 263, read from the format storage unit 14, the n-th Distribution around the attribute discrimination area 263 of
By normalizing f _n (Y) dY with a constant _An , a normalized marginal distribution 1 / A _n · ∫f _n (Y) dY is obtained.

【００４９】周辺分布∫f_n(Y) dYを正規化するための正
規化定数Ａ_n は次式（数１）で表される。The normalization constant A _n for normalizing the marginal distribution [integral] F _n (Y) dY is represented by the following equation (Equation 1).

【００５０】1/A_n・∫F_n(Y) dY＝Ｃ・・・・（数１）但し、∫F_n(Y) dY：チェックを付さない状態で予め求め
た第ｎ番目の属性判別領域の周辺分布∫f_n(Y) dY Ｃ：正の整数である定数この実施例の帳票２２では属性判別領域２６３に関して
はｎ＝１、２、３としているので、ｎ＝１、２、３とし
て（数１）を書き改めると、次式（数２）の如くなる。[0050] _{_{1 / A n · ∫F n (}} Y) dY = C ···· ( number 1) However, ∫F _n (Y) dY: n-th attributes previously obtained with no added check Peripheral distribution ∫f _n (Y) dY C of the discrimination area C: a constant that is a positive integer In the form 22 of this embodiment, n = 1, 2, 3 for the attribute discrimination area 263, so that n = 1, 2, Rewriting (Formula 1) as 3 gives the following formula (Formula 2).

【００５１】 1/A₁・∫F₁(Y) dY＝1/A₂・∫F₂(Y) dY＝1/A₃・∫F₃(Y) dY＝Ｃ・・（数２）チェック３６３を付していない状態で各属性判別領域２
６３毎に周辺分布∫F_n(Y) dYを得、各周辺分布∫F_n(Y)
dYを定数Ｃと等しくする正規化定数Ａ_n を求める。この
ように正規化定数Ａ_n は、各属性判別領域２６３毎に個
別に予め求められ、そしてフォーマット記憶手段１４に
予め記憶されるものである。1 / A ₁ · ∫F ₁ (Y) dY = 1 / A ₂ · ∫F ₂ (Y) dY = 1 / A ₃ · ∫F ₃ (Y) dY = C (Equation 2) Check Each attribute discriminating area 2 without 363
The marginal distribution ∫F _n (Y) dY is obtained for each 63, and each marginal distribution ∫F _n (Y)
A normalization constant _An that makes dY equal to the constant C is obtained. As described above, the normalization constant _An is individually obtained in advance for each attribute determination area 263, and is stored in the format storage unit 14 in advance.

【００５２】次にチェック領域検出手段１６ｂは、各属
性判別領域２６３毎に求めた正規化周辺分布1/A_n・∫f_n
(Y) dYのなかから、最大の正規化周辺分布1/A_n・∫f
_n(Y) dYを検出する。そしてチェック領域検出手段１６
ｂは、正規化周辺分布1/A_n・∫f_n(Y) dYが最大となる属
性判別領域２６３を、当該判別領域２６３に対応する記
入領域２４３に関し、チェック３６３が付されている属
性判別領域２６３として検出する。Next, the check area detecting means 16b calculates the normalized marginal distribution 1 / A _n · ∫f _n obtained for each attribute discrimination area 263.
(Y) From dY, the largest normalized marginal distribution 1 / A _n
_n (Y) dY is detected. Then, the check area detecting means 16
b indicates the attribute discrimination area 263 in which the normalized marginal distribution 1 / A _n · ∫f _n (Y) dY is the maximum, and the attribute discrimination area 243 corresponding to the discrimination area 263 is marked with a check 363. It is detected as an area 263.

【００５３】この実施例で用いる帳票２２にあっては、
記入領域２４３に対して設けた複数の属性判別領域２６
３のいずれかひとつに、チェック３６３を付す。これが
ため当該記入領域２４３に関して、正規化周辺分布1/A_n
・∫f_n(Y) dYが最大となる属性判別領域２６３を、チェ
ック３６３が付された属性判別領域（チェック有りの属
性判別領域）２６３として検出し、かつ、正規化周辺分
布1/A_n・∫f_n(Y) dYが最大とならない属性判別領域２６
３を、チェック２６３が付されていない属性判別領域
（チェック無しの属性判別領域）２６３として検出する
ことができる。In the form 22 used in this embodiment,
A plurality of attribute determination areas 26 provided for the entry area 243
A check 363 is attached to any one of the three. Therefore, with respect to the entry area 243, the normalized marginal distribution 1 / A _n
∫f _n (Y) The attribute discrimination area 263 having the maximum dY is detected as the attribute discrimination area 263 with the check 363 (the attribute discrimination area with the check) 263, and the normalized marginal distribution 1 / A _n・ ∫f _n (Y) Attribute discrimination area 26 where dY is not maximum
3 can be detected as an attribute determination area 263 to which no check 263 is attached (an attribute determination area without a check) 263.

【００５４】このように正規化した周辺分布1/A_n・∫f_n
(Y) dYが最大となるか否かによって、記入領域２４３に
対応する各属性判別領域２６３に関し、チェック３６３
の有無を検出できる。The marginal distribution 1 / A _n · ∫f _n thus normalized
(Y) A check 363 is performed for each attribute determination area 263 corresponding to the entry area 243 depending on whether or not dY is the maximum.
Can be detected.

【００５５】またこの実施例では、周辺分布∫f_n(Y) dY
を正規化し、そして正規化した周辺分布1/A_n・∫f_n(Y)
dYが最大となる属性判別領域２６３を、チェック２６３
が付されている属性判別領域２６３と判定する。In this embodiment, the marginal distribution ∫f _n (Y) dY
And the normalized marginal distribution 1 / A _n・ ∫f _n (Y)
The attribute determination area 263 where dY is the maximum is checked 263
Is determined to be the attribute determination area 263 marked with.

【００５６】このように正規化した周辺分布1/A_n・∫f_n
(Y) dYを用いるので、属性判別用文字３２３を構成する
文字画素の総個数が各属性判別領域２６３毎に相違する
場合でも、またチェック３６３が当該チェック３６３を
付すべき属性判別領域２６３からはみ出て隣接する他の
属性判別領域２６３内に記入されてしまった場合でも、
精度良く、属性判別領域２６３のチェック３６３の有無
を検出できる。The marginal distribution 1 / A _n · ∫f _n thus normalized
(Y) Since dY is used, even when the total number of character pixels constituting the attribute determination character 323 is different for each attribute determination area 263, the check 363 protrudes from the attribute determination area 263 to which the check 363 should be attached. Even if it is written in another attribute determination area 263 adjacent to the
The presence or absence of the check 363 in the attribute determination area 263 can be detected with high accuracy.

【００５７】同様に記入領域２４１に関しても、記入領
域２４１に対して設けられた各属性判別領域２６１毎に
個別に、周辺分布∫f_n(Y) dYを作成しそして正規化した
周辺分布1/A_n・∫f_n(Y) dYを求め、正規化した周辺分布
1/A_n・∫f_n(Y) dYが最大となる属性判別領域２６１を、
当該判別領域２６１に対応する記入領域２４１に関しチ
ェック３６１が付された属性判別領域２６１として検出
する。属性判別領域２６１の正規化した周辺分布1/A_n・
∫f_n(Y) dYが最大となるか否かにより、チェック３６１
の有無を検出できる。Similarly, for the entry area 241, a marginal distribution ∫f _n (Y) dY is created for each attribute discrimination area 261 provided for the entry area 241, and the normalized marginal distribution 1 / A _n・ ∫f _n (Y) dY is obtained and normalized marginal distribution
1 / A _n · ∫f _n (Y) dY,
The entry area 241 corresponding to the determination area 261 is detected as an attribute determination area 261 with a check 361 attached. Normalized marginal distribution 1 / A _n · of attribute discrimination area 261
Check 361 depending on whether or not ∫f _n (Y) dY is maximum
Can be detected.

【００５８】さらに記入領域２４２に関しても、記入領
域２４２に対して設けられた各属性判別領域２６２毎に
個別に、周辺分布∫f_n(Y) dYを作成しそして正規化した
周辺分布1/A_n・∫f_n(Y) dYを求め、正規化した周辺分布
1/A_n・∫f_n(Y) dYが最大となる属性判別領域２６２を、
当該判別領域２６２に対応する記入領域２４２に関しチ
ェック３６２が付された属性判別領域２６２として検出
する。属性判別領域２６２の正規化した周辺分布1/A_n・
∫f_n(Y) dYが最大となるか否かにより、チェック３６２
の有無を検出できる。Further, with respect to the entry area 242, a marginal distribution ∫f _n (Y) dY is created individually for each attribute determination area 262 provided for the entry area 242, and the normalized marginal distribution 1 / A _n・ ∫f _n (Y) dY is calculated and normalized marginal distribution
1 / A _n · ∫f _n (Y) dY,
The entry area 242 corresponding to the determination area 262 is detected as an attribute determination area 262 with a check 362 added. Normalized marginal distribution 1 / A _n · of attribute discrimination area 262
Check 362 depending on whether or not ∫f _n (Y) dY is maximum.
Can be detected.

【００５９】次に切出し許容個数の上限値Ｇ_max 及びＧ
_min の設定について、一例を挙げて、より具体的に説明
する。ここでは、記入領域２４１に関する切出し許容個
数の上限値Ｇ_max 及びＧ_min に着目して説明する。Next, the upper limit values G _max and G of the allowable number of cutouts
The setting of _min will be described more specifically with an example. Here, description will be given focusing on the upper limit values G _max and G _min of the allowable number of cutouts regarding the entry area 241.

【００６０】この実施例の帳票２２を用いる場合、帳票
２２の記入者は、都、道、府或は県の名称を表す切出し
対象文字３４１を記入領域２４１に記入し、そして当該
名称に対応する属性判別用文字３２１ここでは都、道、
府或は県が印刷された属性判別領域２６１にチェック３
６１を付すこととなる。When the form 22 of this embodiment is used, the person who fills in the form 22 writes a cut-out target character 341 representing the name of a city, a road, a prefecture, or a prefecture in the entry area 241 and corresponds to the name. Character 321 for attribute discrimination
Check 3 in the attribute discrimination area 261 where prefecture or prefecture is printed
61 will be attached.

【００６１】そこで記入領域２４１に記入される切出し
対象文字３４１の総個数（以下、記入文字総個数）の上
限値Ｇ_max 及び下限値Ｇ_min に着目すると、都が印刷さ
れた属性判別領域２６１にチェック３６１を付す場合
（以下、チェックの有無の態様１）にあっては、記入領
域２４１に記入される切出し対象文字３４１は東京とな
り従って記入文字総個数の上限値Ｇ_max 及び下限値Ｇ
_min はＧ_max ＝Ｇ_min ＝２個となる。道が印刷された属
性判別領域２６１にチェック３６１を付す場合（以下、
チェックの有無の態様２）にあっては、記入領域２４１
に記入される切出し対象文字３４１は北海となり従って
記入文字総個数の上限値Ｇ_max 及び下限値Ｇ_min はＧ
_max ＝Ｇ_min ＝２個となる。府が印刷された属性判別領
域２６１にチェック３６１を付す場合（以下、チェック
の有無の態様３）にあっては、記入領域２４１に記入さ
れる切出し対象文字３４１は京都或は大阪となり従って
記入文字総個数の上限値Ｇ_max 及び下限値Ｇ_min はＧ
_max ＝Ｇ_min ＝２個となる。さらに県が印刷された属性
判別領域２６１にチェック３６１を付す場合（以下、チ
ェックの有無の態様４）にあっては、記入領域２４１に
記入される切出し対象文字３４１は和歌山、埼玉或はそ
のほかの県名を表す文字であり、従って記入文字総個数
の上限値Ｇ_max 及び下限値Ｇ_min はＧ_max ＝３、Ｇ_min
＝２個となる。Focusing on the upper limit G _max and the lower limit G _min of the total number of characters to be cut out 341 (hereinafter referred to as the total number of characters) to be written in the writing area 241, In the case where the check 361 is added (hereinafter referred to as “checked presence / absence mode 1”), the cut-out target character 341 written in the writing area 241 is Tokyo, so the upper limit G _max and the lower limit G of the total number of characters to be entered are set.
_min is G _max = G _min = 2. When a check 361 is attached to the attribute determination area 261 on which the road is printed (hereinafter, referred to as “check 361”)
In the case 2) with or without the check, the entry area 241
Upper limit value G _max and the lower limit value G _min of the cut-out object character 341 is entered becomes the North Sea thus fill characters total number in the G
_max = _Gmin = 2. In the case where the check mark 361 is added to the attribute discrimination area 261 printed by the government office (hereinafter, whether or not there is a check), the cutout target character 341 to be written in the entry area 241 is Kyoto or Osaka, and thus the input character The upper limit G _max and the lower limit G _min of the total number are G
_max = _Gmin = 2. Further, in the case where a check 361 is attached to the attribute discrimination area 261 on which the prefecture is printed (hereinafter referred to as the presence / absence of check 4), the cutout target character 341 to be entered in the entry area 241 is Wakayama, Saitama, or another character. It is a character representing a prefecture name. Therefore, the upper limit value G _max and the lower limit value G _min of the total number of entered characters are G _max = 3, G _min
= 2.

【００６２】このように属性判別領域２６１のチェック
３６１の有無と、記入文字総個数の上限値Ｇ_max 及びＧ
_min との間には、予め判明している相関関係が存在す
る。従って各属性判別領域２６１のチェック３６１の有
無の各態様毎に、ここでは上述した態様１〜４の各態様
毎に、記入文字総個数の上限値Ｇ_max 及び下限値Ｇ_min
をデータとして蓄積しておくことができる。As described above, the presence or absence of the check 361 in the attribute determination area 261 and the upper limit values G _max and G
There is a correlation that is known in advance between the _min and the _min . Therefore, the upper limit value G _max and the lower limit value G _{min of the} total number of characters to be entered are provided for each aspect of the presence or absence of the check 361 in each attribute determination area 261, here, for each of the aspects 1 to 4 described above.
Can be stored as data.

【００６３】そして後述するように切出し対象文字３４
１の切出し位置を検出する場合にあっては、仮の切出し
位置を用いて求めた仮の切出し文字総個数Ｍが、記入文
字総個数の上限値Ｇ_max 及び下限値Ｇ_min の範囲外の値
となるときは、当該仮の切出し位置は切出し対象文字３
４１の切出し位置として不適切であると判定できる。ま
た仮の切出し位置を用いて求めた仮の切出し総個数Ｍ
が、属性判別領域２６１のチェック３６１の有無に対応
した記入文字総個数の上限値Ｇ_max 及び下限値Ｇ_min の
範囲内の値となるとき、当該仮の切出し位置は切出し対
象文字３４１の切出し位置として適切であると判定でき
る。As will be described later, the character 34 to be cut out
1 In the case of detecting the extraction position, cutout characters total number M of provisional obtained using the extraction position of the provisional, outside the range of values of the upper limit value G _max and the lower limit value G _min of fill characters total number , The provisional extraction position is the extraction target character 3
It can be determined that the cutout position 41 is inappropriate. Also, the total number of temporary cuts M obtained using the temporary cut positions
Is within the range of the upper limit value G _max and the lower limit value G _min of the total number of entered characters corresponding to the presence / absence of the check 361 in the attribute determination area 261, the provisional extraction position is the extraction position of the extraction target character 341. Can be determined to be appropriate.

【００６４】従って予め判明している相関関係に基づい
て得た記入文字総個数の上限値Ｇ_max 及び下限値Ｇ_min
を、上述した切出し許容数の上限値Ｇ_max 及び下限値Ｇ
_minとして用いることができる。Accordingly, the upper limit value G _max and the lower limit value G _{min of the} total number of entered characters obtained based on the previously known correlation.
With the upper limit value G _max and the lower limit value G of the cutout allowable number described above.
Can be used as _min .

【００６５】このように予め判明している切出し許容数
の上限値Ｇ_max 及び下限値Ｇ_min を、各チェックの有無
の態様毎に分類して、許容数記憶手段１６ｃに記憶して
おく。The upper limit value G _max and the lower limit value G _min of the permissible number of cut-outs that have been determined in advance are classified according to the presence or absence of each check and stored in the permissible number storage means 16c.

【００６６】そしてチェック領域検出手段１６ｂは、記
入領域２４１に関しチェックの有無の検出結果（すなわ
ちチェックの態様）を得ると、当該検出結果に対応した
切出し許容数の上限値Ｇ_max 及び下限値Ｇ_min を、許容
数記憶手段１６ｃから読み出し、読み出した上限値Ｇ
_max 及び下限値Ｇ_min を、当該記入領域２４１に関する
切出し許容数の上限値Ｇ_max 及び下限値Ｇ_min として設
定（記憶）する。When the check area detecting means 16b obtains the detection result of the presence / absence of the check regarding the entry area 241 (that is, the mode of the check), the upper limit value G _max and the lower limit value G _{min of} the cutout allowable number corresponding to the detection result. From the allowable number storage means 16c, and the read upper limit G
The _max and the lower limit value G _min, is set as cut allowed number of upper limit value G _max and the lower limit value G _min relating to the entry region 241 (storage).

【００６７】同様にして、記入領域２４２に関しても、
予め判明している切出し許容数の上限値Ｇ_max 及び下限
値Ｇ_min を、各チェックの有無の態様毎に分類して、許
容数記憶手段１６ｃに記憶しておく。そしてチェック領
域検出手段１６ｂは、記入領域２４２に関しチェックの
有無の検出結果（すなわちチェックの態様）を得ると、
当該検出結果に対応した切出し許容数の上限値Ｇ_max 及
び下限値Ｇ_min を、許容数記憶手段１６ｃから読み出
し、読み出した上限値Ｇ_max 及び下限値Ｇ_min を、当該
記入領域２４２に関する切出し許容数の上限値Ｇ_max 及
び下限値Ｇ_min として設定（記憶）する。Similarly, regarding the entry area 242,
The upper limit value G _max and the lower limit value G _min of the permissible number of cutouts that have been determined in advance are classified according to each check mode, and stored in the permissible number storage unit 16c. When the check area detection unit 16b obtains the detection result of the presence / absence of the check regarding the entry area 242 (that is, the mode of the check),
The upper limit value G _max and the lower limit value G _min of cut allowable number corresponding to the detection result, reading from the allowable number storage unit 16c reads the upper limit value G _max and the lower limit value G _min, cut allowable number relating to the entry region 242 _Are set (stored) as the upper limit value _Gmax and the lower limit value _Gmin .

【００６８】また記入領域２４３に関しても、予め判明
している切出し許容数の上限値Ｇ_max 及び下限値Ｇ_min
を、各チェックの有無の態様毎に分類して、許容数記憶
手段１６ｃに記憶しておく。そしてチェック領域検出手
段１６ｂは、記入領域２４３に関しチェックの有無の検
出結果（すなわちチェックの態様）を得ると、当該検出
結果に対応した切出し許容数の上限値Ｇ_max 及び下限値
Ｇ_min を、許容数記憶手段１６ｃから読み出し、読み出
した上限値Ｇ_max 及び下限値Ｇ_min を、当該記入領域２
４３に関する切出し許容数の上限値Ｇ_max 及び下限値Ｇ
_min として設定する。Regarding the entry area 243, the upper limit value G _max and the lower limit value G _{min of the} number of permissible cutouts that have been determined in advance are known.
Are classified according to each check mode, and stored in the allowable number storage unit 16c. Then check area detecting unit 16b obtains the detection result of the presence or absence of the check relates entry region 243 (i.e., aspects of the check), the upper limit G _max and the lower limit value G _min of cut allowable number corresponding to the detection result, the allowable The upper limit value G _max and the lower limit value G _min read from the number storage unit 16 c are stored in the entry area 2.
Upper limit value G _max and lower limit value G of the permissible number of cuts related to 43
Set as _min .

【００６９】（切出し位置決定手段）この実施例では、
切出し位置決定手段１８は、文字要素検出手段１８ａ、
ピッチ推定手段１８ｂ、切出しパラメータ記憶手段１８
ｃ、終了位置検出手段１８ｄ及び位置設定制御手段１８
ｅを有する。(Cutout Position Determination Means) In this embodiment,
The cutout position determining means 18 includes a character element detecting means 18a,
Pitch estimation means 18b, cut-out parameter storage means 18
c, end position detecting means 18d and position setting controlling means 18
e.

【００７０】文字要素検出手段１８ａは、各記入領域毎
に、文字列方向Ｘにおける文字要素パタンの始端位置Ｘ
_L 及び終端位置Ｘ_R を検出する。文字要素パタンは文字
画素が連結して存在する領域の画像パタンであり、切出
し対象文字の画像パタンすなわち対象文字パタンは１個
又は複数個の文字要素パタンを含む。ここでは１個の記
入領域には、切出し対象文字が一列のみ記入される。The character element detecting means 18a calculates the start position X of the character element pattern in the character string direction X for each entry area.
Detecting the _L and end position X _R. The character element pattern is an image pattern of an area in which character pixels are connected, and the image pattern of a character to be extracted, that is, the target character pattern includes one or a plurality of character element patterns. Here, only one line of characters to be cut out is entered in one entry area.

【００７１】ピッチ推定手段１８ｂは、文字要素パタン
の始端位置Ｘ_L 及び終端位置Ｘ_R を用いて、各記入領域
内において、文字列方向Ｘにおける文字要素幅Ｗ_B のう
ち最大の文字要素幅Ｗ_Bmaxと文字列方向Ｘにおける文字
要素間隔Ｗ_S のうち最小となる離間間隔Ｗ_Sminとを求
め、各記入領域毎に、最大の幅Ｗ_Bmax及び最小の幅Ｗ
_Sminの和を推定文字ピッチｐの初期値として設定する。
文字要素幅Ｗ_B は文字要素パタンの幅、文字要素間隔Ｗ
_S は相隣接する文字要素パタンの離間間隔すなわち相隣
接する文字要素が挟む余白パタンの幅である。余白パタ
ンは文字背景画素が連結して存在する領域の画像パタン
である。The pitch estimating means 18b uses the start position X _L and the end position X _R of the character element pattern to set the maximum character element width W _B of the character element width W _B in the character string direction X in each entry area. _Bmax and the minimum spacing W _Smin of the character element spacing W _S in the character string direction X are obtained, and the maximum width W _Bmax and the minimum width W are determined for each entry area.
The sum of _Smin is set as the initial value of the estimated character pitch p.
Character element width W _B is the character element pattern width, character element interval W
_S is the space between adjacent character element patterns, that is, the width of a margin pattern sandwiched between adjacent character elements. The margin pattern is an image pattern of an area in which character background pixels are connected.

【００７２】切出しパラメータ記憶手段１８ｃは、文字
要素パタンの始端位置Ｘ_L 及び終端位置Ｘ_R と推定文字
ピッチｐと仮の切出し開始位置Ｘ_S 及び仮の切出し終了
位置Ｘ_E とをそれぞれ、読み出し及び書き換えの自由に
記憶する。The extraction parameter storage means 18c reads out and reads the start position X _L and end position X _R of the character element pattern, the estimated character pitch p, the provisional extraction start position X _S and the provisional extraction end position X _E , respectively. Remember freely for rewriting.

【００７３】終了位置検出手段１８ｄは、仮の切出し開
始位置Ｘ_S から、文字切出し方向へほぼ推定文字ピッチ
ｐだけ離間した位置を、仮の切出し終了位置Ｘ_E として
算出する。The end position detecting means 18d calculates a position separated from the tentative cut-out start position X _S by the estimated character pitch p in the character cut-out direction as a tentative cut-out end position X _E.

【００７４】位置設定制御手段１８ｅは、仮の切出し終
了位置Ｘ_E から、文字切出し方向へ向けてΔＸ（ΔＸは
正の整数）だけ離間した位置を、次の仮の切出し開始位
置Ｘ_S として設定する。文字切出し方向を正の方向とす
るときは同一記入領域内に存在する文字要素パタンの始
端位置Ｘ_L のうち最小の始端位置Ｘ_L を、最初の仮の切
出し開始位置Ｘ_S とし、文字切出し方向を負の方向とす
るときは同一記入領域内に存在する文字要素パタンの終
端位置のうち最大の終端位置Ｘ_R を、最初の仮の切出し
開始位置Ｘ_S とする。The position setting control means 18e sets a position separated by ΔX (ΔX is a positive integer) in the character extraction direction from the temporary extraction end position X _E as the next temporary extraction start position X _S. I do. The minimum starting end position X _L of the starting end position X _L of the character element pattern existing in the same entry area when the character extraction direction is a positive direction, the cut-out starting position X _S of the first formal, character segmentation direction when the negative direction up to the end position X _R of the end position of the character element pattern existing in the same entry area, and clipping start position X _S of the first formal.

【００７５】また位置設定制御手段１８ｅは、終了位置
検出手段１８ｄが算出した仮の切出し終了位置Ｘ_E が文
字要素領域内の位置となるときは（但しＸ_E ＝Ｘ_R とな
るときを除く）仮の切出し終了位置Ｘ_E を、当該文字要
素領域に隣接する文字要素間領域内の位置若しくは当該
文字要素領域の終端位置Ｘ_R に補正する。文字要素領域
は文字要素パタンが存在する領域、文字要素間領域は相
隣接する文字要素パタンが挟む領域すなわち余白パタン
が存在する領域である。[0075] The positioning control unit 18e, when it cut the end position X _E of the provisional end position detecting means 18d has calculated the position of the character element region (excluding the case where the where X _E = X _R) a temporary cut ending position X _E, corrects the end position X _R position or the character element area of the character element between the region adjacent to the character element region. The character element area is an area where a character element pattern exists, and the area between character elements is an area where adjacent character element patterns are interposed, that is, an area where a margin pattern exists.

【００７６】さらに位置設定制御手段１８ｅは、各記入
領域毎に、仮の切出し開始位置Ｘ_S及び又は仮の切出し
終了位置の検出総個数を記入領域内の切出し文字総個数
Ｍをとし、そして各記入領域毎に、切出し文字総個数Ｍ
と切出し許容個数の上限値Ｇ _max 及び下限値Ｇ_min との
比較結果に応じて次に述べる１）〜３）の処理を行な
う。Further, the position setting control means 18 e
For each area, a temporary cutout start position X_SAnd / or temporary cutout
The total number of detected end positions is the total number of cutout characters in the entry area
M, and for each entry area, the total number of cutout characters M
And the upper limit G of the allowable number of cutouts _max And lower limit G_min With
The following processes 1) to 3) are performed according to the comparison result.
U.

【００７７】処理１）；切出し文字総個数Ｍが切出し許
容個数の下限値Ｇ_min よりも小さいときは、推定文字ピ
ッチｐに正の補正値Δｐを加算して新たな推定文字ピッ
チｐを設定し、この新たな推定文字ピッチｐを用いて仮
の切出し開始位置Ｘ_S 及び仮の切出し終了位置Ｘ_R を設
定し直すべく、終了位置検出手段１８ｄを再起動する。[0077] Process 1); when cut characters total number M is smaller than the lower limit value G _min of cut allowable number adds a positive correction value Δp sets a new estimated character pitch p of the estimated character pitch p the new by using the estimated character pitch p to reset the cut-out start position X _S and cutout end position X _R provisional provisional restarting the end position detection means 18d.

【００７８】処理２）；切出し文字総個数Ｍが切出し許
容個数の上限値Ｇ_max よりも大きいときは、推定文字ピ
ッチｐに負の補正値Δｐを加算して新たな推定文字ピッ
チｐを設定し、この新たな推定文字ピッチｐを用いて仮
の切出し開始位置Ｘ_S 及び仮の切出し終了位置Ｘ_R を設
定し直すべく、終了位置検出手段１８ｄを再起動する。[0078] Process 2); when cut characters total number M is larger than the upper limit value G _max of cut allowable number adds a negative correction value Δp of the estimated character pitch p is set to a new estimated character pitch p the new by using the estimated character pitch p to reset the cut-out start position X _S and cutout end position X _R provisional provisional restarting the end position detection means 18d.

【００７９】処理３）；切出し文字総個数Ｍが切出し許
容個数の下限値Ｇ_min 以上かつ切出し許容個数の上限値
Ｇ_max 以下となるとき、当該切出し文字総個数Ｍを得た
仮の切出し開始位置Ｘ_S 及び仮の切出し終了位置Ｘ_E
を、対象パタン切出し位置として決定する。Process 3); When the total number M of cut-out characters is equal to or more than the lower limit value G _{min of the} allowable number of cut-outs and equal to or less than the upper limit G _{max of the} allowable number of cut-out characters, a temporary cut-out start position at which the total number M of cut-out characters is obtained. X _S and temporary cut-out end position X _E
Is determined as the target pattern cutout position.

【００８０】次に切出し位置決定手段１８の動作の流れ
につき、より具体的に一例を挙げて説明する。図６及び
図７はその説明に供する図である。図６は文字要素パタ
ンの検出及び推定文字ピッチの設定の説明に供する図で
あって、図６の分図（Ａ）にあっては記入領域３４３の
画像パタンを、図３と同様にして示してある。また図６
の分図（Ｂ）にあっては横軸に副走査位置Ｘ及び縦軸に
副走査位置Ｘにおける累積文字画素数f_n(X) を取って、
記入領域３４３内の累積文字画素数f_n(X) の分布状態を
示してある。図７は位置設定制御手段１８ｅに着目した
動作の流れを示す図である。Next, the flow of the operation of the cut-out position determining means 18 will be described more specifically by way of an example. 6 and 7 are diagrams for explanation. FIG. 6 is a diagram for explaining the detection of the character element pattern and the setting of the estimated character pitch. In FIG. 6A, the image pattern of the entry area 343 is shown in the same manner as in FIG. It is. FIG.
In the diagram (B), the horizontal axis indicates the sub-scanning position X and the vertical axis indicates the cumulative number of character pixels f _n (X) at the sub-scanning position X.
The distribution state of the accumulated character pixel number f _n (X) in the entry area 343 is shown. FIG. 7 is a diagram showing the flow of the operation focusing on the position setting control unit 18e.

【００８１】まず文字要素検出手段１８ａは、記入領域
２４３のアドレスＸ、Ｙ及び閾値ＴＨＬ１を、フォーマ
ット記憶手段１４から読み出し、然る後、記入領域２４
１の画像パタン４０を、このアドレスＸ、Ｙを用いて画
像記憶手段１２から読み出す。ここでは、記入領域２４
３は、文字列方向Ｘにおける始端位置Ｘ_L 及び終端位置
Ｘ_R をＸ_L ＝Ｘ３及びＸ_R ＝Ｘ４、文字列方向Ｘと直交
する方向Ｙにおける始端位置Ｙ_T 及び終端位置Ｙ_B をＹ
_T ＝Ｙ５及びＹ_B ＝Ｙ６とした、Ｘ_L ≦Ｘ≦Ｘ_R かつＹ
_T ≦Ｙ≦Ｙ_B の範囲の領域であって、この記入領域２４
３のアドレスＸ、ＹをＸ_L ≦Ｘ≦Ｘ_R かつＹ_T ≦Ｙ≦Ｙ
_B とする。First, the character element detecting means 18a reads the addresses X and Y of the writing area 243 and the threshold value THL1 from the format storage means 14, and then reads the writing area 24.
One image pattern 40 is read from the image storage unit 12 using the addresses X and Y. Here, the entry area 24
3, the starting end position X _L and the end position X _R in the character string direction X X _L = X3 and X _R = X4, the starting end position Y _T and end position Y _B in the direction Y perpendicular to the character string direction X Y
X _L ≦ X ≦ X _R and Y, where _T = Y5 and Y _B = Y6
A region in the range of _T ≦ Y ≦ Y _B, the entry region 24
3 addresses X, Y and X _L ≦ X ≦ X _R and Y _T ≦ Y ≦ Y
_B.

【００８２】次いで文字要素検出手段１８ａは、主走査
方向を文字列方向Ｘと直交する方向Ｙ及び副走査方向を
文字列方向Ｘとして、記入領域２４３の画像パタン４０
を走査し、各副走査位置Ｘ毎に、走査線上の累積文字画
素数f_n(X) を求める。累積文字画素数f_n(X) は、副走査
位置Ｘの走査線上に存在しかつ記入領域２４３内に存在
する文字画素の総個数である。Next, the character element detecting means 18a sets the image pattern 40 of the writing area 243 as the direction Y perpendicular to the character string direction X and the sub-scanning direction as the character string direction X.
To obtain the cumulative number of character pixels f _n (X) on the scanning line for each sub-scanning position X. The cumulative number of character pixels f _n (X) is the total number of character pixels existing on the scanning line at the sub-scanning position X and existing in the writing area 243.

【００８３】次いで文字要素検出手段１８ａは、各副走
査位置Ｘ毎に、累積文字画素数f_n(X) を閾値ＴＨＬ１と
比較し、f_n(X) ＞ＴＨＬ１となる領域を文字要素領域及
びf_n(X) ≦ＴＨＬ１となる領域を文字要素間領域と見做
して、文字要素間領域から文字要素領域に変化したとき
の副走査位置Ｘを文字要素領域の始端位置Ｘ_L として及
び文字要素領域から文字要素間領域に変化したときの副
走査位置Ｘを文字要素領域の終端位置Ｘ_R として検出す
る。そして文字要素検出手段１８ａは、記入領域２４３
内の各文字要素領域毎に、始端位置Ｘ_L 及び終端位置Ｘ
_R を切出しパラメータ記憶手段１８ｃに格納する。図６
にあってはＴＨＬ１＝０とした場合に検出される始端位
置Ｘ_L 及びＸ_R を示してある。Next, the character element detecting means 18a compares the cumulative number of character pixels f _n (X) with the threshold value THL1 for each sub-scanning position X, and determines the area _where f _n (X)> THL1 as a character element area and a THL1 area. the f _n (X) ≦ THL1 become region regarded as the character elements between the regions, and the sub-scanning position X when the changes from between characters element regions in the character element region and start position X _L of the character element regions and character detecting the sub-scanning position X when the change between characters element regions from the element region as a terminal position X _R of the character element region. Then, the character element detecting means 18 a
Each character element each area of the inner, starting end position X _L and the end position X
_R is stored in the extraction parameter storage unit 18c. FIG.
In the is shown a starting position X _L and X _R is detected when the THL1 = 0.

【００８４】次にピッチ推定手段１８ｂは、記入領域２
４３内の各文字要素幅Ｗ_B と記入領域２４３内の各文字
要素間隔Ｗ_S とを求める。文字要素幅Ｗ_B は文字要素領
域の始端位置Ｘ_L 及び終端位置Ｘ_R の離間距離に等し
く、文字要素間隔Ｗ_S は相隣接する文字要素領域の離間
距離に等しい。Next, the pitch estimating means 18b sets the entry area 2
Request and each character element width W _B in 43 and the character element spacing W _S in entry region 243. Character element width W _B is equal to the distance of the starting position X _L and the end position X _R of the character elements region, character element spacing W _S is equal to the distance between the character element region adjacent phases.

【００８５】次いでピッチ推定手段１８ｂは、記入領域
２４３内の文字要素幅Ｗ_B のうち最大の幅Ｗ_Bmaxを検出
すると共に、記入領域２４３内において最大幅Ｗ_Bmaxを
得た文字要素パタンに隣接する文字要素間隔Ｗ_S のうち
最小の間隔Ｗ_Sminを検出し、これら最大幅Ｗ_Bmax及び最
小間隔Ｗ_Sminの和を推定文字ピッチｐとして求める。そ
してピッチ推定手段１８ｂは、求めた推定文字ピッチｐ
を切出しパラメータ記憶手段１８ｄに格納する。Next, the pitch estimating means 18b detects the maximum width W _Bmax of the character element widths W _B in the writing area 243, and is adjacent to the character element pattern in which the maximum width W _Bmax is obtained in the writing area 243. The minimum interval W _Smin among the character element intervals W _S is detected, and the sum of the maximum width W _Bmax and the minimum interval W _Smin is obtained as the estimated character pitch p. The pitch estimating means 18b calculates the estimated character pitch p
Is stored in the extraction parameter storage means 18d.

【００８６】位置設定制御手段１８ｅは、ピッチ推定手
段１８ｂが推定文字ピッチｐを格納し終えると、切出し
パラメータ記憶手段１８ｃから記入領域２４３の文字要
素領域の始端位置Ｘ_S を読み出す。そして位置設定制御
手段１８ｅは、記入領域２４３内の最小の始端位置Ｘ_S
を検出し、当該最小の始端位置Ｘ_S を記入領域２４３の
最初の仮の切出し開始位置Ｘ_S として切出しパラメータ
記憶手段１８ｃに格納し、然る後、終了位置検出手段１
８ｄを起動する（図７の開始）。When the pitch estimating means 18b has stored the estimated character pitch p, the position setting control means 18e reads the starting position X _S of the character element area of the writing area 243 from the cut-out parameter storing means 18c. Then, the position setting control unit 18 e determines the minimum start position X _S in the entry area 243.
Detects, and stores the cut parameter storage unit 18c the minimum starting end position X _S as the first cut-out start position X _S of the temporary entry area 243, thereafter, the end position detecting means 1
8d is started (start of FIG. 7).

【００８７】起動された終了位置検出手段１８ｄは、切
出しパラメータ記憶手段１８ｃから記入領域２４３に関
する最初の仮の切出し開始位置Ｘ_S と推定文字ピッチｐ
とを読み出し、最初の仮の切出し開始位置Ｘ_E としてＸ
_S ＝Ｘ_s ＋ｐ−１を算出する。The activated end position detecting means 18d stores the first temporary cut start position X _S and estimated character pitch p with respect to the entry area 243 from the cut parameter storage means 18c.
Reading the door, X as cut start position X _E of the first formal
To calculate the _{_{S = X s + p-1}} .

【００８８】次に位置設定制御手段１８ｅは、終了位置
検出手段１８ｄが算出した仮の切出し終了位置Ｘ_E が文
字要素領域内の位置及び文字要素間領域内の位置のいず
れであるかを、検定し、この検定結果に応じた仮の切出
し終了位置Ｘ_E を切出しパラメータ記憶手段１８ｃに格
納する。仮の切出し終了位置Ｘ_E が文字間領域内の位置
である場合には、当該終了位置Ｘ_E を補正せずにそのま
ま切出しパラメータ記憶手段１８ｃに格納する。また仮
の切出し終了位置Ｘ_E が文字領域内の位置である場合に
は、仮の切出し終了位置Ｘ_E を当該文字領域の終端位置
Ｘ_R 若しくは当該文字領域に隣接する文字間領域内の位
置に補正し、補正した仮の切出し終了位置Ｘ_E を切出し
パラメータ記憶手段１８ｃに格納する（図７のＳ１）。
次いで位置設定制御手段１８ｅは、切出し文字総個数Ｍ
（Ｍの初期値はＭ＝０）に１を加算して、切出し文字総
個数Ｍをカウントする（図７のＳ２）。Next, the position setting control means 18e checks whether the provisional cut end position X _E calculated by the end position detecting means 18d is a position in the character element area or a position in the inter-character element area. and stores the provisional cut end position X _E in accordance with the test result to the cut parameter storage unit 18c. Provisional cut end position X _E is the case where the position of the inter-character area stores as it is cut parameter storage unit 18c without correcting the end position X _E. In the case the tentative cutout end position X _E is the position of a character region, a temporary cut end position X _E to the position of the inter-character region adjacent to the end position X _R or the character region of the character regions It corrected, cut the corrected temporary cut end position X _E are stored in the parameter storage unit 18c (S1 in FIG. 7).
Next, the position setting control unit 18e determines the total number M of cutout characters.
By adding 1 to (the initial value of M is M = 0), the total number M of cut-out characters is counted (S2 in FIG. 7).

【００８９】次いで位置設定制御手段１８ｅは、記入領
域２４３について仮の切出し位置の設定終了したか否か
を判定する（図７のＳ３）。切出し終了位置Ｘ_E を記入
領域２４３内の文字要素領域の終端位置Ｘ_R のうち最大
の終了位置Ｘ_Rmaxと比較し、Ｘ_E ＜Ｘ_Rmaxとなる場合は
設定未終了と判定し、Ｘ_E ≧Ｘ_Rmaxとなる場合は設定終
了と判定する。Next, the position setting control unit 18e determines whether the setting of the temporary cutout position has been completed for the entry area 243 (S3 in FIG. 7). Compared with the maximum end position X _Rmax of the end position X _R of the character element regions of the cutout end position X _E in writing area 243, if the X _E <X _Rmax is determined not ended and configuration, X _E ≧ If X _Rmax is reached, it is determined that the setting has been completed.

【００９０】設定未終了と判定した場合は、位置設定制
御手段１８ｅは、次の仮の切出し開始位置Ｘ_S としてＸ
_S ＝Ｘ_E ＋ΔＸを算出して、次の仮の切出し位置Ｘ_S を
切出しパラメータ記憶手段１８ｃに格納し、然る後、終
了位置検出手段１８ｄを起動する（図７のＳ４）。例え
ばΔＸ＝１である。起動された終了位置検出手段１８ｄ
は、次の仮の切出し開始位置Ｘ_S を切出しパラメータ記
憶手段１８ｃから読み出し、次の仮の切出し終了位置Ｘ
_E を算出する。次いで位置設定制御手段１８ｅは、終了
位置検出手段１８ｄが算出した次の仮の切出し終了位置
Ｘ_E の検定及び格納を行なう（図７のＳ１）。[0090] When it is determined set not ended and the position setting control section 18e is X as cut start position X _S of the next temporary
Calculate the _S = X _E + _ΔX, and stored in the parameter storage unit 18c cuts out the cut position X _S of the next temporary, thereafter, starts the end position detecting means 18 d (S4 in FIG. 7). For example, ΔX = 1. Activated end position detecting means 18d
Reads out the next provisional extraction start position X _S from the extraction parameter storage unit 18c, and outputs the next provisional extraction end position X S
Calculate _E. Then positioning control unit 18e performs test and storage of the cut end position X _E of the next provisional end position detecting means 18d is calculated (S1 in FIG. 7).

【００９１】また設定終了と判定した場合は、位置設定
制御手段１８ｅは、切出し文字総個数Ｍを、記入領域２
４３に関する切出し許容個数の下限値Ｇ_min 及び上限値
Ｇ_max と比較する（図７のＳ５）。If it is determined that the setting has been completed, the position setting control means 18e sets the total number M of cutout characters in the entry area 2
43 relates to comparing the lower limit value G _min and an upper limit G _max of cut allowable number (S5 in Fig. 7).

【００９２】切出し文字総個数Ｍが下限値Ｇ_min より小
さい場合（Ｍ＜Ｇ_min なる場合）は位置設定制御手段１
８ｅは、推定文字ピッチｐに負のピッチ補正値Δｐを加
算したピッチを新たな推定文字ピッチｐとして算出し、
切出しパラメータ記憶手段１８ｃの推定文字ピッチｐ
を、この新たな推定文字ピッチｐに書き換える。然る
後、位置設定制御手段１８ｅは、最初の仮の切出し終了
位置を算出すべく終了位置検出手段１８ｄを起動する
（図７のＳ６）。起動された終了位置検出手段１８ｄ
は、最初の切出し終了位置Ｘ_E を算出する。然る後、位
置設定制御手段１８ｅは、終了位置検出手段１８が算出
した最初の仮の切出し終了位置Ｘ_E の検定及び格納を行
ない（図７のＳ１）、以後、切出しパラメータ記憶手段
１８ｃの仮の切出し開始位置Ｘ_S 及び仮の切出し終了位
置Ｘ_E を、新たな推定文字ピッチｐにより求めた仮の切
出し開始位置Ｘ_S 及び仮の切出し終了位置Ｘ_E に書き換
える。[0092] When cut characters total number M is smaller than the lower limit value G _min (M <may become G _min) position setting control unit 1
8e calculates a pitch obtained by adding the negative pitch correction value Δp to the estimated character pitch p as a new estimated character pitch p,
Estimated character pitch p of cut-out parameter storage means 18c
To the new estimated character pitch p. Thereafter, the position setting control unit 18e activates the end position detecting unit 18d to calculate the first temporary cutout end position (S6 in FIG. 7). Activated end position detecting means 18d
Calculates a first cut end position X _E. Thereafter, positioning control unit 18e performs a test and stored in the first tentative cutout end position X _E where the end position detecting means 18 is calculated (S1 in FIG. 7), hereinafter, cutout parameter storage unit 18c tentative rewriting of the cutout start position X _S and the temporary cut-out end position X _E, the provisional cut start position X _S and the temporary cut-out end position X _E obtained by the new estimated character pitch p.

【００９３】切出し文字総個数Ｍが上限値Ｇ_max より大
きい場合（Ｍ＞Ｇ_max なる場合）は、位置設定制御手段
１８ｅは、推定文字ピッチｐに正のピッチ補正値Δｐを
加算したピッチを新たな推定文字ピッチｐとして算出
し、切出しパラメータ記憶手段１８ｃの推定文字ピッチ
ｐを、この新たな推定文字ピッチｐに書き換える。然る
後、位置設定制御手段１８ｅは、最初の仮の切出し終了
位置を算出すべく終了位置検出手段１８ｄを起動する
（図７のＳ７）。起動された終了位置検出手段１８ｄ
は、最初の切出し終了位置Ｘ_E を算出する。然る後、位
置設定制御手段１８ｅは終了位置検出手段１８ｄが算出
した最初の仮の切出し終了位置Ｘ_E の検定及び格納を行
ない（図７のＳ１）、以後、切出しパラメータ記憶手段
１８ｃの仮の切出し開始位置Ｘ_S 及び仮の切出し終了位
置Ｘ_E を、新たな推定文字ピッチｐにより求めた仮の切
出し開始位置Ｘ_S 及び仮の切出し終了位置Ｘ_E に書き換
える。[0093] cutout characters total number M (if made M> G _max) upper limit G if _max greater than, positioning control unit 18e newly pitch obtained by adding a positive pitch correction value Δp of the estimated character pitch p The estimated character pitch p is calculated as a new estimated character pitch p, and the estimated character pitch p in the cut-out parameter storage unit 18c is rewritten to the new estimated character pitch p. Thereafter, the position setting control unit 18e activates the end position detecting unit 18d to calculate the first temporary cutout end position (S7 in FIG. 7). Activated end position detecting means 18d
Calculates a first cut end position X _E. Thereafter, positioning control unit 18e performs a test and stored in the first tentative cutout end position X _E where the end position detecting means 18d is calculated (in FIG. 7 S1), subsequently, the temporary cut-out parameter storage unit 18c the cut start position X _S and the temporary cut-out end position X _E, rewrites the temporary cut start position X _S and the temporary cut-out end position X _E obtained by the new estimated character pitch p.

【００９４】また切出し文字総個数Ｍが下限値Ｇ_min 以
上であって上限値Ｇ_max 以下となる場合（Ｇ_min ≦Ｍ≦
Ｇ_max なる場合）は、位置設定制御手段１８ｅは、当該
切出し文字総個数Ｍを得た各仮の切出し開始位置Ｘ_S 及
び各仮の切出し終了位置Ｘ_Eを、記入領域２４３の対象
パタン切出し位置として決定し、当該記入領域２４３の
切出し対象パタンを切り出すべくパタン読み出し手段２
０を起動し（図７のＳ８）、然る後、当該記入領域２４
３に関わる対象パタン切出し位置を検出するための処理
を終了する（図７の終了）。When the total number M of cut-out characters is not less than the lower limit value G _{min and not} more than the upper limit value G _max (G _min ≦ M ≦
If made G _max), the position setting control unit 18e is the cut characters total number clipping start position of each provisional give the M X _S and the temporary cut end position X _E, the target pattern extraction position of the entry area 243 In order to cut out the pattern to be cut out of the writing area 243.
0 (S8 in FIG. 7), and then the entry area 24
The processing for detecting the target pattern cutout position related to No. 3 is ended (end of FIG. 7).

【００９５】同様にして、切出し位置決定手段１８は、
他の記入領域２４１及び２４２についても、個々の記入
領域毎に、対象パタン切出し位置を検出する。Similarly, the cut-out position determining means 18
For the other entry areas 241 and 242, the target pattern cutout position is detected for each entry area.

【００９６】（パタン読み出し手段）この実施例では、
パタン読み出し手段２０は、切出し位置決定手段２０に
より起動されると、対象パタン切出し位置の検出を終了
した記入領域に関する対象パタン切出し位置を、切出し
パラメータ記憶手段１８ｃから読み出すと共に、当該記
入領域の始端位置Ｙ_T 及び終端位置Ｔ_B をフォーマット
記憶手段１４から読み出す。然る後、これら対象パタン
切出し位置と記入領域の始端位置Ｙ_T 及び終端位置Ｙ_B
とを用いて、切出し対象パタンを画像記憶手段１２から
切り出し、後処理手段４０例えば文字認識手段に出力す
る。(Pattern reading means) In this embodiment,
When activated by the cut-out position determining means 20, the pattern read-out means 20 reads out from the cut-out parameter storage means 18c the target pattern cut-out position relating to the entry area for which the detection of the target pattern cut-out position has been detected, and starts the start position of the entry area. It reads the Y _T and the end position T _B from the format storage unit 14. Thereafter, starting end position of these target pattern extraction position entry area Y _T and end position Y _B
By using the above, the extraction target pattern is extracted from the image storage unit 12 and output to the post-processing unit 40, for example, the character recognition unit.

【００９７】この実施例では、Ａ）仮の切出し開始位置
Ｘ_S からほぼ推定文字ピッチｐだけ離れた位置を仮の切
出し終了位置Ｘ_e とし、仮の切出し終了位置Ｘ_E に隣接
する位置を次の仮の切出し開始位置Ｘ_S として、順次に
仮の切出し位置Ｘ_S 、Ｘ_E を検出し、Ｂ）記入領域内の
仮の切出し開始位置Ｘ_S 又は仮の切出し終了位置Ｘ
_Eを、切出し文字総個数Ｍとし、切出し文字総個数Ｍと
切出し許容数の下限値Ｇ_min 、上限値Ｇ_max とを比較
し、Ｃ）この比較結果に応じて、仮の切出し位置Ｘ_S、
Ｘ_E を対象パタン切出し位置として決定し若しくは推定
文字ピッチを補正して再度仮の切出し位置位置Ｘ_S 、Ｘ
_E を検出を行なう。In this embodiment, A) a position substantially apart from the provisional cut-out start position X _S by the estimated character pitch p is set as a provisional cut-out end position X _e, and a position adjacent to the provisional cut-out end position X _E is as cut start position X _S provisional of, sequentially tentative extraction position X _S, detects X _E, B) cut start position of the temporary entry region X _S or provisional cutout end position X
Let _{E be the} total number M of cut-out characters, compare the total number M of cut-out characters with the lower limit G _min and the upper limit G _{max of the} allowable number of cut-outs, and C) according to the comparison result, the temporary cut-out position X _S ,
X _E is determined as the target pattern cut-out position or the estimated character pitch is corrected, and the temporary cut-out position X _S , X
_E is detected.

【００９８】このように対象パタン切出し位置を切出し
文字総個数Ｍと切出し許容数の上限値Ｇ_max 及び下限値
Ｇ_min との比較結果に応じて決定するので、対象パタン
切出し位置の決定を簡単で高精度に行なえる。これがた
め、切出し対象文字パタンの切出し処理を高速化でき、
またこれに加えて装置のハード化に当っては装置構成を
簡単化し装置規模の小型化を図れるという利点がある。[0098] Since the decision in accordance with the comparison result of the upper limit value G _max and the lower limit value G _min of the thus cut object pattern extraction position characters total number M and cut allowable number, a simple determination of the target pattern extraction position Can be performed with high precision. Because of this, it is possible to speed up the extraction process of the extraction target character pattern,
In addition to this, there is an advantage that the hardware configuration of the device can be simplified and the size of the device can be reduced.

【００９９】図８は帳票の他の例を示す図である。上述
した実施例で用いた帳票２２では、属性判別用文字とし
て都道府県等の漢字を用いたが、図８にも示すように属
性判別用文字として、記入領域内に記入される文字の総
個数を表す数字を用いるようにしても良い。例えば図８
の例にあっては、記入者は、記入領域２４１内に東京都
の３文字が切出し対象文字３４１として記入する場合、
属性判別用文字３６１としての３にチェック３６１を付
す。このように記入領域内に記入される文字の総個数を
表す属性判別用文字を用いる場合でも、上述した実施例
と同様に、対象パタン切出し位置の検出を行なえる。FIG. 8 is a diagram showing another example of a form. In the form 22 used in the above-described embodiment, the kanji of the prefecture or the like is used as the attribute discrimination character. However, as shown in FIG. 8, the total number of characters to be entered in the entry area is used as the attribute discrimination character. May be used. For example, FIG.
In the example of the above, when the writer enters three characters of Tokyo as the extraction target character 341 in the entry area 241,
A check 361 is added to 3 as the character 361 for attribute determination. As described above, even when the attribute discrimination character representing the total number of characters to be entered in the entry area is used, the target pattern cutout position can be detected.

【０１００】この発明は上述した実施例にのみ限定され
るものではなく、この発明の趣旨の範囲内において種々
の変更を行なえる。The present invention is not limited to the embodiment described above, and various changes can be made within the scope of the present invention.

【０１０１】例えば、図４及び図５の分図（Ｂ）からも
理解できるように、チェック３６３が付された状態での
個別周辺分布∫f_n(Y) dYは、チェック３６３が付されて
いない状態での個別周辺分布∫f_n(Y) dYすなわち周辺分
布∫F_n(Y) dYよりも大きくなる。従って個別周辺分布∫
f_n(Y) dYを任意好適に定めた閾値ＴＨＬ２と比較し、そ
の比較結果に応じて属性判別領域２６３のチェック３６
３の有無を検出することもできる。すなわち個別周辺分
布∫f_n(Y) dYが閾値ＴＨＬ２以上となる属性判別領域２
６３を、チェック３６３が付された属性判別領域２６３
として検出し、個別周辺分布∫f_n(Y) dYが閾値ＴＨＬ２
未満となる属性判別領域２６３を、チェック３６３が付
されていない属性判別領域２６３として検出すれば良
い。For example, as can be understood from FIG. 4B and FIG. 5B, the individual marginal distribution Δf _n (Y) dY in the state where the check 363 is added is checked 363. The individual marginal distribution ∫f _n (Y) dY in the absence state, that is, larger than the marginal distribution ∫F _n (Y) dY. Therefore individual marginal distribution ∫
f _n (Y) dY is compared with a threshold value THL2 which is arbitrarily determined, and a check 36 of the attribute determination area 263 is performed according to the comparison result.
3 can also be detected. That is, the attribute discrimination area 2 in which the individual marginal distribution ∫f _n (Y) dY is _{equal to} or larger than the threshold value THL2.
63 to the attribute determination area 263 with a check 363
And the individual marginal distribution ∫f _n (Y) dY is _equal to the threshold value THL2.
The attribute discrimination area 263 that is less than may be detected as the attribute discrimination area 263 to which the check 363 is not attached.

【０１０２】この場合には、閾値ＴＨＬ２（ＴＨＬ
２_n）を、例えばＴＨＬ２_n＝∫F_n(Y) dY（∫F_n(Y) dYは
チェック３６３が付されていない状態で得た個別周辺分
布∫f_n(Y) dYである）としたり、ＴＨＬ２_n＝∫F_n(Y) d
Y＋α（αは定数）としたりすることができる。In this case, the threshold value THL2 (THL
2 _n ) is, for example, THL2 _n = ∫F _n (Y) dY (∫F _n (Y) dY is the individual marginal distribution ∫f _n (Y) dY obtained without the check 363) Or THL2 _n = ∫F _n (Y) d
Y + α (α is a constant).

【０１０３】図５にも示すように、チェック３６３が対
応する属性判別領域２６１からはみ出ている場合でも、
αを任意好適な大きさの正の整数とすることにより、精
度良く、チェック３６３の有無を検出できる。As shown in FIG. 5, even when the check 363 is out of the corresponding attribute discrimination area 261,
By setting α to a positive integer of any suitable size, the presence or absence of the check 363 can be accurately detected.

【０１０４】またこの場合には、１個の属性判別領域３
６３にチェック３６３が付される場合のみならず複数個
の属性判別領域３６３にチェック３６３が付される場合
にも、いずれの属性判別領域２６３にチェック３６３が
付されているか検出できる。例えば、帳票２２におい
て、区、町及び村に代えて１、２及び３の各数字を属性
判別用文字３２３に用いる場合を考える。この場合に、
記入領域２４３に記入した文字の総個数が、属性判別用
文字３２３のなかから選択した１個と等しい場合には、
当該選択文字３２３に対応した１個の属性判別領域３２
３にチェック３６３を付し、また記入領域３４３に記入
した文字の総個数が、属性判別用文字３２３のなかから
選択した複数個の和と等しい場合には、当該複数の選択
文字３２３にそれぞれ対応する複数個の属性判別領域３
２３にチェック３６３を付すものとする。例えば、記入
領域２４３に記入した切出し対象文字３４３の総個数が
５となる場合には、帳票２２の記入者がアラビア数字３
及び２に対応する２個の属性判別領域２６３にそれぞれ
チェック３６３を付す。このような場合にも、個別周辺
分布∫f_n(Y) dYが閾値ＴＨＬ２以上となる属性判別領域
３２３を、チェック３６３が付された属性判別領域３２
３として検出することができる。この場合、定数に正の
整数を用いることにより、チェック３６３の有無を検出
できる。In this case, one attribute discrimination area 3
When not only the case where the check 363 is given to the 63 but also the case where the check 363 is given to a plurality of attribute determination areas 363, it is possible to detect which attribute determination area 263 is given the check 363. For example, consider a case in which each number 1, 2, and 3 is used as the attribute determination character 323 in the form 22 instead of the ward, town, and village. In this case,
If the total number of characters entered in the entry area 243 is equal to one selected from the attribute determination characters 323,
One attribute determination area 32 corresponding to the selected character 323
3 is checked 363, and when the total number of characters entered in the entry area 343 is equal to the sum of a plurality of characters selected from the attribute discrimination characters 323, each of the plurality of selected characters 323 is corresponded. Attribute determination areas 3
It is assumed that a check 363 is added to 23. For example, when the total number of cutout target characters 343 entered in the entry area 243 is 5, the person who fills the form 22 has the Arabic numeral 3
A check 363 is attached to each of the two attribute discrimination areas 263 corresponding to and. Even in such a case, the attribute discrimination area 323 where the individual marginal distribution ∫f _n (Y) dY is _{equal to or} larger than the threshold value THL2 is replaced with the attribute discrimination area 32 with the check 363 attached.
3 can be detected. In this case, the presence or absence of the check 363 can be detected by using a positive integer as the constant.

【０１０５】このように各属性判別領域毎に個別に周辺
分布を作成し、各記入領域毎に、所定の閾値以上となる
属性判別領域を検出し、所定の閾値以上となる周辺分布
を得た属性判別領域を、当該判別領域に対応した記入領
域に関しチェックが付された属性判別領域として検出す
るようにしても良い。As described above, a marginal distribution is individually created for each attribute discrimination area, and an attribute discrimination area that exceeds a predetermined threshold is detected for each entry area, and a marginal distribution that exceeds a predetermined threshold is obtained. The attribute discrimination area may be detected as an attribute discrimination area in which a check has been made on the entry area corresponding to the discrimination area.

【０１０６】[0106]

【発明の効果】上述した説明からも明らかなように、こ
の発明の文字切出し装置によれば、記入領域内に記入さ
れる文字の総個数と当該記入領域に対応した属性判別領
域のチェックの有無との間に存在する相関関係に基づい
て、記入領域の切出し許容個数の下限値Ｇ_min 及び上限
値Ｇ_max を予め調べデータとして保持しておく。そして
記入領域内の仮の切出し開始位置又は仮の切出し終了位
置の検出総個数を、記入領域内の切出し文字総個数Ｍと
し、切出し文字総個数Ｍが切出し許容数の下限値Ｇ_min
より小さいか切出し許容数の上限値Ｇ_max より大きいと
きは、当該切出し文字総個数を得た仮の切出し開始位置
及び仮の切出し終了位置は、対象パタン切出し位置すな
わち切出し対象文字パタンの切出し位置として不適切で
あると判定し、仮の切出し開始位置及び仮の切出し終了
位置を補正すべく再度仮の切出し位置の検出を行なう。
また記入領域内の切出し文字総個数Ｍが切出し許容数の
下限値Ｇ_min 以上であってかつ上限値Ｇ_max 以下となる
とき、当該切出し文字総個数を得た仮の切出し開始位置
及び仮の切出し終了位置は、対象パタン切出し位置とし
て適切であると判定し、当該仮の切出し開始位置及び仮
の切出し終了位置を対象パタン切出し位置と決定する。As is clear from the above description, according to the character extracting apparatus of the present invention, the total number of characters to be entered in the entry area and the presence / absence of the attribute discrimination area corresponding to the entry area are checked. holds based on the correlation that exists, as previously examined data the lower limit G _min and an upper limit G _max of cut allowable number of the entry region between the. The detected total number of the provisional cutout start position or the provisional cutout end position in the entry area is defined as the total number M of cutout characters in the entry area, and the total number M of cutout characters is the lower limit Gmin of the allowable number of _cutouts.
Less than or cut when the allowable number of larger than the upper limit G _max is cut start position and cut the end position of the provisional provisional obtain the clipped characters total number as cut position of the target pattern extraction position i.e. cut target character pattern It is determined that it is inappropriate, and the temporary cutout position is detected again to correct the temporary cutout start position and the temporary cutout end position.
When the total number M of cut-out characters in the entry area is equal to or more than the lower limit value G _min and equal to or less than the upper limit value G _{max of the} allowable number of cut-out characters, the provisional cut-out start position and the provisional cut-out where the total number of cut-out characters are obtained. The end position is determined to be appropriate as the target pattern cutout position, and the temporary cutout start position and the temporary cutout end position are determined as the target pattern cutout positions.

【０１０７】このように対象パタン切出し位置を切出し
文字総個数Ｍと切出し許容数の上限値Ｇ_max 及び下限値
Ｇ_min との比較結果に応じて決定するので、対象パタン
切出し位置の決定を簡単に行なえる。これがため、切出
し対象文字パタンの切出し処理を高速化でき、またこれ
に加えて装置のハード化に当っては装置構成を簡単化し
装置規模の小型化を図れるという利点がある。[0107] Since the decision in accordance with the comparison result of the upper limit value G _max and the lower limit value G _min of the thus cut object pattern extraction position characters total number M and cut allowable number, easy determination of the target pattern extraction position I can do it. Therefore, there is an advantage that the processing for extracting the character pattern to be extracted can be speeded up, and in addition to this, the hardware configuration of the apparatus can simplify the apparatus configuration and reduce the size of the apparatus.

[Brief description of the drawings]

【図１】実施例の構成を概略的に示す機能ブロック図で
ある。FIG. 1 is a functional block diagram schematically showing a configuration of an embodiment.

【図２】実施例の文字切出し装置で用いることのできる
帳票の一例を示す図である。FIG. 2 is a diagram illustrating an example of a form that can be used in the character cutout device of the embodiment.

【図３】実施例で用いる帳票の画像パタンの一例を示す
図である。FIG. 3 is a diagram illustrating an example of an image pattern of a form used in the embodiment.

【図４】（Ａ）及び（Ｂ）はチェック無しの場合におけ
る属性判別領域の画像パタン及び当該画像パタンに関す
る累積文字画素数を示す図である。FIGS. 4A and 4B are diagrams showing an image pattern of an attribute determination area and a cumulative number of character pixels relating to the image pattern when no check is made.

【図５】（Ａ）及び（Ｂ）はチェック有りの場合におけ
る属性判別領域の画像パタン及び当該画像パタンに関す
る累積文字画素数を示す図である。FIGS. 5A and 5B are diagrams showing an image pattern of an attribute determination area and a cumulative number of character pixels relating to the image pattern when there is a check;

【図６】（Ａ）及び（Ｂ）は記入領域の画像パタン及び
累積文字画素数を示す図である。FIGS. 6A and 6B are diagrams showing an image pattern and a cumulative number of character pixels in an entry area.

【図７】実施例の位置設定制御手段に着目した動作の流
れを示す図である。FIG. 7 is a diagram showing a flow of operation focusing on the position setting control means of the embodiment.

【図８】実施例の文字切出し装置で用いることのできる
帳票の他の例を示す図である。FIG. 8 is a diagram showing another example of a form that can be used in the character cutout device of the embodiment.

[Explanation of symbols]

１０：文字切出し装置１２：画像記憶手段１４：フォーマット記憶手段１６：切出し許容数設定手段１８：切出し位置決定手段２０：パタン読出し手段 10: Character extraction device 12: Image storage unit 14: Format storage unit 16: Allowable extraction number setting unit 18: Extraction position determination unit 20: Pattern reading unit

フロントページの続き (58)調査した分野(Int.Cl.⁷，ＤＢ名) G06K 9/00 - 9/82 Continuation of the front page (58) Field surveyed (Int.Cl. ⁷ , DB name) G06K 9/00-9/82

Claims

(57) [Claims]

1. A character extracting apparatus for extracting a character pattern to be extracted from an image pattern of a form having an entry area in which a character to be extracted is entered and an attribute determination area corresponding to the entry area. To create a marginal distribution, detect presence / absence of the attribute discrimination area using the marginal distribution of the attribute discrimination area, and set an upper limit value and a lower limit value of the permissible number of pieces corresponding to the detection result. An allowable number setting unit, detecting a character element pattern from the image pattern of the writing area, setting a temporary cutout position using the detected position of the character element pattern, and using the temporary cutout position to temporarily set the temporary setting of the writing area. When the total number of cut-out characters is determined, the provisional cut-out character total number is larger than the upper limit of the cut-out allowable number and smaller than the lower limit of the cut-out allowable number. Correcting the provisional cutout position and recalculating the provisional cutout character total number using the corrected position, and the provisional cutout character total number is equal to or less than the upper limit of the cutout allowable number and the cutout allowable number. And a cutout position determining means for determining, as a target pattern cutout position, a tentative cutout position at which the tentative cutout character total number is obtained, and a cutout target character pattern using the target pattern cutout position. A character readout unit for extracting a character.

2. The character extracting apparatus according to claim 1, wherein the marginal distribution created individually for each attribute discrimination area is normalized, and the marginal distribution which is the largest among the normalized marginal distributions for each entry area. A character segmentation device that detects the attribute discrimination area that has obtained the maximum peripheral distribution as an attribute discrimination area that has been checked for an entry area corresponding to the discrimination area.

3. A character segmentation device according to claim 1, wherein a marginal distribution is created individually for each attribute discrimination area, and an attribute discrimination area exceeding a predetermined threshold is detected for each entry area.
The attribute discrimination area that has obtained the marginal distribution that is equal to or more than the predetermined threshold value,
A character cutout device, which detects an entry area corresponding to the determination area as an attribute determination area with a check.