JPH0368431B2

JPH0368431B2 -

Info

Publication number: JPH0368431B2
Application number: JP58240335A
Authority: JP
Inventors: Yoshitake Tsuji; Hiroshi Asai
Original assignee: Nippon Electric Co Ltd
Current assignee: NEC Corp
Priority date: 1983-12-20
Filing date: 1983-12-20
Publication date: 1991-10-28
Also published as: JPS60132281A

Description

【発明の詳細な説明】本発明は、紙面上に記載された文字列イメージ
を個々の文字に分離する文字分離装置に関するも
のである。DETAILED DESCRIPTION OF THE INVENTION The present invention relates to a character separation device that separates a character string image written on paper into individual characters.

各種印刷文字群を光学的に読み取る装置（以
下、OCRと呼ぶ）において、一連の文字を認識
する場合、各文字を１字毎に分離して文字認識部
に送出してやる必要がある。ところで、郵便物や
大量の文書をOCRにより読み取る場合、印刷物
の品質や印字スタイルが多種存在しており、それ
等を読取対象として取り扱う必要が生じる。この
ような場合、紙面上の文字列イメージに、文字間
に接触が生じたり、１文字が２文字以上に分離す
る状態が生じ、これらを効率良く取り扱うことが
できる文字分離手法が要求される。従来、このよ
うな読取り対象の制約条件が緩和された人力デー
タも含まれる場合、個々のケースに有効と思われ
る機能を追加した文字分離手法が適用されてい
る。しかし、このように個々のケースに対応した
機能を適宜追加することは文字分離の精度を低下
させたり、あるいは、個々の対象物毎に異なつた
機能を持つ文字分離装置を開発する必要性が生じ
る。 When recognizing a series of characters in a device that optically reads various groups of printed characters (hereinafter referred to as OCR), it is necessary to separate each character one by one and send it to a character recognition unit. By the way, when reading mail or a large amount of documents using OCR, there are many different qualities and printing styles of printed matter, and it is necessary to handle these as objects to be read. In such a case, characters may come into contact with each other in the character string image on the paper, or one character may be separated into two or more characters, and a character separation method that can efficiently handle these situations is required. Conventionally, when human data with relaxed constraints on the reading target is included, character separation methods have been applied that add functions that are considered effective in each case. However, adding functions corresponding to individual cases as appropriate may reduce the accuracy of character separation, or it may become necessary to develop character separation devices with different functions for each individual object. .

そこで、本発明の目的は、上記従来の問題点を
解決するために文字ピツチ及び空白情報を用い
て、順次、分離候補区間を設定し、各分離候補区
間の間で各分離候補位置間の距離の分散や平均距
離と文字ピツチとのずれ量に関する分散を最小と
するような文字分離候補位置の系列を動的計画法
を用いて算出することによつて、文字間に接触す
るイメージが生じた場合や１文字が２文字以上に
分離する状態が生じても特殊な機能を追加するこ
となく容易にしかも精度良く文字分離候補位置を
決定することができる文字分離装置を提供するこ
とにある。 SUMMARY OF THE INVENTION In order to solve the above-mentioned conventional problems, it is an object of the present invention to sequentially set separation candidate sections using character pitch and blank information, and to determine the distance between each separation candidate position between each separation candidate section. By using dynamic programming to calculate a series of character separation candidate positions that minimize the variance of the variance and the variance of the deviation between the average distance and character pitch, an image of contact between characters was created. To provide a character separation device capable of easily and accurately determining a character separation candidate position without adding any special function even when a character is separated into two or more characters.

以下、本発明について図面を用いて詳細に説明
する。 Hereinafter, the present invention will be explained in detail using the drawings.

第１図ａ，ｂ，ｃは、部分文字列イメージの一
例を用いて、本発明における文字分離の分離候補
区間設定方法の一例を説明するための図である。
なお、印刷物の品質や印字スタイルが多種存在す
る読取対象における文字分離を行う場合、最初に
文字ピツチを検出する必要がある。文字ピツチ検
出装置としては、例えば、同一出願人による特願
昭58−160763号明細書「文字ピツチ検出装置」
（以下引用例１とする）で示されるような文字ピ
ツチ検出装置があり、このような装置を用いて一
連の文字行イメージから文字ピツチを検出するこ
とができる。 FIGS. 1a, b, and c are diagrams for explaining an example of a separation candidate section setting method for character separation in the present invention using an example of a partial character string image.
Note that when performing character separation in a reading target where there are various print qualities and printing styles, it is necessary to first detect the character pitch. As a character pitch detection device, for example, Japanese Patent Application No. 160763/1983 filed by the same applicant, “Character Pitch Detection Device”
There is a character pitch detection device as shown in (hereinafter referred to as Cited Example 1), and by using such a device, character pitch can be detected from a series of character line images.

第１図ａは、文字間の接触イメージや１文字が
２文字にスプリツトする場合を含む文字列イメー
ジを斜線部で示し、図中Ｐは文字ピツチを示して
いる。第１図ａの部分文字列イメージを垂直投影
すると、第１図ｂで示す投影分布が得られ、黒地
領域（図中、斜線部）、白地領域に分離すること
ができる。文字ピツチＰは、第１図ａで示した文
字列イメージから例えば前記の引用例１で示され
ているような「文字ピツチ検出装置」を用いても
良いし、予め既知であれば、その既定値を用いて
も良い。ここで、第１図ａで示した接触文字
（amu）及びスプリツトした文字ｈを正しく１文
字単位に分離するためには、その文字の分離開始
位置を正しく予測することが必要である。例え
ば、図において、接触文字イメージ“ａ”と
“ｍ”とでは、ａの方がわずかに小さいため、そ
の切り出し開始位置は、接触文字イメージ
（amu）の始端より少し左の方になる。このよう
な接触文字の位置ずれの補正は、従来、例えば文
字イメージを参照することによつて行われてい
た。そのため、文字イメージの接触の度合等によ
り、イメージを参照することによつて処理時間を
要するにもかかわらず、正しく分離位置を決定す
ることができない場合が生じる。そこで、本発明
における文字分離位置の決定方法はまず、第１図
ａで示したような文字行イメージにおいて、第１
図ｂで示したような投影分布等から抽出された白
地領域（以下、空白と呼ぶ）及び黒地領域（以
下、文字塊と呼ぶ）の位置及び大きさを抽出す
る。例えば、第１図ｂに示すような文字塊幅Vi
（ｉ＝１，…４）及び空白サイズWi（ｉ＝１，…
４）、更には、それらの位置情報が公知の技術に
よつて、得られる。次に、前もつて得られた文字
ピツチＰ及び閾値T₁、T₂を用いて、文字分離位
置を設定することが可能な区間（以下、許容区間
と呼ぶ）を、例えば、次に示す条件(1)、(2)を用い
て設定することができる。 In FIG. 1a, a character string image including a contact image between characters and a case where one character is split into two characters is shown by diagonal lines, and P in the figure shows a character pitch. When the partial character string image in FIG. 1a is vertically projected, the projection distribution shown in FIG. 1b is obtained, and it can be separated into a black background area (shaded area in the figure) and a white background area. The character pitch P may be determined from the character string image shown in FIG. A value may also be used. In order to correctly separate the touching characters (amu) and the split character h shown in FIG. 1A into individual characters, it is necessary to accurately predict the separation start position of the characters. For example, in the figure, between the touching character images "a" and "m", since a is slightly smaller, its extraction start position is slightly to the left of the starting edge of the touching character image (amu). Conventionally, correction of such positional deviation of touching characters has been performed by, for example, referring to character images. Therefore, depending on the degree of contact between the character images, it may not be possible to correctly determine the separation position, even though it takes processing time to refer to the images. Therefore, in the method of determining character separation positions in the present invention, first, in a character line image as shown in FIG.
The positions and sizes of white background areas (hereinafter referred to as blanks) and black background areas (hereinafter referred to as character blocks) extracted from the projection distribution etc. as shown in FIG. 2 are extracted. For example, the character block width Vi as shown in Figure 1b
(i=1,...4) and blank size Wi(i=1,...
4) Furthermore, their location information is obtained using known techniques. Next, using the previously obtained character pitch P and threshold values T ₁ and T ₂ , an interval (hereinafter referred to as an allowable interval) in which a character separation position can be set is determined, for example, under the following conditions. It can be set using (1) and (2).

条件(1) 空白となる区間は許容区間とする。Condition (1) Blank sections are allowed sections.

条件(2) 任意の文字塊幅Viにおいて、Vi＞Ｐ＋
T₁を満たす黒地領域のうち、文字塊幅Viの両
端からT₂までの黒地領域を除いた区間は、許
容区間とする。Condition (2) For any character block width Vi, Vi>P+
Among the black areas that satisfy T ₁ , the section excluding the black area from both ends of the character block width Vi to T ₂ is an allowable section.

上述した条件(1)及び条件(2)を満足する許容区間
は例えば、第１図ａの文字列イメージに対して、
第１図ｃの図中A₁，A₂，A₃，A₄，A₅で示す区間
として得られる。尚、上述した閾値T₁及びT₂は、
文字ピツチＰの関数として、与えても良い。ま
た、閾値T₁は前記引用例１に示される文字ピツ
チＰを推定した時の推定誤差等に基づいて設定し
ても良い。次に、本発明における文字分離位置の
決定は、前述した許容区間内で、文字ピツチＰ及
び閾値T₃を用いて、順次第１図ｃで示すような
分離候補区間ｋ（但し、ｋ≧０）を設定し、各分
離候補区間ｋ内の分離候補位置ｘ（ｋ，i_k）（但
し、i_k≧１であり、分離候補区間ｋ内の相対番号
を表わす）に対して、それぞれの分離候補位置間
の距離ｄ（ｋ，ｋ＋１；i_k，i_k+1）を算出し、各分
離候補区間ｋにおいて算出される距離ｄ（ｋ，ｋ
＋１；i_k，i_k+1）の分散σ² _d及び平均値μ_dと文字ピ
ツチＰとのずれの２乗誤差（μ_d−Ｐ）²からなる評
価尺度Ｕを計算し、その評価尺度を最小とするよ
うな分離候補位置ｘ（ｋ，i_k）（ｋ≧０）の系列を
求めることによつて行われる。そこで、上述した
分離候補区間ｋの設定方法の一例について、第１
図ｃを用いて説明する。 For example, the permissible interval that satisfies the above-mentioned conditions (1) and (2) is as follows for the character string image in Figure 1a:
These are obtained as sections A ₁ , A ₂ , A ₃ , A ₄ , and A ₅ in the diagram of FIG. 1c. In addition, the threshold values T ₁ and T ₂ mentioned above are
It may be given as a function of character pitch P. Further, the threshold value T ₁ may be set based on the estimation error when estimating the character pitch P shown in Cited Example 1 above. Next, character separation positions in the present invention are determined within the above-mentioned allowable interval using the character pitch P and the threshold value T ₃ in order of separation candidate interval k (however, k≧0 ₎ , and _each separation is The distance d (k, k+1; i _k , i _k+1 ) between candidate positions is calculated, and the distance d (k, k
+1; i _k , i _k+1 ) and ^the squared error (μ _{d - P) of the deviation between the average value μ d} ^and _the _character pitch P. This is done by finding a sequence of separation candidate positions x(k, i _k ) (k≧0) that minimizes Therefore, regarding an example of the method of setting the separation candidate section k described above, the first
This will be explained using Figure c.

第１図ｃにおける黒点は、分離候補区間ｋ（但
し、ｋ＝０，１，２，３，４）内の各分離候補位
置ｘ（ｋ，i_k）を示しており、上述した許容区間
内で、下記に示す式(3)の関係を満たす位置とし
て、順次設定される。 The black dots in FIG _. The positions are sequentially set as positions that satisfy the relationship of equation (3) shown below.

｜ｄ（ｋ，ｋ＋１：i_k，i_k+1）−Ｐ｜≦T₃ ……(1) 式(1)において、距離ｄ（ｋ，ｋ＋１；i_k，i_k+1）
は、分離候補区間ｋ＋１の分離候補位置ｘ（ｋ＋
１，i_k+1）と分離候補区間ｋの分離候補位置ｘ
（ｋ，i_k）との距離ｘ（ｋ＋１，i_k+1）−ｘ（ｋ＋１，
i_k）を示している。例えば、第１図ｃにおける分
離候補位置ｘ（１，１）と分離候補位置ｘ（２，
２）との距離ｄ（１，２；１，２）において、｜Ｐ
−ｄ（１，２；１，２）｜≦Ｐ＋T₃を満足してい
る。また、分離候補区間０の分離候補位置ｘ（０，
１）から式(1)で示した関係式を満足する位置は、
第１図ｃの白点で示すような６個の位置がある
が、上述した許容区間内に属する白点は、第１図
ｃの黒点で示すように、２個となり、分離候補区
間１の分離候補位置はｘ（１，１），ｘ（１，２）
の２つの位置として求まる。尚、上述した閾値
T₃は前述したような閾値T₁のように文字ピツチ
Ｐの関数として与えることができる。次に、評価
尺度Ｕを用いて、最適な分離候補位置を求めるに
あたり、終端状態となる分離候補区間は、例え
ば、文字行の終端となる空白内のみ設定できると
しても良いし、前述した空白サイズW_iが文字ピ
ツチＰに対して、Wi＞T₄・Ｐ（但し、T₄は閾値
である）を満たす許容区間内に設定できるとして
も良い。後者の方法を採用すると、例えば、第１
図ｃにおける領域Ｅが終端状態となる許容区間と
して検出される。更に、評価尺度Ｕを計算するに
あたり、次に設定すべき分離候補区画（ｋ＋１）
を分離候補区間ｋより前述した式(1)を用いて設定
する場合、式(1)を満たす分離候補区間（ｋ＋１）
がすべて許容区間でなければ、分離候補区間ｋを
終端状態となる分離候補区間ｋとすることができ
る。一方、評価尺度Ｕを用いて最適な分離候補位
置を求めるための始端状態となる分離候補区間
は、すでに検出された終端状態となる分離候補区
間の位置に基づいて設定することができる。|d(k, k+1: i _k , i _k+1 )−P |≦T ₃ ...(1) In equation (1), the distance d(k, k+1; i _k , i _k+1 )
is the separation candidate position x(k+
1, i _k+1 ) and separation candidate position x of separation candidate section k
Distance x(k+ ₁ ,i _k+1 )−x(k+1,
i _k ). For example, the separation candidate position x (1, 1) and the separation candidate position x (2,
2), at the distance d(1,2;1,2), |P
-d(1,2;1,2)|≦P+T ₃ is satisfied. Also, separation candidate position x(0,
From 1) to the position that satisfies the relational expression shown in equation (1),
There are six positions as shown by the white dots in Figure 1c, but there are two white dots that belong to the above-mentioned tolerance interval as shown by the black dots in Figure 1c. Separation candidate positions are x (1, 1), x (1, 2)
It can be found as two positions. In addition, the threshold value mentioned above
T ₃ can be given as a function of the character pitch P, like the threshold T ₁ described above. Next, when determining the optimal separation candidate position using the evaluation scale U, the separation candidate section that becomes the terminal state may be set only within the blank space that is the end of the character line, or the blank size described above may be set. It is also possible that W _i can be set within a permissible interval that satisfies Wi>T ₄ ·P (where T ₄ is a threshold) with respect to character pitch P. If the latter method is adopted, for example, the first
Region E in FIG. c is detected as a permissible section in which the terminal state occurs. Furthermore, in calculating the evaluation scale U, the separation candidate section (k+1) to be set next is
is set using the above-mentioned formula (1) from the separation candidate interval k, the separation candidate interval (k+1) that satisfies formula (1)
If all of the separation candidate sections k are not permissible sections, the separation candidate section k can be set as the separation candidate section k that becomes the terminal state. On the other hand, the separation candidate section that becomes the starting end state for determining the optimal separation candidate position using the evaluation scale U can be set based on the position of the separation candidate section that becomes the end state that has already been detected.

尚、本発明の文字分離における分離候補区間設
定方法は、上述した条件以外でも、文字ピツチ
Ｐ、文字塊幅Vi、空白サイズWiに基づいて設定
できることは言うまでもない。 It goes without saying that the separation candidate section setting method for character separation of the present invention can be set based on the character pitch P, the character block width Vi, and the blank size Wi in addition to the conditions described above.

第２図は本発明における最適な文字分離位置を
抽出するための原理を説明するための図である。
図において、黒点で示した位置は、第１図ｃで示
した分離候補区間ｋ（ｋ＝０，…４）の各分離候
補位置ｘ（ｋ，i_k）の値を示している。また、文
字ピツチＰは20である場合を示している。更に、
本発明の原理を簡単に説明するために、分離対象
となる文字列イメージは、第１図ｃにおける分離
候補区間０から分離候補区間４までとする。ま
ず、記号の定義を行う。記号μ_d（ｒ，ｎ；i_r，i_o）
（但し、１≦ｒ＜ｎ）は、分離候補区間ｒの分離
候補位置ｘ（ｒ，i_r）から分離候補区間ｎの分離
候補位置ｘ（ｎ，i_o）に到るまでの各分離候補区
間で任意に選択されたｎ−ｒ＋１個の分離候補位
置ｘ（ｒ，i_r），ｘ（ｒ＋１，i_r+1），…，ｘ（ｎ，i_o）
から得られるｎ−ｒ個の距離ｄ（ｒ，ｒ＋１；i_r，
i_r+1）ｄ（ｒ＋１，ｒ＋２；i_r+1，i_r+2）…，ｄ（ｎ
−１，ｎ；i_o-1，i_o）の平均値を示す。記号σ² _d
（ｒ，ｎ；i_r，i_o）（但し、１≦ｒ≦ｎ）は、分離
候補区間ｒの分離候補位置ｘ（ｒ，i_r）から分離
候補区間ｎの分離候補位置ｘ（ｎ，i_o）に到るま
での各分離候補区間で任意に選択されたｎ−ｒ＋
１個の分離候補位置ｘ（ｒ，i_r），ｘ（ｒ＋１，
i_r+1），…，ｘ（ｎ，i_o）から得られるｎ−ｒ個の
距離ｄ（ｒ，ｒ＋１；i_r，i_r+1），ｄ（ｒ＋１，ｒ＋
２；i_r+1，i_r+2）の前述した平均値μ_d（ｒ，ｎ；i_r，
i_o）における分散を示す。そこで、始端状態とな
る分離候補区間ｒ（第２図においてはｒ＝０であ
る）から終端状態となる分離候補区間ｎ（第２図
においてはｎ＝４）に到るまでの文字分離位置
は、式(2)で示す評価尺度Ｕを最小とするような分
離候補位置ｘ（ｒ，i_r），ｘ（ｒ＋１，i_r+1），…，
ｘ（ｎ，i_o）を求めることによつて得られる。 FIG. 2 is a diagram for explaining the principle for extracting optimal character separation positions in the present invention.
In the figure, the positions indicated by black dots indicate the values of each separation candidate position x(k, i _k ) in the separation candidate section k (k=0, . . . 4) shown in FIG. 1c. Further, the case where the character pitch P is 20 is shown. Furthermore,
In order to simply explain the principle of the present invention, character string images to be separated are from separation candidate section 0 to separation candidate section 4 in FIG. 1c. First, let's define the symbols. Symbol μ _d (r, n; i _r , i _o )
(However, 1≦r<n) represents each separation candidate from separation candidate position x (r, i _r ) of separation candidate section r to separation candidate position x (n, i _o ) of separation candidate section n. n−r+1 separation candidate positions x(r, i _r ), x(r+1, i _r+1 ), ..., x(n, i _o )
n-r distances d(r, r+1; i _r ,
i _r+1 ) d(r+1, r+2; i _r+1 , i _r+2 )…, d(n
-1, n; _io-1 , _io ). symbol σ ² _d
(r, n; i _r , i _o ) (1≦r≦n) is calculated from the separation candidate position x (r, i _r ) in the separation candidate section r to the separation candidate position x (n, n−r+ arbitrarily selected in each separation candidate interval up to i _o )
One separation candidate position x(r, i _r ), x(r+1,
n-r distances _d (r, r+1; i _r _, i _r+1 ), d(r+1, r+
2; i _r+1 , i _r+2 ), the above-mentioned average value μ _d (r, n; i _r ,
i _o ). Therefore, the character separation position from the separation candidate section r (r = 0 in Fig. 2) which is the start state to the separation candidate section n (n = 4 in Fig. 2) which is the end state is , separation candidate positions x(r, i _r ), x(r+1, i _r+1 ), ..., which minimize the evaluation scale U shown in equation (2).
It can be obtained by finding x(n, _io ).

Ｕ(r,n)＝β・σ² _d（ｒ，ｎ；i_r，i_o）＋(1+β)・（μ_d（ｒ，ｎ；i_r，i_o）−Ｐ）² ……(2) 式(2)における重み係数βは、０≦β≦１を満た
す。ここで、式(2)を最小にするような分離候補位
置のより具体的な実現方法は、以下に示す動的計
画法を用いて、メモリ容量を費やすことなく行う
ことができる。そこで、分離候補区間ｋ＋１にお
ける任意の分離候補位置ｘ（ｋ＋１，i_k+1）にお
いて、１つ前の分離候補区間ｋ（但し、ｋ＝０は、
始端状態を含む分離候補区間とする。）の任意の
分離候補位置ｘ（ｋ，i_k）（但し、i_k＝１，２…h_k
とし、h_k≧１とする）から式(2)を満足する分離候
補位置ｘ（ｋ＋１，i_k）に到る最適な分離候補位
置ｘ（ｋ，i_k）を後述する漸化式を用いて求める
ことができる。まず、距離ｄ（ｋ，ｋ＋１；i_k，
i_k+1）（但し、i_k＝１，２…h_k）を求め、以下に示
す式（３−１），式（３−２），式（３−３）を計
算する。U(r,n)=β・σ ² _d (r, n; i _r , i _o ) +(1+β)・(μ _d (r, n; i _r , i _o )−P) ² ... (2) The weighting coefficient β in equation (2) satisfies 0≦β≦1. Here, a more specific method for realizing separation candidate positions that minimizes Equation (2) can be performed without consuming memory capacity using the dynamic programming method described below. Therefore, at any separation candidate position x (k+1, i _k+1 ) in separation candidate section k+1, the previous separation candidate section k (however, if k=0,
This is the separation candidate section that includes the start state. ) for any separation candidate position x(k, i _k ) (where i _k =1,2...h _k
The optimal separation candidate position x (k, i _k ) that satisfies equation (2) is determined from the separation candidate position x (k + 1, i _k ) from h _k ≥ 1) using the recurrence formula described below. can be found. First, the distance d(k, k+1; i _k ,
i _k+1 ) (where i _k =1, 2...h _k ), and calculate equations (3-1), (3-2), and (3-3) shown below.

μ_d（０，ｋ＋１；i₀，i_k+1）＝１／ｋ＋１｛ｋ・μ^* _d（０，ｋ；i₀，i_k）＋ｄ（ｋ，ｋ＋１；i_k，i_k+1）｝ ……（３−１）Ｄ（ｋ＋１）＝D^*(k)＋d² （ｋ，ｋ＋１；i_k，i_k+1） ……（３−２）Ｕ（０，ｋ＋１）＝β・（Ｄ（ｋ＋１）／ｋ＋１−μ² _d （０，ｋ＋１；i₀，i_k））² ＋(1-β)（μ_d（０，ｋ＋１；i₀，i_k）−Ｐ）²
……（３−３）分離候補区間ｋのh_k個の分離候補位置ｘ（ｋ，
１），…ｘ（ｋ，h_k）に対して、式（３−３）の評
価尺度Ｕ（０，ｋ＋１）を最小とする分離候補位
置ｘ（ｋ，i_k）が分離候補区間ｋ＋１の任意の分
離候補位置ｘ（ｋ＋１，i_k+1）への最適な分離候
補位置となる。μ _d (0, k+1; i ₀ , i _k+1 )=1/k+1 {k・μ ^* _d (0, k; i ₀ , i _k ) +d(k, k+1; i _k , i _k+1 ) } ...(3-1) D(k+1)=D ^* (k)+d ² (k, k+1; i _k , i _k+1 ) ...(3-2) U(0, k+1)=β・( D(k+1)/k+1−μ ² _d (0, k+1; i ₀ , i _k )) ² +(1−β)(μ _d (0, k+1; i ₀ , i _k )−P) ²
...(3-3) h _k separation candidate positions x(k,
1), ... x (k, h _k ), the separation candidate position x (k, i k ) that minimizes the evaluation scale U (0, k + 1) of equation (3-3) is the separation candidate position x (k, i _k ) of the separation candidate section k + 1. This becomes the optimal separation candidate position for an arbitrary separation candidate position x (k+1, i _k+1 ).

ここで、始端状態となる分離候補区間０におけ
る各分離候補位置ｘ（０，i₀）（第２図においてi₀
＝１となる）において、式（３−１）に示す最適
な平均値μ^* _d（０，０；i₀，i₀）＝０，式（３−２）
に示す最適な距離ｄ（−１，０；i_-1，i₀）の２乗
累積和D^*（０）＝０とする。分離候補区間ｋの各
分離候補位置ｘ（ｋ，i_k）には上述した最適な平
均値μ^* _d（０，ｋ；i_k，i₀）及び最適な距離ｄ（ｋ−
１，ｋ；i_k-1，i_k）の２乗の累積和D^*（ｋ−１，
ｋ）を記憶しておけば、次の分離候補区間ｋ＋１
の各分離候補位置ｘ（ｋ＋１，i_k+1）における分
離候補区間ｋの最適な分離候補位置ｘ（ｋ，i_k）
が求められる。尚、式（３−３）における１項
は、式(2)で示した分散σ² _d（（０，ｋ＋１；i₀，i_k+1
）
の別の表現方法になつている。 Here, each separation candidate position x (0, i ₀ ) in separation candidate section 0, which is the start state (i ₀ in FIG.
= 1), the optimal average value μ ^* _d (0, 0; i ₀ , i ₀ ) = 0 shown in equation (3-1), equation (3-2)
Assume that the cumulative sum of squares of the optimal distance d (-1, 0; i _-1 , i ₀ ) shown in is D ^* (0)=0. Each separation candidate position x (k, i _k ) of separation candidate section k has the above-mentioned optimal average value μ ^* _d (0, k; i _k , i ₀ ) and optimal distance d (k−
1, k; i _k-1 , i _k ) cumulative sum D ^* (k-1,
k), the next separation candidate section k+1
The optimal separation candidate position x (k, i _{k ) of separation candidate section k at each separation candidate position x (k+1} , i _k+1 )
is required. Note that the first term in equation (3-3) is the variance σ ² _d ((0, k+1; i ₀ , i _k+1
)
It has become a different way of expressing.

次に、第２図を用いて、式（３−１），式（３
−３）の計算過程を説明する。図において、カツ
コで示した値は、それぞれ各分離候補区間ｋ（ｋ
＝０，１，２，３，４）の各分離候補位置ｘ（ｋ，
i_k）において、式（３−１）及び式（３−３）の
漸化式で示された平均値μ^* _d（０，ｋ；i₀，i_k）及び
評価尺度Ｕ（０，ｋ）を示しており、分離候補位
置ｘ（ｋ−１，i_k-1）からの最適な値として算出
したものである。尚、本説明では、式（３−３）
における重み係数βは、0.5とした場合について
述べる。また、図における矢印は、それぞれ、最
適な分離候補位置の系列を示している。例えば、
分離候補位置ｘ（２，１）は位置39であり、分離
候補位置ｘ（１，１）との距離ｄ（１，１；１，
１）は19となる。そこで、分離候補位置ｘ（１，
１）を通る分離候補位置ｘ（２，１）における平
均値μ_d（０，２）は、式（３−１）及び図より
１／２・（１×20＋19）となり、値19.5となる。次に、分離候補位置ｘ（１，１）には、式（３−２）
で示されるD^*(1)＝20²が記憶されている（図中、
省略）のため、式（３−２）を用いて、Ｄ(2)＝
20²＋19²となる。そこで、分離候補位置ｘ（１，
１）を通る分離候補位置ｘ（２，１）における評
価尺度Ｕ（０，２）はＵ（０，２）＝0.5（20²＋19²／
２ −19.5²）＋0.5・（19.5−20）²となり、値0.38とな
る。同様に、分離候補位置ｘ（１，２）を通る分
離候補位置ｘ（２，１）における評価尺度Ｕ（０，
２）も計算され（但し、計算は省略する）、値
1.26となる。そこで、分離候補位置ｘ（２，１）
に対して２つの評価尺度Ｕ（０，２）のうち、最
小値をとると、１つ前の分離候補区間１における
最適な分離候補位置はｘ（１，１）となり、また、
平均値μ^* _d（０，２）＝19.5、評価尺度Ｕ（０，２）＝
0.38が選択される。以下、同様な操作を式（３−
１）、式（３−２）、式（３−３）で示した漸化式
を用いて、行うことによつて、第２図で示したよ
うに、各分離候補位置ｘ（ｋ，i_k）（ｋ＝０，１，
２，３，４）における評価尺度Ｕ（０，ｋ）（但
し、ｋ＝０，１，２，３，４）が計算される。 Next, using Figure 2, equation (3-1) and equation (3
-3) The calculation process will be explained. In the figure, the values shown in brackets are for each separation candidate section k (k
=0,1,2,3,4) for each separation candidate position x(k,
i _k ), the average value μ ^* _d (0, k; i ₀ , i _k ) and the evaluation scale U (0, k ), which is calculated as the optimal value from the separation candidate position x (k-1, i _k-1 ). In addition, in this explanation, formula (3-3)
The case where the weighting coefficient β in is set to 0.5 will be described. Further, each arrow in the figure indicates a series of optimal separation candidate positions. for example,
The separation candidate position x (2, 1) is the position 39, and the distance from the separation candidate position x (1, 1) is d (1, 1; 1,
1) becomes 19. Therefore, the separation candidate position x(1,
The average value μ _d (0,2) at the separation candidate position x(2,1) passing through 1) is 1/2·(1×20+19) from equation (3-1) and the figure, and has a value of 19.5. Next, for the separation candidate position x (1, 1), formula (3-2) is used.
D ^* (1) = 20 ² shown by is stored (in the figure,
(omitted), using equation (3-2), D(2)=
20 ² + 19 ² . Therefore, the separation candidate position x(1,
The evaluation scale U (0, 2) at the separation candidate position x (2, 1) passing through 1) is U (0, 2) = 0.5 (20 ² + 19 ² /
2 -19.5 ² ) + 0.5・(19.5 - 20) ² , giving a value of 0.38. Similarly, the evaluation scale U(0,
2) is also calculated (however, the calculation is omitted), and the value
It becomes 1.26. Therefore, separation candidate position x (2, 1)
If the minimum value is taken from the two evaluation scales U(0,2) for
Average value μ ^* _d (0, 2) = 19.5, evaluation scale U (0, 2) =
0.38 is selected. Below, similar operations are performed using the formula (3-
1), equation (3-2), and equation (3-3), each separation candidate position x(k, i _k ) (k=0,1,
2, 3, 4), the evaluation scale U(0, k) (where k=0, 1, 2, 3, 4) is calculated.

次に、前述したように、終端状態となる分離候
補区間内の分離候補位置ｘ（４，２），ｘ（４，
３），ｘ（４，４）のうち、評価尺度Ｕ（０，４）
が最小となる分離候補位置ｘ（４，２）を文字分
離の終了位置として選択する。そこで、最適な分
離候補位置の系列を文字分離の終了位置ｘ（４，
２）より逆にたどることによつて、ｘ（４，２）＝
81，ｘ（３，３）＝60，ｘ（２，２）＝20，ｘ（１，
１）＝20，ｘ（０，１）＝０として求めることがで
きる。 Next, as described above, separation candidate positions x(4,2), x(4,
3), among x(4,4), the evaluation scale U(0,4)
The separation candidate position x (4, 2) with the minimum is selected as the end position of character separation. Therefore, we set the series of optimal separation candidate positions to the character separation end position x(4,
2) By tracing backwards, x(4,2)=
81, x (3, 3) = 60, x (2, 2) = 20, x (1,
1)=20, x(0,1)=0.

第３図は、本発明の具体的一実施例を示す論理
ブロツク図である。走査部１は、紙面上に記載さ
れた文字列イメージを光学的に走査して、電気信
号に変換し、２値量子化後、文字列イメージメモ
リ２へ書き込む。文字塊抽出部３は、文字列イメ
ージメモリ２に格納された文字列イメージから文
字塊を順次抽出し、各文字塊の位置及び幅及び高
さを文字塊情報レジスタ２１へ格納する。尚、こ
のような文字塊抽出部３は、公知の技術を用いて
求めることができる。文字ピツチ検出部４は、文
字塊情報レジスタ２１に格納された各文字塊の位
置及び文字塊幅、更には文字の高さを用いて、文
字ピツチＰを推定し、文字ピツチ情報レジスタ２
２に格納する。尚、このような文字ピツチ検出部
４は、同一出願人による前記引用例１の明細書で
示されている技術を用いて求めることができる
し、また予め文字ピツチＰが既知であれば、与え
られた文字ピツチＰを用いても良い。パラメータ
情報レジスタ３０は、本発明で用いる種々の閾値
や重み係数であるパラメータT₁，T2，T₃，T₄，
T₅，βを格納する。許容区間抽出部５は、第１
図を用いて述べた条件(1)及び条件(2)を満足する許
容区間を抽出する。最初に、条件(1)で示した空白
となる許容区間は、文字塊情報レジスタ２１に格
納された複数個の文字塊の位置及び文字塊幅Vi
を用いて、空白となる位置及び空白サイズが比較
回路等によつて抽出される。次に、条件(2)で示し
た黒地領域内の許容区間は、最初に、各文字塊幅
Viが、文字ピツチ情報レジスタ２２に格納され
た文字ピツチＰとパメータ情報レジスタ３０に格
納されたパラメータT₁との和Ｐ＋T₁より大きい
か否かを比較し、大きければ、各文字塊幅Viの
両端からパラメータ情報レジスタT₂で示された
値までを除いて、文字塊幅Viを含む区間を許容
区間として抽出する。以上のようにして、抽出さ
れた条件(1)を満たす空白となる許容区間及び条件
(2)を満たす黒地領域内の許容区間が抽出され、許
容区間情報レジスタ２３に、抽出された各許容区
間の位置及び幅が格納される。終端候補区間抽出
部６は、文字行イメージに対応して、順次許容区
間レジスタ２３に格納された許容区間のうち、空
白となる許容区間Wiについて、パラメータ情報
レジスタ３０に格納されたパラメータT₄及び文
字ピツチＰとの積T₄・Ｐを算出し、積T₄・Ｐと
空白となる許容区間Wiとを比較することによつ
て、積T₄・Ｐよりも大きくなる許容区間Wiを検
出する。次に、許容区間Wiの始端から、パラメ
ータT₅と文字ピツチＰとの積T₅・Ｐ（但しT₅≦
T₄）までの許容区間を算出し、更に許容区間Wi
の直前に存在する文字塊幅Vi始端から文字ピツ
チＰ及びパラメータT₁との和Ｐ＋T₁内の許容区
間を算出して、上述した２つの許容区間の論理和
を、終端候補区間として、順次、終端候補区間レ
ジスタ２４に格納する。 FIG. 3 is a logic block diagram showing a specific embodiment of the present invention. A scanning unit 1 optically scans a character string image written on a paper surface, converts it into an electrical signal, and writes it into a character string image memory 2 after binary quantization. The character block extraction unit 3 sequentially extracts character blocks from the character string image stored in the character string image memory 2, and stores the position, width, and height of each character block in the character block information register 21. Note that such a character chunk extraction unit 3 can be obtained using a known technique. The character pitch detection unit 4 estimates the character pitch P using the position and character block width of each character block stored in the character block information register 21, and furthermore, the height of the character.
Store in 2. Incidentally, such a character pitch detection section 4 can be obtained using the technique shown in the specification of Cited Example 1 by the same applicant, and if the character pitch P is known in advance, The character pitch P may be used. The parameter information register 30 stores parameters T ₁ , T2, T ₃ , T ₄ , which are various threshold values and weighting coefficients used in the present invention.
Store T ₅ and β. The permissible interval extraction unit 5
A permissible interval that satisfies condition (1) and condition (2) described using the diagram is extracted. First, the allowable blank interval shown in condition (1) is based on the positions of multiple character blocks and the character block width Vi stored in the character block information register 21.
Using this, the blank position and blank size are extracted by a comparison circuit or the like. Next, the permissible interval within the black background area shown in condition (2) is determined by the width of each character block.
It is compared whether Vi is larger than the sum P+T ₁ of the character pitch P stored in the character pitch information register 22 and the parameter T ₁ stored in the parameter information register 30, and if it is, the width of each character block Vi is Excluding the values from both ends to the value indicated by the parameter information register _T2 , the section including the character block width Vi is extracted as a permissible section. As described above, the blank allowable interval and conditions that satisfy the extracted condition (1)
Tolerable sections within the black background area that satisfy (2) are extracted, and the position and width of each extracted allowable section are stored in the allowable section information register 23. The terminal candidate section extraction unit 6 extracts the parameters T 4 and 4 stored in the parameter information register 30 for the blank allowable section Wi among the allowable sections sequentially stored in the allowable section register ₂₃ corresponding to the character line image. By calculating the product T ₄・P with the character pitch P and comparing the product T ₄・P with the blank allowable interval Wi, detect the allowable interval Wi that is larger than the product T ₄・P. . Next, from the start of the allowable interval Wi, the product T ₅ · P of the parameter T ₅ and the character pitch P (however, T ₅ ≦
Calculate the allowable interval up to T ₄ ) and further calculate the allowable interval Wi
Calculate the permissible interval within the sum P + T ₁ of the character pitch P and the parameter T ₁ from the character block width Vi that exists immediately before, and use the logical sum of the two permissible intervals described above as the end candidate interval, and sequentially calculate It is stored in the end candidate section register 24.

第４図ａ及びｂに終端候補区間抽出部６によつ
て、抽出される終端候補区間の一例を示す。第４
図ａの場合、終端区間は、図中T₅・Ｐで示した
区間として求められる。第４図ｂの場合、終端区
間は、図中最後の矢印で示した区間であり、
T₅・ＰとＰ＋T₁との論理和のうち、空白となる
許容区間となつている。分離候補区間抽出部７
は、第１図ｃを用いて説明したような分離候補区
間ｋの各分離候補位置ｘ（ｋ，i_k）を許容区間情
報レジスタ２３及びパラメータ情報レジスタ３０
に格納された許容区間及びパラメータを用いて、
順次抽出する。尚、文字分離開始位置を含む始端
分離候補区間０の各分離候補位置ｘ（０，i₀）（但
し、i₀＝１，２…h₀）は制御部１０によつて、最
初に、文字列イメージの始端から文字ピツチＰに
基づいて設定される一定範囲の空白となる許容区
間内の各分離候補位置より求められ、最適分離位
置情報レジスタ２６に格納されているものとす
る。そこで、分離候補区間抽出部７は、すでに、
抽出され最適分離候補位置レジスタ２６に格納さ
れた分離候補区間ｋ（但し、ｋ＝０，１，２…）
の分離候補位置ｘ（ｋ，i_k）（但し、i_k＝１，２…
h_k）から式(1)を満たす許容区間内に属する分離候
補位置ｘ（ｋ＋１，i_k+1）を算出する。即ち、分
離候補区間ｋの第１番目の分離候補位置ｘ（ｋ，
１）から文字ピツチＰとパラメータ情報レジスタ
３０に格納されたパラメータT₃を用いてｘ（ｋ，
１）＋Ｐ−T₃となる位置を算出し、更に分離候補
区間ｋの最後の分離候補位置ｘ（ｋ，h_k）から文
字ピツチＰとパラメータT₃を用いて、ｘ（ｋ，
h_k）＋Ｐ＋T₃となる位置を算出する。上記２つの
位置ｘ（ｋ，１）＋Ｐ−T₃，ｘ（ｋ，h_k）＋Ｐ＋T₃
によつて得られる区間の各分離候補位置のうち、
論理積をとることによつて前述した許容区間に属
する分離候補位置を分離候補区間ｋ＋１の各分離
候補位置ｘ（ｋ＋１，i_k+1）（但し、i_k+1＝１，２
…h_k+1）として抽出し、分離候補位置情報レジス
タ２５に格納される。評価尺度演算部８に、分離
候補位置情報レジスタ２５の内容が入力された
時、最適分離位置情報レジスタ２６には、すでに
演算された分離候補区間０から分離候補区間ｋま
での各分離候補位置ｘ（０，i₀）（但し、i₀＝１…
h₀），ｘ（１，i₁）（但し、i₁＝１…h₁）_，…ｘ（ｋ，
i_k）（但し、i_k＝１…h_k）が格納されている。更
に、分離候補区間ｍ（ｍ＝０…ｋ）の各分離候補
位置ｘ（ｍ，i_n）（但し、i_n＝１…h_n）に対応し
て、評価尺度演算部８によつて、式（３−１）よ
り計算された平均値μ^* _d（０，ｍ；i₀，i_n）、式（３
−２）より計算された距離の２乗の累積和D^*(k)、
式（３−３）より計算された評価尺度Ｕ（０，ｍ）
及び直前の分離候補区間ｍ−１の最適な分離候補
位置ｘ（ｍ−１，i_n-1）が格納されている。尚、
制御部１０によつて、分離候補区間０の各分離候
補位置ｘ（０，i₀）が格納された時、各分離候補
位置ｘ（０，i₀）に対応して格納される平均値μ^* _d
（０，０；i₀，i₀）及び距離の２乗の累積和D^*
（０）は０が格納されているものとする。そこで、
評価尺度演算部８は、分離候補位置情報レジスタ
２５より順次転送される分離候補位置ｘ（ｋ＋１，
i_k）において、最初に最適分離位置情報レジスタ
２６に格納された分離候補区間ｋの各分離候補位
置ｘ（ｋ，i_k）（但し、i_k＝１，２…h_k）における
距離ｄ（ｋ，ｋ＋１；i_k，i_k+1）を算出し、更にそ
の平均値μ^* _d（０，ｋ；i₀，i_k）、距離の２乗累積和
D^*(k)及びパラメータ情報レジスタ３０に記憶さ
れたパラメータβを用いて、順次、式（３−１）
で示した漸化式μ_d（０，ｋ＋１；i₀，i_k+1）＝
１／ｋ＋１｛ｋ・μ^* _d（０，ｋ；i₀，i_k）＋ｄ（ｋ，ｋ
＋１；i_ki_k+1）、式（３−２）で示した漸化式Ｄ（ｋ
＋１）＝D^*(k)＋d²（ｋ，ｋ＋１；i_k，i_k+1）、式（３
−３）で示した漸化式Ｕ（０，ｋ＋１）＝β・
（Ｄ（ｋ＋１）／ｋ＋１−μ² _d（０，ｋ＋１；i₀，i_k+1））²＋（１ −β）・（μ_d（０，ｋ＋１）；i₀，i_k+1）−Ｐ）²を計
算
することによつて、直前の分離候補区間ｋの分離
候補位置ｘ（ｋ，i_k）（但し、i_k＝１…h_k）に対す
る評価尺度Ｕ（０，ｋ＋１）を算出する。 FIGS. 4a and 4b show an example of a terminal candidate section extracted by the terminal candidate section extraction section 6. Fourth
In the case of figure a, the terminal section is determined as the section indicated by _T5 ·P in the figure. In the case of Fig. 4b, the terminal section is the section indicated by the last arrow in the figure,
Of the logical sum of T ₅ ·P and P + T ₁ , it is a blank allowable interval. Separation candidate section extraction unit 7
is the separation candidate position x(k, i _k ) of the separation candidate area k as explained using FIG.
Using the tolerance interval and parameters stored in
Extract sequentially. Note that each separation candidate position x (0, i ₀ ) (however, i ₀ = 1, 2...h ₀ ) of the starting edge separation candidate section 0 including the character separation start position is determined by the control unit 10 first. It is assumed that the separation candidate positions are determined from each separation candidate position within a certain range of blank allowable intervals set based on the character pitch P from the starting end of the column image, and are stored in the optimum separation position information register 26. Therefore, the separation candidate section extraction unit 7 has already
Separation candidate section k extracted and stored in the optimal separation candidate position register 26 (k = 0, 1, 2...)
Separation candidate position x (k, i _k ) (where i _k = 1, 2...
From h _k ), a separation candidate position x (k+1, i _k+1 ) belonging to the allowable interval that satisfies equation (1) is calculated. That is, the first separation candidate position x(k,
1), x( _k ,
1) Calculate _the position where +P-T ₃ , and then calculate x( _k ,
Calculate the position where h _k )+P+T ₃ . The above two positions x (k, 1) + P - T ₃ , x (k, h _k ) + P + T ₃
Of each separation candidate position in the section obtained by
By performing a logical product, the separation candidate positions belonging to the above-mentioned permissible interval are determined as each separation candidate position x (k+1, i _k+1 ) in the separation candidate interval k+1 (where i _k+1 =1, 2
... h _k+1 ) and stored in the separation candidate position information register 25. When the contents of the separation candidate position information register 25 are input to the evaluation scale calculation unit 8, the optimum separation position information register 26 contains the already calculated separation candidate positions x from separation candidate section 0 to separation candidate section k. (0, i ₀ ) (however, i ₀ = 1...
h ₀ ), x (1, i ₁ ) (where i ₁ = 1...h ₁ ) _, ...x(k,
i _k ) (where i _k =1...h _k ) are stored. Furthermore, corresponding to each separation candidate position x (m, i _n ) (where i _n =1... h _n ) of the separation candidate section m (m=0...k), the evaluation scale calculation unit 8 calculates the following: The average value μ ^* _d (0, m; i ₀ , i _n ) calculated from equation (3-1), equation (3
−2) Cumulative sum of squared distances D ^* (k),
Evaluation scale U (0, m) calculated from formula (3-3)
and the optimal separation candidate position x(m-1,in _-1 ) of the immediately preceding separation candidate section m-1 are stored. still,
When each separation candidate position x (0, i ₀ ) of separation candidate section 0 is stored by the control unit 10, the average value μ stored corresponding to each separation candidate position x (0, i ₀ ) ^* _d
(0, 0; i ₀ , i ₀ ) and the cumulative sum of the square of the distance D ^*
It is assumed that 0 is stored in (0). Therefore,
The evaluation scale calculation unit 8 receives separation candidate positions x(k+1,
i _k ₎ , _the distance _d ( k, k+1; i _k , i _k+1 ), and further calculate its average value μ ^* _d (0, k; i ₀ , i _k ), and the cumulative sum of squares of the distances.
Using D ^* (k) and the parameter β stored in the parameter information register 30, formula (3-1)
The recurrence formula μ _d (0, k+1; i ₀ , i _k+1 )=
1/k+1{k・μ ^* _d (0, k; i ₀ , i _k )+d(k, k
+ 1; i _k i _k+1 ), recurrence formula D (k
+1) = D ^* (k) + d ² (k, k+1; i _k , i _k+1 ), equation (3
-3) recurrence formula U(0,k+1)=β・
(D(k+1)/k+1−μ ² _d (0, k+1; i ₀ , i _k+1 )) ² + (1 − β)・(μ _d (0, k+1); i ₀ , i _k+1 ) − P) ² , the separation candidate position x of the previous separation candidate section k is calculated. An evaluation scale U(0, k+1) for (k, i _k ) (where i _k =1...h _k ) is calculated.

次に、h_k個の分離候補位置ｘ（ｋ，i_k）のうち、
評価尺度Ｕ（０，ｋ＋１）が最小となる分離候補
位置ｘ（ｋ，i_k）を分離候補位置ｘ（ｋ＋１，i_k+1）
へ到達する１つ前の最適な分離候補位置ｘ（ｋ，
i_k）として求め、更に、評価尺度Ｕ（０，ｋ＋１）
の最小値及び評価尺度Ｕ（０，ｋ＋１）が最小値
となる平均値μ^* _d（０，ｋ＋１；i₀，i_k）、及び距離
の２乗累積和D^*（ｋ＋１）をそれぞれ分離候補位
置ｘ（ｋ＋１，i_k）と共に、最適分離位置情報レ
ジスタ２６に格納する。評価尺度演算部８におい
て、以上述べた演算処理を分離候補位置情報レジ
スタ２５より順次転送されるすべての分離候補位
置ｘ（ｋ＋１，i_k+1）に対して行われると、制御
部１０は、分離候補区間抽出部６に次の分離候補
区間ｋ＋２の各分離候補位置ｘ（ｋ＋２，i_k+2）
を抽出するように要求し、前述した同様な操作が
繰り返される。ここで、制御部１０は、評価尺度
演算部８によつて、最適分離位置情報レジスタ２
６に転送された分離候補区間ｋ＋１の分離候補位
置ｘ（ｋ＋１，i_k+1）が終端区間情報レジスタ２
４に格納された終端候補区間に到着したか否かを
調べ、到達しない場合には、上述した要求のみを
分離候補区間抽出部６に出力する。一方、分離候
補位置ｘ（ｋ＋１，i_k+1）が終端候補区間に到着
した場合、制御部１０は、上述した要求を分離候
補区間抽出部６に出力し、次の分離候補区間ｋ＋
２の各分離候補位置ｘ（ｋ＋２，i_k+2）（但し、
i_k+2＝１…h_k+2）が前述したように評価尺度演算
部８で評価された後、制御部１０によつて、最適
分離位置情報レジスタ２６に記憶された分離候補
位置ｘ（ｋ，i_k）のうち、上述した終端候補区間
内にある複数個の分離候補位置ｘ（ｎ，i_k）の評
価尺度Ｕ（０，ｎ）が最小となる分離候補位置ｘ
（ｎ，i_o）を評価した区間内の終点位置として検
出され、終点位置となる分離候補位置ｘ（ｎ，i_o）
に到達する最適な分離候補位置の系列が、最適分
離位置情報レジスタ２６を用いて、終点位置とな
る分離候補位置ｘ（ｎ，i_o）から順次、分離候補
位置ｘ（ｎ−１，i_o-1），…ｘ（０，i₀）と逆にたど
ることによつて抽出され、文字分離位置レジスタ
２７に記憶される。 Next, among the h _k separation candidate positions x(k, i _k ),
The separation candidate position x (k, i _k ) where the evaluation scale U (0, k + 1) is the minimum is determined as the separation candidate position x (k + 1, i _{k +1} )
The optimal separation candidate position x(k,
i _k ), and further, the evaluation scale U(0, k+1)
The average value μ ^* _d (0, k + 1; i ₀ , i _k ) at which ^the minimum value of It is stored in the optimum separation position information register 26 together with the position x(k+1, i _k ). When the evaluation scale calculation unit 8 performs the above-mentioned calculation processing on all separation candidate positions x (k+1, i _k+1 ) sequentially transferred from the separation candidate position information register 25, the control unit 10 The separation candidate section extraction unit 6 inputs each separation candidate position x (k+2, i _k+2 ) of the next separation candidate section k+2.
, and the same operations described above are repeated. Here, the control unit 10 uses the evaluation scale calculation unit 8 to control the optimum separation position information register 2.
The separation candidate position x (k+1, i _k+1 ) of the separation candidate section k+1 transferred to the end section information register 2
It is checked whether the end candidate section stored in section 4 has been reached, and if it has not been reached, only the above-mentioned request is output to the separation candidate section extraction section 6. On the other hand, when the separation candidate position x (k+1, i _k+1 ) arrives at the terminal candidate section, the control unit 10 outputs the above-mentioned request to the separation candidate section extraction section 6, and selects the next separation candidate section k+
2, each separation candidate position x (k+2, i _k+2 ) (however,
After _the separation candidate position _x ( k, i _k ), the separation candidate position x at which the evaluation scale U (0, n) of the plurality of separation candidate positions x (n, i _k ) within the above-mentioned terminal candidate section is the minimum.
Separation candidate position x (n, i _o ) that is detected as the end point position in the interval where (n, i _o ) is evaluated and becomes the end point position
The series of optimal separation candidate positions that reach the separation candidate position x (n-1, _i _o _-1 ₎ , .

次に制御部１０は、前述した終点位置ｘ（ｎ，
i_o）から最初に検出される文字塊の始端までの空
白となる許容区間内で、文字ピツチＰに基づいて
設定される一定範囲を次に分離すべき部分文字列
イメージの始端となる分離候補区間０として、最
適分離位置情報レジスタ２６に格納し、前述した
ような操作を行うように、指令する。このように
して、文字列イメージメモリ２に格納された文字
列イメージの文字分離位置が文字分離位置レジス
タ２７に格納され、上述した文字塊情報レジスタ
２１に記憶された各文字塊の高さ及び文字分離位
置レジスタ２７に格納された文字分離位置を用い
ることによつて、１文字単位に分離することがで
きる。 Next, the control unit 10 controls the aforementioned end point position x(n,
A separation candidate that will be the start of the substring image to be separated next in a certain range set based on the character pitch P within the blank allowable interval from i _o ) to the start of the first detected character block. It is stored in the optimum separation position information register 26 as section 0, and commands are given to perform the operations described above. In this way, the character separation position of the character string image stored in the character string image memory 2 is stored in the character separation position register 27, and the height and character of each character block stored in the above-mentioned character block information register 21 are stored. By using the character separation position stored in the separation position register 27, it is possible to separate each character.

第５図は、第３図における評価尺度演算部８の
具体的な一実施例を示す論理ブロツク図である。 FIG. 5 is a logical block diagram showing a specific embodiment of the evaluation scale calculating section 8 in FIG.

前述したように分離候補位置情報レジスタ２５
に分離候補区間ｋ＋１の各分離候補位置ｘ（ｋ＋
１，i_k+1）（但し、i_k+1＝１…h_k+1）が格納される
と、第３図で示した制御部１０によつて、分離候
補位置ｘ（ｋ＋１，i_k+1）が距離算出部８１及び
分離候補位置群レジスタ２６１の所定の位置へ転
送され、分離候補区間ｋ＋１はステージレジスタ
８０及び分離候補位置群レジスタ２６１の所定の
位置に格納される。距離算出部８１に分離候補位
置ｘ（ｋ＋１，i_k+1）が格納されると、制御部１
０によつて、分離候補位置群レジスタ２６１に格
納された分離候補区間ｋの各分離候補位置ｘ（ｋ，
i_k）（但し、i_k＝１…h_k）が順次、距離算出部８１
に転送される。ここで第３図で示した最適分離位
置情報レジスタ２６は分離候補位置群レジスタ２
６１，連接情報群レジスタ２６２，最適統計量群
レジスタ２６３、最適評価値群レジスタ２６４か
ら構成される。距離算出部８は分離候補位置ｘ
（ｋ＋１，i_k+1）と順次転送される分離候補位置
との距離ｄ（ｋ，ｋ＋１；i_k，i_k+1）＝ｘ（ｋ＋１，
i_k+1）−ｘ（ｋ，i_k）を算出する。 As mentioned above, the separation candidate position information register 25
, each separation candidate position x(k+
1, i _k+1 ) (where i _k+1 =1...h _k+1 ), the control unit 10 shown in _FIG . ₊₁ ) is transferred to a predetermined position in the distance calculation unit 81 and separation candidate position group register 261, and separation candidate section k+1 is stored in a predetermined position in the stage register 80 and separation candidate position group register 261. When the separation candidate position x (k+1, i _k+1 ) is stored in the distance calculation unit 81, the control unit 1
0, each separation candidate position x(k,
i _k ) (where i _k =1...h _k ) are sequentially calculated by the distance calculation unit 81
will be forwarded to. Here, the optimum separation position information register 26 shown in FIG. 3 is the separation candidate position group register 2.
61, a connection information group register 262, an optimal statistics group register 263, and an optimal evaluation value group register 264. The distance calculation unit 8 calculates the separation candidate position x
Distance d(k, k+ ₁ ; i _k , i _k+1 )=x(k+1,
i _k+1 )−x(k, i _k ).

統計量算出部８２は、前述した式（３−１）及
び軸（３−２）で示した漸化式に基づいて、平均
値μ_d（０，ｋ＋１；i₀，i_k+1）及び距離の２乗累積
和Ｄ(k)を算出する。即ち、平均値μ_d（０，ｋ＋
１；i₀，i_k+1）は読み出された最適統計群レジス
タ２６３に格納された分離候補位置ｘ（ｋ，i_k）
における平均値μ^* _d（０，ｋ；i₀，i_k）、距離算出部
８１の出力ｄ（ｋ，ｋ＋１；i_k，i_k+1）ステージレ
ジスタ８０の内容である分離区間ｋ＋１及びｋを
用いて、計算式 μ_d(0,k+1;i₀，i_k+1）＝１／ｋ＋１｛ｋ・μ^* _d （０，ｋ；i₀，i_k）+d(k,k+1；i_k，i_k+1）により算出される。一方、距離の２乗累積和Ｄ
（ｋ＋１）は読み出された最適統計群レジスタ２
６３に格納された分離候補位置ｘ（ｋ，i_k）にお
ける距離の２乗累積和D^*(k)と距離算出部８１の
出力ｄ（ｋ，ｋ＋１；i_k，i_k+1）を用いて、計算式Ｄ（ｋ＋１）＝D^*(k)＋ｄ（ｋ，ｋ＋１；i_k，i_k+1）により算出される。統計量算出部８２より算出さ
れた平均値μ_d（０，ｋ＋１；i₀，i_k+1）、及び距離
の２乗累積和Ｄ（ｋ＋１）はそれぞれ統計量格納
レジスタ８２に格納される。評価値算出部８４
は、前述した式（３−２）に基づいて、評価尺度
Ｕ（０，ｋ＋１）の値を算出する。即ち、評価値
Ｕ（０，ｋ＋１）は、第３図で示した文字ピツチ
Ｐ及びパラメータ情報レジスタ３０に格納された
パラメータβ及び統計量格納レジスタ８３の内容
及びステージレジスタ８０の内容を用いて、計算
式Ｕ(0,k+1)＝β・（Ｄ（ｋ＋１）／ｋ＋１−μ² _d （０，ｋ＋１；i₀，i_k+1））²＋(1-β) ・（μ_d（０，ｋ＋１；i₀，i_k+1）−Ｐ）² により算出される。次に、比較部８５において、
評価算出部８４の出力である評価値と最小評価値
レジスタ８６の内容を比較し、評価値算出部８４
の出力が、最小評価値レジスタ８６の内容よりも
小さければ、その出力信号線８５１の出力信号８
５１Ｓを“ON”にする。尚、最小評価値レジス
タ８６の内容は、最初非常に大きな値がセツトさ
れているものとする。 The statistics calculation unit 82 calculates the average value μ _d (0, k+1; i ₀ , i _k+1 ) and Calculate the cumulative sum of squared distances D(k). That is, the average value μ _d (0, k+
1; i ₀ , i _k+1 ) is the separation candidate position x (k, i _k ) stored in the read optimal statistical group register 263
The average value μ ^* _d (0, k; i ₀ , i _k ), the output d (k, k+1; i _k , i _k+1 ) of the distance calculation unit 81 and the separated sections k+1 and k which are the contents of the stage register 80 Using the calculation formula μ _d (0,k+1;i ₀ ,i _k+1 )=1/k+1{k・μ ^* _d (0,k;i ₀ ,i _k )+d(k,k +1; i _k , i _k+1 ). On the other hand, the cumulative sum of squared distances D
(k+1) is the read optimal statistical group register 2
Using the squared cumulative sum D ^* (k) of the distance at the separation candidate position x(k, i _k ) stored in 63 and the output d(k, k+1; i _k , i _k+1 ) of the distance calculation unit 81, It is calculated using the calculation formula D(k+1)=D ^* (k)+d (k, k+1; i _k , i _k+1 ). The average value μ _d (0, k+1; i ₀ , i _k+1 ) and the cumulative sum of squares of distances D(k+1) calculated by the statistics calculation unit 82 are stored in the statistics storage register 82, respectively. Evaluation value calculation unit 84
calculates the value of the evaluation scale U(0, k+1) based on the above-mentioned formula (3-2). That is, the evaluation value U(0, k+1) is calculated using the character pitch P shown in FIG. Calculation formula U(0,k+1)=β・(D(k+1)/k+1−μ ² _d (0, k+1; i ₀ , i _k+1 )) ² +(1−β) ・(μ _d ( 0, k+1; i ₀ , i _k+1 )−P) ² . Next, in the comparison section 85,
The evaluation value output from the evaluation calculation unit 84 is compared with the contents of the minimum evaluation value register 86, and the evaluation value calculation unit 84
If the output is smaller than the contents of the minimum evaluation value register 86, the output signal 8 of the output signal line 851 is
Turn 51S “ON”. It is assumed that the contents of the minimum evaluation value register 86 are initially set to a very large value.

出力信号線８５１Ｓが“ON”になると、ゲー
ト回路５３が開いて、評価値算出部８４の出力が
最小評価値レジスタ８６に転送される。 When the output signal line 851S is turned “ON”, the gate circuit 53 is opened and the output of the evaluation value calculation unit 84 is transferred to the minimum evaluation value register 86.

また、出力信号８５１Ｓが“ON”になると、
統計量格納レジスタ８３に格納された平均値μ_d
（０，ｋ＋１；i₀，i_k+1）及び距離の２乗累積和Ｄ
（ｋ＋１）が、ゲート回路５２が開くことによつ
て、最小統計量レジスタ８８に格納される。 Also, when the output signal 851S turns “ON”,
Average value μ _d stored in statistics storage register 83
(0, k+1; i ₀ , i _k+1 ) and cumulative sum of squares of distance D
(k+1) is stored in the minimum statistic register 88 by opening the gate circuit 52.

更に、出力信号８５１Ｓが“ON”になると、
ゲート回路５１が開くことによつて、距離算出部
８１に転送された分離候補区間ｋの分離候補位置
ｘ（ｋ，i_k）における位置情報ｋ及びi_kが連接情報
レジスタ８７に格納される。 Furthermore, when the output signal 851S becomes “ON”,
By opening the gate circuit 51, the position information k and i _k at the separation candidate position x (k, i _k ) of the separation candidate section k transferred to the distance calculation unit 81 is stored in the connection information register 87.

以上の操作を最適分離位置情報２６１に格納さ
れた分離候補区間ｋの分離候補位置ｘ（ｋ，i_k）
（但し、i_k＝１…h_kに対して行われる。 The above operation is performed at the separation candidate position x (k, i _k ) of the separation candidate section k stored in the optimal separation position information 261.
(However, this is performed for i _k =1...h _k .

次に、第３図で示した制御部１０は分離候補レ
ジスタ２５より距離算出部に転送された分離候補
位置ｘ（ｋ＋１，i_k+1）における最適な平均値μ^*
（０，ｋ＋１；i₀，i_k+1）及び最適な距離の２乗累
積和D^*（ｋ＋１）として、最小統計量レジスタ８
８の内容を最小統計量群レジスタ２６３に転送
し、分離候補位置ｘ（ｋ＋１，i_k+1）の最適な評
価値として、最小評価値レジスタ８６の内容を最
小評価値群レジスタ２６４へ転送し、更に、分離
候補区間ｋにおける分離候補位置ｘ（ｋ＋１，i_k）
への最適な分離パス情報として、連接情報レジス
タ８７の内容を連接情報群レジスタ２６３へ転送
する。 Next, _the control unit 10 shown in FIG ^.
(0, k+1; i ₀ , i _k+1 ) and the minimum statistics register 8 as the cumulative sum of squares of the optimal distance D ^* (k+1)
8 is transferred to the minimum statistics group register 263, and the contents of the minimum evaluation value register 86 are transferred to the minimum evaluation value group register 264 as the optimal evaluation value of the separation candidate position x (k+1, i _k+ 1 ). , Furthermore, the separation candidate position x (k+1, i _k ) in the separation candidate section k
The contents of the linkage information register 87 are transferred to the linkage information group register 263 as the optimal separation path information to the linkage information register 263.

次に、最小評価値レジスタ８６の内容に、初期
値（非常に大きな値）をセツトする。 Next, the contents of the minimum evaluation value register 86 are set to an initial value (a very large value).

以上の操作を繰り返すことによつて分離候補区
間ｋ＋１のすべての分離候補位置ｘ（ｋ＋１，
i_k+1）（但し、i_k+1＝１，２…h_k+1）に対して、最
適な評価値及び最適な分離パスが得られる。 By repeating the above operations, all separation candidate positions x(k+1,
i _k+1 ) (where i _k+1 =1, 2...h _k+1 ), an optimal evaluation value and an optimal separation path can be obtained.

尚、本発明の具体的な別の実現方法として、通
常のマイクロコンピユータを用いて、実現できる
ことは言うまでもない。 It goes without saying that another specific method of implementing the present invention is to use an ordinary microcomputer.

以上、述べたように、本発明を適用することに
よつて、文字間の接触が生じてもまた、１文字が
２文字以上にスプリツトする場合が生じても、容
易にしかも安定に、一文字単位の分離を行うこと
が可能となる。 As described above, by applying the present invention, even if there is contact between characters or if one character is split into two or more characters, it is possible to easily and stably It becomes possible to separate the

[Brief explanation of drawings]

第１図は部分文字列イメージの一例を用いて、
本発明における文字分離の分離候補区間設定方法
の一例を説明するための図、第２図は本発明にお
ける最適な文字分離位置を抽出するための原理を
説明するための図、第３図は本発明の具体的一実
施例を示す論理ブロツク図、第４図は第３図にお
ける終端候補区間抽出部６によつて、抽出される
終端候補区間の一例を示す。第５図は第３図にお
ける評価尺度演算部８の具体的な一実施例を示す
論理ブロツク図である。図において、１は走査部、２は文字列イメージ
メモリ、３は文字塊抽出部、２１は文字塊情報レ
ジスタ、４は文字ピツチ検出部、２２は文字ピツ
チ情報レジスタ、３０はパラメータ情報レジス
タ、５は許容区間抽出部、２３は許容区間情報レ
ジスタ、６は終端候補区間抽出部、２４は終端候
補区間レジスタ、７は分離候補区間抽出部、２５
は分離候補位置情報レジスタ、８は評価尺度演算
部、２６は最適分離位置情報レジスタ、２７は文
字分離位置レジスタ、１０は制御部、８０はステ
ージレジスタ、８１は距離算出部、２６１は分離
候補位置群レジスタ、２６２は連接情報群レジス
タ、２６３は最適統計量群レジスタ、２６４は最
適評価値群レジスタ、８２は統計量算出部、８３
は統計量格納レジスタ、８４は評価値算出部、８
５は比較部、８６は最小評価値レジスタである。 Figure 1 uses an example of a partial string image,
A diagram for explaining an example of a separation candidate section setting method for character separation in the present invention, FIG. 2 is a diagram for explaining the principle for extracting an optimal character separation position in the present invention, and FIG. FIG. 4, a logical block diagram illustrating a specific embodiment of the invention, shows an example of a terminal candidate section extracted by the terminal candidate section extracting section 6 in FIG. FIG. 5 is a logical block diagram showing a specific embodiment of the evaluation scale calculating section 8 in FIG. In the figure, 1 is a scanning unit, 2 is a character string image memory, 3 is a character block extraction unit, 21 is a character block information register, 4 is a character pitch detection unit, 22 is a character pitch information register, 30 is a parameter information register, 5 23 is a permissible interval extraction unit, 23 is an allowable interval information register, 6 is a terminal candidate interval extractor, 24 is a terminal candidate interval register, 7 is a separation candidate interval extractor, 25
8 is a separation candidate position information register, 8 is an evaluation scale calculation unit, 26 is an optimal separation position information register, 27 is a character separation position register, 10 is a control unit, 80 is a stage register, 81 is a distance calculation unit, 261 is a separation candidate position group register, 262 is a connection information group register, 263 is an optimal statistics group register, 264 is an optimal evaluation value group register, 82 is a statistics calculation unit, 83
is a statistics storage register, 84 is an evaluation value calculation unit, 8
5 is a comparison section, and 86 is a minimum evaluation value register.

[Claims]

１現金の収集時に装置の外部に引きだし可能な
現金収納部と、該現金収納部と、現金の入出金口との間で現金
の収納、繰り出し、及び搬送を行う現金出納機構
と、該現金出納機構の現金搬送経路において搬送途
上の現金の種別を鑑別する鑑別部と、該鑑別部よりの鑑別結果信号を受信することに
より、該現金収納部に収容される現金の数量を計
数する計数手段とを有する現金取扱装置におい
て、該現金収納部が装置の外部に引き出されたこと
を検出する検出手段と、該検出手段の発する検出出力を受信したことに
より、該計数手段の計数値をクリヤするクリヤ手
段とを備えることを特徴とする現金取扱装置。 1. A cash storage section that can be pulled out to the outside of the device when collecting cash; a cash accounting mechanism that stores, dispenses, and transports cash between the cash storage section and a cash deposit/withdrawal port; and the cash transaction mechanism. a discrimination section that discriminates the type of cash being conveyed on the cash conveyance route of the mechanism; and a counting means that counts the amount of cash stored in the cash storage section by receiving a discrimination result signal from the discrimination section. A cash handling device having: a detection means for detecting that the cash storage section has been pulled out of the device; and a clearing means for clearing the count value of the counting means upon receiving a detection output emitted by the detection means. A cash handling device comprising: means.