JPS60164879A - Character separating device - Google Patents

Character separating device

Info

Publication number
JPS60164879A
JPS60164879A JP59020300A JP2030084A JPS60164879A JP S60164879 A JPS60164879 A JP S60164879A JP 59020300 A JP59020300 A JP 59020300A JP 2030084 A JP2030084 A JP 2030084A JP S60164879 A JPS60164879 A JP S60164879A
Authority
JP
Japan
Prior art keywords
character
pitch
distance
blocks
evaluation value
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
JP59020300A
Other languages
Japanese (ja)
Inventor
Yoshitake Tsuji
辻 善丈
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
NEC Corp
Original Assignee
NEC Corp
Nippon Electric Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by NEC Corp, Nippon Electric Co Ltd filed Critical NEC Corp
Priority to JP59020300A priority Critical patent/JPS60164879A/en
Priority to US06/683,576 priority patent/US4635290A/en
Priority to DE8484115985T priority patent/DE3486104T2/en
Priority to DE3486241T priority patent/DE3486241T2/en
Priority to EP91100048A priority patent/EP0428499B1/en
Priority to EP84115985A priority patent/EP0146147B1/en
Publication of JPS60164879A publication Critical patent/JPS60164879A/en
Pending legal-status Critical Current

Links

Abstract

PURPOSE:To separate a character row image into a single character by calculating the dispersion of the distance between separated characters, and calculating a dispersed evaluation value consisting of plural dispersion maximum likelihood linear sum to identify a character pitch. CONSTITUTION:A scan device 1 optically scans a character row image described on paper, converts it into an electric signal and quantizes it into binary to write it in a character row image memory 2. A character string extraction device 3 sequentially extracts a character string from said image, and stores start point position and size of each of said strings in a character string register 4. A controller 7 calculates the distance between the character strings from the start and end positions to be transferred, and increments the number of frequencies which are the contents of address position of said corresponding distance of a frequency distribution table 6. In such a way, the device forms the frequency distribution of said distance of the table 6 constituted of a memory.

Description

【発明の詳細な説明】 (技術分野) 本発明は文字分離装置に関し、特に紙面上に記載された
文字行イメージを個個の文字に分離して文字ピッチの識
別を行うのに適応的な文字分離装置に関するものである
[Detailed Description of the Invention] (Technical Field) The present invention relates to a character separation device, and particularly to a character separation device that is suitable for separating a character line image written on paper into individual characters and identifying the character pitch. This relates to a separation device.

(先行技術) 各種印刷文字等を光学的に読み取る装置(以下OCRと
呼ぶ)において一連の文字を認識する場合、各文字を1
字毎に分離して文字認識部に送シ込む必要がある。各文
字t−i字毎に分離するために必要となる情報としては
文字ピッチがあるが、これはOCRの読み取9対象とな
る印刷物の大きさや種類が限定されれば前もって与える
ことができる。
(Prior Art) When recognizing a series of characters with a device that optically reads various printed characters (hereinafter referred to as OCR), each character is
It is necessary to separate each character and send it to the character recognition unit. The information required to separate each character t-i is the character pitch, but this can be provided in advance if the size and type of printed matter to be read by OCR 9 is limited.

しかし、最近のようにOCRにおける読み取シ対象が不
特定な文字ピッチを持つ郵便物や文書のような床机な適
用範囲のものに及ぶようになると、あらかじめ文7″′
・−′Pf−知ること力゛できな?で紙面上の文字行イ
メージから文字ピッチを推定する必要が生じる。従来、
文字ピッチの推定方法としては例えば、平均値等が用い
られていた。しかし、英印字文字のようにフォントや文
字カテゴリーによって個個の文字幅が大きく異なる場合
や接触する文字数が増加した場合には、上述した平均値
と実際の文字ピッチとでは1文字の分離を行う時に生じ
る誤差が無視できなくなる。そのため、例えば前記平均
値を用いて文字間で接触が生じた多くの文字を含む文字
行イメージ全分離する場合は、接触した文字の個数を誤
ったり不正確な分離位置で切断されたシするという問題
があった。また上述したOCRの読み取シ条件下では、
通常、等ピッチ印字データだけでなく文字ピッチが可変
となる可変ピッチ印字データ、時には手書き文字も含ま
れる場合がある。そこで例えば、欧文可変ピッチ印字デ
ータ間で文字間の接触が生じると、等ピッチ印字データ
と同様な文字分離処理の形態を採用することが困難とな
るという第2の問題があった。
However, in recent years, OCR has begun to read items such as mail and documents with unspecified character pitch, such as floor desks, etc.
・-'Pf- Can't you have the power to know? Therefore, it becomes necessary to estimate the character pitch from the character line image on the paper. Conventionally,
For example, an average value has been used as a method for estimating character pitch. However, when the width of individual characters differs greatly depending on the font or character category, such as in the case of English printed characters, or when the number of characters that touch each other increases, the above average value and the actual character pitch are separated by one character. The errors that sometimes occur cannot be ignored. Therefore, for example, if you use the above average value to separate an entire character line image that includes many characters that are in contact with each other, it is possible that the number of characters in contact may be incorrect or that characters may be cut at incorrect separation positions. There was a problem. Also, under the OCR reading conditions mentioned above,
Usually, not only uniform pitch print data but also variable pitch print data in which the character pitch is variable, and sometimes handwritten characters may also be included. Therefore, for example, if contact occurs between characters in European variable pitch print data, there is a second problem in that it becomes difficult to employ the same character separation process as in uniform pitch print data.

シれは、文字分離のプロセスが欧文可変ピッチで代表さ
れる印字データ集合と等ピッチ印字データ集合とではそ
れぞれ異なっていることを意味する。
This means that the character separation process is different between a print data set represented by variable pitch European characters and a uniform pitch print data set.

従って、通常の1文字単位の分離処理を行う以前に前述
した文字ピッチの識別を行うことが重要となる。即ち、
文字分離装置には、1文字の分離を行うための文字ピッ
チを検出すると共に文字分離対象となる文字行イメージ
が推定された文字ピッチに対してどの程度のばらつきを
持って文字イメージが印字されているかを示す分散評価
値も検出することが要求される。そこで、このような分
散評価値が算出されれば、例えば同一出願人によシ昭和
58年12月20日に出願された発明(以下先願発明と
呼ぶ)の「文字分離装置」を用いる場合、文字ピッチや
文字パターンイメージの特徴を用いて、1文字単位に分
離するためのパラメータを決定するために利用すること
ができるし、更に、欧文タイプライタのような可変ピッ
チ印字文字や文字枠を意識せずに書かれた手書き文字の
ような文字ピッチを主体とした文字分離が困難なデータ
集合が混在する場合にも上述した分散評価値を用いて識
別することが可能となシ、よシ汎用的な6CRの文字読
取対象に対しても適用できる文字分離装置を実現するこ
とができる之゛ (発明の目的) 本発明は上記問題点を解決するためになされたもので、
その目的は、最良な文字ピッチを検出すると同時に、文
字行イメージが検出された文字ピッチに対してどの程度
のばらつきを持って印字されているかを示す分散評価尺
度をも算出することによって文字ピッチの性質を把握し
、適応的な文字分離処理を提供することにある。また本
発明の他の目的は、文字間に接触を含む場合や1文字が
2文字以上に分離する場合等の種種の文字読取対象条件
下でも、安定にしかも最良の文字ピッチを検出すること
が可能な文字分離装置を提供することにある。
Therefore, it is important to identify the character pitches described above before performing normal character-by-character separation processing. That is,
The character separation device detects the character pitch for separating one character and also determines how much variation the character line image to be separated has with respect to the estimated character pitch. It is also required to detect a variance evaluation value indicating whether the Therefore, if such a dispersion evaluation value is calculated, for example, when using a "character separation device" of an invention filed on December 20, 1982 by the same applicant (hereinafter referred to as the "earlier invention"). , it can be used to determine parameters for separating individual characters by using the characteristics of the character pitch and character pattern image, and furthermore, it can be used to determine the parameters for separating characters into individual characters. Even when there are mixed data sets that are difficult to separate characters based on character pitch, such as handwritten characters written unconsciously, it is possible to identify them using the variance evaluation value described above. It is possible to realize a character separation device that can be applied to general-purpose 6CR character reading objects. (Objective of the Invention) The present invention has been made to solve the above problems.
The purpose is to detect the best character pitch and at the same time calculate the variance evaluation measure that shows how much variation the character line image is printed with respect to the detected character pitch. The goal is to understand the characteristics and provide adaptive character separation processing. Another object of the present invention is to stably detect the best character pitch even under various character reading conditions, such as when there is contact between characters or when one character is separated into two or more characters. The object of the present invention is to provide a possible character separator.

<amo構成) 本発明によれば、一連の文字行イメージを走査 5− し1文字に分離する文字分離装置において、前記一連の
文字行イメージから複数個の文字塊を順次検出する手段
と、前記複数個の文字塊から文字塊間距離を算出し該文
字塊間距離の頻度分布を記憶する手段と、文字ピッチの
存在区間を予測する手段と、前記頻度分布を複数個の領
域に分割し該複数個の領域における該文字ピッチ推定誤
差の最尤線形和によって構成される推定誤差評価尺度が
最小と壜る前記存在区間に含まれる候補文字ピッチを検
出する手段と、前記文字ピッチを用馳て分割された複数
個の前記領域内に含まれる前記文字塊間距離の分散を算
出し複数個の該分散の最尤線形和によって構成される・
分散評価値を算出する手段と、該分散評価値に基づいて
文字ピッチの識別を行い該文字行イメージ′f:1文字
に分離する手段とを備えることを特徴とする文字分離装
置が得られる。
<amo configuration) According to the present invention, in a character separation device that scans a series of character line images and separates them into single characters, means for sequentially detecting a plurality of character chunks from the series of character line images; means for calculating distances between character clusters from a plurality of character clusters and storing a frequency distribution of the distances between character clusters; means for predicting an interval in which a character pitch exists; and means for dividing the frequency distribution into a plurality of regions, means for detecting a candidate character pitch included in the existing interval in which an estimation error evaluation scale constituted by a maximum likelihood linear sum of the character pitch estimation errors in a plurality of regions is the minimum; Calculate the variance of the distances between the character blocks included in the plurality of divided regions, and configure by the maximum likelihood linear sum of the plurality of variances.
A character separation device is obtained, comprising means for calculating a variance evaluation value, and means for identifying a character pitch based on the variance evaluation value and separating the character line image 'f: one character.

(実施例) ゛ 次に図面を参照して本発明について説明する。(Example) Next, the present invention will be explained with reference to the drawings.

第1図は本発明の文字ピッチ検出装置で用いる6− 文字塊間距離を説明するための文字行イメージの一例を
示す図である。同図において、斜線で示した白地で分離
可能な文字イメージすなわち文字塊を破線図示の矩形領
域で示している。ここで参照記号TJt、t+t (但
しi −1,L−6) ハfjJ!、 i 番目。
FIG. 1 is a diagram showing an example of a character line image for explaining the distance between six character blocks used in the character pitch detection device of the present invention. In the figure, a separable character image, that is, a block of characters, is shown by a rectangular area shown by a broken line on a white background shown by diagonal lines. Here, reference symbol TJt, t+t (however, i -1, L-6) HafjJ! , i-th.

文字塊の始端から第i+1番目の文字塊の始端までを文
字塊間距離としてめたものであり、また参照記号U′1
−+□(但し’ ” 1. a・・・6)は第1番目の
文字塊の終端から第i+1番目の文字塊の終端までを文
字塊間距離としてめたものである。
The distance between the character blocks is defined as the distance from the start of the character block to the start of the i+1th character block, and the reference symbol U'1
-+□ (where ''' 1.a...6) is the distance between character blocks from the end of the first character block to the end of the i+1th character block.

これらはすべて文字塊間距離として後述する文字塊間距
離の頻度分布を構成する観測値として利用することがで
きる。同様に、参照記号U、、、(但しi=1.2.・
・・6 、i(jであシ、図ではU□、3及びU09.
のみ示している)は第1番目の文字塊のl= x、z・
a 、 i <j テあシ、図ではU’、3(7)みも
のであり、通常、観測量が多ければ統計的には安定なの
で後述する文字塊間距離の頻度分布を構成する観測量と
して利用することができる。
All of these can be used as observed values constituting a frequency distribution of distances between character blocks, which will be described later as distances between character blocks. Similarly, reference symbol U, , (where i=1.2.・
...6, i (j), in the figure U□, 3 and U09.
) is the first character block l = x, z・
a , i < j teashi, in the figure U', 3 (7). Normally, the larger the number of observations, the more statistically stable it is, so the number of observations that make up the frequency distribution of the distance between character clusters, which will be described later. It can be used as

なお頻度分布を構成するための観測値となる上述した文
字塊間距離は、例えば文字塊間距離UI。
Note that the above-mentioned distance between character blocks, which is an observed value for configuring the frequency distribution, is, for example, the distance between character blocks UI.

j(但しj= i+1 、i+2.i+3)、文字塊間
距離U’1 、3 (j = i+1 、 i+2 、
 i+3)までを利用するというように制限することが
できる。また、ピリオド(・)、カンマ(1)等(いず
れも図示していない)のような文字イメージは、例えば
複数個の文字塊の平均高さHlを用いてその文字塊幅及
び高さを調べることによシ、文字塊間距離の観測量から
除去しても良い。更に、前述の平均高さHoに比べて大
きな空白が検出された場合も、その空白を含めた文字塊
間距離を観測量として用いないようにしても良い。
j (however, j=i+1, i+2.i+3), distance between character blocks U'1, 3 (j=i+1, i+2,
It is possible to limit the use of up to i+3). Also, for character images such as periods (・), commas (1), etc. (none of which are shown), the width and height of the character blocks are checked using, for example, the average height Hl of multiple character blocks. In particular, it may be removed from the observed amount of distance between character blocks. Furthermore, even if a blank space larger than the above-mentioned average height Ho is detected, the distance between character blocks including the blank space may not be used as the observed amount.

次に第2図は第1図で示したような等ピッチデータにお
ける一連の文字塊間距離の頻度分布の一例を示す図であ
る。同図において、頻度分布の横軸Uは文字塊間距離U
、、、の値を示しておシ、縦軸NUMは任意の文字塊間
距離における頻度数を示している。
Next, FIG. 2 is a diagram showing an example of the frequency distribution of distances between a series of character blocks in equal pitch data as shown in FIG. 1. In the same figure, the horizontal axis U of the frequency distribution is the distance between character blocks U
, , , and the vertical axis NUM indicates the frequency at a given distance between character blocks.

次に第3図囚及び(6)はそれぞれ等ピッチデータ及び
欧文可変ピッチデータにおける一連の文字塊間距離の頻
度分布の第1及び第2の例を示す図である。第3図囚は
等ピッチデータを、また第3図■は欧文可変ピッチデー
タを示している。
Next, Figures 3 and 6 are diagrams showing first and second examples of frequency distributions of distances between a series of character blocks in constant pitch data and Roman variable pitch data, respectively. Figure 3 shows constant pitch data, and Figure 3 shows variable pitch data in Roman languages.

続いて第2図及び第3図を用いて本発明の原理について
説明する。複数個の文子塊の高さから平均高さH□を算
出し、係数α□、α2(但しα1〈−α、)を用いて文
字ピッチの存在区間(α1・H□。
Next, the principle of the present invention will be explained using FIGS. 2 and 3. The average height H□ is calculated from the heights of multiple sentence blocks, and the character pitch existence interval (α1·H□) is calculated using the coefficients α□ and α2 (however, α1<−α,).

α2・H,)’t−設定する。ここで、前述した存在区
間(α、・HHtα2・H,)内のすべての文字塊間距
離を候補文字ピッチP1 としても良いが、以下に述べ
るような処理を用いれば候補文字ピッチPlの数を減少
させ処理時間の向上をはかることができる。すなわち前
述した存在区間(α□・Hm。
α2・H, )'t-set. Here, all the distances between character clusters in the existence interval (α, ・HHtα2・H,) described above may be taken as the candidate character pitch P1, but if the process described below is used, the number of candidate character pitches Pl can be It is possible to improve the processing time by reducing the amount of time required. That is, the existence interval (α□・Hm) mentioned above.

α2・H,)内で、一定許容幅Δτ(但しΔτ≧1)で
最頻値をとる文字塊間距離U (1) t−頻度分布か
ら算出し、文字ピッチの存在区間の下限値をMAX(9
− α1・Hm、(1−α3)・U(1))(但し係数α3
はO≦α3≦1を満たす)、文字ピッチの存在区間の上
限値MIN(α2・Hm、(1+α3)・U (1) 
)とすることができる。第2図では、区間C1が文字ピ
ッチの存在区間となり、区間C1に含まれる文字塊間距
離が候補文字ピッチPI(但しMAX(α1句H,,(
1−α3)・U(1))≦PK≦MIN(α2・HrI
、、(1+α3)−U(1))を満たす)となる。
The distance U between character clusters that takes the mode within a certain allowable width Δτ (however, Δτ ≥ 1) within α2・H, ) (1) Calculated from the t-frequency distribution, and MAX (9
− α1・Hm, (1−α3)・U(1)) (however, coefficient α3
satisfies O≦α3≦1), the upper limit value of the character pitch existence section MIN (α2・Hm, (1+α3)・U (1)
). In Fig. 2, section C1 is the section where character pitches exist, and the distance between character blocks included in section C1 is the candidate character pitch PI (however, MAX (α1 phrase H,, (
1-α3)・U(1))≦PK≦MIN(α2・HrI
, , (1+α3)−U(1)).

次に、任意の候補文字ピッチP、 を基量として頻度分
布は図中破線で示したような領域f’l+f2+・・・
f pに分割される。ここで各領域f′k(但しに=1
.2.−n)の境界点8 (f′に−1+ f′k)は
領域f4−1帥心(k−1)・P、と領域f′、の中心
に* psとの間に存在し、各領域f′にの境界点S 
(fJ、 、 flヤ□)は領域f(、の中心に−Pl
と領域f’に+1の中心(k+1)・PIとの間に存在
する。そこで、例えば境界点S (f′に−1+ ’′
k)は領域f’に−1の中心(k−1)・Plと領域f
l、の中心k −P、の中間点kf(の中心k −p、
と領域f’に+1の中心(k+1)10− 次に、領域、f′k(但しに=1.2.・・・n)内に
存在するnk個(但しn、≧0)の文字塊間距離の平均
値P(k、nk)を算出し該平均値P (1c+ ”k
)を整数にで除算することによって、候補文字ピッチP
1 を基準とした場合の領域f′kから観測される文字
ピッチに相当する量が検出される。そこで、nk)と候
補文字ピッチP、とのずれをすべての領域f′kにわた
って最小にする候補文字ピッチP。
Next, using an arbitrary candidate character pitch P, as a base quantity, the frequency distribution is in the area f'l+f2+... as shown by the broken line in the figure.
It is divided into f p. Here, each area f'k (where = 1
.. 2. -n) boundary point 8 (-1+f'k at f') exists between the region f4-1 center (k-1) P and * ps at the center of the region f', and each Boundary point S in area f'
(fJ, , flya□) is −Pl at the center of the area f(,
and the center of +1 (k+1)·PI in region f'. Therefore, for example, the boundary point S (−1+'' for f'
k) is the center of -1 (k-1) Pl in the area f' and the area f
The center of l, k - P, and the midpoint kf (center of k - P,
and the center of +1 in area f' (k+1) 10- Next, nk (n, ≧0) character blocks existing in area f'k (where = 1.2...n) Calculate the average value P (k, nk) of the distance between
) by dividing the candidate character pitch P by an integer.
An amount corresponding to the character pitch observed from the area f'k when 1 is used as a reference is detected. Therefore, the candidate character pitch P minimizes the deviation between the candidate character pitch P and the candidate character pitch P over all the regions f'k.

を推定文字ピッチPとする最尤線形推定手法を適用する
ことが有効となる。そのため、例えば、最尤推定を行う
ための評価基準となる推定誤差評価こで、係数C(k、
nk)はサンプル数nk及び整数k(但しに−1,2,
・n)の関数であシ、″”xCk=1 (Jnk)=1を満たす。具体的な係数C(k。
It is effective to apply a maximum likelihood linear estimation method in which P is the estimated character pitch. Therefore, for example, in the estimation error evaluation, which is the evaluation standard for performing maximum likelihood estimation, the coefficient C(k,
nk) is the number of samples nk and the integer k (however, -1, 2,
・It is a function of n), and satisfies ""xCk=1 (Jnk)=1. The specific coefficient C(k.

”5c)(D−例としては、C(k、nk)−k −n
k/Σ k2・nkを用いることができる。なお、上述
に=1 した推定誤差評価尺度Tは推定誤差の分散量とな − っているが、ずれの絶対量1−・P (k + ”k)
−P+ Iに基づく評価基準を用いることもできる。こ
のようにして、最適な推定文字ピッチPを推定できると
共に、文字塊間距離の頻度分布のクラスター化も同時に
行われる。
"5c) (D-For example, C(k, nk)-k -n
k/Σ k2·nk can be used. Note that the estimation error evaluation scale T, which is equal to 1 above, is the amount of variance of the estimation error, but the absolute amount of deviation is 1-・P (k + ``k)
An evaluation criterion based on -P+I can also be used. In this way, the optimal estimated character pitch P can be estimated, and the frequency distribution of distances between character blocks can be clustered at the same time.

次に第3図囚、@を用いて本発明における文字ピッチ識
別手法について説明する。第3図囚、@に示したように
文字塊間距離の頻度分布は、等ピッチ印字データと可変
ピッチ印字データでは異なることがわかる。この相異は
第2図で示した最適す文字ピッチPによってクラスター
化された各領域fk(但しに=1.2.・・・n)にお
ける推定文字ピッチPのに倍からの各文字塊間距離の分
散σ”(fsc)をすべての領域f1.f2.・・・に
わたって評価することによって識別することができる。
Next, the character pitch identification method according to the present invention will be explained using FIG. 3 and @. As shown in Figure 3, the frequency distribution of the distance between character blocks is different between uniform pitch print data and variable pitch print data. This difference is between each character block from twice the estimated character pitch P in each region fk (where = 1.2...n) clustered by the optimal character pitch P shown in Figure 2. It can be identified by evaluating the distance variance σ'' (fsc) over all regions f1, f2, .

そこで、クラスター化された各領域fk(但しに−1,
2,・・・n)の分散σ” (fk)の線形和で構成さ
れる分散評価を用いることができる。ここで、上述した
分散量) 価値ε2における係数C(k、nk)は、文
字塊間距離のサンプル数nk及び整数にの関数でありX
Ck=1 (k、nk)=11満たす。具体的な係数C(k。
Therefore, each clustered region fk (−1,
2,...n) can be used. Here, the coefficient C(k, nk) in the value ε2 (the above-mentioned variance amount) can be used. It is a function of the number of samples nk of the distance between clusters and an integer, and
Ck=1 (k, nk)=11 is satisfied. The specific coefficient C(k.

nk)の−例としては、C(k、nk)−に2・nk/
)k2・nkを用いることができる。なお、上述した分
散評価値ε2の代わりに誤差評価値6を用いることもで
きることは言うまでもない。
nk) - For example, C(k, nk) - has 2・nk/
) k2·nk can be used. Note that it goes without saying that the error evaluation value 6 can be used instead of the above-mentioned variance evaluation value ε2.

次に、分散評価値ε2あるいは誤差評価値eが、あらか
じめ設けられた閾値α4よシも大きければ上述した推定
文字ピッチPを主体とする文字分離が可能な場合、すな
わち等ピッチ印字データであシ、前記閾値α4よシも小
さければ上述した推定文字ピッチを主体とする文字分離
が適用できない場合、すなわち欧文可変ピッチ印字デー
タであることが判明し、文字ピッチの性賛ヲ識別するこ
とができる。このようにして、文字ピッチの性質を識別
することが可能になると、文字ピッチを主体として文字
分離が可能な場合には、例えば前記先願発明の「文字分
離装置゛」を用いて安定に文字分離を行うことができる
し、また上述した以外の公13− 知の文字分離技術を用いることもできる。一方、文字ピ
ッチを主体とする文字分離が適用できない場合には、例
えば容易に実現できる公知の文字行イメージの空白によ
る文字分離技術を用いて文字分離を行うこともできる。
Next, if the variance evaluation value ε2 or the error evaluation value e is larger than the preset threshold α4, character separation based on the above-mentioned estimated character pitch P is possible, that is, equal pitch print data is used. If the threshold value α4 is also smaller than the above-mentioned threshold value α4, it is found that the above-mentioned character separation based mainly on the estimated character pitch cannot be applied, that is, the data is European variable pitch print data, and the nature of the character pitch can be identified. In this way, if it becomes possible to identify the characteristics of the character pitch, if character separation is possible based on the character pitch, for example, the characters can be stably separated using the "character separation device" of the prior invention. Separation can be performed, and other well-known character separation techniques other than those described above can also be used. On the other hand, if character separation based mainly on character pitch cannot be applied, character separation can be performed using, for example, a known character separation technique using blank spaces in character line images that can be easily realized.

第4図は本発明の文字分離装置の具体的一実施例を示す
論理ブロック図である。走査装置1は紙面上に記載され
た文字行イメージを光学的に走査して、電気信号に変換
し、2値量子化後、文字行イメージメモリ2へ書き込む
。文字塊抽出装置3は、文字行イメージメモリ2に格納
された複数個の文字行イメージから文字塊を順次抽出し
、各文字塊の始点位置及び大きさを文字塊レジスタ4へ
格納する。なお、文字塊の大きさは、文字塊の幅及び高
さを表わすものとする。また、このような文字塊抽出装
置は、例えば、同一出願人による特願昭56−2751
2号明細書で示されている技術を用いることができる。
FIG. 4 is a logical block diagram showing a specific embodiment of the character separation device of the present invention. A scanning device 1 optically scans a character line image written on a paper surface, converts it into an electrical signal, and writes it into a character line image memory 2 after binary quantization. The character block extraction device 3 sequentially extracts character blocks from a plurality of character line images stored in the character line image memory 2, and stores the starting position and size of each character block in the character block register 4. Note that the size of a character block represents the width and height of the character block. Further, such a character block extraction device is disclosed in, for example, Japanese Patent Application No. 56-2751 filed by the same applicant.
The technique shown in the specification of No. 2 can be used.

制御装置7は、順次、文字塊レジスタ4から転送される
文字塊の始端位置及び終端位置から文字塊間距離を算出
し、頻度分14− 布テーブル6の対応する文字塊間距離のアドレス位置の
内容である頻度数をインクリメントする。
The control device 7 sequentially calculates the distance between character blocks from the start and end positions of the character blocks transferred from the character block register 4, and calculates the address position of the corresponding distance between character blocks in the frequency section 14-cloth table 6. Increment the frequency number that is the content.

このようにして、メモリから構成される頻度分布テーブ
ル6に、第2図で示したような文字塊間距離の頻度分布
が生成される。なお、頻度分布テーブル6は、最初Oに
初期化されているとする。次に、制御装置7によって文
字塊レジスタ4に格納された複数個の文字塊の高さから
平均高さHlが算出され、存在区間検出部50へ転送さ
れる。定数レジスタ19は、第2図で示した係数α□、
α2(但しα□くα2)、α3及び一定許容幅Δτ(但
しΔτ≧1)の各定数を格納する。存在区間検出部50
は、最初に定数レジスタ19から係数α□及びα2を入
力し、α□・Hm及びα2・Hoを前述した文字ピッチ
の存在区間の下限値α1・Hm及び上限値α2・Hoと
して設定する。次いで存在区間検出部50は、前述した
存在区間の下限値α1・Hoと上限値α2・H,内に属
する文字塊間距離の頻度値を制御装置7を介して順次頻
度分布テーブル6から読み出し、定数レジスタ19から
転送された一定許容幅Δτ(但しΔτ≧1)で最頻値と
なる文字塊間距離U (1) ?算出する。存在区間演
算部51は、存在区間検出部50においてめられた文字
塊間距離U(1)と定数レジスタ19から入力した係数
α3を用いて、値(1−α3)・U(1)及び(1+α
、)・U(1)を算出した後、MAX(α、・Hl、l
l、(1−α3)・U(1) ) ?前述した文字ピッ
チの存在区間の下限値PL、MIN(α2・H,n、(
1+α3)・U(1))を前述した文字ピッチの存在区
間の上限値pUとして、それぞれ存在区間レジスタ8に
格納する。なお、前述した文字塊間距離U(1)が存在
区間検出部50において検出されなかった場合には、文
字塊間距離U(1)として平均高さHlが存在区間検出
部50においてセットされるものとする。次いで制御装
置7によって存在区間レジスタ8に格納された文字ピッ
チの下限値PLがカウンタ14にセットされる。カウン
タ14は、下限値PLから上限値PUマで順次以下に述
べる動作が終了後にカウントアツプされ、そのカウント
値Pl(但しPL≦P、≦Ptr)を平均値算出部9に
転送する。平均値算出部9は、カウンタ14から転送さ
れるカウント値すなわち候補文字ピッチp、 ′tl−
基単と基量、頻度分布のn個の領域範囲すなわち領域f
k(但して頻度分布テーブル6を参照することによって
領域fkに属する文字塊間距離の数nk及び領域fkに
属する文字塊間距離の千吟値P(k、nk)を算出する
。以上の処理をn個の領域にわたシ行い、候補客字ピッ
チP、及び各領域fk(但しに=1.λ・・・n)の文
字塊間距離の数nk(但し*=1.z・・・n)及び平
均値P(k、nk)、(但しに= 1.2.・n )が
推定誤差評価値演算部10へ転送される。推定誤差評価
値演算部10は、前述した推定誤差評価値Tf平均値算
出部9から転送される情報を用いて演算することにより
てめられる。評価値レジスタ12には最小となる推定誤
差評価値Tが記憶される。カお最尤文字ピッチレジスタ
15にはちら17− かじめ推定誤差評価値として非常に大きな値がセットさ
れているものとする。比較部11において、推定誤差評
価値演算部10から出力された推定誤差評価値Tと評価
値レジスタ12の内容とを比較し、推定誤差評価値演算
部10の出力値が評価値レジスタ12の内容よシも小さ
ければ推定誤差評価値演算部10の出力値を評価値レジ
スタ12へ書き込み、!に候補文字ピッチPIO値を最
尤文字ピッチレジスタ15へ書き込み、制御装置7′f
:介してカウンタ14が1カウントアツプする。一方、
比較部11において推定誤差評価値演算部10の出力値
が評価値レジスタ12の内容よシも大きければ制御装置
7t−寒してカウンタ14が1カウントア、プされる。
In this way, a frequency distribution of distances between character blocks as shown in FIG. 2 is generated in the frequency distribution table 6 constructed from memory. It is assumed that the frequency distribution table 6 is initially initialized to O. Next, the average height Hl is calculated by the control device 7 from the heights of the plurality of character blocks stored in the character block register 4, and is transferred to the existing section detection unit 50. The constant register 19 stores the coefficient α□ shown in FIG.
The constants α2 (however, α□×α2), α3, and a constant allowable width Δτ (however, Δτ≧1) are stored. Existence section detection unit 50
First inputs the coefficients α□ and α2 from the constant register 19, and sets α□·Hm and α2·Ho as the lower limit value α1·Hm and upper limit value α2·Ho of the character pitch existing interval. Next, the existence section detection unit 50 sequentially reads the frequency values of the distances between character blocks that belong to the lower limit α1·Ho and the upper limit α2·H of the existence interval from the frequency distribution table 6 via the control device 7, Distance between character blocks U (1) that has the most frequent value with the constant allowable width Δτ (however, Δτ≧1) transferred from the constant register 19? calculate. Existence interval calculation unit 51 uses the distance between character blocks U(1) determined by existence interval detection unit 50 and coefficient α3 input from constant register 19 to calculate the values (1−α3)・U(1) and ( 1+α
, )・U(1), MAX(α,・Hl,l
l, (1-α3)・U(1) )? The lower limit value PL,MIN(α2・H,n,(
1+α3)·U(1)) are respectively stored in the existence interval register 8 as the upper limit pU of the character pitch existence interval. Note that when the above-mentioned distance between character blocks U(1) is not detected by the existing area detection unit 50, the average height Hl is set as the distance between character blocks U(1) in the existing area detection unit 50. shall be taken as a thing. Next, the lower limit value PL of the character pitch stored in the existence interval register 8 is set in the counter 14 by the control device 7. The counter 14 counts up after the operations described below are completed sequentially from the lower limit value PL to the upper limit value PU, and transfers the count value Pl (PL≦P,≦Ptr) to the average value calculation unit 9. The average value calculation unit 9 calculates the count value transferred from the counter 14, that is, the candidate character pitch p, 'tl-
n area ranges of base units, base quantities, and frequency distributions, that is, area f
k (However, by referring to the frequency distribution table 6, the number nk of distances between character blocks belonging to the area fk and the 1,000-degree value P(k, nk) of the distances between character blocks belonging to the area fk are calculated.The above processing is carried out over n areas, and the candidate character pitch P and the number of distances between character clusters nk (however, *=1.z... n) and the average value P(k, nk), (where = 1.2.·n) are transferred to the estimated error evaluation value calculation unit 10.The estimated error evaluation value calculation unit 10 is The value Tf is determined by calculation using the information transferred from the average value calculation unit 9. The minimum estimated error evaluation value T is stored in the evaluation value register 12. The maximum likelihood character pitch register 15 17- It is assumed that a very large value is set as the preliminary estimation error evaluation value.In the comparator 11, the estimation error evaluation value T output from the estimation error evaluation value calculation unit 10 and the evaluation value register 12, and if the output value of the estimated error evaluation value calculation unit 10 is smaller than the content of the evaluation value register 12, the output value of the estimated error evaluation value calculation unit 10 is written to the evaluation value register 12, and ! The candidate character pitch PIO value is written to the maximum likelihood character pitch register 15, and the control device 7'f
: The counter 14 increments by one. on the other hand,
In the comparison section 11, if the output value of the estimated error evaluation value calculation section 10 is larger than the content of the evaluation value register 12, the control device 7t is turned off and the counter 14 is counted up by one.

以上の動作をカウンタ14の値が文字ピッチの上限値P
Uに達するまで行うことによって、最適な文字ピッチP
が最尤文字ピッチレジスタ15に格納されるととになる
。分散評価値演算部13ゆ、最尤文字ピッチレジスタ1
号に最適な推定文字ピッチPがセットされると、推定文
字ピッチルt最尤文字ピッチレジスタ15か18− ら読み出し、推定文字ピッチP?基量として頻度分布の
n個の領域範囲、即ち領域fk(但しに=1゜テーブル
6を参照することによって領域fkに属する文字塊間距
離のサンプル数nk及び領域f、に属する文字塊間距離
の値に−Pにおける分散σ2(fk)を算出し、前述し
たよう表分散評価値62=分散評価値演算部13におい
て検出される分散σ3(fk)は平均値算出部9におい
て算出するようにすることも可能である。分散評価値演
算部13において算出された分散評価値C2は分散評価
値レジスタ16に格納される。閾値レジスタ18には前
記文字ピッチの識別を行うための閾値α4t−記憶する
。比較部17は前記分散評価値レジスタ16に格納され
た分散評価値ε2と閾値レジスタ18に格納された閾値
α4どを比較することKよりて文字ピッチの性質を識別
する。すなわち、分散評価値ε2が閾値α4よシも大き
くなれば推定文字ピッチPを主体とした文字分離位置の
決定方法が適用できないと判定され、分散評価値ε2が
閾値α4よυも小さければ推定文字ピッチPを主体とし
た文字分離位置の決定方法が適用できると判定される。
The above operation is performed until the value of the counter 14 is the upper limit value P of the character pitch.
By repeating until reaching U, the optimum character pitch P
is stored in the maximum likelihood character pitch register 15. Variance evaluation value calculation unit 13, maximum likelihood character pitch register 1
When the estimated character pitch P that is most suitable for the number is set, the estimated character pitch is read from the maximum likelihood character pitch register 15 or 18-, and the estimated character pitch P? By referring to n area ranges of the frequency distribution as a base quantity, that is, area fk (where = 1° Table 6), the number of samples nk of the distance between character blocks belonging to area fk and the distance between character blocks belonging to area f can be determined by referring to table 6. The variance σ2(fk) at −P is calculated for the value of , and the variance σ3(fk) detected in the table variance evaluation value 62=dispersion evaluation value calculation unit 13 as described above is calculated in the average value calculation unit 9. The variance evaluation value C2 calculated by the variance evaluation value calculation unit 13 is stored in the variance evaluation value register 16.The threshold value register 18 stores a threshold value α4t-storage for identifying the character pitch. The comparison unit 17 identifies the nature of the character pitch by comparing the variance evaluation value ε2 stored in the variance evaluation value register 16 with the threshold value α4 stored in the threshold value register 18. That is, the character pitch characteristic is identified by comparing the variance evaluation value ε2 stored in the variance evaluation value register 16 with the threshold value α4 stored in the threshold value register 18. If the value ε2 is larger than the threshold α4, it is determined that the character separation position determining method based mainly on the estimated character pitch P cannot be applied, and if the variance evaluation value ε2 is smaller than the threshold α4 by υ, the method for determining character separation positions based on the estimated character pitch P is determined as the main method. It is determined that the character separation position determination method described above is applicable.

参照符号20は1文字分離位置決定部であシ、前述した
ような公知の技術あるいはそれらを組合わせて用いるこ
とができる。す表わち、例えば前記先願発明の「文字分
離装置」と最も容易に実現できる空白による文字分離を
用いることができる。
Reference numeral 20 is a one-character separation position determination unit, and the known techniques as described above or a combination thereof can be used. In other words, it is possible to use, for example, the "character separation device" of the invention of the prior application and character separation using blank spaces, which can be realized most easily.

すなわち、比較部17において前述した分散評価値ε3
から推定文字ピッチPを主体とした文字分離が適用され
ると判定されると、制御装置7によって最尤文字ピッチ
レジスタ15に格納された推定文字ピッチP及び分散評
価値レジスタ16に格納された分散評価値ε!を例えば
前記先願発明の「文字分離装置」よシ構成された1文字
分離位置決定部20に転送する。一方、比較部17にお
いて推定文字ピッチPt−主体とした文字分離が適用で
きないと判定されると、前述した文字塊レジスタ4に格
納された複数個の文字塊イメージを1文字と見なし、1
文字分離位置決定部20の出力値とする。なお、1文字
分離位置決定部20は、上述の実施例以外のものでも公
知の技術を用いて構成できることは言うまでもない。
That is, the above-mentioned variance evaluation value ε3 in the comparing section 17
When it is determined that character separation based on the estimated character pitch P is applied, the control device 7 uses the estimated character pitch P stored in the maximum likelihood character pitch register 15 and the variance stored in the variance evaluation value register 16. Evaluation value ε! is transferred to the one-character separation position determination unit 20, which is configured, for example, as the "character separation device" of the prior invention. On the other hand, if the comparison unit 17 determines that the character separation based on the estimated character pitch Pt cannot be applied, it considers the plurality of character block images stored in the character block register 4 as one character, and
This is the output value of the character separation position determining section 20. Note that it goes without saying that the single character separation position determining section 20 can be constructed using a known technique other than the above-described embodiments.

(発明の効果) 以上述べたように本発明の文字分離位置によれば、あら
かじめ文字ピッチがわからなくとも、また文字塊の接触
や文字間の分離を含む文字イメージが含まれていても、
正確に文字ピッチを測定すると共に、文字ピッチの性質
を把握し適応性の高い文字分離装置が容易に実現できる
という効果が生じる。
(Effects of the Invention) As described above, according to the character separation position of the present invention, even if the character pitch is not known in advance or even if a character image including contact of character blocks or separation between characters is included,
The effect is that it is possible to accurately measure character pitch, grasp the characteristics of character pitch, and easily realize a highly adaptable character separation device.

【図面の簡単な説明】[Brief explanation of the drawing]

第1図は本発明の文字分離装置で用いる文字塊間距離の
文字行イメージの一例を示す図、第2図は等ピッチデー
タにおける一連の文字塊間距離の頻度分布の一例を示す
図、第3図(ト)、@はそれぞ21− れ等ピッチ、可変ピッチデータにおける一連の文字塊間
距離の頻度分布の第1.第2の例を示す図及び第4図は
本発明の文字分離装置の具体的一実施例を示す論理ブロ
ック図である。 図において、1・・・・・・走査装置、2・・・・・・
文字行イメージメモリ、3・・・・・・文字塊抽出装置
、4・・・・・・文字塊レジスタ、50・・・・・・存
在区間検出部、51・・・・・・存在区間演算部、6・
・・・・・頻度分布テーブル、7・・・・・・制御装置
、8・・・・・・存在区間レジスタ、9・・・・・・平
均値算出部、10・・・・・・推定誤差評価値演算部、
11.17・・・・・・比較部、12・・・・・・評価
値レジスタ、13・・・・・・分散評価値演算部、14
・・・・・・カウンタ、15・・・・・・最尤文字ピッ
チレジスタ、16・・・・・・分散評価値レジスタ、1
8・・・・・・閾値レジスタ、19・・・・・・定数レ
ジスタ、20・・・・・・1文字分離位置決定部。 22−
FIG. 1 is a diagram showing an example of a character line image of the distance between character blocks used in the character separation device of the present invention, FIG. 2 is a diagram showing an example of the frequency distribution of the distance between a series of character blocks in equal pitch data, In Figure 3 (g), @ indicates the first frequency distribution of a series of distances between character blocks in constant pitch and variable pitch data. The second example and FIG. 4 are logical block diagrams showing a specific embodiment of the character separation device of the present invention. In the figure, 1... scanning device, 2......
Character line image memory, 3...Character block extraction device, 4...Character block register, 50...Existence section detection unit, 51...Existence section calculation Department, 6・
... Frequency distribution table, 7 ... Control device, 8 ... Existence interval register, 9 ... Average value calculation unit, 10 ... Estimation error evaluation value calculation unit,
11.17...Comparison unit, 12...Evaluation value register, 13...Dispersion evaluation value calculation unit, 14
... Counter, 15 ... Maximum likelihood character pitch register, 16 ... Variance evaluation value register, 1
8...Threshold value register, 19...Constant register, 20...1 character separation position determination unit. 22-

Claims (1)

【特許請求の範囲】[Claims] 一連の文字行イメージを走査し1文字に分離する文字分
離装置において、前記一連の文字行イメージから複数個
の文字塊を順次検出する手段と、前記複数個の文字塊か
ら文字塊間距離を算出し該文字塊間距離の頻度分布を記
憶する手段と、文字ピッチの存在区間を予測する手段と
、前記頻度分布を複数個の領域に分割し該複数個の領域
における該文字ピッチ推定誤差の最尤線形和によって構
成される推定誤差評価尺度が最小となる前記存在区間に
含まれる候補文字ピッチを検出する手段と、前記文字ビ
、チを用いて分割された複数個の前記領域内に含まれる
前記文字塊間距離の分散を算出し複数個の該分散の最尤
線形和によって構成される分散評価値を算出する手段と
、該分散評価値に基づいて文字ピッチの識別を行い該文
字行イメージを1文字に分離する手段とを備えることを
特徴とする文字分離装置。
A character separation device that scans a series of character line images and separates them into single characters, comprising means for sequentially detecting a plurality of character blocks from the series of character line images, and calculating distances between character blocks from the plurality of character blocks. means for storing the frequency distribution of the distance between character blocks; means for predicting the existing interval of character pitch; and means for dividing the frequency distribution into a plurality of regions and calculating the maximum of the character pitch estimation error in the plurality of regions. means for detecting a candidate character pitch included in the existing interval in which an estimated error evaluation measure constituted by a likelihood linear sum is minimized; means for calculating the variance of the distance between the character blocks and calculating a variance evaluation value constituted by the maximum likelihood linear sum of a plurality of said variances; and means for identifying the character pitch based on the variance evaluation value and generating the character line image. A character separating device comprising means for separating into one character.
JP59020300A 1983-12-20 1984-02-07 Character separating device Pending JPS60164879A (en)

Priority Applications (6)

Application Number Priority Date Filing Date Title
JP59020300A JPS60164879A (en) 1984-02-07 1984-02-07 Character separating device
US06/683,576 US4635290A (en) 1983-12-20 1984-12-19 Sectioning apparatus and method for optical character reader systems
DE8484115985T DE3486104T2 (en) 1983-12-20 1984-12-20 SEPARATOR AND METHOD FOR OPTICAL CHARACTER READING DEVICES.
DE3486241T DE3486241T2 (en) 1983-12-20 1984-12-20 Device and method for character spacing for optical character recognition systems.
EP91100048A EP0428499B1 (en) 1983-12-20 1984-12-20 Character pitch detector apparatus and method for optical character reader systems
EP84115985A EP0146147B1 (en) 1983-12-20 1984-12-20 Sectioning apparatus and method for optical character reader system

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
JP59020300A JPS60164879A (en) 1984-02-07 1984-02-07 Character separating device

Publications (1)

Publication Number Publication Date
JPS60164879A true JPS60164879A (en) 1985-08-27

Family

ID=12023297

Family Applications (1)

Application Number Title Priority Date Filing Date
JP59020300A Pending JPS60164879A (en) 1983-12-20 1984-02-07 Character separating device

Country Status (1)

Country Link
JP (1) JPS60164879A (en)

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPH0326427A (en) * 1989-06-23 1991-02-05 Honda Motor Co Ltd Supply of cylinder parts and its device
JPH0368431A (en) * 1989-08-08 1991-03-25 Asahi Chem Ind Co Ltd Bridged cellulose laminated semipermeable film and its production
JPH0468669A (en) * 1990-07-03 1992-03-04 Matsushita Electric Ind Co Ltd Pll circuit

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPH0326427A (en) * 1989-06-23 1991-02-05 Honda Motor Co Ltd Supply of cylinder parts and its device
JPH0368431A (en) * 1989-08-08 1991-03-25 Asahi Chem Ind Co Ltd Bridged cellulose laminated semipermeable film and its production
JPH0468669A (en) * 1990-07-03 1992-03-04 Matsushita Electric Ind Co Ltd Pll circuit

Similar Documents

Publication Publication Date Title
US4594732A (en) Letter pitch detection system
US4411015A (en) Method and apparatus for automatic recognition of image and text/graphics areas on a master
US5841902A (en) System and method for unconstrained on-line alpha-numerical handwriting recognition
EP1016033B1 (en) Automatic language identification system for multilingual optical character recognition
US4635290A (en) Sectioning apparatus and method for optical character reader systems
US9047655B2 (en) Computer vision-based methods for enhanced JBIG2 and generic bitonal compression
EP0436819B1 (en) Handwriting recognition employing pairwise discriminant measures
KR100383858B1 (en) Character extracting method and device
JPS60164879A (en) Character separating device
JPH0430070B2 (en)
JPH0660224A (en) Optical character reader
Mahastama et al. Improving Projection Profile for Segmenting Characters from Javanese Manuscripts
JPS60164878A (en) Device for detecting character pitch
JPH0365584B2 (en)
JPH0468669B2 (en)
JP2630261B2 (en) Character recognition device
Yoo et al. Information extraction from a skewed form document in the presence of crossing characters
JPH0259502B2 (en)
JPH03126188A (en) Character recognizing device
JP2859307B2 (en) Character extraction device
JPH0471232B2 (en)
JP4231476B2 (en) An image processing method for collating an input image inputted with a printed matter or a sticker label of a container with a master image registered in advance
JPS58214969A (en) Character reading device
JPH05114047A (en) Device for segmenting character
JP2778436B2 (en) Character segmentation device