JPS6048582A - Character cutting-out method of character recognizer - Google Patents

Character cutting-out method of character recognizer

Info

Publication number
JPS6048582A
JPS6048582A JP58155415A JP15541583A JPS6048582A JP S6048582 A JPS6048582 A JP S6048582A JP 58155415 A JP58155415 A JP 58155415A JP 15541583 A JP15541583 A JP 15541583A JP S6048582 A JPS6048582 A JP S6048582A
Authority
JP
Japan
Prior art keywords
character
cutting
pattern
circuit
observation pattern
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
JP58155415A
Other languages
Japanese (ja)
Inventor
Yoshihisa Fujii
敬久 藤井
▲はい▼ 東善
Touzen Hai
Yukikazu Kaburayama
蕪山 幸和
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Fujitsu Ltd
Original Assignee
Fujitsu Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Fujitsu Ltd filed Critical Fujitsu Ltd
Priority to JP58155415A priority Critical patent/JPS6048582A/en
Publication of JPS6048582A publication Critical patent/JPS6048582A/en
Pending legal-status Critical Current

Links

Abstract

PURPOSE:To improve character cutting-out accuracy by dividing again a divided observation pattern in the direction of a character writing row in response to the projection distribution of each divided observation pattern and therefore cutting-out a character by division and redivision. CONSTITUTION:The projection distribution obtained by a projection distribution calculating circuit 2 is fed to a cutting-out line detecting circuit 4. Then the areas having smaller value than the threshold value stored in a register 3 are detected to deliver the 1st character cutting-out line to divide an observation pattern in the character writing row direction. The output of the circuit 4 is supplied to a cutting-out line detecting circuit 6 via a mean value calculating circuit 5 to deliver the 2nd character cutting-out line to divide again the divided observation pattern in the character writing row direction. In addition, the cutting-out lines to be deleted and added are decided by the character cutting-out line detecting circuits 8 and 9. The outputs of these circuits 4, 6, 8 and 9 are supplied to a cutting-out circuit 10 to cut-out the observation pattern.

Description

【発明の詳細な説明】 (a)発明の技術分野 本発明は文字認識方法に係り、とくにフリーフォーマン
ト帳票を用いる場合の文字認識装置の文字切出し方法に
関する。
DETAILED DESCRIPTION OF THE INVENTION (a) Technical Field of the Invention The present invention relates to a character recognition method, and more particularly to a character segmentation method for a character recognition device when using a free-formant form.

(b)技術の背景 文字認識装置は2通常、観測01;として光電変換走査
装置などを備え、帳票(文字記入シート)上に記入され
た文字行の中から一文字分ずつの文字パターンを観測し
て量子化データによって表される観測パターンに変換し
、あるいは帳票上に記入された一行分の文字パターンを
観測し量子化信号によって表される観測パターンに変換
したのちその中から一文字分すフの観測パターンを取り
出し。
(b) Background of the technology Character recognition devices are usually equipped with a photoelectric conversion scanning device, etc. as observation 01; and observe the character pattern of each character from a line of characters written on a form (character entry sheet). Convert it into an observed pattern represented by quantized data, or observe a character pattern for one line written on a form and convert it into an observed pattern represented by a quantized signal, and then convert it into an observed pattern represented by a quantized signal. Extract the observed pattern.

−文字分ずつ観測パターンの認識を行う。この−文字分
ずつの文字パターンを観測あるいは一文字分ずつの観測
パターンを取り出すことを切出しまたはセグメンテーシ
ョンと称し、この切出しを容易にするため、これまで、
帳票」二に一文字毎に文字記入枠を設けていた。
- Recognize observation patterns character by character. Observing the character pattern for each character or extracting the observed pattern for each character is called extraction or segmentation, and in order to facilitate this extraction, until now,
A space was provided for each character on the second page of the form.

しかし最近では、一般にフリーフメーマ・ノド帳票と称
される帳票、ずなわら文字記入間隔を指定する文字枠を
設けず文字記人行のみを設けた帳票が用いられるように
なった。
However, recently, forms commonly referred to as free-memory forms have come into use, forms in which only character lines are provided without character frames for specifying the spacing between Zunawara characters.

(C)従来技術と問題点 フリ−フォーマノ1〜帳票を使用する方式の文字認識装
置においては、帳票−ヒに記入された一行分の文字パタ
ーンを観測したのらに、その中から一文字分に相当する
観測パターンを切り出して認識を行うのであるが、観測
パターンの切り出しは。
(C) Prior art and problems In character recognition devices that use free-formano 1 to form, after observing the character pattern for one line written on the form, Recognition is performed by cutting out the corresponding observed pattern, but cutting out the observed pattern is difficult.

従来2次のようにして行われていた。Conventionally, this was done in a secondary manner.

ずなわら、第1図に例示するように2文字行に平行な軸
(Y輔)に関する文字パターン形成画素(通常は黒画素
)の投影分布、すなわちY軸に関する黒画素のヒストグ
ラムをめ、まず最初に8黒画素の投影分布の値の小さい
個所に↓印kO・kl・k2・k3・・・・・klにて
示すような仮の文字切り出し線を付し、これらの仮の文
字切り出し線の間隔の平均値をめる。
First, as shown in Fig. 1, we first calculate the projection distribution of character pattern forming pixels (usually black pixels) with respect to the axis parallel to two character lines (Y axis), that is, the histogram of black pixels with respect to the Y axis. First, temporary character cutting lines as shown by ↓ marks kO, kl, k2, k3...kl are attached to the parts where the value of the projection distribution of 8 black pixels is small, and these temporary character cutting lines are Calculate the average value of the intervals.

次に、前記仮の文字切り出し線の各間隔と前記平均値と
を比狡し、前記仮の文字すJり出し線の一部の削1徐(
k4)あるいは補充(口1およびn2)を施すことによ
って文字切り出し位置を決定し2−ζいた。
Next, each interval of the temporary character cutting line and the average value are compared, and a part of the temporary character cutting line is removed (
The character cutting position was determined by applying k4) or replenishment (mouth 1 and n2) and 2-ζ.

ところが、隣接する文字が相J1に入り組んでいる場所
が文字行中に多数ある場合6.−c;I止MYな文字切
り出し位16を得ることか非當に困ゲ1fであった。
However, if there are many places in the character line where adjacent characters have complicated phase J1, 6. -c; It was extremely difficult to obtain the desired character cutout position of 16.

(d)発明の目的 本発明の目的は、フリーフォーマノ1−帳票を使用する
方式の文字認識装置におりる文字りJり出し2精度を高
めるごとにある。
(d) Object of the Invention An object of the present invention is to improve the precision of character recognition in a character recognition device using a free form character form.

(e)発明の構成 本発明になる文字認識装置の文字切出し方法は。(e) Structure of the invention What is the character extraction method of the character recognition device according to the present invention?

文字記入間隔を指定−1,4′J’文字記人行のめを指
定ずろ帳票に記入された文字を認識する文字認識装置に
おい′乙前記帳票の文字証人?−iに記入された文字行
を観測して得られる観測パターンの前記文字記人行に関
する投影分布によゲこ前記観測パターンを前記文字記入
行方向に分割し、前記分割によって御られる分割観/J
lllパターン旬の前記投影分布によって該分割観測パ
ターンを前記文字記入行方向に再分割し、前記分割と前
記再分割とによってniI記観ll1tl+パターンの
切り出しを行うものである。
Specify the character entry interval - 1,4'J' Specify the character entry line.A character recognition device that recognizes characters written on the form. - Divide the observation pattern in the direction of the character entry line and control the division view by the division/J
The divided observation pattern is subdivided in the character entry line direction according to the projection distribution of the llll pattern, and the niIkiviewllltl+ pattern is cut out by the division and the redivision.

(f)発明の実施例 以1・2本発明の要旨を実施例によって具体的に説明す
る。
(f) Examples of the Invention 1 and 2 The gist of the present invention will be specifically explained by examples.

第2図は本発明一実施例の構成をボずブI’、+ 7り
図であり1図においζ1はフリーフオーーンノト帳票−
」−の火字記入行に記入された文字行を観測し前記文字
Lr毎の観測パターンを得る観/1i11部、2は観測
部1によって得られる観測パターンの11ii記文字記
人1]に関する投影分布を得る投影分布d1′#−回路
Figure 2 is a diagram showing the configuration of one embodiment of the present invention.
`` - Observing the character line written in the character entry line and obtaining the observation pattern for each character Lr/1i11 part, 2 is the projection of the observation pattern obtained by observation part 1 in 11ii character recorder 1] Projection distribution d1'#-circuit to obtain the distribution.

3は後記第一の切出し7線検出回路4において用いるし
きい値を記憶するレジスタ、4は投影夕″14J01算
回路2によゲで得られる投影分布からレジスタ3に記憶
するしきい値より小ざい個Jす「を検出することによっ
て、前記観測パターンを前記文字記入方向に分^りする
ために伺ず第一の文字切出し綿を得る第一のリノ出し線
検出回路、54;l第一の切出し7線検出回路4によっ
て得られる第一の文字切出し線によって前記観測パター
ンを前記文字記入41方向に分割して得られる分;’;
+J観測バ観測バク−曲毎投影分布の平均値を得る平均
値計算回路、6は前記分割観測パターンの投影分布から
、前記分割観測パターン毎の投影分4jの平均値に1よ
り小さい所定数を乗することによって得られるしきい値
より小さいf1^(所を検出することによって、該分割
観測パターンを前記文字記入行方向に再分割するために
(jず第二の切出し線を得る第二の切出し線検出回路、
7は前記第一の切出し線および前記第二の切出し線によ
っ゛ζ切出される各分割観測パターンの間隔の平均イ1
αを得る平均値61洲回路、8は前記第一の切出し線お
よび前記第二の切出し線によって切出される各分割観測
パターンの間隔と平均値計算回路7によっζ得られる1
11均値とを比鮫し。
3 is a register that stores a threshold value used in the first cut-out 7-line detection circuit 4, which will be described later; a first lino drawing line detection circuit for dividing the observed pattern in the character writing direction by detecting the first character cutting line; 54; l first; The portion obtained by dividing the observed pattern in the character entry 41 direction by the first character cutting line obtained by the cutting 7 line detection circuit 4;';
+J observation bar observation bag - an average value calculation circuit that obtains the average value of the projection distribution for each song; 6 is a circuit that calculates a predetermined number smaller than 1 from the projection distribution of the divided observation pattern to the average value of the projection portion 4j for each divided observation pattern In order to redivide the divided observation pattern in the character entry line direction by detecting f1^(, which is smaller than the threshold value obtained by multiplying Cutout line detection circuit,
7 is the average interval of each divided observation pattern cut out by the first cutting line and the second cutting line.
The average value 61 circuit for obtaining α, 8 is the interval of each divided observation pattern cut out by the first cutting line and the second cutting line, and ζ obtained by the average value calculation circuit 7.
Compare with 11 average value.

前記第一の切出し線および前記第二の切出し線の中から
削除すべき切出し線を決定する第三の切出し線検出回路
、9はiiJ記第−・の切出し線および前記第二の切出
し線によって切出される各分IGII観測パターンの間
隔と平均値計算回路7によって1牙られる平均値とを比
軸し、前記第=−の切出し線および前記第二、の切出し
線に追加すべき切出し線を決定する第四の切出し線検出
回路210は第一の切出し線検出回路4・第二の切出し
線検出回路ら 第三の切出し線検出回路8および第四の
切出し線検出回路9によって検出または決定される切出
し線によって前記観測パターンを前記文字記入行方向に
分割する切出し回路、11は切出し回路10によって切
出された分割観測パターン毎の特徴を抽出する特徴抽出
回路、12は文字種毎の杼〆(1′−特徴を記1aする
辞書メモリ、13は特徴抽出回路11によって得られる
分割観測パターンの特徴を辞書メモ1月2と照合するご
とによって前記分割観測バクーンの認識を行う認識回路
である。
A third cutting line detection circuit 9 determines a cutting line to be deleted from among the first cutting line and the second cutting line; The interval between the IGII observation patterns to be cut out and the average value calculated by the average value calculation circuit 7 are compared, and the cutting line to be added to the =-th cutting line and the second cutting line is calculated. The fourth cutout line detection circuit 210 to be determined is detected or determined by the first cutout line detection circuit 4, the second cutout line detection circuit, the third cutout line detection circuit 8, and the fourth cutout line detection circuit 9. 11 is a feature extraction circuit that extracts features for each divided observation pattern cut out by the cutting circuit 10; 12 is a shuttle finisher for each character type; 1' is a dictionary memory for recording features 1a; 13 is a recognition circuit that recognizes the divided observation pattern by comparing the features of the divided observation pattern obtained by the feature extraction circuit 11 with the dictionary memo 1/2;

ずなわら1 まず第一の切出し線検出回路4ては。Zunawara 1 First, let's look at the first cutting line detection circuit 4.

第3図ta+に↓印によって示すように黒画素の投影分
布の値が所定のしきい値より小さい個所に幻する第一の
切出し線として1(0・kl・k2・・・・klが得ら
れる。
As shown by the ↓ mark in Fig. 3 ta+, 1 (0, kl, k2...kl is obtained as the first cutting line that appears at the point where the value of the projection distribution of black pixels is smaller than a predetermined threshold value). It will be done.

続いて第二の切出し線検出回路6では、第3図(blの
ように、 kOとに1との間、 k2とに3との間およ
びに6とに7との間に、それぞれ、第二の切出し線とし
てml・m2およびm3が得られる。なを、このとき。
Next, in the second cutting line detection circuit 6, as shown in FIG. ml・m2 and m3 are obtained as the second cutting line.At this time.

klとに2との間およびに5と1(6との間には第二の
切出し線が得られない。
The second cutting line cannot be obtained between kl and 2 and between 5 and 1 (6).

k]とに2との間およびに5とに6との間に第二の切出
し線が得られない理由は、第一の切出し線によつ得られ
る分割観測パターン毎の投影分布の平均値に1より小さ
い所定数を乗することによって得られるしきい値を用い
ることによるものである。
The reason why the second cutting line cannot be obtained between 2 and 2 and between 5 and 6 is that the average value of the projection distribution for each divided observation pattern obtained by the first cutting line This is by using a threshold value obtained by multiplying by a predetermined number smaller than 1.

ごのあと第三の切出し線検出回路8では、第3図(C1
のように、 k4が削除すべき切出し線として決定され
、また第四の切出し線検出回路9ではに1とに2との間
およびに5とに6との間に、それぞれ、追加すべき切出
し線n1およびn2が決定される。
After that, in the third cutting line detection circuit 8, as shown in FIG.
As shown in FIG. Lines n1 and n2 are determined.

上記実施例によれば、−律なしきい(i#jを用いて切
出し線をめるのみならず、第二の切出し線検出回路6に
よって1分割観測パターンfυの投影分布の平均値から
得られるしきい値を用いて切出し線をめたのちに、各分
割観測パターンの間隔の平均値を用いて切出し線の削除
あるいは追加を行っており、したがって止面な文字切り
出し位置を得ることができる。
According to the above embodiment, the cutout line is not only determined using an arbitrary threshold (i#j) but also obtained from the average value of the projection distribution of the 1-segment observation pattern fυ by the second cutout line detection circuit 6. After the cutting line is set using a threshold value, the cutting line is deleted or added using the average value of the interval between each divided observation pattern, so that a uniform character cutting position can be obtained.

(g)発明の詳細 な説明したように9本発明によれはフリーフメーマノト
帳票を使用する方式の文字認識精度°におりる文字りJ
り出し精度を向上し文字認識精度をit’liめること
かできる。
(g) As described in the detailed description of the invention, the present invention improves the character recognition accuracy of the method using a free form.
It is possible to improve the extraction accuracy and improve the character recognition accuracy.

【図面の簡単な説明】[Brief explanation of drawings]

第1図は従来例の説明図、第2図は本発明一実施例の構
成を示すブロック図、第3図は前記実施例の説明図であ
る。 図中、1は観測部、2は投影分布計算回路、4ば第一の
りJ出し線検出回路、5は平均値計算回路。 6は第二の切出し線検出回I烙、10は切出し回路であ
る。 芥 2 町 ! 授チ妙 2 蟹、J!、l #P f県訂路 切本り嶽 t 1″″′7y 験、1:、回路 −1;ヨ)ロワイm 、。 gTch巨:[路 エフ7本しぎK ど 二r@ ≦C5櫂111j路 平均イ1 に1発 2 ? 、q M’3:* 、*JL H粘j4 恥萌 昼 蕎、1.j // /θ 将軟 炉職回路 ri’lct’Bk 、、上、、。 水了唄 (α) ト′ 2/ ト−
FIG. 1 is an explanatory diagram of a conventional example, FIG. 2 is a block diagram showing the configuration of an embodiment of the present invention, and FIG. 3 is an explanatory diagram of the embodiment. In the figure, 1 is an observation unit, 2 is a projection distribution calculation circuit, 4 is a first slope J outline detection circuit, and 5 is an average value calculation circuit. 6 is a second cutting line detection circuit I, and 10 is a cutting circuit. 2 towns! Juchi Tae 2 Crab, J! , l #P f prefectural correction route Kirimoto Ritake t 1'''''7y Experiment, 1:, Circuit-1; yo) Roy m,. gTch huge: [RoF7honshigiK Dojir@≦C5 paddle 111j Road average I1 1 shot 2? , q M'3: * , * JL H sticky j4 shame moe lunch soba, 1. j // /θ Shosoft Furnace Job Circuit ri'lct'Bk,, upper,,. Suiryouuta (α) To' 2/ To

Claims (2)

【特許請求の範囲】[Claims] (1)文字記入間隔を指定せず文字記入行のみを指定す
る帳票に記入された文字を認識する文字認識装置におい
て、前記帳票の文字記入行に記入された文字行を観1j
ii l、て得られる観/1tllパターンの前記文字
記入行に関する投影分布によって前記観測パターンを前
記文字記入行方向に分割し、前記分割によって得られる
分割観Δ111バクーン旬の前記投影分布によって該分
割観測パターンを前記文字記入行方向に再分割し、前記
分割と前記再分割とによって前記観測パターンの切り出
しを行うことを特徴とする文字認識装置の文字切出し方
法。
(1) In a character recognition device that recognizes characters written on a form that specifies only the character entry line without specifying the character entry interval, the character recognition device recognizes the characters written in the character entry line of the form.
ii) Divide the observation pattern in the direction of the character entry line using the projection distribution for the character entry line of the view/1tll pattern obtained by dividing the divided observation pattern using the projection distribution of the divided view Δ111 Bakun Jun obtained by the division. A character cutting method for a character recognition device, characterized in that the pattern is subdivided in the character writing line direction, and the observed pattern is cut out by the dividing and the re-dividing.
(2)前記分割観測パターンの再分割は該分割観測パタ
ーン毎の前記投影分布の平均値から得られるしきい値に
よって行うことを特徴とする特許請求の範囲第1項記載
の文字認識装置の文字切出し方法。
(2) The character recognition device according to claim 1, wherein the subdivision of the divided observation pattern is performed using a threshold value obtained from the average value of the projection distribution for each divided observation pattern. Cutting method.
JP58155415A 1983-08-25 1983-08-25 Character cutting-out method of character recognizer Pending JPS6048582A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
JP58155415A JPS6048582A (en) 1983-08-25 1983-08-25 Character cutting-out method of character recognizer

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
JP58155415A JPS6048582A (en) 1983-08-25 1983-08-25 Character cutting-out method of character recognizer

Publications (1)

Publication Number Publication Date
JPS6048582A true JPS6048582A (en) 1985-03-16

Family

ID=15605493

Family Applications (1)

Application Number Title Priority Date Filing Date
JP58155415A Pending JPS6048582A (en) 1983-08-25 1983-08-25 Character cutting-out method of character recognizer

Country Status (1)

Country Link
JP (1) JPS6048582A (en)

Cited By (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPH01196685A (en) * 1988-02-01 1989-08-08 Fuji Electric Co Ltd Method for detecting character
JPH0231286A (en) * 1988-07-21 1990-02-01 Fuji Electric Co Ltd Detecting method for special character row
JPH02255995A (en) * 1988-04-28 1990-10-16 Seiko Epson Corp Character segmenting method
US5275672A (en) * 1990-11-10 1994-01-04 Wilkinson Sword Gesellschaft Mit Beschrankter Haftung Razor blade steel having high corrosion resistance and differential residual austenite content
JPH07319998A (en) * 1988-04-28 1995-12-08 Seiko Epson Corp Method for segmenting character

Cited By (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPH01196685A (en) * 1988-02-01 1989-08-08 Fuji Electric Co Ltd Method for detecting character
JPH02255995A (en) * 1988-04-28 1990-10-16 Seiko Epson Corp Character segmenting method
JPH07319998A (en) * 1988-04-28 1995-12-08 Seiko Epson Corp Method for segmenting character
JP2570415B2 (en) * 1988-04-28 1997-01-08 セイコーエプソン株式会社 Character extraction method
JPH0231286A (en) * 1988-07-21 1990-02-01 Fuji Electric Co Ltd Detecting method for special character row
US5275672A (en) * 1990-11-10 1994-01-04 Wilkinson Sword Gesellschaft Mit Beschrankter Haftung Razor blade steel having high corrosion resistance and differential residual austenite content

Similar Documents

Publication Publication Date Title
US4204193A (en) Adaptive alignment for pattern recognition system
JPH06504866A (en) Survey scanning system with expandable answer mark area for efficient scanning and mark detection
Carter et al. Automatic recognition of printed music
JPS59216285A (en) Character recognizing device
JPS6048582A (en) Character cutting-out method of character recognizer
CN106056575A (en) Image matching method based on object similarity recommended algorithm
JPH05159099A (en) Slip character recognition area specification method and slip in optical character recognition device
JPS6310472B2 (en)
JP3552196B2 (en) Print control device, print control method, and recording medium
JPS58134748A (en) Settling system for letter data compression block
JPH01249482A (en) Data output device
CN111161247B (en) Detection method for variable code reading character quality verification
US4466191A (en) Method for creating a plat from a metes and bounds property description
JPS63101983A (en) Character string extracting system
Clarke et al. Recognizing musical text
JPH02178765A (en) Document preparing device
Sanei et al. Fuzzy detection of craniofacial landmarks
JPS6195483A (en) Character recognition method
Drevin et al. Determining the skew and scale in images of Compton-Wollan-Bennett ionization chamber recordings
JPS63208990A (en) Character pattern segmenting device
JPS5533254A (en) Skew compensation circuit
CN112560836A (en) Component identification method and device and computer readable storage medium
JPS62223621A (en) Estimating method for flow axis of ocean current
JPS63205510A (en) Water depth surveying device
JPS6278690A (en) Character recognizing device