JPH0330191B2 - - Google Patents

Info

Publication number
JPH0330191B2
JPH0330191B2 JP59057420A JP5742084A JPH0330191B2 JP H0330191 B2 JPH0330191 B2 JP H0330191B2 JP 59057420 A JP59057420 A JP 59057420A JP 5742084 A JP5742084 A JP 5742084A JP H0330191 B2 JPH0330191 B2 JP H0330191B2
Authority
JP
Japan
Prior art keywords
character
character type
dictionary
handwritten
characters
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Expired
Application number
JP59057420A
Other languages
Japanese (ja)
Other versions
JPS60201486A (en
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed filed Critical
Priority to JP59057420A priority Critical patent/JPS60201486A/en
Publication of JPS60201486A publication Critical patent/JPS60201486A/en
Publication of JPH0330191B2 publication Critical patent/JPH0330191B2/ja
Granted legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V10/00Arrangements for image or video recognition or understanding
    • G06V10/10Image acquisition

Description

【発明の詳細な説明】 (技術分野) 本発明は高速で精度の良い手書文書の読取方法
に関するものである。
DETAILED DESCRIPTION OF THE INVENTION (Technical Field) The present invention relates to a method for reading handwritten documents at high speed and with high accuracy.

(背景技術) これまでに手書文書の読取方法として記入した
文字の文字種を指定する記入枠内の指定の有無を
検出し、指定を検出したカラムの文字の読取を指
定された文字種の辞書のみを参照して行う方法が
提案されている。しかしながら、この方法では、
文字種の指定がなかつた場合、全ての辞書を参照
しなければならず処理速度が遅くなるという問題
があつた。
(Background technology) As a method for reading handwritten documents, the presence or absence of a specification in a writing frame that specifies the character type of the written character is detected, and the characters in the column where the specification is detected are read only in a dictionary of the specified character type. A method has been proposed that refers to However, with this method,
If a character type is not specified, all dictionaries must be referenced, resulting in a slow processing speed.

(発明の目的および概要) 本発明の目的は従来の技術の上記欠点を改善し
て高速で精度のよい手書文書の読取方法を提供す
ることにあり、その特徴は、文字の文字線量を検
出しそれを当該文字の複雑度とし、字種の指定が
なかつた場合、その複雑度により当該文字の含ま
れる文字種の辞書を選択して識別を行うことにあ
る。
(Objective and Summary of the Invention) An object of the present invention is to improve the above-mentioned drawbacks of the conventional technology and provide a high-speed and highly accurate handwritten document reading method. This is taken as the complexity of the character, and if a character type is not specified, the character is identified by selecting a dictionary of the character type that includes the character based on the complexity.

(発明の実施例) 第1図は本発明による手書日本語文書読取方法
の一実施例を示す構成図である。図において、1
は光電変換部、2はパタンレジスタ、3は特徴抽
出部、4は文字線量検出部、5は文字種検出部、
6は識別部、7は文字名出力、8はひらがな辞書
メモリ、9はカタカナ辞書メモリ、10は英字数
字記号辞書メモリ、11は漢字辞書メモリであ
る。
(Embodiment of the Invention) FIG. 1 is a block diagram showing an embodiment of a handwritten Japanese document reading method according to the present invention. In the figure, 1
is a photoelectric conversion unit, 2 is a pattern register, 3 is a feature extraction unit, 4 is a character dose detection unit, 5 is a character type detection unit,
6 is an identification unit, 7 is a character name output, 8 is a hiragana dictionary memory, 9 is a katakana dictionary memory, 10 is an alphanumeric symbol dictionary memory, and 11 is a kanji dictionary memory.

また、第2図は本実施例に使用した帳票例を示
す図で、21は帳票、22は文字種指定記入枠
で、その中の23は漢字指定欄、24はひらがな
指定欄、25はカタカナ指定欄、26は英字数字
記号指定欄、27は文字記入枠、28は文字種指
定記入行、29は文字記入行で、例えば第3図の
帳票記入例のように記入しておく。
Fig. 2 is a diagram showing an example of a form used in this example, where 21 is a form, 22 is a character type specification entry frame, 23 is a kanji specification field, 24 is a hiragana specification field, and 25 is a katakana specification field. 26 is an alphanumeric symbol designation field, 27 is a character entry frame, 28 is a character type designation entry line, and 29 is a character entry line. For example, entries are made as in the form entry example shown in FIG. 3.

以下、この帳票例を用いて本発明の動作を次に
説明する。
The operation of the present invention will be explained below using this example of the form.

まず第2図の帳票21の文字種指定記入行28
の行について文字記入行29の各文字に対応した
字種指定の有無を検出し、識別部6へ字種指定の
有無を送出する。その動作は光電変換部1により
文字種指定記入行28について光電変換を行ない
2値の量子化された電気信号に変換し、1文字分
の領域を切り出してパタンレジスタ2に格納す
る。文字種指定検出部5はパタンレジスタ2を文
字種指定記入枠22に対応する様に4個の領域に
分割し、各領域内の黒点数(文字線部を黒点とす
る。)を計数し、閾値と比較してそれぞれの文字
種の指定の有無を検出し、前記文字種即ち漢字、
ひらがな、カタカナ、記号等の指定の有無を文字
種検出部5内の文字種指定メモリに格納する。以
上の動作により文字記入行29の各文字に対応し
た字種指定の有無を検出する。次に、第2図の文
字記入行29の読取りを行なう。その動作は光電
変換部1により文字記入行29について光電変換
を行ない、2値の量子化した電気信号に変換し、
1文字分の領域を切出してパタンレジスタ2に格
納する。特徴抽出部3はパタンレジスタ2内の文
字パタンより各種特徴を抽出し、該特徴を識別部
6へ送出する。
First, enter line 28 for character type specification in form 21 in Figure 2.
The presence or absence of character type designation corresponding to each character in the character entry line 29 is detected for the line , and the presence or absence of character type designation is sent to the identification unit 6. In this operation, the photoelectric conversion unit 1 performs photoelectric conversion on the character type designation entry line 28 to convert it into a binary quantized electric signal, cuts out an area for one character, and stores it in the pattern register 2. The character type designation detection unit 5 divides the pattern register 2 into four areas corresponding to the character type designation entry frame 22, counts the number of black dots in each area (character line parts are black dots), and calculates the threshold value. The comparison is made to detect whether or not each character type is designated, and the character type, that is, kanji,
The presence or absence of designation of hiragana, katakana, symbol, etc. is stored in the character type designation memory in the character type detection unit 5. Through the above operations, it is detected whether or not a character type has been specified corresponding to each character in the character entry line 29. Next, the character entry line 29 in FIG. 2 is read. The operation is such that the photoelectric conversion unit 1 performs photoelectric conversion on the character entry line 29, converting it into a binary quantized electric signal,
An area corresponding to one character is cut out and stored in the pattern register 2. The feature extraction section 3 extracts various features from the character patterns in the pattern register 2 and sends the features to the identification section 6.

同時に文字線量検出部4ではパタンレジスタ2
内の文字パタンより文字線量を検出して文字の大
きさで正規化することにより文字の複雑度Dとす
る。複雑度は次式によつて表わされる。
At the same time, the pattern register 2 in the character dose detection unit 4
The character complexity level D is determined by detecting the character radiation dose from the character pattern within and normalizing it by the character size. The complexity is expressed by the following equation.

D=A×K/WL×(PB+PR) 但しKはDを整数化するための定数、Aは文字
枠内の全黒点数、PBは文字の外接枠のうち高さ
方向の大きさ、同様にPRは幅方向の大きさを示
すものである。WLは文字の線幅で次式によつて
求める。
D=A×K/WL×(PB+PR) However, K is a constant to convert D into an integer, A is the total number of black dots in the character frame, PB is the size of the circumscribed frame of the character in the height direction, and similarly PR indicates the size in the width direction. WL is the line width of the character and is calculated using the following formula.

WL=A/A−Q 但しQは、文字枠内を2×2の窓で全点観測
し、4点とも黒点である個数を表わす。
WL=A/A-Q However, Q represents the number of points in which all four points are black points when observing all points within the character frame using a 2×2 window.

文字の複雑度Dが検出されたら、文字種指定検
出部5へ複雑度Dを送出する。文字種指定検出部
5では前記文字種指定メモリを順次参照し前記文
字種指定を識別部6へ送出し文字種指定領域で第
3図の22に示すごとく文字種指定が検出できな
かつた場合、前記複雑度Dを用い以下の条件を判
定し、文字種を決定し識別部6へ送出する。
When the complexity level D of the character is detected, the complexity level D is sent to the character type designation detection section 5. The character type designation detection unit 5 sequentially refers to the character type designation memory and sends the character type designation to the identification unit 6. If the character type designation cannot be detected in the character type designation area as shown at 22 in FIG. The following conditions are determined, the character type is determined, and the character type is sent to the identification unit 6.

D<a 全ての辞書を参照する。D<a See all dictionaries.

D≧a 字種は漢字であるとし漢字の辞書を参照
する。
D≧a The character type is assumed to be a kanji, and a kanji dictionary is referred to.

但し本実施例においてはa=10、K=5とし
た。
However, in this example, a=10 and K=5.

識別部6は特徴抽出部3より送出された特徴と
辞書とを照合し、最終的に1文字のカラゴリ名を
文字名出力7へ出力する。
The identification unit 6 compares the features sent from the feature extraction unit 3 with a dictionary, and finally outputs a one-character color name to the character name output 7.

識別部6において使用する辞書メモリは、ひら
がな辞書メモリ8、カタカナ辞書メモリ9、英字
数字記号辞書メモリ10及び漢字辞書メモリ11
の4種が用意されているが、前記特徴抽出部3よ
り送出された特徴と辞書との照合は、前記あらか
じめ各文字に対応する字種指定があつた文字種の
辞書メモリを使用して行う。
The dictionary memories used in the identification unit 6 include a hiragana dictionary memory 8, a katakana dictionary memory 9, an alphanumeric symbol dictionary memory 10, and a kanji dictionary memory 11.
Four types are prepared, and the features sent from the feature extraction section 3 are compared with the dictionary using the dictionary memory of the character types in which the character types corresponding to each character have been designated in advance.

(発明の効果) 本発明は以上詳細に説明したようにあらかじめ
字種指定の検出を行い、前記指定のない文字につ
いては、文字の文字線量を検出して、字種の選択
を行い字種に適した辞書により文字の識別を行つ
ているので高速で精度の高い読取が出来、従つて
高速で精度の良い手書日本語文書の読取が可能と
なる効果がある。
(Effects of the Invention) As described above in detail, the present invention detects the character type designation in advance, and for characters without the above designation, the character type is selected by detecting the character dose of the character. Since characters are identified using a suitable dictionary, reading can be performed at high speed and with high precision, and therefore, handwritten Japanese documents can be read at high speed and with high precision.

【図面の簡単な説明】[Brief explanation of drawings]

第1図は本発明による手書文書読取方法の一実
施例を示す構成図、第2図は本発明の実施例で使
用した帳票例を示す図、第3図はその帳票記入例
を示す図である。 1……光電変換部、2……パタンレジスタ、3
……特徴抽出部、4……文字線量検出部、5……
文字種指定検出部、6……識別部、7……文字名
出力、8……ひらがな辞書メモリ、9……カタカ
ナ辞書メモリ、10……英字数字記号辞書メモ
リ、11……漢字辞書メモリ、21……帳票、2
2……文字種指定記入枠、23……ひらがな指定
欄、24……カタカナ指定欄、25……英字数字
記号指定欄、26……漢字指定欄、27……文字
記入枠、28……文字種指定記入行、29……文
字記入行。
FIG. 1 is a block diagram showing an embodiment of the handwritten document reading method according to the present invention, FIG. 2 is a diagram showing an example of a form used in the embodiment of the present invention, and FIG. 3 is a diagram showing an example of filling out the form. It is. 1...Photoelectric conversion unit, 2...Pattern register, 3
...Feature extraction unit, 4...Character dose detection unit, 5...
Character type designation detection unit, 6...Identification unit, 7...Character name output, 8...Hiragana dictionary memory, 9...Katakana dictionary memory, 10...Alphabet/numeric symbol dictionary memory, 11...Kanji dictionary memory, 21... …Form, 2
2...Character type specification entry box, 23...Hiragana specification field, 24...Katakana specification field, 25...Alphabet, numeric symbol specification field, 26...Kanji specification field, 27...Character entry box, 28...Character type specification Entry line, 29...Character entry line.

Claims (1)

【特許請求の範囲】 1 手書き日本語文書において文字を記入する文
字枠と、該文字枠の近傍にもうけられ文字種を指
定する文字種指定領域を有し、文字種指定領域で
指定された文字種の辞書により文字枠に記入され
た手書文字を認識する手書文書読取方法におい
て、文字種の指定がない場合に、手書文字の文字
線量を文字の複雑度として求め、該複雑度に対応
する文字種の辞書を選択し、該選択された辞書に
より手書文字を認識することを特徴とする手書文
書読取方法。 2 前記辞書がひらがな、かたかな、英数字、及
び漢字に対し各々もうけられることを特徴とする
特許請求の範囲第1項記載の手書文書読取方法。
[Scope of Claims] 1. A handwritten Japanese document has a character frame in which characters are written, and a character type specification area provided near the character frame to specify a character type, and a dictionary of the character type specified in the character type specification area. In a handwritten document reading method that recognizes handwritten characters written in a character frame, when the character type is not specified, the character dose of the handwritten character is determined as the complexity of the character, and a dictionary of character types corresponding to the complexity is obtained. 1. A handwritten document reading method comprising: selecting a dictionary; and recognizing handwritten characters using the selected dictionary. 2. The handwritten document reading method according to claim 1, wherein the dictionary is created for hiragana, katakana, alphanumeric characters, and kanji, respectively.
JP59057420A 1984-03-27 1984-03-27 Handwritten document reading method Granted JPS60201486A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
JP59057420A JPS60201486A (en) 1984-03-27 1984-03-27 Handwritten document reading method

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
JP59057420A JPS60201486A (en) 1984-03-27 1984-03-27 Handwritten document reading method

Publications (2)

Publication Number Publication Date
JPS60201486A JPS60201486A (en) 1985-10-11
JPH0330191B2 true JPH0330191B2 (en) 1991-04-26

Family

ID=13055154

Family Applications (1)

Application Number Title Priority Date Filing Date
JP59057420A Granted JPS60201486A (en) 1984-03-27 1984-03-27 Handwritten document reading method

Country Status (1)

Country Link
JP (1) JPS60201486A (en)

Families Citing this family (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPS62214486A (en) * 1986-03-17 1987-09-21 Sanyo Electric Co Ltd Character recognizing device
JPS63782A (en) * 1986-06-20 1988-01-05 Ricoh Co Ltd Pattern recognizing device

Also Published As

Publication number Publication date
JPS60201486A (en) 1985-10-11

Similar Documents

Publication Publication Date Title
EP0054842B1 (en) Figure recognizing system
GB1498221A (en) Character recognition unit
Kompalli et al. Design and comparison of segmentation driven and recognition driven Devanagari OCR
JPH0330191B2 (en)
JPH0430070B2 (en)
JPS58101378A (en) Manuscript document reading method
Ting et al. A syntactic business form classifier
CN115410207B (en) Detection method and device for vertical text
JP2616995B2 (en) Character recognition device
JPH0436432B2 (en)
JPS60160487A (en) Optical character reader
Kar et al. Recognition of Odia Character in an Image by Dividing the Image into Four Quadrants
JP2708604B2 (en) Character recognition method
JPS58149574A (en) Registering device of standard pattern
JPS63263588A (en) Character reader
JPH0475557B2 (en)
JPS60237584A (en) Character recognition system
JP2578767B2 (en) Image processing method
JPS6129976A (en) Character recognition
JPH01154296A (en) Character segmenting method
JPH0259504B2 (en)
JPH07254046A (en) Character reader and character reading method
JPS60138689A (en) Character recognizing method
JPH04211884A (en) Method for segmenting character
JPH02242391A (en) Character recognizing system