JPH0196779A

JPH0196779A - Word reader

Info

Publication number: JPH0196779A
Application number: JP62255167A
Authority: JP
Inventors: Yasuhiro Okada; 康裕岡田; Kozo Tomono; 伴野　浩三
Original assignee: Mitsubishi Electric Corp
Current assignee: Mitsubishi Electric Corp
Priority date: 1987-10-09
Filing date: 1987-10-09
Publication date: 1989-04-14

Abstract

PURPOSE:To realize the high-speed collation of words without deteriorating the accuracy of word reading, by providing a reading object word deciding means. CONSTITUTION:A reading object word deciding means 6 is provided to decide the word where the character of a specific position in a word dictionary 7 is included in recognition candidate characters as a reading object word. The word decided by the means 6 is collated with said candidate characters. While the dictionary 7 is entirely collated with the candidate characters only in case no word is decided. Therefore, it is possible to perform the collation of words at a high speed without deteriorating the word reading accuracy since the collation is carried out first among a small number of words having high possi bility for satisfactory collation and then performing the collation among all words of the dictionary 7 only when the first collation fails.

Description

【発明の詳細な説明】〔産業上の利用分野〕この発明は、住所・氏名などの単語を読み取って認識す
る単語読取装置、特に単語を構成する文字を１文字毎に
認識し、その認識結果を用いて単語を修正する単語読取
装置の照会時間短縮化に関するものである。[Detailed Description of the Invention] [Field of Industrial Application] This invention relates to a word reading device that reads and recognizes words such as addresses and names, and in particular, recognizes each character that constitutes a word, and uses the recognition results. This invention relates to shortening the query time of a word reading device that corrects words using .

[Conventional technology]

近年、計算機などへの大量かつ高速のデータエン）　Ｉ
Ｊ−手段として、単語読取装置か注目されており１日本
語においては、かな文字や英数字などか複雑に混在して
いることから、高精度に単語の読み取りかできる単語読
取装置の開発か望まれている。この単語読取装置として
は１光学式文字読取装ｆｉｌｌ（ＯＣＲ）か知られてお
り、第２図に０例えば“昭和Ｓ１年度電子通信学会ｉ合
全国大会（ｔａ４ｔ）ｒ手書漢字認識における単語情報
の利用」、などに示された。従来の単語読取装置の構成
−を示す。In recent years, large amounts of high-speed data transmission to computers, etc.) I
Word reading devices are attracting attention as a means of J-1.Since Japanese has a complex mixture of kana characters, alphanumeric characters, etc., there is hope for the development of word reading devices that can read words with high accuracy. It is rare. This word reading device is known as an optical character reading device (OCR), and as shown in Fig. 2, for example, “Word information in handwritten kanji recognition” "Use of", etc. The configuration of a conventional word reading device is shown.

図において、（口は文字か記載されている帳票。In the figure, (the opening is a form with letters written on it.

（２）は前記帳票（口を読取装置に読み取らせるための
走査手段、（３）は入力文字を１文字毎に切り出して認
識し、認識候補文字を出力する文字認識手段。(2) is a scanning means for causing a reading device to read the form (original part); (3) is a character recognition means for cutting out and recognizing input characters character by character, and outputting recognition candidate characters.

１４１は帳票（１）に記入される単語を格納した単語辞
書。141 is a word dictionary that stores words to be entered in the form (1).

（５１は前記文字認識手段（３）から出力された認識候
補文字と前記単語辞書（４１内の複数の単語等を照合し
て単語を決定する単語決定手段である。、従来の単語読
取装置は以上のように構成され、下記に示す判定条件に
従い、前記認識候補文字のうち順位か上位で、かつ、単
語辞書（４−に登碌された単語を優先的に選択したもの
を、単語読み取りの結果とし゛て出力するものである。(51 is a word determining means that determines a word by comparing the recognition candidate character outputted from the character recognition means (3) with a plurality of words in the word dictionary (41).) Conventional word reading devices With the above structure, according to the judgment conditions shown below, words that are ranked high among the recognition candidate characters and have been entered in the word dictionary (4-) are preferentially selected for word reading. This is what is output as a result.

く判定条件〉ｍ個の認識候補文字か文字認識手段（３）から出力され
、ｎ文字からなる単語か入力された時、ｎ文字長の姓の
ｊ番目の単語Ａｊに対して０次式によりＰｊ　を計算す
る。Judgment conditions> When m recognition candidate characters are output from the character recognition means (3) and a word consisting of n characters is input, the jth word Aj of a last name with n characters in length is determined by the zero-order formula. Calculate Pj.

そして。and.

ｐｋ＝　［ｎｉｎ　（Ｐｊ　ＩＡｊはｎ文字単ｇＪ＜ｎ
ＸＴ（Ｔは閾値）なるｋ　（Ａｋはｎ文字単語）か存在するならば、姓の
ｎ文字長の辞書のに番目の項目を照合結果として出力す
る。pk= [nin (Pj IAj is n characters single gJ<n
If there exists a word k (Ak is an n-character word) such as XT (T is a threshold), the second item in the n-character dictionary of the last name is output as the matching result.

第３因は、単語読取装置にて読み取られる帳票（１）の
−例を示す説明図である。図において、（８１は入力文
字“水上”、（９）は入力文字１氷山″である。The third factor is an explanatory diagram showing an example of a form (1) read by a word reading device. In the figure, (81 is the input character "Minakami", and (9) is the input character 1 "Iceberg").

この各入力文字に対して１文字認識手段（３１から出力
される認識候補文字の一例か第４図、第５図に示されて
いる。Examples of recognition candidate characters output from the character recognition means (31) for each input character are shown in FIGS. 4 and 5.

第４図は、入力文字１水上”（８）に対して０文字認識
手段（３：で選択される認識候補文字を示しており、佛
４図の（ｌ［ｌに示されるように、入力文字第１文字目
の１水”に対しては、“水”、″氷”の２個の認識候補
文字か、続いて入力文字第２文字目の１上”に対しては
、″″土１．″″上”、＠止”の３個の認識候補文字か
存在する。Figure 4 shows the recognition candidate characters selected by the 0 character recognition means (3:) for the input character 1 "Minakami" (8). For the first character ``1 water'', there are two recognition candidate characters ``water'' and ``ice'', or for the second input character 1 upper'', there are ``earth''. 1. There are three recognition candidate characters: ``up'' and @stop.

第５図は、入力文字１永山″（９）に対して０文字認識
手段（３）で選択される認識候補文字を示しており、ｇ
ｓｒｓの（Ｉｌｌに示されるように、入力文字第１文字
目の１永”に対しては、＠氷”、＠水”の２個の認識候
補文字か、続いて入力文字第２文字目の１山”に対して
は０ｍ山１．１小”、１Ｊ１ビの３個の認識候補文字か
存在する。Figure 5 shows the recognition candidate characters selected by the 0 character recognition means (3) for the input character 1 Nagayama'' (9), and g
As shown in (Ill) of srs, for the first input character ``1ei'', there are two recognition candidate characters @ice'' and @water'', or the second input character There are three recognition candidate characters for ``1 mountain'', 0 m mountain, 1.1 small'', and 1J1 bi.

また、第６図は、単語読取装置における姓の単語辞書内
の、単語長２の単語の一例を示す説明図である。図にお
いて、ＬＬｚは１永井１なる単語、　（１ｍは１水山”
なる単語、　（１４は１水上”なる単語、ａジは“氷島
”なる単語、ａｅは１水原”なる単語である。Further, FIG. 6 is an explanatory diagram showing an example of words with word length 2 in the word dictionary of surnames in the word reading device. In the diagram, LLz is the word 1 Nagai 1, (1 m is 1 Mizuyama)
(14 is the word 1 water, aji is the word ``ice island,'' and ae is the word 1 water.)

次に、前記従来装置の動作を、前記判定条件のｍ＝３．
ｎ＝２．Ｏ＝１０．Ｔ＝８の場合について説明する。第
４図中の帳票上に記入された１水上”（８１の認識候補
文字ａｌに対して、Ｎ８図に示す単語辞書（４１内の単
語長２の単語との照合を、単語決定手段（５１にて行う
。まず、単語辞書内の単語長２の単語１永井”（１２の
各文字位置において、認識候補文字ＱＩお照合をとる。Next, the operation of the conventional device is determined based on the determination condition m=3.
n=2. O=10. The case where T=8 will be explained. The word determination means (51 First, the recognition candidate character QI is checked at each character position of the word 1 Nagai" (12) with word length 2 in the word dictionary.

すなわち、第１文字目の１氷”と、入力文字第１文字目
の一誠候補文字七の照合をとる。ところか入力文字第１
文字目のｍＲ候袖文字中に１永”かないので、ｐ１１＝
ｔ。In other words, the first character "1 ice" is compared with the first input character Issei candidate character 7. However, the first input character
Since there is no ``1ei'' in the mR sleeve letter, p11=
t.

となる。続いて、第２文字目の１井”と、入力文字第２
文字目の認識候補文字との照合をとる。ここでも入力文
字第２文字目の認識候補文字中に１井“が存在しないの
で、Ｐ２１＝１０となり、ＰＩＴ−２０となる。次いで
、単語辞書内の単語１水山”ａｊと照合を行う。照合操
作は、前記と同様にして行う。各文字位置で、第１文字
目１水”はｗｔ＊候補文字中に存在せず、また第２文字
目１山”も認誠候補文字中に存在しないので、　Ｐ２　
＝Ｐ１２　＋　Ｐ２２＝１０＋１０＝２０となる。becomes. Next, input the second character ``1i'' and the second input character.
Match the recognition candidate character with the first character. Here too, since ``1i'' does not exist in the recognition candidate characters for the second input character, P21=10, resulting in PIT-20. Next, the word 1 Mizuyama''aj in the word dictionary is compared. The matching operation is performed in the same manner as described above. At each character position, the first character "1 water" does not exist in the wt* candidate characters, and the second character "1 mountain" does not exist in the verification candidate characters, so P2
=P12+P22=10+10=20.

次いで、単語辞書内の単語“水上”Ｉの照合を行う。各
文字位置で、第１文字目１水″は認識候補文字中第１位
に、第２文字目１上”は認識候補文字内第２位に存在す
るので、　ｐｇ　＝ｐ＋５　＋　Ｐ２５＝１＋２＝３と
なる。Next, the word "minakami" I in the word dictionary is compared. At each character position, the first character "1 water" is in the first position among the recognition candidate characters, and the second character "1 top" is in the second position among the recognition candidate characters, so pg = p + 5 + P25 = 1 + 2 = It becomes 3.

次いで、単語辞書内の単語１水島’１９（！−照会を行
う。各文字位置で、第１文字目１水ｍはＢ識候補文字中
第１位に存在し、第２文字目１島ｍは認識候補文字中に
存在しないので、　　Ｐ４　”　Ｐｊ４　＋　Ｐ２４＝
１＋１０＝１１となる。Then, a query is made for the word 1 Mizushima'19 (!-) in the word dictionary. At each character position, the first character 1 mizu m exists in the first place among the B recognition candidate characters, and the second character 1 mizushima m does not exist in the recognition candidate characters, so P4 ” Pj4 + P24=
1+10=11.

次いで、単語辞書内の単語“氷原−ｔｔｅと照合を行う
。各文字位置で、第１文字目“水”は認識候補文字中第
１位に存在し、第２文字目１原”は認識候補文字中に存
在しないので、　　Ｐ５　”ｐｔｓ　＋　Ｐ２５＝１＋
１０＝１１となる。この結果Ｐｊ（ｊ＝１〜５）の中で
、ｉ＆小値はＰ５＝３　（（ｎＸＴ　（・＝１１１）］
となり、照合結果として正解の１水上”か得られる。Next, a match is made with the word "Ibara-tte" in the word dictionary. At each character position, the first character "Water" is in the first place among the recognition candidate characters, and the second character "1hara" is a recognition candidate. Since it does not exist in the characters, P5 ”pts + P25=1+
10=11. As a result, among Pj (j=1 to 5), i & small value is P5=3 ((nXT (・=111))
As a result of the matching, the correct answer ``1 Mizukami'' is obtained.

また、第５図中の帳票上た記入された１永山”（９）の
認識候補文字０旧ζ対して、第６囚に示す単語辞書内の
単語長２の単語、′永井’（１２，″″永山（１３゜１
水上＃１．′氷島’Ｑ９．”氷原”ａｅと前記と同様の
照合操作を行う。In addition, for the recognition candidate character 0 old ζ of 1 Nagai' (9) written on the form in Figure 5, the word length 2 in the word dictionary shown in prisoner 6, 'Nagai' (12, ″″Nagayama (13゜1
Water #1. 'Ice Island' Q9. Perform the same matching operation as above with "Ice Field" ae.

この結果、′永井’ａｓＨｃ対してｐ、＝２Ｑ。As a result, p, = 2Q for 'Nagai' asHc.

“氷山’　（１３に対してｐ２＝ｌｌ、＠水上”Ｉに対
してＰ５＝１２．”水Ｍ”器に対してＰ４＝１２゜１水
原”ｔｔｅ　ニ対しテＰｓ　＝１２　ｔ！：、す’Ｑ、
　　Ｐｊ　　（ｊ＝１〜５）の中で、最小値はＰ２＝１
１　（（ｎＸＴ（＝１６））　　となり、照合結果とし
て正解の“氷山”が得られる。“Iceberg” (p2=ll for 13, P5=12 for @Mizukami “I” P4=12゜1 Suwon for “Water M” tte Ps = 12 t!:, 'Q,
Among Pj (j=1 to 5), the minimum value is P2=1
1 ((nXT(=16)), and the correct answer "Iceberg" is obtained as a matching result.

[The problem that the invention attempts to solve]

しかしながら、上記従来の照合方式では、記入文字の認
識候補文字と、記入文字数と同一文字数をもつ単語すべ
てとの照合をとる必要かあり、多大な時間か必要である
七いう間馳点かあった。However, in the conventional matching method described above, it is necessary to match the recognition candidate characters of the entered characters with all the words that have the same number of characters as the number of entered characters, which has the disadvantage of requiring a large amount of time. .

この発明は、かかる問題点を解決するためになされたも
ので、単語の読取精度を鯰下させることなく、高速に単
語照会を行うことかできる単語読取装置を得ることを目
的とする。The present invention was made to solve this problem, and an object of the present invention is to provide a word reading device that can perform word inquiries at high speed without reducing word reading accuracy.

[Means for solving problems]

この発明に係る単語読取装置は、単語辞書内の特定の文
字位置の文字か、認識候補文字内に含まれる単語を、読
取対象単語として決定する読取対象単語決定手段を設け
、前記読取対象単語決定手段により選ばれた単語と、認
識候補文字とを照合し、単語か決定できなかった場合に
だけ、単語辞書全体と、Ｓ！！繊候補文字とを照合する
ようにしたものである。The word reading device according to the present invention is provided with a reading target word determining means for determining a character at a specific character position in a word dictionary or a word included in a recognition candidate character as a reading target word, and the reading target word determining means is provided. The word selected by the means is compared with the recognition candidate characters, and only when the word cannot be determined, the entire word dictionary and S! ! It is designed to match against weak candidate characters.

[Effect]

この発明における単語読取装置は、認識候補文字と単語
辞書内の単語とを照合する際に、まず。The word reading device in this invention first compares recognition candidate characters with words in the word dictionary.

照合成立の可能性の高い少数の単語と照合を行い。Matches a small number of words with a high probability of matching.

この照合か不成功となった時にのみ、単°語辞書内の全
単語との照合を行うので、単語の読取精度を低下させる
ことなく、高速に単語照合を行うことかできる。Only when this matching is unsuccessful, matching against all words in the word dictionary is performed, so word matching can be performed at high speed without reducing word reading accuracy.

[Embodiments of the invention]

以下、この発明の一東施例を図面に基づいて説明する。 Hereinafter, an embodiment of the present invention will be described based on the drawings.

第１図は、この発明による単語読取装置の一実一例を示
す構成因であり、（ｌ）〜（３）及び（５１は、第２図
に示した従来の単語読取装置と同一の構成要素からなる
、図において、（６）は単語辞書内の単語の中で、あらか
じめ定められた特定の文字位置の文字か１文字認識手段
（３）から得られる認識候補文字内に含まれる単語を、
読取対象単語七して、単語辞書から読み出す読取対象単
語決定手段である。FIG. 1 shows the components of an example of the word reading device according to the present invention, and (l) to (3) and (51) are the same components as the conventional word reading device shown in FIG. In the figure, (6) is a word that is included in a character at a predetermined specific character position or a recognition candidate character obtained from the single character recognition means (3) among the words in the word dictionary.
The reading target word 7 is a reading target word determining means that reads out the reading target word from a word dictionary.

また（））は、単語辞書（４）に、あらかじめ定められ
た特定の文字位置の文字をキーとして、所定の条件を満
たす単語を抽出できるようにインデックスか付加された
単語辞書である。Further, ()) is a word dictionary in which an index is added to the word dictionary (4) so that words satisfying predetermined conditions can be extracted using characters at predetermined specific character positions as keys.

第７図は、単語読取装置における姓の単語辞書（７）内
の、単語長２の単語の一例を示す説明−である。−にお
いて、　ａＸ８は１永井”なる単語、　（１３Ｇ２１永
山１なる単語、　（１４１は１水上”なる単語、餞は１
水島″なる単語、　ａＳは１水原”なる単語である。FIG. 7 is an explanation showing an example of a word with a word length of 2 in the surname word dictionary (7) in the word reading device. -, aX8 is the word 1 Nagai, (13G21 is the word Nagai 1, (141 is the word 1 Minakami), and 1 is the word 1.
The word ``Mizushima'', aS is the word ``Suwon''.

さて、αηは単線の先頭文字をキーとするインデツクス
、Ｕは前記インブックスミη内のキー１永”。Now, αη is an index whose key is the first letter of a single line, and U is the key 1 in the in-book system η.

α９は前記インデックス（１７＋内のキー１水Ｚ　ｃｌ
Ｑは前記インデックスαη内のキー１氷”であるつまた
。α9 is the index (key 1 in 17+ Z cl
Q is the key 1 in the index αη.

Ｑυはインブックスミｎ内の１氷＃ａｌのキーから。Qυ is from the key of 1 ice #al in Inbooksumi n.

１永井”なる単語ａｚを指示するポインタ、ｃ１３はイ
ンデックスαη内の１永＃−のキーから、′氷山”なる
単語Ｑ３を指示するポインタ、＠はインデックスｕｌ内
の１水”ａＳのキーから、′水上”なる単語Ｉを指示す
るポインタ、ｃ！４はインデックスＱＤ内の１水″ｆ１
９のキーから、“水鳥″なる単語霞を指示するポインタ
、（ハ）はインデックスＱη内の１水”σ９のキーから
、＠″水凍原１る単語ａｓを指示するポインタである。A pointer pointing to the word ``1 Nagai'' az, c13 is a pointer pointing to the word ``Iceberg'' from the key 1 Nagai#- in the index αη, a pointer pointing to the word Q3 ``Iceberg'', @ is a key pointing to the word ``1 water'' aS in the index ul, Pointer pointing to the word ``water'' I, c! 4 is 1 water "f1" in index QD
From the key 9, a pointer points to the word ``water bird'', and (c) points from the key σ9, ``1 water'' in the index Qη, to the word ``@'' water freeze field 1, as.

なお（ホ）はインデックス鰭内の１氷”偶のキーに対す
るポインタで、キー１氷”偶に対する単語か存在しない
ことを示す。Note that (e) is a pointer to the key 1 ice even in the index fin, and indicates that there is no word for the key 1 ice even.

次に動作について説明する。第４−中の帳票上に記入さ
れた１水上″（８）の認識候補文字舖に対して、第７図
に示す単語辞書（７）内の単語長２の単語との照合操作
を、読取対象単語決定手段（６）と、単語決定小股（５
）を用いて行う。Next, the operation will be explained. The recognition candidate character ``1 Minakami'' (8) written on the middle form in Section 4 is compared with a word with a word length of 2 in the word dictionary (7) shown in Figure 7. Target word determining means (6) and word determining means (5)
).

まず、読取対象単語決定手段（６）は、第４図中の入力
文字１水上ｍ（８）の認識候補文字１水の第１文字目、
第１位の認識候補文字１水′を参照し、第１文字目か”
水″の単語を単語辞書（７）から抽出する。この抽出に
際しては０ｍ７図に示す単語辞書のインデックスａＴＩ
内の１水”０のキーを参照し。First, the reading target word determining means (6) selects the first character of the recognition candidate character 1 water of the input character 1 water m (8) in FIG.
Referring to the first recognition candidate character 1 ', is it the first character?
The word "water" is extracted from the word dictionary (7).In this extraction, the index aTI of the word dictionary shown in Figure 0m7 is
Please refer to the 1 water "0" key.

ポインタ（２）、（財）、＠で指示される単語、′水よ
”Ｉ、＠氷島＃似、＠氷原″（ＬＬｌを抽出する。Extract the words indicated by pointer (2), (goods), @, ``Water'' I, @ice island #similarity, @ice field'' (LLl).

次に、読取対象単語決定手段（ａｌ　４１　、第４因中
の入力文字１水上ｍ（８）の認識候補文字ａｌ中の第１
文字目、第２位の認識候補文字１氷″を参照し、第１文
字目か“氷”の単語を単飴辞＠　（？）から抽出する。Next, the reading target word determining means (al 41 , the first character in the recognition candidate character al of the input character 1 Mizukami m (8) in the fourth factor)
The first character or the word "ice" is extracted from the single candy word @ (?) by referring to the second-ranked recognition candidate character "1 ice".

抽出に際して、第１図に示す辞書のインブックスミ力内
の１永ｍ偽のキーを参照すると、ポインタ（２）は単語
か存在しないことを示しているので。When extracting, if you refer to the false key in the dictionary in the dictionary shown in Figure 1, the pointer (2) indicates that the word does not exist.

１氷”を１文字目とする単語は抽出できない。Words whose first character is "1 ice" cannot be extracted.

かくして、読取対象単語は、＠水上”１．＠水晶”ａＳ
、１水原″ａｅに決定される。さて、読取対象単語か決
定されたので０次に単語決定手段（５）にて照合を行う
。この照合操作は、従来装置と同様にして行う。ただし
、前記従来ｆｃｔの判定条件の定数は、ｍ＝３．ｎ＝２
．Ｃ＝１０．Ｔ＝＝３とする。従来装置の照合と同様に
１文字位置毎に認識候補文字と照合対象単語とを照合す
ると、′水上”Ｉに対して　ｐＨ＝３．−氷島″霞に対
して　Ｐ２　＝’１１．”’氷原”（１［Ｐｊｃ対して
Ｐ５　＝　１１　トナＱ、コの結果Ｐｊ　＝　（ｊ　＝
　１〜３）の中で最小値は、Ｐ１＝　３　（（ｎＸＴ　
（＝８　）　）となまり、照合結果として正解の１水上
”か得られる。このように、従来製置では、単語辞書全
体の単語と照合を行ったのに対し、この発明では、ｙ取
対象単語決定手段（６）により、３個の単語（１水上”
、＠″水氷島。Thus, the words to be read are @Minakami"1.@Suzuki"aS
, 1 Mizuhara"ae is determined.Now that the word to be read has been determined, the zero-order word determination means (5) performs a verification.This verification operation is performed in the same manner as in the conventional device.However, The constants for the judgment conditions of the conventional fct are m=3.n=2
．． C=10. Let T==3. Similar to the comparison with the conventional device, when the recognition candidate character is compared with the target word for each character position, for 'Mizukami' I, pH = 3. - for Hyojima' Kasumi, P2 = '11. ``Ice field'' (1 [Pjc vs. P5 = 11 Tona Q, Ko's result Pj = (j =
1 to 3), the minimum value is P1=3 ((nXT
(=8)), and the correct answer is 1 Mizukami" as a matching result. In this way, in the conventional system, matching was performed with words in the entire word dictionary, but in this invention, the correct answer is "1 Minakami". The word determination means (6) selects 3 words (1.
, @″Mizuhyo Island.

１水原”）に絞って照合を行うだけで正解か得られる。You can get the correct answer just by narrowing down your search to ``Suwon'').

次に、第５図中の帳票上に記入された１氷山”（９）の
認識候補文字ａ旧と対して、第７’ｌｌに示す単語辞書
（７）内の単語長２の単語との照合８．読取対象単語決
定手段（６）と、単語決定手段（５）を用いて行う場合
について説明する。Next, for the recognition candidate character a old of ``1 iceberg'' (9) written on the form in Figure 5, the word length 2 in the word dictionary (7) shown in Figure 7'll is compared. Verification 8. A case in which the reading target word determining means (6) and the word determining means (5) are used will be described.

前記と同様に、まず読取対象単語決定手段（６）は。As above, first, the reading target word determining means (6).

第４図中の入力文字１永山１（９）の認識候補文字＋１
１）中の第１文字目、第１位の認識候補文字“氷”を参
照するか、第１文字目か１氷”の単語は前記のとおり存
在しない。Recognition candidate character for input character 1 Nagayama 1 (9) in Figure 4 +1
1) Referring to the first character, the first recognition candidate character "Ice", or the word "Ice" as the first character does not exist, as described above.

次に、第４図中の入力文字１永山ｍ（９）の認識候補文
字ａＤ中の第１文字目、第２位の認識候補文字１水′″
を参照すると、前記と同様に１水上”ａｌ。Next, the first character in the recognition candidate character aD of the input character 1 Nagayama m (9) in Figure 4, and the second recognition candidate character 1 water'''
Referring to ``1 Minakami'' al., as above.

“水鳥”ａＳ、′氷原”ａｅの３個の単語か抽出される
。さらに、前記と同様に、従来装置の判定条件の定数を
、ｍ＝３．ｎ＝２．Ｏ＝１０．Ｔ＝３として照合すると
、′水上”Ｑ４に対してＰ１＝１２゜１水島１５に対し
て　ｐ２＝ｉ２．＠氷原″（Ｌｌｌに対してＰｓ＝１２
となり、何れもＰ、１（ｊ＝１〜３）〉ｎ　ＸＴ　（＝
６　）であるので、照合結果は得られない。Three words, “waterfowl” aS and “ice field” ae, are extracted.Furthermore, in the same way as above, the constants of the judgment conditions of the conventional device are set as m=3.n=2.O=10.T=3 When collated as follows, P1=12°1 for 'Mizukami' Q4, p2=i2 for Mizushima 15. @Iceland” (Ps=12 for Lll
Therefore, both P, 1 (j=1 to 3)〉n XT (=
6), no matching results can be obtained.

この場合には、インデックス鼎を用いて単語辞書を参照
せずに、従来製置と同様に、単語辞書（７）内の全単語
と照合を行う。菫た。この時１判定条件の定数は０ｍ＝
３．ｎ＝２．Ｏ＝１０．Ｔ＝８として照合を行う。これ
により、従来製置の照合と同様の結果か得られ、＠″水
井υに対してＰ１＝２０．″″永山（Ｉ３に対してＰ２
＝１１．＠水上”Ｉに対してｐ３＝１２．−氷島＃（Ｌ
９に対してＰ４＝１２．”’氷原″ａｅに対してＰｓ＝
１２ａなり。In this case, all the words in the word dictionary (7) are compared using the index function without referring to the word dictionary, as in the conventional setting. Sumire. At this time, the constant of 1 judgment condition is 0m=
3. n=2. O=10. Verification is performed with T=8. As a result, the same results as the conventional comparison were obtained, and @"P1=20 for Mizui υ.""Nagayama (P2 for I3
=11. @Mizukami”p3=12 for I.-Ice Island #(L
P4=12 for 9. Ps= for ``'Ice Field'' ae
It's 12a.

Ｐ、１（ｊ　＝　１〜５　）の中で最小値はＰ２　＝１
１　［（ｎＸＴ（＝１６））と求まり、照合結果として
正解あ“氷山″か得られる。The minimum value among P, 1 (j = 1 to 5) is P2 = 1
1 [(nXT(=16)), and the correct answer is ``iceberg'' as a matching result.

このように、読取対象単語決定手段（６）を用いること
により、認識候補文字と単語辞書（７）内の単語とを照
付する際に、まず、照合成立の可能性の高い少数の単語
と照合を行い、照合か不成功となった時にのみ、単語辞
書（７）内の全単語と照合を行うので、単語の読取精度
を低下させることなく、高速に単語照合を行うことかで
きる。In this way, by using the reading target word determining means (6), when matching recognition candidate characters with words in the word dictionary (7), it first selects a small number of words with a high possibility of matching. Verification is performed, and only when the verification is unsuccessful, all words in the word dictionary (7) are verified, so word verification can be performed at high speed without reducing word reading accuracy.

なお、上記実施例で４２．単語の先頭１文字をキーとし
て読取対象単語を抽出したか、キーとなる文字の文字位
置・文字数は自由に設定できる。In addition, in the above example, 42. Whether the word to be read is extracted using the first character of the word as a key, or the character position and number of key characters can be set freely.

（例えば、単語の先頭１文字と、最後の１文字のどちら
力１か、ｕ踵候補文字中に含まれている単語を抽出する
等。）また、上記実施例では、漢字の姓について述べたか、カ
タカナ・英字などの単語１氏名・会社名などの単語に関
してもこの発明を適用することかできる。(For example, whether the first character or the last character of a word is 1, extracting the words contained in the u-heel candidate characters, etc.) Also, in the above example, the surname of the kanji character was mentioned. This invention can also be applied to words such as 1, names, company names, etc., such as katakana and English characters.

〔Effect of the invention〕

以上のように、この発明によれば、１１１！取対象単語
決定手段を設けたことにより、認識候補文字七単語辞書
内の全単語とを常時照合する必要かな（なり、単語の読
取精度を低下させることなく、高速に単語照合を行うこ
とができる。As described above, according to this invention, 111! By providing a means for determining target words, it becomes necessary to constantly check all the words in the seven-word dictionary for recognition candidate characters (this makes it possible to perform word checking at high speed without reducing word reading accuracy. .

[Brief explanation of the drawing]

第１図はこの発明の一実施例による単語読取装置の構成
図、第２図は従来の単語読取装置の構成図、第３図は単
語読取装置の帳票の一例を示す説明図、＠４図及び第５
図は単語読取装置における入力文字及び認識候補文字の
一例を示す説ＢＡＶ。第６図は従来の単語読取装置における単語辞書内の単語
長側の単語列の一例を示す説明図、第７図はこの発明の
一実施例による単語読取装置における単語辞簀内の単語
側の一例を示すａ側御である。図中、（３）は文字認識手段、（５）は単語決定手段。（６）は読取対象単語決定手段、（７）は単語辞書であ
る。なお１図中、同一符号は同−又は相当部分を示す。Fig. 1 is a block diagram of a word reading device according to an embodiment of the present invention, Fig. 2 is a block diagram of a conventional word reading device, and Fig. 3 is an explanatory diagram showing an example of a form of the word reading device. and fifth
The figure shows an example BAV of input characters and recognition candidate characters in a word reading device. FIG. 6 is an explanatory diagram showing an example of a word string on the word length side in a word dictionary in a conventional word reading device, and FIG. This is an example of A-side control. In the figure, (3) is a character recognition means, and (5) is a word determination means. (6) is a reading target word determining means, and (7) is a word dictionary. In addition, in FIG. 1, the same reference numerals indicate the same or corresponding parts.

Claims

[Claims]

In a word reading device that recognizes and reads characters constituting a word, a character recognition means selects, for each input character, a plurality of recognition candidate characters ordered in descending order of similarity as input characters, and a word written on a form. A word dictionary that stores
a word determining means for determining a word by comparing the recognition candidate characters with the word dictionary; and a word in the word dictionary in which a character at a specific character position is included in the recognition candidate characters as a reading target word. a reading target word determining means that reads out a word from a word dictionary; first, the recognition candidate character is compared with the word read by the reading target word determining means, and only if a word cannot be determined at this stage; A word reading device characterized in that it is configured to determine a word by comparing recognition candidate characters with all words in a word dictionary.