JPH0196779A - Word reader - Google Patents

Word reader

Info

Publication number
JPH0196779A
JPH0196779A JP62255167A JP25516787A JPH0196779A JP H0196779 A JPH0196779 A JP H0196779A JP 62255167 A JP62255167 A JP 62255167A JP 25516787 A JP25516787 A JP 25516787A JP H0196779 A JPH0196779 A JP H0196779A
Authority
JP
Japan
Prior art keywords
word
character
dictionary
recognition candidate
words
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
JP62255167A
Other languages
Japanese (ja)
Inventor
Yasuhiro Okada
康裕 岡田
Kozo Tomono
伴野 浩三
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Mitsubishi Electric Corp
Original Assignee
Mitsubishi Electric Corp
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Mitsubishi Electric Corp filed Critical Mitsubishi Electric Corp
Priority to JP62255167A priority Critical patent/JPH0196779A/en
Publication of JPH0196779A publication Critical patent/JPH0196779A/en
Pending legal-status Critical Current

Links

Abstract

PURPOSE:To realize the high-speed collation of words without deteriorating the accuracy of word reading, by providing a reading object word deciding means. CONSTITUTION:A reading object word deciding means 6 is provided to decide the word where the character of a specific position in a word dictionary 7 is included in recognition candidate characters as a reading object word. The word decided by the means 6 is collated with said candidate characters. While the dictionary 7 is entirely collated with the candidate characters only in case no word is decided. Therefore, it is possible to perform the collation of words at a high speed without deteriorating the word reading accuracy since the collation is carried out first among a small number of words having high possi bility for satisfactory collation and then performing the collation among all words of the dictionary 7 only when the first collation fails.

Description

【発明の詳細な説明】 〔産業上の利用分野〕 この発明は、住所・氏名などの単語を読み取って認識す
る単語読取装置、特に単語を構成する文字を1文字毎に
認識し、その認識結果を用いて単語を修正する単語読取
装置の照会時間短縮化に関するものである。
[Detailed Description of the Invention] [Field of Industrial Application] This invention relates to a word reading device that reads and recognizes words such as addresses and names, and in particular, recognizes each character that constitutes a word, and uses the recognition results. This invention relates to shortening the query time of a word reading device that corrects words using .

〔従来の技術〕[Conventional technology]

近年、計算機などへの大量かつ高速のデータエン) I
J−手段として、単語読取装置か注目されており1日本
語においては、かな文字や英数字などか複雑に混在して
いることから、高精度に単語の読み取りかできる単語読
取装置の開発か望まれている。この単語読取装置として
は1光学式文字読取装fill(OCR)か知られてお
り、第2図に0例えば“昭和S1年度電子通信学会i合
全国大会(ta4t)r手書漢字認識における単語情報
の利用」、などに示された。従来の単語読取装置の構成
−を示す。
In recent years, large amounts of high-speed data transmission to computers, etc.) I
Word reading devices are attracting attention as a means of J-1.Since Japanese has a complex mixture of kana characters, alphanumeric characters, etc., there is hope for the development of word reading devices that can read words with high accuracy. It is rare. This word reading device is known as an optical character reading device (OCR), and as shown in Fig. 2, for example, “Word information in handwritten kanji recognition” "Use of", etc. The configuration of a conventional word reading device is shown.

図において、(口は文字か記載されている帳票。In the figure, (the opening is a form with letters written on it.

(2)は前記帳票(口を読取装置に読み取らせるための
走査手段、(3)は入力文字を1文字毎に切り出して認
識し、認識候補文字を出力する文字認識手段。
(2) is a scanning means for causing a reading device to read the form (original part); (3) is a character recognition means for cutting out and recognizing input characters character by character, and outputting recognition candidate characters.

141は帳票(1)に記入される単語を格納した単語辞
書。
141 is a word dictionary that stores words to be entered in the form (1).

(51は前記文字認識手段(3)から出力された認識候
補文字と前記単語辞書(41内の複数の単語等を照合し
て単語を決定する単語決定手段である。、従来の単語読
取装置は以上のように構成され、下記に示す判定条件に
従い、前記認識候補文字のうち順位か上位で、かつ、単
語辞書(4−に登碌された単語を優先的に選択したもの
を、単語読み取りの結果とし゛て出力するものである。
(51 is a word determining means that determines a word by comparing the recognition candidate character outputted from the character recognition means (3) with a plurality of words in the word dictionary (41).) Conventional word reading devices With the above structure, according to the judgment conditions shown below, words that are ranked high among the recognition candidate characters and have been entered in the word dictionary (4-) are preferentially selected for word reading. This is what is output as a result.

く判定条件〉 m個の認識候補文字か文字認識手段(3)から出力され
、n文字からなる単語か入力された時、n文字長の姓の
j番目の単語Ajに対して0次式によりPj を計算す
る。
Judgment conditions> When m recognition candidate characters are output from the character recognition means (3) and a word consisting of n characters is input, the jth word Aj of a last name with n characters in length is determined by the zero-order formula. Calculate Pj.

そして。and.

pk= [nin (Pj IAjはn文字単gJ<n
XT(Tは閾値) なるk (Akはn文字単語)か存在するならば、姓の
n文字長の辞書のに番目の項目を照合結果として出力す
る。
pk= [nin (Pj IAj is n characters single gJ<n
If there exists a word k (Ak is an n-character word) such as XT (T is a threshold), the second item in the n-character dictionary of the last name is output as the matching result.

第3因は、単語読取装置にて読み取られる帳票(1)の
−例を示す説明図である。図において、(81は入力文
字“水上”、(9)は入力文字1氷山″である。
The third factor is an explanatory diagram showing an example of a form (1) read by a word reading device. In the figure, (81 is the input character "Minakami", and (9) is the input character 1 "Iceberg").

この各入力文字に対して1文字認識手段(31から出力
される認識候補文字の一例か第4図、第5図に示されて
いる。
Examples of recognition candidate characters output from the character recognition means (31) for each input character are shown in FIGS. 4 and 5.

第4図は、入力文字1水上”(8)に対して0文字認識
手段(3:で選択される認識候補文字を示しており、佛
4図の(l[lに示されるように、入力文字第1文字目
の1水”に対しては、“水”、″氷”の2個の認識候補
文字か、続いて入力文字第2文字目の1上”に対しては
、″″土1.″″上”、@止”の3個の認識候補文字か
存在する。
Figure 4 shows the recognition candidate characters selected by the 0 character recognition means (3:) for the input character 1 "Minakami" (8). For the first character ``1 water'', there are two recognition candidate characters ``water'' and ``ice'', or for the second input character 1 upper'', there are ``earth''. 1. There are three recognition candidate characters: ``up'' and @stop.

第5図は、入力文字1永山″(9)に対して0文字認識
手段(3)で選択される認識候補文字を示しており、g
srsの(Illに示されるように、入力文字第1文字
目の1永”に対しては、@氷”、@水”の2個の認識候
補文字か、続いて入力文字第2文字目の1山”に対して
は0m山1.1小”、1J1ビの3個の認識候補文字か
存在する。
Figure 5 shows the recognition candidate characters selected by the 0 character recognition means (3) for the input character 1 Nagayama'' (9), and g
As shown in (Ill) of srs, for the first input character ``1ei'', there are two recognition candidate characters @ice'' and @water'', or the second input character There are three recognition candidate characters for ``1 mountain'', 0 m mountain, 1.1 small'', and 1J1 bi.

また、第6図は、単語読取装置における姓の単語辞書内
の、単語長2の単語の一例を示す説明図である。図にお
いて、LLzは1永井1なる単語、 (1mは1水山”
なる単語、 (14は1水上”なる単語、aジは“氷島
”なる単語、aeは1水原”なる単語である。
Further, FIG. 6 is an explanatory diagram showing an example of words with word length 2 in the word dictionary of surnames in the word reading device. In the diagram, LLz is the word 1 Nagai 1, (1 m is 1 Mizuyama)
(14 is the word 1 water, aji is the word ``ice island,'' and ae is the word 1 water.)

次に、前記従来装置の動作を、前記判定条件のm=3.
n=2.O=10.T=8の場合について説明する。第
4図中の帳票上に記入された1水上”(81の認識候補
文字alに対して、N8図に示す単語辞書(41内の単
語長2の単語との照合を、単語決定手段(51にて行う
。まず、単語辞書内の単語長2の単語1永井”(12の
各文字位置において、認識候補文字QIお照合をとる。
Next, the operation of the conventional device is determined based on the determination condition m=3.
n=2. O=10. The case where T=8 will be explained. The word determination means (51 First, the recognition candidate character QI is checked at each character position of the word 1 Nagai" (12) with word length 2 in the word dictionary.

すなわち、第1文字目の1氷”と、入力文字第1文字目
の一誠候補文字七の照合をとる。ところか入力文字第1
文字目のmR候袖文字中に1永”かないので、p11=
t。
In other words, the first character "1 ice" is compared with the first input character Issei candidate character 7. However, the first input character
Since there is no ``1ei'' in the mR sleeve letter, p11=
t.

となる。続いて、第2文字目の1井”と、入力文字第2
文字目の認識候補文字との照合をとる。ここでも入力文
字第2文字目の認識候補文字中に1井“が存在しないの
で、P21=10となり、PIT−20となる。次いで
、単語辞書内の単語1水山”ajと照合を行う。照合操
作は、前記と同様にして行う。各文字位置で、第1文字
目1水”はwt*候補文字中に存在せず、また第2文字
目1山”も認誠候補文字中に存在しないので、 P2 
=P12 + P22=10+10=20となる。
becomes. Next, input the second character ``1i'' and the second input character.
Match the recognition candidate character with the first character. Here too, since ``1i'' does not exist in the recognition candidate characters for the second input character, P21=10, resulting in PIT-20. Next, the word 1 Mizuyama''aj in the word dictionary is compared. The matching operation is performed in the same manner as described above. At each character position, the first character "1 water" does not exist in the wt* candidate characters, and the second character "1 mountain" does not exist in the verification candidate characters, so P2
=P12+P22=10+10=20.

次いで、単語辞書内の単語“水上”Iの照合を行う。各
文字位置で、第1文字目1水″は認識候補文字中第1位
に、第2文字目1上”は認識候補文字内第2位に存在す
るので、 pg =p+5 + P25=1+2=3と
なる。
Next, the word "minakami" I in the word dictionary is compared. At each character position, the first character "1 water" is in the first position among the recognition candidate characters, and the second character "1 top" is in the second position among the recognition candidate characters, so pg = p + 5 + P25 = 1 + 2 = It becomes 3.

次いで、単語辞書内の単語1水島’19(!−照会を行
う。各文字位置で、第1文字目1水mはB識候補文字中
第1位に存在し、第2文字目1島mは認識候補文字中に
存在しないので、  P4 ” Pj4 + P24=
1+10=11となる。
Then, a query is made for the word 1 Mizushima'19 (!-) in the word dictionary. At each character position, the first character 1 mizu m exists in the first place among the B recognition candidate characters, and the second character 1 mizushima m does not exist in the recognition candidate characters, so P4 ” Pj4 + P24=
1+10=11.

次いで、単語辞書内の単語“氷原−tteと照合を行う
。各文字位置で、第1文字目“水”は認識候補文字中第
1位に存在し、第2文字目1原”は認識候補文字中に存
在しないので、  P5 ”pts + P25=1+
10=11となる。この結果Pj(j=1〜5)の中で
、i&小値はP5=3 ((nXT (・=111)]
となり、照合結果として正解の1水上”か得られる。
Next, a match is made with the word "Ibara-tte" in the word dictionary. At each character position, the first character "Water" is in the first place among the recognition candidate characters, and the second character "1hara" is a recognition candidate. Since it does not exist in the characters, P5 ”pts + P25=1+
10=11. As a result, among Pj (j=1 to 5), i & small value is P5=3 ((nXT (・=111))
As a result of the matching, the correct answer ``1 Mizukami'' is obtained.

また、第5図中の帳票上た記入された1永山”(9)の
認識候補文字0旧ζ対して、第6囚に示す単語辞書内の
単語長2の単語、′永井’(12,″″永山(13゜1
水上#1.′氷島’Q9.”氷原”aeと前記と同様の
照合操作を行う。
In addition, for the recognition candidate character 0 old ζ of 1 Nagai' (9) written on the form in Figure 5, the word length 2 in the word dictionary shown in prisoner 6, 'Nagai' (12, ″″Nagayama (13゜1
Water #1. 'Ice Island' Q9. Perform the same matching operation as above with "Ice Field" ae.

この結果、′永井’asHc対してp、=2Q。As a result, p, = 2Q for 'Nagai' asHc.

“氷山’ (13に対してp2=ll、@水上”Iに対
してP5=12.”水M”器に対してP4=12゜1水
原”tte ニ対しテPs =12 t!:、す’Q、
  Pj  (j=1〜5)の中で、最小値はP2=1
1 ((nXT(=16))  となり、照合結果とし
て正解の“氷山”が得られる。
“Iceberg” (p2=ll for 13, P5=12 for @Mizukami “I” P4=12゜1 Suwon for “Water M” tte Ps = 12 t!:, 'Q,
Among Pj (j=1 to 5), the minimum value is P2=1
1 ((nXT(=16)), and the correct answer "Iceberg" is obtained as a matching result.

〔発明か解決しようとする問題点〕[The problem that the invention attempts to solve]

しかしながら、上記従来の照合方式では、記入文字の認
識候補文字と、記入文字数と同一文字数をもつ単語すべ
てとの照合をとる必要かあり、多大な時間か必要である
七いう間馳点かあった。
However, in the conventional matching method described above, it is necessary to match the recognition candidate characters of the entered characters with all the words that have the same number of characters as the number of entered characters, which has the disadvantage of requiring a large amount of time. .

この発明は、かかる問題点を解決するためになされたも
ので、単語の読取精度を鯰下させることなく、高速に単
語照会を行うことかできる単語読取装置を得ることを目
的とする。
The present invention was made to solve this problem, and an object of the present invention is to provide a word reading device that can perform word inquiries at high speed without reducing word reading accuracy.

〔問題点を解決するための手段〕[Means for solving problems]

この発明に係る単語読取装置は、単語辞書内の特定の文
字位置の文字か、認識候補文字内に含まれる単語を、読
取対象単語として決定する読取対象単語決定手段を設け
、前記読取対象単語決定手段により選ばれた単語と、認
識候補文字とを照合し、単語か決定できなかった場合に
だけ、単語辞書全体と、S!!繊候補文字とを照合する
ようにしたものである。
The word reading device according to the present invention is provided with a reading target word determining means for determining a character at a specific character position in a word dictionary or a word included in a recognition candidate character as a reading target word, and the reading target word determining means is provided. The word selected by the means is compared with the recognition candidate characters, and only when the word cannot be determined, the entire word dictionary and S! ! It is designed to match against weak candidate characters.

〔作用〕[Effect]

この発明における単語読取装置は、認識候補文字と単語
辞書内の単語とを照合する際に、まず。
The word reading device in this invention first compares recognition candidate characters with words in the word dictionary.

照合成立の可能性の高い少数の単語と照合を行い。Matches a small number of words with a high probability of matching.

この照合か不成功となった時にのみ、単°語辞書内の全
単語との照合を行うので、単語の読取精度を低下させる
ことなく、高速に単語照合を行うことかできる。
Only when this matching is unsuccessful, matching against all words in the word dictionary is performed, so word matching can be performed at high speed without reducing word reading accuracy.

〔発明の実施例〕[Embodiments of the invention]

以下、この発明の一東施例を図面に基づいて説明する。 Hereinafter, an embodiment of the present invention will be described based on the drawings.

第1図は、この発明による単語読取装置の一実一例を示
す構成因であり、(l)〜(3)及び(51は、第2図
に示した従来の単語読取装置と同一の構成要素からなる
、 図において、(6)は単語辞書内の単語の中で、あらか
じめ定められた特定の文字位置の文字か1文字認識手段
(3)から得られる認識候補文字内に含まれる単語を、
読取対象単語七して、単語辞書から読み出す読取対象単
語決定手段である。
FIG. 1 shows the components of an example of the word reading device according to the present invention, and (l) to (3) and (51) are the same components as the conventional word reading device shown in FIG. In the figure, (6) is a word that is included in a character at a predetermined specific character position or a recognition candidate character obtained from the single character recognition means (3) among the words in the word dictionary.
The reading target word 7 is a reading target word determining means that reads out the reading target word from a word dictionary.

また())は、単語辞書(4)に、あらかじめ定められ
た特定の文字位置の文字をキーとして、所定の条件を満
たす単語を抽出できるようにインデックスか付加された
単語辞書である。
Further, ()) is a word dictionary in which an index is added to the word dictionary (4) so that words satisfying predetermined conditions can be extracted using characters at predetermined specific character positions as keys.

第7図は、単語読取装置における姓の単語辞書(7)内
の、単語長2の単語の一例を示す説明−である。−にお
いて、 aX8は1永井”なる単語、 (13G21永
山1なる単語、 (141は1水上”なる単語、餞は1
水島″なる単語、 aSは1水原”なる単語である。
FIG. 7 is an explanation showing an example of a word with a word length of 2 in the surname word dictionary (7) in the word reading device. -, aX8 is the word 1 Nagai, (13G21 is the word Nagai 1, (141 is the word 1 Minakami), and 1 is the word 1.
The word ``Mizushima'', aS is the word ``Suwon''.

さて、αηは単線の先頭文字をキーとするインデツクス
、Uは前記インブックスミη内のキー1永”。
Now, αη is an index whose key is the first letter of a single line, and U is the key 1 in the in-book system η.

α9は前記インデックス(17+内のキー1水Z cl
Qは前記インデックスαη内のキー1氷”であるつまた
α9 is the index (key 1 in 17+ Z cl
Q is the key 1 in the index αη.

Qυはインブックスミn内の1氷#alのキーから。Qυ is from the key of 1 ice #al in Inbooksumi n.

1永井”なる単語azを指示するポインタ、c13はイ
ンデックスαη内の1永#−のキーから、′氷山”なる
単語Q3を指示するポインタ、@はインデックスul内
の1水”aSのキーから、′水上”なる単語Iを指示す
るポインタ、c!4はインデックスQD内の1水″f1
9のキーから、“水鳥″なる単語霞を指示するポインタ
、(ハ)はインデックスQη内の1水”σ9のキーから
、@″水凍原1る単語asを指示するポインタである。
A pointer pointing to the word ``1 Nagai'' az, c13 is a pointer pointing to the word ``Iceberg'' from the key 1 Nagai#- in the index αη, a pointer pointing to the word Q3 ``Iceberg'', @ is a key pointing to the word ``1 water'' aS in the index ul, Pointer pointing to the word ``water'' I, c! 4 is 1 water "f1" in index QD
From the key 9, a pointer points to the word ``water bird'', and (c) points from the key σ9, ``1 water'' in the index Qη, to the word ``@'' water freeze field 1, as.

なお(ホ)はインデックス鰭内の1氷”偶のキーに対す
るポインタで、キー1氷”偶に対する単語か存在しない
ことを示す。
Note that (e) is a pointer to the key 1 ice even in the index fin, and indicates that there is no word for the key 1 ice even.

次に動作について説明する。第4−中の帳票上に記入さ
れた1水上″(8)の認識候補文字舖に対して、第7図
に示す単語辞書(7)内の単語長2の単語との照合操作
を、読取対象単語決定手段(6)と、単語決定小股(5
)を用いて行う。
Next, the operation will be explained. The recognition candidate character ``1 Minakami'' (8) written on the middle form in Section 4 is compared with a word with a word length of 2 in the word dictionary (7) shown in Figure 7. Target word determining means (6) and word determining means (5)
).

まず、読取対象単語決定手段(6)は、第4図中の入力
文字1水上m(8)の認識候補文字1水の第1文字目、
第1位の認識候補文字1水′を参照し、第1文字目か”
水″の単語を単語辞書(7)から抽出する。この抽出に
際しては0m7図に示す単語辞書のインデックスaTI
内の1水”0のキーを参照し。
First, the reading target word determining means (6) selects the first character of the recognition candidate character 1 water of the input character 1 water m (8) in FIG.
Referring to the first recognition candidate character 1 ', is it the first character?
The word "water" is extracted from the word dictionary (7).In this extraction, the index aTI of the word dictionary shown in Figure 0m7 is
Please refer to the 1 water "0" key.

ポインタ(2)、(財)、@で指示される単語、′水よ
”I、@氷島#似、@氷原″(LLlを抽出する。
Extract the words indicated by pointer (2), (goods), @, ``Water'' I, @ice island #similarity, @ice field'' (LLl).

次に、読取対象単語決定手段(al 41 、第4因中
の入力文字1水上m(8)の認識候補文字al中の第1
文字目、第2位の認識候補文字1氷″を参照し、第1文
字目か“氷”の単語を単飴辞@ (?)から抽出する。
Next, the reading target word determining means (al 41 , the first character in the recognition candidate character al of the input character 1 Mizukami m (8) in the fourth factor)
The first character or the word "ice" is extracted from the single candy word @ (?) by referring to the second-ranked recognition candidate character "1 ice".

抽出に際して、第1図に示す辞書のインブックスミ力内
の1永m偽のキーを参照すると、ポインタ(2)は単語
か存在しないことを示しているので。
When extracting, if you refer to the false key in the dictionary in the dictionary shown in Figure 1, the pointer (2) indicates that the word does not exist.

1氷”を1文字目とする単語は抽出できない。Words whose first character is "1 ice" cannot be extracted.

かくして、読取対象単語は、@水上”1.@水晶”aS
、1水原″aeに決定される。さて、読取対象単語か決
定されたので0次に単語決定手段(5)にて照合を行う
。この照合操作は、従来装置と同様にして行う。ただし
、前記従来fctの判定条件の定数は、m=3.n=2
.C=10.T==3とする。従来装置の照合と同様に
1文字位置毎に認識候補文字と照合対象単語とを照合す
ると、′水上”Iに対して pH=3.−氷島″霞に対
して P2 =’11.”’氷原”(1[Pjc対して
P5 = 11 トナQ、コの結果Pj = (j =
 1〜3)の中で最小値は、P1= 3 ((nXT 
(=8 ) )となまり、照合結果として正解の1水上
”か得られる。このように、従来製置では、単語辞書全
体の単語と照合を行ったのに対し、この発明では、y取
対象単語決定手段(6)により、3個の単語(1水上”
、@″水氷島。
Thus, the words to be read are @Minakami"1.@Suzuki"aS
, 1 Mizuhara"ae is determined.Now that the word to be read has been determined, the zero-order word determination means (5) performs a verification.This verification operation is performed in the same manner as in the conventional device.However, The constants for the judgment conditions of the conventional fct are m=3.n=2
.. C=10. Let T==3. Similar to the comparison with the conventional device, when the recognition candidate character is compared with the target word for each character position, for 'Mizukami' I, pH = 3. - for Hyojima' Kasumi, P2 = '11. ``Ice field'' (1 [Pjc vs. P5 = 11 Tona Q, Ko's result Pj = (j =
1 to 3), the minimum value is P1=3 ((nXT
(=8)), and the correct answer is 1 Mizukami" as a matching result. In this way, in the conventional system, matching was performed with words in the entire word dictionary, but in this invention, the correct answer is "1 Minakami". The word determination means (6) selects 3 words (1.
, @″Mizuhyo Island.

1水原”)に絞って照合を行うだけで正解か得られる。You can get the correct answer just by narrowing down your search to ``Suwon'').

次に、第5図中の帳票上に記入された1氷山”(9)の
認識候補文字a旧と対して、第7’llに示す単語辞書
(7)内の単語長2の単語との照合8.読取対象単語決
定手段(6)と、単語決定手段(5)を用いて行う場合
について説明する。
Next, for the recognition candidate character a old of ``1 iceberg'' (9) written on the form in Figure 5, the word length 2 in the word dictionary (7) shown in Figure 7'll is compared. Verification 8. A case in which the reading target word determining means (6) and the word determining means (5) are used will be described.

前記と同様に、まず読取対象単語決定手段(6)は。As above, first, the reading target word determining means (6).

第4図中の入力文字1永山1(9)の認識候補文字+1
1)中の第1文字目、第1位の認識候補文字“氷”を参
照するか、第1文字目か1氷”の単語は前記のとおり存
在しない。
Recognition candidate character for input character 1 Nagayama 1 (9) in Figure 4 +1
1) Referring to the first character, the first recognition candidate character "Ice", or the word "Ice" as the first character does not exist, as described above.

次に、第4図中の入力文字1永山m(9)の認識候補文
字aD中の第1文字目、第2位の認識候補文字1水′″
を参照すると、前記と同様に1水上”al。
Next, the first character in the recognition candidate character aD of the input character 1 Nagayama m (9) in Figure 4, and the second recognition candidate character 1 water'''
Referring to ``1 Minakami'' al., as above.

“水鳥”aS、′氷原”aeの3個の単語か抽出される
。さらに、前記と同様に、従来装置の判定条件の定数を
、m=3.n=2.O=10.T=3として照合すると
、′水上”Q4に対してP1=12゜1水島15に対し
て p2=i2.@氷原″(Lllに対してPs=12
となり、何れもP、1(j=1〜3)〉n XT (=
6 )であるので、照合結果は得られない。
Three words, “waterfowl” aS and “ice field” ae, are extracted.Furthermore, in the same way as above, the constants of the judgment conditions of the conventional device are set as m=3.n=2.O=10.T=3 When collated as follows, P1=12°1 for 'Mizukami' Q4, p2=i2 for Mizushima 15. @Iceland” (Ps=12 for Lll
Therefore, both P, 1 (j=1 to 3)〉n XT (=
6), no matching results can be obtained.

この場合には、インデックス鼎を用いて単語辞書を参照
せずに、従来製置と同様に、単語辞書(7)内の全単語
と照合を行う。菫た。この時1判定条件の定数は0m=
3.n=2.O=10.T=8として照合を行う。これ
により、従来製置の照合と同様の結果か得られ、@″水
井υに対してP1=20.″″永山(I3に対してP2
=11.@水上”Iに対してp3=12.−氷島#(L
9に対してP4=12.”’氷原″aeに対してPs=
12aなり。
In this case, all the words in the word dictionary (7) are compared using the index function without referring to the word dictionary, as in the conventional setting. Sumire. At this time, the constant of 1 judgment condition is 0m=
3. n=2. O=10. Verification is performed with T=8. As a result, the same results as the conventional comparison were obtained, and @"P1=20 for Mizui υ.""Nagayama (P2 for I3
=11. @Mizukami”p3=12 for I.-Ice Island #(L
P4=12 for 9. Ps= for ``'Ice Field'' ae
It's 12a.

P、1(j = 1〜5 )の中で最小値はP2 =1
1 [(nXT(=16))と求まり、照合結果として
正解あ“氷山″か得られる。
The minimum value among P, 1 (j = 1 to 5) is P2 = 1
1 [(nXT(=16)), and the correct answer is ``iceberg'' as a matching result.

このように、読取対象単語決定手段(6)を用いること
により、認識候補文字と単語辞書(7)内の単語とを照
付する際に、まず、照合成立の可能性の高い少数の単語
と照合を行い、照合か不成功となった時にのみ、単語辞
書(7)内の全単語と照合を行うので、単語の読取精度
を低下させることなく、高速に単語照合を行うことかで
きる。
In this way, by using the reading target word determining means (6), when matching recognition candidate characters with words in the word dictionary (7), it first selects a small number of words with a high possibility of matching. Verification is performed, and only when the verification is unsuccessful, all words in the word dictionary (7) are verified, so word verification can be performed at high speed without reducing word reading accuracy.

なお、上記実施例で42.単語の先頭1文字をキーとし
て読取対象単語を抽出したか、キーとなる文字の文字位
置・文字数は自由に設定できる。
In addition, in the above example, 42. Whether the word to be read is extracted using the first character of the word as a key, or the character position and number of key characters can be set freely.

(例えば、単語の先頭1文字と、最後の1文字のどちら
力1か、u踵候補文字中に含まれている単語を抽出する
等。) また、上記実施例では、漢字の姓について述べたか、カ
タカナ・英字などの単語1氏名・会社名などの単語に関
してもこの発明を適用することかできる。
(For example, whether the first character or the last character of a word is 1, extracting the words contained in the u-heel candidate characters, etc.) Also, in the above example, the surname of the kanji character was mentioned. This invention can also be applied to words such as 1, names, company names, etc., such as katakana and English characters.

〔発明の効果〕〔Effect of the invention〕

以上のように、この発明によれば、111!取対象単語
決定手段を設けたことにより、認識候補文字七単語辞書
内の全単語とを常時照合する必要かな(なり、単語の読
取精度を低下させることなく、高速に単語照合を行うこ
とができる。
As described above, according to this invention, 111! By providing a means for determining target words, it becomes necessary to constantly check all the words in the seven-word dictionary for recognition candidate characters (this makes it possible to perform word checking at high speed without reducing word reading accuracy. .

【図面の簡単な説明】[Brief explanation of the drawing]

第1図はこの発明の一実施例による単語読取装置の構成
図、第2図は従来の単語読取装置の構成図、第3図は単
語読取装置の帳票の一例を示す説明図、@4図及び第5
図は単語読取装置における入力文字及び認識候補文字の
一例を示す説BAV。 第6図は従来の単語読取装置における単語辞書内の単語
長側の単語列の一例を示す説明図、第7図はこの発明の
一実施例による単語読取装置における単語辞簀内の単語
側の一例を示すa側御である。 図中、(3)は文字認識手段、(5)は単語決定手段。 (6)は読取対象単語決定手段、(7)は単語辞書であ
る。 なお1図中、同一符号は同−又は相当部分を示す。
Fig. 1 is a block diagram of a word reading device according to an embodiment of the present invention, Fig. 2 is a block diagram of a conventional word reading device, and Fig. 3 is an explanatory diagram showing an example of a form of the word reading device. and fifth
The figure shows an example BAV of input characters and recognition candidate characters in a word reading device. FIG. 6 is an explanatory diagram showing an example of a word string on the word length side in a word dictionary in a conventional word reading device, and FIG. This is an example of A-side control. In the figure, (3) is a character recognition means, and (5) is a word determination means. (6) is a reading target word determining means, and (7) is a word dictionary. In addition, in FIG. 1, the same reference numerals indicate the same or corresponding parts.

Claims (1)

【特許請求の範囲】[Claims] 単語を構成する文字を認識して読取る単語読取装置にお
いて、入力文字として類似度の高い順に順序づけされた
複数の認識候補文字を入力文字毎に選択する文字認識手
段と、帳票上に記入される単語を格納した単語辞書と、
前記認識候補文字と前記単語辞書を照合して単語を決定
する単語決定手段と、単語辞書内の単語の中で特定の文
字位置の文字が前記認識候補文字内に含まれる単語を読
取対象単語として単語辞書から読み出す読取対象単語決
定手段とを具備し、まず、前記認識候補文字と前記読取
対象単語決定手段で読み出された単語とを照合し、この
段階で単語が決定できない場合に限り、前記認識候補文
字と単語辞書内の全単語とを照合して単語を決定するよ
うに構成したことを特徴とする単語読取装置。
In a word reading device that recognizes and reads characters constituting a word, a character recognition means selects, for each input character, a plurality of recognition candidate characters ordered in descending order of similarity as input characters, and a word written on a form. A word dictionary that stores
a word determining means for determining a word by comparing the recognition candidate characters with the word dictionary; and a word in the word dictionary in which a character at a specific character position is included in the recognition candidate characters as a reading target word. a reading target word determining means that reads out a word from a word dictionary; first, the recognition candidate character is compared with the word read by the reading target word determining means, and only if a word cannot be determined at this stage; A word reading device characterized in that it is configured to determine a word by comparing recognition candidate characters with all words in a word dictionary.
JP62255167A 1987-10-09 1987-10-09 Word reader Pending JPH0196779A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
JP62255167A JPH0196779A (en) 1987-10-09 1987-10-09 Word reader

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
JP62255167A JPH0196779A (en) 1987-10-09 1987-10-09 Word reader

Publications (1)

Publication Number Publication Date
JPH0196779A true JPH0196779A (en) 1989-04-14

Family

ID=17274989

Family Applications (1)

Application Number Title Priority Date Filing Date
JP62255167A Pending JPH0196779A (en) 1987-10-09 1987-10-09 Word reader

Country Status (1)

Country Link
JP (1) JPH0196779A (en)

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6697511B1 (en) * 1999-05-13 2004-02-24 Currency Systems International Partial read confirmation method
CN104215265A (en) * 2014-08-18 2014-12-17 中国空间技术研究院 PSD (position sensitive detector) signal noise reduction method with high precision

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6697511B1 (en) * 1999-05-13 2004-02-24 Currency Systems International Partial read confirmation method
CN104215265A (en) * 2014-08-18 2014-12-17 中国空间技术研究院 PSD (position sensitive detector) signal noise reduction method with high precision

Similar Documents

Publication Publication Date Title
US20200143575A1 (en) Method and device for displaying explanation of reference numeral in patent drawing image using artificial intelligence technology based machine learning
US20230342400A1 (en) Document search device, document search program, and document search method
JPH0520500A (en) Document recognizing device
JPH0196779A (en) Word reader
JP3979288B2 (en) Document search apparatus and document search program
JPH11219367A (en) Connection processing method and device for different kinds of data by address information
KR102550868B1 (en) verification system for achievements of faculty
JPH0256086A (en) Method for postprocessing for character recognition
JPS63138479A (en) Character recognizing device
JP2982244B2 (en) Character recognition post-processing method
JP2746345B2 (en) Post-processing method for character recognition
JPH0340434B2 (en)
JPS63233489A (en) Character reader
JPS63188284A (en) Character reader
JPS61114388A (en) Character input device
JPS6061875A (en) Generation system of standard pattern
JPH0338787A (en) Character recognition processor
JP2001266074A (en) Device for recognizing character
JPH06180767A (en) Character recognizing device
JPH11120294A (en) Character recognition device and medium
JPH05108703A (en) Machine translator
JPH0652367A (en) Post-processing method for character recognition result
JPH0436885A (en) Optical character reader
JPH04242491A (en) Optical character reader
JPH05189621A (en) On-line hand-written character recognizing device