JPS5839378A

JPS5839378A - Post processing system for character recognition

Info

Publication number: JPS5839378A
Application number: JP56136144A
Authority: JP
Inventors: Hideaki Sugawara; 菅原　秀明; Eiichiro Yamamoto; 山本　栄一郎
Original assignee: Fujitsu Ltd
Current assignee: Fujitsu Ltd
Priority date: 1981-09-01
Filing date: 1981-09-01
Publication date: 1983-03-08
Also published as: JPH0119195B2

Abstract

PURPOSE:To perform accurate post processing by weighting candidated characters according to their order and finding the matching with a word dictionary, and finding a word matched best. CONSTITUTION:Candidated characters recognized and outputted by a recognition part 1 are outputted to a character matrix register 5 in the order of the recognition. A matching circuit collates words read out of a word dictionary 8 with the 1st-5th order recognized candidated characters and when one of the candidated character matches with one of words held in the register 5, the matching level which corresponds to the order of recognition is outputted. A matching result output register 9 holds matching levels, the degree of coincidence, outputted from the circuit 7, corresponding to the words. A result decision circuit 10 outputs a word matched best to an output register 11 on the basis of the result of the matching by the circuit 7.

Description

【発明の詳細な説明】本発明は文字認識後処理方式に関するものであって、特
に文字読取手段によ少入力された入力文字を文字辞書（
例えば漢字辞書）と文字認識処理を行ったのち認識結果
に対しその順位に応じて重みづけを行なって単語辞書と
のマツチングを行なうことにより、入力単語を正確に認
識できるよう忙した文字ｇ繊後処理方弐に関するもので
ある。DETAILED DESCRIPTION OF THE INVENTION The present invention relates to a character recognition post-processing method, and in particular, the present invention relates to a character recognition post-processing method.
After performing character recognition processing with a kanji dictionary (for example, a kanji dictionary), the recognition results are weighted according to their rankings and matched with a word dictionary, so that input words can be accurately recognized. This concerns the second processing method.

従来の文字認識方式では、例えば第１図に示す如く、Ｉ
Ｉ鐵部ｌにおりて入力文字の特徴抽出を行ないこれをフ
ァイルと比較してもっともＭ繊順位の高いものを出力レ
ジスタ２に出方し、その後、文字ｗ１ｗ＆後処理として
この出力レジスタ２に出力された３ケの文字が都道府県
基を示すものであるとあらかじめわかっている場合にけ
、これらの出力された文字を都道府県辞書３と順次マツ
チング回路４にて比較を行ない入力文字を正確に認識す
るようｋしている。In conventional character recognition methods, for example, as shown in Figure 1, I
In the I iron section 1, the features of the input characters are extracted, compared with the file, and the one with the highest M fiber rank is output to the output register 2, and then output to this output register 2 as the character w1w & post-processing. If it is known in advance that the three characters shown represent the prefecture base, these output characters are sequentially compared with the prefecture dictionary 3 in the matching circuit 4 to accurately match the input characters. I'm trying to recognize it.

すなわち、第１図において、都道府県基の記入領斌に３
個の文字の記入されたデータ入力用紙（図示省略）を例
えばＯＣＲ（図示省略）で読取り、これにより得られ九
データにもとづき認識部１ではそれぞれに対する特徴抽
出にもとづき認錬順位のもっとも高い「宮」、「埼」、
「県」を出力レジスタ２に出力し、これらをマツチング
回路４において都道府県辞書３にセットされている都道
府県基と順次比較してその一致度のもっとも高い都道府
県基を読取出力として出力する本のである。In other words, in Figure 1, 3 is entered in the prefecture-based entry column.
A data input form (not shown) on which characters have been written is read using OCR (not shown), and based on the nine data obtained, the recognition unit 1 extracts features for each character and selects the highest recognition rank. ”, “Sai”,
This book outputs "prefecture" to the output register 2, sequentially compares them with the prefecture base set in the prefecture dictionary 3 in the matching circuit 4, and outputs the prefecture base with the highest degree of matching as a read output. It is.

しかるにこのような後処理方式では、第１図に示す如く
、認識部１から「宮」、「埼」、「県」と出力され九こ
とにもとづき都道府県基とマツチングを行なったとき、
「宮崎系」と「宮城県」の２つが同−優先順位で存在す
ることになシ、自動的にこのいずれか一方を選択するこ
とができなかった。However, in such a post-processing method, as shown in FIG. 1, when the recognition unit 1 outputs "Miya", "Sai", and "Prefecture" and matches them with the prefecture based on the nine,
Since ``Miyazaki-kei'' and ``Miyagi Prefecture'' exist with the same priority, it is not possible to automatically select one of them.

そのために、第２図に示す如く、認識部で入力文字を認
識するとき複数順位の候補文字を出力することが提案さ
れえ、認識部で３＊字の都遍府県名を認識したとき第１
番目の文字について＃ｉ第２図に示す如く、第１順位が
「科」、第２顔位が「秩」、第３）＠位が「秋」、第４
順位が「材」、第５順位が「林」であシ、第２番目の文
字については第１順位〜第５順位が「田」、「内」、「
口」、「円」、「由」であシ、第３番目の文字について
は第１順位〜第５願位が「具」、「県」、「目」、「且
」、「旦」の場合に、これらの各候補文字を都道府県基
と順次比較する。すなわち都道府県辞書３からｇｔ番目
に「北海道」を読出し、その第１番目の文字「北」を前
記「科、秩、秋、材、林」と比較してマツチングをとる
。そして第２番目の文字「海」と前記「田、内、口、円
、由」と比較し、第３番目の文字「道」を前記「具、県
、目、且、旦」と比較しそれぞれ一致をとるがいずれも
不一致である１次に第２番目の単語「青森県」と同様な
マツチングを行なうが、第３番目の文字「県」が前記「
具、県、目、且、旦」と照合したとき第２願位の「県」
で一致が得られる。そして第３番目の単語「秋田系」と
マツチングを行なうとき、第１番目の文字「秋」と「科
、秩、秋、材、林」と照合して一致が得られ、同様に第
２番目の文字「田」と「田、内、口、円、由」と照合し
て一致が得られ、第３番目の文字「＃」と「具、県、目
、且、旦」と照合してこれまた一致が得られる。To this end, as shown in Figure 2, it may be proposed to output candidate characters of multiple ranks when the recognition unit recognizes an input character, and when the recognition unit recognizes the 3* character Tohenfuken name, the first
Regarding the #i character, as shown in Figure 2, the first rank is "Kana", the second rank is "Chichi", the third rank is "Autumn", and the fourth rank is "Ka".
The rank is "chi", the fifth rank is "bayashi", and for the second character, the first to fifth ranks are "ta", "uchi", "
For the third character, the 1st to 5th positions are ``gu'', ``prefecture'', ``me'', ``and'', and ``dan''. In this case, each of these candidate characters is sequentially compared with the prefecture base. That is, "Hokkaido" is read out gt-th from the prefectural dictionary 3, and the first character "Kita" is compared with the above-mentioned "Ke, Chichi, Autumn, Timber, Forest" to perform matching. Then, the second character ``Umi'' is compared with the above-mentioned ``田, 内, 口, EN, ゆ'', and the third character ``道'' is compared with the above-mentioned ``gu, prefecture, eye, and tan''. A match is made for each, but none of them match. First, the same matching is performed as for the second word "Aomori Prefecture", but the third character "Prefecture" is
When compared with ``gu, prefecture, eye, and dan'', the second choice is ``prefecture''.
A match is obtained. When matching the third word ``Akita-kei'', a match is obtained by matching the first character ``Autumn'' with ``Kana, Chichi, Autumn, Wood, Forest'', and similarly the second word ``Akita-kei''. A match was obtained by matching the character ``田'' with ``田, 内, 口, EN, ゆ'', and by matching the third character ``#'' with ``gu, prefecture, eye, and dan''. Again, a match is obtained.

かくして「秋田系」ではすべての文字が候補文字の１つ
と一致が得られるので、この”Ｒｒｙチ／グ度合のもっ
ともよい「秋田系」を読取文字として出力する。In this way, all the characters in "Akita-type" match one of the candidate characters, so this "Akita-type" with the highest degree of "Rry chi/g" is output as the read character.

しかしながらこのような複数順位の候補文字を単純に比
較する場合には、第３図に示す如き例では読取出力を１
つに確定できないことがある。すなわち、１ｇ１書目の
文字に対しては第１誉目〜第５番目の認識順位が「宮、
官、富、呂、宙」であり、第２番目の文字に対しては同
じ＜　Ｍ＃Ｊ＆）−位が「埼、崎、峠、城、地」であシ
、第３＃目の文字に対しては同じく「県、具、目、且、
旦」であり、これを文字マトリクス・レジスタ５から各
順位毎に３文字ずつ順位レジスタ６に出力して都道府県
辞書３の単語とマツチング回路４にて照合したとき、「
宮崎系」と「宮城県」とが同一のマツチング度合となり
区分することができない場合が存在する。However, when simply comparing candidate characters in multiple ranks, the reading output is reduced to one in the example shown in Figure 3.
It may not be possible to be certain. In other words, for the characters in the 1st book of 1g, the first to fifth recognition rankings are "Miya,"
Government, Wealth, Lu, Space'', and the same for the second character <M#J&) - the position is ``Sai, Saki, Pass, Castle, Earth'', and the third # character Similarly, “prefecture, ingredients, eyes, and,
When this is output from the character matrix register 5 to the rank register 6 for each rank, three characters at a time, and compared with the words in the prefectural dictionary 3 in the matching circuit 4, "
There are cases where "Miyazaki-kei" and "Miyagi Prefecture" have the same matching degree and cannot be classified.

したがって本発明はこのような問題を改善するために認
識部からの候補出力に対し、その順位毎に重みを付与し
て単語辞書とのマツチングを求め、もっとも上位順位で
マツチングのとれた単語を求めるようＫした文字認識後
処理方式を提供することを目的とするものである。そし
てこのために本発明における文字認識後処理方式では、
読取文字を認識する文字認識手段と単語が保持されてい
る単語保持手段と前記文字Ｗ識字段にて認識された文字
が単語保持手段に保持された単語と一致することを検出
するマツチング手段を具備する文字認識後処理方式にお
いて、文字認識手段から複数順位の認識文字候補を出力
させ、マツチング手段において単語保持手段に保持され
た単語と前記複数順位の認識文字候補とマツチングを行
なうとともに、前記認識文字候補の１つの文字と前記単
語の１つの文字がマツチングしたときはそのｕｔｌ＆顔
位に応じた重みを付与して出力する蔦み付与出力手段を
設けて認識順位に応じたマツチング順位度が得られるよ
うにすることＫよシ、マツチング順位度のもつとも高い
単語を選択出力するようにしたことを特徴とする。Therefore, in order to improve this problem, the present invention assigns weights to the candidate output from the recognition unit for each rank and matches them with the word dictionary, and then finds the word that is matched with the highest rank. The object of the present invention is to provide a character recognition post-processing method that is easy to use. For this purpose, in the character recognition post-processing method of the present invention,
It is equipped with a character recognition means for recognizing read characters, a word holding means for holding words, and a matching means for detecting that the characters recognized in the character W literacy stage match the words held in the word holding means. In the character recognition post-processing method, the character recognition means outputs recognized character candidates of a plurality of ranks, and the matching means matches the word held in the word holding means with the recognition character candidates of the plurality of ranks, and also outputs the recognized character candidates of the plurality of ranks. When one character of the candidate is matched with one character of the word, a weighting output means is provided which gives a weight according to the utl and face position and outputs it, thereby obtaining a matching ranking according to the recognition ranking. The present invention is characterized in that words having the highest matching ranking are selectively output.

以下本発明の一実施例を第４図にもとづき説明する。An embodiment of the present invention will be described below based on FIG.

第４図において他図と同符号部は同一部分を示し、７は
マツチング回路、８は単語辞書、９Ｆｉマツチング結果
出力レジスタ、１０Ｆｉ結果判定回路、１１は出力レジ
スタである。In FIG. 4, the same reference numerals as in other figures indicate the same parts, 7 is a matching circuit, 8 is a word dictionary, 9Fi matching result output register, 10Fi result determination circuit, and 11 is an output register.

マツチング回路７は単語辞書８から読出した単語を認識
部１から文字マトリクス・レジスタ５に出力された第１
顔位〜第５ｉＷｓ位までの認識候補文字とを照合してマ
ツチング度行なうとともにマツチングした文字が存在す
る場合その認識順位に応じて、例えば第１順位の認識候
補文字と一致したとき「０」、第２順位の認識候補文字
と一致したとき「１」、第３順位の認識候補文字と一致
したとき「２」、第４順位のものと一致したとき「３」
、第６順位のものと一致したとき「４」、第１ＪＩａ位
〜第５順位のいずれのものとも一致しないとき「５」を
出力する。The matching circuit 7 matches the word read from the word dictionary 8 with the first word outputted from the recognition unit 1 to the character matrix register 5.
Matching is performed by comparing the recognition candidate characters from the face rank to the 5th iWs rank, and if a matched character exists, it is determined according to its recognition rank. "1" when it matches the recognition candidate character of the second rank, "2" when it matches the recognition candidate character of the third rank, "3" when it matches the recognition candidate character of the fourth rank.
, "4" is output when it matches with the sixth rank, and "5" is output when it does not match with any of the first JIa to fifth ranks.

単語辞書８は後処理に必要な、例えば都道府県名用の単
語集とか、各部道府県毎の例えば秋田県内の郡市町村名
のような分類された複数の単語集がファイルされている
ものであシ、マツチング回路７からの単語毎の制御信号
Ｃ８により分類別に、しかも一定の順序にしたがって所
定の分類の単語が順次出力されるものである。The word dictionary 8 is a file containing a plurality of word collections necessary for post-processing, such as word collections for prefecture names, and classified word collections for each region and prefecture, such as the names of municipalities, towns, and villages in Akita Prefecture. , words of a predetermined classification are sequentially output according to classification and in a fixed order by a control signal C8 for each word from the matching circuit 7.

マツチング結果出力レジスタ９は文字マトリクス・レジ
スタ５にセットされた候補文字と単語辞＠８から出力さ
れた単語との一致度を単語対応に保持するレジスタであ
る。The matching result output register 9 is a register that holds the degree of matching between the candidate characters set in the character matrix register 5 and the words output from the word dictionary @8 in word correspondence.

結果判定回路１０はマツチング回路７にて行なわれたマ
ツチングの結果、そのもっともマツチング度合の大きな
単語を選択出力するものである。The result determination circuit 10 selects and outputs the word with the highest degree of matching as a result of the matching performed by the matching circuit 7.

次に第４図の動作に、ついて説明する。Next, the operation shown in FIG. 4 will be explained.

（１）認識部１から出力された認識候補文字はその認識
順位にしたがって文字マ）　ＩＪクス争レジスタ５に出
力される。例えばｆＩ１番目の文字に対しては第１順位
〜第５順位までの「科、秩、秋、材、林」が出力され、
第２番目の文字に対しては「田、内、口、円、由」が出
力され、第３番目の文字に対しては「具、県、目、且、
旦」が出力される。(1) The recognition candidate characters output from the recognition unit 1 are output to the IJ contest register 5 according to their recognition order. For example, for the 1st character of fI, the 1st to 5th ranks of ``Kana, Chichi, Autumn, Wood, Hayashi'' are output,
For the second character, ``田, 内, 口, EN, ゆう'' is output, and for the third character, ``gu, prefecture, eye, and,
"Dan" is output.

そして前記認識部１の出力が都道府県名であることがあ
らかじめわかっているので、単語辞４１′８から都道府
県名用の単語集ファイル部が順次読出される。この場合
、マツチング回路７から出力される単語毎の制御信号Ｃ
，によ）先ず「北海道」が続出される。そしてマツチン
グ回路７からの順序制御信号Ｃ１−・により順位レジス
タ６に先ず「科田具」がセットされ「北海道」と比較さ
れるが、このとき第４番目に文字がないということでの
み一致するが他は一致しない０次にマツチング回路７か
ら順序制御信号Ｃｓ−１によシ順位レジスタ６に第２順
位の「秩内県」がセットされ、同様に「北海道」と照合
される。このようにしてマツチング回路７からの順序制
御信号Ｃｌ−２〜Ｃ１−４により順位レジスタ６に第３
順位の「秋口目」、第４順位の［材円且Ｊ％票５％位の
「林由旦」が順次セットされ「北海道」とのマツチング
が行なわれるが、これらは文字同志では不一致であ夛、
その結果第４誉目の文字が存在しないということで一致
するのみなので１マツチング結果出力レジスタ９０区分
１の（４）Ｋは「０」が記入され、区分１の（１）〜（
３）には「５」が記入される。Since it is known in advance that the output of the recognition unit 1 is a prefecture name, word collection file portions for prefecture names are sequentially read from the word dictionary 41'8. In this case, the control signal C for each word output from the matching circuit 7
(Yo) First, ``Hokkaido'' is mentioned one after another. Then, "Shidagu" is first set in the order register 6 by the order control signal C1- from the matching circuit 7, and compared with "Hokkaido", but at this time, there is a match only because there is no character in the fourth position. "Chichiuchi Prefecture", which is the second priority, is set in the order register 6 by the order control signal Cs-1 from the matching circuit 7, and is similarly matched with "Hokkaido". In this way, the order control signals Cl-2 to C1-4 from the matching circuit 7 cause the order register 6 to select the third
The ranking ``Akiguchime'' and the 4th ranking ``Hayashi Yudan'' with 5% of J% votes are set sequentially and matched with ``Hokkaido'', but these characters do not match.夛、
As a result, there is only a match because the fourth character does not exist, so "0" is written in (4) K of section 1 of the 1 matching result output register 90, and (1) to (
“5” is entered in 3).

（２）　　このようにして第１番目の単＠「北海道」と
の照合が終るとマツチング回路７は制御信号Ｃ！を出力
し、第２番目の単語「青蛛県」を出力ばせる。それから
順序制御信号Ｃ１−・〜Ｃ１／％−４を出力して順位レ
ジスタ６に第１）Ｖ４位「科田具」〜第５顔位「林由旦
」を順次セットして前記「背森県」とマツチングする。(2) When the matching with the first single @ "Hokkaido" is completed in this way, the matching circuit 7 sends the control signal C! is output, and the second word ``Aohama Prefecture'' is output. Then, the sequence control signal C1-.~C1/%-4 is outputted to sequentially set the 1st) V4 rank "Shidagu" to the 5th face rank "Yudan Hayashi" in the rank register 6, and the above-mentioned "Semori" Matching with "Prefecture".

このとき第２顔位の「秩内県」における「県」と第４番
目の文字がないという２つの点で一致するので、マツチ
ング回路７はｉツチング結果出力レジスタ９０区分２の
（４）に「０」、（３）に「１」、（２）と（１）　４
Ｃそれぞれ「５」が記入されることになる。At this time, there is a match in two points: "Prefecture" in "Chichiuchi Prefecture" in the second face position and the absence of the fourth character, so the matching circuit 7 outputs the i-matching result output register 90 to section 2 (4). "0", "1" in (3), (2) and (1) 4
"5" will be entered for each C.

（３）次いでマッチング回路７Ｆｉ制御信号Ｃｍにより
第３番目の単語「秋田系」を出力させ、それから前記（
１１、（２１と同様にして順位レジスタ６に「科田具」
〜「林由旦ＪｆＪ＠次セットしてこの「秋田系ｊとの照
合を行なう。この場合には、第１Ｍ位の「科田具」にお
ける「田」、第２顔位における「秩内県」の「県」、第
３順位の「秋口目」における「秋」と第４番目の文字が
ないということでそれぞれ一致が得られるので、マツチ
ング結果出力レジスタ９の区分３の（２）、（４）には
ｒｏＪが、（３）には「１」が、（１）には「２」がそ
れぞれｇｅ人されることになる。(3) Next, the third word "Akita-kei" is outputted by the matching circuit 7Fi control signal Cm, and then the (
11, (Same as 21, enter "Shidagu" in rank register 6.
~ "Yutan Hayashi JfJ@Next set and check with this "Akita j. Since a match is obtained for "prefecture" in "prefecture" and "autumn" in the third rank "early autumn" and the absence of the fourth character, (2) in category 3 of matching result output register 9, ( roJ will be added to 4), ``1'' will be added to (3), and ``2'' will be added to (1).

（４）　　このようにしてすべての都道府県名との照合
が終了したとき、結果判定回路１０Ｆｉこのマツチング
結果出力レジスタ９の各区分の合計点のもっとも小さい
区分を求めてそのマツチング度合のもっとも大きいもの
として選択出力することになる。(4) When the matching with all prefecture names is completed in this way, the result judgment circuit 10Fi determines the category with the lowest total score of each category in the matching result output register 9, and selects the category with the highest degree of matching. The selected output will be as follows.

したがってこの場合には区分３が合計点３のために最小
であり、かくしてｆｓ３番目の都道府県名の「秋田系」
を最終的な読取出力として出方レジスタＩＩＫ出力する
。このようにして後処理によ）［秋田系」を正確に取出
すことができる。Therefore, in this case, category 3 is the minimum due to the total score of 3, thus fs 3rd prefecture name "Akita-kei"
is output from the output register IIK as the final read output. In this way, it is possible to accurately extract the [Akita type] by post-processing.

なお、ｔ７Ｊ５図に示す如く、結果判定回路１０’　Ｋ
第１人力レジスタ１２、第２人力レジスタ１３および比
較制御部１４を設け、第１人力レジスタ１２／／ｃマツ
チング回路７からの個々の区分のマツチング状態を入力
してこれを先に入力されている第２人力レジスタ１３に
保持されている区分のものとのマツチング状態と比較し
て、新らしく伝達された第１人力レジスタ１２のマツチ
ング度が大きいとき（第４図の状態では合計点の小さい
とき）にこれを第２人力レジスタ１３に記入し、小さい
ときにはそのまま第１人力レジスタ１２に次の単語に対
するマツチング度を入力するように構成すれば、第４図
におけるマツチング結果出力レジスタ９Ｆｉ不必要とな
り、単語辞書から読出される被照合単語数が大きい場合
で吃、簡単な構成で対処することがで舞る。In addition, as shown in figure t7J5, the result judgment circuit 10'K
A first manual register 12, a second manual register 13, and a comparison control section 14 are provided, and the matching state of each category from the first manual register 12//c matching circuit 7 is inputted first. When the matching degree of the newly transmitted first human-powered register 12 is large compared to the matching state of the category held in the second human-powered register 13 (in the state shown in FIG. 4, when the total score is small) ) is entered in the second manual register 13, and if the matching degree for the next word is entered directly into the first manual register 12 when it is small, the matching result output register 9Fi in FIG. 4 becomes unnecessary. When the number of matching words read from the word dictionary is large, this can be handled with a simple configuration.

かくして、本発明によれば、第６図に示す如く、文字マ
トリクス・レジスタ５に第１願位〜第５順位として「宮
埼県」〜「宙地旦」と出方された場合でも、マツチング
回路７において都道府県静置スタの区分９−０に合計点
が「１」として６１″人されるが「宮城系」に対しては
同じく区分９−ｘａＫ合計点か「３」として記入される
ことになり、これよシ「宮崎県」が後処理結果として出
力されることになる。Thus, according to the present invention, as shown in FIG. 6, even when "Miyazaki Prefecture" to "Chujidan" appear in the character matrix register 5 as the first to fifth rankings, the matching circuit In 7, the total score is "1" and 61" people are entered in category 9-0 of the prefecture stationary star, but for "Miyagi-kei", the total score of category 9-xaK is also entered as "3". Therefore, ``Miyazaki Prefecture'' will be output as the post-processing result.

本発明を一般的に説明すれば、μ８図に示す如く、入力
単語ｆ　Ｌｔ　ｙｌｌｇ　、・・・Ｌ％（鴨文字で単語
を構成するものとする）とし、Ｌｌに対する認識候補を
Ｌｔ（１１ｅＬｔ（Ｌ　＋＋＋　Ｌ＞（ｙ　（ｍＲ候補
とＬ”ｒ第１　ＪＱｉ位〜第５顔位までｆ採用するとき
）とする。また重みを第１願位〜第５願位に対してＷｉ
ｌｌ〜Ｖｄｂ＋とし第６ノ１位以下のものに対して■・
）とし、Ｗｌｌｌ（Ｗｄ・・・（′Ｖ＠ｉｌとしてかつ
Ｗ幻〜ｗｉ＋はぼ線的な変化をもつものとする。そして
単１１３辞曹に登録されている標準単一（北海道とか育
森県に和尚するもの）　８１，８１１・　８％と入力単
語ＬｘｅＬ−・・Ｌｎとの相違度りを各文字ごとの相違
度の和として表現する。各文字Ｌｌ　、Ｌｓ・・および
Ｌｓごとの相違度は、例えばｓｌを考えたとき認識候補
Ｌ　１（４１でマツチングが得られたときその相違ｆ　
ｄ（Ｓｘ、ＬりはＮ４＋となる。もしもマツチングする
認１１！餘補がなければ相違度はＷ荀となる。したがっ
て単語の相違度りは、Ｄ−、Ｌ’　　ｄ（ｓイ、Ｌｌ）１１となり、このＤが最小となる単語辞書の単語を正しいも
のとして判断することにする。To explain the present invention generally, as shown in Figure μ8, an input word f Lt yllg ,...L% (a word is composed of duck characters) is assumed, and a recognition candidate for Ll is Lt(11eLt( L +++ L>(y (when mR candidate and L"r 1st JQi position to 5th face position are adopted). Also, weight is set to Wi for 1st application position to 5th application position)
ll ~ Vdb+ and for those below 6th place ■・
), Wllll (Wd... Express the degree of difference between 81,811.8% and the input word LxeL-...Ln as the sum of the degrees of difference for each character.The difference for each character Ll, Ls... and Ls. For example, when considering sl, the recognition candidate L 1 (when matching is obtained with 41, the difference f
d(Sx, L is N4+.If there is no matching 11! compensation, the degree of dissimilarity will be Wsu.Therefore, the degree of dissimilarity of the word is D-, L' d(sii, Ll) 11, and the word in the word dictionary with the minimum D is determined to be correct.

なお上紀貌明では候補文字を第５順位まで選択した例に
ついて説明したがこれに限定されるものでもない、そし
て順位の高いものを小さな数の貰みづけした例について
説明したが、逆の場合でも同様である。In addition, in Kamiki Eiaki, an example was explained in which candidate characters were selected up to the fifth rank, but this is not limited to this, and an example was explained in which a small number was given to a candidate character with a high rank, but the opposite The same applies to cases.

以上説明の如く、本発明によれば候補文字に対しその候
補順位に応じ友重みづけを行なって単語単位にこの重み
の合計を求めることにょシ明確な後処理を行なうことが
できる。As described above, according to the present invention, clear post-processing can be performed by weighting candidate characters according to their candidate ranks and calculating the sum of the weights for each word.

[Brief explanation of drawings]

紺１図〜第３図は従来の後処理蔽明図、第４図は本発明
の一実施例構成図、第５図はその結果判定回路の他の実
施例、第６図〜第８図は本発明の詳細な説明図である。図中、１はｇ繊部、２は出力レジスタ、３は都道府県辞
書、４はマツチング回路、ｓＦｉ文字マトリクス、レジ
２り、６は順位レジスタ、７はマツチング回路、８ｉｌ
ｔ単語辞書、９ｉｉマツチング結果出力レジスタ、１０
は結果判定回路、１１は出力レジスタをそれぞれ示す。特許出願人　　富士通株式会社代理人弁理士　　山　谷晧榮Figures 1 to 3 are schematic diagrams of conventional post-processing, Figure 4 is a configuration diagram of one embodiment of the present invention, Figure 5 is another embodiment of the result judgment circuit, and Figures 6 to 8. FIG. 2 is a detailed explanatory diagram of the present invention. In the figure, 1 is the g fiber section, 2 is the output register, 3 is the prefectural dictionary, 4 is the matching circuit, sFi character matrix, register 2, 6 is the order register, 7 is the matching circuit, 8il
t word dictionary, 9ii matching result output register, 10
11 indicates a result determination circuit, and 11 indicates an output register. Patent Applicant Fujitsu Limited Representative Patent Attorney Akira Yamatani

Claims

[Claims]

(1) Read the characters I! The character recognition means includes a word holding means in which a word is held, and a matching means for detecting that nine characters recognized by the character recognition means match a word held in the word holding means. In the character recognition post-processing method, the character recognition means outputs the gw & character candidates of the numeric rank, and the matching means matches the word held in the word holding means with the m-character candidate of the plurality of ranks, and the recognition When one character of the character candidate is matched with one character of the word, a weighting output means is provided which assigns and outputs a weight according to the recognition rank, thereby obtaining a matching order position according to the recognition rank. A character recognition post-processing method is characterized in that the word with the highest matching order position is selected and outputted.