JPH0696287A - Word collation pre-processing system - Google Patents
Word collation pre-processing systemInfo
- Publication number
- JPH0696287A JPH0696287A JP4246996A JP24699692A JPH0696287A JP H0696287 A JPH0696287 A JP H0696287A JP 4246996 A JP4246996 A JP 4246996A JP 24699692 A JP24699692 A JP 24699692A JP H0696287 A JPH0696287 A JP H0696287A
- Authority
- JP
- Japan
- Prior art keywords
- character
- word
- candidate
- collation
- appearance
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
Landscapes
- Character Discrimination (AREA)
Abstract
Description
【0001】[0001]
【産業上の利用分野】本発明は単語照合前処理方式に関
し、特に文字認識システムにおける認識後の単語照合の
前処理を行なう単語照合前処理方式に関する。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a word collation preprocessing system, and more particularly to a word collation preprocessing system for preprocessing post-recognition word collation in a character recognition system.
【0002】[0002]
【従来の技術】従来の文字認識システムでは、認識対象
とする単語の含む文字ごとに複数の候補文字が指定さ
れ、複数の文字列を対象として単語照合を行う場合に
は、与えられたすべての候補文字を総当り的に組み合せ
を変えながら単語辞書を検索し、照合を行っている。2. Description of the Related Art In a conventional character recognition system, a plurality of candidate characters are designated for each character included in a word to be recognized, and when performing word matching on a plurality of character strings, all of the given characters are given. The word dictionary is searched and collated while changing the combination of candidate characters in a brute-force manner.
【0003】[0003]
【発明が解決しようとする課題】この従来の単語照合方
式では、認識対象とする単語の含む各文字に対して与え
られた複数の候補文字すべてについて単語辞書の単語と
の総当り的組合せによる照合動作を行なうため、もとも
と単語辞書内に存在しないはずの候補文字についても単
語辞書検索を行うことになり、照合効率が低下すること
が避けられないという欠点があった。According to this conventional word matching method, all candidate characters given to each character included in a word to be recognized are matched with a word in a word dictionary by a brute force combination. Since the operation is performed, the word dictionary is searched for the candidate character that should not originally exist in the word dictionary, and there is a drawback that the collation efficiency is unavoidably deteriorated.
【0004】本発明の目的は上述した欠点を除去し、文
字列とし出現しうる文字のみに候補文字を限定した前処
理を行ない、照合効率を著しく改善しうる単語照合前処
理方式を提供することにある。An object of the present invention is to eliminate the above-mentioned drawbacks and provide a word matching pre-processing method capable of performing a pre-processing in which candidate characters are limited to only those characters that can appear as a character string and remarkably improving the matching efficiency. It is in.
【0005】[0005]
【課題を解決するための手段】本発明の単語照合前処理
方式は、複数の文字を含む文字列として表現され、かつ
認識対象として出現の可能性のある単語を出現単語とし
て限定し、あらかじめ登録しておく出現単語登録手段
と、単語照合に先立って前記文字列の含む文字ごとの複
数の候補文字から前記出現単語による文字群に存在しな
いものを削除する前処理を施した後に単語照合を行なう
単語照合手段とを備えた構成を有する。In the word matching preprocessing method of the present invention, a word that is expressed as a character string including a plurality of characters and that may appear as a recognition target is limited as an appearance word and registered in advance. The word matching is performed after performing a pre-processing for deleting the word that does not exist in the character group by the appearing word from the plurality of candidate characters for each character included in the character string prior to the word matching. And a word matching means.
【0006】[0006]
【実施例】次に、本発明について図面を参照して説明す
る。DESCRIPTION OF THE PREFERRED EMBODIMENTS Next, the present invention will be described with reference to the drawings.
【0007】図1は本発明の一実施例の構成図である。FIG. 1 is a block diagram of an embodiment of the present invention.
【0008】図1に示す実施例の文字認識システム1
は、出現単語登録手段を構成する出現単語登録部2と、
出現文字管理テーブル6単語照合手段を構成する文字認
識部3と単語照合前処理部4と、単語照合部5とを含ん
で成り、図1にはなお、単語辞書7と、イメージデータ
文字列101,候補文字102,限定候補文字103お
よび候補単語104を併記して示す。The character recognition system 1 of the embodiment shown in FIG.
Is an appearance word registration unit 2 which constitutes an appearance word registration unit,
The appearance character management table 6 includes a character recognition unit 3, a word matching preprocessing unit 4, and a word matching unit 5 which form a word matching unit. In FIG. 1, the word dictionary 7 and the image data character string 101 are still included. , Candidate character 102, limited candidate character 103, and candidate word 104 are shown together.
【0009】次に、本実施例の動作について説明する。Next, the operation of this embodiment will be described.
【0010】文字認識部3は、イメージデータ文字列1
01に対して文字認識を行ない、文字ごとに候補文字1
02を指定する。本実施例では、1文字当り8個の候補
が指定される。The character recognition unit 3 uses the image data character string 1
Character recognition is performed for 01 and each character is a candidate character 1
Specify 02. In this embodiment, eight candidates are designated for each character.
【0011】従来の文字認識システムでは、この候補文
字102に対して、直接単語照合部5が単語辞書を参照
しながら単語照合を行ない、候補単語104を決定して
いた。In the conventional character recognition system, the word collating unit 5 directly collates the word with respect to the candidate character 102 while referring to the word dictionary to determine the candidate word 104.
【0012】本実施例では、イメージデータ文字列7に
存在しうる単語をあらかじめ出現単語登録部2で登録し
ておくことにより、イメージデータ文字列7に出現しう
る文字が出現文字管理テーブル6に蓄積されており、単
語照合前処理部は、この出現文字管理テーブル6を参照
して、存在しない、すなわち出現するはずのない文字を
候補文字102から削除し、限定候補文字103を指定
する。これにより、本実施例では、いずれの文字につい
ても2ないし3文字候補に限定される。In the present embodiment, the words that can exist in the image data character string 7 are registered in advance in the appearance word registration unit 2, so that the characters that can appear in the image data character string 7 are stored in the appearance character management table 6. The word collation preprocessing unit refers to the appearing character management table 6 and deletes a character that does not exist, that is, a character that should not appear, from the candidate character 102, and specifies the limited candidate character 103. Thus, in this embodiment, any character is limited to 2 or 3 character candidates.
【0013】単語照合部5は、このように限定された候
補文字に対して照合を行ない、従来の方法に比して著し
く照合効率を向上させた照合を行なうことができる。The word collation unit 5 collates the limited candidate characters in this way, and can perform collation with significantly improved collation efficiency as compared with the conventional method.
【0014】[0014]
【発明の効果】以上説明したように本発明は、単語を形
成する文字列の各文字ごとに複数の候補文字が指定され
る文字認識システムにおいて、認識対象の文字列に出現
するはずの文字のみに候補文字を限定することにより、
単語照合効率を著しく向上させることができる効果を有
する。As described above, according to the present invention, in a character recognition system in which a plurality of candidate characters are designated for each character of a character string forming a word, only characters that should appear in the character string to be recognized are recognized. By limiting the candidate characters to
This has the effect of significantly improving the word matching efficiency.
【図1】本発明の一実施例の構成図である。FIG. 1 is a configuration diagram of an embodiment of the present invention.
1 文字認識システム 2 出現単語登録部 3 文字認識部 4 単語照合前処理部 5 単語照合部 6 出現文字管理テーブル 7 単語辞書 1 Character recognition system 2 Appearing word registration unit 3 Character recognition unit 4 Word matching preprocessing unit 5 Word matching unit 6 Appearing character management table 7 Word dictionary
Claims (1)
れ、かつ認識対象として出現の可能性のある単語を出現
単語として限定し、あらかじめ登録しておく出現単語登
録手段と、単語照合に先立って前記文字列の含む文字ご
との複数の候補文字から前記出現単語による文字群に存
在しないものを削除する前処理を施した後に単語照合を
行なう単語照合手段とを備えることを特徴とする単語照
合前処理方式。1. An appearance word registration means for preliminarily registering a word, which is expressed as a character string including a plurality of characters and has a possibility of appearing as a recognition target, as an appearance word, and prior to word matching. Before word matching, characterized in that it comprises a word matching means for performing word matching after performing a pre-process of deleting a candidate character that does not exist in the character group by the appearing word from a plurality of candidate characters for each character included in the character string. Processing method.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
JP4246996A JPH0696287A (en) | 1992-09-17 | 1992-09-17 | Word collation pre-processing system |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
JP4246996A JPH0696287A (en) | 1992-09-17 | 1992-09-17 | Word collation pre-processing system |
Publications (1)
Publication Number | Publication Date |
---|---|
JPH0696287A true JPH0696287A (en) | 1994-04-08 |
Family
ID=17156827
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
JP4246996A Pending JPH0696287A (en) | 1992-09-17 | 1992-09-17 | Word collation pre-processing system |
Country Status (1)
Country | Link |
---|---|
JP (1) | JPH0696287A (en) |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JP2010009440A (en) * | 2008-06-30 | 2010-01-14 | Fujitsu Frontech Ltd | Character recognition program, character recognition apparatus, and character recognition method |
-
1992
- 1992-09-17 JP JP4246996A patent/JPH0696287A/en active Pending
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JP2010009440A (en) * | 2008-06-30 | 2010-01-14 | Fujitsu Frontech Ltd | Character recognition program, character recognition apparatus, and character recognition method |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
JPH01246678A (en) | Pattern recognizing device | |
JPH08235341A (en) | Method and device for document filing | |
JPH0696287A (en) | Word collation pre-processing system | |
CN116229484A (en) | Text recognition method, list scanning method and device | |
JPH09204492A (en) | Slip processor | |
RU2251736C2 (en) | Method for identification of crossed symbols during recognition of hand-written text | |
JP3725635B2 (en) | Character recognition method and apparatus | |
KR100421683B1 (en) | Person identifying method using image information | |
JP2746345B2 (en) | Post-processing method for character recognition | |
JPH09128484A (en) | Character recognizing method | |
JPH03160585A (en) | Character recognizing method | |
JP3115139B2 (en) | Character extraction method | |
KR100473660B1 (en) | Word recognition method | |
JP3151866B2 (en) | English character recognition method | |
JPS60173688A (en) | Pattern processing device | |
JPH09179935A (en) | Character recognition device and control method therefor | |
JPH0634259B2 (en) | Character recognition device | |
JPH076213A (en) | Character string recognition device | |
JPH10334190A (en) | Character recognition method and device and recording medium | |
JP2905334B2 (en) | Online handwritten character recognition dictionary creation method and online handwritten character recognition dictionary creation device | |
JPS63100584A (en) | Character recognition processing system | |
JPS6295687A (en) | Character recognizing system | |
JPH0844829A (en) | Input pattern/character string registering method for character recognizing device | |
JPS5668879A (en) | Real-time hand-written character recognition system | |
JPH0348379A (en) | Character recognizing system |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
A02 | Decision of refusal |
Free format text: JAPANESE INTERMEDIATE CODE: A02 Effective date: 19990525 |