JPH0696287A - Word collation pre-processing system - Google Patents

Word collation pre-processing system

Info

Publication number
JPH0696287A
JPH0696287A JP4246996A JP24699692A JPH0696287A JP H0696287 A JPH0696287 A JP H0696287A JP 4246996 A JP4246996 A JP 4246996A JP 24699692 A JP24699692 A JP 24699692A JP H0696287 A JPH0696287 A JP H0696287A
Authority
JP
Japan
Prior art keywords
character
word
candidate
collation
appearance
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
JP4246996A
Other languages
Japanese (ja)
Inventor
Shinobu Sasaki
忍 佐々木
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
NEC Corp
Original Assignee
NEC Corp
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by NEC Corp filed Critical NEC Corp
Priority to JP4246996A priority Critical patent/JPH0696287A/en
Publication of JPH0696287A publication Critical patent/JPH0696287A/en
Pending legal-status Critical Current

Links

Landscapes

  • Character Discrimination (AREA)

Abstract

PURPOSE:To efficiently execute a collation by excludng a character which cannot exist as a correct solution from a reference object at the time of collating a word, in a character recognizing system for designating plural candidate characters at every character of a character-string for forming a word. CONSTITUTION:A character recognizing system 1 registers a word which can appear in an appearance word registering part 2 in advance, and accumulates it in an appearance character management table 6. At the time of collating a word, with respect to a character candidate 102 obtained as a result of recognition of a character recognizing part 3, a word collation pre-processing part 4 refers to the appearance character management table 6 and deletes the character candidate which cannot appear, and designates a limited candidate character 103. A word collating part 5 executes a word collation processing with regard to this limited candidate character.

Description

【発明の詳細な説明】Detailed Description of the Invention

【0001】[0001]

【産業上の利用分野】本発明は単語照合前処理方式に関
し、特に文字認識システムにおける認識後の単語照合の
前処理を行なう単語照合前処理方式に関する。
BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a word collation preprocessing system, and more particularly to a word collation preprocessing system for preprocessing post-recognition word collation in a character recognition system.

【0002】[0002]

【従来の技術】従来の文字認識システムでは、認識対象
とする単語の含む文字ごとに複数の候補文字が指定さ
れ、複数の文字列を対象として単語照合を行う場合に
は、与えられたすべての候補文字を総当り的に組み合せ
を変えながら単語辞書を検索し、照合を行っている。
2. Description of the Related Art In a conventional character recognition system, a plurality of candidate characters are designated for each character included in a word to be recognized, and when performing word matching on a plurality of character strings, all of the given characters are given. The word dictionary is searched and collated while changing the combination of candidate characters in a brute-force manner.

【0003】[0003]

【発明が解決しようとする課題】この従来の単語照合方
式では、認識対象とする単語の含む各文字に対して与え
られた複数の候補文字すべてについて単語辞書の単語と
の総当り的組合せによる照合動作を行なうため、もとも
と単語辞書内に存在しないはずの候補文字についても単
語辞書検索を行うことになり、照合効率が低下すること
が避けられないという欠点があった。
According to this conventional word matching method, all candidate characters given to each character included in a word to be recognized are matched with a word in a word dictionary by a brute force combination. Since the operation is performed, the word dictionary is searched for the candidate character that should not originally exist in the word dictionary, and there is a drawback that the collation efficiency is unavoidably deteriorated.

【0004】本発明の目的は上述した欠点を除去し、文
字列とし出現しうる文字のみに候補文字を限定した前処
理を行ない、照合効率を著しく改善しうる単語照合前処
理方式を提供することにある。
An object of the present invention is to eliminate the above-mentioned drawbacks and provide a word matching pre-processing method capable of performing a pre-processing in which candidate characters are limited to only those characters that can appear as a character string and remarkably improving the matching efficiency. It is in.

【0005】[0005]

【課題を解決するための手段】本発明の単語照合前処理
方式は、複数の文字を含む文字列として表現され、かつ
認識対象として出現の可能性のある単語を出現単語とし
て限定し、あらかじめ登録しておく出現単語登録手段
と、単語照合に先立って前記文字列の含む文字ごとの複
数の候補文字から前記出現単語による文字群に存在しな
いものを削除する前処理を施した後に単語照合を行なう
単語照合手段とを備えた構成を有する。
In the word matching preprocessing method of the present invention, a word that is expressed as a character string including a plurality of characters and that may appear as a recognition target is limited as an appearance word and registered in advance. The word matching is performed after performing a pre-processing for deleting the word that does not exist in the character group by the appearing word from the plurality of candidate characters for each character included in the character string prior to the word matching. And a word matching means.

【0006】[0006]

【実施例】次に、本発明について図面を参照して説明す
る。
DESCRIPTION OF THE PREFERRED EMBODIMENTS Next, the present invention will be described with reference to the drawings.

【0007】図1は本発明の一実施例の構成図である。FIG. 1 is a block diagram of an embodiment of the present invention.

【0008】図1に示す実施例の文字認識システム1
は、出現単語登録手段を構成する出現単語登録部2と、
出現文字管理テーブル6単語照合手段を構成する文字認
識部3と単語照合前処理部4と、単語照合部5とを含ん
で成り、図1にはなお、単語辞書7と、イメージデータ
文字列101,候補文字102,限定候補文字103お
よび候補単語104を併記して示す。
The character recognition system 1 of the embodiment shown in FIG.
Is an appearance word registration unit 2 which constitutes an appearance word registration unit,
The appearance character management table 6 includes a character recognition unit 3, a word matching preprocessing unit 4, and a word matching unit 5 which form a word matching unit. In FIG. 1, the word dictionary 7 and the image data character string 101 are still included. , Candidate character 102, limited candidate character 103, and candidate word 104 are shown together.

【0009】次に、本実施例の動作について説明する。Next, the operation of this embodiment will be described.

【0010】文字認識部3は、イメージデータ文字列1
01に対して文字認識を行ない、文字ごとに候補文字1
02を指定する。本実施例では、1文字当り8個の候補
が指定される。
The character recognition unit 3 uses the image data character string 1
Character recognition is performed for 01 and each character is a candidate character 1
Specify 02. In this embodiment, eight candidates are designated for each character.

【0011】従来の文字認識システムでは、この候補文
字102に対して、直接単語照合部5が単語辞書を参照
しながら単語照合を行ない、候補単語104を決定して
いた。
In the conventional character recognition system, the word collating unit 5 directly collates the word with respect to the candidate character 102 while referring to the word dictionary to determine the candidate word 104.

【0012】本実施例では、イメージデータ文字列7に
存在しうる単語をあらかじめ出現単語登録部2で登録し
ておくことにより、イメージデータ文字列7に出現しう
る文字が出現文字管理テーブル6に蓄積されており、単
語照合前処理部は、この出現文字管理テーブル6を参照
して、存在しない、すなわち出現するはずのない文字を
候補文字102から削除し、限定候補文字103を指定
する。これにより、本実施例では、いずれの文字につい
ても2ないし3文字候補に限定される。
In the present embodiment, the words that can exist in the image data character string 7 are registered in advance in the appearance word registration unit 2, so that the characters that can appear in the image data character string 7 are stored in the appearance character management table 6. The word collation preprocessing unit refers to the appearing character management table 6 and deletes a character that does not exist, that is, a character that should not appear, from the candidate character 102, and specifies the limited candidate character 103. Thus, in this embodiment, any character is limited to 2 or 3 character candidates.

【0013】単語照合部5は、このように限定された候
補文字に対して照合を行ない、従来の方法に比して著し
く照合効率を向上させた照合を行なうことができる。
The word collation unit 5 collates the limited candidate characters in this way, and can perform collation with significantly improved collation efficiency as compared with the conventional method.

【0014】[0014]

【発明の効果】以上説明したように本発明は、単語を形
成する文字列の各文字ごとに複数の候補文字が指定され
る文字認識システムにおいて、認識対象の文字列に出現
するはずの文字のみに候補文字を限定することにより、
単語照合効率を著しく向上させることができる効果を有
する。
As described above, according to the present invention, in a character recognition system in which a plurality of candidate characters are designated for each character of a character string forming a word, only characters that should appear in the character string to be recognized are recognized. By limiting the candidate characters to
This has the effect of significantly improving the word matching efficiency.

【図面の簡単な説明】[Brief description of drawings]

【図1】本発明の一実施例の構成図である。FIG. 1 is a configuration diagram of an embodiment of the present invention.

【符号の説明】[Explanation of symbols]

1 文字認識システム 2 出現単語登録部 3 文字認識部 4 単語照合前処理部 5 単語照合部 6 出現文字管理テーブル 7 単語辞書 1 Character recognition system 2 Appearing word registration unit 3 Character recognition unit 4 Word matching preprocessing unit 5 Word matching unit 6 Appearing character management table 7 Word dictionary

Claims (1)

【特許請求の範囲】[Claims] 【請求項1】 複数の文字を含む文字列として表現さ
れ、かつ認識対象として出現の可能性のある単語を出現
単語として限定し、あらかじめ登録しておく出現単語登
録手段と、単語照合に先立って前記文字列の含む文字ご
との複数の候補文字から前記出現単語による文字群に存
在しないものを削除する前処理を施した後に単語照合を
行なう単語照合手段とを備えることを特徴とする単語照
合前処理方式。
1. An appearance word registration means for preliminarily registering a word, which is expressed as a character string including a plurality of characters and has a possibility of appearing as a recognition target, as an appearance word, and prior to word matching. Before word matching, characterized in that it comprises a word matching means for performing word matching after performing a pre-process of deleting a candidate character that does not exist in the character group by the appearing word from a plurality of candidate characters for each character included in the character string. Processing method.
JP4246996A 1992-09-17 1992-09-17 Word collation pre-processing system Pending JPH0696287A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
JP4246996A JPH0696287A (en) 1992-09-17 1992-09-17 Word collation pre-processing system

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
JP4246996A JPH0696287A (en) 1992-09-17 1992-09-17 Word collation pre-processing system

Publications (1)

Publication Number Publication Date
JPH0696287A true JPH0696287A (en) 1994-04-08

Family

ID=17156827

Family Applications (1)

Application Number Title Priority Date Filing Date
JP4246996A Pending JPH0696287A (en) 1992-09-17 1992-09-17 Word collation pre-processing system

Country Status (1)

Country Link
JP (1) JPH0696287A (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2010009440A (en) * 2008-06-30 2010-01-14 Fujitsu Frontech Ltd Character recognition program, character recognition apparatus, and character recognition method

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2010009440A (en) * 2008-06-30 2010-01-14 Fujitsu Frontech Ltd Character recognition program, character recognition apparatus, and character recognition method

Similar Documents

Publication Publication Date Title
JPH01246678A (en) Pattern recognizing device
JPH08235341A (en) Method and device for document filing
JPH0696287A (en) Word collation pre-processing system
CN116229484A (en) Text recognition method, list scanning method and device
JPH09204492A (en) Slip processor
US20040114803A1 (en) Method of stricken-out character recognition in handwritten text
JP3725635B2 (en) Character recognition method and apparatus
KR100421683B1 (en) Person identifying method using image information
JP2746345B2 (en) Post-processing method for character recognition
JPH09128484A (en) Character recognizing method
TWI747172B (en) Foreign word management system
JPH03160585A (en) Character recognizing method
JP3115139B2 (en) Character extraction method
KR100473660B1 (en) Word recognition method
JP3151866B2 (en) English character recognition method
JPS60173688A (en) Pattern processing device
JPH09179935A (en) Character recognition device and control method therefor
JPH0634259B2 (en) Character recognition device
JPH076213A (en) Character string recognition device
JPH0484380A (en) Character recognizing device
JPH10334190A (en) Character recognition method and device and recording medium
JP3290110B2 (en) Handwritten character recognition device
JP2905334B2 (en) Online handwritten character recognition dictionary creation method and online handwritten character recognition dictionary creation device
JPS63100584A (en) Character recognition processing system
JPS6295687A (en) Character recognizing system

Legal Events

Date Code Title Description
A02 Decision of refusal

Free format text: JAPANESE INTERMEDIATE CODE: A02

Effective date: 19990525