JP2000339412A

JP2000339412A - Recognition information correction system

Info

Publication number: JP2000339412A
Application number: JP11151024A
Authority: JP
Inventors: Tadamitsu Ryu; 忠光龍
Original assignee: Individual
Current assignee: Individual
Priority date: 1999-05-31
Filing date: 1999-05-31
Publication date: 2000-12-08

Abstract

PROBLEM TO BE SOLVED: To provide an efficient recognition system where erroneous recognition which frequently occurs is reduced in recognition of an OCR device and a sound recognition device and correction work by a manual operation is suppressed to be minimum. SOLUTION: An OCR device 101, a word judgement means 102, a word dictionary 103, an example collation processing part 104, an example data base 105, an automatic correction means 106, an input correction part 107, a syntax judgment means 108, a syntax dictionary 109, an example collation processing part 110, an example data base Ill, an automatic correction means 112, an input correction part 113 and an output means 114, are included. Correction work by a manual operation can be suppressed to be minimum even while a recognition device by such conventional technology is used.

Description

DETAILED DESCRIPTION OF THE INVENTION

【０００１】[0001]

【産業上の利用分野】本発明は、ＯＣＲ（OPTICAL CHAR
ACTER READER）、音声認識装置から選ばれた少なくとも
１つの認識装置の誤認識を修正する認識情報修正システ
ムに関する。The present invention relates to an OCR (Optical Character)
ACTER READER), and relates to a recognition information correction system that corrects erroneous recognition of at least one recognition device selected from speech recognition devices.

【０００２】[0002]

【従来技術】文書処理装置においては、ＯＣＲを使って
新聞紙や雑誌などの紙面を読みとらせ、その結果を保存
したり、機械翻訳、校正支援といった文書処理をおこな
っている。しかしながらＯＣＲによる読みとりには誤認
識が多くその修正には人手による手直しが欠かせない。2. Description of the Related Art In a document processing apparatus, an OCR is used to read a sheet of paper such as a newspaper or a magazine, and the result is stored, and a document process such as machine translation and proofreading support is performed. However, reading by OCR often involves misrecognition and manual correction is indispensable for correction.

【０００３】また、相談などの会話の内容が文書化され
れば、ユーザ情報や相談内容の管理に役立つ。近年、音
声認識技術が進歩し、音声認識による書き起こしを行う
可能性が現れた（参照文献：日経エレクトロニクス１９
９９年１月１１日号 pp.43-50）。しかし、音声認識に
よる書き起こしの精度は、しばしば実用のために十分で
はない。[0003] If the contents of conversations such as consultations are documented, it is useful for managing user information and consultation contents. In recent years, speech recognition technology has advanced, and the possibility of performing transcription by speech recognition has appeared (see Reference: Nikkei Electronics 19).
January 11, 1999, pp.43-50). However, the accuracy of transcription by speech recognition is often not sufficient for practical use.

【０００４】[0004]

【発明が解決しようとする課題】このように従来の技術
では認識装置の認識結果に対して人手による修正が欠か
せないものであり、しばしば必要とする人件費が莫大な
ものとなってしまい実用的ではなかった。As described above, in the prior art, the recognition result of the recognition device must be manually corrected, and the labor cost that is often required becomes enormous, so that practical use is not possible. It was not a target.

【０００５】本発明は、従来技術による認識装置を用い
ながらも、人手による修正作業を最小限に抑え、トータ
ルのシステムとして実用的な認識システム（＝認識情報
修正システム）を提供することを目的としている。An object of the present invention is to provide a practical recognition system (= recognition information correction system) as a total system while minimizing manual correction work while using a conventional recognition device. I have.

【０００６】[0006]

【課題を解決するための手段】本発明では、上記の課題
を解決するため、ＯＣＲ（OPTICAL CHARACTER READE
R）、音声認識装置から選ばれた少なくとも１つの認識
装置と、前記認識装置の認識結果であるテキスト情報を
分析し、修正の必要性を判定する判定手段と、前記判定
手段の判定結果として修正の必要が判明した部分を操作
者に表示する表示手段と、認識の誤りを修正した事例を
格納した事例データベースと、前記事例を参照し事例が
あれば自動修正し、なければ操作者の修正入力によって
前記認識結果を修正する修正手段と、前記操作者の修正
入力を事例として前記事例データベースに登録する学習
手段とを構成要素として備えたことを特徴とする認識情
報修正システムを採用した。According to the present invention, in order to solve the above-mentioned problems, an OCR (optical character reader) is provided.
R), at least one recognition device selected from the voice recognition device, a determination unit that analyzes text information as a recognition result of the recognition device and determines necessity of correction, and corrects the determination result of the determination unit. Display means for displaying to the operator the part where the necessity has been found, a case database storing cases in which recognition errors have been corrected, and automatically correcting any cases by referring to the cases; And a learning means for registering the correction result of the operator as a case in the case database as a case.

【０００７】前記修正必要性の判定は、予め用意した単
語辞書を参照して、その単語辞書にない文字列を未知語
として抽出し、この未知語の有無によっておこなわれ
る。The determination of the necessity of the correction is performed by referring to a word dictionary prepared in advance, extracting a character string that is not in the word dictionary as an unknown word, and determining whether or not the unknown word exists.

【０００８】この単語辞書は、該システムを使用する業
務環境の資料を分析して抽出された単語を集めて作られ
たものであれば、単語辞書のサイズを必要最小限に抑え
ることが出来て、未知語の抽出が効率的である。If the word dictionary is made by collecting words extracted by analyzing materials of a business environment using the system, the size of the word dictionary can be minimized. The extraction of unknown words is efficient.

【０００９】さらには修正必要性の判定を、予め用意し
た構文辞書を参照して、前記認識結果であるテキスト情
報から構文辞書に登録された構造に合致しない構文を抽
出し、この未知構文の有無によってもおこなうのも良い
方法である。[0009] Further, the determination of the necessity of correction is performed by referring to a syntax dictionary prepared in advance and extracting a syntax that does not match the structure registered in the syntax dictionary from the text information as the recognition result. It is also a good way to do this.

【００１０】学習手段においては、修正入力で前記単語
辞書にない単語、前記構文辞書にな構文から選ばれる少
なくとも１つが抽出された場合、前記単語辞書もしくは
前記構文辞書に登録することができれば、学習によるシ
ステムの能率アップの度合いが高まる。In the learning means, if at least one selected from a word not in the word dictionary and a syntax in the syntax dictionary is extracted by the correction input, if it can be registered in the word dictionary or the syntax dictionary, the learning means Increases the efficiency of the system.

【００１１】また、テキスト化の対象とする信号を一時
的に記憶し、その信号を認識し認識結果を判定する処理
を少なくとも２回行い、それら複数の判定処理結果に対
して比較処理をおこなう対比処理部を備え、修正不要部
分を選択し接合しておけば、必要最小限の修正処理で信
号をテキスト化できる。[0011] Further, a process of temporarily storing a signal to be converted into a text, performing a process of recognizing the signal and determining a recognition result at least twice, and performing a comparison process on the plurality of determination process results is performed. By providing a processing unit and selecting and joining portions that do not require correction, signals can be converted to text with the minimum necessary correction processing.

【００１２】[0012]

【実施例１】以下、本発明の第１の実施例を詳細に説明
する。図１は本発明のシステムのブロック図である。Ｏ
ＣＲ装置１０１，単語判定手段１０２，単語辞書１０
３，事例照合処理部１０４，事例データベース１０５，
自動修正手段１０６，入力修正部１０７，構文判定手段
１０８，構文辞書１０９，事例照合処理部１１０，事例
データベース１１１，自動修正手段１１２，入力修正部
１１３，出力手段１１４を備えている。Embodiment 1 Hereinafter, a first embodiment of the present invention will be described in detail. FIG. 1 is a block diagram of the system of the present invention. O
CR device 101, word determination means 102, word dictionary 10
3, case collation processing unit 104, case database 105,
An automatic correction unit 106, an input correction unit 107, a syntax determination unit 108, a syntax dictionary 109, a case collation processing unit 110, a case database 111, an automatic correction unit 112, an input correction unit 113, and an output unit 114 are provided.

【００１３】ＯＣＲ装置１０１ではスキャナーで読み込
んだ画像データに対してテキスト化をおこない、テキス
ト情報を抽出する。このテキスト情報は単語判定手段１
０２に送出される。ここで認識手段としてＯＣＲ装置を
採用したが、音声認識装置でもよい。The OCR device 101 converts the image data read by the scanner into text and extracts text information. This text information is stored in the
02. Here, an OCR device is used as the recognition means, but a voice recognition device may be used.

【００１４】単語判定部１０２では単語辞書１０３を参
照し認識データを形態素分析し、辞書にない単語を未知
語として抽出し、未知語の有無によって修正の必要性を
判定する。修正の必要がないならそのまま１０８の構文
判定手段に該テキスト情報を送る。修正の必要がある場
合、単語事例照合部１０４で事例データベース１０５を
参照し、事例があれば自動修正手段１０６に送られ修正
される。事例がない場合は入力修正部１０７にて操作者
の入力により修正される。修正後のデータは構文判定手
段１０８におくられる。The word judging unit 102 refers to the word dictionary 103, performs morphological analysis of the recognition data, extracts words not in the dictionary as unknown words, and judges the necessity of correction based on the presence or absence of the unknown words. If no correction is necessary, the text information is sent to the syntax determination means 108 as it is. If the word needs to be corrected, the word case matching unit 104 refers to the case database 105, and if there is a case, it is sent to the automatic correcting unit 106 and corrected. If there is no case, the input correction unit 107 corrects the case according to the input by the operator. The corrected data is sent to the syntax determining means 108.

【００１５】入力修正部における入力は修正事例として
事例データベース１０５に登録される。このとき修正入
力が未登録単語の場合、単語辞書１０３に登録される。The input in the input correction unit is registered in the case database 105 as a correction case. At this time, if the correction input is an unregistered word, it is registered in the word dictionary 103.

【００１６】構文判定手段１０８では構文辞書１０９を
参照し、送られてきたデータから構文辞書にない構文を
未知構文として抽出し、未知構文の有無によって修正の
必要性を判定する。修正の必要がないのなら出力手段１
１４に送られる。修正が必要な場合、事例データベース
１１１を参照し、事例があれば自動修正手段１１２に送
られ自動修正される。事例がない場合は入力修正部１１
３にて操作者の入力によって修正される。修正後のデー
タは出力手段に送られ出力される。The syntax determining means 108 refers to the syntax dictionary 109, extracts a syntax that is not in the syntax dictionary from the sent data as an unknown syntax, and determines the necessity of correction based on the presence or absence of the unknown syntax. Output means 1 if no correction is needed
14 is sent. If correction is necessary, the case database 111 is referred to, and if there is a case, it is sent to the automatic correction means 112 and automatically corrected. If there is no case, input correction unit 11
At 3 the correction is made by the input of the operator. The corrected data is sent to the output means and output.

【００１７】図２は入力修正部（図１の１０７，１１
３）の構成を示したものである。事例判定手段によって
事例がないと判定されたデータを受け取ると、表示手段
２０１で操作者に対して修正入力を促す表示をおこな
う。表示はモニター等の表示装置が望ましいが、合成音
声による修正部分の指摘と入力指示でも問題ない。操作
者はキーボード、音声入力、タッチパネルから選ばれた
少なくとも１つの入力手段２０２によって修正入力をお
こなう。登録手段２０３では、、辞書にないパターンの
単語、構文は辞書２０４へ，修正入力の事例は事例デー
タベース２０５へそれぞれ登録される。FIG. 2 shows an input correction unit (107, 11 in FIG. 1).
This shows the configuration of 3). Upon receiving the data determined by the case determining means that there is no case, the display means 201 displays a message prompting the operator to make a correction input. The display is desirably a display device such as a monitor, but there is no problem in pointing out a correction portion and input instruction by a synthesized voice. The operator performs a correction input using at least one input means 202 selected from a keyboard, a voice input, and a touch panel. In the registration means 203, the words and the syntax of the pattern not in the dictionary are registered in the dictionary 204, and the case of the correction input is registered in the case database 205, respectively.

【００１８】[0018]

【実施例２】図３は第２の実施形態を示したフローチャ
ートである。音声信号を図示しないマイクで収録し、音
声データを一時記憶処理３０１により保存しておく。そ
の一時保存されたデータに対して音声認識処理３０４す
る。判定処理３０５では、その認識結果に対して図示し
ない辞書を参照して修正の必要部分を抽出する（抽出処
理３０２Ａ）。[Embodiment 2] FIG. 3 is a flowchart showing a second embodiment. The audio signal is recorded by a microphone (not shown), and the audio data is stored by the temporary storage processing 301. The voice recognition processing 304 is performed on the temporarily stored data. In the determination processing 305, a part requiring correction is extracted from the recognition result with reference to a dictionary (not shown) (extraction processing 302A).

【００１９】さらに、一時保存されたデータに対して抽
出処理３０２Ｂを行い、対比処理３０３Ａをおこなう。
対比処理では抽出処理３０２Ａの結果として抽出された
修正必要部分を３０３Ｂの相当部分で認識に成功したと
判定された部分（修正必要と判定されなかった部分）に
置き換える。Further, extraction processing 302B is performed on the temporarily stored data, and comparison processing 303A is performed.
In the comparison process, the portion requiring correction extracted as a result of the extraction process 302A is replaced with a portion corresponding to 303B determined to have been successfully recognized (portion not determined to require correction).

【００２０】対比処理３０３Ａで生成されたデータに対
して、抽出処理３０３Ｃの結果を使って対比処理３０３
Ｂをおこなう。こうして生成されたデータを修正処理３
０６に送る。修正処理３０６では図示しない事例データ
ベースを参照し、事例があれば自動修正、事例がなけれ
ば入力修正をおこなう。出力手段３０７で成果物として
のテキスト情報を出力する。The comparison processing 303A is performed on the data generated in the comparison processing 303A using the result of the extraction processing 303C.
Perform B. Correction processing 3 for the data thus generated
Send to 06. In the correction process 306, a case database (not shown) is referred to, and if there is a case, automatic correction is performed, and if there is no case, input correction is performed. The output unit 307 outputs text information as a product.

【００２１】[0021]

【発明の効果】科学分野の文献を資料として１０００回
読み込ませ学習させた第１の実施形態の装置を使って、
３１６語のテスト文章を入力したところ、修正処理をす
る前は、平均して９２．７％の正解率あったのが、９
９．３％の正解率に向上した。According to the apparatus of the first embodiment, which has been learned by reading the literature in the scientific field 1000 times as a material,
When a test sentence of 316 words was input, the correct answer rate of 92.7% on average before correction processing was 9
The accuracy rate was improved to 9.3%.

【００２２】また同じテスト文章を読み上げ、その音声
データを第２の実施形態の装置を使ったところ、対比処
理をしない方法では８３．５％の正解率であったのが、
９１．９％の正解率に改善された。このように従来技術
による認識装置を用いながらも、人手による修正作業を
最小限に抑えることができる。When the same test sentence was read aloud and its voice data was used by the apparatus of the second embodiment, the correct answer rate was 83.5% in the method without the comparison processing.
The answer rate was improved to 91.9%. As described above, it is possible to minimize the manual correction work while using the recognition device according to the related art.

【００２３】[0023]

[Brief description of the drawings]

【図１】第１の実施形態のブロック図FIG. 1 is a block diagram of a first embodiment;

【図２】入力修正部の構成図FIG. 2 is a configuration diagram of an input correction unit

【図３】第２の実施形態を示したブロック図FIG. 3 is a block diagram showing a second embodiment.

Claims

[Claims]

1. A system for converting at least one signal selected from an image and a sound into a text, comprising:
CAL CHARACTER READER), at least one recognition means selected from a voice recognition device, a determination means for analyzing text information as a recognition result of the recognition means and determining the necessity of correction, and a determination result of the determination means Display means for displaying to the operator a portion that has become necessary to be corrected, a case database storing a case in which the recognition error has been corrected,
A correction means for automatically correcting if there is a case with reference to the case, and correcting the recognition result by a correction input of an operator if not, and a learning means for registering the correction input of the operator as a case in the case database. A recognition information correction system provided as a component.

2. The method according to claim 1, wherein the determination of the necessity of the correction is performed by referring to a word dictionary prepared in advance, extracting a character string not in the word dictionary as an unknown word, and determining whether or not the unknown word exists. The recognition information correcting system according to claim 1.

3. The word dictionary according to claim 2, wherein the word dictionary is created by collecting words extracted by analyzing data of a business environment using the system. Recognition information correction system.

4. The determination of the necessity of correction is performed by extracting a sentence that does not match a syntax registered in the syntax dictionary from text information as the recognition result by referring to a syntax dictionary prepared in advance. 2. The recognition information correction system according to claim 1, wherein the recognition information correction is performed based on the presence or absence of a syntax.

5. When at least one selected from a word that is not in the word dictionary and a semantic structure that is not in the syntax dictionary is extracted by the correction input in the learning means, the word is registered in the word dictionary or the syntax dictionary. 2. The recognition information correcting system according to claim 1, wherein:

6. A process for temporarily storing a signal to be converted to text, performing a process of recognizing the signal and determining a recognition result at least twice, and performing a comparison process on the plurality of determination processes. 2. The recognition information correcting system according to claim 1, further comprising a comparison processing unit, and a step of selecting and joining unrequired portions.