JPS63159900A - Voice information input system - Google Patents

Voice information input system

Info

Publication number
JPS63159900A
JPS63159900A JP61306408A JP30640886A JPS63159900A JP S63159900 A JPS63159900 A JP S63159900A JP 61306408 A JP61306408 A JP 61306408A JP 30640886 A JP30640886 A JP 30640886A JP S63159900 A JPS63159900 A JP S63159900A
Authority
JP
Japan
Prior art keywords
voice
ocr
input
information input
voice information
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
JP61306408A
Other languages
Japanese (ja)
Inventor
潔 長澤
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Hitachi Ltd
Original Assignee
Hitachi Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Hitachi Ltd filed Critical Hitachi Ltd
Priority to JP61306408A priority Critical patent/JPS63159900A/en
Publication of JPS63159900A publication Critical patent/JPS63159900A/en
Pending legal-status Critical Current

Links

Abstract

(57)【要約】本公報は電子出願前の出願データであるた
め要約のデータは記録されません。
(57) [Summary] This bulletin contains application data before electronic filing, so abstract data is not recorded.

Description

【発明の詳細な説明】 〔産業上の利用分野〕 本発明は文字読取装置に用いて入力作業の効率向上に好
適な音声情報入力方式に関する。
DETAILED DESCRIPTION OF THE INVENTION [Field of Industrial Application] The present invention relates to a voice information input method suitable for use in a character reading device to improve the efficiency of input work.

〔従来の技術〕[Conventional technology]

従来°の装置は、特開昭57−200100号のように
、文字読取装置と音声認識のハードを一部共有化して組
合せることが考えられていた。しかし、音声入力時の認
識範囲については配慮されていなかった。
In conventional devices, as in Japanese Patent Laid-Open No. 57-200100, it was considered to combine a character reading device and voice recognition hardware by sharing part of the hardware. However, no consideration was given to the recognition range during voice input.

〔発明が解決しようとする問題点〕[Problem that the invention seeks to solve]

上記従来技術は認識範囲の点については配慮がされてお
らず、特にOCRでの文字読取は一語単位で行なわれる
ため、音声入力も必然的に音節単位のL&mとなるが、
これは良く知られているように、認識精度(認識率)の
点で問題となる。
The above conventional technology does not take into consideration the recognition range, and in particular, character reading in OCR is performed in units of words, so voice input is inevitably L&M in units of syllables.
As is well known, this poses a problem in terms of recognition accuracy (recognition rate).

本発明の目的はOCRにて入力できながった文字を音声
にて安定かつ速やかに入力することにある。
An object of the present invention is to stably and quickly input characters by voice that cannot be input by OCR.

〔問題点を解決するための手段〕[Means for solving problems]

上記目的は、OCRにて取り込んだ情報を認識範囲に反
映することにより達成される。
The above objective is achieved by reflecting the information captured by OCR in the recognition range.

〔作用〕[Effect]

OCRにて読み込まれた文字入力データは、標準文字パ
ターンとの類似度計算(マツチング)を行なった後、判
定部で結果が求められる。この時CRT等の外部表示は
第1位のものだけであるとしても、内部的には第n位(
nは別に与えられるパラメータ)までの結果をメモリに
保持しておき、音声入力による訂正や再入力の際にはこ
れらの候補単語を認識範囲とする。このようにすると、
入力文字の種類が数10〜数100  あっても音声で
の入力対象となるのはn語であるので、正確な認識が期
待できる。また、演算量も減少するので処理時間も短縮
できる。
The character input data read by OCR is subjected to similarity calculation (matching) with a standard character pattern, and then the result is determined by a determination unit. At this time, even if the external display such as a CRT is only for the first place, internally the nth place (
The results up to (n is a parameter given separately) are held in memory, and these candidate words are used as the recognition range when correcting or re-inputting by voice input. In this way,
Even if there are tens to hundreds of types of input characters, only n words are input by voice, so accurate recognition can be expected. Furthermore, since the amount of calculations is reduced, the processing time can also be shortened.

〔実施例〕〔Example〕

以下、本発明の一実施例を第1図により説明する。図中
、1は文字データを電気信号に変換する光電変換部であ
り、ここで読み取られた文字情報は2の特徴抽出部で特
徴を抽出された後、3の類似度演算部に送られる。類似
度演算部では入力データと4の文字標準パターンメモリ
ー中の標準文字データとの伸縮マツチングを行ない、類
似度を求める。5の判定部は類似度判定部で求められた
ガ1似度が充分高く、かつ2位以下の候補の類似度と充
分な差がある場合には1位の文字コードを上位装置やC
RT等に出力するが、類似度が低い場合や2位以下の候
補と類似度の差があまりない場合には音声入力による再
入力をうながす。操作者により発生された音声は6の音
声入力部で増幅やAD変換等が行なわれた後、7の特徴
抽出部で特、徴パラメータが求められ、9の類似度演算
部でこの特徴パラメータと8の音声標準パターンメモリ
中の標準音声データとのマツチングが行なわれる。
An embodiment of the present invention will be described below with reference to FIG. In the figure, numeral 1 is a photoelectric conversion unit that converts character data into electrical signals, and the character information read here has its features extracted by a feature extraction unit 2, and then sent to a similarity calculation unit 3. The similarity calculation section performs expansion/contraction matching between the input data and the standard character data in the character standard pattern memory 4 to determine the similarity. If the degree of similarity determined by the similarity degree determination section is sufficiently high and there is a sufficient difference from the degree of similarity of the second or lower candidates, the determination section 5 transmits the character code of the first place to the host device or C.
It is output to RT, etc., but if the degree of similarity is low or there is not much difference in degree of similarity from the second or lower candidate, re-input by voice input is prompted. After the voice generated by the operator is amplified and AD converted in the voice input section 6, features and characteristic parameters are obtained in the feature extraction section 7, and these feature parameters are calculated in the similarity calculation section 9. Matching with the standard voice data in the voice standard pattern memory of No. 8 is performed.

この時全ての標準パターンとのマツチングを行なわずに
、10の認識範囲制御部によりOCR部であらかじめ候
補にあげられた単語に制限される。
At this time, without performing matching with all standard patterns, the recognition range control section 10 limits the words to words that have been selected as candidates in advance by the OCR section.

OCRで読み込む文字は通常、数字、アルファベット、
カタカナの数10〜数100文字におよび、これらの中
には「2」、「二J、rEJ等の互いに発音が似かよっ
たものが含まれているため、これらを音声で再入力、あ
るいは訂正しようとしても高い認識率は期待できないが
1本実施例によれば、認識の対象となる文字数はかなり
制限されるので、認識精度が高まると共に、9の類似度
演算部での処理量が減少するので、処理時間が短縮され
るという効果がある。
The characters read by OCR are usually numbers, alphabets,
There are 10 to 100 katakana characters, and some of these include words with similar pronunciations, such as ``2'', ``niJ'', and rEJ, so let's re-enter or correct them aloud. However, according to this embodiment, the number of characters to be recognized is considerably limited, so the recognition accuracy is increased and the amount of processing in the similarity calculation section 9 is reduced. This has the effect of shortening processing time.

〔発明の効果〕〔Effect of the invention〕

本発明によれば、入力すべき文字数が多くても音声によ
る認識範囲はあらかじめOCRによって選ばれた候補に
限られるので、認識精度が高まり演算処理に要する時間
も短縮できるという効果がある。
According to the present invention, even if there are a large number of characters to be input, the voice recognition range is limited to candidates selected in advance by OCR, so that the recognition accuracy is increased and the time required for arithmetic processing is reduced.

【図面の簡単な説明】[Brief explanation of the drawing]

第1図は本発明の一実施例を示すブロック図である。 1・・・光電変換装置、3・・・類似度演算部、4・・
・文字標準パターンメモリー、5・・・判定部、6・・
・音声入力部、9・・・類似度演算部、10・・・認識
範囲制御部。 Xi 図
FIG. 1 is a block diagram showing one embodiment of the present invention. 1... Photoelectric conversion device, 3... Similarity calculation unit, 4...
・Character standard pattern memory, 5... Judgment section, 6...
- Voice input section, 9... Similarity calculation section, 10... Recognition range control section. Xi diagram

Claims (1)

【特許請求の範囲】[Claims] 1、音声認識装置を備えたOCR(光学的文字読取装置
)において、OCRで読み誤った、あるいはリジェクト
した文字を音声入力により修正する時に、音声の認識範
囲をOCRで求めた上位候補に制限することにより認識
精度および速度を向上させることを特徴とする音声情報
入力方式。
1. In an OCR (optical character reader) equipped with a voice recognition device, when correcting characters misread or rejected by the OCR using voice input, limit the voice recognition range to the top candidates determined by the OCR. A voice information input method characterized by improving recognition accuracy and speed.
JP61306408A 1986-12-24 1986-12-24 Voice information input system Pending JPS63159900A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
JP61306408A JPS63159900A (en) 1986-12-24 1986-12-24 Voice information input system

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
JP61306408A JPS63159900A (en) 1986-12-24 1986-12-24 Voice information input system

Publications (1)

Publication Number Publication Date
JPS63159900A true JPS63159900A (en) 1988-07-02

Family

ID=17956655

Family Applications (1)

Application Number Title Priority Date Filing Date
JP61306408A Pending JPS63159900A (en) 1986-12-24 1986-12-24 Voice information input system

Country Status (1)

Country Link
JP (1) JPS63159900A (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2007048177A (en) * 2005-08-12 2007-02-22 Canon Inc Information processing method and information processing device

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2007048177A (en) * 2005-08-12 2007-02-22 Canon Inc Information processing method and information processing device
JP4708913B2 (en) * 2005-08-12 2011-06-22 キヤノン株式会社 Information processing method and information processing apparatus

Similar Documents

Publication Publication Date Title
US6671403B1 (en) Pattern recognition apparatus and method utilizing conversion to a common scale by a linear function
JPS63159900A (en) Voice information input system
JP2801602B2 (en) Word recognition device
JPS58192180A (en) Character reader
JP3198218B2 (en) Online handwriting recognition method
JP2851865B2 (en) Character recognition device
JPH01201789A (en) Character reader
KR900005141B1 (en) Handwritter character recognizing device
JPS5969878A (en) On-line character recognizing method
JPS6115288A (en) Optical character reader
JPS6222186A (en) Drawing reader
JPH01311390A (en) Character substitution control system
JPS6095689A (en) Optical character reader
JPS63155389A (en) On-line character recognizing device
JPS5914078A (en) Reader of business form
JPH0437971A (en) Character reading device
JP3021708B2 (en) Line image analyzer
JP2665488B2 (en) Personal dictionary registration method
JPS6022793B2 (en) character identification device
JPS6190641A (en) Indivisual identification apparatus
JPH0682402B2 (en) Character recognition device
JPH03123989A (en) Character recognition device
JPS62236090A (en) Pattern collating system
JPS5668879A (en) Real-time hand-written character recognition system
JPS6379191A (en) Character recognizing device