JPH10254489A

JPH10254489A - Speech recognition system for numerals

Info

Publication number: JPH10254489A
Application number: JP10059827A
Authority: JP
Inventors: Stephan Gamm; ガンステファン; Nils Dr Lenke; レンケニルス; Joerg Ockel; オーケルイェルグ
Original assignee: Philips Electronics NV
Current assignee: Koninklijke Philips NV
Priority date: 1997-03-11
Filing date: 1998-03-11
Publication date: 1998-09-25
Anticipated expiration: 2018-03-11
Also published as: DE59808726D1; EP0865031B1; US6078887A; EP0865031A3; DE19709990A1; JP4216361B2; EP0865031A2; DE19709990C2

Abstract

PROBLEM TO BE SOLVED: To provide the speech recognition system which prevents the transmission of the numerals that are incorrectly recognized. SOLUTION: The system is provided with a control device 33 which recognizes at least one of numeral columns and generates the recognized numerals of at least one of numeral columns. If there exists at least one erroneously recognized numeral of a first numeral column, the device 33 compares a second numeral column against the first numeral column. If the number of numerals in the second numeral column is less than the number of numerals in the first numeral column, the device 33 determines the related numerals of the portion of the first numeral column which has the numeral that best matches with the numeral of the second numeral column and the device 33 replaces the numerals of the determined portion of the first numeral column by the unmatched numerals of the second numeral column.

Description

DETAILED DESCRIPTION OF THE INVENTION

【０００１】[0001]

【発明の属する技術分野】本発明は、少なくとも１つの
数字列を認識し、前記少なくとも１つの数字列の認識さ
れた数字を発生する制御装置を具える数字用音声認識シ
ステムに関するものである。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a speech recognition system for numbers, comprising a control device for recognizing at least one digit sequence and generating a recognized digit of said at least one digit sequence.

【０００２】[0002]

【従来の技術】このようなシステムは、例えばＷＯ９
５／０６３０９Ａ１から既知であり、マイクロホンと
変調赤外線信号を形成する回路とを含む遠隔制御装置を
具える。前記マイクロホンによって捕らえられたユーザ
の音声入力は、赤外線信号を経て制御装置に送信され、
この制御装置は、前記音声入力を符号ワードに変換し、
例えばビデオカセットレコーダまたはテレビジョン受像
機用の制御コマンドを形成する評価回路に送信する。特
定の機能を前記テレビジョン受像機またはビデオカセッ
トレコーダにおいて、個々の音声入力または音声コマン
ドによって実行することができる。例えば、チャネルを
選択することができ、音量レベルを設定することがで
き、または、ビデオテープの再生を停止することができ
る。チャネル、日付、開始および終了時間を入力するプ
ログラミングを予め一定に規定された逐次的な順序にお
いて行う、前記ビデオカセットレコーダの時間プログラ
ミングの説明も与える。音声を入力する場合、数字の入
力が必要である。例えばチャネルまたは時刻に関して数
字が入力された後、前記ビデオカセットレコーダまたは
テレビジョン受像機の個々の制御を与える応答を行う。
つぎに、格納されたパターンとの比較を行う。数字が間
違って認識され、間違ったパターンに割り当てられる
と、不完全な制御になる。2. Description of the Related Art Such a system is disclosed, for example, in WO 9
5/06309 A1 and comprises a remote control including a microphone and a circuit for forming a modulated infrared signal. The voice input of the user captured by the microphone is transmitted to the control device via an infrared signal,
The control device converts the speech input into a code word,
It is transmitted to an evaluation circuit which forms control commands for, for example, a video cassette recorder or a television receiver. Certain functions can be performed on the television receiver or video cassette recorder by individual voice inputs or voice commands. For example, a channel can be selected, a volume level can be set, or video tape playback can be stopped. A description is also given of the time programming of the videocassette recorder, wherein the programming for inputting the channel, date, start and end times is performed in a predefined, sequential order. When inputting voice, it is necessary to input numbers. For example, after a number has been entered for a channel or time, a response is provided giving individual control of the video cassette recorder or television receiver.
Next, comparison with the stored pattern is performed. If the numbers are misrecognized and assigned to the wrong pattern, there is incomplete control.

【０００３】[0003]

【発明が解決しようとする課題】したがって、本発明の
目的は、不正確に認識された数字の伝達を回避する音声
認識装置を提供することである。SUMMARY OF THE INVENTION It is therefore an object of the present invention to provide a speech recognition apparatus which avoids the transmission of incorrectly recognized digits.

【０００４】[0004]

【課題を解決するための手段】この目的は、少なくとも
１つの誤って認識された第１数字列の数字がある場合、
前記制御装置が、口述の第２数字列を前記第１数字列と
比較し、前記第２数字列の数字の数が前記第１数字列の
数字の数より少ない場合、前記制御装置が、前記第２数
字列の数字と一致する数字を最も多く有する前記第１数
字列の部分の関連する数字を決定し、前記制御装置が、
前記第２数字列の不一致数字を前記第１数字列の決定さ
れた部分の数字の代わりにする、序章において規定した
形式のシステムによって達成される。The object of the present invention is to provide a method, comprising: at least one misrecognized first digit sequence number;
The controller compares the dictated second number string with the first number string, and when the number of numbers in the second number string is less than the number of numbers in the first number string, the controller is Determining the associated number of the portion of the first number sequence that has the most number that matches the number of the second number sequence;
This is achieved by a system of the type defined in the introduction, wherein the mismatched digits of the second sequence are replaced by the digits of the determined part of the first sequence.

【０００５】本発明によるシステムにおいて、前記音声
入力の検証をユーザが行う。認識されないこれらの数字
の選択的な訂正を行う。前記音声認識は、刊行物「Herm
annNey, Volker Steinbiss, Xavier Aubert, Reinhold
Haeb-Umbach: Progress inLarge Vocabulary, Continuo
us Speech Recognition, in: H. Niemann, R. de Mori,
G. Hanrieder: Progress and Prospects of Speech Re
search and Technology, 1994, pp. 75 to 92 」から既
知の方法から取ってもよい。この方法によれば、リンク
された数字の連続を、隠れマルコフモデルの助けをかり
て認識する。入力された数字の連続を検証のために発生
した後、ユーザは、認識された数字列を受諾または拒絶
することができ、その後、特定の数字をもう一度入力す
ることができる。前記数字を、前記制御装置による音声
合成によって発生するか、前もって入力および格納され
た１つの数字によって発生する。前記制御装置を、０な
いし９の数字と、例えば「はい」、「いいえ」等のよう
な特定の制御入力とを理解するものとする。In the system according to the present invention, the user performs verification of the voice input. Make selective corrections to these numbers that are not recognized. The speech recognition is described in the publication "Herm
annNey, Volker Steinbiss, Xavier Aubert, Reinhold
Haeb-Umbach: Progress in Large Vocabulary, Continuo
us Speech Recognition, in: H. Niemann, R. de Mori,
G. Hanrieder: Progress and Prospects of Speech Re
search and Technology, 1994, pp. 75 to 92 ". According to this method, a sequence of linked numbers is recognized with the help of a hidden Markov model. After generating the sequence of entered digits for verification, the user can accept or reject the recognized digit sequence, and then re-enter the particular digit. The number is generated by speech synthesis by the controller or by a previously entered and stored number. The controller shall understand the numbers 0 to 9 and certain control inputs such as, for example, "Yes", "No" and the like.

【０００６】第１数字列を認識した場合、ユーザにこの
列が正確に理解されているかどうかを質問する。正確で
ない場合、ユーザに他の音声入力を与えることを要求す
る。このときユーザは、完全に新たな数字列を入力して
もよく、部分的な数字列のみを入力してもよい。その
後、前記第１数字列と新たに入力された第２数字列とを
比較する。次に前記制御装置は、最も多くの前記第２数
字列の数字と一致する数字を有する前記第１数字列の部
分を決定する。このとき、前記第２数字列の数字の数が
前記第１数字列の数字の数より少ないことが必要条件で
ある。その後、前記第２数字列の不一致数字を、前記第
１数字列の部分の数字の代わりにする。When recognizing the first sequence of digits, the user is asked if the sequence is understood correctly. If not, request the user to provide another audio input. At this time, the user may input a completely new number string or only a partial number string. Thereafter, the first number string is compared with the newly input second number string. Next, the controller determines the portion of the first number sequence that has a number that matches the most number of the second number sequence. At this time, it is a necessary condition that the number of numbers in the second number string is smaller than the number of numbers in the first number string. Thereafter, the mismatched digits in the second digit sequence are substituted for the digits in the first digit sequence.

【０００７】このようなシステムは、例えば、電話番号
を音声入力によって形成する電話セクションを形成す
る。さらに、本発明によるシステムを、高められたサー
ビス（例えば、ネットワークにおける言語選択）におい
て使用してもよい。[0007] Such a system forms, for example, a telephone section in which telephone numbers are formed by voice input. Furthermore, the system according to the invention may be used in enhanced services (eg language selection in a network).

【０００８】本発明によるシステムは、ユーザが、訂正
の場合においてのみ個々の文脈を有するこれらのような
数字を入力するという利点を有する。例えば、ユーザ
は、間違って理解された数字の前および後の数字のみを
発音する。この訂正の形態は、ユーザが慣れている自然
な動作に一致し、数字列全体をもう一度入力するよりも
速い。加えて、この形式の訂正は、部分的な数字列の入
力によって識別誤りの危険がより少なくなるため、成功
の可能性がより高い。[0008] The system according to the invention has the advantage that the user enters such a number with an individual context only in the case of a correction. For example, the user pronounces only the digits before and after the misunderstood number. This form of correction corresponds to the natural behavior that the user is accustomed to, and is faster than re-entering the entire digit string. In addition, this form of correction is more likely to be successful because the entry of partial digit strings reduces the risk of identification errors.

【０００９】評価プロセス中、前記制御装置は、前記第
１および第２数字列の数字の数を決定し、前記第１数字
列のすべての関連する部分のどの数字が、前記第２数字
列の数字と一致するかを決定する。前記第１数字列の種
々の部分またはサブ列の各々が同じ数の一致を有する場
合、これらのうちの１つのサブ列を訂正のために選択す
る。前記第２数字列と一致する数字の数が同じ複数のサ
ブ列から第１のサブ列を選択してもよい。During the evaluation process, the control device determines the number of digits in the first and second digit sequences, and determines which digits of all relevant parts of the first digit sequence are of the second digit sequence. Determine if it matches the number. If various parts or sub-strings of the first number sequence have the same number of matches, one of these sub-columns is selected for correction. The first sub-column may be selected from a plurality of sub-columns having the same number of numbers matching the second number string.

【００１０】さらに、前記制御装置を、前記第２数字列
の数字で代える前記第１数字列の少なくとも１つの数字
にマーキングし、マークされた数字を発音するのに使用
する。他の数字も、特定の強勢で発音する。前記制御装
置は、前記数字列において奇数位置を有する数字を上昇
する強勢で示し、前記数字列において偶数位置を有する
数字と最後の位置を有する数字とを下降する強勢で示
す。この対様式の韻律による自然な発音の形態は、数字
の記録を改善することができる。訂正された数字の強調
（一定の強勢）での発音は、成功した証明をより簡単に
することができる。[0010] Further, the control device is used to mark at least one number of the first number string to be replaced with the number of the second number string, and to pronounce the marked number. Other numbers are also pronounced with certain stress. The controller indicates the numbers having odd positions in the sequence of numbers with increasing stress and the numbers having even positions and the last position of the numbers with decreasing stress. This natural form of pronunciation with paired prosody can improve the recording of numbers. Pronunciation with corrected numerical emphasis (constant stress) can make successful proof easier.

【００１１】訂正された第１数字列を第１および第２数
字列の評価後に発生することによって、前記制御装置
は、数字列が正確に認識されたかどうかのユーザへの質
問を形成する。By generating the corrected first digit sequence after the evaluation of the first and second digit sequence, the controller forms a question for the user as to whether the digit sequence has been correctly recognized.

【００１２】本発明は、少なくとも１つの数字列を認識
し、この少なくとも１つの文字列の認識された数字を発
生する、数字の音声認識方法にも関係する。このとき、
第１数字列の少なくとも１つの数字が誤って認識される
イベントにおいて、口述された第２数字列を前記第１数
字列と比較し、前記第２数字列の数字の数が前記第１数
字列の数字の数より少ない場合、その列が前記第２数字
列の数字と一致する数字を最も多く有する前記第１文字
列の部分の数字の訂正を決定し、前記第２数字列の不一
致数字を前記第１数字列の決定された部分の数字の代わ
りにすることが与えられる。[0012] The invention also relates to a method for recognizing a digit, which recognizes at least one digit sequence and generates a recognized digit of the at least one character sequence. At this time,
In an event where at least one digit of the first digit sequence is incorrectly recognized, the dictated second digit sequence is compared with the first digit sequence, and the number of digits in the second digit sequence is changed to the first digit sequence. If the number of numbers is less than the number of the second character string, determine the correction of the number of the part of the first character string that has the most number of numbers that match the number of the second number string, It is provided to substitute for the digits of the determined part of the first digit sequence.

【００１３】本発明のこれらおよび他の態様は、以下に
説明する実施形態の参照から明らかになるであろう。[0013] These and other aspects of the invention will be apparent from and elucidated with reference to the embodiments described hereinafter.

【００１４】[0014]

【発明の実施の形態】図１は、数字用音声認識システム
の好適な実施形態を示し、このシステムは、マイクロホ
ン１と、２つの増幅器２および３と、音声認識装置４
と、評価回路５と、ラウドスピーカ６とを具える。音声
認識装置４および評価回路５は、制御装置３３を形成す
る。ユーザの音声入力をマイクロホン１に供給する。該
システムは、特定の音声入力、すなわち、特定の数字列
（例えば、「３８７４２１６」）と、増幅器
２を経て音声認識装置４に供給される制御入力とを含
む。音声認識装置４は、例えば、適切な周辺装置を有す
る信号プロセッサを具えてもよく、この信号プロセッサ
の実行プログラムは、音声認識を可能にする。このよう
なプログラムは既知であり、このプログラムが基礎とす
る方法を、例えば、文献「Hermann Ney, Volker Steinb
iss, Xavier Aubert, Reinhold Haeb-Umbach: Progress
in Large Vocabulary, Continuous Speech Recognitio
n, in: H. Niemann, R. de Mori, G. Hanrieder: Progr
ess and Prospects of Speech Research and Technolog
y, 1994, pp. 75 to 92 」から取ってもよい。ユーザに
よって入力された数字列を認識し、評価回路５に（例え
ば、ＡＳＣＩＩ符号において）符号ワードとして入力す
る。評価回路５は、前記認識された数字から音声応答を
形成する音声応答ユニットを含む。この音声応答ユニッ
トを、合成された数字を増幅器３に入力する音声合成器
としてもよく、前記音声合成ユニットが、発声者の格納
された音声節をメモリから取り出し、これらの音声節を
増幅器３に入力してもよい。FIG. 1 shows a preferred embodiment of a digitized speech recognition system, which comprises a microphone 1, two amplifiers 2 and 3, and a speech recognition device 4.
, An evaluation circuit 5 and a loudspeaker 6. The speech recognition device 4 and the evaluation circuit 5 form a control device 33. The voice input of the user is supplied to the microphone 1. The system includes a particular speech input, ie, a particular sequence of digits (eg, “3 8 7 4 21 6”) and a control input supplied to the speech recognizer 4 via the amplifier 2. The speech recognizer 4 may, for example, comprise a signal processor with suitable peripherals, whose execution program enables speech recognition. Such programs are known and the methods on which they are based are described, for example, in the literature "Hermann Ney, Volker Steinb
iss, Xavier Aubert, Reinhold Haeb-Umbach: Progress
in Large Vocabulary, Continuous Speech Recognitio
n, in: H. Niemann, R. de Mori, G. Hanrieder: Progr
ess and Prospects of Speech Research and Technolog
y, 1994, pp. 75 to 92 ". Recognize the digit string input by the user and input it to the evaluation circuit 5 as a code word (for example, in an ASCII code). The evaluation circuit 5 includes a voice response unit that forms a voice response from the recognized digits. The voice response unit may be a voice synthesizer for inputting the synthesized numbers to the amplifier 3, and the voice synthesis unit retrieves the stored voice syllables of the speaker from the memory and sends these voice syllables to the amplifier 3. You may enter it.

【００１５】前記音声応答をラウドスピーカ６を経てユ
ーザに通知し、これらを試験する。前記評価回路はさら
に、例えば、「これらの数字列は正確に理解されていま
すか」のような特定の通知または文句も発生する。この
ときユーザは、数字または数字列が誤って理解されたイ
ベントにおいて訂正を実行してもよい。The voice response is notified to the user via the loudspeaker 6, and these are tested. The evaluation circuit also generates certain notices or complaints, such as, for example, "Is these numbers exactly understood?" At this time, the user may execute a correction in an event in which a number or a string of numbers is misunderstood.

【００１６】評価回路５は、適切な周辺装置を有するマ
イクロプロセッサをさらに含み、このマイクロプロセッ
サは、前記認識された制御入力および数字を処理し、前
記音声応答ユニットを制御するソフトウェアモジュール
を具える。図２は、数字を認識する主な処理のフローチ
ャートを示す。数字列を音声認識装置４によって認識
し、これを図２のブロック７において略語ＥＲＫＺＮ
によって示し、その後、解析およびアクセントマーキン
グ（ＡＮＡＫ，ブロック８）を前記音声応答ユニット
に関して実行する。数字列において偶数位置を有する数
字に「ｂ」をマークし、前記数字列において奇数位置を
有する数字に「ｅ」をマークする。前記数字列の最後の
数字に、前記数字列において偶数位置を有するか奇数位
置を有するかに係わりなく、「ｅ」をマークする。これ
は、前記数字列の第１、３、５位置等における数字を
「ｂ」によって特徴付け、前記数字列の第２、４、６位
置等における数字を「ｅ」によって特徴付けることを意
味する。このとき、前記応答に関して対様式の韻律が発
生する。例えば、数字列「３８７４２１
６」を、「３ｂ８ｅ７ｂ４ｅ２ｂ１ｅ６
ｅ」のようにマークする。The evaluation circuit 5 further includes a microprocessor having suitable peripherals, the microprocessor comprising a software module for processing the recognized control inputs and digits and for controlling the voice response unit. FIG. 2 shows a flowchart of the main processing for recognizing numbers. The digit sequence is recognized by the speech recognition device 4 and is identified in block 7 of FIG. ZN
, Then parsing and accent marking (AN AK, block 8) is performed for the voice response unit. The number having an even position in the number sequence is marked with "b" and the number having an odd position in the number sequence is marked with "e". The last digit of the sequence is marked with an "e", regardless of whether it has an even or odd position in the sequence. This means that the numbers at the first, third, fifth, etc. positions of the number sequence are characterized by "b" and the numbers at the second, fourth, sixth position, etc. of the number sequence are characterized by "e". At this time, a pair-style prosody is generated for the response. For example, the numeric string “3 8 7 4 2 1
6 "to" 3b 8e 7b 4e 2b 1e 6
e ".

【００１７】図２に示すフローチャートにおいて、ブロ
ック８の後の次のステップをブロック９（ＡＵＫＯ）
によって示す。このブロックは、前記認識された数字列
の応答と、前記数字列が正確に認識されたかどうかの質
問とを特徴とする。前記数字列に応答した場合、評価回
路５の音声応答ユニットは、２つの文句変形を使用す
る。数字を、上昇または下降強勢で発生する。「ｂ」を
マークした文字に対して、文句を上昇強勢で使用し、
「ｅ」をマークした文字に対して、文句を下降強勢で使
用する。結果として、前記音声応答ユニットにおいて対
様式の韻律パターンが存在し、このパターンは、人間の
自然な話し方に対応する。In the flowchart shown in FIG. 2, the next step after block 8 is called block 9 (AU KO)
Indicated by This block is characterized by a response of the recognized number sequence and a question as to whether the number sequence was correctly recognized. When responding to the digit sequence, the voice response unit of the evaluation circuit 5 uses two phrase variants. The numbers occur on ascending or descending stress. For words marked "b", use a complaint with ascending stress,
For the character marked "e", the phrase is used in descending stress. As a result, there is a paired prosodic pattern in the voice response unit, which pattern corresponds to the natural way of speaking of humans.

【００１８】一度ユーザが、前記システムが認識してい
ることについての質問に返答すると（ブロック１０，Ｅ
ＲＫＡ）、質問ブロック１１（ＯＫ？）において、そ
の返答が何であるかを試験する。ユーザが「はい」と返
答した場合、前記数字列を認識し、前記入力を終了す
る。次に、前記認識された数字列を使用し、さらなる処
理をしてもよい。前記返答が「いいえ」である場合、前
記システムはユーザに訂正を要求し、これをブロック１
２における略語ＡＵＦＲによって表す。このときユー
ザは、完全に新たな数字列か、数字サブ列を入力しても
よい。前記新たに入力された数字列のその後の音声認識
プロセスおよび解析を、ブロック１３において略語ＥＲ
ＫＺＫによって示す。ブロック１３における音声認識
および音声解析後、上述したような他の分析および強勢
マーキングを実行する（ブロック８）。Once the user has answered the question about what the system is aware of (block 10, E
RK A), question block 11 (OK?) Tests what the answer is. If the user replies “yes”, the number string is recognized and the input is terminated. Next, further processing may be performed using the recognized number sequence. If the answer is "No", the system requests a correction from the user,
Abbreviation AU in 2 Expressed by FR. At this time, the user may input a completely new number string or a number sub-string. The subsequent speech recognition process and analysis of the newly entered digit sequence is referred to in block 13 by the abbreviation ER
K Indicated by ZK. After speech recognition and speech analysis in block 13, other analysis and stress marking as described above are performed (block 8).

【００１９】ブロック１３によって示される解析を、図
３および４のフローチャートの助けを借りてさらに説明
する。図３における解析の開始をＳＴとして示す。最初
に、前の数字列Ｚ１の長さＬ（Ｚ１）が新たな数字列Ｚ
２の長さＬ（Ｚ２）より短いかどうかを試験する（ブロ
ック１４：Ｌ（Ｚ１）＜Ｌ（Ｚ２））。違う場合、新た
な数字列Ｚ２を前の数字列Ｚ１の代わりにし、これをブ
ロック１５においてＺ１→Ｚ２によって示す。これは、
解析（ＥＮ）を終了する。しかしながら、数字列Ｚ１が
数字列Ｚ２より長いか、またはこれに等しい場合、ブロ
ック１６に示すように、変数ｍ、ｍＴおよびｍＳをゼロ
にセットする（ｍ＝０，ｍＴ＝０，ｍＳ＝０）。The analysis represented by block 13 will be further described with the aid of the flow charts of FIGS. The start of the analysis in FIG. 3 is shown as ST. First, the length L (Z1) of the previous number string Z1 is changed to the new number string Z.
Test whether it is shorter than the length L (Z2) of block 2 (block 14: L (Z1) <L (Z2)). If not, the new digit sequence Z2 replaces the previous digit sequence Z1, which is indicated in block 15 by Z1 → Z2. this is,
The analysis (EN) ends. However, if the sequence Z1 is longer than or equal to the sequence Z2, the variables m, mT and mS are set to zero as shown in block 16 (m = 0, mT = 0, mS = 0). .

【００２０】ここで、前の数字列Ｚ１のどの部分が新た
な数字列Ｚ２と最も共通しているかを見つける前記フロ
ーチャートの部分を説明する。第１ループの開始におい
て、数字列Ｚ２を数字列Ｚ１の各々の部分と比較したか
どうかを試験する（ブロック１７）。ブロック１７にお
いて、変数ｍの値が数字列Ｚ１およびＺ２の長さの差以
下であるかどうかを検査する：ｍ≦Ｌ（Ｚ１）−Ｌ（Ｚ
２）。例えば、数字列Ｚ１が数字「３８７４２
１６」を有し、数字列Ｚ２が数字「７５２」を有す
る場合、数字列Ｚ１の長さは７に等しく、数字列Ｚ２の
長さは３に等しい。したがって、数字列Ｚ１の五つの部
分（「３８７」、「８７４」、「７４２」、「４２１」
および「２１６」を数字列Ｚ２と比較するため、前記第
１ループを全部で５回通過すべきである。ブロック１７
における比較が、変数ｍの値が数字列Ｚ１およびＺ２の
長さの差より大きいことを示す場合、前記第１ループを
終了し、図４にそのフローチャートを示す第２ループに
切り替える。前記第２ループへの変化を、マーク「Ａ」
（ブロック１８）によって示す。Here, a description will be given of a part of the flowchart for finding which part of the previous number string Z1 is most common with the new number string Z2. At the beginning of the first loop, it is tested whether the sequence Z2 has been compared with each part of the sequence Z1 (block 17). In block 17, it is checked whether the value of the variable m is less than or equal to the difference between the lengths of the digit strings Z1 and Z2: m ≦ L (Z1) −L (Z
2). For example, if the number string Z1 is the number “3 8 7 4 2
If the number string Z2 has the number "752", the length of the number string Z1 is equal to 7 and the length of the number string Z2 is equal to 3. Therefore, the five parts of the numeric string Z1 (“387”, “874”, “742”, “421”)
And the first loop should be passed a total of five times to compare & 216 with the digit sequence Z2. Block 17
If the comparison in indicates that the value of the variable m is greater than the difference between the lengths of the numeric strings Z1 and Z2, the first loop is ended and the processing is switched to the second loop whose flowchart is shown in FIG. The change to the second loop is indicated by the mark "A".
(Block 18).

【００２１】比較ｍ≦Ｌ（Ｚ１）−Ｌ（Ｚ２）が真であ
る場合、ブロック１９に示すように２つの他の変数ｎお
よびｔをゼロにセットする。変数ｎは数字列Ｚ２におけ
る数字の位置を示し、変数ｔは、数字列Ｚ１の比較すべ
き部分と、数字列Ｚ２との一致する数字の数を示す。次
の質問ブロック２０は、ブロック２１、２２および２３
と同様にサブループの一部である。ブロック２０におい
て、変数ｎの値が数字列Ｚ２の長さより小さいかどうか
を確かめる。そうである場合、質問ブロック２１は、数
字列Ｚ１の位置ｍ＋ｎにおける数字が、数字列Ｚ２の位
置ｎにおける数字に等しい（Ｚ１（ｍ＋ｎ）＝Ｚ２
（ｎ））かどうかを質問する。この質問が肯定的に答え
られた場合、変数ｔを増分する（ブロック２２）。否定
的に答えられた場合、ブロック２２を処理したのと同じ
ようにブロック２３にジャンプする。ブロック２３は、
変数ｎの増分を示す。その後、ブロック２０において処
理を行う。If the comparison m≤L (Z1) -L (Z2) is true, set two other variables n and t to zero, as shown in block 19. The variable n indicates the position of the number in the number string Z2, and the variable t indicates the number of numbers that match the part of the number string Z1 to be compared with the number string Z2. The next question block 20 consists of blocks 21, 22 and 23
Is a part of the sub-loop as well. In block 20, it is checked whether the value of the variable n is smaller than the length of the digit string Z2. If so, the interrogation block 21 determines that the number at position m + n of the number string Z1 is equal to the number at position n of the number string Z2 (Z1 (m + n) = Z2
(N)). If the question is answered affirmatively, the variable t is incremented (block 22). If the answer is no, the process jumps to block 23 as if block 22 had been processed. Block 23
Indicates the increment of variable n. Thereafter, processing is performed in block 20.

【００２２】変数ｎの値が数字列Ｚ２の長さ以上になっ
た場合（ブロック２０）、質問ブロック２４で処理を行
う。ここで、変数ｔの値が変数ｍＴの値より大きいかど
うか試験する。そうである場合、変数ｍＴをｔに等しく
セットし、変数ｍＳをｍに等しくセットする（ブロック
２５）。変数ｍＳは、数字列Ｚ１の数字列Ｚ２と一致す
る数字を最も多く有する部分を示す。変数ｍＴは、一致
する数字の数に等しい。次のステップにおいて、ブロッ
ク２４の質問の否定的な結果後、または、ブロック２５
において変数ｍＴおよびｍＳをセットした後、ブロック
２６において変数ｍを増分する。これは、数字列Ｚ２と
最も多く対応する数字列Ｚ１の部分を決定する第１ルー
プ終了する。上記で規定した例において、数字「３８
７４２１６」を有する数字列Ｚ１において数字
「７４２」を有する部分は、数字「７５２」を
有する数字列Ｚ２に最も多く対応する。If the value of the variable n is equal to or greater than the length of the numeric string Z2 (block 20), processing is performed in a question block 24. Here, it is tested whether the value of the variable t is larger than the value of the variable mT. If so, the variable mT is set equal to t and the variable mS is set equal to m (block 25). The variable mS indicates a portion having the largest number of numbers that match the number string Z2 of the number string Z1. The variable mT is equal to the number of matching digits. In the next step, after the negative result of the question in block 24, or
After setting the variables mT and mS at, the variable m is incremented at block. This ends the first loop for determining the part of the number string Z1 most corresponding to the number string Z2. In the example specified above, the number "3 8
The part having the number “7 42” in the number string Z 1 having “7 42 16” corresponds most to the number string Z 2 having the number “75 2”.

【００２３】図４のフローチャートにおいて示す第２ル
ープは、数字列Ｚ１の数字列Ｚ２の数字と異なる部分の
数字にマークする。図４に示すフローチャートは、ブロ
ック２７におけるマーク「Ａ」で開始する。この第２ル
ープの開始前に、変数ｎをゼロに設定し、これをブロッ
ク２８に示す。この変数ｎは、数字列Ｚ２における数字
の位置を示す。前記第２ループは、質問ブロック２９お
よび３０と、他のブロック３１および３２を具える。質
問ブロック２９において、変数ｎの値が数字列Ｚ２の長
さより小さいか（ｎ＜Ｌ（Ｚ２））どうかを質問する。
これがそうでない場合、解析を終了する。そうである場
合、数字列Ｚ１の前記部分の数字が、数字列Ｚ２の割り
当てられた数字に等しいかどうかを試験する（ブロック
３０）。これに対する数学的表現は、Ｚ１（ｍ＋ｎ）＝
Ｚ２（ｎ）である。数字列Ｚ１の位置ｍ＋ｎにおける数
字が、数字列Ｚ２の位置ｎにおける数字に対応する場
合、ブロック３２を処理する。他の場合において、前記
数字が対応しない場合、数字列Ｚ２の位置ｎにおける数
字を、数字列Ｚ１の位置ｎ＋ｍにおける数字の代わりに
する。この場合を、ブロック３１において式Ｚ１（ｍ＋
ｎ）→Ｚ２（ｎ）によって示す。加えて、置き換えられ
た数字に参照符「ａ」を付ける。この関係を、ブロック
３１において式ａＺ１（ｍ＋ｎ）によって示す。次のス
テップにおいて、ブロック３２において示すように変数
ｎを増分する。次に、質問ブロック２９においてさらな
る処理を再び行う。The second loop shown in the flowchart of FIG. 4 marks a part of the numeral string Z1 which is different from the numeral Z2. The flowchart shown in FIG. 4 starts with mark “A” in block 27. Prior to the start of this second loop, the variable n is set to zero, which is indicated in block 28. This variable n indicates the position of the numeral in the numeral string Z2. The second loop comprises question blocks 29 and 30, and other blocks 31 and 32. In a question block 29, it is asked whether the value of the variable n is smaller than the length of the digit string Z2 (n <L (Z2)).
If this is not the case, the analysis ends. If so, it tests whether the number in said part of the number sequence Z1 is equal to the assigned number of the number sequence Z2 (block 30). The mathematical expression for this is Z1 (m + n) =
Z2 (n). If the number at position m + n of the number string Z1 corresponds to the number at position n of the number string Z2, block 32 is processed. In other cases, if the numbers do not correspond, the number at position n of the number string Z2 is substituted for the number at position n + m of the number string Z1. In this case, in block 31, the expression Z1 (m +
n) → Z2 (n). In addition, reference numbers "a" are appended to the replaced numbers. This relationship is shown in block 31 by the expression aZ1 (m + n). In the next step, the variable n is incremented as shown in block 32. Next, further processing is performed again in question block 29.

【００２４】ブロック１３（図２）において解析した
後、新たな数字列Ｚ１をブロック８に供給する。例え
ば、数字「３８７５２１６」を有する新た
な数字列Ｚ１を、数字「３８７４２１６」
を有する前の数字列Ｚ１と、数字列Ｚ２の数字「７５
２」とから形成する。このとき、数字「５」を数字
「４」の代わりにする。加えて、ブロック８はブロック
１３から、前記代わりの数字のマーキングか、文字
「ａ」によって示される数字を受ける。ブロック８にお
いて、これらの数字に上述したような文字「ｂ」および
「ｅ」をマークする。マークされた数字列に対応する音
声応答をブロック９において発生する。「ｂ」をマーク
された数字を上昇強勢で発音し、「ｅ」をマークされた
数字を下降強勢で発音する。「ａ」をマークされた数字
をさらに強調し、変更をユーザに示す。例えば、新たな
数字列Ｚ１のマーキングを、「３ｂ８ｅ７ｂａ５
ｅ２ｂ１ｅ６ｅ」のようにする。After analysis in block 13 (FIG. 2), a new digit sequence Z1 is supplied to block 8. For example, a new numeric string Z1 having the number “3 8 7 5 2 1 6” is replaced with the number “3 8 7 4 2 1 6”
And the number “75” in the number string Z1 before
2 ". At this time, the number “5” is replaced with the number “4”. In addition, block 8 receives from block 13 the marking of said alternative number or the number indicated by the letter "a". At block 8, these numbers are marked with the letters "b" and "e" as described above. An audio response corresponding to the marked digit sequence is generated at block 9. The number marked "b" is pronounced ascending and the number marked "e" is pronounced as descending. The number marked "a" is further emphasized to indicate the change to the user. For example, the marking of the new numeral string Z1 is described as “3b 8e 7b a5
e 2b 1e 6e ".

【００２５】図２のフローチャートに示し、ブロック８
ないし１３を具えるループを、ユーザが前記結果を承認
するまで通過する。As shown in the flowchart of FIG.
Through 13 until the user approves the result.

【００２６】制御装置３３を、音声認識ユニット４およ
び評価回路５の機能を行うコンピュータシステムとして
配置してもよいことをさらに認めるべきである。It should further be appreciated that the control device 33 may be arranged as a computer system performing the functions of the speech recognition unit 4 and the evaluation circuit 5.

[Brief description of the drawings]

【図１】音声認識システムのブロック図である。FIG. 1 is a block diagram of a speech recognition system.

【図２】音声認識システムの説明におけるフローチャー
トである。FIG. 2 is a flowchart illustrating a speech recognition system.

【図３】音声認識システムの説明におけるフローチャー
トである。FIG. 3 is a flowchart illustrating a speech recognition system.

【図４】音声認識システムの説明におけるフローチャー
トである。FIG. 4 is a flowchart illustrating a speech recognition system.

[Explanation of symbols]

１マイクロホン２、３増幅器４音声認識装置５評価回路６ラウドスピーカ３３制御装置 Reference Signs List 1 microphone 2, 3 amplifier 4 voice recognition device 5 evaluation circuit 6 loudspeaker 33 control device

───────────────────────────────────────────────────── フロントページの続き (72)発明者ニルスレンケドイツ連邦共和国 53332 ボルンハイムラートハウスシュトラーセ８ (72)発明者イェルグオーケルドイツ連邦共和国 52066 アーヘンベントシュトラーセ 27アー ──────────────────────────────────────────────────続き Continued on the front page (72) Inventor Nils Renke Germany 53332 Bornheim Rathausstraße 8 (72) Inventor Jörg Oker Germany 52066 Aachen Bentstrasse 27 a

Claims

[Claims]

1. A number speech recognition system, comprising: a controller for recognizing at least one digit sequence and generating a recognized digit of said at least one digit sequence.
If there is at least one misrecognized number in the number sequence, the controller compares the dictated second number sequence with the first number sequence and determines that the number of digits in the second number sequence is the first number. If less than the number of digits in the sequence, the controller determines the associated number of the portion of the first sequence having the sequence that best matches the number in the second sequence; Wherein the mismatched digits of the second digit sequence are substituted for the digits of the determined portion of the first digit sequence.

2. The system according to claim 1, wherein the controller comprises:-determining the number of digits in the first and second digit sequences;-all relevant parts of the first digit sequence. Of the number
Determining a match with the second digit string; and-if more parts of the first digit string have the same number of matching digits, the first with the most matches.
A system for selecting a portion of a number sequence.

3. The system according to claim 1, wherein the control device marks at least one alternative number in the first sequence of numbers with a number of the second sequence of digits and sounds the marked number with a sound. A system for use in generating.

4. The system according to claim 1, wherein if a corrected first digit sequence is generated, after evaluating the first and second digit sequences, the control device can recognize the digit sequence correctly. A system for generating a question for a user as to whether or not they are in use.

5. The system of claim 1, wherein the controller generates numbers at odd positions of the sequence of numbers with ascending force and generates numbers at even positions of the sequence of numbers with descending force. A system characterized in that it is used.

6. A speech recognition method for recognizing at least one digit sequence and generating a recognized digit of said at least one digit sequence, wherein at least one digit of the first digit sequence is incorrectly recognized. Comparing the dictated second number string with the first number string, if the number of numbers in the second number string is less than the number of numbers in the first number string,
Determining a correction of a number in the portion of the first string that has the most digits that match the digits of the second string;
The method of claim 1 wherein the mismatched digits of the second digit sequence replace the digits of the determined portion of the first digit sequence.