JPH10510081A

JPH10510081A - Apparatus and voice control device for equipment

Info

Publication number: JPH10510081A
Application number: JP9513042A
Authority: JP
Inventors: ラウケフォルカー
Original assignee: ブラウプンクト−ヴェルケゲゼルシャフトミットベシュレンクテルハフツング
Priority date: 1995-09-26
Filing date: 1995-09-26
Publication date: 1998-09-29
Also published as: EP0793819A1; DE59509345D1; EP0793819B1; WO1997012302A1

Abstract

(57)【要約】本発明は、装置及び機器の動作を制御する命令を用いた、装置及び機器の音声制御用装置に関する。本発明では表示装置上にそのつどの動作状態に依存して、総体的に設けられた命令の一部が表示され、この場合１つの入力が、表示された命令のそれぞれ１つの発声によって行われる。メモリは、マイクロフォンを介して受け取られた各音声信号又はそこから導出された信号の記憶のために設けられている。記憶された信号は、表示された各命令に相応する、音声に係わる信号と比較される。この比較で正の結果となった命令は、選択結果として受入れられる。 (57) [Summary] The present invention relates to a device for voice control of a device and a device using an instruction for controlling the operation of the device and the device. According to the invention, depending on the respective operating state, a part of the collectively provided commands is displayed on the display device, wherein one input is made by means of one utterance of each of the displayed commands. . A memory is provided for storing each audio signal received via the microphone or a signal derived therefrom. The stored signals are compared with audio-related signals corresponding to each displayed command. An instruction having a positive result in this comparison is accepted as a selection result.

Description

【発明の詳細な説明】装置及び機器の音声制御用装置従来の技術本発明は、装置又は機器の作動を制御する命令を用いた、装置及び機器の音声制御用装置に関する。装置及び機器の制御に対してはますます音声入力手段が用いられている。この場合はマイクロフォンを介して受入れられたユーザの命令が、音声識別手法を用いて識別される。この場合話し手に依存する音声識別手法と話し手に依存しない音声識別手法との間では違いがある。話し手に依存しない音声識別では、限定された固定の命令セットがあり、この命令セットはユーザによってトレーニングされなければならない。それに対しては学習フェーズにおいて各命令がユーザによって複数回繰り返され、それによって音声識別装置がユーザの音声に適合化される。話し手に依存しない音声識別も固定的に定義される限定された命令セットで動作する。この場合はいずれにしても音声識別手法はもはやトレーニングの必要がないようにインテリジェンスなものである。前記２つの手法で共通しているのは、唯１つの限られた所定の固定命令セットしか識別できないことである。これは通常ユーザが暗記して学習しなければならないものである。この命令の数が多ければ多いほど、音声識別手法とユーザの記憶力に対する要求も高くなる。このような公知の音声識別手法の欠点は、例えばカーラジオの音声制御を著しく困難にする。このような音響機器では、必要となる命令の数も比較的多くなり、とりわけ長距離走行中は１つのカーラジオに対して多くの放送局がチューニングされなければならない。本発明の課題は、装置及び機器の音声制御用装置において、前述した公知装置の欠点に鑑みこれを解消すべく改善を行うことであり、特にユーザに対して多くの命令の記憶学習の負担が軽減されるように改善することである。上記課題は本発明により、表示装置にそのつどの動作状態に依存して、総体的に設けられた命令の一部が表示され、前記表示された命令のそれぞれ１つの発声によって１つの入力が行われ、マイクロフォンを介して受入れられた各音声信号又はそこから導出された信号の記憶のためにメモリが設けられており、記憶された信号が、表示された各命令に相応する、音声に係わる信号と比較され、この比較で正の結果となった命令が、選択結果として受入れられるように構成されて解決される。本発明による装置は一方で次のような利点を有している。すなわちそのつどの目下の装置と機器の作動状態のもとでどの命令が入力可能かがユーザに表示される。この場合はそのような選択の利点としてだけではなく、個々の命令の視認性も得られる。これによりユーザは、所定の言葉を役立てることにくみできる。それにより例えばユーザによる音声識別系にとって識別不可能な同義語の誤った使用が避けられる。他方では本発明による装置は次のような利点を有する。すなわち音声識別装置が、受け取った音声信号を総体的に可能な命令が多数にもかかわらず、そのつどのメニュー表示された少ない命令との比較だけでよい利点を有する。これにより簡単で確実な音声識別手法が選択される。本発明の別の有利な実施例によれば、音声に係わる信号が、表示された各命令の音声合成によって形成され、さらなる別のメモリにファイルされる。これによって次のような利点が得られる。すなわち、装置及び機器の製造者と、場合によってはユーザが、コマンド選択リスト（メニュー）ないしは個々の命令の変更の際に、新たな命令をテキスト形式（例えばいわゆるＡＳＣＩI文字等、これらが比較すべき音声に係わる信号に変換される）で入力するだけでよい利点が得られる。さらに本発明の別の実施例によれば、メモリと、さらなる別のメモリは、アナログメモリであり、アナログ信号との比較が行われるか、又は前記メモリと、さらなる別のメモリは、デジタルメモリであり、デジタル信号との比較が行われる。さらに別の有利な実施例によれば、総体的に設けられた全ての命令の、音声に係わる比較すべき信号がメモリにファイルされており、比較のための該メモリへのアクセスが前記表示された各命令に応じて制御される。さらに別の実施例によれば、前記音声に係わる信号は、そのつどの命令の発音のもとでの基本変調を表している。これにより、記憶されている信号と、音声に係わる信号との簡単な比較が可能となる。実施例次に本発明を図面に基づき以下に詳細に説明する。図示の実施例では本発明による装置によってカーラジオが音声制御されている。このカーラジオは、アンテナ２を備えた受信部１、信号処理回路３、２つの出力段４,５、スピーカ６,７によって概略的に示されている。信号処理回路３は公知のように、ステレオデコーダ、ラジオデータ信号デコーダ、交通情報デコーダ、音量及び音質調整器を含んでいる。受信部１と信号処理回路３は、マイクロコンピュータ８によって制御されている。このマイクロコンピュータは、信号処理回路３から様々なデータ、例えば復号化されたラジオデータ信号等を受け取る。マイクロコンピュータ８の出力側は、表示装置（ディスプレイ）９と接続されている。この表示装置は、そのつどのカーラジオの作動状態において実行可能な命令のメニューを表示する。これは例えば“カセット”、“ＦＭ ”、“中波”、“交通情報”等の設定リストであってもよく、又そのつどの受信すべき放送局の選局情報であってもよい。音声制御方式でない公知の入力装置では、複数の放送局からの１つの選局が、局名の横に配置されているキーボタンのプッシュによって行われる。本発明による、音声制御用装置では、マイクロフォン１０が設けられている。このマイクロフォン１０の出力信号が増幅器１１を介してメモリ１２に供給される。表示装置９の他にマイクロコンピュータ８には、音声符号器１３が接続されている。この音声符号器１３の出力信号は音声合成信号を表し、メモリ１４内に書き込み可能である。音声符号化のための方法は、例えばそれ自体公知のコンピュータ“Amiga”用のコンピュータプログラムＳＡＹ等がある。メモリ１２及び１４の内容は、比較装置１５において比較される。音声符号器１３内で形成される信号の１つとメモリ１２内にある信号とが（許容偏差範囲も含めて）一致する場合には、これが音声に係わる信号としてマイクロコンピュータ８に通知される。すなわち入力された命令と一致する、表示メニューからの命令が通知される。その後で相応の機能がマイクロコンピュータ８によって実行される。その後は場合によってその他のメニューが表示装置９で視認できるようにされてもよい。この場合はメニュー内で構築されたデータが音声符号器１３に供給され、それに対して新たな音声入力が可能となる。DETAILED DESCRIPTION OF THE INVENTION Apparatus and voice control device for equipment Conventional technology The present invention relates to audio of devices and equipment using instructions to control the operation of the equipment or equipment. The present invention relates to a control device. Increasingly, voice input is used for controlling devices and equipment. this If the user's instruction received via the microphone uses voice identification techniques Is identified. In this case, speaker-dependent speech recognition and speaker-independent There is a difference between this and the speech recognition method. Limited speaker-independent speech identification There is a fixed instruction set that is trained by the user. Must be done. In the learning phase, each instruction is Multiple times, thereby adapting the speech recognition device to the user's speech. You. Speaker-independent speech identification also operates with a limited set of fixed instructions. Make. In any case, the speech recognition method no longer needs training Not as intelligent. What the two approaches have in common is that there is only one limited fixed instruction set It can only be identified. This usually requires the user to memorize and learn Not something. The greater the number of these instructions, the more the voice The demands on cognition also increase. Disadvantages of such known voice identification techniques are, for example, the difficulty in controlling voice on car radios. Make it difficult. Such audio equipment requires a relatively large number of instructions. Many broadcasters tune to one car radio, especially when driving long distances. Must be An object of the present invention is to provide an apparatus and a device for controlling audio of a device, wherein In order to solve this problem, it is necessary to make improvements to solve this problem. To reduce the memory learning burden of the instruction. According to the present invention, the above-mentioned object is achieved by a display device depending on the respective operating state. Are displayed, and each one of the displayed instructions is uttered. Makes one input and receives each audio signal through the microphone Or a memory is provided for the storage of signals derived therefrom, Is compared with the audio signal corresponding to each command displayed. Instructions that yield a positive result in the comparison are configured and accepted to be accepted as a selection result. Is decided. On the one hand, the device according to the invention has the following advantages. That is, Operating status of current equipment and equipment Which command can be input under the condition is displayed to the user. In this case such Not only as an advantage of the selection, but also the visibility of the individual instructions. This allows you Users can use certain words to help. This allows, for example, Incorrect use of synonyms that are indistinguishable by a speech recognition system. On the other hand The device according to the invention has the following advantages. That is, the voice recognition device Despite the large number of commands that can collectively output the audio signal, It has the advantage that only a comparison with the few instructions displayed is sufficient. This makes it simple and reliable Is selected. According to another advantageous embodiment of the invention, the signal relating to the audio is represented by each displayed command. And filed in yet another memory. This Thus, the following advantages can be obtained. That is, in some cases, In other words, the user can change the command selection list (menu) or individual commands. At this time, new instructions are written in text format (for example, so-called ASCII characters, etc.) Is converted to a signal related to the sound to be compared). You. According to yet another embodiment of the present invention, the memory and the further memory are A log memory, where a comparison with an analog signal is made or Another memory consisting of is a digital memory, where a comparison with a digital signal is made . According to yet another advantageous embodiment, the audio of all instructions provided collectively is provided. The relevant signal to be compared is stored in a memory and stored in the memory for comparison. Is controlled according to each of the displayed instructions. According to yet another embodiment, the voice-related signal is the pronunciation of the respective instruction. Represents the basic modulation under. This allows the stored signal and audio A simple comparison with the signals concerned is made possible. Example Next, the present invention will be described in detail below with reference to the drawings. In the embodiment shown, the car radio is voice-controlled by the device according to the invention. . This car radio has a receiving unit 1 having an antenna 2, a signal processing circuit 3, and two outputs. Schematically represented by power stages 4,5 and speakers 6,7. The signal processing circuit 3 is public As you know, stereo decoder, radio data signal decoder, traffic information decoder , Volume and tone control. The receiving unit 1 and the signal processing circuit 3 are controlled by a microcomputer 8. You. The microcomputer receives various data from the signal processing circuit 3, for example, Receives encoded radio data signals and the like. The output side of the microcomputer 8 , Display device (display 9). This indicator shows the operating status of the respective car radio. Display a menu of executable instructions. This is, for example, "cassette", "FM , "Medium wave", "traffic information", etc. It may be channel selection information of a broadcast station to be performed. A known input device that is not a voice control system Is one of the key buttons located next to the station name. Done by push. In the voice control device according to the present invention, a microphone 10 is provided. The output signal of the microphone 10 is supplied to the memory 12 via the amplifier 11. You. A speech encoder 13 is connected to the microcomputer 8 in addition to the display device 9. ing. The output signal of the speech coder 13 represents a speech synthesis signal, Writable. Methods for speech coding include, for example, compilations known per se. There is a computer program SAY for the computer "Amiga". Memory 12 and The contents of 14 are compared in the comparison device 15. Formed in the speech encoder 13 One of the signals in the memory 12 matches the signal in the memory 12 (including the allowable deviation range). In this case, this is notified to the microcomputer 8 as a signal relating to voice. . That is, a command from the display menu that matches the input command is notified. After that, the corresponding function is performed by the microcomputer 8 Be executed. Thereafter, other menus may be made visible on the display device 9 in some cases. You may. In this case, the data constructed in the menu is supplied to the speech encoder 13. Then, a new voice input becomes possible.

Claims

[Claims] 1. For voice control of devices and equipment using instructions to control the operation of the equipment and equipment In the device, Depending on the respective operating state of the display device (9), the instructions provided as a whole are Partial display, one input by each one utterance of the displayed instruction Is performed and each audio signal received therethrough via the microphone (10) or A memory (12) is provided for storing the signals derived therefrom; The stored signal is compared with the audio signal corresponding to each command displayed. Note that the instruction with a positive result in this comparison is accepted as the selection result. A device for controlling voice of devices and equipment. 2. The signal related to the voice is formed by voice synthesis of each displayed instruction. Device according to claim 1, wherein the file is stored in a further memory (14). . 3. The memory (12) and still another memory (14) are analog memories 3. Apparatus and apparatus according to claim 2, wherein the comparison with an analog signal is performed. Equipment for controlling the sound of vessels. 4. The memory (12) and still another memory (14) are digital memories 3. Apparatus and apparatus according to claim 2, wherein the comparison with a digital signal is performed. Equipment for controlling the sound of vessels. 5. Signals to be compared related to voice of all instructions provided as a whole are stored in memory. And access to the memory for comparison is displayed on each of the displayed The apparatus for controlling voice of an apparatus and apparatus according to claim 1, wherein the apparatus is controlled in accordance with a command. Place. 6. The signal relating to the voice represents the basic modulation under the pronunciation of the respective instruction. The voice control of the apparatus and the apparatus according to any one of claims 1 to 5, wherein Equipment.