JP2002341893A

JP2002341893A - Speech recognition device

Info

Publication number: JP2002341893A
Application number: JP2001146205A
Authority: JP
Inventors: Mikio Oda; 幹夫小田
Original assignee: Matsushita Electric Industrial Co Ltd
Current assignee: Panasonic Holdings Corp
Priority date: 2001-05-16
Filing date: 2001-05-16
Publication date: 2002-11-29

Abstract

PROBLEM TO BE SOLVED: To provide a sound transmitter for speech recognition without apocope in which power consumption is reduced, an operating time is extended. SOLUTION: The supply of a power source to a main processing part for speech transmission is continued for a period when a talk switch is pushed and for several seconds even when the talk switch is opened, and thereby a speech signal without the apocope is transmitted on radio. Accordingly, the reduction of power consumption of a power source battery is realized, the operating time is extended, and a speech is reliably transmitted.

Description

DETAILED DESCRIPTION OF THE INVENTION

【０００１】[0001]

【発明の属する技術分野】本発明は、無線により送信さ
れた音声信号から、入力音声を認識する音声認識装置に
おいて、電池を電源とした音声送信機の消費電力を低減
し、電池の動作使用時間の拡大を図り、音声送信を確実
に行う音声認識装置に関するものである。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a voice recognition apparatus for recognizing an input voice from a voice signal transmitted wirelessly, the power consumption of a voice transmitter powered by a battery being reduced, and the operating time of the battery being used. The present invention relates to a voice recognition device that ensures voice transmission by enlarging a voice.

【０００２】[0002]

【従来の技術】昨今の音声認識技術は、デジタル信号処
理技術の向上、処理ＬＳＩの高性能化、低価格化などに
より、民生機器の音声認識によるリモコン化が図られて
おり、機器の操作性向上に役立っている。しかし、現状
のＴＶやＶＴＲなどのリモコン送信機は、そのポータブ
ルさから電池を電源としており、電池寿命は約１年以上
と長いが、音声認識によるリモコン送信機では、電池の
寿命は極端に短くなる。2. Description of the Related Art In recent voice recognition technologies, remote control by voice recognition of consumer devices has been attempted due to improvements in digital signal processing technology, higher performance of processing LSIs, lower prices, and the like. Helps to improve. However, current remote control transmitters such as TVs and VTRs use a battery as a power source because of their portability, and the battery life is as long as about one year or more. However, the battery life of a remote control transmitter using voice recognition is extremely short. Become.

【０００３】通常、音声認識処理は、マイクロコンピュ
ータ、ＤＳＰなどで処理されており、数百mAの消費電流
が必要である。一方電池の技術開発も盛んに行われてお
り、単３電池で1500ｍAｈなどの高容量のものがある。Normally, speech recognition processing is performed by a microcomputer, DSP, or the like, and requires a current consumption of several hundred mA. On the other hand, battery technology is also being actively developed, and there are AA batteries with a high capacity of 1500 mAh or the like.

【０００４】しかしながら、この種の電池を使用して
も、数百mAの消費電流では、数時間しか動作しないとい
う課題を有する。ＡＣ電源を使用すればこの課題は解決
できるが、ポータブルさは損なわれ、実用性に欠ける。[0004] However, there is a problem that even with this kind of battery, operation is performed only for several hours at a current consumption of several hundred mA. The use of an AC power supply can solve this problem, but its portability is impaired and it is not practical.

【０００５】この課題を解決する音声認識装置として、
特開平５−１８１４９３に示す構成が提案されている。
以下図６、図７を参照しながら音声認識装置の一例につ
いて説明する。[0005] As a voice recognition device for solving this problem,
A configuration shown in Japanese Patent Application Laid-Open No. 5-181493 has been proposed.
Hereinafter, an example of the speech recognition device will be described with reference to FIGS.

【０００６】図６は第１の実施の形態を示すものであ
り、図６において、符号２０は発声された音声信号を電
気信号に変換するマイク、２１は入力されたマイク信号
出力から音声の始端及び終端を検出する音声区間検出
部、２２は音声区間検出部が検出した始端及び終端の音
声区間の音声信号を受信系へ無線で送信する無線送信部
である。FIG. 6 shows a first embodiment. In FIG. 6, reference numeral 20 denotes a microphone for converting an uttered voice signal into an electric signal, and reference numeral 21 denotes a starting point of voice from an input microphone signal output. An audio section detection unit 22 for detecting the end and the end is a wireless transmission unit that wirelessly transmits the audio signals of the start and end audio sections detected by the audio section detection unit to the receiving system.

【０００７】以上のように構成された第１の実施例の音
声認識装置について、その動作を説明する。使用者が音
声認識を利用して機器を制御する場合、まず、図６で構
成された音声送信機に向かって、音声認識処理できる単
語を発声する。The operation of the speech recognition apparatus of the first embodiment configured as described above will be described. When a user controls a device using voice recognition, first, a word that can be subjected to voice recognition processing is uttered to the voice transmitter configured in FIG.

【０００８】マイク２０に音声が入力されると音声区間
検出部２１で、まず始端を検出し、発声が終了すると今
度は終端を検出する。この始端と終端の区間だけ無線送
信部２２に電源を供給し、無線送信部２２で入力された
音声を搬送波に乗せて送信する。When a voice is input to the microphone 20, the voice section detector 21 first detects the beginning, and when the voice is finished, detects the end. Power is supplied to the wireless transmission unit 22 only in the start and end sections, and the voice input by the wireless transmission unit 22 is transmitted on a carrier wave.

【０００９】送信された音声は、受信検波し、音声信号
を音声認識させ、発声された音声に沿う機器の制御を行
い、一連の音声認識処理を完了する。The transmitted voice is received and detected, and the voice signal is subjected to voice recognition, the apparatus is controlled in accordance with the uttered voice, and a series of voice recognition processing is completed.

【００１０】また図７は第２の実施の形態を示すもので
あり、図７において、符号２０は発声された音声信号を
電気信号に変換するマイク、２１は入力されたマイク信
号出力から音声の始端及び終端を検出する音声区間検出
部、２２は音声区間検出部が検出した始端及び終端の音
声区間の音声信号を受信系へ無線で送信する無線送信
部、２４は音声アナログ信号をデジタル信号に変換する
A／Ｄコンバータ、２３はA／Ｄコンバータ２４でデジタ
ルに変換された音声信号を記憶するリングメモリ、２５
は記憶された音声デジタル信号をアナログに変換するD
／Aコンバータである。FIG. 7 shows a second embodiment. In FIG. 7, reference numeral 20 denotes a microphone for converting an uttered voice signal into an electric signal, and reference numeral 21 denotes a voice from an input microphone signal output. A voice section detector for detecting the start and end of the voice section 22 is a wireless transmission section for wirelessly transmitting the voice signals of the start and end voice sections detected by the voice section detector to the receiving system, and 24 is a digital signal for converting the audio analog signal to a digital signal. Convert
A / D converter 23 is a ring memory for storing the audio signal converted to digital by A / D converter 24, 25
D converts the stored audio digital signal to analog
/ A converter.

【００１１】以上のように構成された第２の実施例の音
声認識装置について、その動作を説明する。The operation of the speech recognition apparatus of the second embodiment configured as described above will be described.

【００１２】使用者が音声認識を利用して機器を制御す
る場合、まず、図７で構成された音声送信機に向かって
音声認識処理できる単語を発声する。マイク２０に音声
が入力されるとA／Ｄコンバータ２４でデジタル信号に
変換され、リングメモリ２３に順次書き込み記憶され
る。When a user controls a device using voice recognition, first, a word that can be subjected to voice recognition processing is uttered toward the voice transmitter configured in FIG. When sound is input to the microphone 20, it is converted into a digital signal by the A / D converter 24, and is sequentially written and stored in the ring memory 23.

【００１３】また音声区間検出部２１は音声が入力され
た始端を検出し、発声が終了すると今度は終端を検出す
る。この始端と終端の区間だけリングメモリ２３から順
次読み出し、D／Aコンバータ２５でアナログ信号に変換
し、無線送信部２２で入力された音声を搬送波に乗せて
送信する。送信された音声は、受信検波し、音声信号を
音声認識させ、発声された音声に沿う機器の制御を行
い、一連の音声認識処理を完了する。The voice section detection unit 21 detects the start end of the input voice, and detects the end when the utterance ends. Only the start and end sections are sequentially read from the ring memory 23, converted into analog signals by the D / A converter 25, and the voice input by the wireless transmission unit 22 is transmitted on a carrier wave. The transmitted voice is received and detected, the voice signal is subjected to voice recognition, and devices are controlled in accordance with the uttered voice, thereby completing a series of voice recognition processes.

【００１４】[0014]

【発明が解決しようとする課題】しかしながら前記従来
の構成では、音声認識処理が必要な音声区間のみ、無線
送信部２２に電源を供給するので、消費電力の低減には
なるが、実用性に欠ける。However, in the above-described conventional configuration, power is supplied to the radio transmission section 22 only in a voice section requiring voice recognition processing, so that power consumption is reduced, but lacks practicality. .

【００１５】すなわち、第１の実施例では、音声区間検
出部２１において、その始端及び終端検出の方法は、入
力される音声レベルを検出するものであり、不要な会話
雑音、電話の呼び出し音などの生活雑音によっても無線
送信部２２に電源が供給されることになり、マイク感度
は微妙な設定が要求され、また誤動作も多くなる。That is, in the first embodiment, the method of detecting the start and end of the voice section detection section 21 is to detect the input voice level, and unnecessary speech noise, telephone ringing sound, etc. As a result, power is supplied to the wireless transmission unit 22 due to the living noise, and delicate setting of the microphone sensitivity is required, and erroneous operation increases.

【００１６】第２の実施例では音声区間検出部２１はデ
ジタル処理で行う方式だが、アナログ処理の第１の実施
例と同様、マイク感度の微妙な設定、誤動作問題があ
り、またデジタルに変換するA／Dコンバータには常に電
源が必要で、消費電力が増加し、システム構成が複雑に
なる。In the second embodiment, the voice section detection unit 21 performs digital processing. However, similar to the first embodiment of analog processing, there is a problem of delicate setting of microphone sensitivity and malfunction, and conversion to digital is performed. The A / D converter always requires a power supply, which increases power consumption and complicates the system configuration.

【００１７】わざわざ手でスイッチを入れ、発声すると
いう従来一般的な方法より使い勝手は良くなるが、音声
区間検出部のレベル設定、雑音との切り分けなど、実用
上課題が多い。つまり、音声レベルだけで電源供給のオ
ンオフをさせるのは、現実問題として課題が多い。Although the usability is improved as compared with the conventional general method of turning on the switch manually and uttering, there are many practical problems such as setting the level of the voice section detection unit and separating the noise from the noise. That is, turning on / off the power supply only by the audio level has many problems as a practical problem.

【００１８】[0018]

【課題を解決するための手段】前記課題を解決するため
に本発明の音声認識装置は、トークスイッチの押された
区間と、トークスイッチを開放しても数秒間、音声送信
主処理部に電源を供給し続け、語尾切れのない音声信号
を無線で送信することで、電源電池の消費電力の低減を
図り、動作使用時間を拡大し、音声送信を確実に行うこ
とを特徴としたものである。In order to solve the above-mentioned problems, a voice recognition apparatus according to the present invention provides a power supply to a voice transmission main processing unit for a section where a talk switch is pressed and for several seconds even when the talk switch is opened. By continuously transmitting voice signals and transmitting endless audio signals wirelessly, power consumption of the power supply battery is reduced, operation time is extended, and sound transmission is performed reliably. .

【００１９】本発明によれば、音声認識の発声を行う時
は、トークスイッチを押し、周辺の雑音に影響される事
なく、確実に音声信号を無線で送信し、また言い終わる
と同時に切るトークスイッチの操作に対しても、予め設
定された時定数保持部で保持された時間だけ、音声送信
主処理部に電源を供給し続け、語尾切れのない音声信号
を無線で送信することで、電源電池の消費電力の低減を
図り、動作使用時間を拡大し、音声送信を確実に行える
音声認識装置を提供することが可能となる。According to the present invention, when uttering speech recognition, the talk switch is pressed, and the speech signal is transmitted wirelessly without being affected by the surrounding noise. For the operation of, the power source is continuously supplied to the audio transmission main processing unit for the time held in the preset time constant holding unit, and the audio signal without ending is wirelessly transmitted, so that the power supply battery is It is possible to provide a speech recognition device that can reduce power consumption, extend operation use time, and reliably perform speech transmission.

【００２０】[0020]

【発明の実施の形態】本発明の請求項１に記載の音声認
識装置は、トークスイッチの押された区間と、トークス
イッチを開放しても数秒間、音声送信主処理部に電源を
供給し続け、語尾切れのない音声信号を無線で送信する
ことで、電源電池の消費電力の低減を図り、動作使用時
間を拡大し、音声送信を確実に行える音声認識を実現し
うるものである。DESCRIPTION OF THE PREFERRED EMBODIMENTS The voice recognition apparatus according to the first aspect of the present invention supplies power to the voice transmission main processing section for a section where the talk switch is pressed and for several seconds even when the talk switch is released. Continuously, by transmitting a speech signal without end of the word by radio, the power consumption of the power supply battery can be reduced, the operation time can be extended, and voice recognition that can reliably transmit voice can be realized.

【００２１】つぎに、本発明の請求項２に記載された音
声認識装置は、入力された音声信号を電気信号に変換す
るマイクと、前記マイク出力を増幅する増幅器と、前記
増幅器で増幅された音声信号を変調、送信する変調送信
部からなる音声送信主処理部を有すると共に、音声発声
の操作を知らしめるトークスイッチと、前記トークスイ
ッチを押している時間の時定数を保持する時定数保持部
と、前記時定数保持部で設定された時間幅のみ前記音声
送信主処理部に電源を供給する電源制御部を具備し、前
記電源制御部出力を入力とし、電源供給時を表示する表
示部で構成することで、トークスイッチの押された区間
と、トークスイッチを開放しても数秒間、音声送信主処
理部に電源を供給し続け、語尾切れのない音声信号を無
線で送信し、電源電池の消費電力の低減を図り、動作使
用時間を拡大し、音声送信を確実に行える音声認識を実
現しうるものである。Next, according to a second aspect of the present invention, there is provided a voice recognition device for converting an input voice signal into an electrical signal, an amplifier for amplifying the microphone output, and an amplifier amplified by the amplifier. A voice signal is modulated and has a voice transmission main processing unit including a modulation transmission unit that transmits a voice signal, a talk switch that informs an operation of voice utterance, and a time constant holding unit that holds a time constant of a time when the talk switch is pressed. A power control unit that supplies power to the main voice transmission processing unit only for a time width set by the time constant holding unit, and a display unit that receives power from the power control unit and displays power supply time. By doing so, the power is continuously supplied to the voice transmission main processing unit for a few seconds even when the talk switch is pressed and the talk switch is opened, and a sound signal without any ending is wirelessly transmitted. Achieving a reduction in power consumption of the pond, to expand the operation using time, those capable of realizing a voice recognition can be reliably performed voice transmission.

【００２２】つぎに、本発明の請求項３に記載された音
声認識装置は、入力された音声信号を電気信号に変換す
るマイクと、前記マイク出力を増幅する増幅器と、前記
増幅器で増幅された音声信号を変調、送信する変調送信
部と、前記増幅器出力を入力とし、音声入力レベルを検
出するレベル検出器からなる音声送信主処理部を有する
と共に、音声発声の操作を知らしめるトークスイッチ
と、前記トークスイッチと前記レベル検出器のORをとる
OR回路と、前記トークスイッチを押している時間と前記
レベル検出器で検出した音声区間のORをとったOR回路出
力パルスの時定数を保持する時定数保持部と、前記時定
数保持部で設定された時間幅のみ前記音声送信主処理部
に電源を供給する電源制御部を具備し、前記電源制御部
出力を入力とし、電源供給時を表示する表示部で構成す
ることで、トークスイッチの押された区間と、トークス
イッチを開放しても発声が終了するまでの数秒間、音声
送信主処理部に電源を供給し続け、語尾切れのない音声
信号を無線で送信し、電源電池の消費電力の低減を図
り、動作使用時間を拡大し、音声送信を確実に行える音
声認識を実現しうるものである。Next, a voice recognition device according to a third aspect of the present invention provides a microphone for converting an input voice signal into an electric signal, an amplifier for amplifying the microphone output, and an amplifier amplified by the amplifier. A modulation transmission unit that modulates and transmits an audio signal, and has an audio output main processing unit including a level detector that receives the amplifier output and detects an audio input level, and a talk switch that informs an operation of audio utterance, OR the talk switch and the level detector
An OR circuit, a time constant holding unit that holds a time constant of an OR circuit output pulse obtained by ORing a time during which the talk switch is pressed and a voice section detected by the level detector, and a time constant holding unit that sets the time constant. A power control unit for supplying power to the voice transmission main processing unit only for the time width set as input, the output of the power control unit being input, and a display unit for displaying when power is supplied, the talk switch is pressed. Power is continuously supplied to the voice transmission main processing unit for several seconds until the utterance ends even if the talk switch is released, and a voice signal without any ending is wirelessly transmitted, and the power consumption of the power battery is reduced. It is possible to realize voice recognition that can reduce the time, extend the operation time, and reliably transmit voice.

【００２３】以下本発明の実施の形態について、図１か
ら図５を用いて説明する。An embodiment of the present invention will be described below with reference to FIGS.

【００２４】（実施の形態１）以下に、本発明の請求項
１及び請求項２に記載された発明の実施の形態につい
て、図１、図２、図３を用いて説明する。(Embodiment 1) An embodiment of the present invention described in claims 1 and 2 of the present invention will be described below with reference to FIGS. 1, 2 and 3. FIG.

【００２５】図１は、本発明の一実施例における音声認
識装置の音声送信機のブロック構成図を示す。図１にお
いて、符号7は入力された音声信号を電気信号に変換す
るマイク、8は前記マイク出力を増幅する増幅器、9は前
記増幅器で増幅された音声信号を変調、送信する変調送
信部である。FIG. 1 is a block diagram showing a voice transmitter of a voice recognition apparatus according to one embodiment of the present invention. In FIG. 1, reference numeral 7 denotes a microphone that converts an input audio signal into an electric signal, 8 denotes an amplifier that amplifies the microphone output, and 9 denotes a modulation transmission unit that modulates and transmits the audio signal amplified by the amplifier. .

【００２６】また、１はマイク７、増幅器８、変調送信
部９からなる音声送信主処理部、２は音声発声の操作を
知らしめるトークスイッチ、３は前記トークスイッチ２
を押してる時間の時定数を保持する時定数保持部、４は
前記時定数保持部で設定された時間幅のみ前記音声送信
主処理部１に電源を供給する電源制御部、６は前記電源
制御部４で電源供給される時を表示する表示部、５は音
声送信機全体に電源を供給する電池部である。Reference numeral 1 denotes an audio transmission main processing unit including a microphone 7, an amplifier 8, and a modulation transmission unit 9. Reference numeral 2 denotes a talk switch for notifying an operation of audio utterance.
A time constant holding unit for holding a time constant of a pressing time, a power control unit 4 for supplying power to the voice transmission main processing unit 1 only for a time width set by the time constant holding unit, and a power control unit 6. A display unit 5 that indicates when power is supplied by the unit 4 is a battery unit that supplies power to the entire audio transmitter.

【００２７】以上のように構成された音声認識装置につ
いて、その動作を説明する。音声認識でＴＶやＶＴＲな
どの機器を動作制御する場合、図１に示す音声送信機が
使用者の近くにあり、音声信号のみを送信した方が、音
声認識全体をユニット化するより、電源の消費電力は削
減できる。The operation of the speech recognition apparatus configured as described above will be described. When controlling the operation of a device such as a TV or a VTR by voice recognition, the voice transmitter shown in FIG. 1 is close to the user, and transmitting only the voice signal is more effective than the unitization of the entire voice recognition. Power consumption can be reduced.

【００２８】使用者はまず、この音声送信機のトークス
イッチ２を押し続けて、音声認識に登録されている単語
を発声する。このトークスイッチ２を押すと時定数保持
部３はＨパルスを出力し、時定数保持部３の出力がＨの
区間、電源制御部４は電池部５から供給される電池電源
を、マイク７、増幅器８、変調送信部９で構成される音
声送信主処理部１に電源を供給する。First, the user keeps pressing the talk switch 2 of the voice transmitter to utter a word registered for voice recognition. When the talk switch 2 is pressed, the time constant holding unit 3 outputs an H pulse, and the output of the time constant holding unit 3 is in a period of H, and the power supply control unit 4 supplies the battery power supplied from the battery unit 5 to the microphone 7, Power is supplied to the main audio transmission processing unit 1 including the amplifier 8 and the modulation transmission unit 9.

【００２９】電源を供給された音声主処理部１は、マイ
ク７で発声者の音声を電気信号に変換し、増幅器８でそ
のレベルを増幅し、変調送信部９で発声音声を無線送信
する。送信された音声は、ＴＶやＶＴＲなどの近くに設
置または内蔵された受信機で復調し、音声認識処理され
て発声された音声に沿う機器の制御を行い、一連の音声
認識処理が完了する。The voice main processing unit 1 supplied with power converts the voice of the speaker into an electric signal by the microphone 7, amplifies the level by the amplifier 8, and wirelessly transmits the voice by the modulation transmitting unit 9. The transmitted voice is demodulated by a receiver installed or built in the vicinity of a TV or VTR, and is subjected to voice recognition processing to control devices according to the uttered voice, thereby completing a series of voice recognition processing.

【００３０】使用者は発声が終わるとトークスイッチ２
を開放し、開放されたことを確認した時定数保持部３は
数秒後にＨパルスからＬパルスを電源制御部４に入力
し、電源制御部４は電池部５からの電源を、音声送信主
処理部１に供給することを停止する。表示部６はこの電
源供給時を表示する。変調送信部９は通常、ＦＭ変調な
どが一般的に利用され、また送信媒体としては、赤外
線、微少電波などが考えられる。When the user has finished speaking, the talk switch 2
After a few seconds, the time constant holding unit 3 inputs the H-pulse to the L-pulse to the power control unit 4, and the power control unit 4 supplies the power from the battery unit 5 to the voice transmission main process. The supply to the unit 1 is stopped. The display unit 6 displays the power supply time. The modulation transmission unit 9 generally uses FM modulation or the like, and the transmission medium may be an infrared ray, a minute radio wave, or the like.

【００３１】トークスイッチ２は発声する単語が完全に
終了するまで押し続けるのであれば、問題は発生しない
が、通常、人間の動作として、発声完了と同時にトーク
スイッチ２を開放することが多々ある。If the talk switch 2 is kept depressed until the word to be uttered is completely finished, no problem occurs, but usually, as a human operation, the talk switch 2 is often released at the same time as the completion of the utterance.

【００３２】図２は図１の各部の動作波形を示すタイム
チャートであり、時間t0で発声するためのトークスイッ
チ２を押し、単語例「イッチャンネル」を発声し、時間
t1で発声完了として、トークスイッチ２を開放する。何
も対策してなければ、すぐに電源制御部４からの電源が
供給されなくなり、最後の発声語の語尾が切れることに
なるが、時定数保持部３でＬパルスになるのに数秒の時
間遅れを発生させ、電源制御部４から供給される電源を
その間、時間拡大する。FIG. 2 is a time chart showing the operation waveforms of the respective parts in FIG. 1. The talk switch 2 for uttering at time t0 is pressed, and the word example "I-Channel" is uttered.
At t1, the utterance is completed, and the talk switch 2 is opened. If no countermeasures are taken, the power supply from the power control unit 4 is not immediately supplied, and the ending of the last uttered word is cut off. A delay is generated, and the power supplied from the power supply control unit 4 is expanded during that time.

【００３３】図３は図１の時定数保持部３、電源制御部
４の具体的回路例であり、トークスイッチ２を押すとRC
の時定数で積分されてトランジスタＱ１のベース電圧が
Ｈとなり、Ｑ１が導通し、トランジスタＱ２のベース電
圧がＬとなり、Ｑ２も導通状態となり電源電池の電圧が
Ｑ２のコレクタに流れる。FIG. 3 is a specific circuit example of the time constant holding unit 3 and the power supply control unit 4 in FIG.
, The base voltage of the transistor Q1 becomes H, the transistor Q1 becomes conductive, the base voltage of the transistor Q2 becomes L, the transistor Q2 becomes conductive, and the voltage of the power supply battery flows to the collector of the transistor Q2.

【００３４】発声が終わってトークスイッチ２を開放す
ると、ＲＣの時定数により積分され、Ｑ１のベース電圧
がＣに蓄えられた電位が自然放電されるまでＨとなり、
その後Ｌになるので、開放より遅れてＱ１が非導通とな
り、Ｑ２も遅れて電源供給を停止する。このように簡単
な回路で時定数保持、電源制御が可能である。When the talk switch 2 is opened after the utterance ends, integration is performed by the time constant of RC, and the base voltage of Q1 becomes H until the potential stored in C is spontaneously discharged,
After that, since it becomes L, Q1 becomes non-conductive later than opening, and Q2 also stops power supply later. Thus, the time constant can be maintained and the power supply can be controlled with a simple circuit.

【００３５】つまり、常に音声送信機全体に電源を供給
する必要はなく、トークスイッチの押された区間と、ト
ークスイッチを開放しても数秒間、音声送信主処理部に
電源を供給し続けることで、語尾切れのない音声信号を
無線で送信し、電源電池の消費電力の低減を図り、動作
使用時間を拡大し、音声送信を確実に行える音声認識を
実現できる。That is, it is not necessary to always supply power to the entire voice transmitter, and it is necessary to supply power to the voice transmission main processing unit for a few seconds even when the talk switch is pressed and when the talk switch is opened. Thus, it is possible to wirelessly transmit a speech signal without a suffix, reduce power consumption of a power supply battery, extend operation use time, and realize speech recognition that can reliably perform speech transmission.

【００３６】（実施の形態２）つぎに、本発明の請求項
３に記載された発明の実施の形態について、図４、図５
を用いて説明する。(Embodiment 2) Next, an embodiment of the invention described in claim 3 of the present invention will be described with reference to FIGS.
This will be described with reference to FIG.

【００３７】図４は、本発明の一実施例における音声認
識装置の音声送信機のブロック構成図を示す。図４にお
いて、符号7は入力された音声信号を電気信号に変換す
るマイク、8は前記マイク出力を増幅する増幅器、9は前
記増幅器で増幅された音声信号を変調、送信する変調送
信部、１２は前記増幅器で増幅された音声信号レベルを
検出し、レベル検出時はＨパルスを出力するレベル検出
部である。FIG. 4 is a block diagram showing a voice transmitter of the voice recognition apparatus according to one embodiment of the present invention. 4, reference numeral 7 denotes a microphone that converts an input audio signal into an electric signal, 8 denotes an amplifier that amplifies the microphone output, 9 denotes a modulation transmission unit that modulates and transmits the audio signal amplified by the amplifier, 12 Is a level detection unit that detects the level of the audio signal amplified by the amplifier and outputs an H pulse when the level is detected.

【００３８】また、１３はマイク７、増幅器８、変調送
信部９、レベル検出部１２からなる音声送信主処理部、
２は音声発声の操作を知らしめるトークスイッチ、３は
前記トークスイッチ２を押してる時間の時定数を保持す
る時定数保持部、４は前記時定数保持部で設定された時
間幅のみ前記音声送信主処理部１３に電源を供給する電
源制御部、６は前記電源制御部４で電源供給される時を
表示する表示部、５は音声送信機全体に電源を供給する
電池部、１０、１１はダイオードで構成されたトークス
イッチ２とレベル検出部１２のOＲをとるOＲ回路であ
る。Reference numeral 13 denotes an audio transmission main processing unit including the microphone 7, the amplifier 8, the modulation transmission unit 9, and the level detection unit 12,
Reference numeral 2 denotes a talk switch for notifying an operation of voice utterance. Reference numeral 3 denotes a time constant holding unit for holding a time constant of a time when the talk switch 2 is pressed. Reference numeral 4 denotes the voice transmission only for a time width set by the time constant holding unit. A power control unit for supplying power to the main processing unit 13, a display unit 6 for displaying when power is supplied by the power control unit 4, a battery unit 5 for supplying power to the entire audio transmitter, and 10, 11 This is an OR circuit that takes the OR of the talk switch 2 and the level detection unit 12 which are configured by diodes.

【００３９】以上のように構成された音声認識装置につ
いて、その動作を説明する。使用者はまず、この音声送
信機のトークスイッチ２を押し続けて、音声認識に登録
されている単語を発声する。The operation of the speech recognition apparatus configured as described above will be described. First, the user keeps pressing the talk switch 2 of the voice transmitter to utter a word registered for voice recognition.

【００４０】このトークスイッチ２を押すと時定数保持
部３はＨパルスを出力し、時定数保持部３がＨの区間、
電源制御部４は電池部５から供給される電池電源を、マ
イク７、増幅器８、変調送信部９、レベル検出部１２で
構成される音声送信主処理部１３に電源を供給する。電
源を供給された音声送信主処理部１３は、マイク７で発
声者の音声を電気信号に変換し、増幅器８でそのレベル
を増幅し、変調送信部９で発声音声を無線送信する。When the talk switch 2 is pressed, the time constant holding unit 3 outputs an H pulse.
The power control unit 4 supplies the battery power supplied from the battery unit 5 to the audio transmission main processing unit 13 including the microphone 7, the amplifier 8, the modulation transmission unit 9, and the level detection unit 12. The voice transmission main processing unit 13 supplied with power converts the voice of the speaker into an electric signal with the microphone 7, amplifies the level with the amplifier 8, and wirelessly transmits the voice with the modulation transmission unit 9.

【００４１】送信された音声は、ＴＶやＶＴＲなどの近
くに設置または内蔵された受信機で復調し、音声認識処
理されて発声された音声に沿う機器の制御を行い、一連
の音声認識処理が完了する。使用者は発声が終わるとト
ークスイッチ２を開放し、開放されたことを確認した時
定数保持部３は数秒後にＨパルスからＬパルスを電源制
御部４に入力し、電源制御部４は電池部５からの電源
を、音声送信主処理部１３に供給することを停止する。The transmitted voice is demodulated by a receiver installed or built in the vicinity of a TV or VTR, and subjected to voice recognition processing to control equipment in accordance with the uttered voice, and a series of voice recognition processing is performed. Complete. The user releases the talk switch 2 when the utterance ends, and after confirming that the talk switch 2 is released, the time constant holding unit 3 inputs an H pulse to an L pulse to the power control unit 4 after a few seconds, and the power control unit 4 controls the battery unit. The supply of the power from the power supply 5 to the audio transmission main processing unit 13 is stopped.

【００４２】トークスイッチ２は発声する単語が完全に
終了するまで押し続けるのであれば、問題は発生しない
が、通常人間の動作として、発声完了と同時にトークス
イッチ２を開放することが多々ある。図５は図４の各部
の動作波形を示すタイムチャートであり、時間t0で発声
するためのトークスイッチ２を押し、単語例「イッチャ
ンネル」を発声し、時間t1で発声完了として、トークス
イッチ２を開放する。If the talk switch 2 is kept depressed until the word to be uttered is completely finished, there is no problem. However, as a human operation, the talk switch 2 is often released simultaneously with the completion of the utterance. FIG. 5 is a time chart showing the operation waveforms of the respective parts of FIG. 4. The talk switch 2 for uttering at time t0 is pressed, the word example "I-Channel" is uttered, and the utterance is completed at time t1. To release.

【００４３】トークスイッチ２を開放しても、発声が完
全に終了してない場合、レベル検出部１２から音声信号
検出のＨパルスが出力され続けており、完全に終了した
時間t2でレベル検出部１２の出力パルスはＬになる。If the utterance is not completely terminated even after the talk switch 2 is opened, the H pulse for detecting the audio signal is continuously output from the level detection unit 12, and the level detection unit is output at time t2 when the speech signal is completely terminated. Twelve output pulses become L.

【００４４】このＬパルスを入力した時定数保持部３
は、Ｌパルスになるのに数秒の時間遅れを発生させ、電
源制御部４から供給される電源をその分、時間拡大す
る。何も対策してなければ、すぐに電源制御部４から電
源が供給されなくなり、最後の発声語の語尾が切れるこ
とになるが、語尾が完全に終了したことでもって電源供
給を停止する。The time constant holding unit 3 receiving the L pulse
Causes a time delay of several seconds to become an L pulse, and expands the power supplied from the power supply control unit 4 by that amount. If no countermeasures are taken, power is not immediately supplied from the power control unit 4 and the ending of the last uttered word is cut off, but the power supply is stopped when the ending is completely completed.

【００４５】つまり、常に音声送信機全体に電源を供給
する必要はなく、トークスイッチの押された区間と、ト
ークスイッチを開放しても音声入力の有無を検出し、そ
の検出区間と時定数保持部３で設定した数秒間、音声送
信主処理部に電源を供給し続けることで、語尾切れのな
い音声信号を無線で送信し、電源電池の消費電力の低減
を図り、動作使用時間を拡大し、音声送信を確実に行え
る音声認識を実現できる。In other words, it is not necessary to always supply power to the entire voice transmitter. The section where the talk switch is pressed and the presence or absence of voice input even when the talk switch is opened are detected, and the detected section and the time constant are held. By continuing to supply power to the audio transmission main processing unit for several seconds set in the unit 3, the endless audio signal is transmitted wirelessly, the power consumption of the power supply battery is reduced, and the operation time is extended. In addition, it is possible to realize voice recognition that can reliably perform voice transmission.

【００４６】また実施の形態１では、トークスイッチの
開放を判断して、開放後、時定数保持部で設定した時間
だけ電源供給を持続させている為、時定数設定にはかな
りの余裕が必要となる欠点があるが、実施の形態２で
は、一度電源供給が始まれば、音声レベルが無くなるま
で電源供給が持続され、より確実な語尾対策となる。い
ずれにしても、常に音声送信機全体に電源を供給する必
要はなく、トークスイッチの押された区間と、トークス
イッチを開放して、発声が終了するまでの数秒間、音声
送信主処理部に電源を供給し続けることで、語尾切れの
ない音声信号を無線で送信し、電源電池の消費電力の低
減を図り、動作使用時間を拡大し、音声送信を確実に行
える音声認識を実現できる。In the first embodiment, since the power supply is maintained for the time set in the time constant holding unit after the release of the talk switch is determined after opening, a considerable margin is required for setting the time constant. However, in the second embodiment, once the power supply is started, the power supply is continued until the sound level disappears, and a more reliable ending countermeasure is provided. In any case, it is not necessary to always supply power to the entire voice transmitter, and the voice transmission main processing unit performs for a few seconds until the talk switch is pressed and the talk switch is released and the utterance ends. By continuously supplying power, it is possible to wirelessly transmit a sound signal without a suffix, reduce power consumption of a power supply battery, extend operation use time, and realize voice recognition capable of reliably transmitting voice.

【００４７】[0047]

【発明の効果】以上のように、本発明の音声認識装置
は、トークスイッチの押された区間と、トークスイッチ
を開放しても数秒間、音声送信主処理部に電源を供給し
続け、語尾切れのない音声信号を無線で送信すること
で、電源電池の消費電力の低減を図り、動作使用時間を
拡大し、音声送信を確実に行える音声認識装置を提供す
ることが可能となる。As described above, the voice recognition apparatus of the present invention continues to supply power to the voice transmission main processing section for several seconds even when the talk switch is depressed and when the talk switch is opened, By transmitting an uninterrupted voice signal wirelessly, it is possible to provide a voice recognition device capable of reducing power consumption of a power supply battery, extending operation use time, and reliably transmitting voice.

[Brief description of the drawings]

【図１】本発明の第１の実施の形態における音声認識装
置の音声送信機のブロック構成図FIG. 1 is a block diagram of a voice transmitter of a voice recognition device according to a first embodiment of the present invention.

【図２】図１の各部の動作波形を示す図FIG. 2 is a diagram showing operation waveforms of each unit in FIG. 1;

【図３】図１の時定数保持部、電源制御部の回路例を示
す図FIG. 3 is a diagram illustrating a circuit example of a time constant holding unit and a power supply control unit in FIG. 1;

【図４】本発明の第２の実施の形態における音声認識装
置の音声送信機のブロック構成図FIG. 4 is a block diagram of a voice transmitter of a voice recognition device according to a second embodiment of the present invention.

【図５】図４の各部の動作波形を示す図FIG. 5 is a diagram showing operation waveforms of each unit in FIG. 4;

【図６】従来の第１実施例の音声認識装置の音声送信機
のブロック構成図FIG. 6 is a block diagram of a voice transmitter of the voice recognition device according to the first conventional example.

【図７】従来の第２実施例の音声認識装置の音声送信機
のブロック構成図FIG. 7 is a block diagram of a speech transmitter of a speech recognition apparatus according to a second conventional example.

[Explanation of symbols]

１音声送信主処理部２トークスイッチ３時定数保持部４電源制御部５電池部６表示部７マイク８増幅器９変調送信部１０ダイオード１１ダイオード１２レベル検出部１３音声送信主処理部２０マイク２１音声区間検出部２２無線送信部２３リングメモリー２４ A/D変換器２５ D/A変換器 Reference Signs List 1 voice transmission main processing unit 2 talk switch 3 time constant holding unit 4 power supply control unit 5 battery unit 6 display unit 7 microphone 8 amplifier 9 modulation transmission unit 10 diode 11 diode 12 level detection unit 13 voice transmission main processing unit 20 microphone 21 voice Section detection unit 22 Wireless transmission unit 23 Ring memory 24 A / D converter 25 D / A converter

Claims

[Claims]

1. By continuously supplying power to a voice transmission main processing unit for a section in which a talk switch is pressed and for several seconds even when the talk switch is opened, and transmitting a sound signal without end of a word by radio, A speech recognition device that reduces the power consumption of the power battery, extends the operating time, and reliably transmits speech.

2. An audio transmission main process comprising a microphone for converting an input audio signal into an electric signal, an amplifier for amplifying the microphone output, and a modulation transmission unit for modulating and transmitting the audio signal amplified by the amplifier. A talk switch for notifying an operation of voice utterance, a time constant holding unit for holding a time constant of a time when the talk switch is pressed, and the voice transmission only for a time width set by the time constant holding unit. A speech recognition apparatus, comprising: a power supply control unit that supplies power to a main processing unit; and a display unit that receives an output of the power supply control unit and displays power supply time.

3. A microphone for converting an input audio signal into an electric signal, an amplifier for amplifying the microphone output, a modulation transmitting unit for modulating and transmitting the audio signal amplified by the amplifier, and It has an audio transmission main processing unit comprising a level detector for detecting an audio input level as an input, and a talk switch for notifying an operation of voice utterance, and an OR of the talk switch and the level detector is taken.
An OR circuit, a time constant holding unit that holds a time constant of an OR circuit output pulse obtained by ORing a time during which the talk switch is pressed and a voice section detected by the level detector, and a time constant holding unit that sets the time constant. A power control unit for supplying power to the voice transmission main processing unit only for a specified duration, and a display unit for receiving power from the power control unit and displaying when power is supplied. .