JPH11282477A

JPH11282477A - Music playing device

Info

Publication number: JPH11282477A
Application number: JP10079519A
Authority: JP
Inventors: Shoji Kuriki; 章次栗木
Original assignee: Ricoh Co Ltd
Current assignee: Ricoh Co Ltd
Priority date: 1998-03-26
Filing date: 1998-03-26
Publication date: 1999-10-15

Abstract

PROBLEM TO BE SOLVED: To share a generally used voice input means for voice recognition and for singing. SOLUTION: A control part 7 identifies the operation modes of plural stages such as the inter-song mode of not reproducing music data and an introduction mode relating to the time of reproduction, etc., and switches the output destination of voice signals inputted from a microphone to a microphone signal switching part 3 to one of a mixer part 6 and a voice recognition part 8 corresponding to the identified operation mode.

Description

DETAILED DESCRIPTION OF THE INVENTION

【０００１】[0001]

【発明の属する技術分野】この発明は、カラオケ装置，
通信カラオケ装置等の音楽演奏装置に関する。The present invention relates to a karaoke apparatus,
The present invention relates to a music performance device such as a communication karaoke device.

【０００２】[0002]

【従来の技術】近年、キーボードやマウス等の入力装置
を使用しないコンピュータの入力手段として、音声認識
技術を用いて各種の操作指示を音声入力できるようにし
たものが一般的であるが、カラオケ音楽を再生するカラ
オケ装置，通信カラオケ装置等の音楽演奏装置に音声認
識技術を用いた入力手段を採用すれば、カラオケ曲の選
曲等の操作がし易くなり、利用者のユーザインタフェー
スを向上させることができる。2. Description of the Related Art In recent years, as an input means of a computer which does not use an input device such as a keyboard or a mouse, it is general to use a voice recognition technique to input various operation instructions by voice. If an input means using voice recognition technology is adopted in a music performance device such as a karaoke device for reproducing music, a communication karaoke device, etc., operations such as selection of karaoke songs can be easily performed, and a user interface of a user can be improved. it can.

【０００３】そこで、従来は歌唱用のマイクと選曲用の
マイクを備え、選曲用のマイクから入力された音声から
リクエスト曲を認識するようにしたカラオケ装置（例え
ば、特開平６−１６１４８２号公報参照）や、音声認識
機能モードとカラオケ機能モードの切替スイッチを設け
た専用のマイクを使用し、音声認識機能モードに切り替
えられた後に集音した音声によるリクエスト曲や音量調
整等を認識するようにしたカラオケ装置（例えば、特開
平６−８９０９６号公報参照）が提案されている。Therefore, a karaoke apparatus conventionally provided with a microphone for singing and a microphone for selecting music, and recognizing a requested music from voice input from the microphone for music selection (for example, see Japanese Patent Application Laid-Open No. Hei 6-161482). ) And a dedicated microphone provided with a switch for switching between the voice recognition function mode and the karaoke function mode, and recognizes the requested song or volume adjustment by the collected voice after switching to the voice recognition function mode. A karaoke apparatus (for example, see Japanese Patent Application Laid-Open No. 6-89096) has been proposed.

【０００４】[0004]

【発明が解決しようとする課題】しかしながら、上述し
た前者のようなカラオケ装置では、使用者が歌唱時と選
曲時とでマイクを持ち代える必要があって不便であると
いう問題があった。また、後者のようなカラオケ装置で
は、音声認識機能モードとカラオケ機能モードの切替ス
イッチを設けた専用のマイクが必要であり、マイク毎に
異なる音質の中から好みのものを使用することが多い使
用者にとって、普段使用している愛用のマイクを使用で
きないという問題があった。とくに、一般的なワイヤレ
スマイクを使用できないので、カラオケの歌唱時に不便
を感じてしまう。However, the above-mentioned karaoke apparatus has a problem that the user needs to change the microphone between singing and selecting a tune, which is inconvenient. Also, in the latter karaoke apparatus, a dedicated microphone provided with a switch for switching between the voice recognition function mode and the karaoke function mode is required, and in many cases, a preferred one from among different sound qualities is used for each microphone. People could not use their favorite microphones. In particular, since a general wireless microphone cannot be used, it is inconvenient when singing karaoke.

【０００５】さらに、音声入力による指示内容には、選
曲の他に音量調整やテンポ調整等のコマンド制御指示も
含まれており、このように認識対象の内容が多岐に渡っ
ていると認識率の低下を招くという問題もあった。[0005] Furthermore, the instruction content by voice input includes, in addition to music selection, command control instructions such as volume adjustment and tempo adjustment. There was also a problem of causing a decrease.

【０００６】この発明は上記の点に鑑みてなされたもの
であり、一般に使用される音声入力手段を音声認識用と
歌唱用に共用できるようにすることを目的とする。さら
に、使用者からの音声による指示内容の認識精度を向上
させることも目的とする。The present invention has been made in view of the above points, and has as its object to enable commonly used voice input means to be shared for voice recognition and singing. It is another object of the present invention to improve the accuracy of recognizing the contents of an instruction given by a voice from a user.

【０００７】[0007]

【課題を解決するための手段】この発明は上記の目的を
達成するため、音楽データに基づいて音楽信号を再生す
る音楽再生手段と、音声信号を入力する音声入力手段
と、上記音楽再生手段によって再生された音楽信号によ
る音楽と上記音声入力手段によって入力された音声信号
による音声とを合わせて出力する音楽・音声出力手段
と、上記音声入力手段によって入力された音声信号に基
づく音声の指示内容を認識する音声認識手段と、その手
段によって認識された指示内容に対応する処理を実行す
る手段と、上記音楽データを再生していないときと再生
時にかかわる複数段階の動作モードを識別する動作モー
ド識別手段と、その手段によって識別された動作モード
に応じて上記音声入力手段によって入力される音声信号
の出力先を前記音楽・音声出力手段と音声認識手段のい
ずれかに切り替える音声出力先切替手段を備えた音楽演
奏装置を提供する。In order to achieve the above object, the present invention provides a music reproducing means for reproducing a music signal based on music data, a voice input means for inputting an audio signal, and the music reproducing means. Music / speech output means for outputting the music of the reproduced music signal together with the sound of the sound signal input by the sound input means; and a sound instruction content based on the sound signal input by the sound input means. Speech recognition means for recognizing, means for executing processing corresponding to the instruction content recognized by the means, and operation mode identification means for identifying a plurality of stages of operation modes when the music data is not reproduced and when the music data is reproduced And the output destination of the audio signal input by the audio input means in accordance with the operation mode identified by the means. Providing music performance apparatus having a sound output destination switching means for switching to either the voice output unit and the voice recognition means.

【０００８】さらに、上記音声出力先切替手段による音
声信号の出力先を知らせる情報を出力する手段を設ける
となおよい。Further, it is more preferable to provide a means for outputting information notifying the output destination of the audio signal by the audio output destination switching means.

【０００９】また、音楽データに基づいて音楽信号を再
生する音楽再生手段と、音声信号を入力する音声入力手
段と、上記音楽再生手段によって再生された音楽信号に
よる音楽と上記音声入力手段によって入力された音声信
号による音声とを合わせて出力する音楽・音声出力手段
と、複数種類の音声認識用辞書を記憶した音声認識用辞
書記憶手段と、その手段に記憶された各音声認識用辞書
を参照して上記音声入力手段によって入力された音声信
号に基づく音声の指示内容を認識する音声認識手段と、
その手段によって認識された指示内容に対応する処理を
実行する手段と、上記音楽データを再生していないとき
と再生時にかかわる複数段階の動作モードを識別する動
作モード識別手段と、その手段によって識別された動作
モードに応じて上記音声認識手段が参照する上記音声認
識用辞書記憶手段の音声認識用辞書の種類を切り替える
音声認識用辞書切替手段を備えた音楽演奏装置にすると
よい。Also, a music reproducing means for reproducing a music signal based on music data, a voice input means for inputting an audio signal, a music based on the music signal reproduced by the music reproducing means, and a music input by the voice input means. Music / speech output means for outputting the combined speech with the speech signal, speech recognition dictionary storage means for storing a plurality of types of speech recognition dictionaries, and the respective speech recognition dictionaries stored in the means. Voice recognition means for recognizing a voice instruction content based on a voice signal input by the voice input means,
Means for executing a process corresponding to the instruction content recognized by the means, operation mode identification means for identifying a plurality of stages of operation modes when the music data is not reproduced and when the music data is reproduced, and operation mode identification means for identifying the music data. The music performance device may include a voice recognition dictionary switching unit that switches the type of the voice recognition dictionary in the voice recognition dictionary storage unit referred to by the voice recognition unit according to the operation mode.

【００１０】さらに、上記音声認識用辞書切替手段によ
って切り替えた音声認識用辞書の種類を知らせる情報を
出力する手段を設けるとなおよい。Further, it is more preferable to provide a means for outputting information notifying the type of the speech recognition dictionary switched by the speech recognition dictionary switching means.

【００１１】また、音楽データに基づいて音楽信号を再
生する音楽再生手段と、音声信号を入力する音声入力手
段と、上記音楽再生手段によって再生された音楽信号に
よる音楽と上記音声入力手段によって入力された音声信
号による音声とを合わせて出力する音楽・音声出力手段
と、複数種類の音声認識用辞書を記憶した音声認識用辞
書記憶手段と、その手段に記憶された各音声認識用辞書
を参照して上記音声入力手段によって入力された音声信
号に基づく音声の指示内容を認識する音声認識手段と、
その手段によって認識された指示内容に対応する処理を
実行する手段と、上記音楽データを再生していないとき
と再生時にかかわる複数段階の動作モードを識別する動
作モード識別手段と、その手段によって識別された動作
モードに応じて上記音声入力手段によって入力される音
声信号の出力先を上記音楽・音声出力手段と音声認識手
段のいずれかに切り替える音声出力先切替手段と、上記
動作モード識別手段によって識別された動作モードに応
じて上記音声認識手段が参照する上記音声認識用辞書記
憶手段の音声認識用辞書の種類を切り替える音声認識用
辞書切替手段を備えた音楽演奏装置にするとよい。[0011] Also, a music reproducing means for reproducing a music signal based on music data, a voice input means for inputting an audio signal, a music based on the music signal reproduced by the music reproducing means, and a music input by the voice input means Music / speech output means for outputting the combined speech with the speech signal, speech recognition dictionary storage means for storing a plurality of types of speech recognition dictionaries, and the respective speech recognition dictionaries stored in the means. Voice recognition means for recognizing a voice instruction content based on a voice signal input by the voice input means,
Means for executing a process corresponding to the instruction content recognized by the means, operation mode identification means for identifying a plurality of stages of operation modes involved when the music data is not reproduced and when the music data is reproduced, and operation mode identification means for identifying the music data. Voice output destination switching means for switching an output destination of a voice signal input by the voice input means to one of the music / voice output means and voice recognition means in accordance with the operation mode, and the operation mode identification means. The music performance device may include a voice recognition dictionary switching unit that switches the type of the voice recognition dictionary in the voice recognition dictionary storage unit referred to by the voice recognition unit according to the operation mode.

【００１２】さらに、上記音声出力先切替手段による音
声信号の出力先を知らせる情報と、上記音声認識用辞書
切替手段によって切り替えた音声認識用辞書の種類を知
らせる情報とを出力する手段を設けるとなおよい。Further, a means for outputting information notifying the output destination of the voice signal by the voice output destination switching means and information notifying the type of the voice recognition dictionary switched by the voice recognition dictionary switching means is provided. Good.

【００１３】この発明の請求項１の音楽演奏装置は、音
楽の曲間，前奏，曲中，中奏，及び後奏等の各種の動作
モードを識別し、その動作モードに応じて音声入力手段
から入力される音声信号の出力先を音声認識手段と音楽
・音声出力手段のいずれかに切り替えるので、カラオケ
等の音楽を再生していないときと、前奏，中奏，及び後
奏時には選曲，音量，テンポ等の調整指示の音声のため
の音声入力を可能にし、音楽の曲中のときには歌唱音声
入力を可能にすることができる。According to a first aspect of the present invention, there is provided a music performance device for identifying various operation modes such as inter-music, prelude, mid-music, middle and last, and voice input means according to the operation mode. The output destination of the audio signal input from the device is switched to either the voice recognition means or the music / voice output means, so that when music such as karaoke is not being reproduced, and when the prelude, middle, and subsequent performances are selected, the volume and volume are selected. , A tempo, etc., and a singing voice can be input during a music song.

【００１４】したがって、音声入力手段として一般的な
マイクを音声認識用と歌唱用に共用することができ、使
用者は選曲と歌唱と歌唱中の音量及びテンポの調整をマ
イクの交換等の煩雑な作業を行なわなくても行なうこと
ができ、快適な歌唱環境を得ることができる。Therefore, a general microphone can be used as a voice input means for voice recognition and singing, and the user can select a tune, sing and adjust the volume and tempo during singing by complicated procedures such as changing microphones. It can be performed without performing work, and a comfortable singing environment can be obtained.

【００１５】また、この発明の請求項２の音楽演奏装置
は、音声信号の出力先が音楽・音声出力手段と音声認識
手段のいずれであるかの情報を表示するので、使用者に
対して音声入力手段から選曲や音量，テンポ等の調整指
示のための音声入力が可能なときと、歌唱音声の入力が
可能なときを容易に判断させることができる。したがっ
て、使用者は選曲と歌唱と歌唱の合間の音量やテンポの
調整をタイミング良く行なうことができ、歌唱をより楽
しむことができる。Further, the music performance device according to the second aspect of the present invention displays information indicating whether the output destination of the audio signal is the music / speech output means or the speech recognition means. It is possible to easily determine when a voice input for an instruction to adjust a tune, a volume, a tempo, etc. is possible from the input means and when a singing voice is input. Therefore, the user can adjust the volume and tempo between the song selection, the singing, and the singing with good timing, and can enjoy the singing more.

【００１６】さらに、この発明の請求項３の音楽演奏装
置は、音楽の曲間，前奏，曲中，中奏，及び後奏等の各
種の動作モードを識別し、その動作モードに応じて音声
認識手段が参照する音声認識用辞書の種類を切り替える
ので、リクエストが行なわれる可能性が高い曲間と後奏
時に入力された音声の認識には、選曲のための曲名や曲
番号の認識に必要な曲名認識用の辞書を参照し、音量や
テンポの調整がなされる可能性が高い前奏と中奏時に入
力された音声の認識には、音量やテンポの調整のための
制御コマンドの認識に必要な制御コマンド認識用の辞書
を参照することができる。したがって、各動作モード毎
に最も為される確率の高い指示内容の認識に絞り込むこ
とにより、認識精度を向上させることができる。Further, the music performance apparatus according to a third aspect of the present invention distinguishes between various operation modes such as inter-music, prelude, mid-course, middle, and subsequent, and outputs sound according to the operation mode. Since the type of voice recognition dictionary to be referred to by the recognition means is switched, it is necessary to recognize the song name and song number for song selection in order to recognize between songs that are likely to be requested and voice input during the subsequent performance It is necessary to recognize the control commands for adjusting the volume and tempo for recognizing voices input during preludes and middle plays that are likely to be adjusted in volume and tempo by referring to a dictionary for song title recognition It is possible to refer to a dictionary for recognizing various control commands. Therefore, the recognition accuracy can be improved by narrowing down to the recognition of the instruction content that is most likely to be performed for each operation mode.

【００１７】また、この発明の請求項４の音楽演奏装置
は、音声認識に使用する辞書の種類がいずれであるかの
情報を表示するので、使用者に対して選曲と音量，テン
ポ等の調整指示とのいずれを行なうのが適当なのかを判
断させることができる。例えば、使用者に対して音楽が
再生されていないときと再生時の後奏時には選曲を行な
わせ、再生開始から終了までの中奏時などで音量やテン
ポの調整を行なわせることができる。したがって、使用
者が意図する指示動作を確実に行なうことができ、使用
者により快適な歌唱環境を提供することができる。Further, the music performance apparatus according to the fourth aspect of the present invention displays information as to which type of dictionary is used for voice recognition. It is possible to determine which of the instructions is appropriate. For example, the user can be caused to select a music when the music is not being reproduced and at the time of the subsequent performance during the reproduction, and to adjust the volume and the tempo at the time of the middle performance from the start to the end of the reproduction. Therefore, the instruction operation intended by the user can be reliably performed, and a more comfortable singing environment can be provided for the user.

【００１８】さらに、この発明の請求項５の音楽演奏装
置は、音楽の曲間，前奏，曲中，中奏，及び後奏等の各
種の動作モードを識別し、その動作モードに応じて音声
入力手段から入力される音声信号の出力先を音声認識手
段と音楽・音声出力手段のいずれかに切り替えると共
に、その動作モードに応じて音声認識手段が参照する音
声認識用辞書の種類を切り替えるので、カラオケ等の音
楽を再生していないときと後奏時に入力された音声から
選曲された曲名等の認識を実行し、再生時の前奏と中奏
時に入力された音声から音量，テンポ等の調整指示の認
識を実行し、曲中に入力された音声を歌唱音声として音
楽と共に出力することができる。Further, the music performance apparatus according to a fifth aspect of the present invention identifies various operation modes such as inter-music, prelude, mid-music, middle and last, and outputs sound according to the operation mode. Since the output destination of the voice signal input from the input means is switched to either the voice recognition means or the music / voice output means, and the type of the voice recognition dictionary referred to by the voice recognition means is switched according to the operation mode, Recognition of the selected song, etc., from the voice input during karaoke or other music not played and during the subsequent performance, and instructions for adjusting the volume, tempo, etc. based on the voice input during the prelude and the middle during playback. , And the voice input during the song can be output as singing voice along with the music.

【００１９】したがって、使用者が１つの音声入力手段
によって歌唱と音声による選曲や音量及びテンポの調整
等の指示を容易に行なうことができ、その音声による指
示内容を確実に実行することができる。Therefore, the user can easily give an instruction such as selection of a song by singing and voice, adjustment of a volume and a tempo by one voice input means, and the content of the instruction by the voice can be surely executed.

【００２０】また、この発明の請求項６の音楽演奏装置
は、音声信号の出力先が音楽・音声出力手段と音声認識
手段のいずれであるかの情報と共に、音声認識に使用す
る辞書の種類がいずれであるかの情報を表示するので、
使用者に対して歌唱，選曲，音量やテンポ等の調整指示
とのいずれを行なうのが適当なのかを判断させることが
できる。したがって、使用者が歌唱，選曲，音量及びテ
ンポの調整指示に手間取ることなく歌唱を楽しむことが
できる。According to a sixth aspect of the present invention, there is provided a music performance apparatus, wherein the type of dictionary used for voice recognition is determined along with information indicating whether the output destination of the voice signal is music / voice output means or voice recognition means. Since it displays information about which one is
It is possible to make the user determine whether it is appropriate to perform a singing, a song selection, or an instruction to adjust the volume or the tempo. Therefore, the user can enjoy singing without having to spend time singing, selecting songs, adjusting volume and tempo.

【００２１】[0021]

【発明の実施の形態】以下、この発明の実施の形態を図
面に基づいて具体的に説明する。図１は、この発明の一
実施形態であるカラオケ装置の構成を示すブロック図で
ある。Embodiments of the present invention will be specifically described below with reference to the drawings. FIG. 1 is a block diagram showing a configuration of a karaoke apparatus according to an embodiment of the present invention.

【００２２】このカラオケ装置は、ＣＰＵ，ＲＯＭ，及
びＲＡＭ等からなるマイクロコンピュータによって実現
され、マイク１，マイクアンプ部２，マイク信号切替部
３，カラオケ再生部４，カラオケ曲データ記憶部５，ミ
キサー部６，制御部７，音声認識部８，スピーカ９，及
び表示部１０からなる。This karaoke apparatus is realized by a microcomputer including a CPU, a ROM, a RAM, and the like, and includes a microphone 1, a microphone amplifier 2, a microphone signal switching unit 3, a karaoke reproducing unit 4, a karaoke music data storage unit 5, a mixer. It comprises a unit 6, a control unit 7, a voice recognition unit 8, a speaker 9, and a display unit 10.

【００２３】マイク１は、歌唱音声と選曲や音量及びテ
ンポ等の調整指示の音声とを入力する音声入力装置であ
る。マイクアンプ部２は、マイク１から入力された音声
信号を増幅してマイク信号切替部３へ出力する。The microphone 1 is a voice input device for inputting a singing voice and voices for selecting songs, adjusting volume, tempo, and the like. The microphone amplifier unit 2 amplifies the audio signal input from the microphone 1 and outputs the amplified audio signal to the microphone signal switching unit 3.

【００２４】マイク信号切替部３は、制御部７からの出
力先切替信号に基づいてマイク１から入力された音声信
号をミキサー部６と音声認識部８のいずれかに出力する
ための切り替え処理を行なう。カラオケ再生部４は、カ
ラオケ曲データ記憶部５からリクエストされた曲の音楽
データ（「曲データ」とも称する）を参照して、その音
楽データから再生した音楽信号をミキサー部６へ出力す
るＭＩＤＩ音源等のカラオケ音楽再生装置である。The microphone signal switching unit 3 performs a switching process for outputting an audio signal input from the microphone 1 to one of the mixer unit 6 and the audio recognition unit 8 based on an output destination switching signal from the control unit 7. Do. The karaoke playback unit 4 refers to music data (also referred to as “song data”) of the song requested from the karaoke song data storage unit 5 and outputs a music signal reproduced from the music data to the mixer unit 6. And the like.

【００２５】カラオケ曲データ記憶部５は、多種類のカ
ラオケ曲の音楽データ（例えば、ＭＩＤＩデータ）を記
憶したハードディスク等の記憶装置であり、各音楽デー
タには予めカラオケ再生時の複数段階の動作モードを識
別するための情報が付加されている。ミキサー部６は、
カラオケ再生部４から受け取った音楽信号とマイク信号
切替部３から受け取った音声信号とを合わせてスピーカ
９へ出力する。The karaoke song data storage unit 5 is a storage device such as a hard disk which stores music data (for example, MIDI data) of various types of karaoke songs. Information for identifying the mode is added. The mixer unit 6
The music signal received from the karaoke reproducing unit 4 and the audio signal received from the microphone signal switching unit 3 are combined and output to the speaker 9.

【００２６】制御部７は、このカラオケ装置全体の制御
を司り、カラオケ音楽を再生していないときの動作モー
ドと、カラオケ音楽の再生時にカラオケ再生部４で再生
されるカラオケ曲データ記憶部５の音楽データを参照
し、その音楽データ中の各動作モードを識別するための
情報に基づいて各段階の動作モードを識別して、各動作
モードに応じてマイク信号切替部３へマイク入力信号の
出力先切替信号を出力する処理と、音声認識部８から受
け取った認識結果に対応する選曲や音量及びテンポの調
整処理のコマンド制御処理と、マイク１から入力された
音声の出力先がスピーカ９か音声認識部８かを知らせる
メッセージを表示する処理も実行する。The control unit 7 controls the entire karaoke apparatus. The operation mode when the karaoke music is not reproduced and the karaoke music data storage unit 5 reproduced by the karaoke reproduction unit 4 when the karaoke music is reproduced. The music data is referred to, the operation mode of each stage is identified based on information for identifying each operation mode in the music data, and the microphone input signal is output to the microphone signal switching unit 3 according to each operation mode. A process of outputting a destination switching signal, a command control process of a music selection corresponding to the recognition result received from the voice recognition unit 8 and a process of adjusting a volume and a tempo, and an output destination of the voice input from the microphone 1 is the speaker 9 or the voice. A process of displaying a message notifying the recognition unit 8 is also executed.

【００２７】音声認識部８は、マイク信号切替部３から
受け取った音声信号を予め登録されている辞書に基づい
て選曲された曲名や音量及びテンポの調整処理のコマン
ド制御指示の内容を認識する処理を実行し、その認識結
果を制御部７へ出力する。スピーカ９は、カラオケ音楽
と共に歌唱音声を聴取可能に出力する。表示部１０は、
カラオケ使用時の各種の操作ガイダンスやマイク１から
入力された音声の出力先がスピーカ９か音声認識部８か
を知らせるメッセージ等の各種の情報を表示するＬＣ
Ｄ，ＣＲＴ等の表示装置である。The voice recognizing unit 8 recognizes the content of the command of the music name, volume and tempo selected based on the dictionary registered in advance, based on the voice signal received from the microphone signal switching unit 3. And outputs the recognition result to the control unit 7. The speaker 9 outputs the singing voice together with the karaoke music so as to be audible. The display unit 10
LC that displays various information such as various operation guidance when using karaoke and a message indicating that the output destination of the voice input from the microphone 1 is the speaker 9 or the voice recognition unit 8
A display device such as a D or CRT.

【００２８】すなわち、上記カラオケ再生部４が音楽デ
ータに基づいて音楽信号を再生する音楽再生手段の機能
を果たし、上記マイク１が音声信号を入力する音声入力
手段の機能を果たし、上記ミキサー部６及びスピーカ９
がカラオケ再生部４によって再生された音楽信号による
音楽とマイク１によって入力された音声信号による音声
とを合わせて出力する音楽・音声出力手段の機能を果た
す。That is, the karaoke reproducing section 4 functions as a music reproducing means for reproducing a music signal based on music data, the microphone 1 functions as a sound input means for inputting a sound signal, and the mixer section 6 And speaker 9
Functions as a music / speech output unit that outputs the music based on the music signal reproduced by the karaoke reproducing unit 4 and the sound based on the audio signal input by the microphone 1 together.

【００２９】また、上記音声認識部８がマイク１によっ
て入力された音声信号に基づく音声の指示内容を認識す
る音声認識手段の機能を果たし、制御部７が音声認識部
８によって認識された指示内容に対応する処理を実行す
る手段と、上記音楽データを再生していないときと再生
時にかかわる複数段階の動作モードを識別する動作モー
ド識別手段の機能を果たす。The voice recognition section 8 functions as voice recognition means for recognizing voice instructions based on a voice signal input by the microphone 1, and the control section 7 controls the voice content recognized by the voice recognition section 8. And a function of an operation mode identifying means for identifying a plurality of stages of operation modes involved when the music data is not reproduced and when the music data is reproduced.

【００３０】さらに、上記マイク信号切替部３が制御部
７によって識別された動作モードに応じてマイク１によ
って入力される音声信号の出力先をミキサー部６及びス
ピーカ９と音声認識部８とのいずれかに切り替える音声
出力先切替手段の機能を果たす。そして、上記表示部１
０がマイク信号切替部３による音声信号の出力先を知ら
せる情報を出力する手段の機能を果たす。Further, the microphone signal switching unit 3 determines whether the output destination of the audio signal input by the microphone 1 according to the operation mode identified by the control unit 7 is any one of the mixer unit 6, the speaker 9, and the voice recognition unit 8. The function of the audio output destination switching means for switching between the crab and the crab is performed. Then, the display unit 1
0 functions as a means for outputting information notifying the output destination of the audio signal by the microphone signal switching unit 3.

【００３１】次に、このカラオケ装置における処理を説
明する。まず、カラオケ音楽の再生時には、曲間，前
奏，曲中，中奏，及び後奏等の複数段階の動作モードが
ある。曲間モードは、カラオケ音楽が再生されていない
ときである。前奏モードはカラオケ音楽の再生を準備
し、再生を開始し、最初の歌詞の歌い出し（イントロ）
までである。曲中モードはカラオケ歌唱部分（例えば、
１番，２番など）である。中奏モードは曲中の歌わない
部分（例えば、１番と２番の間）である。後奏モード
は、歌詞を歌い終わってカラオケ音楽の再生が入力する
までである。Next, processing in the karaoke apparatus will be described. First, at the time of reproducing the karaoke music, there are a plurality of stages of operation modes such as inter-song, prelude, mid-song, middle, and after. The inter-song mode is when karaoke music is not being played. Prelude mode prepares for playback of karaoke music, starts playback, and starts singing the first lyrics (intro)
Up to. In song mode, the karaoke singing part (for example,
No. 1 and No. 2). The middle performance mode is a portion of the music that is not sung (for example, between No. 1 and No. 2). In the after-play mode, the singing of lyrics ends and the playback of karaoke music is input.

【００３２】このカラオケ装置は、予めカラオケ曲デー
タ記憶部５に記憶する音楽データ（カラオケ曲データ）
の各所に上記各動作モードを識別するためのタイミング
データを付加している。このタイミングデータを付加す
る位置は、例えば、１番の歌詞の歌い出しの箇所，１番
の歌の終了箇所，２番の歌詞を歌う開始箇所，２番の歌
の終了箇所などである。In the karaoke apparatus, music data (karaoke music data) stored in the karaoke music data storage unit 5 in advance.
The timing data for identifying each of the above operation modes is added to each of the sections. The locations to which the timing data is added are, for example, the first song singing location, the first song ending location, the second song singing start location, the second song ending location, and the like.

【００３３】そして、制御部７は、カラオケ曲データを
再生していないときには曲モードと判断し、カラオケ曲
データの再生時は、カラオケ曲データ中のタイミングデ
ータに基づいて前奏モード，曲中モード，中奏モード，
及び後奏モードを識別することができる。When the karaoke music data is not being reproduced, the control section 7 judges that the music mode is the music mode. When the karaoke music data is reproduced, the control section 7 performs the prelude mode, the music mode, and the music reproduction mode based on the timing data in the karaoke music data. Middle mode,
And the trailing mode can be identified.

【００３４】このカラオケ装置は、マイク１から入力さ
れた音声信号をマイクアンプ部２で増幅すると、その増
幅した音声信号をマイク信号切替部３へ送信する。マイ
ク信号切替部３は、制御部７から出力先をミキサー部６
に切り替える出力先切替信号を受信すると、マイクアン
プ部２から受信した音声信号を信号線ａを介してミキサ
ー部６へ出力し、出力先を音声認識部８に切り替える出
力先切替信号を受信すると、マイクアンプ部２から受信
した音声信号を信号線ｂを介して音声認識部８へ出力す
る。In the karaoke apparatus, when the audio signal input from the microphone 1 is amplified by the microphone amplifier unit 2, the amplified audio signal is transmitted to the microphone signal switching unit 3. The microphone signal switching unit 3 sends an output destination from the control unit 7 to the mixer unit 6.
When the output destination switching signal for switching to the audio amplifier is received, the audio signal received from the microphone amplifier unit 2 is output to the mixer unit 6 via the signal line a, and the output destination switching signal for switching the output destination to the audio recognition unit 8 is received. The voice signal received from the microphone amplifier unit 2 is output to the voice recognition unit 8 via the signal line b.

【００３５】一方、カラオケ再生部４は、制御部７によ
って選曲されたカラオケ曲データ記憶部５のカラオケ曲
データを参照し、そのカラオケ曲データによってカラオ
ケ音楽の音楽信号を再生してミキサー部６へ出力し、ミ
キサー部６はカラオケ再生部４から受信した音楽信号と
マイク信号切替部３から受信した音声信号を合わせて
（ミックスして）スピーカ９へ出力する。On the other hand, the karaoke reproducing section 4 refers to the karaoke music data stored in the karaoke music data storage section 5 selected by the control section 7, reproduces the music signal of the karaoke music based on the karaoke music data, and sends it to the mixer section 6. The mixer unit 6 combines (mixes) the music signal received from the karaoke reproducing unit 4 and the audio signal received from the microphone signal switching unit 3 and outputs the combined signal to the speaker 9.

【００３６】また、音声認識部８は、予め登録されてい
る辞書に基づいてマイク信号切替部３から受信した音声
信号から選曲された曲名や番号、又は音量やテンポ等の
調整のコマンド制御を認識し、その認識結果を制御部７
へ出力して、制御部７は音声認識部８から受信した認識
結果に基づいて選曲や音量，テンポ等の調整を行なうと
共に、表示部１０に音声信号の出力先を知らせるメッセ
ージを表示する。The voice recognition unit 8 recognizes a song name or number selected from a voice signal received from the microphone signal switching unit 3 or a command control for adjusting volume, tempo, etc., based on a dictionary registered in advance. The recognition result is sent to the control unit 7
The control unit 7 adjusts the music selection, volume, tempo, and the like based on the recognition result received from the voice recognition unit 8, and displays a message on the display unit 10 informing the output destination of the voice signal.

【００３７】さらに説明する。曲間モードのとき、制御
部７はマイク信号切替部３の出力先を音声認識部８へ切
り替え、マイク１からの音声信号を音声認識部８へ出力
させる。この音声認識部８で音声信号からリクエストの
曲名を認識すると、その曲名を制御部７へ出力し、制御
部７はカラオケ再生部４にカラオケ曲データ記憶部５の
該当するカラオケ曲データを再生させる。Further description will be given. In the inter-song mode, the control unit 7 switches the output destination of the microphone signal switching unit 3 to the voice recognition unit 8 and outputs the voice signal from the microphone 1 to the voice recognition unit 8. When the voice recognition section 8 recognizes the song title of the request from the voice signal, it outputs the song title to the control section 7, and the control section 7 causes the karaoke playback section 4 to play the corresponding karaoke song data in the karaoke song data storage section 5. .

【００３８】そして、カラオケ再生部４によるカラオケ
音楽の再生時、制御部７はカラオケ曲データのタイミン
グデータに基づいて前奏モードが終了するタイミングを
認知すると、マイク信号切替部３の出力先をミキサー部
６へ切り替え、マイク１からの音声信号をミキサー部６
へ出力させる。したがって、ミキサー部６によってカラ
オケ音楽の音楽信号と歌唱音声の音声信号がミックスさ
れてスピーカ９から出力される。When the karaoke reproducing section 4 reproduces the karaoke music, the control section 7 recognizes the timing of the end of the prelude mode based on the timing data of the karaoke music data, and changes the output destination of the microphone signal switching section 3 to the mixer section. 6 and the audio signal from the microphone 1 is
Output to Accordingly, the music signal of the karaoke music and the audio signal of the singing voice are mixed by the mixer section 6 and output from the speaker 9.

【００３９】また、制御部７はカラオケ音楽の１番が終
了して中奏モードになったタイミングを認知すると、マ
イク信号切替部３の出力先を再び音声認識部８へ切り替
え、マイク１からの音声信号を音声認識部８へ出力させ
る。この音声認識部８で音声信号から「音量アップ」や
「テンポダウン」等の調整のコマンド制御を認識する
と、そのコマンド制御を制御部７へ出力し、制御部７は
カラオケ再生部４やミキサー部６に対する各種の調整の
コマンド制御を実行する。When the control unit 7 recognizes the timing at which the karaoke music is finished and the karaoke music is in the middle mode, the control unit 7 switches the output destination of the microphone signal switching unit 3 to the voice recognition unit 8 again, and The voice signal is output to the voice recognition unit 8. When the voice recognition unit 8 recognizes the command control for the adjustment such as “volume up” or “tempo down” from the voice signal, the command control is output to the control unit 7, and the control unit 7 outputs the karaoke playback unit 4 and the mixer unit. 6 is executed for various adjustment commands.

【００４０】さらに、制御部７は中奏モードが終了して
２番の開始のタイミングを認知すると、マイク信号切替
部３の出力先を再びミキサー部６へ切り替え、マイク１
からの音声信号をミキサー部６へ出力させる。したがっ
て、ミキサー部６によってカラオケ音楽の音楽信号と歌
唱音声の音声信号がミックスされてスピーカ９から出力
される。Further, when the control unit 7 recognizes the start timing of the second after the middle mode ends, the control unit 7 switches the output destination of the microphone signal switching unit 3 to the mixer unit 6 again, and
Is output to the mixer unit 6. Accordingly, the music signal of the karaoke music and the audio signal of the singing voice are mixed by the mixer section 6 and output from the speaker 9.

【００４１】さらにまた、制御部７はカラオケ音楽の後
奏モードになったタイミングを認知すると、マイク信号
切替部３の出力先を再び音声認識部８へ切り替え、マイ
ク１からの音声信号を音声認識部８へ出力させる。この
音声認識部８で音声信号からリクエストの曲名を認識す
ると、その曲名を制御部７へ出力し、制御部７はカラオ
ケ再生部４にカラオケ曲データ記憶部５の該当するカラ
オケ曲データを再生させる。Further, when the control unit 7 recognizes the timing at which the karaoke music enters the after-play mode, it switches the output destination of the microphone signal switching unit 3 to the voice recognition unit 8 again, and recognizes the voice signal from the microphone 1 by voice recognition. Output to the unit 8. When the voice recognition unit 8 recognizes the song title of the request from the audio signal, it outputs the song title to the control unit 7, and the control unit 7 causes the karaoke playback unit 4 to play the corresponding karaoke song data in the karaoke song data storage unit 5. .

【００４２】そして、制御部７は音声信号の出力先を音
声認識部８へ切り替えたときには、表示部１０に「マイ
クで選曲ができます」「マイクで音量やテンポを調整で
きます」等のメッセージを表示し、出力先をミキサー部
６へ切り替えたときには、表示部１０に「マイクで歌え
ます」等のメッセージを表示する。When the control unit 7 switches the output destination of the audio signal to the audio recognition unit 8, the display unit 10 displays a message such as "You can select music with a microphone", "You can adjust the volume and tempo with a microphone". Is displayed, and when the output destination is switched to the mixer section 6, a message such as "You can sing with a microphone" is displayed on the display section 10.

【００４３】このようにして、一般のマイクで煩雑な切
り替え操作をしなくても、カラオケの歌唱と選曲や音
量，テンポ，キー等の調整を容易に行なえるので、使用
者の歌唱環境を向上させることができる。In this way, singing of karaoke and selection of karaoke and adjustment of volume, tempo, keys, etc. can be easily performed without complicated switching operation with a general microphone, thereby improving the singing environment of the user. Can be done.

【００４４】次に、この発明の他の実施形態について説
明する。一般に音声認識では、認識対象が少ないほど認
識精度が向上するものである。カラオケ装置を制御する
場合、大きく分けて選曲とコマンド操作に分けることが
できる。選曲の場合、音声による認識対象は曲名や曲番
号であり、コマンド制御の場合、音声による認識対象は
音量調整の制御や、テンポコントロール，キーコントロ
ールの制御である。Next, another embodiment of the present invention will be described. Generally, in speech recognition, the smaller the number of recognition targets, the higher the recognition accuracy. When the karaoke apparatus is controlled, it can be roughly divided into music selection and command operation. In the case of song selection, the recognition target by voice is a song name or a song number, and in the case of command control, the recognition target by voice is control of volume adjustment, tempo control, and key control.

【００４５】そして、一般に選曲は曲間モード時に、コ
マンド操作は選曲後の前奏モード時にそれぞれ行なわれ
ることが多いので、カラオケ装置において、曲間モード
時の音声認識に使用する辞書を曲名（あるいは曲番号）
用辞書にし、前奏モード時の音声認識に使用する辞書を
コマンド制御用辞書に切り替えるようにすれば、認識対
象を絞り込んで認識精度を向上させることができる。そ
こで、動作モードに応じて音声認識に使用する辞書を切
り替えるカラオケ装置を提供する。In general, song selection is often performed in the inter-song mode, and command operation is often performed in the prelude mode after the song selection. Therefore, in the karaoke apparatus, the dictionary used for voice recognition in the inter-song mode is designated by the song name (or song name). number)
If the dictionary used for voice recognition in the prelude mode is switched to the command control dictionary, recognition targets can be narrowed and recognition accuracy can be improved. Therefore, a karaoke apparatus for switching a dictionary used for voice recognition according to an operation mode is provided.

【００４６】図２は、この発明の他の実施形態である動
作モードに応じて音声認識に使用する辞書を切り替える
カラオケ装置の構成を示す図であり、図１と共通する部
分には同一符号を付し、その説明を省略する。FIG. 2 is a diagram showing a configuration of a karaoke apparatus for switching a dictionary used for speech recognition according to an operation mode according to another embodiment of the present invention. And description thereof is omitted.

【００４７】このカラオケ装置には、図１に示したカラ
オケ装置の機能部の他に、新たに辞書切替部１１，辞書
記憶部１２を設けている。辞書切替部１１は、制御部７
からの辞書切替信号に基づいて音声認識部８が音声認識
時に参照する辞書記憶部１２の辞書を切り替える処理を
行なう。辞書記憶部１２は、音声認識部８が参照するリ
クエスト曲を認識するための曲名辞書１３と音量，テン
ポ，キー等の調整のコマンド制御の内容を認識するため
のコマンド辞書１４を備えている。This karaoke apparatus is provided with a dictionary switching section 11 and a dictionary storage section 12 in addition to the functional sections of the karaoke apparatus shown in FIG. The dictionary switching unit 11 includes the control unit 7
The speech recognition unit 8 performs a process of switching the dictionary of the dictionary storage unit 12 to be referred to at the time of speech recognition, based on the dictionary switching signal from. The dictionary storage unit 12 includes a song name dictionary 13 for recognizing the requested song referenced by the voice recognition unit 8 and a command dictionary 14 for recognizing the contents of the command control for adjusting the volume, tempo, key and the like.

【００４８】すなわち、上記辞書記憶部１２が複数種類
の音声認識用辞書を記憶した音声認識用辞書記憶手段に
相当し、上記音声認識部８が辞書記憶部１２に記憶され
た各音声認識用辞書を参照してマイク１によって入力さ
れた音声信号に基づく音声の指示内容を認識する音声認
識手段の機能を果たす。また、上記辞書切替部１１が制
御部７によって識別された動作モードに応じて音声認識
部８が参照する辞書記憶部１２の音声認識用辞書の種類
を切り替える音声認識用辞書切替手段の機能を果たす。That is, the dictionary storage unit 12 corresponds to a voice recognition dictionary storage unit storing a plurality of types of voice recognition dictionaries, and the voice recognition unit 8 corresponds to each of the voice recognition dictionaries stored in the dictionary storage unit 12. And performs the function of voice recognition means for recognizing the content of a voice instruction based on the voice signal input by the microphone 1 with reference to FIG. Further, the dictionary switching unit 11 functions as a voice recognition dictionary switching unit that switches the type of the voice recognition dictionary in the dictionary storage unit 12 referred to by the voice recognition unit 8 according to the operation mode identified by the control unit 7. .

【００４９】さらに、上記制御部７及び表示部１０が辞
書切替部１１によって切り替えた音声認識用辞書の種類
を知らせる情報を出力する手段の機能を果たす。さらに
また、上記制御部７及び表示部１０は、マイク信号切替
部３による音声信号の出力先を知らせる情報と、辞書切
替部１１によって切り替えた音声認識用辞書の種類を知
らせる情報とを出力する手段の機能も果たす。Further, the control section 7 and the display section 10 function as a means for outputting information notifying the type of the speech recognition dictionary switched by the dictionary switching section 11. Furthermore, the control unit 7 and the display unit 10 output information for notifying the output destination of the audio signal by the microphone signal switching unit 3 and information for notifying the type of the voice recognition dictionary switched by the dictionary switching unit 11. Also performs the function of

【００５０】次に、このカラオケ装置における処理を説
明する。このカラオケ装置において、制御部７は、図１
で説明したようにして音声信号の出力先を切り替えると
共に、各動作モードに応じて信号線ｄを介して辞書切替
部１１へ辞書切替信号を出力し、音声認識部８が音声認
識時に参照する辞書記憶部１２の曲名辞書１３とコマン
ド辞書１４を切り替えさせる。Next, processing in the karaoke apparatus will be described. In this karaoke device, the control unit 7
As described above, the output destination of the voice signal is switched, and a dictionary switching signal is output to the dictionary switching unit 11 via the signal line d according to each operation mode, and the dictionary which the voice recognition unit 8 refers to at the time of voice recognition. The song dictionary 13 and the command dictionary 14 in the storage unit 12 are switched.

【００５１】さらに説明する。曲間モードのとき、制御
部７はマイク信号切替部３の出力先を音声認識部８へ切
り替え、マイク１からの音声信号を音声認識部８へ出力
させると共に、辞書切替部１１に音声認識部８が曲名辞
書１３を参照するように切り替えさせる。この音声認識
部８は曲名辞書１３を参照して音声信号からリクエスト
の曲名を認識すると、その曲名を制御部７へ出力し、制
御部７はカラオケ再生部４にカラオケ曲データ記憶部５
の該当するカラオケ曲データを再生させる。こうして、
選曲のための音声入力がされることが多い曲間モード時
には、曲名を認識するための辞書を使用して曲名の認識
精度を高めることができる。Further description will be given. In the inter-song mode, the control unit 7 switches the output destination of the microphone signal switching unit 3 to the voice recognition unit 8, outputs the voice signal from the microphone 1 to the voice recognition unit 8, and causes the dictionary switching unit 11 to output the voice recognition unit. 8 is switched so as to refer to the song name dictionary 13. When the voice recognition unit 8 recognizes the requested song name from the audio signal with reference to the song name dictionary 13, it outputs the song name to the control unit 7, and the control unit 7 sends the karaoke song data storage unit 5 to the karaoke playback unit 4.
The karaoke song data corresponding to is reproduced. Thus,
In the inter-song mode in which voice input is frequently performed for song selection, a dictionary for recognizing song titles can be used to improve song title recognition accuracy.

【００５２】また、カラオケ再生部４によるカラオケ音
楽の再生時、制御部７はカラオケ曲データのタイミング
データに基づいて前奏モードを認知すると、辞書切替部
１１に音声認識部８がコマンド辞書１４を参照するよう
に切り替えさせ、音声認識部８はコマンド辞書１４を参
照して音声信号から音量，テンポ，キー等の調整のコマ
ンド制御内容を認識すると、そのコマンド制御の内容を
制御部７へ出力し、制御部７はカラオケ再生部４及びミ
キサー部６に対するカラオケ曲データ再生時のコマンド
制御を実行する。When the karaoke reproducing section 4 reproduces the karaoke music, the control section 7 recognizes the prelude mode based on the timing data of the karaoke tune data, and the voice recognition section 8 refers to the command dictionary 14 to the dictionary switching section 11. When the voice recognition unit 8 recognizes the command control content for adjusting the volume, tempo, key and the like from the voice signal with reference to the command dictionary 14, the voice recognition unit 8 outputs the command control content to the control unit 7, The control unit 7 executes a command control for the karaoke reproducing unit 4 and the mixer unit 6 at the time of reproducing the karaoke music data.

【００５３】こうして、コマンド制御のための音声入力
がされることが多い前奏モード時には、コマンド制御の
内容を認識するための辞書を使用してコマンド制御の認
識精度を高めることができる。Thus, in the prelude mode in which voice input for command control is frequently performed, the recognition accuracy of command control can be improved by using a dictionary for recognizing the contents of command control.

【００５４】そして、カラオケ再生部４によるカラオケ
音楽の再生時、制御部７はカラオケ曲データのタイミン
グデータに基づいて前奏モードが終了するタイミングを
認知すると、マイク信号切替部３の出力先をミキサー部
６へ切り替え、マイク１からの音声信号をミキサー部６
へ出力させる。したがって、ミキサー部６によってカラ
オケ音楽の音楽信号と歌唱音声の音声信号がミックスさ
れてスピーカ９から出力される。When the karaoke reproducing section 4 reproduces the karaoke music, the control section 7 recognizes the timing of the end of the prelude mode based on the timing data of the karaoke music data, and changes the output destination of the microphone signal switching section 3 to the mixer section. 6 and the audio signal from the microphone 1 is
Output to Accordingly, the music signal of the karaoke music and the audio signal of the singing voice are mixed by the mixer section 6 and output from the speaker 9.

【００５５】また、制御部７はカラオケ音楽の１番が終
了して中奏モードになったタイミングを認知すると、マ
イク信号切替部３の出力先を再び音声認識部８へ切り替
え、マイク１からの音声信号を音声認識部８へ出力させ
る。音声認識部８はコマンド辞書１４を参照して音声信
号から「音量アップ」や「テンポダウン」等の調整のコ
マンド制御を認識すると、そのコマンド制御を制御部７
へ出力して、制御部７はカラオケ再生部４やミキサー部
６に対する各種の調整のコマンド制御を実行する。When the control unit 7 recognizes the timing at which the karaoke music number 1 ends and the mode changes to the middle mode, the control unit 7 switches the output destination of the microphone signal switching unit 3 to the voice recognition unit 8 again, and The voice signal is output to the voice recognition unit 8. When the voice recognition unit 8 recognizes the command control for the adjustment such as “volume up” or “tempo down” from the voice signal with reference to the command dictionary 14, the command recognition is performed by the control unit 7.
The control unit 7 executes command control for various adjustments to the karaoke reproducing unit 4 and the mixer unit 6.

【００５６】こうして、前奏モードと共にコマンド制御
のための音声入力がされることが多い中奏モード時に
も、コマンド制御の内容を認識するための辞書を使用し
てコマンド制御の認識精度を高めることができる。As described above, even in the intermediate mode where the voice for command control is frequently input together with the prelude mode, the recognition accuracy of the command control can be improved by using the dictionary for recognizing the contents of the command control. it can.

【００５７】さらに、制御部７は中奏モードが終了して
２番の開始のタイミングを認知すると、マイク信号切替
部３の出力先を再びミキサー部６へ切り替え、マイク１
からの音声信号をミキサー部６へ出力させる。したがっ
て、ミキサー部６によってカラオケ音楽の音楽信号と歌
唱音声の音声信号がミックスされてスピーカ９から出力
される。Further, when the control section 7 recognizes the timing of the start of the second after the end of the middle mode, the control section 7 switches the output destination of the microphone signal switching section 3 to the mixer section 6 again, and
Is output to the mixer unit 6. Accordingly, the music signal of the karaoke music and the audio signal of the singing voice are mixed by the mixer section 6 and output from the speaker 9.

【００５８】さらにまた、制御部７はカラオケ音楽の後
奏モードになったタイミングを認知すると、マイク信号
切替部３の出力先を再び音声認識部８へ切り替え、マイ
ク１からの音声信号を音声認識部８へ出力させると共
に、辞書切替部１１に音声認識部８が曲名辞書１３を参
照するように切り替えさせる。この音声認識部８は曲名
辞書１３を用いて音声信号からリクエストの曲名を認識
すると、その曲名を制御部７へ出力し、制御部７はカラ
オケ再生部４にカラオケ曲データ記憶部５の該当するカ
ラオケ曲データを再生させる。Further, when the control unit 7 recognizes the timing at which the karaoke music enters the after-play mode, the output destination of the microphone signal switching unit 3 is switched to the voice recognition unit 8 again, and the voice signal from the microphone 1 is recognized by voice. In addition to the output to the section 8, the dictionary switching section 11 switches the voice recognition section 8 to refer to the song name dictionary 13. When the voice recognizing unit 8 recognizes the song title of the request from the audio signal using the song title dictionary 13, it outputs the song title to the control unit 7, and the control unit 7 corresponds to the karaoke song data storage unit 5 to the karaoke playback unit 4. Play karaoke song data.

【００５９】こうして、曲間モードと共にコマンド制御
のための音声入力がされることが多い後奏モード時に
は、リクエストの曲名を認識するための辞書を使用して
選曲の認識精度を高めることができる。このように辞書
を切り替える場合、歌唱用のマイクとは異なる音声認識
用のマイクを使用することもできる。Thus, in the after-play mode in which voice input for command control is frequently performed together with the inter-song mode, a dictionary for recognizing the title of the requested song can be used to improve the recognition accuracy of the song selection. When the dictionary is switched in this manner, a microphone for voice recognition different from a microphone for singing can be used.

【００６０】そして、制御部７は音声信号の認識に使用
する辞書を曲名辞書１３に切り替えたときには、表示部
１０に「マイクで選曲ができます」を表示し、コマンド
辞書１４に切り替えたときには、表示部１０に「マイク
で音量やテンポを調整できます」等のメッセージを表示
する。When the dictionary used for voice signal recognition is switched to the song name dictionary 13, the control unit 7 displays “You can select a song with a microphone” on the display unit 10, and when it switches to the command dictionary 14, A message such as “You can adjust the volume and tempo with a microphone” is displayed on the display unit 10.

【００６１】このようにして、音声による選曲と音量，
テンポ，及びキー等の調整の認識精度を向上させること
ができるので、使用者が音声で指示した内容を確実に実
行することができ、誤認識によって指示された操作が為
されなかったり、使用者の意図と異なる操作がされたり
するような歌唱環境が悪化することを防止し、より快適
な歌唱環境を提供することができる。In this manner, the music selection and the sound volume by voice,
Since the accuracy of recognizing the adjustment of the tempo, the key, and the like can be improved, the content instructed by the user by voice can be reliably executed, and the operation instructed by the erroneous recognition is not performed, or the user is not performed. It is possible to prevent a singing environment in which an operation different from the intended operation is performed from being deteriorated, and to provide a more comfortable singing environment.

【００６２】なお、上述の実施形態では、カラオケ装置
について説明したが、センタ装置から公衆回線を介して
カラオケ曲データや各種のサービスを受けられる通信カ
ラオケ装置や、カラオケ曲データを再生可能なパーソナ
ルコンピュータなどにおいても上述と同じように実施す
ることができる。Although the karaoke apparatus has been described in the above embodiment, a communication karaoke apparatus that can receive karaoke music data and various services from a center apparatus via a public line, and a personal computer that can reproduce karaoke music data And the like can be implemented in the same manner as described above.

【００６３】[0063]

【発明の効果】以上説明してきたように、この発明の音
楽演奏装置によれば、一般に使用される音声入力手段を
音声認識用と歌唱用に共用できるようにすることができ
る。さらに、使用者からの音声による指示内容の認識精
度を向上させることもできる。As described above, according to the music performance apparatus of the present invention, the commonly used voice input means can be shared for voice recognition and singing. Further, it is possible to improve the accuracy of recognizing the instruction content by voice from the user.

[Brief description of the drawings]

【図１】この発明の一実施形態であるカラオケ装置の構
成を示すブロック図である。FIG. 1 is a block diagram showing a configuration of a karaoke apparatus according to an embodiment of the present invention.

【図２】この発明の他の実施形態であるカラオケ装置の
構成を示すブロック図である。FIG. 2 is a block diagram showing a configuration of a karaoke apparatus according to another embodiment of the present invention.

[Explanation of symbols]

１：マイク２：マイクアンプ部３：マイク信号切替部４：カラオケ再生部５：カラオケ曲データ記憶部６：ミキサー部７：制御部８：音声認識部９：スピーカ１０：表示部１１：辞書切替部１２：辞書記憶部１３：曲名辞書１４：コマンド辞書 1: microphone 2: microphone amplifier section 3: microphone signal switching section 4: karaoke playback section 5: karaoke song data storage section 6: mixer section 7: control section 8: voice recognition section 9: speaker 10: display section 11: dictionary switching Part 12: Dictionary storage part 13: Song name dictionary 14: Command dictionary

Claims

[Claims]

1. A music reproducing means for reproducing a music signal based on music data, a voice input means for inputting an audio signal, a music based on the music signal reproduced by the music reproducing means, and a music input by the voice input means. Music / speech output means for outputting the combined speech with the speech signal, speech recognition means for recognizing the speech instruction content based on the speech signal input by the speech input means, and the instruction content recognized by the means Means for executing processing corresponding to the above, operation mode identification means for identifying a plurality of stages of operation modes involved when the music data is not being reproduced and when the music data is being reproduced, and the sound corresponding to the operation mode identified by the means. Voice output for switching the output destination of the voice signal input by the input means to one of the music / voice output means and voice recognition means A music performance device comprising: a first switching means.

2. The music performance apparatus according to claim 1, further comprising means for outputting information indicating an output destination of the audio signal by said audio output destination switching means.

3. A music reproducing means for reproducing a music signal based on music data, a voice input means for inputting an audio signal, and music input by the music signal reproduced by the music reproducing means and input by the voice input means. Music / speech output means for outputting the combined speech with the speech signal, speech recognition dictionary storage means for storing a plurality of types of speech recognition dictionaries, and speech recognition dictionaries stored in the means. Voice recognition means for recognizing a voice instruction content based on a voice signal input by the voice input means, means for executing processing corresponding to the content of the instruction recognized by the voice input means, and reproducing the music data. Operation mode identification means for identifying a plurality of stages of operation modes involved in the absence and reproduction, and the voice recognition means according to the operation mode identified by the means. A voice recognition dictionary switching means for switching a type of a voice recognition dictionary in the voice recognition dictionary storage means referred to by a row.

4. The music performance apparatus according to claim 3, further comprising means for outputting information for notifying the type of the voice recognition dictionary switched by said voice recognition dictionary switching means.

5. A music reproducing means for reproducing a music signal based on music data, a voice input means for inputting an audio signal, and music input by the music signal reproduced by the music reproducing means and input by the voice input means. Music / speech output means for outputting the combined speech with the speech signal, speech recognition dictionary storage means for storing a plurality of types of speech recognition dictionaries, and speech recognition dictionaries stored in the means. Voice recognition means for recognizing a voice instruction content based on a voice signal input by the voice input means, means for executing processing corresponding to the content of the instruction recognized by the voice input means, and reproducing the music data. Operation mode identification means for identifying a plurality of stages of operation modes involved in the absence and reproduction of the voice input means; and the voice input means according to the operation mode identified by the means. Voice output destination switching means for switching the output destination of a voice signal input by a stage to one of the music / voice output means and voice recognition means; and the voice recognition in accordance with the operation mode identified by the operation mode identification means. And a voice recognition dictionary switching means for switching a type of the voice recognition dictionary in the voice recognition dictionary storage means referred to by the means.

6. The music performance device according to claim 5, wherein information for notifying an output destination of the audio signal by the audio output destination switching means and a type of the voice recognition dictionary switched by the voice recognition dictionary switching means are notified. A music performance device comprising means for outputting information.