JP2004246379A

JP2004246379A - Karaoke device

Info

Publication number: JP2004246379A
Application number: JP2004122846A
Authority: JP
Inventors: Kanehisa Tsurumi; 兼久鶴見
Original assignee: Yamaha Corp
Current assignee: Yamaha Corp
Priority date: 2004-04-19
Filing date: 2004-04-19
Publication date: 2004-09-02
Anticipated expiration: 2017-07-18
Also published as: JP3982514B2

Abstract

<P>PROBLEM TO BE SOLVED: To provide a karaoke device in which relative merits in singing skill are graded. <P>SOLUTION: When first and second evaluation function computing sections 510 and 520 respectively compute grading results Q1a and Q1b based on difference data Diffa and Diffb, a first comparison section 550 compares the results Q1a and Q1b. When third and fourth evaluation function computing sections 530 and 540 respectively compute grading results Q2a and Q2b based on difference data Diffa and Diffb, a second comparison section 560 compares the results Q2a and Q2b. A random number generating section 570 generates random numbers M. A discrimination section 580 conducts discrimination of the relative merits of singing skill based on the comparison results of the sections 550 and 560 and the random numbers M. When either one of the comparison results of the sections 550 and 560 indicates coincidence, the relative merits of the singing skill are discriminated based on the other comparison result. When both comparison results indicate the coincidence, the relative merits of the singing skill are discriminated based on the random numbers M. <P>COPYRIGHT: (C)2004,JPO&NCIPI

Description

この発明は、利用者の歌唱力を採点する機能を有するカラオケ装置に関する。 The present invention relates to a karaoke apparatus having a function of scoring a singing ability of a user.

従来より、歌い手の歌唱力を採点する機能を備えたカラオケ装置が各種開発されている。一般に、この種のカラオケ装置においては、歌い手の歌唱音声とカラオケの楽曲情報に含められたボーカルパートのリファレンスとの間で音量や音程（ピッチ）等を比較し、音量差と音程差の程度に応じて歌唱力を採点するようになっている。また、カラオケ装置では、歌い手が歌唱すると、マイクロホン（以下、マイクという）から入力される歌唱音声を増幅して、スピーカーから発音させている。 Conventionally, various karaoke apparatuses having a function of scoring the singing ability of a singer have been developed. Generally, in this type of karaoke apparatus, the volume and pitch (pitch) and the like are compared between the singing voice of the singer and the reference of the vocal part included in the music information of the karaoke, and the difference between the volume and the pitch difference is determined. Singing skills are graded accordingly. In a karaoke apparatus, when a singer sings, a singing voice input from a microphone (hereinafter, referred to as a microphone) is amplified and sounded from a speaker.

ところで、カラオケ装置で歌唱される曲の種類として、二人の歌い手が同時に歌唱し歌唱力の優劣を競うバトル曲が知られている。バトル曲では、二人の歌い手の歌唱力は同一の評価関数によって採点され、それらの採点結果に基づいて歌唱力の優劣が決定される。そして、歌唱力の優劣はモニタに表示されるようになっている。これにより、その場の雰囲気が盛り上がり、歌い手や聴衆は楽曲の歌唱をより積極的に楽しむことができる。しかし、せっかく歌唱しても採点結果が同一であれば、優劣がつかず引き分けになってしまい面白味が半減してしまうという問題があった。 By the way, as a kind of song sung by the karaoke apparatus, a battle song in which two singers sing simultaneously and compete in singing ability is known. In the battle tune, the singing ability of the two singers is scored by the same evaluation function, and the singing ability is determined based on the scoring result. The singing ability is displayed on a monitor. Thereby, the atmosphere of the place is excited, and the singer and the audience can enjoy the singing of the music more positively. However, if the scores are the same even if the song is sung with great effort, there is a problem that the match is not made and a draw is made, and the fun is reduced by half.

また、バトル曲のように二人の歌い手の採点を独立して行う場合には、マイクの選択に対応して歌唱力の採点および優劣の判定が行われる。しかし、どちらの歌い手がどのマイクを使用して歌唱しているかを知ることができなかったので、優劣の判定について混乱が生じることがあった。 In addition, when the two singers are scored independently as in the case of a battle song, the singing ability is scored and the superiority or inferiority is determined in accordance with the selection of the microphone. However, since it was not possible to know which singer was singing using which microphone, confusion sometimes occurred in judging superiority.

また、デュエット曲では、男性のみが歌唱する男性歌唱区間、女性のみが歌唱する女性歌唱区間、および男女が同時に歌唱する混成歌唱区間から構成されることが多い。しかし、デュエット曲を歌い慣れていない歌い手は、いま自分が歌唱すべき時なのか良く分からず、まごついてしまうことがあった。 In addition, duet songs often include a male singing section in which only men sing, a female singing section in which only women sing, and a mixed singing section in which men and women sing simultaneously. However, singers who are not accustomed to singing duet songs sometimes get confused because they do not know when it is time to sing.

この発明は、このような背景の下になされたもので、二人以上の歌い手が歌唱する場合に、歌唱力の優劣を必ずつけることを目的とする。また、他の目的は、歌い手に歌唱しているマイクの種類を知らせることを目的とする。 The present invention has been made under such a background, and an object of the present invention is to always give singing power superiority when two or more singers sing. Another object is to inform a singer of the type of microphone singing.

上記課題を解決するため、請求項１に記載した発明にあっては、曲データに基づいて、楽曲の演奏を行うとともに歌詞をモニタに表示させるカラオケ装置において、第１のマイクロホンから取り込まれる歌唱音声信号と第２のマイクロホンから取り込まれる歌唱音声信号とを混合または選択して第１の出力端子と第２の出力端子から出力する選択手段と、前記第１の出力端子から出力される前記歌唱音声信号に基づいて歌唱音量を検出する第１の検出手段と、前記第２の出力端子から出力される前記歌唱音声信号に基づいて歌唱音量を検出する第２の検出手段と、前記第１の検出手段によって検出された歌唱音量に応じて第１のキャラクタの形状を可変して前記モニタに表示させるとともに、前記第２の検出手段によって検出された歌唱音量に応じて第２のキャラクタの大きさを可変して前記モニタに表示させる表示制御手段と、前記曲データに基づいて、前記選択手段の切換と前記第１，第２のキャラクタの設定とを同期して制御する制御手段とを備えたことを特徴とする。 In order to solve the above-mentioned problem, according to the invention described in claim 1, in a karaoke apparatus that performs music performance and displays lyrics on a monitor based on music data, a singing voice captured from a first microphone. Selection means for mixing or selecting a signal and a singing voice signal taken in from a second microphone and outputting the signal from a first output terminal and a second output terminal; and the singing voice output from the first output terminal First detecting means for detecting a singing volume based on a signal, second detecting means for detecting a singing volume based on the singing voice signal output from the second output terminal, and the first detection The shape of the first character is varied according to the singing volume detected by the means and displayed on the monitor, and the singing volume detected by the second detecting means is changed. Display control means for varying the size of the second character in response to the display on the monitor, and synchronizing switching of the selection means and setting of the first and second characters based on the music data. And control means for performing the control.

また、請求項２に記載した発明にあっては、請求項１に記載の発明において、前記制御手段は、前記曲データが二人の歌い手によって歌唱される混成歌唱区間と一方の歌い手によって歌唱される単独歌唱区間から構成されるものであることを検知すると、前記混成歌唱区間において、混合した前記歌唱音声信号を前記第１，第２の出力端子から出力するように前記選択手段を制御するとともに、前記第１，第２の検出手段によって検出された歌唱音量に対応して前記第１，第２のキャラクタの形状を可変し、前記単独歌唱区間において、前記第１のマイクロホンからの前記歌唱音声信号を第１の出力端子から出力するとともに第２のマイクロホンからの前記歌唱音声信号を前記第２の出力端子から出力するように前記選択手段を制御するとともに、前記一方の歌い手による前記歌唱音声信号から検出された歌唱音量に応じて、対応するキャラクタの形状を可変するように制御することを特徴とする。 Further, in the invention described in claim 2, in the invention described in claim 1, the control means includes a step in which the music data is sung by a hybrid singer section in which two singers sing and a singer by one singer. When it is detected that the single singing section is composed of a single singing section, the selecting section is controlled to output the mixed singing voice signal from the first and second output terminals in the mixed singing section. And changing the shapes of the first and second characters in accordance with the singing volume detected by the first and second detecting means, and singing voice from the first microphone in the single singing section. Controlling the selection means so as to output a signal from a first output terminal and output the singing voice signal from a second microphone from the second output terminal. , Depending on the singing sound said detected from the singing voice signal according to one of the singers, the shape of the corresponding character and controls so as to vary.

この発明によれば、二人の歌い手が歌唱する場合に、歌唱力の優劣を必ずつけることができる。また、歌唱音量をモニタに表示させるので、歌い手は自分の歌唱音量を知ることができる。また、二人の歌い手が歌唱する場合には、各歌い手が歌唱すべきマイクを認識することができる。 ADVANTAGE OF THE INVENTION According to this invention, when two singers sing, singing ability can always be given. Also, since the singing volume is displayed on the monitor, the singer can know his or her singing volume. Also, when two singers sing, each singer can recognize the microphone to sing.

以下、図面を参照して、この発明の実施形態について説明する。 Hereinafter, embodiments of the present invention will be described with reference to the drawings.

＜Ａ：実施形態の全体構成＞
図１は、この発明の一実施形態によるカラオケ装置の全体構成を示すブロック図である。同図において、３０は装置各部を制御するＣＰＵである。このＣＰＵ３０には、バスＢＵＳを介してＲＯＭ３１、ＲＡＭ３２、ハードディスク装置（ＨＤＤ）３７、通信制御部３６、リモコン受信部３３、表示パネル３４、パネルスイッチ３５、音源装置３８、音声データ処理部３９、効果用ＤＳＰ４０、文字表示部４３、ＬＤチェンジャ４４、表示制御部４５および音声処理用ＤＳＰ４９が接続されている。 <A: Overall Configuration of Embodiment>
FIG. 1 is a block diagram showing the overall configuration of a karaoke apparatus according to one embodiment of the present invention. In FIG. 1, reference numeral 30 denotes a CPU that controls each unit of the apparatus. The CPU 30 includes, via a bus BUS, a ROM 31, a RAM 32, a hard disk drive (HDD) 37, a communication control unit 36, a remote control receiving unit 33, a display panel 34, a panel switch 35, a sound source device 38, an audio data processing unit 39, A DSP 40, a character display unit 43, an LD changer 44, a display control unit 45, and an audio processing DSP 49 are connected.

ＲＯＭ３１には、当該カラオケ装置を起動するために必要なイニシャルプログラムが記憶されている。装置の電源がオンされると、このイニシャルプログラムによってＨＤＤ３７に記憶されたシステムプログラムおよびアプリケーションプログラムがＲＡＭ３２にロードされる。ＨＤＤ３７には、上記システムプログラム、アプリケーションプログラム、カラオケ演奏時に再生される約１万曲分の楽曲データを記憶する楽曲データファイル３７０、バトル曲中で再生されるアニメーションの動画データ、および各種のキャラクタデータが記憶されている。 The ROM 31 stores an initial program necessary for starting the karaoke apparatus. When the power of the apparatus is turned on, the system program and the application program stored in the HDD 37 are loaded into the RAM 32 by the initial program. The HDD 37 stores the above system program, application program, music data file 370 for storing music data of about 10,000 music played during karaoke performance, animation data of animation played in battle music, and various character data. Is stored.

ここで、図２〜図４を参照し、楽曲データの内容について説明する。図２は、１曲分の楽曲データのフォーマットを示す図である。また、図３、図４は楽曲データの各トラックの内容を示す図である。
図２において、楽曲データは、ヘッダ、楽音トラック、ガイドメロディトラック、歌詞トラック、音声トラック、効果トラックおよび音声データ部からなっている。ヘッダには、その楽曲データに関する種々の情報が書き込まれており、例えば曲番号、曲名、ジャンル、発売日、曲の演奏時間（長さ）等のデータが書き込まれている。 Here, the contents of the music data will be described with reference to FIGS. FIG. 2 is a diagram showing a format of music data for one music. FIGS. 3 and 4 are diagrams showing the contents of each track of the music data.
In FIG. 2, the music data includes a header, a musical sound track, a guide melody track, a lyrics track, an audio track, an effect track, and an audio data section. Various information regarding the music data is written in the header, and data such as a music number, a music title, a genre, a release date, and a music playing time (length) are written in the header.

楽音トラックないし効果トラックの各トラックは、図３および図４に示すように、複数のイベントデータと各イベント間の時間間隔を示すデュレーションデータΔｔからなるシーケンスデータで構成されている。ＣＰＵ３０は、カラオケ演奏時にシーケンスプログラム（カラオケ演奏のためのアプリケーションプログラム）によって各トラックのデータを並行して読み出すようになっている。各トラックのシーケンスデータを読み出す場合、所定のテンポクロックによりΔｔをカウントし、カウントを終了したしたときこれに続くイベントデータを読み出し、所定の処理部へ出力する。楽音トラックには、図３に示すように、メロディトラック、リズムトラックをはじめとして種々のパートのトラックが形成されている。 As shown in FIGS. 3 and 4, each track of the tone track or the effect track is composed of sequence data including a plurality of event data and duration data Δt indicating a time interval between each event. The CPU 30 reads the data of each track in parallel by a sequence program (application program for karaoke performance) at the time of karaoke performance. When reading the sequence data of each track, Δt is counted by a predetermined tempo clock, and when the counting is completed, the subsequent event data is read and output to a predetermined processing unit. As shown in FIG. 3, tracks of various parts including a melody track and a rhythm track are formed on the musical tone track.

また、図４に示すように、ガイドメロディトラックには、このカラオケ曲のボーカルパートのメロディすなわち歌唱者が歌うべきメロディのシーケンスデータが書き込まれている。ＣＰＵ３０は、このデータに基づきリファレンスの音高データ、音量データを生成し、歌唱音声と比較する。デュエット曲のように複数のボーカルパート（例えば、メインメロディとコーラスメロディ）がある場合には、各パートに対応してガイドメロディトラックが存在する。 As shown in FIG. 4, the melody of the vocal part of the karaoke tune, that is, the melody sequence data to be sung by the singer is written in the guide melody track. The CPU 30 generates reference pitch data and volume data based on the data and compares the generated data with the singing voice. When there are a plurality of vocal parts (for example, a main melody and a chorus melody) as in a duet song, there is a guide melody track corresponding to each part.

また、歌詞トラックは、モニタ４６上に歌詞を表示するためのシーケンスデータからなっている。このシーケンスデータは、楽音データではないが、インプリメンテーションの統一を図り作業工程を容易にするため、このトラックもＭＩＤＩデータ形式で記述される。データの種類は、システムエクスクルーシブメッセージである。歌詞トラックは、通常はモニタに表示される１行分の歌詞に相当する文字コード、そのモニタ画面上の表示座標、表示時間、およびワイプシーケンスデータからなっている。ワイプシーケンスデータとは、曲の進行に合わせて歌詞の表示色を変更していくためのシーケンスデータであり、表示色を変更するタイミング（この歌詞が表示されてからの時間）と変更位置（座標）が１行分の長さにわたって順次記録されているデータである。 The lyrics track is composed of sequence data for displaying lyrics on the monitor 46. Although this sequence data is not musical sound data, this track is also described in the MIDI data format in order to unify the implementation and facilitate the work process. The type of data is a system exclusive message. The lyrics track usually includes a character code corresponding to one line of lyrics displayed on the monitor, its display coordinates on the monitor screen, display time, and wipe sequence data. The wipe sequence data is sequence data for changing the display color of the lyrics in accordance with the progress of the song. The timing of changing the display color (time from when the lyrics are displayed) and the change position (coordinates) ) Is data sequentially recorded over the length of one line.

音声トラックは、音声データ部に記憶されている音声データｎ（ｎ＝１，２，３，……）の発生タイミング等を指定するシーケンストラックである。音声データ部には、音源装置３８では合成し難いバックコーラス等の人声が記憶されている。音声トラックには、音声指定データと、音声指定データの読み出し間隔、すなわち、音声データを音声データ処理部３９に出力して音声信号を形成するタイミングを指定するデュレーションデータΔｔが書き込まれている。音声指定データは、音声データ番号、音程データおよび音量データからなっている。音声データ番号は、音声データ部に記録されている各音声データの識別番号ｎである。音程データ、音量データは、形成すべき音声データの音程や音量を指定するデータである。すなわち、言葉を伴わない「アー」や「ワワワワッ」等のバックコーラスは、音程や音量を変化させれば何度も利用できるため、基本的な音程、音量で１つ記憶しておき、このデータに基づいて音程や音量をシフトして繰り返し使用する。音声データ処理部３９は、音量データに基づいて出力レベルを設定し、音程データに基づいて音声データの読み出し間隔を変えることによって音声信号の音程を設定する。 The audio track is a sequence track that specifies the generation timing of the audio data n (n = 1, 2, 3,...) Stored in the audio data section. The voice data section stores a human voice such as a back chorus which is difficult to synthesize by the sound source device 38. In the audio track, the audio designation data and the reading interval of the audio designation data, that is, the duration data Δt that designates the timing of outputting the audio data to the audio data processing unit 39 and forming the audio signal, are written. The voice designation data includes a voice data number, pitch data, and volume data. The audio data number is an identification number n of each audio data recorded in the audio data section. The pitch data and the volume data are data for specifying the pitch and volume of the audio data to be formed. That is, a back chorus such as "Ah" or "Wawa Wawa" without words can be used many times by changing the pitch and volume. The pitch and volume are shifted based on and used repeatedly. The audio data processing unit 39 sets the output level based on the volume data, and sets the pitch of the audio signal by changing the reading interval of the audio data based on the pitch data.

効果トラックには、効果用ＤＳＰ４０を制御するためのＤＳＰコントロールデータが書き込まれている。効果用ＤＳＰ４０は、音源装置３８、音声データ処理部３９から入力される信号に対してリバーブなどの残響系の効果を付与する。ＤＳＰコントロールデータは、このような効果の種類を指定するデータおよびディレータイム、エコーレベル等の効果付与の程度を指定するデータからなっている。
このような楽曲データは、カラオケの演奏開始時にＨＤＤ３７から読み出され、ＲＡＭ３２にロードされる。 In the effect track, DSP control data for controlling the effect DSP 40 is written. The effect DSP 40 applies reverberation or other reverberation-based effects to signals input from the sound source device 38 and the audio data processing unit 39. The DSP control data is composed of data for specifying the kind of the effect and data for specifying the degree of effect application such as the delay time and the echo level.
Such music data is read from the HDD 37 at the start of the karaoke performance and loaded into the RAM 32.

次に、図５を参照し、ＲＡＭ３２のメモリマップの内容を説明する。同図に示すように、ＲＡＭ３２には、ロードしたシステムプログラムやアプリケーションプログラムを記憶するプログラム記憶エリア３２４のほか、カラオケ演奏のための楽曲データを記憶する実行データ記憶エリア３２３、ガイドメロディを一時記憶するＭＩＤＩバッファ３２０、このガイドメロディから抽出されたリファレンスデータを記憶するリファレンスデータレジスタ３２１、およびリファレンスと歌唱音声を比較することによって求められた差分データを蓄積記憶する差分データ記憶エリア３２２が設定されている。リファレンスデータレジスタ３２１は、音高データレジスタ３２１ａおよび音量データレジスタ３２１ｂからなっている。また、差分データ記憶エリア３２２は、音高差分データ記憶エリア３２２ａ、音量差分データ記憶エリア３２２ｂからなっている。 Next, the contents of the memory map of the RAM 32 will be described with reference to FIG. As shown in the figure, the RAM 32 temporarily stores a program storage area 324 for storing loaded system programs and application programs, an execution data storage area 323 for storing music data for karaoke performance, and a guide melody. A MIDI buffer 320, a reference data register 321 for storing reference data extracted from the guide melody, and a difference data storage area 322 for storing difference data obtained by comparing the reference with the singing voice are set. . The reference data register 321 includes a pitch data register 321a and a volume data register 321b. The difference data storage area 322 includes a pitch difference data storage area 322a and a volume difference data storage area 322b.

さて、再び図１を参照し、当該カラオケ装置の構成の説明を進める。同図において、通信制御部３６は、ＩＳＤＮ回線を介して図示しないホストコンピュータから楽曲データ等をダウンロードし、内部のＤＭＡコントローラによって受信した楽曲データをＣＰＵ３０を介さずに直接ＨＤＤ３７へ転送する。
リモコン受信部３３は、リモコン５１から送られてくる赤外線信号を受信して入力データを復元する。リモコン５１は、選曲スイッチなどのコマンドスイッチやテンキースイッチ等を備えており、利用者がこれらのスイッチを操作するとその操作に応じたコードで変調された赤外線信号を送信する。
表示パネル３４は、このカラオケ装置の前面に設けられており、現在演奏中の曲コードや予約曲数などを表示するものである。パネルスイッチ３５は、カラオケ装置の前面に設けられており、曲コード入力スイッチやキーチェンジスイッチ等を含んでいる。また、リモコン５１またはパネルスイッチ３５によって採点機能のオン／オフが指定できるようになっている。 Now, the configuration of the karaoke apparatus will be described again with reference to FIG. In the figure, a communication control unit 36 downloads music data and the like from a host computer (not shown) via an ISDN line, and transfers music data received by an internal DMA controller directly to the HDD 37 without passing through the CPU 30.
The remote control receiver 33 receives the infrared signal sent from the remote controller 51 and restores the input data. The remote controller 51 includes a command switch such as a music selection switch, a numeric key switch, and the like. When the user operates these switches, an infrared signal modulated with a code corresponding to the operation is transmitted.
The display panel 34 is provided on the front of the karaoke apparatus, and displays the currently playing music code and the number of reserved music. The panel switch 35 is provided on the front of the karaoke apparatus, and includes a music code input switch, a key change switch, and the like. Further, the on / off of the scoring function can be designated by the remote controller 51 or the panel switch 35.

音源装置３８は、楽曲データの楽音トラックのデータに基づいて楽音信号を形成する。楽曲データは、カラオケ演奏時にＣＰＵ３０によって読み出され、楽音トラックとともに比較用データであるガイドメロディトラックも並行して読み出される。音源装置３８は、楽音トラックの各トラックのデータを並行して読み出し、複数パートの楽音信号を同時に形成する。 The sound source device 38 forms a tone signal based on the data of the tone track of the music data. The music data is read by the CPU 30 during a karaoke performance, and the guide melody track, which is comparison data, is read in parallel with the musical sound track. The tone generator 38 reads out the data of each of the musical tone tracks in parallel, and simultaneously generates musical tone signals of a plurality of parts.

音声データ処理部３９は、楽曲データに含まれる音声データに基づき、指定された長さ、指定された音高の音声信号を形成する。音声データは、バックコーラス等の音源装置３８で電子的に発生し難い信号波形をそのままＡＤＰＣＭデータ化して記憶したものである。音源装置３８が形成した楽音信号および音声データ処理部３９が形成した音声信号がカラオケ演奏音であり、これらは、効果用ＤＳＰ４０に入力される。効果用ＤＳＰ４０は、このカラオケ演奏音に対してリバーブやエコー等の効果を付与する。効果を付与されたカラオケ演奏音は、Ｄ／Ａコンバータ４１によってアナログ信号に変換された後、アンプスピーカ４２へ出力される。 The audio data processing unit 39 forms an audio signal having a specified length and a specified pitch based on the audio data included in the music data. The audio data is a signal waveform that is hardly generated electronically by the sound source device 38 such as a back chorus and is directly converted into ADPCM data and stored. The tone signal formed by the sound source device 38 and the sound signal formed by the sound data processing section 39 are karaoke performance sounds, which are input to the effect DSP 40. The effect DSP 40 adds effects such as reverb and echo to the karaoke performance sound. The karaoke performance sound to which the effect has been added is converted into an analog signal by the D / A converter 41 and then output to the amplifier speaker 42.

また、４７ａ，４７ｂは各々歌唱用のマイクであり、各マイク４７ａ，４７ｂから入力される歌唱音声信号Ｖ１，Ｖ２は、図示せぬプリアンプで増幅された後、アンプスピーカ４２およびセレクタ４８に各々入力される。 47a and 47b are singing microphones. Singing voice signals V1 and V2 input from the microphones 47a and 47b are amplified by a preamplifier (not shown) and then input to the amplifier speaker 42 and the selector 48, respectively. Is done.

セレクタ４８は、ＣＰＵ３０の制御の下、各歌唱音声信号Ｖ１，Ｖ２を選択して音声処理用ＤＳＰ４９を出力する。この場合、セレクタ４８の切換には、入力端子Ｘ１に供給される歌唱音声信号Ｖ１を出力端子Ｙ１から、入力端子Ｘ２に供給される歌唱音声信号Ｖ２を出力端子Ｙ２から各々出力するストレートモードと、入力端子Ｘ１，Ｘ２に供給される歌唱音声信号Ｖ１，Ｖ２を混合した後、出力端子Ｙ１，Ｙ２に出力するミックスモードがある。 The selector 48 selects each of the singing voice signals V1 and V2 under the control of the CPU 30, and outputs a voice processing DSP 49. In this case, switching of the selector 48 includes a straight mode in which the singing voice signal V1 supplied to the input terminal X1 is output from the output terminal Y1 and the singing voice signal V2 supplied to the input terminal X2 is output from the output terminal Y2. There is a mix mode in which the singing voice signals V1 and V2 supplied to the input terminals X1 and X2 are mixed and then output to the output terminals Y1 and Y2.

ここで、モードの選択は楽曲データおよびリモコン５１の操作の組み合わせによって決定される。例えば、楽曲によっては、ハモリパートのデータを有するものがあるが、ハモリ機能を用いるか否かは、利用者の判断に委ねられている。利用者がハモリ機能を利用して歌唱したい場合には、リモコン５１を操作してその旨を入力すると、ハモリパートとメインボーカルパートの演奏が行われ、一方、特に操作の行わない場合には、メインボーカルパートのみによる演奏が行われる。この場合に、ハモリ機能を利用するならばストレートモードとされ、それを利用しない場合にはミックスモードとされる。換言すれば、各種の効果を含め、利用者によって設定された楽曲データによってモードの選択が行われる。 Here, the selection of the mode is determined by a combination of the music data and the operation of the remote controller 51. For example, some songs have data of a hamori part, but whether or not to use the hamori function is left to the discretion of the user. When the user wants to sing using the hamori function, the user operates the remote controller 51 and inputs that fact, and the hamori part and the main vocal part are performed. On the other hand, if no operation is performed, The performance is performed only by the main vocal part. In this case, the straight mode is used if the hamori function is used, and the mixed mode is used if it is not used. In other words, the mode is selected based on the music data set by the user, including various effects.

音声処理用ＤＳＰ４９に入力された各歌唱音声信号Ｖ１，Ｖ２は、各々ディジタル信号に変換された後、採点処理のための信号処理が施される。この音声処理用ＤＳＰ４９とＣＰＵ３０を含む構成によって後述する採点処理部５０の機能が実現される。また、アンプスピーカ４２は、入力されたカラオケ演奏音および各歌唱音声信号を増幅し、かつ、各歌唱音声信号にエコー等の効果を付与した後、スピーカから放音する。 Each of the singing voice signals V1 and V2 input to the voice processing DSP 49 is converted into a digital signal, and then subjected to signal processing for scoring processing. The configuration including the voice processing DSP 49 and the CPU 30 realizes a function of a scoring processing unit 50 described later. Further, the amplifier speaker 42 amplifies the input karaoke performance sound and each singing voice signal, gives an effect such as an echo to each singing voice signal, and then emits the sound from the speaker.

また、音声処理用ＤＳＰ４９は、デジタル信号に変換された歌唱音声信号Ｖ１，Ｖ２のレベルを検出して、音量データを生成する。ＣＰＵ３０は、この音量データに基づいて、モニタ４６に表示するキャラクタの大きさを可変するように制御する。具体的には、図６に示すように、キャラクタとしてアニメーションの人の顔を用い、顔の大きさを音量データの示すレベルに応じて可変すれば良い。この場合、キャラクタデータは、例えば、外部から伝送され通信制御部３６を介してＨＤＤ３７に格納される。ＣＰＵ３０は、ＨＤＤ３７からキャラクタデータを読み出し、これをＲＡＭ３２に展開しておく。そして、音量データを倍率として用いてキャラクタデータに画像処理を施して、表示データを生成している。表示データは表示制御部４５に転送される。 The DSP 49 for voice processing detects the levels of the singing voice signals V1 and V2 converted into digital signals, and generates volume data. The CPU 30 controls to change the size of the character displayed on the monitor 46 based on the volume data. Specifically, as shown in FIG. 6, a human face of an animation may be used as a character, and the size of the face may be changed according to the level indicated by the volume data. In this case, the character data is transmitted from the outside and stored in the HDD 37 via the communication control unit 36, for example. The CPU 30 reads character data from the HDD 37 and expands the character data in the RAM 32. Then, image processing is performed on the character data using the volume data as a magnification to generate display data. The display data is transferred to the display control unit 45.

これにより、歌唱音声の入力レベルをキャラクタを用いてモニタ４６に表示することが可能となる。また、デュエット曲を歌唱する際にマイク４７ａ、４７ｂの選択を誤ると的確な採点を行うことがきず、また、バトル曲を歌唱する際にマイク４７ａ、４７ｂの選択を誤ると採点結果が逆になってしまう。このため、本実施形態にあっては、キャラクタを２種類用意して、各マイク４７ａ，４７ｂと対応させている。この場合、歌い手が、各マイク４７ａ，４７ｂに向かって発声すると、対応するキャラクタの大きさが変化する。したがって、歌い手は、自分が正しいマイクを用いて歌唱しようとしているか否かを容易に確認することができる。 Thereby, the input level of the singing voice can be displayed on the monitor 46 using the character. In addition, if the microphones 47a and 47b are incorrectly selected when singing a duet song, accurate scoring cannot be performed, and if the microphones 47a and 47b are incorrectly selected when singing a battle song, the scoring results will be reversed. turn into. For this reason, in the present embodiment, two types of characters are prepared and corresponded to the microphones 47a and 47b. In this case, when the singer speaks toward each of the microphones 47a and 47b, the size of the corresponding character changes. Therefore, the singer can easily confirm whether he / she is going to sing with the correct microphone.

次に、図１に示す文字表示部４３は、文字コードが入力されるとこれに対応する曲名や歌詞等のフォントデータを内部のＲＯＭ（図示略）から読み出し、該データを出力する。また、ＬＤチェンジャ４４は、入力された映像選択データ（チャプタナンバ）に基づき、対応するＬＤの背景映像を再生する。映像選択データは、当該カラオケ曲のジャンルデータに基づいて決定される。このジャンルデータは、楽曲データのヘッダに書かれており、カラオケ演奏スタート時にＣＰＵ３０によって読み出される。ＣＰＵ３０は、ジャンルデータに基づいてどの背景映像を再生するかを決定し、その背景映像を指定する映像選択データをＬＤチェンジャ４４に対して出力する。ＬＤチェンジャ４４には、５枚程度のレーザディスクが内蔵されており、約１２０シーンの背景映像を再生することが可能である。映像選択データによってこの中から１つの背景映像が選択され、映像データとして出力される。この映像データと文字表示部４３から出力される歌詞等のフォントデータは、表示制御部４５にてスーパーインポーズされ、その合成画像がモニタ４６に表示される。採点処理部５０によって採点結果が算出されると、これに応じたキャラクタが文字表示部４３から出力され、モニタ４６に表示されるようになっている。 Next, when a character code is input, the character display unit 43 shown in FIG. 1 reads out font data such as a song title and lyrics corresponding to the character code from an internal ROM (not shown) and outputs the data. The LD changer 44 reproduces the background video of the corresponding LD based on the input video selection data (chapter number). The video selection data is determined based on the genre data of the karaoke song. This genre data is written in the header of the music data, and is read by the CPU 30 at the start of the karaoke performance. The CPU 30 determines which background video is to be reproduced based on the genre data, and outputs video selection data specifying the background video to the LD changer 44. The LD changer 44 contains about five laser disks, and can reproduce about 120 scenes of background video. One background video is selected from among them according to the video selection data, and is output as video data. The video data and font data such as lyrics output from the character display unit 43 are superimposed by the display control unit 45, and the composite image is displayed on the monitor 46. When the scoring result is calculated by the scoring processing unit 50, a character corresponding to the scoring result is output from the character display unit 43 and displayed on the monitor 46.

また、マイクの入力レベルを示す上記表示データが表示制御部４５に転送されると、表示制御部４５は映像データに表示データをスーパーインポーズするようになっている。さらに、バトル曲を歌唱する場合には、アニメーションのキャラクタが闘うシーンを表す動画データがＨＤＤ３７から読み出され、これが表示制御部４５に供給されるようになっている。また、採点処理部５０によって採点結果が算出されると、これに応じたキャラクタが文字表示部４３から出力され、モニタ４６に表示されるようになっている。 When the display data indicating the microphone input level is transferred to the display control unit 45, the display control unit 45 superimposes the display data on the video data. Further, when singing a battle song, moving image data representing a scene in which an animation character fights is read from the HDD 37 and supplied to the display control unit 45. When the scoring result is calculated by the scoring processing unit 50, a character corresponding to the scoring result is output from the character display unit 43 and displayed on the monitor 46.

＜Ｂ：採点処理部５０＞
次に、本実施形態の採点処理部５０について説明する。この採点処理部５０は、上述した音声処理用ＤＳＰ４９、ＣＰＵ３０等のハードウェアと採点用のソフトウェアによって構成される。図７は、採点処理部５０の構成を示すブロック図である。同図において、採点処理部５０は、第１の採点部５０Ａ、第２の採点部５０Ｂ、合成部５０Ｃおよび評価部５０Ｄからなる。
第１，第２の採点部５０Ａ，５０Ｂは、一対のＡ／Ｄコンバータ５０１ａ，５０１ｂ、データ抽出部５０２ａ，５０２ｂ、比較部５０３ａ，５０３ｂ、およびフィルタ５０４ａ，５０４ｂによって構成される。 <B: scoring processor 50>
Next, the scoring processing unit 50 of the present embodiment will be described. The scoring processing unit 50 is configured by hardware such as the above-described voice processing DSP 49 and CPU 30 and scoring software. FIG. 7 is a block diagram illustrating a configuration of the scoring processing unit 50. In the figure, the scoring processing unit 50 includes a first scoring unit 50A, a second scoring unit 50B, a combining unit 50C, and an evaluation unit 50D.
The first and second scoring units 50A and 50B are composed of a pair of A / D converters 501a and 501b, data extraction units 502a and 502b, comparison units 503a and 503b, and filters 504a and 504b.

Ａ／Ｄコンバータ５０１ａ，５０１ｂは、セレクタ４８から出力される歌唱音声信号を各々ディジタル信号に変換する。データ抽出部５０２ａ，５０２ｂは、ディジタル化された各歌唱音声信号から１００ｍｓ毎に音高データと音量データを抽出する。比較部５０３ａ，５０３ｂは、各歌唱音声信号から抽出された音高データおよび音量データとリファレンスメロディデータ＃Ａ，＃Ｂの音高データおよび音量データとを各々比較し、それらの差分を算出して、差分データDiffa，Diffbとして出力する。 The A / D converters 501a and 501b each convert the singing voice signal output from the selector 48 into a digital signal. The data extraction units 502a and 502b extract pitch data and volume data from each digitized singing voice signal every 100 ms. The comparing units 503a and 503b compare the pitch data and the volume data extracted from each singing voice signal with the pitch data and the volume data of the reference melody data #A and #B, respectively, and calculate the difference therebetween. Output as differential data Diffa and Diffb.

ここで、差分データDiffa，Diffbは、以下のデータから構成される。
Ｔｉ：計測時刻データ（演奏クロックの相対時間で計測）
ΔＴ：持続時間データ（前回の計測時刻からの時間）
Ｍｉ：リファレンスメロディ状態データ
（歌唱が必要な区間か否か、歌唱区間で「１」、非歌唱区間で「０」）
Ｓｉ：歌唱状態データ（歌唱の有無、歌唱中で「１」、非歌唱中で「０」）
Ｆｉ：音高差データ（音高の差分をログスケール（cent単位）で指示）
Ｌｉ：音量差データ（音量の差分をログスケール（dB単位）で指示）
ただし、「ｉ」は、ｉ番目のサンプルであることを示している。 Here, the difference data Diffa and Diffb are composed of the following data.
Ti: Measurement time data (measured by relative time of performance clock)
ΔT: Duration data (time since last measurement time)
Mi: Reference melody state data (whether or not a section requires singing, "1" for a singing section, "0" for a non-singing section)
Si: Singing state data (singing presence / absence, “1” during singing, “0” during non-singing)
Fi: Pitch difference data (pitch difference is indicated by log scale (cent unit))
Li: Volume difference data (The volume difference is indicated by log scale (dB unit))
Here, “i” indicates that it is the i-th sample.

この場合、音高差データＦｉと音量差データＬｉはログスケールで表されているので、後段にある合成部５０Ｃの演算を簡略化することができる。
また、リファレンスメロディ状態データＭｉは、ガイドメロディトラックに記録されている各パートに対応した楽曲データに基づいて、ＣＰＵ３０が生成する。具体的には、当該楽曲データ中のノートオンステータス、ノートオフステータスから生成される。
また、歌唱状態データＳｉは、データ抽出部５０２ａ，５０２ｂから供給される各音量データを予め定められた閾値と比較することによって、各比較部５０３ａ，５０３ｂが生成する。この場合、閾値は、利用者が歌唱しているか否かを判別可能なレベルに設定される。 In this case, since the pitch difference data Fi and the volume difference data Li are expressed on a log scale, the calculation of the synthesis unit 50C at the subsequent stage can be simplified.
The reference melody state data Mi is generated by the CPU 30 based on music data corresponding to each part recorded on the guide melody track. Specifically, it is generated from the note-on status and the note-off status in the music data.
The singing state data Si is generated by each of the comparison units 503a and 503b by comparing each volume data supplied from the data extraction units 502a and 502b with a predetermined threshold. In this case, the threshold is set to a level at which it is possible to determine whether the user is singing.

ここで、図８を参照し、歌唱音声データ、リファレンスデータ、差分データDiffについて説明する。図８（Ａ），（Ｂ）はリファレンスであるガイドメロディの例を示す図である。同図（Ａ）はガイドメロディを五線譜によって示したもので、同図（Ｂ）はこの五線譜の内容を約８０パーセントのゲートタイムで音高データ、音量データ化したものを示している。音量はｍｐ→クレッシェンド→ｍｐの指示に従って上下している。これに対し、同図（Ｃ）は歌唱音声の例を示している。音高、音量ともリファレンスが示す値から若干変動している。この場合の歌唱状態データＳｉは、図に示すように音量データが、閾値を上回った場合に「１」となり、それ以下の場合に「０」となる。後述する評価部５０Ｄは、歌唱状態データＳｉが「０」となっているサンプルについては、有効なサンプルとして扱わないようにしている。このように音量の小さな部分を無視するのは、この区間では、音高差データＦｉあるいは音量差データＬｉに占めるノイズ成分の割合が大きくなるため、採点精度が劣化してしまうからである。 Here, the singing voice data, the reference data, and the difference data Diff will be described with reference to FIG. FIGS. 8A and 8B are diagrams showing examples of a guide melody as a reference. FIG. 7A shows the guide melody in a staff notation, and FIG. 7B shows the contents of the staff converted into pitch data and volume data with a gate time of about 80%. The volume rises and falls according to the instruction of mp → crescendo → mp. On the other hand, FIG. 3C shows an example of a singing voice. Both the pitch and volume slightly fluctuate from the values indicated by the reference. The singing state data Si in this case becomes “1” when the volume data exceeds the threshold as shown in the figure, and becomes “0” when the volume data is lower than the threshold. The evaluation unit 50D, which will be described later, does not treat a sample whose singing state data Si is “0” as a valid sample. The reason for ignoring the low-volume part is that, in this section, the proportion of the noise component in the pitch difference data Fi or the volume difference data Li is large, and the scoring accuracy is degraded.

ところで、音高差データＦｉと音量差データＬｉは、ある範囲内で変動するのが通常であり、これらの値が突発的に変動する場合は、ノイズによる誤動作等によって誤った演算が行われたと考えることができる。ノイズの影響を受けた音高差データＦｉと音量差データＬｉとに基づいて歌唱力の採点を行ったのでは、歌い手の歌唱力を正当に評価することはできない。フィルタ５０４ａ，５０４ｂは、このような場合の音高差データＦｉと音量差データＬｉとを無効にするために設けられたものである。 By the way, the pitch difference data Fi and the volume difference data Li usually fluctuate within a certain range, and when these values fluctuate suddenly, it is assumed that erroneous calculation was performed due to malfunction due to noise or the like. You can think. If the singing ability is scored based on the pitch difference data Fi and the volume difference data Li affected by the noise, the singing ability of the singer cannot be properly evaluated. The filters 504a and 504b are provided to invalidate the pitch difference data Fi and the volume difference data Li in such a case.

フィルタ５０４ａ，５０４ｂは、その内部にバッファ、減算器およびコンパレータを有している。バッファには、１つ前のサンプルで算出された音高差データＦｉ-1，音量差データＬｉ-1が格納される。そして、現在のサンプルに対応した音高差データＦｉ，音量差データＬｉが入力されると、減算器において、ΔＬｉ＝｜Ｌｉ−Ｌｉ-1｜、ΔＦｉ＝｜Ｆｉ−Ｆｉ-1｜が算出される。コンパレータは、ΔＬｉ、ΔＦｉを予め定められた閾値Ｌｒ、Ｆｒと各々比較して、各閾値を上回る場合に「１」となり、下回る場合」に「０」となる制御信号を出力する。ここで、各閾値は、各種の実測データから無効なサンプルと判定できるように定める。そして、フィルタ５０４ａ，５０４ｂは、制御信号が「１」の場合に、現在の音高差データＦｉと音量差データＬｉと無効とする。
これにより、前回のサンプルと比較して変化の大きいサンプルを無効にして、歌い手の歌唱力を正当に評価することが可能となる。 Each of the filters 504a and 504b has a buffer, a subtractor, and a comparator therein. The buffer stores pitch difference data Fi-1 and volume difference data Li-1 calculated for the immediately preceding sample. Then, when the pitch difference data Fi and the volume difference data Li corresponding to the current sample are input, the subtractor calculates ΔLi = | Li−Li−1 | and ΔFi = | Fi−Fi−1 | You. The comparator compares ΔLi and ΔFi with predetermined thresholds Lr and Fr, respectively, and outputs a control signal that becomes “1” when the threshold value is exceeded and “0” when the threshold value is exceeded. Here, each threshold value is determined so that an invalid sample can be determined from various types of actually measured data. Then, when the control signal is "1", the filters 504a and 504b invalidate the current pitch difference data Fi and the volume difference data Li.
As a result, it is possible to invalidate a sample whose change is larger than that of the previous sample and to properly evaluate the singing ability of the singer.

次に、合成部５０Ｃは、計測時刻データＴｉを参照することにより、同時刻の差分データDiffa，Diffbを合成し、合成差分データDiffcを生成する。合成差分データDiffcは、計測時刻データＴｉ、持続時間データΔＴの他、合成リファレンスメロディ状態データＭｉ’、合成歌唱状態データＳｉ’、合成音高差データＦｉ’および合成音量差データＬｉ’から構成される。 Next, the combining unit 50C combines the difference data Diffa and Diffb at the same time by referring to the measurement time data Ti to generate combined difference data Diffc. The synthetic difference data Diffc is composed of synthetic reference melody state data Mi ′, synthetic singing state data Si ′, synthetic pitch difference data Fi ′, and synthetic volume difference data Li ′, in addition to the measurement time data Ti and the duration data ΔT. You.

ここで、差分データDiffaを構成する各データに添字「１」、差分データDiffbに係わる各データに添字「２」を付して表すこととすると、合成リファレンスメロディ状態データＭｉ’はＭｉ1とＭｉ2の論理和として、合成歌唱状態データＳｉ’はＳｉ1とＳｉ2の論理和として算出される。また、合成音高差データＦｉ’と合成音量差データＬｉ’は、Ｍｉ1とＭｉ2、Ｓｉ1とＳｉ2に応じて以下に示す式に従って算出される。 Here, assuming that each data constituting the difference data Diffa is represented by adding a suffix “1” and each data relating to the difference data Diffb is appended with a suffix “2”, the combined reference melody state data Mi ′ is represented by Mi1 and Mi2. As a logical sum, the synthetic singing state data Si 'is calculated as a logical sum of Si1 and Si2. The synthesized pitch difference data Fi 'and the synthesized volume difference data Li' are calculated according to the following equations according to Mi1 and Mi2 and Si1 and Si2.

１）Ｍｉ1＊Ｍｉ2＊Ｓｉ1＊Ｓｉ2＝１の場合
この場合は、いずれの採点部で行われる採点にあっても、有効な歌唱区間であって、かつ歌い手が歌唱している期間である。このため、差分データの平均値を算出する。
Ｆｉ’＝（Ｆｉ１＋Ｆｉ２）／２
Ｌｉ’＝（Ｌｉ１＋Ｌｉ２）／２ 1) When Mi1 * Mi2 * Si1 * Si2 = 1 In this case, the scoring performed by any scoring unit is a valid singing section and a period in which the singer is singing. Therefore, an average value of the difference data is calculated.
Fi ′ = (Fi1 + Fi2) / 2
Li ′ = (Li1 + Li2) / 2

２）Ｍｉ1＊Ｓｉ1＝１、Ｍｉ2＊Ｓｉ2＝０
この場合、第２の採点部５０Ｂで行われる採点は、非歌唱区間かあるいは歌唱中でない期間に行われている。一方、第１の採点部５０Ａで行われる採点は、有効歌唱区間において歌い手が歌唱中である期間である。このため、差分データDiffbは無視される。
Ｆｉ’＝Ｆｉ１
Ｌｉ’＝Ｌｉ１ 2) Mi1 * Si1 = 1, Mi2 * Si2 = 0
In this case, the scoring performed by the second scoring unit 50B is performed during a non-singing section or a period during which no singing is being performed. On the other hand, the scoring performed by the first scoring unit 50A is a period during which the singer is singing in the effective singing section. Therefore, the difference data Diffb is ignored.
Fi '= Fi1
Li '= Li1

３）Ｍｉ1＊Ｓｉ1＝０、Ｍｉ2＊Ｓｉ2＝１
この場合、第１の採点部５０Ａで行われる採点は、非歌唱区間かあるいは歌唱中でない期間に行われている。一方、第２の採点部５０Ｂで行われる採点は、有効歌唱区間において歌い手が歌唱中である期間である。このため、差分データDiffaは無視される。
Ｆｉ’＝Ｆｉ２
Ｌｉ’＝Ｌｉ２ 3) Mi1 * Si1 = 0, Mi2 * Si2 = 1
In this case, the scoring performed by the first scoring unit 50A is performed during a non-singing section or a period during which no singing is being performed. On the other hand, the scoring performed by the second scoring unit 50B is a period during which the singer is singing in the effective singing section. Therefore, the difference data Diffa is ignored.
Fi '= Fi2
Li '= Li2

このよう合成部５０Ｃを構成することによって、例えば、デュエット曲の混成歌唱区間で、男性の歌い手が正しく歌唱して、女性の歌い手が歌唱しなかった場合、女性の歌い手が歌唱しなかった部分については採点の対象外とされ、正しく歌唱した男性の歌い手の歌唱力をもって両者の歌唱力とすることが可能となる。
また、デュエット曲の単独歌唱区間において、本来歌唱すべきでない歌唱音声は採点対象とならず、本来予定されている歌唱音声のみに基づいて、正確な採点結果を得ることができる。 By configuring the synthesizing unit 50C in this manner, for example, in a mixed singing section of a duet song, when a male singer sings correctly and a female singer does not sing, a portion where a female singer did not sing Is excluded from the scoring, and the singing ability of a male singer who sings correctly can be used as both singing skills.
In addition, in a single singing section of a duet song, singing voices that should not be sung originally are not to be scored, and accurate scoring results can be obtained based only on originally planned singing voices.

次に、評価部５０Ｄは、記憶部等（図示せず）から構成されており、差分データDiffa，Diffbまたは合成差分データDiffcに基づいて、採点結果を算出する。差分データDiffa，Diffbまたは合成差分データDiffcが入力されると、記憶部（すなわち、ＲＡＭ３２の差分データ記憶エリア３２２）に蓄積記憶される。この場合、Diffa，DiffbまたはDiffcのうちどのデータを記憶部に蓄積するかは、ＣＰＵ３０によって制御される。この蓄積は曲の演奏中随時行われる。 Next, the evaluation unit 50D includes a storage unit and the like (not shown), and calculates a scoring result based on the difference data Diffa, Diffb or the combined difference data Diffc. When the difference data Diffa, Diffb or the combined difference data Diffc is input, the difference data Diffa, Diffb, or the combined difference data Diffc is stored in the storage unit (that is, the difference data storage area 322 of the RAM 32). In this case, the CPU 30 controls which data of Diffa, Diffb or Diffc is stored in the storage unit. This accumulation is performed at any time during the performance of the music.

曲の演奏が終了すると、評価部５０Ｄは、記憶部に蓄積された差分データを順次読み出してこれらを音高、音量の各音楽要素毎に累算し、各累算値に基づいて各々採点のための減算値を求める。そして、各減算値を満点（１００点）から減算して各音楽要素毎の得点を求め、これらの平均値を採点結果として出力する。 When the performance of the music is completed, the evaluation unit 50D sequentially reads out the difference data stored in the storage unit, accumulates the difference data for each musical element of pitch and volume, and gives a score based on each accumulated value. Find the subtraction value for Then, each subtraction value is subtracted from the full score (100 points) to obtain a score for each music element, and the average value of these is output as a scoring result.

ところで、カラオケ装置で歌唱される曲には、二人の歌い手が歌唱力を競うために設けられたバトル曲がある。バトル曲の歌唱にあっては、セレクタ４８がストレートモードに設定され、第１の採点部５０Ａと第２の採点部５０Ｂで別々に採点が行われ、差分データDiffa，Diffbが生成される。この場合、評価部５０Ｄは、図９に示すブロック図で表すことができる。 By the way, the songs sung by the karaoke apparatus include battle songs provided for two singers to compete for singing ability. When singing a battle song, the selector 48 is set to the straight mode, the first scoring unit 50A and the second scoring unit 50B perform scoring separately, and difference data Diffa and Diffb are generated. In this case, the evaluation unit 50D can be represented by a block diagram shown in FIG.

図に示すように、評価部５０Ｄは、第１〜第４の評価関数演算部５１０〜５４０、第１，第２の比較部５５０，５６０、乱数発生部５７０、および判定部５８０によって構成される。ここで、第１，第２の評価関数演算部５１０，５２０は、評価関数Ｑ１（Ｘ）を用いて採点を行って採点結果Ｑ１ａ，Ｑ１ｂを算出する。また、第３，第４の評価関数演算部５３０，５４０は、評価関数Ｑ２（Ｘ）を用いて採点を行って採点結果Ｑ２ａ，Ｑ２ｂを算出する。評価関数Ｑ１（Ｘ）と評価関数Ｑ２（Ｘ）とは互いに相違するものである。例えば、評価関数Ｑ１（Ｘ）では音量差を重視して評価し、一方、評価関数Ｑ２（Ｘ）では音程差を重視して評価を行うものとすれば、評価関数Ｑ１（Ｘ），Ｑ２（Ｘ）は、以下の式で表される。
Ｑ１（Ｘ）＝ｋ１＊Ｆｉ＋ｋ２＊Ｌｉ
Ｑ２（Ｘ）＝ｋ３＊Ｆｉ＋ｋ４＊Ｌｉ
ただし、ｋ１＞ｋ３、ｋ２＜ｋ４であるものとする。 As illustrated, the evaluation unit 50D includes first to fourth evaluation function operation units 510 to 540, first and second comparison units 550 and 560, a random number generation unit 570, and a determination unit 580. . Here, the first and second evaluation function calculation units 510 and 520 perform scoring using the evaluation function Q1 (X) to calculate scoring results Q1a and Q1b. Further, the third and fourth evaluation function calculation units 530 and 540 perform scoring using the evaluation function Q2 (X) to calculate scoring results Q2a and Q2b. The evaluation function Q1 (X) and the evaluation function Q2 (X) are different from each other. For example, if the evaluation function Q1 (X) evaluates with emphasis on the volume difference, while the evaluation function Q2 (X) evaluates with emphasis on the pitch difference, the evaluation functions Q1 (X), Q2 ( X) is represented by the following equation.
Q1 (X) = k1 * Fi + k2 * Li
Q2 (X) = k3 * Fi + k4 * Li
However, it is assumed that k1> k3 and k2 <k4.

また、採点結果Ｑ１ａ，Ｑ１ｂ，Ｑ２ａ，Ｑ２ｂは、以下の式で表される。
Ｑ１ａ＝１００−（ｋ１＊Ｆｉ１＋ｋ２＊Ｌｉ１）
Ｑ１ｂ＝１００−（ｋ１＊Ｆｉ２＋ｋ２＊Ｌｉ２）
Ｑ２ａ＝１００−（ｋ３＊Ｆｉ１＋ｋ４＊Ｌｉ１）
Ｑ２ｂ＝１００−（ｋ３＊Ｆｉ２＋ｋ４＊Ｌｉ２） The scoring results Q1a, Q1b, Q2a, Q2b are represented by the following equations.
Q1a = 100− (k1 * Fi1 + k2 * Li1)
Q1b = 100− (k1 * Fi2 + k2 * Li2)
Q2a = 100− (k3 * Fi1 + k4 * Li1)
Q2b = 100− (k3 * Fi2 + k4 * Li2)

次に、第１の比較部５５０は採点結果Ｑ１ａ，Ｑ１ｂを比較して、それらの大小関係を算出する。比較結果としては、Ｑ１ａ大、Ｑ１ｂ大および一致の三種類がある。ところで、第１の比較部５５０においては、Ｆｉ１＝Ｆｉ２かつＬｉ１＝Ｌｉ２の場合、または、以下に示す式が成立する場合には、Ｑ１ａ＝Ｑ１ｂとなる。
ｋ１／ｋ２＝（Ｌｉ２−Ｌｉ１）／（Ｆｉ１−Ｆｉ２）
この場合には、同一の採点結果となってしまうので、歌唱力の優劣を判定することができない。しかしながら、バトル曲は、その判定結果によって、歌い手や周りの聴衆はカラオケの雰囲気を盛り上げるために歌唱されるものである。したがって、判定結果が引き分けであると、せっかく歌唱しても面白味に欠けてしまう。そこで、本実施形態にあっては、第３，第４の評価関数演算部５３０，５４０を設け、第２の評価関数Ｑ２（ｘ）で歌唱力の優劣を評価できるようにしている。 Next, the first comparing unit 550 compares the scoring results Q1a and Q1b, and calculates the magnitude relation therebetween. There are three types of comparison results, Q1a large, Q1b large, and coincidence. By the way, in the first comparing section 550, when Fi1 = Fi2 and Li1 = Li2, or when the following equation is satisfied, Q1a = Q1b.
k1 / k2 = (Li2-Li1) / (Fi1-Fi2)
In this case, since the same scoring result is obtained, it is not possible to determine the superiority of the singing ability. However, the battle song is sung by the singer and the surrounding audience to enhance the karaoke atmosphere according to the result of the determination. Therefore, if the determination result is a draw, even if the song is sung with great effort, it will lack interest. Therefore, in the present embodiment, the third and fourth evaluation function calculation units 530 and 540 are provided so that the singing ability can be evaluated by the second evaluation function Q2 (x).

次に、第２の比較部５６０は採点結果Ｑ２ａ，Ｑ２ｂを比較して、それらの大小関係を算出する。比較結果としては、Ｑ２ａ大、Ｑ２ｂ大および一致の三種類がある。次に、乱数発生部５７０は２進数の乱数を発生し、最下位ビットを乱数Ｍとして出力する。乱数の発生は、例えば、Ｍ系列の符号発生回路を用いればよい。 Next, the second comparing section 560 compares the scoring results Q2a and Q2b, and calculates the magnitude relation therebetween. As the comparison result, there are three types: large Q2a, large Q2b, and coincidence. Next, the random number generator 570 generates a binary random number and outputs the least significant bit as a random number M. The random number may be generated using, for example, an M-sequence code generation circuit.

次に、判定部５８０は、第１，第２の比較部５５０，５６０の比較結果および乱数Ｍに基づいて、歌唱力の優劣の判定を行う。まず、第１の比較部５５０の比較結果がＱ１ａ大またはＱ１ｂ大を示す場合には、これらに基づいて判定を行う。Ｑ１ａ大の場合には、マイク４７ａで歌唱した歌い手の勝ちとし、一方、Ｑ１ｂ大の場合にはマイク４７ｂで歌唱した歌い手の勝ちとする判定結果を生成する。 Next, the determination unit 580 determines the singing ability based on the comparison result of the first and second comparison units 550 and 560 and the random number M. First, when the comparison result of the first comparing section 550 indicates that Q1a is large or Q1b is large, the determination is performed based on these. When Q1a is large, a determination result is generated that the singer sung by the microphone 47a wins the microphone 47a, and when Q1b is large, the singer sung by the microphone 47b wins.

次に、第１の比較部５５０の比較結果が一致を示し、かつ、第２の比較部５６０の比較結果がＱ２ａ大またはＱ２ｂ大を示す場合には、第２の比較部５６０の比較結果に基づいて判定を行う。具体的には、Ｑ２ａ大の場合には、マイク４７ａで歌唱した歌い手の勝ちとし、一方、Ｑ２ｂ大の場合にはマイク４７ｂで歌唱した歌い手の勝ちとする判定結果を生成する。 Next, when the comparison result of the first comparison unit 550 indicates a match and the comparison result of the second comparison unit 560 indicates a large Q2a or a large Q2b, the comparison result of the second comparison unit 560 A determination is made based on this. More specifically, a determination result is generated that, when Q2a is large, the singer who sings with the microphone 47a wins, and when Q2b is large, the singer sings with the microphone 47b wins.

次に、第１，第２の比較部５５０，５６０がいずれも一致を示す場合には、判定部５８０は、乱数Ｍに基づいて、判定を行う。具体的には、乱数Ｍが「１」の場合には、マイク４７ａで歌唱した歌い手の勝ちとし、一方、乱数Ｍが「０」の場合にはマイク４７ｂで歌唱した歌い手の勝ちとする判定結果を生成する。 Next, when both the first and second comparison units 550 and 560 indicate a match, the determination unit 580 makes a determination based on the random number M. Specifically, when the random number M is "1", the singer who sings with the microphone 47a wins, and when the random number M is "0", the singer sings with the microphone 47b wins. Generate

これにより、バトル曲を歌唱した場合、評価関数Ｑ１（ｘ）による評価が二人の歌い手で同一であっても、評価関数Ｑ２（ｘ）によって歌唱力の優劣を判定することができる。また、評価関数Ｑ１（ｘ），Ｑ２（ｘ）による評価が同一であっても、乱数Ｍによって優劣を付けることができる。この結果。歌唱力の優劣を必ず付けることができ、歌唱の雰囲気を盛り上げることができる。 Thus, when a battle song is sung, even if the evaluation by the evaluation function Q1 (x) is the same for two singers, it is possible to determine the singing ability by the evaluation function Q2 (x). Even if the evaluations by the evaluation functions Q1 (x) and Q2 (x) are the same, it is possible to give priority to the random numbers M. As a result. Singing ability can always be assigned, and the atmosphere of singing can be enhanced.

＜Ｃ：実施形態の動作＞
次に、本実施形態による動作について説明する。なお、この例においては、特に断らない限り、歌い手は歌唱すべき区間で歌唱中であり、歌唱状態データＳｉ＝１であったものとする。 <C: Operation of Embodiment>
Next, the operation according to the present embodiment will be described. In this example, unless otherwise specified, it is assumed that the singer is singing in the section to be sung and the singing state data Si = 1.

＜Ｃ−１：バトル曲を歌唱する場合＞
まず、二人の歌い手が、バトル曲を歌唱する場合について図１０を参照しつつ説明する。この例のバトル曲は、図１０に示すように前奏・間奏区間ｔ１，ｔ５および第１〜第３歌唱区間ｔ２〜ｔ４から構成されているものとする。この場合には、各歌い手毎に採点する必要があるため、図１０（Ｂ）に示すようにセレクタ４８はストレートモードに設定される。 <C-1: When singing a battle song>
First, a case where two singers sing a battle song will be described with reference to FIG. It is assumed that the battle tune of this example is composed of prelude / interlude sections t1 and t5 and first to third singing sections t2 to t4 as shown in FIG. In this case, since it is necessary to score for each singer, the selector 48 is set to the straight mode as shown in FIG.

バトル曲の歌唱をリモコン５１の操作によって指定すると、ＣＰＵ３０はリモコン受信部３３からの信号に基づいて、これを検知する。この後、ＣＰＵ３０は、ＨＤＤ３７から複数のキャラクタデータを読み出して、これらをモニタ４６に表示する。歌い手は、モニタ４６に表示されるキャラクタの中から好みのキャラクタをリモコン５１の操作によって選択する。この後、ＣＰＵ３０は、モニタ４６の画面の左上と右上に対戦するキャラクタを表示する。この場合、キャラクタの大きさは歌唱音声の入力レベルに応じて変化する。したがって、歌い手は各マイク４７ａ，４７ｂに向かって発声することによって、自分のキャラクタを確認することができる。 When the singing of the battle song is designated by operating the remote controller 51, the CPU 30 detects this based on a signal from the remote controller receiving unit 33. Thereafter, the CPU 30 reads out a plurality of character data from the HDD 37 and displays them on the monitor 46. The singer selects a favorite character from the characters displayed on the monitor 46 by operating the remote controller 51. Thereafter, the CPU 30 displays characters to be played in the upper left and upper right corners of the screen of the monitor 46. In this case, the size of the character changes according to the input level of the singing voice. Therefore, the singer can confirm his or her own character by speaking into each of the microphones 47a and 47b.

また、図７に示す第１の採点部５０Ａと第２の採点部５０Ｂには、同一のリファレンスメロディデータ＃Ａが供給される。これにより、第１，第２の採点部５０Ａ，５０Ｂに各歌唱音声信号Ｖ１，Ｖ２が入力されると、第１の採点部５０Ａと第２の採点部５０Ｂは、差分データDiffa，Diffbを生成する。この場合の採点は各歌い手毎に行う必要があるので、評価部５０Ｄは、差分データDiffaに基づく採点結果と差分データDiffbに基づく採点結果を各々生成し、これに基づいて歌唱力の優劣を各歌唱区間毎に判定するとともに、曲の終了時点で総合的な優劣を判定する。そして、判定結果に基づいて、アニメーションがモニタ４６に表示される。 The same reference melody data #A is supplied to the first scoring unit 50A and the second scoring unit 50B shown in FIG. Accordingly, when the singing voice signals V1 and V2 are input to the first and second scoring units 50A and 50B, the first scoring unit 50A and the second scoring unit 50B generate difference data Diffa and Diffb. I do. In this case, since it is necessary to perform the scoring for each singer, the evaluation unit 50D generates a scoring result based on the difference data Diffa and a scoring result based on the difference data Diffb, and determines the singing ability based on this. The determination is made for each singing section, and the overall superiority is determined at the end of the song. Then, an animation is displayed on the monitor 46 based on the determination result.

ここで、各歌唱区間ｔ２〜ｔ４におけるモニタ４６の表示動作を説明する。ＣＰＵ３０は、曲データ中のガイドメロディデータの有無に基づいて歌唱区間を検知すると、ＨＤＤ３７から動画データを読み出し、これをモニタ４６に表示させる。この場合の動画データは、図１０（Ｃ）に示すように各キャラクタが闘っているシーンＳ１を表すものである。なお、ここで用いられるキャラクタは、マイクの入力レベルをその大きさで表すものと一致させる。このため、上述したリモコン５１の操作によって指定されたキャラクタの組に基づいて、ＣＰＵ３０は動画データをＨＤＤ３７から読み出す。 Here, the display operation of the monitor 46 in each of the singing sections t2 to t4 will be described. When detecting the singing section based on the presence or absence of the guide melody data in the music data, the CPU 30 reads out the moving image data from the HDD 37 and displays it on the monitor 46. The moving image data in this case represents a scene S1 in which each character is fighting, as shown in FIG. The character used here matches the input level of the microphone with that represented by its size. Therefore, the CPU 30 reads the moving image data from the HDD 37 based on the character set specified by the operation of the remote controller 51 described above.

次に、各歌唱区間ｔ２〜ｔ４の終わりの部分では、判定部５８０で生成される判定結果に基づいて、キャラクタの勝ち負けを表す動画データを表示する。例えば、二人の歌い手が男の子と女の子を用いて対戦し、女の子のキャラクタを用いて歌唱した歌いての歌唱力が勝っているとすれば、図１０（Ｃ）に示すように、女の子が勝利したシーンＳ２がモニタ４６に表示される。このため、ＣＰＵ３０は、判定結果とキャラクタの組に基づいて、動画データをＨＤＤ３７から読み出す。 Next, at the end of each of the singing sections t2 to t4, moving image data representing the winning or losing of the character is displayed based on the determination result generated by the determining unit 580. For example, if two singers fight against each other using a boy and a girl, and the singing power of the song sung using the girl character is superior, as shown in FIG. The scene S2 is displayed on the monitor 46. Therefore, the CPU 30 reads the moving image data from the HDD 37 based on the combination of the determination result and the character.

＜Ｃ−２：通常の曲を歌唱する場合＞
次に、一人の歌い手が通常の曲を歌唱する場合について説明する。この場合には、いずれか一方の採点部によって、差分データを生成してもよいが、本実施形態では、ノイズの低減を図るために、第１，第２の採点部５０Ａ，５０Ｂで同時に処理を行い、その平均値に基づいて採点を行うようにしている。
このため、セレクタ４８はミックスモードに設定され、第１の採点部５０Ａと第２の採点部５０Ｂには、同一のリファレンスメロディデータ＃Ａが供給される。そして、合成部５０Ｃは差分データDiffaと差分データDiffbの平均値を算出し、合成差分データDiffcとして出力する。 <C-2: When singing a normal song>
Next, a case where one singer sings a normal song will be described. In this case, the difference data may be generated by one of the scoring units, but in the present embodiment, the first and second scoring units 50A and 50B simultaneously process the difference data in order to reduce noise. And scoring is performed based on the average value.
Therefore, the selector 48 is set to the mix mode, and the same reference melody data #A is supplied to the first scoring unit 50A and the second scoring unit 50B. Then, the combining unit 50C calculates an average value of the difference data Diffa and the difference data Diffb, and outputs the result as combined difference data Diffc.

一般に、ノイズ成分はランダムノイズであるから、平均をとることによってその成分は３dB減少する。これに対して、信号成分は平均をとっても変化しない。したがって、合成差分データDiffc中の合成音高差データＦｉ’および合成音量差データＬｉ’のＳＮ比は、差分データDiffa，差分データDiffbのそれと比較して、３dB改善される。
これにより、Ａ／Ｄコンバータ５０１ａ，５０１ｂにおいて、量子化する際に発生する誤差や、音高を検出する際の誤差等によって生じるノイズ成分を低減して、歌唱力を精度の良く採点することが可能となる。 In general, since the noise component is random noise, averaging reduces the component by 3 dB. On the other hand, the signal component does not change even if the average is taken. Therefore, the SN ratio of the synthesized pitch difference data Fi ′ and the synthesized volume difference data Li ′ in the synthesized difference data Diffc is improved by 3 dB as compared with those of the difference data Diffa and the difference data Diffb.
As a result, in the A / D converters 501a and 501b, it is possible to reduce a noise component generated due to an error generated at the time of quantization and an error at the time of detecting a pitch, and to score the singing ability with high accuracy. It becomes possible.

＜Ｃ−３：デュエット曲を歌唱する場合＞
次に、男女の歌い手がデュエット曲を歌唱する場合について説明する。デュエット曲中には、一般に、男性のみが歌唱する男性歌唱区間、女性のみが歌唱する女性歌唱区間、男性と女性が同時に歌唱する混成歌唱区間、および両者がともに歌唱しない前奏・間奏区間がある。混成区間にあっては、両者が同時に歌唱するため、歌唱力の採点は、第１，第２の採点部５０Ａ，５０Ｂの各々で行う必要がある。これに対して、男性歌唱区間あるいは女性歌唱区間では、いずれか一方で差分データを生成すれば、採点を行うことができるが、本実施形態にあっては、採点精度を向上させる目的で、この場合にも両方の採点部を用いて差分データを生成し、これを合成部５０Ｃで平均して合成差分データを得ている。 <C-3: When singing a duet song>
Next, a case where male and female singers sing a duet song will be described. In a duet song, there are generally a male singing section in which only men sing, a female singing section in which only women sing, a mixed singing section in which men and women sing simultaneously, and a prelude / interlude section in which both do not sing. In the hybrid section, since both sing at the same time, it is necessary to score the singing power in each of the first and second scoring units 50A and 50B. On the other hand, in the male singing section or the female singing section, scoring can be performed by generating difference data in one of the sections, but in the present embodiment, in order to improve the scoring accuracy, this is performed. Also in this case, difference data is generated using both the scoring units, and the difference data is averaged by the combining unit 50C to obtain combined difference data.

この点について、図１１を参照しつつ具体的に説明する。なお、この例では、男性がマイク４７ａで歌唱し、女性がマイク４７ｂで歌唱するものとする。図１１（Ａ）は、デュエット曲の進行の一例を示したものである。この例のデュエット曲は、前奏区間Ｔ１→男性歌唱区間Ｔ２→女性歌唱区間Ｔ３→混成歌唱区間Ｔ４→間奏区間Ｔ５の順に進行する。また、図１１（Ｂ）はセレクタ４８のモードを示したものであり、図１１（Ｃ）はキャラクタの表示を示したものである。なお、＃Ｍを男性パートのリファレンスメロディデータ、＃Ｗを女性パートのリファレンスメロディデータとして説明する。なお、各歌唱区間の判別は、曲データ中の区間情報に基づいてＣＰＵ３０が判別している。 This point will be specifically described with reference to FIG. In this example, it is assumed that a man sings with the microphone 47a and a woman sings with the microphone 47b. FIG. 11A shows an example of the progress of a duet song. The duet music in this example progresses in the order of a prelude section T1, a male singing section T2, a female singing section T3, a mixed singing section T4, and an interlude section T5. FIG. 11B shows the mode of the selector 48, and FIG. 11C shows the display of a character. Note that #M will be described as reference melody data for the male part and #W will be described as reference melody data for the female part. The CPU 30 determines each singing section based on section information in the song data.

まず、前奏区間Ｔ１と間奏区間Ｔ５は、本来の歌唱区間でないから、ガイドメロディは存在しておらず、採点の対象外とされる。このため、セレクタ４８の切換モードは、スレートモード、ミックスモードのどちらであってもよいが、マイク４７ａ，４７ｂの確認を容易に行えるようにスレートモードに設定される。ところで、デュエット曲は、男女が協力して歌唱するのが一般である。このため、予め設定されている男女のキャラクタがモニタ４６に表示され、特に、歌い手がキャラクタの変更を希望する場合にのみ、リモコン５１の操作によってキャラクタの変更が行われる。なお、この例にあっては、マイク４７ａが男性のキャラクタに、マイク４７ｂが女性のキャラクタに対応している。 First, since the prelude section T1 and the interlude section T5 are not original singing sections, there is no guide melody and they are excluded from scoring. Therefore, the switching mode of the selector 48 may be either the slate mode or the mix mode, but is set to the slate mode so that the microphones 47a and 47b can be easily checked. By the way, men and women cooperate to sing duet songs. For this reason, the predetermined male and female characters are displayed on the monitor 46. In particular, only when the singer wants to change the character, the character is changed by operating the remote controller 51. In this example, the microphone 47a corresponds to a male character, and the microphone 47b corresponds to a female character.

前奏区間Ｔ１と間奏区間Ｔ５にあっては、図１１（Ｃ）に示すように、キャラクタＣａとキャラクタＣｂがモニタ４６の左上と右上に表示される。ここで、歌い手がマイク４７ａに向かって発声したとすると、男性のキャラクタＣａが大きくなり、キャラクタＣａ’に変化する。これにより、女性の歌い手がマイク４７ａに向かって発声したとすれば、マイクを取り違えていることを認識できる。 In the prelude section T1 and the interlude section T5, the characters Ca and Cb are displayed on the upper left and upper right of the monitor 46, as shown in FIG. Here, assuming that the singer speaks toward the microphone 47a, the male character Ca increases and changes to the character Ca '. As a result, if the female singer utters the voice toward the microphone 47a, it can be recognized that the microphone has been mistaken.

次に、男性歌唱区間Ｔ２にあっては、セレクタ４８はミックスモードに設定される。この場合、ＣＰＵ３０は、セレクタ４８の入力端子Ｘ１と出力端子Ｙ１，Ｙ２を接続状態にし、セレクタ４８の入力端子Ｘ２を開放状態にするように制御する。このため、マイク４７ａから出力される男性の歌唱音声信号Ｖ１は、第１の採点部５０Ａと第２の採点部５０Ｂに供給される。この区間にあっては、第１，第２の採点部５０Ａ，５０Ｂには、リファレンスメロディデータ＃Ｍが供給される。したがって、男性の歌唱音声信号Ｖ１と男性パートのリファレンスメロディデータ＃Ｍが二つの採点部５０Ａ，５０Ｂによって比較され、その平均値が合成部５０Ｃにおいて生成される。評価部５０Ｄは合成部５０Ｃからの合成差分データDiffcに基づいて当該区間の採点を行う。この場合の合成差分データDiffcは、差分データDiffat,Diffbと比較してＳＮ比が改善されたものとなる。 Next, in the male singing section T2, the selector 48 is set to the mix mode. In this case, the CPU 30 controls so that the input terminal X1 of the selector 48 is connected to the output terminals Y1 and Y2, and the input terminal X2 of the selector 48 is opened. Therefore, the male singing voice signal V1 output from the microphone 47a is supplied to the first scoring unit 50A and the second scoring unit 50B. In this section, the reference melody data #M is supplied to the first and second scoring units 50A and 50B. Therefore, the male singing voice signal V1 and the reference melody data #M of the male part are compared by the two scoring units 50A and 50B, and the average value is generated in the synthesizing unit 50C. The evaluation unit 50D scores the section based on the combined difference data Diffc from the combining unit 50C. In this case, the combined difference data Diffc has an improved SN ratio as compared with the difference data Diffat and Diffb.

この区間は、男性のみの歌唱区間であるが、歌い慣れていない歌い手は、このことが分からない場合もある。そこで、本実施形態にあっては、女性のキャラクタＣｂの大きさを小さくしたキャラクタＣｂｓをモニタ４６に表示するようにして、男性が歌唱する区間であることを歌い手に認識させるようにしている。この場合、キャラクタＣｂｓのデータは、ＨＤＤ３７から読み出したデータに基づいて、ＣＰＵ３０が画像の縮小処理を施すことによって生成される。なお、マイク４７ｂに向かって発声しても、キャラクタＣｂｓの大きさは変化しないようになっている。 This section is a singing section only for men, but a singer who is not used to singing may not know this. Therefore, in the present embodiment, the character Cbs in which the size of the female character Cb is reduced is displayed on the monitor 46 so that the singer can recognize that the section is a section where a male sings. In this case, the data of the character Cbs is generated by the CPU 30 performing image reduction processing based on the data read from the HDD 37. It should be noted that the size of the character Cbs does not change even if it is uttered toward the microphone 47b.

次に、女性歌唱区間Ｔ３にあっては、男性歌唱区間Ｔ２と同様にセレクタ４８はミックスモードに設定される。ただし、セレクタ４８の内部の接続状態は男性歌唱区間Ｔ２と相違する。この場合、ＣＰＵは、セレクタ４８の入力端子Ｘ２と出力端子Ｙ１，Ｙ２を接続状態にし、セレクタ４８の入力端子Ｘ１を開放状態にするように制御する。このため、男性の歌唱音声信号Ｖ１は、セレクタ４８から出力されない。二人の歌い手のうち一方のみが歌唱すべき区間において、両方の歌唱音声信号を混合して出力端子Ｙ１，Ｙ２に出力せず、他方のマイクからの入力を開放としたのは、例えば、女性歌唱区間Ｔ３において、男性が手拍子を行うと、それがノイズとして混入され、女性の歌唱力を正当に評価することができないからである。 Next, in the female singing section T3, the selector 48 is set to the mix mode as in the male singing section T2. However, the connection state inside the selector 48 is different from the male singing section T2. In this case, the CPU performs control so that the input terminal X2 of the selector 48 is connected to the output terminals Y1 and Y2, and the input terminal X1 of the selector 48 is opened. Therefore, the male singing voice signal V1 is not output from the selector 48. In the section where only one of the two singers is to sing, both singing voice signals are not mixed and output to the output terminals Y1 and Y2, and the input from the other microphone is opened, for example, This is because, when the male clapping in the singing section T3, the clapping is performed as noise, and the singing ability of the female cannot be properly evaluated.

こうして、女性の歌唱音声信号Ｖ２が第１，第２の採点部５０Ａ，５０Ｂに供給されると、第１，第２の採点部５０Ａ，５０Ｂは、リファレンスメロディデータ＃Ｗに基づいて比較を行なう。この比較結果が合成部５０Ｃによって平均化され、合成差分データDiffcとして出力されると、評価部５０Ｄは合成差分データDiffcに基づいて当該区間の採点を行う。この場合も、男性歌唱区間Ｔと同様に、合成差分データDiffcは、差分データDiffat,Diffbと比較してＳＮ比が改善されたものとなる。 Thus, when the female singing voice signal V2 is supplied to the first and second scoring units 50A and 50B, the first and second scoring units 50A and 50B perform comparison based on the reference melody data #W. . When the comparison result is averaged by the combining unit 50C and output as the combined difference data Diffc, the evaluation unit 50D scores the section based on the combined difference data Diffc. Also in this case, similarly to the male singing section T, the synthetic difference data Diffc has an improved SN ratio compared to the difference data Diffat and Diffb.

また、この区間は、男性歌唱区間とは逆に、女性のみの歌唱区間であることを歌い手に認識させるため、男性のキャラクタＣａの大きさを小さくしたキャラクタＣａｓをモニタ４６に表示するようにして、女性が歌唱する区間であることを歌い手に認識させるようにしている（図１１（Ｃ）参照）。この場合、キャラクタＣａｓのデータは、ＨＤＤ３７から読み出したデータに基づいて、ＣＰＵ３０が画像の縮小処理を施すことによって生成される。なお、マイク４７ａに向かって発声しても、キャラクタＣａｓの大きさが変化しない点は、男性歌唱区間においてキャラクタＣｂｓの大きさが変化しない点と同様である。 In addition, in order to make the singer recognize that this section is a female-only singing section, contrary to the male singing section, a character Cas in which the size of the male character Ca is reduced is displayed on the monitor 46. The singer recognizes that the section is a section where a woman sings (see FIG. 11C). In this case, the data of the character Cas is generated by the CPU 30 performing image reduction processing based on the data read from the HDD 37. Note that the point that the size of the character Cas does not change even when uttered toward the microphone 47a is the same as the point that the size of the character Cbs does not change in the male singing section.

次に、混成歌唱区間にあっては、セレクタ４８はストレートモードに設定される。この場合、ＣＰＵ３０は、セレクタ４８の入力端子Ｘ１と出力端子Ｙ１を接続状態にし、その入力端子Ｘ２を出力端子Ｙ１を接続状態にするように制御する。このため、男性の歌唱音声信号Ｖ１が第１の採点部５０Ａに、女性の歌唱音声信号Ｖ２が第２の採点部５０Ｂに供給される。この区間にあっては、第１，第２の採点部５０Ａ，５０Ｂに、リファレンスメロディデータ＃Ｍ，＃Ｗを各々供給する。このため、第１，第２の採点部５０Ａ，５０Ｂからは、異なる差分データDiffa,Diffbが出力される。合成部５０Ｃは、両者の平均値を算出して合成差分データDiffcを生成する。 Next, in the mixed singing section, the selector 48 is set to the straight mode. In this case, the CPU 30 controls the input terminal X1 and the output terminal Y1 of the selector 48 to be connected, and controls the input terminal X2 to be connected to the output terminal Y1. Therefore, the male singing voice signal V1 is supplied to the first scoring unit 50A, and the female singing voice signal V2 is supplied to the second scoring unit 50B. In this section, reference melody data #M and #W are supplied to the first and second scoring units 50A and 50B, respectively. Therefore, different difference data Diffa, Diffb are output from the first and second scoring units 50A, 50B. The synthesizing unit 50C calculates an average value of both, and generates synthetic difference data Diffc.

ここで、当該区間の一部において女性が歌唱しなっかたとすると、当該期間にあっては、合成部５０Ｃは、平均値を算出するのではなく、第１の採点部５０Ａによって生成された音高差データＦｉ１、音量差データＬｉ１を合成差分データDiffcとして出力する。これにより、男性の歌唱力によって総合的な採点を行うことができる。 Here, assuming that a woman does not sing in a part of the section, the synthesis unit 50C does not calculate the average value during the period, but calculates the sound generated by the first scoring unit 50A. The high difference data Fi1 and the volume difference data Li1 are output as combined difference data Diffc. This makes it possible to perform comprehensive scoring based on the male singing ability.

また、混成歌唱区間にあっては、通常の大きさのキャラクタＣａとキャラクタＣｂがモニタ４６の左上と右上に表示される。この場合、キャラクタＣａ，Ｃｂの大きさは、歌唱音声のレベルに応じて変動する。 In the hybrid singing section, characters Ca and Cb of normal size are displayed on the upper left and upper right of the monitor 46. In this case, the sizes of the characters Ca and Cb change according to the level of the singing voice.

このように、本実施形態によれば、バトル曲において、複数の評価関数を用いて採点を行い優劣を決定するので、引き分けを少なくすることができ、さらに、いずれの評価関数による採点結果が一致する場合には、乱数Ｍを用いて優劣を決定するから、引き分けを一切なくすことができる。
また、各マイク４７ａ，４７ｂの入力レベルをキャラクタの大きさとして表示するので、音量を一見して知ることができる。さらに、歌唱すべきマイクの種類を容易に判別することもできる。 As described above, according to the present embodiment, in the battle tune, scoring is performed by using a plurality of evaluation functions to determine the superiority or inferiority, so that the number of draws can be reduced, and the scoring results by any of the evaluation functions are consistent. In this case, since the priority is determined using the random number M, the draw can be eliminated at all.
Since the input levels of the microphones 47a and 47b are displayed as the size of the character, the volume can be known at a glance. Further, it is possible to easily determine the type of microphone to be sung.

また、楽曲データとリモコン５１の操作の組み合わせに基づいて、ＣＰＵ３０は、セレクタ４８の切換と第１，第２の採点部５０Ａ，５０Ｂに供給するリファレンスガイドメロディデータを制御するので、第１，第２の採点部５０Ａ，５０Ｂを有効に活用して、精度の良くかつ妥当な採点結果を算出することが可能となる。
すなわち、一人の歌い手が歌唱する場合には、ＳＮ比を改善した合成差分データDiffcに基づいて採点結果を得ることができ、デュエット曲においては、歌唱区間の性質に応じて、合成部５０Ｃの動作を切り替えることによって精度の良くかつ妥当な採点結果を算出することができる。 Further, based on the combination of the music data and the operation of the remote controller 51, the CPU 30 controls the switching of the selector 48 and the reference guide melody data supplied to the first and second scoring units 50A and 50B. By effectively utilizing the second scoring units 50A and 50B, it is possible to calculate an accurate and appropriate scoring result.
That is, when one singer sings, a scoring result can be obtained based on the synthesized difference data Diffc with an improved SN ratio. In a duet song, the operation of the synthesizing unit 50C depends on the nature of the singing section. By switching, accurate and appropriate scoring results can be calculated.

＜Ｄ：変形例＞
なお、本発明は、上述した実施形態には限定されず、以下のような各種の変形が可能である。
（１）例えば、実施形態では、デュエット曲をカラオケ演奏する場合を例としたが、これに限らず、３つ以上のボーカルパートからなるコーラスの歌唱に対応すべく拡張することも可能である。この場合、採点処理部５０をパートの数に対応した系統に拡張し、ガイドメロディもパートの数に対応したトラック数だけ用意すればよい。
（２）また、実施形態のように、採点結果として各音楽要素の平均値を求めるのではなく、音高、音量あるいはリズムの得点を各音楽要素毎の採点結果として出力してもよい。 <D: Modification>
Note that the present invention is not limited to the above-described embodiment, and various modifications as described below are possible.
(1) For example, in the embodiment, the case of performing a karaoke performance of a duet song has been described as an example. In this case, the scoring processing unit 50 may be extended to a system corresponding to the number of parts, and guide melody may be prepared by the number of tracks corresponding to the number of parts.
(2) Further, instead of obtaining the average value of each music element as the scoring result as in the embodiment, the score of the pitch, volume or rhythm may be output as the scoring result for each music element.

（３）また、採点処理は、曲が終了した後にまとめて採点を行っているが、フレーズ単位、音符単位で基本評価を行い、曲終了後にそれを集計するようにしてもよい。さらに、フレーズ単位毎に採点結果をモニタ４６に表示し、曲終了後に最終的な採点結果を表示してもよい。
（４）また、実施形態では、デュエット曲においてボーカルのパート毎に得られる得点の平均値を出力したが、個別に出力するようにしてもよいし、あるいは、両方を出力するようにしてもよい。個別に出力する場合は、差分データDiffa，Diffb各々に基づいて採点結果を評価部５０Ｄで算出すればよい。
（５）その他、複数の歌唱音声のうち採点結果の最も高い者の点数を強調表示するなど、種々の表示態様を採用することによって利用者の楽しみをさらに増すことができる。 (3) In the grading process, the grading is performed collectively after the song is completed. However, the basic evaluation may be performed in units of phrases and musical notes, and the results may be totaled after the tune is completed. Further, the scoring result may be displayed on the monitor 46 for each phrase unit, and the final scoring result may be displayed after the end of the music.
(4) In the embodiment, the average value of the scores obtained for each vocal part in the duet music is output. However, the average value may be output individually, or both may be output. . In the case of outputting individually, the scoring result may be calculated by the evaluation unit 50D based on each of the difference data Diffa and Diffb.
(5) In addition, the user's enjoyment can be further increased by adopting various display modes such as highlighting the score of the highest scoring result among the plurality of singing voices.

（６）また、実施形態においては、第１，第２の評価関数演算部５１０，５２０で評価関数Ｑ１（ｘ）による演算を行い、一方、第３，第４の評価関数演算部５３０，５４０で評価関数Ｑ２（ｘ）による演算を行ったが、第１，第２の評価関数演算部５１０，５２０を時分割で動作させ、一方を省略してもよい。また同様に、第３，第４の評価関数演算部５３０，５４０を時分割で動作させ、一方を省略してもよい。さらに、これらの機能をＣＰＵ３０で行うようにしてもよい。 (6) In the embodiment, the first and second evaluation function calculation units 510 and 520 perform the calculation using the evaluation function Q1 (x), while the third and fourth evaluation function calculation units 530 and 540 Although the calculation using the evaluation function Q2 (x) was performed in the above, the first and second evaluation function calculation units 510 and 520 may be operated in a time-sharing manner, and one of them may be omitted. Similarly, the third and fourth evaluation function calculation units 530 and 540 may be operated in a time-sharing manner, and one of them may be omitted. Further, these functions may be performed by the CPU 30.

（７）また、実施形態において、デュエット曲を歌唱する場合、男性歌唱区間、女性歌唱区間においては、歌唱が予定されていない歌唱音声信号に対応するキャラクタは大きさを縮小してモニタ４６に表示するようにしたが、対応しないキャラクタはモニタ４６に表示しないようにしてもよい。この場合、ＣＰＵ３０は楽曲データに基づいて男性・女性歌唱区間を検出し、検出結果に基づいてモニタ４６に表示するキャラクタを選択すればよい。 (7) In the embodiment, when singing a duet song, in the male singing section and the female singing section, the character corresponding to the singing voice signal for which no singing is scheduled is reduced in size and displayed on the monitor 46. However, an unsupported character may not be displayed on the monitor 46. In this case, the CPU 30 may detect the male / female singing section based on the music data and select a character to be displayed on the monitor 46 based on the detection result.

（８）また、実施形態においては、第１，第２の評価関数演算部５１０，５２０を設けたが、評価関数演算部をいずれか一つにし、優劣がつかない判定になった場合には、乱数発生部５７０が発生する乱数によって優劣を決定するように構成してもよい。 (8) In the embodiment, the first and second evaluation function operation units 510 and 520 are provided. However, when one of the evaluation function operation units is used and a determination is made that there is no superiority or inferiority, Alternatively, the priority may be determined by a random number generated by the random number generation unit 570.

この発明の一実施形態によるカラオケ装置の構成を示すブロックである。1 is a block diagram illustrating a configuration of a karaoke apparatus according to an embodiment of the present invention. 同実施形態における楽曲データのデータフォーマットを示す図である。It is a figure showing the data format of the music data in the embodiment. 同楽曲データの楽音トラックの構成を示す図である。FIG. 3 is a diagram showing a configuration of a music track of the music data. 同楽曲データの楽音トラック以外のトラックの構成を示す図である。It is a figure showing composition of tracks other than a musical tone track of the music data. 同カラオケ装置におけるＲＡＭのメモリマップの内容を示す図である。It is a figure showing the contents of the memory map of RAM in the karaoke device. 同カラオケ装置における歌唱音量レベルとキャラクタの大きさの関係を示す図である。FIG. 3 is a diagram illustrating a relationship between a singing volume level and a character size in the karaoke apparatus. 同カラオケ装置における採点処理部の構成を示すブロック図である。It is a block diagram showing the composition of the scoring processing part in the karaoke device. （Ａ）は同実施形態におけるガイドメロディの例を五線譜で示す図、（Ｂ）は同ガイドメロディに基づくリファレンスの音高データおよび音量データを示す図、（Ｃ）は歌唱音声の音高データ、音量データおよび歌唱状態データを示す図である。(A) is a diagram showing an example of a guide melody in the embodiment in a staff notation, (B) is a diagram showing pitch data and volume data of a reference based on the guide melody, (C) is pitch data of a singing voice, It is a figure which shows volume data and singing state data. 同カラオケ装置においてバトル曲が歌唱される場合における評価部５０Ｄの機能を示すブロック図である。It is a block diagram which shows the function of the evaluation part 50D when a battle song is sung in the karaoke apparatus. 同カラオケ装置においてバトル曲を歌唱する場合のタイミングチャートである。It is a timing chart in the case of singing a battle song in the karaoke apparatus. 同カラオケ装置においてデュエット曲を歌唱する場合のタイミングチャートである。It is a timing chart at the time of singing a duet song in the karaoke apparatus.

Explanation of reference numerals

３０…ＣＰＵ（制御手段、採点手段）、３１…ＲＯＭ、３２…ＲＡＭ、３７…ハードディスク装置、３８…音源装置、４６…モニタ、４７ａ，４７ｂ…マイク（第１，第２のマイクロホン）、４８…セレクタ（選択手段）、４９…音声処理用ＤＳＰ、５０…採点処理部、５０１ａ，５０１ｂ…Ａ／Ｄコンバータ、５０２ａ，５０２ｂ…データ抽出部（第１，第２の抽出手段）、５０３ａ，５０３ｂ…比較部（第１，第２の比較手段）、５１０，５２０…第１，第２の評価関数演算部（第１の評価部）、５５０…第１，第２の比較部（第１の評価部）、５３０，５４０…第３，第４の評価関数演算部（第２の評価部）、５６０…第２の比較部（第２の評価部）、５７０…乱数発生部、５８０…判定部 30 CPU (control means, scoring means), 31 ROM, 32 RAM, 37 hard disk device, 38 sound source device, 46 monitor, 47a, 47b microphones (first and second microphones), 48 ... Selector (selection means), 49 DSP for voice processing, 50 ... scoring processing section, 501a, 501b ... A / D converter, 502a, 502b ... data extraction section (first and second extraction means), 503a, 503b ... Comparison sections (first and second comparison means), 510, 520... First and second evaluation function operation sections (first evaluation section), 550... First and second comparison sections (first evaluation section) 530, 540... Third and fourth evaluation function calculation units (second evaluation unit), 560... Second comparison unit (second evaluation unit), 570... Random number generation unit, 580.

Claims

In a karaoke apparatus that performs a song based on song data and displays lyrics on a monitor,
Selecting means for mixing or selecting a singing voice signal fetched from the first microphone and a singing voice signal fetched from the second microphone and outputting from the first output terminal and the second output terminal;
First detecting means for detecting a singing volume based on the singing voice signal output from the first output terminal;
Second detection means for detecting a singing volume based on the singing voice signal output from the second output terminal;
The shape of the first character is changed and displayed on the monitor in accordance with the singing volume detected by the first detecting means, and the second character is displayed on the monitor in accordance with the singing volume detected by the second detecting means. Display control means for changing the size of the character and displaying the character on the monitor;
A karaoke apparatus, comprising: control means for controlling switching of the selection means and setting of the first and second characters in synchronization with each other based on the music data.

When the control means detects that the song data is composed of a mixed singing section sung by two singers and a single singing section sung by one singer,
In the hybrid singing section, the selecting means is controlled so as to output the mixed singing voice signal from the first and second output terminals, and the singing volume detected by the first and second detecting means. The shapes of the first and second characters are changed in accordance with
In the single singing section, the singing voice signal from the first microphone is output from a first output terminal and the singing voice signal from a second microphone is output from the second output terminal. 2. The karaoke machine according to claim 1, wherein the selection unit is controlled and the shape of the corresponding character is changed according to a singing volume detected from the singing voice signal of the one singer. 3. apparatus.