JP6177027B2

JP6177027B2 - Singing scoring system

Info

Publication number: JP6177027B2
Application number: JP2013137607A
Authority: JP
Inventors: 橘　聡; 聡橘
Original assignee: Daiichikosho Co Ltd
Current assignee: Daiichikosho Co Ltd
Priority date: 2013-06-29
Filing date: 2013-06-29
Publication date: 2017-08-09
Anticipated expiration: 2033-06-29
Also published as: JP2015011243A

Description

本発明は、歌唱採点システムに関するものであり、特に、カラオケ楽曲の歌詞である各単語の歌唱時間よりも短い時間となるように、歌唱採点の対象となる採点区間を設定し、各採点区間において、マイクロホンから入力された歌唱音声信号と採点リファレンスデータとを比較して歌唱採点値を算出する歌唱採点システムに関するものである。 The present invention relates to a singing scoring system, and in particular, sets a scoring section that is subject to singing scoring so that it is shorter than the singing time of each word that is the lyrics of karaoke music, and in each scoring section The present invention relates to a singing scoring system that compares a singing voice signal input from a microphone with scoring reference data to calculate a singing scoring value.

現在普及しているカラオケシステムでは、マイクロホンから入力された歌唱音声信号と採点リファレンスデータとを比較して歌唱採点値を算出する歌唱採点機能を備えている。このような歌唱採点機能に関する技術は、例えば、特許文献１に記載されている。 The currently popular karaoke system has a singing scoring function that compares a singing voice signal input from a microphone and scoring reference data to calculate a singing scoring value. The technique regarding such a singing scoring function is described in Patent Document 1, for example.

特許文献１に記載されたカラオケ装置は、シーケンサがカラオケ演奏用データを読み出して楽音発生部に入力することによってカラオケ演奏が行われる。カラオケ歌唱者は、カラオケ演奏に合わせて歌唱し、その歌唱音声信号はマイクを介してアンプに入力されるとともにＡ／Ｄコンバータにも入力され、デジタルデータに変換する。 In the karaoke apparatus described in Patent Document 1, a karaoke performance is performed by a sequencer reading out data for karaoke performance and inputting it into a musical sound generator. The karaoke singer sings along with the karaoke performance, and the singing voice signal is input to the amplifier via the microphone and also to the A / D converter, and is converted into digital data.

そして、データ抽出部により、デジタル化された歌唱音声信号から音高データ、音量データを抽出し、抽出した音高データ、音量データ（歌唱音声）を比較部に入力する。シーケンサはカラオケ演奏用データに並行して比較用データであるガイドメロディを読み出してこれを比較部に入力する。比較部では、抽出した音高データ、音量データと、ガイドメロディとを比較して、歌唱者の歌唱の巧拙を採点評価するようになっている。 Then, the data extraction unit extracts pitch data and volume data from the digitized singing voice signal, and inputs the extracted pitch data and volume data (singing voice) to the comparison unit. The sequencer reads a guide melody, which is comparison data, in parallel with the karaoke performance data and inputs it to the comparison unit. The comparison unit compares the extracted pitch data, volume data, and guide melody, and evaluates the skill of the singer's singing.

特開平１０−６９２１６号公報Japanese Patent Laid-Open No. 10-69216

上述したように、マイクロホンから入力され、Ａ／Ｄコンバータによりデジタル変換された歌唱音声信号と、ガイドメロディ等の歌唱採点用リファレンスデータとを比較することにより、歌唱採点値を算出することができる。しかし、特に、カラオケ楽曲の歌詞である各単語の歌唱時間よりも短い時間となるように、歌唱採点の対象となる採点区間を設定し、各採点区間において、マイクロホンから入力された歌唱音声信号と採点リファレンスデータとを比較して歌唱採点値を算出する歌唱採点システムでは、歌詞の単語の種類によっては、音高データ（ピッチデータ）を抽出できない場合がある。 As described above, the singing score value can be calculated by comparing the singing voice signal input from the microphone and digitally converted by the A / D converter with the singing score reference data such as a guide melody. However, in particular, a grading section that is subject to singing is set so that it is shorter than the singing time of each word that is the lyrics of the karaoke music, and in each scoring section, the singing voice signal input from the microphone and In a singing scoring system that calculates singing scoring values by comparing with scoring reference data, pitch data (pitch data) may not be extracted depending on the type of words in the lyrics.

そして、音高データ（ピッチデータ）を抽出できない採点区間においては、歌唱者が上手に歌唱しているにも拘わらず、採点リファレンスデータと比較するデータを抽出できないため歌唱採点値が低くなってしまい、正確な歌唱採点を行うことができないという問題があった。 And, in the scoring section where pitch data (pitch data) cannot be extracted, the singing scoring value becomes low because the data compared with the scoring reference data cannot be extracted even though the singer sings well. There was a problem that accurate singing could not be performed.

すなわち、現在のカラオケシステムで利用されているピッチ検出方法では、歌詞に含まれる単語の中の子音部分（ｓ、ｔ、ｋ等）は、原則としてピッチを検出することができない。また、子音の種類に応じて、発音する時間的な長さが異なる。具体的には、同じ長さの「す」と「き」を比較すると、「す」における子音部分「ｓ」の方が、「き」における子音部分「ｋ」よりも発音時間が長い。したがって、採点区間毎にピッチ検出を行うと、サ行の発音は当該区間で子音部分「ｓ」が占める時間が長くなり、カ行の発音よりもピッチ検出の精度が低下してしまう。一方、カ行の発音は当該区間で子音部分「ｋ」が占める時間が短いため、ピッチ検出の精度が低下することは殆どない。 In other words, in the pitch detection method used in the current karaoke system, the pitch cannot be detected in principle for consonant parts (s, t, k, etc.) in words included in the lyrics. Further, the time length of sound generation differs depending on the type of consonant. Specifically, when “su” and “ki” having the same length are compared, the consonant part “s” in “su” has a longer pronunciation time than the consonant part “k” in “ki”. Therefore, if pitch detection is performed for each scoring interval, the time required for the consonant part “s” to occupy the pronunciation of the sub-line becomes longer, and the accuracy of pitch detection is lower than that of the pronunciation of the c-line. On the other hand, in the pronunciation of the K line, since the time occupied by the consonant part “k” in the section is short, the accuracy of pitch detection hardly decreases.

図面を参照して、「す」及び「き」を発音する際の子音部分の長さ、「す」における子音「ｓ」のＦＦＴによる周波数特性の解析結果（以下、ＦＦＴと略す、図面も同様）及び母音「ｕ」のＦＦＴ、「き」における子音「ｋ」のＦＦＴ及び母音「ｉ」のＦＦＴについて説明する。なお通常の歌唱において、子音部分すなわち「す」の発音における「ｓ」及び「き」の発音における「ｋ」の長さは、それぞれほぼ一定であると考えてよい。一方、母音部分すなわち「す」の発音における「ｕ」及び「き」の発音における「ｉ」の長さに関しては、「す」と「きー」など全体の発音を短く歌唱するか長く歌唱するかで変化する。図６は「す（ｓｕ）」の発音における子音の長さを示す説明図、図７は子音「ｓ」のＦＦＴを示す説明図、図８は母音「ｕ」のＦＦＴを示す説明図、図９は「き（ｋｉ）」の発音における子音の長さを示す説明図、図１０は子音「ｋ」のＦＦＴを示す説明図、図１１は母音「ｉ」のＦＦＴを示す説明図である。 Referring to the drawings, the length of the consonant part when pronounced "su" and "ki", the analysis result of the frequency characteristics by FFT of the consonant "s" in "su" (hereinafter abbreviated as FFT, the same applies to the drawings) ) And the vowel “u”, the FFT of the consonant “k” and the FFT of the vowel “i” in “ki”. In normal singing, the length of “s” in the pronunciation of the consonant portion, that is, “su”, and the length of “k” in the pronunciation of “ki” may be considered to be substantially constant. On the other hand, regarding the length of “u” in the pronunciation of the vowel part, that is, “i” in the pronunciation of “su”, the entire pronunciation such as “su” and “ki” is sung short or long. It will change. 6 is an explanatory diagram showing the length of the consonant in the pronunciation of “su”, FIG. 7 is an explanatory diagram showing the FFT of the consonant “s”, and FIG. 8 is an explanatory diagram showing the FFT of the vowel “u”. 9 is an explanatory diagram showing the length of the consonant in the pronunciation of “ki”, FIG. 10 is an explanatory diagram showing the FFT of the consonant “k”, and FIG. 11 is an explanatory diagram showing the FFT of the vowel “i”.

図６に示すように、「す（ｓｕ）」を発音すると、子音「ｓ」の発音部分は約１１０ｍｓｅｃ続く。また、図７に示すように、子音「ｓ」を発音した場合にははっきりとした基音や倍音列が認められず、ピッチを検出することは困難である。一方、図８に示すように、母音「ｕ」を発音した場合には、周波数が約３１０Ｈｚで相対的に高いレベルを持つ基音とその略整数倍の周波数を持つ倍音列を含んでおり、ピッチを検出することが可能となる。このように、サ行の発音（例えば「す」の発音）では、採点区間において子音部分「ｓ」が占める時間が長いため、正確なピッチ検出を行うことができない。したがって、サ行の音声を含む採点区間において採点の精度を上げるためには、採点処理に工夫を施す必要がある。 As shown in FIG. 6, when “su” is pronounced, the pronunciation of the consonant “s” continues for about 110 msec. Further, as shown in FIG. 7, when the consonant “s” is pronounced, a clear fundamental tone or harmonic sequence is not recognized, and it is difficult to detect the pitch. On the other hand, as shown in FIG. 8, when the vowel “u” is pronounced, it includes a fundamental tone having a relatively high level at a frequency of about 310 Hz and a harmonic sequence having a frequency that is substantially an integral multiple of the fundamental tone. Can be detected. As described above, in the pronunciation of the sub-line (for example, the pronunciation of “su”), since the time occupied by the consonant part “s” is long in the scoring interval, accurate pitch detection cannot be performed. Therefore, in order to increase the accuracy of the scoring in the scoring section including the voice of the bank, it is necessary to devise the scoring process.

これに対して、図９に示すように、「き（ｋｉ）」を発音すると、子音「ｋ」の発音部分は約２５ｍｓｅｃであり、子音「s」の発音部分の４分の１以下である。また、図１０に示すように、子音「ｋ」を発音した場合には、はっきりとした基音や倍音列が認められず、ピッチを検出することは困難である。一方、図１１に示すように、母音「ｉ」を発音した場合には、周波数が約３００Ｈｚで相対的に高いレベルを持つ基音とその略整数倍の周波数を持つ倍音列を含んでおり、ピッチを検出することが可能となる。このように、カ行の発音（例えば、「き」の発音）では、所定時間長の採点区間において子音部分「ｋ」が占める時間が極めて短いため、当該子音部分「ｋ」でピッチ検出を行うことができなくても、母音部分「ｉ」において、歌唱採点に必要なピッチ検出を行うことができる。したがって、カ行の音声を含む採点区間では、採点処理の精度が極端に低下することはない。 On the other hand, as shown in FIG. 9, when “ki” is pronounced, the sounding part of the consonant “k” is about 25 msec, which is less than a quarter of the sounding part of the consonant “s”. . Also, as shown in FIG. 10, when the consonant “k” is pronounced, a clear fundamental tone or harmonic sequence is not recognized, and it is difficult to detect the pitch. On the other hand, as shown in FIG. 11, when the vowel “i” is pronounced, it includes a fundamental tone having a frequency of about 300 Hz and a relatively high level, and a harmonic sequence having a frequency that is substantially an integral multiple of the fundamental tone. Can be detected. In this way, in the pronunciation of a ka line (for example, the pronunciation of “ki”), the time occupied by the consonant part “k” in the scoring section of a predetermined time length is extremely short, and therefore pitch detection is performed on the consonant part “k”. Even if it is not possible, it is possible to detect the pitch necessary for singing in the vowel part “i”. Therefore, the accuracy of the scoring process does not extremely decrease in the scoring section including the voice of the mosquito line.

図６〜図１１から明らかなように、サ行及びカ行のいずれの場合であっても、母音部分は整数倍音のピークがきれいに出ており、基音でピッチ検出が可能であるのに対して、子音部分は基音の周波数域でピッチ検出しようとしてもピークが無いため、ピッチ検出（一波長の測定）ができない。さらに、カ行はサ行と比較して、子音部分「ｋ」の発音時間が短いため、子音部分「ｋ」を無視してピッチ検出を行うことができる。 As is clear from FIG. 6 to FIG. 11, the peak of the integer overtone appears clearly in the vowel part in both cases of the S line and the C line, whereas the pitch can be detected with the fundamental tone. In the consonant part, there is no peak even if it is attempted to detect the pitch in the frequency range of the fundamental tone, so pitch detection (measurement of one wavelength) cannot be performed. Furthermore, since the pronunciation time of the consonant part “k” is shorter in the ca line than in the sa line, the pitch detection can be performed while ignoring the consonant part “k”.

本発明は、上述した事情に鑑み提案されたもので、歌詞の単語が含む子音の種類に応じて音高データ（ピッチデータ）を抽出できない採点区間が存在する場合であっても、正確な歌唱採点を行うことが可能な歌唱採点システムを提供することを目的とする。 The present invention has been proposed in view of the above-described circumstances, and accurate singing is possible even when there is a scoring section in which pitch data (pitch data) cannot be extracted according to the type of consonant included in the words of the lyrics. An object of the present invention is to provide a singing scoring system capable of scoring.

本発明の歌唱採点システムは、上述した事情に鑑み提案されたもので、以下の特徴点を有している。すなわち、本発明の歌唱採点システムは、カラオケ楽曲の歌詞である各単語の歌唱時間よりも短い時間となるように、歌唱採点の対象となる採点区間を設定し、各採点区間において、マイクロホンから入力された歌唱音声信号と採点リファレンスデータとを比較して歌唱採点値を算出する歌唱採点システムにおいて、子音を含む採点区間について、当該子音の種類に応じて異なる採点処理を行う採点処理手段を備えたことを特徴とするものである。 The singing scoring system of the present invention has been proposed in view of the above-described circumstances, and has the following characteristic points. That is, the singing scoring system of the present invention sets a scoring section that is the subject of singing scoring so that it is shorter than the singing time of each word that is the lyrics of karaoke music, and inputs from the microphone in each scoring section In the singing scoring system that calculates the singing scoring value by comparing the singing voice signal and scoring reference data, the scoring system includes scoring processing means for scoring differently depending on the type of the consonant for the scoring section including the consonant It is characterized by this.

また、本発明の歌唱採点システムにおいて、採点処理手段は、採点に用いるための歌唱音声のピッチ検出機能であって各採点区間に含まれる母音のピッチを検出可能なピッチ検出機能を有し、子音を含む採点区間について、当該子音の時間長に応じて異なる採点処理を行うことが好ましい。 Further, in the singing scoring system of the present invention, the scoring processing means has a pitch detecting function capable of detecting the pitch of vowels included in each scoring section, which is a pitch detecting function of singing voice for use in scoring, and a consonant It is preferable to perform different scoring processes according to the time length of the consonant for the scoring section including.

また、歌唱音声のピッチ検出機能を有する場合に、採点処理手段は、所定の時間長以上の子音を含む採点区間の歌唱採点値を、他の所定時間以上の子音を含まない採点区間の採点値に置き換えるように構成することが可能である。 Moreover, when it has the pitch detection function of a singing voice, a scoring process means, the scoring value of the scoring area which does not contain the consonant of other predetermined time or more, the singing scoring value of the scoring area containing the consonant more than predetermined time length It can be configured to replace

また、歌唱音声のピッチ検出機能を有する場合に、採点処理手段は、所定の時間長以上の子音を含む採点区間について、採点対象となる採点区間としないに構成することが可能である。 Moreover, when it has the pitch detection function of a singing voice, a scoring process means can be comprised so that it may not set as the scoring area used as scoring object about the scoring area containing the consonant more than predetermined time length.

また、歌唱音声のピッチ検出機能を有する場合に、採点処理手段は、所定の時間長以上の子音を含む採点区間の歌唱採点値を、他の所定時間以上の子音を含まない採点区間の採点値に基づき補間することにより算出するように構成することが可能である。 Moreover, when it has the pitch detection function of a singing voice, a scoring process means, the scoring value of the scoring area which does not contain the consonant of other predetermined time or more, the singing scoring value of the scoring area containing the consonant more than predetermined time length It is possible to make a calculation by interpolation based on the above.

また、歌唱音声のピッチ検出機能を有する場合に、採点処理手段は、所定の時間長以上の子音を含む採点区間の歌唱採点値を、当該子音以外の歌唱部分の割合及び採点値に基づいて算出するように構成することが可能である。 Moreover, when it has the pitch detection function of singing voice, the scoring processing means calculates the singing score value of the scoring section including the consonant longer than the predetermined time length based on the ratio of the singing part other than the consonant and the scoring value. It can be configured to do so.

このような構成からなる歌唱採点システムでは、カラオケ楽曲の演奏に合わせて利用者が歌唱を行うと、マイクロホンから入力された歌唱音声信号をＡ／Ｄコンバータによりデジタル変換して、歌唱採点対象となる採点対象データを生成する。そして、採点処理手段により、所定の歌唱採点区間毎に、採点対象データと採点リファレンスデータとを比較して歌唱採点値を算出する。 In the singing scoring system having such a configuration, when a user sings along with the performance of the karaoke music, the singing voice signal input from the microphone is digitally converted by the A / D converter, and becomes a singing scoring target. Generate scoring target data. Then, the grading processing means compares the grading target data with the grading reference data for each predetermined singing grading section to calculate the singing grading value.

この際、採点処理手段では、子音を含む採点区間について、当該子音の種類に応じて異なる採点処理を行う。すなわち、採点処理手段が採点に用いるための歌唱音声のピッチ検出機能を有する場合に、所定の時間長以上の子音を含む採点区間の採点値を、他の所定時間以上の子音を含まない採点区間（例えば、前後の採点区間）の採点値に置き換える。また、所定の時間長以上の子音を含む採点区間について、採点対象となる採点区間から除外してもよい。また、所定の時間長以上の子音を含む採点区間分の採点値を、他の採点区間分の採点値に基づき補間してもよい。さらに、所定の時間長以上の子音を含む採点区間について、当該子音以外の歌唱部分の割合及び採点値に基づいて、当該採点区間の採点値を算出してもよい。 At this time, the scoring processing means performs different scoring processing on the scoring section including the consonant depending on the type of the consonant. That is, when the scoring means has a pitch detection function for singing voice for use in scoring, the scoring value of a scoring section including consonants longer than a predetermined time length is not included in the scoring section including other consonants longer than a predetermined time. Replace with the scoring value (for example, the previous or next scoring interval). Moreover, you may exclude from the scoring area used as scoring object about the scoring area containing the consonant more than predetermined time length. Moreover, you may interpolate the scoring value for the scoring area containing the consonant more than predetermined time length based on the scoring value for another scoring area. Further, for a scoring section including consonants that are longer than a predetermined time length, the scoring value of the scoring section may be calculated based on the ratio of the singing part other than the consonant and the scoring value.

本発明の歌唱採点システムによれば、子音を含む採点区間について、当該子音の種類に応じて異なる採点処理を行うことにより、子音を含む採点区間について、音高データ（ピッチデータ）を抽出できない場合であっても、正確な歌唱採点を行うことが可能となる。 According to the singing scoring system of the present invention, the pitch data (pitch data) cannot be extracted for the scoring section including the consonant by performing different scoring processes according to the type of the consonant for the scoring section including the consonant. Even so, it is possible to accurately score the singing.

特に、カラオケ楽曲の歌詞である各単語の歌唱時間よりも短い時間となるように、歌唱採点の対象となる採点区間を設定すると、音高データ（ピッチデータ）を抽出できない場合があるが、本発明の歌唱採点システムでは、このような歌唱採点区間が存在した場合であっても、正確な歌唱採点を行うことができる。 In particular, if you set a scoring section that is subject to singing so that it is shorter than the singing time of each word that is the lyrics of karaoke music, pitch data (pitch data) may not be extracted. In the singing scoring system of the invention, even if such a singing scoring section exists, accurate singing scoring can be performed.

本発明の実施形態に係る歌唱採点システムを適用したカラオケシステムの構成を示すブロック図。The block diagram which shows the structure of the karaoke system to which the singing scoring system which concerns on embodiment of this invention is applied. 本発明の実施形態に係る歌唱採点システムにおける歌唱採点処理の実施例１を示す説明図。Explanatory drawing which shows Example 1 of the singing scoring process in the singing scoring system which concerns on embodiment of this invention. 本発明の実施形態に係る歌唱採点システムにおける歌唱採点処理の実施例２を示す説明図。Explanatory drawing which shows Example 2 of the singing scoring process in the singing scoring system which concerns on embodiment of this invention. 本発明の実施形態に係る歌唱採点システムにおける歌唱採点処理の実施例３を示す説明図。Explanatory drawing which shows Example 3 of the singing scoring process in the singing scoring system which concerns on embodiment of this invention. 本発明の実施形態に係る歌唱採点システムにおける歌唱採点処理の実施例４を示す説明図。Explanatory drawing which shows Example 4 of the singing scoring process in the singing scoring system which concerns on embodiment of this invention. 「す（ｓｕ）」の発音における子音の長さを示す説明図。Explanatory drawing which shows the length of the consonant in pronunciation of "su". 子音「ｓ」のＦＦＴを示す説明図。Explanatory drawing which shows FFT of consonant "s". 母音「ｕ」のＦＦＴを示す説明図。Explanatory drawing which shows FFT of vowel "u". 「き（ｋｉ）」の発音における子音の長さを示す説明図。Explanatory drawing which shows the length of the consonant in pronunciation of "ki". 子音「ｋ」のＦＦＴを示す説明図。Explanatory drawing which shows FFT of consonant "k". 母音「ｉ」のＦＦＴを示す説明図。Explanatory drawing which shows FFT of vowel "i".

図面を参照して、本発明の歌唱採点システムの実施形態について説明する。図１〜図５は本発明の実施形態に係る歌唱採点システムを示すもので、図１は歌唱採点システムを適用したカラオケシステムの構成を示すブロック図、図２〜図５は歌唱採点処理の実施例を示す説明図である。 An embodiment of a singing scoring system of the present invention will be described with reference to the drawings. 1 to 5 show a singing scoring system according to an embodiment of the present invention. FIG. 1 is a block diagram showing the configuration of a karaoke system to which the singing scoring system is applied, and FIGS. It is explanatory drawing which shows an example.

＜歌唱採点システムの概要＞
本発明の実施形態に係る歌唱採点システムは、カラオケ楽曲の歌詞である各単語の歌唱時間よりも短い時間となるように、歌唱採点の対象となる採点区間を設定し、各採点区間において、マイクロホンから入力された歌唱音声信号と採点リファレンスデータとを比較して歌唱採点値を算出するシステムに関するものである。この歌唱採点システム１０は、図１に示すようにカラオケシステム（カラオケ演奏装置２０を含むシステム）に組み込まれてその機能を実現するものであり、歌唱採点機能の一部として構成される。本実施形態の歌唱採点システム１０は、歌唱採点機能を実現するための機能手段として、採点処理手段３８を備えている。 <Outline of singing scoring system>
The singing scoring system according to the embodiment of the present invention sets a scoring section that is subject to singing scoring so that the singing time is shorter than the singing time of each word that is the lyrics of karaoke music. It is related with the system which calculates the singing score value by comparing the singing voice signal and scoring reference data input. As shown in FIG. 1, this singing scoring system 10 is incorporated in a karaoke system (system including the karaoke performance device 20) to realize its function, and is configured as a part of the singing scoring function. The singing scoring system 10 of the present embodiment includes scoring processing means 38 as functional means for realizing a singing scoring function.

なお、以下の説明において、プログラムとは、ＲＡＭ等に記憶され、ＣＰＵ等のハードウェアで実行されることにより、その機能を発揮するソフトウェアだけではなく、同等の機能を発揮することが可能な論理回路も含む概念である。 In the following description, a program is a logic that can be stored in a RAM or the like and executed by hardware such as a CPU, so that not only software that exhibits the function but also an equivalent function can be achieved. It is a concept that includes a circuit.

＜カラオケ演奏装置＞
本発明の実施形態に係る歌唱採点システム１０を適用するカラオケ演奏装置２０は、図１に示すように、カラオケ本体２１、スピーカ２２、マイクロホン２３、表示装置２４、ミキシングアンプ２５、カラオケリモコン装置２６を備えている。なお、図示しないが、カラオケ演奏装置２０は、ルータ及びデータ通信回線を介して、管理サーバとネットワーク接続されていてもよい。 <Karaoke performance device>
As shown in FIG. 1, a karaoke performance device 20 to which a singing scoring system 10 according to an embodiment of the present invention includes a karaoke main body 21, a speaker 22, a microphone 23, a display device 24, a mixing amplifier 25, and a karaoke remote control device 26. I have. Although not shown, the karaoke performance device 20 may be network-connected to the management server via a router and a data communication line.

＜カラオケリモコン装置＞
カラオケリモコン装置２６は、ユーザインタフェース機能を備えており、カラオケ本体２１のローカル送受信手段３６との間で有線方式又は無線方式によりデータの送受信を行うようになっている。このカラオケリモコン装置２６は、楽曲検索手段２６ａとして機能するプログラム、楽曲索引データベース２６ｂ、種々のデータを記憶するためのデータ記憶部２６ｃ、データの入出力を行うための入出力表示部２６ｄ等を備えている。このカラオケリモコン装置２６に付帯するスイッチ類や、入出力表示部２６ｄに表示される各種のアイコン等を操作することにより、選曲操作等が行われる。 <Karaoke remote control device>
The karaoke remote control device 26 has a user interface function, and transmits / receives data to / from the local transmission / reception means 36 of the karaoke main body 21 by a wired method or a wireless method. The karaoke remote control device 26 includes a program functioning as a music search means 26a, a music index database 26b, a data storage unit 26c for storing various data, an input / output display unit 26d for inputting / outputting data, and the like. ing. A music selection operation or the like is performed by operating switches attached to the karaoke remote control device 26 or various icons displayed on the input / output display unit 26d.

＜楽曲検索手段／楽曲索引データベース＞
楽曲検索手段２６ａは、利用者の指示に基づき、楽曲索引データベース２６ｂを参照して楽曲を検索するためのプログラムからなる。楽曲索引データベース２６ｂは、カラオケ演奏装置２０で演奏に供されるカラオケ楽曲について、その属性情報を記述したデータベースであり、例えば、楽曲番号・曲名・アーティスト名・歌い出し部分の歌詞・流行時期・音楽ジャンル区分・デュエット曲か否かなど、種々の属性情報がこれに含まれている。 <Music search means / music index database>
The music search means 26a is composed of a program for searching for music by referring to the music index database 26b based on a user instruction. The song index database 26b is a database describing attribute information of karaoke songs used for performance by the karaoke performance device 20, for example, song number, song name, artist name, lyrics of the singing part, trend time, music. This includes various attribute information such as genre classification and whether or not it is a duet song.

＜マイクロホン＞
マイクロホン２３は、歌唱音声の入力を行うための装置である。マイクロホン２３から入力された歌唱音声信号は、ミキシングアンプ２５により、音楽再生制御手段３９から送出される演奏音声信号とミキシングされると共に増幅され、スピーカ２２へ出力される。なお、マイクロホン２３からの音声入力信号は、Ａ／Ｄコンバータ４０によりデジタル変換され、採点処理手段３８における歌唱採点等に使用される。 <Microphone>
The microphone 23 is a device for inputting singing voice. The singing voice signal inputted from the microphone 23 is mixed and amplified by the mixing amplifier 25 with the performance voice signal sent from the music reproduction control means 39 and outputted to the speaker 22. Note that the audio input signal from the microphone 23 is digitally converted by the A / D converter 40 and used for singing scoring in the scoring means 38.

＜表示装置＞
表示装置２４は、カラオケ楽曲に関連した背景映像や歌詞テロップ等を表示するための装置で、例えば、液晶ディスプレイ等により構成される。 <Display device>
The display device 24 is a device for displaying a background video, lyrics telop, and the like related to karaoke music, and is configured by, for example, a liquid crystal display.

＜カラオケ本体＞
カラオケ本体２１は、ネットワーク送受信手段３１、中央制御手段３２、ＲＯＭ３３、ＲＡＭ３４、ＨＤＤ３５、ローカル送受信手段３６、予約管理手段３７、採点処理手段３８、音楽再生制御手段３９、Ａ／Ｄコンバータ４０、映像再生制御手段４１を備えている。 <Karaoke body>
The karaoke main body 21 includes a network transmission / reception means 31, a central control means 32, a ROM 33, a RAM 34, an HDD 35, a local transmission / reception means 36, a reservation management means 37, a scoring processing means 38, a music reproduction control means 39, an A / D converter 40, and a video reproduction. Control means 41 is provided.

＜中央制御手段＞
中央制御手段３２は、カラオケ本体２１を総合的に制御するための手段であり、例えばＣＰＵ及びその周辺機器により構成されており、ＣＰＵ等がＲＯＭ３３等に記憶されたプログラムに従って動作することにより、制御機能を発揮することができるようになっている。 <Central control means>
The central control means 32 is a means for comprehensively controlling the karaoke main body 21 and is constituted by, for example, a CPU and its peripheral devices, and is controlled by the CPU or the like operating according to a program stored in the ROM 33 or the like. The function can be demonstrated.

＜ＲＯＭ／ＲＡＭ＞
ＲＯＭ３３は、カラオケ本体２１を構成する各機器を制御するためのプログラムデータや数値データを記憶するための機器で、例えば半導体メモリ等で構成される。また、ＲＡＭ３４は、プログラムや各種データを一時的に記憶する一時記憶領域として機能するもので、例えば半導体メモリ等で構成される。 <ROM / RAM>
The ROM 33 is a device for storing program data and numerical data for controlling each device constituting the karaoke main body 21, and is constituted by a semiconductor memory, for example. The RAM 34 functions as a temporary storage area for temporarily storing programs and various data, and is composed of, for example, a semiconductor memory.

本実施形態では、ＲＡＭ３４に、予約待ち行列３４ａが記憶されるようになっている。なお、予約待ち行列３４ａは、選曲予約されたカラオケ楽曲について、演奏順に楽曲ＩＤを並べて構成されたデータテーブルであり、選曲予約者の利用者ＩＤ等、他の識別データが紐付けされている場合もある。また、ＲＡＭ３４に、歌唱採点値を記憶するようにしてもよい。 In the present embodiment, a reservation queue 34 a is stored in the RAM 34. Note that the reservation queue 34a is a data table in which music IDs are arranged in order of performance for karaoke music reserved for music selection, and when other identification data such as a user ID of a music selection reservation person is associated. There is also. Moreover, you may make it memorize | store a singing score value in RAM34.

＜ＨＤＤ＞
ＨＤＤ３５は、大容量記憶装置として機能するもので、楽曲データベース３５ａ、映像データベース３５ｂが格納されている。なお、ＨＤＤ３５に替えて、あるいはＨＤＤ３５と共に、データを書き替え可能なＤＶＤ等の大容量記憶装置を用いてもよい。 <HDD>
The HDD 35 functions as a mass storage device and stores a music database 35a and a video database 35b. Note that a mass storage device such as a DVD capable of rewriting data may be used instead of the HDD 35 or together with the HDD 35.

＜楽曲データベース／映像データベース＞
楽曲データベース３５ａは、演奏制御データ（ＭＩＤＩ規格のデータ）及び歌詞描出データが同期されて構成される楽曲データについて、楽曲ＩＤと対応付けてそれぞれ構成されたデータベースである。演奏制御データは、各楽曲の演奏を制御するためのデジタルデータであり、歌詞描出データは演奏に同期した歌詞文字の表示タイミングデータ及び色変わりデータを含んでいる。映像データベース３５ｂは、演奏されるカラオケ楽曲に対応した背景映像を、当該カラオケ楽曲の楽曲ＩＤに対応させた映像ファイルとして所定数格納したデータベースである。 <Music database / video database>
The music database 35a is a database configured by associating music control data (MIDI standard data) and lyrics rendering data in synchronization with music IDs. The performance control data is digital data for controlling the performance of each musical piece, and the lyric rendering data includes display timing data and color change data of lyric characters synchronized with the performance. The video database 35b is a database that stores a predetermined number of background videos corresponding to karaoke songs to be played as video files corresponding to the song IDs of the karaoke songs.

なお、本実施形態では、楽曲データベース３５ａに含まれる演奏制御データ及び歌詞描出データが、採点処理手段３８における採点リファレンスデータとして機能するが、さらに詳細な採点を行う場合には、採点リファレンスデータに他の採点要素を含ませてもよい。 In the present embodiment, the performance control data and the lyric rendering data included in the music database 35a function as scoring reference data in the scoring processing means 38. However, in the case of performing more detailed scoring, other than the scoring reference data, The scoring element may be included.

＜送受信手段＞
ローカル送受信手段３６は、カラオケ本体２１とカラオケリモコン装置２６との間で、データの送受信を行うための電子回路及びプログラムからなる。本実施形態では、赤外線通信により、カラオケ本体２１とカラオケリモコン装置２６との間でデータの送受信が行われる。また、本実施形態では、ルータ（図示せず）との間でデータの送受信を行うためのネットワーク送受信手段３１を備えている。また、カラオケリモコン装置２６は、ルータ（図示せず）を介して、カラオケ本体２１のネットワーク送受信手段３１とデータの送受信を行うように構成してもよい。 <Transmitting / receiving means>
The local transmission / reception means 36 includes an electronic circuit and a program for transmitting / receiving data between the karaoke main body 21 and the karaoke remote control device 26. In the present embodiment, data is transmitted and received between the karaoke main body 21 and the karaoke remote control device 26 by infrared communication. In the present embodiment, network transmission / reception means 31 for transmitting / receiving data to / from a router (not shown) is provided. The karaoke remote control device 26 may be configured to transmit / receive data to / from the network transmission / reception means 31 of the karaoke main body 21 via a router (not shown).

＜予約管理手段＞
予約管理手段３７は、任意の利用者が選曲予約する際に、当該選曲されたカラオケ楽曲の楽曲ＩＤを含む予約待ち行列３４ａを作成して管理するためのプログラムからなる。すなわち、予約管理手段３７は、利用者により楽曲検索手段２６ａの機能を用いて選曲された楽曲ＩＤを演奏順に並べて予約待ち行列３４ａを作成し、この予約待ち行列３４ａをＲＡＭ３４に格納して管理する。また、予約待ち行列３４ａに選曲者の利用者ＩＤを含める場合には、利用者ＩＤの取得が必要となる。 <Reservation management means>
The reservation management unit 37 includes a program for creating and managing a reservation queue 34a including a song ID of the selected karaoke song when an arbitrary user makes a song selection reservation. That is, the reservation management unit 37 creates a reservation queue 34a by arranging the music IDs selected by the user using the function of the music search unit 26a in the order of performance, and stores and manages the reservation queue 34a in the RAM 34. . In addition, when the user ID of the music selector is included in the reservation queue 34a, it is necessary to acquire the user ID.

利用者ＩＤは、利用者ＩＤカードに記憶された利用者ＩＤをカードリーダにより読み取り、あるいは、カラオケリモコン装置２６の入出力表示部２６ｄを用いて入力された利用者ＩＤ及びパスワードに基づいて取得すればよい。さらに、利用者が携帯する携帯情報端末を用いて予約を行う機能を有する場合には、当該携帯情報端末の機器ＩＤに紐付けされた利用者ＩＤを取得してもよい。また、カラオケ演奏装置２０を使用する際に、利用者に対して一時的に利用者ＩＤを付与してもよい。 The user ID is acquired based on the user ID and password input using the input / output display unit 26d of the karaoke remote control device 26 by reading the user ID stored in the user ID card with a card reader. That's fine. Furthermore, when it has the function to make a reservation using the portable information terminal which a user carries, you may acquire user ID linked | related with apparatus ID of the said portable information terminal. Moreover, when using the karaoke performance apparatus 20, you may provide a user ID temporarily with respect to a user.

＜採点処理手段＞
採点処理手段３８は、子音を含む採点区間について、当該子音の種類に応じて異なる採点処理を行うためのプログラムからなる。この場合、採点処理手段３８の機能として、採点に用いるための歌唱音声のピッチ検出機能であって各採点区間に含まれる母音のピッチを検出可能なピッチ検出機能を有し、子音を含む採点区間について、当該子音の時間長に応じて異なる採点処理を行うことが好ましい。なお、所定の時間長以上の子音を含む採点区間は、歌詞描出データに基づいて特定することができる。すなわち、歌詞描出データと各採点区間とは同期しており、歌詞描出データの中に、「ｓ」や「ｈ」のような所定の時間長以上の子音が含まれていると、当該歌詞描出データに相当する採点区間は、所定の時間長以上の子音を含む採点区間とすることができる。 <Scoring processing means>
The scoring means 38 includes a program for performing different scoring processes according to the type of the consonant for the scoring section including the consonant. In this case, as a function of the scoring means 38, a singing voice pitch detecting function for use in scoring, which has a pitch detecting function capable of detecting the pitch of vowels included in each scoring section, and includes a scoring section including consonants It is preferable to perform different scoring processes according to the time length of the consonant. In addition, the scoring section containing the consonant more than predetermined time length can be specified based on lyrics rendering data. That is, the lyric rendering data and each scoring section are synchronized, and if the lyric rendering data includes a consonant longer than a predetermined time such as “s” or “h”, the lyric rendering is performed. The scoring section corresponding to the data can be a scoring section including consonants longer than a predetermined time length.

＜採点処理（実施例１）＞
図２を参照して、採点処理手段３８における採点処理の実施例１を説明する。採点処理の実施例１は、所定の時間長以上の子音を含む採点区間について、ピッチ検出結果に基づく採点を行わずに、当該採点区間以外の採点区間におけるピッチ検出結果に基づく採点値を用いて、当該採点区間の採点値を置き換え又は補間するようにしたものである。 <Scoring process (Example 1)>
With reference to FIG. 2, Example 1 of the scoring process in the scoring means 38 will be described. Example 1 of scoring processing uses a scoring value based on a pitch detection result in a scoring section other than the scoring section without performing scoring based on the pitch detection result for a scoring section including a consonant longer than a predetermined time length. The scoring value of the scoring section is replaced or interpolated.

すなわち、図２に示すように、例えば「ｓ」や「ｈ」のように、所定の時間長以上の子音を含む採点区間が存在すると、当該採点区間については歌唱採点を行わない。そして、当該採点区間の前後の採点区間におけるピッチ検出結果に基づく採点値を用いて、当該採点区間の採点値を置き換え又は補間する。 That is, as shown in FIG. 2, when there is a scoring section including consonants longer than a predetermined time length, such as “s” and “h”, singing scoring is not performed for the scoring section. Then, using the scoring value based on the pitch detection result in the scoring section before and after the scoring section, the scoring value of the scoring section is replaced or interpolated.

図２に示す例では、所定の時間長以上の子音「ｓ」及び子音「ｈ」を含む採点区間では歌唱採点を行わずに、当該採点区間を挟んだ前後各１区間の採点区間における採点値の平均値や標準偏差等を、当該歌唱採点を行わない採点区間（子音「ｓ」及び子音「ｈ」を含む採点区間）の採点値として代替し、所定の採点区間毎の歌唱採点や楽曲全体の歌唱採点を行う。 In the example shown in FIG. 2, in the scoring section including the consonant “s” and the consonant “h” that are equal to or longer than the predetermined time length, the singing scoring is not performed, and the scoring value in the scoring section of each section before and after the scoring section is sandwiched. Instead of the average value or standard deviation, etc., as the scoring value of the scoring section (scoring section including consonant “s” and consonant “h”) where the singing scoring is not performed. Singing is performed.

なお、所定の時間長以上の子音を含む採点区間の採点値に代替するのは、当該採点区間の１区間前の採点区間、１区間後の採点区間、当該採点区間を挟んだ前後各１区間の採点区間、当該採点区間を挟んだ前後複数区間の採点区間等、適宜変更して実施することができる。また、直前又は直後の採点区間においても所定の時間長以上の子音を含んでいる場合には、さらに当該採点区間以外の採点区間の採点値を代替採点値とする。 Note that the scoring value of a scoring section containing consonants longer than a predetermined time length is replaced by a scoring section one section before the scoring section, a scoring section after the first section, and one section before and after the scoring section. The scoring section, and scoring sections of a plurality of sections before and after the scoring section, etc. can be changed as appropriate. In addition, when the immediately preceding or immediately following scoring section includes a consonant of a predetermined time length or more, the scoring value of the scoring section other than the scoring section is set as the alternative scoring value.

具体的には、図２に示すように、区間（１）では、所定の時間長以上の子音「ｓ」を含んでいるため、採点を行わない。この区間（１）は、前区間が存在しないため、後区間である区間（２）の採点値である８０点を区間（１）の採点値として置き換える。また、区間（３）では子音「ｋ」を含んでいるが、子音「ｋ」は発音時間が短く、子音「ｋ」以外の部分で十分にピッチ検出を行うことができるため、実際の採点値である７５点を区間（３）の採点値とする。また、区間（５）では、所定の時間長以上の子音「ｈ」を含んでいるため、採点を行わない。この区間（５）は、前後区間が存在するため、区間（４）の採点値である７８点と区間（６）の採点値である８２点の平均値をとって、８０点を区間（５）の採点値として補間する。 Specifically, as shown in FIG. 2, the section (1) includes a consonant “s” that is equal to or longer than a predetermined time length, and thus is not scored. In this section (1), since there is no previous section, 80 points which are the scoring values of the section (2) which is the subsequent section are replaced with scoring values of the section (1). In the section (3), the consonant “k” is included. However, since the consonant “k” has a short pronunciation time and can sufficiently detect the pitch in a portion other than the consonant “k”, the actual scoring value is obtained. 75 points are set as the scoring value of the section (3). In section (5), scoring is not performed because consonant “h” longer than a predetermined time length is included. Since this section (5) has preceding and following sections, an average value of 78 points, which are the scoring value of section (4), and 82 points, which is the scoring value of section (6), is taken as 80 points (5 ) Is interpolated as the scoring value.

＜採点処理（実施例２）＞
図３を参照して、採点処理手段３８における採点処理の実施例２を説明する。採点処理の実施例２は、所定の時間長以上の子音を含む採点区間について、採点対象となる採点区間としないようにしたものである。 <Scoring process (Example 2)>
With reference to FIG. 3, Example 2 of the scoring process in the scoring means 38 will be described. In the second embodiment of the scoring process, a scoring section including a consonant having a predetermined time length or more is not set as a scoring section to be scoring.

すなわち、図３に示すように、例えば「ｓ」や「ｈ」のように、所定の時間長以上の子音を含む採点区間が存在すると、当該採点区間については歌唱採点を行わない。そして、当該採点区間以外の採点区間の採点値に基づいて、所定の採点区間毎の歌唱採点や楽曲全体の歌唱採点を行う。 That is, as shown in FIG. 3, for example, if there is a scoring section including consonants longer than a predetermined time length, such as “s” and “h”, singing scoring is not performed for the scoring section. And based on the scoring value of scoring sections other than the said scoring section, the singing scoring for every predetermined scoring section and the singing scoring of the whole music are performed.

図３に示す例では、所定の時間長以上の子音「ｓ」及び子音「ｈ」を含む採点区間では歌唱採点を行わずに、当該採点区間以外の採点区間の採点値に基づいて、所定の採点区間毎の歌唱採点や楽曲全体の歌唱採点を行う。 In the example shown in FIG. 3, singing scoring is not performed in a scoring section including a consonant “s” and a consonant “h” that are equal to or longer than a predetermined time length, and based on scoring values in scoring sections other than the scoring section. Singing for each grading section and singing for the entire song.

具体的には、図３に示すように、区間（１）では、所定の時間長以上の子音「ｓ」を含んでいるため、採点を行ったとしても、当該採点値は総合採点結果に反映させない。なお、実施例１と同様に、区間（１）について、採点を行わなくてもよい。また、区間（３）では子音「ｋ」を含んでいるが、子音「ｋ」は発音時間が短く、子音「ｋ」以外の部分で十分にピッチ検出を行うことができるため、実際の採点値である７５点を区間（３）の採点値として、当該採点値を総合採点結果に反映させる。また、区間（５）では、所定の時間長以上の子音「ｈ」を含んでいるため、採点を行ったとしても、当該採点値は総合採点結果に反映させない。また、区間（５）では、区間（１）と同様に、採点を行わなくてもよい。 Specifically, as shown in FIG. 3, the section (1) includes a consonant “s” that is equal to or longer than a predetermined time length, so even if scoring is performed, the scoring value is reflected in the total scoring result. I won't let you. Similar to the first embodiment, scoring may not be performed for the section (1). In the section (3), the consonant “k” is included. However, since the consonant “k” has a short pronunciation time and can sufficiently detect the pitch in a portion other than the consonant “k”, the actual scoring value is obtained. The score of 75 is used as the score of the section (3), and the score is reflected in the overall score. In addition, in the section (5), since the consonant “h” having a predetermined time length or more is included, even if scoring is performed, the scoring value is not reflected in the total scoring result. In the section (5), scoring may not be performed as in the section (1).

＜採点処理（実施例３）＞
図４を参照して、採点処理手段３８における採点処理の実施例３を説明する。採点処理の実施例３は、所定の時間長以上の子音を含む採点区間について、当該子音以外の歌唱部分の割合及び採点値に基づいて、当該採点区間の採点値を算出するようにしたものである。 <Scoring process (Example 3)>
With reference to FIG. 4, a third embodiment of the scoring process in the scoring processing means 38 will be described. In the third embodiment of the scoring process, the scoring value of the scoring section is calculated based on the ratio of the singing part other than the consonant and the scoring value for the scoring section including the consonant longer than the predetermined time length. is there.

図４に示すように、例えば「ｓ」や「ｈ」のように、所定の時間長以上の子音を含む採点区間が存在すると、当該採点区間について、以下の採点処理を行う。すなわち、当該採点区間において、所定の時間長以上の子音の歌唱部分と、ピッチ検出を行うことができる歌唱部分との時間比率を算出し、所定の時間長以上の子音の歌唱部分では、その時間比率に応じた満点を与える。一方、ピッチ検出を行うことができる歌唱部分では、その時間比率及び採点値に基づいて、当該採点区間の採点値を算出する。そして、時間比率に応じて算出した子音の歌唱部分の採点値及びそれ以外の歌唱部分の採点値を加算して、当該所定の時間長以上の子音を含む採点区間の採点値とする。 As shown in FIG. 4, when there is a scoring section including consonants longer than a predetermined time length, such as “s” and “h”, the following scoring process is performed on the scoring section. That is, in the scoring section, the time ratio between the consonant singing part longer than the predetermined time length and the singing part capable of performing pitch detection is calculated. The perfect score according to the ratio is given. On the other hand, in the singing part where the pitch can be detected, the scoring value of the scoring section is calculated based on the time ratio and scoring value. And the scoring value of the singing part of the consonant calculated according to the time ratio and the scoring value of the other singing part are added to obtain the scoring value of the scoring section including the consonant longer than the predetermined time length.

図４に示す例では、所定の時間長以上の子音「ｓ」及び子音「ｈ」を含む採点区間において、当該子音「ｓ」及び子音「ｈ」を含む歌唱部分では、所定の時間長以上の子音の歌唱部分と、ピッチ検出を行うことができる歌唱部分との時間比率を算出し、それぞれ採点値を算出して合算する。これにより、所定の時間長以上の子音「ｓ」及び子音「ｈ」を含む採点区間についても、ほぼ正確な歌唱採点を行うことができる。 In the example shown in FIG. 4, in a scoring section including a consonant “s” and a consonant “h” having a predetermined time length or longer, a singing portion including the consonant “s” and the consonant “h” has a predetermined time length or more. The time ratio between the singing part of the consonant and the singing part where the pitch can be detected is calculated, and the scoring values are calculated and added together. Thereby, a substantially accurate singing score can be performed also about the scoring area containing consonant "s" and consonant "h" more than predetermined time length.

具体的には、図４に示すように、区間（１）では、所定の時間長以上の子音「ｓ」を含んでいるため、子音「ｓ」の歌唱部分については、その時間比率に応じた満点を付与する。また、区間（１）において、子音「ｓ」を含んでいる歌唱部分以外の採点を行い、その時間比率に応じて当該歌唱部分の採点値を算出する。図４に示す例では、子音「ｓ」の歌唱部分が区間（１）の３０％、子音「ｓ」を含んでいる歌唱部分以外が区間（１）の７０％であり、区間（１）の採点値は６０点である。したがって、区間（１）における採点値は、｛１００点×（３０／１００）｝＋｛６０点×（７０／１００）｝＝７２点となる。 Specifically, as shown in FIG. 4, the section (1) includes a consonant “s” that is equal to or longer than a predetermined time length, so the singing portion of the consonant “s” corresponds to the time ratio. A perfect score is given. Further, in the section (1), scoring is performed for parts other than the singing part including the consonant “s”, and the scoring value of the singing part is calculated according to the time ratio. In the example shown in FIG. 4, the singing part of the consonant “s” is 30% of the section (1), and the part other than the singing part including the consonant “s” is 70% of the section (1). The scoring value is 60 points. Therefore, the scoring value in the section (1) is {100 points × (30/100)} + {60 points × (70/100)} = 72 points.

また、区間（３）では子音「ｋ」を含んでいるが、子音「ｋ」は発音時間が短く、子音「ｋ」以外の部分で十分にピッチ検出を行うことができるため、実際の採点値である７５点を区間（３）の採点値とする。 In the section (3), the consonant “k” is included. However, since the consonant “k” has a short pronunciation time and can sufficiently detect the pitch in a portion other than the consonant “k”, the actual scoring value is obtained. 75 points are set as the scoring value of the section (3).

また、区間（５）では、所定の時間長以上の子音「ｈ」を含んでいるため、子音「ｈ」の歌唱部分については、その時間比率に応じた満点を付与する。また、区間（５）において、子音「ｈ」を含んでいる歌唱部分以外の採点を行い、その時間比率に応じて当該歌唱部分の採点値を算出する。図４に示す例では、子音「ｈ」の歌唱部分が区間（１）の４０％、子音「ｈ」を含んでいる歌唱部分以外が区間（５）の６０％であり、区間（５）の採点値は５５点である。したがって、区間（５）における採点値は、｛１００点×（４０／１００）｝＋｛５５点×（６０／１００）｝＝７３点となる。 Further, since the section (5) includes the consonant “h” that is equal to or longer than the predetermined time length, the singing portion of the consonant “h” is given a perfect score according to the time ratio. Further, in the section (5), scoring is performed for parts other than the singing part including the consonant “h”, and the scoring value of the singing part is calculated according to the time ratio. In the example shown in FIG. 4, the singing part of the consonant “h” is 40% of the section (1), and the part other than the singing part including the consonant “h” is 60% of the section (5). The scoring value is 55 points. Therefore, the scoring value in the section (5) is {100 points × (40/100)} + {55 points × (60/100)} = 73 points.

＜採点処理（実施例４）＞
図５を参照して、採点処理手段３８における採点処理の実施例４を説明する。採点処理の実施例４は、所定の時間長以上の子音を含む採点区間について、当該子音以外の歌唱部分の割合及び採点値に基づいて、当該採点区間の採点値を算出するようにする点で、実施例３と同様である。 <Scoring process (Example 4)>
With reference to FIG. 5, a fourth embodiment of scoring processing in the scoring processing means 38 will be described. Example 4 of the scoring process is that the scoring value of the scoring section is calculated based on the ratio of the singing part other than the consonant and the scoring value for the scoring section including the consonant of the predetermined time length or more. This is the same as in Example 3.

以下、実施例３とは異なる採点値の算出方法についてのみ説明する。実施例４では、図５に示すように、区間（１）では、所定の時間長以上の子音「ｓ」を含んでいるため、子音「ｓ」の歌唱部分については、採点できなかったものとして、補正した採点値を採用する。すなわち、区間（１）において、子音「ｓ」を含んでいる歌唱部分以外の採点を行い、その時間比率に応じて当該歌唱部分の採点値を算出する。そして、子音「ｓ」を含んでいる歌唱部分では、子音「ｓ」の歌唱部分以外の採点値を用いて、子音「ｓ」の歌唱部分の時間比率に応じて当該歌唱部分の採点値を算出する。図５に示す例では、子音「ｓ」の歌唱部分が区間（１）の３０％、子音「ｓ」を含んでいる歌唱部分以外が区間（１）の７０％であり、子音「ｓ」を含んでいる歌唱部分以外の採点値は５２点である。したがって、区間（１）における補正採点値は、（子音部分の推定採点値＝２２点）＋（子音部分以外の実採点値＝５２点）＝７４点となる。 Only the scoring value calculation method different from that in the third embodiment will be described below. In Example 4, as shown in FIG. 5, the section (1) includes a consonant “s” longer than a predetermined time length, and therefore, the singing part of the consonant “s” cannot be scored. The corrected scoring value is adopted. That is, in section (1), scoring other than the singing part containing the consonant “s” is performed, and the scoring value of the singing part is calculated according to the time ratio. Then, in the singing part including the consonant “s”, the scoring value of the singing part is calculated according to the time ratio of the singing part of the consonant “s” using the scoring value other than the singing part of the consonant “s”. To do. In the example shown in FIG. 5, the singing part of the consonant “s” is 30% of the section (1), and the part other than the singing part including the consonant “s” is 70% of the section (1). The scoring value other than the included singing part is 52 points. Therefore, the corrected scoring value in the section (1) is (estimated scoring value of consonant part = 22 points) + (actual scoring value other than consonant part = 52 points) = 74 points.

また、区間（５）では、所定の時間長以上の子音「ｈ」を含んでいる歌唱部分以外の採点を行い、その時間比率に応じて当該歌唱部分の採点値を算出する。そして、子音「ｈ」を含んでいる歌唱部分では、子音「ｈ」の歌唱部分以外の採点値を用いて、子音「ｈ」の歌唱部分の時間比率に応じて当該歌唱部分の採点値を算出する。図５に示す例では、子音「ｈ」の歌唱部分が区間（１）の４０％、子音「ｈ」を含んでいる歌唱部分以外が区間（１）の６０％であり、子音「ｈ」を含んでいる歌唱部分以外の採点値は４５点である。したがって、区間（５）における補正採点値は、（子音部分の推定採点値＝３０点）＋（子音部分以外の実採点値＝４５点）＝７５点となる。 Further, in the section (5), scoring is performed for portions other than the singing portion including the consonant “h” longer than the predetermined time length, and the scoring value of the singing portion is calculated according to the time ratio. And, in the singing part including the consonant “h”, the scoring value of the singing part is calculated according to the time ratio of the singing part of the consonant “h” using the scoring value other than the singing part of the consonant “h”. To do. In the example shown in FIG. 5, the singing part of the consonant “h” is 40% of the section (1), and the part other than the singing part including the consonant “h” is 60% of the section (1). The scoring value other than the included singing part is 45 points. Therefore, the corrected scoring value in the section (5) is (estimated scoring value of consonant part = 30 points) + (actual scoring value other than consonant part = 45 points) = 75 points.

＜音楽再生制御手段＞
音楽再生制御手段３９は、楽曲ＩＤに基づいて演奏データから抽出された演奏制御データに基づいて、音源データをデジタル再生すると共にアナログ変換してミキシングアンプ２５に出力するための電子回路である。上述したように、ミキシングアンプ２５は、マイクロホン２３から入力された歌唱者の歌唱音声信号と、音楽再生制御手段３９から送出される演奏音声信号とをミキシングすると共に、アンプ機能により増幅してスピーカ２２より出力するための装置である。 <Music playback control means>
The music reproduction control means 39 is an electronic circuit for digitally reproducing the sound source data based on the performance control data extracted from the performance data based on the music ID and converting it to analog and outputting it to the mixing amplifier 25. As described above, the mixing amplifier 25 mixes the singer's singing voice signal input from the microphone 23 and the performance voice signal sent from the music reproduction control means 39, and amplifies it by the amplifier function to be amplified by the speaker 22. It is a device for outputting more.

＜映像再生制御手段＞
映像再生制御手段４１は、カラオケ楽曲の演奏中に、映像データベース３５ｂから抽出した背景映像データと、演奏データに含まれる歌詞描出データに基づいて作成される歌詞文字とを、当該カラオケ楽曲の演奏データに同期させて表示装置２４に出力する。 <Video playback control means>
The video reproduction control means 41 uses the background video data extracted from the video database 35b and the lyric characters created based on the lyric rendering data included in the performance data during the performance of the karaoke music, as performance data of the karaoke music. Are output to the display device 24 in synchronization with

＜他の実施形態＞
本発明のシステム及びその周辺装置を構成する機器や手段は上述したものに限定されず、その利用目的に応じて、必要な機器や手段のみの構成としたり、適宜他の機器や手段を付加したりすることができる。また、各手段をそれぞれ別個のものとして構成するのではなく、複数の機能を統合した手段として構成してもよい。 <Other embodiments>
The devices and means constituting the system of the present invention and its peripheral devices are not limited to those described above, and only the necessary devices and means are configured according to the purpose of use, or other devices and means are appropriately added. Can be. Further, each unit may be configured as a unit in which a plurality of functions are integrated, instead of being configured separately.

１０歌唱採点システム
２０カラオケ演奏装置
２１カラオケ本体
２２スピーカ
２３マイクロホン
２４表示装置
２５ミキシングアンプ
２６カラオケリモコン装置
２６ａ楽曲検索手段
２６ｂ楽曲索引データベース
２６ｃデータ記憶部
２６ｄ入出力表示部
３１ネットワーク送受信手段
３２中央制御手段
３３ＲＯＭ
３４ＲＡＭ
３４ａ予約待ち行列
３５ＨＤＤ
３５ａ楽曲データベース
３５ｂ映像データベース
３６ローカル送受信手段
３７予約管理手段
３８採点処理手段
３９音楽再生制御手段
４０Ａ／Ｄコンバータ
４１映像再生制御手段 DESCRIPTION OF SYMBOLS 10 Song scoring system 20 Karaoke performance apparatus 21 Karaoke main body 22 Speaker 23 Microphone 24 Display apparatus 25 Mixing amplifier 26 Karaoke remote control apparatus 26a Music search means 26b Music index database 26c Data storage part 26d Input / output display part 31 Network transmission / reception means 32 Central control means 33 ROM
34 RAM
34a Reservation queue 35 HDD
35a Music database 35b Video database 36 Local transmission / reception means 37 Reservation management means 38 Scoring processing means 39 Music reproduction control means 40 A / D converter 41 Video reproduction control means

Claims

Set the scoring section that is the subject of singing scoring so that it is shorter than the singing time of each word that is the lyrics of the karaoke music, and in each scoring section, the singing voice signal and scoring reference data input from the microphone In the singing scoring system that calculates the singing scoring value by comparing
A singing scoring system comprising scoring processing means for performing scoring processing that differs depending on the type of consonant for a scoring section including consonants.

The scoring means has a pitch detection function for singing voice for use in scoring and has a pitch detection function capable of detecting the pitch of vowels included in each scoring section, and for the scoring section including the consonants, 2. The singing scoring system according to claim 1, wherein different scoring processes are performed according to the time length of consonants.

The scoring means replaces a singing score value of a scoring section including consonants longer than a predetermined time with a scoring value of a scoring section that does not include consonants longer than a predetermined time. Singing scoring system.

3. The singing scoring system according to claim 2, wherein the scoring processing unit does not set a scoring section including a consonant longer than a predetermined time length as a scoring section to be scoring.

The scoring means calculates the singing score value of a scoring section including consonants longer than a predetermined time length by interpolating based on the scoring value of a scoring section not including consonants longer than a predetermined time. The singing scoring system according to claim 2.

The scoring processing unit calculates a singing score value of a scoring section including a consonant of a predetermined time length or more based on a ratio of a singing part other than the consonant and a scoring value. Singing scoring system.