JP2016009153A

JP2016009153A - Karaoke device

Info

Publication number: JP2016009153A
Application number: JP2014131024A
Authority: JP
Inventors: 豪矢吹; Takeshi Yabuki
Original assignee: Daiichikosho Co Ltd
Current assignee: Daiichikosho Co Ltd
Priority date: 2014-06-26
Filing date: 2014-06-26
Publication date: 2016-01-18
Anticipated expiration: 2034-06-26
Also published as: JP6316112B2

Abstract

PROBLEM TO BE SOLVED: To provide a karaoke device which gives a pattern voice data using a voice of a singer himself/herself that is recorded by one time singing so that the singer can sing in his/her voice range.SOLUTION: A karaoke device comprises: determination means 30 which detects singing pitch of singing voice data for each singing section of a music, compares the singing pitch with the reference pitch to calculate a pitch difference, and verifies that the pitch difference is within an allowable range to determine whether the singing pitch for each singing section is required to be corrected or not; correction means 31 which creates the standard pattern voice data in which the singing pitch required to be corrected is corrected based on the reference data; identification means 32 which identifies, when the pitch of the singing voice data corresponding to the highest reference pitch is required to be corrected, the singing upper limit pitch where a singer can sing; and transposition means 33 which transposes all singing sections of the standard pattern voice data into lower pitch by a semitone unit transposition quantity on the basis of the pitch difference between the highest reference pitch and the singing upper limit pitch.

Description

本発明は、歌唱者の歌唱に対する模範的な歌唱音声データを作成して提供する手段を備えるカラオケ装置に関する。 The present invention relates to a karaoke apparatus including means for creating and providing exemplary singing voice data for a singer's singing.

近年、カラオケ装置の高機能化に伴い種々のサービスを提供できるようになっている。例えば、歌唱者の歌唱力の向上を図るために、歌唱者の歌唱に対する模範的な歌唱音声データを作成して聴取可能とするサービスもその一つである。このような模範的な歌唱を提供するに際しては歌唱者が歌唱できる音域での模範的な歌唱とさせることが望まれる。 In recent years, various services can be provided with the enhancement of the functionality of karaoke apparatuses. For example, in order to improve the singing ability of a singer, a service that enables creation and listening of exemplary singing voice data for the singing of the singer is one of them. In providing such an exemplary singing, it is desired to be an exemplary singing in a range where the singer can sing.

従来、カラオケ装置には、利用者の歌唱音声データを模範音声データと比較して採点し、また歌唱音声データを記憶することができるものも知られており、例えば、伴奏の再生とともにマイク入力された歌唱音声データを記憶するとともに歌唱音声データと模範音声データとの比較により歌唱力を評価し、正しく歌唱できなかった区間について歌唱音声と模範音声とを聞き比べることができる技術が知られている（特許文献１）。 Conventionally, karaoke apparatuses are also known that can score a user's singing voice data in comparison with model voice data and store singing voice data. A technique is known that can store singing voice data, evaluate singing power by comparing singing voice data and model voice data, and compare the singing voice and model voice for sections that could not be sung correctly. (Patent Document 1).

この場合、模範音声データは、プロのシンガーなど歌唱力の優れた歌唱者によって歌唱されることが多く、聞き比べに際してはプロシンガーではなく、利用者自身の声で歌唱された模範音声であることが歌唱力向上のためには望ましい。そのため、利用者による歌唱音声データを、模範的な歌唱のデータのピッチに基づいて補正することができる技術も開示されており（特許文献２）、特許文献１の技術に特許文献２の技術を用いることで、利用者自身の声で歌唱された模範音声と聞き比べることができることとなる。 In this case, the model voice data is often sung by a singer with excellent singing ability, such as a professional singer, and is not a professional singer but a model voice sung by the user's own voice for comparison. Is desirable for improving singing ability. Therefore, the technique which can correct | amend singing voice data by a user based on the pitch of the data of model singing is also disclosed (patent document 2), and the technique of patent document 2 is added to the technique of patent document 1. By using it, it can be compared with the model voice sung by the user's own voice.

一方、利用者には歌唱に際しての自身の声域というものがあり、歌唱者の声域を測定して告知する機能を備えるカラオケ装置も提供されており（特許文献３）、測定された声域に応じてキー設定することができるようになっている。 On the other hand, there is a user's own vocal range for singing, and a karaoke device provided with a function for measuring and announcing the vocal range of the singer is also provided (Patent Document 3), depending on the measured vocal range. The key can be set.

特開２００８−２３３７３６号公報JP 2008-233736 A 特開平１１−０３８９８７号公報Japanese Patent Laid-Open No. 11-038987 特開２００４−３２６１３３号公報JP 2004-326133 A

しかしながら、上記特許文献２の技術を用いて歌唱音声のピッチを修正する場合、単に歌唱ピッチが精確であればよいというだけでなく、歌唱を練習する際の模範として利用できる必要があることから、ピッチを修正した結果、利用者の声域（特に歌唱上限ノート）を超えてしまうと利用者はその部分を歌唱することができず、模範的な歌唱としては不適当であるという問題がある。 However, when correcting the pitch of the singing voice using the technique of the above-mentioned Patent Document 2, it is not only necessary that the singing pitch is accurate, but it is necessary to be able to use it as an example when practicing singing. As a result of correcting the pitch, if the user's voice range (especially the singing upper limit note) is exceeded, the user cannot sing that part, which is inappropriate as an exemplary singing.

また、特許文献３などの技術を用いて歌唱者の声域を測定しておき、当該測定結果に基づいてキーを設定してから歌唱することが考えられるが、この技術における歌唱は声域測定を目的とした歌唱であって歌唱中にキー変更を繰り返すため、歌唱音声データから模範音声データを生成するための歌唱には不適であり、もう一度最適なキーで歌唱し直さねばならないという問題がある。 In addition, it is conceivable that the vocal range of the singer is measured using a technique such as Patent Document 3 and singing after setting a key based on the measurement result. Since the key change is repeated during singing, it is unsuitable for singing for generating model voice data from the singing voice data, and there is a problem that it is necessary to sing again with the optimal key.

そこで、本発明は上記課題に鑑みなされたもので、歌唱者による一度の歌唱で、歌唱者自身の声による模範音声データを歌唱者の声域で歌唱可能に作成し得るカラオケ装置を提供することを目的とする。 Then, this invention is made | formed in view of the said subject, and provides the karaoke apparatus which can create the example audio | voice data by a singer's own voice so that it can sing in the vocal range of a singer by the singer's one time singing. Objective.

上記課題を解決するために、請求項１の発明では、各楽曲について歌唱区間毎に、歌唱者による歌唱を当該歌唱区間に合わせて少なくとも歌唱したピッチに対して歌唱すべきピッチを示すリファレンスピッチとの関係で分析する分析基準としてのリファレンスデータを備えるカラオケ装置であって、歌唱者による所定楽曲の歌唱音声を歌唱音声データとして記録する記録手段と、当該楽曲の歌唱区間毎に、前記記録した歌唱音声データの歌唱ピッチを検出し、前記リファレンスデータの対応するリファレンスピッチと比較してピッチ差分を算出し、ピッチ差分が許容範囲内か否かにより各歌唱区間の歌唱ピッチが要修正か修正不要かを判定する判定手段と、前記判定手段による該当歌唱区間の要修正の歌唱ピッチを前記リファレンスデータの対応するリファレンスピッチに基づいて修正した基準模範音声データを作成する修正手段と、前記判定結果に基づいて、当該楽曲の全歌唱区間のうち最も高いリファレンスピッチに対応する歌唱音声データの歌唱ピッチが要修正である場合に、当該リファレンスピッチと対応する歌唱音声データの歌唱ピッチを歌唱者の歌唱可能な歌唱上限ピッチとして特定する特定手段と、当該楽曲の全歌唱区間のうち最も高いリファレンスピッチと前記特定手段で特定した歌唱上限ピッチとのピッチ差分に基づいた半音単位の移調量で、前記基準模範音声データの全歌唱区間を低い方に移調して移調模範音声データとする移調手段と、を有する構成とする。 In order to solve the above-mentioned problem, in the invention of claim 1, for each song section, for each song section, a reference pitch indicating a pitch to be sung with respect to a pitch at least sung by a singer in accordance with the song section; A karaoke apparatus having reference data as an analysis standard to be analyzed in relation to the above, wherein recording means for recording the singing voice of a predetermined song by a singer as singing voice data, and the recorded song for each song section of the song Detecting the singing pitch of the audio data, calculating the pitch difference compared with the corresponding reference pitch of the reference data, whether the singing pitch of each singing section needs to be corrected or not required depending on whether the pitch difference is within an allowable range or not Determining means for determining the singing pitch of the singing section to be corrected by the determining means. The singing pitch of the singing voice data corresponding to the highest reference pitch among all the singing sections of the music is required based on the determination result, based on the determination result and the correcting means for correcting the reference model voice data corrected based on the corresponding reference pitch. In the case of correction, a specifying means for specifying the singing pitch of the singing voice data corresponding to the reference pitch as a singing upper limit pitch that can be sung by the singer, and the highest reference pitch and the specifying among all singing sections of the song A transposing means that transposes the entire singing section of the reference model voice data to a lower one to make the transposition model voice data with a transposition amount in semitones based on the pitch difference with the singing upper limit pitch specified by the means. And

請求項２の発明では、各楽曲について歌唱区間毎に、歌唱者による歌唱を当該歌唱区間に合わせて少なくとも歌唱したピッチに対して歌唱すべきピッチを示すリファレンスピッチとの関係で分析する分析基準としてのリファレンスデータを備えるカラオケ装置であって、歌唱者による所定楽曲の歌唱音声を歌唱音声データとして記録する記録手段と、当該楽曲の歌唱区間毎に、前記記録した歌唱音声データの歌唱ピッチを検出し、前記リファレンスデータの対応するリファレンスピッチと比較してピッチ差分を算出し、ピッチ差分が許容範囲内か否かにより各歌唱区間の歌唱ピッチが要修正か修正不要かを判定する判定手段と、前記判定手段による該当歌唱区間の要修正の歌唱ピッチを前記リファレンスデータの対応するリファレンスピッチに基づいて修正した基準模範音声データを作成する修正手段と、前記判定結果に基づいて、当該楽曲の全歌唱区間のうち最も高いリファレンスピッチに対応する歌唱音声データの歌唱ピッチが要修正である場合に、当該判定結果による修正不要の歌唱ピッチのうち最も高い歌唱ピッチを歌唱者の歌唱可能な歌唱上限ピッチとして特定する特定手段と、当該楽曲の全歌唱区間のうち最も高いリファレンスピッチと前記特定手段で特定した歌唱上限ピッチとのピッチ差分に基づいた半音単位の移調量で、前記基準模範音声データの全歌唱区間を低い方に移調して移調模範音声データとする移調手段と、を有する構成とする。 In the invention of claim 2, as an analysis standard for analyzing each piece of music for each singing section in relation to a reference pitch indicating a pitch to be sung with respect to at least the pitch sung by the singer in accordance with the singing section. Karaoke apparatus comprising the reference data of the recording device, wherein the singing voice of the predetermined music by the singer is recorded as singing voice data, and the singing pitch of the recorded singing voice data is detected for each singing section of the music. Determining means for calculating a pitch difference in comparison with a corresponding reference pitch of the reference data, and determining whether the singing pitch of each singing section needs to be corrected or not to be corrected depending on whether the pitch difference is within an allowable range; and The singing pitch required to be corrected in the corresponding singing section by the judging means is the reference pitch corresponding to the reference data. When the singing pitch of the singing voice data corresponding to the highest reference pitch among all the singing sections of the music is based on the determination result, the correcting means for creating the reference model voice data corrected based on The specifying means for specifying the highest singing pitch as a singing upper limit pitch that can be sung by the singer, among the singing pitches that do not need to be corrected according to the determination result, and the highest reference pitch and the specifying means among all singing sections of the song A transposition means that transposes the entire singing section of the reference exemplary voice data to a lower one to make the transposition exemplary voice data with a transposition amount in semitones based on the pitch difference from the specified singing upper limit pitch. .

請求項３、４の発明では、「前記移調手段は、さらに、前記記録した歌唱音声データの全歌唱区間を前記移調量で移調すると共に、当該楽曲の演奏データを当該移調量で移調する」構成とし、
「前記移調された歌唱音声データ及び演奏データ、又は、前記移調模範音声データ及び前記移調された演奏データとを選択的に聴取させる選択手段を備える」構成とする。 In the inventions of claims 3 and 4, "the transposing means further transposes all singing sections of the recorded singing voice data by the transposition amount and transposes the performance data of the music by the transposition amount". age,
“Contains selection means for selectively listening to the transposed singing voice data and performance data, or the transposition model voice data and the transposed performance data”.

請求項１、２の発明によれば、楽曲の歌唱区間毎に、歌唱音声データの歌唱ピッチを検出し、リファレンスピッチと比較してピッチ差分を算出してピッチ差分が許容範囲内か否かにより各歌唱区間の歌唱ピッチが要修正か修正不要かを判定し、要修正の歌唱ピッチをリファレンスデータに基づいて修正した基準模範音声データを作成すると共に、判定結果に基づいて、当該楽曲の全歌唱区間のうち最も高いリファレンスピッチに対応する歌唱音声データの歌唱ピッチが要修正である場合に、当該リファレンスピッチと対応する歌唱音声データの歌唱ピッチ、又は、当該判定結果による修正不要の歌唱ピッチのうち最も高い歌唱ピッチを歌唱者の歌唱可能な歌唱上限ピッチとして特定し、当該リファレンスピッチと上記特定した歌唱上限ピッチとのピッチ差分に基づいた半音単位の移調量で、基準模範音声データの全歌唱区間を低い方に移調して移調模範音声データとする構成とすることにより、修正後の基準模範音声データの最高ノートのピッチが歌唱者の上限のピッチを超えないような移調模範音声データとさせることから、歌唱者による一度の歌唱で、歌唱者自身の声による模範音声データを歌唱者の声域で歌唱可能に提供することができるものである。 According to the first and second aspects of the invention, the singing pitch of the singing voice data is detected for each singing section of the music, and the pitch difference is calculated by comparing with the reference pitch, and whether or not the pitch difference is within the allowable range. It is determined whether the singing pitch of each singing section needs to be corrected or not required, and the reference model voice data in which the singing pitch to be corrected is corrected based on the reference data is created, and the entire singing of the song is performed based on the determination result. When the singing pitch of the singing voice data corresponding to the highest reference pitch in the section is a correction required, the singing pitch of the singing voice data corresponding to the reference pitch, or the singing pitch that does not need to be corrected based on the determination result The highest singing pitch is specified as the singing upper limit pitch that can be sung by the singer, the reference pitch and the singing upper limit pitch specified above. By transposing the entire singing section of the reference model voice data to the lower side to make the transposition model voice data with the transposition amount in semitones based on the pitch difference of the highest pitch of the reference model voice data after correction Singing by the singer once, so that the singing voice data of the singer's own voice can be sung in the singer's voice range. Is something that can be done.

請求項３、４の発明では、移調手段において歌唱音声データの全歌唱区間を上記決定した移調量で移調すると共に、演奏再生手段に対して当該楽曲の演奏データをも当該移調量で移調させ、当該移調された歌唱音声データ及び演奏データ、又は、移調模範音声データ及び移調された演奏データとを選択的に聴取させる構成とすることにより、利用者に対して歌唱可能な模範歌唱との聴き比べを可能として利用者の便に資することができるものである。 In the third and fourth aspects of the invention, the transposing means transposes the entire singing voice data in the singing voice data by the determined transposition amount, and the performance reproducing means also transposes the performance data of the music by the transposition amount, Listening comparison with a model singing that can be sung by the user by selectively listening to the transposed singing voice data and performance data, or the transposing model voice data and the transposed performance data. Can contribute to the convenience of the user.

本発明に係るカラオケ装置のブロック構成図である。It is a block block diagram of the karaoke apparatus which concerns on this invention. 図１の判定手段の処理フローチャート及び判定結果の説明図である。It is a process flowchart of the determination means of FIG. 1, and explanatory drawing of a determination result. 図１の特定手段の処理フローチャートである。It is a process flowchart of the specific means of FIG. 図１の移調手段の処理フローチャートである。It is a process flowchart of the transposing means of FIG. 図４の移調手段の処理説明図である。It is processing explanatory drawing of the transposing means of FIG. 図１の特定手段の他の処理フローチャートである。It is another process flowchart of the identification means of FIG. 図６の処理に基づく移調手段の処理説明図である。It is process explanatory drawing of the transposition means based on the process of FIG.

以下、本発明の実施形態を図により説明する。
図１に、本発明に係るカラオケ装置のブロック構成図を示す。図１において、各カラオケ端末１１は、主要装置としてのカラオケ本体１２に、有線又は無線で外部接続されるものとして、表示部１３、ミキシングアンプ１４、マイク１５、スピーカ１６、遠隔入出力装置１７が接続される。 Hereinafter, embodiments of the present invention will be described with reference to the drawings.
FIG. 1 is a block diagram of a karaoke apparatus according to the present invention. In FIG. 1, each karaoke terminal 11 has a display unit 13, a mixing amplifier 14, a microphone 15, a speaker 16, and a remote input / output device 17 that are externally connected to a karaoke main body 12 as a main device by wire or wirelessly. Connected.

上記表示部１３は、通常の楽曲選曲表示やカラオケ演奏時の背景映像等を表示するもので、例えば液晶ディスプレイ（ＬＣＤ）、プラズマディスプレイ（ＰＤＰ）、その他種々のディスプレイを採用することができる。上記ミキシングアンプ１４は、カラオケ本体１２より送られてくる音楽演奏信号に、マイク１５からの音声信号をミキシングし、増幅してスピーカ１６より出力する。 The display unit 13 displays a normal music selection display, a background image during karaoke performance, and the like. For example, a liquid crystal display (LCD), a plasma display (PDP), and other various displays can be employed. The mixing amplifier 14 mixes the audio signal from the microphone 15 with the music performance signal sent from the karaoke main body 12, amplifies it, and outputs it from the speaker 16.

上記遠隔入出力装置１７は、図示しない端末送受信部により、カラオケ本体１２に対して有線方式ないし無線方式（ＩＲ方式やブルートゥース（登録商標）機構のピコネット接続方式など）を利用してデータ授受を行うためのもので、少なくとも端末表示部１７Ａ及び選曲楽曲登録手段１７Ｂを適宜備える。 The remote input / output device 17 transmits / receives data to / from the karaoke main body 12 using a wired system or a wireless system (IR system, Bluetooth (registered trademark) mechanism piconet connection system, etc.) by a terminal transmission / reception unit (not shown). For this purpose, at least a terminal display unit 17A and a music selection piece registration unit 17B are provided as appropriate.

上記端末表示部１７Ａは、液晶ディスプレイ（ＬＣＤ）とタッチセンサとを積層して入出力用とし、表示されるアイコン等に対応して当該タッチセンサにより楽曲の選択などのデータを入力することができるＧＵＩのユーザインタフェース機能を有するものである。 The terminal display unit 17A can be used for input / output by laminating a liquid crystal display (LCD) and a touch sensor, and data such as selection of music can be input by the touch sensor corresponding to the displayed icon or the like. It has a GUI user interface function.

選曲楽曲登録手段１７Ｂは楽曲を検索させ、選曲させるテーブルを備えるプログラムであり、選曲された楽曲は、後述のカラオケ本体１２における予約待ち行列（４１）に登録される。また、選曲時に、歌唱後に自己の歌唱に対する模範的な音声データの作成及び再生、又は、自己の歌唱を歌唱可能なピッチに移調した音声データの再生を選択させることの入力が可能な表示が選択手段として設けられ、選択された場合には選曲楽曲に付帯されて予約待ち行列に登録される。 The music selection music registration means 17B is a program including a table for searching for music and selecting music, and the music selection is registered in a reservation queue (41) in the karaoke main body 12 to be described later. In addition, at the time of music selection, display is selected that allows the user to select the creation and playback of exemplary audio data for his / her song after singing, or the playback of audio data transposed to a singable pitch. It is provided as a means, and when it is selected, it is attached to the selected music and registered in the reservation queue.

上記カラオケ本体１２は、バス２０、中央制御部２１、ＲＯＭ２２、ＲＡＭ２３、映像表示制御手段２４、音楽演奏制御部２５、音源（シンセサイザ）２６、送受信部２７、Ａ／Ｄ変換部２８、記憶部２９、判定手段３０、修正手段３１、特定手段３２、移調手段３３及び演奏再生手段３４を備える。 The karaoke main body 12 includes a bus 20, a central control unit 21, a ROM 22, a RAM 23, a video display control means 24, a music performance control unit 25, a sound source (synthesizer) 26, a transmission / reception unit 27, an A / D conversion unit 28, and a storage unit 29. , Determination means 30, correction means 31, identification means 32, transposition means 33, and performance reproduction means 34.

上記ＲＡＭ２３には予約待ち行列４１、歌唱音声データ４２、演奏データ４３、判定結果４４及び基準模範音声データ４５の記憶領域が形成される。また、記憶部２９には、楽曲データベース（楽曲ＤＢ）５１、映像データベース（映像ＤＢ）５２及びリファレンスデータベース（リファレンスＤＢ）５３が記憶される。なお、上記各構成について、本発明の要旨と直接関連しない要素部分であっても、従前のカラオケシステムにおいても大部分が適用可能であることを示すために、構成要素の全体を説明する。 In the RAM 23, a storage area for a reservation queue 41, singing voice data 42, performance data 43, a determination result 44, and reference model voice data 45 is formed. The storage unit 29 also stores a music database (music DB) 51, a video database (video DB) 52, and a reference database (reference DB) 53. In addition, about each said structure, even if it is an element part which is not directly related to the summary of this invention, in order to show that most can be applied also in the conventional karaoke system, the whole component is demonstrated.

上記中央制御部２１は、このシステムを統括的に処理制御する物理的なＣＰＵであり、ＲＯＭ２２に記憶されているプログラムに基づくアルゴリズム処理を行う。上記ＲＡＭ２３は、予約待ち行列４１、歌唱音声データ４２、演奏データ４３、判定結果４４及び基準模範音声データ４５の記憶領域が形成される他に、上記種々のプログラムを展開、実行させるための作業領域としての役割をなすもので、例えば半導体メモリで構成され、仮想的にハードディスク上に構築される場合をも含む概念である。 The central control unit 21 is a physical CPU that performs overall processing control of the system, and performs algorithm processing based on a program stored in the ROM 22. The RAM 23 has a storage area for the reservation queue 41, singing voice data 42, performance data 43, determination result 44, and reference model voice data 45, and a work area for developing and executing the various programs. It is a concept that includes a case where it is configured by, for example, a semiconductor memory and is virtually built on a hard disk.

上記映像表示制御手段２４は、演奏時に、映像ＤＢ５２より抽出された背景映像及び楽曲ＤＢ５１より抽出された楽曲の歌詞データを表示部１３に出力するプログラム乃至電子回路である。上記音楽演奏制御部２５は、例えばシーケンスプログラムを備え、楽曲ＩＤで楽曲ＤＢ５１より抽出された演奏データとしての音符データに従って音源（シンセサイザ）２６を駆動するもので、当該音源２６の出力は演奏信号としてミキシングアンプ１４に出力される。この音符データ（演奏データ）は、ＲＡＭ２３の演奏データ記憶領域４３に記憶される。 The video display control means 24 is a program or an electronic circuit that outputs the background video extracted from the video DB 52 and the lyrics data of the music extracted from the music DB 51 to the display unit 13 during performance. The music performance control unit 25 includes, for example, a sequence program, and drives a sound source (synthesizer) 26 according to note data as performance data extracted from the music DB 51 with a music ID. The output of the sound source 26 is used as a performance signal. It is output to the mixing amplifier 14. The note data (performance data) is stored in the performance data storage area 43 of the RAM 23.

上記送受信部２７は、遠隔入出力装置１７との間で有線方式ないし無線方式（ＩＲ方式やブルートゥース（登録商標）機構のピコネット接続方式など）を利用してデータ授受を行うためのもので、そのための電子回路及びプログラムである。上記Ａ／Ｄ変換部２８は、マイク１５からミキシングアンプ１４をスルーした音声信号をデジタル変換するプログラムであり、デジタル変換された音声信号は歌唱音声データとしてＲＡＭ２３の歌唱音声データ記憶領域４２に記録される。当該Ａ／Ｄ変換部２８及び歌唱音声データ記憶領域４２により記録手段を構成する。 The transmission / reception unit 27 is used to exchange data with the remote input / output device 17 using a wired system or a wireless system (such as an IR system or a Bluetooth (registered trademark) mechanism piconet connection system). Electronic circuit and program. The A / D conversion unit 28 is a program for digitally converting an audio signal that has passed through the mixing amplifier 14 from the microphone 15, and the digitally converted audio signal is recorded in the song audio data storage area 42 of the RAM 23 as song audio data. The The A / D converter 28 and the singing voice data storage area 42 constitute recording means.

上記記憶部２９に記憶されている楽曲ＤＢ５１は、楽曲毎に、演奏データ（音符データ）、歌詞データを格納する。具体的には、楽曲ＩＤ、曲名及びアーチストＩＤ（アーチスト名）が関連付けられた楽曲テーブルを有し、楽曲毎に、楽曲ＩＤで管理される所定データ形式のカラオケ楽曲の音符データ（例えば、ＭＩＤＩ（登録商標）形式の音符データ）等で構成される楽曲データ（ファイル）について当該楽曲ＩＤをファイル名としてそれぞれ格納したデータベースである。 The music DB 51 stored in the storage unit 29 stores performance data (note data) and lyrics data for each music. Specifically, it has a music table in which a music ID, a music title, and an artist ID (artist name) are associated, and for each music, karaoke musical note data (for example, MIDI ( This is a database in which the music ID is stored as a file name for music data (file) composed of (registered trademark) format note data) and the like.

上記記憶部２９に記憶されている映像ＤＢ５２は、楽曲毎に応じた背景映像データについて楽曲ＩＤをファイル名としてそれぞれ格納したデータベースである。上記記憶部２９に記憶されているリファレンスＤＢ５３は、上記楽曲ＤＢ５１に記憶されているカラオケ楽曲と当該カラオケ楽曲の十分に小さな歌唱区間に合わせた歌唱者による歌唱を評価、分析するための評価基準として用いられるリファレンスデータとを紐付けて記憶するデータベースである。当該リファレンスデータには、楽曲毎に、少なくともノート番号とベロシティ（大きさ）に応じて歌唱すべきピッチを示すリファレンスピッチが含まれる。 The video DB 52 stored in the storage unit 29 is a database in which music IDs are stored as file names for background video data corresponding to each music. Reference DB53 memorize | stored in the said memory | storage part 29 is used as evaluation criteria for evaluating and analyzing the song by the singer matched with the karaoke music memorize | stored in the said music DB51 and the sufficiently small song area of the said karaoke music. It is a database that stores reference data used in association with each other. The reference data includes a reference pitch indicating a pitch to be sung according to at least a note number and velocity (size) for each music piece.

なお、上記リファレンスＤＢ５３を備えることで、カラオケ装置１１においては、このリファレンスデータを用いて歌唱採点する採点手段を備えさせることとしてもよい。採点手段は、楽曲歌唱に対して楽曲の各歌唱区間毎に歌唱音声を分析して採点するもので、マイク１５からの音声信号がミキシングアンプ１４をスルーしてＡ／Ｄ変換部２８でデジタル変換された音声信号に対して楽曲データに含まれる歌唱採点するためのリファレンスデータに基づいて採点処理を行うプログラムであり、具体的には、例えば特許第４２２２９１５号公報に記載されている手法を用いることができる。 In addition, by providing the reference DB 53, the karaoke apparatus 11 may be provided with a scoring means for singing using this reference data. The scoring means analyzes and sings the singing voice for each singing section of the music with respect to the song singing, and the audio signal from the microphone 15 passes through the mixing amplifier 14 and is digitally converted by the A / D converter 28. This is a program that performs scoring processing based on reference data for scoring the singing included in the music data for the recorded audio signal, and specifically uses, for example, the technique described in Japanese Patent No. 4229915 Can do.

上記判定手段３０は、歌唱された楽曲に対し、上述のように十分に小さな歌唱区間毎に、上記記録した歌唱音声データの歌唱ピッチを検出し、対応のリファレンスデータをリファレンスＤＢ５３より抽出して対応する歌唱区間のノートの歌唱ピッチと比較してピッチ差分を算出し、ピッチ差分が許容範囲内か否かにより各歌唱区間の歌唱ピッチが要修正か修正不要かを判定するプログラムである（詳細は図２で説明する）。この場合の判定結果はＲＡＭ２３の判定結果記憶領域４４に記憶される（図５で説明する）。 The determination unit 30 detects the singing pitch of the recorded singing voice data for each singing song section sufficiently small as described above, and extracts corresponding reference data from the reference DB 53 to cope with the sung music. This is a program that calculates the pitch difference in comparison with the singing pitch of the notes in the singing section, and determines whether the singing pitch in each singing section needs to be corrected or not required depending on whether the pitch difference is within an allowable range (details This will be described with reference to FIG. The determination result in this case is stored in the determination result storage area 44 of the RAM 23 (described in FIG. 5).

上記修正手段３１は、判定手段３０による該当歌唱区間の要修正の歌唱ピッチを当該判定手段３０で抽出したリファレンスデータに基づいて修正した基準模範音声データを作成するプログラムであり、作成された基準模範音声データはＲＡＭ２３の基準模範音声データ記憶領域４５に記憶される。 The correction means 31 is a program for creating reference model voice data in which the singing pitch required to be corrected by the determination means 30 based on the reference data extracted by the determination means 30 is created. The sound data is stored in the reference model sound data storage area 45 of the RAM 23.

上記特定手段３２は、一の手法として、判定結果に基づいて、当該楽曲の全歌唱区間のうち最も高いリファレンスピッチに対応する歌唱音声データの歌唱ピッチが要修正である場合に、当該リファレンスピッチに対応する歌唱音声データの歌唱ピッチを歌唱者の歌唱可能な歌唱上限ピッチとして特定するプログラムである。（図４及び図５で説明する） As one method, the specifying means 32, when the singing pitch of the singing voice data corresponding to the highest reference pitch among all the singing sections of the song is based on the determination result, needs to be changed to the reference pitch. It is a program which specifies the singing pitch of corresponding singing voice data as a singing upper limit pitch which a singer can sing. (This will be explained with reference to FIGS. 4 and 5.)

また、他の手法として、当該楽曲の全歌唱区間のうち最も高いリファレンスピッチに対応する歌唱音声データの歌唱ピッチが要修正である場合に、当該判定結果による修正不要の歌唱ピッチのうち最も高い歌唱ピッチを歌唱者の歌唱可能な歌唱上限ピッチとして特定するプログラムとする（図６及び図７で説明する）。 Further, as another method, when the singing pitch of the singing voice data corresponding to the highest reference pitch among all singing sections of the music is a correction required, the highest singing of the singing pitches that do not need to be corrected according to the determination result It is assumed that the pitch is specified as a singing upper limit pitch that can be sung by the singer (described with reference to FIGS. 6 and 7).

上記移調手段３３は、歌唱された楽曲について、判定手段３０による判定結果及び修正手段３１で作成した基準模範音声データをＲＡＭ２３の判定結果記憶領域４４及び基準模範音声データ記憶手段４５より読み出し、判定結果より得られる当該歌唱楽曲の全歌唱区間のうち最も高いリファレンスピッチと特定手段３２で特定した歌唱上限ピッチとのピッチ差分に基づいた半音単位の移調量で基準模範音声データの全歌唱区間を低い方に移調して移調模範音声データを作成するプログラムである（図４及び図５で説明する）。なお、上記低い方に移調する移調量には移調量「ゼロ」を含む概念である（この点についても図４及び図５で説明する）。 The transposition means 33 reads the determination result by the determination means 30 and the reference model voice data created by the correction means 31 from the determination result storage area 44 and the reference model voice data storage means 45 of the RAM 23 for the sung song, and the determination result. The lower one of all the singing sections of the reference model voice data with the transposition amount in semitones based on the pitch difference between the highest reference pitch and the singing upper limit pitch specified by the specifying means 32 among all the singing sections of the singing song obtained more. Is a program for creating transposition model voice data by transposing to (described with reference to FIGS. 4 and 5). Note that the transposition amount transposed to the lower side includes a transposition amount “zero” (this point will also be described with reference to FIGS. 4 and 5).

また、移調手段３３は、さらに、読み出した歌唱音声データの対応楽曲の全歌唱区間を上記移調量で移調すると共に、演奏再生手段３４に対して当該楽曲の演奏データを当該移調量で移調して再生するよう制御するプログラムを備える。 The transposing means 33 further transposes the entire singing section of the corresponding music of the read singing voice data by the transposition amount, and transposes the performance data of the music by the transposition amount to the performance reproducing means 34. A program for controlling playback is provided.

上記演奏再生手段３４は、楽曲の演奏データを所定の移調量で移調して再生（音楽演奏制御部２５に出力）するプログラムであり、当該移調量は移調手段３３より制御される。 The performance reproducing means 34 is a program for transposing and reproducing the musical performance data by a predetermined transposition amount (output to the music performance control unit 25), and the transposition amount is controlled by the transposing means 33.

そこで、図２に、図１の判定手段の処理フローチャート及び判定結果の説明図を示す。図２（Ａ）において、所定楽曲の歌唱後に、当該楽曲で歌唱されてＲＡＭ２３の歌唱音声データ記憶領域４２に記録した歌唱音声データを取得すると共に、リファレンスＤＢ５３より対応のリファレンスデータを取得し（ステップ１（Ｓ１））、歌唱区間毎に歌唱した歌唱音声データの歌唱ピッチを検出する（Ｓ２）。 Therefore, FIG. 2 shows a processing flowchart of the determination unit of FIG. 1 and an explanatory diagram of the determination result. In FIG. 2A, after singing a predetermined music, singing voice data sung by the music and recorded in the singing voice data storage area 42 of the RAM 23 is acquired, and corresponding reference data is acquired from the reference DB 53 (step). 1 (S1)), the singing pitch of the singing voice data sung for each singing section is detected (S2).

続いて、歌唱区間毎の検出した各歌唱ピッチとリファレンスデータの対応する歌唱区間のピッチ（リファレンスピッチ）とを比較してピッチ差分を算出する（Ｓ３）。ここで、ピッチ差分の許容範囲を、例えば±１０ｃｅｎｔ未満として、当該算出したピッチ差分が許容範囲か否かを判定する（Ｓ４）。許容範囲を超えていれば要修正（「ＮＧ」）とし（Ｓ５−１）、許容範囲内であれば修正不要（「ＯＫ」）と判定する（Ｓ５−２）。そして、これらの判定を全歌唱区間で処理する（Ｓ６）。 Subsequently, the pitch difference is calculated by comparing each singing pitch detected for each singing section with the pitch (reference pitch) of the corresponding singing section of the reference data (S3). Here, the allowable range of the pitch difference is set to be, for example, less than ± 10 cent, and it is determined whether or not the calculated pitch difference is within the allowable range (S4). If it exceeds the allowable range, it is determined that correction is required ("NG") (S5-1), and if it is within the allowable range, it is determined that correction is not necessary ("OK") (S5-2). And these determinations are processed in all the singing sections (S6).

なお、上記判定手段３０による判定結果においては、歌唱する当該楽曲のどこかで一度でも歌唱すべき正しいピッチで歌唱できている（リファレンスピッチとの差分が許容範囲内でピッチ判定結果が「ＯＫ」である）ノートについては、歌唱可能つまり歌唱上限ノート以下であるとみなしている。また、同一ノートが複数回判定された場合には、それぞれの差分のうち最も小さかった差分としている。 In addition, in the determination result by the determination means 30, the song can be sung at the correct pitch to be sung once anywhere in the song to be sung (the difference from the reference pitch is within an allowable range and the pitch determination result is “OK”). ) Is considered to be singable, that is, below the upper limit singing note. When the same note is determined a plurality of times, the difference is the smallest difference among the differences.

上記判定手段３０による判定結果を、図２（Ｂ）に一例として示す。図２（Ｂ）は、歌唱すべきノートのうち最高ノート「Ｇ５」から高い順に、例えば５つのノートのピッチ差分及び判定結果を示している。なお、ＭＩＤＩ機器においては、ピアノの鍵盤（８８鍵）の中央の「ド」を「Ｃ４」として、最高音の鍵盤「ド」を「Ｃ８」、最低音の鍵盤「ラ」を「Ａ０」と表すのが普通である。また、判定結果には、図示しないが、判定対象となったリファレンスピッチ及び歌唱ピッチの値が付帯される。 The determination result by the determination means 30 is shown as an example in FIG. FIG. 2B shows pitch differences and determination results of, for example, five notes in order from the highest note “G5” among the notes to be sung. In the MIDI device, the “do” in the center of the piano keyboard (88 keys) is “C4”, the highest note “do” is “C8”, and the lowest note “la” is “A0”. It is normal to represent. Further, although not shown, the determination result is accompanied by values of the reference pitch and the singing pitch that are to be determined.

また、図３に、図１の特定手段の処理フローチャートを示す。上記特定手段３２は、図３において、ＲＡＭ２３より当該歌唱楽曲の判定手段３０による判定結果を読み出し、まず、歌唱楽曲の全歌唱区間のうち最も高いピッチ（リファレンスピッチ）に対応する歌唱音声データの歌唱ピッチの判定結果を参照する（Ｓ１１）。当該判定結果が、修正不要の場合には（Ｓ１２）、修正手段３１で作成した基準模範音声データは移調しないものとして終了する。 FIG. 3 shows a process flowchart of the specifying means of FIG. In FIG. 3, the specifying unit 32 reads the determination result by the determination unit 30 of the song from the RAM 23, and first, the song of the singing voice data corresponding to the highest pitch (reference pitch) among all the song sections of the song The pitch determination result is referred to (S11). If the determination result does not require correction (S12), the reference exemplary voice data created by the correction means 31 is assumed not to be transposed, and the process ends.

一方、要修正の場合には（Ｓ１２）、歌唱楽曲の全歌唱区間のうち最も高いリファレンスピッチに対応する歌唱音声データのピッチを、歌唱者の歌唱可能な歌唱上限ピッチとして特定する（Ｓ１３）。すなわち、図２（Ｂ）に示される判定結果から、歌唱楽曲の全歌唱区間のうち最も高いリファレンスピッチ（ノート番号「Ｇ５」のピッチ）と、対応する歌唱音声データの歌唱ピッチとのピッチ差分が「−３０ｃｅｎｔ」であって、ピッチ判定が「ＮＧ」であり要修正であることから、当該歌唱音声データの歌唱ピッチ（ノート番号「Ｇ５」のピッチから３０ｃｅｎｔを差し引いたピッチ）が歌唱者の歌唱可能な歌唱上限ピッチとして特定される。 On the other hand, in the case of correction required (S12), the pitch of the singing voice data corresponding to the highest reference pitch among all the singing sections of the singing song is specified as the singing upper limit pitch that can be sung by the singer (S13). That is, from the determination result shown in FIG. 2 (B), the pitch difference between the highest reference pitch (note number “G5” pitch) of all the song sections of the song and the song pitch of the corresponding song voice data is obtained. Since it is “−30 cent” and the pitch determination is “NG” and correction is necessary, the singing pitch of the singing voice data (the pitch obtained by subtracting 30 cent from the pitch of the note number “G5”) is the singing of the singer. It is specified as a possible singing upper limit pitch.

なお、上記歌唱上限ピッチの特定は、ノート番号「Ｇ５」において「−３０ｃｅｎｔ」で歌唱できたということは、「Ｇ５」より１半音（１００ｃｅｎｔ）低い「Ｆ＃５」以下は問題なく歌唱できるはずであるというものであり、ノート番号「Ｆ５」と「Ｅ５」のピッチ判定が「ＮＧ」であっても、これらの判定と歌唱上限ピッチとは無関係であるという考え方に基づくものである。 In addition, the above-mentioned singing upper limit pitch can be sung without problems with "F # 5" or less that is one semitone (100 cents) lower than "G5". Even if the pitch determination of note numbers “F5” and “E5” is “NG”, these determinations and the singing upper limit pitch are irrelevant.

そこで、図４に図１の移調手段の処理フローチャートを示すと共に、図５に図４の移調手段の処理説明図を示す。図４において、移調手段３３は、判定手段３０による判定結果をＲＡＭ２３の判定結果記憶領域４４より読み出す（Ｓ２１）。読み出した判定結果から、特定手段３２で特定された歌唱上限ピッチ、すなわち全歌唱区間で最も高いリファレンスピッチに対応する歌唱音声データの歌唱ピッチのピッチ差分に基づいて移調量を決定する（Ｓ２２）。すなわち、図５に示すように、全歌唱区間で最も高いノート番号「Ｇ５」のリファレンスピッチと対応する歌唱音声データの歌唱ピッチとのピッチ差分を「−３０ｃｅｎｔ」を得、これに基づいて当該ピッチ差分を包含する移調量として１半音の「１００ｃｅｎｔ」と決定する。 FIG. 4 shows a process flowchart of the transposing means of FIG. 1, and FIG. 5 shows a process explanatory diagram of the transposing means of FIG. In FIG. 4, the transposing means 33 reads the determination result by the determination means 30 from the determination result storage area 44 of the RAM 23 (S21). From the read determination result, the transposition amount is determined based on the singing upper limit pitch specified by the specifying means 32, that is, the pitch difference of the singing voice data of the singing voice data corresponding to the highest reference pitch in all singing sections (S22). That is, as shown in FIG. 5, “−30 cent” is obtained as the pitch difference between the reference pitch of the highest note number “G5” in the entire singing section and the singing pitch of the corresponding singing voice data, and the pitch is based on this. The transposition amount including the difference is determined as “100 cent” of one semitone.

そして、決定した移調量で、修正手段で修正した基準模範音声データを、全歌唱区間を低い方に移調して移調模範音声データを作成する（Ｓ２３）。すなわち、図５に示すように、基準模範音声データの全歌唱区間を決定した１半音（１００ｃｅｎｔ）で低い方に移調（最高ノート番号が「Ｆ＃５」になるように移調）して移調模範音声データを作成する。換言すれば、１半音（１００ｃｅｎｔ）下に移調することにより、歌唱すべき最高ノートは「Ｆ＃５」となることから、「ＮＧ」だった最高ノートのピッチを修正しても、修正後のピッチが歌唱者の歌唱上限ピッチを超えることはなく、歌唱者にとって歌唱可能な移調模範音声データとして有用となるものである。 Then, the reference model voice data corrected by the correction means with the determined transposition amount is transposed to the lower one in the entire singing section to create the transposition model voice data (S23). That is, as shown in FIG. 5, the transposition model is transposed (transposed so that the highest note number is “F # 5”) by one semitone (100 cent) in which all the singing sections of the reference model audio data are determined. Create audio data. In other words, by transposing down one semitone (100 cent), the highest note to be sung becomes “F # 5”, so even if the pitch of the highest note that was “NG” is corrected, The pitch does not exceed the singing upper limit pitch of the singer, and is useful as transposition model voice data that can be sung by the singer.

なお、ノート番号「Ｇ５」と対応歌唱ピッチとのピッチ差分が、例えば「−１１０ｃｅｎｔ（ＮＧ）」であった場合には、最高ノート番号が「Ｆ＃５」になるように移調しても理論上「−１０ｃｅｎｔ」が限界となって正しいピッチで歌唱できないこととなり、最高ノート番号が「Ｆ５」になるように、２半音（２００ｃｅｎｔ）下に移調することとなる。 Note that if the pitch difference between the note number “G5” and the corresponding singing pitch is “−110 cent (NG)”, for example, even if transposition is performed so that the highest note number is “F # 5”. The upper “−10 cent” is the limit, and it is impossible to sing at the correct pitch, and the transposition is down by two semitones (200 cent) so that the highest note number becomes “F5”.

ところで、この場合の低い方に移調する移調量には移調量「ゼロ」が含まれる。例えば、最高ノートの歌唱ピッチとリファレンスピッチのピッチ差分がプラスで「ＮＧ（要修正）」だった場合には、歌唱ピッチを下げる方向に修正するので修正後も歌唱上限以下となり、移調は不要（移調量「ゼロ」）となる。したがって、実質上は、最も高いリファレンスピッチと歌唱上限ピッチとのピッチ差分がマイナスとなるピッチ差分に基づいた半音単位の移調量で、基準模範音声データの全歌唱区間を低い方に移調することとなる。 By the way, the transposition amount transposed to the lower side in this case includes the transposition amount “zero”. For example, if the pitch difference between the highest note singing pitch and the reference pitch is positive and “NG (requires correction)”, the singing pitch will be adjusted downward, so that the singing pitch will remain below the upper limit of the singing and transposition is unnecessary ( Transposition amount “zero”). Therefore, in effect, transposing the entire singing section of the reference model voice data to the lower one with a transposition amount in semitones based on the pitch difference in which the pitch difference between the highest reference pitch and the singing upper limit pitch is negative. Become.

次に、図６に図１の特定手段の他の処理フローチャートを示すと共に、図７に図６の処理に基づく移調手段の処理説明図を示す。上記特定手段３２は、図６において、ＲＡＭ２３より当該歌唱楽曲の判定手段３０による判定結果を読み出し、まず、歌唱楽曲の全歌唱区間のうち最も高いピッチ（リファレンスピッチ）に対応する歌唱音声データの歌唱ピッチの判定結果を参照する（Ｓ３１）。当該判定結果が、修正不要の場合には（Ｓ３２）、修正手段３１で作成した基準模範音声データは移調しないものとして終了する。 Next, FIG. 6 shows another processing flowchart of the specifying means of FIG. 1, and FIG. 7 shows a process explanatory diagram of the transposing means based on the processing of FIG. In FIG. 6, the specifying unit 32 reads the determination result by the determination unit 30 of the song from the RAM 23, and first, the singing of the singing voice data corresponding to the highest pitch (reference pitch) among all the song sections of the song. The pitch determination result is referred to (S31). If the determination result does not require correction (S32), the reference exemplary voice data created by the correction means 31 is assumed not to be transposed, and the process ends.

一方、要修正の場合には（Ｓ３２）、歌判定結果による修正不要の歌唱ピッチのうち最も高い歌唱ピッチを歌唱者の歌唱可能な歌唱上限ピッチとして特定する（Ｓ３３）。すなわち、図７（図２（Ｂ）と同じ）に示される判定結果から、修正不要の歌唱ピッチのうち最も高い歌唱ピッチが歌唱上限ピッチとして特定され、ノート番号「Ｄ５」が移調量決定の対象となる。 On the other hand, in the case of correction required (S32), the highest singing pitch among the singing pitches that do not need to be corrected based on the song determination result is specified as the singing upper limit pitch that can be sung by the singer (S33). That is, from the determination result shown in FIG. 7 (same as FIG. 2B), the highest singing pitch among the singing pitches that do not need to be corrected is specified as the singing upper limit pitch, and the note number “D5” is the object of transposition amount determination. It becomes.

なお、上記歌唱上限ピッチの特定は、「Ｇ５」が「−３０ｃｅｎｔ」で歌唱できたにも関わらず、一度も「Ｆ５」や「Ｅ５」を歌唱すべき正しいピッチで歌唱できていない、すなわちリファレンスピッチとの差分が許容範囲外でピッチ判定結果が「ＮＧ」であるということは、少なくとも当該楽曲のメロディを歌唱する上で、「Ｄ＃５」以上のノートでは正しいピッチで歌唱することが困難であり、「Ｄ５」が正しいピッチで歌唱できるノートの上限であるという考え方に基づくものである。 In addition, although the above-mentioned singing upper limit pitch is specified, although “G5” can be sung at “−30 cent”, “F5” and “E5” have never been sung at the correct pitch, that is, the reference If the difference from the pitch is outside the allowable range and the pitch determination result is “NG”, it is difficult to sing at the correct pitch with notes of “D # 5” or higher when singing the melody of the music at least. It is based on the idea that “D5” is the upper limit of notes that can be sung at the correct pitch.

そこで、移調手段３３は、図７に示すように、判定結果から、特定手段で特定した歌唱上限ピッチのピッチ値と、全歌唱区間中で最も高いリファレンスピッチのピッチ値とのピッチ差分を算出し、ピッチ差分に応じた半音単位としてノート「Ｇ５」からの「Ｄ５」まで低くする５半音の移調量「５００ｃｅｎｔ」を決定する。 Therefore, as shown in FIG. 7, the transposing means 33 calculates a pitch difference between the pitch value of the singing upper limit pitch specified by the specifying means and the pitch value of the highest reference pitch in all singing sections from the determination result. Then, the transposition amount “500 cent” of five semitones to be lowered from the note “G5” to “D5” as a semitone unit corresponding to the pitch difference is determined.

そして、決定した移調量で、修正手段で修正した基準模範音声データを、全歌唱区間を低い方に移調して移調模範音声データを作成する。すなわち、図７に示すように、「５半音（５００ｃｅｎｔ）」下に移調することにより、歌唱すべき最高ノートは「Ｄ５」となり、「ＮＧ」だったノートの修正後のピッチはいずれも歌唱者の歌唱上限ピッチ（ノート「Ｄ５」のピッチ）を超えることはなく、歌唱者にとって歌唱可能な移調模範音声データとして有用となるものである。 Then, with the determined transposition amount, the reference model voice data corrected by the correction means is transposed to the lower one in the entire singing section to create the transposition model voice data. That is, as shown in FIG. 7, by transposing down “5 semitones (500 cents)”, the highest note to be sung becomes “D5”, and the corrected pitches of the notes that were “NG” are both singers. This is useful as transposition model voice data that can be sung by a singer without exceeding the upper singing upper limit pitch (note “D5” pitch).

ところで、上記遠隔入出力装置１７の選曲楽曲登録手段１７Ｂでは、選曲時に、歌唱後に自己の歌唱に対する模範的な音声データの作成及び再生、又は、自己の歌唱を歌唱可能なピッチに移調した音声データの再生を選択させることの入力が可能な表示が選択手段として設けられている。例えば、移調模範音声データ及び移調された演奏データの再生が選択された場合、移調手段３３は作成した移調模範音声データを音楽演奏制御部２５に出力すると共に、演奏再生手段３４に対して上記決定した移調量で当該楽曲の演奏データを移調するように制御し、演奏再生手段３４は当該楽曲の演奏データをＲＡＭ２３の演奏データ記憶領域４３より読み出して移調手段３３より指示された上記移調量で移調して音楽演奏制御部２５に出力することで再生される。 By the way, in the music selection music registration means 17B of the remote input / output device 17, at the time of music selection, generation and reproduction of exemplary voice data for the user's own song after singing, or voice data obtained by transposing the user's song to a singable pitch. A display capable of inputting to select reproduction is provided as selection means. For example, when reproduction of transposition model audio data and transposed performance data is selected, the transposition means 33 outputs the created transposition model audio data to the music performance control unit 25 and determines the above-mentioned determination for the performance reproduction means 34. The performance reproduction means 34 reads out the performance data of the music from the performance data storage area 43 of the RAM 23 and transposes the transposition amount instructed by the transposition means 33 with the transposition amount. Then, it is reproduced by being output to the music performance control unit 25.

また、移調された歌唱音声データ及び演奏データの再生が選択された場合、移調手段３３は歌唱音声データを歌唱音声データ記憶領域４２より読み出して上記移調量で移調して音楽演奏制御部２５に出力すると共に、演奏再生手段３４に対して上記決定した移調量で当該楽曲の演奏データを移調するように制御し、演奏再生手段３４は当該楽曲の演奏データをＲＡＭ２３の演奏データ記憶領域４３より読み出して移調手段３３より指示された上記移調量で移調して音楽演奏制御部２５に出力することで再生されるものである。 Also, when playback of the transposed singing voice data and performance data is selected, the transposing means 33 reads the singing voice data from the singing voice data storage area 42, transposes it by the transposition amount, and outputs it to the music performance controller 25. At the same time, the performance reproducing means 34 is controlled to transpose the performance data of the music piece with the determined transposition amount, and the performance reproduction means 34 reads the performance data of the music piece from the performance data storage area 43 of the RAM 23. It is reproduced by transposing the transposition amount instructed by the transposing means 33 and outputting it to the music performance control unit 25.

すなわち、利用者に対して修正前後の自己の歌唱と歌唱可能な模範歌唱との聴き比べを可能とし、その際に自己の音声かつ歌唱可能な声域で再生することから、利用者の便に資することができるものである。 In other words, it is possible for the user to compare his / her own song before and after the correction and the singable model song, and at that time, the user's voice and singable voice range are played back, which contributes to the convenience of the user. It is something that can be done.

上述のように、修正後の基準模範音声データの最高ノートのピッチが歌唱者の上限のピッチを超えないような移調模範音声データとさせることから、歌唱者による一度の歌唱で、歌唱者自身の声による模範音声データを歌唱者の声域で歌唱可能に提供することができるものである。 As described above, since the pitch of the highest note of the modified reference model voice data is assumed to be transposition model voice data that does not exceed the upper limit pitch of the singer, the singer's own singing, It is possible to provide model voice data by voice so that it can be sung in the vocal range of the singer.

本発明のカラオケ装置は、歌唱者の歌唱に対する模範的な歌唱音声データを生成して提供する手段を備える装置の製造、使用の産業に利用可能である。 The karaoke apparatus of the present invention can be used in the industry of manufacturing and using an apparatus including means for generating and providing exemplary singing voice data for a singer's song.

１１カラオケ装置
１２カラオケ本体
１７遠隔入出力装置
２５音楽演奏制御部
２８Ａ／Ｄ変換部
３０判定手段
３１修正手段
３２特定手段
３３移調手段
３４演奏再生手段
４４判定結果記憶領域
４５基準模範音声データ記憶領域
５３リファレンスデータベース DESCRIPTION OF SYMBOLS 11 Karaoke apparatus 12 Karaoke main body 17 Remote input / output apparatus 25 Music performance control part 28 A / D conversion part 30 Judgment means 31 Correction means 32 Specification means 33 Transposition means 34 Performance reproduction means 44 Judgment result storage area 45 Reference | standard model voice data storage area 53 Reference Database

Claims

A karaoke apparatus provided with reference data as an analysis standard for analyzing a song by a singer for each song section in relation to a reference pitch indicating a pitch to be sung at least with respect to a pitch sung in accordance with the song section. Because
Recording means for recording the singing voice of the predetermined music by the singer as singing voice data;
For each singing section of the music, the singing pitch of the recorded singing voice data is detected, and a pitch difference is calculated by comparing with the corresponding reference pitch of the reference data. A determination means for determining whether the singing pitch of the singing section requires correction or correction, and
Correction means for creating reference model voice data in which the singing pitch required for correction of the corresponding singing section by the determination means is corrected based on the corresponding reference pitch of the reference data;
Based on the determination result, when the singing pitch of the singing voice data corresponding to the highest reference pitch among all the singing sections of the tune is to be modified, sing the singing pitch of the singing voice data corresponding to the reference pitch. Specific means for specifying the upper limit pitch that can be sung by the person,
Transpose the entire singing section of the reference model voice data to the lower one with a transposition amount in semitones based on the pitch difference between the highest reference pitch and the singing upper limit pitch specified by the specifying means among all singing sections of the song Transposing means for transposing model voice data,
A karaoke apparatus comprising:

A karaoke apparatus provided with reference data as an analysis standard for analyzing a song by a singer for each song section in relation to a reference pitch indicating a pitch to be sung at least with respect to a pitch sung in accordance with the song section. Because
Recording means for recording the singing voice of the predetermined music by the singer as singing voice data;
For each singing section of the music, the singing pitch of the recorded singing voice data is detected, and a pitch difference is calculated by comparing with the corresponding reference pitch of the reference data. A determination means for determining whether the singing pitch of the singing section requires correction or correction, and
Correction means for creating reference model voice data in which the singing pitch required for correction of the corresponding singing section by the determination means is corrected based on the corresponding reference pitch of the reference data;
Based on the determination result, when the singing pitch of the singing voice data corresponding to the highest reference pitch among all singing sections of the music is a correction required, the highest singing of the singing pitches that need not be corrected based on the determination result A specifying means for specifying the pitch as a singing upper limit pitch that the singer can sing,
Transpose the entire singing section of the reference model voice data to the lower one with a transposition amount in semitones based on the pitch difference between the highest reference pitch and the singing upper limit pitch specified by the specifying means among all singing sections of the song Transposing means for transposing model voice data,
A karaoke apparatus comprising:

The karaoke apparatus according to claim 1 or 2,
A performance reproduction means for transposing and reproducing the performance data of the music by the transposition amount determined by the transposition means,
The transposing unit further transposes the entire singing section of the recorded singing voice data by the transposition amount, and controls the performance reproducing unit to transpose and reproduce the performance data of the music by the transposing amount. Karaoke device characterized by.

4. The karaoke apparatus according to claim 3, further comprising selection means for selectively listening to the transposed singing voice data and performance data, or the transposition model voice data and the transposed performance data. Karaoke device.