JPH08286684A

JPH08286684A - Interval evaluation device and karaoke evaluation device

Info

Publication number: JPH08286684A
Application number: JP7088452A
Authority: JP
Inventors: Takeshi Motai; 健馬渡; Yukashi Shimokawa; 由加志下川; Toshiaki Izawa; 利明伊澤
Original assignee: Pioneer Electronic Corp
Current assignee: Pioneer Corp
Priority date: 1995-04-13
Filing date: 1995-04-13
Publication date: 1996-11-01
Anticipated expiration: 2019-01-19
Also published as: CN1140293A; CN1107942C; JP3487950B2

Abstract

PURPOSE: To evaluate a KARAOKE (recorded accompaniment) singing employing the source other than multiplex audio signals as a reference. CONSTITUTION: The devices are provided with an intonation detection means 2 which detects the pitch of the sound of a first audio signal corresponding to a comparison audio, plural band pass filters 5 each of which has a bandwidth that is made of dividing the frequency range corresponding to prescribed intervals in the frequency range having a prescribed bandwidth including the pitch of the sound of the first audio signals and respectively passes band signals in each bandwidth from a second audio signal corresponding to a reference audio, a level comparison means 6 which respectively compares the amplitudes of the band signals that are passed through the filters 5 and an evaluation means 1a which conducts the evaluation related to the interval of the comparison audio against a reference audio based on the result of the means 6.

Description

Detailed Description of the Invention

【０００１】[0001]

【産業上の利用分野】本発明は、いわゆるカラオケ再生
装置に係り、特に、ユーザの歌唱を採点する機能を備え
たカラオケ採点装置に関する。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a so-called karaoke reproducing device, and more particularly to a karaoke scoring device having a function of scoring a user's song.

【０００２】[0002]

【従来の技術】カラオケ再生装置において、ユーザの歌
唱の良否を採点するカラオケ採点装置が知られている。
従来のカラオケ採点装置は、カラオケ用に音声多重方式
で記録された光ディスク等から再生されたステレオ音声
のうち、いずれか一方のチャンネルにのみ含まれている
ガイドボーカルを検出して、ユーザの歌唱を採点又は評
価する機能を備えていた。2. Description of the Related Art In a karaoke reproducing device, a karaoke scoring device for scoring the quality of a user's singing is known.
A conventional karaoke scoring device detects a guide vocal included in only one of the stereo voices reproduced from an optical disc or the like recorded in a voice multiplex system for karaoke, and sings the user's voice. It had the ability to score or evaluate.

【０００３】図１０は、従来のカラオケ採点装置の一例
である。レーザディスク等の媒体から再生されたカラオ
ケ音声は、音声多重方式で記録された右チャンネル（Ｒ
ｃｈ）、左チャンネル（Ｌｃｈ）を有するステレオ音声
である。例えば、Ｌｃｈは伴奏のみ、Ｒｃｈは伴奏とユ
ーザの歌唱を導くためのガイドボーカルとを含む。両チ
ャンネルの信号は、Ａ／Ｄ変換器９９ａ、９９ｂにてデ
ジタル信号に変換される。次いで、減算器９１にて減算
されることにより、ガイドボーカルが伴奏から分離さ
れ、第１の音調検出手段９３においてガイドボーカルに
ついての基本周波数が検出される。FIG. 10 shows an example of a conventional karaoke scoring device. Karaoke sound reproduced from a medium such as a laser disk is recorded on the right channel (R
ch) and a left channel (Lch). For example, Lch includes only accompaniment, and Rch includes accompaniment and guide vocals for guiding the user's singing. The signals of both channels are converted into digital signals by A / D converters 99a and 99b. Next, the subtracter 91 subtracts the guide vocal from the accompaniment, and the first tone detecting means 93 detects the fundamental frequency of the guide vocal.

【０００４】ユーザの歌唱は、マイク９２を経てＡ／Ｄ
変換器９４にてデジタル信号に変換された後、第２の音
調検出手段９５でユーザの歌唱についての基本周波数が
検出される。The user's singing is performed by the A / D via the microphone 92.
After being converted into a digital signal by the converter 94, the second tone detecting means 95 detects the fundamental frequency of the user's singing.

【０００５】第１及び第２の音調検出手段９３、９５
は、入力される音声信号中のピーク値同士の間隔やゼロ
クロス点同士の間隔等を検出し、基本周波数に換算す
る。比較手段９６は、ガイドボーカルの基本周波数とユ
ーザの歌唱の周波数とを比較して音程（二つの音の高さ
の差をいう。）を検出する。演算手段９７は、両音声の
音程に基づいて点数を計算し、採点結果を表示装置９８
に表示する。First and second tone control means 93, 95
Detects the interval between peak values and the interval between zero-cross points in the input audio signal, and converts it into the fundamental frequency. The comparison means 96 compares the fundamental frequency of the guide vocal and the frequency of the user's singing to detect a pitch (a difference between the pitches of two notes). The calculation means 97 calculates a score based on the pitch of both voices and displays the scoring result on the display device 98.
To be displayed.

【０００６】また、これら従来のカラオケ採点装置を複
数人で使用する場合は、一人ずつの採点を順次行うこと
で歌唱の優劣を判定していた。Further, when a plurality of people use these conventional karaoke scoring devices, the superiority or inferiority of the singing is judged by sequentially scoring one by one.

【０００７】[0007]

【発明が解決しようとする課題】上記のように、従来の
カラオケ採点装置では、左右のチャンネルの音声信号を
減算するため、ステレオの片方のチャンネルのみにガイ
ドボーカルを記録したカラオケ用音声多重ソフトを用い
ない限り、ガイドボーカルを確実に分離することが困難
である。As described above, in the conventional karaoke scoring device, since the audio signals of the left and right channels are subtracted, the audio multiplexing software for karaoke in which the guide vocal is recorded only on one of the stereo channels is provided. Unless used, it is difficult to reliably separate the guide vocals.

【０００８】ところが、カラオケ用音声多重ソフトを用
いても、記録再生時に左右のチャンネル相互間で伴奏の
音量が異なったり、位相がずれたりすることがある。ま
た、カラオケ用ソフトの中には、左右のチャンネル相互
間で、全く別々の演奏内容が記録されているものもあ
る。これらのカラオケ用ソフトでは、左右のチャンネル
の音声信号同士を単純に減算しても、ガイドボーカルは
抽出できない。However, even if the voice multiplexing software for karaoke is used, the volume of the accompaniment may be different between the left and right channels during recording and reproduction, or the phase may be shifted. In addition, some karaoke software has completely different performance contents recorded between the left and right channels. With these karaoke software, the guide vocal cannot be extracted by simply subtracting the audio signals of the left and right channels.

【０００９】また、ユーザが通常の音楽演奏を記録した
ソフトによりカラオケ採点を行うことを希望する場合も
ある。一方、カラオケ演奏は、通常複数の人で楽しむこ
とが多い。複数人が同時に歌唱し、互いに歌唱の上手さ
を競い合うのも面白い。しかし、上記のように従来のカ
ラオケ採点装置では、一人の歌唱者のみを採点し得るに
過ぎないという欠点もあったため、他の歌唱者との優劣
は一人ずつの採点結果を相互に比較して判断していた。
これでは、歌唱の優劣を同時に競い合うような遊び方は
できない。There are also cases where the user desires to perform karaoke scoring with software that records a normal music performance. On the other hand, a karaoke performance is usually enjoyed by a plurality of people. It is also interesting that multiple people sing at the same time and compete with each other for singing. However, in the conventional karaoke scoring device as described above, there is also a drawback that only one singer can be scored, so the superiority and inferiority with other singers is compared with each other one by one. Had decided.
With this, it is not possible to play in such a way that the merits and demerits of the singing are competing simultaneously.

【００１０】上記問題に鑑み、本願発明の第１の課題
は、音声信号の種類によらず、音声信号の評価が可能な
音程評価装置及びカラオケ採点装置を提供することであ
る。また、本願発明の第２の課題は、複数人で歌唱の優
劣を比較できるカラオケ採点装置を提供することにあ
る。In view of the above problems, a first object of the present invention is to provide a pitch evaluation device and a karaoke scoring device capable of evaluating a voice signal regardless of the type of the voice signal. A second object of the present invention is to provide a karaoke scoring device that allows a plurality of people to compare the merits and demerits of singing.

【００１１】[0011]

【課題を解決するための手段】請求項１に記載の音程評
価装置は、（ａ）第１音声に対応する第１音声信号を
入力し、第１音声の音の高さを検出（音声信号の振幅の
ピーク値間隔又はゼロクロス点の間隔に基づいて周波数
を演算する、バンドパスフィルタ群を用いる等）する第
１音調検出手段と、（ｂ）第１音声の音の高さを含む
所定幅（例えば、第１音声の音の高さを中心として±１
音の範囲、±半音の範囲）の周波数帯域において、所定
の音程（１／４音、半音等）に対応して周波数帯域を分
割した帯域幅を各々有し、供給される第２音声（例え
ば、再生装置から再生されるカラオケ音声、楽器の演
奏、通常の音楽ソース等）に対応する第２音声信号から
各帯域幅における帯域信号を各々通過させる複数のバン
ドパスフィルタ（例えば、ＤＳＰ（Digital Signal Pro
cessor）に設定される複数のデジタルフィルタ）と、
（ｃ）バンドパスフィルタを通過した帯域信号の振幅
を各々比較することにより最大振幅を示す帯域信号を検
出する第２音調検出手段と、（ｄ）第１音声の音の高
さと最大振幅を示す帯域信号とに基づいて、第１音声の
音程の評価を行う評価手段（例えば、コンピュータ）
と、を備えて構成される。According to another aspect of the present invention, there is provided a pitch evaluation apparatus including: (a) inputting a first voice signal corresponding to a first voice, and detecting a pitch of the first voice (voice signal Tone detection means for calculating a frequency based on the interval of the peak value of the amplitude or the interval of zero cross points, using a bandpass filter group, etc.), and (b) a predetermined width including the pitch of the first voice. (For example, centering on the pitch of the first voice ± 1
In a frequency band of a sound range, a range of ± semitone), a second sound to be supplied (for example, having a bandwidth obtained by dividing the frequency band corresponding to a predetermined pitch (1/4 tone, semitone, etc.)) , A plurality of band pass filters (for example, DSP (Digital Signal) that pass band signals in respective bandwidths from the second audio signal corresponding to the karaoke voice reproduced from the reproducing device, the performance of the musical instrument, the normal music source, etc.) Pro
cessor) multiple digital filters), and
(C) Second tone detecting means for detecting the band signal having the maximum amplitude by comparing the amplitudes of the band signals having passed through the band pass filter, and (d) showing the pitch and the maximum amplitude of the first voice. Evaluation means (for example, a computer) for evaluating the pitch of the first voice based on the band signal
And are configured.

【００１２】なお、第２音声信号には、音声多重信号を
記録した記録媒体から再生される信号の他、一般的なス
テレオ信号でもモノラル信号でもよい。評価手段の行う
評価には、第１音声の音の高さと第２音声の音の高さと
の一致不一致に対応して行う採点や、両音声の音の高さ
の一致不一致に対応した判定結果を表示する等の音声信
号の評価が考えられる。The second audio signal may be a general stereo signal or a monaural signal in addition to the signal reproduced from the recording medium in which the audio multiplex signal is recorded. The evaluation performed by the evaluation means is performed by scoring the pitch of the first voice and the pitch of the second voice in correspondence with each other, and the judgment result of the pitch of the two voices corresponding to each other. It is conceivable to evaluate the voice signal such as displaying.

【００１３】また、第１音声として音声多重信号に含ま
れるガイドボーカルや演奏等を用い、第２音声としてマ
イクから入力される歌唱者の歌唱や練習用の演奏等を用
いる場合には、第１音声の評価の代わりに第２音声の評
価を行うものでもよい。すなわち、所望の音声をより確
実に抽出しうる音声（他の音の混入が少ない音声）を第
１音声とし、音の高さが不安定であり評価の必要がある
方の音声を、評価手段の評価対象とすればよい。When the guide vocal or performance included in the voice multiplexed signal is used as the first voice and the singing of the singer or the performance for practice input from the microphone is used as the second voice, the first voice is used. Instead of the voice evaluation, the second voice may be evaluated. That is, the voice that can more reliably extract the desired voice (the voice in which other sounds are less mixed) is set as the first voice, and the voice whose pitch is unstable and needs to be evaluated is evaluated by the evaluation means. Should be evaluated.

【００１４】評価手段における評価は、任意の演算が適
用できる。例えば、第１音声が歌唱であれば、カラオケ
採点等に適用できる。請求項２に記載の音程評価装置に
おいて、第１音調検出手段は、（ａ）所定の音程（１
／４音、半音等）に対応した帯域幅を各々有し、供給さ
れる第１音声（例えば、歌唱者の歌唱、楽器の演奏）に
対応する第１音声信号から各帯域幅における帯域信号を
各々通過させる複数のバンドパスフィルタ（例えば、Ｄ
ＳＰに設定される複数のデジタルフィルタ）と、（ｂ）
複数のバンドパスフィルタを通過した帯域信号の振幅
を各々比較することにより第１音声の音の高さを特定す
る音調特定手段と、を備えて構成される。Arbitrary calculation can be applied to the evaluation by the evaluation means. For example, if the first voice is a song, it can be applied to karaoke scoring or the like. In the pitch evaluation device according to claim 2, the first pitch detection means is (a) a predetermined pitch (1
/ 4 tone, semitone, etc.) respectively, and a band signal in each band from the first voice signal corresponding to the supplied first voice (for example, a singer singing, a musical instrument performance) is supplied. A plurality of bandpass filters (eg D
A plurality of digital filters set in SP), and (b)
And a tone specifying unit that specifies the pitch of the first voice by comparing the amplitudes of the band signals that have passed through the plurality of band pass filters.

【００１５】請求項３に記載の音程評価装置は、請求項
１又は請求項２に記載の音程評価装置において、第１音
声の音の高さを含む所定幅の周波数帯域は、第１音声の
音の高さを中心として略対称的に設定する。例えば、第
１音声の音の高さを基準として±１音、±半音の範囲に
設定される。A pitch evaluation device according to a third aspect is the pitch evaluation device according to the first or second aspect, in which the frequency band having a predetermined width including the pitch of the first voice is the first voice. The pitch is set approximately symmetrically about the pitch. For example, it is set within a range of ± 1 tone and ± semitone with reference to the pitch of the first voice.

【００１６】請求項４に記載のカラオケ採点装置は、請
求項１乃至請求項３のいずれかに記載の音程評価装置を
備えたカラオケ採点装置において、第１音声として歌唱
者の歌唱を用い、第２音声として外部から供給される基
準音声を用い、評価手段は、比較して得た音程に基づい
て、歌唱者の歌唱の採点を行う。A karaoke scoring device according to a fourth aspect is the karaoke scoring device including the pitch evaluation device according to any one of the first to third aspects, wherein the singing voice of the singer is used as the first voice. Using the reference voices supplied from the outside as the two voices, the evaluation means scores the singers' songs based on the pitches obtained by comparison.

【００１７】請求項５に記載のカラオケ採点装置は、請
求項４に記載のカラオケ採点装置において、複数の歌唱
者の各々による歌唱音声を歌唱音声信号に各々変換する
複数の音声変換手段（例えば、マイク、電子楽器）と、
複数の歌唱音声信号の中からいずれかを選択し、選択し
た歌唱音声信号を第１音声として出力する選択手段（セ
レクタ等）と、を備える。評価手段は、第１音声の一の
音が検出される度に、選択手段を順次切り換えて、各歌
唱者に対応する歌唱音声毎に歌唱の採点を行う。A karaoke scoring device according to a fifth aspect is the karaoke scoring device according to the fourth aspect, wherein a plurality of voice converting means (for example, a plurality of voice converting means for converting singing voices by each of a plurality of singers into a singing voice signal). Microphone, electronic musical instrument)
A selecting unit (selector or the like) that selects any one of the plurality of singing voice signals and outputs the selected singing voice signal as the first voice. The evaluation means sequentially switches the selection means each time one sound of the first voice is detected, and scores the singing for each singing voice corresponding to each singer.

【００１８】なお、請求項４又は請求項５に記載の評価
手段は、両音声の音の高さのズレを判定し、このズレを
表示等する採点を行うものであってもよい。さらに、歌
唱音声が基準音声から外れる程、点数を減らす採点、歌
唱音声が基準音声に一致すると、点数を増加させる採点
等の採点方法でもよい。The evaluation means according to claim 4 or 5 may determine the pitch difference between the voices of both voices and perform marking such as displaying the difference. Further, a scoring method may be used, such as a score that decreases as the singing voice deviates from the reference voice, or a score that increases when the singing voice matches the reference voice.

【００１９】[0019]

【作用】請求項１に記載の音程評価装置によれば、（ａ）第１音調検出手段は、第１音声に対応する第１
音声信号を入力し、第１音声の音の高さを検出する。According to the pitch evaluation apparatus of the first aspect, (a) the first tone detecting means has the first tone corresponding to the first voice.
The voice signal is input and the pitch of the first voice is detected.

【００２０】なお、この音の高さの検出手段としては、
公知の検出手段、例えば、信号振幅のピーク値の間隔、
ゼロクロス点の間隔から周波数を検出する方法、バンド
パスフィルタ群を用いる方法等を適用できる。第１音声
は、他の音声の混入するおそれのない歌唱者の歌唱、楽
器の演奏等を用いるのが好ましい。As the means for detecting the pitch of this sound,
Known detection means, for example, the interval between the peak values of the signal amplitude,
A method of detecting a frequency from the interval of zero cross points, a method of using a bandpass filter group, etc. can be applied. As the first voice, it is preferable to use a song by a singer, a performance of a musical instrument, or the like, which is unlikely to be mixed with other voices.

【００２１】（ｂ）複数のバンドパスフィルタは、第
１音声の音の高さを含む所定幅の周波数帯域において、
所定の音程に対応して周波数帯域を分割した帯域幅を各
々有する。そして、供給される第２音声に対応する第２
音声信号から各帯域幅における帯域信号を各々通過させ
る。(B) The plurality of bandpass filters have a predetermined width in a frequency band including the pitch of the first voice,
Each has a bandwidth obtained by dividing the frequency band corresponding to a predetermined pitch. Then, the second sound corresponding to the second sound supplied.
A band signal in each bandwidth is passed from the audio signal.

【００２２】なお、所定幅の周波数帯域は、第１音声の
音の高さを含む狭い帯域であるため、第２音声の種類
（音声多重信号等）を問わず、第２音声の音の高さが第
１音声の音の高さに近い場合に（例えば、カラオケのガ
イドボーカルと歌唱者の歌唱の関係）、帯域信号が良好
に検出される。Since the frequency band of the predetermined width is a narrow band including the pitch of the first voice, the pitch of the second voice is irrespective of the type of the second voice (voice multiplexed signal etc.). In the case where is close to the pitch of the first voice (for example, the relationship between the guide vocal of karaoke and the singer's singing), the band signal is satisfactorily detected.

【００２３】（ｃ）第２音調検出手段は、バンドパス
フィルタを通過した帯域信号の振幅を各々比較すること
により、最大振幅を示す帯域信号を検出する。なお、こ
の検出としては、帯域信号の便宜上の番号を評価手段に
供給する等の方法が好ましい。(C) The second tone detecting means detects the band signal having the maximum amplitude by comparing the amplitudes of the band signals that have passed through the band pass filter. For this detection, a method of supplying a convenient number of the band signal to the evaluation means is preferable.

【００２４】（ｄ）評価手段は、第１音声の音の高さ
と最大振幅を示す帯域信号とに基づいて、第２音声の評
価を行う。例えば、第１音声の音の高さに対し、検出さ
れた帯域信号に対応する音の高さが一致するか、どの程
度音程が離れているかを調べる。(D) The evaluation means evaluates the second voice based on the pitch of the first voice and the band signal having the maximum amplitude. For example, it is checked whether the pitch of the first voice corresponds to the pitch of the sound corresponding to the detected band signal or how far apart the pitch is.

【００２５】したがって、カラオケ採点装置のように、
基準となる第２音声（ガイドボーカル）に第１音声（歌
唱）が追従するような場合の音程評価に適する。請求項
２に記載の音程評価装置の第１音調検出手段において、（ａ）複数のバンドパスフィルタは、所定の音程に対
応した帯域幅を各々有し、供給される第１音声に対応す
る第１音声信号から各帯域幅における帯域信号を各々通
過させる。Therefore, like a karaoke scoring device,
It is suitable for pitch evaluation when the first voice (song) follows the reference second voice (guide vocal). The first tone detection means of the pitch evaluation apparatus according to claim 2, wherein: (a) the plurality of bandpass filters each have a bandwidth corresponding to a predetermined pitch, and correspond to the first voice supplied. Band signals in each bandwidth are passed from one audio signal.

【００２６】（ｂ）音調特定手段は、複数のバンドパ
スフィルタを通過した帯域信号の振幅を各々比較するこ
とにより第１音声の音の高さを特定する。なお、帯域信
号を検出するための帯域幅を小さく設定する程、より高
い精度で第１音声の音の高さを特定できる。第１音声の
音の高さが検出されると、この音の高さを中心とする所
定幅の周波数帯域に存在する第２音声が、帯域幅毎の帯
域信号の振幅値に基づいて検出される。(B) The tone specifying means specifies the pitch of the first voice by comparing the amplitudes of the band signals that have passed through the plurality of band pass filters. Note that the pitch of the first voice can be specified with higher accuracy as the bandwidth for detecting the band signal is set smaller. When the pitch of the first voice is detected, the second voice existing in a frequency band having a predetermined width centered on the pitch of the first voice is detected based on the amplitude value of the band signal for each bandwidth. It

【００２７】請求項３に記載の音程評価装置によれば、
第２音声を検出する周波数範囲として、第１音声の音の
高さを中心として高音側と低音側とへ略対称的な周波数
範囲が設定される。よって、例えばガイドボーカル（第
２音声）に追従しながらも、高音側又は低音側に変動し
てしまう歌唱音声等（第１音声）を評価するのに適す
る。According to the pitch evaluation device of claim 3,
As a frequency range for detecting the second voice, a frequency range that is substantially symmetrical to the high tone side and the low tone side with the pitch of the first voice as the center is set. Therefore, for example, it is suitable for evaluating a singing voice or the like (first voice) that fluctuates to the high pitch side or the low pitch side while following the guide vocal (second voice).

【００２８】請求項４に記載のカラオケ採点装置によれ
ば、第１音声として用いる歌唱者の歌唱は、通常マイク
等から入力されるので他の音声の混入が少なく、音の高
さを検出するのに適する。第２音声として用いる基準音
声は、通常再生装置等から供給されるので、音調の安定
している音声であり、歌唱者が歌唱の手本とするサイド
ボーカル等が相当する。According to the karaoke scoring device of the fourth aspect, since the singer's singing used as the first voice is usually input from a microphone or the like, other voices are less mixed and the pitch is detected. Suitable for The reference voice used as the second voice is a voice having a stable tone because it is normally supplied from a reproduction device or the like, and corresponds to a side vocal or the like used by the singer as a model for singing.

【００２９】歌唱者はこのガイドボーカルに追従して歌
うため、歌唱者の歌唱音声とガイドボーカル等の基準音
声とは、接近した音の高さを有する。このため、歌唱者
の音声（第１音声）の音の高さを中心として設定される
ごく狭い周波数帯域には、基準音声のガイドボーカルの
音の高さが含まれる。基準音声のうち、検出対象となる
音声（ガイドボーカル等）の他の音声（伴奏等）は、検
出対象となる音と協和音の音程（少なくとも短３度）だ
け離れているので、検出対象となる音声の他の音声（伴
奏等）が第２音声として検出されることがない。具体的
な第１音声と第２音声の検出は、請求項１乃至請求項３
に記載した作用の通りである。評価手段は、比較して得
た音程に基づいて歌唱者の歌唱の採点を行う。Since the singer follows the guide vocal and sings, the singing voice of the singer and the reference voice such as the guide vocal have pitches close to each other. Therefore, the pitch of the guide vocal of the reference voice is included in a very narrow frequency band set around the pitch of the voice of the singer (first voice). Among the reference voices, the other voices (accompaniment, etc.) of the voices to be detected (guide vocals, etc.) are separated from the voices to be detected by the pitch of the consonant (at least 3rd minor) The other voices (accompaniment, etc.) other than the voice will not be detected as the second voice. The specific detection of the first voice and the second voice is performed by claim 1.
It is as described in the above. The evaluation means scores the song of the singer based on the pitch obtained by comparison.

【００３０】請求項５に記載のカラオケ採点装置によれ
ば、複数の音声変換手段は、複数の歌唱者の各々による
歌唱音声を歌唱音声信号に各々変換する。選択手段は、
複数の歌唱音声信号の中からいずれかを選択し、選択し
た歌唱音声信号を第１音声として出力する。According to the karaoke scoring device of the fifth aspect, the plurality of voice converting means respectively convert the singing voices of the plurality of singers into singing voice signals. The selection means is
Any one of the plurality of singing voice signals is selected, and the selected singing voice signal is output as the first voice.

【００３１】評価手段は、第１音声の一の音が検出され
る度に、選択手段を順次切り換えて、各歌唱者に対応す
る歌唱音声毎に歌唱の採点を行う。なお、前記した採点
方法の他に、基準音声の演奏が終了した後に、両者の採
点を比べ、より点数の大きい方（小さい方）を「勝ち」
（「負け」）とする採点を行ってもよい。The evaluation means sequentially switches the selection means each time one sound of the first voice is detected, and scores the singing voice for each singing voice corresponding to each singer. In addition to the scoring method described above, after the performance of the reference voice is finished, the scoring of both is compared, and the one with the larger score (the smaller one) is “winned”.
You may score it as ("losing").

【００３２】[0032]

【実施例】本発明の装置に係る好適な実施例を図面を参
照して説明する。（Ｉ）第１実施例本第１実施例は、請求項１又は請求項３の音程評価装置
を備えた請求項４のカラオケ採点装置に関する。構成の説明図１に、本第１実施例のカラオケ採点装置１００の構成
を示す。DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENT A preferred embodiment of the apparatus of the present invention will be described with reference to the drawings. (I) First Example The present first example relates to the karaoke scoring device of claim 4 including the pitch evaluation device of claim 1 or claim 3. Description of Configuration FIG. 1 shows the configuration of a karaoke scoring device 100 according to the first embodiment.

【００３３】マイクＭ₀は、歌唱者の歌唱による音声を
電気信号に変換する。マイクロコンピュータ１ａは、音
調検出部２の検出した音調信号Ｔ_Cとレベル比較部６の
出力した音調信号Ｔ_Sとに基づいて、両音声間の音程を
判定し、採点を行う。また、バンドパスフィルタ群５に
フィルタの特性（中心周波数と周波数帯域幅）を指示す
るフィルタ指示信号Ｓ_FSを出力する。The microphone M ₀ converts the voice sung by the singer into an electric signal. The microcomputer 1a determines the pitch between both voices based on the tone signal T _C detected by the tone detection unit 2 and the tone signal T _S output by the level comparison unit 6, and makes a score. Further, it outputs to the bandpass filter group 5 a filter instructing signal S _FS instructing the filter characteristics (center frequency and frequency bandwidth).

【００３４】音調検出部２は、増幅器、Ａ／Ｄ変換器Ａ
Ｄ₀によりデジタル信号となった音声信号から、歌唱者
の音の高さを検出し音調信号Ｔ_Cとしてマイクロコンピ
ュータ１ａに供給する。The tone detector 2 includes an amplifier and an A / D converter A.
The pitch of the singer's pitch is detected from the voice signal which has become a digital signal by D _0, and is supplied to the microcomputer 1a as a tone signal T _C.

【００３５】一方、基準音声は、ビデオディスク、コン
パクトディスク（ＣＤ）等から再生されるカラオケ音声
である。加算器３は、増幅器を経て供給されるステレオ
音声信号の各々（Ｌｃｈ，Ｒｃｈ）を加算し、モノラル
信号に変換する。なお、供給されるステレオ音声信号
は、デジタル信号である。On the other hand, the reference voice is a karaoke voice reproduced from a video disc, a compact disc (CD) or the like. The adder 3 adds each of the stereo audio signals (Lch, Rch) supplied via the amplifier, and converts them into a monaural signal. The supplied stereo audio signal is a digital signal.

【００３６】ディレイ回路４は、比較音声の音調検出と
基準音声の音調検出との処理時間の差を調整する。当該
ディレイ回路４は、基準音声側の他、必要により比較音
声側に対しても挿入してもよい。The delay circuit 4 adjusts the difference in processing time between the tone detection of the comparative voice and the tone detection of the reference voice. The delay circuit 4 may be inserted not only on the reference audio side but also on the comparative audio side, if necessary.

【００３７】バンドパスフィルタ群５とレベル比較部６
とで音調検出部を構成する。バンドパスフィルタ群５
は、ＤＳＰで構成される。ＤＳＰは、マイクロコンピュ
ータ１ａから供給されるフィルタ制御信号Ｓ_FSに基づい
て、複数のバンドパスフィルタをデジタルフィルタで構
成する。各バンドパスフィルタは同時に動作する。各バ
ンドパスフィルタは、基準音声信号のうち互いに異なる
音の高さを中心周波数とする所定幅の周波数帯域成分を
各々通過させる。そして、通過させた周波数帯域に対応
する帯域音声信号を各々出力する。このようにＤＳＰを
バンドパスフィルタとして用いるのは、音声信号の内容
により、フィルタ特性をリアルタイムに変更していく必
要が生ずるからである。本実施例では同時に５個のバン
ドパスフィルタ（以下、このように設定される個々のバ
ンドパスフィルタを「サブフィルタ」という。）を構成
するものとする。Bandpass filter group 5 and level comparing section 6
And constitute a tone detection section. Bandpass filter group 5
Is a DSP. The DSP configures a plurality of bandpass filters with digital filters based on the filter control signal S _FS supplied from the microcomputer 1a. Each bandpass filter operates simultaneously. Each band-pass filter respectively passes a frequency band component of a predetermined width having a pitch of different sounds as a center frequency in the reference audio signal. Then, the band audio signals corresponding to the passed frequency bands are output. The reason why the DSP is used as a bandpass filter is that it is necessary to change the filter characteristic in real time depending on the content of the audio signal. In this embodiment, it is assumed that five bandpass filters (hereinafter, the individual bandpass filters set in this way are referred to as "sub-filters") are configured.

【００３８】レベル比較部６は、各サブフィルタからの
出力を互いに比較し、最もレベルの大きいサブフィルタ
の番号を音調信号Ｔ_Sとしてマイクロコンピュータ１ａ
に出力する。動作の説明次に、第１実施例の動作を二つの音調検出動作及び採点
処理に分けて説明する。The level comparing section 6 compares the outputs from the respective sub-filters with each other, and sets the number of the sub-filter having the highest level as the tone signal T _S to the microcomputer 1a.
Output to. Description of Operation Next, the operation of the first embodiment will be described by dividing it into two tone detection operations and scoring processing.

【００３９】いま、カラオケの基準音声としてＡ音（イ
音）が供給され、歌唱者がこれに合わせて歌っているも
のとする。ｉ）比較音声の音調検出まず、音調検出部２は、マイクＭ₀を経由して入力され
る歌唱者の歌唱音声、すなわち比較音声の音の高さを検
出する。この音の高さの検出手段としては、公知の検出
手段、信号振幅のピーク値の間隔、ゼロクロス点の間隔
等から周波数を検出する方法を適用できる。音調検出部
２により検出した基本周波数もしくは音の高さは、音調
信号としてマイクロコンピュータ１ａに出力される。Now, it is assumed that the A sound (A sound) is supplied as the reference sound of karaoke, and the singer sings in accordance with this. i) Tone detection of comparative voice First, the tone detection unit 2 detects the pitch of the singing voice of the singer, that is, the pitch of the comparative voice, which is input via the microphone M ₀ . As the means for detecting the pitch of the sound, a known detecting means, a method for detecting the frequency from the interval between the peak values of the signal amplitude, the interval between the zero cross points, etc. can be applied. The basic frequency or pitch detected by the tone control section 2 is output to the microcomputer 1a as a tone signal.

【００４０】ii）基準音声の音調検出上記の手順により比較音声の音の高さが検出されると、
次に、供給される基準音声の音の高さの検出を行う。Ii) Tone detection of reference voice When the pitch of the comparative voice is detected by the above procedure,
Next, the pitch of the supplied reference voice is detected.

【００４１】マイクロコンピュータ１ａは、バンドパス
フィルタ群５に対し、検出された比較音声の音（Ａ音）
を中心音として基準音声を検出する周波数帯域（検出範
囲）を設定する。例えば、所定の音程毎（１／４音毎、
半音毎等）のバンドパスフィルタ群を設定する。The microcomputer 1a causes the bandpass filter group 5 to detect the sound of the comparative sound (A sound).
Set the frequency band (detection range) in which the reference sound is detected with the center sound as. For example, every predetermined pitch (every quarter note,
Set a bandpass filter group for each semitone.

【００４２】図４では、中心音Ｒを基準として±１音の
範囲をカバーする周波数帯域が設定されている。（Ａ）
は１／４音毎、（Ｂ）は半音毎にサブフィルタを設定し
た例である。同時に設定できるサブフィルタの数が限ら
れている場合（例えば、本実施例のように同時に５つの
フィルタを設定する場合）、（Ａ）に示すように、バン
ドパスフィルタ群を２度に分けて設定する。In FIG. 4, a frequency band covering a range of ± 1 tone is set with reference to the central tone R. (A)
Is an example in which a sub-filter is set for each quarter tone, and (B) is an example for setting a semi-tone. When the number of sub-filters that can be set at the same time is limited (for example, when five filters are set at the same time as in this embodiment), the band pass filter group is divided into two as shown in (A). Set.

【００４３】基準音声の中に基準となるガイドボーカル
以外の伴奏音等が含まれる場合、伴奏音を誤検出する危
険性が高くなるため、バンドパスフィルタ群５の周波数
帯域を±１音の範囲より広げることは好ましくない。周
波数帯域を±１音の範囲内に設定する限りは、伴奏音は
図５に示すように一般にガイドボーカルと協和音の関係
にある。伴奏音とガイドボーカルとは、少なくとも短３
度の音程は離れるのでガイドボーカルの代わりに伴奏音
が誤検出されることはない。また、伴奏の音量はガイド
ボーカルの音量よりも小さいので、検出される危険性が
少ない。When the reference voice includes an accompaniment sound other than the reference guide vocal, the risk of erroneously detecting the accompaniment sound increases. Therefore, the frequency band of the bandpass filter group 5 is within ± 1 note range. Widening more is not preferable. As long as the frequency band is set within the range of ± 1 note, the accompaniment note generally has a relationship between guide vocal and consonant note, as shown in FIG. Accompaniment sound and guide vocal must be at least short 3
Since the pitch of the degree is distant, the accompaniment sound is not erroneously detected instead of the guide vocal. Moreover, since the volume of the accompaniment is lower than the volume of the guide vocal, there is little risk of being detected.

【００４４】バンドパスフィルタ群５の出力は、レベル
比較部６へ入力される。レベル比較部６は、各々のサブ
フィルタを通過した基準音声信号の振幅を各々比較し、
最大振幅を出力したサブフィルタの番号を示す音調信号
Ｔ_Sをマイクロコンピュータ１ａへ出力する。例えば、
図４（Ｂ）に示した±１音を検出範囲とする周波数帯域
がバンドパスフィルタ群５に設定された場合、３番目の
サブフィルタの番号が最大振幅を示したとき「３」を示
す音調信号Ｔ_Sが出力される。マイクロコンピュータ１
ａは、歌唱者による歌唱音声の音の高さが正確に基準音
声の音の高さに一致していると判断する。The output of the bandpass filter group 5 is input to the level comparing section 6. The level comparison unit 6 compares the amplitudes of the reference audio signals that have passed through the respective sub-filters,
The tone signal T _S indicating the number of the sub-filter that has output the maximum amplitude is output to the microcomputer 1a. For example,
When the frequency band having the detection range of ± 1 sound shown in FIG. 4B is set in the bandpass filter group 5, the tone indicating “3” when the third sub-filter number indicates the maximum amplitude The signal T _S is output. Microcomputer 1
A determines that the pitch of the singing voice by the singer exactly matches the pitch of the reference voice.

【００４５】なお、レベル比較部６は、所定の音量以下
の入力を排除するため、予めしきい値が設定されてい
る。そして、バンドパスフィルタ群５からの入力がすべ
てしきい値以下である場合は、歌唱者の歌唱がガイドボ
ーカルの音の高さから著しくはずれているか、ガイドボ
ーカルが歌われていない状態であると判断する。The level comparing section 6 has a threshold value set in advance in order to exclude an input having a predetermined sound volume or less. When all the inputs from the bandpass filter group 5 are equal to or less than the threshold value, it means that the singing of the singer is significantly deviated from the pitch of the guide vocal, or the guide vocal is not sung. to decide.

【００４６】iii ）採点処理本実施例の採点処理を全体的に示すと、図６のフローチ
ャートのようになる。マイクロコンピュータ１ａは、比
較音声の音の高さの検出後（ステップ６０１〜６０
２）、バンドパスフィルタ群５の周波数帯域を設定する
（ステップ６０３）。そして、各サブフィルタの出力レ
ベルを比較し（ステップ６０４）、比較できた場合（ス
テップ６０５：ＹＥＳ）に基準音声と比較音声の音程、
すなわち、ズレの量よりズレ量点を演算し、バッファに
格納する（ステップ６０６）。同時に何回目の比較かを
別のバッファに格納する（ステップ６０７）。Iii) Scoring Process The scoring process of this embodiment is generally shown in the flowchart of FIG. The microcomputer 1a detects the pitch of the comparative voice (steps 601 to 60).
2), the frequency band of the bandpass filter group 5 is set (step 603). Then, the output levels of the sub-filters are compared (step 604), and if they can be compared (step 605: YES), the pitches of the reference voice and the comparison voice,
That is, a shift amount point is calculated from the shift amount and stored in the buffer (step 606). At the same time, the number of comparisons is stored in another buffer (step 607).

【００４７】演奏が終了すると（ステップ６０８：ＹＥ
Ｓ）、すべての比較回数に対して何回両者の音の高さが
一致したかを示す正解率を演算する（ステップ６０
９）。この正解率を例えば１００点満点とした得点に換
算する演算をし（ステップ６１０）、外部の表示装置に
表示させる（ステップ６１１）。When the performance ends (step 608: YE
S), the correct answer rate indicating how many times the pitches of the both agree with each other for all comparisons is calculated (step 60).
9). For example, a calculation is performed to convert this correct answer rate into a score with a maximum of 100 points (step 610) and the result is displayed on an external display device (step 611).

【００４８】なお、歌唱者毎に歌唱音声のはずれ方に相
違が存在する。複数の歌唱者が順次使用する場合、音の
はずれ方の多少に応じて優劣を競うこともできる。効果の説明上記の如く本第１実施例によれば、カラオケ用音声多重
ソフトを用いることなく、通常の演奏を基準音声として
カラオケ採点が行える。また、高速演算を行い得るＤＳ
Ｐを用いたので、フィルタの特性変更が短時間に十分行
える。このため、通常の歌唱における１音の長さより十
分短い時間内に１音の採点を完了できる。（ＩＩ）第２実施例本第１実施例は、請求項１乃至請求項３の音程評価装置
を備えた請求項５のカラオケ採点装置に関する。本実施
例によれば、複数の歌唱者が同時に自己の歌唱の上手さ
を競い合うことができる。構成の説明図７に、本第２実施例のカラオケ採点装置の構成を示
す。There is a difference in how the singing voice is dissociated from one singer to another. When used by a plurality of singers in sequence, it is possible to compete for superiority or inferiority depending on how much the sound is lost. Description of Effect According to the first embodiment as described above, karaoke can be scored using the normal performance as the reference voice without using the voice multiplexing software for karaoke. Also, a DS that can perform high-speed calculation
Since P is used, the characteristics of the filter can be sufficiently changed in a short time. Therefore, the scoring of one sound can be completed within a time period that is sufficiently shorter than the length of one sound in normal singing. (II) Second Embodiment The first embodiment relates to the karaoke scoring device according to claim 5 including the pitch evaluation device according to claims 1 to 3. According to this embodiment, a plurality of singers can simultaneously compete for their own singing skill. Description of Configuration FIG. 7 shows the configuration of the karaoke scoring device of the second embodiment.

【００４９】本実施例のカラオケ採点装置１０１は、二
人の歌唱者がそれぞれ異なるマイクＭ₁、Ｍ₂を使用す
る。Ａ／Ｄ変換器ＡＤ₁、ＡＤ₂は各々の音声を音声信
号に変換する。The karaoke scoring device 101 of this embodiment uses the microphones M ₁ and M _{2 of} which _two singers are different. The A / D converters AD ₁ and AD ₂ convert each voice into a voice signal.

【００５０】スイッチＳＷは、マイクロコンピュータ１
ｂからの選択信号Ｓ_SWに従って両音声信号を選択する。
その他の構成については、第１実施例と同様なので、そ
の説明は省略する。但し、マイクロコンピュータ１ｂ
は、スイッチＳＷに選択信号Ｓ_SWを供給する他、図８に
従って動作を行う点で第１実施例のマイクロコンピュー
タ１ａと異なる。動作の説明次に、図８に示すフローチャートを用いて本第２実施例
の動作を説明する。The switch SW is the microcomputer 1
Selection signal S from b_SWSelect both audio signals according to.
Other configurations are the same as those in the first embodiment, so
Is omitted. However, the microcomputer 1b
Selects the selection signal S to the switch SW._SWIn addition to supplying
Therefore, in terms of operation, the microcomputer of the first embodiment
Different from data 1a. Description of operation Next, the second embodiment will be described with reference to the flowchart shown in FIG.
The operation of will be described.

【００５１】まず、マイクロコンピュータ１ｂは、スイ
ッチＳＷをマイクＭ₁の側に切り換え、マイクＭ₁の音
が検出されるか否かを検査する（ステップ８０１〜８０
２）。音が検出されないときは、このマイクＭ₁は使用
されていないと判断し、他のマイクＭ₂の音調検出に移
行する（ステップ８０２：ＮＯ）。マイクＭ₁の音が比
較音声として検出されると（ステップ８０２：ＹＥ
Ｓ）、バンドパスフィルタ群５に対し、検出した比較音
声の音の高さを中心音として基準音声を検出する周波数
帯域を設定する（ステップ８０３）。バンドパスフィル
タ群５を通過したマイクＭ₁の音声信号はレベル比較部
６へ入力され、各々のサブフィルタの出力レベルを比較
する（ステップ８０４〜８０５）。比較結果はマイクロ
コンピュータ１ｂに送られ、音の高さのズレ量点をバッ
ファに格納する（ステップ６０４）。Firstly, the microcomputer 1b switches the switch SW to the side of the microphone M _1, determines whether the sound of the microphone M ₁ is detected (step 801-80
2). When no sound is detected, it is determined that the microphone M ₁ is not used, and the process shifts to the tone detection of another microphone M ₂ (step 802: NO). When the sound of the microphone M ₁ is detected as the comparative sound (step 802: YE
S), the frequency band for detecting the reference voice is set for the bandpass filter group 5 with the pitch of the detected comparative voice as the central tone (step 803). The audio signal of the microphone M ₁ that has passed through the bandpass filter group 5 is input to the level comparison unit 6 and the output levels of the respective sub-filters are compared (steps 804 to 805). The comparison result is sent to the microcomputer 1b, and the deviation amount point of the pitch of the sound is stored in the buffer (step 604).

【００５２】同様に、マイクＭ₂についても、マイクＭ
₁のときと同様の処理が行われる（ステップ８０７〜８
１２）。ステップ８１３にて、マイクロコンピュータ１
ｂは、両者の正解率を算出して点数に換算する。演奏が
続けられている限り（ステップ８１５の判断がＮＯであ
る限り）は、このリアルタイムに得られる点数を表示装
置に表示する（ステップ８１４）。Similarly, for the microphone M ₂ , the microphone M ₂
The same processing as in the case of ₁ is performed (steps 807 to 8).
12). In step 813, the microcomputer 1
For b, the correct answer rate of both is calculated and converted into a score. As long as the performance is continued (as long as the determination in step 815 is NO), the score obtained in real time is displayed on the display device (step 814).

【００５３】演奏が終わると（ステップ８１５：ＹＥ
Ｓ）、演奏の全体を通しての総合得点を算出する。さら
に、総合得点の高低により、何れの歌唱者の方が正解率
が高かったかを示す勝敗を表示装置に出力して（ステッ
プ８１６）、カラオケによる競争を終了する。When the performance ends (step 815: YE
S), calculate the total score throughout the performance. Further, depending on whether the total score is high or low, the win or loss indicating which singer has the higher correct answer rate is output to the display device (step 816), and the competition by karaoke is ended.

【００５４】ステップ８０６又は８１２にて演算するズ
レ量点としては、例えば、以下のような評価を行う。音
調比較の結果を３段階に分け、各々のズレの大きい方か
ら、−１．０、＋０．５、＋１．０というような点を付
ける。この点の評価を１音毎に行い、ステップ８１６で
総合点数を算出する。The deviation amount points calculated in step 806 or 812 are evaluated as follows, for example. The result of the tone comparison is divided into three stages, and points such as -1.0, +0.5, and +1.0 are given from the one with the largest deviation. This point is evaluated for each sound, and the total score is calculated in step 816.

【００５５】バンドパスフィルタ群５に設定するサブフ
ィルタとは、例えば以下のように対比できる。〔ズレ量点とサブフィルタの番号との対比〕ズレ量点サブフィルタ番号 −１．０「１」又は「５」＋０．５「２」又は「４」＋１．０「３」そして、これらの点を１００点満点表示に適するよう
に、係数を掛けて換算する。表示の加算方法としては、
音の高さが一致しているときのみ点数をアップ（ダウ
ン）させる方法や、音の高さがずれていくときのみ得点
をダウン（アップ）させる方法も考えられる。効果の説明上記の如く、本第２実施例によれば、両歌唱者が同時に
同一音について歌っている間に採点を完了できる。例え
ば、ポップス系のアップテンポな曲目では、四分音符で
１２０／分程度の速度で演奏される。このとき、ボーカ
ルのパートで最も短い音符は１６分音符であり、約０．
１２５秒で演奏される。この程度の処理時間であれば、
最近のマイクロコンピュータとＤＳＰとにより十分処理
が可能である。The sub-filter set in the band-pass filter group 5 can be compared with, for example, the following. [Comparison between deviation amount point and sub-filter number] deviation amount sub-filter number -1.0 "1" or "5" +0.5 "2" or "4" +1.0 "3" The points are converted by multiplying by a coefficient so as to be suitable for displaying 100 points. As the addition method of the display,
A method of increasing (down) the score only when the pitches match, and a method of decreasing (up) the score only when the pitches of the pitches deviate may be considered. Description of Effects As described above, according to the second embodiment, the scoring can be completed while both singers are singing the same sound at the same time. For example, in a pop type uptempo song, a quarter note is played at a speed of about 120 / min. At this time, the shortest note in the vocal part is a sixteenth note, which is about 0.
It will be played in 125 seconds. With this level of processing time,
It can be processed sufficiently by a recent microcomputer and DSP.

【００５６】また、歌唱者が歌っている最中にリアルタ
イムに点数表示が変更されるので、相手方との競演が大
いに盛り上がる。点数表示の変化の激しさにより、調子
のはずれかたが直接認識できるので、歌唱者の歌の優劣
が面白味をもって表示される。（III ）第３実施例本第３実施例は、請求項２の音程評価装置を適用した請
求項５に記載のカラオケ採点装置に係る。本実施例は、
第２実施例を発展させ多人数の歌唱者による同時競演も
可能な構成とする。また、検出可能な周波数帯域の範囲
内に歌唱者の歌唱の音の高さが存在する場合及びこの範
囲より歌唱者の歌唱の音の高さが低い場合に、その音声
を検出する例を各々示す。Moreover, since the score display is changed in real time while the singer is singing, the competition with the opponent is greatly excited. Depending on the intensity of the change in the score display, it is possible to directly recognize the out-of-tune condition, so that the superiority or inferiority of the singer's song is displayed with fun. (III) Third Example This third example relates to the karaoke scoring apparatus according to claim 5 to which the pitch evaluation apparatus according to claim 2 is applied. In this embodiment,
The second embodiment is developed so that a large number of singers can perform a simultaneous competition. In addition, when the pitch of the singing song of the singer exists within the range of the detectable frequency band and when the pitch of the singing song of the singer is lower than this range, an example of detecting the voice is Show.

【００５７】図９に第３実施例のカラオケ採点装置の構
成を示す。図９に示すように、本実施例のカラオケ採点
装置１０２は、複数人ｎ（ｎは自然数）のユーザにより
同時に歌唱の競演を行う。FIG. 9 shows the structure of the karaoke scoring device of the third embodiment. As shown in FIG. 9, the karaoke scoring device 102 of the present embodiment simultaneously performs a singing competition by a plurality of users n (n is a natural number).

【００５８】マイクはＭ₁〜Ｍ_nのｎ本が存在し、これ
に対応してＡ／Ｄ変換器もｎ個備える。スイッチＳＷ₁
は、マイクロコンピュータ１ｃから供給される選択信号
ＳSW1によりｎ個のマイクのうちいずれかを選択する。
スイッチＳＷ₁をＡ／Ｄ変換器の後段でなく前段に設け
れば、マイクＭ₁〜Ｍ_nに対応するＡ／Ｄ変換器は１個
で済む。There are _n microphones M _{1 to} M _n , and correspondingly, n A / D converters are provided. Switch SW ₁
Selects any one of the n microphones by the selection signal SSW1 supplied from the microcomputer 1c.
If the switch SW ₁ is provided in the front stage instead of the rear stage of the A / D converter, only one A / D converter corresponding to the microphones M _{1 to} M _n is required.

【００５９】マイクロコンピュータ１ｃは、基準音声の
再生が音声多重ディスクから行われる場合に、スイッチ
ＳＷ₂を制御して音声チャンネルを選択する。基準音声
が音声多重信号であると判断される場合には、反転した
基準音声信号のＬｃｈを選択し、加算器３においてＲｃ
ｈと加算してガイドボーカルを伴奏から分離する。基準
音声が音声多重信号以外の信号である場合、反転しない
Ｌｃｈを選択し、Ｒｃｈと単純に加算する。The microcomputer 1c selects the audio channel by controlling the switch SW ₂ when the reference audio is reproduced from the audio multiplex disc. When it is determined that the reference voice is a voice multiplex signal, the Lch of the inverted reference voice signal is selected, and Rc is selected in the adder 3.
Add h to separate the guide vocal from the accompaniment. When the reference voice is a signal other than the voice multiplex signal, Lch that is not inverted is selected and simply added to Rch.

【００６０】さらに、本実施例では音調検出部を兼用
し、例えば、音調検出部を構成するＤＳＰの構成を簡単
にしている。音調検出部を構成するＤＳＰは、一時に５
つのフィルタが設定可能であるとする。Further, in the present embodiment, the tone detecting section is also used, and for example, the construction of the DSP constituting the tone detecting section is simplified. The DSP that constitutes the tone detection unit is 5 at a time.
Suppose two filters can be set.

【００６１】次に、本実施例の音調検出について説明す
る。基準音声の音調検出については、第１実施例と同様
なので説明は省略する。女性の発する音声は、周波数に
して３３０Ｈｚから６６０Ｈｚ程度の狭い範囲の基本周
波数成分を有する。マイクロコンピュータ１ｃは、この
周波数の範囲を検出しうるように、５つのサブフィルタ
の特性を順次変更していく。５つのサブフィルタは、そ
のうち中心となる音を基準音Ｒとすると、図３（Ｂ）に
如く、半音ずつ中心周波数の異なる連続したバンドパス
フィルタが構成される。Next, the tone detection of this embodiment will be described. The tone detection of the reference voice is the same as that in the first embodiment, and therefore its explanation is omitted. The voice uttered by a woman has a fundamental frequency component in a narrow range of about 330 Hz to 660 Hz. The microcomputer 1c sequentially changes the characteristics of the five sub-filters so that the frequency range can be detected. When the central sound of the five sub-filters is the reference sound R, a continuous band-pass filter having a different central frequency for each semitone is formed as shown in FIG. 3B.

【００６２】図２において、まずバンドパスフィルタ群
５は、Ｄ音を中心としたＢＰＦ＃１に設定される。マイ
クロコンピュータ１ｃは、ＢＰＦ＃１の構成においてど
のサブフィルタが最も大きい振幅値を出力するかを検出
する。同様に、Ｆ＃を中心としたＢＰＦ＃２、Ａ＃を中
心としたＢＰＦ＃３においても、最大振幅を出力するサ
ブフィルタを特定する。例えば、このとき検出結果が以
下のようになるとする。In FIG. 2, first, the bandpass filter group 5 is set to BPF # 1 centered on the D sound. The microcomputer 1c detects which sub-filter outputs the largest amplitude value in the configuration of BPF # 1. Similarly, for BPF # 2 centered on F # and BPF # 3 centered on A #, the sub-filter that outputs the maximum amplitude is specified. For example, the detection result at this time is as follows.

【００６３】ＢＰＦ＃１＝「１」ＢＰＦ＃２＝「５」ＢＰＦ＃３＝「２」バンドパスフィルタは、入力されている音声信号の周波
数から遠い周波数になる程、出力する振幅値を小さくす
る。よって、上記音調信号の組合せから、マイクロコン
ピュータ１ｃは現在入力中の基準音声をＡ音として検出
する。また、レベル比較部６は、各サブフィルタの振幅
を所定のしきい値と比較する。そして、すべてのサブフ
ィルタの出力がこのしきい値以下のときは、音声信号の
入力がないものとして判断する。BPF # 1 = "1" BPF # 2 = "5" BPF # 3 = "2" The bandpass filter outputs a smaller amplitude value as the frequency becomes farther from the frequency of the input audio signal. To do. Therefore, the microcomputer 1c detects the currently input reference voice as the A tone from the combination of the tone signals. The level comparison unit 6 also compares the amplitude of each sub-filter with a predetermined threshold value. Then, when the outputs of all the sub-filters are equal to or less than this threshold value, it is determined that no audio signal is input.

【００６４】なお、「ｎ」はサブフィルタの番号を示し
ている。一方、男性の発する音声は、周波数が３３０Ｈ
ｚ以下と低いため、倍音を用いて検出する。通常、人間
の発する音声は、基準となる基本周波数の他に、この基
本周波数の整数倍の高調波が含まれている。よって、基
本周波数が３３０Ｈｚより低くても、その倍音成分が３
３０Ｈｚ以上であるため、検出が可能である。例えば、
男性の音声が基本周波数１１０Ｈｚ（Ａ音）を有する場
合、図３（Ｃ）に示すようなスペクトル分布となる。基
本周波数（１１０Ｈｚ）自体は検出範囲外であっても、
３３０Ｈｚ（Ｅ音）、４４０Ｈｚ（Ａ音）、５５０Ｈｚ
（Ｃ＃音）、６６０Ｈｚ（Ｅ音）の各高調波が検出可能
である。このときの音調信号は、以下のようになる。Note that "n" indicates the sub-filter number. On the other hand, the voice uttered by a man has a frequency of 330H.
Since it is as low as z or less, it is detected using overtones. Usually, a human voice includes, in addition to a reference fundamental frequency, harmonics that are integral multiples of this fundamental frequency. Therefore, even if the fundamental frequency is lower than 330 Hz, its overtone component is 3
Since it is 30 Hz or higher, detection is possible. For example,
When a male voice has a fundamental frequency of 110 Hz (A sound), the spectrum distribution is as shown in FIG. 3 (C). Even if the fundamental frequency (110 Hz) itself is outside the detection range,
330Hz (E sound), 440Hz (A sound), 550Hz
(C # sound) and 660 Hz (E sound) harmonics can be detected. The tone signal at this time is as follows.

【００６５】ＢＰＦ＃１＝「２」または「５」ＢＰＦ＃２＝「１」ＢＰＦ＃３＝「２」基本周波数が検出可能範囲に存在する場合、基本周波数
を直接検出するバンドパスフィルタ群以外の残り二つの
バンドパスフィルタ群では、必ず「１」又は「５」のサ
ブフィルタの出力が大きくなる。しかし、基本周波数が
低く高調波成分のみが検出範囲に存在する場合は、この
法則が崩れる。したがって、マイクロコンピュータ１ｃ
は、この場合に検出範囲外に存在するであろう音声信号
の基本周波数を類推する演算をする。BPF # 1 = “2” or “5” BPF # 2 = “1” BPF # 3 = “2” When the fundamental frequency is in the detectable range, other than the bandpass filter group for directly detecting the fundamental frequency In the remaining two band pass filter groups, the output of the sub filter of "1" or "5" is always large. However, when the fundamental frequency is low and only harmonic components exist in the detection range, this law is broken. Therefore, the microcomputer 1c
Performs an operation of analogizing the fundamental frequency of the audio signal which will be outside the detection range in this case.

【００６６】また、高調波成分は、一般に高調波の次数
が増える程その振幅を小さくする。このため、図３
（Ａ）のようなバンドパスフィルタを音調検出部の前段
に挿入可能に準備しておき、高調波成分が検出されたと
判断したとき、マイクロコンピュータ１ｃは、サブフィ
ルタのゲインを増加する等の制御をしてもよい。The amplitude of the harmonic component generally decreases as the order of the harmonic increases. Therefore, in FIG.
When a bandpass filter as shown in (A) is prepared so that it can be inserted before the tone detection unit and it is determined that a harmonic component is detected, the microcomputer 1c controls the gain of the sub-filter to be increased. You may

【００６７】さて、本実施例におけるカラオケ採点に関
する動作は、図８に示すように、第２実施例における二
人の歌唱者の競演を、単純にｎ人に拡張させればよい。
例えば、図８のステップ８０１〜８０６、ステップ８０
７〜８１２の処理ブロックを人数分繰り返せばよい。曲
目の終了後の処理は、ステップ８１５、８１６に準じて
行えばよい。Now, as for the operation relating to the karaoke scoring in this embodiment, as shown in FIG. 8, the competition of two singers in the second embodiment may be simply expanded to n people.
For example, steps 801 to 806 and step 80 in FIG.
The processing blocks 7 to 812 may be repeated for the number of people. The process after the end of the tune may be performed according to steps 815 and 816.

【００６８】上記のように本第３実施例によれば、２人
以上の間で歌唱の優劣の比較が行え、カラオケも一層興
味深いものとなる。（ＩＶ）その他の変形例本発明は上記実施例に限らず種々適用可能である。As described above, according to the third embodiment, it is possible to compare the merits and demerits of singing between two or more people, and karaoke becomes more interesting. (IV) Other Modifications The present invention is not limited to the above-described embodiments and can be applied in various ways.

【００６９】例えば、上記各実施例では、音程評価装置
をカラオケ採点装置として用いていたが、他の音程評価
に用いることが可能である。他の評価方法としては、例
えば、比較音声として楽器の演奏を用い、基準となる楽
器の演奏に合わせて演奏練習する際の練習用評価に用い
ることができる。For example, in the above embodiments, the pitch evaluation device is used as a karaoke scoring device, but it can be used for other pitch evaluations. As another evaluation method, for example, a musical instrument performance is used as a comparative voice, and it can be used for practice evaluation when performing a performance in accordance with the performance of a reference musical instrument.

【００７０】[0070]

【発明の効果】請求項１又は請求項２に記載の音程評価
装置によれば、第２音声の種類によらず（例えば、多重
化された音声であるか、ステレオ音声か、モノラル音声
かによらない。）、音程の評価が行える。According to the pitch evaluation apparatus of claim 1 or 2, regardless of the type of the second sound (for example, whether the sound is multiplexed sound, stereo sound, or monaural sound). You can evaluate the pitch.

【００７１】請求項３に記載の音程評価装置によれば、
第１音声を中心に広がる周波数帯域で第２音声を検出す
るので、第２音声（ガイドボーカル等）に追従して音の
高さが変動する第１音声（カラオケにおける歌唱音声
等）を検出するのに適する。According to the pitch evaluation apparatus of claim 3,
Since the second voice is detected in the frequency band that spreads around the first voice, the first voice (song voice in karaoke, etc.) whose pitch changes according to the second voice (guide vocal, etc.) is detected. Suitable for

【００７２】請求項４に記載のカラオケ採点装置によれ
ば、請求項１乃至請求項３の効果に加えて、歌唱者の歌
唱の評価を自由に行える。特に、短時間で検出できるた
め、短いサイクルで歌唱の評価が行え、正確な採点が行
える。According to the karaoke scoring device of the fourth aspect, in addition to the effects of the first to third aspects, the singing of the singer can be freely evaluated. In particular, since it can be detected in a short time, singing can be evaluated in a short cycle and accurate scoring can be performed.

【００７３】請求項５に記載のカラオケ採点装置によれ
ば、請求項３の効果に加えて、多数人が互いの歌唱の優
劣をリアルタイムに比較でき、カラオケ演奏における新
しい楽しみ方を提供できる。According to the karaoke scoring device of the fifth aspect, in addition to the effect of the third aspect, a large number of people can compare the merits and demerits of each other's singing in real time, and a new way of enjoying the karaoke performance can be provided.

[Brief description of drawings]

【図１】第１実施例のカラオケ採点装置の構成図であ
る。FIG. 1 is a configuration diagram of a karaoke scoring device according to a first embodiment.

【図２】比較音声の音調検出の動作を示す説明図であ
る。FIG. 2 is an explanatory diagram showing an operation of detecting a tone of a comparative voice.

【図３】バンドパスフィルタの説明図である。FIG. 3 is an explanatory diagram of a bandpass filter.

【図４】基準音声の音調検出の動作を示す説明図であ
る。FIG. 4 is an explanatory diagram showing an operation of detecting a tone of a reference voice.

【図５】バンドパスフィルタの設定のついての説明図で
ある。FIG. 5 is an explanatory diagram for setting a bandpass filter.

【図６】第１実施例のカラオケ採点のフローチャートで
ある。FIG. 6 is a flowchart of karaoke scoring according to the first embodiment.

【図７】第２実施例のカラオケ採点装置の構成図であ
る。FIG. 7 is a block diagram of a karaoke scoring device of a second embodiment.

【図８】第２実施例のカラオケ採点のフローチャートで
ある。FIG. 8 is a flowchart of karaoke scoring according to the second embodiment.

【図９】第３実施例のカラオケ採点装置の構成図であ
る。FIG. 9 is a block diagram of a karaoke scoring device of a third embodiment.

【図１０】従来のカラオケ採点装置の構成図である。FIG. 10 is a block diagram of a conventional karaoke scoring device.

[Explanation of symbols]

１ａ〜１ｃ…マイクロコンピュータ２…音調検出部３…加算器４…ディレイ回路５…バンドパスフィルタ群６…レベル比較部 1a to 1c ... Microcomputer 2 ... Tone detection section 3 ... Adder 4 ... Delay circuit 5 ... Bandpass filter group 6 ... Level comparison section

Claims

[Claims]

1. A first tone detecting means for inputting a first voice signal corresponding to a first voice and detecting a pitch of the first voice, and a predetermined pitch including a pitch of the first voice. In the frequency band of the width, a second band corresponding to a second pitch having a band width obtained by dividing the frequency band corresponding to a predetermined pitch.
A second band-pass filter that passes band signals in respective bandwidths from an audio signal, and a band-signal that shows the maximum amplitude by comparing the amplitudes of the band signals that have passed the plurality of band-pass filters, respectively. A pitch evaluation comprising: a tone detection means; and an evaluation means for evaluating the pitch of the second voice based on the pitch of the first voice and a band signal indicating the maximum amplitude. apparatus.

2. The pitch evaluation device according to claim 1, wherein the first tone detection unit has a bandwidth corresponding to a predetermined pitch, and a first voice signal corresponding to the supplied first voice. To specify the pitch of the first voice by comparing the amplitudes of the band signals that have passed through the plurality of first band pass filters with the plurality of band pass filters that respectively pass the band signals in each bandwidth. A pitch evaluation device comprising:

3. The pitch evaluation device according to claim 1, wherein the frequency band having a predetermined width including the pitch of the first voice is:
A pitch evaluation device characterized in that the pitch is set substantially symmetrically with respect to the pitch of the first voice.

4. A karaoke scoring device comprising the pitch evaluation device according to claim 1, wherein a singing voice of a singer is used as the first voice, and the second voice is supplied from the outside. A karaoke scoring device, characterized in that the evaluation means scores the singing of the singer based on the pitch obtained by the comparison, using a reference voice according to the above.

5. The karaoke scoring device according to claim 4, wherein any one of a plurality of voice converting means for converting a singing voice by each of a plurality of singers into a singing voice signal, and a plurality of the singing voice signals. Selecting means for selecting whether or not to output the selected singing voice signal as the first voice, and the evaluating means sequentially outputs the selecting means each time one sound of the first voice is detected. A karaoke scoring device which is switched to perform singing scoring for each singing voice corresponding to each singer.