JPH08115097A - Acoustic reproduction device - Google Patents

Acoustic reproduction device

Info

Publication number
JPH08115097A
JPH08115097A JP6249340A JP24934094A JPH08115097A JP H08115097 A JPH08115097 A JP H08115097A JP 6249340 A JP6249340 A JP 6249340A JP 24934094 A JP24934094 A JP 24934094A JP H08115097 A JPH08115097 A JP H08115097A
Authority
JP
Japan
Prior art keywords
pitch
tempo
input voice
difference
sound
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
JP6249340A
Other languages
Japanese (ja)
Other versions
JP3263546B2 (en
Inventor
Hiroki Onishi
宏樹 大西
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Sanyo Electric Co Ltd
Original Assignee
Sanyo Electric Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Sanyo Electric Co Ltd filed Critical Sanyo Electric Co Ltd
Priority to JP24934094A priority Critical patent/JP3263546B2/en
Publication of JPH08115097A publication Critical patent/JPH08115097A/en
Application granted granted Critical
Publication of JP3263546B2 publication Critical patent/JP3263546B2/en
Anticipated expiration legal-status Critical
Expired - Fee Related legal-status Critical Current

Links

Landscapes

  • Reverberation, Karaoke And Other Acoustics (AREA)

Abstract

PURPOSE: To provide an acoustic reproduction device in which the tempo and the interval of an accompaniment are compensated in accordance with the interval of a singer even when there exists a tempo difference, which is within an allowable range, between the voice of the singer and the accompaniment. CONSTITUTION: The device consists of a first interval extracting means 3 which extracts the interval of a first inputted voice, a second interval extracting means 6 which extracts the interval of a second inputted voice, storage means 4 and 7 which store the time histories of each interval, a computing means 8 which computes the differences in tempo and interval between the first and the second inputted voices employing the time histories of the intervals and a nonlinear pattern matching method, compensating means 9 and 10 which compensate the tempo and the interval of the second inputted voice so that they are made approximately equal to the tempo and the interval of the first inputted voice and a means 13 which reproduces the compensated second inputted voice that is compensated by the means 9 and 10 or a third inputted voice that is compensated for its tempo and interval differences.

Description

【発明の詳細な説明】Detailed Description of the Invention

【0001】[0001]

【産業上の利用分野】本発明は、例えば、歌い手の音声
のテンポや音程に合わせて、伴奏音のテンポや音程を補
正する音響再生装置に関する。
BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a sound reproducing apparatus for correcting the tempo and pitch of an accompaniment sound in accordance with the tempo and pitch of the voice of a singer, for example.

【0002】[0002]

【従来の技術】従来、カラオケ装置等の音響再生装置に
音程調整機を接続して、伴奏音の曲のテンポを変えずに
音程のみを変化させることが行われていた。
2. Description of the Related Art Heretofore, a pitch adjuster has been connected to a sound reproducing device such as a karaoke device to change only the pitch without changing the tempo of the accompaniment tune.

【0003】音響再生装置の伴奏音の音程の調整に関し
ては、特開平5−35286号公報に示された如く、伴
奏音の途中であっても、あらかじめ設定された時間内で
歌い手の音声の音程と伴奏音の音程との間の音程差を検
出し、その後、自動的に伴奏音の音程を歌い手の音程に
近似するように補正する方法があった。
Regarding the adjustment of the pitch of the accompaniment sound of the sound reproducing device, as shown in Japanese Patent Laid-Open No. 5-35286, the pitch of the voice of the singer within a preset time even during the accompaniment sound. There is a method of detecting a pitch difference between the pitch of the accompaniment sound and the pitch of the accompaniment sound, and then automatically correcting the pitch of the accompaniment sound so as to approximate the pitch of the singer.

【0004】[0004]

【発明が解決しようとする課題】しかしながら、従来の
如く、歌い手の音声の音程に近似するように自動的に伴
奏音の音程を補正する方法では、歌い手の音声と伴奏音
との時間的なずれ、すなわち、歌い手の音声と伴奏音と
の間でテンポ差はないことを前提としており、この状況
下で、歌い手の音声と伴奏音との間にテンポ差が生じて
しまうと、前記伴奏音の音程が誤って補正されるという
問題があった。
However, in the conventional method of automatically correcting the pitch of the accompaniment sound so as to approximate the pitch of the voice of the singer, the time difference between the voice of the singer and the accompaniment sound is increased. That is, it is premised that there is no tempo difference between the singer's voice and the accompaniment sound, and in this situation, if a tempo difference occurs between the singer's voice and the accompaniment sound, There was a problem that the pitch was erroneously corrected.

【0005】そこで、本発明は前述の問題点に鑑み為さ
れたものであり、歌い手の音声と伴奏音との間に許容差
内のテンポ差があっても、該歌い手の音程に合わせて、
前記伴奏音のテンポ及び音程を補正する音響再生装置を
提供することを目的とする。
Therefore, the present invention has been made in view of the above-described problems, and even if there is a tempo difference within a permissible difference between the voice of the singer and the accompaniment sound,
An object of the present invention is to provide a sound reproducing device that corrects the tempo and pitch of the accompaniment sound.

【0006】[0006]

【課題を解決するための手段】本発明による音響再生装
置は、第1の入力音声の音程を抽出する第1音程抽出手
段と、第2の入力音声の音程を抽出する第2音程抽出手
段と、前記各々の音程の時間履歴を記憶する記憶手段
と、該音程の時間履歴を用いて、非線形パターンマッチ
ング手法により第1の入力音声と第2の入力音声との間
のテンポ差および音程差を算出する計算手段と、第2の
入力音声のテンポおよび音程を第1の入力音声のテンポ
および音程に近似させるよう補正する補正手段と、該補
正手段により補正された第2の入力音声或は、前記算出
されたテンポ差および音程差の補正を行った第3の入力
音声を再生する手段と、を具備することを特徴としてい
る。加えて、前記非線形パターンマッチング手法にDP
マッチングを用い、該DPマッチングより求められる時
間正規化関数の傾きにより、前記第1の入力音声と第2
の入力音声とのテンポ差を算出し、この後、斯かるテン
ポ差に応じて補正された第2の入力音声の音程の時間履
歴と第1の入力音声の音程の時間履歴から算出される平
均音程差を第1の入力音声と第2の入力音声との音程差
として算出することを特徴とする。
A sound reproducing apparatus according to the present invention comprises a first pitch extracting means for extracting a pitch of a first input voice, and a second pitch extracting means for extracting a pitch of a second input voice. , A tempo difference and a pitch difference between the first input voice and the second input voice by a non-linear pattern matching method using a storage means for storing the time history of each pitch and the time history of the pitch. A calculating means for calculating, a correcting means for correcting the tempo and pitch of the second input voice to approximate the tempo and pitch of the first input voice, and the second input voice or the second input voice corrected by the correcting means, Means for playing back the third input sound in which the calculated tempo difference and pitch difference are corrected. In addition, the nonlinear pattern matching method has a DP
Using the matching, the slope of the time normalization function obtained from the DP matching is used to detect the first input voice and the second input voice.
An average calculated from the time history of the pitch of the second input voice and the time history of the pitch of the first input voice corrected according to the tempo difference. It is characterized in that the pitch difference is calculated as the pitch difference between the first input voice and the second input voice.

【0007】また、前記DPマッチングより求められる
時間正規化関数の傾きに制限を設け、前記第2の入力音
声のテンポに対する前記第1の入力音声のテンポ差を設
定値以上に補正しないことを特徴とする。
Further, the slope of the time normalization function obtained from the DP matching is limited so that the tempo difference of the first input voice with respect to the tempo of the second input voice is not corrected to a set value or more. And

【0008】さらに、前記第2の入力音声のテンポ及び
音程を前記第1の入力音声のテンポおよび音程に近似さ
せるように補正する補正期間を設定し、該補正期間後
は、前記テンポ差および音程差の補正量を一定に保つよ
うに第2の入力音声或は、前記第3の入力音声を再生す
ることを特徴とする。
Further, a correction period for correcting the tempo and pitch of the second input voice to approximate the tempo and pitch of the first input voice is set, and after the correction period, the tempo difference and pitch are set. The second input voice or the third input voice is reproduced so that the difference correction amount is kept constant.

【0009】一方、前記第2の入力音声或は、第3の入
力音声のテンポ及び音程の補正の有無を指示するための
手元スイッチを備えており、該手元スイッチにより、操
作者が前記補正期間を指定できることを特徴とする。
On the other hand, a hand switch for instructing the presence / absence of correction of the tempo and pitch of the second input voice or the third input voice is provided, and the operator can use the hand switch to perform the correction period. The feature is that you can specify.

【0010】[0010]

【作用】本発明の音響再生装置は、操作者の指示によ
り、第2の入力音声或は、第3の入力音声のテンポ及び
音程を補正するように補正開始の命令をうけ、まず、第
1の入力音声及び第2の入力音声の音程を抽出し、斯か
る各々の音声の一定期間の音程の時間履歴を記憶する。
The sound reproducing apparatus of the present invention receives a command to start correction so as to correct the tempo and pitch of the second input voice or the third input voice according to the operator's instruction, and firstly, the first The pitches of the input voice and the second input voice are extracted, and the time history of the pitch of each of the voices for a certain period is stored.

【0011】次に、該音程の時間履歴を用いて、非線形
パターンマッチング手法により第1の入力音声と第2の
入力音声との間のテンポ差および音程差を算出する。
Next, using the time history of the pitch, the tempo difference and the pitch difference between the first input voice and the second input voice are calculated by the non-linear pattern matching method.

【0012】斯かる非線形パターンマッチング手法にD
Pマッチングを用いると、該DPマッチングより求めら
れる時間正規化関数の傾きにより、第1の入力音声と第
2の入力音声とのテンポ差が算出される。この時、該時
間正規化関数の傾きは、あらかじめ設定されたある許容
範囲内であれば、該傾きを第1の入力音声と第2の入力
音声のテンポ差として算出し、許容範囲外であれば、該
テンポ差は補正不可であると判断する。
D is applied to such a nonlinear pattern matching method.
When P matching is used, the tempo difference between the first input voice and the second input voice is calculated from the slope of the time normalization function obtained from the DP matching. At this time, if the slope of the time normalization function is within a certain allowable range set in advance, the slope is calculated as the tempo difference between the first input voice and the second input voice, and if it is outside the allowable range. For example, it is determined that the tempo difference cannot be corrected.

【0013】一方、テンポ差が許容範囲内にあれば、前
記時間正規化関数によって第2の入力音声の音程の時間
履歴が第1の入力音声の音程の時間履歴に対し、近似な
テンポになるよう補正され、該補正された第2の入力音
声の音程の時間履歴と第1の入力音声の音程の時間履歴
とから算出される平均音程差を第1の入力音声と第2の
入力音声との音程差として算出する。
On the other hand, if the tempo difference is within the allowable range, the time history of the pitch of the second input voice is approximated to the time history of the pitch of the first input voice by the time normalization function. The average pitch difference calculated from the corrected time history of the pitch of the second input voice and the corrected time history of the pitch of the first input voice is corrected between the first input voice and the second input voice. It is calculated as the pitch difference of.

【0014】最後に、前記算出されたテンポ差および音
程差により、第2の入力音声のテンポ及び音程を第1の
入力音声のテンポおよび音程に近似させるように第2の
入力音声を補正して再生するか或は、前記算出されたテ
ンポ差および音程差により、第3の入力音声のテンポ及
び音程を第1の入力音声のテンポおよび音程に近似させ
るように第3の入力音声を補正して再生する。
Finally, the second input voice is corrected by the calculated tempo difference and pitch difference so as to approximate the tempo and pitch of the second input voice to the tempo and pitch of the first input voice. The third input sound is reproduced or is corrected by the calculated tempo difference and pitch difference so that the tempo and pitch of the third input sound are approximated to the tempo and pitch of the first input sound. Reproduce.

【0015】[0015]

【実施例】図1は、本発明による音響再生装置をカラオ
ケ装置に適用した場合の概略構成図である。通常、歌い
手の音声は、マイク1を介して、アナログ音声信号に変
換され、ミキシング回路12に送信される。一方、楽音
情報記録媒体5にはMIDI(Musical Instrument Dig
ital Interface)情報と呼ばれるディジタルの伴奏音の
信号が記憶されており、該ディジタル伴奏音信号は、テ
ンポ差補正回路9および音程差補正回路10を介するも
のの、補正されずに通過し、D/A変換器11によりア
ナログ伴奏音信号に変換され、ミキシング回路12に送
信される。ミキシング回路12では、該アナログ伴奏音
信号と前記アナログ音声信号とがミキシング、増幅さ
れ、スピーカー13により、歌い手の音声と伴奏音とが
混在した音として再生される。
DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS FIG. 1 is a schematic diagram of the case where the sound reproducing device according to the present invention is applied to a karaoke device. Usually, the voice of the singer is converted into an analog voice signal via the microphone 1 and transmitted to the mixing circuit 12. On the other hand, the musical sound information recording medium 5 has a MIDI (Musical Instrument Dig.
A digital accompaniment sound signal called ital interface) information is stored. The digital accompaniment sound signal passes through the tempo difference correction circuit 9 and the pitch difference correction circuit 10, but is not corrected, and the D / A It is converted into an analog accompaniment sound signal by the converter 11 and transmitted to the mixing circuit 12. In the mixing circuit 12, the analog accompaniment sound signal and the analog audio signal are mixed and amplified, and reproduced by the speaker 13 as a sound in which the voice of the singer and the accompaniment sound are mixed.

【0016】ここで、歌い手の操作により、伴奏音のテ
ンポ及び音程を補正して再生する場合を図2に示す処理
手順に沿って説明する。
Now, a case where the tempo and pitch of the accompaniment sound are corrected and reproduced by the operation of the singer will be described with reference to the processing procedure shown in FIG.

【0017】本実施例では、第1の入力音声を歌い手の
音声、第2の入力音声を楽音情報記憶媒体5に記憶され
ている歌い手の手本となる教師音信号、及び第3の入力
音声を楽音情報記憶媒体5に記憶されている伴奏音信号
として説明する。
In this embodiment, the first input voice is the voice of the singer, the second input voice is the teacher sound signal serving as the model of the singer stored in the musical tone information storage medium 5, and the third input voice. Will be described as an accompaniment sound signal stored in the musical sound information storage medium 5.

【0018】ステップS1では、マイク1に取りつけら
れた手元スイッチを歌い手が操作することにより、本発
明によるカラオケ装置は伴奏音を補正する補正開始命令
を受ける。
In step S1, the singer operates the hand switch attached to the microphone 1, whereby the karaoke apparatus according to the present invention receives a correction start command for correcting the accompaniment sound.

【0019】しかる後、直ちに(第1の入力音声であ
る)歌い手のアナログ音声信号は、前述した通常信号経
路とは別に、A/D変換器2によって、ディジタル音声
信号に変換される。A/D変換の際のA/D変換器2の
標本化周波数は2kHzとし、標本化の前に音声信号は
カットオフ周波数1kHzのローパスフィルタを通過す
る。
Immediately thereafter, the analog voice signal of the singer (which is the first input voice) is converted into a digital voice signal by the A / D converter 2 separately from the above-mentioned normal signal path. The sampling frequency of the A / D converter 2 at the time of A / D conversion is 2 kHz, and the audio signal passes through a low-pass filter having a cutoff frequency of 1 kHz before sampling.

【0020】そして、ステップS2では、前記ディジタ
ル音声信号は、音程抽出回路3により時系列的に順次、
自己相関法などの信号処理技術を用いて、歌い手の音声
の音程を算出し、該歌い手の音程の時間履歴が、バッフ
ァメモリ4に記憶される。実施例では、音程抽出回路3
は、計測時間20msec毎に平均された音程を抽出
し、バッファメモリ4には、6secの時間長の前記歌
い手の音程の時間履歴が記憶される。
Then, in step S2, the digital voice signal is sequentially time-sequentially output by the pitch extraction circuit 3.
The pitch of the voice of the singer is calculated using a signal processing technique such as the autocorrelation method, and the time history of the pitch of the singer is stored in the buffer memory 4. In the embodiment, the pitch extraction circuit 3
Extracts a pitch averaged every 20 msec of the measurement time, and the buffer memory 4 stores a time history of the pitch of the singer having a time length of 6 sec.

【0021】一方、楽音情報記憶媒体5には、前述の如
く、予め音程、音色、音量などの信号が分離された状態
でMIDI情報と呼ばれる(第2の入力音声である)歌
い手の手本となる教師音信号及び(第3の入力音声であ
る)ディジタル伴奏音信号が記憶されている。従って、
音程抽出回路6により教師音の音程が抽出され、斯かる
教師音の音程の時間履歴を5secの時間長でバッファ
メモリ7に記憶する。バッファメモリ7に記憶する教師
音の音程の時間履歴は、前記歌い手の音程時間履歴同
様、計測時間20msec毎の平均音程となるよう間引
き或は、線形補間される。
On the other hand, in the tone information storage medium 5, as described above, a sample of a singer called MIDI information (which is the second input voice) in a state where signals such as pitch, tone color and volume are separated in advance. The teacher sound signal and the digital accompaniment sound signal (which is the third input sound) are stored. Therefore,
The pitch of the teacher sound is extracted by the pitch extraction circuit 6, and the time history of the pitch of the teacher sound is stored in the buffer memory 7 with a time length of 5 sec. The time history of the pitches of the teacher's notes stored in the buffer memory 7 is thinned out or linearly interpolated so as to have an average pitch every 20 msec of the measurement time, like the pitcher's time history of the singer.

【0022】従って、バッファメモリ4には前記補正開
始命令より時間起算した6sec分の前記歌い手の音程
の時間履歴が記憶され、バッファメモリ7には前記補正
開始命令より時間起算した5sec分の前記教師音の音
程の時間履歴が記憶されており、斯かる各々の音程の時
間履歴を用いて、非線形パタ−ンマッチング回路8によ
り、前記歌い手の音声と教師音とのテンポ差及び音程差
を算出する。
Therefore, the buffer memory 4 stores the time history of the pitch of the singer for 6 seconds calculated from the correction start command, and the buffer memory 7 stores the teacher for 5 seconds calculated from the correction start command. The time history of the pitch of the sound is stored, and the non-linear pattern matching circuit 8 is used to calculate the tempo difference and the pitch difference between the voice of the singer and the teacher's sound by using the time history of each pitch. .

【0023】次に、ステップS3における前記テンポ差
及び音程差の算出方法について述べる。本実施例では、
DPマッチング手法により前記歌い手の音声と教師音と
のテンポ差及び音程差を算出する。DPマッチング手法
に関しては、日本音響学会誌Vol.27 No.9 pp483-487 に
記載されており、本実施例では、歌い手の音程の時間履
歴と教師音の音程の時間履歴とにより、両者の音程差の
絶対値が前記補正開始命令発生より時間起算した5se
cの間の個々の時点で、その時点に至る累積音程差が最
小となるような時間軸正規化関数を求める。
Next, a method of calculating the tempo difference and the pitch difference in step S3 will be described. In this embodiment,
The tempo difference and the pitch difference between the voice of the singer and the teacher's sound are calculated by the DP matching method. The DP matching method is described in Journal of Acoustical Society of Japan, Vol.27 No.9 pp483-487, and in the present embodiment, the pitches of the singer's pitch and the pitch of the teacher's pitch are both recorded. The absolute value of the difference is 5se calculated from the time when the correction start command is issued.
At each time point between c, a time axis normalization function that minimizes the cumulative pitch difference up to that time point is obtained.

【0024】実施例では、教師音の音程時間履歴は、5
sec分記憶され、音程計測時間は20msec単位で
あるので、 A1 ,A2 ,・・・・・,A250 なる250個の音程の時系列データとして表される。同
様に、歌い手の音程時間履歴は、6sec分記憶されて
いるので、 B1 ,B2 ,・・・・・,B300 なる音程の時系列データとして表される。図3はDPマ
ッチング手法による時間軸正規化関数の計算例を示す。
時間軸正規化関数は、格子点(A1,B1)を起点とし、
この起点からすべての格子点における累積音程差の最小
値を累積距離として求め、パターンマッチングを終了す
る格子点(A250,Btr)から、逆時間方向に累積距離
が小さくなる経路をたどることによって求める。尚、B
trとは、教師音の音程データA250に対応する歌い手の
音程データをいう。
In the embodiment, the interval time history of the teacher sound is 5
Since it is stored for sec, and the pitch measurement time is in 20 msec unit, it is represented as time series data of 250 pitches A 1 , A 2 , ..., A 250 . Similarly, since the singer's pitch time history is stored for 6 seconds, it is represented as time series data of the pitches B 1 , B 2 , ..., B 300 . FIG. 3 shows a calculation example of the time axis normalization function by the DP matching method.
The time-axis normalization function starts from the grid point (A 1 , B 1 ),
From this starting point, the minimum value of the cumulative pitch difference at all grid points is obtained as the cumulative distance, and the path from the grid point (A 250 , B tr ) at which the pattern matching is terminated becomes smaller in the reverse time direction. Ask. Incidentally, B
tr means the pitch data of the singer corresponding to the pitch data A 250 of the teacher sound.

【0025】図3は横軸に教師音の音程時間履歴をと
り、縦軸に歌い手の音程時間履歴をとっており、前述の
格子点と呼ばれる座標より構成される。ここで、時間軸
正規化関数31は、DPマッチング手法により、線分3
5上の(A250,B200)から(A250,B300)までの各
格子点における、格子点(A1,B1)からの累積音程差
を最小とする累積距離と呼ばれる値の内、最も小さい累
積距離の値を有する格子点(A250,Btr)を始点とし
て、時間逆進行方向に累積距離が小さくなる経路をたど
ることによって求められる。実施例では、歌い手の音程
測定データB200は、前記補正開始命令から4sec後
にあたり、求められた時間軸正規化関数31より、前記
補正命令開始直後の歌い手の歌い出しが教師音に対して
遅れたにもかかわらず、該補正命令開始5秒後の教師音
に対して早く歌い終わっていることがわかる。図3中、
線分32は(A1,B1)及び(A250,B250)を通る傾
き1の線分であり、時間軸正規化関数が線分32と一致
すれば、教師音と歌い手の音声の時間的なずれ、すなわ
ち、テンポ差はない。これに対し、線分33
((A25 0,B300)を通り傾き1の線分)及び、線分3
4((A250,B200)を通り傾き1の線分)は時間軸整
合窓と呼ばれる座標領域を形成し、時間軸正規化関数が
線分33及び線分34の間の時間整合窓から外れた場合
は、非現実的であるとして、前記テンポ差は補正しない
ものと考える。但し、実施例では、時間軸正規化関数
は、前述の如き逆時間方向に経路を探索する際、前記時
間整合窓の範囲を越えようとすれば、強制的に時間整合
窓内での最も累積距離が小さい他の格子点を選択するの
で、必ず時間軸整合窓の範囲の中での時間軸正規化関数
が求まる。実施例では、時間軸正規化関数31に対し
て、最小二乗法により近似直線36を得、直線36の傾
きをテンポ差補正量とした。
In FIG. 3, the abscissa represents the pitch time history of the teacher's tone and the ordinate represents the pitch time history of the singer, which is composed of the coordinates called grid points. Here, the time axis normalization function 31 uses the DP matching method to calculate the line segment 3
Of the cumulative distance that minimizes the cumulative pitch difference from the grid point (A 1 , B 1 ) at each grid point from (A 250 , B 200 ) to (A 250 , B 300 ) in 5 above. , The grid point (A 250 , B tr ) having the smallest value of the cumulative distance is used as a starting point, and the route is calculated by tracing the path in which the cumulative distance becomes smaller in the time reverse direction. In the embodiment, the singer's pitch measurement data B 200 is 4 seconds after the correction start command, and the singer's singing immediately after the start of the correction command is delayed with respect to the teacher sound by the obtained time axis normalization function 31. However, it can be seen that, despite the fact that the teacher's sound is five seconds after the start of the correction command, the song is finished singing quickly. In FIG.
The line segment 32 is a line segment having a slope of 1 passing through (A 1 , B 1 ) and (A 250 , B 250 ). If the time axis normalization function matches the line segment 32, the teacher sound and the voice of the singer are There is no time lag, that is, tempo difference. In contrast, line segment 33
((A 25 0, B 300 ) of a line segment of street slope 1) and, line segment 3
4 (a line segment that passes through (A 250 , B 200 ) and has a slope of 1) forms a coordinate area called a time-axis matching window, and the time-axis normalization function If it is not correct, it is considered unrealistic and the tempo difference is not corrected. However, in the embodiment, the time-axis normalizing function is forced to maximize the accumulation within the time-matching window if an attempt is made to exceed the range of the time-matching window when searching for a route in the reverse time direction as described above. Since another grid point having a small distance is selected, the time axis normalization function is always found within the time axis matching window range. In the embodiment, an approximate straight line 36 is obtained for the time axis normalization function 31 by the method of least squares, and the inclination of the straight line 36 is used as the tempo difference correction amount.

【0026】次に、ステップS5におけるテンポ差補正
回路9での処理について述べる。テンポ差補正回路9で
は、非線形パターンマッチング回路8で得られたテンポ
差補正量を用い、伴奏音の通常再生速度を該テンポ差補
正量で除した値を伴奏音の補正再生速度とする。実施例
では、テンポ差補正量0.97が得られた。
Next, the processing in the tempo difference correction circuit 9 in step S5 will be described. The tempo difference correction circuit 9 uses the tempo difference correction amount obtained by the non-linear pattern matching circuit 8 and sets the value obtained by dividing the normal reproduction speed of the accompaniment sound by the tempo difference correction amount as the corrected reproduction speed of the accompaniment sound. In the example, a tempo difference correction amount of 0.97 was obtained.

【0027】最後に、ステップS6における音程差補正
回路10での処理について述べる。音程差補正回路10
では、図3に示す非線形パターンマッチング回路8で得
られた時間軸正規化関数31の経路に沿って、前記歌い
手の音程時間履歴に対して時間軸上、逐次的に対応する
教師音の音程差の総和より平均を求め、その平均音程差
を音程差補正量として伴奏音の音程を補正する。具体的
に、図4を用いて説明する。図4の上段にバッファメモ
リ7に記憶されている歌い手の音程時間履歴の実施例を
示し、下段にバッファメモリ4に記憶されている教師音
の音程時間履歴の実施例を示す。得られた時間軸正規化
関数31の経路をたどることにより、A 250にはBtr
215にはB205、・・・,A1 にはB17といった具合
に、時間的に非線形な対応が得られる。音程差補正量
は、前記それぞれの時間軸に対応する音程差の平均値に
より算出される。
Finally, the pitch difference correction in step S6
The processing in the circuit 10 will be described. Pitch difference correction circuit 10
Then, using the non-linear pattern matching circuit 8 shown in FIG.
Along the path of the time axis normalization function 31
Corresponding to the pitch time history of the hand sequentially on the time axis
Average from the sum of the pitch differences of the teacher's sound, and the average pitch difference
Is used as the pitch difference correction amount to correct the pitch of the accompaniment sound. concrete
First, description will be made with reference to FIG. Buffer memo at the top of Figure 4
An example of the pitch time history stored in Li 7
And the teacher sounds stored in the buffer memory 4 at the bottom.
An example of the interval time history of is shown. Obtained time base normalization
By following the path of function 31, A 250To Btr,
A215To B205・ ・ ・, A1 To B17Such as
Thus, a non-linear response in time is obtained. Pitch difference correction amount
Is the average value of the pitch difference corresponding to each time axis
It is calculated from

【0028】以上のテンポ差及び音程差の補正処理を楽
音情報記憶媒体5より得られるディジタル伴奏音信号に
施し、以後、一定の補正量により補正されたディジタル
伴奏音信号は、D/A変換器11を経てアナログ信号と
なり、ミキシング回路12で歌い手の音声信号とミキシ
ングされ、スピーカー13によって再生される。
The above-described correction processing of the tempo difference and the pitch difference is applied to the digital accompaniment sound signal obtained from the musical sound information storage medium 5, and thereafter, the digital accompaniment sound signal corrected by a constant correction amount is converted into a D / A converter. After passing through 11, it becomes an analog signal, is mixed with the voice signal of the singer in the mixing circuit 12, and is reproduced by the speaker 13.

【0029】一方、前記補正開始命令が実施されない場
合は、楽音情報記憶媒体5から供給されるディジタル伴
奏音信号は、テンポ差補正回路9及び音程差補正回路1
0を補正せずに通過して、D/A変換器11を経てアナ
ログ信号となり、ミキシング回路12で歌い手の音声信
号とミキシングされ、スピーカー13によって再生され
る。実施例では、D/A変換器11の標本化速度を4
4.1kHzとし、D/A変換された直後の伴奏音信号
はカットオフ周波数20kHzのローパスフィルタを通
過する。
On the other hand, when the correction start command is not executed, the digital accompaniment sound signal supplied from the musical tone information storage medium 5 is the tempo difference correction circuit 9 and the pitch difference correction circuit 1.
0 passes through without correction, passes through the D / A converter 11, becomes an analog signal, is mixed with the voice signal of the singer in the mixing circuit 12, and is reproduced by the speaker 13. In the embodiment, the sampling rate of the D / A converter 11 is set to 4
At 4.1 kHz, the accompaniment sound signal immediately after D / A conversion is passed through a low-pass filter with a cutoff frequency of 20 kHz.

【0030】ここで、楽音情報記憶媒体5に、(第2の
入力音声である)歌い方の手本となる教師音信号が記憶
されていない場合でも、(第3の入力音声である)伴奏
音信号の音程時間履歴を教師音信号の音程時間履歴の代
わりに用いることによって、(第1の入力音声である)
歌い手の音声とのテンポ差及び音程を算出し、伴奏音自
信のテンポ及び音程が補正可能であることは言うまでも
ない。
Here, even when the musical tone information storage medium 5 does not store a teacher sound signal as a model for singing (which is the second input voice), the accompaniment (which is the third input voice) By using the interval time history of the sound signal instead of the interval time history of the teacher sound signal (the first input voice)
Needless to say, the tempo difference and pitch of the singer's voice can be calculated to correct the tempo and pitch of the accompaniment sound.

【0031】[0031]

【発明の効果】本発明による音響再生装置によれば、操
作者の希望するタイミングで、第1の入力音声のテンポ
及び音程に合わせて、第2の入力音声或は、第3の入力
音声を補正することが可能になるので、例えば、第1の
入力音声を歌い手の音声として、補正対象を伴奏音とし
たようなカラオケ装置に適用した場合、歌い手の希望す
るタイミングで、歌い手のテンポ及び音程に合わせて補
正された伴奏音が再生されるため、歌い手は気持ち良く
歌を歌うことができ、カラオケの娯楽性が高められる。
According to the sound reproducing apparatus of the present invention, the second input sound or the third input sound is matched with the tempo and pitch of the first input sound at the timing desired by the operator. Since the correction can be performed, for example, when the first input sound is applied to a karaoke device in which the correction target is an accompaniment sound as the voice of the singer, the tempo and pitch of the singer at the desired timing of the singer. Since the accompaniment sound corrected according to is reproduced, the singer can comfortably sing a song, and the entertainment of karaoke is enhanced.

【0032】また、歌い手は、手元スイッチを操作する
ことによって、前述の補正開始をカラオケ装置に指示で
きるので、非常に良い操作性が得られる。加えて、テン
ポ差及び音程差の補正量に制限があるため、伴奏音が補
正されすぎて、歌い手に違和感を生じさせるようなこと
なくカラオケが楽しめるといった効果を奏する。
Further, since the singer can instruct the karaoke apparatus to start the above-mentioned correction by operating the hand switch, very good operability can be obtained. In addition, since the correction amount of the tempo difference and the pitch difference is limited, the accompaniment sound is overcorrected, and the karaoke can be enjoyed without causing the singer to feel uncomfortable.

【図面の簡単な説明】[Brief description of drawings]

【図1】本発明の音響再生装置の概略構成図である。FIG. 1 is a schematic configuration diagram of a sound reproducing device of the present invention.

【図2】本発明の音響再生装置の概略処理手順を示す図
である。
FIG. 2 is a diagram showing a schematic processing procedure of the sound reproducing device of the present invention.

【図3】本発明の音響再生装置の構成をなす非線形パタ
ーンマッチング回路によるDPマッチング計算結果例で
ある。
FIG. 3 is an example of a DP matching calculation result by a non-linear pattern matching circuit that constitutes the configuration of the sound reproducing device of the present invention.

【図4】教師音の音程時間履歴及び歌い手の音程の時間
履歴を示す図である。
FIG. 4 is a diagram showing a pitch time history of a teacher sound and a time history of a singer's pitch.

【符号の説明】[Explanation of symbols]

1 ・・・マイク 2 ・・・A/D変換器 3,6・・・音程抽出回路 4,7・・・バッファメモリ 8 ・・・非線形パターンマッチング回路 9 ・・・テンポ差補正回路 10 ・・・音程差補正回路 11 ・・・D/A変換器 12 ・・・ミキシング回路 13 ・・・スピーカー 1 ... Microphone 2 ... A / D converter 3, 6 ... Pitch extraction circuit 4, 7 ... Buffer memory 8 ... Non-linear pattern matching circuit 9 ... Tempo difference correction circuit 10 ...・ Pitch difference correction circuit 11 ・ ・ ・ D / A converter 12 ・ ・ ・ Mixing circuit 13 ・ ・ ・ Speaker

Claims (5)

【特許請求の範囲】[Claims] 【請求項1】 第1の入力音声の音程を抽出する第1音
程抽出手段と、第2の入力音声の音程を抽出する第2音
程抽出手段と、前記各々の音程の時間履歴を記憶する記
憶手段と、該音程の時間履歴を用いて、非線形パターン
マッチング手法により第1の入力音声と第2の入力音声
との間のテンポ差および音程差を算出する計算手段と、
第2の入力音声のテンポおよび音程を第1の入力音声の
テンポおよび音程に近似させるよう補正する補正手段
と、該補正手段により補正された第2の入力音声或は、
前記算出されたテンポ差および音程差の補正を行った第
3の入力音声を再生する手段と、を具備することを特徴
とする音響再生装置。
1. A first pitch extracting means for extracting a pitch of a first input voice, a second pitch extracting means for extracting a pitch of a second input voice, and a memory for storing a time history of each pitch. And a calculating means for calculating a tempo difference and a pitch difference between the first input voice and the second input voice by a non-linear pattern matching method using the time history of the pitch.
Correction means for correcting the tempo and pitch of the second input sound so as to approximate the tempo and pitch of the first input sound; and the second input sound or the second input sound corrected by the correction means,
Means for playing back the third input sound in which the calculated tempo difference and pitch difference are corrected, and a sound reproducing device.
【請求項2】 前記非線形パターンマッチング手法にD
Pマッチングを用い、該DPマッチングより求められる
時間正規化関数の傾きにより、前記第1の入力音声と第
2の入力音声とのテンポ差を算出し、この後、斯かるテ
ンポ差に応じて補正された第2の入力音声の音程の時間
履歴と第1の入力音声の音程の時間履歴から算出される
平均音程差を第1の入力音声と第2の入力音声との音程
差として算出することを特徴とする請求項1記載の音響
再生装置。
2. The non-linear pattern matching method includes D
Using P matching, the tempo difference between the first input voice and the second input voice is calculated from the slope of the time normalization function obtained from the DP matching, and then corrected according to the tempo difference. Calculating an average pitch difference calculated from the recorded time history of the pitch of the second input voice and the time history of the pitch of the first input voice as the pitch difference between the first input voice and the second input voice. The sound reproducing device according to claim 1, wherein
【請求項3】 前記DPマッチングより求められる時間
正規化関数の傾きに制限を設け、前記第2の入力音声の
テンポに対する前記第1の入力音声のテンポ差を設定値
以上に補正しないことを特徴とする請求項2記載の音響
再生装置。
3. The slope of the time normalization function obtained from the DP matching is limited, and the tempo difference of the first input voice with respect to the tempo of the second input voice is not corrected to a set value or more. The sound reproducing device according to claim 2.
【請求項4】 前記第2の入力音声のテンポ及び音程を
前記第1の入力音声のテンポおよび音程に近似させるよ
うに補正する補正期間を設定し、該補正期間後は、前記
テンポ差および音程差の補正量を一定に保つように第2
の入力音声或は、前記第3の入力音声を再生することを
特徴とする請求項1記載の音響再生装置。
4. A correction period for correcting the tempo and pitch of the second input voice to approximate the tempo and pitch of the first input voice, and after the correction period, the tempo difference and pitch. Second to keep the difference correction amount constant
2. The sound reproducing apparatus according to claim 1, wherein the input sound of the above or the third input sound is reproduced.
【請求項5】 前記第2の入力音声或は、第3の入力音
声のテンポ及び音程の補正を指示するための手元スイッ
チを備えており、該手元スイッチにより、操作者が前記
補正期間を指定できることを特徴とする請求項4記載の
音響再生装置。
5. A hand switch for instructing correction of the tempo and pitch of the second input voice or the third input voice is provided, and the operator specifies the correction period by the hand switch. The sound reproducing device according to claim 4, which can be performed.
JP24934094A 1994-10-14 1994-10-14 Sound reproduction device Expired - Fee Related JP3263546B2 (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
JP24934094A JP3263546B2 (en) 1994-10-14 1994-10-14 Sound reproduction device

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
JP24934094A JP3263546B2 (en) 1994-10-14 1994-10-14 Sound reproduction device

Publications (2)

Publication Number Publication Date
JPH08115097A true JPH08115097A (en) 1996-05-07
JP3263546B2 JP3263546B2 (en) 2002-03-04

Family

ID=17191567

Family Applications (1)

Application Number Title Priority Date Filing Date
JP24934094A Expired - Fee Related JP3263546B2 (en) 1994-10-14 1994-10-14 Sound reproduction device

Country Status (1)

Country Link
JP (1) JP3263546B2 (en)

Cited By (14)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2004069815A (en) * 2002-08-02 2004-03-04 Yamaha Corp System, method, and program for editing content
JP2004170891A (en) * 2002-11-15 2004-06-17 Takao Ushiyama Karaoke system with automatic adjusting function for key and singing start delay
US7435169B2 (en) * 2006-03-10 2008-10-14 Nintendo Co., Ltd. Music playing apparatus, storage medium storing a music playing control program and music playing control method
JP2009014923A (en) * 2007-07-03 2009-01-22 Yamaha Corp Musical performance clock generating device, data reproducing device, musical performance clock generating method, data reproducing method, and program
JP2009014978A (en) * 2007-07-04 2009-01-22 Yamaha Corp Musical performance clock generating device, data reproducing device, musical performance clock generating method, data reproducing method, and program
JP2009020179A (en) * 2007-07-10 2009-01-29 Yamaha Corp Performance clock generation device, data reproducing device, performance clock generation method, data reproducing method, and program
JP2010504563A (en) * 2006-09-26 2010-02-12 ジョテック インコーポレイテッド Automatic sound adjustment method and system for music accompaniment apparatus
JP2010139571A (en) * 2008-12-09 2010-06-24 Fujitsu Ltd Voice processing apparatus and voice processing method
JP2011048335A (en) * 2009-08-25 2011-03-10 Inst For Information Industry Singing voice synthesis system, singing voice synthesis method and singing voice synthesis device
JP2011053588A (en) * 2009-09-04 2011-03-17 Yamaha Corp Acoustic processing device and program
JP2011053589A (en) * 2009-09-04 2011-03-17 Yamaha Corp Acoustic processing device and program
JP2011053590A (en) * 2009-09-04 2011-03-17 Yamaha Corp Acoustic processing device and program
JP2012123230A (en) * 2010-12-09 2012-06-28 Yamaha Corp Information processor
WO2015002238A1 (en) * 2013-07-02 2015-01-08 ヤマハ株式会社 Mixing management device and mixing management method

Cited By (15)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2004069815A (en) * 2002-08-02 2004-03-04 Yamaha Corp System, method, and program for editing content
JP2004170891A (en) * 2002-11-15 2004-06-17 Takao Ushiyama Karaoke system with automatic adjusting function for key and singing start delay
US7435169B2 (en) * 2006-03-10 2008-10-14 Nintendo Co., Ltd. Music playing apparatus, storage medium storing a music playing control program and music playing control method
JP2010504563A (en) * 2006-09-26 2010-02-12 ジョテック インコーポレイテッド Automatic sound adjustment method and system for music accompaniment apparatus
JP2009014923A (en) * 2007-07-03 2009-01-22 Yamaha Corp Musical performance clock generating device, data reproducing device, musical performance clock generating method, data reproducing method, and program
JP2009014978A (en) * 2007-07-04 2009-01-22 Yamaha Corp Musical performance clock generating device, data reproducing device, musical performance clock generating method, data reproducing method, and program
JP2009020179A (en) * 2007-07-10 2009-01-29 Yamaha Corp Performance clock generation device, data reproducing device, performance clock generation method, data reproducing method, and program
JP2010139571A (en) * 2008-12-09 2010-06-24 Fujitsu Ltd Voice processing apparatus and voice processing method
JP2011048335A (en) * 2009-08-25 2011-03-10 Inst For Information Industry Singing voice synthesis system, singing voice synthesis method and singing voice synthesis device
JP2011053588A (en) * 2009-09-04 2011-03-17 Yamaha Corp Acoustic processing device and program
JP2011053589A (en) * 2009-09-04 2011-03-17 Yamaha Corp Acoustic processing device and program
JP2011053590A (en) * 2009-09-04 2011-03-17 Yamaha Corp Acoustic processing device and program
JP2012123230A (en) * 2010-12-09 2012-06-28 Yamaha Corp Information processor
WO2015002238A1 (en) * 2013-07-02 2015-01-08 ヤマハ株式会社 Mixing management device and mixing management method
JP2015012592A (en) * 2013-07-02 2015-01-19 ヤマハ株式会社 Mixing management device

Also Published As

Publication number Publication date
JP3263546B2 (en) 2002-03-04

Similar Documents

Publication Publication Date Title
JP3900580B2 (en) Karaoke equipment
JP3709631B2 (en) Karaoke equipment
JP3263546B2 (en) Sound reproduction device
JPH10319964A (en) Electronic musical instrument
US20050257667A1 (en) Apparatus and computer program for practicing musical instrument
US5862232A (en) Sound pitch converting apparatus
JPH0481880A (en) Karaoke device
JP4802857B2 (en) Musical sound synthesizer and program
US7122731B2 (en) Musical information processing terminal, control method therefor, and program for implementing the method
JPH10319947A (en) Pitch extent controller
JP2924208B2 (en) Electronic music playback device with practice function
JPH0962257A (en) Musical sound signal processing device
JP3562068B2 (en) Karaoke equipment
JP4049465B2 (en) Pitch control device for waveform reproduction device
JP2000047677A (en) Karaoke device
JP2001060089A (en) Karaoke device
JP2001155031A (en) Input data processing method and data input device for music retrieval system
JP4081859B2 (en) Singing voice generator and karaoke device
JP2674452B2 (en) Music player
JP4802947B2 (en) Performance method determining device and program
JP3834963B2 (en) Voice input device and method, and storage medium
JP3903492B2 (en) Karaoke equipment
JPH11143480A (en) Karaoke device, and medium
JP2001125582A (en) Method and device for voice data conversion and voice data recording medium
JP3166621B2 (en) Karaoke processor and musical instrument practice processor

Legal Events

Date Code Title Description
LAPS Cancellation because of no payment of annual fees