JP2005195968A

JP2005195968A - Pitch converting device

Info

Publication number: JP2005195968A
Application number: JP2004003197A
Authority: JP
Inventors: Atsushi Hoshiai; 厚星合
Original assignee: Roland Corp
Current assignee: Roland Corp
Priority date: 2004-01-08
Filing date: 2004-01-08
Publication date: 2005-07-21
Anticipated expiration: 2024-01-08
Also published as: JP4565846B2

Abstract

<P>PROBLEM TO BE SOLVED: To provide a pitch converting device capable of converting the pitch of an audio signal such as a speech signal and a musical sound signal while leaving natural fluctuations with simple constitution in such a case. <P>SOLUTION: A desired section is cut out of sampling data stored in a ring memory as a phoneme based upon cycles corresponding to a pitch detected by a pitch detecting means 30, and the cut-out section is read out with time at a readout speed corresponding to a formant change coefficient FORMANT-VR; and the pitch detected by the pitch detecting means 30 is smoothed by a pitch smoothing means 32 and the phoneme is put together with the smoothed pitch in cycles corresponding to target pitch information SPITCH. <P>COPYRIGHT: (C)2005,JPO&NCIPI

Description

本発明は、音声信号や楽音信号等のピッチを変換するピッチ変換装置に関する。 The present invention relates to a pitch converter for converting the pitch of an audio signal, a musical sound signal, or the like.

従来、ピッチ変換装置としては、種々のものが提案されている。例えば特許第３３２９６３５号公報（特許文献１）には、入力されたオーディオ信号をメモリに記憶するとともに、そのオーディオ信号のピッチ（周期）を検出し、その検出された周期に対応する区間をメモリから切り出して、新たに指定されたピッチに応じた周期でその切り出した区間を合成することによりピッチを変換して再生する効果装置が開示されている。なお、本明細書中では、新たに指定されたピッチを目標ピッチといい、この目標ピッチは、入力されたオーディオ信号のピッチとは関係のない絶対音高を指すものとする。ピッチを指定する方法としては、入力されたオーディオ信号のピッチに対する相対値で指定する方法もあり、この方法と区別するためである。 Conventionally, various pitch converters have been proposed. For example, in Japanese Patent No. 3329635 (Patent Document 1), an input audio signal is stored in a memory, a pitch (period) of the audio signal is detected, and a section corresponding to the detected period is stored in the memory. An effect device is disclosed in which a pitch is converted and reproduced by synthesizing the segmented segments in a cycle according to the newly designated pitch. In the present specification, a newly designated pitch is referred to as a target pitch, and this target pitch refers to an absolute pitch that is not related to the pitch of the input audio signal. As a method of designating the pitch, there is a method of designating by a relative value with respect to the pitch of the input audio signal, for the purpose of distinguishing from this method.

また、特許第３２８７２３０号公報（特許文献２）には、歌唱者が発生すべきボーカル音のピッチを指定する情報およびそのボーカル音に付与すべきハーモニー音のピッチを指定する情報を入力し、歌唱者が発声したボーカル音のピッチを検出して、そのピッチと歌唱者が発生すべきボーカル音のピッチとの比率を求め、その比率に圧縮処理を施すことにより得られる情報をボーカル音に付与すべきハーモニー音のピッチを指定する情報に付与してハーモニー音のピッチを変換するようにしたコーラス効果装置が開示されている。したがって、この装置では歌唱者が発声したボーカル音のピッチが揺らいでいる場合には、歌唱者のピッチと発声すべきボーカル音のピッチとの比率により歌唱者のピッチの揺らぎが抽出され、この抽出された揺らぎがハーモニー音に付与される。
特許第３３２９６３５号公報（図１０等）特許第３２８７２３０号公報 In addition, in Japanese Patent No. 3287230 (Patent Document 2), information specifying the pitch of a vocal sound to be generated by a singer and information specifying the pitch of a harmony sound to be given to the vocal sound are input, and singing is performed. Detects the pitch of the vocal sound uttered by the person, finds the ratio of the pitch to the pitch of the vocal sound that the singer should generate, and gives the vocal sound the information obtained by compressing the ratio A chorus effect device is disclosed in which the pitch of a harmony sound is converted by adding to the information specifying the pitch of the harmony sound. Therefore, in this apparatus, when the pitch of the vocal sound uttered by the singer is fluctuating, the fluctuation of the singer's pitch is extracted by the ratio of the pitch of the singer and the pitch of the vocal sound to be uttered. The added fluctuation is added to the harmony sound.
Japanese Patent No. 3329635 (FIG. 10 etc.) Japanese Patent No. 3287230

しかしながら、特許文献１に記載されている効果装置では、切り出されたオーディオ信号を、新たに指定された目標ピッチに応じた周期で合成するように構成されているため、変換された音声は一定のピッチで発生され、あたかもロボットが発声しているかのような音声になっていた。特に、カラオケなどで効果装置として用いられるコーラス付加装置の場合は、付加されるコーラスのピッチが一定となるため、不自然で違和感があるという問題点があった。 However, the effect device described in Patent Document 1 is configured to synthesize the clipped audio signal at a period corresponding to the newly designated target pitch, and thus the converted sound is constant. It was generated at the pitch and sounded as if the robot was speaking. In particular, in the case of a chorus adding device used as an effect device in karaoke or the like, the pitch of the added chorus is constant, and thus there is a problem that it is unnatural and uncomfortable.

また、特許文献２に記載されているコーラス効果付与装置では、歌唱者が発生すべきボーカル音のピッチと、歌唱者が発生したボーカル音のピッチとの比率を求め、この比率に基づいてハーモニー音に揺らぎを与えるので、歌唱者が発生すべきボーカル音のピッチを指定する情報が必要であるという問題点があった。 Moreover, in the chorus effect imparting device described in Patent Document 2, the ratio of the pitch of the vocal sound that the singer should generate and the pitch of the vocal sound that the singer generated is obtained, and the harmony sound is calculated based on this ratio. This gives a problem that information specifying the pitch of the vocal sound that the singer should generate is necessary.

本発明は、上記問題点を解決するためになされたもので、音声信号や楽音信号等のオーディオ信号のピッチを変換するに際し、簡単な構成で自然な揺らぎを残したまま音声のピッチを変換することができるピッチ変換装置を提供することを目的とする。 The present invention has been made to solve the above-described problems. When converting the pitch of an audio signal such as an audio signal or a musical tone signal, the pitch of the audio is converted with a simple configuration while leaving natural fluctuations. An object of the present invention is to provide a pitch conversion device that can perform the above-described operation.

上記の目的を達成するために、本発明の請求項１記載のピッチ変換装置は、オーディオ信号を入力する入力手段と、その入力手段に入力されたオーディオ信号のピッチを順次検出するピッチ検出手段と、そのピッチ検出手段により検出されたピッチを平滑化し、平滑化ピッチ情報を求めるピッチ平滑手段と、目標ピッチを指定する目標ピッチ指定手段と、前記入力手段に入力されたオーディオ信号のピッチを変換するピッチ変換手段と
、前記ピッチ平滑手段により平滑化された平滑化ピッチ情報と前記目標ピッチ指定手段により指定された目標ピッチとに基づいて前記ピッチ変換手段を制御する制御手段とを備えている。 In order to achieve the above object, a pitch conversion apparatus according to claim 1 of the present invention includes an input means for inputting an audio signal, and a pitch detection means for sequentially detecting the pitch of the audio signal input to the input means. The pitch detection means for smoothing the pitch detected by the pitch detection means, the pitch smoothing means for obtaining smoothed pitch information, the target pitch specifying means for specifying the target pitch, and the pitch of the audio signal input to the input means are converted. Pitch conversion means; and control means for controlling the pitch conversion means based on the smoothed pitch information smoothed by the pitch smoothing means and the target pitch specified by the target pitch specifying means.

この請求項１記載のピッチ変換装置によれば、入力されたオーディオ信号のピッチを順次検出し、そのピッチを平滑化する平滑化手段を備え、平滑化されたピッチと指定された目標ピッチとに基づいてピッチ変換を行うものである。 According to the pitch conversion apparatus of the first aspect, the pitch converting apparatus includes a smoothing unit that sequentially detects the pitch of the input audio signal and smoothes the pitch, and the smoothed pitch and the designated target pitch are set. Based on this, pitch conversion is performed.

請求項２記載のピッチ変換装置は、請求項１記載のピッチ変換装置において、前記平滑化手段は、順次検出されたピッチに低域通過フィルタ処理を施すものである。 The pitch converter according to claim 2 is the pitch converter according to claim 1, wherein the smoothing means performs low-pass filter processing on the sequentially detected pitches.

請求項３記載のピッチ変換装置は、請求項１又は２記載のピッチ変換装置において、前記平滑化手段は、前記ピッチ検出手段により今回検出されたピッチの値から前回平滑化された値を引いた差分値に所定の係数を乗算することにより平滑化ピッチ情報を得るものである。 The pitch conversion device according to claim 3 is the pitch conversion device according to claim 1 or 2, wherein the smoothing means subtracts a previously smoothed value from a pitch value detected this time by the pitch detection means. Smoothing pitch information is obtained by multiplying the difference value by a predetermined coefficient.

請求項４記載のピッチ変換装置は、請求項１から３のいずれかに記載のピッチ変換装置において、前記ピッチ変換手段は、前記ピッチ検出手段により検出されたピッチに基づいて、前記オーディオ信号の所定区間を切り出し、その切り出した区間を前記ピッチ平滑手段により平滑化された平滑化ピッチ情報と前記目標ピッチ指定手段により指定された目標ピッチとに応じた周期で合成するものである。 The pitch conversion device according to claim 4 is the pitch conversion device according to any one of claims 1 to 3, wherein the pitch conversion unit is configured to perform predetermined processing of the audio signal based on the pitch detected by the pitch detection unit. The section is cut out, and the cut out section is synthesized with a period according to the smoothed pitch information smoothed by the pitch smoothing means and the target pitch specified by the target pitch specifying means.

請求項５記載のピッチ変換装置は、請求項１から３のいずれかに記載のピッチ変換装置において前ピッチ変換手段は、波形記憶手段に記憶されたオーディオ信号を前記ピッチ平滑手段により平滑化された平滑化ピッチ情報と前記目標ピッチ指定手段により指定された目標ピッチとに応じた速さで読み出すものである。 The pitch conversion device according to claim 5 is the pitch conversion device according to any one of claims 1 to 3, wherein the front pitch conversion means smoothes the audio signal stored in the waveform storage means by the pitch smoothing means. Data is read at a speed corresponding to the smoothed pitch information and the target pitch specified by the target pitch specifying means.

請求項１記載のピッチ変換装置によれば、入力されたオーディオ信号のピッチを順次検出し、そのピッチを平滑化するピッチ平滑手段を備え、平滑化されたピッチと指定された目標ピッチとに基づいてピッチ変換を行うので、入力されたオーディオ信号のピッチが指定された目標ピッチに変換されるとともに、入力されたオーディオ信号のピッチが揺らいでいる場合には、変換されたオーディオ信号のピッチも揺らぐようにすることができるという効果がある。 According to the pitch conversion device of the first aspect, pitch smoothing means for sequentially detecting the pitch of the input audio signal and smoothing the pitch is provided, and based on the smoothed pitch and the designated target pitch. Therefore, the pitch of the input audio signal is converted to the specified target pitch, and if the pitch of the input audio signal fluctuates, the pitch of the converted audio signal also fluctuates. There is an effect that can be made.

請求項２記載のピッチ変換装置によれば、請求項１記載のピッチ変換装置の奏する効果に加え、前記ピッチ平滑手段は、順次検出されたピッチに低域通過フィルタ処理を施すものであるので、簡単な処理により平滑化することができるという効果がある。 According to the pitch conversion device of claim 2, in addition to the effect of the pitch conversion device of claim 1, the pitch smoothing means performs low-pass filter processing on the sequentially detected pitch, There is an effect that it can be smoothed by a simple process.

請求項３記載のピッチ変換装置は、請求項１又は２に記載のピッチ変換装置の奏する効果に加え、前記ピッチ平滑手段は、前記ピッチ検出手段により今回検出されたピッチの値から前回平滑化された値を引いた差分値に所定の係数を乗算することにより平滑化ピッチ情報を得るものであるので、単純な演算により平滑化処理を行うことができるという効果がある。 According to a third aspect of the present invention, in addition to the effect produced by the first aspect of the present invention, the pitch smoothing means is previously smoothed from the pitch value detected this time by the pitch detecting means. Since the smoothed pitch information is obtained by multiplying the difference value obtained by subtracting the obtained value by a predetermined coefficient, there is an effect that the smoothing process can be performed by a simple calculation.

請求項４記載のピッチ変換装置は、請求項１から３のいずれかに記載のピッチ変換装置の奏する効果に加え、前記ピッチ変換手段は、前記ピッチ検出手段により検出されたピッチに基づいて、前記オーディオ信号の所定区間を切り出し、その切り出した区間を前記ピッチ平滑手段により平滑化された平滑化ピッチ情報と前記目標ピッチ指定手段により指定された目標ピッチとに応じた周期で合成するものであるので、入力されたオーディオ信号のフォルマントを変更せずにピッチだけを変換することができるという効果がある。 According to a fourth aspect of the present invention, in addition to the effect produced by the pitch conversion device according to any one of the first to third aspects, the pitch conversion unit is configured to generate the pitch conversion unit based on the pitch detected by the pitch detection unit. A predetermined section of the audio signal is cut out, and the cut out section is synthesized with a period according to the smoothed pitch information smoothed by the pitch smoothing means and the target pitch specified by the target pitch specifying means. There is an effect that only the pitch can be converted without changing the formant of the input audio signal.

請求項５記載のピッチ変換装置は、請求項１から３のいずれかに記載のピッチ変換装置において、前ピッチ変換手段は、波形記憶手段に記憶されたオーディオ信号を前記ピッチ平滑手段により平滑化された平滑化ピッチ情報と前記目標ピッチ指定手段により指定された目標ピッチとに応じた速さで読み出すものであるので、入力されたオーディオ信号のフォルマントも同時に変換されるが構成が簡単であるという効果がある。 The pitch conversion device according to claim 5 is the pitch conversion device according to any one of claims 1 to 3, wherein the previous pitch conversion means smoothes the audio signal stored in the waveform storage means by the pitch smoothing means. The smoothed pitch information and the target pitch designated by the target pitch designation means are read out at a speed, so that the formant of the input audio signal is converted at the same time, but the configuration is simple. There is.

本発明の第１の実施形態は、図１（ａ）に示すように、アナログの楽音信号または音声信号等のオーディオ信号（可聴周波数信号）が入力される入力端子２を有し、この入力端子２に供給されたオーディオ信号が、Ａ／Ｄ変換器４によってデジタルオーディオ信号（サンプリングデータ）に変換される。このＡ／Ｄ変換器４と入力端子２との間には、オーディオ信号をＡ／Ｄ変換器４におけるサンプリング周波数（例えば４８ｋＨｚ）の１／２以下の周波数に制限して、エイリアスの発生を防止するために、ローパスフィルタ６が設けられている。 As shown in FIG. 1 (a), the first embodiment of the present invention has an input terminal 2 to which an audio signal (audible frequency signal) such as an analog musical tone signal or a voice signal is input. The audio signal supplied to 2 is converted into a digital audio signal (sampling data) by the A / D converter 4. Between the A / D converter 4 and the input terminal 2, the audio signal is limited to a frequency that is 1/2 or less of the sampling frequency (for example, 48 kHz) in the A / D converter 4 to prevent aliasing. For this purpose, a low-pass filter 6 is provided.

Ａ／Ｄ変換器４によりデジタル化されたサンプリングデータは、ＤＳＰ（デジタル信号処理装置）８に供給され、ＤＳＰ８からＲＡＭ１２に供給される。このＲＡＭ１２は、入力されたサンプリングデータを順次記憶するリングメモリとして使用されている。 The sampling data digitized by the A / D converter 4 is supplied to a DSP (digital signal processing device) 8 and is supplied from the DSP 8 to the RAM 12. The RAM 12 is used as a ring memory that sequentially stores input sampling data.

次に、ＤＳＰ８は、このリングメモリに一時的に記憶したサンプリングデータを読出して、処理を行い、Ｄ／Ａ変換器１４に供給する。Ｄ／Ａ変換器１４は、ＤＳＰ８で処理されたサンプリングデータをアナログのオーディオ信号に変換し、ローパスフィルタ１６に供給する。このアナログのオーディオ信号は、ローパスフィルタ１６により不要な信号成分が除去された後、出力端子１８から出力され、図示しないアンプにより増幅されてスピーカ等から放音される。 Next, the DSP 8 reads the sampling data temporarily stored in the ring memory, performs processing, and supplies it to the D / A converter 14. The D / A converter 14 converts the sampling data processed by the DSP 8 into an analog audio signal and supplies the analog audio signal to the low-pass filter 16. This analog audio signal is output from the output terminal 18 after unnecessary signal components are removed by the low-pass filter 16, amplified by an amplifier (not shown), and emitted from a speaker or the like.

ＤＳＰ８は、予め設定されているプログラムに従ってサンプリングデータを処理する。その処理には、例えば使用者が操作する操作子２８によって設定されたフォルマント変更係数ＦＯＲＭＡＮＴ−ＶＲや目標ピッチ情報ＳＰＩＴＣＨ等のパラメータが使用される。これらフォルマント変更係数ＦＯＲＭＡＮＴ−ＶＲや目標ピッチ情報ＳＰＩＴＣＨは、それぞれを設定するボリュームツマミが設けられており、それらのボリュームツマミにより設定されている値がＣＰＵ２２によって検出され、ＤＳＰ８に供給され、ＤＳＰ８内のレジスタに記憶される。なお、目標ピッチ情報ＳＰＩＴＣＨは、目標ピッチ（周波数）に対応する周期であり、目標とするピッチの逆数である。 The DSP 8 processes sampling data according to a preset program. For this process, parameters such as formant change coefficient FORMAT-VR and target pitch information SPITCH set by the operator 28 operated by the user are used. These formant change coefficient FORMANT-VR and target pitch information SPITCH are provided with volume knobs for setting each of them, and the values set by these volume knobs are detected by the CPU 22, supplied to the DSP 8, and stored in the DSP 8. Stored in a register. The target pitch information SPITCH is a period corresponding to the target pitch (frequency), and is the reciprocal of the target pitch.

ＣＰＵ２２には、プログラム等を記憶したＲＯＭ２４とＲＡＭ２６がバスを経由して接続され、ＲＯＭ２４に記憶されたプログラムを実行することによりピッチ変換装置の制御を行う。ＣＰＵ２２が実行する処理としては、上述の通り操作子２８により設定されている値を検出し、その設定に応じたパラメータをＤＳＰ８に供給するなどのＤＳＰ８の制御と、設定されたパラメータを表示器（図示なし）に表示を行うなどの処理がある。 A ROM 24 and a RAM 26 storing programs and the like are connected to the CPU 22 via a bus, and the pitch conversion device is controlled by executing the programs stored in the ROM 24. As the processing executed by the CPU 22, as described above, the value set by the operation element 28 is detected, and the DSP 8 is controlled such that the parameter corresponding to the setting is supplied to the DSP 8, and the set parameter is displayed on the display ( There is a process such as displaying (not shown).

図１（ｂ）は、ＤＳＰ８が実行する機能を機能ブロック図で示したもので、本発明の基本となる実施形態を示したものである。なお、この実施例では、入力端子２３に入力されるサンプリングデータをＲＡＭ１２のリングメモリに順次記憶する機能ブロックを有しているが、説明を容易にするため、図１（ｂ）ではその機能ブロックを省略している。 FIG. 1B is a functional block diagram showing the functions executed by the DSP 8, and shows a basic embodiment of the present invention. In this embodiment, there is a functional block for sequentially storing the sampling data input to the input terminal 23 in the ring memory of the RAM 12, but for the sake of easy explanation, the functional block is shown in FIG. Is omitted.

同図（ｂ）に示す基本実施形態では、入力端子２３と、ピッチ変換手段３６と、ピッチ検出手段３０と、ピッチ平滑手段３２と、制御手段３４と、出力端子２９とが、設けられている。 In the basic embodiment shown in FIG. 2B, an input terminal 23, a pitch conversion means 36, a pitch detection means 30, a pitch smoothing means 32, a control means 34, and an output terminal 29 are provided. .

ピッチ検出手段３０は、サンプリングデータからゼロクロス点を検出し、ゼロクロス点の間隔がほぼ同一となる周期を順次検出するものである。ピッチ平滑手段３２は、ピッチ検出手段３０が検出した周期を順次入力し平滑化を行うものである。入力されたオーディオ信号のピッチが揺らいでいる場合には、揺らぎが除かれ、平滑化（平均化）されたピッチ（周期）が得られる。 The pitch detection means 30 detects the zero cross points from the sampling data, and sequentially detects periods in which the intervals between the zero cross points are substantially the same. The pitch smoothing unit 32 performs smoothing by sequentially inputting the periods detected by the pitch detecting unit 30. When the pitch of the input audio signal is fluctuating, the fluctuation is removed, and a smoothed (averaged) pitch (period) is obtained.

制御手段３４は、ピッチ検出手段３０からのピッチ検出信号とピッチ平滑手段３２により平滑化された平滑化ピッチ情報ｌｐや、ＣＰＵ２２を介して操作子２８の操作によって設定されたフォルマント変更係数ＦＯＲＭＡＮＴ−ＶＲや目標ピッチ情報ＳＰＩＴＣＨ等のパラメータを用いてピッチ変換手段３６を制御するものである。 The control means 34 includes a pitch detection signal from the pitch detection means 30 and smoothed pitch information lp smoothed by the pitch smoothing means 32, and a formant change coefficient FORMAT-VR set by the operation of the operator 28 via the CPU 22. The pitch conversion means 36 is controlled using parameters such as target pitch information SPITCH.

ピッチ変換手段３６は、リングメモリに記憶されているサンプリングデータからピッチ検出手段３０により検出されたピッチに対応する周期に基づいて所望の区間を音素として切り出し、制御手段３４の制御に基づいて、切り出された区間を所望のフォルマントに対応する速度で読み出すと共に、再生ピッチ情報ＷＩＤＴＨに対応した周期で合成する。 The pitch converting means 36 cuts out a desired section as a phoneme based on the period corresponding to the pitch detected by the pitch detecting means 30 from the sampling data stored in the ring memory, and cuts out based on the control of the control means 34. The read section is read out at a speed corresponding to a desired formant and synthesized at a period corresponding to the reproduction pitch information WIDTH.

ピッチ平滑手段３２は、上述の通りピッチ検出手段３０により順次検出されたオーディオ信号のピッチに対応する周期を平滑化（平均化）するもので、ローパスフィルタ（低域通過フィルタ）を用いることができる。このローパスフィルタの一例として、今回検出されたピッチ（周期）をＰＩＴＣＨ、平滑率をα（０より大きく１．０より小さい所定の値の係数）とすると平滑化された平滑ピッチｌｐ（平滑化ピッチ情報）は、
ｌｐ＝（ＰＩＴＣＨ−ｌｐ）×α＋ｌｐ（１）
として求めることができる。 The pitch smoothing means 32 smoothes (averages) the period corresponding to the pitch of the audio signal sequentially detected by the pitch detecting means 30 as described above, and a low-pass filter (low-pass filter) can be used. . As an example of the low-pass filter, if the detected pitch (period) is PITCH and the smoothing rate is α (a coefficient having a predetermined value larger than 0 and smaller than 1.0), the smoothed smoothing pitch lp (smoothing pitch) Information)
lp = (PITCH−lp) × α + lp (1)
Can be obtained as

ここで、平滑率αの値を１．０とすれば、ｌｐは、平滑化されない検出されたピッチの値そのものとなり、平滑率αの値を０に近い値に設定すると、検出されたピッチの変化に追従しない平滑化ピッチ情報となる。この平滑率を適切な値に設定することにより、入力されたオーディオ信号のピッチが揺らいでいる場合には、その揺らぎが除かれたピッチ、言換えれば中心ピッチが求められる。 Here, if the value of the smoothing rate α is 1.0, lp becomes the value of the detected pitch that is not smoothed, and if the value of the smoothing rate α is set to a value close to 0, the value of the detected pitch The smoothed pitch information does not follow the change. By setting the smoothing rate to an appropriate value, when the pitch of the input audio signal fluctuates, the pitch from which the fluctuation is removed, in other words, the center pitch is obtained.

制御手段３４は、その揺らぎが除かれたピッチを指定された目標ピッチに変換するようにピッチ変換手段３６を制御する。その結果、ピッチ変換されたオーディオ信号に、入力されたオーディオ信号のピッチの揺らぎが保持されることになる。 The control means 34 controls the pitch converting means 36 so as to convert the pitch from which the fluctuation is removed into a designated target pitch. As a result, the pitch fluctuation of the input audio signal is retained in the pitch-converted audio signal.

なお、この基本実施形態では、上記音素の再生を、第１の波形を再生するための処理経路と、第２の波形を再生するための処理経路とを使用し、それぞれの処理経路では、再生しようとする周期の２倍の周期で音素を再生し、これらを合成するようにしている。 In this basic embodiment, the phoneme is reproduced by using a processing path for reproducing the first waveform and a processing path for reproducing the second waveform. The phonemes are reproduced with a period twice as long as the intended period, and these are synthesized.

この効果付加の動作を、図２、図３及び図４に示すフローチャートに基づいて説明する。ＤＳＰ８は、サンプリングデータの検出ピッチに対応する周期を記憶するレジスタＰＩＴＣＨ、リングメモリからサンプリングデータを切り出す（読出）アドレスを記憶するレジスタＳＡＤＲＳを備えている。更に、後述する再生ピッチ周期長に達したかをカウントするためのレジスタＰＨＡＳＥ、第１の波形の位相をカウントするためのレジスタＰＨ１、第２の波形の位相をカウントするためのレジスタＰＨ２も設けられている。 The effect adding operation will be described with reference to the flowcharts shown in FIGS. The DSP 8 includes a register PITCH that stores a period corresponding to the detected pitch of the sampling data, and a register SADRS that stores an address for reading (reading) the sampling data from the ring memory. Further, a register PHASE for counting whether a reproduction pitch period length described later has been reached, a register PH1 for counting the phase of the first waveform, and a register PH2 for counting the phase of the second waveform are also provided. ing.

また、ピッチ検出手段３０によって検出された周期ＰＩＴＣＨに目標ピッチ情報ＳＰＩＴＣＨを乗算し、その積を平滑化ピッチ情報ｌｐで除算することによって求めた再生ピッチ情報を記憶するためのレジスタＷＩＤＴＨ、検出した周期とフォルマント係数ＦＯＲＭＡＮＴ−ＶＲとから定めたエンベロープの長さを記憶するレジスタＬＥＮＧＴＨ、平滑化されたピッチである平滑化ピッチ情報を記憶するレジスタｌｐ、第１の波形のエンベロープを記憶するためのレジスタＥＮＶ１、及び第２の波形のエンベロープを記憶するためのレジスタＥＮＶ２が、設けられている。 A register WIDTH for storing reproduction pitch information obtained by multiplying the period PITCH detected by the pitch detection means 30 by the target pitch information SPITCH and dividing the product by the smoothed pitch information lp, the detected period And a register LENGTH for storing the length of the envelope determined from the formant coefficient FORMANT-VR, a register lp for storing smoothed pitch information which is a smoothed pitch, and a register ENV1 for storing the envelope of the first waveform And a register ENV2 for storing the envelope of the second waveform.

更に、第１の波形のエンベロープの形状を決定するためのレジスタＷＩＮＤＯＷ１、第２の波形のエンベロープの形状を決定するためのレジスタＷＩＮＤＯＷ２、ＬＥＮＧＴＨの値に基づいて定めたＷＩＮＤＯＷ１、ＷＩＮＤＯＷ２の歩進率を記憶するレジスタＷ−ＲＡＴＥ、第１の波形の切り出し開始アドレスを記憶するレジスタＳＡＤＲＳ１、第２の波形の切り出し開始アドレスを記憶するレジスタＳＡＤＲＳ２、第１及び第２の波形の切り出しの開始位置等の決定のために使用するフラグＦ等も設けられている。 Further, the step rate of WINDOW1 and WINDOW2 determined based on the values of the register WINDOW1 for determining the shape of the envelope of the first waveform, the registers WINDOW2 and LENGTH for determining the shape of the envelope of the second waveform, Register W-RATE to be stored, register SADRS1 to store the first waveform cut-out start address, register SADRS2 to store the second waveform cut-out start address, determination of the first and second waveform cut-out start positions, etc. A flag F used for the purpose is also provided.

これらは、電源の投入の際に、初期化が行われる。即ち、フラグＦは１に、他のものは０に、それぞれ設定される。なお、以下の説明では、各レジスタには、既に適当な値が記憶されているとして説明する。また、図２及び図３に示すフローチャートの各ステップは、ＤＳＰ８にＡ／Ｄ変換器４からサンプリングデータが入力されるごとに実行される。 These are initialized when the power is turned on. That is, the flag F is set to 1 and the others are set to 0. In the following description, it is assumed that appropriate values are already stored in each register. Each step of the flowcharts shown in FIGS. 2 and 3 is executed each time sampling data is input from the A / D converter 4 to the DSP 8.

図２において、サンプリングデータがＡ／Ｄ変換器４から供給されると、これをリングメモリに書き込む（Ｓ２）。次に、この入力されたサンプリングデータに基づいてピッチ検出処理を行う（Ｓ４）。このＳ４の処理がピッチ検出手段３０に相当する。このピッチ検出処理は、ピッチを検出したときにゼロクロス位置のアドレスであるＳＡＤＲＳと検出したピッチに対応する周期であるＰＩＴＣＨとを出力するものである。 In FIG. 2, when sampling data is supplied from the A / D converter 4, it is written to the ring memory (S2). Next, pitch detection processing is performed based on the input sampling data (S4). The process of S4 corresponds to the pitch detection means 30. This pitch detection process outputs SADRS, which is the address of the zero cross position, and PITCH, which is a cycle corresponding to the detected pitch, when the pitch is detected.

このピッチ検出処理は、例えば隣接するゼロクロス間の時間間隔を順次比較するものである。例えば、図５（ａ）は、入力されたオーディオ信号の波形を表すもので、横軸に時間を、縦軸に波形の振幅値を表示している。この図において、入力信号のゼロクロス間の時間間隔がａ０、ｂ０、ｃ０、ｄ０、ａ１、ｂ１、ｃ１、ｄ１であるとする。最初のゼロクロス間の時間間隔ａ０と次のゼロクロス間の時間間隔ｂ０とを比較し、両者が異なると、次にそれらを加算した時間間隔（ａ０＋ｂ０）と隣接する時間間隔（ｃ０＋ｄ０）とを比較する。この比較においても両者が異なると、時間間隔（ａ０＋ｂ０＋ｃ０）と隣接する時間間隔（ｄ０＋ａ１＋ｂ１）とを比較する。やはり両者が異なると、時間間隔（ａ０＋ｂ０＋ｃ０＋ｄ０）と隣接する時間間隔（ａ１＋ｂ１＋ｃ１＋ｄ１）とを比較する。この比較で両者がほぼ一致すると、時間間隔（ａ１＋ｂ１＋ｃ１＋ｄ１）をＰＩＴＣＨ、時刻ｔ０に対応するアドレスをゼロクロス位置のアドレスとして出力する。 This pitch detection process sequentially compares time intervals between adjacent zero crosses, for example. For example, FIG. 5A shows a waveform of an input audio signal, and the horizontal axis represents time and the vertical axis represents the amplitude value of the waveform. In this figure, it is assumed that the time intervals between the zero crosses of the input signal are a0, b0, c0, d0, a1, b1, c1, and d1. The time interval a0 between the first zero crosses is compared with the time interval b0 between the next zero crosses. If they are different, the time interval (a0 + b0) obtained by adding them is compared with the adjacent time interval (c0 + d0). . If both are different in this comparison, the time interval (a0 + b0 + c0) is compared with the adjacent time interval (d0 + a1 + b1). If they are different, the time interval (a0 + b0 + c0 + d0) is compared with the adjacent time interval (a1 + b1 + c1 + d1). When the two values are almost the same in this comparison, the time interval (a1 + b1 + c1 + d1) is output as PITCH, and the address corresponding to time t0 is output as the address at the zero cross position.

その他、例えば特開平３−２８８２００号公報に開示されているように、サンプリングデータのゼロクロス位置と、波形信号のピーク位置とを検出し、これらゼロクロス位置とピーク位置との時間間隔を、以前に検出したゼロクロス位置との時間間隔と比較することによって、ピッチを検出するものも使用することができる。 In addition, as disclosed in, for example, Japanese Patent Laid-Open No. 3-288200, the zero cross position of the sampling data and the peak position of the waveform signal are detected, and the time interval between the zero cross position and the peak position is previously detected. It is also possible to use one that detects the pitch by comparing the time interval with the zero cross position.

Ｓ４の処理においてピッチ検出が行われたか否かを判断し（Ｓ６）、ピッチ検出された場合（Ｓ６：Ｙｅｓ）は、Ｓ４の処理で検出されたゼロクロス位置を切り出しアドレスとし、これによってレジスタＳＡＤＲＳの値を更新し、かつ検出ピッチを１周期長として、これでＰＩＴＣＨの値を更新する（Ｓ８）。 It is determined whether or not pitch detection has been performed in the process of S4 (S6). If the pitch is detected (S6: Yes), the zero-cross position detected in the process of S4 is set as the cut-out address, and thereby the register SADRS The value is updated, and the detected pitch is set to one cycle length, thereby updating the value of PITCH (S8).

ピッチ検出が行われなかった場合（Ｓ６：Ｎｏ）、またはピッチ検出が行われ、上記の両レジスタの更新が行われた場合、レジスタＰＨＡＳＥ、ＰＨ１、ＰＨ２の値をそれぞれ１歩進させる（Ｓ１０）。 When the pitch detection is not performed (S6: No), or when the pitch detection is performed and both the above registers are updated, the values of the registers PHASE, PH1, and PH2 are respectively advanced by 1 (S10). .

次にレジスタＰＨＡＳＥが、ＷＩＤＴＨの値と等しいかまたは大きいかを判断する（Ｓ１２）。ＷＩＤＴＨは、後述するように再生するピッチに対応する１周期長を記憶しており、ＰＨＡＳＥの値がこのＷＩＤＴＨの値に達しているか否かを判断している。即ち、所定の再生ピッチにＰＨＡＳＥの値が達しているか否かを判断している。このＷＩＤＴＨの値が再生ピッチ情報に相当する。 Next, it is determined whether the register PHASE is equal to or greater than the value of WIDTH (S12). WIDTH stores one cycle length corresponding to the pitch to be reproduced as will be described later, and determines whether or not the value of PHASE has reached this WIDTH value. That is, it is determined whether or not the PHASE value has reached a predetermined reproduction pitch. The value of WIDTH corresponds to reproduction pitch information.

ＰＨＡＳＥの値がＷＩＤＴＨの値に達していないと（Ｓ１２：Ｎｏ）、後述する図３に記載のフローチャートのＳ３２の波形処理へ進む。ＰＨＡＳＥの値がＷＩＤＴＨの値に達していると（Ｓ１２：Ｙｅｓ）、レジスタＰＨＡＳＥの値を０とする（Ｓ１４）。 If the PHASE value does not reach the WIDTH value (S12: No), the process proceeds to the waveform processing in S32 of the flowchart shown in FIG. If the value of PHASE has reached the value of WIDTH (S12: Yes), the value of register PHASE is set to 0 (S14).

次に、ピッチ平滑化処理を行う。この処理は、レジスタＰＩＴＣＨの値からレジスタｌｐの値を引いた値に係数αを乗算し、その値にレジスタｌｐの値を加算した値を新たなｌｐとするものであり、この処理によりレジスタＰＩＴＣＨに順次記憶される値が平滑化される。この平滑化された値ｌｐをレジスタｌｐに記憶する（Ｓ１６）。 Next, a pitch smoothing process is performed. In this process, a value obtained by subtracting the value of the register lp from the value of the register PITCH is multiplied by the coefficient α, and a value obtained by adding the value of the register lp to the value is used as a new lp. The values sequentially stored in are smoothed. The smoothed value lp is stored in the register lp (S16).

そして新たなＷＩＤＴＨの値を決定するために、ＰＩＴＣＨの値と目標ピッチ情報ＳＰＩＴＣＨとを乗算し、その積を平滑化ピッチ情報ｌｐの値で除算する。これをレジスタＷＩＤＴＨに記憶させると共に、第１の波形のエンベロープ、第２の波形のエンベロープの周期を決定するために、ＰＩＴＣＨの値をフォルマント係数ＦＯＲＭＡＮＴ−ＶＲで除算し、レジスタＬＥＮＧＴＨに記憶させる（Ｓ１８）。 In order to determine a new WIDTH value, the PITCH value is multiplied by the target pitch information SPITCH, and the product is divided by the smoothed pitch information lp value. This is stored in the register WIDTH, and the PITCH value is divided by the formant coefficient FORMANT-VR and stored in the register LENGTH to determine the envelope periods of the first waveform and the second waveform (S18). ).

次に図３に記載するフローチャートに進み、レジスタＬＥＮＧＴＨの値がレジスタＷＩＤＴＨの値より大きいか否かを判断する（Ｓ２０）。ＬＥＮＧＴＨの値がＷＩＤＴＨの値よりも大きいと（Ｓ２０：Ｙｅｓ）、ＬＥＮＧＴＨの値をＷＩＤＴＨの値とする（Ｓ２２）。これは、ＬＥＮＧＴＨの値がＷＩＤＴＨの値を超えないようにするためである。即ち、後述するＳ３２の波形読み出し処理において波形を加算するための処理経路が２つしか用意していないため、２つ以上の波形が重ならないようにするためである。 Next, proceeding to the flowchart shown in FIG. 3, it is determined whether or not the value of the register LENGTH is larger than the value of the register WIDTH (S20). If the LENGTH value is larger than the WIDTH value (S20: Yes), the LENGTH value is set as the WIDTH value (S22). This is to prevent the LENGTH value from exceeding the WIDTH value. In other words, since only two processing paths for adding waveforms are prepared in the waveform readout process of S32 described later, two or more waveforms are prevented from overlapping.

なお、ＬＥＮＧＴＨの値がＷＩＤＴＨの値以下であると、Ｓ２２のような処理を行わずに、Ｓ２４の処理を行う。また、Ｓ２２の処理に続いて、Ｓ２４の処理も行われる。Ｓ２４の処理では、ＬＥＮＧＴＨの値の逆数を求め、レジスタＷ−ＲＡＴＥに記憶させる。このＷ−ＲＡＴＥの値は、後述するようにＷＩＮＤＯＷ１、ＷＩＮＤＯＷ２の値を歩進させるために使用する。また、Ｓ２４の処理では、フラグＦの値を今までの値と反転させることも行う。 If the LENGTH value is less than or equal to the WIDTH value, the process of S24 is performed without performing the process of S22. Further, following the process of S22, the process of S24 is also performed. In the process of S24, the reciprocal of the value of LENGTH is obtained and stored in the register W-RATE. This W-RATE value is used to step up the values of WINDOW1 and WINDOW2 as will be described later. Further, in the process of S24, the value of the flag F is also inverted from the current value.

次に、Ｓ２４の処理で反転させたフラグＦの値が１であるか、−１であるかを判断する（Ｓ２６）。フラグＦが１であると（Ｓ２６：Ｙｅｓ）、第１の波形の読出開始のため、レジスタＰＨ１、ＷＩＮＤＯＷ１をそれぞれ０とし、Ｓ８の処理で決定したＳＡＤＲＳの値（切り出しアドレス）を切り出し開始アドレスレジスタＳＡＤＲＳ１に記憶させる（Ｓ２８）。 Next, it is determined whether the value of the flag F inverted in the process of S24 is 1 or -1 (S26). When the flag F is 1 (S26: Yes), in order to start reading the first waveform, the registers PH1 and WINDOW1 are set to 0, and the value of SADRS (cutout address) determined in the process of S8 is set to the cutout start address register. The data is stored in SADRS1 (S28).

またＳ２４の処理で反転させたＦの値が−１であると（Ｓ２６：Ｎｏ）、第２の波形の読出開始のため、レジスタＰＨ２、ＷＩＮＤＯＷ２をそれぞれ０とし、Ｓ８で決定したレジスタＳＡＤＲＳの値（切り出しアドレス）を切り出し開始アドレスレジスタＳＡＤＲＳ２に記憶させる（Ｓ３０）。 If the value of F inverted in the process of S24 is -1 (S26: No), the value of the register SADRS determined in S8 is set to 0 by setting the registers PH2 and WINDOW2 to 0 in order to start reading the second waveform. The (cutout address) is stored in the cutout start address register SADRS2 (S30).

このようなステップＳ１２、Ｓ１４、Ｓ１６、Ｓ１８、Ｓ２０、Ｓ２２、Ｓ２４、Ｓ２６、Ｓ２８、Ｓ３０によって、２再生ピッチの周期長に相当する時間（ＷＩＤＴＨの値の２倍）経過ごとに、第１及び第２の波形データの切り出し開始アドレスが更新され、ＰＨＡＳＥの値がＷＩＤＴＨの値に達するごとにフラグＦの値が反転されることになる。 By such steps S12, S14, S16, S18, S20, S22, S24, S26, S28, and S30, the first and second time intervals corresponding to the period length of two playback pitches (twice the value of WIDTH) are passed. The cut start address of the second waveform data is updated, and the value of the flag F is inverted every time the PHASE value reaches the WIDTH value.

Ｓ２８またはＳ３０の処理に続いて、またはＳ１２の処理においてレジスタＰＨＡＳＥの値がレジスタＷＩＤＴＨの値に達していないと判断された場合、波形読出処理を行う（Ｓ３２）。このように波形読みだし処理は、ＰＨＡＳＥの値がＷＩＤＴＨに達するまでは、ＷＩＤＴＨの値、ＬＥＮＧＴＨの値（ひいてはＷ−ＲＡＴＥの値）を変更せずに行われる。 Following the processing of S28 or S30, or when it is determined in the processing of S12 that the value of the register PHASE has not reached the value of the register WIDTH, a waveform reading process is performed (S32). In this manner, the waveform reading process is performed without changing the WIDTH value and LENGTH value (and thus the W-RATE value) until the PHASE value reaches WIDTH.

後述する波形読み出し処理では、Ｗ−ＲＡＴＥの値で、第１及び第２の波形のエンベロープＥＮＶ１、ＥＮＶ２の値が制御される。 In the waveform readout process described later, the values of envelopes ENV1 and ENV2 of the first and second waveforms are controlled by the value of W-RATE.

そして、Ｗ−ＲＡＴＥ、ひいてはＬＥＮＧＴＨの値は、Ｓ１８において検出ピッチをフォルマント変更係数ＦＯＲＭＡＮＴ−ＶＲによって変更したものであるので、波形読出処理される波形のピッチは、フォルマント変更係数ＦＯＲＭＡＮＴ−ＶＲに応じたものとなる。 Since the value of W-RATE, and hence LENGTH, is obtained by changing the detected pitch by the formant change coefficient FORMAT-VR in S18, the pitch of the waveform to be subjected to the waveform reading process corresponds to the formant change coefficient FORMANT-VR. It will be a thing.

次に図４を参照して、この波形読出処理について説明する。まずカウンタＷＩＮＤＯＷ１の値をＷ−ＲＡＴＥの値だけ歩進させる（Ｓ３４）。そして、歩進させたＷＩＮＤＯＷ１の値が１より小さいか、１以上であって２より小さいか、２以上であるかを判定する（Ｓ３６）。 Next, the waveform reading process will be described with reference to FIG. First, the value of the counter WINDOW1 is incremented by the value of W-RATE (S34). Then, it is determined whether the value of the incremented WINDOW1 is less than 1, 1 or more, less than 2, or 2 or more (S36).

１より小さい場合、ＷＩＮＤＯＷ１の値をレジスタＥＮＶ１に記憶させ（Ｓ３８）、１以上であって２より小さいとき、２からＷＩＮＤＯＷ１の値を減算した値をレジスタＥＮＶ１に記憶させ（Ｓ４０）、２以上のとき、ＥＮＶ１の値を０とする（Ｓ４２）。 If it is smaller than 1, the value of WINDOW1 is stored in the register ENV1 (S38). If it is 1 or more and smaller than 2, the value obtained by subtracting the value of WINDOW1 from 2 is stored in the register ENV1 (S40). At this time, the value of ENV1 is set to 0 (S42).

Ｓ３４乃至Ｓ４０の処理は、Ｗ−ＲＡＴＥの値ずつ値が増加する鋸歯状波を作成し、これの値を１で折り返すことによって、ＥＮＶ１を作成している。但し、ＷＩＮＤＯＷ１の値が２を超えた場合には、Ｓ４２の処理によってＥＮＶ１を０としている。即ち、フォルマント係数ＦＯＲＭＡＮＴ−ＶＲと検出ピッチとに基づいて定めたＬＥＮＧＴＨの値の逆数であるＷ−ＲＡＴＥずつ１まで増加し、その後、Ｗ−ＲＡＴＥずつ０まで減少する三角波を第１の波形のエンベロープとして作成している。 In the processing of S34 to S40, a sawtooth wave whose value increases by the value of W-RATE is created, and this value is folded back to 1, thereby creating ENV1. However, when the value of WINDOW1 exceeds 2, ENV1 is set to 0 by the process of S42. That is, a triangular wave that increases to 1 by W-RATE, which is the reciprocal of the value of LENGTH determined based on the formant coefficient FORMANT-VR and the detection pitch, and then decreases to 0 by W-RATE is converted into an envelope of the first waveform. It is created as.

また、Ｓ３８、Ｓ４０またはＳ４２の処理に続いて、レジスタＰＨ１の値（切り出しアドレスの歩進値）にフォルマント係数ＦＯＲＭＡＮＴ−ＶＲを乗算した値を、第１の波形の切り出し開始アドレスを記憶しているレジスタＳＡＤＲＳ１の値と加算して、第１の波形の切り出しアドレスを記憶するレジスタＡＤＲＳ１に記憶させる（Ｓ４４）。これに続いて、リングメモリから切り出しアドレスＡＤＲＳ１で第１の波形の波形データＤＡＴＡ１を読み出す（Ｓ４６）。 Further, following the processing of S38, S40 or S42, a value obtained by multiplying the value of the register PH1 (stepping value of the cut-out address) by the formant coefficient FORMAT-VR is stored as the cut-out start address of the first waveform. The value is added to the value of the register SADRS1, and is stored in the register ADRS1 that stores the cut-out address of the first waveform (S44). Subsequently, the waveform data DATA1 of the first waveform is read from the ring memory with the cut-out address ADRS1 (S46).

このように読出アドレスはフォルマント係数ＦＯＲＭＡＮＴ−ＶＲによって変更されているので、結果的には波形データＤＡＴＡ１の読出速度が、フォルマント係数ＦＯＲＭＡＮＴ−ＶＲによって変更されている。 As described above, since the read address is changed by the formant coefficient FORMAT-VR, as a result, the read speed of the waveform data DATA1 is changed by the formant coefficient FORMAT-VR.

これに続いて、ＷＩＮＤＯＷ２の値をＷ−ＲＡＴＥだけ歩進させる（Ｓ４８）。歩進させたＷＩＮＤＯＷ２の値が１より小さいか、１以上で２未満であるか、または２以上であるか判断する（Ｓ５０）。そして、ＷＩＮＤＯＷ２の値が１より小さいと、ＷＩＮＤＯＷ２の値をレジスタＥＮＶ２に記憶させ（Ｓ５２）、ＷＩＮＤＯＷ２の値が１以上で２未満であると、２からＷＩＮＤＯＷ２の値を減算した値をレジスタＥＮＶ２の記憶させ（Ｓ５４）、ＷＩＮＤＯＷ２の値が２以上であると、ＥＮＶ２の値を０とする（Ｓ５６）。このようにして、第２の波形のエンベロープを準備する。 Following this, the value of WINDOW2 is incremented by W-RATE (S48). It is determined whether the value of the incremented WINDOW2 is less than 1, 1 or more and less than 2, or 2 or more (S50). If the value of WINDOW2 is smaller than 1, the value of WINDOW2 is stored in the register ENV2 (S52). If the value of WINDOW2 is 1 or more and less than 2, the value obtained by subtracting the value of WINDOW2 from 2 is stored in the register ENV2. If the value of WINDOW2 is 2 or more, the value of ENV2 is set to 0 (S56). In this way, the envelope of the second waveform is prepared.

Ｓ５２、Ｓ５４またはＳ５６の処理に続いて、レジスタＰＨ２の値にフォルマント係数ＦＯＲＭＡＮＴ−ＶＲを乗算した値と第２の波形データ用の切り出し開始アドレスＳＡＤＲＳ２の値とを加算した値を、第２の波形データ用の切り出しアドレス用のレジスタＡＤＲＳ２に記憶させる（Ｓ５８）。そして、リングメモリからアドレスＡＤＲＳ２で第２の波形の波形データ（ＤＡＴＡ２）を読み出す（Ｓ６０）。 Subsequent to the processing of S52, S54, or S56, the value obtained by multiplying the value of the register PH2 by the value of the formant coefficient FORMAT-VR and the value of the extraction start address SADRS2 for the second waveform data is added to the second waveform. It is stored in the register ADRS2 for the cut-out address for data (S58). Then, the waveform data (DATA2) of the second waveform is read from the ring memory at the address ADRS2 (S60).

このようにして読出されたＤＡＴＡ１にＥＮＶ１の値を乗算したものと、ＤＡＴＡ２にＥＮＶ２の値を乗算したものとを、加算したものを出力ＯＵＴとする（Ｓ６２）。このような波形読出処理が行われた後、図３に示すように出力ＯＵＴを送出する（Ｓ６４）。 An output OUT is obtained by adding the value of ENV1 multiplied by the value of DATA1 read out in this manner and the value of DATA2 multiplied by the value of ENV2 (S62). After such waveform reading processing is performed, output OUT is sent out as shown in FIG. 3 (S64).

図５は、目標ピッチ情報ＳＰＩＴＣＨに応じて設定されたＷＩＤＴＨが、検出された周期ＰＩＴＣＨより長く設定され、フォルマント変更係数ＦＯＲＭＡＮＴ−ＶＲが１に設定された場合の各部の波形を示すものである。すなわち、この図５では、入力されたオーディオ信号のピッチを低く変換し、フォルマント特性は元のままとする場合を示している。 FIG. 5 shows the waveform of each part when WIDTH set according to the target pitch information SPITCH is set longer than the detected period PITCH and the formant change coefficient FORMAT-VR is set to 1. That is, FIG. 5 shows a case where the pitch of the input audio signal is converted to a low value and the formant characteristics remain unchanged.

図５（ａ）は、入力波形であり、時刻ｔ０からｔ１までが１周期であってその周期をＰ０、次に時刻ｔ１からｔ２までが次の１周期であり、その周期をＰ１として示している。
以下同様である。
（ｂ）は、レジスタＰＨＡＳＥの値の変化であり、再生される楽音のピッチに対応する周期であるＷＩＤＴＨを周期とする鋸歯状波である。この周期毎の時刻をｔｗ０、ｔｗ１・・・とする。
（ｃ）は、ＰＨ１、（ｄ）は、ＰＨ２を示す図で、ＷＩＤＴＨの２倍の周期の鋸歯状波であって、ＰＨ１とＰＨ２とは、位相がＷＩＤＴＨだけ異なっている。 FIG. 5A shows an input waveform, which is one cycle from time t0 to t1, and that cycle is P0, and then from time t1 to t2 is the next one cycle, which is shown as P1. Yes.
The same applies hereinafter.
(B) is a change in the value of the register PHASE, and is a sawtooth wave having a period of WIDTH, which is a period corresponding to the pitch of the reproduced musical sound. The times for each cycle are tw0, tw1,.
(C) is a diagram showing PH1, and (d) is a diagram showing PH2, and is a sawtooth wave having a period twice that of WIDTH. PH1 and PH2 are different in phase by WIDTH.

（ｅ）は、ＥＮＶ１を表す図であり、ＬＥＮＧＴＨは、入力されたオーディオ信号のピッチに対応する周期に設定されている。このＬＥＮＧＴＨの期間に入力されたオーディオ信号の１周期が、ここでは入力されたサンプリング周期と同じ周期で読み出される。 (E) is a diagram showing ENV1, and LENGTH is set to a cycle corresponding to the pitch of the input audio signal. One period of the audio signal input during this LENGTH period is read out at the same period as the input sampling period.

時刻がｔｗ２の時、入力波形のＰ０の周期が確定されるので、この時点でＬＥＮＧＴＨの値が、Ｐ０に設定され、切り出しアドレスＳＡＤＲＳは、時刻ｔ０にリングメモリに記憶したアドレスに設定される。
（ｆ）は、ＥＮＶ２を表す図であり、ＥＮＶ１と同様にＬＥＮＧＴＨは、入力されたオーディオ信号のピッチに対応する周期に設定されている。ＥＮＶ１とＥＮＶ２とは、位相がＷＩＤＴＨだけ異なっており、読出された波形に乗算されるエンベロープ波形であり、ＥＮＶ１とＥＮＶ２とでエンベロープがクロスフェードされる。 When the time is tw2, the period of P0 of the input waveform is determined. At this time, the value of LENGTH is set to P0, and the cut-out address SADRS is set to the address stored in the ring memory at time t0.
(F) is a diagram showing ENV2, and LENGTH is set to a cycle corresponding to the pitch of the input audio signal as in ENV1. ENV1 and ENV2 are envelope waveforms that are different in phase by WIDTH and multiplied by the read waveform, and the envelopes are cross-faded between ENV1 and ENV2.

（ｇ）、（ｈ）は、それぞれのエンベロープＥＮＶ１およびＥＮＶ２が読出された音素に乗算された様子を示すもので、これらの波形が合成されて出力される。 (G) and (h) show how the envelopes ENV1 and ENV2 are multiplied by the read phonemes, and these waveforms are synthesized and output.

以上説明したように、この実施例では、出力される信号のピッチは、ピッチ検出手段により検出された周期であるＰＩＴＣＨと設定された目標ピッチ情報ＳＰＩＴＣＨとを乗算し、その積を平滑化ピッチ情報ｌｐにより除算した値であるＷＩＤＴＨの値により定められ、出力される信号の読出速度は、フォルマント変更係数ＦＯＲＭＡＮＴ−ＶＲによって定められる。 As described above, in this embodiment, the pitch of the output signal is multiplied by the pitch detected by the pitch detection means PITCH and the set target pitch information SPITCH, and the product is smoothed pitch information. It is determined by the value of WIDTH, which is a value divided by lp, and the read speed of the output signal is determined by the formant change coefficient FORMAT-VR.

したがって、入力されたオーディオ信号のピッチが揺らいでいる場合には、出力される信号のピッチは、その中心ピッチが目標ピッチに変換されつつも平滑率に対応した揺らぎが維持される。 Therefore, when the pitch of the input audio signal fluctuates, the fluctuation of the pitch of the output signal corresponding to the smoothing rate is maintained while the center pitch is converted to the target pitch.

図６は、本発明の第２の実施形態について説明するためのもので、本発明をコーラス効果装置に応用した実施例を示すブロック図である。図１（ｂ）と異なる点は、ピッチ変換手段は、４つのコーラスチャネル３６ａ、３６ｂ、３６ｃ、３６ｄを備えている点である。その他のピッチ検出手段３０、ピッチ平滑手段３２は、同じであり制御手段３４は、それぞれのコーラスチャネルに検出されたピッチ情報ＰＩＴＣＨとそれぞれの再生ピッチ情報ＷＩＤＴＨを供給する。 FIG. 6 is a block diagram illustrating an example in which the present invention is applied to a chorus effect device for explaining a second embodiment of the present invention. The difference from FIG. 1 (b) is that the pitch converting means includes four chorus channels 36a, 36b, 36c, and 36d. The other pitch detection means 30 and pitch smoothing means 32 are the same, and the control means 34 supplies the detected pitch information PITCH and the respective reproduction pitch information WIDTH to each chorus channel.

例えば、カラオケ装置にこのコーラス効果装置を応用する場合は、カラオケ装置に備えられたシーケンサから曲の進行に合わせて複数のハーモニーに対応する音高情報がＭＩＤＩ規格により規定されるノートナンバにより供給される。ＣＰＵ２２は、これらのノートナンバに対応する目標ピッチ情報ＳＰＩＴＣＨを求め、制御手段３４に供給する。制御手段３４は、それぞれの目標ピッチ情報ＳＰＩＴＣＨについて、検出されたピッチＰＩＴＣＨおよび平滑化ピッチｌｐに対応する再生ピッチ情報ＷＩＤＴＨを求め各コーラスチャネルに供給する。 For example, when this chorus effect device is applied to a karaoke device, pitch information corresponding to a plurality of harmonies is supplied from a sequencer provided in the karaoke device according to the progress of a song by a note number defined by the MIDI standard. The The CPU 22 obtains target pitch information SPITCH corresponding to these note numbers and supplies it to the control means 34. The control means 34 obtains reproduction pitch information WIDTH corresponding to the detected pitch PITCH and the smoothed pitch lp for each target pitch information SPITCH, and supplies it to each chorus channel.

また、複数のハーモニーのフォルマントをそれぞれ変更設定することにより、フォルマントを変更することができる。例えば、入力されるボーカルが男性の声であれば、ハーモニーのフォルマントを女性、または子供のフォルマントに設定すれば、あたかも男性のボーカルに女性、あるいは子供のハーモニーが付加されたような効果を得ることができる。 In addition, the formant can be changed by changing and setting the formants of a plurality of harmonies. For example, if the input vocal is a male voice, if the harmony formant is set to female or child formant, the effect is as if a female or child harmony was added to the male vocal. Can do.

図７は、本発明の第３の実施形態について説明するためのもので、本発明を電子楽器に応用した実施例のブロック図である。図１（ａ）と異なる点は、オーディオ信号が予め記憶手段であるストレージ３８に記憶されている点と、目標ピッチを指定する手段が鍵盤４０である点であり、その他の構成は、図１（ａ）と同じであり同じ部番を付している。図１（ａ）では、オーディオ信号が外部から入力され、そのオーディオ信号のピッチを変換する場合の実施例であるが、図７の場合は、例えばハードディスクなどの大容量のメモリに多数のサンプリングされた波形を記憶し、いずれかの波形を選択し、目標とするピッチを鍵盤４０に備えられた鍵を押下することにより指定するものである。 FIG. 7 is a block diagram of an example in which the present invention is applied to an electronic musical instrument for explaining a third embodiment of the present invention. The difference from FIG. 1A is that an audio signal is stored in advance in a storage 38 which is a storage means, and a means for designating a target pitch is a keyboard 40. The other configurations are as shown in FIG. It is the same as (a) and has the same part number. FIG. 1A shows an embodiment in which an audio signal is input from the outside and the pitch of the audio signal is converted. In the case of FIG. 7, a large number of samples are sampled in a large-capacity memory such as a hard disk. The waveform is stored, one of the waveforms is selected, and the target pitch is designated by pressing a key provided on the keyboard 40.

この場合には、ストレージ３８に記憶された波形の中から選択された波形を読出し、ピッチを順次検出してそのピッチを平滑化し、その平滑化された平滑化ピッチと、鍵盤４０により指定された目標ピッチとに基づいてピッチ変換する。このことにより、ストレージ３８に記憶されている波形のピッチの揺らぎを、ピッチ変換されて出力する波形のピッチに付与することができる。 In this case, the waveform selected from the waveforms stored in the storage 38 is read out, the pitches are sequentially detected and the pitches are smoothed, and the smoothed smoothed pitches and the keyboard 40 are designated. Pitch conversion is performed based on the target pitch. As a result, fluctuations in the pitch of the waveform stored in the storage 38 can be added to the pitch of the waveform that is output after being pitch-converted.

なお、以上の実施例では、ピッチ変換装置として、オーディオ信号の所定区間を切り出し、その切り出した区間を再生ピッチに対応する周期で合成する方式のものとしたが、メモリにオーディオ信号を所定のサンプリング周波数でサンプリングして記憶し、そのサンプリング周波数とは異なるサンプリング周波数で読み出すことによりピッチを変換する方式のものとしてもよい。例えば、特許第２５１９４４１号に開示されたコーラス効果装置は、この方式を採用したものである。この公報では、目標ピッチをｆ１、入力されたオーディオ信号のピッチをｆ２とし、ピッチ変換量Ｐをｆ１／ｆ２とし、入力されたオーディオ信号を所定のサンプリング周波数Ｆ１でメモリに記憶し、読み出す場合は、このピッチ変換量ＰをＦ１に乗算したサンプリング周波数Ｆ２で読出している。この方式に、本発明を適用するには、検出されたピッチｆ２をピッチ平滑手段３２により平滑化されたピッチに置き換えればよい。なお、ここでは、サンプリング周波数を変更するいわゆる可変サンプリング方式であるが、サンプリング周波数は一定で、歩進するアドレスの幅を、再生する速度に対応する幅とし、サンプル点がアドレスの少数点になる場合には、補間方法によりその小数点のアドレスに対応する振幅値を求める固定サンプリング方式としてもよい。 In the above embodiment, the pitch conversion device is a method of cutting out a predetermined section of the audio signal and synthesizing the cut out section with a period corresponding to the reproduction pitch. The pitch may be converted by sampling at a frequency and storing it, and reading at a sampling frequency different from the sampling frequency. For example, the chorus effect device disclosed in Japanese Patent No. 2519441 adopts this method. In this publication, when the target pitch is f1, the pitch of the input audio signal is f2, the pitch conversion amount P is f1 / f2, and the input audio signal is stored in the memory at a predetermined sampling frequency F1 and read out. The pitch conversion amount P is read at a sampling frequency F2 obtained by multiplying F1. In order to apply the present invention to this method, the detected pitch f2 may be replaced with a pitch smoothed by the pitch smoothing means 32. In this example, the so-called variable sampling method is used to change the sampling frequency. However, the sampling frequency is constant, the width of the address that advances is set to the width corresponding to the reproduction speed, and the sampling point is the decimal point of the address. In this case, a fixed sampling method may be used in which an amplitude value corresponding to the decimal point address is obtained by an interpolation method.

なお、請求項１記載の入力手段は、図１（ｂ）の入力端子２３が該当し、目標ピッチ指定手段は、図１（ａ）の操作子２８、または図６を参照して説明したＭＩＤＩ情報や図７の鍵盤４０が該当する。 The input means described in claim 1 corresponds to the input terminal 23 in FIG. 1B, and the target pitch designating means is the operation element 28 in FIG. 1A or the MIDI described with reference to FIG. This corresponds to the information and the keyboard 40 shown in FIG.

以上、実施例に基づき本発明を説明したが、本発明は上述した実施例に何ら限定されるものではなく、本発明の趣旨を逸脱しない範囲内で種々の改良変更が可能であることは容易に推察できるものである。 The present invention has been described above based on the embodiments. However, the present invention is not limited to the above-described embodiments, and various modifications and changes can be easily made without departing from the spirit of the present invention. Can be inferred.

例えば、上記実施例では、ピッチ検出手段３０をＤＳＰ８に設けたものとしたが、この処理をＣＰＵ２２により実行するようにしてもよい。その場合には、ＤＳＰ８からゼロクロス間の時間情報や波形のピークの時間などをＣＰＵ２２に送り、ＣＰＵ２２は、これらの値から自己相関を演算により求め、周期を抽出する。 For example, in the above embodiment, the pitch detection means 30 is provided in the DSP 8, but this processing may be executed by the CPU 22. In that case, the DSP 8 sends time information between zero crosses, the peak time of the waveform, and the like to the CPU 22, and the CPU 22 obtains autocorrelation from these values by calculation and extracts the period.

また、同様に上記実施例では、ピッチ平滑手段３２をＤＳＰ８に設けたものとしたが、この処理をＣＰＵ２２により実行するようにしてもよい。その場合には、ＤＳＰ８に設けたピッチ検出手段により検出されたピッチの値をＣＰＵ２２が読み取り、平滑化した平滑化ピッチ情報を制御手段３４に供給すればよい。また、同様に、制御手段３４により行われる処理をＣＰＵ２２により実行されるようにしてもよい。 Similarly, in the above embodiment, the pitch smoothing means 32 is provided in the DSP 8, but this processing may be executed by the CPU 22. In that case, the CPU 22 may read the value of the pitch detected by the pitch detection means provided in the DSP 8 and supply the smoothed smoothed pitch information to the control means 34. Similarly, the process performed by the control unit 34 may be executed by the CPU 22.

また、順次検出されるピッチの値を平滑化する低域通過フィルタの演算式（１）を例示したが、上記に代わる演算式として、
ｌp ＝ＰＩＴＣＨ×α ＋（１ーα）×ｌp
を用いてもよいし、その他の平滑化演算方法を用いてもよい。 Moreover, although the arithmetic expression (1) of the low-pass filter which smoothes the value of the pitch detected sequentially is illustrated, as an arithmetic expression instead of the above,
lp = PITCH × α + (1−α) × lp
Or other smoothing calculation methods may be used.

（ａ）は、本発明によるピッチ変換装置のブロック図、（ｂ）は、ＤＳＰにおける処理を説明するためのブロック図である。(A) is a block diagram of the pitch conversion apparatus by this invention, (b) is a block diagram for demonstrating the process in DSP. ＤＳＰが実行する処理の一部を示すフローチャートである。It is a flowchart which shows a part of process which DSP performs. 図２に示すフローチャートに続くフローチャートである。It is a flowchart following the flowchart shown in FIG. 図３のフローチャートにおける波形読出し処理の詳細を示すフローチャートである。It is a flowchart which shows the detail of the waveform read-out process in the flowchart of FIG. ＤＳＰにおいて実行される処理の部分毎の波形を示す図である。It is a figure which shows the waveform for every part of the process performed in DSP. 本発明を応用したコーラス効果装置のブロック図を示す図である。It is a figure which shows the block diagram of the chorus effect apparatus to which this invention is applied. 本発明を応用した電子楽器のブロック図を示す図である。It is a figure which shows the block diagram of the electronic musical instrument to which this invention is applied.

Explanation of symbols

８ＤＳＰ
２２ＣＰＵ
２３入力端子（入力手段）
３０ピッチ検出手段
３２ピッチ平滑手段
３４制御手段
３６ピッチ変換手段 8 DSP
22 CPU
23 Input terminal (input means)
30 Pitch detection means 32 Pitch smoothing means 34 Control means 36 Pitch conversion means

Claims

An input means for inputting an audio signal;
Pitch detection means for sequentially detecting the pitch of the audio signal input to the input means;
Pitch smoothing means for smoothing the pitch detected by the pitch detecting means and obtaining smoothed pitch information;
Target pitch designating means for designating the target pitch;
The pitch conversion means for converting the pitch of the audio signal input to the input means, the pitch conversion based on the smoothed pitch information smoothed by the pitch smoothing means and the target pitch specified by the target pitch specifying means. And a control means for controlling the means.

2. The pitch converting apparatus according to claim 1, wherein the pitch smoothing means performs low-pass filter processing on the pitches sequentially detected by the pitch detecting means.

The pitch smoothing means obtains smoothed pitch information by multiplying a difference value obtained by subtracting a previously smoothed value from the pitch value detected this time by the pitch detecting means by a predetermined coefficient. The pitch converter according to claim 1 or 2, characterized in that

The pitch converting means cuts out a predetermined section of the audio signal based on the pitch detected by the pitch detecting means, and smoothed pitch information smoothed by the pitch smoothing means and the target pitch. 4. The pitch conversion apparatus according to claim 1, wherein the pitch conversion apparatus is one that synthesizes at a period corresponding to the target pitch designated by the designation means.

The previous pitch conversion means reads the audio signal stored in the waveform storage means at a speed corresponding to the smoothed pitch information smoothed by the pitch smoothing means and the target pitch specified by the target pitch specifying means. The pitch converter according to any one of claims 1 to 3, wherein: