JP2004078232A

JP2004078232A - Method and device for restoring wide-band voice, voice transmission system, and voice transmission method

Info

Publication number: JP2004078232A
Application number: JP2003315485A
Authority: JP
Inventors: Hirohisa Tazaki; 田崎　裕久
Original assignee: Mitsubishi Electric Corp
Current assignee: Mitsubishi Electric Corp
Priority date: 2003-09-08
Filing date: 2003-09-08
Publication date: 2004-03-11
Anticipated expiration: 2019-09-02
Also published as: JP3560964B2

Abstract

<P>PROBLEM TO BE SOLVED: To obtain a wide-band voice restoring device which restores a wide-band voice signal of high quality by estimating a stable wide-band sound source with a correct amplitude from narrow-band voices or narrow-band voice codes under less influence of differences of speakers and noise. <P>SOLUTION: The wide-band voice restoring device is provided with a narrow-band sound source decoding means which uses narrow-band voice codes to generate narrow-band synthesized sounds, a spectrum decoding means which uses narrow-band spectral codes separated from narrow-band voice codes to estimate wide-band spectral parameters, a wide-band sound source decoding means which uses narrow-band sound source codes separated from narrow-band voice codes to estimate a wide-band sound source signal, and a synthesizing means which generates a wide-band voice signal from the generated narrow-band synthesized sounds, the estimated wide-band spectral parameters, and the wide-band sound source signal. <P>COPYRIGHT: (C)2004,JPO

Description

　この発明は、帯域制限された狭帯域音声信号や、狭帯域音声信号を符号化した狭帯域音声符号から広帯域の音声信号を復元する広帯域音声復元装置に関するものである。 The present invention relates to a wideband speech restoration apparatus for restoring a wideband speech signal from a narrowband speech signal whose band is limited or a narrowband speech code obtained by encoding the narrowband speech signal.

　狭帯域音声信号の一例として、現在の電話音声がある。電話システムでは音声信号は約３００Ｈｚから３．４ＫＨｚの帯域に制限されて伝送されており、帯域制限がない場合に比べると、貧弱で籠った感じの音質となっている。高品質化するためには広帯域の音声信号を伝送できる電話システムを構築することが考えられるが、多くの時間と経費が必要である。 One example of a narrowband audio signal is the current telephone audio. In the telephone system, the audio signal is transmitted in a limited band of about 300 Hz to 3.4 KHz, and the sound quality is poor and crowded as compared with the case where there is no band limitation. To improve the quality, a telephone system capable of transmitting a wideband voice signal may be constructed, but much time and expense are required.

　電話帯域に制限された狭帯域音声から広帯域音声信号を復元する広帯域音声復元方法として考えられた従来のものに、特開平６−１１８９９５号がある。 Japanese Patent Laid-Open No. 6-118995 discloses a conventional method for restoring a wideband speech signal from a narrowband speech limited to a telephone band.

　特開平６−１１８９９５号は、狭帯域音声信号をＬＰＣ分析してスペクトルパラメータを算出し、このスペクトルパラメータを狭帯域符号帳を用いてベクトル量子化する。そして、狭帯域符号帳と対応づけて学習した広帯域符号帳を用いて広帯域のスペクトルパラメータを復号する。このスペクトルパラメータを用いてＬＰＣ合成処理を行い、仮の広帯域音声信号を得る。狭帯域音声信号をアップサンプリングしたものに、仮の広帯域音声信号から狭帯域音声信号以外の帯域成分を抽出して加算することで、最終的な広帯域音声信号を生成する。なお、広帯域のＬＰＣ合成処理を行う場合には、広帯域の音源信号が必要となるが、この音源信号の生成方法については具体的に開示されていない。 Japanese Patent Application Laid-Open No. 6-118995 discloses an LPC analysis of a narrowband speech signal to calculate a spectrum parameter, and vector-quantizes the spectrum parameter using a narrowband codebook. Then, broadband spectral parameters are decoded using the wideband codebook learned in association with the narrowband codebook. An LPC synthesis process is performed using the spectrum parameters to obtain a temporary wideband audio signal. A final wideband audio signal is generated by extracting and adding a band component other than the narrowband audio signal from the temporary wideband audio signal to an upsampled narrowband audio signal. Note that when performing wideband LPC synthesis processing, a wideband sound source signal is required, but a method for generating the sound source signal is not specifically disclosed.

　特開平６−１１８９９５号と同じ構成を持ち、広帯域の音源信号生成について開示されている文献として、文献１「コードブックマッピングによる狭帯域音声から広帯域音声の復元」電子情報通信学会、信学技報SP93-61 (1993-08) がある。 As a document having the same configuration as that of Japanese Patent Application Laid-Open No. 6-118995, and which discloses a wide-band sound source signal generation, reference 1 "Reconstruction of wide-band speech from narrow-band speech by codebook mapping" IEICE, IEICE There is SP93-61 (1993-08).

　この文献１では、広帯域の音源生成方法として２つの方法が開示されている。文献 In this document 1, two methods are disclosed as a wideband sound source generation method.

　第１の方法は、狭帯域音声を分析して得られたピッチとパワーを用いて、同業者間では一般的な方法によって音源生成を行う。すなわち、有声音ではピッチ周期で繰り返すインパルス列、無声音では白色雑音を生成し、パワーによってその振幅を決定する。 The first method uses a pitch and power obtained by analyzing a narrowband voice to generate a sound source by a general method among those skilled in the art. That is, an impulse train that repeats at a pitch cycle is generated for voiced sounds, and white noise is generated for unvoiced sounds, and the amplitude is determined by power.

　なお文献１では、音質改善のために幾つかの後処理を行っている。３００Ｈｚ以下の低域を復元する場合には、復元帯域のパワー不足を補うために低域復元音のパワーを低数倍する。３．４Ｈｚから７．３ＫＨｚの高域を復元する場合には、インパルス列を音源としたことによって発生するパルス的な音を軽減するためにパルスをつぶすようにｃｏｓｉｎｅ関数をかける。 In Reference 1, some post-processing is performed to improve sound quality. When restoring a low frequency of 300 Hz or less, the power of the low frequency restored sound is multiplied by a low multiple in order to compensate for the power shortage in the restored band. When restoring a high band from 3.4 Hz to 7.3 KHz, a cosine function is applied so as to crush the pulse in order to reduce a pulse-like sound generated by using an impulse train as a sound source.

　第２の方法は、狭帯域音声信号のスペクトルパラメータをベクトル量子化し、得られた符号に対応する狭帯域の代表波形素片と高域の代表波形素片を選択する。そして、この２つの波形素片に対して以下の処理を行う。波形素片の有声無声を判定し、有声音の場合には狭帯域音声信号を分析して得られたピッチに同期して前記波形素片を重ね合わせる。無声音の場合には、波形素片のランダムな位置から必要な長さの信号を切り出す。狭帯域波形素片から上記処理によって生成された信号と狭帯域スペクトルパラメータを用いて合成された合成音と狭帯域音声のパワー比を算出する。そして、高域波形素片から上記処理によって生成された信号と広帯域のスペクトルパラメータを用いて合成音を生成し、これに前記パワー比を乗ずることで高域の復元信号を得る。 {Circle around (2)} In the second method, the spectral parameters of the narrowband speech signal are vector-quantized, and a narrowband representative waveform element and a highband representative waveform element corresponding to the obtained code are selected. Then, the following processing is performed on these two waveform segments. The voiced and unvoiced waveform segments are determined, and in the case of voiced sounds, the waveform segments are superimposed in synchronization with a pitch obtained by analyzing a narrowband audio signal. In the case of an unvoiced sound, a signal of a required length is cut out from a random position of a waveform element. The power ratio between the synthesized sound and the narrow-band sound synthesized using the signal generated from the narrow-band waveform element by the above processing and the narrow-band spectral parameter is calculated. Then, a synthesized sound is generated from the high-band waveform element using the signal generated by the above processing and the broadband spectral parameters, and a high-frequency restored signal is obtained by multiplying the synthesized sound by the power ratio.

　利用分野が異なるが、音源信号の帯域を広げる別の方法として、文献２「 A 2.4Kbps High−Qaulity Speech Coder 」IEEE International Conference on Acoustics, Speech, and Signal Processing vol.1, S9.5, pp. 589-592 (1991.5) に開示されているものがある。 Although the field of use is different, as another method for expanding the band of the sound source signal, reference 2 `` A 2.4 Kbps High-Qaulity Speech Coder '' IEEE International Conference on Acoustics, Speech, and Signal Processing vol.1, S9.5, pp. 589-592 (1991.5).

　文献２は、電話帯域音声を高能率に符号化し、復号化する方式に関するもので、符号化する際の音源の情報量を削減するために、０Ｈｚから３．４ＫＨｚの音源信号を長周期予測分析し、長周期予測係数と長周期予測残差信号に分離する。０Ｈｚから３．４ＫＨｚの長周期予測残差信号を０Ｈｚから１ＫＨｚに帯域制限して符号化を行う。そして、復号化する際に帯域制限された長周期予測残差信号から３．４ＫＨｚまでの電話帯域の長周期予測残差信号を生成した後、長周期合成処理を行って音源信号を復元するものである。長周期予測残差信号の復元は、０Ｈｚから１ＫＨｚの成分を持つ信号を８ＫＨｚのサンプリング周波数にアップサンプリングした後、４サンプル間隔で残し、それ以外を零にすることで行っている。
特開平０６−１１８９９５号公報特開昭６３−０３４６００号公報特開平０５−２９７８９８号公報 Reference 2 relates to a method for encoding and decoding telephone band voice with high efficiency. In order to reduce the amount of information of a sound source at the time of encoding, a long-period prediction analysis of a sound source signal from 0 Hz to 3.4 KHz is performed. Then, the signal is separated into a long-period prediction coefficient and a long-period prediction residual signal. Encoding is performed by band-limiting the long-period prediction residual signal from 0 Hz to 3.4 kHz from 0 Hz to 1 kHz. Then, after decoding, a long-period prediction residual signal of the telephone band up to 3.4 KHz is generated from the band-limited long-period prediction residual signal, and a long-period synthesis process is performed to restore the sound source signal. It is. Restoration of a long-period prediction residual signal is performed by up-sampling a signal having a component of 0 Hz to 1 KHz to a sampling frequency of 8 KHz, leaving the signal at intervals of 4 samples, and setting the rest to zero.
JP-A-06-118995 JP-A-63-034600 JP 05-297988 A

　上記の従来法には、以下に述べる課題がある。従来 The above conventional methods have the following problems.

　特開平６−１１８９９５号と、別の文献ではあるが、その具体的実用例を開示している文献１では、大別して次の４つの課題、つまり音源振幅推定、音源生成方法、スペクトルパラメータ推定法、通信系への適用に関する課題がある。 Although it is a different document from Japanese Patent Application Laid-Open No. Hei 6-118995, Document 1 which discloses a concrete practical example thereof is roughly divided into the following four problems, namely, a sound source amplitude estimation, a sound source generation method, and a spectrum parameter estimation method. However, there is a problem regarding application to communication systems.

　まず、第１の音源振幅推定に関して説明する。 First, the first sound source amplitude estimation will be described.

　文献１の第１の音源生成方法を用いる場合、復元音の合成に用いるパワーについては、狭帯域音声を分析して得られたパワー値をそのまま、もしくは定数倍して用いているが、狭帯域のスペクトルパラメータと推定された広帯域のスペクトルパラメータでは合成フィルタの利得が異なるので、同一の音源振幅を与えても得られる合成音の振幅が異なって来る。この差異がフレーム毎に変化するため、音源振幅、つまりパワー値を定数倍する事では、正しい振幅を持った広帯域音声は復元されない課題がある。 When the first sound source generation method of Document 1 is used, the power value used for synthesizing the reconstructed sound uses the power value obtained by analyzing the narrow-band sound as it is or by multiplying it by a constant. Since the gain of the synthesis filter is different between the spectral parameter of (1) and the estimated broadband spectral parameter, the amplitude of the synthesized sound obtained differs even when the same sound source amplitude is given. Since this difference changes for each frame, there is a problem that broadband speech having a correct amplitude cannot be restored by multiplying the sound source amplitude, that is, the power value by a constant.

　また、文献１の第２の音源生成方法を用いる場合、狭帯域合成音を生成して狭帯域音声とのパワー比を算出して、高域合成音に乗じているが、２つの波形素片に対して複雑な処理を実行する事が必要となる課題がある。 Further, when the second sound source generation method of Document 1 is used, a narrow-band synthesized sound is generated, a power ratio with respect to the narrow-band sound is calculated, and the high-band synthesized sound is multiplied. There is a problem that it is necessary to perform complicated processing for

　つぎに、第２の音源生成方法に関して説明する。 Next, the second sound source generation method will be described.

　文献１の第１の音源生成方法を用いる場合、ピッチとパワーという僅かな情報だけで広帯域音源信号の生成を行うので、様々に変化する本来の広帯域音源を十分に推定する事はできない。この結果、ｃｏｓｉｎｅ関数によってパルス的な音の軽減を行っているが、完全にパルス的な音の抑圧はできず、音質が不自然となる課題がある。また、話者毎に大きく性質が異なる有声音源を１つの固定音源で表現する事に無理があるため、話者によって音質が劣化する課題がある。場合 When the first sound source generation method of Document 1 is used, a wide band sound source signal is generated using only a small amount of information, such as pitch and power, and thus it is not possible to sufficiently estimate an original wide band sound source that changes in various ways. As a result, although the pulse-like sound is reduced by the cosine function, the pulse-like sound cannot be completely suppressed, and there is a problem that the sound quality becomes unnatural. In addition, since it is impossible to express a voiced sound source having different characteristics for each speaker with one fixed sound source, there is a problem that sound quality is degraded by each speaker.

　文献１の第２の音源生成方法を用いる場合、スペクトルパラメータのベクトル量子化結果の符号に対応する代表波形素片を用いているが、本来スペクトルパラメータは声道の形状に依存し、音源波形は声帯の振動の仕方に依存するものであるので、両者の間に強い対応関係は無い。音源波形は、むしろ話者に依存する所が大きい。従って、適切な音源が選択されない課題がある。 When the second sound source generation method of Document 1 is used, a representative waveform segment corresponding to the sign of the vector quantization result of the spectrum parameter is used. However, the spectrum parameter originally depends on the shape of the vocal tract, and the sound source waveform is Since it depends on the manner of vocal cord vibration, there is no strong correspondence between the two. The sound source waveform depends largely on the speaker. Therefore, there is a problem that an appropriate sound source cannot be selected.

　文献１中に記載されている様に、この第２の音源生成法を用いた場合には、有声音であるにもかかわらず無声音の波形素片を選択したり、逆に無声音であるにもかかわらず有声音の波形素片を選択してしまう場合があり、そのまま合成を行うと品質劣化を起こす課題がある。この事を回避するために、その部分でのパワー比を強制的に０としているが、この結果、復元された高域の振幅が部分的に０となってしまい別の品質劣化を起こす課題がある。 As described in Document 1, when the second sound source generation method is used, a waveform unit of an unvoiced sound is selected in spite of being a voiced sound. Regardless, a voiced waveform segment may be selected, and if synthesized as it is, there is a problem that quality degradation occurs. In order to avoid this, the power ratio at that portion is forcibly set to 0. As a result, however, the restored high-frequency amplitude becomes partially 0, which causes another quality deterioration. is there.

　更に、どちらの音源生成法においても、有声無声判定、ピッチ抽出誤りが起こった場合の品質劣化が避けられないという課題がある。特に、雑音が重畳した狭帯域音声信号に対して適用した場合に、判定誤り、抽出誤りが増大し、大きな劣化が起こる課題がある。 Furthermore, in either of the sound source generation methods, there is a problem that quality degradation due to voiced / unvoiced determination and pitch extraction error is inevitable. In particular, when applied to a narrow-band audio signal on which noise is superimposed, there is a problem that determination errors and extraction errors increase, and large degradation occurs.

　また、有声音と無声音の２つのモードしかないため、中間的な性質を持つ音源が十分表現できず、有声音と無声音の境界部分において品質劣化が起こる課題がある。 Also, since there are only two modes, voiced and unvoiced, there is a problem that sound sources having intermediate properties cannot be sufficiently expressed, and quality degradation occurs at the boundary between voiced and unvoiced.

　つぎに第３のスペクトルパラメータ推定方法に関して説明する。 Next, the third spectral parameter estimation method will be described.

　特開平６−１１８９９５号と文献１では、２つの符号帳を利用したベクトル量子化と逆量子化を行っているが、符号帳を蓄積しておくメモリが必要である事、量子化処理のための多くの演算量が必要である事が課題である。 In Japanese Patent Application Laid-Open No. Hei 6-118995 and Reference 1, vector quantization and inverse quantization using two codebooks are performed, but a memory for storing codebooks is required, The problem is that a large amount of computation is required.

　また、雑音、無声音、有声音の区別はパワーによってしやすく、かつそれらの区別によって狭帯域のスペクトルパラメータと広帯域のスペクトルパラメータの対応関係は変化する。しかしながら、何れの場合も、スペクトルパラメータとパワーを独立に扱っているので、広帯域のスペクトルパラメータの推定にパワーに関する情報が反映されていない。このため、狭帯域のスペクトルの形状が類似していれば、パワーの大小に関係なく、同様な広帯域スペクトルが推定されてしまう課題がある。雑音 Also, noise, unvoiced sound, and voiced sound can be easily distinguished by power, and the correspondence between the narrow-band spectral parameter and the wide-band spectral parameter changes depending on the distinction. However, in each case, since the spectrum parameter and the power are treated independently, information on the power is not reflected in the estimation of the spectrum parameter in a wide band. For this reason, if the shapes of the narrow-band spectra are similar, there is a problem that a similar wide-band spectrum is estimated regardless of the magnitude of the power.

　最後に第４の通信系への適用に関して説明する。 Lastly, application to the fourth communication system will be described.

　特開平６−１１８９９５号と文献１の方法を通信系へ適用する場合、受信した音声符号から狭帯域合成音を復号した後、この狭帯域合成音を再分析して広帯域音声信号を復元する事となるが、スペクトルパラメータと音源情報が分離・符号化されて伝送されてくる場合には、その音声符号を直接利用して広帯域音声信号を復元する方が効率的と考えられる。つまり、特開平６−１１８９９５号と文献１の方法は再分析が必要である点で非効率である課題がある。また、合成と再分析を行って得られるパラメータには、合成時の補間や分析時の窓掛等による歪が重畳しており、広帯域音声の品質劣化もある。 When applying the method of Japanese Patent Application Laid-Open No. 6-118995 and Reference 1 to a communication system, it is necessary to decode a narrow-band synthesized sound from a received speech code and then re-analyze the narrow-band synthesized sound to restore a wide-band speech signal. However, when the spectrum parameters and the sound source information are transmitted after being separated / encoded, it is considered more efficient to directly use the speech code to restore the wideband speech signal. In other words, the method disclosed in Japanese Patent Application Laid-Open No. 6-118995 and Reference 1 is inefficient in that reanalysis is required. Also, distortions due to interpolation at the time of synthesis and windowing at the time of analysis are superimposed on parameters obtained by performing the synthesis and re-analysis, and there is a deterioration in quality of wideband speech.

　この他、特開平６−１１８９９５号と文献１では、一般に合成音の雑音感の低減や了解性の改善のために導入される信号加工処理を付加していないため、復元された広帯域音声信号の音質が不足する場合にその改善をする事ができない課題がある。 In addition, in Japanese Patent Application Laid-Open No. Hei 6-118995 and Reference 1, since a signal processing process which is generally introduced to reduce noise in synthesized sounds and improve intelligibility is not added, the restored wideband audio signal There is a problem that cannot be improved when the sound quality is insufficient.

　また、通信系へ適用する場合、狭帯域合成音に対して信号加工処理が適用されることがあり、加工された狭帯域音声信号と加工されていない広帯域音声信号を重畳するために、両者の音質の連続性が悪くなる課題がある。 In addition, when applied to a communication system, signal processing may be applied to a narrow-band synthesized sound, and in order to superimpose a processed narrow-band audio signal and an unprocessed wide-band audio signal, both of the signals are processed. There is a problem that the continuity of sound quality deteriorates.

　文献２の方法では、０Ｈｚから１ＫＨｚを狭帯域、０Ｈｚから３．４ＫＨｚを広帯域と考えれば、広帯域の音源信号推定を行っていることになるが、前記した通りこの方式は広帯域の音声信号を入力とし、これを分析して得たパラメータを符号化し、復号化して広帯域合成音を得るものであり、狭帯域の音声信号、または狭帯域の音声信号から抽出されたパラメータから広帯域の音声信号を復元する方法を開示したものではない。 In the method of Document 2, if the frequency band from 0 Hz to 1 KHz is considered to be a narrow band and the frequency band from 0 Hz to 3.4 KHz is considered to be a wide band, a wideband sound source signal estimation is performed. The parameters obtained by analyzing this are encoded and decoded to obtain a wideband synthesized sound, and a wideband speech signal is restored from a narrowband speech signal or a parameter extracted from a narrowband speech signal. It does not disclose a method for doing so.

　以下に述べる実施例は、かかる課題を解決するためになされたものであり、狭帯域音声からより正しい振幅を持った広帯域音声信号を復元する広帯域音声復元装置を実現する事を目的としている。 The embodiment described below has been made to solve such a problem, and has as its object to realize a wideband audio restoration apparatus for restoring a wideband audio signal having a more correct amplitude from narrowband audio.

　また、比較的簡単な処理の広帯域音源振幅の推定処理を持った広帯域音声復元装置を実現する事を目的としている。 It is another object of the present invention to realize a wideband sound restoration apparatus having a relatively simple process of estimating a wideband sound source amplitude.

　更に、話者に依存性が少なく、有声無声境界付近でも良好な広帯域音源を推定し、安定で自然な音質の広帯域音声を復元する広帯域音声復元装置を実現する事を目的としている。 Further, it aims at realizing a wideband speech restoration apparatus which has little dependence on a speaker, estimates a good wideband sound source even near a voiced / unvoiced boundary, and restores a wideband speech with stable and natural sound quality.

　また、雑音が重畳した狭帯域音声信号に対して起こりがちな有声無声判定誤りやピッチ抽出誤りの影響の少ない広帯域音声復元装置を実現する事を目的としている。 It is another object of the present invention to realize a wideband speech restoration apparatus which is less affected by voiced unvoiced decision errors and pitch extraction errors which are likely to occur in a narrowband speech signal on which noise is superimposed.

　更に、通信系へ適用した場合に、再分析を行わずに効率良く広帯域音声の復元を行う広帯域音声復元装置を実現する事を目的としている。 Furthermore, it is an object of the present invention to realize a wideband speech restoration apparatus that efficiently restores wideband speech without performing reanalysis when applied to a communication system.

　更に、復元された広帯域音声信号の音質が不足する場合にその改善を可能とし、狭帯域合成音に対して信号加工処理が適用される場合に、加工された狭帯域連続性が良い広帯域音声信号が得られる広帯域音声復元装置を実現する事を目的としている。 Furthermore, when the sound quality of the reconstructed wideband audio signal is insufficient, it can be improved, and when the signal processing is applied to the narrowband synthesized sound, the processed wideband audio signal having good narrowband continuity. The purpose of the present invention is to realize a wideband audio restoration apparatus that can obtain the above.

　この発明に係る広帯域音声復元装置は、狭帯域スペクトル符号から狭帯域スペクトルパラメータを復号し、この復号した狭帯域スペクトルパラメータのスペクトル包絡を周波数軸方向へ引き伸ばして、広帯域スペクトルパラメータとして出力するスペクトル復号手段と、
　この出力された広帯域スペクトルパラメータを用いて広帯域音声信号を生成する合成手段を備えたことを特徴とする。 A broadband speech restoration apparatus according to the present invention decodes a narrowband spectral parameter from a narrowband spectral code, extends a spectrum envelope of the decoded narrowband spectral parameter in the frequency axis direction, and outputs the result as a wideband spectral parameter. When,
It is characterized by comprising a synthesizing means for generating a wideband speech signal using the outputted wideband spectrum parameter.

　実施例１．
　本発明の一実施例を図に基づいて説明する。 Embodiment 1 FIG.
An embodiment of the present invention will be described with reference to the drawings.

　本実施例は、主として広帯域音源信号の生成をより正しい形で復元する構成と動作を説明するものである。 This embodiment mainly describes a configuration and an operation for restoring the generation of a broadband sound source signal in a more correct manner.

　図１は本発明の実施例１の広帯域音声復元装置の構成図である。図において、１は入力の狭帯域音声信号、２は分析手段、３はスペクトル分析手段、４は狭帯域スペクトルパラメータ、５は逆フィルタ、６は狭帯域音源信号、７は広帯域スペクトル推定手段、８はベクトル量子化手段、９は狭帯域スペクトル符号帳、１０はスペクトル符号、１１は逆量子化手段、１２は広帯域スペクトル符号帳、１３は広帯域スペクトルパラメータである。１４は本実施例での重要な新規構成要素である広帯域音源推定手段、１５はその具体例としての零詰手段、１６は広帯域音源信号、１７は合成手段としての合成フィルタ、１８は帯域フィルタ、１９はアップサンプリング手段、２０は広帯域音声信号である。 FIG. 1 is a configuration diagram of a wideband audio restoration apparatus according to Embodiment 1 of the present invention. In the figure, 1 is an input narrowband speech signal, 2 is an analyzing means, 3 is a spectrum analyzing means, 4 is a narrowband spectral parameter, 5 is an inverse filter, 6 is a narrowband sound source signal, 7 is a wideband spectral estimating means, 8 Is a vector quantization means, 9 is a narrowband spectrum codebook, 10 is a spectrum code, 11 is an inverse quantization means, 12 is a wideband spectrum codebook, and 13 is a wideband spectrum parameter. 14 is a broadband sound source estimating means which is an important new component in the present embodiment, 15 is a zero filling means as a specific example thereof, 16 is a wideband sound source signal, 17 is a synthesizing filter as a synthesizing means, 18 is a bandpass filter, 19 is an upsampling means, and 20 is a wideband audio signal.

　また、図２は、零詰手段１５の処理を説明する信号説明図である。 FIG. 2 is a signal explanatory diagram for explaining the process of the zero filling means 15.

　以下、図１と図２を用いて本発明の実施例１の動作について説明する。 The operation of the first embodiment of the present invention will be described below with reference to FIGS.

　まず、例えば８ＫＨｚでサンプリングされ、３００Ｈｚから３．４ＫＨｚの電話帯域に制限された狭帯域音声信号１が分析手段２とアップサンプリング手段１９に入力される。分析手段２内のスペクトル分析手段３は、狭帯域音声信号１を分析して狭帯域スペクトルパラメータ４を算出し、分析手段２内の逆フィルタ５と広帯域スペクトル推定手段７内に出力する。なお、狭帯域スペクトルパラメータ４としては、線形予測係数、ＬＳＰ、ＰＡＲＣＯＲ係数、ケプストラム等様々なものが適用可能である。逆フィルタ５は、狭帯域スペクトルパラメータ４を用いて狭帯域音声信号１を逆フィルタリングし、得られた狭帯域音源信号６を広帯域音源推定手段１４内に出力する。 First, a narrow-band audio signal 1 sampled at, for example, 8 KHz and limited to a telephone band of 300 Hz to 3.4 KHz is input to the analysis unit 2 and the up-sampling unit 19. The spectrum analysis means 3 in the analysis means 2 analyzes the narrowband speech signal 1 to calculate a narrowband spectrum parameter 4 and outputs the same to the inverse filter 5 and the wideband spectrum estimation means 7 in the analysis means 2. As the narrow-band spectrum parameter 4, various parameters such as a linear prediction coefficient, an LSP, a PARCOR coefficient, and a cepstrum can be applied. The inverse filter 5 inversely filters the narrowband audio signal 1 using the narrowband spectral parameter 4 and outputs the obtained narrowband sound source signal 6 to the wideband sound source estimating means 14.

　広帯域スペクトル推定手段７内のベクトル量子化手段８は、狭帯域スペクトル符号帳９を用いて前記狭帯域スペクトルパラメータ４をベクトル量子化し、得られたスペクトル符号１０を広帯域スペクトル推定手段７内の逆量子化手段１１に出力する。逆量子化手段１１は、広帯域スペクトル符号帳１２を用いてスペクトル符号１０を逆量子化し、得られた広帯域スペクトルパラメータ１３を合成フィルタ１７に出力する。 The vector quantization means 8 in the wideband spectrum estimating means 7 performs vector quantization of the narrowband spectral parameters 4 using the narrowband spectral codebook 9 and converts the obtained spectrum code 10 into the inverse quantization in the wideband spectrum estimating means 7. Output to the converting means 11. The inverse quantization means 11 inversely quantizes the spectrum code 10 using the wideband spectrum codebook 12 and outputs the obtained wideband spectrum parameters 13 to the synthesis filter 17.

　なお、この広帯域スペクトル推定手段７内の処理は、文献１と同様であり、狭帯域スペクトル符号帳９と広帯域スペクトル符号帳１２の生成法や、ベクトル量子化の方法に関する詳細な説明を省略する。 The processing in the wideband spectrum estimating means 7 is the same as that in the literature 1, and detailed description on the method of generating the narrowband spectral codebook 9 and the wideband spectral codebook 12 and the method of vector quantization will be omitted.

　本実施例の重要部分である広帯域音源推定手段１４内の零詰手段１５は、狭帯域音源信号６の各サンプル値間にＭ−１サンプルずつ零を挿入し、得られたＭ倍のサンプル数の信号を広帯域音源信号１６として合成フィルタ１７に出力する。ここで、Ｍは、復元する広帯域音声信号のサンプリング周波数を、狭帯域音声信号のサンプリング周波数で除した値であり、この実施例では、Ｍが２の場合について説明する。図２（ａ）は、Ｎサンプルの狭帯域音源信号６である。この信号に対して、零詰手段１５による零詰め処理を行うと、Ｍ−１サンプル、つまり１サンプルずつの零が各サンプル間に挿入されて、図２（ｂ）に示す２Ｎサンプルの広帯域音源信号１６が得られる。Ｍが２の零詰め処理を行うと、広帯域音声信号のサンプリング周波数の半分の周波数、つまり４ＫＨｚを中心にして、０Ｈｚから４ＫＨｚと対称のスペクトルが４ＫＨｚから８ＫＨｚに復元される。 The zero-filling means 15 in the wide-band sound source estimating means 14, which is an important part of the present embodiment, inserts M-1 samples between each sample value of the narrow-band sound source signal 6, and obtains M times the number of samples obtained. Is output to the synthesis filter 17 as the broadband sound source signal 16. Here, M is a value obtained by dividing the sampling frequency of the wideband audio signal to be restored by the sampling frequency of the narrowband audio signal. In this embodiment, the case where M is 2 will be described. FIG. 2A shows a narrowband excitation signal 6 of N samples. When this signal is subjected to zero padding processing by the zero padding means 15, M-1 samples, that is, zeros for each sample are inserted between each sample, and a 2N sample broadband sound source shown in FIG. A signal 16 is obtained. When the zero padding process is performed with M being 2, a spectrum symmetrical from 0 Hz to 4 KHz with half the frequency of the sampling frequency of the wideband audio signal, that is, 4 KHz, is restored from 4 KHz to 8 KHz.

　合成フィルタ１７は、広帯域スペクトルパラメータ１３を用いて広帯域音源信号１６に合成フィルタ処理を行い仮の広帯域音声信号を生成する。帯域フィルタ１８は、この仮の広帯域音声信号に対して、帯域通過フィルタ処理を行い、狭帯域音声の成分の存在する帯域以外の成分を抽出する。広帯域音声信号の帯域が０Ｈｚから７．３ＫＨｚの場合、０Ｈｚから３００Ｈｚと３．４ＫＨｚから７．３ＫＨｚの成分が抽出される。 The synthesis filter 17 performs synthesis filter processing on the broadband sound source signal 16 using the wideband spectral parameters 13 to generate a temporary wideband audio signal. The band-pass filter 18 performs band-pass filtering on the provisional wide-band audio signal, and extracts components other than the band in which the narrow-band audio component exists. When the band of the wideband audio signal is from 0 Hz to 7.3 kHz, components of 0 Hz to 300 Hz and components of 3.4 kHz to 7.3 kHz are extracted.

　アップサンプリング手段１９は、狭帯域音声信号１をＭ倍にアップサンプリングする。アップサンプリングによって生成される信号は、サンプリング周波数が広帯域音声信号２０と同じで、狭帯域音声信号１と同じ狭帯域成分を持つものである。そして、帯域フィルタ１８の出力とアップサンプリング手段１９の出力を加算して広帯域音声信号２０を生成する。 The up-sampling unit 19 up-samples the narrowband audio signal 1 by M times. The signal generated by the upsampling has the same sampling frequency as the wideband audio signal 20 and has the same narrowband component as the narrowband audio signal 1. Then, the output of the bandpass filter 18 and the output of the upsampling means 19 are added to generate a wideband audio signal 20.

　本来狭帯域音源信号と広帯域音源信号は、同一の発声器官から生成された音源信号の特徴を反映しているので、ピッチ周波数の高調波成分の強さ、高調波成分間の雑音的成分の強さ等の音源信号の特徴において相関がある。つまり、狭帯域音源信号がピッチ周波数の高調波成分が強い規則的な特徴を持っている場合には、広帯域音源信号も同様にピッチ周波数の高調波成分が強い規則的な特徴を持っているし、逆に狭帯域音源信号が雑音的な成分が強い特徴を持っている場合には、広帯域音源信号も同様に雑音的な成分が強い特徴を持っている。 Since the narrow-band sound source signal and the wide-band sound source signal originally reflect the characteristics of the sound source signal generated from the same vocal organ, the strength of the harmonic component of the pitch frequency and the strength of the noise component between the harmonic components are increased. There is a correlation in the characteristics of the sound source signal, such as In other words, if the narrowband sound source signal has a regular characteristic in which the harmonic component of the pitch frequency is strong, the broadband sound source signal also has a regular characteristic in which the harmonic component of the pitch frequency is strong. Conversely, when the narrowband sound source signal has a characteristic that the noise component is strong, the wideband sound source signal also has the characteristic that the noise component is strong.

　この実施例の様に広帯域音源推定手段を構成する事により、低域の０〜４ＫＨｚの狭帯域音源信号と同様の特徴を持つ０〜８ＫＨｚの広帯域音源信号を生成する事ができるので、話者に依存性が少なく、安定で自然な音質の広帯域音声を復元することができる効果がある。 By configuring the wideband sound source estimating means as in this embodiment, it is possible to generate a wideband sound source signal of 0 to 8 kHz having the same characteristics as the narrow band sound source signal of 0 to 4 kHz in the low frequency range. There is an effect that a wideband sound with stable and natural sound quality can be restored with little dependency on the sound.

　また、従来例のように有声無声判定やピッチ抽出が必要なく、本構成により自ずと中間的な性質の音源も表現できるので、雑音が重畳した狭帯域音声信号に対して起こりがちな有声無声判定誤りやピッチ抽出誤りの影響がなく、有声無声境界付近でも良好な広帯域音源を推定することができ、安定で自然な音質の広帯域音声を復元することができる効果がある。 Also, unlike the conventional example, voiced unvoiced judgment and pitch extraction are not required, and a sound source having an intermediate characteristic can be naturally expressed by this configuration. It is possible to estimate a good wideband sound source even near a voiced / unvoiced boundary without the influence of a pitch extraction error and a voiced unvoiced boundary.

　実施例２．
　図３は本発明の実施例２の広帯域音声復元装置における音源推定手段１４の構成図である。図において新規な部分は、２１の音源分析手段、２２の狭帯域適応符号帳、２３の歪最小化手段、２４の狭帯域駆動音源信号、２５の狭帯域適応ラグ長、２６の狭帯域適応ゲイン、２７の広帯域駆動音源推定手段、２８の零詰手段、２９の広帯域駆動音源信号、３０の広帯域適応音源推定手段、３１の広帯域適応音源符号帳、３２の広帯域適応音源信号、３３の広帯域適応ラグ長、３４の広帯域適応ゲインである。全体構成は、図１と同じであるので、構成の記載と図３以外の部分の動作の説明を省略する。 Embodiment 2. FIG.
FIG. 3 is a configuration diagram of the sound source estimating means 14 in the wideband audio restoration apparatus according to the second embodiment of the present invention. In the figure, the new parts are 21 sound source analysis means, 22 narrow band adaptive codebooks, 23 distortion minimizing means, 24 narrow band drive excitation signals, 25 narrow band adaptive lag lengths, 26 narrow band adaptive gains. , 27 broadband driving excitation estimating means, 28 zero filling means, 29 wideband driving excitation signal, 30 wideband adaptive excitation estimating means, 31 wideband adaptive excitation codebook, 32 wideband adaptive excitation signal, 33 wideband adaptive lag Long, a wideband adaptive gain of 34. Since the entire configuration is the same as in FIG. 1, the description of the configuration and the operation of the parts other than FIG. 3 will be omitted.

　本構成によれば、広帯域音源信号が更によりよく復元できる。 According to this configuration, the broadband sound source signal can be restored even better.

　以下、図３を用いて本発明の一実施例の動作について説明する。 Hereinafter, the operation of the embodiment of the present invention will be described with reference to FIG.

　狭帯域音源信号６が広帯域音源推定手段１４内の音源分析手段２１に入力される。音源分析手段２１内の狭帯域適応符号帳２２には、過去の狭帯域音源信号６が記憶されており、後述する歪最小化手段２３が順次出力するラグ長に従って、ラグ長が整数値である場合には記憶してある過去の狭帯域音源信号６をこのラグ長で繰り返して得られる信号を出力する。ラグ長が非整数値である場合には、文献３「 Pitch Predictors ｗith High Temporal Resolution 」IEEE International Conference on Acoustics, Speech, and Signal Processing vol.2, S12.6, pp.661-664 (1990.4) に記載されているようにポリフェイズフィルタ出力により信号を生成し、出力する。出力する信号の長さは、現在の狭帯域音源信号６と同じ長さである。 The narrow band sound source signal 6 is input to the sound source analyzing means 21 in the wide band sound source estimating means 14. The past narrowband excitation signal 6 is stored in the narrowband adaptive codebook 22 in the excitation analysis unit 21, and the lag length is an integer according to the lag length sequentially output by the distortion minimization unit 23 described later. In this case, a signal obtained by repeating the stored past narrowband sound source signal 6 with this lag length is output. If the lag length is a non-integer value, see Reference 3 “Pitch Predictors with High Temporal Resolution” IEEE International Conference on Acoustics, Speech, and Signal Processing vol.2, S12.6, pp.661-664 (1990.4) Generate and output a signal with the polyphase filter output as described. The length of the signal to be output is the same as that of the current narrowband sound source signal 6.

　図４に、狭帯域適応符号帳２２内に記憶されている過去の狭帯域音源信号６と、入力されたラグ長に従って出力される信号の例を示す。 FIG. 4 shows an example of the past narrowband excitation signal 6 stored in the narrowband adaptive codebook 22 and a signal output according to the input lag length.

　図において、横軸は時間で矢印方向に時間が経過することを示す。（Ａ１），（Ｂ１）は従って音源信号の時間的な長さを示し、（Ａ２），（Ｂ２）は２０〜１２８等、出力される時間に対して正規化されたラグ長を示し、（Ａ３），（Ｂ３）は出力される音源信号の例を示す。 In the figure, the horizontal axis indicates time and elapse of time in the direction of the arrow. (A1) and (B1) thus indicate the temporal length of the sound source signal, (A2) and (B2) indicate the lag length normalized to the output time, such as 20 to 128, A3) and (B3) show examples of output sound source signals.

　図４（ａ）は出力信号の長さがラグ長より短い場合を示し、その場合にはラグ長の最初から出力信号時間Ｔ１の長さの音源信号（Ａ３）を過去の音源信号に引続いて出力する。ラグ長が出力する信号の長さよりもＴ２のように短い時には、図４（ｂ）に示す様に複数回同じ音源信号（Ｂ３）を繰り返して過去の音源信号に続いて出力する。 FIG. 4A shows a case where the length of the output signal is shorter than the lag length. In this case, the sound source signal (A3) having the length of the output signal time T1 from the beginning of the lag length follows the past sound source signal. Output. When the lag length is shorter than the length of the output signal, such as T2, the same sound source signal (B3) is repeated a plurality of times as shown in FIG.

　歪最小化手段２３は、前記狭帯域適応符号帳２２に対して複数のラグ長の値を順次出力し、各ラグ長に対して狭帯域適応符号帳２２が出力した信号にゲインを乗じた信号と狭帯域音源信号６との歪が最小になるようにそのゲインを決定していく。そして、全てのラグ長に中で歪を最小にするものを選択し、狭帯域適応ラグ長２５として広帯域適応音源推定手段３０に出力する。また、その時のゲインの値を狭帯域適応ゲイン２６として広帯域適応音源推定手段３０に出力し、狭帯域適応符号帳２２が出力した信号に狭帯域適応ゲイン２６を乗じた信号と狭帯域音源信号６の誤差信号を狭帯域駆動音源信号２４として広帯域駆動音源推定手段２７に出力する。なお、歪最小化手段２３内でのゲインの決定方法としては、一般に知られているラグランジュの未定係数法を用いる事ができる。 The distortion minimizing means 23 sequentially outputs a plurality of lag length values to the narrowband adaptive codebook 22, and multiplies a signal output from the narrowband adaptive codebook 22 for each lag length by a gain. The gain is determined so that the distortion between the signal and the narrow-band sound source signal 6 is minimized. Then, a lag length that minimizes distortion among all lag lengths is selected and output to the wideband adaptive sound source estimating means 30 as a narrowband adaptive lag length 25. Further, the gain value at that time is output to the wideband adaptive excitation estimating means 30 as a narrowband adaptive gain 26, and a signal obtained by multiplying the signal output from the narrowband adaptive codebook 22 by the narrowband adaptive gain 26 and the narrowband excitation signal 6 Is output to the wide band drive sound source estimating means 27 as the narrow band drive sound source signal 24. As a method of determining the gain in the distortion minimizing means 23, a generally known Lagrange's undetermined coefficient method can be used.

　即ち歪最小化手段２３は、狭帯域音源信号６と狭帯域適応符号帳２２出力を入力とし、狭帯域適応音源符号である歪最小のラグ長２５とゲイン２６と、誤差信号の狭帯域駆動音源信号２４を出力する。 That is, the distortion minimizing means 23 receives the narrow-band excitation signal 6 and the output of the narrow-band adaptive codebook 22 as inputs, and provides a minimum distortion lag length 25 and a gain 26, which are narrow-band adaptive excitation codes, and a narrow-band driving excitation of the error signal. The signal 24 is output.

　広帯域駆動音源推定手段２７内の零詰手段２８は、狭帯域駆動音源信号２４の各サンプル値間にＭ−１サンプルずつ零を挿入し、得られたＭ倍のサンプル数の信号を広帯域駆動音源信号２９として出力する。ここで、Ｍは、復元する広帯域音声信号のサンプリング周波数を、狭帯域音声信号のサンプリング周波数で除した値であり、零を挿入する動作は前記零詰手段１５と同じである。 The zero-filling means 28 in the wide-band driving sound source estimating means 27 inserts M-1 samples of zero between each sample value of the narrow-band driving sound source signal 24, and converts the obtained signal having M times the number of samples into the wide-band driving sound source signal 24. Output as signal 29. Here, M is a value obtained by dividing the sampling frequency of the wideband audio signal to be restored by the sampling frequency of the narrowband audio signal, and the operation of inserting zero is the same as that of the zero-filling means 15.

　広帯域適応音源推定手段３０内では、まず狭帯域適応ラグ長２５をＭ倍して広帯域適応ラグ長３３とし、狭帯域適応ゲイン２６をｇ倍して広帯域適応ゲイン３４とする。ｇを１とすると最終的に得られる広帯域音源信号１６のピッチ周期性が狭帯域音源信号６と同等となり、１から小さくしていくにつれて狭帯域音源信号６に比べてピッチ周期性が弱くなっていく。実際の音声を観察すると、周波数が高い部分ほどピッチ周期性が弱くなっていく場合がおおいので、高域を復元する場合にｇを１より小さい値に設定するとより高品質な広帯域音声が復元できる。 In the wideband adaptive sound source estimating means 30, first, the narrowband adaptive lag length 25 is multiplied by M to obtain a wideband adaptive lag length 33, and the narrowband adaptive gain 26 is multiplied by g to obtain a wideband adaptive gain 34. Assuming that g is 1, the pitch periodicity of the finally obtained broadband excitation signal 16 is equivalent to that of the narrowband excitation signal 6, and the pitch periodicity becomes weaker as compared with the narrowband excitation signal 6 as it is reduced from 1. Go. When observing the actual sound, the pitch periodicity is likely to be weaker as the frequency becomes higher. Therefore, when restoring the high frequency, setting g to a value smaller than 1 enables a higher-quality wideband sound to be restored. .

　広帯域適応音源推定手段３０内の広帯域適応音源符号帳３１には、過去の広帯域音源信号１６が記憶されており、この信号を前記広帯域適応ラグ長３３で繰り返して得られる信号を出力する。そして広帯域適応音源推定手段３０内でこの信号を前記広帯域適応ゲイン３４で乗算して、広帯域適応音源信号３２として出力する。 The wideband adaptive excitation codebook 31 in the wideband adaptive excitation estimation means 30 stores the past wideband excitation signal 16, and outputs a signal obtained by repeating this signal with the wideband adaptive lag length 33. The signal is multiplied by the wideband adaptive gain 34 in the wideband adaptive sound source estimating means 30 and output as a wideband adaptive sound source signal 32.

　最後に広帯域駆動音源信号２９と広帯域適応音源信号３２を加算して、広帯域音源信号１６として出力する。 (4) Finally, the broadband drive excitation signal 29 and the broadband adaptive excitation signal 32 are added and output as the broadband excitation signal 16.

　この様に構成する事により、狭帯域音源信号の持つピッチ周期性の強さや変動に関する特徴が、狭帯域適応ラグ長２５と狭帯域適応ゲイン２６によって良好に表現され、広帯域音源信号に反映されるので、様々に変化する音源を十分に推定でき、パルス的な音もなく、良好な音質の広帯域音声を復元することができる効果がある。また、話者によらずに適切な音源が推定できる効果がある。 With such a configuration, the characteristics related to the strength and fluctuation of the pitch periodicity of the narrow-band excitation signal are well represented by the narrow-band adaptive lag length 25 and the narrow-band adaptive gain 26, and are reflected in the wide-band excitation signal. Therefore, there is an effect that it is possible to sufficiently estimate a sound source that changes in various ways and to restore a broadband sound having good sound quality without pulse-like sound. Further, there is an effect that an appropriate sound source can be estimated without depending on a speaker.

　広帯域適応音源信号３２において、広帯域適応ラグ長３３によって決まる基本周波数とその高調波成分の周波数が、正しく整数倍の位置に並ぶので、最終的に復元される広帯域音声信号２０での狭帯域成分と復元広帯域成分のつながりが良く、高品質な広帯域音声を復元できる効果がある。 In the wideband adaptive excitation signal 32, the fundamental frequency determined by the wideband adaptive lag length 33 and the frequency of its harmonic component are correctly aligned at integer multiple positions, so that the narrowband component in the finally recovered wideband audio signal 20 There is an effect that the connection between the restored broadband components is good and high-quality wideband speech can be restored.

　更に、周波数が高くなるにつれてピッチ周期性が弱くなっていく特徴を係数ｇによって導入する事ができるので、より自然な音質が得られる効果がある。 Furthermore, since a feature that the pitch periodicity becomes weaker as the frequency becomes higher can be introduced by the coefficient g, a more natural sound quality can be obtained.

　また、有声無声判定やピッチ抽出が必要なく、中間的な性質の音源も表現できるので、雑音が重畳した狭帯域音声信号に対して起こりがちな有声無声判定誤りやピッチ抽出誤りの影響がなく、有声無声境界付近でも良好な広帯域音源を推定することができ、安定で自然な音質の広帯域音声を復元することができる効果がある。 In addition, since voiced unvoiced judgment and pitch extraction are not required and a sound source of an intermediate property can be expressed, there is no influence of voiced unvoiced judgment error or pitch extraction error which is likely to occur on a narrowband audio signal on which noise is superimposed, A good wideband sound source can be estimated even near a voiced / unvoiced boundary, and there is an effect that a wideband sound with stable and natural sound quality can be restored.

　実施例３．
　図５は本発明の実施例３の広帯域音声復元装置における広帯域駆動音源推定手段２７の構成図である。図において新規な部分は、３５のパワー算出手段、３６の雑音生成手段である。その他の構成は図１および図３と同じであるので、対応部分の動作の説明を省略する。 Embodiment 3 FIG.
FIG. 5 is a block diagram of the wideband driving sound source estimating means 27 in the wideband audio restoration apparatus according to the third embodiment of the present invention. In the figure, new parts are a power calculating means 35 and a noise generating means 36. Other configurations are the same as those in FIG. 1 and FIG.

　以下、図５を用いて本発明の実施例３の図に示された部分の動作について説明する。 Hereinafter, the operation of the portion shown in the drawing of the third embodiment of the present invention will be described with reference to FIG.

　狭帯域駆動音源信号２４が広帯域駆動音源推定手段２７内のパワー算出手段３５に入力される。パワー算出手段３５は狭帯域駆動音源信号２４のパワーを算出し、出力する。雑音生成手段３６は、パワー正規化された白色雑音信号を生成し出力する。そして、広帯域駆動音源推定手段２７内で、前記白色雑音信号にパワー算出手段３５が出力したパワーを乗じ、得られた信号を広帯域駆動音源信号２９として出力する。 The narrow-band driving sound source signal 24 is input to the power calculating means 35 in the wide-band driving sound source estimating means 27. The power calculator 35 calculates and outputs the power of the narrow-band drive sound source signal 24. The noise generator 36 generates and outputs a power-normalized white noise signal. Then, the white noise signal is multiplied by the power output from the power calculator 35 in the broadband drive sound source estimator 27, and the obtained signal is output as the broadband drive sound source signal 29.

　ピッチ周期や周期性の強さは時々刻々変化している。狭帯域音源信号６におけるピッチ周期や周期性の強さの細かい変動分は狭帯域適応ラグ長２５と狭帯域適応ゲイン２６では表現できないため、その誤差が狭帯域駆動音源信号２４に含まれている。実施例２のようにこの誤差成分を含む狭帯域駆動音源信号２４を用いて広帯域駆動音源信号２９を生成すると、広帯域駆動音源信号２９に不必要な乱れが生じてしまう事があり、パワーが同じ白色雑音を生成して広帯域駆動音源信号２９として用いた方が良好な復元音が得られる場合がある事を実験的に確認している。 The pitch period and the strength of the periodicity change every moment. Since the fine fluctuation of the pitch period and the intensity of the periodicity in the narrow band excitation signal 6 cannot be expressed by the narrow band adaptive lag length 25 and the narrow band adaptive gain 26, the error is included in the narrow band driving excitation signal 24. . When the wide-band drive excitation signal 29 is generated using the narrow-band drive excitation signal 24 including this error component as in the second embodiment, unnecessary disturbance may occur in the wide-band drive excitation signal 29, and the power may be the same. It has been experimentally confirmed that a better restored sound may be obtained when white noise is generated and used as the broadband driving sound source signal 29.

　実施例３の様に構成する事により、狭帯域駆動音源信号２４とパワーが同じ白色雑音を生成して広帯域駆動音源信号２９として用いているので、実施例２が持つ効果に加えて、ピッチ周期や周期性の強さの変動分による乱れの少ない良好な復元音が得られる効果がある。 With the configuration as in the third embodiment, the white noise having the same power as that of the narrow-band drive excitation signal 24 is generated and used as the broad-band drive excitation signal 29. In addition to the effects of the second embodiment, the pitch period There is an effect that a good restored sound with little disturbance due to fluctuations in the intensity of the periodicity is obtained.

　また、零詰め処理を行うと４ＫＨｚを中心に対称なスペクトルが生成される。従って、この０Ｈｚから３００Ｈｚと３．４ＫＨｚから４．０ＫＨｚの成分がない狭帯域駆動音源信号２４に対して零詰めを行うと、０Ｈｚから３００Ｈｚ、３．４ＫＨｚから４．６ＫＨｚ、７．７ＫＨｚから８ＫＨｚの成分がない信号が得られてしまう。これに対し、白色雑音を用いるこの構成では、０Ｈｚから８ＫＨｚまで全ての成分を持つ広帯域駆動音源信号２９が得られるので、全域にわたって帯域がある良好な復元音が得られる効果がある。特に０Ｈｚから３００Ｈｚの復元を行う場合には効果が大きい。と Further, when the zero padding process is performed, a spectrum symmetric about 4 KHz is generated. Therefore, if zero-padding is performed on the narrow-band drive sound source signal 24 having no components of 0 Hz to 300 Hz and 3.4 kHz to 4.0 kHz, 0 Hz to 300 Hz, 3.4 kHz to 4.6 kHz, and 7.7 kHz to 8 kHz. A signal having no component is obtained. On the other hand, in this configuration using white noise, the broadband driving sound source signal 29 having all components from 0 Hz to 8 KHz is obtained, so that there is an effect that a good restored sound having a band over the entire region can be obtained. In particular, the effect is great when restoring from 0 Hz to 300 Hz.

　実施例４．
　図６は本発明の実施例４の広帯域音声復元装置における広帯域駆動音源推定手段２７の構成図である。図において、２８の零詰手段、３５のパワー算出手段、３６の雑音生成手段は実施例２および実施例３のものと同一である。その他の構成は図１および図３と同じであるので、図示以外の部分の動作の説明を省略する。 Embodiment 4. FIG.
FIG. 6 is a configuration diagram of the wideband driving sound source estimating means 27 in the wideband audio restoration apparatus according to the fourth embodiment of the present invention. In the figure, 28 zero-filling means, 35 power calculating means, and 36 noise generating means are the same as those of the second and third embodiments. Other configurations are the same as those in FIG. 1 and FIG. 3, and the description of the operation of the parts other than those illustrated is omitted.

　以下、図６を用いて本発明の一実施例の動作について説明する。 Hereinafter, the operation of the embodiment of the present invention will be described with reference to FIG.

　狭帯域駆動音源信号２４が広帯域駆動音源推定手段２７内の零詰手段２８とパワー算出手段３５に入力される。広帯域駆動音源推定手段２７内の零詰手段２８は、狭帯域駆動音源信号２４の各サンプル値間にＭ−１サンプルずつ零を挿入し、得られたＭ倍のサンプル数の信号を出力する。ここで、Ｍは、復元する広帯域音声信号のサンプリング周波数を、狭帯域音声信号のサンプリング周波数で除した値であり、零を挿入する動作は前記零詰手段１５と同じである。 The narrow-band driving sound source signal 24 is input to the zero-filling means 28 and the power calculating means 35 in the wide-band driving sound source estimating means 27. The zero-filling means 28 in the wide-band driving sound source estimating means 27 inserts zeros by M-1 samples between each sample value of the narrow-band driving sound source signal 24, and outputs a signal of M times the obtained number of samples. Here, M is a value obtained by dividing the sampling frequency of the wideband audio signal to be restored by the sampling frequency of the narrowband audio signal, and the operation of inserting zero is the same as that of the zero-filling means 15.

　パワー算出手段３５は狭帯域駆動音源信号２４のパワーを算出し、出力する。雑音生成手段３６は、パワー正規化された白色雑音信号を生成し出力する。そして、零詰手段２８が出力した信号にゲインｇｒ１を乗じた信号と、雑音生成手段３６が出力した白色雑音信号にパワー算出手段３５が出力したパワーを乗じ、さらにゲインｇｒ２を乗じた信号を加算して広帯域駆動音源信号２９として出力する。 The power calculation means 35 calculates and outputs the power of the narrow band drive excitation signal 24. The noise generator 36 generates and outputs a power-normalized white noise signal. Then, a signal obtained by multiplying the signal output by the zero-filling means 28 by the gain gr1 is added to a white noise signal output by the noise generating means 36 by the power output by the power calculating means 35, and further a signal obtained by multiplying the gain gr2. And outputs it as a broadband drive sound source signal 29.

　実施例２および実施例３による復元音が、それぞれ一長一短を有している場合、この様に構成し、ｇｒ１とｇｒ２を適切に設定することで、両者を上回る品質の広帯域音声が復元できる得られる効果がある。なお、実施例２と実施例３と同じ効果も持っている。 When the restored sounds according to the second and third embodiments have respective advantages and disadvantages, by configuring in this way and setting gr1 and gr2 appropriately, it is possible to restore a broadband sound having a quality exceeding both of them. effective. The second embodiment has the same effect as the second and third embodiments.

　実施例５．
　広帯域音源信号の良好な復元が出来る他の構成を説明する。 Embodiment 5 FIG.
Another configuration capable of favorably restoring a broadband sound source signal will be described.

　図７は本発明の実施例５の広帯域音声復元装置における広帯域音源推定手段１４の構成図である。図において新規な部分は、３７の狭帯域長周期予測分析手段、３８の狭帯域長周期遅延、３９の狭帯域長周期予測係数、４０の長周期逆フィルタ、４１の狭帯域長周期予測残差信号、４２の広帯域長周期予測残差推定手段、４３の零詰手段、４４の広帯域長周期予測パラメータ（符号）推定手段、４５の広帯域長周期遅延、４６の広帯域長周期予測係数、４７の長周期合成フィルタ、４８の広帯域長周期予測残差信号である。全体構成は、図１と同じであるので、説明を省略する。 FIG. 7 is a configuration diagram of the wideband sound source estimating means 14 in the wideband speech restoration apparatus according to the fifth embodiment of the present invention. In the figure, a new portion is a narrow-band long-period prediction analysis means 37, a narrow-band long-period delay 38, a narrow-band long-period prediction coefficient 39, a long-period inverse filter 40, and a narrow-band long-period prediction residual 41. Signal, wideband long-period prediction residual estimating means 42, zero-filling means 43, wideband long-period prediction parameter (code) estimating means 44, wideband long-period delay 45, wideband long-period prediction coefficient 46, length 47 This is a wideband long-period prediction residual signal of the period synthesis filter 48. Since the entire configuration is the same as that of FIG. 1, the description is omitted.

　以下、図７を用いて本発明の一実施例の動作について説明する。 Hereinafter, the operation of the embodiment of the present invention will be described with reference to FIG.

　狭帯域音源信号６が広帯域音源推定手段１４内の音源分析手段２１に入力される。音源分析手段２１内の狭帯域長周期予測分析手段３７は、狭帯域音源信号６に対して長周期予測分析を行い、狭帯域長周期予測符号である狭帯域長周期遅延３８と狭帯域長周期予測係数３９を出力する。なお、長周期予測分析については、ＣＥＬＰ系の符号化方式でしばしば用いられていた方法であるので説明を省略する。 The narrow band sound source signal 6 is input to the sound source analyzing means 21 in the wide band sound source estimating means 14. The narrow-band long-period prediction analysis unit 37 in the sound source analysis unit 21 performs a long-period prediction analysis on the narrow-band excitation signal 6, and outputs a narrow-band long-period delay 38, which is a narrow-band long-period prediction code, and a narrow-band long-period. The prediction coefficient 39 is output. Note that the long-period prediction analysis is a method often used in the CELP coding method, and thus the description is omitted.

　音源分析手段２１内の長周期逆フィルタ４０は、狭帯域長周期遅延３８と狭帯域長周期予測係数３９を用いて狭帯域音源信号６を逆フィルタリングし、得られた信号を狭帯域長周期予測残差信号４１として広帯域長周期予測残差推定手段４２に出力する。 The long-period inverse filter 40 in the sound source analyzer 21 performs inverse filtering of the narrow-band sound source signal 6 using the narrow-band long-period delay 38 and the narrow-band long-period prediction coefficient 39, and predicts the obtained signal in the narrow-band long-period prediction. The residual signal 41 is output to the wideband long-period prediction residual estimating means 42.

　広帯域長周期予測残差推定手段４２内の零詰手段４３は狭帯域長周期予測残差信号４１の各サンプル値間にＭ−１サンプルずつ零を挿入し、得られたＭ倍のサンプル数の信号を広帯域長周期予測残差信号４８として出力する。ここで、Ｍは、復元する広帯域音声信号のサンプリング周波数を、狭帯域音声信号のサンプリング周波数で除した値であり、零を挿入する動作は前記零詰手段１５と同じである。 The zero-filling means 43 in the wide-band long-period prediction residual estimating means 42 inserts zero by M-1 samples between each sample value of the narrow-band long-period prediction residual signal 41, and obtains M times the number of samples obtained. The signal is output as a wideband long-period prediction residual signal 48. Here, M is a value obtained by dividing the sampling frequency of the wideband audio signal to be restored by the sampling frequency of the narrowband audio signal, and the operation of inserting zero is the same as that of the zero-filling means 15.

　広帯域長周期予測パラメータ（符号）推定手段４４は、狭帯域長周期遅延３８をＭ倍して予測符号の１つである広帯域長周期遅延４５を出力し、また狭帯域長周期予測係数３９をｇ倍して他の予測符号である広帯域長周期予測係数４６を出力する。ｇを１とすると最終的に得られる広帯域音源信号１６のピッチ周期性が狭帯域音源信号６と同等となり、１から小さくしていくにつれて狭帯域音源信号６に比べてピッチ周期性が弱くなっていく。実施例２と同様に、高域を復元する場合にはｇを１より小さい値に設定した方が高品質となる。 The wide-band long-period prediction parameter (code) estimating means 44 multiplies the narrow-band long-period delay 38 by M to output a wide-band long-period delay 45, which is one of the prediction codes. The multiplication factor is multiplied to output a wideband long-period prediction coefficient 46 which is another prediction code. Assuming that g is 1, the pitch periodicity of the finally obtained broadband excitation signal 16 is equivalent to that of the narrowband excitation signal 6, and the pitch periodicity becomes weaker as compared with the narrowband excitation signal 6 as it is reduced from 1. Go. As in the second embodiment, when restoring a high frequency range, setting g to a value smaller than 1 results in higher quality.

　最後に、長周期合成フィルタ４７は、広帯域長周期遅延４５と広帯域長周期予測係数４６を用いて、広帯域長周期予測残差信号４８に対して長周期合成フィルタリングを行い、得られた信号を広帯域音源信号１６として出力する。 Finally, the long-period synthesis filter 47 performs long-period synthesis filtering on the wide-band long-period prediction residual signal 48 using the wide-band long-period delay 45 and the wide-band long-period prediction coefficient 46, and It is output as a sound source signal 16.

　この様に構成する事により、狭帯域音源信号の持つピッチ周期性の強さや変動に関する特徴が、狭帯域長周期遅延３８と狭帯域長周期予測係数３９によって良好に表現され、広帯域音源信号に反映されるので、様々に変化する音源を十分に推定でき、パルス的な音もなく、良好な音質の広帯域音声を復元することができる効果がある。また、話者によらずに適切な音源が推定できる効果がある。 With such a configuration, the characteristics related to the strength and fluctuation of the pitch periodicity of the narrow-band excitation signal are well represented by the narrow-band long-period delay 38 and the narrow-band long-period prediction coefficient 39, and are reflected in the wide-band excitation signal. Therefore, there is an effect that it is possible to sufficiently estimate a sound source that changes in various ways and to restore a broadband sound having good sound quality without pulse-like sound. Further, there is an effect that an appropriate sound source can be estimated without depending on a speaker.

　広帯域音源信号１６において、広帯域長周期遅延４５によって決まる基本周波数とその高調波成分の周波数が、正しく整数倍の位置に並ぶので、最終的に復元される広帯域音声信号２０での狭帯域成分と復元広帯域成分のつながりが良く、高品質な広帯域音声を復元できる効果がある。 In the broadband sound source signal 16, the fundamental frequency determined by the wideband long-period delay 45 and the frequency of its harmonic component are correctly aligned at integer multiple positions. The connection of the wideband components is good, and there is an effect that high-quality wideband speech can be restored.

　実施例６．
　図８は本発明の実施例６の広帯域音声復元装置における広帯域長周期予測残差推定手段４２の構成図である。図において、３５のパワー算出手段、３６の雑音生成手段は実施例３のものと同一である。その他の構成は図１および図７と同じであるので、説明を省略する。 Embodiment 6 FIG.
FIG. 8 is a configuration diagram of the wideband long-period prediction residual estimating means 42 in the wideband speech restoration apparatus according to the sixth embodiment of the present invention. In the figure, the power calculation means 35 and the noise generation means 36 are the same as those in the third embodiment. Other configurations are the same as those in FIG. 1 and FIG.

　以下、図８を用いて本発明の一実施例の動作について説明する。 Hereinafter, the operation of the embodiment of the present invention will be described with reference to FIG.

　狭帯域長周期予測残差信号４１が広帯域長周期予測残差推定手段４２内のパワー算出手段３５に入力される。パワー算出手段３５は狭帯域長周期予測残差信号４１のパワーを算出し、出力する。雑音生成手段３６は、パワー正規化された白色雑音信号を生成し出力する。そして、広帯域長周期予測残差推定手段４２内で、前記白色雑音信号にパワー算出手段３５が出力したパワーを乗じ、得られた信号を広帯域長周期予測残差信号４８として出力する。 The narrow-band long-period prediction residual signal 41 is input to the power calculator 35 in the wide-band long-period prediction residual estimator 42. The power calculator 35 calculates and outputs the power of the narrow band long cycle prediction residual signal 41. The noise generator 36 generates and outputs a power-normalized white noise signal. Then, the white noise signal is multiplied by the power output from the power calculation unit 35 in the wideband long-period prediction residual estimation unit 42, and the obtained signal is output as a wideband long-period prediction residual signal 48.

　実施例３での説明と同様に、狭帯域音源信号６におけるピッチ周期や周期性の強さの細かい変動分は狭帯域長周期遅延３８と狭帯域長周期予測係数３９では表現できないため、その誤差が狭帯域長周期予測残差信号４１に含まれている。実施例５のようにこの誤差成分を含む狭帯域長周期予測残差信号４１を用いて広帯域長周期予測残差信号４８を生成すると広帯域長周期予測残差信号４８に不必要な乱れが生じてしまう事があり、パワーが同じ白色雑音を生成して広帯域長周期予測残差信号４８として用いた方が良好な復元音が得られる場合がある。 As described in the third embodiment, fine fluctuations in the pitch period and the strength of the periodicity in the narrow-band excitation signal 6 cannot be expressed by the narrow-band long-period delay 38 and the narrow-band long-period prediction coefficient 39. Are included in the narrow-band long-period prediction residual signal 41. When the wideband long-period prediction residual signal 48 is generated using the narrowband long-period prediction residual signal 41 including the error component as in the fifth embodiment, unnecessary disturbance occurs in the wideband long-period prediction residual signal 48. In some cases, better restored sound may be obtained by generating white noise having the same power and using it as the wideband long-period prediction residual signal 48.

　実施例６の様に構成する事により、狭帯域長周期予測残差信号４１とパワーが同じ白色雑音を生成して広帯域長周期予測残差信号４８として用いているので、実施例５が持つ効果に加えて、ピッチ周期や周期性の強さの変動分による乱れの少ない良好な復元音が得られる効果がある。 By configuring as in the sixth embodiment, the white noise having the same power as the narrow-band long-period prediction residual signal 41 is generated and used as the wide-band long-period prediction residual signal 48. In addition to this, there is an effect that a good restored sound with little disturbance due to the fluctuation of the pitch period or the intensity of the periodicity can be obtained.

　また、零詰め処理を行うと４ＫＨｚを中心に対称なスペクトルが生成されるので、この０Ｈｚから３００Ｈｚと３．４ＫＨｚから４．０ＫＨｚの成分がない狭帯域長周期予測残差信号４１に対して行うと、０Ｈｚから３００Ｈｚ、３．４ＫＨｚから４．６、ＫＨｚ７．７ＫＨｚから８ＫＨｚの成分がない信号が得られてしまう。これに対し、白色雑音を用いるこの構成では、０Ｈｚから８ＫＨｚまで全ての成分を持つ広帯域長周期予測残差信号４８が得られるので、不足する帯域がない良好な復元音が得られる効果がある。特に０Ｈｚから３００Ｈｚの復元を行う場合には効果が大きい。 Further, when the zero padding processing is performed, a spectrum symmetrical about 4 KHz is generated. Therefore, the processing is performed on the narrow-band long-period prediction residual signal 41 having no components from 0 Hz to 300 Hz and 3.4 KHz to 4.0 KHz. Then, a signal having no components of 0 Hz to 300 Hz, 3.4 KHz to 4.6 KHz, and 7.7 KHz to 8 KHz is obtained. On the other hand, in this configuration using white noise, the wideband long-period prediction residual signal 48 having all components from 0 Hz to 8 KHz is obtained, so that there is an effect that a good restored sound without a lacking band is obtained. In particular, the effect is great when restoring from 0 Hz to 300 Hz.

　実施例７．
　図９は本発明の実施例７の広帯域音声復元装置における広帯域長周期予測残差推定手段４２の構成図である。図において、４３の零詰手段、３５のパワー算出手段、３６の雑音生成手段は実施例５および実施例６のものと同一である。その他の構成は図１および図７と同じであるので、説明を省略する。 Embodiment 7 FIG.
FIG. 9 is a block diagram of the wideband long-period prediction residual estimating means 42 in the wideband speech restoration apparatus according to the seventh embodiment of the present invention. In the figure, 43 zero filling means, 35 power calculation means, and 36 noise generation means are the same as those of the fifth and sixth embodiments. Other configurations are the same as those in FIG. 1 and FIG.

　以下、図９を用いて本発明の一実施例の動作について説明する。 Hereinafter, the operation of the embodiment of the present invention will be described with reference to FIG.

　狭帯域長周期予測残差信号４１が広帯域長周期予測残差推定手段４２内の零詰手段４３とパワー算出手段３５に入力される。広帯域長周期予測残差推定手段４２内の零詰手段４３は、狭帯域長周期予測残差信号４１の各サンプル値間にＭ−１サンプルずつ零を挿入し、得られたＭ倍のサンプル数の信号を出力する。ここで、Ｍは、復元する広帯域音声信号のサンプリング周波数を、狭帯域音声信号のサンプリング周波数で除した値であり、零を挿入する動作は前記零詰手段１５と同じである。 The narrow-band long-period prediction residual signal 41 is input to the zero-filling unit 43 and the power calculating unit 35 in the wide-band long-period prediction residual estimating unit 42. The zero-filling means 43 in the wide-band long-period prediction residual estimating means 42 inserts zeros by M-1 samples between each sample value of the narrow-band long-period prediction residual signal 41 and obtains M times the number of samples obtained. The signal of is output. Here, M is a value obtained by dividing the sampling frequency of the wideband audio signal to be restored by the sampling frequency of the narrowband audio signal, and the operation of inserting zero is the same as that of the zero-filling means 15.

　パワー算出手段３５は狭帯域長周期予測残差信号４１のパワーを算出し、出力する。雑音生成手段３６は、パワー正規化された白色雑音信号を生成し出力する。そして、零詰手段４３が出力した信号にゲインｇｒ１を乗じた信号と、雑音生成手段３６が出力した白色雑音信号にパワー算出手段３５が出力したパワーを乗じ、さらにゲインｇｒ２を乗じた信号を加算して広帯域長周期予測残差信号４８として出力する。 The power calculation means 35 calculates and outputs the power of the narrow band long cycle prediction residual signal 41. The noise generator 36 generates and outputs a power-normalized white noise signal. Then, a signal obtained by multiplying the signal output from the zero-filling means 43 by the gain gr1 is multiplied by the white noise signal output from the noise generating means 36 by the power output from the power calculating means 35, and further a signal obtained by multiplying the gain gr2. And outputs it as a wideband long-period prediction residual signal 48.

　実施例５および実施例６による復元音が、それぞれ一長一短を有している場合、この様に構成し、ｇｒ１とｇｒ２を適切に設定することで、両者を上回る品質の広帯域音声が復元できる得られる効果がある。なお、実施例５と実施例６と同じ効果も持っている。 In the case where the restored sounds according to the fifth and sixth embodiments have respective advantages and disadvantages, by configuring in this way and setting gr1 and gr2 appropriately, it is possible to restore a broadband sound having a quality exceeding both of them. effective. Note that the same effects as those of the fifth and sixth embodiments are obtained.

　実施例８．
　図１０は本発明の実施例８の広帯域音声復元装置における広帯域音源推定手段１４の構成図である。図において新規な部分は、４９のアップサンプリング手段、５０の零化手段である。全体構成は、図１と同じであるので、説明を省略する。 Embodiment 8 FIG.
FIG. 10 is a block diagram of the wideband sound source estimating means 14 in the wideband speech restoration apparatus according to the eighth embodiment of the present invention. The new parts in the figure are 49 upsampling means and 50 nulling means. Since the entire configuration is the same as that of FIG. 1, the description is omitted.

　以下、図１０を用いて本発明の一実施例の動作について説明する。 Hereinafter, the operation of the embodiment of the present invention will be described with reference to FIG.

　狭帯域音源信号６がアップサンプリング手段４９に入力される。アップサンプリング手段４９は、狭帯域音源信号６をＭ倍にアップサンプリングして、得られた信号を音源分析手段２１に出力する。 The narrow-band sound source signal 6 is input to the up-sampling means 49. The up-sampling unit 49 up-samples the narrow-band sound source signal 6 by M times and outputs the obtained signal to the sound source analyzing unit 21.

　音源分析手段２１内の狭帯域長周期予測分析手段３７は、アップサンプリング手段４９の出力信号に対して長周期予測分析を行い、狭帯域長周期遅延３８と狭帯域長周期予測係数３９を出力する。なお、長周期予測分析における遅延探索範囲が実施例５の場合のＭ倍になる。 The narrow-band long-period prediction analysis unit 37 in the sound source analysis unit 21 performs a long-period prediction analysis on the output signal of the up-sampling unit 49, and outputs a narrow-band long-period delay 38 and a narrow-band long-period prediction coefficient 39. . Note that the delay search range in the long-period prediction analysis is M times that in the fifth embodiment.

　音源分析手段２１内の長周期逆フィルタ４０は、狭帯域長周期遅延３８と狭帯域長周期予測係数３９を用いて、アップサンプリング手段４９の出力信号を逆フィルタリングし、得られた信号を狭帯域長周期予測残差信号４１として広帯域長周期予測残差推定手段４２に出力する。 The long-period inverse filter 40 in the sound source analyzing unit 21 performs inverse filtering on the output signal of the up-sampling unit 49 using the narrow-band long-period delay 38 and the narrow-band long-period prediction coefficient 39, and converts the obtained signal into a narrow-band signal. The long-period prediction residual signal 41 is output to the wide-band long-period prediction residual estimating means 42.

　広帯域長周期予測残差推定手段４２内の零化手段５０は、狭帯域長周期予測残差信号４１のＭサンプル置きの信号のみを残し、残りの信号の値を零とする。そして、得られた信号を広帯域長周期予測残差信号４８として出力する。 The zeroing means 50 in the wideband long-period prediction residual estimating means 42 leaves only the signal every M samples of the narrowband long-period prediction residual signal 41, and sets the value of the remaining signal to zero. Then, the obtained signal is output as a wideband long-period prediction residual signal 48.

　広帯域長周期予測パラメータ推定手段４４は、狭帯域長周期遅延３８をそのまま広帯域長周期遅延４５として出力し、狭帯域長周期予測係数３９をｇ倍して広帯域長周期予測係数４６として出力する。ｇについては実施例５と同様である。 The wideband long-period prediction parameter estimating means 44 outputs the narrow-band long-period delay 38 as it is as the wide-band long-period delay 45, multiplies the narrow-band long-period prediction coefficient 39 by g, and outputs it as the wide-band long-period prediction coefficient 46. g is the same as in Example 5.

　この様に構成する事により、高いサンプリング周波数の信号に対して長周期分析が行えるので、より精度の高い遅延が分析できるようになり、狭帯域音源信号の持つピッチ周期性の強さや変動に関する特徴をより細かく広帯域音源信号に反映することが可能となり、様々に変化する音源を十分に推定でき、良好な音質の広帯域音声を復元することができる効果がある。なお、実施例５と同じ効果も持っている。 With this configuration, long-period analysis can be performed on a signal with a high sampling frequency, so that a more accurate delay can be analyzed, and characteristics relating to the strength and fluctuation of the pitch periodicity of the narrow-band sound source signal. Can be reflected more finely in a wideband sound source signal, and there can be obtained an effect that a sound source that changes variously can be sufficiently estimated, and a wideband sound with good sound quality can be restored. Note that the same effect as in the fifth embodiment is also obtained.

　実施例９．
　図１１は本発明の実施例９の広帯域音声復元装置の構成図である。図において新規な部分は、５１の狭帯域パワー算出手段、５２の狭帯域音源パワー、５３の狭帯域パワー込みスペクトル符号帳である。その他は、前記したものと同じであるので、動作に若干の差異があるものだけ説明を行う。 Embodiment 9 FIG.
FIG. 11 is a configuration diagram of a wideband audio restoration apparatus according to Embodiment 9 of the present invention. The new parts in the figure are a narrow-band power calculating means 51, a narrow-band excitation power 52, and a narrow-band power-inclusive spectrum codebook 53. Others are the same as those described above, and therefore only those having a slight difference in operation will be described.

　以下、図１１を用いて本発明の一実施例の動作について説明する。 Hereinafter, the operation of the embodiment of the present invention will be described with reference to FIG.

　分析手段２内の狭帯域パワー算出手段５１は、狭帯域音源信号６の振幅情報に含まれるパワーを算出して狭帯域音源パワー５２として出力する。この他にスペクトルパラメータ４と、狭帯域音源信号６も出力する。 The narrow-band power calculating means 51 in the analyzing means 2 calculates the power included in the amplitude information of the narrow-band sound source signal 6 and outputs the calculated power as the narrow-band sound source power 52. In addition, it also outputs a spectrum parameter 4 and a narrow-band sound source signal 6.

　広帯域スペクトル推定手段７内のベクトル量子化手段８は、狭帯域パワー込みスペクトル符号帳５３を用いて、狭帯域スペクトルパラメータ４と狭帯域音源パワー５２を一括してベクトル量子化し、得られたスペクトル符号１０を広帯域スペクトル推定手段７内の逆量子化手段１１に出力する。 The vector quantization means 8 in the wideband spectrum estimating means 7 collectively vector-quantizes the narrowband spectral parameters 4 and the narrowband excitation power 52 using the narrowband power-added spectral codebook 53, and obtains the obtained spectral code. 10 is output to the inverse quantization means 11 in the wideband spectrum estimation means 7.

　ここで、狭帯域パワー込みスペクトル符号帳５３は、多くの狭帯域音声信号を分析して得られた狭帯域スペクトルパラメータと狭帯域音源パワーの対を学習データとして、文献１と同様な方法で作成する。狭帯域パワー込みスペクトル符号帳５３の学習時とベクトル量子化手段８における距離尺度としては、パワーの対数値のユークリッド距離をｗ倍したものとスペクトルパラメータのユークリッド距離を加算したものを用いることができる。 Here, the narrow-band power-added spectrum codebook 53 is created in the same manner as in Reference 1, using a pair of a narrow-band spectral parameter and a narrow-band sound source power obtained by analyzing many narrow-band speech signals as learning data. I do. As a distance measure at the time of learning the narrow-band power-added spectrum codebook 53 and at the vector quantization means 8, a value obtained by adding the Euclidean distance of the logarithmic value of power to w times and the Euclidean distance of the spectral parameter can be used. .

　なお、狭帯域パワー算出手段５１が狭帯域音源信号６ではなく、狭帯域音声信号１のパワーを算出して、これを上記狭帯域音源パワー５２の代わりに用いる事もできる。この場合には、狭帯域スペクトルパラメータと狭帯域音声信号のパワーの対を学習データとして、狭帯域パワー込みスペクトル符号帳５３の学習を行う。 Note that the narrow-band power calculating means 51 may calculate the power of the narrow-band sound signal 1 instead of the narrow-band sound source signal 6 and use the calculated power instead of the narrow-band sound source power 52. In this case, learning of the narrow-band power-added spectrum codebook 53 is performed using the pair of the narrow-band spectral parameter and the power of the narrow-band audio signal as learning data.

　この様に構成する事により、実施例１が持つ効果に加えて、広帯域のスペクトルパラメータの推定にパワーに関する情報が反映され、より安定に良好なスペクトルが推定できる効果がある。 With such a configuration, in addition to the effects of the first embodiment, information relating to power is reflected in estimation of broadband spectrum parameters, and there is an effect that a good spectrum can be more stably estimated.

　実施例１０．
　図１２は本発明の実施例１０の広帯域音声復元装置の構成図である。図において新規な部分は、５４の音源正規化手段、５５の狭帯域正規化音源信号、５６の広帯域正規化音源信号、５７の広帯域パワー符号帳、５８の広帯域音源パワー、広帯域スペクトル推定手段に含まれる５９の広帯域音源パワー推定手段である。その他は、前記したものと同じであるので、説明を省略する。 Embodiment 10 FIG.
FIG. 12 is a configuration diagram of a wideband audio restoration apparatus according to Embodiment 10 of the present invention. In the figure, the new parts are included in the sound source normalization means 54, the narrow band normalized excitation signal 55, the wideband normalized excitation signal 56, the wideband power codebook 57, the wideband excitation power 58, and the wideband spectrum estimation means. 59 broadband sound source power estimating means. Others are the same as those described above, and the description is omitted.

　以下、図１２を用いて本発明の一実施例の動作について説明する。 Hereinafter, the operation of the embodiment of the present invention will be described with reference to FIG.

　分析手段２内の音源正規化手段５４は、狭帯域音源信号６の振幅情報に含まれるパワーを算出して狭帯域音源パワー５２として広帯域音源パワー推定手段５９に出力するとともに、狭帯域音源信号６のパワーを正規化した信号を狭帯域正規化音源信号５５として広帯域音源推定手段１４に出力する。 The sound source normalizing means 54 in the analyzing means 2 calculates the power included in the amplitude information of the narrow band sound source signal 6 and outputs it as the narrow band sound source power 52 to the wide band sound source power estimating means 59. Is output to the broadband sound source estimating means 14 as a narrow band normalized sound source signal 55.

　実際には広帯域スペクトル推定手段７内にある広帯域音源パワー推定手段５９中のベクトル量子化手段８は、狭帯域パワー込みスペクトル符号帳５３を用いて、狭帯域スペクトルパラメータ４と狭帯域音源パワー５２を一括してベクトル量子化し、得られたスペクトル符号１０を広帯域音源パワー推定手段５９内の逆量子化手段１１に出力する。逆量子化手段１１は、広帯域パワー符号帳５７を用いてスペクトル符号１０を復号し、得られた広帯域音源パワー５８を出力する。 Actually, the vector quantization means 8 in the wideband excitation power estimation means 59 in the wideband spectrum estimation means 7 uses the narrowband power-inclusive spectrum codebook 53 to convert the narrowband spectral parameters 4 and the narrowband excitation power 52. The spectral code 10 obtained by the vector quantization is output to the inverse quantization means 11 in the wideband excitation power estimation means 59. The inverse quantization means 11 decodes the spectrum code 10 using the wideband power codebook 57 and outputs the obtained wideband excitation power 58.

　広帯域音源推定手段１４は、狭帯域正規化音源信号５４を用いて、広帯域正規化音源信号５６を推定する。なお、広帯域スペクトル推定手段７と広帯域音源推定手段１４における推定には、実施例１ないし実施例８と同様な方法を用いる事ができる。そして、この広帯域正規化音源信号５６に前記広帯域音源パワー５８を乗じて広帯域音源信号１６を生成する。 The wideband sound source estimating means 14 estimates the wideband normalized sound source signal 56 using the narrowband normalized sound source signal 54. The estimation in the broadband spectrum estimating means 7 and the wideband sound source estimating means 14 can be performed in the same manner as in the first to eighth embodiments. Then, the broadband normalized excitation signal 56 is multiplied by the broadband excitation power 58 to generate the broadband excitation signal 16.

　この様に構成する事により、実施例１が持つ効果に加えて、広帯域音源パワーの推定にスペクトルパラメータの違いを反映させる事ができるので、より正しい振幅を持った広帯域音声が復元できる効果がある。 With such a configuration, in addition to the effect of the first embodiment, the estimation of the broadband sound source power can reflect the difference in the spectrum parameter, and thus, there is an effect that a wideband voice having a more correct amplitude can be restored. .

　実施例１１．
　図１３は本発明の実施例１１の広帯域音声復元装置の構成図である。図において新規な部分は、６０の広帯域パワー込みスペクトル符号帳である。その他は、図１１および図１２と同じであるので、動作に若干の差異があるものだけ説明を行う。 Embodiment 11 FIG.
FIG. 13 is a configuration diagram of a wideband audio restoration apparatus according to Embodiment 11 of the present invention. The new part in the figure is the 60 wideband power-inclusive spectral codebook. The other parts are the same as those in FIGS. 11 and 12, and only those having a slight difference in the operation will be described.

　以下、図１３を用いて本発明の一実施例の動作について説明する。 Hereinafter, the operation of the embodiment of the present invention will be described with reference to FIG.

　広帯域スペクトル推定手段７内の逆量子化手段１１は、広帯域パワー込みスペクトル符号帳６０を用いてスペクトル符号１０を復号し、得られた広帯域スペクトルパラメータ１３と広帯域音源パワー５８を出力する。 The inverse quantization means 11 in the wideband spectrum estimating means 7 decodes the spectrum code 10 using the wideband power-added spectrum codebook 60, and outputs the obtained wideband spectrum parameters 13 and wideband excitation power 58.

　ここで、広帯域パワー込みスペクトル符号帳６０は、多くの広帯域音声信号を分析して得られた広帯域スペクトルパラメータと広帯域音源パワーの対を学習データとして、文献１と同様な方法で作成する。距離尺度には、狭帯域パワー込みスペクトル符号帳５３の作成に用いたものと同じものを用いる。 Here, the spectrum codebook 60 with wideband power is created by a method similar to that of Reference 1 using, as learning data, a pair of a wideband spectrum parameter and a wideband excitation power obtained by analyzing many wideband speech signals. As the distance scale, the same scale as that used to create the spectrum codebook 53 including the narrow band power is used.

　この様に構成する事により、実施例９と実施例１０が持つ効果を合わせ持つ事ができる。 With such a configuration, the effects of the ninth and tenth embodiments can be obtained.

　実施例１２．
　図１４は本発明の実施例１２の広帯域音声復元装置の構成図である。図において新規な部分は、６１のポストフィルタ手段である。その他は、実施例１ないし実施例１１と同じであり、説明を省略する。 Embodiment 12 FIG.
FIG. 14 is a configuration diagram of a wideband audio restoration apparatus according to Embodiment 12 of the present invention. The new part in the figure is 61 post-filter means. Other configurations are the same as those in the first to eleventh embodiments, and a description thereof will be omitted.

　以下、図１４を用いて本発明の一実施例の動作について説明する。 Hereinafter, the operation of the embodiment of the present invention will be described with reference to FIG.

　ポストフィルタ手段６１は、合成フィルタ１７が出力した仮の広帯域音声信号に対してポストフィルタリング処理を行い、得られた信号を帯域フィルタ１８に出力する。帯域フィルタ１８は、ポストフィルタ手段６１が出力した信号に対して、帯域通過フィルタ処理を行い、狭帯域音声の成分のある帯域以外の成分を抽出する。 The post-filter unit 61 performs post-filtering processing on the temporary wideband audio signal output from the synthesis filter 17, and outputs the obtained signal to the band filter 18. The band-pass filter 18 performs band-pass filter processing on the signal output from the post-filter unit 61, and extracts components other than the band including the narrow-band sound component.

　なお、ポストフィルタリング処理は、聴感的品質を改善する信号加工処理のことで、ピッチ周期性やスペクトルの極を強調したり、高域を強調して明瞭性を改善したり、伝送路を通す際に発生する歪が多い帯域を抑圧して歪感を低減するものである。 The post-filtering process is a signal processing process that improves perceived quality. It emphasizes pitch periodicity and spectral poles, enhances high frequencies to improve clarity, and is used when passing through a transmission path. In this case, a band in which a large amount of distortion occurs is suppressed to reduce the sense of distortion.

　ピッチ周期性の強調処理としては、ピッチ周期だけ前の仮の広帯域音声信号に１より小さい係数を乗じて現在の仮の広帯域音声信号に加算する方法が一般的である。 As a process of enhancing the pitch periodicity, a method is generally used in which a temporary wideband audio signal before the pitch period is multiplied by a coefficient smaller than 1 and added to the current temporary wideband audio signal.

　極強調処理としては、広帯域スペクトルパラメータ１３を変形して、広帯域スペクトルパラメータ１３の持つ極周波数近傍の周波数帯域に大きなゲインを持ち、広帯域スペクトルパラメータ１３の持つ極近傍以外の周波数帯域に小さいゲインを持つ極零型のフィルタのフィルタ係数を算出する方法が各種提案されており、このフィルタを仮の広帯域音声信号に掛けることで実現できる。また、伝送路を通す際に発生する歪は振幅の小さい周波数帯域、つまり極近傍以外の周波数帯域に多いので、この極強調処理により歪が多い帯域を抑圧する事もできる。 As the pole enhancement processing, the wideband spectrum parameter 13 is modified to have a large gain in a frequency band near the pole frequency of the wideband spectrum parameter 13 and a small gain in a frequency band other than the pole band of the wideband spectrum parameter 13. Various methods for calculating the filter coefficient of the pole-zero filter have been proposed, and can be realized by applying this filter to a temporary wideband audio signal. Further, since distortion generated when passing through the transmission path is large in a frequency band having a small amplitude, that is, in a frequency band other than a very close vicinity, a band with a large amount of distortion can be suppressed by this pole enhancement processing.

　高域強調処理としては、プリエンファシスと呼ばれる方法、すなわち１点前の仮の広帯域音声信号に１以下の係数を乗じて現在の仮の広帯域音声信号から減算する方法が一般的である。 As the high-frequency emphasis processing, a method called pre-emphasis, that is, a method of multiplying a temporary wideband audio signal one point before by a coefficient of 1 or less and subtracting it from the current temporary wideband audio signal is generally used.

　また、図１４において、ポストフィルタ手段６１と帯域フィルタ１８が逆の位置でも構わないし、広帯域音声信号２０に対してポストフィルタ手段６１をかける構成でも構わない。 In FIG. 14, the post filter means 61 and the band-pass filter 18 may be in opposite positions, or the post-filter means 61 may be applied to the wideband audio signal 20.

　この様に構成する事で、実施例１が持つ効果に加えて、復元された広帯域音声信号の音質が不足する場合に、広帯域音声信号のピッチ周期性やスペクトルの極を強調したり、高域を強調して明瞭性を改善したり、伝送路を通す際に発生する歪が多い帯域を抑圧して歪感を低減することができる効果がある。 With such a configuration, in addition to the effects of the first embodiment, when the sound quality of the restored wideband audio signal is insufficient, the pitch periodicity of the wideband audio signal or the pole of the spectrum is emphasized, Is emphasized to improve the clarity, and there is an effect that a band with much distortion generated when passing through a transmission path is suppressed to reduce a feeling of distortion.

　なお、図１４において逆フィルタ５と広帯域音源推定手段１４を外した構成も可能である。この構成は、文献１に本発明を適用したものに相当し、上記と同様の効果がある。 In addition, a configuration in which the inverse filter 5 and the wideband sound source estimating means 14 are removed in FIG. 14 is also possible. This configuration corresponds to the application of the present invention to Document 1, and has the same effects as described above.

　実施例１３．
　実施例１ないし実施例１２における広帯域スペクトル推定手段７が、狭帯域スペクトルパラメータ４をそのまま広帯域スペクトルパラメータ１３として出力する構成も可能である。 Embodiment 13 FIG.
A configuration is also possible in which the wideband spectrum estimating means 7 in the first to twelfth embodiments outputs the narrowband spectrum parameter 4 as it is as the wideband spectrum parameter 13.

　図１５は、この場合の狭帯域スペクトルと広帯域スペクトルの概形の関係を説明する説明図である。狭帯域スペクトルパラメータ４が表すスペクトル包絡が図１５（ａ）である場合、これをそのまま広帯域スペクトルパラメータ１３として用いると、結果的にその幅が伸張し、広帯域スペクトルパラメータ１３が表すスペクトル包絡は図１５（ａ）を周波数軸方向にＭ倍に引き伸ばした形で、Ｍが２の時には図１５（ｂ）のようになる。従って、狭帯域スペクトル包絡の２ＫＨｚから３．４ＫＨｚが高い場合には復元される３．４ＫＨｚ以上の高域も高くなり、逆に２ＫＨｚから３．４ＫＨｚが低い場合には高域も低くなり、この結果狭帯域スペクトル包絡のおおまかな傾斜がそのまま高域に反映される事となる。 FIG. 15 is an explanatory diagram for explaining the general relationship between the narrowband spectrum and the wideband spectrum in this case. In the case where the spectrum envelope represented by the narrowband spectrum parameter 4 is as shown in FIG. 15A, if this is used as it is as the broadband spectrum parameter 13, the width is consequently expanded, and the spectrum envelope represented by the wideband spectrum parameter 13 is as shown in FIG. FIG. 15B is a diagram in which FIG. 15A is expanded M times in the frequency axis direction, and when M is 2, FIG. Therefore, when the narrow band spectral envelope is higher from 2 KHz to 3.4 KHz, the restored high band above 3.4 KHz also becomes higher, and conversely, when the lower 2 KHz to 3.4 KHz is lower, the higher band becomes lower. As a result, the rough slope of the narrow band spectral envelope is directly reflected in the high band.

　この様に構成する事で、実施例１が持つ効果に加えて、おおまかではあるが、極めて簡単に広帯域スペクトルを復元できる効果がある。実施例１に比べて、符号帳を蓄積しておくメモリが不必要で、演算量が少なくなる効果がある。 With such a configuration, in addition to the effects of the first embodiment, there is an effect that the broadband spectrum can be restored extremely easily, though roughly. Compared with the first embodiment, there is no need for a memory for storing a codebook, and there is an effect that the amount of calculation is reduced.

　実施例１４．
　実施例１ないし実施例１２において、広帯域スペクトル推定手段７が、狭帯域スペクトルパラメータ４の最低次から所定次数までを広帯域スペクトルパラメータ１３として出力する構成も可能である。ただし、スペクトル分析手段３が出力する狭帯域スペクトルパラメータ４としては、ＰＡＲＣＯＲ係数や自己相関係数のように最低次から所定次数までを取り出したものを広帯域スペクトルパラメータ１３としてもちいても合成が常に安定なパラメータである場合に限られる。 Embodiment 14 FIG.
In the first to twelfth embodiments, a configuration is also possible in which the wideband spectrum estimating means 7 outputs from the lowest order to the predetermined order of the narrowband spectrum parameter 4 as the wideband spectrum parameter 13. However, the narrow band spectral parameter 4 output from the spectrum analyzing means 3 is always stable even when the one extracted from the lowest order to the predetermined order such as a PARCOR coefficient or an autocorrelation coefficient is used as the wide band spectral parameter 13. Only when the parameters are appropriate.

　図１６は、この場合の狭帯域スペクトルと広帯域スペクトルの概形の関係を説明する説明図である。狭帯域スペクトルパラメータ４が表すスペクトル包絡が図１５（ａ）である場合、これの最低次から所定次数までを広帯域スペクトルパラメータ１３として用いると、広帯域スペクトルパラメータ１３が表すスペクトル包絡は図１５（ａ）を周波数軸方向にＭ倍に引き伸ばして更に極構造をなめらかにした形となり、Ｍが２の時には図１５（ｂ）のようになる。この結果狭帯域スペクトル包絡のおおまかな傾斜がそのまま高域に反映され、かつ存在しない強い極が高域に生成され、不自然な復元音が発生することを抑えることができる。 FIG. 16 is an explanatory diagram for explaining the general relationship between the narrowband spectrum and the wideband spectrum in this case. In the case where the spectrum envelope represented by the narrowband spectrum parameter 4 is as shown in FIG. 15A, if the lowest to the predetermined order of the spectrum envelope is used as the wideband spectrum parameter 13, the spectrum envelope represented by the wideband spectrum parameter 13 is as shown in FIG. Is expanded M times in the frequency axis direction to further smooth the pole structure, and when M is 2, it becomes as shown in FIG. As a result, the rough slope of the narrow-band spectrum envelope is directly reflected in the high band, and a strong non-existent pole is generated in the high band, so that generation of an unnatural restoration sound can be suppressed.

　この様に構成する事で、実施例１３が持つ効果に加えて、実施例１３の場合にまれにおこる、存在しない強い極が高域に生成されて不自然な復元音の発生を抑える事ができる効果がある。 With this configuration, in addition to the effect of the thirteenth embodiment, it is possible to suppress the occurrence of an unnatural restoration sound, which is rarely generated in the thirteenth embodiment and has a strong non-existent pole in the high range. There is an effect that can be done.

　実施例１５．
　図１７は本発明の実施例１５の広帯域音声復元装置の広帯域スペクトル推定手段７の構成図である。図において新規な部分は、６２のスペクトルパラメータ変換手段、６３の次数低減手段、６４のスペクトルパラメータ逆変換手段である。その他は、実施例１ないし実施例１２と同じであり、説明を省略する。 Embodiment 15 FIG.
FIG. 17 is a configuration diagram of the wideband spectrum estimating means 7 of the wideband speech restoration apparatus according to Embodiment 15 of the present invention. In the figure, new parts are a spectrum parameter conversion means 62, an order reduction means 63, and a spectrum parameter inverse conversion means 64. Other configurations are the same as those of the first to twelfth embodiments, and a description thereof will be omitted.

　以下、図１７を用いて本発明の一実施例の動作について説明する。 Hereinafter, the operation of the embodiment of the present invention will be described with reference to FIG.

　広帯域スペクトル推定手段７内のスペクトルパラメータ変換手段６２は、狭帯域スペクトルパラメータ４を、ＰＡＲＣＯＲ係数や自己相関係数のように最低次から所定次数までを取り出した場合に合成が常に安定なパラメータに変換する。次数低減手段６３は、スペクトルパラメータ変換手段６２が出力したパラメータの最低次から所定次数までを取り出したものをスペクトルパラメータ逆変換手段６４に出力する。スペクトルパラメータ逆変換手段６４は、次数低減手段６３の出力したパラメータを狭帯域スペクトルパラメータ４と同じ領域に戻し、広帯域スペクトルパラメータ１３として出力する。 The spectrum parameter converting means 62 in the wide band spectrum estimating means 7 converts the narrow band spectral parameter 4 into a parameter whose synthesis is always stable when the narrow band spectral parameter 4 is extracted from the lowest order to a predetermined order such as a PARCOR coefficient or an autocorrelation coefficient. I do. The order reduction unit 63 outputs a parameter extracted from the lowest order to a predetermined order of the parameters output by the spectrum parameter conversion unit 62 to the spectrum parameter inverse conversion unit 64. The spectrum parameter inverse transform unit 64 returns the parameter output from the order reduction unit 63 to the same region as the narrowband spectral parameter 4 and outputs the same as the wideband spectral parameter 13.

　この様に構成する事で、狭帯域スペクトルパラメータ４が、最低次から所定次数までを取り出した場合に合成が不安定になるパラメータである場合でも、実施例１４と同じ効果が得られる。構成 With such a configuration, the same effect as in the fourteenth embodiment can be obtained even when the narrowband spectral parameter 4 is a parameter in which the synthesis becomes unstable when extracting from the lowest order to the predetermined order.

　実施例１６．
　実施例１４および実施例１５では、次数低減によって強い極を抑制したが、スペクトルパラメータとして自己相関係数を用いてこれにラグ窓をかける等、類似の効果を与える方法を用いる事ができる。 Embodiment 16 FIG.
In the fourteenth and fifteenth embodiments, the strong poles are suppressed by the order reduction. However, a method that gives a similar effect, such as applying a lag window to the autocorrelation coefficient as a spectral parameter, can be used.

　この様に構成する事で、実施例１４と同じ効果が別の手段で得られる効果がある。構成 With this configuration, the same effect as that of the fourteenth embodiment can be obtained by another means.

　なお、上記実施例１３ないし１６の広帯域スペクトル推定手段７を、文献１等の従来構成に適用する事も可能である。例えば文献１に適用する場合の全体構成は、図１４から逆フィルタ５、広帯域音源推定手段１４、ポストフィルタ手段６１を外したものとなる。この様に構成した場合には、実施例１３ないし１６にて新たに発生した効果を従来技術に付加する事ができる。 It is also possible to apply the wideband spectrum estimating means 7 of the thirteenth to sixteenth embodiments to a conventional configuration such as that of Reference 1. For example, the overall configuration when applied to Document 1 is such that the inverse filter 5, the broadband sound source estimating means 14, and the post-filtering means 61 are removed from FIG. In such a configuration, the effects newly generated in the thirteenth to sixteenth embodiments can be added to the conventional technology.

　実施例１７．
　以下の実施例では、伝送等による符号化情報を基に広帯域音声を復元する装置に対して本発明を適用する例を説明する。 Embodiment 17 FIG.
In the following embodiment, an example will be described in which the present invention is applied to an apparatus for restoring wideband speech based on coded information by transmission or the like.

　図１８は本発明の実施例１７の広帯域音声復元装置の構成図である。図において、１０１は狭帯域音声符号、１０２は分離手段、１０３は狭帯域スペクトル符号、１０４は狭帯域音源符号、１０５は広帯域スペクトル復号手段、１０６は広帯域音源復号手段、１０７は狭帯域スペクトル復号手段、１０８は狭帯域音源復号手段、１０９は狭帯域音声復号手段である。その他は、実施例１ないし実施例１６と同じであり、説明を省略する。 FIG. 18 is a configuration diagram of a wideband audio restoration apparatus according to Embodiment 17 of the present invention. In the figure, 101 is a narrowband speech code, 102 is a separating means, 103 is a narrowband spectral code, 104 is a narrowband excitation code, 105 is a wideband spectral decoding means, 106 is a wideband excitation decoding means, and 107 is a narrowband spectral decoding means , 108 are narrowband sound source decoding means, and 109 is a narrowband speech decoding means. Other configurations are the same as those in the first to sixteenth embodiments, and a description thereof will not be repeated.

　本実施例においても、再分析を行わずに良好な広帯域音源信号を得る構成となっている。において This embodiment is also configured to obtain a good wideband sound source signal without performing reanalysis.

　以下、図１８を用いて本発明の一実施例の動作について説明する。 Hereinafter, the operation of the embodiment of the present invention will be described with reference to FIG.

　まず、狭帯域音声符号１０１が、分離手段１０２と狭帯域音声復号手段１０９に入力される。この狭帯域音声符号１０１は、例えば８ＫＨｚでサンプリングされ、３００Ｈｚから３．４ＫＨｚの電話帯域に制限された狭帯域音声信号が別途符号化されたものであり、蓄積メディアや通信路から入力されて来るものである。 {First, the narrowband speech code 101 is input to the separating unit 102 and the narrowband speech decoding unit 109. The narrow-band speech code 101 is, for example, sampled at 8 KHz, is a separately encoded narrow-band speech signal limited to a telephone band of 300 Hz to 3.4 KHz, and is input from a storage medium or a communication path. Things.

　分離手段１０２では、狭帯域音声符号１０１を狭帯域スペクトル符号１０３と狭帯域音源符号１０４に分離して、狭帯域スペクトル符号１０３を広帯域スペクトル復号手段１０５に、狭帯域音源符号１０４を広帯域音源復号手段１０６に出力する。 The separation means 102 separates the narrowband speech code 101 into a narrowband spectrum code 103 and a narrowband excitation code 104, and converts the narrowband spectrum code 103 into a wideband spectrum decoding means 105 and the narrowband excitation code 104 into a wideband excitation codec. Output to 106.

　広帯域スペクトル復号手段１０５内の狭帯域スペクトル復号手段１０７は、狭帯域スペクトル符号１０３を復号して、得られた狭帯域スペクトルパラメータ４を出力する。なお、狭帯域スペクトル復号手段１０７は、狭帯域音声符号１０１が符号化された時に用いられた狭帯域スペクトルパラメータの符号化処理の逆の処理を行えば良い。狭 The narrow band spectrum decoding unit 107 in the wide band spectrum decoding unit 105 decodes the narrow band spectrum code 103 and outputs the obtained narrow band spectrum parameter 4. It should be noted that the narrowband spectrum decoding means 107 only needs to perform a process opposite to the process of encoding the narrowband spectrum parameters used when the narrowband speech code 101 was encoded.

　そして、広帯域スペクトル復号手段１０５内の広帯域スペクトル推定手段７が、前記狭帯域スペクトルパラメータ４を用いて広帯域スペクトルパラメータ１３を推定する。なお、広帯域スペクトル推定手段７としては、これまで説明を行った実施例に記載されている方法を用いる事ができる。 {Circle around (2)} The wideband spectrum estimating means 7 in the wideband spectrum decoding means 105 estimates the wideband spectrum parameters 13 using the narrowband spectrum parameters 4. As the broadband spectrum estimating means 7, the method described in the embodiment described above can be used.

　広帯域音源復号手段１０６内の狭帯域音源復号手段１０８は、前記狭帯域音源符号１０４を復号して、得られた狭帯域音源信号６を出力する。そして、広帯域音源復号手段１０６内の広帯域音源推定手段１４が、前記狭帯域音源信号６を用いて広帯域音源信号１６を推定する。狭 The narrow band excitation decoding unit 108 in the wide band excitation decoding unit 106 decodes the narrow band excitation code 104 and outputs the obtained narrow band excitation signal 6. Then, the wideband excitation estimating means 14 in the wideband excitation decoding means 106 estimates the wideband excitation signal 16 using the narrowband excitation signal 6.

　なお、広帯域音源推定手段１４には、零詰手段等を用いる事ができる。狭帯域音源復号手段１０８では、狭帯域音声符号１０１が符号化された時に用いられた狭帯域音源信号の符号化処理の逆の処理を行えば良い。 Note that the wideband sound source estimating means 14 may use a zero-filling means or the like. The narrowband excitation decoding means 108 may perform the reverse of the encoding of the narrowband excitation signal used when the narrowband speech code 101 was encoded.

　合成フィルタ１７は、広帯域スペクトルパラメータ１３を用いて広帯域音源信号１６に合成フィルタ処理を行い仮の広帯域音声信号を生成する。帯域フィルタ１８は、この仮の広帯域音声信号に対して、帯域通過フィルタ処理を行い、狭帯域音声の成分のある帯域以外の成分を抽出する。広帯域音声信号の帯域が０Ｈｚから７．３ＫＨｚの場合、０Ｈｚから３００Ｈｚと３．４ＫＨｚから７．３ＫＨｚの成分が抽出される。 The synthesis filter 17 performs synthesis filter processing on the broadband sound source signal 16 using the wideband spectral parameters 13 to generate a temporary wideband audio signal. The band-pass filter 18 performs a band-pass filter process on the provisional wide-band audio signal, and extracts components other than the band having the narrow-band audio component. When the band of the wideband audio signal is from 0 Hz to 7.3 kHz, components of 0 Hz to 300 Hz and components of 3.4 kHz to 7.3 kHz are extracted.

　一方、狭帯域音声復号手段１０９は、入力した狭帯域音声符号１０１を復号して、得られた狭帯域音声信号１をアップサンプリング手段１９に出力する。この復号処理は、狭帯域音声符号１０１が符号化された時に用いられた符号化処理の逆の処理を行えば良い。 On the other hand, the narrowband speech decoding means 109 decodes the input narrowband speech code 101 and outputs the obtained narrowband speech signal 1 to the upsampling means 19. This decoding process may be the reverse of the encoding process used when the narrowband speech code 101 was encoded.

　次に、アップサンプリング手段１９は、狭帯域音声信号１をＭ倍にアップサンプリングする。アップサンプリングによって生成される信号は、サンプリング周波数が広帯域音声信号２０と同じで、狭帯域音声信号１と同じ狭帯域成分を持つものである。そして、帯域フィルタ１８の出力とアップサンプリング手段１９の出力を加算して広帯域音声信号２０を生成する。 Next, the up-sampling means 19 up-samples the narrowband audio signal 1 by M times. The signal generated by the upsampling has the same sampling frequency as the wideband audio signal 20 and has the same narrowband component as the narrowband audio signal 1. Then, the output of the bandpass filter 18 and the output of the upsampling means 19 are added to generate a wideband audio signal 20.

　この様に構成する事により、蓄積メディアや通信路から狭帯域音声符号を受信した場合、狭帯域音声を再分析する必要がないので少ない処理量で復元ができる効果がある。また、合成時の補間や分析時の窓掛等による歪が重畳しないので、より良い品質の広帯域音声が復元できる効果がある。なお、実施例１と同じ効果も持っている。 With this configuration, when a narrowband speech code is received from a storage medium or a communication path, there is no need to re-analyze the narrowband speech, so that there is an effect that restoration can be performed with a small processing amount. In addition, since distortion due to interpolation at the time of synthesis or windowing at the time of analysis is not superimposed, there is an effect that better quality wideband speech can be restored. Note that the same effect as in the first embodiment is also obtained.

　なお、狭帯域音声復号手段１０９は、狭帯域スペクトルパラメータ４と狭帯域音源信号６を入力して、狭帯域音声信号１を合成する構成でも良いし、逆に狭帯域音声復号手段１０９内の復号過程の中間パラメータとして算出される狭帯域スペクトルパラメータ４と狭帯域音源信号６を広帯域スペクトル復号手段１０５と広帯域音源復号手段１０６に入力する構成も可能である。この場合、重複している処理を省く事ができ、更に少ない処理量で広帯域音声が復元できる効果がある。 Note that the narrow-band speech decoding means 109 may be configured to input the narrow-band spectrum parameter 4 and the narrow-band sound source signal 6 and synthesize the narrow-band speech signal 1, or conversely, to decode the narrow-band speech decoding means 109. A configuration is also possible in which the narrowband spectral parameter 4 and the narrowband excitation signal 6 calculated as intermediate parameters in the process are input to the wideband spectrum decoding means 105 and the wideband excitation decoding means 106. In this case, there is an effect that the overlapping processing can be omitted and the wideband sound can be restored with a smaller processing amount.

　また、狭帯域音声符号１０１から、ピッチ周期符号とパワー符号が分離できる場合には、これらの符号からピッチ周期とパワー情報を復号して、前記広帯域スペクトルパラメータ１３とこのピッチ周期とパワー情報を用いて文献１と同じ方法で仮の広帯域合成音を生成する構成も可能である。 When the pitch cycle code and the power code can be separated from the narrowband speech code 101, the pitch cycle and the power information are decoded from these codes, and the broadband spectrum parameter 13 and the pitch cycle and the power information are used. Thus, a configuration in which a temporary broadband synthesized sound is generated in the same manner as in Reference 1 is also possible.

　実施例１８．
　図１９は本発明の実施例１８の広帯域音声復元装置の広帯域音源復号手段１０６の構成図である。図において新規な部分は、１１０の狭帯域ピッチ符号、１１１の狭帯域パワー符号、１１２の広帯域ピッチ復号手段、１１３の広帯域ピッチ周期、１１４の広帯域パワー復号手段、１１５の広帯域パワー、１１６の音源生成手段である。その他は、実施例１７と同じであり、説明を省略する。 Embodiment 18 FIG.
FIG. 19 is a configuration diagram of the wideband sound source decoding means 106 of the wideband speech restoration apparatus according to the eighteenth embodiment of the present invention. In the figure, the novel parts are a narrow band pitch code 110, a narrow band power code 111, a wide band pitch decoding unit 112, a wide band pitch period 113, a wide band power decoding unit 114, a wide band power 115, and a sound source generation 116. Means. Other points are the same as those of the seventeenth embodiment, and the description is omitted.

　この実施例は、前記分離手段１０２にて簡単に狭帯域ピッチ符号１１０と狭帯域パワー符号１１１が分離できるような狭帯域音声符号１０１が入力される場合に限られる。この場合には図１９の構成が意味を持つ。 This embodiment is limited to the case where the narrow band speech code 101 is input so that the narrow band pitch code 110 and the narrow band power code 111 can be easily separated by the separating means 102. In this case, the configuration of FIG. 19 is significant.

　以下、図１９を用いて本発明の一実施例の動作について説明する。 Hereinafter, the operation of the embodiment of the present invention will be described with reference to FIG.

　狭帯域音源符号１０４として、狭帯域ピッチ符号１１０と狭帯域パワー符号１１１が広帯域音源復号手段１０６に入力される。 As the narrow band excitation code 104, the narrow band pitch code 110 and the narrow band power code 111 are input to the wide band excitation decoding means 106.

　広帯域音源復号手段１０６内の広帯域ピッチ復号手段１１２は、狭帯域ピッチ符号１１０を用いて広帯域ピッチ周期１１３を推定する。推定の方法としては、狭帯域ピッチ符号１１０から狭帯域ピッチ周期を復号してその値をＭ倍してもよいが、その結果をテーブルとして持っておいて狭帯域ピッチ符号１１０に対応するテーブル成分を読みだす事で求めてもよい。広帯域 The wideband pitch decoding unit 112 in the wideband excitation decoding unit 106 estimates the wideband pitch period 113 using the narrowband pitch code 110. As an estimation method, a narrow band pitch period may be decoded from the narrow band pitch code 110 and its value may be multiplied by M, but the result is stored as a table and a table component corresponding to the narrow band pitch code 110 is stored. You may ask by reading out.

　次に、広帯域音源復号手段１０６内の広帯域パワー復号手段１１４は、狭帯域パワー符号１１１を用いて広帯域パワー１１５を推定する。推定の方法としては、狭帯域パワー符号１１１から狭帯域パワーを復号してその値をｇ倍してもよいが、その結果をテーブルとして持っておいて狭帯域パワー符号１１１に対応するテーブル成分を読みだす事で求めてもよい。 Next, the wideband power decoding unit 114 in the wideband excitation decoding unit 106 estimates the wideband power 115 using the narrowband power code 111. As an estimation method, the narrow-band power code 111 may be decoded from the narrow-band power code 111 and its value may be multiplied by g, but the result is stored as a table and the table component corresponding to the narrow-band power code 111 is stored. You may ask for it by reading it out.

　音源生成手段１１６は、前記広帯域ピッチ周期１１３を繰り返し周期として、固定音源を並べ立てた信号を出力し、最後にこの音源生成手段１１６の出力信号に広帯域パワー１１５を乗じて、広帯域音源信号１６として出力する。 The sound source generating means 116 outputs a signal in which fixed sound sources are arranged side by side with the broadband pitch period 113 as a repetition period. Finally, the output signal of the sound source generating means 116 is multiplied by a wideband power 115 and output as a wideband sound source signal 16. I do.

　この様に構成する事により、実施例１７が持つ効果に加えて、狭帯域音源信号の復号を行わずに直接広帯域音源信号１６が生成されるので、少ない処理量で復元ができる効果がある。 With such a configuration, in addition to the effects of the seventeenth embodiment, the wideband excitation signal 16 is directly generated without decoding the narrowband excitation signal, so that there is an effect that restoration can be performed with a small amount of processing.

　実施例１９．
　図２０は本発明の実施例１９の広帯域音声復元装置の広帯域音源復号手段１０６の構成図である。図において新規な部分は、１１７の狭帯域適応音源符号、１１８の狭帯域駆動音源符号、１１９の広帯域適応音源復号手段、１２０の広帯域駆動音源復号手段、１２１の狭帯域適応音源復号手段、１２２の狭帯域駆動音源復号手段である。その他は、前記したものと同じであり、説明を省略する。 Embodiment 19 FIG.
FIG. 20 is a configuration diagram of the wideband sound source decoding means 106 of the wideband speech restoration apparatus according to the nineteenth embodiment of the present invention. In the figure, a new portion is a narrow band adaptive excitation code 117, a narrow band excitation code 118, a wide band adaptive excitation decoding device 119, a wide band driving excitation decoding device 120, a narrow band adaptive excitation decoding device 121 and a narrow band excitation code 122. This is a narrow-band drive sound source decoding means. Others are the same as those described above, and the description is omitted.

　この実施例は、前記分離手段１０２にて入力の狭帯域音声符号から簡単に狭帯域適応音源符号１１７と狭帯域駆動音源符号１１８が分離できるような狭帯域音声符号１０１が入力される場合に限られる。この場合には図２０の構成が意味を持つ。 This embodiment is limited to a case where a narrow-band speech code 101 is input so that the narrow-band adaptive excitation code 117 and the narrow-band excitation code 118 can be easily separated from the input narrow-band speech code by the separating means 102. Can be In this case, the configuration of FIG. 20 is significant.

　以下、図２０を用いて本発明の一実施例の動作について説明する。 Hereinafter, the operation of the embodiment of the present invention will be described with reference to FIG.

　狭帯域音源符号１０４として、狭帯域適応音源符号１１７と狭帯域駆動音源符号１１８が広帯域音源復号手段１０６に入力される。 As the narrow-band excitation code 104, the narrow-band adaptive excitation code 117 and the narrow-band excitation code 118 are input to the wide-band excitation decoding means 106.

　広帯域適応音源復号手段１１９内の狭帯域適応音源復号手段１２１は、前記狭帯域適応音源符号１１７を復号して、得られた狭帯域適応ラグ長２５と狭帯域適応ゲイン２６を出力する。広帯域適応音源復号手段１１９内の広帯域適応音源推定手段３０は、この狭帯域適応ラグ長２５と狭帯域適応ゲイン２６から、広帯域適応音源信号３２を生成し、出力する。広帯域適応音源推定手段３０の動作については、実施例２と同様である。 The narrow band adaptive excitation decoding unit 121 in the wide band adaptive excitation decoding unit 119 decodes the narrow band adaptive excitation code 117, and outputs the obtained narrow band adaptive lag length 25 and narrow band adaptive gain 26. The wideband adaptive excitation estimating means 30 in the wideband adaptive excitation decoding means 119 generates and outputs a wideband adaptive excitation signal 32 from the narrowband adaptive lag length 25 and the narrowband adaptive gain 26. The operation of the wideband adaptive sound source estimating means 30 is the same as in the second embodiment.

　広帯域駆動音源復号手段１２０内の狭帯域駆動音源復号手段１２２は、前記狭帯域駆動音源符号１１８を復号して、得られた狭帯域駆動音源信号２４を出力する。広帯域駆動音源復号手段１２０内の広帯域駆動音源推定手段２７は、この狭帯域駆動音源信号２４から広帯域駆動音源信号２９を推定し、出力する。広帯域駆動音源推定手段２７の動作は、実施例２ないし実施例４と同様である。狭 The narrow band drive excitation decoding unit 122 in the wide band drive excitation decoding unit 120 decodes the narrow band drive excitation code 118 and outputs the obtained narrow band drive excitation signal 24. The broadband drive excitation estimating means 27 in the wideband drive excitation decoding means 120 estimates the wideband drive excitation signal 29 from the narrowband drive excitation signal 24 and outputs it. The operation of the broadband driving sound source estimating means 27 is the same as in the second to fourth embodiments.

　最後に、広帯域適応音源信号３２と広帯域駆動音源信号２９を加算して、広帯域音源信号１６として出力する。 Finally, the broadband adaptive excitation signal 32 and the broadband drive excitation signal 29 are added and output as the broadband excitation signal 16.

　この様に構成する事により、実施例２ないし実施例４および実施例１７が持つ効果に加えて、狭帯域音源信号の復号を行わずに直接広帯域音源信号１６が生成されるので、少ない処理量で復元ができる効果がある。 With this configuration, in addition to the effects of the second to fourth embodiments and the seventeenth embodiment, since the wideband excitation signal 16 is directly generated without decoding the narrowband excitation signal, a small amount of processing is performed. Has the effect of being able to restore.

　更に、基本周波数とその高調波成分の周波数が正しく整数倍の位置に並ぶので、最終的に復元される広帯域音声信号での狭帯域成分と復元広帯域成分のつながりが良く、高品質な広帯域音声を復元できる効果がある。 Furthermore, since the fundamental frequency and the frequency of its harmonic component are correctly aligned at integer multiple positions, the connection between the narrowband component and the restored wideband component in the finally restored broadband audio signal is good, and high-quality wideband speech is reproduced. There is an effect that can be restored.

　また、有声無声情報やピッチ周期情報を用いないので、中間的な性質の音源も表現できるので、雑音が重畳した狭帯域音声信号に対して起こりがちな有声無声判定誤りやピッチ抽出誤りの影響がなく、有声無声境界付近でも良好な広帯域音源を推定することができ、安定で自然な音質の広帯域音声を復元することができる効果がある。 In addition, since voiced unvoiced information and pitch period information are not used, sound sources of intermediate characteristics can be expressed, so that the effects of voiced unvoiced decision errors and pitch extraction errors that tend to occur on narrowband speech signals with superimposed noise are reduced. In addition, a good wideband sound source can be estimated even near a voiced / unvoiced boundary, and there is an effect that a wideband sound with stable and natural sound quality can be restored.

　実施例２０．
　図２１は本発明の実施例２０の広帯域音声復元装置の広帯域音源復号手段１０６の構成図である。図において新規な部分は、１２３の狭帯域長周期予測符号、１２４の広帯域長周期予測パラメータ（符号）復号手段、１２５の狭帯域長周期予測パラメータ（符号）復号手段、１２６の狭帯域長周期予測残差符号、１２７の広帯域長周期予測残差復号手段、１２８の狭帯域長周期予測残差復号手段である。その他は、前記したものと同じであり、説明を省略する。 Embodiment 20 FIG.
FIG. 21 is a configuration diagram of the wideband sound source decoding means 106 of the wideband sound restoration apparatus according to the twentieth embodiment of the present invention. In the figure, new parts are a narrow band long cycle prediction code 123, a wide band long cycle prediction parameter (code) decoding unit 124, a narrow band long cycle prediction parameter (code) decoding unit 125, and a narrow band long cycle prediction 126. Residual codes, 127 are wideband long-period prediction residual decoding means, and 128 are narrowband long-period prediction residual decoding means. Others are the same as those described above, and the description is omitted.

　この実施例は、前記分離手段１０２にて入力の狭帯域音声符号から簡単に狭帯域長周期予測符号１２３と狭帯域長周期予測残差符号１２６が分離できるような狭帯域音声符号１０１が入力される場合に限られる。この場合には図２１の構成が意味を持つ。 In this embodiment, a narrow-band speech code 101 is input so that the separation means 102 can easily separate a narrow-band long-period prediction code 123 and a narrow-band long-period prediction residual code 126 from an input narrow-band speech code. Limited to In this case, the configuration of FIG. 21 is significant.

　以下、図２１を用いて本発明の一実施例の動作について説明する。 Hereinafter, the operation of the embodiment of the present invention will be described with reference to FIG.

　狭帯域音源符号１０４として、狭帯域長周期予測符号１２３と狭帯域長周期予測残差符号１２６が広帯域音源復号手段１０６に入力される。 As the narrow-band excitation code 104, the narrow-band long-period prediction code 123 and the narrow-band long-period prediction residual code 126 are input to the wideband excitation decoding means 106.

　広帯域長周期予測パラメータ（符号）復号手段１２４内の狭帯域長周期予測パラメータ復号手段１２５は、前記狭帯域長周期予測符号１２３を復号して、得られた予測符号の１つである狭帯域長周期遅延３８と、他の予測符号である狭帯域長周期予測係数３９を出力する。広帯域長周期予測パラメータ復号手段１２４内の広帯域長周期予測パラメータ推定手段４４は、この狭帯域長周期遅延３８と狭帯域長周期予測係数３９から、長周期予測符号の１つである広帯域長周期遅延４５と、他の長周期予測符号の１つである広帯域長周期予測係数４６を推定し、出力する。広帯域長周期予測パラメータ推定手段４４の動作については、実施例５と同様である。 A narrow-band long-period prediction parameter decoding unit 125 in the wide-band long-period prediction parameter (code) decoding unit 124 decodes the narrow-band long-period prediction code 123 and obtains a narrow-band length, which is one of the obtained prediction codes. A period delay 38 and a narrow band long period prediction coefficient 39 which is another prediction code are output. The wide-band long-period prediction parameter estimating unit 44 in the wide-band long-period prediction parameter decoding unit 124 calculates a wide-band long-period delay, which is one of the long-period prediction codes, from the narrow-band long-period delay 38 and the narrow-band long-period prediction coefficient 39. 45 and a wide-band long-period prediction coefficient 46, which is one of the other long-period prediction codes, are estimated and output. The operation of the wideband long-period prediction parameter estimating means 44 is the same as in the fifth embodiment.

　広帯域長周期予測残差復号手段１２７内の狭帯域長周期予測残差復号手段１２８は、前記狭帯域長周期予測残差符号１２６を復号して、得られた狭帯域長周期予測残差信号４１を出力する。広帯域長周期予測残差復号手段１２７内の広帯域長周期予測残差推定手段４２は、この狭帯域長周期予測残差信号４１から広帯域長周期予測残差信号４８を推定し、出力する。広帯域長周期予測残差推定手段４２の動作は、実施例５ないし実施例７と同様である。 The narrow-band long-period prediction residual decoding unit 128 in the wide-band long-period prediction residual decoding unit 127 decodes the narrow-band long-period prediction residual code 126 and obtains the obtained narrow-band long-period prediction residual signal 41. Is output. The wide-band long-period prediction residual estimating unit 42 in the wide-band long-period prediction residual decoding unit 127 estimates the wide-band long-period prediction residual signal 48 from the narrow-band long-period prediction residual signal 41 and outputs it. The operation of the wideband long-period prediction residual estimation unit 42 is the same as in the fifth to seventh embodiments.

　この様に構成する事により、実施例５ないし実施例７および実施例１７が持つ効果に加えて、狭帯域音源信号の復号を行わずに直接広帯域音源信号１６が生成されるので、少ない処理量で復元ができる効果がある。 With this configuration, in addition to the effects of the fifth to seventh and seventeenth embodiments, since the wideband excitation signal 16 is directly generated without decoding the narrowband excitation signal, a small amount of processing is required. Has the effect of being able to restore.

　実施例２１．
　実施例１７ないし実施例２０では、狭帯域スペクトル符号１０３から狭帯域スペクトルパラメータ４を復号した後に広帯域スペクトルパラメータ１３の推定を行っているが、狭帯域スペクトル符号１０３によって広帯域スペクトル符号帳を参照する事で直接広帯域スペクトルパラメータ１３を算出する構成も可能である。 Embodiment 21 FIG.
In the seventeenth to twentieth embodiments, the wideband spectral parameter 13 is estimated after decoding the narrowband spectral parameter 4 from the narrowband spectral code 103. However, the narrowband spectral code 103 refers to the wideband spectral codebook. It is also possible to calculate the broadband spectrum parameter 13 directly by using.

　この様に構成する事により、実施例１７ないし実施例２０が持つ効果に加えて、更に少ない処理量で復元ができる効果がある。 With this configuration, in addition to the effects of the seventeenth to twentieth embodiments, there is an effect that restoration can be performed with a smaller processing amount.

　実施例２２．
　図２２は本発明の一実施例である広帯域音声復元装置の構成図である。図において新規な部分は、１２９の狭帯域パワー復号手段、１３０の広帯域正規化音源復号手段である。 Embodiment 22 FIG.
FIG. 22 is a configuration diagram of a wideband audio restoration apparatus according to an embodiment of the present invention. The new parts in the figure are the narrowband power decoding means 129 and the wideband normalized excitation decoding means 130.

　広帯域スペクトル推定手段７は実施例１１と同じであり、その他は前記したものと同じであり、説明を省略する。 The wideband spectrum estimating means 7 is the same as that of the eleventh embodiment, and the other parts are the same as those described above.

　以下、図２２を用いて本発明の一実施例の動作について説明する。 The operation of the embodiment of the present invention will be described below with reference to FIG.

　狭帯域パワー復号手段１２９は、狭帯域音源符号１０４の中に含まれる狭帯域振幅情報からパワーに関する部分を復号して、得られた狭帯域音源パワー５２を広帯域スペクトル推定手段７に対して出力する。広帯域スペクトル推定手段７は、狭帯域スペクトルパラメータ４と狭帯域音源パワー５２を用いて、広帯域スペクトルパラメータ１３と広帯域音源パワー５８を推定する。 The narrow-band power decoding unit 129 decodes a portion related to power from the narrow-band amplitude information included in the narrow-band excitation code 104 and outputs the obtained narrow-band excitation power 52 to the wide-band spectrum estimation unit 7. . The wideband spectrum estimating means 7 estimates the wideband spectrum parameter 13 and the wideband excitation power 58 using the narrowband spectrum parameter 4 and the narrowband excitation power 52.

　広帯域正規化音源復号手段１３０は、狭帯域音源符号１０４の中に含まれる狭帯域パワーに関する部分以外を用いて、パワーが正規化された広帯域の音源信号を推定し、広帯域正規化音源信号５６として出力する。この広帯域正規化音源復号手段１３０における処理には、実施例１８ないし実施例２０と同様なものを用いる事ができる。そして、この広帯域正規化音源信号５６に前記広帯域音源パワー５８を乗じて広帯域音源信号１６を生成する。 The wideband normalized excitation decoding means 130 estimates a wideband excitation signal whose power has been normalized by using a portion other than the portion related to the narrowband power included in the narrowband excitation code 104. Output. For the processing in the wideband normalized excitation decoding means 130, the same processing as in the eighteenth to twentieth embodiments can be used. Then, the broadband normalized excitation signal 56 is multiplied by the broadband excitation power 58 to generate the broadband excitation signal 16.

　この様に構成する事により、実施例１１および実施例１８ないし実施例２０が持つ効果を合わせ持つ事ができる。なお、実施例９や実施例１０のように広帯域スペクトル推定手段７が広帯域スペクトルパラメータ１３もしくは広帯域音源パワー５８の一方だけを推定する構成も可能である。 With such a configuration, it is possible to combine the effects of the eleventh embodiment and the eighteenth and twentieth embodiments. Note that a configuration in which the wideband spectrum estimating means 7 estimates only one of the wideband spectrum parameter 13 and the wideband sound source power 58 as in the ninth and tenth embodiments is also possible.

　実施例２３．
　実施例１７ないし実施例２２において、合成フィルタ１７と帯域フィルタ１８の間にポストフィルタ手段６１を挿入した構成も可能である。また、ポストフィルタ手段６１と帯域フィルタ１８が逆の位置の構成も可能であるし、広帯域音声信号２０に対してポストフィルタ手段６１をかける構成も可能である。 Embodiment 23 FIG.
In the seventeenth to twenty-second embodiments, a configuration in which post filter means 61 is inserted between the synthesis filter 17 and the bandpass filter 18 is also possible. Further, a configuration in which the post-filter means 61 and the band-pass filter 18 are in opposite positions is possible, and a configuration in which the post-filter means 61 is applied to the wideband audio signal 20 is also possible.

　この様に構成する事により、狭帯域音声復号手段１０９内でポストフィルタ処理が行なわれる場合に、狭帯域部と復元した帯域の連続性を良くする事ができる。また、実施例１２および実施例１７ないし実施例２２が持つ効果を合わせ持つ事ができる。 With this configuration, when post-filter processing is performed in the narrow-band speech decoding unit 109, continuity between the narrow-band portion and the restored band can be improved. In addition, the effects of the twelfth embodiment and the seventeenth to twenty-second embodiments can be combined.

　実施例２４．
　図１８から広帯域音源復号手段１０６を外した構成において、合成フィルタ１７と帯域フィルタ１８の間にポストフィルタ手段６１を挿入した構成も可能である。また、ポストフィルタ手段６１と帯域フィルタ１８が逆の位置の構成も可能であるし、広帯域音声信号２０に対してポストフィルタ手段６１をかける構成も可能である。 Embodiment 24 FIG.
In a configuration in which the wideband excitation decoding means 106 is removed from FIG. 18, a configuration in which a post filter means 61 is inserted between the synthesis filter 17 and the bandpass filter 18 is also possible. Further, a configuration in which the post-filter means 61 and the band-pass filter 18 are in opposite positions is possible, and a configuration in which the post-filter means 61 is applied to the wideband audio signal 20 is also possible.

　この構成は、文献１に本発明の請求項９と請求項１５を適用したものに相当し、狭帯域音声復号手段１０９内でポストフィルタ処理が行なわれる場合に、狭帯域部と復元した帯域の連続性を良くする事ができる効果がある。 This configuration corresponds to the configuration in which claims 9 and 15 of the present invention are applied to Document 1, and when post-filter processing is performed in narrow-band speech decoding means 109, the narrow-band portion and the restored band There is an effect that continuity can be improved.

　以下に、各実施例の特徴をまとめて記載する。 (4) The features of each embodiment are described below.

　前述した広帯域音声復元装置は、狭帯域音声信号を分析して狭帯域スペクトルパラメータと狭帯域音源信号を得る分析手段と、この狭帯域スペクトルパラメータを用いて広帯域スペクトルパラメータを推定するスペクトル推定手段と、狭帯域音源信号を用い広帯域音源信号を推定する広帯域音源推定手段と、この推定された広帯域スペクトルパラメータと広帯域音源信号とから広帯域音声信号を生成する合成手段を備えた。 The above-described wideband audio restoration apparatus includes an analyzing unit that analyzes a narrowband audio signal to obtain a narrowband spectral parameter and a narrowband sound source signal; a spectrum estimating unit that estimates a wideband spectral parameter using the narrowband spectral parameter; Broadband sound source estimating means for estimating a wideband sound source signal using a narrowband sound source signal, and synthesizing means for generating a wideband speech signal from the estimated wideband spectral parameters and the wideband sound source signal are provided.

　また更に、広帯域音源推定手段として、入力の狭帯域音源信号の各サンプル間隔中に所定の零値を挿入する零詰手段を用いた。 {Circle around (4)} As the wideband sound source estimating means, a zero filling means for inserting a predetermined zero value into each sample interval of the input narrowband sound source signal is used.

　また、広帯域音源推定手段は、入力の狭帯域音源信号を分析して狭帯域適応音源符号と狭帯域駆動音源信号を得る音源分析手段と、この狭帯域適応音源符号を用いて広帯域適応音源信号を推定する適応音源推定手段と、狭帯域駆動音源信号を用いて広帯域駆動音源信号を推定する駆動音源推定手段と、この推定された広帯域適応音源信号と広帯域駆動音源信号とから広帯域音源信号を生成する加算手段とで構成した。 Further, the wideband excitation estimating means analyzes the input narrowband excitation signal to obtain a narrowband adaptive excitation code and a narrowband driving excitation signal, and a wideband adaptive excitation signal using the narrowband adaptive excitation code. Adaptive sound source estimating means for estimating, driving sound source estimating means for estimating a wide band driving sound source signal using a narrow band driving sound source signal, and generating a wide band sound source signal from the estimated wide band adaptive sound source signal and the wide band driving sound source signal. And an adding means.

　または、広帯域音源推定手段は、入力の狭帯域音源信号を分析して狭帯域長周期予測符号と狭帯域長周期予測残差信号を得る音源分析手段と、この狭帯域長周期予測残差信号を用いて広帯域長周期予測残差信号を推定する長周期予測残差推定手段と、狭帯域長周期予測符号を用いて広帯域長周期予測符号を推定する広帯域長周期予測符号推定手段と、これら推定された広帯域長周期予測残差信号と広帯域長周期予測符号とから広帯域音源信号を合成する長周期合成手段とで構成した。 Alternatively, the wideband sound source estimating means analyzes the input narrowband sound source signal to obtain a narrowband long-period prediction code and a narrowband long-period prediction residual signal. A long-period prediction residual estimator for estimating a wide-band long-period prediction residual signal using a wide-band long-period prediction code using a narrow-band long-period prediction code; And a long-period synthesizing means for synthesizing a wideband excitation signal from the wideband long-period prediction residual signal and the wideband long-period prediction code.

　他の広帯域音声復元装置は、狭帯域音声信号を分析して狭帯域スペクトルパラメータと狭帯域振幅情報とを得る分析手段と、この狭帯域スペクトルパラメータと狭帯域振幅情報を用いて少なくとも広帯域スペクトルパラメータまたは広帯域振幅情報を推定するスペクトル推定手段と、これら推定された広帯域スペクトルパラメータと広帯域振幅情報または広帯域音源信号とから広帯域音声信号を生成する合成手段を備えた。 Another wideband audio restoration apparatus is an analyzing means for analyzing a narrowband audio signal to obtain a narrowband spectral parameter and narrowband amplitude information, and using the narrowband spectral parameter and the narrowband amplitude information at least a wideband spectral parameter or A spectrum estimating means for estimating broadband amplitude information, and a synthesizing means for generating a wideband speech signal from the estimated wideband spectral parameters and the wideband amplitude information or the wideband excitation signal are provided.

　または、狭帯域音声信号を用いて広帯域音声信号を推定する広帯域推定手段と、推定された広帯域音声信号に対してポストフィルタリングを行うポストフィルタ手段を備えた。 Or a broadband estimating unit for estimating a wideband audio signal using a narrowband audio signal, and a post-filter unit for performing post-filtering on the estimated wideband audio signal.

　または、狭帯域音声信号を分析して狭帯域スペクトルパラメータを得る分析手段と、狭帯域スペクトルパラメータをそのまま広帯域スペクトルパラメータとして用いて広帯域スペクトルパラメータを出力するスペクトル推定手段と、この出力された広帯域スペクトルパラメータから広帯域音声信号を生成する合成手段を備えた。 An analyzing means for analyzing a narrowband speech signal to obtain a narrowband spectral parameter; a spectrum estimating means for outputting a wideband spectral parameter using the narrowband spectral parameter as it is as a wideband spectral parameter; And a synthesizing means for generating a wideband audio signal from the audio signal.

　または、狭帯域音声信号を分析して狭帯域スペクトルパラメータを得る分析手段と、狭帯域スペクトルパラメータを必要に応じて別領域に変換し、変形を行い、スペクトルパラメータの領域に逆変換して広帯域スペクトルパラメータを出力するスペクトル推定手段と、この出力された広帯域スペクトルパラメータから広帯域音声信号を生成する合成手段を備えた。 Or analyzing means for analyzing a narrowband speech signal to obtain a narrowband spectral parameter, and converting the narrowband spectral parameter to another region as necessary, performing deformation, and inversely converting to a spectral parameter region to obtain a wideband spectrum. A spectrum estimating means for outputting parameters and a synthesizing means for generating a wideband speech signal from the outputted wideband spectral parameters are provided.

　他の広帯域音声復元装置は、狭帯域音声符号から広帯域スペクトルパラメータを推定するスペクトル復号手段と、この推定された広帯域スペクトルパラメータから広帯域音声信号を生成する合成手段を備えた。広帯域 The other wideband speech restoration apparatus includes spectrum decoding means for estimating a wideband spectrum parameter from a narrowband speech code, and synthesis means for generating a wideband speech signal from the estimated wideband spectrum parameter.

　または、狭帯域音声符号から分離された狭帯域スペクトル符号を用いて広帯域スペクトルパラメータを推定するスペクトル復号手段と、狭帯域音声符号から分離された狭帯域音源符号を用いて広帯域音源信号を推定する広帯域音源復号手段と、この推定された広帯域スペクトルパラメータと広帯域音源信号とから広帯域音声信号を生成する合成手段を備えた。 Alternatively, spectrum decoding means for estimating a wideband spectral parameter using a narrowband spectral code separated from a narrowband speech code, and a wideband for estimating a wideband excitation signal using a narrowband excitation code separated from the narrowband speech code Excitation decoding means and synthesis means for generating a wideband speech signal from the estimated wideband spectrum parameters and the wideband excitation signal are provided.

　また更に、広帯域音源復号手段として、狭帯域音源符号から復元した狭帯域音源信号の各サンプル間隔中に所定の零値を挿入する零詰手段を用いた。 {Circle around (4)} Further, as the wideband excitation decoding means, zero filling means for inserting a predetermined zero value into each sample interval of the narrowband excitation signal restored from the narrowband excitation code is used.

　または、広帯域音源復号手段は、入力の狭帯域音声符号から分離した狭帯域適応音源符号を用いて広帯域適応音源信号を推定する広帯域適応音源復号手段と、入力の狭帯域音声符号から分離した狭帯域駆動音源符号を用いて広帯域駆動音源信号を推定する広帯域駆動音源復号手段と、これらの推定された広帯域適応音源信号と広帯域駆動音源信号とから広帯域音源信号を生成する加算手段とで構成した。 Alternatively, the wideband excitation decoding means comprises a wideband adaptive excitation decoding means for estimating a wideband adaptive excitation signal using a narrowband adaptive excitation code separated from the input narrowband speech code, and a narrowband excitation separation means separated from the input narrowband speech code. Broadband driving excitation signal decoding means for estimating a broadband driving excitation signal using a driving excitation code, and adding means for generating a wideband excitation signal from the estimated wideband adaptive excitation signal and the wideband driving excitation signal.

　または、広帯域音源復号手段は、入力の狭帯域音声符号から分離した狭帯域長周期予測符号を用いて広帯域長周期予測符号を推定する広帯域長周期予測符号復号手段と、入力の狭帯域音声符号から分離した狭帯域長周期予測残差符号を用いて広帯域長周期予測残差信号を推定する広帯域長周期予測残差復号手段と、これら推定された広帯域長周期予測符号と広帯域長周期予測残差信号とから広帯域音源信号を生成する加算手段とで構成した。 Alternatively, the wideband sound source decoding unit includes a wideband long period prediction code decoding unit that estimates a wideband long period prediction code using a narrowband long period prediction code separated from the input narrowband speech code, and Wideband long-period prediction residual decoding means for estimating a wide-band long-period prediction residual signal using the separated narrow-band long-period prediction residual code, and these estimated wideband long-period prediction residual signal and wideband long-period prediction residual signal And an adding means for generating a broadband sound source signal from the above.

　または、狭帯域音声符号から分離された狭帯域音源符号を用いて狭帯域振幅情報を推定する狭帯域振幅情報復号手段と、狭帯域音声符号から分離された狭帯域スペクトル符号と狭帯域振幅情報を用いて少なくとも広帯域スペクトルパラメータまたは広帯域振幅情報を推定するスペクトル復号手段と、この推定された広帯域スペクトルパラメータと必要に応じて広帯域振幅情報または広帯域音源信号とから広帯域音声信号を生成する合成手段を備えた。 Alternatively, a narrow-band amplitude information decoding means for estimating narrow-band amplitude information using a narrow-band excitation code separated from a narrow-band speech code, and a narrow-band spectrum code and narrow-band amplitude information separated from the narrow-band speech code. A spectrum decoding means for estimating at least a wideband spectral parameter or a wideband amplitude information by use thereof, and a synthesizing means for generating a wideband speech signal from the estimated wideband spectral parameter and the wideband amplitude information or the wideband excitation signal as required. .

　または、狭帯域音声符号を用いて広帯域音声信号を推定する広帯域音声復号手段と、この復号し推定された広帯域音声信号に対してポストフィルタリングを行うポストフィルタ手段を備えた。 Or a wideband speech decoding means for estimating a wideband speech signal using a narrowband speech code, and post-filter means for performing post-filtering on the decoded and estimated wideband speech signal.

　前述した広帯域音声復元装置は、狭帯域スペクトルパラメータを用いて推定した広帯域スペクトルパラメータと、狭帯域音源信号を用いて推定した広帯域音源信号とから広帯域音声信号が合成される。 {Circle around (4)} The wideband speech restoration apparatus described above synthesizes a wideband speech signal from the wideband spectrum parameters estimated using the narrowband spectrum parameters and the wideband excitation signal estimated using the narrowband excitation signal.

　また、狭帯域音源信号の各サンプル間に所定個ずつの零を挿入する事で広帯域音源信号が生成され、これと推定した広帯域スペクトルとを用いて広帯域音声信号が合成される。 {Circle around (2)} A wideband sound source signal is generated by inserting a predetermined number of zeros between each sample of the narrowband sound source signal, and a wideband speech signal is synthesized using this and the estimated wideband spectrum.

　また、広帯域音源信号の推定にあたっては、入力の狭帯域音源信号を分析して狭帯域適応音源符号と狭帯域駆動音源信号が算出され、この狭帯域適応音源符号を用いて推定した広帯域適応音源信号と、狭帯域駆動音源を用いて推定した広帯域駆動音源信号とを加算して広帯域音源信号とした。これと推定した広帯域スペクトルとを用いて広帯域音声信号が合成される。 In estimating the wideband excitation signal, the input narrowband excitation signal is analyzed to calculate a narrowband adaptive excitation code and a narrowband driving excitation signal, and the wideband adaptive excitation signal estimated using the narrowband adaptive excitation code is calculated. And the broadband driving sound source signal estimated using the narrow band driving sound source were added to obtain a wideband sound source signal. A wideband speech signal is synthesized using this and the estimated wideband spectrum.

　また、他の広帯域音源信号の推定のやり方として、入力狭帯域音源信号を分析して狭帯域長周期予測符号と狭帯域長周期残差信号が算出され、狭帯域長周期予測符号を用いて推定した広帯域長周期予測符号と、狭帯域長周期残差信号を用いて推定した広帯域長周期残差信号とを用いて広帯域音源信号とした。これと推定した広帯域スペクトルとを用いて広帯域音声信号が合成される。 As another method of estimating the wideband excitation signal, the input narrowband excitation signal is analyzed to calculate a narrowband long-period prediction code and a narrowband long-period residual signal, and the estimation is performed using the narrowband long-period prediction code. The wideband long-period prediction code and the wideband long-period residual signal estimated using the narrow-band long-period residual signal are used as a wideband excitation signal. A wideband speech signal is synthesized using this and the estimated wideband spectrum.

　また、他の広帯域音声復元装置は、狭帯域音声信号を分析して狭帯域スペクトルパラメータと狭帯域振幅情報と狭帯域音源信号が算出され、狭帯域スペクトルパラメータと狭帯域振幅情報を用いて広帯域スペクトルパラメータと広帯域振幅情報のいずれかまたはその両方が推定される。その後、これらの信号と狭帯域音源信号から推定された広帯域音源信号とで広帯域音声信号が合成される。 Further, other wideband speech restoration apparatuses analyze a narrowband speech signal to calculate a narrowband spectrum parameter, narrowband amplitude information, and a narrowband excitation signal, and use the narrowband spectrum parameter and the narrowband amplitude information to calculate a wideband spectrum. One or both of the parameters and the broadband amplitude information are estimated. Thereafter, a wideband audio signal is synthesized with these signals and a wideband excitation signal estimated from the narrowband excitation signal.

　また、他の広帯域音声復元装置は、狭帯域音声信号を用いて推定した広帯域音声信号にポストフィルタリングが行われ、主として、高域特性が加工される。 In another wideband audio restoration device, post-filtering is performed on a wideband audio signal estimated using a narrowband audio signal, and mainly high-frequency characteristics are processed.

　また、他の広帯域音声復元装置は、狭帯域スペクトルパラメータの特性を全域に伸張して広帯域スペクトルパラメータとして用いて広帯域音声信号が合成される。 Another wideband speech restoration apparatus extends a characteristic of a narrowband spectrum parameter to the whole band and synthesizes a wideband speech signal using the wideband spectrum parameter.

　また、他の広帯域音声復元装置は、狭帯域スペクトルパラメータの特定次数までを用い、これを対応するスペクトルパラメータに逆変換する事で広帯域スペクトルパラメータを得、これを用いて広帯域音声信号が合成される。 Further, other wideband speech restoration apparatuses use wideband spectrum parameters up to a specific order and inversely convert the narrowband spectrum parameters to corresponding spectrum parameters to obtain wideband spectrum parameters, which are used to synthesize a wideband speech signal. .

　また、他の広帯域音声復元装置は、狭帯域音声符号を用いて狭帯域合成音の生成と広帯域音声信号の推定を行い、狭帯域合成音をアップサンプリングした信号または狭帯域合成音に、前記広帯域音声信号の狭帯域合成音以外の主として高域の帯域の成分を抽出した信号を加算して広帯域音声信号が合成される。 Another wide-band speech restoration apparatus generates a narrow-band synthesized sound using a narrow-band speech code and estimates a wide-band speech signal, and converts the narrow-band synthesized sound into an up-sampled signal or a narrow-band synthesized sound. A signal obtained by extracting mainly high-band components other than the narrow-band synthesized sound of the audio signal is added to synthesize a wide-band audio signal.

　また、他の広帯域音声復元装置は、狭帯域スペクトル符号を用いて推定した広帯域スペクトルパラメータと、狭帯域音源符号を用いて推定した広帯域音源信号とを用いて広帯域音声信号が合成される。 {Circle around (2)} In another wideband speech restoration apparatus, a wideband speech signal is synthesized using a wideband spectrum parameter estimated using a narrowband spectrum code and a wideband excitation signal estimated using a narrowband excitation code.

　また、更に、広帯域音源復号手段により、狭帯域音源符号を用いて復号した狭帯域音源の各サンプル間に所定個ずつの零値を挿入する事で広帯域音源信号が生成され、これと推定した広帯域スペクトルとを用いて広帯域音声信号が合成される。 Furthermore, a wideband excitation signal is generated by inserting a predetermined number of zero values between each sample of the narrowband excitation decoded by using the narrowband excitation code by the wideband excitation decoding means, and the estimated wideband excitation signal is generated. A wideband audio signal is synthesized using the spectrum.

　また、他の広帯域音源復号手段により、狭帯域適応音源符号を用いて推定した広帯域適応音源信号と、狭帯域駆動音源信号から推定した広帯域駆動音源信号が加算されて広帯域音源信号が生成される。これと推定した広帯域スペクトルとを用いて広帯域音声信号が合成される。 {Circle around (4)} The wideband excitation signal estimated by using the narrowband adaptive excitation code and the wideband excitation signal estimated from the narrowband excitation signal are added by another wideband excitation decoding means to generate a wideband excitation signal. A wideband speech signal is synthesized using this and the estimated wideband spectrum.

　また、他の広帯域音源復号手段により、狭帯域音源符号を用いて推定した広帯域長周期予測符号と、狭帯域長周期予測残差信号から推定された広帯域長周期残差信号とから広帯域音源信号が合成される。これと推定した広帯域スペクトルとを用いて広帯域音声信号が合成される。 Further, a wideband excitation signal is obtained from another wideband excitation decoding means from a wideband long-period prediction code estimated using the narrowband excitation code and a wideband long-period residual signal estimated from the narrowband long-period prediction residual signal. Synthesized. A wideband speech signal is synthesized using this and the estimated wideband spectrum.

　また、他の広帯域音声復元装置は、狭帯域スペクトル符号と狭帯域振幅情報を用いて広帯域スペクトルパラメータと広帯域振幅情報のいずれかまたはその両方が推定される。その後、これらの情報と狭帯域音源信号から推定された広帯域音源信号とで広帯域音声信号が合成される。 Another wideband speech restoration apparatus estimates either or both of the wideband spectrum parameter and the wideband amplitude information using the narrowband spectrum code and the narrowband amplitude information. Thereafter, a wideband speech signal is synthesized with these pieces of information and the wideband sound source signal estimated from the narrowband sound source signal.

　また、他の広帯域音声復元装置は、狭帯域音声符号を用いて推定した広帯域音声信号にポストフィルタリングが行われ、主として高域特性が加工される。 In another wideband speech restoration apparatus, post-filtering is performed on a wideband speech signal estimated using a narrowband speech code, and mainly high-frequency characteristics are processed.

　以上説明したように、狭帯域音源信号を用いて広帯域音源信号の推定を行い、これを用いて広帯域音声信号を合成するようにしたので、狭帯域音源信号の特徴を良好に広帯域音源信号に与える事ができ、話者に依存性が少なく、安定で自然な音質の広帯域音声を復元することができる効果がある。 As described above, since the wideband sound source signal is estimated using the narrowband sound source signal and the wideband sound signal is synthesized using the narrowband sound source signal, the characteristics of the narrowband sound source signal are favorably given to the wideband sound source signal. Therefore, there is an effect that a wideband voice with stable and natural sound quality can be restored with little dependence on a speaker.

　また、広帯域音源推定手段として、狭帯域音源信号の各サンプル間に所定個ずつの零を挿入する零詰め手段を用いたので、有声無声判定やピッチ抽出が必要なく、有声無声判定誤りやピッチ抽出誤りの影響がない良好な広帯域音源を推定でき、安定で自然な音質の広帯域音声を復元することができる効果がある。 In addition, since the wide-band sound source estimating means uses zero padding means for inserting a predetermined number of zeros between each sample of the narrow-band sound source signal, voiced / unvoiced determination and pitch extraction are not required, and voiced / unvoiced determination error and pitch extraction are not required. There is an effect that it is possible to estimate a good broadband sound source without the influence of an error, and to restore a wideband sound with stable and natural sound quality.

　また、広帯域音源推定手段として、狭帯域適応音源符号と狭帯域駆動音源信号を用いて広帯域適応音源信号と広帯域駆動音源信号を推定するようにし、これから広帯域音源信号を生成するようにしたので、狭帯域音源信号の持つピッチ周期性の強さや変動に関する特徴が良好に広帯域音源信号に反映され、パルス的な音もなく、良好な音質の広帯域音声を復元することができる効果がある。 Also, as the wideband excitation estimation means, the narrowband adaptive excitation code and the narrowband excitation signal are used to estimate the wideband adaptive excitation signal and the broadband excitation signal, and the wideband excitation signal is generated from this. The characteristics related to the strength and fluctuation of the pitch periodicity of the band excitation signal are well reflected in the band excitation signal, and there is an effect that a broadband sound having good sound quality can be restored without pulse-like sound.

　更に、基本周波数とその高調波成分の周波数が正しく整数倍の位置に並ぶので、広帯域音声信号での狭帯域成分と復元広帯域成分のつながりが良く、またピッチ周期性の実際的な性質も復元でき、高品質な広帯域音声を復元できる効果がある。 Furthermore, since the fundamental frequency and the frequency of its harmonic component are correctly aligned at integer multiples, the connection between the narrowband component and the restored broadband component in the wideband audio signal is good, and the practical properties of pitch periodicity can be restored. This has the effect of restoring high quality wideband speech.

　また、広帯域音源推定手段として、狭帯域長周期予測符号と狭帯域長周期残差信号を用いて広帯域長周期予測符号と広帯域長周期残差信号を推定するようにし、これらを用いて広帯域音源信号を合成するようにしたので、狭帯域音源信号の持つピッチ周期性の強さや変動に関する特徴が良好に広帯域音源信号に反映され、パルス的な音もなく、良好な音質の広帯域音声を復元することができる効果がある。 Also, as the wideband excitation source estimating means, the wideband long-period prediction code and the wideband long-period residual signal are estimated using the narrow-band long-period prediction code and the narrow-band long-period residual signal. , The characteristics of the pitch periodicity of the narrow-band sound source signal, such as its strength and fluctuations, are well reflected in the wide-band sound source signal. There is an effect that can be.

　更に、基本周波数とその高調波成分の周波数が正しく整数倍の位置に並ぶので、最終的に復元される広帯域音声信号での狭帯域成分と復元広帯域成分のつながりが良く、実際のピッチ周期性の特性もとり入れることができ、高品質な広帯域音声を復元できる効果がある。 Furthermore, since the fundamental frequency and the frequency of its harmonic component are correctly aligned at integer multiple positions, the connection between the narrowband component and the restored wideband component in the finally restored wideband audio signal is good, and the actual pitch periodicity is improved. Characteristics can also be taken in, and there is an effect that high-quality wideband speech can be restored.

　また、狭帯域スペクトルパラメータと狭帯域振幅情報を用いて広帯域スペクトルパラメータと広帯域振幅情報のいずれか、または両方を推定するようにしたので、広帯域のスペクトルパラメータの推定に狭帯域振幅情報が反映され、より安定に良好なスペクトルが推定でき、より正しい振幅を持った広帯域音声が復元できる効果がある。 Also, because one or both of the wideband spectral parameter and the wideband amplitude information is estimated using the narrowband spectral parameter and the narrowband amplitude information, the narrowband amplitude information is reflected in the estimation of the wideband spectral parameter, There is an effect that a good spectrum can be more stably estimated, and a wideband voice having a more correct amplitude can be restored.

　また更に、狭帯域音声信号を用いて推定した広帯域音声信号にポストフィルタリングを行うようにしたので、復元された広帯域音声信号の音質が不足する場合に、ピッチ周期性の強調、スペクトル包絡の極の強調等の音質改善ができる効果がある。 Furthermore, since the post-filtering is performed on the wideband audio signal estimated using the narrowband audio signal, when the sound quality of the restored wideband audio signal is insufficient, the pitch periodicity is emphasized and the pole of the spectral envelope is reduced. This has the effect of improving sound quality such as emphasis.

　また更に、狭帯域スペクトルパラメータを伸張して広帯域スペクトルパラメータとして用いて広帯域音声信号を合成するようにしたので、極めて簡単におおまかな広帯域スペクトルを復元できる効果がある。また、符号帳を蓄積しておくメモリが不必要で、演算量が少なくなる効果がある。 {Circle around (4)} Since the narrowband spectrum parameter is expanded and used as a wideband spectrum parameter to synthesize a wideband speech signal, there is an effect that a rough broadband spectrum can be restored very easily. In addition, there is no need for a memory for storing the codebook, which has the effect of reducing the amount of calculation.

　また更に、狭帯域スペクトルパラメータの所定次数までを用いてこれをスペクトルパラメータに逆変換する事で広帯域スペクトルパラメータを得るようにしたので、極めて簡単におおまかな広帯域スペクトルを復元できる効果がある。また、符号帳を蓄積しておくメモリが不必要で、演算量が少なくなる効果がある。 {Circle around (4)} Since the wideband spectrum parameter is obtained by inverting the narrowband spectrum parameter up to a predetermined order and converting it into a spectrum parameter, the broadband spectrum can be restored very easily. In addition, there is no need for a memory for storing the codebook, which has the effect of reducing the amount of calculation.

　またこの発明によれば、狭帯域音声符号を用いて狭帯域合成音の生成と広帯域音声信号の推定を行い、狭帯域合成音をアップサンプリングした信号か狭帯域合成音に、広帯域音声信号の狭帯域合成音以外の帯域の成分を抽出して加算したので、符号化された狭帯域音声からでも広帯域音声の復元が可能となり、復号した狭帯域音声を再分析しないので、少ない処理量で復元ができる効果がある。 Further, according to the present invention, a narrow-band synthesized speech is generated using a narrow-band speech code and a wide-band speech signal is estimated, and the narrow-band synthesized speech is narrowed down to a signal obtained by up-sampling the narrow-band synthesized speech or a narrow-band synthesized speech. Since the components of the bands other than the band-synthesized sound are extracted and added, it is possible to restore the wide-band sound even from the encoded narrow-band sound, and the decoded narrow-band sound is not re-analyzed. There is an effect that can be done.

　または、狭帯域スペクトル符号を用いて推定した広帯域スペクトルパラメータと、狭帯域音源符号を用いて推定した広帯域音源信号とを用いて広帯域音声信号を合成するようにしたので、復号した狭帯域音声を再分析する必要がなく、少ない処理量で復元ができる効果がある。また、合成時の補間や分析時の窓掛等による歪が重畳しないので、より良い品質の広帯域音声が復元できる効果がある。 Alternatively, since the wideband speech signal is synthesized using the wideband spectrum parameters estimated using the narrowband spectrum code and the wideband excitation signal estimated using the narrowband excitation code, the decoded narrowband speech is reproduced. There is no need for analysis, and there is an effect that restoration can be performed with a small amount of processing. In addition, since distortion due to interpolation at the time of synthesis or windowing at the time of analysis is not superimposed, there is an effect that better quality wideband speech can be restored.

　また広帯域音源復号手段として、狭帯域音源符号を用いて復号した狭帯域音源の各サンプル間に所定個ずつの零を挿入する零詰め手段を用いたので、有声と無声の中間的な性質の音源も良好に復元でき、安定で自然な音質の広帯域音声を復元することができる効果がある。 In addition, as the wideband excitation decoding means, a zero padding means for inserting a predetermined number of zeros between each sample of the narrowband excitation decoded using the narrowband excitation code is used, so that a sound source having an intermediate characteristic between voiced and unvoiced is used. Can be satisfactorily restored, and there is an effect that a broadband sound with stable and natural sound quality can be restored.

　また、広帯域音源復号手段として、狭帯域音源符号を用いて推定した広帯域適応音源信号と広帯域駆動音源信号を推定するようにし、それを加算して広帯域音源信号としたので、狭帯域音源信号の復号を行わずに直接広帯域音源信号が生成され、少ない処理量で復元ができる効果がある。 Also, as the wideband excitation decoding means, the wideband adaptive excitation signal and the wideband driving excitation signal estimated using the narrowband excitation code are estimated, and the sum is added to obtain the wideband excitation signal. Therefore, a wideband sound source signal is directly generated without performing the above processing, and there is an effect that restoration can be performed with a small amount of processing.

　また、狭帯域音源符号が含んでいるピッチ周期性の強さや変動に関する特徴が良好に広帯域音源信号に反映されるので、良好な音質の広帯域音声を復元することができる効果がある。 (4) Since the characteristics related to the strength and fluctuation of the pitch periodicity included in the narrowband excitation code are well reflected in the wideband excitation signal, there is an effect that a wideband speech having good sound quality can be restored.

　また、広帯域音源復号手段として、狭帯域音源符号を用いて推定した広帯域長周期予測符号と広帯域長周期残差信号とを推定するようにし、これらを用いて広帯域音源信号を合成するようにしたので、狭帯域音源信号の復号を行わずに直接広帯域音源信号が生成され、少ない処理量で復元ができる効果がある。 Also, as the wideband excitation decoding means, the wideband long-period prediction code and the wideband long-period residual signal estimated using the narrowband excitation code are estimated, and the wideband excitation signal is synthesized using these. Thus, the wideband excitation signal is directly generated without decoding the narrowband excitation signal, and the effect is that the restoration can be performed with a small amount of processing.

　また、狭帯域スペクトル符号と狭帯域振幅情報を用いて広帯域スペクトルパラメータと広帯域振幅情報のいずれか、またはその両方を推定するようにしたので、広帯域のスペクトルパラメータの推定に狭帯域振幅情報が反映され、より安定に良好なスペクトルが推定でき、広帯域振幅情報の推定に狭帯域スペクトルパ符号の違いを反映させる事ができるので、より正しい振幅を持った広帯域音声が復元できる効果がある。 In addition, since the wideband spectral parameter and / or the wideband amplitude information are estimated using the narrowband spectral code and the narrowband amplitude information, the narrowband amplitude information is reflected in the estimation of the wideband spectral parameter. Since a good spectrum can be more stably estimated and the difference in the narrowband spectral code can be reflected in the estimation of the broadband amplitude information, there is an effect that a wideband speech having a more correct amplitude can be restored.

　また更に、狭帯域音声符号を用いて推定した広帯域音声信号にポストフィルタリングを行うようにしたので、狭帯域合成音に対してポストフィルタ処理が適用される場合に、狭帯域部と復元した帯域の連続性がよくなる効果がある。また、復元された広帯域音声信号の音質が不足する場合に、ピッチ周期性の強調、スペクトル包絡の極の強調等の音質改善ができる効果がある。 Furthermore, since the post-filtering is performed on the wideband speech signal estimated using the narrowband speech code, when the post-filter processing is applied to the narrowband synthesized sound, the narrowband portion and the restored band are used. This has the effect of improving continuity. Further, when the sound quality of the restored wideband audio signal is insufficient, there is an effect that sound quality can be improved such as enhancement of pitch periodicity and enhancement of poles of a spectrum envelope.

この発明の実施例１の広帯域音声復元装置の構成図である。BRIEF DESCRIPTION OF THE DRAWINGS FIG. 1 is a configuration diagram of a wideband audio restoration device according to a first embodiment of the present invention. この発明の実施例１における零詰手段の処理を説明する説明図である。FIG. 3 is an explanatory diagram illustrating a process of a zero filling unit according to the first embodiment of the present invention. この発明の実施例２の広帯域音声復元装置における広帯域音源推定手段の構成図である。FIG. 9 is a configuration diagram of a wideband sound source estimating unit in the wideband sound restoration apparatus according to the second embodiment of the present invention. この発明の実施例２における適応音源信号の一例を説明する説明図である。FIG. 9 is an explanatory diagram illustrating an example of an adaptive excitation signal according to the second embodiment of the present invention. この発明の実施例３の広帯域音声復元装置における広帯域駆動音源推定手段の構成図である。FIG. 11 is a configuration diagram of a wideband driving sound source estimating unit in the wideband audio restoration apparatus according to the third embodiment of the present invention. この発明の実施例４の広帯域音声復元装置における広帯域駆動音源推定手段の構成図である。FIG. 13 is a configuration diagram of a wideband driving sound source estimating unit in a wideband audio restoration apparatus according to a fourth embodiment of the present invention. この発明の実施例５の広帯域音声復元装置における広帯域音源推定手段の構成図である。FIG. 14 is a configuration diagram of a wideband sound source estimating unit in a wideband speech restoration apparatus according to a fifth embodiment of the present invention. この発明の実施例６の広帯域音声復元装置における広帯域駆動音源推定手段の構成図である。FIG. 14 is a configuration diagram of a wideband driving sound source estimating unit in the wideband audio restoration apparatus according to the sixth embodiment of the present invention. この発明の実施例７の広帯域音声復元装置における広帯域駆動音源推定手段の構成図である。FIG. 14 is a configuration diagram of a wideband driving sound source estimating means in the wideband audio restoration apparatus according to the seventh embodiment of the present invention. この発明の実施例８の広帯域音声復元装置における広帯域音源推定手段の構成図である。FIG. 18 is a configuration diagram of a wideband sound source estimating unit in the wideband speech restoration apparatus according to the eighth embodiment of the present invention. この発明の実施例９の広帯域音声復元装置の構成図である。FIG. 19 is a configuration diagram of a wideband audio restoration apparatus according to a ninth embodiment of the present invention. この発明の実施例１０の広帯域音声復元装置の構成図である。FIG. 21 is a configuration diagram of a wideband audio restoration apparatus according to Embodiment 10 of the present invention. この発明の実施例１１の広帯域音声復元装置の構成図である。FIG. 21 is a configuration diagram of a wideband audio restoration apparatus according to Embodiment 11 of the present invention. この発明の実施例１２の広帯域音声復元装置の構成図である。FIG. 21 is a configuration diagram of a wideband audio restoration apparatus according to Embodiment 12 of the present invention. この発明の実施例１３における狭帯域スペクトルと広帯域スペクトルの概形の関係を説明する説明図である。FIG. 21 is an explanatory diagram illustrating a general relationship between a narrowband spectrum and a wideband spectrum in Embodiment 13 of the present invention. この発明の実施例１４における狭帯域スペクトルと広帯域スペクトルの概形の関係を説明する説明図である。FIG. 19 is an explanatory diagram for explaining a general relationship between a narrowband spectrum and a wideband spectrum in Embodiment 14 of the present invention. この発明の実施例１５の広帯域音声復元装置における広帯域スペクトル推定手段の構成図である。FIG. 21 is a configuration diagram of a wideband spectrum estimating means in a wideband speech restoration apparatus according to Embodiment 15 of the present invention. この発明の実施例１７の広帯域音声復元装置の構成図である。FIG. 21 is a configuration diagram of a wideband audio restoration apparatus according to Embodiment 17 of the present invention. この発明の実施例１８の広帯域音声復元装置における広帯域音源復号手段の構成図である。FIG. 28 is a configuration diagram of a wideband sound source decoding means in the wideband audio restoration apparatus according to the eighteenth embodiment of the present invention. この発明の実施例１９の広帯域音声復元装置における広帯域音源復号手段の構成図である。FIG. 21 is a configuration diagram of a wideband sound source decoding means in a wideband sound restoration apparatus according to a nineteenth embodiment of the present invention. この発明の実施例２０の広帯域音声復元装置における広帯域音源復号手段の構成図である。FIG. 21 is a configuration diagram of a wideband sound source decoding means in a wideband speech restoration apparatus according to a twentieth embodiment of the present invention. この発明の実施例２２の広帯域音声復元装置の構成図である。FIG. 27 is a configuration diagram of a wideband audio restoration apparatus according to Embodiment 22 of the present invention.

Explanation of reference numerals

　１　狭帯域音声信号、２　分析手段、３　スペクトル分析手段、４　狭帯域スペクトルパラメータ、５　逆フィルタ、６　狭帯域音源信号、７　広帯域スペクトル推定手段、８　ベクトル量子化手段、９　狭帯域スペクトル符号帳、１０　スペクトル符号、１１　逆量子化手段、１２　広帯域スペクトル符号帳、１３　広帯域スペクトルパラメータ、１４　広帯域音源推定手段、１５　零詰手段、１６　広帯域音源信号、１７　合成フィルタ、１８　帯域フィルタ、１９　アップサンプリング手段、２０　広帯域音声信号、２１　音源分析手段、２２　狭帯域適応符号帳、２３　歪最小化手段、２４　狭帯域駆動音源信号、２５　狭帯域適応ラグ長、２６　狭帯域適応ゲイン、２７　広帯域駆動音源推定手段、２８　零詰手段、２９　広帯域駆動音源信号、３０　広帯域適応音源推定手段、３１　広帯域適応音源符号帳、３２　広帯域適応音源信号、３３　広帯域適応ラグ長、３４　広帯域適応ゲイン、３５　パワー算出手段、３６　雑音生成手段、３７　狭帯域長周期予測分析手段、３８　狭帯域長周期遅延、３９　狭帯域長周期予測係数、４０　長周期逆フィルタ、４１　狭帯域長周期予測残差信号、４２　広帯域長周期予測残差推定手段、４３　零詰手段、４４　広帯域長周期予測パラメータ推定手段、４５　広帯域長周期遅延、４６　広帯域長周期予測係数、４７　長周期合成フィルタ、４８　広帯域長周期予測残差信号、４９　アップサンプリング手段、５０　零化手段、５１　狭帯域パワー算出手段、５２　狭帯域音源パワー、５３　狭帯域パワー込みスペクトル符号、５４　音源正規化手段、５５　狭帯域正規化音源信号、５６　広帯域正規化音源信号、５７　広帯域パワー符号帳、５８　広帯域音源パワー、５９　広帯域音源パワー推定手段、６０　広帯域パワー込みスペクトル符号帳、６１　ポストフィルタ手段、６２　スペクトルパラメータ変換手段、６３　次数低減手段、６４　スペクトルパラメータ逆変換手段、１０１　狭帯域音声符号、１０２　分離手段、１０３　狭帯域スペクトル符号、１０４　狭帯域音源符号、１０５　広帯域スペクトル復号手段、１０６　広帯域音源復号手段、１０７　狭帯域スペクトル復号手段、１０８　狭帯域音源復号手段、１０９　狭帯域音声復号手段、１１０　狭帯域ピッチ符号、１１１　狭帯域パワー符号、１１２　広帯域ピッチ復号手段、１１３　広帯域ピッチ周期、１１４　広帯域パワー復号手段、１１５　広帯域パワー復号手段、１１６　音源生成手段、１１７　狭帯域適応音源符号、１１８　狭帯域駆動音源符号、１１９　広帯域適応音源復号手段、１２０　広帯域駆動音源復号手段、１２１　狭帯域適応音源復号手段、１２２　狭帯域駆動音源復号手段、１２３　狭帯域長周期予測符号、１２４　広帯域長周期予測パラメータ復号手段、１２５　狭帯域長周期予測パラメータ復号手段、１２６　狭帯域長周期予測残差符号、１２７　広帯域長周期予測残差復号手段、１２８　狭帯域長周期予測残差復号手段、１２９　狭帯域パワー復号手段、１３０　広帯域正規化音源復号手段。 1} narrowband speech signal, 2} analysis means, 3} spectrum analysis means, 4} narrowband spectrum parameter, 5} inverse filter, 6} narrowband excitation signal, 7} wideband spectrum estimation means, 8} vector quantization means, 9] narrowband spectrum codebook, 10 spectrum code, 11 inverse quantization means, 12 wideband spectrum codebook, 13 wideband spectrum parameters, 14 wideband excitation estimation means, 15 zero-filling means, 16 wideband excitation signal, 17 synthesis filter, 18 bandpass filter, 19 upsampling means, 20 wideband speech signal, 21 sound source analysis means, 22 narrowband adaptive codebook, 23 distortion minimization means, 24 narrowband drive excitation signal, 25 narrowband adaptive lag length, 26 narrowband adaptive gain, 27 wideband drive excitation estimation means, 28 zero filling means, 29 wide band Driving excitation signal, 30 、 wideband adaptive excitation estimating means, 31 wideband adaptive excitation codebook, 32 wideband adaptive excitation signal, 33 wideband adaptive lag length, 34 wideband adaptive gain, 35 power calculation means, 36 noise generation means, 37 narrowband long period Prediction analysis means, 38 narrow band long cycle delay, 39 narrow band long cycle prediction coefficient, 40 long cycle inverse filter, 41 narrow band long cycle prediction residual signal, 42 wide band long cycle prediction residual estimation means, 43 zero filling means, 44 wideband long-period prediction parameter estimation means, 45 wideband long-period delay coefficient, 46 wideband long-period prediction coefficient, 47 long-period synthesis filter, 48 wideband long-period prediction residual signal, 49 upsampling means, 50 zeroing means, 51 narrowband Power calculation means, 52 narrow band sound source power, 53 narrow band power included spectrum code , 54 excitation normalizing means, 55 narrowband normalized excitation signal, 56 wideband normalized excitation signal, 57 wideband power codebook, 58 wideband excitation power estimation, 59 wideband excitation power estimation means, 60 wideband power spectrum codebook, 61 post Filter means, 62 spectrum parameter conversion means, 63 degree reduction means, 64 spectrum parameter inverse conversion means, 101 narrowband speech code, 102 separation means, 103 narrowband spectrum code, 104 narrowband excitation code, 105 wideband spectrum decoding means, 106 Broadband excitation decoding means, 107 narrowband spectrum decoding means, 108 narrowband excitation decoding means, 109 narrowband speech decoding means, 110 narrowband pitch code, 111 narrowband power code, 112 wideband pitch decoding means, 113 wideband Pitch period, 114 {Wide band power decoding means, 115} Wide band power decoding means, 116} Excitation generation means, 117 # Narrow band driving excitation code, 118 # Narrow band adaptive excitation decoding means, 119 # Wide band driving excitation decoding means, 120 # Wide band driving excitation decoding means, 121 # Narrow Bandwidth adaptive excitation decoding means, 122 {narrowband driving excitation decoding means, 123} narrowband long-period prediction parameter decoding means, 124 wideband long-period prediction parameter decoding means, 125 {narrow-band long-period prediction parameter decoding means, 126} narrow-band long-period prediction residual code 127, wideband long-period prediction residual decoding means, 128, narrowband long-period prediction residual decoding means, 129, narrowband power decoding means, 130, wideband normalized excitation decoding means.

Claims

Spectrum decoding means for decoding a narrow-band spectral parameter from the narrow-band spectral code, extending the spectrum envelope of the decoded narrow-band spectral parameter in the frequency axis direction, and outputting as a wide-band spectral parameter;
A wideband speech restoration apparatus comprising a synthesizing means for generating a wideband speech signal using the outputted wideband spectrum parameters.

Comprising a wideband sound source estimating means for estimating a wideband sound source signal,
2. The wideband audio restoration apparatus according to claim 1, wherein the synthesis unit is configured to generate a wideband audio signal from the wideband sound source signal and a wideband spectrum parameter.

A spectrum decoding step of decoding a narrow-band spectrum parameter from the narrow-band spectrum code, extending a spectrum envelope of the decoded narrow-band spectrum parameter in the frequency axis direction, and outputting as a wide-band spectrum parameter;
A wide-band speech restoration method including a synthesis step of generating a wide-band speech signal using the outputted wide-band spectrum parameters.

(4) A wideband audio restoration apparatus that expands a spectrum envelope of a narrowband spectral parameter obtained by analyzing a narrowband audio signal in a frequency axis direction and generates a wideband audio signal using the wideband spectrum parameter.

Comprising a wideband sound source estimating means for estimating a wideband sound source signal,
The wideband audio restoration apparatus according to claim 4, wherein a wideband audio signal is generated from the generated wideband sound source signal and the wideband spectrum parameter.

(4) A wideband audio restoration method for generating a wideband audio signal by extending a spectrum envelope of a narrowband spectral parameter obtained by analyzing a narrowband audio signal in a frequency axis direction and using the spectrum envelope as a wideband spectral parameter.

In a speech transmission system including a speech encoding device that encodes a narrowband speech signal and outputs a narrowband speech code, and a wideband speech restoration device that receives the narrowband speech code and generates a wideband speech signal,
The wideband audio restoration device,
Spectral decoding for decoding a narrowband spectral parameter from the narrowband spectral code included in the received narrowband speech code, extending the spectrum envelope of the decoded narrowband spectral parameter in the frequency axis direction, and outputting as a wideband spectral parameter. Means,
An audio transmission system including a synthesizing unit that generates a wideband audio signal using the output wideband spectral parameters.

A voice encoding step of encoding a narrowband voice signal to output a narrowband voice code, and a wideband voice restoration step of receiving the narrowband voice code and generating a wideband voice signal,
The wideband audio restoration step includes:
Spectral decoding for decoding a narrowband spectral parameter from the narrowband spectral code included in the received narrowband speech code, extending the spectrum envelope of the decoded narrowband spectral parameter in the frequency axis direction, and outputting as a wideband spectral parameter. Steps and
A voice transmission method including a synthesis step of generating a wideband voice signal using the output wideband spectral parameters.

The said spectrum decoding means performs the deformation | transformation which suppresses the pole of the spectrum envelope of the decoded narrow-band spectrum parameter, and outputs the narrow-band spectrum parameter which performed the deformation | transformation which suppresses the said pole as a wide-band spectrum parameter. Item 2. The wideband audio restoration device according to Item 1.

The spectrum decoding means performs a deformation to smooth the pole structure of the spectral envelope of the decoded narrowband spectrum parameter, and outputs the narrowband spectrum parameter subjected to the deformation to smooth the pole structure as a wideband spectrum parameter. 2. The wideband audio restoration apparatus according to claim 1, wherein:

The spectrum decoding step performs a modification for suppressing a pole of a spectrum envelope of the decoded narrowband spectral parameter, and outputs the narrowband spectral parameter subjected to the modification for suppressing the pole as a wideband spectral parameter. Item 3. The wideband audio restoration method according to Item 3.

The spectrum decoding step performs a deformation to smooth the pole structure of the spectral envelope of the decoded narrowband spectrum parameter, and outputs the narrowband spectrum parameter that has been deformed to smooth the pole structure as a wideband spectrum parameter. The method of claim 3, wherein the wideband sound is restored.