JP3266920B2 - Audio encoding device, audio decoding device, and audio encoding / decoding device - Google Patents

Audio encoding device, audio decoding device, and audio encoding / decoding device

Info

Publication number
JP3266920B2
JP3266920B2 JP24566591A JP24566591A JP3266920B2 JP 3266920 B2 JP3266920 B2 JP 3266920B2 JP 24566591 A JP24566591 A JP 24566591A JP 24566591 A JP24566591 A JP 24566591A JP 3266920 B2 JP3266920 B2 JP 3266920B2
Authority
JP
Japan
Prior art keywords
harmonic
fourier transform
decoding
encoding
envelope
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Expired - Fee Related
Application number
JP24566591A
Other languages
Japanese (ja)
Other versions
JPH0580800A (en
Inventor
純 石井
真哉 高橋
邦男 中島
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Mitsubishi Electric Corp
Original Assignee
Mitsubishi Electric Corp
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Mitsubishi Electric Corp filed Critical Mitsubishi Electric Corp
Priority to JP24566591A priority Critical patent/JP3266920B2/en
Publication of JPH0580800A publication Critical patent/JPH0580800A/en
Application granted granted Critical
Publication of JP3266920B2 publication Critical patent/JP3266920B2/en
Anticipated expiration legal-status Critical
Expired - Fee Related legal-status Critical Current

Links

Description

【発明の詳細な説明】DETAILED DESCRIPTION OF THE INVENTION

【0001】[0001]

【産業上の利用分野】この発明は、例えば音声をディジ
タル伝送あるいは蓄積する場合に用いられる音声符号化
装置及び音声復号化装置並びに音声符号化復号化装置に
関するものである。
BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a speech encoding system used for, for example, digitally transmitting or storing speech.
The present invention relates to a device, a speech decoding device, and a speech encoding / decoding device.

【0002】[0002]

【従来の技術】この種の従来技術の基本的手法を記述し
たものに D.W.Griffin,J.S.Lim著”Multiband Excitati
on Vocoder”(IEEE trans.on ASSP,Vol.36,No.8,pp.12
23-1235,Aug 1988)(以下[文献1]と称する)があ
り、さらにこの方式の低ビットレート化を図ったものを
記述したものとしてP.C.Meuse著 ”A 2400 bpsMulti-ba
nd Excitation Vocoder”(ICASSP 90,pp.9-12,1990)
(以下[文献2]と称する)がある。図5は上記[文献
1]に示された従来の音声符号化装置の構成図である。
1は符号化部、2は復号化部、3は伝送路であり、4は
音声入力端、5は音声出力端である。6はピッチ・調波
成分分析手段、7は有声/無声判定手段、8は有声/無
声符号化手段、9はピッチ符号化手段、10は調波成分
符号化手段、11は有声/無声復号化手段、12はピッ
チ復号化手段。13は調波成分復号化手段、14は有声
合成手段、15は無声合成手段である。
2. Description of the Related Art A description of a basic technique of this kind in the prior art is given in "Multiband Excitati" by DWGriffin and JSLim.
on Vocoder ”(IEEE trans.on ASSP, Vol.36, No.8, pp.12
23-1235, Aug 1988) (hereinafter referred to as [Literature 1]), and further described by PCMeuse, "A 2400 bpsMulti-ba"
nd Excitation Vocoder ”(ICASSP 90, pp. 9-12, 1990)
(Hereinafter referred to as [Reference 2]). FIG. 5 is a configuration diagram of the conventional speech encoding device shown in the above [Reference 1].
1 is an encoding unit, 2 is a decoding unit, 3 is a transmission path, 4 is an audio input terminal, and 5 is an audio output terminal. 6 is pitch / harmonic component analysis means, 7 is voiced / unvoiced determination means, 8 is voiced / unvoiced coding means, 9 is pitch coding means, 10 is harmonic component coding means, 11 is voiced / unvoiced decoding. Means, 12 is pitch decoding means. 13 is a harmonic component decoding means, 14 is a voiced synthesis means, and 15 is an unvoiced synthesis means.

【0003】以下、従来の音声符号化復号化装置の動作
を図5において説明する。先ず符号化部1について説明
する。ピッチ・調波成分分析手段6は、一定長の分析フ
レーム単位に音声入力端4からの入力音声を分析して、
ピッチ及び周波数スペクトル上に現れるピッチ周波数間
隔の調波成分の振幅値Amと位相θm(mは調波番号)を
抽出して調波成分符号化手段10へ出力する。
[0003] The operation of the conventional speech encoding / decoding apparatus will be described below with reference to FIG. First, the encoding unit 1 will be described. The pitch / harmonic component analysis means 6 analyzes the input voice from the voice input terminal 4 in units of a fixed length analysis frame,
The amplitude value Am and the phase θm (m is a harmonic number) of the harmonic component at the pitch frequency interval appearing on the pitch and frequency spectrum are extracted and output to the harmonic component encoding means 10.

【0004】有声/無声判定手段7は周波数領域を1〜
3個の調波を含む複数の帯域に分割し、各帯域毎に有声
無声/判定を行い有声/無声情報を有声/無声判定符号
化手段8へ出力する。この有声/無声判定は、調波構造
を有する帯域は有声、調波構造が乱れている帯域は無声
と判定する。有声/無声判定符号化手段8は上記有声/
無声判定情報の符号化を行い、その符号を復号化部2に
伝送路3を介して出力する。ピッチ符号化手段9は上記
ピッチ周波数の符号化を行い、符号化データを復号化部
2に伝送路3を介して出力する。調波成分符号化手段1
0は上記振幅値Amとθmの符号化を行い、符号化データ
を復号化部2に伝送路3を介して出力する。
[0004] The voiced / unvoiced determination means 7 sets the frequency domain to 1
It divides into a plurality of bands including three harmonics, performs voiced / unvoiced / judgment for each band, and outputs voiced / unvoiced information to voiced / unvoiced judgment coding means 8. In the voiced / unvoiced determination, a band having a harmonic structure is determined to be voiced, and a band having a distorted harmonic structure is determined to be unvoiced. The voiced / unvoiced determination encoding means 8 performs the voiced / unvoiced
The unvoiced determination information is encoded, and the code is output to the decoding unit 2 via the transmission path 3. The pitch encoding unit 9 encodes the pitch frequency and outputs encoded data to the decoding unit 2 via the transmission path 3. Harmonic component encoding means 1
0 encodes the amplitude values Am and θm, and outputs the encoded data to the decoding unit 2 via the transmission path 3.

【0005】図6は上記調波成分符号化手段10の内部
構成を示す構成図であり、上記[文献2]に記述されて
いる2.4Kbpsにおける音声符号化復号化装置の調波成分
符号化手段である。以下この図によって、調波成分符号
化手段10の詳細を説明する。固定サンプル点スペクト
ル抽出手段32は調波成分の振幅Amの値を補間し、固
定点(48個)のサンプルで表現される周波数スペクトル
を抽出する。周波数帯域分割手段33は、このスペクト
ルを一部重複して低周波数帯域(0〜1.5kHz)の18サン
プルと、高周波数帯域(1〜4kHz)の36サンプルに分け
る。低周波数帯域の18サンプルはM1次元ベクトル量
子化手段34、また、高周波数帯域の36サンプルはM
2次元ベクトル量子化手段35によって、それぞれM1
(18)次元ベクトル量子化、及びM2次元(36)ベクト
ル量子化を行うことにより符号化する。このとき、ベク
トル量子化手段における量子化ビット数はそれぞれ10
ビットである。
FIG. 6 is a block diagram showing the internal structure of the harmonic component encoding means 10. The harmonic component encoding means of the 2.4 Kbps speech encoding / decoding apparatus described in [Reference 2] is described. It is. Hereinafter, the details of the harmonic component encoding means 10 will be described with reference to FIG. The fixed sample point spectrum extracting means 32 interpolates the value of the amplitude Am of the harmonic component and extracts a frequency spectrum represented by the samples of the fixed points (48). The frequency band dividing means 33 partially overlaps the spectrum and divides the spectrum into 18 samples in the low frequency band (0 to 1.5 kHz) and 36 samples in the high frequency band (1 to 4 kHz). The 18 samples in the low frequency band are M1-dimensional vector quantization means 34, and the 36 samples in the high frequency band are M
By the two-dimensional vector quantization means 35, M1
(18) Encode by performing dimensional vector quantization and M2 dimensional (36) vector quantization. At this time, the number of quantization bits in the vector quantization means is 10
Is a bit.

【0006】次に復号化部について説明する。有声/無
声復号化手段11は、伝送路3から入力された有声/無
声判定情報の符号化データを復号化し、有声/無声判定
を求め、有声合成手段14、及び無声合成手段15に出
力する。ピッチ復号化手段12は、伝送路3から入力さ
れたピッチ符号化データを復号化し、ピッチ周波数を求
め、有声合成手段14、及び無声合成手段15に出力す
る。
Next, the decoding section will be described. The voiced / unvoiced decoding unit 11 decodes the coded data of the voiced / unvoiced judgment information input from the transmission path 3, obtains a voiced / unvoiced judgment, and outputs it to the voiced synthesis unit 14 and the unvoiced synthesis unit 15. The pitch decoding unit 12 decodes the pitch-encoded data input from the transmission path 3, obtains a pitch frequency, and outputs the pitch frequency to the voiced synthesis unit 14 and the unvoiced synthesis unit 15.

【0007】調波成分復号化手段13は、伝送路3から
入力された調波成分符号化データを復号化し、調波成分
の振幅Amと位相θmを求め、有声合成手段14、及び無
声合成手段15に出力する。有声合成手段14は、有声
/無声復号化手段11によって求められた有声/無声判
定が有声である周波数帯域において、ピッチ復号化手段
12によって求められたピッチ周波数と調波成分復号化
手段13によって復号された振幅Amと位相θmから有声
部の復号音声を(1)式で表される余弦波の重ね合わせに
よって合成し、出力する。
The harmonic component decoding means 13 decodes the encoded harmonic component data inputted from the transmission line 3 to determine the amplitude Am and the phase θm of the harmonic component, and outputs a voiced synthesizing means 14 and an unvoiced synthesizing means. 15 is output. The voiced synthesizing unit 14 decodes the pitch frequency obtained by the pitch decoding unit 12 and the harmonic component decoding unit 13 in a frequency band in which the voiced / unvoiced determination obtained by the voiced / unvoiced decoding unit 11 is voiced. Based on the amplitude Am and the phase θm, a decoded voice of a voiced part is synthesized by superimposing a cosine wave represented by the equation (1) and output.

【0008】[0008]

【数1】 (Equation 1)

【0009】無声合成手段15は、有声/無声復号化手
段11によって求められた有声/無声判定が無声である
周波数帯域において、白色雑音に調波成分復号化手段1
3によって求められた振幅Amから求まるスペクトル振
幅を与えて復号音声を合成する。
The unvoiced synthesizing unit 15 converts the harmonic component decoding unit 1 into white noise in a frequency band in which the voiced / unvoiced determination determined by the voiced / unvoiced decoding unit 11 is unvoiced.
3. A decoded speech is synthesized by giving a spectrum amplitude obtained from the amplitude Am obtained in step 3.

【0010】[0010]

【発明が解決しようとする課題】従来の音声符号化復号
化装置の調波成分の振幅の符号化部は以上の様に構成さ
れているため、本来相関が強く、一括してベクトル量子
化すべき音声信号スペクトルの低周波数領域と高周波数
領域を分離し、別々に調波成分の振幅値をベクトル量子
化するので、量子化効率が悪く、しかもベクトル量子化
のときの次元数が大いので、少量のベクトル量子化ビッ
トで量子化する場合は量子化歪が大きくなり、合成音声
の品質が劣化する問題点があった。
Since the encoding section of the amplitude of the harmonic component of the conventional speech encoding / decoding apparatus is configured as described above, the correlation is inherently strong and vector quantization must be performed collectively. Since the low-frequency and high-frequency regions of the audio signal spectrum are separated and the amplitude values of the harmonic components are separately vector-quantized, the quantization efficiency is poor, and the number of dimensions in vector quantization is large. When quantizing with a small number of vector quantization bits, there is a problem that quantization distortion increases and the quality of synthesized speech deteriorates.

【0011】この発明は、上記のような課題を解消する
ためになされたものであり、調波成分の振幅値を少ない
ビット量でも、量子化歪が少なく効率的に量子化し、品
質の良い復号音声を合成することを目的としている。
SUMMARY OF THE INVENTION The present invention has been made to solve the above-described problem, and it is possible to efficiently quantize the amplitude value of the harmonic component with a small bit amount with little quantization distortion and to obtain high quality decoding. It is intended to synthesize speech.

【0012】[0012]

【課題を解決するための手段】この発明に係る音声符号
化装置は、一定時間長のフレーム毎に分析されたピッチ
周波数間隔の調波の振幅値を対数領域に変換した後、逆
フーリェ変換し、低次の逆フーリェ変換係数を出力する
調波振幅包絡分析手段と、この調波振幅包絡分析手段か
ら出力された上記低次の逆フーリェ変換係数を量子化し
て符号化する包絡成分量子化手段と、この包絡成分量子
化手段で量子化された上記低次の逆フーリェ変換係数を
逆量子化した後にフーリェ変換して、フーリェ変換係数
を求め、このフーリェ変換係数と上記ピッチ周波数間隔
の調波成分の振幅値との各調波上での差分値を量子化し
て符号化する差分量子化手段とを備える。また、この発
明に係る音声復号化装置は、符号化された調波成分の一
部である低次の逆フーリェ変換係数を復号化し、低次の
逆フーリェ変換係数を出力する包絡成分逆量子化手段
と、この低次の逆フーリェ変換係数をフーリェ変換し
て、フーリェ変換係数を求める調波振幅包絡復号手段
と、符号化された調波成分の一部である差分を復号する
差分逆量子化手段と、この復号化された差分と、上記調
波振幅包絡復号手段で求められたフーリェ変換係数を加
算することでピッチ周波数間隔の調波成分の振幅値を復
号する加算手段とを備える。さらに、この発明に係る音
声符号化復号化装置は、一定時間長のフレーム毎に分析
されたピッチ周波数間隔の調波の振幅値を対数領域に変
換した後、逆フーリェ変換し、低次の逆フーリェ変換係
数を出力する調波振幅包絡分析手段と、この調波振幅包
絡分析手段から出力された上記低次の逆フーリェ変換係
数を量子化して符号化する包絡成分量子化手段と、この
包絡成分量子化手段で量子化された上記低次の逆フーリ
ェ変換係数を逆量子化した後にフーリェ変換して、フー
リェ変換係数を求め、このフーリェ変換係数と上記ピッ
チ周波数間隔の調波成分の振幅値との各調波上での差分
値を量子化して符号化する差分量子化手段を符号化部に
備え、上記符号化部から送信された符号化された低次の
逆フーリェ変換係数を復号化し、低次の逆フーリェ変換
係数を出力する包絡成分逆量子化手段と、この低次逆フ
ーリェ変換係数をフーリェ変換して、フーリェ変換係数
を求める調波振幅包絡復号手段と、上記符号化部から送
信された符号化された差分を復号する差分 逆量子化手段
と、この復号化された差分と、上記調波振幅包絡復号手
段で求められたフーリェ変換係数を加算することで、ピ
ッチ周波数間隔の調波成分の振幅値を復号する加算手段
を復号化部に備える。
A speech code according to the present invention
The digitizer analyzes the pitch analyzed for each frame of a certain length of time.
After converting the amplitude value of the harmonic at the frequency interval to the logarithmic domain,
Fourier transform and output low-order inverse Fourier transform coefficients
Harmonic amplitude envelope analysis means and the harmonic amplitude envelope analysis means
Quantizes the low-order inverse Fourier transform coefficients output from
Component quantization means for encoding by means of
The low-order inverse Fourier transform coefficient quantized by the
Fourier transform after inverse quantization, and Fourier transform coefficients
And the Fourier transform coefficient and the pitch frequency interval
Quantizes the difference value of each harmonic with the amplitude value of the harmonic component of
Differential quantizing means for performing encoding. In addition,
The audio decoding device according to the present invention performs one of the encoded harmonic components.
Decode the low-order inverse Fourier transform coefficients that are the
Envelope component inverse quantization means for outputting inverse Fourier transform coefficients
And the Fourier transform of this low-order inverse Fourier transform coefficient.
And harmonic amplitude envelope decoding means for obtaining a Fourier transform coefficient
And decode the difference that is part of the encoded harmonic component
Difference inverse quantization means, the decoded difference,
Add the Fourier transform coefficients obtained by the wave amplitude envelope decoding means.
The amplitude value of the harmonic component at the pitch frequency interval.
And addition means for performing the addition. Furthermore, the sound according to the present invention
The voice encoding / decoding device analyzes each frame of a certain time length.
The amplitude value of the harmonic at the pitch frequency interval
After performing the inverse Fourier transform, the low-order inverse Fourier transform
Harmonic amplitude envelope analyzing means for outputting a number,
The low-order inverse Fourier transform section output from the fault analysis means
An envelope component quantization means for quantizing and encoding the number,
The low-order inverse Fourier quantized by the envelope component quantizing means.
Fourier transform after inverse quantization of the
The Fourier transform coefficient is obtained, and the Fourier transform coefficient
Difference on each harmonic from the amplitude value of the harmonic component at the first frequency interval
Differential quantization means for quantizing and encoding values
And the encoded low-order transmitted from the encoding unit.
Decode the inverse Fourier transform coefficients and perform a low-order inverse Fourier transform
Envelope component inverse quantization means for outputting coefficients, and
Fourier transform of the Fourier transform coefficients
From the harmonic amplitude envelope decoding means for obtaining
Difference inverse quantization means for decoding received encoded difference
And the decoded difference and the harmonic amplitude envelope decoding
By adding the Fourier transform coefficients obtained in the step,
Adding means for decoding the amplitude value of the harmonic component at the switch frequency interval
Is provided in the decoding unit.

【0013】[0013]

【作用】この発明における音声符号化装置及び音声符号
化復号化装置の調波振幅包絡分析手段は、一定長のフレ
ーム毎に分析されるピッチ周波数間隔の調波成分の振幅
値を対数領域に変換した後、逆フーリェ変換し、低次の
逆フーリェ変換係数を求め出力する。包絡成分量子化手
は調波振幅包絡分析手段から出力された上記低次の逆
フーリェ変換係数を量子化して符号化する。差分量子化
手段は包絡成分量子化手段で量子化された上記低次の逆
フーリェ変換係数を逆量子化した後にフーリェ変換し
て、フーリェ変換係数を求め、このフーリェ変換係数と
上記ピッチ周波数間隔の調波成分の振幅値との各調波上
での差分を量子化して符号化する。また、この発明にお
ける音声復号化装置及び音声符号化復号化装置の包絡成
分逆量子化手段は、符号化された調波成分の一部である
低次の逆フーリェ変換係数を逆量子化する。調波振幅包
絡復号手段は低次の逆フーリェ変換係数をフーリェ変換
して、フーリェ変換係数を求める。差分逆量子化手段
符号化された調波成分の一部である差分の逆量子化を行
う。加算手段は差分逆量子化手段から出力された差分
と、上記調波振幅包絡復号手段で求められたフーリェ変
換係数を加算することでピッチ周波数間隔の調波成分の
振幅値を復号する。
The harmonic amplitude envelope analyzing means of the speech coding apparatus and the speech coding / decoding apparatus according to the present invention converts the amplitude value of the harmonic component at the pitch frequency interval analyzed for each fixed length frame into a logarithmic domain. After that, inverse Fourier transform is performed to obtain and output a low-order inverse Fourier transform coefficient. Envelope component quantizing means you encode inverse Fourier transform coefficients outputted in the low order of harmonic amplitude envelope analysis means and quantization. Differential quantization
Means dequantizing the low-order inverse Fourier transform coefficient quantized by the envelope component quantizing means and then performing Fourier transform to obtain a Fourier transform coefficient; and a harmonic component between the Fourier transform coefficient and the pitch frequency interval. On each harmonic with the amplitude value of
Is quantized and encoded. In addition, the present invention
The envelope component inverse quantization means of the audio decoding device and the audio encoding / decoding device performs inverse quantization of a low-order inverse Fourier transform coefficient which is a part of the encoded harmonic component . The harmonic amplitude envelope decoding means Fourier-transforms the low-order inverse Fourier transform coefficients to obtain Fourier transform coefficients. Difference inverse quantization means
Inverse quantization of the difference that is a part of the encoded harmonic component is performed. The adding means adds the difference output from the difference dequantizing means and the Fourier transform coefficient obtained by the harmonic amplitude envelope decoding means to obtain the harmonic component of the pitch frequency interval.
Decode the amplitude value.

【0014】[0014]

【実施例】【Example】

実施例1.図1はこの発明の一実施例を適用した入力音
声の符号化復号化装置の構成図である。図1において図
5と同一の部分については同一の符号を付し、説明を省
略する。図1において、16は調波成分符号化手段、1
7は調波成分復号化手段である。また図2は、この発明
を実施した音声符号化復号化装置の、調波成分符号化手
段16の内部構成図、図3は調波成分復号化手段17の
内部構成図であり、図4は調波振幅と調波振幅包絡の例
を示した説明図である。
Embodiment 1 FIG. FIG. 1 is a configuration diagram of an input speech encoding / decoding apparatus to which an embodiment of the present invention is applied. In FIG. 1, the same portions as those in FIG. 5 are denoted by the same reference numerals, and description thereof will be omitted. In FIG. 1, reference numeral 16 denotes harmonic component encoding means,
7 is a harmonic component decoding means. FIG. 2 is an internal configuration diagram of the harmonic component encoding unit 16 of the audio encoding / decoding apparatus embodying the present invention, FIG. 3 is an internal configuration diagram of the harmonic component decoding unit 17, and FIG. It is explanatory drawing which showed the example of harmonic amplitude and harmonic amplitude envelope.

【0015】以下、この発明の一実施例の動作を説明す
る。先ず、図2を用いて調波成分符号化手段16の動作
を説明する。この調波成分符号化手段16中の調波振幅
包絡分析手段18は対数振幅手段19とM点離散逆フー
リエ変換手段20、及び低次成分抽出手段21よって構
成されており、フレーム毎に分析される調波成分の振幅
値Amを対数振幅手段19で対数振幅とした後に、M点
離散フーリエ変換手段20で離散逆フーリエ変換を行
い、低次成分抽出手段21で低次の逆フーリエ変換係数
(例えば20次)を調波の振幅値Amの包絡を表現する
パラメータとして出力する。このときの離散フーリエ変
換のポイント数Mは、フレーム内の全調波数の2倍とす
る。
The operation of one embodiment of the present invention will be described below. First, the operation of the harmonic component encoding means 16 will be described with reference to FIG. The harmonic amplitude envelope analyzing means 18 in the harmonic component coding means 16 is composed of a logarithmic amplitude means 19, an M-point discrete inverse Fourier transform means 20, and a low-order component extracting means 21, and is analyzed for each frame. After the amplitude value Am of the harmonic component is converted to a logarithmic amplitude by the logarithmic amplitude means 19, a discrete inverse Fourier transform is performed by the M-point discrete Fourier transform means 20, and a low-order inverse Fourier transform coefficient ( For example, 20th order) is output as a parameter expressing the envelope of the amplitude value Am of the harmonic. At this time, the number M of points of the discrete Fourier transform is twice the number of all harmonics in the frame.

【0016】次に、調波振幅包絡分析手段18から出力
される低次の逆フーリエ変換成分を包絡成分量子化手段
22によって例えば10ビットでベクトル量子化し符号
化して、伝送路に符号化データを出力する。
Next, the low-order inverse Fourier transform component output from the harmonic amplitude envelope analysis means 18 is vector-quantized and coded by, for example, 10 bits by the envelope component quantization means 22, and the coded data is transmitted to the transmission path. Output.

【0017】また、差分抽出手段23は、包絡成分逆量
子化手段24とN点離散フーリエ変換手段25、及び加
算器26から構成されており、上記包絡成分量子化手段
22から出力される量子化された低次の逆フーリエ変換
係数を包絡成分逆量子化手段24によって逆量子化し
て、低次の逆フーリエ変換係数を出力し、N点離散フー
リエ変換手段25によってフーリエ変換係数を復号し、
加算器26に出力する。この時のNの値はM以上の任意
の値とする。
The difference extracting means 23 comprises an envelope component inverse quantizing means 24, an N-point discrete Fourier transform means 25, and an adder 26, and the quantization output from the envelope component quantizing means 22 The obtained low-order inverse Fourier transform coefficient is inversely quantized by the envelope component inverse quantization means 24 to output a low-order inverse Fourier transform coefficient, and the N-point discrete Fourier transform means 25 decodes the Fourier transform coefficient.
Output to the adder 26. The value of N at this time is an arbitrary value of M or more.

【0018】加算器26は、復号されたフーリエ変換係
数をピッチ周波数間隔でサンプリングして得られる値の
符号反転を行ったものと、対数振幅手段19の出力であ
る対数調波振幅Amを入力し、差分を出力する。差分量
子化手段27は、差分抽出手段23によって出力された
調波振幅包絡差分を例えばスカラー量子化を行い、伝送
路に符号化データを出力する。
The adder 26 inputs a signal obtained by sampling the decoded Fourier transform coefficients at pitch frequency intervals, inverting the sign, and a logarithmic harmonic amplitude Am output from the logarithmic amplitude means 19. And output the difference. The difference quantization unit 27 performs, for example, scalar quantization on the harmonic amplitude envelope difference output by the difference extraction unit 23, and outputs encoded data to a transmission path.

【0019】上記調波振幅包絡分析手段18が行う調波
振幅包絡分析によって出力される低次の逆フーリエ変換
係数をフーリエ変換して得られる調波振幅包絡と、調波
成分の振幅値Amの関係の例を図4に示す。図4におい
て、ピッチ周波数と、その高調波位置を中心とした周波
数帯域に分割し、各帯域毎に分析される調波成分の振幅
値Amを、対応する帯域の中心周波数位置にポイントし
ている。
The harmonic amplitude envelope obtained by Fourier-transforming the low-order inverse Fourier transform coefficient output by the harmonic amplitude envelope analysis performed by the harmonic amplitude envelope analyzing means 18 and the amplitude value Am of the harmonic component. FIG. 4 shows an example of the relationship. In FIG. 4, the frequency is divided into a frequency band centered on the pitch frequency and its harmonic position, and the amplitude value Am of the harmonic component analyzed for each band is pointed to the center frequency position of the corresponding band. .

【0020】この例では調波数が26の場合である。調
波振幅Amに関して52点離散逆フーリエ変換を行い、
低次の逆フーリエ変換係数20次を量子化し、逆量子化
した後に256点(20次以外は0詰め)の離散フーリ
エ変換して得られた調波振幅包絡を実線で示してある。
調波振幅包絡をピッチ周波数間隔でサンプリングするこ
とで、調波振幅包絡から近似されるれる調波振幅Amの
値が得られる。調波振幅包絡差分は、調波振幅包絡から
得られるAmと、調波成分の振幅値Amの差分である。
In this example, the number of harmonics is 26. Performing a 52-point discrete inverse Fourier transform on the harmonic amplitude Am,
The solid line represents the harmonic amplitude envelope obtained by quantizing the low-order inverse Fourier transform coefficient 20th order, performing inverse quantization, and performing discrete Fourier transform of 256 points (0-order except for the 20th order).
By sampling the harmonic amplitude envelope at pitch frequency intervals, a value of the harmonic amplitude Am approximated from the harmonic amplitude envelope is obtained. The harmonic amplitude envelope difference is a difference between Am obtained from the harmonic amplitude envelope and the amplitude value Am of the harmonic component.

【0021】次に、図3を用いて調波成分復号化手段1
7の動作を説明する。包絡成分逆量子化手段28は、伝
送路から送信された包絡成分符号化データを逆量子化処
理によって復号化し、低次の逆フーリエ変換係数を出力
する。調波振幅包絡復号手段29は、包絡成分逆量子化
手段28から出力された低次の逆フーリエ変換係数をN
点のフーリエ変換を行い、フーリエ変換係数を復号し、
加算器31に出力する。ここでNは、符号化装置の差分
抽出手段で用いるN点フーリエ変換と同じ値とする。
Next, the harmonic component decoding means 1 will be described with reference to FIG.
7 will be described. The envelope component inverse quantization means 28 decodes the envelope component encoded data transmitted from the transmission path by an inverse quantization process, and outputs a low-order inverse Fourier transform coefficient. The harmonic amplitude envelope decoding means 29 converts the low-order inverse Fourier transform coefficient output from the envelope component inverse quantization means 28 into N
Perform a Fourier transform of the points, decode the Fourier transform coefficients,
Output to the adder 31. Here, N is the same value as the N-point Fourier transform used in the difference extracting means of the encoding device.

【0022】また、差分逆量子化手段30は、伝送路か
ら送信された差分符号化データを逆量子化し、調波包絡
差分を加算器31に出力する。加算器31は、調波振幅
包絡復号手段29から出力されるフーリエ変換係数をピ
ッチ周波数間隔でサンプリングした値と、差分逆量子化
手段30から出力される差分の加算を行い、対数振幅の
調波振幅値を出力する。
The differential inverse quantization means 30 inversely quantizes the differentially encoded data transmitted from the transmission path, and outputs the harmonic envelope difference to the adder 31. The adder 31 adds the value obtained by sampling the Fourier transform coefficient output from the harmonic amplitude envelope decoding means 29 at pitch frequency intervals and the difference output from the difference inverse quantization means 30 to obtain a logarithmic amplitude harmonic. Outputs the amplitude value.

【0023】なお、図2において、差分量子化手段27
での量子化法はスカラー量子化法を例として説明した
が、スカラー量子化ではなくベクトル量子化でも良い。
In FIG. 2, the difference quantization means 27
Has been described using the scalar quantization method as an example, but vector quantization may be used instead of scalar quantization.

【0024】[0024]

【発明の効果】以上のようにこの発明では、比較的少な
い次元数で入力音声の調波成分の振幅値の全周波数帯域
に渡る包絡を表現するパラメータを求め、それをベクト
ル量子化し、さらに、振幅値と包絡の差分を量子化する
ようにしたので、音声符号化装置では少ない量子化ビッ
トでも効率的に調波成分の振幅値の量子化が行え、音声
復号化装置では少ない量子化ビットで量子化された調波
成分の振幅値から品質の良い復号音声を合成でき、ま
た、音声符号化復号化装置では少ない量子化ビットでも
効率的に調波成分の振幅値の量子化が行え、品質の良い
復号音声を合成できるという効果がある。
As described above, according to the present invention, a parameter expressing the envelope of the amplitude value of the harmonic component of the input voice over the entire frequency band is obtained with a relatively small number of dimensions, and the parameter is vector-quantized. Since the difference between the amplitude value and the envelope is quantized, the audio encoding device has a small number of quantization bits.
Can efficiently quantize the amplitude value of the harmonic component
Harmonics quantized with fewer quantization bits in the decoder
High-quality decoded speech can be synthesized from the amplitude values of the components.
In addition, the speech encoding / decoding device has an effect that the amplitude value of the harmonic component can be efficiently quantized even with a small number of quantization bits, and a high-quality decoded speech can be synthesized.

【図面の簡単な説明】[Brief description of the drawings]

【図1】この発明の一実施例を示す符号化復号化装置の
構成図である。
FIG. 1 is a configuration diagram of an encoding / decoding device showing an embodiment of the present invention.

【図2】この発明の一実施例による調波成分符号化手段
の構成図である。
FIG. 2 is a configuration diagram of harmonic component encoding means according to one embodiment of the present invention.

【図3】この発明の一実施例による調波成分復号化手段
の構成図である。
FIG. 3 is a configuration diagram of harmonic component decoding means according to an embodiment of the present invention.

【図4】この発明の一実施例による符号化復号化装置に
おける調波振幅包絡特性の例をを示した説明図である。
FIG. 4 is an explanatory diagram showing an example of a harmonic amplitude envelope characteristic in the encoding / decoding device according to an embodiment of the present invention.

【図5】従来の音声符号化復号化装置の構成図である。FIG. 5 is a configuration diagram of a conventional speech encoding / decoding device.

【図6】従来の調波成分符号化手段装置の構成図であ
る。
FIG. 6 is a configuration diagram of a conventional harmonic component encoding means device.

【符号の説明】[Explanation of symbols]

1 符号化部 2 復号化部 3 伝送路 6 ピッチ・調波成分分析手段 9 ピッチ符号化手段 10 調波成分符号化手段 12 ピッチ復号化手段 13 調波成分復号化手段 14 有声合成手段 15 無声合成手段 17 調波成分復号化手段 18 調波振幅包絡分析手
段 22 包絡成分量子化手段 23 差分抽出手段 24 包絡成分逆量子化手段 27 差分量子化手段 28 包絡成分逆量子化手段 29 調波振幅包絡復号手段 30 差分逆量子化手段
DESCRIPTION OF SYMBOLS 1 Encoding part 2 Decoding part 3 Transmission line 6 Pitch / harmonic component analysis means 9 Pitch encoding means 10 Harmonic component encoding means 12 Pitch decoding means 13 Harmonic component decoding means 14 Voiced synthesis means 15 Unvoiced synthesis Means 17 Harmonic component decoding means 18 Harmonic amplitude envelope analysis means 22 Envelope component quantization means 23 Difference extraction means 24 Envelope component inverse quantization means 27 Difference quantization means 28 Envelope component inverse quantization means 29 Harmonic amplitude envelope decoding Means 30 Differential inverse quantization means

───────────────────────────────────────────────────── フロントページの続き (56)参考文献 特開 昭57−62096(JP,A) 特開 昭57−161795(JP,A) 特開 昭60−35799(JP,A) 特開 平2−134699(JP,A) (58)調査した分野(Int.Cl.7,DB名) G10L 19/00 - 19/14 H03M 7/30 ──────────────────────────────────────────────────続 き Continuation of the front page (56) References JP-A-57-62096 (JP, A) JP-A-57-161795 (JP, A) JP-A-60-35799 (JP, A) JP-A-2- 134699 (JP, A) (58) Field surveyed (Int. Cl. 7 , DB name) G10L 19/00-19/14 H03M 7/30

Claims (3)

(57)【特許請求の範囲】(57) [Claims] 【請求項1】 一定時間長の分析フレーム毎に、音声信
号のピッチ周波数と、ピッチ周波数間隔の調波の振幅値
と位相を求め、各々を符号化して出力する音声符号化装
置において、上記ピッチ周波数間隔の調波の振幅値を対
数領域に変換した後、逆フーリェ変換し、低次の逆フー
リェ変換係数を出力する調波振幅包絡分析手段と、この
調波振幅包絡分析手段から出力された上記低次の逆フー
リェ変換係数を量子化して符号化する包絡成分量子化手
段と、この包絡成分量子化手段で量子化された上記低次
の逆フーリェ変換係数を逆量子化した後にフーリェ変換
して、フーリェ変換係数を求め、このフーリェ変換係数
と上記ピッチ周波数間隔の調波成分の振幅値との各調波
上での差分値を量子化して符号化する差分量子化手段と
を備えることを特徴とする音声符号化装置。
An audio signal is transmitted for each analysis frame having a predetermined time length.
Signal's pitch frequency and amplitude value of harmonics at pitch frequency intervals
Audio encoding device that calculates and outputs
The amplitude value of the harmonic at the pitch frequency interval.
After transforming to the number domain, inverse Fourier transform
A harmonic amplitude envelope analyzing means for outputting a Rie transform coefficient,
The low-order inverse filter output from the harmonic amplitude envelope analysis means.
Envelope component quantizer that quantizes and encodes the Rie transform coefficients.
And the lower order quantized by the envelope component quantizing means.
Fourier transform after inverse quantization of inverse Fourier transform coefficients of
To obtain a Fourier transform coefficient, and this Fourier transform coefficient
And the amplitude value of the harmonic component of the pitch frequency interval
A difference quantization means for quantizing and encoding the difference value above
A speech encoding device comprising:
【請求項2】 一定時間長の分析フレーム毎に、符号化2. Encoding is performed for each analysis frame having a predetermined time length.
された音声信号のピッチ周波数、及び符号化されたピッPitch frequency of the encoded audio signal and encoded pitch
チ周波数間隔の調波成分を復号化し、各調波の周波数をAnd decodes the harmonic components at the
持つ余弦波を、復号されたその調波の振幅と位相を基にIs based on the amplitude and phase of the decoded harmonic
生成して重ね合わせることで復号音声を合成する音声復Speech decoding that synthesizes decoded speech by generating and superimposing
号化装置において、符号化された調波成分の一部であるIn the encoding device, it is a part of the encoded harmonic component
低次の逆フーリェ変換係数を復号化し、低次の逆フーリThe low-order inverse Fourier transform coefficients are decoded, and the low-order inverse Fourier transform
ェ変換係数を出力する包絡成分逆量子化手段と、この低An envelope component inverse quantization means for outputting a transform coefficient.
次の逆フーリェ変換係数をフーリェ変換して、フーリェFourier transform the following inverse Fourier transform coefficients
変換係数を求める調波振幅包絡復号手段と、符号化されA harmonic amplitude envelope decoding means for obtaining a transform coefficient;
た調波成分の一部である差分を復号する差分逆量子化手Inverse quantizer that decodes the difference that is part of the harmonic component
段と、この復号化された差分と、上記調波振幅包絡復号And the decoded difference and the harmonic amplitude envelope decoding
手段で求められたフーリェ変換係数を加算することでピBy adding the Fourier transform coefficients obtained by
ッチ周波数間隔の調波成分の振幅値を復号する加算手段Adding means for decoding the amplitude value of the harmonic component at the switch frequency interval
とを備えることを特徴とする音声復号化装置。A speech decoding device comprising:
【請求項3】 一定時間長の分析フレーム毎に、音声信3. An audio signal for each analysis frame having a predetermined time length.
号のピッチ周波数と、ピッチ周波数間隔の調波の振幅値Signal's pitch frequency and amplitude value of harmonics at pitch frequency intervals
と位相を求め、各々を符号化して出力する符号化部と、And an encoding unit that calculates and outputs a phase, and encodes and outputs each.
上記符号化されたピッチ周波数と、ピッチ周波数間隔のThe encoded pitch frequency and the pitch frequency interval
調波成分を復号化し、各調波の周波数を持つ余弦波を、The harmonic components are decoded, and the cosine wave having the frequency of each harmonic is
復号されたその調波の振幅と位相を基に生成して重ね合Generate and superimpose based on the amplitude and phase of the decoded harmonic
わせることで復号音声を合成する復号化部とで構成されAnd a decoding unit that synthesizes the decoded speech by
る音声符号化復号化装置において、In the audio encoding and decoding apparatus, 上記ピッチ周波数間隔の調波の振幅値を対数領域に変換Convert the amplitude value of harmonics at the above pitch frequency interval to logarithmic domain
した後、逆フーリェ変After the inverse Fourier transformation 換し、低次の逆フーリェ変換係数And the low-order inverse Fourier transform coefficients
を出力する調波振幅包絡分析手段と、この調波振幅包絡And a harmonic amplitude envelope analyzing means for outputting the harmonic amplitude envelope.
分析手段から出力された上記低次逆フーリェ変換係数をThe low-order inverse Fourier transform coefficient output from the analysis means is
量子化して符号化する包絡成分量子化手段と、この包絡Envelope component quantization means for quantizing and encoding, and this envelope
成分量子化手段で量子化された上記低次の逆フーリェ変The above low-order inverse Fourier transform quantized by the component quantization means
換係数を逆量子化した後にフーリェ変換して、フーリェFourier transform after inverse quantization of the permutation coefficient
変換係数を求め、このフーリェ変換係数と上記ピッチ周The transform coefficient is obtained, and the Fourier transform coefficient and the pitch
波数間隔の調波成分の振幅値との各調波上での差分値をThe difference value on each harmonic from the amplitude value of the harmonic component at the wave number interval
量子化して符号化する差分量子化手段とを符号化部に備The encoding unit is provided with a differential quantization means for quantizing and encoding.
え、上記符号化部から送信された符号化された低次の逆The encoded low-order inverse transmitted from the encoding unit.
フーリェ変換係数を復号化し、低次の逆フーリェ変換係Decodes Fourier transform coefficients and generates a low-order inverse Fourier transform
数を出力する包絡成分逆量子化手段と、この低次逆フーAn inverse component quantization means for outputting a number,
リェ変換係数をフーリェ変換して、フーリェ変換係数をFourier transforms the Fourier transform coefficients, and calculates the Fourier transform coefficients.
求める調波振幅包絡復号手段と、上記符号化部から送信The harmonic amplitude envelope decoding means to be sought and transmitted from the encoding unit
された符号化された差分を復号する差分逆量子化手段Inverse quantization means for decoding the encoded encoded difference
と、この復号化された差分と、上記調波振幅包絡復号手And the decoded difference and the harmonic amplitude envelope decoding
段で求められたフーリェ変換係数を加算することで、ピBy adding the Fourier transform coefficients obtained in the step,
ッチ周波数間隔の調波成分の振幅値を復号する加算手段Adding means for decoding the amplitude value of the harmonic component at the switch frequency interval
を復号化部に備えることを特徴とする音声符号化復号化Encoding / decoding characterized in that the decoding unit is provided with
装置。apparatus.
JP24566591A 1991-09-25 1991-09-25 Audio encoding device, audio decoding device, and audio encoding / decoding device Expired - Fee Related JP3266920B2 (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
JP24566591A JP3266920B2 (en) 1991-09-25 1991-09-25 Audio encoding device, audio decoding device, and audio encoding / decoding device

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
JP24566591A JP3266920B2 (en) 1991-09-25 1991-09-25 Audio encoding device, audio decoding device, and audio encoding / decoding device

Publications (2)

Publication Number Publication Date
JPH0580800A JPH0580800A (en) 1993-04-02
JP3266920B2 true JP3266920B2 (en) 2002-03-18

Family

ID=17136997

Family Applications (1)

Application Number Title Priority Date Filing Date
JP24566591A Expired - Fee Related JP3266920B2 (en) 1991-09-25 1991-09-25 Audio encoding device, audio decoding device, and audio encoding / decoding device

Country Status (1)

Country Link
JP (1) JP3266920B2 (en)

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
KR100462611B1 (en) * 2002-06-27 2004-12-20 삼성전자주식회사 Audio coding method with harmonic extraction and apparatus thereof.

Also Published As

Publication number Publication date
JPH0580800A (en) 1993-04-02

Similar Documents

Publication Publication Date Title
JP3483958B2 (en) Broadband audio restoration apparatus, wideband audio restoration method, audio transmission system, and audio transmission method
JP3881943B2 (en) Acoustic encoding apparatus and acoustic encoding method
US8326638B2 (en) Audio compression
US6678655B2 (en) Method and system for low bit rate speech coding with speech recognition features and pitch providing reconstruction of the spectral envelope
WO2003091989A1 (en) Coding device, decoding device, coding method, and decoding method
JP2003323199A (en) Device and method for encoding, device and method for decoding
JPH06118995A (en) Method for restoring wide-band speech signal
WO2007088853A1 (en) Audio encoding device, audio decoding device, audio encoding system, audio encoding method, and audio decoding method
JP2004102186A (en) Device and method for sound encoding
JPH11510274A (en) Method and apparatus for generating and encoding line spectral square root
JP2002372996A5 (en)
JPH11177434A (en) Voice code decoding system
WO1995021490A1 (en) Method and device for encoding information and method and device for decoding information
JP2003108197A (en) Audio signal decoding device and audio signal encoding device
JP3092653B2 (en) Broadband speech encoding apparatus, speech decoding apparatus, and speech encoding / decoding apparatus
JP3237178B2 (en) Encoding method and decoding method
JP2004302259A (en) Hierarchical encoding method and hierarchical decoding method for sound signal
JP3266920B2 (en) Audio encoding device, audio decoding device, and audio encoding / decoding device
JPH05297898A (en) Data quantity converting method
JPH09127987A (en) Signal coding method and device therefor
JP4287840B2 (en) Encoder
JP3576485B2 (en) Fixed excitation vector generation apparatus and speech encoding / decoding apparatus
JP3230790B2 (en) Wideband audio signal restoration method
JP3598111B2 (en) Broadband audio restoration device
JP3384523B2 (en) Sound signal processing method

Legal Events

Date Code Title Description
LAPS Cancellation because of no payment of annual fees