JP2002169595A

JP2002169595A - Fixed sound source code book and speech encoding/ decoding apparatus

Info

Publication number: JP2002169595A
Application number: JP2000366141A
Authority: JP
Inventors: Hiroyuki Ebara; 宏幸江原; Kazutoshi Yasunaga; 和敏安永; Kazunori Mano; 一則間野; Yuusuke Hiwazaki; 祐介日和▲崎▼
Original assignee: Nippon Telegraph and Telephone Corp; Matsushita Electric Industrial Co Ltd
Current assignee: Nippon Telegraph and Telephone Corp; Panasonic Holdings Corp
Priority date: 2000-11-30
Filing date: 2000-11-30
Publication date: 2002-06-14
Anticipated expiration: 2020-11-30
Also published as: JP3576485B2

Abstract

PROBLEM TO BE SOLVED: To deal with a low bit rate while assuring the number of pieces of sound source pulses. SOLUTION: The fixed sound source code book 207 includes a first sound source code book 401 which is relatively high in time resolution to partially express sound source vectors and a second sound source code book 402 which is relatively low in time resolution to express the entire part of the sound source vectors.

Description

DETAILED DESCRIPTION OF THE INVENTION

【０００１】[0001]

【発明の属する技術分野】本発明は、音声信号を符号化
して伝送する移動通信システムなどにおける低ビットレ
ート音声符号化装置、特にパルス音源を駆動音源信号と
して用いるＣＥＬＰ（Code Excited Linear Predictio
n）型音声符号化装置などに関する。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a low bit rate speech coding apparatus in a mobile communication system for coding and transmitting a speech signal, and more particularly to a CELP (Code Excited Linear Predictio) using a pulsed sound source as a driving sound source signal.
The present invention relates to an n) type speech coding device.

【０００２】[0002]

【従来の技術】ディジタル移動通信や、インターネット
通信に代表されるパケット通信、あるいは音声蓄積など
の分野においては、電波などの伝送路容量や記憶媒体の
有効利用のために音声情報を圧縮し、高能率で符号化す
るための音声符号化装置が用いられている。中でもＣＥ
ＬＰ方式をベースにした方式が中・低ビットレートにお
いて広く実用化されている。ＣＥＬＰの技術について
は、M.R.Schroeder and b.s.Atal:"Code-ExcitedLinear
Prediction(CELP): High-quality Speech at VeryLow B
it Rates", Proc. ICASSP-85, 25.1.1, pp.937-940, 19
85" に示されている。2. Description of the Related Art In the field of digital mobile communication, packet communication typified by the Internet communication, and voice storage, voice information is compressed for effective use of transmission line capacity such as radio waves and storage media. A speech encoding device for encoding with efficiency is used. Above all CE
A system based on the LP system has been widely put into practical use at medium and low bit rates. For information on CELP technology, see MRSchroeder and bsAtal: "Code-ExcitedLinear
Prediction (CELP): High-quality Speech at VeryLow B
it Rates ", Proc. ICASSP-85, 25.1.1, pp.937-940, 19
85 ".

【０００３】ＣＥＬＰ型音声符号化方式は、ディジタル
化された音声信号を一定のフレーム長（５ms〜５０ms程
度）に区切り、フレーム毎に音声の線形予測を行い、フ
レーム毎の線形予測による予測残差（励振信号）を、既
知の波形からなる適応符号帳と雑音（固定）符号帳とを
用いて符号化するものである。[0003] In the CELP type speech coding system, a digitized speech signal is divided into a fixed frame length (about 5 ms to 50 ms), a speech is linearly predicted for each frame, and a prediction residual by the linear prediction for each frame. (Excitation signal) is encoded using an adaptive codebook having a known waveform and a noise (fixed) codebook.

【０００４】適応符号帳は、過去に生成した駆動音源信
号を格納しており、音声信号の周期成分を表現するため
に用いられる。固定符号帳は、予め用意された定められ
た数の定められた形状を有するベクトルを格納してお
り、適応符号帳では表現できない非周期的成分を主とし
て表現するために用いられる。固定符号帳に格納される
ベクトルには、ランダムな雑音系列から成るベクトル
や、何本かのパルスの組み合わせによって表現されるベ
クトルなどが用いられる。[0004] The adaptive codebook stores driving excitation signals generated in the past, and is used to represent a periodic component of an audio signal. The fixed codebook stores a predetermined number of vectors having a predetermined shape prepared in advance, and is used for mainly expressing aperiodic components that cannot be expressed by the adaptive codebook. As a vector stored in the fixed codebook, a vector composed of a random noise sequence, a vector expressed by a combination of several pulses, and the like are used.

【０００５】数本のパルスの組み合わせによって前記ベ
クトルを表現する固定符号帳の代表的なものの一つに代
数的固定符号帳がある。代数的固定符号帳については
「ITU-T勧告G.729」などに具体的内容が示されている。[0005] An algebraic fixed codebook is one of the typical fixed codebooks that express the above-mentioned vector by a combination of several pulses. Specific contents of the algebraic fixed codebook are shown in "ITU-T Recommendation G.729" and the like.

【０００６】従来の代数的固定符号帳を図１４を用いて
具体的に説明する。図１４は、代数的固定符号帳から固
定音源ベクトルが生成される様子を示した図である。図
１４では、３本の単位パルス（振幅値が１）が異なるト
ラックから生成され、極性付与部１４０１〜１４０３で
それぞれ適切な極性が付与された後に、加算部１４０４
で３本のパルスが足し合わされて固定音源ベクトルが生
成される。A conventional algebraic fixed codebook will be specifically described with reference to FIG. FIG. 14 is a diagram illustrating a manner in which a fixed excitation vector is generated from an algebraic fixed codebook. In FIG. 14, three unit pulses (having an amplitude value of 1) are generated from different tracks, and after being given appropriate polarities by the polarity applying units 1401 to 1403, the adding unit 1404
, The three pulses are added to generate a fixed sound source vector.

【０００７】各トラックはパルスを配置できる位置が異
なっており、図１４においては、第１トラックは｛0,3,
6,9,12,15,18,21｝の８箇所のうちのいずれかに、第２
トラックは｛1,4,7,10,13,16,19,22｝の８箇所のうちの
いずれかに、第３トラックは｛2,5,8,11,14,17,20,23｝
の８箇所のうちのいずれかに、それぞれ単位パルスを１
本ずつ立てることができる構成となっている。この例で
は、各パルスに対して位置が８通り、極性が正負の２通
り、であるので、位置情報３ビット、極性情報１ビッ
ト、が各音源パルスを表現するのに用いられる。したが
って、合計１２ビットの固定音源符号帳となる。Each track is different in the position where the pulse can be arranged. In FIG. 14, the first track is # 0,3,
6,9,12,15,18,21｝
The truck is located in one of the eight locations {1,4,7,10,13,16,19,22}, and the third is {2,5,8,11,14,17,20,23}
One of the eight units
The book can be set up one by one. In this example, since each pulse has eight positions and two positive and negative polarities, three bits of position information and one bit of polarity information are used to represent each sound source pulse. Therefore, it becomes a fixed excitation codebook of 12 bits in total.

【０００８】[0008]

【発明が解決しようとする課題】しかしながら、上記従
来の代数的固定符号帳を、４kbit/s以下のような低ビッ
トレート用の音声符号化装置に適用しようとした場合、
ビット数の不足からどのトラックにも含まれない位置
（パルスを立てない点）が多くなったり、極性情報をパ
ルス毎に割り当てられなくなったりするという状況が生
じ、急速に符号化音声品質が劣化するという問題があ
る。特に、４kbit/s以下のようなレートに適用するため
には、各トラック内の位置候補数の削減に加えて音源パ
ルスの本数も減らす必要が生じる。However, when the above-mentioned conventional algebraic fixed codebook is applied to a speech coding apparatus for a low bit rate such as 4 kbit / s or less,
Due to the lack of bits, the number of positions that are not included in any track (points where no pulse is formed) increases, or the polarity information cannot be assigned to each pulse, and the coded voice quality rapidly deteriorates. There is a problem. In particular, in order to apply to a rate of 4 kbit / s or less, it is necessary to reduce the number of sound source pulses in addition to the reduction of the number of position candidates in each track.

【０００９】音源パルス数が少ないほどパルス本数削減
による品質劣化も大きくなるので、できるだけ音源パル
ス数の本数を確保しつつ、多くの位置候補を各トラック
でカバーさせることが、代数的固定符号帳を用いた低ビ
ットレートＣＥＬＰ型音声符号化装置の高性能化におい
て重要な課題となる。[0009] The smaller the number of excitation pulses, the greater the quality degradation due to the reduction in the number of pulses. Therefore, it is necessary to ensure that the number of excitation pulses is as large as possible and to cover a large number of position candidates with each track. This is an important issue in improving the performance of the low bit rate CELP type speech coding apparatus used.

【００１０】本発明はかかる点に鑑みてなされたもので
あり、音源パルス数の本数を確保しつつ低ビットレート
に対応することができる固定音源符号帳及び音声符号化
／復号化装置を提供することを目的とする。The present invention has been made in view of the above points, and provides a fixed excitation codebook and a speech encoding / decoding apparatus capable of coping with a low bit rate while securing the number of excitation pulses. The purpose is to:

【００１１】[0011]

【課題を解決するための手段】本発明の固定音源符号帳
は、音源ベクトルを部分的に表現する比較的時間分解能
が高い第１の音源符号帳と、音源ベクトル全体を表現す
る比較的時間分解能の低い第２の音源符号帳と、を具備
する構成を採る。SUMMARY OF THE INVENTION A fixed excitation codebook according to the present invention comprises a first excitation codebook which partially expresses an excitation vector and has a relatively high time resolution, and a relatively time resolution which expresses the entire excitation vector. And a second excitation codebook having a low power consumption.

【００１２】この構成によれば、多くのビット数が必要
となる時間分解能の高い符号帳を限定的に使用すること
により必要となるビット数を少なく抑えることが可能で
ある。また、聴覚的に重要な部分は音源ベクトルの一部
分に集中することが多いので、このような部分的に時間
分解能が高い音源符号帳でも高品質を実現することがで
きる。さらに、全体をカバーする音源符号帳も備えてい
るので、聴覚的に重要な部分がベクトル全体に散らばっ
ている場合にもある程度の対応が可能である。According to this configuration, the number of required bits can be reduced by using a codebook with a high time resolution which requires a large number of bits. In addition, since an acoustically important part is often concentrated on a part of the excitation vector, high quality can be realized even in such an excitation codebook having a partially high temporal resolution. In addition, since an excitation codebook that covers the whole is also provided, it is possible to cope to some extent even when aurally important parts are scattered throughout the vector.

【００１３】本発明の固定音源符号帳は、数本のパルス
からなる音源ベクトルを生成する音源ベクトル生成装置
に用いる固定音源符号帳であって、音源ベクトル全体に
対して限定された範囲内において各パルスを細かく配置
してなる音源ベクトルを格納した第１の音源符号帳と、
音源ベクトル全体の広範囲に渡って各パルスを粗く配置
してなる音源ベクトルを格納した第２の音源符号帳と、
を具備する構成を採る。A fixed excitation codebook according to the present invention is a fixed excitation codebook used for an excitation vector generation device for generating an excitation vector composed of several pulses, and each excitation codebook is provided within a limited range with respect to the entire excitation vector. A first excitation codebook storing excitation vectors in which pulses are finely arranged;
A second excitation codebook storing excitation vectors obtained by coarsely arranging each pulse over a wide range of the entire excitation vector;
Is adopted.

【００１４】この構成によれば、少ないビット数におい
ても音源パルス数と音源パルスの配置可能な位置とを多
く取ることが可能となる。According to this configuration, even with a small number of bits, it is possible to increase the number of excitation pulses and the positions where the excitation pulses can be arranged.

【００１５】本発明の固定音源符号帳は、隣接する２サ
ンプルに対してどちらか一方に１本のパルスを配置する
音源符号帳であって、互いに異なる極性が両サンプルに
対して割り当てられている構成を採る。The fixed excitation codebook of the present invention is an excitation codebook in which one pulse is arranged in one of two adjacent samples, and different polarities are assigned to both samples. Take the configuration.

【００１６】この構成によれば、２つの位置に対して１
ビットの極性情報を割り当てるため、従来の１つの位置
に対して１ビットの極性を割り当てる場合に比べて必要
なビット数を半減できる。また、２つの位置は隣接する
ためまとめて取り扱うことによって生じる劣化を低く抑
えることが可能である。According to this configuration, 1 is set for two positions.
Since the bit polarity information is assigned, the number of necessary bits can be halved compared to the conventional case where one bit polarity is assigned to one position. Further, since the two positions are adjacent to each other, it is possible to suppress deterioration caused by handling the two positions collectively.

【００１７】本発明の固定音源符号帳は、複数のパルス
を代数的に配置する音源符号帳であって、少なくとも２
本のパルスは接近して配置されるような組み合わせのみ
を生成する音源符号帳を上記第１の音源符号帳として具
備し、上記音源符号帳を上記第２の音源符号帳として具
備する構成を採る。The fixed excitation codebook of the present invention is an excitation codebook in which a plurality of pulses are arranged algebraically,
The present pulse adopts a configuration in which an excitation codebook that generates only combinations that are arranged close to each other is provided as the first excitation codebook, and the excitation codebook is provided as the second excitation codebook. .

【００１８】この構成によれば、少ないビット数で音源
パルス数および各パルスの配置可能位置を多く確保でき
る代数的固定音源符号帳を実現することが可能となる。According to this configuration, it is possible to realize an algebraic fixed excitation codebook that can secure a large number of excitation pulses and a large possible position of each pulse with a small number of bits.

【００１９】本発明の固定音源符号帳は、上記音源符号
帳を第１及び第２の音源符号帳とし、白色雑音的な音源
ベクトルを生成する第３の音源符号帳と、を具備する構
成を採る。The fixed excitation codebook of the present invention comprises a first excitation codebook and a second excitation codebook, and a third excitation codebook for generating an excitation vector like white noise. take.

【００２０】この構成によれば、音源パルス数の少ない
代数的固定音源符号帳では表現し難い雑音的信号（摩擦
子音等）も良好に表現することが可能となる。According to this configuration, it is possible to satisfactorily express a noisy signal (such as a fricative consonant) which is difficult to express with an algebraic fixed excitation codebook having a small number of excitation pulses.

【００２１】本発明の固定音源符号帳は、パルス音源符
号帳と、雑音音源符号帳とを備えた音源ベクトル生成装
置に用いる固定音源符号帳であって、生成される音源ベ
クトルはパルス音源符号帳か雑音音源符号帳かのいずれ
か一方から生成されたものであり、生成した音源ベクト
ルを用いて入力信号を符号化する際、符号化歪みが大き
いほど雑音音源符号帳から生成される音源ベクトルが選
ばれ易くなるようにする重み付けがなされる構成を採
る。The fixed excitation codebook of the present invention is a fixed excitation codebook used for an excitation vector generation apparatus having a pulse excitation codebook and a noise excitation codebook, wherein the generated excitation vector is a pulse excitation codebook. Or the noise excitation codebook, and when encoding the input signal using the generated excitation vector, the excitation vector generated from the noise excitation codebook increases as the coding distortion increases. A configuration is adopted in which weighting is performed so as to facilitate selection.

【００２２】この構成によれば、うまく表現できない入
力信号に対してはパルス音源ではなく雑音音源を適用す
るようにすることにより、聴覚的に自然な符号化歪とな
るようにすることが可能となる。According to this configuration, by applying a noise sound source instead of a pulse sound source to an input signal that cannot be expressed well, it is possible to obtain an acoustically natural encoding distortion. Become.

【００２３】本発明の固定音源符号帳は、上記第１及び
第２の音源符号帳をパルス音源符号帳として具備し、上
記第３の音源符号帳を雑音音源符号帳として具備する構
成を採る。The fixed excitation codebook of the present invention has a configuration in which the first and second excitation codebooks are provided as pulse excitation codebooks and the third excitation codebook is provided as a noise excitation codebook.

【００２４】この構成によれば、雑音性信号に対する上
記固定音源符号帳の性能を大きく改善することが可能で
ある。According to this configuration, it is possible to greatly improve the performance of the fixed excitation codebook for a noisy signal.

【００２５】本発明の固定音源ベクトル生成方法は、各
パルスの取り得る位置が細かく設定されており、少なく
とも２本のパルスが接近するように制限されているパル
ス音源を生成する第１の音源生成工程と、各パルスの取
り得る位置が粗く設定されており、各パルスの組み合わ
せに何ら制限が加えられないパルス音源を生成する第２
の音源生成工程と、ランダムな雑音信号からなる音源を
生成する第３の音源生成工程と、符号化歪みが大きいほ
ど第３の音源生成工程で生成された音源ベクトルが選択
され易くなるように重み付けを行う重み付け工程と、を
備える。According to the fixed sound source vector generation method of the present invention, a first sound source generation for generating a pulse sound source in which positions where each pulse can be taken is set finely and at least two pulses are restricted so as to approach each other. The second step is to generate a pulse sound source in which the steps and possible positions of each pulse are roughly set, and no limitation is imposed on the combination of each pulse.
, A third excitation generation step for generating an excitation composed of random noise signals, and weighting such that the excitation vector generated in the third excitation generation step is more easily selected as the coding distortion increases. Performing a weighting step.

【００２６】この方法によれば、少ないビットで音源パ
ルス数と音源パルスを配置可能な位置を多くとることが
可能となり、雑音的な信号に対しても主観的品質を改善
することが可能となる。According to this method, the number of excitation pulses and the positions where the excitation pulses can be arranged can be increased with a small number of bits, and the subjective quality can be improved even for noise-like signals. .

【００２７】本発明の記憶媒体は、音源生成プログラム
を格納し、コンピュータにより読み取り可能な記録媒体
であって、前記音源生成プログラムは、各パルスの取り
得る位置が細かく設定されており、少なくとも２本のパ
ルスが接近するように制限されたパルス音源を生成する
第１の音源生成手順と、各パルスの取り得る位置が粗く
設定されており、各パルスの組み合わせには何ら制限が
加えられないパルス音源を生成する第２の音源生成手順
と、ランダムな雑音信号からなる音源を生成する第３の
音源生成手順と、符号化歪みが大きいほど第３の音源生
成手順で生成された音源ベクトルが選択され易くなるよ
うに重み付けを行う重み付け手順と、を有する構成を採
る。The storage medium according to the present invention is a computer-readable recording medium storing a sound source generation program, wherein the sound source generation program has at least two positions where each pulse can be taken. A first sound source generation procedure for generating a pulse sound source that is restricted so that the pulses approach each other, and a pulse sound source in which the possible positions of each pulse are roughly set and no limitation is applied to the combination of each pulse , A third excitation generation procedure for generating an excitation composed of random noise signals, and an excitation vector generated in the third excitation generation procedure is selected as the coding distortion increases. And a weighting procedure for performing weighting so as to make it easier.

【００２８】この構成によれば、記憶媒体において、上
記いずれかの作用効果を得ることができる。According to this configuration, any one of the above-described functions and effects can be obtained in the storage medium.

【００２９】本発明の音声符号化装置は、上記固定音源
符号帳と、入力音声信号の周期成分を表す適応音源符号
帳と、入力音声信号のスペクトル特性を表すパラメータ
を量子化・符号化する手段と、前記固定音源符号帳と前
記適応音源符号帳とから生成される音源ベクトルと前記
パラメータとを用いて合成音声信号を合成する手段と、
入力音声信号と前記合成音声信号との歪みが小さくなる
ように前記固定符号帳と前記適応符号帳からの出力を決
定する手段と、を具備する構成をとる。The speech coding apparatus of the present invention comprises the above fixed excitation codebook, an adaptive excitation codebook representing a periodic component of the input speech signal, and means for quantizing and encoding parameters representing the spectral characteristics of the input speech signal. Means for synthesizing a synthesized speech signal using the excitation vector and the parameter generated from the fixed excitation codebook and the adaptive excitation codebook,
Means for determining an output from the fixed codebook and the adaptive codebook such that distortion between the input speech signal and the synthesized speech signal is reduced.

【００３０】この構成によれば、上記いずれかの作用効
果を固定音源符号帳で得られるので、低ビットレートで
高品質な音声符号化を実現可能となる。According to this configuration, any one of the above-described functions and effects can be obtained by the fixed excitation codebook, so that high-quality speech coding at a low bit rate can be realized.

【００３１】本発明の音声復号化装置は、上記固定音源
符号帳と、合成音声信号の周期成分を表す適応音源符号
帳と、音声符号化装置によって符号化されたスペクトル
特性を表すパラメータを復号化する手段と、前記音声符
号化装置において決定された音源ベクトルを固定音源符
号帳と適応音源符号帳とから復号し、復号された音源ベ
クトルと前記パラメータとから合成音声信号を合成する
手段と、を具備する構成を採る。The speech decoding apparatus of the present invention decodes the fixed excitation codebook, the adaptive excitation codebook representing the periodic component of the synthesized speech signal, and the parameters representing the spectral characteristics encoded by the speech encoding apparatus. Means for decoding the excitation vector determined in the speech encoding apparatus from the fixed excitation codebook and the adaptive excitation codebook, and synthesizing a synthesized speech signal from the decoded excitation vector and the parameter. The configuration provided is adopted.

【００３２】この構成によれば、上記いずれかの作用効
果を固定音源符号帳で得られるので、低ビットレートで
高品質な音声信号を復号することが可能となる。According to this configuration, any one of the above-described functions and effects can be obtained with the fixed excitation codebook, so that a high-quality speech signal can be decoded at a low bit rate.

【００３３】本発明の音声信号送信装置は、上記構成の
音声符号化装置を備えたことを特徴とする。また、本発
明の音声信号受信装置は、上記構成の音声復号化装置を
備えたことを特徴とする。[0033] A speech signal transmitting apparatus according to the present invention includes the speech coding apparatus having the above-described configuration. Further, an audio signal receiving apparatus according to the present invention includes the audio decoding apparatus having the above configuration.

【００３４】本発明の基地局装置は、上記構成の音声信
号送信装置及び／又は音声信号受信装置を備えたことを
特徴とする。また、本発明の移動局装置は、上記構成の
音声信号送信装置及び／又は音声信号受信装置を備えた
ことを特徴とする。[0034] A base station apparatus according to the present invention includes the voice signal transmitting apparatus and / or the voice signal receiving apparatus having the above-described configuration. Further, a mobile station device according to the present invention includes the voice signal transmitting device and / or the voice signal receiving device configured as described above.

【００３５】これらの構成により、ディジタル無線通信
システムにおいて、低ビットレートであっても高性能化
を図ることが可能である。With these configurations, it is possible to improve the performance of the digital radio communication system even at a low bit rate.

【００３６】[0036]

【発明の実施の形態】以下、本発明の実施の形態につい
て、添付図面を参照して詳細に説明する。図１は、本発
明の実施の形態に係る音声符号化／復号化装置を備えた
送信装置及び受信装置の構成を示すブロック図である。Embodiments of the present invention will be described below in detail with reference to the accompanying drawings. FIG. 1 is a block diagram illustrating a configuration of a transmission device and a reception device including a speech encoding / decoding device according to an embodiment of the present invention.

【００３７】図１において、音声信号は、送信装置の入
力装置１０１、例えばマイクによって電気的信号に変換
され、Ａ／Ｄ変換装置１０２に出力される。Ａ／Ｄ変換
装置１０２は、入力装置１０１から出力された（アナロ
グ）信号をディジタル信号に変換し、このディジタル信
号を音声符号化装置１０３へ出力する。In FIG. 1, an audio signal is converted into an electric signal by an input device 101 of a transmitting device, for example, a microphone, and is output to an A / D converter 102. The A / D converter 102 converts an (analog) signal output from the input device 101 into a digital signal, and outputs the digital signal to the speech encoder 103.

【００３８】音声符号化装置１０３は、Ａ／Ｄ変換装置
１０２から出力されたディジタル信号を後述する音声符
号化方法を用いて符号化して、得られた音声符号化情報
をＲＦ変調装置１０４へ出力する。Speech coding apparatus 103 encodes the digital signal output from A / D conversion apparatus 102 by using a speech coding method described later, and outputs the obtained speech coded information to RF modulation apparatus 104. I do.

【００３９】ＲＦ変調装置１０４は、音声符号化装置１
０３から出力された音声符号化情報を電波などの伝播媒
体に載せて送出するための信号に変換し、その信号を送
信アンテナ１０５へ出力する。送信アンテナ１０５は、
ＲＦ変調装置１０４から出力された出力信号を電波（Ｒ
Ｆ信号）として送出する。The RF modulation device 104 is a voice coding device 1
The audio coded information output from the receiver 03 is converted into a signal to be transmitted on a propagation medium such as a radio wave, and the signal is output to the transmitting antenna 105. The transmitting antenna 105
The output signal output from the RF modulator 104 is converted into a radio wave (R
F signal).

【００４０】ＲＦ信号は、受信装置の受信アンテナ１０
６によって受信され、ＲＦ復調装置１０７へ出力され
る。ＲＦ復調装置１０７は、受信アンテナ１０６から出
力されたＲＦ信号から音声符号化情報を復調し、その音
声符号化情報を音声復号化装置１０８へ出力する。The RF signal is transmitted to the receiving antenna 10 of the receiving apparatus.
6 and output to the RF demodulator 107. RF demodulation device 107 demodulates audio encoded information from the RF signal output from receiving antenna 106 and outputs the audio encoded information to audio decoding device 108.

【００４１】音声復号化装置１０８は、ＲＦ復調装置１
０７から出力された音声符号化情報から後述する音声復
号化方法を用いて音声信号を復号し、復号化された音声
信号をＤ／Ａ変換装置１０９へ出力する。Ｄ／Ａ変換装
置１０９は、音声復号化装置１０８から出力されたディ
ジタル音声信号をアナログの電気的信号に変換し、この
電気的信号を出力装置１１０、例えばマイクへ出力す
る。出力装置１１０は、電気的信号を空気の振動に変換
し、音波として人間の耳に聴こえるように出力する。The audio decoding device 108 is an RF demodulator 1
The audio signal is decoded using the audio decoding method described later from the audio coded information output from 07, and the decoded audio signal is output to the D / A converter 109. The D / A converter 109 converts the digital audio signal output from the audio decoder 108 into an analog electric signal, and outputs the electric signal to an output device 110, for example, a microphone. The output device 110 converts the electric signal into vibration of air and outputs the sound as sound waves so as to be audible to human ears.

【００４２】上記のような構成の音声信号送信装置及び
受信装置の少なくとも一方を備えることにより、移動通
信システムにおける基地局装置及び移動端末装置を構成
することができる。By providing at least one of the voice signal transmitting device and the receiving device having the above configuration, a base station device and a mobile terminal device in a mobile communication system can be configured.

【００４３】音声信号の送信装置における音声符号化装
置１０３は、図２に示す構成を有する。図２は、本発明
の実施の形態に係る音声符号化装置の構成を示すブロッ
ク図である。Speech encoding apparatus 103 in the speech signal transmitting apparatus has the configuration shown in FIG. FIG. 2 is a block diagram showing a configuration of the speech coding apparatus according to the embodiment of the present invention.

【００４４】図２において、入力音声信号は、図１のＡ
／Ｄ変換装置１０２から出力される信号であり、前処理
部２００に入力される。前処理部２００では、ＤＣ成分
（直流成分）を取り除くハイパスフィルタ処理、後続す
る符号化処理の性能改善につながるような波形整形処
理、及び／又はプリエンファシス処理を行い、処理後の
信号（Ｘin）をＬＰＣ分析部２０１、加算器２０４、及
びパラメータ決定部２１２に出力する。In FIG. 2, the input audio signal is represented by A in FIG.
The signal is output from the / D conversion device 102 and is input to the preprocessing unit 200. The pre-processing unit 200 performs a high-pass filter process for removing a DC component (a DC component), a waveform shaping process that leads to an improvement in the performance of a subsequent encoding process, and / or a pre-emphasis process, and processes the signal (Xin). Is output to the LPC analysis unit 201, the adder 204, and the parameter determination unit 212.

【００４５】ＬＰＣ分析部２０１は、Ｘinを用いて線形
予測分析を行い、分析結果（線形予測係数）をＬＰＣ量
子化部２０２へ出力する。ＬＰＣ量子化部２０２は、Ｌ
ＰＣ分析部２０１から出力された線形予測係数（ＬＰ
Ｃ）の量子化処理を行い、量子化ＬＰＣを合成フィルタ
２０３へ出力すると共に、前記量子化ＬＰＣを表す符号
Ｌを多重化部２１３へ出力する。LPC analysis section 201 performs a linear prediction analysis using Xin, and outputs an analysis result (linear prediction coefficient) to LPC quantization section 202. LPC quantization section 202 calculates LPC
The linear prediction coefficient (LP) output from the PC analysis unit 201
C), and outputs the quantized LPC to the synthesis filter 203 and the code L representing the quantized LPC to the multiplexing unit 213.

【００４６】合成フィルタ２０３は、前記量子化ＬＰＣ
をフィルタ係数と加算器２１０から出力される駆動音源
とを用いてフィルタ合成を行い、合成信号を加算器２０
４へ出力する。加算器２０４は、前記Ｘinと前記合成信
号との誤差信号を算出し、聴覚重み付け部２１１へ出力
する。The synthesis filter 203 is the same as the quantization LPC
Is subjected to filter synthesis using the filter coefficients and the driving sound source output from the adder 210, and the synthesized signal is added to the adder 20.
Output to 4. The adder 204 calculates an error signal between the Xin and the synthesized signal, and outputs the error signal to the auditory weighting unit 211.

【００４７】聴覚重み付け部２１１は、加算器２０４か
ら出力された誤差信号に対して聴覚的な重み付けを行
い、聴覚重み付け領域での前記Ｘinと前記合成信号との
歪みを算出し、パラメータ決定部２１２へ出力する。The auditory weighting section 211 performs auditory weighting on the error signal output from the adder 204, calculates the distortion between the Xin and the synthesized signal in an auditory weighting area, and obtains a parameter determining section 212. Output to

【００４８】パラメータ決定部２１２は、聴覚重み付け
部２１１から出力された前記符号化歪みが最小となるよ
うに、適応音源符号帳２０５、固定音源符号帳２０７、
及び量子化利得生成部２０６から生成されるべき信号を
決定する。The parameter deciding section 212 controls the adaptive excitation codebook 205, the fixed excitation codebook 207, and the fixed excitation codebook 207 so that the coding distortion output from the auditory weighting section 211 is minimized.
And a signal to be generated from the quantization gain generation unit 206.

【００４９】なお、聴覚重み付け部２１１から出力され
る符号化歪みの最小化だけでなく、前記Ｘinを用いた別
の符号化歪みを併用して前記３つの処理部から生成され
るべき信号を決定することにより、さらに符号化性能を
改善することもできる。It is to be noted that not only the coding distortion output from the auditory weighting unit 211 is minimized, but also a signal to be generated from the three processing units is determined by using another coding distortion using the Xin. By doing so, the coding performance can be further improved.

【００５０】適応音源符号帳２０５は、過去に加算器２
１０によって出力された音源信号をバッファリングして
おり、パラメータ決定部２１２から出力された信号
（Ａ）によって特定される位置から適応音源ベクトルを
切り出して乗算器２０８へ出力する。The adaptive excitation codebook 205 has been
The sound source signal output by the buffer 10 is buffered, and an adaptive sound source vector is cut out from the position specified by the signal (A) output from the parameter determination unit 212 and output to the multiplier 208.

【００５１】固定音源符号帳２０７は、パラメータ決定
部２１２から出力された信号（Ｆ）によって特定される
形状を有するベクトルを乗算器２０９へ出力する。量子
化利得生成部２０６は、パラメータ決定部２１２から出
力された信号（Ｇ）によって特定される適応音源利得と
固定音源利得とをそれぞれ乗算器２０８と乗算器２０９
へ出力する。[0051] Fixed excitation codebook 207 outputs to multiplier 209 a vector having a shape specified by signal (F) output from parameter determination section 212. Quantization gain generating section 206 multiplies adaptive excitation gain and fixed excitation gain specified by signal (G) output from parameter determining section 212 by multipliers 208 and 209, respectively.
Output to

【００５２】乗算器２０８は、量子化利得生成部２０６
から出力された量子化適応音源利得を、適応音源符号帳
２０５から出力された適応音源ベクトルに乗じて、加算
器２１０へ出力する。乗算器２０９は、量子化利得生成
部２０６から出力された量子化固定音源利得を、固定音
源符号帳２０７から出力された固定音源ベクトルに乗じ
て、加算器２１０へ出力する。The multiplier 208 includes a quantization gain generation unit 206
Are multiplied by the adaptive excitation vector output from adaptive excitation codebook 205 and output to adder 210. Multiplier 209 multiplies the fixed excitation vector output from fixed excitation codebook 207 by the quantized fixed excitation gain output from quantization gain generating section 206, and outputs the result to adder 210.

【００５３】加算器２１０は、利得乗算後の適応音源ベ
クトルと固定音源ベクトルとをそれぞれ乗算器２０８と
乗算器２０９から入力し、ベクトル加算をして合成フィ
ルタ２０３及び適応音源符号帳２０５へ出力する。Adder 210 receives the adaptive excitation vector and the fixed excitation vector after gain multiplication from multipliers 208 and 209, respectively, adds the vectors, and outputs the result to synthesis filter 203 and adaptive excitation codebook 205. .

【００５４】最後に、多重化部２１３は、ＬＰＣ量子化
部２０２から量子化ＬＰＣを表す符号Ｌを入力し、パラ
メータ決定部２１２から適応音源ベクトルを表す符号
Ａ、固定音源ベクトルを表す符号Ｆ、及び量子化利得を
表す符号Ｇを入力し、これらの情報を多重化して符号化
情報として伝送路へ出力する。Finally, the multiplexing unit 213 receives the code L representing the quantized LPC from the LPC quantization unit 202, the code A representing the adaptive excitation vector, the code F representing the fixed excitation vector, And a code G representing a quantization gain, the information is multiplexed and output to the transmission path as coded information.

【００５５】上述した音声符号化装置は、固定音源符号
帳２０７の具体的構成とパラメータ決定部２１２にその
特徴を有する。図３及び図４は固定音源符号帳２０７の
構成を示すブロック図であり、図５はパラメータ決定部
２１２の構成を示すブロック図である。The above speech coding apparatus has a specific configuration of fixed excitation codebook 207 and features of parameter determination section 212. 3 and 4 are block diagrams showing a configuration of fixed excitation codebook 207, and FIG. 5 is a block diagram showing a configuration of parameter determining section 212.

【００５６】図３において、第１の音源符号帳３０１
は、限定された範囲内に細かい精度で音源パルスを配置
した音源ベクトルを生成する音源符号帳であり、第２の
音源符号帳３０２は、広い範囲に粗い精度で音源パルス
を配置した音源ベクトルを生成する音源符号帳であり、
切替スイッチ３０３は、第１の音源符号帳３０１から生
成される音源ベクトルと第２の音源符号帳３０２から生
成される音源ベクトルとのいずれか一方を選択するため
のスイッチである。In FIG. 3, first excitation codebook 301
Is an excitation codebook that generates an excitation vector in which excitation pulses are arranged within a limited range with fine precision, and a second excitation codebook 302 generates an excitation vector in which excitation pulses are arranged with coarse accuracy over a wide range. A sound source codebook to be generated,
The changeover switch 303 is a switch for selecting one of an excitation vector generated from the first excitation codebook 301 and an excitation vector generated from the second excitation codebook 302.

【００５７】この固定音源符号帳は、図２におけるパラ
メータ決定部２１２から入力される信号（Ｆ）で特定さ
れる固定音源ベクトルを、第１の音源符号帳３０１又は
第２の音源符号帳３０２により生成し、切替スイッチ３
０３を介して固定音源ベクトルとして出力する。In the fixed excitation codebook, a fixed excitation vector specified by signal (F) input from parameter determining section 212 in FIG. 2 is converted by first excitation codebook 301 or second excitation codebook 302. Generate and changeover switch 3
The signal is output as a fixed sound source vector via the signal line 03.

【００５８】図４において、第１の音源符号帳４０１と
第２の音源符号帳４０２は、図３における第１の音源符
号帳３０１と第２の音源符号帳３０２とにそれぞれ対応
し、同じ構成のものである。図４に示す固定音源符号帳
と図３に示す固定音源符号帳の違いは、第３の音源符号
帳４０３を具備することである。なお、図４において参
照符号４０４は切替スイッチを示す。In FIG. 4, first excitation codebook 401 and second excitation codebook 402 correspond to first excitation codebook 301 and second excitation codebook 302 in FIG. 3, respectively, and have the same configuration. belongs to. The difference between the fixed excitation codebook shown in FIG. 4 and the fixed excitation codebook shown in FIG. 3 is that a third excitation codebook 403 is provided. In FIG. 4, reference numeral 404 denotes a changeover switch.

【００５９】第１及び第２の音源符号帳４０１，４０２
が少ない本数（２〜４本程度）の音源パルスから成る固
定音源ベクトルを生成するのに対して、第３の音源符号
帳４０３は多数の音源パルスや乱数系列から成る固定音
源ベクトルを生成する。First and second excitation codebooks 401 and 402
Generates a fixed excitation vector composed of a small number (about 2 to 4) of excitation pulses, whereas the third excitation codebook 403 generates a fixed excitation vector composed of a large number of excitation pulses and a random number sequence.

【００６０】決められた種類の白色ガウス雑音ベクトル
を格納しておき、その中から適切なものを１つ選んで固
定音源ベクトルとして出力するものが最も基本的かつ一
般的なものである。この他に多数（少なくとも１０本程
度以上）音源パルスをランダムな極性をつけてランダム
に並べたものなども一般的である。このような第３の音
源符号帳を備えることにより、少数パルス音源では表現
できない雑音的な信号を表現することが可能となる。The most basic and general one is to store a predetermined type of white Gaussian noise vector, select an appropriate one from the stored white Gaussian noise vectors, and output it as a fixed sound source vector. In addition, it is also common to arrange a large number (at least about 10 or more) of excitation pulses at random with random polarity. By providing such a third excitation codebook, it is possible to represent a noise-like signal that cannot be represented by a small number of pulse excitations.

【００６１】図３及び図４における、第１の音源符号帳
及び第２の音源符号帳を、代数的固定符号帳を用いて構
成した例について図７、図８及び図９に示す。図７は、
３トラック（３本）のパルスから固定音源ベクトルを生
成する第１の音源符号帳（３０１，４０１）の例を示す
図であり、各トラックに立てることが可能なパルスの位
置と極性が示されている。図中の数字はパルスの位置を
示している。FIGS. 7, 8 and 9 show examples in which the first excitation codebook and the second excitation codebook in FIGS. 3 and 4 are configured using algebraic fixed codebooks. FIG.
It is a figure which shows the example of the 1st excitation codebook (301,401) which produces | generates a fixed excitation vector from the pulse of three tracks (three), and shows the position and polarity of the pulse which can be set in each track. ing. The numbers in the figure indicate the positions of the pulses.

【００６２】この代数的固定音源符号帳の特徴は、各ト
ラックが隣接する２サンプルのパルス位置候補点から成
っており、前記隣接する２サンプルに対して＋と−の極
性のパルスが別々に割り当てられていることである。２
サンプルの点に対して１本のパルスを立てる立て方は全
部で４通り存在するが、前記の２種類のパルスはこの４
通りの立て方のうちパルス位置・パルス極性ともに異な
るという意味から最も類似性の低い２通りの立て方を組
み合わせたものである。The feature of this algebraic fixed excitation codebook is that each track is composed of pulse position candidate points of two adjacent samples, and pulses of positive and negative polarities are separately assigned to the two adjacent samples. It is being done. 2
There are a total of four ways to raise one pulse for a sample point.
This is a combination of the two methods having the lowest similarity in the sense that the pulse position and the pulse polarity are different from each other.

【００６３】したがって、前記４通りの立て方を２通り
に削減する場合、前記のように隣接する２サンプルに対
して別々の極性を割り当てるようなやり方が最も冗長が
ないと言える。また、２サンプルが隣接しているので、
一方のサンプル点に必要な極性のパルスを（前記のよう
な位置と極性の制限のために）立てることができない場
合でも、他方のサンプル点に（位置は１サンプルずれて
しまうが）必要な極性のパルスを立てることができ、こ
のようなパルスで本来必要なパルスの代用が可能となる
確率が高くなる。Therefore, when the above four ways are reduced to two ways, it can be said that the method of assigning different polarities to two adjacent samples as described above has the least redundancy. Also, since two samples are adjacent,
If a pulse of the required polarity cannot be set at one sample point (due to the position and polarity restrictions described above), the required polarity (although the position is shifted by one sample) at the other sample point , And the probability that such a pulse can be substituted for the pulse originally required increases.

【００６４】なお、パルス位置を表すビット数が不足す
る場合は、トラック内の全てのパルス位置候補点が隣接
する２サンプルでなければならない訳ではなく、例えば
ベクトルの後半や末尾においては候補点間の距離が２サ
ンプル以上（候補点間に１つ以上のサンプル点が存在す
る）となるトラック構成でもよい。ただし、このように
隣接しない部分においては、一方のパルスで他方の位置
に必要なパルスを代用させるような前記効果は期待でき
なくなる。When the number of bits representing the pulse position is insufficient, not all the pulse position candidate points in the track need to be two adjacent samples. The track configuration may be such that the distance of the sample is two or more samples (one or more sample points exist between the candidate points). However, in such non-adjacent portions, the above-described effect of causing one pulse to substitute a necessary pulse at the other position cannot be expected.

【００６５】上記のように構成された３つのトラックか
ら１本ずつパルスが生成され、３本のパルスから成るベ
クトルとなる。最後に生成されたベクトルに極性を乗じ
たものがこの音源符号帳からの出力ベクトルとなる。な
お、ここでは音源パルスが３本の例を示したが、いかな
る本数でも上記の考え方は適用可能である。また、最後
に乗じるベクトル全体の極性を省いた構成でも有効性は
得られる。A pulse is generated one by one from the three tracks configured as described above, and becomes a vector composed of three pulses. The last generated vector multiplied by the polarity is the output vector from the excitation codebook. Here, an example in which three sound source pulses are used has been described, but the above concept can be applied to any number of sound source pulses. Further, the effectiveness can be obtained even in a configuration in which the polarity of the entire vector to be multiplied last is omitted.

【００６６】図８は、３トラック（３本）のパルスから
固定音源ベクトルを生成する第２の音源符号帳（３０
２，４０２）の例を示す図である。トラックの構成（パ
ルス位置および極性）は一般的な代数的固定符号帳と同
一である。異なる点は、３本のパルスの組み合わせ方が
限定されている点である。FIG. 8 shows a second excitation codebook (30) for generating a fixed excitation vector from three track (three) pulses.
2, 402). The configuration of the track (pulse position and polarity) is the same as that of a general algebraic fixed codebook. The difference is that the combination of three pulses is limited.

【００６７】図８では、３本とも近い組み合わせのみを
生成する例を示している。図中の各トラックに示された
破線はパルス位置の候補であるが、例えば１番目のトラ
ックでサンプル点が３であるパルスを選択した場合（図
では実線で示されている）、２番目のトラックのパルス
位置は４か７に、３番目のトラックのパルス位置は５か
８に、限定され、これらの位置候補の組み合わせでしか
音源ベクトルを生成できない。すなわち、先頭となるパ
ルスの直後から２つの位置候補だけを用いて音源ベクト
ルを生成する構成となっている。ここでは位置候補が２
箇所であるが、ビット数などに応じて位置候補が３箇所
や４箇所であっても良い。FIG. 8 shows an example in which only combinations that are close to all three are generated. The dashed line shown in each track in the figure is a candidate for a pulse position. For example, when a pulse whose sample point is 3 is selected in the first track (shown by a solid line in the figure), the second track is shown. The pulse position of the track is limited to 4 or 7, and the pulse position of the third track is limited to 5 or 8, and a sound source vector can be generated only by a combination of these position candidates. That is, the sound source vector is generated using only two position candidates immediately after the leading pulse. Here, the position candidate is 2
However, the number of position candidates may be three or four depending on the number of bits.

【００６８】図９も、３トラック（３本）のパルスから
固定音源ベクトルを生成する第２の音源符号帳（３０
２，４０２）の例を示す図である。図９に示す音源符号
帳と図８に示す音源符号帳が異なる点は、３本のパルス
の組み合わせ方の限定方法が異なる点である。FIG. 9 also shows a second excitation codebook (30) for generating a fixed excitation vector from three track (three) pulses.
2, 402). The difference between the excitation codebook shown in FIG. 9 and the excitation codebook shown in FIG. 8 is that the method of limiting the combination of three pulses is different.

【００６９】図９において、第１のパルス位置が３であ
る場合、第２のパルス位置は４に、第３のパルス位置は
１１に限定される。すなわち、先頭のパルスに対して１
本は直後の一箇所、もう１本は少し離れた１ヶ所、とい
う組み合わせのベクトルのみを生成する。In FIG. 9, when the first pulse position is 3, the second pulse position is limited to 4, and the third pulse position is limited to 11. That is, 1 for the first pulse
Only the vector of the combination of a book at one place immediately after it and another book at one place slightly away is generated.

【００７０】この音源符号帳は、前述の図８で示す音源
符号帳と組み合わせて使用することを想定しているた
め、最後の離れた１箇所に立てるパルスの位置は、図８
の音源符号帳では不可能な範囲（図８の構成で限定され
た範囲より後ろに離れた範囲（この範囲がベクトル長を
超える場合はフレーム先頭へ巡回させても良い））に設
定する。Since it is assumed that this excitation codebook is used in combination with the excitation codebook shown in FIG. 8, the position of the pulse set at the last distant location is as shown in FIG.
Of the excitation codebook (a range distant from the range limited by the configuration of FIG. 8 (if the range exceeds the vector length, it may be circulated to the beginning of the frame)).

【００７１】限定するパルス位置は、前記のように１箇
所とは限らず、利用可能なビット数に応じて、２箇所や
３箇所でもよく、先頭パルスに近い２番目のパルス位置
候補数と先頭パルスから離れた３番目のパルス位置候補
数は異なっていても良い。The number of pulse positions to be limited is not limited to one as described above, but may be two or three in accordance with the number of available bits. The number of third pulse position candidates distant from the pulse may be different.

【００７２】図５は、図２に示す音声符号化装置におけ
るパラメータ決定部２１２の構成を示すブロック図であ
る。図５において、まず、適応音源ベクトル選択部５０
１が、図２における聴覚重み付け部２１１からの出力が
最も小さくなるような適応音源ベクトルを適応音源符号
帳２０５から見つけ出し、この適応音源ベクトルに対応
する符号Ａを出力する。この段階では固定音源符号帳か
らは何も出力されず、適応音源符号帳のみで合成フィル
タ２０３を駆動する。また、適応音源ベクトルに乗じる
利得は計算により求められた理想的な利得を用いる。FIG. 5 is a block diagram showing a configuration of parameter determining section 212 in the speech coding apparatus shown in FIG. In FIG. 5, first, the adaptive excitation vector selection unit 50
1 finds an adaptive excitation vector that minimizes the output from the auditory weighting unit 211 in FIG. 2 from the adaptive excitation codebook 205, and outputs a code A corresponding to this adaptive excitation vector. At this stage, nothing is output from the fixed excitation codebook, and synthesis filter 203 is driven only by the adaptive excitation codebook. Further, as the gain multiplied by the adaptive excitation vector, an ideal gain obtained by calculation is used.

【００７３】次に、適応音源ベクトルは、前記適応音源
ベクトル選択部５０１で選択された適応音源ベクトルに
固定した上で、固定音源ベクトル選択部５０２が、聴覚
重み付け部２１１からの出力（重みつき誤差）が最も小
さくなるような固定音源ベクトルを固定音源符号帳２０
７から見つけ出し、この固定音源ベクトルに対応する符
号Ｆを出力する。この段階では既に選択されている適応
音源ベクトル及び新たに選択された固定音源ベクトルに
乗じる利得は計算により求められた理想的な利得を用い
る。また、前記重みつき誤差の最小化だけでなく、前処
理後の入力信号Ｘinも併用して固定音源ベクトルの選択
を行っても良い。Next, after the adaptive excitation vector is fixed to the adaptive excitation vector selected by the adaptive excitation vector selection section 501, the fixed excitation vector selection section 502 outputs the output (weighted error) from the auditory weighting section 211. ) Is fixed to the fixed excitation codebook 20.
7, and outputs a code F corresponding to the fixed excitation vector. At this stage, an ideal gain obtained by calculation is used as a gain for multiplying the already selected adaptive excitation vector and the newly selected fixed excitation vector. Further, the fixed excitation vector may be selected not only by minimizing the weighted error but also by using the input signal Xin after the preprocessing.

【００７４】次に、適応音源ベクトルと固定音源ベクト
ルを、前記のように選択されたものに固定した上で、両
ベクトルに乗じる利得の量子化を行う。音源利得量子化
部５０３は、前記重み付き誤差が最も小さくなるよう
に、前記量子化音源利得の量子化を行い、この量子化音
源利得に対応する符号Ｇを出力する。Next, the adaptive excitation vector and the fixed excitation vector are fixed to those selected as described above, and then the quantization of the gain multiplied by both vectors is performed. The excitation gain quantization unit 503 quantizes the quantized excitation gain so as to minimize the weighted error, and outputs a code G corresponding to the quantized excitation gain.

【００７５】図５に示すパラメータ決定部は、固定音源
ベクトル選択部５０２にその特徴を有する。図６は、固
定音源ベクトル選択部５０２の構成を示すブロック図で
ある。図６において、第１の固定音源ベクトル選択部６
０１は、重みつき誤差を最小とする第１の固定音源ベク
トルを第１の音源符号帳４０１から選択し、選択部６０
４へ出力する。第２の固定音源ベクトル選択部６０２
は、重みつき誤差を最小とする第２の固定音源ベクトル
を第２の音源符号帳４０２の中から選択し、選択部６０
４へ出力する。The parameter deciding section shown in FIG. 5 has the feature of fixed sound source vector selecting section 502. FIG. 6 is a block diagram illustrating a configuration of the fixed sound source vector selection unit 502. In FIG. 6, a first fixed sound source vector selecting unit 6
01 selects the first fixed excitation vector that minimizes the weighted error from the first excitation codebook 401,
Output to 4. Second fixed sound source vector selection unit 602
Selects the second fixed excitation vector that minimizes the weighted error from the second excitation codebook 402,
Output to 4.

【００７６】選択部６０４は、第１の固定音源ベクトル
と、第２の固定音源ベクトルと、で重みつき誤差を比較
し、重みつき誤差が小さくなる方の固定音源ベクトルを
選択し、これを重みつき選択部６０５へ出力する。The selecting unit 604 compares the weighted error between the first fixed sound source vector and the second fixed sound source vector, selects the fixed sound source vector with the smaller weighted error, and repeats this. The result is output to the attachment selector 605.

【００７７】第３の固定音源ベクトル選択部６０３は、
重みつき誤差を最小とする第３の固定音源ベクトルを第
３の音源符号帳４０３の中から選択し、これを重みつき
選択部６０５へ出力する。The third fixed sound source vector selection unit 603
A third fixed excitation vector that minimizes the weighted error is selected from third excitation codebook 403, and is output to weighted selection section 605.

【００７８】重みつき選択部６０５は、選択部６０４か
ら出力された第１又は第２の固定音源ベクトルと、前記
第３の固定音源ベクトルと、のそれぞれを用いて音声信
号を合成した場合のＷＳＮＲ（前処理後の入力信号Ｘin
をＳ、重みつき誤差をＮとするＳＮ比）を計算し、この
ＷＳＮＲの値に応じて２つの固定音源ベクトルのいずれ
か一方を選択し、その固定音源ベクトルに対応する符号
Ｆを出力する。重みつき選択部６０５の具体的な選択動
作については後述する。The weighted selection section 605 generates a WSNR when a speech signal is synthesized using each of the first or second fixed excitation vector output from the selection section 604 and the third fixed excitation vector. (Input signal Xin after preprocessing
Is calculated as S, and a weighted error is set as N). One of the two fixed excitation vectors is selected according to the value of the WSNR, and a code F corresponding to the fixed excitation vector is output. A specific selection operation of the weighted selection unit 605 will be described later.

【００７９】図１０は、重みつき選択部６０５の選択基
準を説明する図である。図１０において、横軸は第３の
固定音源ベクトル選択部６０３で選択された第３の固定
音源ベクトルを用いて合成した音声信号の前記ＷＳＮＲ
の値[ｄＢ]を示し、縦軸は選択部６０４で選択された第
１もしくは第２の固定音源ベクトルを用いて合成した音
声信号の前記ＷＳＮＲの値[ｄＢ]を示し、それぞれＳＮ
Ｒｎ、ＳＮＲｐとして示している。FIG. 10 is a view for explaining selection criteria of the weighted selection section 605. 10, the horizontal axis represents the WSNR of the audio signal synthesized using the third fixed sound source vector selected by the third fixed sound source vector selection unit 603.
The vertical axis indicates the WSNR value [dB] of the audio signal synthesized using the first or second fixed excitation vector selected by the selection unit 604, and the SN
They are shown as Rn and SNRp.

【００８０】重みつき距離のみの大小で最適固定音源ベ
クトルを選択する場合は、図１０中の直線ＳＮＲｎ＝Ｓ
ＮＲｐの上側にあるか下側にあるかで選択を行うのと等
価である。すなわち、図１０中の直線ＳＮＲｐ＝ＳＮＲ
ｎの下側の領域では、前記第３の固定音源ベクトルを用
いた方がＷＳＮＲが高くなるので、第３の固定音源ベク
トルが最終的な固定音源ベクトルとして選択され、直線
ＳＮＲｐ＝ＳＮＲｎの上側の領域では、前記第１もしく
は第２の固定音源ベクトルを用いた方がＷＳＮＲが高く
なるので、第１もしくは第２の固定音源ベクトルが最終
的な固定音源ベクトルとして選択される。When selecting the optimal fixed sound source vector based on only the weighted distance, the straight line SNRn = S in FIG.
This is equivalent to making a selection depending on whether it is above or below NRp. That is, the straight line SNRp = SNR in FIG.
In the region below n, the WSNR becomes higher when the third fixed sound source vector is used. Therefore, the third fixed sound source vector is selected as the final fixed sound source vector, and the upper side of the straight line SNRp = SNRn In the region, the WSNR becomes higher when the first or second fixed excitation vector is used, so that the first or second fixed excitation vector is selected as the final fixed excitation vector.

【００８１】しかしながら、前記２種類の固定音源ベク
トルのどちらを用いてもＷＳＮＲの絶対値が低い場合
は、理想的な固定音源ベクトルが白色雑音的であるよう
な場合が多い。一方で、このような白色雑音的な信号を
パルス音源（第１もしくは第２の固定音源符号帳）で符
号化すると、雑音的音源（第３の固定音源符号帳）で符
号化した場合に比べてＳＮ比は若干高くなる傾向がある
ものの、主観的にはジリジリしたような雑音となり品質
劣化の要因となることが知られている。However, when the absolute value of the WSNR is low regardless of which of the above two types of fixed sound source vectors is used, the ideal fixed sound source vector often looks like white noise. On the other hand, when such a white noise-like signal is encoded by a pulse excitation (first or second fixed excitation codebook), compared to the case of encoding by a noise-like excitation (third fixed excitation codebook). Although the SN ratio tends to be slightly higher, it is known that noise becomes subjectively jerky and causes quality deterioration.

【００８２】そこで、このような低ＳＮ比の領域では、
前記第３の固定音源ベクトルが最終的な固定音源ベクト
ルとして選択され易くなるように、判定の境界線として
直線ＳＮＲｐ＝ＳＮＲｎの他に直線ＳＮＲｐ＝（（Ａ−
Ｂ）／Ａ）＊ＳＮＲｎ＋Ｂを用意し、低ＳＮ（ＷＳＮ）
時には、この後者の直線を判定境界とするようにする。
ただし、音声の立ち上がり部などは低ＳＮ比になる場合
も多く、このような立ち上がり部においても判定境界を
前記後者の直線を判定境界とすることは望ましくない。
したがって、このような場合に適応するために、有声区
間かどうかを別途判定する手段を設け、有声区間でない
と判定された場合に上記のような重みつき選択処理を動
作させるのが望ましい。Thus, in such a low SN ratio region,
In order to make it easier for the third fixed sound source vector to be selected as the final fixed sound source vector, in addition to the straight line SNRp = SNRn, the straight line SNRp = ((A−
B) / A) * SNRn + B is prepared and low SN (WSN)
Sometimes, the latter straight line is used as a determination boundary.
However, the rising portion of the voice often has a low SN ratio, and it is not desirable to use the latter straight line as the determination boundary even in such a rising portion.
Therefore, in order to adapt to such a case, it is desirable to provide a means for separately determining whether or not the section is a voiced section, and to operate the above-described weighted selection processing when it is determined that the section is not a voiced section.

【００８３】なお、本実施の形態では、図７〜図９に示
す音源符号帳及びガウス雑音のような雑音音源符号帳を
組み合わせて用いる構成について説明したが、前記音源
符号帳のうちどれか１種類の音源符号帳のみを用いる構
成も可能であり、２種類以上の音源符号帳を組み合わせ
て用いる構成も可能である。Although the present embodiment has been described with respect to a configuration using the excitation codebook shown in FIGS. 7 to 9 and a noise excitation codebook such as Gaussian noise in combination, any one of the excitation codebooks is used. A configuration using only one type of excitation codebook is possible, and a configuration using two or more types of excitation codebooks in combination is also possible.

【００８４】図１１は、固定音源符号帳探索の処理手順
を示すフロー図であり、図１２は、重みつき選択の処理
手順を示すフロー図である。FIG. 11 is a flowchart showing a processing procedure of the fixed excitation codebook search, and FIG. 12 is a flowchart showing a processing procedure of the weighted selection.

【００８５】図１１において、まず、ステップ（以下、
ＳＴと省略する）１１０１で第１の音源符号帳探索が行
われ、第１の音源ベクトルが選択される。次に、ＳＴ１
１０２において、第２の音源符号帳探索が行われ、第２
の音源ベクトルが選択される。この時点で第１と第２の
いずれか一方（重みつき誤差が小さくなる方）がパルス
音源ベクトル候補として選択される。In FIG. 11, first, a step (hereinafter, referred to as a step)
At 1101, a first excitation codebook search is performed, and a first excitation vector is selected. Next, ST1
At 102, a second excitation codebook search is performed.
Are selected. At this point, one of the first and second (the one with a smaller weighted error) is selected as a pulse excitation vector candidate.

【００８６】次に、ＳＴ１１０３において、第３の音源
符号帳探索が行われ、第３の音源符号ベクトル（雑音音
源ベクトル候補）が選択される。最後に、ＳＴ１１０４
において、重みつき選択が行われ、前記パルス音源ベク
トル候補と雑音音源ベクトル候補のいずれか適切な方が
固定音源ベクトルとして選択される。Next, in ST1103, a third excitation codebook search is performed, and a third excitation code vector (noise excitation vector candidate) is selected. Finally, ST1104
In, weighted selection is performed, and an appropriate one of the pulse excitation vector candidate and the noise excitation vector candidate is selected as a fixed excitation vector.

【００８７】図１２において、ＳＴ１２０１において、
パルス音源ベクトル候補を用いた場合のＷＳＮＲ（＝Ｓ
ＮＲｐ）が下記式（１）によって算出される。なお、算
出においては、厳密に式（１）にしたがう必要はなく、
式（１）と等価なものや式（１）において定数項を取り
除いたものなどを用いてもよい。In FIG. 12, in ST 1201,
WSNR (= S
NRp) is calculated by the following equation (1). In the calculation, it is not necessary to strictly follow equation (1).
An equivalent to the equation (1) or an equation obtained by removing the constant term in the equation (1) may be used.

【００８８】ＳＮＲｐ＝１０＊ｌｏｇ１０（ＳＳin／ＮＮin）式（１）ただし、ＳＳin＝Σ（Ｘin）＊（Ｘin），ＮＮin＝Σ
（Ｘin−Ｓout）＊（Ｘin−Ｓout）ここで、Ｘinは前処理後の入力信号を示し、Ｓoutは合
成フィルタ出力信号を示し、Σはベクトル長のサンプル
数の総和を意味する。SNRp = 10 * log10 (SSin / NNin) Equation (1) where SSin = Σ (Xin) * (Xin), NNin = Σ
(Xin-Sout) * (Xin-Sout) Here, Xin indicates an input signal after preprocessing, Sout indicates a synthesis filter output signal, and Σ indicates the sum of the number of samples of the vector length.

【００８９】次に、ＳＴ１２０２において、雑音音源ベ
クトル候補を用いた場合のＷＳＮＲ（＝ＳＮＲｎ）がＳ
ＮＲｐと同様にして求められる。次に、ＳＴ１２０３に
おいて、ＳＮＲｎ＞Ａ、ＳＮＲｐ＞Ａ、又は有声区間か
どうか、がチェックされ、そうであれば雑音音源ベクト
ル候補を優先する必要はなく、聴覚重みつき誤差が最小
となる候補を最終的な固定音源ベクトルとして選択す
る。そうでない場合は、ＳＴ１２０４へ進む。Next, in ST1202, the WSNR (= SNRn) when the noise source vector candidate is used is S
It is determined in the same manner as NRp. Next, in ST1203, it is checked whether SNRn> A, SNRp> A, or whether it is a voiced section. Selected as a fixed fixed sound source vector. Otherwise, the mobile terminal proceeds to ST1204.

【００９０】ＳＴ１２０４では、ＳＮＲｐ＞ＳＮＲｎ＊
（Ａ−Ｂ）／Ａ＋Ｂを満たすかどうかの判定を行い、満
たせばパルス音源ベクトル候補を最終的な固定音源ベク
トルとして選択する。満たさなければ雑音音源ベクトル
候補を最終的な固定音源ベクトルとして選択する。In ST1204, SNRp> SNRn *
It is determined whether (AB) / A + B is satisfied, and if so, a pulse excitation vector candidate is selected as a final fixed excitation vector. If not, a candidate noise source vector is selected as the final fixed source vector.

【００９１】図１３は、図１中の音声復号化装置１０８
の構成を示すブロック図である。図１３において、ＲＦ
復調装置１０７から出力された符号化情報は、多重化分
離部１３０１によって多重化されている符号化情報を個
々の符号情報に分離される。分離されたＬＰＣ符号Ｌ
は、ＬＰＣ復号化部１３０２に出力され、分離された適
応音源ベクトル符号Ａは適応音源符号帳１３０５に出力
され、分離された音源利得符号Ｇは量子化利得生成部１
３０６に出力され、分離された固定音源ベクトル符号Ｆ
は固定音源符号帳１３０７へ出力される。FIG. 13 is a block diagram showing the speech decoding apparatus 108 shown in FIG.
FIG. 3 is a block diagram showing the configuration of FIG. In FIG. 13, RF
In the coded information output from the demodulation device 107, the coded information multiplexed by the demultiplexing unit 1301 is separated into individual code information. LPC code L separated
Is output to an LPC decoding section 1302, the separated adaptive excitation vector code A is output to an adaptive excitation codebook 1305, and the separated excitation gain code G is output to the quantization gain generation section 1
306 and the separated fixed excitation vector code F
Are output to fixed excitation codebook 1307.

【００９２】ＬＰＣ復号化部１３０２は、多重化分離部
１３０１から出力された符号ＬからＬＰＣを復号し、こ
れを合成フィルタ１３０３に出力する。適応音源符号帳
１３０５は、多重化分離部１３０１から出力された符号
Ａで指定される位置から適応音源ベクトルを取り出して
乗算器１３０８へ出力する。The LPC decoding section 1302 decodes the LPC from the code L output from the demultiplexing section 1301 and outputs this to the synthesis filter 1303. Adaptive excitation codebook 1305 extracts an adaptive excitation vector from the position specified by code A output from demultiplexing section 1301, and outputs the vector to multiplier 1308.

【００９３】固定音源符号帳１３０７は、多重化分離部
１３０１から出力された符号Ｆで指定される固定音源ベ
クトルを生成し、乗算器１３０９へ出力する。量子化利
得生成部１３０６は、多重化分離部１３０１から出力さ
れた音源利得符号Ｇで指定される適応音源ベクトル利得
と固定音源ベクトル利得とを復号し、これらを乗算器１
３０８，１３０９へそれぞれ出力する。Fixed excitation codebook 1307 generates a fixed excitation vector specified by code F output from multiplexing / demultiplexing section 1301, and outputs the generated fixed excitation vector to multiplier 1309. The quantization gain generation section 1306 decodes the adaptive excitation vector gain and the fixed excitation vector gain specified by the excitation gain code G output from the demultiplexing section 1301, and multiplies these by the multiplier 1
308 and 1309, respectively.

【００９４】乗算器１３０８は、前記適応符号ベクトル
に前記適応符号ベクトル利得を乗算して、加算器１３１
０へ出力する。乗算器１３０９は、前記固定符号ベクト
ルに前記固定符号ベクトル利得を乗算して、加算器１３
１０へ出力する。加算器１３１０は、加算器１３０８，
１３０９から出力された利得乗算後の適応音源ベクトル
と固定音源ベクトルの加算を行い、合成フィルタ１３０
３へ出力する。The multiplier 1308 multiplies the adaptive code vector by the adaptive code vector gain, and
Output to 0. A multiplier 1309 multiplies the fixed code vector by the fixed code vector gain, and
Output to 10 The adder 1310 includes an adder 1308,
The adaptive excitation vector after gain multiplication output from 1309 and the fixed excitation vector are added, and the synthesis filter 130
Output to 3.

【００９５】合成フィルタ１３０３は、加算器１３１０
から出力された音源ベクトルを駆動信号として、ＬＰＣ
復号化部１３０２によって復号されたフィルタ係数を用
いて、フィルタ合成を行い、合成した信号を後処理部１
３０４へ出力する。The synthesis filter 1303 includes an adder 1310
The sound source vector output from the
Filter synthesis is performed using the filter coefficients decoded by the decoding unit 1302, and the synthesized signal is processed by the post-processing unit 1.
Output to 304.

【００９６】後処理部１３０４は、ホルマント強調やピ
ッチ強調といったような音声の主観的な品質を改善する
処理や、定常雑音の主観的品質を改善する処理などを施
した上で、最終的な復号音声信号として出力する。The post-processing unit 1304 performs processing for improving the subjective quality of speech, such as formant emphasis and pitch emphasis, and processing for improving the subjective quality of stationary noise. Output as audio signal.

【００９７】また、上記音声符号化・復号化装置は、デ
ィジタル無線通信システムにおける基地局装置や移動局
のような通信端末装置に適用することができる。これに
より、ディジタル無線通信システムにおいて、低ビット
レートであっても高性能化を図ることが可能である。[0097] Further, the speech encoding / decoding device can be applied to a communication terminal device such as a base station device or a mobile station in a digital radio communication system. As a result, in a digital wireless communication system, high performance can be achieved even at a low bit rate.

【００９８】本発明は上記実施の形態に限定されず、種
々変更して実施することが可能である。例えば、上記実
施の形態に係る音源ベクトルの生成は、音声符号化装置
／音声復号化装置として説明しているが、これらの音源
ベクトルの生成をソフトウェアとして構成しても良い。
例えば、上記音源ベクトルの生成のプログラムをＲＯＭ
に格納し、そのプログラムにしたがってＣＰＵの指示に
より動作させるように構成しても良い。また、音源ベク
トル生成プログラムをコンピュータで読み取り可能な記
憶媒体に格納し、この記憶媒体の音源ベクトル生成プロ
グラムをコンピュータのＲＡＭに記録して、音源ベクト
ル生成プログラムにしたがって動作させるようにしても
良い。このような場合においても、上記実施の形態と同
様の作用、効果を呈する。The present invention is not limited to the above embodiment, but can be implemented with various modifications. For example, although the generation of the excitation vector according to the above embodiment has been described as a speech encoding device / speech decoding device, the generation of these excitation vectors may be configured as software.
For example, a program for generating the sound source vector is stored in a ROM
, And may be configured to operate according to an instruction of the CPU according to the program. Alternatively, the sound source vector generation program may be stored in a computer-readable storage medium, and the sound source vector generation program in the storage medium may be recorded in the RAM of the computer and operated according to the sound source vector generation program. In such a case, the same operation and effect as those of the above-described embodiment are exhibited.

【００９９】[0099]

【発明の効果】以上説明したように、本発明によれば、
少ないビット数で良好な符号化性能が得られる固定音源
符号帳を提供することができる。これにより、音源パル
ス数の本数を確保しつつ低ビットレートに対応すること
ができる。As described above, according to the present invention,
It is possible to provide a fixed excitation codebook in which good coding performance can be obtained with a small number of bits. This makes it possible to cope with a low bit rate while securing the number of sound source pulses.

【図面の簡単な説明】[Brief description of the drawings]

【図１】本発明の実施の形態に係る音声符号化／復号化
装置を備えた送信装置及び受信装置を示すブロック図FIG. 1 is a block diagram showing a transmitting device and a receiving device provided with a speech encoding / decoding device according to an embodiment of the present invention.

【図２】本発明の実施の形態に係る音声符号化装置の構
成を示すブロック図FIG. 2 is a block diagram showing a configuration of a speech coding apparatus according to an embodiment of the present invention.

【図３】本発明の実施の形態に係る固定音源符号帳を示
すブロック図FIG. 3 is a block diagram showing a fixed excitation codebook according to the embodiment of the present invention.

【図４】本発明の実施の形態に係る固定音源符号帳を示
すブロック図FIG. 4 is a block diagram showing a fixed excitation codebook according to the embodiment of the present invention.

【図５】本発明の実施の形態に係る音声符号化装置にお
けるパラメータ決定部を示すブロック図FIG. 5 is a block diagram showing a parameter determining unit in the speech coding apparatus according to the embodiment of the present invention.

【図６】図５に示す音源パラメータ決定部の固定音源ベ
クトル選択部の構成を示すブロック図FIG. 6 is a block diagram showing a configuration of a fixed sound source vector selecting unit of the sound source parameter determining unit shown in FIG.

【図７】本発明の実施の形態に係る固定音源符号帳の第
１の音源符号帳を示す図FIG. 7 is a diagram showing a first excitation codebook of the fixed excitation codebook according to the embodiment of the present invention.

【図８】本発明の実施の形態に係る固定音源符号帳の第
２の音源符号帳を示す図FIG. 8 is a diagram showing a second excitation codebook of the fixed excitation codebook according to the embodiment of the present invention.

【図９】本発明の実施の形態に係る固定音源符号帳の第
２の音源符号帳を示す図FIG. 9 is a diagram showing a second excitation codebook of the fixed excitation codebook according to the embodiment of the present invention.

【図１０】図５に示す音源パラメータ決定部における固
定音源ベクトル選択部の重みつき選択部の選択基準を説
明する図FIG. 10 is a view for explaining selection criteria of a weighted selection unit of a fixed excitation vector selection unit in the excitation parameter determination unit shown in FIG. 5;

【図１１】本発明の実施の形態に係る固定音源符号帳の
探索処理手順を示すフロー図FIG. 11 is a flowchart showing a fixed excitation codebook search processing procedure according to an embodiment of the present invention.

【図１２】図１０における重みつき選択部での重みつき
選択処理手順を示すフロー図FIG. 12 is a flowchart showing a weighted selection processing procedure in a weighted selection unit in FIG. 10;

【図１３】本発明の実施の形態に係る音声復号化装置の
構成を示すブロック図FIG. 13 is a block diagram showing a configuration of a speech decoding device according to an embodiment of the present invention.

【図１４】従来の代数的固定符号帳を示す図FIG. 14 shows a conventional algebraic fixed codebook.

[Explanation of symbols]

２００前処理部２０１ＬＰＣ分析部２０２ＬＰＣ量子化部２０３合成フィルタ２０５適応音源符号帳２０６量子化利得生成部２０７固定音源符号帳２１１聴覚重み付け部２１２パラメータ決定部２１３多重化部３０１，４０１第１の音源符号帳３０２，４０２第２の音源符号帳４０３第３の音源符号帳５０１適応音源ベクトル選択部５０２固定音源ベクトル選択部５０３音源利得量子化部６０１第１の固定音源ベクトル選択部６０２第２の固定音源ベクトル選択部６０３第３の固定音源ベクトル選択部６０４選択部６０５重み付き選択部 Reference Signs List 200 Preprocessing unit 201 LPC analysis unit 202 LPC quantization unit 203 Synthesis filter 205 Adaptive excitation codebook 206 Quantization gain generation unit 207 Fixed excitation codebook 211 Auditory weighting unit 212 Parameter determination unit 213 Multiplexing unit 301, 401 First Excitation codebooks 302, 402 Second excitation codebook 403 Third excitation codebook 501 Adaptive excitation vector selection unit 502 Fixed excitation vector selection unit 503 Excitation gain quantization unit 601 First fixed excitation vector selection unit 602 Second Fixed sound source vector selection unit 603 Third fixed sound source vector selection unit 604 Selection unit 605 Weighted selection unit

───────────────────────────────────────────────────── フロントページの続き (72)発明者安永和敏神奈川県川崎市多摩区東三田３丁目10番１号松下技研株式会社内 (72)発明者間野一則東京都千代田区大手町二丁目３番１号日本電信電話株式会社内 (72)発明者日和▲崎▼ 祐介東京都千代田区大手町二丁目３番１号日本電信電話株式会社内Ｆターム(参考） 5D045 CA01 DA11 5J064 AA02 BA13 BB12 BC09 BC11 BC16 BC26 BD02 ──────────────────────────────────────────────────続き Continuing on the front page (72) Kazutoshi Yasunaga 3-10-1, Higashi-Mita, Tama-ku, Kawasaki City, Kanagawa Prefecture Inside Matsushita Giken Co., Ltd. No. 1 Nippon Telegraph and Telephone Co., Ltd. (72) Inventor Yuwa Hitozaki Saki 2-1-3 Otemachi, Chiyoda-ku, Tokyo F-term within Nippon Telegraph and Telephone Co., Ltd. 5D045 CA01 DA11 5J064 AA02 BA13 BB12 BC09 BC11 BC16 BC26 BD02

Claims

[Claims]

1. A first excitation codebook having a relatively high temporal resolution that partially represents an excitation vector and a second excitation codebook that has a relatively low temporal resolution that expresses the entire excitation vector. A fixed excitation codebook, characterized in that:

2. A fixed excitation codebook for use in an excitation vector generation apparatus for generating an excitation vector composed of several pulses, wherein each pulse is finely arranged within a limited range with respect to the entire excitation vector. A first excitation codebook storing an excitation vector and a second excitation codebook storing an excitation vector obtained by roughly arranging each pulse over a wide range of the entire excitation vector are provided. Fixed sound source codebook.

3. An excitation codebook in which one pulse is arranged on one of two adjacent samples, wherein different polarities are assigned to both samples. Codebook.

4. The excitation codebook according to claim 2, wherein the excitation codebook arranges a plurality of pulses algebraically, and generates only a combination in which at least two pulses are arranged close to each other. 4. An excitation codebook comprising:
A fixed excitation codebook comprising the excitation codebook according to claim 2 as the second excitation codebook according to claim 2.

5. A fixed excitation codebook according to claim 4 as first and second excitation codebooks, and a third excitation codebook for generating an excitation vector like white noise. Fixed sound source codebook.

6. A fixed excitation codebook used for an excitation vector generation apparatus having a pulse excitation codebook and a noise excitation codebook, wherein the generated excitation vector is a pulse excitation codebook or a noise excitation codebook. When encoding the input signal using the generated excitation vector, which is generated from either one, the excitation vector generated from the noise excitation codebook is more easily selected as the coding distortion is larger. A fixed excitation codebook characterized by being weighted.

7. The apparatus according to claim 5, wherein the first and second excitation codebooks are provided as pulse excitation codebooks, and the third excitation codebook is provided as noise excitation codebooks. The fixed excitation codebook according to claim 6, wherein

8. A first sound source generating step of generating a pulse sound source in which the positions where each pulse can be taken are finely set, and at least two pulses are limited to approach each other, and the possible positions of each pulse. A second sound source generating step of generating a pulse sound source whose position is set roughly and which does not impose any restriction on the combination of each pulse; a third sound source generating step of generating a sound source composed of a random noise signal; A weighting step of performing weighting so that the excitation vector generated in the third excitation generation step is more easily selected as the coding distortion increases.

9. A recording medium which stores a sound source generating program and is readable by a computer, wherein the sound source generating program has finely set positions where each pulse can be taken, and at least two pulses approach each other. A first sound source generation procedure for generating a pulse sound source restricted as described above, and a second sound source generation step for generating a pulse sound source in which the positions where each pulse can be taken are roughly set and the combination of each pulse is not limited at all. , A third excitation generation procedure for generating an excitation composed of random noise signals, and weighting such that the excitation vector generated in the third excitation generation procedure is more easily selected as the coding distortion increases. And a weighting procedure for performing the following.

10. A fixed excitation codebook according to claim 1, an adaptive excitation codebook representing a periodic component of an input speech signal, and a parameter representing a spectrum characteristic of the input speech signal. Means for encoding, means for synthesizing a synthesized speech signal using the excitation vector and the parameter generated from the fixed excitation codebook and the adaptive excitation codebook, an input audio signal and the synthesized audio signal, And a means for determining an output from the fixed codebook and the adaptive codebook such that distortion of the speech is reduced.

11. A fixed excitation codebook according to any one of claims 1 to 7, an adaptive excitation codebook representing a periodic component of a synthesized speech signal, and a spectrum characteristic encoded by the speech encoding device. Means for decoding the parameters to be represented, and the excitation vector determined by the speech encoding apparatus is decoded from the fixed excitation codebook and the adaptive excitation codebook, and a synthesized speech signal is synthesized from the decoded excitation vector and the parameters. A speech decoding device.

12. An audio signal transmission device comprising the audio encoding device according to claim 10.

13. An audio signal receiving apparatus comprising the audio decoding apparatus according to claim 11.

14. A mobile station device comprising at least one of the voice signal transmitting device according to claim 12 and the voice signal receiving device according to claim 13, and performing wireless communication with a base station device.

15. A base station device comprising at least one of the voice signal transmitting device according to claim 12 and the voice signal receiving device according to claim 13, and performing wireless communication with a mobile station device.