JP3242353B2

JP3242353B2 - Audio signal quantization apparatus, audio signal inverse quantization apparatus, audio signal quantization method, and audio signal inverse quantization method

Info

Publication number: JP3242353B2
Application number: JP21221397A
Authority: JP
Inventors: 峰生津島; 武志則松; 良久中藤; 智一石川
Original assignee: Panasonic Corp; Matsushita Electric Industrial Co Ltd
Current assignee: Panasonic Corp; Panasonic Holdings Corp
Priority date: 1997-04-10
Filing date: 1997-08-06
Publication date: 2001-12-25
Anticipated expiration: 2017-08-06
Also published as: JPH10340100A

Description

DETAILED DESCRIPTION OF THE INVENTION

【０００１】[0001]

【発明の属する技術分野】この発明は音声信号や音楽信
号などのオーディオ信号を入力し、周波数変換したデー
タ（オーディオ特徴ベクトル）をサブデータ（サブベク
トル）の単位で符号化するオーディオ信号量子化装置及
びオーディオ信号量子化方法、並びにオーディオ信号量
子化装置から出力されるコードインデックスを入力し、
前記オーディオ信号を復号するオーディオ信号逆量子化
装置及びオーディオ信号逆量子化方法に関するものであ
る。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to an audio signal such as an audio signal or a music signal ,
Data (audio feature vector) as sub data (sub vector
Audio signal quantizer for encoding in units of
And audio signal quantization method and audio signal amount
Input the code index output from the child device,
Audio signal inverse quantization for decoding the audio signal
The present invention relates to an apparatus and an audio signal inverse quantization method .

【０００２】[0002]

【従来の技術】音響信号を効率的に符号化する様々な手
法が提案されている。音楽信号など２０ｋＨｚほどの帯
域を有する信号は、特に近年、ＭＰＥＧオーディオ方式
などを用いて符号化されることがあげられる。ＭＰＥＧ
方式に代表される方式は、時間軸のディジタルオーディ
オ信号を直交変換を用いて周波数軸に変換し、その周波
数軸上の情報を、人間の聴覚的な感度特性を考慮して、
聴覚的に重要な情報から優先的に情報量を与えていく方
式である。原ディジタル信号の情報量に対して、かなり
少ない情報量で信号を表現しようとする場合、ＴＣＷＶ
Ｑ（Transform Coding for Weighted Vector Quantizat
ion ）などのベクトル量子化の手法を用いた符号化方式
がある。ＭＰＥＧオーディオ、およびＴＣＷＶＱはそれ
ぞれISO/IEC 標準IS-11172-3およびT.Moriya,H.Suga:"A
n 8 Kbits transform coder for noisy channels,"Pro
c.ICASSP'89,pp196-199などに述べられている。2. Description of the Related Art Various techniques have been proposed for efficiently encoding audio signals. A signal having a band of about 20 kHz, such as a music signal, has recently been coded using an MPEG audio system in recent years. MPEG
The method represented by the method converts the digital audio signal on the time axis to the frequency axis using orthogonal transform, and considers information on the frequency axis in consideration of human auditory sensitivity characteristics,
This is a method in which the amount of information is given priority from the auditoryly important information. To express a signal with a considerably smaller amount of information than the information amount of the original digital signal, TCWV
Q (Transform Coding for Weighted Vector Quantizat
ion), there is an encoding method using a vector quantization technique. MPEG audio and TCWVQ are based on ISO / IEC standards IS-11172-3 and T.Moriya, H.Suga: "A
n 8 Kbits transform coder for noisy channels, "Pro
c. Described in ICASSP'89, pp196-199.

【０００３】以下、図面を参照しながら従来のオーディ
オ信号量子化装置について説明を行う。図７において、
１０１はコードブック、１０２はオーディオコード選択
部である。このオーディオコード選択部１０２は、オー
ディオ特徴ベクトルを入力として、上記コードブック１
０１が持つ各コードと前記オーディオ特徴ベクトルとの
聴覚的な距離を順次算出し、その距離が最小となるコー
ドを選出し、該選出されたコードに対応するコードイン
デックスを出力するものである。そして受信側では、送
信されてきたコードインデックスを、コード復号器で復
号し、コードブック１０１と同一内容の受信機側コード
ブックを用いてオーディオ特徴ベクトルに変換する。Hereinafter, a conventional audio signal quantization apparatus will be described with reference to the drawings. In FIG.
101 is a code book, and 102 is an audio code selection unit. The audio code selection unit 102 receives the audio feature vector as input, and
Audible distances between each code of 01 and the audio feature vector are sequentially calculated, a code having the minimum distance is selected, and a code index corresponding to the selected code is output. Then, on the receiving side, the transmitted code index is decoded by a code decoder, and is converted into an audio feature vector using a receiver-side codebook having the same contents as the codebook 101.

【０００４】[0004]

【発明が解決しようとする課題】従来のオーディオ信号
量子化装置は以上のように構成されており、コードブッ
クが持つ各コードとオーディオ特徴ベクトルとの聴覚的
な距離が最小となるコードに対応したコードインデック
スを出力することにより量子化を行うものであるが、コ
ードブックが有するコードが多い場合、最適なコードを
検索する際に計算量が非常に多くなり、また、コードブ
ックのもつデータ量が多い場合は、ハードウエアで構成
する際に多くの量のメモリが必要となり不経済であると
いう問題点があった。また、受信側においても、コード
インデックスに対応するだけの検索、及びメモリ量を必
要とするという問題点があった。The conventional audio signal quantizing apparatus is configured as described above, and is adapted to a code which minimizes the auditory distance between each code of the codebook and the audio feature vector. Quantization is performed by outputting a code index.However, when there are many codes in a codebook, the amount of calculation becomes extremely large when searching for an optimum code, and the amount of data in the codebook is small. In the case of a large number, there is a problem that a large amount of memory is required when configuring with hardware, which is uneconomical. Further, the receiving side also has a problem that a search corresponding to the code index and a memory amount are required.

【０００５】この発明は上記のような問題点に鑑みてな
されたもので、コード検索数を削減し、コード数の少な
いコードブックで効率的にオーディオ信号を量子化する
ことのできるオーディオ信号量子化装置、及び逆量子化
することのできるオーディオ信号逆量子化装置を提供す
ることを目的とする。SUMMARY OF THE INVENTION The present invention has been made in view of the above-described problems, and has been made in consideration of the above-described problems. It is an object of the present invention to provide a device and an audio signal inverse quantization device capable of inverse quantization.

【０００６】[0006]

【課題を解決するための手段】この発明の請求項１に係
るオーディオ信号量子化装置は、オーディオ信号を入力
し周波数変換したデータ（オーディオ特徴ベクトル）を
サブデータ（サブベクトル）の単位で符号化するオーデ
ィオ信号量子化装置であって、前記周波数変換されたサ
ブデータのうちの所定の周波数帯域に属する要素データ
の位相情報を抽出する位相情報抽出部と、前記サブデー
タを代表値で表す複数個のオーディオコードであって、
前記所定の周波数帯域に属する要素データに対応する値
が位相成分を除いた絶対値であるオーディオコードを格
納するコードブックを有し、前記サブデータに対応する
前記オーディオコードを前記コードブックの中から検索
するとともに、検索したオーディオコードを識別する情
報と前記サブデータの中で位相成分が除かれた要素デー
タに対応する位相情報とを含むコードインデックスを出
力するオーディオコード選択部とを備えたものである。An audio signal quantization apparatus according to a first aspect of the present invention receives an audio signal.
Frequency-converted data (audio feature vector)
Audio to be encoded in units of sub data (sub vector)
A video signal quantizer, wherein the frequency-converted
Element data belonging to the specified frequency band of the
A phase information extracting unit for extracting phase information of
A plurality of audio codes representing data by a representative value,
A value corresponding to the element data belonging to the predetermined frequency band
Is the absolute value of the audio code excluding the phase component.
Has a codebook to store, corresponding to the sub data
Search for the audio code in the codebook
Information to identify the searched audio code.
Data and the element data from which the phase component has been
Output a code index containing the phase information corresponding to the
And an audio code selection unit for inputting the audio code.

【０００７】また、この発明の請求項２に係るオーディ
オ信号量子化装置は、請求項１記載のオーディオ信号量
子化装置において、前記位相情報抽出部が、前記周波数
変換されたサブデータに含まれる複数の要素データのう
ち、低周波帯域側に属する要素データの位相情報を抽出
するものである。[0007] The audio signal quantization apparatus according to claim 2 of the present invention, in the audio signal quantization apparatus of 請 Motomeko 1, wherein said phase information extraction unit, wherein the frequency
Data of multiple element data included in the converted sub data
Extract the phase information of the element data belonging to the low frequency band
Is what you do.

【０００８】また、この発明の請求項３に係るオーディ
オ信号量子化装置は、請求項１記載のオーディオ信号量
子化装置において、さらに、人間の聴覚心理モデルに基
づき各周波数における聴覚感度を表す聴覚感度テーブル
を備え、前記位相情報抽出部が、前記聴覚感度テーブル
に基づき、前記周波数変換されたサブデータに含まれる
複数の要素データのうち、高い聴覚感度にある周波数に
属する要素データの位相情報を抽出するものである。[0008] The audio signal quantization apparatus according to claim 3 of the present invention, in the audio signal quantization apparatus of 請 Motomeko 1, further groups to the human psychoacoustic model
Sensitivity table showing the sensitivity at each frequency
Wherein the phase information extraction unit comprises:
Is included in the frequency-converted sub-data based on
Of the multiple element data, select a frequency with high hearing sensitivity
This is to extract the phase information of the element data to which it belongs .

【０００９】また、この発明の請求項４に係るオーディ
オ信号量子化装置は、請求項１から請求項３のいずれか
に記載のオーディオ信号量子化装置において、さらに、
前記周波数変換されたデータに含まれる複数のサブデー
タに対し、周波数毎のレンジを縮小させ、それぞれの前
記サブデータを平滑化する平滑化部を備え、前記オーデ
ィオコード選択部が、平滑化されたサブデータの入力を
受け、対応するオーディオコードを検索するに際し、前
記平滑部からの情報に基づき、平滑化されたサブデータ
のスケーリングを調整したうえで、対応するオーディオ
コードを検索するものである。[0009] The audio signal quantization apparatus according to claim 4 of the present invention, any one of 請 Motomeko 1 of claim 3
In the audio signal quantization device according to the above, further,
A plurality of sub-data included in the frequency-converted data
The range for each frequency,
And a smoothing unit for smoothing the sub data.
The code selector selects the input of the smoothed sub data.
Before searching for the corresponding audio code
Sub data smoothed based on information from the smoothing unit
After adjusting the scaling of the
Search code .

【００１０】また、この発明の請求項５に係るオーディ
オ信号量子化装置は、請求項４記載のオーディオ信号量
子化装置において、さらに、前記オーディオコード選択
部に入力する複数の前記サブデータを並び替えるソート
部を備え、前記ソート部が、前記平滑部からの情報と前
記聴覚感度テーブルを参照結果とに基づき、オーディオ
データの物理的な特性と人間の聴覚特性との両者を加味
した重要度の高い順に複数の前記サブデータを並び替え
るものである。[0010] The audio signal quantization apparatus according to a fifth aspect of the invention, in the audio signal quantization apparatus of 請 Motomeko 4, further, the audio code selection
Sorting to sort a plurality of the sub data input to the section
Unit, wherein the sorting unit is configured to store information from the smoothing unit
Based on the hearing sensitivity table and the reference result, the audio
Taking into account both the physical characteristics of the data and the human auditory characteristics
Sort multiple sub-data in descending order of importance
It is those that.

【００１１】また、この発明の請求項６に係るオーディ
オ信号量子化装置は、請求項１から請求項５のいずれか
に記載のオーディオ信号量子化装置において、前記周波
数変換されたデータが、入力されたオーディオ信号をＭ
ＤＣＴ変換（変形離散コサイン変換）した係数を要素と
するものである。Further, the audio signal quantization apparatus according to claim 6 of the present invention, in the audio signal quantization apparatus according to claim 5 請 Motomeko 1, wherein the frequency
The number-converted data converts the input audio signal to M
DCT (Modified Discrete Cosine Transform) Coefficient
Is what you do.

【００１２】また、この発明の請求項７に係るオーディ
オ信号逆量子化装置は、オーディオ信号を入力し周波数
変換したデータ（オーディオ特徴ベクトル）をサブデー
タ（サブベクトル）の単位で符号化するオーディオ信号
量子化装置から出力されるコードインデックスを入力
し、前記オーディオ信号を復号するオーディオ信号逆量
子化装置であって、前記コードインデックスは、前記サ
ブデータを代表値で表すオーディオコードを識別する情
報と位相情報とを含む情報であって、前記オーディオコ
ードは所定の周波数帯域に属する要素データに対応する
値が位相成分を除いた絶対値であり、前記位相情報は対
応するオーディオコードの位相成分が除かれた要素デー
タの位相成分であり、前記オーディオ信号逆量子化装置
は、前記コードインデックスに含まれる位相情報を抽出
する位相情報抽出部と、前記サブデータを代表値で表す
複数個のオーディオコードであって、前記所定の周波数
帯域に属する要素データに対応する値が位相成分を除い
た絶対値であるオーディオコードを格納するコードブッ
クを有し、前記コードインデックスに含まれるオーディ
オコードを識別する情報に基づき、前記コードインデッ
クスに対応する前記オーディオコードを前記コードブッ
クの中から検索するとともに、前記コードインデックス
に含まれる位相情報を、検索したオーディオコードの中
で位相成分が除かれた要素データの位相成分として付加
し、これを出力信号として出力するオーディオコード選
択部とを備えたものである。[0012] The audio signal dequantization apparatus according to the seventh aspect of the invention, the frequency inputs an audio signal
Convert the converted data (audio feature vector)
Audio signal to be encoded in data (sub-vector) units
Input the code index output from the quantizer
And an audio signal inverse amount for decoding the audio signal.
A code generator, wherein the code index is
Information that identifies the audio code that
Information including the audio information and the phase information.
Mode corresponds to the element data belonging to the predetermined frequency band.
The value is an absolute value excluding the phase component, and the phase information is
Element data from which the phase component of the corresponding audio code has been removed
The audio signal inverse quantization device
Extracts the phase information included in the code index
Phase information extracting unit, and the sub-data is represented by a representative value.
A plurality of audio codes, wherein the predetermined frequency is
The value corresponding to the element data belonging to the band excludes the phase component
Code book that stores audio codes that are absolute values
Audio that is included in the code index
Code index based on the information identifying the
The audio code corresponding to the audio
And search the code index.
Phase information included in the searched audio code.
Added as phase component of element data with phase component removed
Audio code selection to output this as an output signal
And a selector .

【００１３】また、この発明の請求項８に係るオーディ
オ信号量子化方法は、オーディオ信号を入力し周波数変
換したデータ（オーディオ特徴ベクトル）をサブデータ
（サブベクトル）の単位で符号化するオーディオ信号量
子化装置のオーディオ信号量子化方法であって、前記オ
ーディオ信号量子化装置は、前記サブデータを代表値で
表す複数個のオーディオコードであって、所定の周波数
帯域に属する要素データに対応する値が位相成分を除い
た絶対値であるオーディオコードを格納するコードブッ
クを備え、前記オーディオ信号量子化方法は、前記周波
数変換されたサブデータのうちの所定の周波数帯域に属
する要素データの位相情報を抽出するステップと、前記
サブデータに対応する前記オーディオコードを前記コー
ドブックの中から検索するステップと、検索したオーデ
ィオコードを識別する情報と前記サブデータの中で位相
成分が除かれた要素データに対応する位相情報とを含む
コードインデックスを出力するステップとを含むもので
ある。Also, in the audio signal quantizing method according to claim 8 of the present invention , an audio signal is inputted and frequency conversion is performed.
Converted data (audio feature vector) to sub data
Audio signal amount to be encoded in (sub-vector) units
A method of quantizing an audio signal of a decoding device, comprising:
The audio signal quantizer converts the sub data into a representative value.
A plurality of audio codes representing a predetermined frequency
The value corresponding to the element data belonging to the band excludes the phase component
Code book that stores audio codes that are absolute values
Wherein the audio signal quantization method comprises:
Belonging to a predetermined frequency band of the converted sub data
Extracting phase information of the element data to be performed;
The audio code corresponding to the sub data is
Steps to search in the book
Information identifying the radio code and the phase in the sub-data.
Including phase information corresponding to the element data from which the component has been removed
Outputting the code index .

【００１４】また、この発明の請求項９に係るオーディ
オ信号逆量子化方法は、オーディオ信号を入力し周波数
変換したデータ（オーディオ特徴ベクトル）をサブデー
タ（サブベクトル）の単位で符号化するオーディオ信号
量子化装置から出力されるコードインデックスを入力
し、前記オーディオ信号を復号するオーディオ信号逆量
子化装置のオーディオ信号逆量子化方法であって、前記
コードインデックスは、前記サブデータを代表値で表す
オーディオコードを識別する情報と位相情報とを含む情
報であって、前記オーディオコードは所定の周波数帯域
に属する要素データに対応する値が位相成分を除いた絶
対値であり、前記位相情報は対応するオーディオコード
の位相成分が除かれた要素データの位相成分であり、前
記オーディオ信号逆量子化装置は、前記サブデータを代
表値で表す複数個のオーディオコードであって、前記所
定の周波数帯域に属する要素データに対応する値が位相
成分を除いた絶対値であるオーディオコードを格納する
コードブックを備え、前記オーディオ信号逆量子化方法
は、前記コードインデックスに含まれる位相情報を抽出
するステップと、前記コードインデックスに含まれるオ
ーディオコードを識別する情報に基づき、前記コードイ
ンデックスに対応する前記オーディオコードを前記コー
ドブックの中から検索するステップと、前記コードイン
デックスに含まれる位相情報を、検索したオーディオコ
ードの中で位相成分が除かれた要素データの位相成分と
して付加し、これを出力信号として出力するステップと
を含むものである。According to a ninth aspect of the present invention, in the audio signal dequantizing method , an audio signal is input and
Convert the converted data (audio feature vector)
Audio signal to be encoded in data (sub-vector) units
Input the code index output from the quantizer
And an audio signal inverse amount for decoding the audio signal.
An audio signal inverse quantization method for a child device,
The code index represents the sub data by a representative value
Information including audio code identification information and phase information
The audio code is in a predetermined frequency band.
The value corresponding to the element data belonging to
The phase information is a corresponding audio code.
Is the phase component of the element data with the phase component of
The audio signal dequantizer substitutes the sub data for
A plurality of audio codes represented by table values,
The value corresponding to the element data belonging to a certain frequency band is the phase
Store audio code that is absolute value excluding components
An audio signal inverse quantization method comprising a codebook
Extracts the phase information included in the code index
And the steps included in the code index.
Audio code based on the information identifying the audio code.
The audio code corresponding to the index
Searching in the book,
The phase information included in the dex is
The phase component of the element data from which the phase component has been
And outputting this as an output signal.
Is included .

【００１５】[0015]

BEST MODE FOR CARRYING OUT THE INVENTION

実施の形態１．以下、本発明の実施の形態１によるオー
ディオ信号量子化装置について、図面を参照しながら説
明する。図１は本発明のオーディオ信号量子化装置の概
略構成図であり、図２は実施の形態１におけるオーディ
オ信号量子化装置を構成する符号化部の詳細な構成を示
す図であり、これらの図において、１は符号化部、２は
復号化部、３はオーディオ信号の特徴量の代表値である
オーディオコードを複数有する送信側コードブック、１
０２はオーディオコード選択部、１０７は位相情報抽出
部である。また、５はコード復号部、６は受信側コード
ブックである。Embodiment 1 FIG. Hereinafter, an audio signal quantization device according to Embodiment 1 of the present invention will be described with reference to the drawings. FIG. 1 is a schematic configuration diagram of an audio signal quantization device according to the present invention, and FIG. 2 is a diagram illustrating a detailed configuration of an encoding unit included in the audio signal quantization device according to the first embodiment. In the figure, 1 is an encoding unit, 2 is a decoding unit, 3 is a transmission-side codebook having a plurality of audio codes, which are representative values of characteristic amounts of audio signals, and 1
02 is an audio code selection unit, and 107 is a phase information extraction unit. Reference numeral 5 denotes a code decoding unit, and reference numeral 6 denotes a reception-side codebook.

【００１６】以下、動作について説明する。時間軸上の
ディジタルオーディオ信号を量子化する際、時間軸上の
データを時間周波数変換と呼ばれる直交変換を用いて周
波数軸上のデータに変換する。直交変換としてはＤＦＦ
Ｔ（離散フーリエ変換）や、ＭＤＣＴ（変形離散コサイ
ン変換）などがあげられる。The operation will be described below. When quantizing a digital audio signal on the time axis, data on the time axis is converted into data on the frequency axis by using orthogonal transform called time-frequency conversion. DFF as orthogonal transformation
T (Discrete Fourier Transform), MDCT (Modified Discrete Cosine Transform) and the like.

【００１７】図３に示すように、周波数軸上のデータを
１つの音源ベクトルとみなした場合、その音源ベクトル
から、いくつかの要素を抜き出したサブベクトルを形成
し、これを図１の入力ベクトルとした場合、オーディオ
コード選択部１０２は、送信側コードブック３中の各コ
ードと前記入力ベクトルとの距離を算出して、その距離
が最小となるコードを選択して、その選択されたコード
の送信側コードブック３におけるコードインデックスを
復号化部２への入力とする。そして受信部である復号化
部２では、前記出力されたコードインデックスをコード
復号部５の入力として、受信側コードブック６中のコー
ドインデックスに対するコードをコード復号部５に出力
して、コード復号部５は出力ベクトルとして出力する。As shown in FIG. 3, when the data on the frequency axis is regarded as one sound source vector, sub-vectors are obtained by extracting some elements from the sound source vector. In this case, the audio code selection unit 102 calculates the distance between each code in the transmission-side codebook 3 and the input vector, selects the code having the minimum distance, and selects the code of the selected code. The code index in the transmission side codebook 3 is input to the decoding unit 2. The decoding unit 2 serving as a receiving unit receives the output code index as an input to the code decoding unit 5, outputs a code corresponding to the code index in the reception side codebook 6 to the code decoding unit 5, and outputs the code to the code decoding unit 5. 5 is output as an output vector.

【００１８】上記動作において、入力ベクトルを符号化
部１に入力して、符号化部１がコードインデックスを出
力するまでのプロセスが符号化であり、このコードイン
デックスを入力として出力ベクトルを出力するまでのプ
ロセスが復号化である。多くの場合、送信側コードブッ
ク３と受信側コードブック６とは同一内容であることが
多いが、本実施の形態１では、受信側コードブック６は
従来と同様に、送信されてくるコードインデックスと同
等の数のコードを有するものとして説明を行う。上記符
号化部１の詳細な動作を図２及び図４を用いて以下に説
明する。ここでは、コード化を、２０ＫＨｚを対象とす
るために１０ｂｉｔで行う場合を想定する。また、位相
情報抽出部１０７では、抽出する位相を周波数の低い方
から２番目までの要素に対して、すなわち２ビット分と
する。上記オーディオコード選択部１０２の入力は、オ
ーディオ特徴ベクトルであり、例えば、オーディオ信号
をＭＤＣＴ変換（Modified Discrete Cosine Transfor
m) して得た係数を１つのベクトルとした時に、そのベ
クトルを分割して幾つかの要素、例えば、２０程度の要
素を１つのサブベクトルとしたものである。その際、ベ
クトルはＸ₀〜Ｘ₁₉とし、Ｘの添え字の数字が小さいサ
ブベクトルの要素ほど、低い周波数成分を有するＭＤＣ
Ｔ係数に対応するものとする。ここで低い周波数成分は
人にとって聴覚的に重要な情報であり、従って、これら
の要素についてのコード化を優先的に行うことで、再生
時に人間にとっては音質の劣化を感じさせにくい。In the above operation, the process of inputting an input vector to the encoding unit 1 and outputting the code index by the encoding unit 1 is encoding. Until an output vector is output using the code index as an input. Is the decryption process. In many cases, the transmission-side codebook 3 and the reception-side codebook 6 often have the same contents. However, in the first embodiment, the reception-side codebook 6 stores the transmitted code index as in the related art. The description will be made assuming that the code has the same number of codes. The detailed operation of the encoding unit 1 will be described below with reference to FIGS. Here, it is assumed that coding is performed in 10 bits in order to target 20 KHz. Further, the phase information extraction unit 107 extracts the phase to be extracted from the lowest frequency element to the second element, that is, 2 bits. The input of the audio code selection unit 102 is an audio feature vector. For example, an audio signal is subjected to MDCT (Modified Discrete Cosine Transform).
m), when the coefficient obtained as a single vector is obtained, the vector is divided, and several elements, for example, about 20 elements, are converted into one subvector. At this time, the vectors are X _{0 to} X _19, and the sub-vector element having a smaller subscript of X has a lower MDC having a lower frequency component.
Let it correspond to the T coefficient. Here, the low frequency component is information that is perceptually important to humans, and therefore, by giving priority to coding of these elements, it is difficult for humans to feel deterioration in sound quality during reproduction.

【００１９】オーディオコード選択部１０２では、前記
オーディオ特徴ベクトルと、送信側コードブック３の各
々のコードとの距離を算出する。例えば、コードインデ
ックスをｉとした場合に、数１式でコードインデックス
ｉのコードにおける距離Diが算出できる。The audio code selection unit 102 calculates the distance between the audio feature vector and each code in the transmission side code book 3. For example, when the code index is i, the distance Di in the code of the code index i can be calculated by Expression 1.

【００２０】[0020]

【数１】 (Equation 1)

【００２１】ここで上記数１式において、Ｎは送信側コ
ードブック３中の全コードの個数であり、Cij はコード
インデックスI におけるｊ番目の要素の値である。M は
本実施の形態１の場合、１９以下の数、例えば、１など
である。P は距離計算におけるノルムであり、例えば、
２などである。また、abs( ) は絶対値演算を意味す
る。In the above equation (1), N is the number of all codes in the transmission side code book 3, and Cij is the value of the j-th element in the code index I. M is 19 or less, for example, 1 in the first embodiment. P is the norm in the distance calculation, for example,
2 and so on. Abs () means an absolute value operation.

【００２２】位相情報抽出部１０７では、最小の距離Di
を与えるコードインデックスｉとＭ個の位相情報Ph(j)
j=0 to Mを出力する。位相情報Ph(j) は数２式で示され
るように、In the phase information extracting section 107, the minimum distance Di
Index i and M pieces of phase information Ph (j)
Outputs j = 0 to M. The phase information Ph (j) is given by

【００２３】[0023]

【数２】 (Equation 2)

【００２４】オーディオ特徴ベクトルが音声信号をＭＤ
ＣＴ変換したベクトルのサブベクトルである場合などで
は、一般にXjの添え字j が小さいほど係数の聴覚的な重
要度が高いため、本構成により、各サブベクトルの低い
周波数成分の要素に対応する位相（正負）についてはコ
ード検索時にはこれらの情報については考慮されず、検
索後に別途付加されることになる。すなわち、図４
（ａ）に示されるように、サブベクトル化されたオーデ
ィオ特徴ベクトルは、低い周波数側の２ｂｉｔ分の要素
の正負符号を無視して、送信側コードブック３の有する
コードとパターン比較される。例えば、ここには低い周
波数側の２ｂｉｔ分の要素が共に正として格納された２
５６個のコードが格納されており、オーディオコード選
択部１０２では、入力されたサブベクトルと上記送信側
コードブック３の有する２５６個のコードとの検索を行
う。そして得られたコードに対して、位相情報抽出部１
０７で抽出された、図４（ｂ）に示される組み合わせの
いずれかのものが、当該サブベクトルの低い周波数側の
２ｂｉｔ分の正負符号として付加されて、合計１０ｂｉ
ｔのコードインデックスとして出力を行う。The audio feature vector converts the audio signal to the MD.
In the case where the vector is a sub-vector of a CT-transformed vector, the smaller the subscript j of Xj is, the higher the auditory importance of the coefficient is. Therefore, according to this configuration, the phase corresponding to the element of the low frequency component of each sub-vector Regarding (positive / negative), such information is not considered at the time of code search, and is added separately after the search. That is, FIG.
As shown in (a), the sub-vectorized audio feature vector is compared with the code of the transmission side code book 3 ignoring the sign of the two-bit element on the low frequency side. For example, here, the elements of 2 bits on the low frequency side are both stored as positive.
56 codes are stored, and the audio code selection unit 102 searches the input subvector and the 256 codes of the transmission side codebook 3. Then, for the obtained code, the phase information extraction unit 1
07, one of the combinations shown in FIG. 4 (b) is added as a plus / minus sign of 2 bits on the lower frequency side of the subvector, for a total of 10 bi.
Output as a code index of t.

【００２５】このようにすることで、符号化部１から復
号化部２に送るコードインデックスは、従来と同様に１
０ｂｉｔ（１０２４個）のままで、送信側コードブック
３に格納するコードは８ｂｉｔ（２５６個）とすること
ができ、位相情報との情報量の和を、数３式の距離計算
のコードインデックスの情報量と同じにした場合に、下
記の数３式で復号された合成音声と本構成の合成音声と
比べてみると、ほぼ同等の主観評価結果を得ることがで
きる。By doing so, the code index sent from the encoding unit 1 to the decoding unit 2 becomes 1 as in the prior art.
The code stored in the transmission side codebook 3 can be set to 8 bits (256) while keeping 0 bits (1024), and the sum of the amount of information with the phase information is calculated as When the amount of information is the same, a substantially equivalent subjective evaluation result can be obtained by comparing the synthesized speech decoded by the following Expression 3 with the synthesized speech of the present configuration.

【００２６】[0026]

【数３】 (Equation 3)

【００２７】ここで本構成と数３式を用いた場合の計算
量とメモリ量の関係を表１に示す。Table 1 shows the relationship between the amount of calculation and the amount of memory when this configuration and Equation 3 are used.

【００２８】[0028]

【表１】 [Table 1]

【００２９】本実施例の構成の方がコードブックは４分
の１で、計算量も従来では１０２４通りの検索処理が必
要であったのが、２５６通りの検索処理と、検索結果に
対して２符号を付加するという処理を行うだけでよく、
計算量ならびにメモリを大幅に削減できることが分か
る。In the configuration of the present embodiment, the codebook is one quarter and the amount of calculation has conventionally required 1024 types of search processing. All you have to do is add the two codes.
It can be seen that the amount of calculation and memory can be significantly reduced.

【００３０】このように本実施の形態１によれば、符号
化部１で符号化を行う際に、オーディオ特徴ベクトルを
分割して作成されたサブベクトルと送信側コードブック
３の中の各オーディオコードとの聴覚的な距離のうち最
小距離を有するオーディオコードを選択するにあたり、
聴覚的な重要度が高いサブベクトルの要素に相当する部
分については、オーディオコード選択部１０２でその位
相情報を示す正負符号を無視して取り扱って、送信側コ
ードブック３のオーディオコードとの比較検索を行い、
得られた結果に別途、位相情報抽出部１０７で抽出した
上記サブベクトルの要素部分に対応する位相情報を付加
してコードインデックスとして出力するようにしたの
で、体感的な音質の劣化を招くことなく、オーディオコ
ード選択部１０２における計算量を削減でき、また、コ
ードブック３，１０１に必要なコード数をも削減するこ
とができる。As described above, according to the first embodiment, when encoding is performed by the encoding unit 1, the sub-vector created by dividing the audio feature vector and each audio In selecting the audio code that has the minimum distance from the audible distance to the code,
For the portion corresponding to the sub-vector element having high auditory importance, the audio code selection unit 102 treats the portion ignoring the sign indicating the phase information, and compares it with the audio code in the transmission side code book 3. Do
Separately, the phase information corresponding to the element portion of the sub-vector extracted by the phase information extraction unit 107 is added to the obtained result and output as a code index, so that the perceived sound quality does not deteriorate. The amount of calculation in the audio code selection unit 102 can be reduced, and the number of codes required for the code books 3 and 101 can also be reduced.

【００３１】実施の形態２.以下、本発明の実施の形態
２によるオーディオ信号量子化装置について、図面を参
照しながら説明する。図５（ａ）は本実施の形態２にお
けるオーディオ信号量子化装置の構成を示す図であり、
図３において、１０３は人間の聴覚心理特性を考慮した
各周波数における相対的な聴覚心理量のテーブルを格納
する聴覚心理重みベクトルテーブルである。Embodiment 2 Hereinafter, an audio signal quantization apparatus according to Embodiment 2 of the present invention will be described with reference to the drawings. FIG. 5A is a diagram illustrating a configuration of an audio signal quantization device according to Embodiment 2.
In FIG. 3, reference numeral 103 denotes an psychoacoustic weight vector table for storing a table of relative psychological quantities at each frequency in consideration of psychoacoustic characteristics of a human.

【００３２】以下、動作について説明する。上記実施の
形態１との違いは、聴覚心理重みベクトルテーブル１０
３が新たに付加された構成であることである。聴覚心理
重みベクトルとは、人間の聴覚心理モデルに基づき、周
波数に対する聴覚の感度特性として定義された聴覚感度
テーブルなどから、本実施の形態の入力であるオーディ
オ特徴ベクトルの各要素に対する同じ周波数帯の要素を
集めてベクトルとしたものである。これは例えば、図５
（ｂ）に示されるように、周波数２．５ＫＨｚ程度にピ
ークを有しており、周波数の最も低い位置にあるものが
必ずしも人の聴覚的に重要でないものであることがわか
る。The operation will be described below. The difference from the first embodiment is that the psychoacoustic weight vector table 10
3 is a newly added configuration. The psychoacoustic weight vector is based on a human psychoacoustic model, and from an auditory sensitivity table defined as an auditory sensitivity characteristic with respect to frequency, the same frequency band for each element of the audio feature vector that is the input of the present embodiment. It is a collection of elements into a vector. This is illustrated in FIG.
As shown in (b), it has a peak at a frequency of about 2.5 KHz, and it can be seen that the one at the lowest frequency is not necessarily important to human hearing.

【００３３】すなわち、本実施の形態では、オーディオ
特徴ベクトルをオーディオコード選択部１０２への入力
として、聴覚心理重みベクトルテーブル１０３をコード
選択時の重みとして、送信側コードブック３中の各コー
ドとオーディオ特徴ベクトルとの聴覚的な距離を算出
し、最小の距離を与えるコードに対するコードインデッ
クスを出力する。コードインデックスをｉとした場合に
オーディオコード選択部１０２におけるコード選択時の
距離尺度Ｄi は、例えば、That is, in the present embodiment, the audio feature vector is input to the audio code selection unit 102, and the psychoacoustic weight vector table 103 is used as the weight at the time of code selection. The auditory distance from the feature vector is calculated, and the code index for the code that gives the minimum distance is output. When the code index is i, the distance scale Di at the time of code selection in the audio code selection unit 102 is, for example,

【００３４】[0034]

【数４】 (Equation 4)

【００３５】となる。ここで、Ｎは送信側コードブック
３中の全コードの個数であり、Cij はコードインデック
スｉにおけるｊ番目の要素の値である。M は本実施例の
場合、１９以下の数、例えば、１などである。P は距離
計算におけるノルムであり、例えば、２などである。Wj
は、聴覚心理重みベクトルテーブル１０３のj 番目の要
素である。また、abs( ) は絶対値演算を意味する。## EQU1 ## Here, N is the number of all codes in the transmission side codebook 3, and Cij is the value of the j-th element in the code index i. M is a number of 19 or less, for example, 1 in the present embodiment. P is a norm in the distance calculation, for example, 2 or the like. Wj
Is the j-th element of the psychoacoustic weight vector table 103. Abs () means an absolute value operation.

【００３６】位相情報抽出部１０７では、聴覚心理重み
ベクトルテーブル１０３から、どの周波数のオーディオ
特徴ベクトルに相当する要素の位相情報を抽出するかを
判断し、その範囲で最小のDiを与えるコードインデック
スI とＭ個の位相情報Ph(j)j=0 to Mを出力する。位相
情報Ph(j) は、数２式で同様に定義される。The phase information extraction unit 107 determines from the psychoacoustic weight vector table 103 which phase information of an element corresponding to an audio feature vector of which frequency is to be extracted, and a code index I that gives the minimum Di in the range. And M pieces of phase information Ph (j) j = 0 to M are output. The phase information Ph (j) is similarly defined by Expression 2.

【００３７】このように本実施の形態２によれば、符号
化部１で符号化を行う際に、オーディオ特徴ベクトルを
分割して作成されたサブベクトルと送信側コードブック
３の中の各オーディオコードとの聴覚的な距離のうち最
小距離を有するオーディオコードを選択するにあたり、
聴覚的な重要度が高いサブベクトルの要素に相当する部
分については、オーディオコード選択部１０２でその位
相情報を示す正負符号を無視して取り扱って、送信側コ
ードブック３のオーディオコードとの比較検索を行い、
得られた結果に別途、位相情報抽出部１０７で抽出した
上記サブベクトルの要素部分に対応する位相情報を付加
してコードインデックスとして出力するようにしたの
で、体感的な音質の劣化を招くことなく、オーディオコ
ード選択部１０２における計算量を削減でき、また、コ
ードブック３，１０１に必要なコード数をも削減するこ
とができる。As described above, according to the second embodiment, when encoding is performed by the encoding unit 1, the sub-vectors created by dividing the audio feature vector and each audio In selecting the audio code that has the minimum distance from the audible distance to the code,
For the portion corresponding to the sub-vector element having high auditory importance, the audio code selection unit 102 treats the portion ignoring the sign indicating the phase information, and compares it with the audio code in the transmission side code book 3. Do
Separately, the phase information corresponding to the element portion of the sub-vector extracted by the phase information extraction unit 107 is added to the obtained result and output as a code index, so that the perceived sound quality does not deteriorate. The amount of calculation in the audio code selection unit 102 can be reduced, and the number of codes required for the code books 3 and 101 can also be reduced.

【００３８】また、上記オーディオコード選択部１０２
でその位相情報を示す正負符号を無視して取り扱うオー
ディオ特徴ベクトルを、人間の聴覚心理特性を考慮した
各周波数における相対的な聴覚心理量のテーブルを格納
する聴覚心理重みベクトルテーブル１０３を用いて重み
付けして選択することにより、上記実施の形態１のよう
に単に低域から所定個数のオーディオ特徴ベクトルを選
択するものに比べて、より体感的に音質の優れた量子化
を行うことができる。The audio code selecting section 102
The audio feature vector handled by ignoring the sign indicating the phase information is weighted using the psychoacoustic weight vector table 103 that stores a table of the relative psychological quantity at each frequency in consideration of the psychoacoustic characteristics of a human. By doing so, it is possible to perform quantization with excellent sound quality more sensuously than in the case of simply selecting a predetermined number of audio feature vectors from the low band as in the first embodiment.

【００３９】実施の形態３．以下、本発明の実施の形態
３によるオーディオ信号量子化装置について、図面を参
照しながら説明する。図６（ａ）は本実施の形態３にお
けるオーディオ信号量子化装置の構成を示す図であり、
図において、１０４は平滑ベクトルテーブルであり、実
際には除算カーブ等のデータが格納されている。１０５
はオーディオ特徴ベクトルを平滑ベクトルテーブル１０
４に格納された平滑ベクトルを用いて、ベクトル要素同
士の除算により平滑化を行う平滑化部である。Embodiment 3 Hereinafter, an audio signal quantization device according to Embodiment 3 of the present invention will be described with reference to the drawings. FIG. 6A is a diagram illustrating a configuration of an audio signal quantization device according to the third embodiment.
In the figure, reference numeral 104 denotes a smooth vector table, which actually stores data such as a division curve. 105
Represents the audio feature vector as a smooth vector table 10.
4 is a smoothing unit that performs smoothing by dividing vector elements using the smoothed vector stored in No. 4.

【００４０】以下、動作について説明する。上記平滑化
部１０５に入力されるものは、上記実施の形態１および
実施の形態２におけるオーディオ信号量子化装置と同
様、オーディオ特徴ベクトルであり、この平滑化部１０
５では平滑ベクトルテーブル１０４に格納された平滑ベ
クトルである除算カーブを用いて、入力されたオーディ
オ特徴ベクトルに平滑演算を行う。この平滑演算は、例
えば、入力のオーディオ特徴ベクトルをＸとし、平滑ベ
クトル１０４をＦとし、平滑化部１０５の出力をＹとし
て、それぞれのベクトルのI 番目の要素をXi,Fi,Yiとし
た場合に、数５式で表される処理を行なう。The operation will be described below. What is input to the smoothing unit 105 is an audio feature vector, similarly to the audio signal quantizing devices in the first and second embodiments.
In step 5, a smoothing operation is performed on the input audio feature vector using a division curve that is a smooth vector stored in the smooth vector table 104. This smoothing operation is performed, for example, when the input audio feature vector is X, the smoothed vector 104 is F, the output of the smoothing unit 105 is Y, and the I-th element of each vector is Xi, Fi, Yi. Then, the processing represented by Expression 5 is performed.

【００４１】[0041]

【数５】 (Equation 5)

【００４２】平滑ベクトルテーブル１０４は、一般的に
オーディオ特徴ベクトルをＭＤＣＴ係数とした場合に
は、ＭＤＣＴ係数の分散を小さくするような値である。
図６（ｂ）は上記平滑化の処理を模式的に示したもので
あり、サブベクトル化された要素のうちの低域側から２
つ分の要素に対して除算処理を行うことにより、周波数
毎の情報量の格差（レンジ）を縮めることができる。In general, when the audio feature vector is an MDCT coefficient, the smoothed vector table 104 has a value that reduces the variance of the MDCT coefficient.
FIG. 6 (b) schematically shows the smoothing process, in which two of the sub-vectorized elements are counted from the low frequency side.
By performing the division process on one element, the difference (range) of the information amount for each frequency can be reduced.

【００４３】そして、平滑化部１０５の出力は、オーデ
ィオコード選択部１０２の入力となり、コード選択部１
０２では上述した実施の形態１と同様に、平滑化された
オーディオ特徴ベクトルは、位相情報抽出部１０７に
て、周波数の低い方から２番目までの要素に対して、そ
の位相情報が抽出され、一方、オーディオコード選択部
１０２では、送信側コードブック３に格納された２５６
個のコードとの検索を行う。このとき、そのまま得られ
た検索結果に対応するコードインデックス（８ｂｉｔ）
を出力すると正しい検索結果が得られないので、平滑ベ
クトルテーブル１０４より平滑化処理時の情報を受け取
り、スケーリングを調整した上で、該検索結果に対応す
るコードインデックス（８ｂｉｔ）の選択を行い、この
ようにして得られた結果に２ビット分の位相情報を付加
して１０ｂｉｔのコードインデックスI を出力する。The output of the smoothing unit 105 is input to the audio code selection unit 102, and the code selection unit 1
02, the phase information of the smoothed audio feature vector is extracted by the phase information extraction unit 107 for the elements from the lowest frequency to the second element, as in the first embodiment. On the other hand, in the audio code selection unit 102, the 256
Search with this code. At this time, a code index (8 bits) corresponding to the search result obtained as it is
Is output, the correct search result cannot be obtained. Therefore, the information at the time of the smoothing process is received from the smoothing vector table 104, the scaling is adjusted, and the code index (8 bits) corresponding to the search result is selected. The result obtained in this way is added with 2-bit phase information, and a 10-bit code index I is output.

【００４４】このときのオーディオベクトルとコードブ
ック３１に格納されたコードとの距離Diは、例えば、平
滑ベクトルテーブル１０４の各ｉ番目の要素をFiとし
て、数６式のように示される。At this time, the distance Di between the audio vector and the code stored in the codebook 31 is represented by, for example, Equation 6 where each i-th element of the smoothed vector table 104 is Fi.

【００４５】[0045]

【数６】 (Equation 6)

【００４６】ここで、Ｎは送信側コードブック３１中の
全コードの個数であり、Cij はコードインデックスｉに
おけるｊ番目の要素の値である。M は本実施の形態の場
合、１９以下の数、例えば、１などである。P は距離計
算におけるノルムであり、例えば、２などである。Wj
は、聴覚心理重みベクトルテーブル１０３のj 番目の要
素である。また、abs( ) は絶対値演算を意味する。位
相情報抽出部１０７では、最小のDiを与えるコードイン
デックスｉとＭ個の位相情報Ph(j) j=0 to Mを出力す
る。位相情報Ph(j) は、数２式で同様に定義される。Here, N is the number of all codes in the transmission side code book 31, and Cij is the value of the j-th element in the code index i. In the present embodiment, M is a number of 19 or less, for example, 1 or the like. P is a norm in the distance calculation, for example, 2 or the like. Wj
Is the j-th element of the psychoacoustic weight vector table 103. Abs () means an absolute value operation. The phase information extraction unit 107 outputs a code index i that gives the minimum Di and M pieces of phase information Ph (j) j = 0 to M. The phase information Ph (j) is similarly defined by Expression 2.

【００４７】このように本実施の形態３によれば、符号
化部１で符号化を行う際に、オーディオ特徴ベクトルを
分割して作成されたサブベクトルと送信側コードブック
３１の中の各オーディオコードとの聴覚的な距離のうち
最小距離を有するオーディオコードを選択するにあた
り、聴覚的な重要度が高いサブベクトルの要素に相当す
る部分については、オーディオコード選択部１０２でそ
の位相情報を示す正負符号を無視して取り扱って、送信
側コードブック３１のオーディオコードとの比較検索を
行い、得られた結果に別途、位相情報抽出部１０７で抽
出した上記サブベクトルの要素部分に対応する位相情報
を付加してコードインデックスとして出力するようにし
たので、体感的な音質の劣化を招くことなく、オーディ
オコード選択部１０２における計算量を削減でき、ま
た、コードブック３，１０１に必要なコード数をも削減
することができる。As described above, according to the third embodiment, when encoding is performed by the encoding unit 1, the sub-vectors created by dividing the audio feature vector and each audio In selecting the audio code having the minimum distance from the audible distance to the code, a portion corresponding to the element of the sub-vector having high audibility is positive or negative indicating the phase information by the audio code selection unit 102. The code is treated ignoring the code, a comparison search with the audio code of the transmission side code book 31 is performed, and phase information corresponding to the element part of the sub-vector extracted by the phase information extraction unit 107 is separately added to the obtained result. Since the audio code is output as a code index, the audio code selection unit 10 does not cause a perceptible deterioration in sound quality. In reduces the amount of calculation, also can be reduced even code number necessary codebook 3,101.

【００４８】また、入力されたオーディオ特徴ベクトル
を平滑化テーブル１０４，平滑化部１０５を用いて平滑
化するようにしたから、オーディオコード選択部１０２
で検索を行う際に参照される、送信側コードブック３１
に記憶しておくコードブックの周波数毎の情報量を全体
的に少ないものとすることができる。Further, since the input audio feature vector is smoothed using the smoothing table 104 and the smoothing unit 105, the audio code selecting unit 102
Sender codebook 31 referred to when searching by
The amount of information for each frequency of the codebook stored in the codebook can be reduced as a whole.

【００４９】実施の形態４．以下、本発明の実施の形態
４によるオーディオ信号量子化装置について、図面を参
照しながら説明する。図７は本発明の実施の形態４にお
けるオーディオ信号量子化装置の構成を示す図であり、
図において、図４に示した実施の形態３との違いは、オ
ーディオコード選択部１０２でコードを選択する際に、
平滑ベクトルテーブル１０４に加えて、実施の形態２で
用いた聴覚心理重みベクトルテーブル１０３をも使用す
るようにした点である。Embodiment 4 Hereinafter, an audio signal quantization device according to Embodiment 4 of the present invention will be described with reference to the drawings. FIG. 7 is a diagram showing a configuration of an audio signal quantization device according to Embodiment 4 of the present invention.
In the figure, the difference from the third embodiment shown in FIG. 4 is that when the audio code selecting unit 102 selects a code,
The point is that the psychoacoustic weight vector table 103 used in the second embodiment is used in addition to the smooth vector table 104.

【００５０】以下、動作について説明する。平滑化部１
０５に入力されるものは、上記各実施の形態と同様、オ
ーディオ特徴ベクトルであり、この平滑化部１０５の出
力は、オーディオコード選択部１０２の入力となり、オ
ーディオコード選択部１０２では、送信側コードブック
３１中の各コードと上記平滑化部１０５の出力との距離
を、平滑ベクトルテーブル１０４から出力される平滑処
理時の情報に基づいて、平滑処理時のスケーリングを考
慮しつつ聴覚心理重みベクトルテーブル１０３の聴覚心
理重みベクトルによる重み付けを加味して算出する。上
記実施の形態２，３と同様の表記を用いて、距離Diは、
例えば、数７式のように示される。The operation will be described below. Smoothing unit 1
Input to the audio code vector 05 is an audio feature vector, as in the above embodiments. The output of the smoothing unit 105 is input to the audio code selection unit 102, and the audio code selection unit 102 The distance between each code in the book 31 and the output of the smoothing unit 105 is determined based on the information at the time of the smoothing process output from the smoothing vector table 104 while taking into account the scaling at the time of the smoothing process. The calculation is made in consideration of the weighting by the psychoacoustic weight vector 103. Using the same notation as in Embodiments 2 and 3, distance Di is:
For example, it is shown as in equation (7).

【００５１】[0051]

【数７】 (Equation 7)

【００５２】ここで、Ｎは送信側コードブック３１中の
全コードの個数であり、Cij はコードインデックスｉに
おけるｊ番目の要素の値である。M は本実施例の場合、
１９以下の数、例えば、１などである。P は距離計算に
おけるノルムであり、例えば、２などである。Wjは、聴
覚心理重みベクトルテーブル１０３のj 番目の要素であ
る。また、abs( ) は絶対値演算を意味する。位相情報
抽出部１０７では、最小のDiを与えるコードインデック
スI とＭ個の位相情報Ph(j) j=0 to Mを出力する。位相
情報Ph(j) は、上記数２式で同様に定義される。Here, N is the number of all codes in the transmission side code book 31, and Cij is the value of the j-th element in the code index i. M is in the case of this embodiment,
The number is 19 or less, for example, 1 or the like. P is a norm in the distance calculation, for example, 2 or the like. Wj is the j-th element of the psychoacoustic weight vector table 103. Abs () means an absolute value operation. The phase information extraction unit 107 outputs a code index I that gives the minimum Di and M pieces of phase information Ph (j) j = 0 to M. The phase information Ph (j) is similarly defined by the above equation (2).

【００５３】このように本実施の形態４によれば、符号
化部１で符号化を行う際に、オーディオ特徴ベクトルを
分割して作成されたサブベクトルと送信側コードブック
３１の中の各オーディオコードとの聴覚的な距離のうち
最小距離を有するオーディオコードを選択するにあた
り、聴覚的な重要度が高いサブベクトルの要素に相当す
る部分については、オーディオコード選択部１０２でそ
の位相情報を示す正負符号を無視して取り扱って、送信
側コードブック３１のオーディオコードとの比較検索を
行い、得られた結果に別途、位相情報抽出部１０７で抽
出した上記サブベクトルの要素部分に対応する位相情報
を付加してコードインデックスとして出力するようにし
たので、体感的な音質の劣化を招くことなく、オーディ
オコード選択部１０２における計算量を削減でき、ま
た、送信側コードブック３１に必要なコード数も削減す
ることができる。As described above, according to the fourth embodiment, when encoding is performed by the encoding unit 1, the sub-vectors created by dividing the audio feature vector and each audio In selecting the audio code having the minimum distance from the audible distance to the code, a portion corresponding to the element of the sub-vector having high audibility is positive or negative indicating the phase information by the audio code selection unit 102. The code is treated ignoring the code, a comparison search with the audio code of the transmission side code book 31 is performed, and phase information corresponding to the element part of the sub-vector extracted by the phase information extraction unit 107 is separately added to the obtained result. Since the audio code is output as a code index, the audio code selection unit 10 does not cause a perceptible deterioration in sound quality. It reduces the calculation amount in, and can also reduce the number of codes required for transmitting side code book 31.

【００５４】また、上記オーディオコード選択部１０２
でその位相情報を示す正負符号を無視して取り扱うオー
ディオ特徴ベクトルを、人間の聴覚心理特性を考慮した
各周波数における相対的な聴覚心理量のテーブルを格納
する聴覚心理重みベクトルテーブル１０３を用いて重み
付けして選択することにより、上記実施の形態１のよう
に単に低域から所定個数のオーディオ特徴ベクトルを選
択するものに比べて、より体感的に音質の優れた量子化
を行うことができる。The audio code selection unit 102
The audio feature vector handled by ignoring the sign indicating the phase information is weighted using the psychoacoustic weight vector table 103 that stores a table of the relative psychological quantity at each frequency in consideration of the psychoacoustic characteristics of a human. By doing so, it is possible to perform quantization with excellent sound quality more sensuously than in the case of simply selecting a predetermined number of audio feature vectors from the low band as in the first embodiment.

【００５５】さらに、入力されたオーディオ特徴ベクト
ルを平滑化テーブル１０４，平滑化部１０５を用いて平
滑化するようにしたから、オーディオコード選択部１０
２で検索を行う際に参照される、送信側コードブック３
１に記憶しておくコードブックの周波数毎の情報量を全
体的に少ないものとすることができる。Further, the input audio feature vector is smoothed using the smoothing table 104 and the smoothing unit 105.
Sender-side codebook 3 that is referenced when performing a search in 2
1, the amount of information for each frequency of the codebook stored in the storage unit 1 can be reduced as a whole.

【００５６】実施の形態５．以下、本発明の実施の形態
５によるオーディオ信号量子化装置について、図面を参
照しながら説明する。図８は本発明の実施の形態５にお
けるオーディオ信号量子化装置の構成を示す図であり、
図において、１０６はソート部であり、聴覚心理重みベ
クトルテーブル１０３の出力と平滑ベクトルテーブル１
０４の出力とを受け、算出されたベクトルのうちのもっ
とも大きな要素を複数個選択してこれを出力する。Embodiment 5 Hereinafter, an audio signal quantization device according to Embodiment 5 of the present invention will be described with reference to the drawings. FIG. 8 is a diagram showing a configuration of an audio signal quantization device according to Embodiment 5 of the present invention.
In the figure, reference numeral 106 denotes a sort unit, which outputs the psychoacoustic weight vector table 103 and the smoothed vector table 1.
04, the plurality of the largest elements among the calculated vectors are selected and output.

【００５７】以下、その動作について説明する。本実施
の形態５と上記実施の形態４の構成上の違いは、ソート
部１０６が付加されたことと、オーディオコード選択部
１０２のコードインデックスを選択して出力する方法が
違う点にある。すなわち、ソート部１０６では、聴覚心
理重みベクトルテーブル１０３と平滑ベクトルテーブル
１０４の出力とを入力とし、例えば、ベクトルＷＦのｊ
番目の要素をＷＦｊと定義すると、以下の数８式で示さ
れる。The operation will be described below. The configuration difference between the fifth embodiment and the fourth embodiment is that the sorting unit 106 is added and the method of selecting and outputting the code index of the audio code selection unit 102 is different. That is, the sorting unit 106 receives the psychoacoustic weight vector table 103 and the output of the smooth vector table 104 as inputs, and
If the ith element is defined as WFj, it is expressed by the following equation (8).

【００５８】[0058]

【数８】 (Equation 8)

【００５９】そして、このソート部１０６では、ベクト
ルＷＦの各要素ＷＦｊの中から最も大きなＲ個の要素を
算出し、そのＲ個の要素番号をソート部１０６の出力と
する。オーディオコード選択部１０２では、上記各実施
の形態と同様、距離Diを算出する。距離Diは、例えば、
次の数９式で示される。The sorting unit 106 calculates the largest R elements from the respective elements WFj of the vector WF, and uses the R element numbers as the output of the sorting unit 106. The audio code selection unit 102 calculates the distance Di as in the above embodiments. The distance Di is, for example,
It is shown by the following equation (9).

【００６０】[0060]

【数９】 (Equation 9)

【００６１】ここで、Ｒj はソート部１０６によって、
出力された要素番号であるならRjは１であり、出力され
た要素番号でないならRjは０とする。Ｎは送信側コード
ブック３１中の全コードの個数であり、Cij はコードイ
ンデックスｉにおけるｊ番目の要素の値である。M は本
実施例の場合、１９以下の数、例えば、１などである。
P は距離計算におけるノルムであり、例えば、２などで
ある。Wjは、聴覚心理重みベクトルテーブル１０３のj
番目の要素である。また、abs( ) は絶対値演算を意味
する。位相情報抽出部１０７では、最小のDiを与えるコ
ードインデックスI とＭ個の位相情報Ph(j) j= 0toＲを
出力する。位相情報Ph(j) は、数１０式で定義される。Here, Rj is determined by the sorting unit 106.
If the element number is an output element number, Rj is 1, and if it is not an output element number, Rj is 0. N is the number of all codes in the transmission side codebook 31, and Cij is the value of the j-th element in the code index i. M is a number of 19 or less, for example, 1 in the present embodiment.
P is a norm in the distance calculation, for example, 2 or the like. Wj is j in the psychoacoustic weight vector table 103
Element. Abs () means an absolute value operation. The phase information extraction unit 107 outputs a code index I that gives the minimum Di and M pieces of phase information Ph (j) j = 0toR. The phase information Ph (j) is defined by Expression 10.

【００６２】[0062]

【数１０】 (Equation 10)

【００６３】ただし、Ph(j) は、ソート部１０６で出力
された要素番号に対応するもののみについて計算され
る。本例では（Ｒ＋１）個である。本実施の形態５の構
成を用いる場合には、図１の構成の復号化部２において
も、ソート部１０６を備えた構成とする必要がある。However, Ph (j) is calculated only for those corresponding to the element numbers output by the sorting unit 106. In this example, the number is (R + 1). In the case where the configuration of the fifth embodiment is used, it is necessary that the decoding unit 2 having the configuration of FIG.

【００６４】このように本実施の形態５によれば、上記
実施の形態４において、平滑ベクトルテーブル１０４の
出力と聴覚心理重みベクトルテーブル１０３の出力とを
受け、これらの出力結果から、ベクトルのうちのもっと
も大きな要素、すなわち、重み絶対値の大きな要素を複
数個選択してこれをオーディオコード選択部１０２に出
力するようにしたので、人間の聴覚特性にとって意味の
大きな要素と、物理的に重要な要素とを両方加味してコ
ードインデックスを算出することができ、より品質の高
いオーディオ信号圧縮化を行うことができる。なお、本
実施の形態５では、平滑ベクトル１０４と聴覚心理重み
ベクトル１０３の両者を考慮した重みの絶対値の大きい
ものから要素を選択する個数をＲ個としたが、これは、
上記実施の形態１〜４で使用したＭと同じ数値であって
もかまわない。As described above, according to the fifth embodiment, in the fourth embodiment, the output of the smoothed vector table 104 and the output of the psychoacoustic weight vector table 103 are received. Is selected, ie, a plurality of elements having a large absolute weight value are selected and output to the audio code selection unit 102. Therefore, elements that are significant for human auditory characteristics and elements that are physically important The code index can be calculated in consideration of both factors, and higher quality audio signal compression can be performed. In the fifth embodiment, the number of elements to be selected from those having the largest absolute value of the weight in consideration of both the smooth vector 104 and the psychoacoustic weight vector 103 is R.
The same numerical value as M used in the first to fourth embodiments may be used.

【００６５】実施の形態６．以下、本発明の実施の形態
６によるオーディオ信号逆量子化装置について、図面を
参照しながら説明する。図９は本発明の実施の形態６に
おけるオーディオ信号逆量子化装置の構成を示す図であ
り、図において、２１は図１の復号化部に相当するもの
であり、受信側コードブック６１とコード復号部５１と
から構成され、さらにコード復号化部５１は、オーディ
オコード選択部１０２と位相情報抽出部１０７とから構
成されたものとなっている。Embodiment 6 FIG. Hereinafter, an audio signal dequantization device according to Embodiment 6 of the present invention will be described with reference to the drawings. FIG. 9 is a diagram showing a configuration of an audio signal inverse quantization apparatus according to Embodiment 6 of the present invention. In FIG. 9, reference numeral 21 denotes the decoding unit of FIG. The code decoding unit 51 includes an audio code selection unit 102 and a phase information extraction unit 107.

【００６６】以下、その動作について説明する。本実施
の形態６では、コードインデックスを受信してデコード
する際に、上記実施の形態１〜５に示した量子化方式を
応用したものであり、すなわち、オーディオコード選択
部１０２では、例えば、受信した１０ｂｉｔのコードイ
ンデックスのうち人間の聴覚的に重要度の高い低域側か
ら２ｂｉｔ分の要素を除く、残り８ｂｉｔ分の要素につ
いて、受信側コードブック６１に記憶されたコードとの
比較検索を行い、上記除外した２ｂｉｔ分の要素の位相
情報については、これを位相情報抽出部１０７を用いて
抽出し、上記検索結果に対してこれを付加することで、
オーディオ特徴ベクトルを再生する、すなわち逆量子化
を行う。The operation will be described below. In the sixth embodiment, when the code index is received and decoded, the quantization scheme shown in the first to fifth embodiments is applied. A comparison search is performed on the remaining 8 bits of the 10-bit code index excluding the 2-bit elements from the low-frequency side of the human auditory significance, with the code stored in the reception-side codebook 61. By extracting the phase information of the 2 bits of the excluded elements using the phase information extracting unit 107 and adding the extracted information to the search result,
The audio feature vector is reproduced, that is, inverse quantization is performed.

【００６７】このようにすることで、上記受信側コード
ブックとしては、８ｂｉｔ分の要素に対応する２５６個
のコードを格納すればよく、受信側コードブック６１に
格納するデータ量を少ないものとすることができ、ま
た、オーディオコード選択部１０２での演算は２５６回
のコード検索と、検索結果に対して２符号を付加する処
理となり、大幅に演算量を削減することができる。In this way, the reception-side codebook only needs to store 256 codes corresponding to 8-bit elements, and the amount of data stored in the reception-side codebook 61 is small. In addition, the calculation in the audio code selection unit 102 is a process of 256 times of code search and a process of adding two codes to the search result, and the amount of calculation can be greatly reduced.

【００６８】なお、本実施の形態６では、実施の形態１
の構成を受信側の構成に応用したものを示したが、実施
の形態２〜５に示した構成のものを応用することも可能
であり、また、受信側に単独で用いるのではなく、上記
実施の形態１ないし５のいずれかの実施の形態と組み合
わせて用いることにより、オーディオ信号の圧縮、展開
をスムーズに行うことができるオーディオデータ送受信
システムを構築することができる。In the sixth embodiment, the first embodiment is used.
Is applied to the configuration on the receiving side, but the configuration shown in Embodiments 2 to 5 can also be applied, and instead of using the receiving side alone, When used in combination with any of the first to fifth embodiments, an audio data transmission / reception system that can smoothly compress and expand an audio signal can be constructed.

【００６９】[0069]

【発明の効果】以上のように、本発明の請求項１に係る
オーディオ信号量子化装置によれば、オーディオ信号を
入力し周波数変換したデータ（オーディオ特徴ベクト
ル）をサブデータ（サブベクトル）の単位で符号化する
オーディオ信号量子化装置であって、前記周波数変換さ
れたサブデータのうちの所定の周波数帯域に属する要素
データの位相情報を抽出する位相情報抽出部と、前記サ
ブデータを代表値で表す複数個のオーディオコードであ
って、前記所定の周波数帯域に属する要素データに対応
する値が位相成分を除いた絶対値であるオーディオコー
ドを格納するコードブックを有し、前記サブデータに対
応する前記オーディオコードを前記コードブックの中か
ら検索するとともに、検索したオーディオコードを識別
する情報と前記サブデータの中で位相成分が除かれた要
素データに対応する位相情報とを含むコードインデック
スを出力するオーディオコード選択部とを備えたので、
体感的な音質の劣化を招くことなく、オーディオコード
選択部における計算量を削減でき、また、コードブック
に記憶すべきコード数をも削減することができる効果が
ある。As described above, according to the audio signal quantization apparatus according to the first aspect of the present invention, the audio signal is
Input and frequency converted data (audio feature vector
) In sub-data (sub-vector) units
An audio signal quantizer, comprising:
Of the sub-data that belongs to the specified frequency band
A phase information extracting unit for extracting phase information of data;
Audio data representing the audio data with a representative value.
Corresponding to the element data belonging to the predetermined frequency band.
Audio code whose absolute value excludes the phase component
A code book for storing the sub-data,
Whether the corresponding audio code is in the codebook
Search and identify the searched audio code
And phase information in the sub-data
Code index including phase information corresponding to raw data
And an audio code selection unit that outputs
There is an effect that the amount of calculation in the audio code selection unit can be reduced and the number of codes to be stored in the code book can be reduced without causing a perceptible deterioration in sound quality.

【００７０】また、本発明の請求項３に係るオーディオ
信号量子化装置によれば、請求項１記載のオーディオ信
号量子化装置において、さらに、人間の聴覚心理モデル
に基づき各周波数における聴覚感度を表す聴覚感度テー
ブルを備え、前記位相情報抽出部が、前記聴覚感度テー
ブルに基づき、前記周波数変換されたサブデータに含ま
れる複数の要素データのうち、高い聴覚感度にある周波
数に属する要素データの位相情報を抽出するようにした
ので、より体感的に音質の優れた量子化を行うことがで
きる効果がある。[0070] Further, according to the audio signal quantization apparatus according to claim 3 of the present invention, in the audio signal quantization apparatus of 請 Motomeko 1, further comprising a human psychoacoustic model
Auditory sensitivity table representing the auditory sensitivity at each frequency based on
And the phase information extracting unit includes the audio information table.
Included in the frequency-converted sub-data based on the
Frequency of high sensitivity
Since the phase information of the element data belonging to the number is extracted , there is an effect that quantization with excellent sound quality can be performed more intuitively.

【００７１】また、本発明の請求項４に係るオーディオ
信号量子化装置によれば、請求項１から請求項３のいず
れかに記載のオーディオ信号量子化装置において、さら
に、前記周波数変換されたデータに含まれる複数のサブ
データに対し、周波数毎のレンジを縮小させ、それぞれ
の前記サブデータを平滑化する平滑化部を備え、前記オ
ーディオコード選択部が、平滑化されたサブデータの入
力を受け、対応するオーディオコードを検索するに際
し、前記平滑部からの情報に基づき、平滑化されたサブ
データのスケーリングを調整したうえで、対応するオー
ディオコードを検索するようにしたので、オーディオコ
ード選択部で検索を行う際に参照される、コードブック
に記憶しておくコードブックの周波数毎の情報量を全体
的に少ないものとすることができる効果がある。[0071] Further, according to the audio signal quantization apparatus according to claim 4 of the present invention, according to claim 3 noise from 請 Motomeko 1
In the audio signal quantization apparatus according Rekani, further
A plurality of subs contained in the frequency-converted data.
For data, reduce the range for each frequency,
A smoothing unit for smoothing the sub data of
The audio code selector inputs the smoothed sub data.
When searching for the corresponding audio code
Then, based on the information from the smoothing unit,
After adjusting the data scaling, the corresponding
Since the search for the audio code is performed, the amount of information for each frequency of the codebook stored in the codebook, which is referred to when searching in the audio code selection unit, can be reduced as a whole. effective.

【００７２】また、本発明の請求項５に係るオーディオ
信号量子化装置によれば、請求項４記載のオーディオ信
号量子化装置において、さらに、前記オーディオコード
選択部に入力する複数の前記サブデータを並び替えるソ
ート部を備え、前記ソート部が、前記平滑部からの情報
と前記聴覚感度テーブルを参照結果とに基づき、オーデ
ィオデータの物理的な特性と人間の聴覚特性との両者を
加味した重要度の高い順に複数の前記サブデータを並び
替えるものとしたので、人間の聴覚特性にとって意味の
大きな要素と、物理的に重要な要素とを両方加味してコ
ードインデックスを算出することができ、より品質の高
いオーディオ信号圧縮化を行うことができる効果があ
る。[0072] Further, according to the audio signal quantization apparatus according to claim 5 of the present invention, in the audio signal quantization apparatus of 請 Motomeko 4, further, the audio code
A source for rearranging a plurality of the sub-data input to the selection unit.
A sorting unit, wherein the sorting unit is configured to output information from the smoothing unit.
Based on the hearing sensitivity table and the reference result.
Both the physical characteristics of the audio data and the human auditory characteristics
Multiple sub-data items are arranged in descending order of importance
The code index can be calculated by taking into account both the factors that are significant for the human auditory characteristics and those that are physically important, thus achieving higher quality audio signal compression. There is an effect that can be done.

【００７３】また、本発明の請求項７に係るオーディオ
信号逆量子化装置によれば、オーディオ信号を入力し周
波数変換したデータ（オーディオ特徴ベクトル）をサブ
データ（サブベクトル）の単位で符号化するオーディオ
信号量子化装置から出力されるコードインデックスを入
力し、前記オーディオ信号を復号するオーディオ信号逆
量子化装置であって、前記コードインデックスは、前記
サブデータを代表値で表すオーディオコードを識別する
情報と位相情報とを含む情報であって、前記オーディオ
コードは所定の周波数帯域に属する要素データに対応す
る値が位相成分を除いた絶対値であり、前記位相情報は
対応するオーディオコードの位相成分が除かれた要素デ
ータの位相成分であり、前記オーディオ信号逆量子化装
置は、前記コードインデックスに含まれる位相情報を抽
出する位相情報抽出部と、前記サブデータを代表値で表
す複数個のオーディオコードであって、前記所定の周波
数帯域に属する要素データに対応する値が位相成分を除
いた絶対値であるオーディオコードを格納するコードブ
ックを有し、前記コードインデックスに含まれるオーデ
ィオコードを識別する情報に基づき、前記コードインデ
ックスに対応する前記オーディオコードを前記コードブ
ックの中から検索するとともに、前記コードインデック
スに含まれる位相情報を、検索したオーディオコードの
中で位相成分が除かれた要素データの位相成分として付
加し、これを出力信号として出力するオーディオコード
選択部とを備えたので、受信側で用いるコードブックに
格納するデータ量を少ないものとすることができ、ま
た、受信側での演算量を大幅に削減することができる効
果がある。Further, according to the audio signal dequantizing apparatus according to claim 7 of the present invention, an audio signal is input and
Wave number converted data (audio feature vector)
Audio to be encoded in units of data (subvector)
Enter the code index output from the signal quantizer.
Audio signal to decode and decode the audio signal
The quantization device, wherein the code index is:
Identify audio code that represents sub data with representative value
Information including information and phase information, wherein the audio
The code corresponds to the element data belonging to the predetermined frequency band.
Is the absolute value excluding the phase component, and the phase information is
Element data from which the phase component of the corresponding audio code has been removed
Phase component of the audio signal,
Extract the phase information included in the code index.
A phase information extraction unit to output the sub-data and a representative value.
A plurality of audio codes, wherein the predetermined frequency is
The values corresponding to the element data belonging to several bands exclude the phase component.
Code to store the absolute audio code
Audio code included in the code index
The code index based on the information identifying the
The audio code corresponding to the
And search the code index
Phase information contained in the audio code
Is added as the phase component of the element data with the phase component removed.
In addition, since an audio code selector is provided for outputting this as an output signal, the amount of data stored in the codebook used on the receiving side can be reduced, and the amount of calculation on the receiving side can be greatly increased. There is an effect that can be reduced.

[Brief description of the drawings]

【図１】この発明によるオーディオ信号量子化装置の
概念的な構成を示す図である。FIG. 1 is a diagram showing a conceptual configuration of an audio signal quantization device according to the present invention.

【図２】本発明の実施の形態１におけるオーディオ信
号量子化装置の符号化部の詳細な構成を示す図である。FIG. 2 is a diagram illustrating a detailed configuration of an encoding unit of the audio signal quantization device according to the first embodiment of the present invention.

【図３】上記実施の形態１におけるオーディオ信号量
子化装置のオーディオ特徴ベクトルの構成を説明するた
めの図である。FIG. 3 is a diagram illustrating a configuration of an audio feature vector of the audio signal quantization device according to the first embodiment.

【図４】上記実施の形態１におけるオーディオ信号量
子化装置の符号化処理を説明するための図である。FIG. 4 is a diagram for describing an encoding process of the audio signal quantization device according to the first embodiment.

【図５】本発明の実施の形態２におけるオーディオ信
号量子化装置の符号化部の詳細な構成を示す図、及び聴
覚心理重みベクトルテーブルの一例を示す図である。FIG. 5 is a diagram illustrating a detailed configuration of an encoding unit of the audio signal quantization device according to Embodiment 2 of the present invention, and is a diagram illustrating an example of an psychoacoustic weight vector table.

【図６】本発明の実施の形態３におけるオーディオ信
号量子化装置の符号化部の詳細な構成を示す図、及び平
滑化部における処理を説明するための図である。FIG. 6 is a diagram illustrating a detailed configuration of an encoding unit of an audio signal quantization device according to Embodiment 3 of the present invention, and a diagram for describing processing in a smoothing unit.

【図７】本発明の実施の形態４におけるオーディオ信
号量子化装置の符号化部の詳細な構成を示す図である。FIG. 7 is a diagram illustrating a detailed configuration of an encoding unit of an audio signal quantization device according to Embodiment 4 of the present invention.

【図８】本発明の実施の形態５におけるオーディオ信
号量子化装置の符号化部の詳細な構成を示す図である。FIG. 8 is a diagram illustrating a detailed configuration of an encoding unit of an audio signal quantization device according to a fifth embodiment of the present invention.

【図９】本発明の実施の形態６におけるオーディオ信
号逆量子化装置の構成を示す図である。FIG. 9 is a diagram showing a configuration of an audio signal inverse quantization device according to Embodiment 6 of the present invention.

【図１０】従来のオーディオ信号量子化装置の構成を
示す図である。FIG. 10 is a diagram showing a configuration of a conventional audio signal quantization device.

[Explanation of symbols]

１符号化部２，２１復号化部３，３１送信側コードブック４オーディオコード選択部５コード復号部６，６１受信側コードブック１０２オーディオコード選択部１０３聴覚心理重みベクトルテーブル１０４平滑ベクトルテーブル１０５平滑化部１０６ソート部１０７位相情報抽出部 DESCRIPTION OF SYMBOLS 1 Encoding part 2, 21 Decoding part 3, 31 Transmission codebook 4 Audio code selection part 5 Code decoding part 6, 61 Reception side codebook 102 Audio code selection part 103 Perceptual psychological weight vector table 104 Smooth vector table 105 Smoothing Conversion unit 106 sorting unit 107 phase information extraction unit

───────────────────────────────────────────────────── フロントページの続き (72)発明者石川智一大阪府門真市大字門真1006番地松下電器産業株式会社内 (56)参考文献特開平９−130260（ＪＰ，Ａ) 特開昭59−94797（ＪＰ，Ａ) 特開平７−248799（ＪＰ，Ａ) 特開平９−8667（ＪＰ，Ａ) (58)調査した分野(Int.Cl.⁷，ＤＢ名) G10L 19/00 - 19/14 H03M 7/30 H04B 14/04 ────────────────────────────────────────────────── ─── Continuation of the front page (72) Inventor Tomokazu Ishikawa 1006 Kazuma Kadoma, Kadoma, Osaka Prefecture Matsushita Electric Industrial Co., Ltd. (56) References JP-A-9-130260 (JP, A) JP-A-59- 94797 (JP, A) JP-A-7-248799 (JP, A) JP-A-9-8667 (JP, A) (58) Fields investigated (Int. Cl. ⁷ , DB name) G10L 19/00-19 / 14 H03M 7/30 H04B 14/04

Claims

(57) [Claims]

An audio signal is input and frequency-converted.
Data (audio feature vector)
Vector) audio signal quantization unit
A predetermined frequency of the frequency-converted sub-data.
Phase information for extracting phase information of element data belonging to band
An extraction unit, and a plurality of audio codes representing the sub-data as representative values;
Element data belonging to the predetermined frequency band.
Is the absolute value excluding the phase component.
And a code book for storing the audio code.
Search in the codebook and search for
Information identifying the audio code and the position in the sub data
Phase information corresponding to the element data with the phase component removed.
And an audio code selection unit for outputting a code index .

2. The method according to claim 1, wherein the phase information extracting unit is configured to perform the frequency conversion.
Of the multiple element data included in the sub data
Extract phase information of element data belonging to low frequency band
The audio signal quantization device according to claim 1, wherein:

3. The audio signal quantizing device further comprises:
Then, based on the human psychoacoustic model,
A hearing sensitivity table representing perceptual sensitivity, wherein the phase information extraction unit is configured to perform
A plurality of frequency-converted sub-data
Of the element data, belongs to the frequency with high auditory sensitivity
Claims: extracting phase information of element data.
Item 3. The audio signal quantization device according to Item 1 .

4. The audio signal quantizing device further comprises:
A plurality of subs contained in the frequency-converted data.
For data, reduce the range for each frequency,
Said comprising a smoothing unit for smoothing the sub data, the audio code selection unit was smoothed Sabude
Data input and search for the corresponding audio code
On the basis of the information from the smoothing unit,
After adjusting the sub data scaling,
Claim, characterized in that to search for audio code 1
An audio signal quantization device according to any one of claims 1 to 3 .

5. The audio signal quantizing device further comprises:
A plurality of the above-mentioned input to the audio code selection unit.
A sorter for rearranging the sub-data, wherein the sorter includes information from the smoother and the auditory sensitivity
Based on the table and the reference result, the audio data
Importance considering both physical characteristics and human auditory characteristics
Sorting the plurality of sub-data in descending order of
5. The audio signal quantization device according to claim 4, wherein:

6. The frequency-converted data is inputted.
Transform the transformed audio signal (modified discrete cosine)
2. The method according to claim 1, wherein the converted coefficient is used as an element.
An audio signal quantization device according to any one of claims 1 to 5 .

7. An audio signal is input and frequency-converted.
Data (audio feature vector)
Vector) audio signal quantization unit
Enter the code index output from the
Audio signal dequantizer that decodes audio signals
The code index indicates the sub data as a representative value.
Information identifying the audio code to be represented and phase information.
The audio code is at a predetermined frequency.
The value corresponding to the element data belonging to the band excludes the phase component
And the phase information is the corresponding audio code.
Is the phase component of the element data from which the phase component of the
The audio signal dequantizer extracts phase information included in the code index.
A phase information extracting unit, and a plurality of audio codes representing the sub-data as representative values;
Element data belonging to the predetermined frequency band.
Is the absolute value excluding the phase component.
An audio code included in the code index.
Corresponds to the code index based on the identifying information
The audio code from the codebook
Search and included in the code index
Phase information is searched for the phase component in the searched audio code.
Is added as the phase component of the element data from which
Audio code selector for outputting as an output signal
An audio signal inverse quantization device , characterized in that:

8. An audio signal is input and frequency-converted.
Data (audio feature vector)
Vector) audio signal quantization unit
An audio signal quantizing method, wherein the audio signal quantizing device substitutes the sub data.
A plurality of audio codes represented by table values,
The value corresponding to the element data belonging to the frequency band is the phase component
Code that stores the audio code that is the absolute value excluding
Comprising a codebook, said audio signal quantization method, the predetermined frequency of the frequency-converted sub data has been
Step of extracting phase information of element data belonging to a band
And the audio code corresponding to the sub-data is
Searching the code book; information identifying the searched audio code;
Position corresponding to the element data from which the phase component has been
Outputting a code index including phase information
Audio signal quantization direction, characterized in that including bets
Law .

9. An audio signal is input and frequency-converted.
Data (audio feature vector)
Vector) audio signal quantization unit
Enter the code index output from the
Audio signal dequantizer that decodes audio signals
An audio signal inverse quantization method, wherein the code index represents the sub-data as a representative value.
Information identifying the audio code to be represented and phase information.
The audio code is at a predetermined frequency.
The value corresponding to the element data belonging to the band excludes the phase component
And the phase information is the corresponding audio code.
Is the phase component of the element data from which the phase component of the
The audio signal dequantizer includes a plurality of audio codes representing the sub-data as representative values.
Element data belonging to the predetermined frequency band.
Is the absolute value excluding the phase component.
An audio codec, wherein the audio signal dequantization method extracts phase information included in the code index.
And an audio code included in the code index.
Corresponds to the code index based on the identifying information
The audio code from the codebook
Searching , and searching for phase information included in the code index.
Of the audio code with the phase component removed
Output as an output signal.
Audio signal inverse quantization wherein the comprising the steps of.