KR100300963B1

KR100300963B1 - Linked scalar quantizer

Info

Publication number: KR100300963B1
Application number: KR1019980037169A
Authority: KR
Inventors: 김무영; 조용덕
Original assignee: 윤종용; 삼성전자 주식회사
Priority date: 1998-09-09
Filing date: 1998-09-09
Publication date: 2001-09-22
Also published as: KR20000019195A

Abstract

본 발명은 스칼라 양자화기에 벡터 양자화기의 특성을 적용한 양자화기 및 음성부호화/복호화방법에 관한 것으로서, 본 발명에 의한 양자화기는 M개의 LSF 중 우선 임의의 N번째 LSF는 일반적인 SQ 방식에 의해 코드북을 형성하고, (N+K+1)번째 LSF(단, K는 0 이상이고, (M-N-1) 이하의 정수)는 각각 (N+K)번째 LSF의 분포에 근거한 코드북을 형성하고, (N-K-1)번째 LSF(단, K는 0 이상이고, (N-2) 이하인 정수)는 각각 (N-K)번째 LSF의 분포에 근거한 코드북을 형성하도록 함을 특징으로 한다.The present invention relates to a quantizer and a speech encoding / decoding method for applying a characteristic of a vector quantizer to a scalar quantizer, and a quantizer according to the present invention is characterized in that an arbitrary Nth LSF among M LSFs forms a codebook by a general SQ method (N + K) th LSF (where K is an integer greater than or equal to 0 and equal to or smaller than (MN-1)) forms a codebook based on the distribution of (N + K) th LSFs, 1) th LSF (where K is an integer equal to or greater than 0 and equal to or smaller than (N-2)) forms a codebook based on the distribution of the (NK) th LSFs.

본 발명에 의하면, ITU-T, SG 16, QN 21에 제안한 4kbps 음성 코더에 적용함으로써 기존의 방식에 비해 메모리와 비트수를 절약하면서 향상된 성능을 얻을 수 있었다.According to the present invention, by applying the present invention to the 4kbps voice coder proposed in ITU-T, SG 16, and QN 21, improved performance can be obtained while saving memory and bits compared to the conventional method.

Description

Linked scalar quantizer

본 발명은 음성부호화/복호화 장치에 사용되는 양자화기에 관한 것으로서, 특히 스칼라 양자화기에 벡터 양자화기의 특성을 적용한 양자화기 및 음성부호화/복호화방법에 관한 것이다.The present invention relates to a quantizer used in a speech encoding / decoding apparatus, and more particularly to a quantizer and a speech encoding / decoding method in which the characteristics of a vector quantizer are applied to a scalar quantizer.

최근들어, 세계적으로 음성 부호화기의 표준화가 진행되고 있다. 표준화에 참가한 음성부호화기들은 음성을 스펙트럼 포락선과 여기신호로 나누어 표현하고, 스펙트럼 포락선과 여기신호를 각각 양자화한 비트 스트림을 전송하는 방식을 채택하고 있다. 일반적으로, 스펙트럼 포락선을 나타내기 위해 선형 예측(linear predictive coding : 이하에서 LPC라 한다) 계수를 사용하고 있다. 하지만, LPC 계수는 직접 양자화하기에는 너무 변화가 심하므로 선스펙트럼 주파수(line spectrum frequencies : 이하에서 LSF라 한다)로 변환하여 양자화한다.In recent years, standardization of speech coder has been progressing globally. The speech coders participating in the standardization adopt a scheme of expressing the speech by the spectral envelope and the excitation signal, and transmitting the bitstream in which the spectral envelope and the excitation signal are respectively quantized. Generally, linear predictive coding (LPC) coefficients are used to represent the spectral envelope. However, since the LPC coefficients are too variable to be directly quantized, they are converted into line spectrum frequencies (LSFs) and quantized.

LSF의 양자화 방법은 크게 스칼라 양자화(Scalar Quantization)와 벡터 양자화(Vector Quantization)의 두가지 방법이 있다.There are two methods of LSF quantization: Scalar Quantization and Vector Quantization.

M차 LSF를 스칼라 양자화기(Scalar Quantizer : 이하에서 SQ라 한다)로 설계할 경우와 벡터 양자화기(Vector Quantizer : 이하에서 VQ라 한다)로 설계할 경우의 가장 큰 차이점은 각 차수간의 상관관계(Correlation)을 사용하는가 여부이다. 설명의 편의를 위해 M차 LSF 벡터를 아래와 같이 표현한다.The major difference between designing a M-order LSF with a Scalar Quantizer (SQ) and a Vector Quantizer (VQ) is that the correlation between the orders Correlation is used. For convenience of explanation, the M-order LSF vector is expressed as follows.

｛ omega_1 ,omega_2 ,omega_3 ,…,omega_M ｝{Omega_1, omega_2, omega_3, ... , omega_M}

M차 LSF를 직접 벡터양자화하여 얻어진 코드벡터는부터차까지의 모든 코드벡터가 상관관계를 가지도록 설계된다. 즉,은과 상관관계를 가지도록 설계되므로,이의 분포를 제한하는 효과를 가진다. 따라서, 동일 비트수로 양자화한 경우 VQ는 SQ에 비해 뛰어난 성능을 나타내지만, 메모리 요구량과 검색시간 컴플렉서티(Search Time Complexity)가 더 많이 요구된다. M = 16, 요구되는 비트수 = 32비트, LSF 하나를 저장하는데 필요한 메모리가 1바이트라고 가정하면,A code vector obtained by directly vector quantizing the M-th order LSF The from All code vectors up to the car are designed to have a correlation. In other words, silver And therefore, this The effect of limiting the distribution of Therefore, when quantized with the same number of bits, VQ shows superior performance to SQ, but requires more memory requirement and search time complexity. Assuming that M = 16, the number of required bits = 32 bits, the memory required to store one LSF is 1 byte,

16 × 2³²× 1바이트 = 64 기가바이트(Gbyte)16 x 2 ³² x 1 byte = 64 gigabytes (Gbyte)

의 메모리가 요구되므로 실제 시스템에서는 구현하기 불가능하다.It is impossible to implement it in an actual system.

M차 LSF를 스칼라 양자화하여 얻어진 코드벡터는부터차까지의 모든 코드벡터가 독립적으로 설계된다. 즉,은과 전혀 상관관계를 가지지 않도록 설계되므로, 존재하지 않는 코드벡터의 조합이 다수 존재할 수 있다. 반면에 M = 16, 요구되는 비트수 = 32비트, LSF 하나를 저장하는데 필요한 메모리가 1byte라고 가정하면 (각 차수에 2비트씩 할당되는 경우),The code vector obtained by scalar quantizing the M-th order LSF The from All code vectors to the car are designed independently. In other words, silver So that there are many combinations of code vectors that do not exist. On the other hand, assuming that M = 16, the number of required bits = 32 bits, and the memory required to store one LSF is 1 byte (two bits are assigned to each order)

16 × 2²× 1바이트 = 64 바이트(byte)16 × 2 ² × 1 byte = 64 bytes (byte)

의 메모리가 요구되므로 실제 시스템에서의 구현이 용이하다. 하지만, VQ에 비해서 급격한 성능 저하가 발생한다.Memory is required, so it is easy to implement in an actual system. However, there is a dramatic performance degradation compared to VQ.

결국, SQ는 메모리와 검색시간 관점에서는 만족할만한 성능을 나타내지만, 너무 많은 전송비트가 필요하므로 저전송 음성부호화기에 적용되기에는 문제가 있다. 반면에 VQ는 SQ에 비해서 급격하게 전송비트수를 줄일 수 있지만, 검색시간과 메모리 요구량이 너무 많아서 디지털신호처리기(Digital Signal Processor:DSP) 상에 구현하기에는 문제가 있다.As a result, although the SQ exhibits satisfactory performance in terms of memory and search time, it requires too many transmission bits, which is problematic to be applied to a low transmission speech coder. On the other hand, VQ can reduce the number of transmitted bits drastically compared to SQ, but there are too many search times and memory requirements to implement on a digital signal processor (DSP).

본 발명은 상기의 문제점을 해결하기 위하여 창작된 것으로서, 가능한 적은 비트로 만족할 만한 왜곡도를 가지고, 가능한 적은 메모리를 필요로 하며, 가능한 적은 검색시간 컴플렉서티를 필요로하는 양자화기, 그 양자화기를 학습하는 방법및 LSF의 부호화/복호화방법을 제공함을 그 목적으로 한다.SUMMARY OF THE INVENTION The present invention has been made in order to solve the above problems, and it is an object of the present invention to provide a quantizer which has a satisfactory degree of distortion with as few bits as possible, requires as little memory as possible and requires as little search time complexity as possible, And an encoding / decoding method of LSF.

도 1은 본 발명에 의한 연결 스칼라 양자화기의 구성도이다.1 is a configuration diagram of a connected scalar quantizer according to the present invention.

도 2는 도 1의 분류기 중 하나의 구조를 도시한 것이다.Figure 2 shows the structure of one of the classifiers of Figure 1;

도 3은 본 발명에 의한 연결 스칼라 양자화기의 학습 과정을 도시한 것이다.FIG. 3 illustrates a learning process of a connected scalar quantizer according to the present invention.

도 4는 본 발명에 의한 연결 스칼라 양자화기를 이용한 부호화 과정을 도시한 것이다.FIG. 4 illustrates a coding process using a connected scalar quantizer according to the present invention.

도 5는 본 발명에 의한 연결 스칼라 양자화기를 이용한 복호화 과정을 도시한 것이다.FIG. 5 illustrates a decoding process using a connected scalar quantizer according to the present invention.

상기의 목적을 달성하기 위하여, 본 발명에 의한 M차 선스펙트럼주파수들｛ omega_1 ,omega_2 ,omega_3 ,…,omega_M ｝ 을 양자화하는 양자화기는 M 가지 차수별로 다수가 할당된 코드북들; 및 상기 M차 선스펙트럼주파수들｛ omega_1 ,omega_2 ,omega_3 ,…,omega_M ｝ 을 각각 양자화하되, 각 선스펙트럼주파수에 할당된 다수의 코드북들 중 인접한 선스펙트럼주파수의 크기에 의해 하나의 코드북을 선택하고, 선택된 코드북을 참조하여 상기 각 선스펙트럼을 양자화하는 양자화부를 포함함을 특징으로 한다.In order to achieve the above object, the M-th order line spectrum frequencies {omega_1, omega_2, omega_3,. , omega_M} are codebooks to which a large number of codebooks are allocated in each of the M classes; And the M-th order line spectrum frequencies {omega_1, omega_2, omega_3, ... , and omega_M}, quantizing each of the line spectra by selecting one codebook based on the size of the adjacent line spectrum frequency among a plurality of codebooks assigned to the line spectrum frequencies, and referring to the selected codebook .

상기의 다른 목적을 달성하기 위하여, 다수의 M차 선스펙트럼주파수들 ｛｛ omega_1 ｝,…,｛omega_N ｝,…,｛omega_M ｝｝ 을 이용하여 연결 스칼라 양자화기를 학습하는 방법은 (a) 다수의 기준 선스펙트럼주파수들 {ω_N}을 학습하여 N차에 할당된 하나의 코드북을 생성하는 단계(여기에서, 1≤N≤M); (b) K를 0에서 (M-N-1)까지 변경하면서, (b.1) 다수의 선스펙트럼주파수들 {ω_N+K}의 분포공간을 L개로 나누는 소단계; (b.2) 다수의 선스펙트럼주파수 {ω_N+K+1}에 속하는 각 선스펙트럼주파수 ω_N+K+1을 같은 프레임 내의 선스펙트럼주파수 ω_N+K가 속하는 분포공간에 대응하는 L개의 부집합들로 나누는 소단계; 및 (b.3) 상기 (b.2) 소단계에서 L개의 부집합들로 나뉘어진 선스펙트럼주파수들을 각각 학습하여 (N+K+1)차에 할당된 L개의 코드북들을 생성하는 소단계를 반복 수행함에 의해 상위차수에 할당된 코드북들을생성하는 단계; 및 (c) K를 0에서 (N-2)까지 변경하면서, (c.1) 다수의 선스펙트럼주파수들 {ω_N-K}의 분포공간을 L개로 나누는 소단계; (c.2) 다수의 선스펙트럼주파수 {ω_N-K-1}에 속하는 각 선스펙트럼주파수 ω_N-K-1을 같은 프레임 내의 선스펙트럼주파수 ω_N-K가 속하는 분포공간에 대응하는 L개의 부집합들로 나누는 소단계; 및 (c.3) 상기 (c.2) 소단계에서 L개의 부집합들로 나뉘어진 선스펙트럼주파수들을 각각 학습하여 (N-K-1)차에 할당된 L개의 코드북들을 생성하는 소단계를 반복 수행함에 의해 하위차수에 할당된 코드북들을 생성하는 단계를 포함함을 특징으로 한다.In order to achieve the above-mentioned other object, a plurality of M-th order line spectrum frequencies {{omega_1}, ... , {Omega_N}, ... , {Omega_M}} comprises the steps of (a) learning a plurality of reference line spectral frequencies {ω _N } to generate one codebook assigned to the N-th order, where 1 ? N? M); (b) a small step of dividing the distribution space of a plurality of line spectrum frequencies {ω _{N + K} } into L by changing K from 0 to (MN-1); (b.2) Each line spectral frequency ω _{N + K + 1} belonging to a plurality of line spectrum frequencies {ω _{N + K + 1} } is divided into L L spectra corresponding to the distribution space to which the line spectrum frequency ω _{N +} Small steps to divide into subsets; And (b.3) a step of generating L codebooks allocated to the (N + K + 1) -th order by learning line spectral frequencies divided into L subsets in the step (b.2) Generating codebooks assigned to higher orders by repetition; (C) a small step of dividing the distribution space of a plurality of line spectrum frequencies {ω _NK } into L by changing K from 0 to (N-2); (c.2) Divide each line spectral frequency ω _NK-1 belonging to a plurality of line spectrum frequencies {ω _NK-1 } into L subsets corresponding to the distribution space to which the line spectrum frequency ω _NK in the same frame belongs step; And (c.3) repeating a small step of generating L codebooks allocated to the (NK-1) th by learning line spectral frequencies divided into L subsets in the (c.2) And generating codebooks assigned to the lower order by the codebooks.

상기의 또 다른 목적을 달성하기 위하여, 본 발명에 의한 M차 선스펙트럼주파수들｛ omega_1 ,…,omega_N ,…,omega_M ｝ 을 부호화하는 방법은 (a) N차에 할당된 코드북에 의해 기준 선스펙트럼주파수 ω_N을 양자화한 값과 부호화한 값 i_N을 구하는 단계(여기에서, 1≤N≤M); (b) K를 0에서 (M-N-1)까지 변경하면서, (b.1) 양자화된 선스펙트럼주파수를 그 크기에 따라 1 내지 L 중 하나의 값으로 분류하는 소단계; (b.2) (N+K+1)차에 할당된 L개의 코드북들 중 상기 (b.1) 소단계에서 분류된 값에 대응하는 코드북을 찾는 소단계; 및 (b.3) 상기 (b.2) 소단계에서 찾은 코드북에 의해 선스펙트럼주파수 ω_N+K+1을 양자화한 값과 부호화한 값 i_N+K+1을 구하는 소단계를 반복 수행함에 의해 상위차수 선스펙트럼주파수들을 부호화하는 단계; 및 (c) K를 0에서 (N-2)까지 변경하면서, (c.1) 양자화된선스펙트럼주파수를 그 크기에 따라 1 내지 L 중 하나의 값으로 분류하는 소단계; (c.2) (N-K-1)차에 할당된 L개의 코드북들 중 상기 (c.1) 소단계에서 분류된 값에 대응하는 코드북을 찾는 소단계; 및 (c.3) 상기 (c.2) 소단계에서 찾은 코드북에 의해 선스펙트럼주파수 ω_N-K-1을 양자화한 값과 부호화한 값 i_N-K-1을 구하는 소단계를 반복 수행함에 의해 하위차수 선스펙트럼주파수들을 부호화하는 단계를 포함함을 특징으로 한다.In order to achieve the above-mentioned other object, the M-th order line spectrum frequencies {omega_1,. , omega_N, ... , omega_M} is encoded by (a) a value obtained by quantizing the reference line spectrum frequency? _N by a codebook allocated to the Nth order And obtaining a coded value i _N (where 1? _N? M); (b) changing K from 0 to (MN-1), (b.1) changing the quantized line spectrum frequency Into a value of one of 1 to L according to its size; (b.2) a small step of finding a codebook corresponding to the value classified in the step (b.1) among the L codebooks allocated to the (N + K + 1) th order; And (b.3) a value obtained by quantizing the line spectrum frequency? _{N + K + 1} by the codebook found in the step (b.2) And a step of obtaining a coded value iN _{+ K + 1} by repeating the steps of: encoding the higher order line spectral frequencies; And (c) changing K from 0 to (N-2), (c. 1) changing the quantized line spectrum frequency Into a value of one of 1 to L according to its size; (c.2) searching a codebook corresponding to a value classified in the (c.1) small step among L codebooks allocated to the (NK-1) th order; And (c.3) a value obtained by quantizing the line spectrum frequency? _NK-1 by the codebook found in the step (c.2) And encoding the lower order line spectral frequencies by repeatedly performing a small step of obtaining the encoded value i _NK-1 .

상기의 또 다른 목적을 달성하기 위하여, 본 발명에 의한 M차 선스펙트럼주파수들의 인덱스들｛ i_1 ,…,i_N ,…,i_M ｝ 을 복호화하는 방법은 (a) N차에 할당된 코드북에서 기준 선스펙트럼주파수의 인덱스 i_N을 이용하여 양자화된 선스펙트럼주파수을 찾는 단계(여기에서, 1≤N≤M); (b) K를 0에서 (M-N-1)까지 변경하면서, (b.1) 양자화된 선스펙트럼주파수를 그 크기에 따라 1 내지 L 중 하나의 값으로 분류하는 소단계; (b.2) (N+K+1)차에 할당된 L개의 코드북들 중 상기 (b.1) 소단계에서 분류된 값에 대응하는 코드북을 찾는 소단계; 및 (b.3) 상기 (b.2) 소단계에서 찾은 코드북에서 인덱스 i_N+K+1을 이용하여 양자화된 선스펙트럼주파수을 찾는 소단계를 반복 수행함에 의해 상위차수 선스펙트럼주파수들의 인덱스들을 복호화하는 단계; 및 (c) K를 0에서 (N-2)까지 변경하면서, (c.1) 양자화된 선스펙트럼주파수를 그 크기에 따라 1 내지 L 중 하나의 값으로 분류하는 소단계; (c.2) (N-K-1)차에 할당된 L개의 코드북들 중 상기 (c.1) 소단계에서 분류된 값에 대응하는 코드북을 찾는 소단계; 및 (c.3) 상기 (c.2) 소단계에서 찾은 코드북에서 인덱스 i_N-K-1을 이용하여 양자화된 선스펙트럼주파수을 찾는 소단계를 반복 수행함에 의해 하위차수 선스펙트럼주파수들의 인덱스들을 복호화하는 단계를 포함함을 특징으로 한다.In order to achieve the above-mentioned another object, the indexes of the M-th order line spectrum frequencies {i_1, ..., , i_N, ... , i_M} are obtained by (a) using the index i _N of the reference line spectral frequency in the codebook allocated to the N-th order to obtain a quantized line spectrum frequency (Where 1 < = N < = M); (b) changing K from 0 to (MN-1), (b.1) changing the quantized line spectrum frequency Into a value of one of 1 to L according to its size; (b.2) a small step of finding a codebook corresponding to the value classified in the step (b.1) among the L codebooks allocated to the (N + K + 1) th order; And (b.3) quantizing the line spectrum frequency using the index iN _{+ K + 1} in the codebook found in the step (b.2) Decoding the indexes of the higher order line spectral frequencies by repeating a small step of finding the higher order line spectral frequencies; And (c) changing K from 0 to (N-2), (c. 1) changing the quantized line spectrum frequency Into a value of one of 1 to L according to its size; (c.2) searching a codebook corresponding to a value classified in the (c.1) small step among L codebooks allocated to the (NK-1) th order; And (c.3) quantizing the line spectrum frequency using the index iNK _-1 in the codebook found in the (c.2) And decoding the indexes of the lower order line spectral frequencies by repeating the steps of searching for the lower order line spectral frequencies.

이하에서 첨부된 도면을 참조하여 본 발명을 상세히 설명한다.DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS The present invention will now be described in detail with reference to the accompanying drawings.

이하에서 본 발명에 의한 양자화기를 연결 스칼라 양자화기(Linked Scalar Quantizer)라 하고 LSQ라 표기한다.Hereinafter, the quantizer according to the present invention will be referred to as a Linked Scalar Quantizer (LSQ).

도 1에 의하면, M차 선스펙트럼주파수들｛ omega_1 ,omega_2 ,omega_3 ,…,omega_M ｝ 을 양자화하는 본 발명에 의한 LSQ는 각 차수별 코드북들(10), 각 차수별 분류기들(20) 및 양자화제어부(30)을 포함하여 구성된다.1, M-th order line spectrum frequencies {omega_1, omega_2, omega_3, ... , omega_M} according to the present invention includes the codebooks 10 for each order, the classifiers 20 for each order, and the quantization control unit 30. [

LSQ는 기준 선스펙트럼주파수의 차수인 N차에 할당된 하나의 코드북(C_N)과, N 이외의 각 차수에 각각 L개씩 할당된 코드북들을 구비한다(여기에서, 1≤N≤M).The LSQ includes one codebook (C _N ) assigned to the N-th order, which is the order of the reference line spectrum frequency, and codebooks allocated to each order other than N, where L is 1? N? M.

각 차수별 분류기들(20)은 각각 하나의 선스펙트럼주파수를 입력받아 그 값의 크기에 따라 1 내지 L 중 하나의 값을 결정한다.Each classifier (20) for each order receives one line spectrum frequency and determines one of the values from 1 to L according to the magnitude of the value.

그리고, 양자화제어기(30)은 N차에 대해서는 대응하는 코드북(C_N)에 의해 양자화하고, N보다 높은 차수의 선스펙트럼주파수에 대해서는 할당된 L개의 코드북들 중 한 차수 낮은 차수의 분류기에 의해 분류된 값에 대응하는 코드북에 의해 양자화하고, 상기 N보다 낮은 차수의 선스펙트럼주파수에 대해서는 할당된 L개의 코드북들 중 한 차수 높은 차수의 분류기에 의해 분류된 값에 대응하는 코드북에 의해양자화한다.Then, the quantization controller 30 quantizes the codebook by a corresponding codebook C _N for the N-th order, classifies it by one of the L low-order classifiers of the L codebooks allocated to the line spectrum frequency higher than N Quantized by a codebook corresponding to a value classified by a classifier of one degree higher order among the L codebooks allocated for a line spectrum frequency lower than N,

도 2는 제x분류기(Classifier)의 분류과정을 나타낸다(1≤x≤M). 참조번호 200은 제x분류기가 음성에서 추출한 LSF｛ omega_1 ,omega_2 ,omega_3 ,…,omega_x ,…,omega_M ｝ 의 x번째 데이터 ω_x를 입력하는 것을 표시한다. 참조번호 202는 입력값 ω_x를 L개의 영역 중 하나로 분리하는 과정을 나타내며, 참조번호 204는 제x분류기에 의해 결정된 인덱스 l_x를 나타낸다.Fig. 2 shows a classification process of the x-th classifier (1? X? M). Reference numeral 200 denotes LSF {omega_1, omega_2, omega_3, ...} extracted from speech by the xth classifier. , omega_x, ... and it displays the entry of the second data x ω _x of omega_M}. Reference numeral 202 denotes a process of separating the input value ω _x into one of L regions, and reference numeral 204 denotes an index l _x determined by the xth classifier.

도 3은 LSQ의 학습 과정을 나타낸다. 참조번호 300은 음성에서 추출한 M개의 LSF ｛ omega_1 ,omega_2 ,omega_3 ,…,omega_N ,…,omega_M ｝ 중 N번째 데이터 집합 {ω_N}을 LBG(Linda Buzo Gray) 알고리듬에 의해 학습하는 과정을 나타낸다.3 shows the learning process of the LSQ. Reference numeral 300 denotes M LSF {omega_1, omega_2, omega_3, ... , omega_N, ... a, omega_M N} th data set {ω _N} of illustrates a process of learning by the LBG (Linda Buzo Gray) algorithm.

참조번호 302와 304는 LSF 차수가 N을 초과하는 경우의 학습 과정을 나타낸다. 참조번호 302는 학습하려는 데이터 집합 {ω_N+K+1}을 자신의 차수(N+K+1)보다 1 적은 차수(N+K)의 LSF {ω_N+K}를 제(N+K)분류기에 통과시켜 L개의 부집합(subset)으로 분할하여 학습하는 과정을 나타내며, 참조번호 304는 이때의 (N+K+1)의 범위를 명시한다.Reference numerals 302 and 304 denote the learning process when the LSF degree exceeds N. FIG. Reference numeral 302 denotes a data set {omega _{N + K + 1} } to be learned as (N + K + 1) th LSF {omega _{N + K} } of degree (N + K) ) Classifier and divided into L subsets, and reference numeral 304 denotes a range of (N + K + 1) at this time.

참조번호 306와 308는 LSF 차수가 N미만인 경우의 학습 과정을 나타낸다. 참조번호 306는 학습하려는 데이터 집합 {ω_N-K-1}을 자신의 차수(N-K-1)보다 1 많은 차수(N-K)의 LSF {ω_N-K}를 제(N-K)분류기에 통과시켜 L개의 부집합으로 분할하여 학습하는 과정을 나타내며, 참조번호 308은 이때의 (N-K-1)의 범위를 명시한다.Reference numerals 306 and 308 indicate a learning process when the LSF degree is less than N. [ Reference numeral 306 denotes a set of L subsets that pass the data set {ω _NK-1 } to be studied through a (NK) classifier with an LSF {ω _NK } of order NK higher than its degree NK- , And reference numeral 308 denotes a range of (NK-1) at this time.

도 4는 LSQ의 부호화 과정을 나타낸다. 참조번호 400은 음성에서 추출한 M개의 LSF ｛ omega_1 ,omega_2 ,omega_3 ,…,omega_N ,…,omega_M ｝ 중 N번째 데이터 ω_N을 부호화하여 해당 인덱스 i_N을 전송하는 과정이다. 이때, ω_N의 양자화된 값을이라 표기한다.4 shows the encoding process of the LSQ. Reference numeral 400 denotes M LSF {omega_1, omega_2, omega_3, ... , omega_N, ... , encoding the N-th data of the omega_M ω _N} is the process of transmitting the index i _N. At this time, the quantized value of < RTI ID = _0.0 > .

참조번호 402와 404는 LSF 차수가 N을 초과하는 경우의 부호화 과정을 나타낸다. 참조번호 402는 N+K번째 LSF의 양자화된 값를 제(N+K)분류기에 통과시켜 ω_N+K+1을 양자화할 L개의 코드북들 중 l_N+K번째 코드북을 선택한 후, 이를 이용해 ω_N+K+1을 양자화하여 해당 인덱스 i_N+K+1을 전송하는 과정을 나타낸다. 이때,의 크기에 따라서 선택된 코드북을 사용하여 ω_N+K+1을로 양자화하되,을 초기값으로 하여 (N+K+1)의 값이 M이 될 때까지 K를 1씩 증가하면서 반복한다. 참조번호 404에는 루프의 (N+K+1) 범위를 명시한다.Reference numerals 402 and 404 denote the encoding process when the LSF degree exceeds N. FIG. Reference numeral 402 denotes a quantized value of the N + Kth LSF A second (N + K) and passed through a classifier to select ω _{N + K +} L of the codebook from among l _{N + K-th} codebook to quantize the _first and then, quantizes the ω _{N + K + 1} by using this, the index i _{N + K + 1 < / RTI} > At this time, And using the selected codebook according to the size of < _{RTI ID = 0.0 >} Lt; / RTI > (N + K + 1) is incremented by 1 until the value of M becomes equal to M. [ Reference numeral 404 denotes the (N + K + 1) range of the loop.

참조번호 406와 408는 LSF 차수가 N미만인 경우의 부호화 과정을 나타낸다. 참조번호 406는 (N-K)번째 LSF의 양자화된 값를 제(N-K)분류기에 통과시켜 ω_N-K-1을 양자화할 L개의 코드북들 중 l_N-K번째 코드북을 선택한 후, 이를 이용해 ω_N-K-1을 양자화하여 해당 인덱스 i_N-K-1을 전송하는 과정을 나타낸다. 이때,의 크기에 따라 선택된 코드북을 사용하여 ω_N-K-1을로 양자화하되,을 초기값으로 하여 (N-K-1)의 값이 1이 될 때까지 K를 1씩 감소하면서 반복한다. 참조번호 408에는 루프의 (N-K-1) 범위를 명시한다.Reference numerals 406 and 408 denote the encoding process when the LSF order is less than N. FIG. Reference numeral 406 denotes a quantized value of the (NK) th LSF To a first (NK) and then passed through a classifier to select the L codebook from among l _NK second codebook to quantize the ω _NK-1, with this quantizing ω _NK-1 shows a process of transmitting the index i _NK-1 . At this time, Using the codebook selected according to the size of the symbol _NK-1 Lt; / RTI > (NK-1) is decremented by 1 until the value of (NK-1) becomes 1, Reference numeral 408 denotes the (NK-1) range of the loop.

도 5는 LSQ의 복호화 과정을 나타낸다. 참조번호 500은 수신된 인덱스 i_N을 복호화하여 음성에서 추출한 M개의 LSF ｛ omega_1 ,omega_2 ,omega_3 ,…,omega_N ,…,omega_M ｝ 중 N번째 데이터 ω_N의 양자화된 값을 얻는 과정이다.5 shows a decoding process of the LSQ. Reference numeral 500 decodes the received index i extracted from the audio _N M number of LSF {omega_1, omega_2, omega_3, ... , omega_N, ... , omega_M}, the quantized value of the Nth data? _N .

참조번호 502와 504는 LSF 차수가 N을 초과하는 경우의 복호화 과정을 나타낸다. 참조번호 502는 (N+K)번째 LSF의 양자화된 값를 제(N+K)분류기에 통과시켜 ω_N+K+1을 양자화할 L개의 코드북들 중 l_N+K번째 코드북을 선택한 후, 이 코드북에서 수신한 인덱스 i_N+K+1을 이용해 ω_N+K+1을 양자화한을 얻는 과정을 나타낸다. 이때,의 크기에 따라 선택된 코드북에서 i_N+K+1을 이용하여를 찾되,을 초기값으로 하여 (N+K+1)의 값이 M이 될 때까지 K를 1씩 증가하면서 반복한다. 참조번호 504에는 루프의 (N+K+1) 범위를 명시한다.Reference numerals 502 and 504 denote the decoding process when the LSF degree exceeds N. FIG. Reference numeral 502 denotes a quantized value of the (N + K) -th LSF A second (N + K) and passed through a classifier to select l _{N + K-th} codebook in the L codebook to quantize the ω _{N + K + 1} Then, using an index i _{N + K + 1} received from the codebook ω _{N + K + 1} Is obtained. At this time, I _{+ N + K + 1} in the codebook selected according to the size of Lt; / RTI > (N + K + 1) is incremented by 1 until the value of M becomes equal to M. [ Reference numeral 504 denotes the (N + K + 1) range of the loop.

참조번호 506와 508는 LSF 차수가 N미만인 경우의 복호화 과정을 나타낸다. 참조번호 506는 (N-K)번째 LSF의 양자화된 값을 제(N-K)분류기에 통과시켜 을 양자화할 L개의 코드북들 중 l_N-K번째 코드북을 선택한 후, 이 코드북에서 수신한 인덱스 i_N-K-1을 이용해 ω_N-K-1을 양자화한을 얻는 과정을 나타낸다. 이때,의 크기에 따라 선택된 코드북에서 i_N-K-1을 이용하여를 찾되,을 초기값으로 하여 (N-K-1)의 값이 M이 될 때까지 K를 1씩 증가하면서 반복한다. 참조번호 508에는 루프의 (N-K-1) 범위를 명시한다.Reference numerals 506 and 508 denote decoding processes when the LSF degree is less than N. [ Reference numeral 506 denotes a quantized value of the (NK) th LSF A first (NK) to select the second codebook _NK l of the L codebook to quantize and passed through a classifier, with the index i _NK-1 received from the codebook a quantized ω _NK-1 Is obtained. At this time, I _NK-1 in the selected codebook according to the size of Lt; / RTI > (K) is incremented by 1 until the value of (NK-1) becomes M. Reference numeral 508 specifies the (NK-1) range of the loop.

이하에서 본 발명의 동작 원리를 상세히 설명한다.Hereinafter, the operation principle of the present invention will be described in detail.

우선, SQ에 VQ의 특성을 부여하기 위하여 SQ의 각 차수 LSF간에 연결고리(Link)를 형성해주는 방법을 제안한다. LSF는 각 차수간에 상관관계가 존재하며 기존의 실험에 의하면 바로 근처의 LSF 값과 가장 큰 상관관계를 가짐을 알 수 있었다. 따라서, M개의 LSF 중 우선 임의의 N번째 LSF는 일반적인 SQ 방식에 의해 코드북을 형성하고, (N+K+1)번째 LSF(단, K는 0 이상이고, (M-N-1) 이하의 정수)는 각각 (N+K)번째 LSF의 분포에 근거한 코드북을 형성하고, (N-K-1)번째 LSF(단, K는 0 이상이고, (N-2) 이하인 정수)는 각각 (N-K)번째 LSF의 분포에 근거한 코드북을 형성하도록 한다. 따라서, SQ방식으로 설계하였지만 각 차수간의 상관관계가 존재하도록 설계되므로, 메모리 요구량과 검색시간은 VQ에 비해 급격히 적으면서 성능은 SQ에 비해 탁월한 연결 스칼라 양자화기(LSQ)를 설계할 수 있다.First, we propose a method of forming a link between each order LSF of SQ to give the characteristics of VQ to SQ. LSF has a correlation between each order, and according to the existing experiment, it is found that LSF has the greatest correlation with the near LSF value. (N + K + 1) -th LSF (where K is an integer equal to or greater than 0 and equal to or smaller than (MN-1)), the Nth LSF of the M LSFs first forms a codebook by a general SQ scheme, (NK-1) th LSF (where K is an integer equal to or greater than 0 and equal to or smaller than (N-2)) forms a codebook based on the distribution of the (N + Thereby forming a codebook based on the distribution. Therefore, although it is designed in the SQ scheme, the connection scalar quantizer (LSQ), which has a memory requirement and a search time that is significantly less than VQ and whose performance is superior to that of SQ, can be designed.

(1) 분류기(Classifier)의 설계 및 적용(1) Design and application of Classifier

A번째 LSF ω_A를 이용하여 바로 옆의 B(B = (A+1) 또는 (A-1))번째 LSF ω_B를 L개의 구간으로 분류하는 분류기(Classifier)는 다음과 같이 설계한다.A second LSF ω _A right side of B using the (B = (A + 1) or (A-1)) classifier for classifying the second LSF ω _B of the L section (Classifier) are designed as follows.

1) ω_A의 전체 학습 데이터의 수가 S개라면, ω_A를 내림차순으로 정렬한 후 데이터가 S/L개씩 분포하도록 ω_A를 분류한다.1) If the number of S pieces of the whole learning data of _A ω, ω classifies _A Align the _A ω in descending order the data is distributed to S / L each.

2) 이때, 분류에 사용한 ω_A의 경계값을 이용하여 ω_B를 분류할 수 있다.즉, 분류기의 입력으로 들어온 ω_A가 정렬한 S개 데이터의 아래 영역에 존재한다면2) At this time, ω _B can be classified using the boundary value of ω _A used in classification, ie, if ω _A entered as the input of the classifier exists in the lower region of the sorted S data

ω_B도 일정구간으로 제한된다는 점을 이용한다. 여기서, L=S이면 ω_A와 ω_B를 벡터 양자화한 것과 일치하게 되며, L<S이면 필요한 메모리는 줄어들면서 성능도 저하되어 L=1인 경우는 스칼라 양자화와 일치하게 된다. 따라서, L의 값으로 1과 S사이의 임의의 값을 선택하면 메모리와 성능면에서 만족할 만한 양자화기를 설계할 수 있다.ω _B is limited to a certain period. Here, if L = S, it coincides with the vector quantization of? _A and? _B. If L < S, the necessary memory is reduced and the performance is degraded. Therefore, if an arbitrary value between 1 and S is selected as the value of L, a quantizer satisfactory in memory and performance can be designed.

3) 설계된 분류기는 LSQ의 학습, 부호화, 복호화의 모든 과정에 적용되며 특히 부호화와 복호화 시에는 분류기의 입력으로 양자화된 LSF값을 사용함으로써 별도의 비트 전송없이 연결고리 구조를 이용할 수 있다.3) The designed classifier is applied to all learning, coding and decoding processes of LSQ. In particular, LSF value quantized as input of classifier is used for coding and decoding, so that a link loop structure can be used without extra bit transmission.

(2) LSQ 학습 방법(2) LSQ learning method

학습 방법은 아래와 같다.The learning method is as follows.

1) M개의 LSF중 임의의 N번째 LSF ω_N을 선택하고, LSF ω_N은 일반적인 SQ방식으로 학습한다.1) Select an arbitrary Nth LSF ω _N among M LSFs, and LSF ω _N learns by a general SQ method.

2) (N+1)번째 LSF ω_N+1을 학습할 때는(단, (N+1)은 M이하의 정수) N번째 LSF ω_N의 분포를 이용한다.2) When learning the (N + 1) th LSF ω _{N + 1} (where (N + 1) is an integer equal to or less than M), the distribution of the _Nth LSF ω _N is used.

2.1) ω_N의 분포구간을 L개로 나누고,2.1) We divide the distribution of ω _N by L,

2.2) ω_N이 1,2,...,L_N구간 각각에 존재할 때의, ω_N+1값 각각을 따로학습하여 ω_N+1에 해당하는 코드북이 ω_N의 종류에 따라서 L_N가지로 존재하도록 한다.2.2) ω _N is 1, 2, ..., L _N of the presence of the interval, respectively, the codebook by learning ω _{N + 1} values separately for the _{N + 1} ω L _N different according to the type of _N ω Respectively.

3) (N+K)번째 LSF ω_N+K를 학습할 때는(단, (N+K)는 M이하이고, N을 초과하는 정수) (N+K-1)번째 LSF ω_N+K-1의 분포를 이용한다.3) (N + K) th LSF ω _{N +} When learning the _K (However, (N + K) is less than or equal to M, an integer exceeding N) (N + K-1 ) th LSF ω _{N + K- 1} is used.

4) (N-K)번째 LSF ω_N+K를 학습할 때는(단, (N-K)는 N미만이고, 1이상인 정수) (N-K+1)번째 LSF ω_N+K+1의 분포를 이용한다.4) (N-K + 1) -th LSF ω _{N + K + 1} when learning the (NK) th LSF ω _{N + K} (NK is less than N and an integer of 1 or more).

(3) LSQ 부호화 방법(3) LSQ encoding method

부호화 방법은 아래와 같다.The encoding method is as follows.

1) 우선, M개의 LSF중 N번째 LSF ω_N을 부호화한다. ω_N을 표현하는 코드북이 P_N개의 코드벡터로 구성되어 있다면, 부호화는 유클리디언 거리척도가 가장 적은 값을 가질 때의 인덱스를 전송하는 방식으로 한다.1) First, the Nth LSF ω _N among M LSFs is encoded. Codebook expressing ω _N Is composed of P _N code vectors, the encoding is performed by transmitting the index when the Euclidean distance measure has the smallest value.

2) (N+K)번째 LSF ω_N+K를 부호화할 때는(단, (N+K)는 M이하이고, N을 초과하는 정수) 이미 부호화한 (N+K-1)번째 LSF의 분포를 이용한다. 즉, 학습 과정에서 ω_N+K의 코드북은 ω_N+K-1의 범위에 따라 L_N+K-1가지의 종류가 존재하므로, 이미 양자화한 값인의 분포를 이용하여, ω_N+K의 코드북중에서 L'_N+K-1번째인을 이용하여 아래식과 같이 부호화한다.2) (N + K-1) -th LSF (N + K-1) th encoded LSF ω _{N + K} , where N + K is M or less and N is an integer greater than N . That is, the codebook of _{N + K} ω in the learning process, so that L _{N + K-1,} depending on the extent of different types of ω _{N + K-1} exists, the value has already been quantized _{N + K-1-} th codebook among the codebooks of < _{RTI ID =} _{0.0 ># N + K &} As shown in the following equation.

3) (N-K)번째 LSF ω_N-K를 부호화할 때는(단, N-K는 N미만이고, 1이상인 정수) 이미 부호화한 (N-K+1)번째 LSF의 분포를 이용한다. 즉, 학습 과정에서 ω_N-K의 코드북은 ω_N-K+1의 범위에 따라 L_N-K+1가지 종류가 존재하므로, 이미 양자화한 값인의 분포를 이용하여, ω_N-K의 코드북중에서 L'_N-K+1번째인을 이용하여 아래식과 같이 부호화한다.3) (N-K + 1) -th LSF (N-K + 1) -th encoded LSF ω _NK (NK is an integer smaller than N, . That is, the codebook of _NK ω in the learning process, so that L _{NK + 1} kinds, depending on the extent of the ω _{NK + 1} exists, has already been quantized value Using the distribution of the codebook from the _NK ω L _{'NK + 1} beonjjae As shown in the following equation.

(4) LSQ 복호화 방법(4) LSQ decoding method

복호화 방법은 아래와 같다.The decoding method is as follows.

1) 우선, M개의 LSF중 N번째 LSF ω_N에 해당하는 값을 아래와 같이 복호화한다. 전송된 인덱스가 P'_N일때, 복호화된 LSF는 아래식과 같다.1) First, among the M LSFs, the value corresponding to the N-th LSF ω _N Is decoded as follows. When the transmitted index is P ' _N , the decoded LSF is as follows.

2) (N+K)번째 LSF ω_N+K를 복호화할 때는(단, (N+K)는 M이하이고, N을 초과하는 정수) 이미 복호화된 (N+K-1)번째 LSF의 분포를 이용한다. 즉, 학습 과정에서 ω_N+K의 코드북은 ω_N+K-1의 범위에 따라 L_N+K-1가지 종류가 존재하므로, 이미 양자화한 값인의 분포를 이용하여, ω_N+K의 코드북 중에서 L'_N+K-1번째인을 이용하여 아래식과 같이 복호화한다. 2) (N + K-1) -th LSF (N + K-1) th decoded LSF ω _{N + K} , where N + K is an integer smaller than or equal to M, . That is, the codebook of _{N + K} ω in the learning process, so that L _{N + K-1} kinds, depending on the extent of the ω _{N + K-1} exists, the value has already been quantized _{N + K-1-} th codebook among the codebooks of < _{RTI ID =} _{0.0 ># N + K &} And decodes it as shown in the following equation.

3) (N-K)번째 LSF ω_N-K를 복호화할 때는(단, (N-K)는 N미만이고, 1이상인 정수) 이미 복호화된 (N-K+1)번째 LSF의 분포를 이용한다. 즉, 학습 과정에서 ω_N-K의 코드북은 ω_N-K+1의범위에 따라 L_N-K+1가지 종류가 존재하므로, 이미 양자화한 값인의 분포를 이용하여, ω_N-K의 코드북중에서 L'_N-K+1번째인을 이용하여 아래식과 같이 복호화한다. 3) (N-K + 1) -th LSF (N-K + 1) th decoded LSF ω _NK (NK is an integer less than N, . That is, in the learning process, the codebook of? _NK has one kind of LN _{-K +} according to? _{N-K + 1} , Using the distribution of the codebook from the _NK ω L _{'NK + 1} beonjjae And decodes it as shown in the following equation.

이하에서는 본 발명의 실험결과를 설명한다.Hereinafter, experimental results of the present invention will be described.

8000Hz로 샘플링된 음성 데이타는 20ms마다 자기상관함수(autocorrelation method)에 근거한 14차 LPC 분석을 거쳤으며, 다시 LSF로 변환되었다. LSF의 0~π 분포는 0~4000Hz에 대응한다. 본 실험에서 학습 데이타로 사용한 NATC 데이타베이스의 한국어 음성은 다음과 같이 구성되어 있다.The speech data sampled at 8000 Hz was subjected to 14th order LPC analysis based on the autocorrelation method every 20ms and converted back to LSF. The 0 to π distribution of the LSF corresponds to 0 to 4000 Hz. The Korean speech of the NATC database used as learning data in this experiment is composed as follows.

1) 4명의 남성, 4명의 여성 = (8명).1) 4 males and 4 females = (8).

2) 각 사람마다 서로 다른 12개 문장씩을 8초씩 발음함.2) Each person pronounces 12 different sentences in 8 seconds.

3) 각 문장에 대해서 clean, babble noised, car noised speech 환경 적용시킴.3) Clean, babble noised, car noised speech environment for each sentence.

(= 8명 * 12문장 * 8초 * 3가지 환경 = 2304초)(= 8 persons * 12 sentences * 8 seconds * 3 environments = 2304 seconds)

(1) 분류기(Classifier)의 설계(1) Design of Classifier

A번째 LSF ω_A를 이용하여 바로 옆의 B(B = A+1 or A-1)번째 LSF ω_B를 L=3개의 구간으로 분류하는 분류기의 실제 예이다.The right by using the A-th _{LSF ω A B (B = A} + 1 or A-1) is an actual example of a classifier for classifying the second LSF ω _B with L = 3 different intervals.

AA BB l=1{하한,상한}l = 1 {lower limit, upper limit} l=2{하한,상한}l = 2 {lower limit, upper limit} l=3{하한,상한}l = 3 {lower limit, upper limit} 22 1One {0.f, 0.323399f}{0.f, 0.323399f} {0.323399f, 0.393569f}{0.323399f, 0.393569f} {0.393569f, 4.f}{0.393569f, 4.f} 33 22 {0.f, 0.492342f}{0.f, 0.492342f} {0.492342f, 0.597209f}{0.492342f, 0.597209f} {0.597209f, 4.f}{0.597209f, 4.f} 33 {0.f, 4.f}{0.f, 4.f} {0.f, 4.f}{0.f, 4.f} {0.f, 4.f}{0.f, 4.f} 33 44 {0.f, 0.492342f}{0.f, 0.492342f} {0.492342f, 0.597209f}{0.492342f, 0.597209f} {0.597209f, 4.f}{0.597209f, 4.f} 44 55 {0.f, 0.670035f}{0.f, 0.670035f} {0.670035f, 0.740385f}{0.670035f, 0.740385f} {0.740385f, 4.f}{0.740385f, 4.f} 55 66 {0.f, 0.901243f}{0.f, 0.901243f} {0.901243f, 0.985694f}{0.901243f, 0.985694f} {0.985694f, 4.f}{0.985694f, 4.f} 66 77 {0.f, 1.089864f}{0.f, 1.089864f} {1.089864f, 1.182087f}{1.089864f, 1.182087f} {1.182087f, 4.f}{1.182087f, 4.f} 77 88 {0.f, 1.338076f}{0.f, 1.338076f} {1.338076f, 1.419179f}{1.338076f, 1.419179f} {1.419179f, 4.f}{1.419179f, 4.f} 88 99 {0.f, 1.564212f}{0.f, 1.564212f} {1.564212f, 1.632792f}{1.564212f, 1.632792f} {1.632792f, 4.f}{1.632792f, 4.f} 99 1010 {0.f, 1.783253f}{0.f, 1.783253f} {1.783253f, 1.835503f}{1.783253f, 1.835503f} {1.835503f, 4.f}{1.835503f, 4.f} 1010 1111 {0.f, 1.943743f}{0.f, 1.943743f} {1.943743f, 2.006788f}{1.943743f, 2.006788f} {2.006788f, 4.f}{2.006788f, 4.f} 1111 1212 {0.f, 2.155441f}{0.f, 2.155441f} {2.155441f, 2.225683f}{2.155441f, 2.225683f} {2.225683f, 4.f}{2.225683f, 4.f} 1212 1313 {0.f, 2.359377f}{0.f, 2.359377f} {2.359377f, 2.422795f}{2.359377f, 2.422795f} {2.422795f, 4.f}{2.422795f, 4.f} 1313 1414 {0.f, 2.591981f}{0.f, 2.591981f} {2.591981f, 2.642309f}{2.591981f, 2.642309f} {2.642309f, 4.f}{2.642309f, 4.f}

L = 6인 경우의 분류기 경계값은 아래와 같다.The classifier boundary value for L = 6 is as follows.

{{0.f, 0.274970f}, {0.274970f, 0.323399f}, {0.323399f, 0.355183f}, {0.355183f, 0.393569f}, {0.393569f, 0.456871f}, {0.456871f, 4.f} }, /* B=1 */{0.f, 0.274970f}, {0.274970f, 0.323399f}, {0.323399f, 0.355183f}, {0.355183f, 0.393569f}, {0.393569f, 0.456871f}, {0.456871f, 4.f} }, / * B = 1 * /

{ {0.f, 0.418122f}, {0.418122f, 0.492342f}, {0.492342f, 0.547863f}, {0.547863f, 0.597209f}, {0.597209f, 0.638697f}, {0.638697f, 4.f} }, /* B=2 */0.547863f, 0.597209f}, {0.597209f, 0.638697f}, {0.638697f, 4.f}, {0.f, 0.418122f}, {0.418122f, 0.492342f}, {0.492342f, 0.547863f} }, / * B = 2 * /

{ {0.f, 4.f}, {0.f, 0.f}, {0.f, 0.f}, {0.f, 0.f}, {0.f, 0.f}, {0.f, 4.f} }, /* B=3 */0.f, 0.f}, {0.f, 0.f}, {0.f, 0.f}, {0.f, , {0.f, 4.f}}, / * B = 3 * /

{ {0.f, 0.418122f}, {0.418122f, 0.492342f}, {0.492342f, 0.547863f},{0.547863f, 0.597209f}, {0.597209f, 0.638697f}, {0.638697f, 4.f} }, /* B=4 */0.547863f, 0.597209f}, {0.597209f, 0.638697f}, {0.638697f, 4.f}, {0.f, 0.418122f}, {0.418122f, 0.492342f}, {0.492342f, 0.547863f} }, / * B = 4 * /

{ {0.f, 0.594666f}, {0.594666f, 0.670035f}, {0.670035f, 0.711242f}, {0.711242f, 0.740385f}, {0.740385f, 0.796516f}, {0.796516f, 4.f} }, /* B=5 */{0.f, 0.594666f}, {0.594666f, 0.670035f}, {0.670035f, 0.711242f}, {0.711242f, 0.740385f}, {0.740385f, 0.796516f}, {0.796516f, 4.f} }, / * B = 5 * /

{ {0.f, 0.853770f}, {0.853770f, 0.901243f}, {0.901243f, 0.939014f}, {0.939014f, 0.985694f}, {0.985694f, 1.054606f}, {1.054606f, 4.f} }, /* B=6 */{0.f, 0.853770f}, {0.853770f, 0.901243f}, {0.901243f, 0.939014f}, {0.939014f, 0.985694f}, {0.985694f, 1.054606f}, {1.054606f, 4.f} }, / * B = 6 * /

{ {0.f, 1.051015f}, {1.051015f, 1.089864f}, {1.089864f, 1.122802f}, {1.122802f, 1.182087f}, {1.182087f, 1.243816f}, {1.243816f, 4.f} }, /* B=7 */{1.0f1015f, 1.089864f}, {1.089864f, 1.122802f}, {1.122802f, 1.182087f}, {1.182087f, 1.243816f}, {1.243816f, 4.f} }, / * B = 7 * /

{ {0.f, 1.286068f}, {1.286068f, 1.338076f}, {1.338076f, 1.377647f}, {1.377647f, 1.419179f}, {1.419179f, 1.473773f}, {1.473773f, 4.f} }, /* B=8 */1.377647f, 1.377647f, 1.419179f}, {1.419179f, 1.473773f}, {1.473773f, 4.f}}, {1.2f6068f, 1.338076f}, {1.338076f, }, / * B = 8 * /

{ {0.f, 1.507915f}, {1.507915f, 1.564212f}, {1.564212f, 1.600643f}, {1.600643f, 1.632792f}, {1.632792f, 1.674107f}, {1.674107f, 4.f} }, /* B=9 */{1.50415f, 1.564212f}, {1.564212f, 1.600643f}, {1.600643f, 1.632792f}, {1.632792f, 1.674107f}, {1.674107f, 4.f} }, / * B = 9 * /

{ {0.f, 1.741556f}, {1.741556f, 1.783253f}, {1.783253f, 1.809551f}, {1.809551f, 1.835503f}, {1.835503f, 1.876451f}, {1.876451f, 4.f} },/* B=10 */1.781556f, 1.783253f}, {1.783253f, 1.809551f}, {1.809551f, 1.835503f}, {1.835503f, 1.876451f}, {1.876451f, 4.f} }, / * B = 10 * /

{ {0.f, 1.912272f}, {1.912272f, 1.943743f}, {1.943743f, 1.970872f}, {1.970872f, 2.006788f}, {2.006788f, 2.054019f}, {2.054019f, 4.f} },/* B=11 */1.912272f, 1.943743f}, {1.943743f, 1.970872f}, {1.970872f, 2.006788f}, {2.006788f, 2.054019f}, {2.054019f, 4.f} }, / * B = 11 * /

{ {0.f, 2.117337f}, {2.117337f, 2.155441f}, {2.155441f, 2.187412f}, {2.187412f, 2.225683f}, {2.225683f, 2.270792f}, {2.270792f, 4.f} },/* B=12 */{2.10444f, 2.187412f}, {2.187412f, 2.225683f}, {2.225683f, 2.270792f}, {2.270792f, 4.f} }, / * B = 12 * /

{ {0.f, 2.319844f}, {2.319844f, 2.359377f}, {2.359377f, 2.389847f}, {2.389847f, 2.422795f}, {2.422795f, 2.461599f}, {2.461599f, 4.f} },/* B=13 */2.38947f, 2.38937f, 2.389477f, 2.389847f, {2.389847f, 2.422795f}, {2.422795f, 2.461599f}, {2.461599f, 4.f} }, / * B = 13 * /

{ {0.f, 2.555961f}, {2.555961f, 2.591981f}, {2.591981f, 2.617327f},{2.617327f, 2.641309f}, {2.641309f, 2.672604f}, {2.672604f, 4.f} },/* B=14 */{2.5f961f, 2.591981f}, {2.591981f, 2.617327f}, {2.617327f, 2.641309f}, {2.641309f, 2.672604f}, {2.672604f, 4.f} }, / * B = 14 * /

L = 15인 경우의 분류기 경계값은 아래와 같다.The value of the classifier boundary for L = 15 is as follows.

{ {0.f, 0.235044f}, {0.235044f, 0.262255f}, {0.262255f, 0.286506f}, {0.286506f, 0.307375f}, {0.307375f, 0.323399f}, {0.323399f, 0.337399f}, {0.337399f, 0.349574f}, {0.349574f, 0.361919f}, {0.361919f, 0.376771f}, {0.376771f, 0.393569f}, {0.393569f, 0.417651f}, {0.417651f, 0.446975f}, {0.446975f, 0.463934f}, {0.463934f, 0.479502f}, {0.479502f, 4.f} }, /* B=1 */0.235044f, 0.262255f, {0.262255f, 0.286506f}, {0.286506f, 0.307375f}, {0.307375f, 0.323399f}, {0.323399f, 0.337399f}, { {0.337399f, 0.349574f}, {0.349574f, 0.361919f}, {0.361919f, 0.376771f}, {0.376771f, 0.393569f}, {0.393569f, 0.417651f}, {0.417651f, 0.446975f}, {0.446975 f, 0.463934f}, {0.463934f, 0.479502f}, {0.479502f, 4.f}}, / * B = 1 * /

{ {0.f, 0.373994f}, {0.373994f, 0.404027f}, {0.404027f, 0.433275f}, {0.433275f, 0.465560f}, {0.465560f, 0.492342f}, {0.492342f, 0.516309f}, {0.516309f, 0.537683f}, {0.537683f, 0.557508f}, {0.557508f, 0.578283f}, {0.578283f, 0.597209f}, {0.597209f, 0.613344f}, {0.613344f, 0.630257f}, {0.630257f, 0.647859f}, {0.647859f, 0.676275f}, {0.676275f, 4.f} }, /* B=2 */{0.f, 0.373994f}, {0.373994f, 0.404027f}, {0.404027f, 0.433275f}, {0.433275f, 0.465560f}, {0.465560f, 0.492342f}, {0.492342f, 0.516309f} {0.516309f, 0.537683f}, {0.537683f, 0.557508f}, {0.557508f, 0.578283f}, {0.578283f, 0.597209f}, {0.597209f, 0.613344f}, {0.613344f, 0.630257f}, {0.630257 f, 0.647859f}, {0.647859f, 0.676275f}, {0.676275f, 4.f}}, / * B = 2 * /

{ {0.f, 4.f}, {0.f, 0.f}, {0.f, 0.f}, {0.f, 0.f}, {0.f, 0.f}, {0.f, 0.f}, {0.f, 0.f}, {0.f, 0.f}, {0.f, 0.f}, {0.f, 0.f}, {0.f, 0.f}, {0.f, 0.f}, {0.f, 0.f}, {0.f, 0.f}, {0.f, 0.f} },/* B=3 */0.f, 0.f}, {0.f, 0.f}, {0.f, 0.f}, {0.f, , {0.f, 0.f}, {0.f, 0.f}, {0.f, 0.f} , {0.f, 0.f}, {0.f, 0.f}, {0.f, 0.f} }, / * B = 3 * /

{ {0.f, 0.373994f}, {0.373994f, 0.404027f}, {0.404027f, 0.433275f}, {0.433275f, 0.465560f}, {0.465560f, 0.492342f}, {0.492342f, 0.516309f}, {0.516309f, 0.537683f}, {0.537683f, 0.557508f}, {0.557508f, 0.578283f}, {0.578283f, 0.597209f}, {0.597209f, 0.613344f}, {0.613344f, 0.630257f},{0.630257f, 0.647859f}, {0.647859f, 0.676275f}, {0.676275f, 4.f} }, /* B=4 */{0.f, 0.373994f}, {0.373994f, 0.404027f}, {0.404027f, 0.433275f}, {0.433275f, 0.465560f}, {0.465560f, 0.492342f}, {0.492342f, 0.516309f} {0.516309f, 0.537683f}, {0.537683f, 0.557508f}, {0.557508f, 0.578283f}, {0.578283f, 0.597209f}, {0.597209f, 0.613344f}, {0.613344f, 0.630257f}, {0.630257 f, 0.647859f}, {0.647859f, 0.676275f}, {0.676275f, 4.f}}, / * B = 4 * /

{ {0.f, 0.510562f}, {0.510562f, 0.571390f}, {0.571390f, 0.614788f}, {0.614788f, 0.646839f}, {0.646839f, 0.670035f}, {0.670035f, 0.689459f}, {0.689459f, 0.705029f}, {0.705029f, 0.716975f}, {0.716975f, 0.728454f}, {0.728454f, 0.740385f}, {0.740385f, 0.757645f}, {0.757645f, 0.781948f}, {0.781948f, 0.811476f}, {0.811476f, 0.849480f}, {0.849480f, 4.f} }, /* B=5 */{0.f, 0.510562f}, {0.510562f, 0.571390f}, {0.571390f, 0.614788f}, {0.614788f, 0.646839f}, {0.646839f, 0.670035f}, {0.670035f, 0.689459f} {0.689459f, 0.705029f}, {0.705029f, 0.716975f}, {0.716975f, 0.728454f}, {0.728454f, 0.740385f}, {0.740385f, 0.757645f}, {0.757645f, 0.781948f}, {0.781948 0.811476f}, {0.811476f, 0.849480f}, {0.849480f, 4.f}}, / * B = 5 * /

{ {0.f, 0.794467f}, {0.794467f, 0.838568f}, {0.838568f, 0.866646f}, {0.866646f, 0.885520f}, {0.885520f, 0.901243f}, {0.901243f, 0.916409f}, {0.916409f, 0.931262f}, {0.931262f, 0.946566f}, {0.946566f, 0.964604f}, {0.964604f, 0.985694f}, {0.985694f, 1.013291f}, {1.013291f, 1.040804f}, {1.040804f, 1.069675f}, {1.069675f, 1.109874f}, {1.109874f, 4.f} }, /* B=6 */0.838568f, 0.866646f, {0.866646f, 0.885520f}, {0.885520f, 0.901243f}, {0.901243f, 0.916409f}, { {0.916409f, 0.931262f}, {0.931262f, 0.946566f}, {0.946566f, 0.964604f}, {0.964604f, 0.985694f}, {0.985694f, 1.013291f}, {1.013291f, 1.040804f}, {1.040804 f, 1.069675f}, {1.069675f, 1.109874f}, {1.109874f, 4.f}}, / * B = 6 * /

{ {0.f, 0.990472f}, {0.990472f, 1.038742f}, {1.038742f, 1.060728f}, {1.060728f, 1.076330f}, {1.076330f, 1.089864f}, {1.089864f, 1.101254f}, {1.101254f, 1.114450f}, {1.114450f, 1.132790f}, {1.132790f, 1.155356f}, {1.155356f, 1.182087f}, {1.182087f, 1.207519f}, {1.207519f, 1.231719f}, {1.231719f, 1.257682f}, {1.257682f, 1.303627f}, {1.303627f, 4.f} }, /* B=7 */1.038742f, 1.087242f, {1.060728f, 1.076330f}, {1.076330f, 1.089864f}, {1.089864f, 1.101254f}, { {1.101254f, 1.114450f}, {1.114450f, 1.132790f}, {1.132790f, 1.155356f}, {1.155356f, 1.182087f}, {1.182087f, 1.207519f}, {1.207519f, 1.231719f}, {1.231719 f, 1.257682f}, {1.257682f, 1.303627f}, {1.303627f, 4.f}}, / * B = 7 * /

{ {0.f, 1.224180f}, {1.224180f, 1.270616f}, {1.270616f, 1.299337f}, {1.299337f, 1.320930f}, {1.320930f, 1.338076f}, {1.338076f, 1.354144f}, {1.354144f, 1.369426f}, {1.369426f, 1.385734f}, {1.385734f, 1.401858f}, {1.401858f, 1.419179f}, {1.419179f, 1.438471f}, {1.438471f, 1.460843f},{1.460843f, 1.489174f}, {1.489174f, 1.539309f}, {1.539309f, 4.f} }, /* B=8 */1.224180f, 1.270616f, {1.270616f, 1.299337f}, {1.299337f, 1.320930f}, {1.320930f, 1.338076f}, {1.338076f, 1.354144f}, { {1.354144f, 1.369426f}, {1.369426f, 1.385734f}, {1.385734f, 1.401858f}, {1.401858f, 1.419179f}, {1.419179f, 1.438471f}, {1.438471f, 1.460843f}, {1.460843 f, 1.489174f}, {1.489174f, 1.539309f}, {1.539309f, 4.f}}, / * B = 8 * /

{ {0.f, 1.431250f}, {1.431250f, 1.489389f}, {1.489389f, 1.523346f}, {1.523346f, 1.546492f}, {1.546492f, 1.564212f}, {1.564212f, 1.579060f}, {1.579060f, 1.593888f}, {1.593888f, 1.607045f}, {1.607045f, 1.619846f}, {1.619846f, 1.632792f}, {1.632792f, 1.646761f}, {1.646761f, 1.663350f}, {1.663350f, 1.687100f}, {1.687100f, 1.733830f}, {1.733830f, 4.f} }, /* B=9 */{1.503346f, 1.546492f}, {1.546492f, 1.564212f}, {1.564212f, 1.579060f}, { {1.579060f, 1.593888f}, {1.593888f, 1.607045f}, {1.607045f, 1.619846f}, {1.619846f, 1.632792f}, {1.632792f, 1.646761f}, {1.646761f, 1.663350f}, {1.663350 f, 1.687100f}, {1.687100f, 1.733830f}, {1.733830f, 4.f}}, / * B = 9 * /

{ {0.f, 1.682544f}, {1.682544f, 1.728448f}, {1.728448f, 1.752468f}, {1.752468f, 1.769501f}, {1.769501f, 1.783253f}, {1.783253f, 1.793231f}, {1.793231f, 1.804241f}, {1.804241f, 1.814543f}, {1.814543f, 1.824683f}, {1.824683f, 1.835503f}, {1.835503f, 1.848488f}, {1.848488f, 1.865541f}, {1.865541f, 1.889830f}, {1.889830f, 1.932878f}, {1.932878f, 4.f} },/* B=10 */1.722448f, 1.752468f, 1.752468f, 1.769501f, {1.769501f, 1.783253f}, {1.783253f, 1.793231f}, { {1.793231f, 1.804241f}, {1.804241f, 1.814543f}, {1.814543f, 1.824683f}, {1.824683f, 1.835503f}, {1.835503f, 1.848488f}, {1.848488f, 1.865541f}, {1.865541 1.889830f}, {1.889830f, 1.932878f}, {1.932878f, 4.f}}, / * B = 10 * /

{ {0.f, 1.873944f}, {1.873944f, 1.903446f}, {1.903446f, 1.919255f}, {1.919255f, 1.932970f}, {1.932970f, 1.943743f}, {1.943743f, 1.954767f}, {1.954767f, 1.965228f}, {1.965228f, 1.977102f}, {1.977102f, 1.990802f}, {1.990802f, 2.006788f}, {2.006788f, 2.024141f}, {2.024141f, 2.043363f}, {2.043363f, 2.066415f}, {2.066415f, 2.098800f}, {2.098800f, 4.f} },/* B=11 */{1.919255f, 1.932970f}, {1.932970f, 1.943743f}, {1.943743f, 1.954767f}, { {1.954767f, 1.965228f}, {1.965228f, 1.977102f}, {1.977102f, 1.990802f}, {1.990802f, 2.006788f}, {2.006788f, 2.024141f}, {2.024141f, 2.043363f}, {2.043363 2.066415f}, {2.066415f, 2.098800f}, {2.098800f, 4.f}}, / * B = 11 * /

{ {0.f, 2.071948f}, {2.071948f, 2.106122f}, {2.106122f, 2.126233f}, {2.126233f, 2.141440f}, {2.141440f, 2.155441f}, {2.155441f, 2.169336f}, {2.169336f, 2.181376f}, {2.181376f, 2.194262f}, {2.194262f, 2.209642f}, {2.209642f, 2.225683f}, {2.225683f, 2.242341f}, {2.242341f, 2.260786f},{2.260786f, 2.281577f}, {2.281577f, 2.311329f}, {2.311329f, 4.f} },/* B=12 */2.10612f, 2.126233f, {2.126233f, 2.141440f}, {2.141440f, 2.155441f}, {2.155441f, 2.169336f}, { {2.169336f, 2.181376f}, {2.181376f, 2.194262f}, {2.194262f, 2.209642f}, {2.209642f, 2.225683f}, {2.225683f, 2.242341f}, {2.242341f, 2.260786f}, {2.260786 f, 2.281577f}, {2.281577f, 2.311329f}, {2.311329f, 4.f}}, / * B = 12 * /

{ {0.f, 2.267698f}, {2.267698f, 2.307557f}, {2.307557f, 2.329068f}, {2.329068f, 2.345140f}, {2.345140f, 2.359377f}, {2.359377f, 2.371829f}, {2.371829f, 2.383751f}, {2.383751f, 2.396125f}, {2.396125f, 2.408754f}, {2.408754f, 2.422795f}, {2.422795f, 2.437442f}, {2.437442f, 2.453004f}, {2.453004f, 2.471189f}, {2.471189f, 2.500069f}, {2.500069f, 4.f} },/* B=13 */{2.3f140f, 2.359377f}, {2.359377f, 2.371829f}, {2.329068f, 2.345140f}, { {2.371829f, 2.383751f}, {2.383751f, 2.396125f}, {2.396125f, 2.408754f}, {2.408754f, 2.422795f}, {2.422795f, 2.437442f}, {2.437442f, 2.453004f}, {2.453004 f, 2.471189f}, {2.471189f, 2.500069f}, {2.500069f, 4.f}}, / * B = 13 * /

{ {0.f, 2.505818f}, {2.505818f, 2.544425f}, {2.544425f, 2.565426f}, {2.565426f, 2.580842f}, {2.580842f, 2.591981f}, {2.591981f, 2.602545f}, {2.602545f, 2.612613f}, {2.612613f, 2.622403f}, {2.622403f, 2.632054f}, {2.632054f, 2.641309f}, {2.641309f, 2.651491f}, {2.651491f, 2.664897f}, {2.664897f, 2.681739f}, {2.681739f, 2.709239f}, {2.709239f, 4.f} },/* B=14 */2.565426f, 2.580842f}, {2.580842f, 2.591981f}, {2.591981f, 2.602545f}, {2.505818f, 2.544425f}, { {2.602545f, 2.612613f}, {2.612613f, 2.622403f}, {2.622403f, 2.632054f}, {2.632054f, 2.641309f}, {2.641309f, 2.651491f}, {2.651491f, 2.664897f}, {2.664897 2.681739f}, {2.681739f, 2.709239f}, {2.709239f, 4.f}}, / * B = 14 * /

(2) LSQ 학습 방법(2) LSQ learning method

학습 방법은 아래와 같다.The learning method is as follows.

1) 14차 LSF중 3번째 LSF ω₃는 LBG 방식으로 학습한다.1) The third LSF ω ₃ of the 14th LSF is learned by the LBG method.

2) ω₃의 값을 분류기에 통과시켜 L(L=3 또는 6 또는 15)개중 한개의 학습 집합에 4번째 LSF ω₄를 포함시키고, 각 학습 집합의 ω₄는 LBG 방식으로 학습한다.2) ω is passed through the _third value of the classifier L (L = 3 or 6 or 15) gaejung and include one of the fourth LSF ω ₄ in the training set, each of the training set ω ₄ learns the LBG method.

3) (N+K)번째 LSF ω_N+K를 학습할 때는(단, (N+K)는 14이하이고, N을 초과하는 정수) (N+K-1)번째 LSF ω_N+K-1의 분포를 이용한다.3) (N + K) When learning a second LSF ω _{N + K} (However, (N + K) is less than or equal to 14, an integer exceeding N) (N + K-1 ) th LSF ω _{N + K- 1} is used.

4) (N-K)번째 LSF ω_N-K를 학습할 때는(단, (N-K)는 N미만이고, 1이상인 정수) (N-K+1)번째 LSF ω_N-K+1의 분포를 이용한다.4) (N-K + 1) th LSF ω _{N-K + 1} when learning the (NK) th LSF ω _NK (NK is an integer less than N and equal to or greater than 1).

(4) 부호화, 복호화 방법(4) Encoding and decoding method

14차 LSF중 3번째 LSF ω₃은 SQ 방식으로 부호화/복호화하고, 나머지 LSF는 분류기를 이용하여 L(L=3 or 6 or 15)개중 한개의 학습된 코드북을 선택하여 부호화/복호화한다. 이때, 분류기의 입력은 학습 과정과는 다르게 양자화된 값들을 사용하므로 별도의 추가비트 전송이 필요없다.The third LSF? ₃ of the 14th LSF is encoded / decoded by the SQ scheme, and the remaining LSFs select one of the L (L = 3 or 6 or 15) codebooks to be encoded / decoded using a classifier. At this time, since the input of the classifier uses quantized values differently from the learning process, additional additional bit transmission is not required.

(5) 실험 결과(5) Experimental results

공정한 평가를 위하여 테스트 음성은 다음과 같이 NATC 데이타베이스의 영어 음성을 사용하였다. (학습 데이터는 NATC의 한국어 음성)For fair evaluation, test voice used English voice of NATC database as follows. (Learning data is NATC's Korean voice)

1) 4명의 남성, 4명의 여성 = (8명)1) 4 males and 4 females = (8)

3) 각 문장에 대해서 clean, babble noise 30dB, car noise 30dB, car noise 15dB3) For each sentence, clean, babble noise 30dB, car noise 30dB, car noise 15dB

(= 8명 * 12문장 * 8초 * 4가지 환경 = 3072초)(= 8 persons * 12 sentences * 8 seconds * 4 environments = 3072 seconds)

성능 평가는 스펙트럼 왜곡(SD:spectral distortion) 측정법을 사용하였다. i 번째 프레임의 SD는 다음과 같으며,Spectral distortion (SD) measurement was used for the performance evaluation. The SD of the i-th frame is as follows,

여기서 P_j는 원래 LSF의 파워 스펙트럼(power spectrum of the original LSF)을 나타내며,는 양자화된 LSF의 파워 스펙트럼(power spectrum of the quantized LSF)을 나타낸다. 사람 귀의 특성에 맞춰서 a는 125Hz, b는 3400Hz가 선택되었다.Where P _j represents the power spectrum of the original LSF (power spectrum of the original LSF), Represents the power spectrum of the quantized LSF. In accordance with the characteristics of the human ear, a was chosen to be 125 Hz and b was chosen to be 3400 Hz.

양자화기의 성능 평가표(SD)Performance evaluation table (SD) of quantizer 다양한환경별 SD 양자화기 유형Various environment-specific SD quantizer types 클린clean 배블 30dBBubble 30dB 자동차 30dB30dB for cars 자동차 15dB15dB for car 합계Sum 평균왜곡(dB)Average Distortion (dB) 2dB~4dB(%)2dB ~ 4dB (%) 4dB~(%)4dB to (%) 평균왜곡(dB)Average Distortion (dB) 2dB~4dB(%)2dB ~ 4dB (%) 4dB~(%)4dB to (%) 평균왜곡(dB)Average Distortion (dB) 2dB~4dB(%)2dB ~ 4dB (%) 4dB~(%)4dB to (%) 평균왜곡(dB)Average Distortion (dB) 2dB~4dB(%)2dB ~ 4dB (%) 4dB~(%)4dB to (%) 평균왜곡(dB)Average Distortion (dB) SQ^* SQ ^* 1.561.56 13.913.9 2.332.33 1.341.34 8.958.95 1.491.49 1.431.43 9.529.52 1.511.51 1.201.20 4.774.77 0.420.42 1.411.41 SVQ^** SVQ ^** 1.201.20 6.306.30 0.470.47 1.061.06 3.973.97 0.110.11 1.091.09 4.124.12 0.110.11 0.980.98 1.521.52 0.000.00 1.081.08 LSQ^***:3링크LSQ ^*** : 3 links 1.131.13 6.526.52 0.790.79 1.011.01 4.654.65 0.360.36 1.041.04 4.614.61 0.380.38 0.940.94 2.042.04 0.010.01 1.031.03 LSQ^***:6링크LSQ ^*** : 6 links 1.041.04 5.345.34 0.040.04 0.940.94 3.933.93 0.150.15 0.960.96 3.943.94 0.160.16 0.870.87 1.521.52 0.000.00 0.950.95 LSQ^***:15링크LSQ ^*** : 15 links 0.990.99 5.295.29 0.160.16 0.910.91 3.873.87 0.070.07 0.940.94 3.783.78 0.070.07 0.860.86 1.571.57 0.000.00 0.930.93

* SQ : 스칼라 양자화기* SQ: Scalar quantizer

** SVQ : 분할 벡터 양자화기(14차를 2-2-2-2-2-2-2로 분할)** SVQ: Split vector quantizer (14th divided into 2-2-2-2-2-2-2)

*** LSQ : 연결 스칼라 양자화기 : 3 또는 6 또는 15링크*** LSQ: Link Scalar Quantizer: 3 or 6 or 15 Link

양자화기의 성능 평가표(메모리와 검색시간 컴플렉서티)Quantizer Performance Table (Memory and Search Time Complexity) SQ*SQ * 메모리Memory 2⁴+ 7×2³+ 5×2²+ 2¹= 94 바이트2 ⁴ + 7 × 2 ³ + 5 × 2 ² + 2 ¹ = 94 bytes 시간time 0.038 MIPS0.038 MIPS SVQ**SVQ ** 메모리Memory 6×2×2⁵+ 2×2⁶= 512 바이트6 x 2 x 2 ⁵ + 2 x 2 ⁶ = 512 bytes 시간time 0.256 MIPS0.256 MIPS LSQ*** : 3링크LSQ ***: 3 links 메모리Memory 2⁴+ 7×3×2³+ 5×3×2²+ 3×2¹= 250 바이트2 ⁴ + 7 × 3 × 2 ³ + 5 × 3 × 2 ² + 3 × 2 ¹ = 250 bytes 시간time 0.038 MIPS0.038 MIPS LSQ*** : 6링크LSQ ***: 6 links 메모리Memory 2⁴+ 7×6×2³+ 5×6×2²+ 6×2¹= 484 바이트2 ⁴ + 7 × 6 × 2 ³ + 5 × 6 × 2 ² + 6 × 2 ¹ = 484 bytes 시간time 0.038 MIPS0.038 MIPS LSQ*** : 15링크LSQ ***: 15 links 메모리Memory 2⁴+ 7×15×2³+ 5×15×2²+ 15×2¹= 1186 바이트2 ⁴ + 7 × 15 × 2 ³ + 5 × 15 × 2 ² + 15 × 2 ¹ = 1186 bytes 시간time 0.038 MIPS0.038 MIPS

표 1에는 다양한 양자화기의 성능을 SD 방법에 의해 평가하였다. 평가 결과 LSQ의 연결고리(Link) 수를 3,6,15개로 증가시키면 그 성능이 증가함을 볼 수 있다. 또한 평균 스펙트럼 왜곡(Average Spectral Distortion) 관점에서 LSQ가 SQ와 SVQ보다 뛰어남을 알 수 있다. SD 성능 이외에 표 2의 메모리와 검색시간 컴플렉서티까지 고려하면, SQ는 메모리 관점에서는 우수하지만 SD 성능이 너무 뒤쳐져 있음을 알 수 있으며, SVQ는 SD 성능, 메모리, 검색시간 컴플렉서티 관점 모두에서 LSQ 6링크 구조에 비해 뒤처짐을 알 수 있다.Table 1 shows the performance of various quantizers by the SD method. As a result, it is seen that the performance is increased when the number of links of LSQ is increased to 3, 6, or 15. Also, LSQ is superior to SQ and SVQ in terms of Average Spectral Distortion. In addition to the SD performance, considering the memory and search time complexity in Table 2, we can see that SQ is superior in terms of memory, but SD performance is far behind, and SVQ can be used for both SD performance, memory, and search time complexity LSQ 6 link structure.

본 발명에 의하면, ITU-T, Study Group 16, Question Number 21에 제안한 4kbps 음성 코더에 적용함으로써 기존의 방식에 비해 메모리와 비트수를 절약하면서 향상된 성능을 얻을 수 있었다.According to the present invention, by applying the present invention to the 4kbps voice coder proposed in ITU-T, Study Group 16, and Question Number 21, improved performance can be obtained while saving memory and bits compared to the conventional method.

Claims

M-order line spectral frequencies {omega_1, omega_2, omega_3, ... , omega_M} in a quantizer,

Codebooks to which a plurality of codebooks are allocated according to M levels; And

The M-th order line spectrum frequencies {omega_1, omega_2, omega_3, ... , omega_M} are quantized, one codebook is selected based on the magnitude of the line spectrum frequency adjacent to the line spectrum frequency to be quantized among a plurality of codebooks allocated to each line spectrum frequency, and the quantization And a quantization unit for quantizing the line spectrum frequency to be quantized.

M-order line spectral frequencies {omega_1, ... , omega_N, ... , omega_M} in a quantizer,

Codebooks allocated to N-th order and codebooks assigned to L other than the N-th order (where 1? N? M);

A first classifier to an M classifier for classifying the line spectrum frequency inputted for each order into L kinds according to the magnitude; And

The reference spectrum frequency ωN is quantized by a codebook assigned to the N-th order, and for a line spectrum frequency higher than ωN, one of the L codebooks assigned to the order is assigned to a value classified by a low- Quantized by a codebook corresponding to a value classified by a class of one order higher than that of L codebooks allocated to the degree of linear spectrum frequency lower than the above-mentioned frequency N by a corresponding codebook, And a quantizer for quantizing the quantized signal.

3. The method of claim 2, wherein the line spectral frequencies input to the first to M < th &

Wherein each quantizer is a quantized linear spectral frequency of the same order as the classifier.

Multiple M-order line spectrum frequencies {{omega_1}, ... , {Omega_N}, ... , {Omega_M}} for learning a connected scalar quantizer,

(a) learning a plurality of reference line spectral frequencies {? N} to generate one codebook assigned to the N-th order (where 1? N? M);

(b) While changing K from 0 to (M-N-1)

(b.1) a small step of dividing the distribution space of a plurality of line spectrum frequencies {? N + K} into L;

(b.2) Each line spectral frequency? N + K + 1 belonging to a plurality of line spectrum frequencies {? N + K + 1} is divided into L subsets corresponding to the distribution space in which the line spectrum frequency? ; And

(b.3) repeating a small step of generating L codebooks allocated to (N + K + 1) -th order by learning line spectral frequencies divided into L subsets in the (b.2) Generating codebooks assigned to higher orders by performing; And

(c) While changing K from 0 to (N-2)

(c.1) a small step of dividing the distribution space of a plurality of line spectrum frequencies {? N-K} into L;

(c.2) Each line spectral frequency? N-K-1 belonging to a plurality of line spectral frequencies {? N-K-1} is divided into L subsets corresponding to the distribution space to which the line spectral frequency? ; And

(c.3) Repeating the small step of generating L codebooks allocated to the (NK-1) difference by learning each of the line spectrum frequencies divided into L subsets in the step (c.2) And generating codebooks assigned to the lower-order codebooks.

M-order line spectral frequencies {omega_1, ... , omega_N, ... , omega_M}, comprising:

(a) The value obtained by quantizing the reference line spectral frequency? N by the codebook allocated to the Nth order And obtaining a coded value iN (where 1? N? M);

(b) While changing K from 0 to (M-N-1)

(b.1) quantized line spectrum frequency Into a value of one of 1 to L according to its size;

(b.2) a small step of finding a codebook corresponding to the value classified in the step (b.1) among the L codebooks allocated to the (N + K + 1) th order; And

(b.3) a value obtained by quantizing the line spectrum frequency? N + K + 1 by the codebook found in the step (b.2) And a small step of obtaining a coded value iN + K + 1; And

(c) While changing K from 0 to (N-2)

(c.1) Quantized line spectrum frequency Into a value of one of 1 to L according to its size;

(c.2) searching a codebook corresponding to a value classified in the (c.1) small step among L codebooks allocated to the (N-K-1) th order; And

(c.3) a value obtained by quantizing the line spectrum frequency? N-K-1 by the codebook found in the (c.2) And encoding the lower-order line spectral frequencies by repeatedly performing a small step of obtaining a coded value iN-K-1.

6. The method of claim 5, wherein the quantized values in steps (a), (b.3), and (c.3) are quantized from values included in a corresponding codebook, And a Claryian distance scale is set to a smallest value.

The method of claim 5, wherein the values encoded in steps (a), (b.3), and (c.3) are each an index of a corresponding codebook for a corresponding quantized value, A method of coding a line spectral frequency.

The indexes of M-order line spectral frequencies {i_1, ... , i_N, ... , i_M}, comprising:

(a) the quantized line spectrum frequency using the index iN of the reference line spectral frequency in the codebook assigned to the Nth order (Where 1 < = N < = M);

(b) While changing K from 0 to (M-N-1)

(b.3) quantizing the line spectrum frequency using the index iN + K + 1 in the codebook found in the step (b.2) Decoding the indexes of the higher order line spectral frequencies by repeating a small step of finding the higher order line spectral frequencies; And

(c) While changing K from 0 to (N-2)

(c.3) The quantized line spectrum frequency (iN-K-1) using the index iN-K-1 in the codebook found in the (c.2) And decoding the indexes of the lower order line spectral frequencies by repeating the step of finding the lower frequency spectral frequencies.