KR100668300B1

KR100668300B1 - Bitrate scalable speech coding and decoding apparatus and method thereof

Info

Publication number: KR100668300B1
Application number: KR1020040040478A
Authority: KR
Inventors: 손창용; 이강은; 강상원
Original assignee: 삼성전자주식회사
Priority date: 2003-07-09
Filing date: 2004-06-03
Publication date: 2007-01-12
Also published as: JP5313967B2; KR20050007117A; JP2011008250A

Abstract

본 발명은 SNR 비트율 확장 음성 부호화 및 복호화 장치와 그 방법에 관한 것으로, 본 발명에 따른 부호화 장치는, 기본 계층, 음질 향상 계층 및 다중화기를 포함하고, 기본 계층은 선형 예측 부호화에 의해 입력 음성신호를 필터링하고, 고정 코드북 탐색 및 적응 코드북 탐색에 의해 필터링된 음성신호에 대응되는 여기 신호를 생성하고, 음질 향상 계층은 기본 계층에서의 고정 코드북 탐색에 따라 생성되는 매개 변수를 이용하여 고정 코드북을 탐색하거나 기본 계층의 고정 코드북 탐색 대상 신호에서 기본 계층의 고정 코드북의 기여도와 음질 향상 계층에서 이전의 고정 코드북을 합성 필터링 한 신호를 제거한 신호를 음질 향상 계층의 대상 신호로 하여 고정 코드북을 탐색하고, 다중화기는 기본 계층에서 생성되는 신호와 적어도 하나의 음질 향상 계층에서 생성되는 신호를 다중화함으로써, 본 발명에 따른 부호화 장치는 기존의 표준화된 음성 코덱과 호환이 가능하고, 연산량을 줄일 수 있으며, 보다 좋은 음질을 제공할 수 있다. The present invention relates to an SNR bit rate extended speech encoding and decoding apparatus and a method thereof, wherein the encoding apparatus includes a base layer, a sound quality enhancement layer, and a multiplexer, and the base layer includes an input speech signal by linear predictive encoding. Filter, generate an excitation signal corresponding to the speech signal filtered by the fixed codebook search and the adaptive codebook search, and the sound quality enhancement layer searches for the fixed codebook using the parameters generated according to the fixed codebook search in the base layer; Fixed codebook search of base layer The fixed codebook is searched by using the signal that removes the contribution of the fixed codebook of the base layer from the target signal and the signal obtained by synthesizing the previous fixed codebook in the sound quality enhancement layer as the target signal of the sound quality enhancement layer. Enhances the signal generated by the base layer and at least one sound quality By multiplexing the signal generated in the layer, the encoding apparatus according to the present invention is compatible with the existing standardized speech codec, can reduce the amount of calculation, and can provide better sound quality.

Description

Bitrate scalable speech coding and decoding apparatus and method

도 1은 본 발명의 바람직한 일 실시 예에 따른 비트율 확장 음성 부호화 장치의 블록도이다. 1 is a block diagram of a bit rate extended speech encoding apparatus according to an exemplary embodiment of the present invention.

도 2는 도 1에 도시된 기본 계층 고정 코드북 탐색부에 의해 탐색된 펄스의 위치와 음질 향상 계층 고정 코드북 탐색부에 의해 탐색된 펄스의 위치의 예시 도이다. FIG. 2 is an exemplary diagram of positions of pulses searched by the base layer fixed codebook search unit shown in FIG. 1 and positions of pulses searched by the sound quality enhancement layer fixed codebook search unit.

도 3은 본 발명의 바람직한 일 실시 예에 따른 비트율 확장 음성 복호화 장치의 블록도이다. 3 is a block diagram of a bit rate extended speech decoding apparatus according to an exemplary embodiment of the present invention.

도 4는 본 발명의 바람직한 일 실시 예에 따른 비트율 확장 음성 부호화 방법의 동작 흐름도이다.4 is an operation flowchart of a bit rate extended speech encoding method according to an embodiment of the present invention.

도 5는 본 발명의 바람직한 일 실시 예에 따른 비트율 확장 음성 복호화 방법의 동작 흐름도이다. 5 is a flowchart illustrating an operation of a bit rate extended speech decoding method according to an exemplary embodiment of the present invention.

도 6은 본 발명의 바람직한 다른 실시 예에 따른 비트율 확장 음성 부호화 장치의 블록도이다. 6 is a block diagram of a bit rate extended speech encoding apparatus according to another exemplary embodiment of the present invention.

도 7은 도 6에 도시된 음질 향상 계층의 이득 값 차 양자화기의 바람직한 실시 예를 나타낸 블록도이다.FIG. 7 is a block diagram illustrating an exemplary embodiment of a gain value difference quantizer of the sound quality enhancement layer illustrated in FIG. 6.

도 8은 본 발명의 바람직한 다른 실시 예에 따른 비트율 확장 음성 복호화 장치의 블록도이다. 8 is a block diagram of a bit rate extended speech decoding apparatus according to another exemplary embodiment of the present invention.

도 9는 도 8의 비트율 확장 음성 복호화 장치에서 기본 계층 고정 코드북 탐색에 의해 탐색된 펄스의 위치와 음질 향상 계층 고정 코드북 탐색에 의해 탐색된 펄스의 위치 예시 도이다. 9 is a diagram illustrating a position of a pulse searched by a base layer fixed codebook search and a position of a pulse searched by a sound quality enhancement layer fixed codebook search in the bit rate extension speech decoding apparatus of FIG. 8.

도 10은 본 발명의 바람직한 다른 실시 예에 따른 비트율 확장 음성 부호화 방법의 동작 흐름도이다.10 is a flowchart illustrating an operation of a bit rate extended speech encoding method according to another exemplary embodiment of the present invention.

도 11는 본 발명의 바람직한 다른 실시 예에 따른 비트율 확장 음성 복호화 방법의 동작 흐름도이다. 11 is an operation flowchart of a bit rate extended speech decoding method according to another preferred embodiment of the present invention.

본 발명은 켈프(Code Excited Linear Prediction, 이하 CELP라고 약함) 알고리즘을 사용하는 음성 코덱(codec)에 관한 것으로서, 특히, 음질을 향상시키기 위하여 SNR(Signal to Noise Ratio) 비트율을 확장하는 음성 부호화 및 복호화 장치와 그 방법에 관한 것이다.The present invention relates to a speech codec (codec) using a Kelp (Code Excited Linear Prediction, hereinafter referred to as CELP) algorithm, and in particular, speech coding and decoding for extending the SNR (Signal to Noise Ratio) bit rate to improve sound quality. It relates to an apparatus and a method thereof.

CELP 구조를 갖는 음성 코덱은 현재 이동 통신 시스템에서 가장 널리 사용되는 것으로, 선형 예측 부호화(Linear Prediction coding, 이하 LPC라고 약함)를 기본으로 한다. 이러한 CELP 구조를 갖는 음성 코덱은 서비스의 종류에 따라 요구되는 전송률 및 대역폭이 다르다. The speech codec having the CELP structure is the most widely used in the current mobile communication system, and is based on linear prediction coding (hereinafter, referred to as LPC). The voice codec having the CELP structure has a different data rate and bandwidth depending on the type of service.

그러나, 일반적인 음성 코덱은 전송률 및 대역폭이 부호화 장치에서 설정되므로 복호화장치에서 전송률 및 대역폭을 선택할 수 없다. 또한, 네트워크 상에서 하나의 송신단에서 여러 수신단으로 패킷 정보를 전송하는 멀티 캐스팅(multicasting)이 수행될 때, 송신단의 음성 코덱이 고정된 비트율을 가지면, 각기 다른 비트율을 요구하는 수신단으로 전송되는 패킷 정보의 질이 저하될 수 있다. However, in the general voice codec, since the bit rate and bandwidth are set in the encoding device, the bit rate and bandwidth cannot be selected in the decoding device. In addition, when multicasting is performed to transmit packet information from one transmitter to multiple receivers on the network, if the voice codec of the transmitter has a fixed bit rate, the packet information transmitted to the receivers requiring different bit rates may be used. The quality may deteriorate.

이를 개선하기 위하여 비트율 확장 음성 부호화 방식을 채택한 음성 코덱이 제안되었다. 이러한 음성 코덱은 기본 코덱(base codec)의 정보뿐만 아니라 복원할 신호를 더 정확하게 할 정보가 추가되도록 비트 스트림(bit stream)을 구성한다. In order to improve this, a speech codec employing a bit rate extended speech coding scheme has been proposed. This voice codec constitutes a bit stream so that not only the information of the base codec but also the information to more accurately correct the signal to be restored are added.

기존의 비트율 확장 음성 부호화 방식은 크게 SNR(Signal to Noise Ratio, 이하 SNR이라 약함) 비트율 확장방법과 대역폭 확장방법으로 분류할 수 있다. Conventional bit rate extension speech coding schemes can be broadly classified into signal to noise ratio (SNR) bit rate extension methods and bandwidth extension methods.

SNR 비트율 확장 방법에 의한 음성 부호화는 계층적(hierarchical) 코딩방식으로 음성신호를 부호화하고 복호화 한다. 즉, 음성신호를 기본 계층(base layer)과 음질 향상 계층(speech enhancement layer)으로 나누어 음성신호를 부호화한다. 기본 계층은 최소한의 음질을 복원할 수 있는 정보만을 전송한다. 음질 향상 계층에서는 음질을 향상시킬 수 있는 추가 정보를 전송한다. Speech encoding by the SNR bit rate extension method encodes and decodes a speech signal using a hierarchical coding scheme. That is, the speech signal is encoded by dividing the speech signal into a base layer and a speech enhancement layer. The base layer transmits only information that can restore the minimum sound quality. The sound quality enhancement layer transmits additional information to improve sound quality.

그러나, 기존에 제안된 SNR 비트율 확장 음성 부호화 장치는 기본 계층과 음질 향상 계층을 독립적으로 부호화하도록 구성되어 있다. 따라서 고정 코드북을 탐색할 때 요구되는 대상 신호(또는 타겟 벡터)와 임펄스 응답과의 상관도와 에너지를 검출하기 위한 연산이 기본 계층과 음질 향상 계층에서 각각 수행되므로 고정 코드북 탐색을 위한 매개 변수를 구하기 위해 많은 연산량이 요구된다. However, the proposed SNR bit rate extended speech encoding apparatus is configured to independently encode the base layer and the sound quality enhancement layer. Therefore, since the calculation of correlation and energy between the target signal (or target vector) and the impulse response required when searching the fixed codebook is performed in the base layer and the sound quality enhancement layer, the parameters for the fixed codebook search are obtained. A lot of computation is required.

그리고, 기존에 제안된 SNR 비트율 확장 음성 부호화장치는 상기 음질 향상 계층을 추가로 운영하기 위하여 기존의 표준화된 CELP 음성 부호화기의 구조를 변경하여 기존의 표준화된 CELP 음성 부호화기와 호환되지 않는 단점을 갖고 있다. In addition, the proposed SNR bit rate extended speech coder has a disadvantage in that it is not compatible with the existing standardized CELP speech coder by changing the structure of the existing standardized CELP speech coder to further operate the sound quality enhancement layer. .

본 발명이 이루고자 하는 기술적 과제는 기존의 표준화된 음성 코덱의 고정 코드북과 다층 구조를 이루는 고정 코드북을 포함하여 기존의 표준화된 음성 코덱과 호환성을 갖는 SNR 비트율을 확장하는 음성 부호화 및 복호화 장치와 그 방법을 제공하는데 있다. The present invention provides a speech encoding and decoding apparatus and method for extending an SNR bit rate compatible with existing standardized speech codecs, including a fixed codebook of a conventional standardized speech codec and a fixed codebook having a multi-layered structure. To provide.

본 발명이 이루고자 하는 다른 기술적 과제는 고정 코드북 탐색을 위한 매개 변수를 구하는 연산량이 감소된 SNR 비트율 확장 음성 부호화 및 복호화 장치와 그 방법을 제공하는데 있다. Another object of the present invention is to provide an SNR bit rate extended speech encoding and decoding apparatus and method for reducing the amount of computation for obtaining parameters for fixed codebook search.

본 발명이 이루고자 하는 또 다른 기술적 과제는 기본 계층에서 탐색된 고정 코드북의 기여도와 음질 향상 계층의 합성된 여기 신호(excitation signal)가 제거된 대상신호를 이용하여 음질 향상 계층의 고정 코드북을 탐색하는 SNR 비트율 확장 음성 부호화 및 복호화 장치와 그 방법을 제공하는데 있다.Another technical problem to be solved by the present invention is SNR for searching a fixed codebook of a sound quality enhancement layer by using a target signal from which contribution of the fixed codebook found in the base layer and a synthesized excitation signal of the sound quality enhancement layer are removed. Disclosed are a bit rate extended speech encoding and decoding apparatus and a method thereof.

본 발명이 이루고자 하는 또 다른 기술적 과제는 기본 계층에서 탐색된 펄스의 위치와 음질 향상 계층에서 탐색된 펄스의 위치가 중복되는 것을 허용함으로써, 대수 코드북의 한계를 극복할 수 있는 SNR 비트율 확장 음성 부호화 및 복호화 장치와 그 방법을 제공하는데 있다. Another technical problem to be solved by the present invention is to allow overlapping of the position of the pulse searched in the base layer and the position of the pulse searched in the sound quality enhancement layer, thereby overcoming the limitations of the algebraic codebook. The present invention provides a decoding apparatus and a method thereof.

본 발명이 이루고자 하는 또 다른 기술적 과제는 음질 향상 계층에서의 고정 코드북의 이득값에 대한 양자화 비트를 줄일 수 있는 SNR 비트율 확장 음성 부호화 및 복호화 장치와 그 방법을 제공하는데 있다. Another object of the present invention is to provide an apparatus and method for SNR bit rate extension speech encoding and decoding capable of reducing quantization bits for gain values of fixed codebooks in a sound quality enhancement layer.

상기 기술적 과제들을 달성하기 위하여 본 발명은, 선형 예측 코딩을 사용하여 입력 음성신호를 필터링하고, 고정 코드북 탐색 및 적응 코드북 탐색에 의해 상기 필터링된 음성신호의 여기 신호를 생성하는 기본 계층; 및 상기 기본 계층에서의 고정 코드북 탐색에 의해 얻어지는 매개 변수를 이용하여 고정 코드북을 탐색하는 음질 향상 계층을 적어도 하나 포함하고, 상기 기본 계층에서 생성되는 신호와 상기 음질 향상 계층에서 생성되는 신호를 다중화하고, 상기 다중화된 신호를 출력하는 다중화기를 포함하는 음성신호 부호화 장치를 제공한다. In order to achieve the above technical problem, the present invention provides a communication system comprising: a base layer for filtering an input speech signal using linear predictive coding and generating an excitation signal of the filtered speech signal by fixed codebook searching and adaptive codebook searching; And at least one sound quality enhancement layer for searching a fixed codebook using a parameter obtained by the fixed codebook search in the base layer, and multiplexing a signal generated in the base layer and a signal generated in the sound quality enhancement layer. And a multiplexer for outputting the multiplexed signal.

상기 기술적 과제들을 달성하기 위하여 본 발명은, 입력되는 음성신호를 선형 예측 부호화 필터링하고, 고정 코드북 탐색 및 적응 코드북 탐색에 의해 상기 필터링된 음성 신호에 대응되는 여기 신호를 생성하는 기본 계층; 및 상기 기본 계층에서의 고정 코드북 탐색에 따라 생성되는 매개 변수를 이용하여 고정 코드북을 탐색하는 고정 코드북 탐색부, 상기 기본 계층의 상기 고정 코드북 탐색에 의해 생성된 제 1 고정 코드북 이득값과 상기 고정 코드북 탐색부로부터 출력되는 제 2 고정 코드북 이득값간의 차를 검출하고, 검출된 차를 양자화 하는 이득값 차 양자화기를 포함하는 음질 향상 계층을 복수개 구비하고, 상기 기본 계층에서 생성되는 신호와 상기 음질 향상 계층에서 생성되는 신호를 다중화하는 다중화기를 포함하는 음성신호 부호화 장치를 제공한다. According to an aspect of the present invention, there is provided an apparatus including: a base layer configured to linearly predictively encode an input speech signal and generate an excitation signal corresponding to the filtered speech signal by fixed codebook searching and adaptive codebook searching; And a fixed codebook search unit for searching a fixed codebook using a parameter generated according to the fixed codebook search in the base layer, a first fixed codebook gain value generated by the fixed codebook search in the base layer, and the fixed codebook. A plurality of sound quality enhancement layers including a gain difference quantizer for detecting a difference between the second fixed codebook gain values output from the searcher and quantizing the detected difference, and a signal generated in the base layer and the sound quality enhancement layer Provided is a speech signal encoding apparatus including a multiplexer for multiplexing a signal generated by a.

상기 기술적 과제들을 달성하기 위하여 본 발명은, 기본 계층과 적어도 하나의 음질 향상 계층으로 나뉘어 부호화된 음성신호를 디코딩하기 위한 음성 신호 복호화 장치에 있어서, 부호화된 음성신호중에서 기본 계층에서의 부호화 정보를 디코드 하기 위한 제 1 복호화 유니트; 상기 음성 신호 복호화 장치의 동작 환경에 따라 상기 부호화된 음성신호중에서 음질 향상 계층에서의 부호화 정보를 복원하는 제 2 복호화 유니트; 상기 음성 신호 복호화 장치의 동작 환경에 따라 상기 제 1 복호화 유니트에서 복원된 신호와 상기 제 2 복호화 유니트에서 복원된 신호를 연산하는 연산 유니트; 상기 제 1 복호화 유니트에서 출력되는 선형 예측 부호화 계수를 이용하여 상기 연산 유니트에서 출력되는 신호를 합성하여 음성신호를 복원하는 음성신호 복원 유니트를 포함하는 음성 신호 복호화 장치를 제공한다.In order to achieve the above technical problem, the present invention provides a speech signal decoding apparatus for decoding a speech signal encoded by being divided into a base layer and at least one sound quality enhancement layer, and decodes the encoding information of the encoded speech signal in the base layer. A first decoding unit for performing; A second decoding unit for restoring encoding information in a sound quality enhancement layer among the encoded speech signals according to an operating environment of the speech signal decoding apparatus; A calculation unit configured to calculate a signal restored in the first decoding unit and a signal restored in the second decoding unit according to an operating environment of the speech signal decoding apparatus; Provided is a speech signal decoding apparatus comprising a speech signal recovery unit for recovering a speech signal by synthesizing a signal output from the calculation unit using the linear prediction coding coefficients output from the first decoding unit.

상기 제 1 복호화 유니트는, 상기 기본 계층에서의 부호화 정보에 포함되어 있는 선형 예측 부호화 계수 양자화 정보를 디코드 하는 선형 예측 부호화 계수 복호화부; 상기 기본 계층에서의 부호화 정보에 포함되어 있는 고정 코드북 인덱스를 디코드 하는 제 1 고정 코드북 복호화부; 상기 기본 계층에서의 부호화 정보에 포함되어 있는 적응 코드북 인덱스를 디코드 하는 적응 코드북 복호화부; 상기 기본 계층에서의 부호화 정보에 포함되어 있는 고정 코드북 이득값과 적응 코드북 이득값을 각각 디코드하는 이득값 복호화부를 포함하는 것이 바람직하다. The first decoding unit includes: a linear prediction coding coefficient decoder for decoding linear prediction coding coefficient quantization information included in the encoding information in the base layer; A first fixed codebook decoder which decodes a fixed codebook index included in the encoding information in the base layer; An adaptive codebook decoder which decodes an adaptive codebook index included in the encoding information in the base layer; Preferably, a gain value decoding unit for decoding the fixed codebook gain value and the adaptive codebook gain value included in the encoding information in the base layer, respectively.

상기 제 2 복호화 유니트는, 상기 음성 향상 계층에서의 부호화 정보에 포함되어 있는 고정 코드북 이득값간의 차의 양자화 정보를 디코드 하는 이득값 차 복 호화부; 상기 음질 향상 계층에서의 부호화 정보에 포함되어 있는 고정 코드북 인덱스를 디코드 하는 제 2 고정 코드북 복호화부를 포함하는 것이 바람직하다. The second decoding unit includes: a gain value difference decoder which decodes quantization information of a difference between fixed codebook gain values included in encoded information in the speech enhancement layer; Preferably, a second fixed codebook decoder for decoding the fixed codebook index included in the encoding information in the sound quality enhancement layer is included.

상기 제 2 복호화 유니트는, 상기 음질 향상 계층에서의 부호화 정보에 포함되어 있는 고정 코드북 로그스케일 이득값간의 차의 양자화 정보를 디코드 하는 이득값 차 복호화부; 상기 음질 향상 계층에서의 부호화 정보에 포함되어 있는 고정 코드북 인덱스를 디코드 하는 제 2 고정 코드북 복호화부를 포함하는 것이 바람직하다. The second decoding unit includes: a gain value difference decoding unit for decoding quantization information of the difference between the fixed codebook logscale gain values included in the encoding information in the sound quality enhancement layer; Preferably, a second fixed codebook decoder for decoding the fixed codebook index included in the encoding information in the sound quality enhancement layer is included.

상기 제 2 복호화 유니트는, 상기 음성 향상 계층에서의 부호화 정보에 포함되어 있는 고정 코드북 로그스케일 이득값간의 차의 양자화 정보를 디코드 하는 이득값 차 복호화부; 상기 음질 향상 계층에서의 부호화 정보에 포함되어 있는 고정 코드북 인덱스를 디코드 하는 고정 코드북 복호화부를 포함하는 것이 바람직하다. The second decoding unit includes: a gain value difference decoding unit for decoding the quantization information of the difference between the fixed codebook logscale gain values included in the encoding information in the speech enhancement layer; Preferably, the fixed codebook decoder includes a fixed codebook index included in the encoded information in the sound quality enhancement layer.

상기 기술적 과제들을 달성하기 위하여 본 발명은, 입력된 음성신호의 선형 예측 부호화 계수를 추출하고, 고정 코드북 탐색 및 적응 코드북 탐색에 의해 상기 입력된 음성신호에 대응하는 여기 신호를 생성하는 기본 계층 처리 단계; 상기 기본 계층 처리 단계에서 상기 고정 코드북 탐색에 따라 생성된 매개 변수를 이용하여 고정 코드북을 탐색하는 음질 향상 계층 처리 단계; 상기 기본 계층 처리 단계와 상기 음질 향상 계층 처리 단계에 의해 생성되는 신호를 다중화하는 단계를 포함하는 음성 신호 부호화 방법을 제공한다. According to an aspect of the present invention, there is provided a basic layer processing step of extracting a linear predictive coding coefficient of an input speech signal and generating an excitation signal corresponding to the input speech signal by a fixed codebook search and an adaptive codebook search. ; A sound quality enhancement layer processing step of searching for a fixed codebook using a parameter generated according to the fixed codebook search in the base layer processing step; It provides a speech signal encoding method comprising the step of multiplexing the signal generated by the base layer processing step and the sound quality enhancement layer processing step.

상기 음질 향상 계층 처리 단계는 복수 단계로 수행되는 것이 바람직하다. The sound quality enhancement layer processing step is preferably performed in a plurality of steps.

상기 기술적 과제들을 달성하기 위하여 본 발명은, 기본 계층과 적어도 하나 의 음질 향상 계층으로 부호화된 음성 신호를 복호화하기 위한 음성 신호 복호화 방법에 있어서, 상기 부호화된 음성신호를 복호화하는 단계; 상기 복호화 단계에서 복호화된 기본 계층에 대한 코드북과 음질 향상 계층에 대한 코드북을 상기 음성 신호 복호화의 동작 조건에 따라 선택적으로 전송하는 단계; 상기 선택적으로 전송되는 코드북과 상기 복호화 단계에서 복호화된 선형 예측 계수를 합성하여 복원된 음성신호를 생성하는 단계를 포함하는 음성 신호 복호화 방법을 제공한다. According to an aspect of the present invention, there is provided a speech signal decoding method for decoding a speech signal encoded by a base layer and at least one sound quality enhancement layer, the method comprising: decoding the encoded speech signal; Selectively transmitting the codebook for the base layer decoded in the decoding step and the codebook for the sound quality enhancement layer according to the operating condition of the speech signal decoding; And generating a reconstructed speech signal by combining the selectively transmitted codebook and the linear prediction coefficients decoded in the decoding step.

상기 기술적 과제들을 달성하기 위하여 본 발명은, 선형 예측 부호화를 사용하여 입력 음성신호를 필터링하고, 고정 코드북 탐색 및 적응 코드북 탐색에 의해 상기 필터링된 음성신호의 여기 신호를 생성하는 기본 계층; 상기 기본 계층의 고정 코드북 탐색 대상 신호에서 상기 기본 계층의 고정 코드북의 기여도를 제거한 신호를 대상 신호로 하여 고정 코드북을 탐색하는 음질 향상 계층을 적어도 하나 포함하고, 상기 기본 계층에서 생성되는 신호와 상기 음질 향상 계층에서 생성되는 신호를 다중화하고, 상기 다중화된 신호를 출력하는 다중화기를 포함하는 음성신호 부호화 장치를 제공한다. In order to achieve the above technical problem, the present invention includes a base layer for filtering an input speech signal using linear predictive coding, and generating an excitation signal of the filtered speech signal by fixed codebook search and adaptive codebook search; And a sound quality enhancement layer for searching for a fixed codebook by using a signal from which the contribution of the fixed codebook of the base layer is removed from the fixed codebook search target signal of the base layer, and the signal generated in the base layer and the sound quality. The present invention provides a speech signal encoding apparatus including a multiplexer for multiplexing a signal generated in an enhancement layer and outputting the multiplexed signal.

상기 기본 계층의 고정 코드북 기여도 y₂(n)은 상기 기본 계층의 고정 코드북의 양자화 이득값이 승산된 고정 코드북 c_G와 합성 필터의 임펄스 응답 h(n)을 이용한 하기 식에 기초하여 계산되는 것이 바람직하다. The fixed codebook contribution y ₂ (n) of the base layer is calculated based on the following equation using the impulse response h (n) of the synthesis filter and the fixed codebook c _G multiplied by the quantization gain value of the fixed codebook of the base layer. desirable.

상기 음질 향상 계층은 상기 선형 예측 부호화 계수를 이용하여 음질 향상 계층에서 생성된 고정 코드북 신호를 합성한 신호를 상기 기본 계층의 대상 신호로부터 더 제거하는 것이 바람직하다. The sound quality enhancement layer may further remove a signal obtained by synthesizing the fixed codebook signal generated in the sound quality enhancement layer by using the linear prediction coding coefficients from the target signal of the base layer.

상기 음질 향상 계층의 고정 코드북 탐색 시, 상기 기본 계층의 고정 코드북 탐색에 의해 얻어진 제 1 이득값의 로그 스케일 값과 상기 음질 향상 계층에서의 고정 코드북 탐색에 의해 얻어진 제 2 이득값의 로그 스케일 값간의 차를 양자화한 결과를 이용하여 음질 향상 계층의 양자화된 이득값을 구하고 양자화된 이득값을 상기 음질 향상 계층에서 고정 코드북 탐색에 의해 얻어진 고정 코드북 벡터에 승산하는 기능을 더 포함하는 것이 바람직하다. In the fixed codebook search of the sound quality enhancement layer, between the log scale value of the first gain value obtained by the fixed codebook search of the base layer and the log scale value of the second gain value obtained by the fixed codebook search in the sound quality enhancement layer. It is preferable to further include a function of obtaining a quantized gain value of the sound quality enhancement layer by using the result of quantizing the difference and multiplying the quantized gain value by the fixed codebook vector obtained by the fixed codebook search in the sound quality enhancement layer.

상기 음질 향상 계층은, 상기 대상신호를 인지 가중 필터링한 후, 상기 고정 코드북 탐색을 수행하는 것이 바람직하다. The sound quality enhancement layer may perform the fixed codebook search after the cognitive weighting filtering of the target signal.

상기 기술적 과제들을 달성하기 위하여 본 발명은, 입력되는 음성신호를 선형 예측 부호화 필터링하고, 고정 코드북 탐색 및 적응 코드북 탐색에 의해 상기 필터링 된 음성 신호에 대응되는 여기 신호를 생성하는 기본 계층; 상기 기본 계층의 고정 코드북 탐색 대상 신호에서 기본 계층의 고정 코드북 기여도를 제거한 신호를 음질 향상 계층의 고정 코드북 탐색 대상 신호로 하여, 고정 코드북을 탐색하는 탐색부, 상기 기본 계층의 상기 고정 코드북 탐색에 의해 생성된 제 1 고정 코드북의 로그 스케일 이득값과 상기 고정 코드북 탐색부로부터 출력되는 제 2 고정 코드북의 로그 스케일 이득값간의 차를 검출하고, 검출된 차를 양자화 하는 로그 스케일 이득값 차 양자화기를 포함하는 음질 향상 계층을 복수개 구비하고, 상기 기본 계층에서 생성되는 신호와 상기 음질 향상 계층에서 생성되는 신호를 다중화 하는 다중화기를 포함하고, 상기 음질 향상 계층은 상기 음질 향상 계층에서 선형 예측 부호화 계수를 이용하여 고정 코드북을 합성한 신호를 상기 음질 향상 계층의 고정 코드북 탐색 대상 신호로부터 더 제거하는 것을 특징으로 하는 음성신호 부호화 장치를 제공한다. According to an aspect of the present invention, there is provided an apparatus, including: a base layer configured to linearly predictively code an input speech signal and generate an excitation signal corresponding to the filtered speech signal by fixed codebook searching and adaptive codebook searching; A searcher for searching for a fixed codebook by using a signal from which the fixed codebook contribution of the base layer is removed from the fixed codebook search target signal of the base layer as a fixed codebook search target signal of a sound quality enhancement layer, by the fixed codebook search of the base layer And a log scale gain value difference quantizer for detecting a difference between the generated log scale gain value of the first fixed codebook and the log scale gain value of the second fixed codebook output from the fixed codebook search unit, and quantizing the detected difference. And a multiplexer for multiplexing a signal generated in the base layer and a signal generated in the sound quality enhancement layer, wherein the sound quality enhancement layer is fixed using a linear prediction coding coefficient in the sound quality enhancement layer. Codebook synthesized signal of the sound quality enhancement layer And it provides an audio signal encoding apparatus according to claim 1, further removed from the constant codebook search target signal.

상기 기술적 과제들을 달성하기 위하여 본 발명은, 입력된 음성신호의 선형 예측 계수를 추출하고, 고정 코드북 탐색 및 적응 코드북 탐색에 의해 상기 입력된 음성신호에 대응하는 여기 신호를 생성하는 기본 계층 처리 단계; 상기 기본 계층의 고정 코드북 탐색 대상 신호에서 기본 계층의 고정 코드북 기여도를 제거한 신호를 음질 향상 계층의 고정 코드북 탐색 대상 신호로 하여, 고정 코드북을 탐색하는 음질 향상 계층 처리 단계; 상기 기본 계층 처리 단계와 상기 음질 향상 계층 처리 단계에 의해 생성되는 신호를 다중화하는 단계를 포함하는 음성 신호 부호화 방법을 제공한다. According to an aspect of the present invention, there is provided a method, comprising: a base layer processing step of extracting a linear prediction coefficient of an input speech signal and generating an excitation signal corresponding to the input speech signal by a fixed codebook search and an adaptive codebook search; A sound quality enhancement layer processing step of searching for a fixed codebook by using a signal from which the fixed codebook contribution of the base layer is removed from the fixed codebook search target signal of the base layer as a fixed codebook search target signal of a sound quality enhancement layer; It provides a speech signal encoding method comprising the step of multiplexing the signal generated by the base layer processing step and the sound quality enhancement layer processing step.

이하 본 발명의 실시 예에 따른 비트율 확장 음성 부호화 및 복호화 장치와 그 방법을 살펴보면 다음과 같다. Hereinafter, a bit rate extended speech encoding and decoding apparatus and a method thereof according to an embodiment of the present invention will be described.

도 1은 본 발명의 바람직한 일 실시 예에 따른 비트율 확장 음성 부호화 장치의 기능 블록도이다. 도 1을 참조하면, 본 발명의 일 실시 예에 따른 음성 부호화 장치는 기본 계층(100)과 음질 향상 계층(130)을 포함하는 다층 고정 코드북 구조를 갖는다. 1 is a functional block diagram of a bit rate extended speech encoding apparatus according to an embodiment of the present invention. Referring to FIG. 1, a speech encoding apparatus according to an embodiment of the present invention has a multi-layer fixed codebook structure including a base layer 100 and a sound quality enhancement layer 130.

기본 계층(100)에서는 최소한의 음질을 복원할 수 있는 부호화 정보가 생성된다. 기본 계층(100)은 기존의 표준화된 CELP 음성 부호화기의 구성과 유사하다. 따라서, 기본 계층(100)은 입력 음성 신호를 선형 예측 부호화에 의해 필터링하여 입력 음성신호에 대응되는 여기 신호(excitation signal)를 생성한다. In the base layer 100, encoding information for restoring the minimum sound quality is generated. The base layer 100 is similar to the configuration of the existing standardized CELP speech coder. Accordingly, the base layer 100 filters the input speech signal by linear predictive coding to generate an excitation signal corresponding to the input speech signal.

기본 계층(100)은 전처리 유니트(102), LPC 계수 추출 및 벡터 양자화기(104), 합성 필터(106), 감산기(108), 인지 가중 필터(perceptual weighting filter)(110), 피치(pitch) 분석부(112), 피치 기여도(contribution) 제거부(115), 고정 코드북 탐색부(117), 고정 코드북(119), 제 1 승산기(121), 가산기(123), 적응 코드북(124), 제 2 승산기(126), 이득값 양자화기(129)로 구성된다. The base layer 100 includes a preprocessing unit 102, an LPC coefficient extraction and vector quantizer 104, a synthesis filter 106, a subtractor 108, a perceptual weighting filter 110, and a pitch. Analysis unit 112, pitch contribution removal unit 115, fixed codebook search unit 117, fixed codebook 119, first multiplier 121, adder 123, adaptive codebook 124, 2 multiplier 126, a gain value quantizer 129.

전처리 유니트(102)는 라인(101)을 통해 입력되는 음성 신호에서 DC성분을 제거한다. 즉, 전처리 유니트(102)는 하이패스 필터를 사용하여 입력 음성 신호를 필터링하여 입력 음성 신호의 저주파 대역의 노이즈 성분을 제거한다. 사용된 하이패스 필터 H_h1(n)은 수학식 1과 같은 전달 함수를 갖는다.The preprocessing unit 102 removes the DC component from the audio signal input through the line 101. That is, the preprocessing unit 102 filters the input voice signal using a high pass filter to remove noise components of the low frequency band of the input voice signal. The high pass filter H _h1 (n) used has a transfer function as shown in equation (1).

전처리 유니트(102)로부터 출력되는 신호는 라인(103)을 통해 LPC 계수 추출 및 벡터 양자화기(104)로 전송된다. The signal output from the preprocessing unit 102 is transmitted to the LPC coefficient extraction and vector quantizer 104 via line 103.

LPC 계수 추출 및 벡터 양자화기(104)는 상기 전처리 유니트(102)로부터 출력되는 신호의 LPC 계수를 추출한다. 추출된 LPC 계수는 LPC 계수 추출 및 벡터 양자화기(104)에 의해 벡터 양자화 된다. LPC 계수의 벡터 양자화 정보는 라인(105) 을 통해 합성 필터(106)와 다중화기(140)로 전송된다.The LPC coefficient extraction and vector quantizer 104 extracts the LPC coefficients of the signal output from the preprocessing unit 102. The extracted LPC coefficients are vector quantized by the LPC coefficient extraction and vector quantizer 104. Vector quantization information of the LPC coefficients is sent via line 105 to synthesis filter 106 and multiplexer 140.

합성 필터(synthesis filter)(106)는 상기 LPC 계수의 벡터 양자화 정보를 이용하여 라인(128)을 통해 입력되는 여기 신호(excitation signal)에 대응되는 합성된 신호를 출력한다. 상기 합성된 신호는 라인(107)을 통해 감산기(108)로 출력된다. A synthesis filter 106 outputs a synthesized signal corresponding to an excitation signal input through the line 128 using the vector quantization information of the LPC coefficients. The synthesized signal is output to subtractor 108 via line 107.

감산기(108)는 라인(103)을 통해 입력되는 전처리 유니트(102)로부터 출력되는 신호에서 라인(107)을 통해 입력되는 합성된 신호를 감산하여 차 신호를 생성한다. 상기 차 신호는 라인(109)을 통해 인지 가중 필터(110)로 전송된다.The subtractor 108 subtracts the synthesized signal input through the line 107 from the signal output from the preprocessing unit 102 input through the line 103 to generate a difference signal. The difference signal is transmitted via the line 109 to the cognitive weighting filter 110.

인지 가중 필터(110)는 인체 청각 구조의 마스킹(masking) 효과를 이용하기 위하여 양자화 잡음이 마스킹 임계치이하가 되도록 한다. 따라서 인지 가중 필터(110)는 상기 차 신호의 양자화 잡음이 최소화되도록 가중치를 포함하는 신호를 피치 분석부(112)로 출력한다. The cognitive weighting filter 110 allows the quantization noise to be less than or equal to the masking threshold in order to use the masking effect of the human auditory structure. Accordingly, the cognitive weighting filter 110 outputs a signal including a weight to the pitch analyzer 112 so that the quantization noise of the difference signal is minimized.

피치(pitch) 분석부(112)는 인지 가중 필터(110)로부터 출력되는 신호에 대해 개회로(open-loop) 피치와 폐회로(close-loop) 피치를 탐색한다. 즉, 피치 분석부(112)는 인지 가중 필터(110)로부터 출력되는 신호를 복수개의 서브프레임(subframe)으로 나누고, 상기 각 서브 프레임의 피치를 분석하여 적응 코드북의 인덱스와 이득값을 출력한다. 상기 적응 코드북의 인덱스는 라인(113)을 통해 피치 기여도 제거부(115)와 적응 코드북(124)으로 전송되면서 라인(114)을 통해 다중화기(140)로 전송된다. 또한, 상기 적응 코드북의 이득값은 이득값 양자화기(129)로 제공된다. The pitch analyzer 112 searches for an open-loop pitch and a close-loop pitch with respect to the signal output from the cognitive weight filter 110. That is, the pitch analyzer 112 divides the signal output from the cognitive weighting filter 110 into a plurality of subframes, analyzes the pitch of each subframe, and outputs an index and a gain value of the adaptive codebook. The index of the adaptive codebook is transmitted to the multiplexer 140 through the line 114 while being transmitted to the pitch contribution remover 115 and the adaptive codebook 124 through the line 113. The gain value of the adaptive codebook is also provided to a gain value quantizer 129.

피치 기여도 제거부(115)는 상기 적응 코드북의 인덱스를 토대로 인지 가중 필터(110)의 출력 신호로부터 고정 코드북 탐색을 위해 필요한 대상 신호(또는 타겟 벡터)를 검출한다. 그리고 피치 기여도 제거부(115)는 라인(111)에서 피치 기여도 y₁(n)을 감산하여 고정 코드북 탐색 대상 신호를 라인(116)을 통해 기본 계층(100)의 고정 코드북 탐색부(117)와 음질 향상 계층(130)의 고정 코드북 탐색부(131)로 출력한다. 피치 기여도 y₁(n)은 수학식 2에 의하여 구해진다. The pitch contribution remover 115 detects a target signal (or target vector) required for the fixed codebook search from the output signal of the cognitive weight filter 110 based on the index of the adaptive codebook. The pitch contribution remover 115 subtracts the pitch contribution y ₁ (n) from the line 111 to transmit the fixed codebook search target signal to the fixed codebook search unit 117 of the base layer 100 through the line 116. The fixed codebook search unit 131 of the sound quality enhancement layer 130 is output. The pitch contribution y ₁ (n) is obtained by equation (2).

수학식 2에서 AC_G(n)은 적응 코드북 이득값이 승산된 값이다. In Equation 2, AC _G (n) is a value multiplied by an adaptive codebook gain value.

고정 코드북 탐색부(117)는 라인(111)을 통해 입력된 대상 신호 x'(n)을 사용하여 대상신호와 임펄스 응답 h(n)과의 상관도 d(n)을 구한다.The fixed codebook search unit 117 calculates the correlation d (n) between the target signal and the impulse response h (n) using the target signal x '(n) input through the line 111.

예를 들어 부프레임의 크기가 40샘플이고 각 계층의 펄스 수가 4개라고 가정하면, 상기 상관도 d(n)은 수학식 3과 같이 정의될 수 있다. For example, assuming that the size of the subframe is 40 samples and the number of pulses in each layer is four, the correlation d (n) may be defined as shown in Equation 3 below.

수학식 3에서 h(i-n)은 임펄스 응답이고, x'(n)은 대상 신호이다. In Equation 3, h (i-n) is an impulse response, and x '(n) is a target signal.

상기 임펄스 응답 h(n)과 상관도 d(n)은 라인(118')을 통하여 음질향상 계층(130)의 고정 코드북 탐색부(131)로 제공된다.The impulse response h (n) and the correlation d (n) are provided to the fixed codebook search unit 131 of the sound quality enhancement layer 130 through the line 118 '.

상기 고정 코드북 탐색부(117)는 상기 임펄스 응답 h(n)과 상기 상관도 d(n)을 토대로 표 1의 예와 같이 구성된 대수 코드북(algebraic codebook) 형태의 고정 코드북을 탐색한다. The fixed codebook search unit 117 searches for a fixed codebook in the form of an algebraic codebook configured as shown in Table 1 based on the impulse response h (n) and the correlation d (n).

표 1을 참고하면, 고정 코드북 탐색부(117)에서 고정 코드북 벡터는 4개의 위치에서만 그 펄스의 크기가 0이 아니다. 따라서 상기 펄스의 부호 s와 상관도 d(n)를 이용하여 각 펄스의 상관도 d(n)의 크기의 합인 상관도 C는 수학식 2와 같이 정의될 수 있다. 고정 코드북 탐색부(117)는 수학식 4에 의해 상관도 C를 검출한다. Referring to Table 1, in the fixed codebook search unit 117, the magnitude of the pulse is not zero only in four positions. Therefore, the correlation C, which is the sum of the magnitudes of the correlation d (n) of each pulse using the symbol s of the pulse and the correlation d (n), may be defined as in Equation 2. The fixed codebook search unit 117 detects the correlation C by equation (4).

수학식 4에서 m_i는 i번째 펄스의 위치를 나타내고, s_i는 i번째 펄스의 부호를 나타낸다. 고정 코드북 검출부(117)는 합성 필터(106)의 임펄스 응답 h(n)의 에너지 E를 수학식 5에 의해 검출한다. In Equation 4, m _i represents the position of the i-th pulse, s _i represents the sign of the i-th pulse. The fixed codebook detector 117 detects the energy E of the impulse response h (n) of the synthesis filter 106 by the equation (5).

수학식 5에서 ??(m_i, m_j)는 i번째 펄스의 위치와 j번째 펄스의 위치에 대한 임펄스 응답신호 h(n)간의 상관도이고, s_i는 i번째 펄스의 부호이고, s_j는 j번째 펄스의 부호이다. ?? (m _i , m _j ) in Equation 5 is a correlation between the position of the i-th pulse and the impulse response signal h (n) with respect to the position of the j-th pulse, s _i is the sign of the i-th pulse, and s _j is the sign of the j th pulse.

상기 고정 코드북 탐색부(117)는 상기 상관도 C와 임펄스 응답 h(n)의 에너지 E를 저장한다. 상관도 C는 부호 sign[d(i)]와 그 절대값으로 나뉘어 저장된다. sign[d(i)]는 d(i)의 부호이다. 상기 에너지 E는 수학식 6와 같은 형태로 저장된다. The fixed codebook search unit 117 stores the correlation C and the energy E of the impulse response h (n). The correlation degree C is stored by dividing the sign [d (i)] and its absolute value. sign [d (i)] is the sign of d (i). The energy E is stored in the form as shown in Equation 6.

에너지 E에 대한 수학식 5은 수학식 7와 같이 재 정의될 수 있다. Equation 5 for energy E may be redefined as in Equation 7.

고정 코드북 탐색부(117)는 상기 검출된 상관도 C와 에너지 E를 라인(118")을 통해 음질 향상 계층(130)의 고정 코드북 탐색부(131)로 제공하면서, 검출된 상관도 C와 에너지 E를 이용하여 대수 코드북으로 구성된 고정 코드북을 탐색한다. 상기 고정 코드북 탐색에 의해 고정 코드북 인덱스와 이득 값이 얻어지면, 고정 코드북 탐색부(117)는 상기 고정 코드북 인덱스를 고정 코드북(119)과 다중화기(140)로 전송하고, 상기 이득 값을 이득값 양자화기(129)로 전송한다. The fixed codebook search unit 117 provides the detected correlation C and energy E to the fixed codebook search unit 131 of the sound quality enhancement layer 130 through the line 118 ", while detecting the detected correlation C and energy E. The fixed codebook consisting of algebraic codebooks is searched using E. When the fixed codebook index and the gain value are obtained by the fixed codebook search, the fixed codebook search unit 117 multiplies the fixed codebook index with the fixed codebook 119. And transmits the gain value to the gain value quantizer 129.

고정 코드북(119)은 라인(118)을 통해 입력된 인덱스를 토대로 기본 계층(100)의 고정 코드북 벡터를 출력한다. 고정 코드북(119)에서 출력되는 고정 코드북 벡터는 라인(120)을 통해 제 1 승산기(121)로 제공된다.The fixed codebook 119 outputs a fixed codebook vector of the base layer 100 based on the index input through the line 118. The fixed codebook vector output from the fixed codebook 119 is provided to the first multiplier 121 via the line 120.

제 1 승산기(121)는 이득값 양자화기(129)에서 제공되는 상기 고정 코드북의 이득 값에 대한 양자화 이득 값 G_c를 상기 펄스 위치와 부호 정보에 승산하고 그 결과를 라인(122)을 통해 출력한다. 라인(122)을 통해 출력되는 신호는 고정 코드북의 벡터이다. 상기 양자화 이득값 G_c는 이득값 양자화기(129)로부터 제공된다. The first multiplier 121 multiplies the pulse position and the sign information by the quantization gain value G _c for the gain value of the fixed codebook provided by the gain value quantizer 129 and outputs the result through the line 122. do. The signal output over line 122 is a vector of fixed codebooks. The quantization gain value G _c is provided from a gain value quantizer 129.

라인(113)을 통해 적응 코드북 인덱스가 인가되면, 적응 코드북(124)은 상기 적응 코드북 인덱스에 대응되는 펄스의 위치 정보와 부호 정보를 출력한다. 라인(125)을 통해 출력되는 적응 코드북 벡터는 제 2 승산기(126)로 제공된다. When the adaptive codebook index is applied through the line 113, the adaptive codebook 124 outputs position information and sign information of a pulse corresponding to the adaptive codebook index. The adaptive codebook vector output over line 125 is provided to a second multiplier 126.

제 2 승산기(126)는 적응 코드북의 이득값에 대한 양자화된 이득값 G_p를 상기 라인(125)을 통해 전송되는 적응 코드북 벡터에 승산하고, 그 결과를 라인(127)을 통해 출력한다. 상기 라인(127)을 통해 출력되는 신호는 이득값 G_p가 승산된 적응 코드북의 벡터이다. 상기 양자화된 이득값 G_p는 이득값 양자화기(129)로부터 제공된다. The second multiplier 126 multiplies the quantized gain value G _p for the gain value of the adaptive codebook by the adaptive codebook vector transmitted via the line 125 and outputs the result via line 127. The signal output through the line 127 is a vector of the adaptive codebook multiplied by the gain value G _p . The quantized gain value G _p is provided from a gain value quantizer 129.

가산기(123)는 라인(122)을 통해 입력되는 이득값 G_c가 승산된 고정 코드북 벡터와 라인(127)을 통해 입력되는 이득값 G_p가 승산된 적응 코드북 벡터를 가산하여 여기 신호를 얻는다. 상기 여기 신호는 라인(128)을 통해 합성 필터(106)로 출력된다. The adder 123 adds the fixed codebook vector multiplied by the gain value G _c inputted through the line 122 and the adaptive codebook vector multiplied by the gain value G _p inputted through the line 127 to obtain an excitation signal. The excitation signal is output to the synthesis filter 106 via line 128.

이득값 양자화기(129)는 고정 코드북 탐색부(117)로부터 출력되는 고정 코드북의 이득값과 피치 분석부(112)로부터 출력되는 적응 코드북의 이득값을 각각 양자화한다. 상기 고정 코드북의 이득값을 양자화한 이득값 G_c은 제 1 승산기(121)로 출력되고, 적응 코드북의 이득값을 양자화한 이득값 G_p는 제 2 승산기(126)로 출력된다. 상기 양자화한 이득값 G_c는 음질 향상 계층(130)에 포함되어 있는 이득값 차 양자화기(134)로도 제공된다. The gain value quantizer 129 quantizes the gain value of the fixed codebook output from the fixed codebook search unit 117 and the gain value of the adaptive codebook output from the pitch analyzer 112, respectively. The gain value G _c obtained by quantizing the gain value of the fixed codebook is output to the first multiplier 121, and the gain value G _p obtained by quantizing the gain value of the adaptive codebook is output to the second multiplier 126. The quantized gain value G _c is also provided to the gain value quantizer 134 included in the sound quality enhancement layer 130.

음질 향상 계층(130)은 복원되는 음질을 향상시키기 위하여 기본 계층(100)에서 제공되는 비트이외에 추가적인 비트를 더 제공하기 위한 것이다. 예를 들어 기본 계층(100)이 8kbps의 비트율을 제공할 때, 음질 향상 계층(130)이 4kbps의 추가 비트율을 제공할 수 있다. 도 1은 설명의 편의를 위하여 하나의 음성 향상 계층(130)이 기본 계층(100)에 연결된 구성을 도시하였으나, 복수개의 음성 향상 계층이 기본 계층(100)에 연결될 수 있다. The sound quality enhancement layer 130 is to provide additional bits in addition to the bits provided in the base layer 100 in order to improve the restored sound quality. For example, when the base layer 100 provides a bit rate of 8 kbps, the sound quality enhancement layer 130 may provide an additional bit rate of 4 kbps. 1 illustrates a configuration in which one voice enhancement layer 130 is connected to the base layer 100 for convenience of description, but a plurality of voice enhancement layers may be connected to the base layer 100.

음질 향상 계층(130)은 고정 코드북 탐색부(131)와 이득값 차 양자화기(134)로 구성된다. 고정 코드북 탐색부(131)는 라인(118')을 통해 제공되는 임펄스 응답 신호 h(n), 대상 신호와 임펄스 응답신호 h(n)의 상관도인 d(n), 펄스의 부호와 상기 상관도인 d(n)을 이용하여 검출된 d(n)의 크기 정보에 해당되는 상관도 C 및 임펄스 응답 신호 h(n)의 에너지 E를 이용하여 대수 코드북으로 구성된 고정 코드북을 탐색한다. The sound quality enhancement layer 130 includes a fixed codebook search unit 131 and a gain value quantizer 134. The fixed codebook search unit 131 has an impulse response signal h (n) provided through the line 118 ', d (n) which is a correlation between the target signal and the impulse response signal h (n), and the sign of the pulse. A fixed codebook consisting of logarithmic codebooks is searched using the correlation C corresponding to the magnitude information of d (n) detected using the diagram d (n) and the energy E of the impulse response signal h (n).

이와 같이 고정 코드북 탐색부(131)는 고정 코드북 탐색부(117)에서 탐색된 대상 신호와 동일한 대상 신호에 대한 고정 코드북 탐색을 수행한다. 고정 코드북 탐색부(131)는 대수 코드북을 사용한다. 고정 코드북 탐색부(131)는 대상 신호(타겟 벡터)의 MSE(Mean Square Error)를 최소화하고, 수학식 6을 최대화하는 벡터 c_k를 찾는다. 찾아진 벡터 c_k가 고정 코드북 벡터가 된다.As described above, the fixed codebook search unit 131 performs a fixed codebook search on the same target signal as the target signal searched by the fixed codebook search unit 117. The fixed codebook search unit 131 uses an algebraic codebook. The fixed codebook search unit 131 minimizes MSE (Mean Square Error) of the target signal (target vector) and finds a vector c _k that maximizes Equation 6. The found vector c _k becomes a fixed codebook vector.

수학식 8에서 Φ는 임펄스 응답 h(n)간의 상관도를 나타낸다. 상기 d(n)과 Φ는 기본 계층(100)에서 제공하는 값을 이용한다. 상기 Φ은 고정 코드북 탐색부(117)로부터 제공된다. 따라서, 고정 코드북 탐색부(131)는 고정 코드북 탐색 시 필요한 연산량을 줄일 수 있다. In Equation 8 Φ represents the correlation between the impulse response h (n). The d (n) and Φ use values provided by the base layer 100. Φ is provided from the fixed codebook search unit 117. Accordingly, the fixed codebook search unit 131 can reduce the amount of computation required when searching for the fixed codebook.

기본 계층(100)의 고정 코드북 벡터의 차수가 40이고, 기본 계층(100)과 음질 향상 계층(130)에서 크기가 0이 아닌 펄스를 각각 4개 찾는다고 가정하면, 기본 계층(100)의 고정 코드북(117)에서 먼저 4개의 펄스를 찾고 음질 향상 계층(130)의 고정 코드북 탐색부(131)에서 4개의 펄스를 찾기 때문에, 고정 코드북 탐색부(131) 는 기본 계층(100)에서 찾은 4개의 펄스의 영향도 고려한다. 따라서, 고정 코드북 탐색부(131)에서 얻어지는 상관도 C'는 수학식 9와 같이 정의될 수 있고, 에너지 E'는 수학식 10과 같이 정의될 수 있다. Assuming that the degree of the fixed codebook vector of the base layer 100 is 40 and that the base layer 100 and the sound quality enhancement layer 130 find four non-zero pulses, respectively, the base layer 100 is fixed. Since the codebook 117 first finds four pulses and the fixed codebook search unit 131 of the sound quality enhancement layer 130 finds four pulses, the fixed codebook search unit 131 finds four pulses found in the base layer 100. Consider the effects of the pulses. Therefore, the correlation C 'obtained from the fixed codebook search unit 131 may be defined as in Equation 9, and the energy E' may be defined as in Equation 10.

수학식 4에 정의된 상관도 C값을 이용하여 상기 수학식 9는 수학식 11와 같이 재 정의될 수 있다. Equation 9 may be redefined as in Equation 11 using the correlation C value defined in Equation 4.

고정 코드북 탐색부(131)는 탐색 과정의 복잡도를 줄이기 위하여 에너지 E 을 수학식 12과 같이 재 정의된 연산에 의해 검출할 수 있다. The fixed codebook search unit 131 may detect the energy E by a redefined operation as shown in Equation 12 in order to reduce the complexity of the search process.

수학식 12는 수학식 7에 정의되어 있는 에너지 E를 이용하면, 수학식 13과 같이 재 정의될 수 있다. Equation 12 may be redefined as in Equation 13 using the energy E defined in Equation 7.

상관도 C'와 에너지 E'는 음질 향상 계층(130)에서의 고정 코드북 탐색 이전에 저장되어 고정 코드북 탐색 과정을 간소화 할 수 있다. The correlation C 'and the energy E' may be stored before the fixed codebook search in the sound quality enhancement layer 130 to simplify the fixed codebook search process.

상술한 상관도 C', 에너지 E'를 이용하여 음질 향상 계층(130)의 펄스의 부호 정보와 위치 정보를 얻기 위한 고정 코드북 탐색부(131)의 과정은 기본 계층(100)의 고정 코드북 탐색부(117)에서 수행되는 방식과 동일하게 이루어진다. 이 때, 기본 계층(100)에서 탐색된 펄스의 위치 정보와 음질 향상 계층에서 탐색된 펄스의 위치 정보는 동일할 수 있다.Of the fixed codebook search unit 131 for obtaining the sign information and the position information of the pulse of the sound quality enhancement layer 130 using the above-described correlations C 'and energy E'. The process is the same as that performed by the fixed codebook search unit 117 of the base layer 100. In this case, the position information of the pulse searched in the base layer 100 and the position information of the pulse searched in the sound quality enhancement layer may be the same.

도 2는 도 1의 비트율 확장 음성 부호화 장치에 있어서 고정 코드북 탐색부(117)에 의해 탐색된 펄스의 위치와 고정 코드북 탐색부(131)에 의해 탐색된 펄스의 위치를 설명하기 위한 도면이다. FIG. 2 is a diagram for describing a position of a pulse searched by the fixed codebook search unit 117 and a position of a pulse searched by the fixed codebook search unit 131 in the bit rate extended speech encoding apparatus of FIG. 1.

도 2를 참조하면, 고정 코드북 탐색(201)에서 탐색된 펄스의 위치는 음질 향상 계층 고정 코드북 탐색(202)에서 탐색된 펄스의 위치와 같을 수 있다. 따라서, 최종 고정 코드북의 펄스의 크기는 기본 계층(100)과 음질 향상 계층(130)의 고정 코드북 펄스의 크기를 포함한 다중 크기를 갖는다. 따라서, 대수 코드북의 펄스의 크기는 +1 또는 -1만 갖지 않는다. Referring to FIG. 2, the position of the pulse searched in the fixed codebook search 201 may be the same as the position of the pulse searched in the sound quality enhancement layer fixed codebook search 202. Accordingly, the magnitude of the pulse of the final fixed codebook has multiple magnitudes including the magnitude of the fixed codebook pulses of the base layer 100 and the sound quality enhancement layer 130. Thus, the magnitude of the pulses of algebraic codebooks does not have only +1 or -1.

고정 코드북 탐색부(131)는 탐색 결과에 따라 얻어진 고정 코드북 벡터는 다중화기(140)로 제공하고, 고정 코드북의 이득값을 이득값 차 양자화기(134)로 제공한다. 상기 음질 향상 계층(130)에서의 상기 고정 코드북 인덱스는 펄스 부호 정보와 펄스의 위치 정보로 구성 될 수 있다.The fixed codebook search unit 131 provides the fixed codebook vector obtained according to the search result to the multiplexer 140 and provides the gain value of the fixed codebook to the gain value quantizer 134. The fixed codebook index in the sound quality enhancement layer 130 may be composed of pulse code information and pulse position information.

이와 같이 음질 향상 계층(130)에서 탐색된 고정 코드북 인덱스는 다음 프레임을 위하여 저장되지 않아 기본 계층(100)의 동작에 영향을 주지 않는다. As such, the fixed codebook index found in the sound quality enhancement layer 130 is not stored for the next frame and thus does not affect the operation of the base layer 100.

이득값 차 양자화기(134)는 고정 코드북 탐색부(131)에서 구한 고정 코드북의 이득값(132)과 기본 계층(100)에서 양자화된 고정 코드북의 이득값(G_c)간의 차를 구하고, 상기 차를 양자화 한다. 이에 따라 이득값 차 양자화 정보(G_diff)가 이득값 차 양자화기(134)로부터 라인(135)을 통해 다중화기(140)로 전송되므로, 음질 향상 계층(130)은 고정 코드북의 이득값에 대한 양자화 비트를 줄일 수 있다. The gain difference quantizer 134 obtains a difference between the gain value 132 of the fixed codebook obtained by the fixed codebook search unit 131 and the gain value G _c of the fixed codebook quantized in the base layer 100, Quantize the car. Accordingly, since the gain difference quantization information G _diff is transmitted from the gain difference quantizer 134 to the multiplexer 140 through the line 135, the sound quality enhancement layer 130 is applied to the gain value of the fixed codebook. Quantization bits can be reduced.

다중화기(140)는 기본 계층(100)으로부터 제공되는 LPC 계수 양자화 정보, 고정 코드북 인덱스, 적응 코드북 인덱스, 이득값 양자화 정보와 음질 향상 계층(130)으로부터 제공되는 음질 향상 계층의 고정 코드북 인덱스, 이득값 차 양자화 정보를 비트 스트림으로 출력한다. The multiplexer 140 may include the LPC coefficient quantization information, the fixed codebook index, the adaptive codebook index, the gain value quantization information provided from the base layer 100, and the fixed codebook index of the sound quality enhancement layer provided from the sound quality enhancement layer 130. The value difference quantization information is output as a bit stream.

기본 계층(100)과 음질 향상 계층(130)의 비트 스트림은 구분하여 전송한다.즉, 도 1에 도시된 바와 같이 음질 향상 계층(130)의 비트 스트림은 기본 계층(100)의 비트 스트림 뒤에 전송된다. 이에 따라 상기 비트 스트림은 네트워크 트래픽 상태에 따라 복호화 장치에 필요한 비트율로 쉽게 분리될 수 있다. 예를 들어 복호화 장치측의 채널 특성이 열악하여 기본 계층의 비트 스트림만 수신할 수 있는 경우에, 상기 복호화 장치는 도 1의 비트율 확장 음성 부호화 장치가 송출하는 비트 스트림에서 기본 계층의 비트 스트림만 수신할 수 있다. The bit streams of the base layer 100 and the sound quality enhancement layer 130 are separately transmitted. That is, as shown in FIG. 1, the bit streams of the sound quality enhancement layer 130 are transmitted after the bit streams of the base layer 100. do. Accordingly, the bit stream can be easily separated at the bit rate required for the decoding apparatus according to the network traffic conditions. For example, when the channel characteristic of the decoding apparatus is poor and only the bit stream of the base layer can be received, the decoding apparatus receives only the bit stream of the base layer from the bit stream transmitted by the bit rate extension speech encoding apparatus of FIG. 1. can do.

도 3을 참조하면, 상기 비트율 확장 음성 복호화 장치는 역다중화기(301), LPC 계수 복호화부(302), 이득값 복호화부(303), 제 1 고정 코드북 복호화부(304), 적응 코드북 복호화부(305), 이득값 차 복호화부(306), 제 2 고정 코드북 복호화부(307), 제 1 가산기(308), 제 2 가산기(309), 제 1 선택 스위치(310), 제 2 선택 스위치(311), 제 1 승산기(312), 제 2 승산기(313), 제 3 가산기(314), 합성 필터(315), 및 후처리부(316)로 구성된다. Referring to FIG. 3, the apparatus for decoding a bit rate extension speech includes a demultiplexer 301, an LPC coefficient decoder 302, a gain value decoder 303, a first fixed codebook decoder 304, and an adaptive codebook decoder. 305, gain difference decoder 306, second fixed codebook decoder 307, first adder 308, second adder 309, first select switch 310, second select switch 311 ), A first multiplier 312, a second multiplier 313, a third adder 314, a synthesis filter 315, and a post processor 316.

상기 비트율 확장 음성 복호화 장치는 비트율 확장 음성 부호화장치로부터 전송되는 비트 스트림을 선택적으로 수신할 수 있다. 즉, 비트 스트림에서 기본 계층에 대한 비트 스트림만 수신하면, 기본 계층의 음질을 복원할 수 있고, 기본 계층 및 음질 향상 계층에 대한 비트 스트림을 모두 수신하면, 좀더 향상된 음질을 제공할 수 있다. The bit rate extended speech decoding apparatus may selectively receive a bit stream transmitted from the bit rate extended speech encoding apparatus. That is, if only the bit stream for the base layer is received in the bit stream, the sound quality of the base layer can be restored, and if both the bit streams for the base layer and the sound quality enhancement layer are received, a more improved sound quality can be provided.

역다중화기(301)는 수신되는 비트 스트림을 각 모듈의 정보로 역다중화하여 출력한다. 즉, 역다중화기(301)는 LPC 계수 양자화 정보를 LPC 계수 복호화부(302)로, 이득값 양자화 정보는 이득값 복호화부(303)로, 이득값 차 양자화 정보는 이득값 차 복호화부(306)로, 음질 향상 계층의 고정 코드북 인덱스는 제 2 고정 코드북 복호화부(307)로, 고정 코드북 인덱스는 제 1 고정 코드북 복호화부(304)로, 적응 코드북 인덱스는 적응 코드북 복호화부(305)로 각각 제공한다. The demultiplexer 301 demultiplexes the received bit stream with information of each module and outputs the demultiplexer. That is, the demultiplexer 301 converts the LPC coefficient quantization information into the LPC coefficient decoder 302, the gain value quantization information into the gain value decoder 303, and the gain difference quantization information into the gain value difference decoder 306. The fixed codebook index of the sound quality enhancement layer is provided to the second fixed codebook decoder 307, the fixed codebook index to the first fixed codebook decoder 304, and the adaptive codebook index to the adaptive codebook decoder 305, respectively. do.

LPC 계수 복호화부(302)의 구조는 부호화 장치측의 LPC 계수 추출 및 벡터 양자화기(104)에 의해 결정되고, 입력되는 LPC 계수 양자화 정보로부터 LPC 계수를 복원한다. 복원된 LPC 계수는 합성 필터(315)와 후처리부(316)로 제공된다. The structure of the LPC coefficient decoder 302 is determined by the LPC coefficient extraction and vector quantizer 104 on the encoding apparatus side, and restores the LPC coefficients from the input LPC coefficient quantization information. The restored LPC coefficients are provided to the synthesis filter 315 and the post processor 316.

이득값 복호화부(303)의 구조는 부호화 장치측의 이득값 양자화기(129)에 의해 결정된다. 이득값 복호화부(303)는 입력되는 이득값 양자화 정보를 디코딩한다. 상기 이득값 양자화 정보는 적응 코드북 이득값과 고정 코드북 이득값을 포함한다. 따라서, 이득값 복호화부(303)로부터 기본 계층(100)에서의 적응 코드북 이득값 g_p와 고정 코드북 이득값 g_c가 각각 출력된다. The structure of the gain value decoder 303 is determined by the gain value quantizer 129 on the encoding device side. The gain value decoder 303 decodes the input gain value quantization information. The gain value quantization information includes an adaptive codebook gain value and a fixed codebook gain value. Therefore, the adaptive codebook gain value g _p and the fixed codebook gain value g _c in the base layer 100 are output from the gain value decoding unit 303, respectively.

제 1 고정 코드북 복호화부(304)는 입력되는 기본 계층(100)의 고정 코드북 인덱스를 디코딩하여 기본 계층(100)의 고정 코드북을 출력한다. 고정 코드북 복호 방식은 부호화장치의 고정 코드북 탐색부(117)에서의 탐색방식에 의해 결정된다. 적응 코드북 복호화부(305)는 입력되는 적응 코드북 인덱스를 디코딩하여 기본 계층(100)의 적응 코드북을 출력한다. The first fixed codebook decoder 304 decodes the fixed codebook index of the input base layer 100 and outputs the fixed codebook of the base layer 100. The fixed codebook decoding method is determined by the search method in the fixed codebook search unit 117 of the encoding apparatus. The adaptive codebook decoder 305 decodes the input adaptive codebook index and outputs the adaptive codebook of the base layer 100.

상술한 LPC 계수 복호화부(302), 이득값 복호화부(303), 제 1 고정 코드북 복호화부(304), 및 적응 코드북 복호화부(305)는 역다중화기(301)로부터 전송되는 기본 계층(100)에서의 부호화 정보를 디코딩하는 제 1 복호화 유니트로 정의될 수 있다. The LPC coefficient decoder 302, the gain value decoder 303, the first fixed codebook decoder 304, and the adaptive codebook decoder 305 described above are the base layer 100 transmitted from the demultiplexer 301. It may be defined as a first decoding unit for decoding the encoding information in the.

이득값 차 복호화부(306)와 제 2 고정 코드북 복호화부(307)의 동작은 네트워크 트랙픽 상태나 수신 단말의 처리 용량에 의존한다. The operation of the gain value difference decoder 306 and the second fixed codebook decoder 307 depends on the network traffic state or the processing capacity of the receiving terminal.

만약 이득값 차 복호화부(306)와 제 2 고정 코드북 복호화부(307)가 동작되는 것으로 결정되면, 이득값 차 복호화부(306)는 입력되는 이득값 차 양자화 정보를 디코딩한다. 제 2 고정 코드북 복호화부(307)는 입력되는 음질 향상 계층의 고정 코드북 인덱스를 디코딩한다. 이득값 차 복호화 방식은 부호화 장치측의 이득값 차 양자화기(134)에 의해 결정된다. 제 2 고정 코드북 복호화부(307)에서의 디코딩 방식은 부호화장치측의 제 2 고정 코드북 탐색부(131)에 의해 결정된다. If it is determined that the gain value difference decoder 306 and the second fixed codebook decoder 307 are operated, the gain value difference decoder 306 decodes the input gain value difference quantization information. The second fixed codebook decoder 307 decodes the fixed codebook index of the input sound quality enhancement layer. The gain value difference decoding method is determined by the gain value difference quantizer 134 on the encoding apparatus side. The decoding method of the second fixed codebook decoding unit 307 is determined by the second fixed codebook searching unit 131 on the encoding apparatus side.

이득값 차 복호화부(306)와 제 2 고정 코드북 복호화부(307)는 역다중화기(301)로부터 전송되는 음질 향상 계층(130)에서의 부호화 정보를 디코딩하는 제 2 복호화 유니트로 간주될 수 있다. The gain value difference decoder 306 and the second fixed codebook decoder 307 may be regarded as a second decoding unit for decoding the encoding information in the sound quality enhancement layer 130 transmitted from the demultiplexer 301.

제 1 가산기(308)는 이득값 복호화부(303)로부터 출력되는 디코딩된 고정 코 드북의 이득값 g_c와 이득값 차 복호화부(306)로부터 출력되는 디코딩된 이득값 차 g_diff를 가산한다. 제 1 가산기(308)의 출력은 복호화시 음질 향상 계층의 이득값이다. The first adder 308 adds the gain value g _c of the decoded fixed codebook output from the gain value decoding unit 303 and the decoded gain value difference g _diff output from the gain value difference decoding unit 306. The output of the first adder 308 is the gain value of the sound quality enhancement layer upon decoding.

제 2 가산기(309)는 제 2 고정 코드북 복호화부(307)에서 디코딩된 음질 향상 계층(130)의 고정 코드북과 제 1 고정 코드북 복호화부(304)에서 디코딩된 기본 계층(100)의 고정 코드북을 가산한다. 따라서, 제 2 가산기(309)로부터 출력되는 신호는 수학식 13와 같이 정의할 수 있다. The second adder 309 selects the fixed codebook of the sound quality enhancement layer 130 decoded by the second fixed codebook decoder 307 and the fixed codebook of the base layer 100 decoded by the first fixed codebook decoder 304. Add. Therefore, the signal output from the second adder 309 may be defined as shown in Equation (13).

수학식 14에서 c(n)은 기본 계층에서의 고정 코드북이고, c'(n)은 음질 향상 계층에서의 고정 코드북이다. In Equation 14, c (n) is a fixed codebook in the base layer, and c '(n) is a fixed codebook in the sound quality enhancement layer.

이에 따라 복호화 장치에서의 고정 코드북 펄스는 기본 계층과 음질 향상 계층의 대수 코드북을 누적시켜 다중 크기를 갖는 대수 코드북 펄스 구조를 갖는다. 상기 대수 코드북을 누적시키는 것은 모든 펄스의 크기가 같은 크기를 갖는 기존의 고정 코드북 구조에서 발생되는 단점을 보완하기 위한 것이다. 따라서 누적시킨 대수 코드북의 펄스 부호는 대상 신호에 적합한 부호를 갖는다. Accordingly, the fixed codebook pulse in the decoding apparatus has an algebraic codebook pulse structure having multiple magnitudes by accumulating algebraic codebooks of the base layer and the sound quality enhancement layer. Accumulating the algebraic codebooks is to compensate for the disadvantages of the existing fixed codebook structure in which all pulses have the same magnitude. Therefore, the accumulated pulse code of the algebraic codebook has a code suitable for the target signal.

제 1 선택 스위치(310)는 이득값 복호화부(303)에서 디코딩된 고정 코드북 이득값 g_c와 제 1 가산기(308)에서 출력되는 신호를 선택적으로 전송한다. 즉, 복호화 장치가 기본 계층으로 동작하면, 제 1 선택 스위치(310)는 이득값 복호화부(303)로부터 출력되는 고정 코드북 이득값 g_c를 전송하고, 해당되는 복호화 장치가 음질 향상 계층으로 동작하면, 제 1 선택 스위치(310)는 가산기(308)로부터 출력되는 이득값을 전송한다. The first selection switch 310 selectively transmits the fixed codebook gain value g _c decoded by the gain value decoder 303 and the signal output from the first adder 308. That is, when the decoding apparatus operates as the base layer, the first selection switch 310 transmits the fixed codebook gain value g _c output from the gain value decoding unit 303, and when the corresponding decoding apparatus operates as the sound quality enhancement layer. The first selection switch 310 transmits a gain value output from the adder 308.

제 2 선택 스위치(311)는 제 2 가산기(309)로부터 출력되는 신호와 제 1 고정 코드북 복호화부(304)에서 출력되는 기본 계층(100)에서의 고정 코드북을 선택적으로 전송한다. 즉, 상기 복호화 장치가 음질 향상 계층에서 동작되지 않을 경우에, 제 2 선택 스위치(311)는 제 1 고정 코드북 복호화부(304)에서 출력되는 신호를 전송하고, 상기 복호화 장치가 음질향상 계층에서 동작할 경우에, 제 2 선택 스위치(311)는 제 2 가산기(309)로부터 출력되는 신호를 전송한다. The second selection switch 311 selectively transmits the signal output from the second adder 309 and the fixed codebook in the base layer 100 output from the first fixed codebook decoder 304. That is, when the decoding apparatus is not operated in the sound quality enhancement layer, the second selection switch 311 transmits a signal output from the first fixed codebook decoder 304 and the decoding apparatus operates in the sound quality enhancement layer. In this case, the second selection switch 311 transmits a signal output from the second adder 309.

제 1 승산기(312)는 제 2 선택 스위치(311)로부터 출력되는 고정 코드북에 제 1 선택 스위치(310)에서 출력되는 이득값을 승산하여 출력한다. The first multiplier 312 multiplies and outputs a gain value output from the first selection switch 310 to the fixed codebook output from the second selection switch 311.

제 2 승산기(313)는 적응 코드북 복호화부(305)로부터 출력되는 디코딩된 적응 코드북에 이득값 복호화부(303)로부터 출력되는 적응 코드북의 이득값 g_p를 승산하여 출력한다. The second multiplier 313 multiplies the decoded adaptive codebook output from the adaptive codebook decoder 305 by the gain value g _p of the adaptive codebook output from the gain value decoder 303.

제 3 가산기(314)는 제 1 승산기(312)로부터 출력되는 고정 코드북에 대한 정보와 제 2 승산기(313)로부터 출력되는 적응 코드북에 대한 정보를 가산하여 복원된 여기 신호를 발생한다. The third adder 314 adds the information about the fixed codebook output from the first multiplier 312 and the information about the adaptive codebook output from the second multiplier 313 to generate a restored excitation signal.

상술한 제 1 가산기(308), 제 2 가산기(309), 제 3 가산기(314), 제 1 승산기(312), 제 2 승산기(313), 제 1 선택 스위치(310) 및 제 2 선택 스위치(311)는 상술한 제 1 복호화 유니트와 제 2 복호화 유니트에서 각각 디코딩된 신호를 상기 복호화 장치의 동작환경에 따라 연산하는 연산 유니트로 정의될 수 있다. The first adder 308, the second adder 309, the third adder 314, the first multiplier 312, the second multiplier 313, the first selector switch 310, and the second selector switch ( 311 may be defined as a calculation unit that calculates the signals decoded by the first decoding unit and the second decoding unit according to an operating environment of the decoding apparatus.

합성 필터(315)는 LPC 계수 복호화부(302)로부터 제공되는 복원된 LPC 계수를 이용하여 가산기(314)로부터 제공되는 여기 신호를 합성하여 음성신호를 복원한다. The synthesis filter 315 synthesizes the excitation signal provided from the adder 314 using the reconstructed LPC coefficients provided from the LPC coefficient decoder 302 to restore the speech signal.

후처리부(316)는 합성 필터(315)로부터 전송되는 음성신호의 음질을 향상시키는 역할을 한다. 즉, 후처리부(316)는 음성 신호의 음질을 향상시키기 위하여, LPC 계수 복호화부(302)로부터 제공되는 LPC 계수를 이용하여 합성 필터(315)로부터 출력되는 신호를 필터링 하기 위한 하이패스 필터(High Pass Filtering)를 사용한다. The post processor 316 serves to improve the sound quality of the voice signal transmitted from the synthesis filter 315. That is, the post processor 316 is a high pass filter for filtering the signal output from the synthesis filter 315 using the LPC coefficients provided from the LPC coefficient decoder 302 in order to improve the sound quality of the voice signal. Pass Filtering).

상술한 합성 필터(315)와 후처리부(316)는 상기 연산 유니트로부터 출력되는 신호를 LPC 계수 복호화부(302)로부터 출력되는 LPC 계수와 합성하여 음성신호를 복원하는 복원 유니트로 정의될 수 있다. The synthesis filter 315 and the post processor 316 described above may be defined as a reconstruction unit for reconstructing a voice signal by combining the signal output from the operation unit with the LPC coefficient output from the LPC coefficient decoder 302.

도 4는 본 발명의 일 실시 예에 따른 비트율 확장 음성 부호화 방법의 동작 흐름도이다.4 is a flowchart illustrating an operation of a bit rate extended speech encoding method according to an embodiment of the present invention.

제 401 단계에서 음성신호 부호화 장치는 도 1의 전처리 유니트(102)와 같이 입력된 음성 신호를 전처리한다. 제 402 단계에서 음성신호 부호화 장치는 전처리된 음성 신호에서 LPC 계수를 추출하고, 추출된 LPC 계수의 양자화 정보를 생성한다.In operation 401, the apparatus for encoding a speech signal preprocesses an input speech signal as in the preprocessing unit 102 of FIG. 1. In operation 402, the apparatus for encoding a speech signal extracts LPC coefficients from a preprocessed speech signal and generates quantization information of the extracted LPC coefficients.

제 403 단계에서 음성 신호 부호화 장치는 생성된 LPC 계수의 양자화 정보를 이용하여 여기 신호를 도 1의 합성 필터(106)에서와 같이 합성한다. 제 404 단계에서 음성신호 부호화 장치는 상기 전처리된 신호에서 상기 합성된 신호를 감산하여 LPC 잔차 신호를 검출한다. 제 405 단계에서 음성 신호 부호화 장치는 검출된 LPC 잔차 신호를 도 1의 인지 가중 필터(110)에서와 같이 필터링하여 인지 가중된 신호를 출력한다. In operation 403, the apparatus for encoding a speech signal synthesizes the excitation signal as in the synthesis filter 106 of FIG. 1 using the generated quantization information of the LPC coefficients. In operation 404, the apparatus for encoding a speech signal detects an LPC residual signal by subtracting the synthesized signal from the preprocessed signal. In operation 405, the apparatus for encoding a speech signal filters the detected LPC residual signal as in the cognitive weighting filter 110 of FIG. 1 and outputs a cognitive weighted signal.

제 406 단계에서 음성 신호 부호화 장치는 인지 가중된 신호의 피치를 도 1의 피치 분석부(112)와 같이 분석하여 적응 코드북의 인덱스와 이득값을 얻는다. 그리고 도 1의 피치 기여도 제거부(115)와 같이 적응 코드북의 인덱스를 토대로 인지 가중된 신호에서 피치 기여도를 제거하여 고정 코드북 탐색을 위해 필요한 대상 신호를 검출한다. In operation 406, the apparatus for encoding a speech signal analyzes the pitch of the perceptually weighted signal as in the pitch analyzer 112 of FIG. 1 to obtain an index and a gain value of the adaptive codebook. Like the pitch contribution remover 115 of FIG. 1, the pitch contribution is removed from the cognitively weighted signal based on the index of the adaptive codebook to detect the target signal required for the fixed codebook search.

제 407 단계에서 음성 신호 부호화 장치는 도 1의 제 1 고정 코드북 탐색부(117)에서와 같이 기본 계층 고정 코드북을 탐색하여 고정 코드북 이득값과 고정 코드북 인덱스를 생성한다. 제 408 단계에서 음성 신호 부호화 장치는 도 1의 이득값 양자화기(129)에서와 같이 상기 검출된 고정 코드북 이득값과 상기 검출된 적응 코드북 이득값을 양자화 한다. In operation 407, the apparatus for encoding a speech signal searches for a base layer fixed codebook as in the first fixed codebook search unit 117 of FIG. 1 to generate a fixed codebook gain value and a fixed codebook index. In operation 408, the apparatus for encoding a speech signal quantizes the detected fixed codebook gain value and the detected adaptive codebook gain value as in the gain value quantizer 129 of FIG.

제 409 단계에서 음성 신호 부호화 장치는 기본 계층에서의 상관도들 C 및 d(n), 에너지 E와 같은 매개 변수를 이용하여 음질 향상 계층 고정 코드북을 탐색한다. 음질 향상 계층 고정 코드북 탐색에 의해 음질 향상 계층 고정 코드북의 이득값과 음질 향상 계층 고정 코드북의 인덱스가 각각 생성된다. In operation 409, the apparatus for encoding a speech signal searches for a sound quality enhancement layer fixed codebook using parameters such as correlations C and d (n) and energy E in the base layer. The gain of the sound quality enhancement layer fixed codebook and the index of the sound quality enhancement layer fixed codebook are generated by searching the sound quality enhancement layer fixed codebook.

제 410 단계에서 음성 신호 부호화 장치는 기본 계층 고정 코드북의 이득값 과 음질 향상 계층의 고정 코드북의 이득값 간의 차를 양자화 한다. 상술한 음질 향상 계층에서의 고정 코드북 탐색 및 이득값 양자화 과정은 도 1에서 설명한 바와 같이 복수 개로 나누어 수행될 수 있다. 음질 향상 계층의 처리가 복수개로 나누어 수행되면, 그만큼 복원되는 음성 신호의 질이 향상될 수 있다. In operation 410, the apparatus for encoding a speech signal quantizes a difference between a gain value of the base layer fixed codebook and a gain value of the fixed codebook of the sound quality enhancement layer. The fixed codebook search and gain value quantization processes in the sound quality enhancement layer may be divided into a plurality of processes as described with reference to FIG. 1. When the processing of the sound quality enhancement layer is divided into a plurality of steps, the quality of the speech signal restored by that can be improved.

제 411 단계에서 음성 신호 부호화 장치는 상술한 단계들을 통해 얻은 LPC 계수 양자화 정보, 기본 계층의 고정 코드북 인덱스, 기본 계층의 적응 코드북 인덱스, 기본 계층의 고정 코드북의 이득값, 기본 계층의 적응 코드북의 이득값, 음질 향상 계층의 고정 코드북 인덱스 및 상기 이득값 차 양자화 정보를 비트 스트림 형태로 다중화하여 음성신호 복호화 장치측으로 송출한다. In operation 411, the apparatus for encoding a speech signal may include LPC coefficient quantization information obtained through the above steps, a fixed codebook index of a base layer, an adaptive codebook index of a base layer, a gain value of a fixed codebook of a base layer, and a gain of an adaptive codebook of a base layer. A value, a fixed codebook index of the sound quality enhancement layer, and the gain value quantization information are multiplexed in a bit stream to be transmitted to the speech signal decoding apparatus.

제 501 단계에서 음성 신호 복호화 장치는 도 3의 역다중화기(301)와 같이 수신되는 비트 스트림을 각 구성의 정보로 역다중화한다. In operation 501, the apparatus for decoding a speech signal demultiplexes a received bit stream with information of each component, such as the demultiplexer 301 of FIG. 3.

제 502 단계에서 음성 신호 복호화 장치는 상기 역다중화된 신호를 디코딩한다. 즉, 도 3의 LPC 계수 복호화부(302), 이득값 복호화부(303), 제 1 고정 코드북 복호화부(304), 적응 코드북 복호화부(305), 이득값 차 복호화부(306), 제 2 고정 코드북 복호화부(307)와 같이 상기 역다중화 된 신호를 디코딩한다. In operation 502, the voice signal decoding apparatus decodes the demultiplexed signal. That is, the LPC coefficient decoder 302, the gain value decoder 303, the first fixed codebook decoder 304, the adaptive codebook decoder 305, the gain value difference decoder 306, and the second of FIG. Like the fixed codebook decoder 307, the demultiplexed signal is decoded.

제 503 단계에서 음성 신호 복호화 장치는 음질 향상 계층 고정 코드북 이득값을 소정 연산처리에 의해 복원한다. 상기 음성 신호 복호화 장치는 복호화된 고정 코드북 이득값과 음질 향상 계층의 고정 코드북 이득값의 양자화 정보로 수신된 이득값 차를 가산하여 음질 향상 계층의 고정 코드북 이득값을 복원한다.In operation 503, the speech signal decoding apparatus restores the sound quality enhancement layer fixed codebook gain value by a predetermined operation. The speech signal decoding apparatus reconstructs the fixed codebook gain value of the sound quality enhancement layer by adding a difference between the decoded fixed codebook gain value and the received gain value as quantization information of the fixed codebook gain value of the sound quality enhancement layer.

제 504 단계에서 음성 신호 복호화 장치는 음성신호 복호화 장치의 동작 조건에 따라 음질 향상 계층의 고정 코드북과 기본 계층의 고정 코드북을 선택적으로 전송하고, 이득값도 선택적으로 전송된다. 즉, 음성 신호 복호화 장치가 음질 향상 계층에서 동작되면, 복원된 음질 향상 계층의 고정 코드북의 이득값이 승산된 음질 향상 계층의 고정 코드북을 전송시킨다. 반면에 음성 신호 부호화 장치가 음질 향상 계층에서 동작되지 않으면, 복호화된 기본 계층의 고정 코드북에 기본 계층의 고정 코드북의 이득값을 승산한 고정 코드북을 전송시킨다. In operation 504, the voice signal decoding apparatus selectively transmits the fixed codebook of the sound quality enhancement layer and the fixed codebook of the base layer according to the operating conditions of the voice signal decoding apparatus, and optionally the gain value. That is, when the audio signal decoding apparatus is operated in the sound quality enhancement layer, the fixed codebook of the sound quality enhancement layer is multiplied by the gain value of the fixed codebook of the reconstructed sound quality enhancement layer. On the other hand, if the speech signal encoding apparatus does not operate in the sound quality enhancement layer, the fixed codebook multiplying the gain value of the fixed codebook of the base layer is transmitted to the decoded fixed codebook of the base layer.

제 505 단계에서 음성 신호 복호화 장치는 제 502 단계에서 복호화된 LPC 계수를 이용하여 제 504 단계에서 선택적으로 전송된 고정 코드북을 합성한다. In operation 505, the apparatus for decoding speech signals synthesizes the fixed codebook selectively transmitted in operation 504 using the LPC coefficients decoded in operation 502.

제 506 단계에서 음성 신호 복호화 장치는 후처리부(316)와 같이 후처리하여 복원된 음성 신호를 생성한다.In operation 506, the voice signal decoding apparatus post-processes the post-processing unit 316 to generate a reconstructed voice signal.

도 6은 본 발명의 바람직한 다른 실시 예에 따른 비트율 확장 음성 부호화 장치의 기능 블록도이다. 도 6을 참조하면, 상기 비트율 확장 음성 부호화 장치는 기본 계층(600)과 음질 향상 계층(630)을 포함하는 다층 고정 코드북 구조를 갖는다.6 is a functional block diagram of a bit rate extended speech encoding apparatus according to another exemplary embodiment of the present invention. Referring to FIG. 6, the apparatus for encoding a bit rate extension speech codec has a multilayer fixed codebook structure including a base layer 600 and a sound quality enhancement layer 630.

기본 계층(600)에서는 최소한의 음질을 복원할 수 있는 부호화 정보가 생성된다. 기본 계층(600)은 기존의 표준화된 CELP 음성 부호화기의 구성과 유사하다. 따라서, 기본 계층(600)은 입력 음성 신호를 선형 예측 부호화에 의해 필터링하고 상기 필터링된 음성 신호에 대응되는 여기 신호(excitation signal)를 생성한다. 여기 신호는 고정 코드북 탐색과 적응 코드북 탐색에 의해 생성된다.In the base layer 600, encoding information for restoring the minimum sound quality is generated. The base layer 600 is similar to the configuration of the existing standardized CELP speech coder. Accordingly, the base layer 600 filters the input speech signal by linear predictive coding and generates an excitation signal corresponding to the filtered speech signal. The excitation signal is generated by fixed codebook search and adaptive codebook search.

기본 계층(600)은 전처리 유니트(602), LPC 계수 추출 및 벡터 양자화기(604), 합성 필터(606), 감산기(608), 인지 가중 필터(perceptual weighting filter)(610), 피치(pitch) 분석부(612), 피치 기여도(contribution) 제거부(615), 고정 코드북 탐색부(617), 고정 코드북(619), 제 1 승산기(621), 가산기(623), 적응 코드북(624), 제 2 승산기(626), 이득값 양자화기(629)로 구성된다. Base layer 600 includes preprocessing unit 602, LPC coefficient extraction and vector quantizer 604, synthesis filter 606, subtractor 608, perceptual weighting filter 610, pitch Analysis unit 612, pitch contribution removal unit 615, fixed codebook search unit 617, fixed codebook 619, first multiplier 621, adder 623, adaptive codebook 624, Two multipliers 626 and a gain value quantizer 629.

전처리 유니트(602)는 라인(601)을 통해 입력되는 음성 신호에서 DC성분을 제거한다. 즉, 전처리 유니트(602)는 하이패스 필터를 사용하여 입력 음성 신호를 필터링하여 입력 음성 신호의 저주파 대역의 노이즈 성분을 제거한다. 사용된 하이패스 필터는 본 발명의 일 실시 예에서 기본 계층(100)의 전처리 유니트(102)의 하이패스 필터와 동일하다. 전처리 유니트(602)로부터 출력되는 신호는 라인(603)을 통해 LPC 계수 추출 및 벡터 양자화기(604)로 전송된다. The preprocessing unit 602 removes the DC component from the voice signal input through the line 601. That is, the preprocessing unit 602 filters the input voice signal using a high pass filter to remove noise components of the low frequency band of the input voice signal. The high pass filter used is the same as the high pass filter of the preprocessing unit 102 of the base layer 100 in one embodiment of the invention. The signal output from preprocessing unit 602 is sent via line 603 to LPC coefficient extraction and vector quantizer 604.

LPC 계수 추출 및 벡터 양자화기(604)는 상기 전처리 유니트(602)로부터 출력되는 신호의 LPC 계수를 추출한다. 추출된 LPC 계수는 LPC 계수 추출 및 벡터 양자화기(604)에 의해 벡터 양자화 된다. LPC 계수의 벡터 양자화 정보는 라인(605)을 통해 합성 필터(606)와 다중화기(650)로 전송된다. The LPC coefficient extraction and vector quantizer 604 extracts the LPC coefficients of the signal output from the preprocessing unit 602. The extracted LPC coefficients are vector quantized by LPC coefficient extraction and vector quantizer 604. Vector quantization information of the LPC coefficients is sent via line 605 to synthesis filter 606 and multiplexer 650.

합성 필터(synthesis filter)(606)는 상기 LPC 계수의 벡터 양자화 정보를 이용하여 라인(628)을 통해 입력되는 여기 신호(excitation signal)에 대응되는 합성된 신호를 출력한다. 상기 합성된 신호는 라인(607)을 통해 감산기(608)로 출력된다.A synthesis filter 606 outputs a synthesized signal corresponding to an excitation signal input through the line 628 using the vector quantization information of the LPC coefficients. The synthesized signal is output to subtractor 608 via line 607.

감산기(608)는 라인(603)을 통해 입력되는 전처리 유니트(602)로부터 출력되는 신호에서 라인(607)을 통해 입력되는 합성된 신호를 감산하여 LPC 잔차 신호를 생성한다. 상기 LPC 잔차 신호는 라인(609)을 통해 인지 가중 필터(610)로 전송된다.The subtractor 608 subtracts the synthesized signal input through the line 607 from the signal output from the preprocessing unit 602 input through the line 603 to generate the LPC residual signal. The LPC residual signal is transmitted via line 609 to the cognitive weight filter 610.

인지 가중 필터(610)는 인체 청각 구조의 마스킹(masking) 효과를 이용하기 위하여 양자화 잡음이 마스킹 임계치이하가 되도록 한다. 따라서 인지 가중 필터(610)는 상기 LPC 잔차 신호의 양자화 잡음이 최소화되도록 가중치를 포함하는 신호를 피치 분석부(612)로 출력한다. The cognitive weighting filter 610 allows the quantization noise to be less than or equal to the masking threshold in order to use the masking effect of the human auditory structure. Accordingly, the cognitive weight filter 610 outputs a signal including a weight to the pitch analyzer 612 so that the quantization noise of the LPC residual signal is minimized.

피치(pitch) 분석부(612)는 인지 가중 필터(610)로부터 출력되는 신호에 대해 개회로(open-loop) 피치와 폐회로(close-loop) 피치를 탐색한다. 즉, 피치 분석부(612)는 인지 가중 필터(610)로부터 출력되는 신호를 복수개의 피치 서브프레임(subframe)으로 나누고, 상기 표준화된 CLEP 음성 부호화장치에서와 같이 각 서브 프레임의 피치를 분석하여 적응 코드북의 인덱스와 이득값을 출력한다. The pitch analyzer 612 searches for an open-loop pitch and a close-loop pitch with respect to the signal output from the cognitive weight filter 610. That is, the pitch analyzer 612 divides the signal output from the cognitive weight filter 610 into a plurality of pitch subframes, and analyzes and adapts the pitch of each subframe as in the standardized CLEP speech encoder. Output the codebook index and gain values.

상기 적응 코드북의 인덱스는 라인(613)을 통해 피치 기여도 제거부(615)와 적응 코드북(624)으로 전송되면서 라인(614)을 통해 다중화기(650)로 전송된다. 또한, 상기 적응 코드북의 이득값은 이득값 양자화기(629)로 제공된다. The index of the adaptive codebook is transmitted to the multiplexer 650 through the line 614 while being transmitted to the pitch contribution remover 615 and the adaptive codebook 624 through the line 613. The gain value of the adaptive codebook is also provided to a gain value quantizer 629.

피치 기여도 제거부(615)는 상기 적응 코드북의 인덱스를 토대로 인지 가중 필터(610)의 출력 신호로부터 고정 코드북 탐색을 위해 필요한 대상 신호를 출력한다. 그리고 피치 기여도 제거부(615)는 라인(611)에서 피치 기여도 y₁(n)을 감산하 여 고정 코드북 탐색 대상 신호를 라인(616)을 통해 기본 계층(600)의 고정 코드북 탐색부(617)로 출력한다. 피치 기여도 y₁(n)은 수학식 2에 의하여 구해진다. The pitch contribution remover 615 outputs a target signal required for fixed codebook search from the output signal of the cognitive weight filter 610 based on the index of the adaptive codebook. The pitch contribution remover 615 subtracts the pitch contribution y ₁ (n) from the line 611 to subtract the fixed codebook search target signal through the line 616 to the fixed codebook search unit 617 of the base layer 600. Will output The pitch contribution y ₁ (n) is obtained by equation (2).

고정 코드북 탐색부(617)은 라인(611)을 통해 입력된 대상 신호 x'(n)을 사용하여 대상 신호와 임펄스 응답 h(n)과의 상관도 d(n)을 구한다. The fixed codebook search unit 617 calculates a correlation d (n) between the target signal and the impulse response h (n) using the target signal x '(n) input through the line 611.

예를 들어 부프레임의 크기가 40샘플이고 각 계층의 펄스 수가 4개라고 가정하면, 상기 상관도 d(n)은 수학식 1과 같이 정의될 수 있다. For example, assuming that the size of the subframe is 40 samples and the number of pulses in each layer is 4, the correlation d (n) may be defined as in Equation 1.

상기 고정 코드북 탐색부(617)는 상기 임펄스 응답 h(n)과 상기 상관도 d(n)을 토대로 상기 표 1의 예와 같이 구성된 대수 코드북(algebraic codebook) 형태의 고정 코드북을 탐색한다. 표 1을 참고하면, 고정 코드북 탐색부(617)에서 고정 코드북 벡터는 4개의 위치에서만 그 펄스의 크기가 0이 아니다. 따라서 상기 펄스의 부호 s와 상관도 d(n)을 이용한 상관도 d(n)의 크기에 대응되는 상관도 C는 수학식 2와 같이 정의될 수 있다. 고정 코드북 탐색부(617)는 수학식 2에 의해 상관도 C를 검출한다. 고정 코드북 검출부(617)는 임펄스 응답 에너지 E를 수학식 3에 의해 검출한다. The fixed codebook search unit 617 searches a fixed codebook in the form of an algebraic codebook configured as in the example of Table 1 based on the impulse response h (n) and the correlation d (n). Referring to Table 1, the fixed codebook vector in the fixed codebook search unit 617 has a pulse size other than zero only at four positions. Accordingly, the correlation C corresponding to the magnitude of the correlation d (n) using the sign s of the pulse and the correlation d (n) may be defined as in Equation 2. The fixed codebook search unit 617 detects the correlation C by equation (2). The fixed codebook detector 617 detects the impulse response energy E by the equation (3).

상기 고정 코드북 탐색부(617)는 상기 상관도 C와 에너지 E를 저장한다. 상관도 C는 부호 sign[d(i)]와 그 절대값으로 나뉘어 저장된다. sign[d(i)]는 d(i)의 부호이다. 상기 에너지 E는 수학식 4와 같은 형태로 저장된다. 에너지 E에 대한 수학식 3은 수학식 5와 같이 재 정의될 수 있다. The fixed codebook search unit 617 stores the correlation C and the energy E. The correlation degree C is stored by dividing the sign [d (i)] and its absolute value. sign [d (i)] is the sign of d (i). The energy E is stored in the form as shown in Equation 4. Equation 3 for energy E may be redefined as in Equation 5.

상기 탐색에 의해 고정 코드북 인덱스와 이득값이 얻어지면, 고정 코드북 탐 색부(617)는 상기 고정 코드북 인덱스를 고정 코드북(619)과 다중화기(650)로 전송하고, 상기 이득값을 이득값 양자화기(629)로 전송한다. When the fixed codebook index and the gain value are obtained by the search, the fixed codebook search unit 617 transmits the fixed codebook index to the fixed codebook 619 and the multiplexer 650, and transmits the gain value to the gain value quantizer. To 629.

고정 코드북(619)은 라인(618)을 통해 입력된 인덱스를 토대로 기본 계층(600)의 고정 코드북 벡터를 출력한다. 고정 코드북 벡터는 펄스 위치 정보(m)와 부호 정보(s)를 바탕으로 구성된다. 고정 코드북(619)에서 출력되는 고정 코드북 벡터는 라인(620)을 통해 제 1 승산기(621)로 제공된다.The fixed codebook 619 outputs a fixed codebook vector of the base layer 600 based on the index input through the line 618. The fixed codebook vector is constructed based on pulse position information m and sign information s. The fixed codebook vector output from the fixed codebook 619 is provided to the first multiplier 621 via line 620.

제 1 승산기(621)는 이득값 양자화기(629)에서 제공되는 상기 고정 코드북의 이득값에 대한 양자화 이득값 G_c를 상기 고정 코드북 벡터에 승산하고 그 결과를 라인(622)을 통해 출력한다. 라인(622)을 통해 출력되는 신호는 기본 계층(600)의 고정 코드북 벡터에 양자화 이득값 G_c를 승산한 고정 코드북 c_G(n)으로 정의할 수 있다. 상기 양자화 이득값 G_c는 이득값 양자화기(629)로부터 제공된다. The first multiplier 621 multiplies the fixed codebook vector by the quantization gain value G _c for the gain value of the fixed codebook provided by gain value quantizer 629 and outputs the result via line 622. The signal output through the line 622 may be defined as the fixed codebook c _G (n) multiplied by the quantization gain value G _c of the fixed codebook vector of the base layer 600. The quantization gain value G _c is provided from a gain value quantizer 629.

라인(613)을 통해 적응 코드북 인덱스가 인가되면, 적응 코드북(624)은 상기 적응 코드북 인덱스에 대응되는 적응 코드북 벡터를 출력한다. 라인(625)을 통해 상기 적응 코드북 벡터는 제 2 승산기(626)로 제공된다. When an adaptive codebook index is applied through line 613, the adaptive codebook 624 outputs an adaptive codebook vector corresponding to the adaptive codebook index. Via line 625 the adaptive codebook vector is provided to a second multiplier 626.

제 2 승산기(626)는 적응 코드북의 이득값에 대한 양자화된 이득값 G_p를 상기 라인(625)을 통해 전송되는 적응 코드북 벡터에 승산하고, 그 결과를 라인(627)을 통해 출력한다. 상기 양자화된 이득값 G_p는 이득값 양자화기(629)로부터 제공된다. The second multiplier 626 multiplies the quantized gain value G _p for the gain value of the adaptive codebook by the adaptive codebook vector transmitted via the line 625 and outputs the result via line 627. The quantized gain value G _p is provided from a gain value quantizer 629.

가산기(623)는 라인(622)을 통해 입력되는 고정 코드북 벡터와 라인(627)을 통해 입력되는 적응 코드북 벡터를 가산하여 여기 신호를 얻는다. 상기 여기 신호는 라인(628)을 통해 합성 필터(606)로 출력된다. The adder 623 adds the fixed codebook vector input through the line 622 and the adaptive codebook vector input through the line 627 to obtain an excitation signal. The excitation signal is output to the synthesis filter 606 via line 628.

이득값 양자화기(629)는 고정 코드북 탐색부(617)로부터 출력되는 고정 코드북의 이득값과 피치 분석부(612)로부터 출력되는 적응 코드북의 이득값을 각각 양자화한다. 상기 고정 코드북의 이득값을 양자화한 이득값 G_c은 제 1 승산기(621)로 출력되고, 적응 코드북의 이득값을 양자화한 이득값 G_p는 제 2 승산기(626)로 출력된다. 상기 양자화한 이득값 G_c는 음질 향상 계층(630)에 포함되어 있는 이득값 차 양자화기(643)로도 제공된다. The gain value quantizer 629 quantizes the gain value of the fixed codebook output from the fixed codebook search unit 617 and the gain value of the adaptive codebook output from the pitch analyzer 612, respectively. The gain value G _c obtained by quantizing the gain value of the fixed codebook is output to the first multiplier 621, and the gain value G _p obtained by quantizing the gain value of the adaptive codebook is output to the second multiplier 626. The quantized gain value G _c is also provided to a gain value quantizer 643 included in the sound quality enhancement layer 630.

음질 향상 계층(630)은 도 1의 음질 향상 계층(130)과 같이 복원되는 음질을 향상시키기 위하여 기본 계층(600)에서 제공되는 비트이외에 추가적인 비트를 더 제공하기 위한 것이다. 도 6은 설명의 편의를 위하여 하나의 음성 향상 계층(630)이 기본 계층(600)에 연결된 구성을 도시하였으나, 복수개의 음성 향상 계층이 기본 계층(600)에 연결될 수 있다. The sound quality enhancement layer 630 is to provide additional bits in addition to the bits provided in the base layer 600 to improve sound quality restored as in the sound quality enhancement layer 130 of FIG. 1. 6 illustrates a configuration in which one voice enhancement layer 630 is connected to the base layer 600 for convenience of description, but a plurality of voice enhancement layers may be connected to the base layer 600.

음질 향상 계층(630)은 고정 코드북 기여도 계산부(631), 제 3 가산기(633), 합성 필터(634), 인지 가중 필터(637), 고정 코드북 탐색부(639), 고정 코드북(641), 이득값 차 양자화기(643), 및 제 3 승산기(644)로 구성된다. The sound quality enhancement layer 630 includes a fixed codebook contribution calculator 631, a third adder 633, a synthesis filter 634, a cognitive weight filter 637, a fixed codebook search unit 639, a fixed codebook 641, A gain value difference quantizer 643 and a third multiplier 644.

기본 계층(600)의 제 1 승산기(621)로부터 고정 코드북의 벡터에 양자화 이득값 G_c가 승산된 고정 코드북 c_G(n)이 수신되면, 고정 코드북 기여도 계산부(631) 는 수학식 15에 의해 고정 코드북 기여도 y₂(n)을 계산한다. When the fixed codebook c _G (n) obtained by multiplying the vector of the fixed codebook by the quantization gain value G _c from the first multiplier 621 of the base layer 600 is received, the fixed codebook contribution calculator 631 is expressed by Equation 15: The fixed codebook contribution y ₂ (n) is calculated.

수학식 15에서 N은 부프레임의 크기를 구성하는 샘플수에 따라 결정된다. 따라서, 피치 기여도 제거부(615)에서 설명한 바와 같이 부프레임의 크기가 40샘플인 경우에 N은 40이다. h(n)은 합성 필터의 임펄스 응답이다. 고정 코드북 기여도 계산부(631)에서 계산된 고정 코드북 기여도는 라인(632)을 통해 제 3 가산기(633)로 제공된다.In Equation 15, N is determined according to the number of samples constituting the size of the subframe. Therefore, as described in the pitch contribution removing unit 615, N is 40 when the size of the subframe is 40 samples. h (n) is the impulse response of the synthesis filter. The fixed codebook contribution calculated by the fixed codebook contribution calculator 631 is provided to the third adder 633 through the line 632.

제 3 가산기(633)는 라인(616)을 통해 제공되는 기본 계층(600)의 고정 코드북 탐색을 위해 요구되는 대상 신호에서 라인(632)을 통해 제공되는 고정 코드북 기여도와 라인(635)을 통해 합성 필터(634)로부터 제공되는 합성 신호를 제거한 신호를 출력한다. The third adder 633 synthesizes via line 635 the fixed codebook contribution provided through line 632 in the target signal required for fixed codebook search of base layer 600 provided through line 616. A signal from which the synthesized signal provided from the filter 634 is removed is output.

합성 필터(634)는 라인(647)을 통해 고정 코드북의 벡터에 양자화된 음질 향상 계층(630)의 양자화된 고정 코드북 이득값

가 승산된 고정 코드북이 입력되면, LPC 계수 추출 및 벡터 양자화기(604)에서 추출된 양자화된 LPC 계수를 사용하여 상기 입력되는 고정 코드북 신호를 합성한 신호를 출력한다. Synthesis filter 634 is a quantized fixed codebook gain value of the sound quality enhancement layer 630 quantized to a vector of fixed codebooks via line 647.

When a multiplying fixed codebook is input, a signal obtained by synthesizing the input fixed codebook signal is output using the quantized LPC coefficient extracted by the LPC coefficient extraction and the vector quantizer 604.

인지 가중 필터(637)는 라인(636)을 통해 입력되는 신호를 인지 가중 필터(610)와 같이 인지 가중 필터링하여 음질 향상 계층(630)에서 고정 코드북 탐색을 위해 요구되는 대상 신호를 출력한다. 대상신호는 라인(638)을 통해 고정 코 드북 탐색부(639)로 전송된다. The cognitive weighting filter 637 performs cognitive weighting filtering on the signal input through the line 636 like the cognitive weighting filter 610, and outputs a target signal required for fixed codebook search in the sound quality enhancement layer 630. The target signal is transmitted through the line 638 to the fixed codebook search unit 639.

고정 코드북 탐색부(639)는 기본 계층(600)의 고정 코드북 탐색부(617)와 같이 입력되는 대상 신호를 토대로 고정 코드북을 탐색하여 고정 코드북의 인덱스와 이득값을 얻는다. 얻어진 고정 코드북의 인덱스는 라인(640)을 통해 다중화기(650)로 전송되면서 고정 코드북(641)으로 전송된다. 상기 고정 코드북의 이득값 G_CE는 라인(642)을 통해 이득값 차 양자화기(643)로 전송된다. The fixed codebook search unit 639 searches the fixed codebook based on the input signal, such as the fixed codebook search unit 617 of the base layer 600, to obtain an index and a gain value of the fixed codebook. The index of the obtained fixed codebook is transmitted to the multiplexer 650 through the line 640 and to the fixed codebook 641. The gain value G _CE of the fixed codebook is transmitted to a gain value quantizer 643 via line 642.

고정 코드북(641)은 입력된 고정 코드북 인덱스를 토대로 음질 향상 계층(630)의 고정 코드북 벡터를 출력한다. 고정 코드북 벡터는 펄스의 위치 정보(m)와 부호 정보(s)를 사용하여 구성할 수 있다. 고정 코드북(641)에서 출력되는 고정 코드북 벡터는 제 3 승산기(644)로 제공된다. 기본 계층(600)의 고정 코드북(619)에서 출력되는 고정 코드북 벡터의 펄스의 위치와 음질 향상 계층(630)의 고정 코드북(641)에서 출력되는 고정 코드북 벡터의 펄스의 위치는 동일할 수 있다. The fixed codebook 641 outputs a fixed codebook vector of the sound quality enhancement layer 630 based on the input fixed codebook index. The fixed codebook vector can be constructed using the position information m of the pulse and the sign information s. The fixed codebook vector output from the fixed codebook 641 is provided to the third multiplier 644. The position of the pulse of the fixed codebook vector output from the fixed codebook 619 of the base layer 600 and the position of the pulse of the fixed codebook vector output from the fixed codebook 641 of the sound quality enhancement layer 630 may be the same.

이득값 차 양자화기(643)는 기본 계층(600)의 이득값 양자화기(629)로부터 출력되는 고정 코드북의 이득값을 양자화한 이득값 G_C와 음질 향상 계층(630)의 고정 코드북 탐색부(639)로부터 출력되는 고정 코드북의 양자화되지 않은 이득 값 G_CE간의 로그 스케일 차 값을 이용하여 음질 향상 계층(630)의 고정 코드북 이득값 G_CE를 양자화하며, 양자화된 이득값

를 출력한다. The gain difference quantizer 643 may include a gain value G _C obtained by quantizing a gain value of the fixed codebook output from the gain value quantizer 629 of the base layer 600 and a fixed codebook search unit of the sound quality enhancement layer 630 ( 639) using a log scale difference between the gain value G _CE of the unquantized fixed codebook output and the quantized fixed codebook gain value G _CE of improved quality layer 630 from the quantized gain value

Outputs

도 7은 이득값 차 양자화기(643)의 바람직한 실시 예를 나타낸 블록도이다. 이득값 양자화기(643)는 제 1 로그 스케일 변환부(702), 제 2 로그 스케일 변환부(706), 제 4 및 제 5 승산기(708, 711) 및 제 4 가산기(704)를 포함한다. 7 is a block diagram illustrating a preferred embodiment of a gain difference quantizer 643. The gain value quantizer 643 includes a first log scale converter 702, a second log scale converter 706, fourth and fifth multipliers 708 and 711, and a fourth adder 704.

기본 계층(600)의 이득값 양자화기(629)에 의해 제공되는 양자화된 고정 코드북 이득값(G_C)이 라인(701)을 통해 입력되면, 제 1 로그 스케일 변환부(702)는 고정 코드북 이득값(Gc)에 대응되는 로그 스케일 변환된 고정 코드북 이득 값을 라인(703)을 통해 출력한다. When the quantized fixed codebook gain value G _C provided by the gain value quantizer 629 of the base layer 600 is input through the line 701, the first log scale converter 702 may use the fixed codebook gain. A log scale converted fixed codebook gain value corresponding to the value Gc is output through the line 703.

음질 향상 계층(630)의 고정 코드북 탐색부(639)로부터 출력되는 양자화 되지 않은 이득 값(G_CE)이 라인(705)을 통해 입력되면, 제 2 로드 스케일 변환부(706)에 의하여 로그 스케일 변환된 고정 코드북 이득 값을 라인(707)을 통해 출력한다.When the non-quantized gain value G _CE output from the fixed codebook search unit 639 of the sound quality enhancement layer 630 is input through the line 705, the log scale conversion is performed by the second load scale converter 706. The fixed codebook gain value is output via the line 707.

제 4 승산기(708)는 라인(707)을 통해 입력되는 고정 코드북 이득값에 이득값 차 조정값

를 승산하고, 승산된 결과를 라인(708)을 통해 출력한다. The fourth multiplier 708 adjusts the gain difference adjustment value to the fixed codebook gain value input through the line 707.

Multiply by and output the multiplied result via line 708.

제 4 가산기(704)는 라인(703)을 통해 입력되는 고정 코드북 이득 값과 라인(708)을 통해 입력되는 고정 코드북 이득 값을 간의 차이값을 라인(710)을 통해 출력한다. The fourth adder 704 outputs a difference value between the fixed codebook gain value input through the line 703 and the fixed codebook gain value input through the line 708 via the line 710.

제 5 승산기(711)는 입력되는 이득값 차에 스케일 확장 요소(10)를 승산하여 로그 스케일 이득 값 차(G_DIFF)(712)를 생성한다. The fifth multiplier 711 multiplies the scale difference element 10 by the input gain value difference to generate a log scale gain value difference G _DIFF 712.

상술한 이득값 차 양자화(643)의 동작 과정은 수학식 16과 같이 정의할 수 있다. The above-described operation of gain difference quantization 643 may be defined as in Equation 16 below.

수학식 16에서 G_c는 이득값 양자화기(629)에 의하여 양자화된 고정 코드북의 이득값이고, G_CE는 고정 코드북 탐색부(639)로부터 출력되는 양자화되지 않은 이득값이다. 또한, 이득값 차 조정 값

는 로그 스케일 이득값간 차이값의 동적 범위가 최소가 되도록 하는 조정 값이다. 이득값 차 조정 값은 음성 부호화기의 종류에 따라 어떠한 값이 될 수도 있으며 실 예로 0.987이 사용된다. In Equation 16, G _c is a gain value of the fixed codebook quantized by the gain value quantizer 629, and G _CE is an unquantized gain value output from the fixed codebook search unit 639. In addition, the gain difference adjustment value

Is an adjustment value such that the dynamic range of the difference between logarithmic scale gain values is minimized. The gain difference adjustment value may be any value depending on the type of speech coder. For example, 0.987 is used.

수학식 16와 같은 과정을 거쳐 생성된 로그 스케일 이득값 차(712)는 아날로그 신호이므로 3비트 스칼라 양자화기에 의하여 양자화된다. 3비트 스칼라 양자화기에 의해 양자화 된 결과를 이용하여 양자화된 음질 향상 계층(630)의 고정 코드북 이득값

를 출력한다. 상기 양자화된 이득값

는 라인(645)을 통해 제 3 승산기(644)로 출력되면서 라인(646)을 통해 다중화기(650)로 출력된다. The log scale gain difference 712 generated through the process as shown in Equation 16 is an analog signal and thus is quantized by a 3-bit scalar quantizer. Fixed codebook gain value of quantized sound quality enhancement layer 630 using the result quantized by 3-bit scalar quantizer

Outputs The quantized gain value

Is output to the multiplexer 650 via line 646 while being output to third multiplier 644 via line 645.

제 3 승산기(644)는 고정 코드북(641)으로부터 제공되는 고정 코드북 벡터에 이득값 차 양자화기(643)로부터 제공되는 양자화된 음질 향상 계층(6300의 고정 코드북 이득값

를 승산하고, 승산 결과를 라인(647)을 합성 필터(634)로 제공한다. The third multiplier 644 is a fixed codebook gain value of the quantized sound quality enhancement layer 6300 provided from the gain value difference quantizer 643 to a fixed codebook vector provided from the fixed codebook 641.

Multiply by < RTI ID = 0.0 > and < / RTI > provide a line 647 to the synthesis filter 634.

다중화기(650)는 기본 계층(600)으로부터 제공되는 LPC 계수 양자화 정보, 고정 코드북 인덱스, 적응 코드북 인덱스, 이득값 양자화 정보와 음질 향상 계층(630)으로부터 제공되는 음질 향상 계층의 고정 코드북 인덱스, 이득값 차 양자화 정보를 비트 스트림으로 출력한다. The multiplexer 650 is a fixed codebook index, a gain of the LPC coefficient quantization information, a fixed codebook index, an adaptive codebook index, a gain value quantization information provided from the base layer 600 and a sound quality enhancement layer provided from the sound quality enhancement layer 630. The value difference quantization information is output as a bit stream.

기본 계층(600)과 음질 향상 계층(630)의 비트 스트림은 구분하여 전송된다. 즉, 도 6에 도시된 바와 같이 음질 향상 계층(630)의 비트 스트림은 기본 계층(600)의 비트 스트림 뒤에 전송된다. 이에 따라 상기 비트 스트림은 네트워크 트래픽 상태에 따라 복호화 장치에 필요한 비트율로 쉽게 분리될 수 있다. 예를 들어 복호화 장치측의 채널 특성이 열악하여 기본 계층의 비트 스트림만 수신할 수 있는 경우에, 상기 복호화 장치는 도 6의 비트율 확장 음성 부호화 장치가 송출하는 비트 스트림에서 기본 계층의 비트 스트림만 수신할 수 있다.The bit streams of the base layer 600 and the sound quality enhancement layer 630 are transmitted separately. That is, as shown in FIG. 6, the bit stream of the sound quality enhancement layer 630 is transmitted after the bit stream of the base layer 600. Accordingly, the bit stream can be easily separated at the bit rate required for the decoding apparatus according to the network traffic conditions. For example, when the channel characteristic of the decoding apparatus is poor and only the bit stream of the base layer can be received, the decoding apparatus receives only the bit stream of the base layer from the bit stream transmitted by the bit rate extension speech encoding apparatus of FIG. 6. can do.

도 8은 본 발명의 바람직한 다른 실시 예에 따른 비트율 확장 음성 복호화 장치의 블록도이다. 도 8을 참조하면, 상기 비트율 확장 음성 복호화 장치는 역다중화기(802), LPC 계수 복호화부(803), 이득값 복호화부(804), 제 1 고정 코드북 복호화부(805), 적응 코드북 복호화부(806), 이득값 차 복호화부(807), 제 2 고정 코드북 복호화부(808), 승산기들(809, 810, 813), 가산기들(811, 814), 선택 스위치(812), 합성 필터(815), 및 후처리부(816)를 포함한다. 8 is a block diagram of a bit rate extended speech decoding apparatus according to another exemplary embodiment of the present invention. Referring to FIG. 8, the apparatus for decoding a bit rate extension speech includes a demultiplexer 802, an LPC coefficient decoder 803, a gain value decoder 804, a first fixed codebook decoder 805, and an adaptive codebook decoder ( 806, gain difference decoder 807, second fixed codebook decoder 808, multipliers 809, 810, 813, adders 811, 814, selection switch 812, synthesis filter 815 And a post-processing unit 816.

역다중화기(802)는 수신되는 비트 스트림(801)을 각 구성 요소(element)의 정보로 역다중화하여 출력한다. 즉, 역다중화기(802)는 LPC 계수 양자화 정보를 LPC 계수 복호화부(803)로, 이득값 양자화 정보는 이득값 복호화부(804)로, 이득값 차 양자화 정보는 이득값 차 복호화부(807)로, 음질 향상 계층(630)의 고정 코드북 인덱스는 제 2 고정 코드북 복호화부(808), 기본 계층(600)의 고정 코드북 인덱스는 제 1 고정 코드북 복호화부(805)로, 적응 코드북 인덱스는 적응 코드북 복호화부(806)로 각각 제공한다. The demultiplexer 802 demultiplexes the received bit stream 801 with information of each element and outputs the demultiplexer. That is, the demultiplexer 802 converts the LPC coefficient quantization information to the LPC coefficient decoder 803, the gain value quantization information to the gain value decoder 804, and the gain difference quantization information to the gain value difference decoder 807. The fixed codebook index of the sound quality enhancement layer 630 is the second fixed codebook decoder 808, the fixed codebook index of the base layer 600 is the first fixed codebook decoder 805, and the adaptive codebook index is the adaptive codebook. Each is provided to the decoding unit 806.

LPC 계수 복호화부(803)의 구조는 부호화 장치측의 LPC 계수 추출 및 벡터 양자화기(604)에 의해 결정되고, 입력되는 LPC 계수 양자화 정보로부터 LPC 계수를 복원한다. 복원된 LPC 계수는 합성 필터(815)와 후처리부(816)로 제공된다. The structure of the LPC coefficient decoder 803 is determined by the LPC coefficient extraction and vector quantizer 604 on the encoding apparatus side, and restores the LPC coefficients from the input LPC coefficient quantization information. The restored LPC coefficients are provided to the synthesis filter 815 and the post processor 816.

이득값 복호화부(804)의 구조는 부호화 장치측의 이득값 양자화기(629)에 의해 결정된다. 이득값 복호화부(804)는 입력되는 이득값 양자화 정보를 디코딩한다. 상기 이득값 양자화 정보는 적응 코드북 이득값과 고정 코드북 이득값을 포함한다. 따라서, 이득값 복호화부(804)로부터 기본 계층(600)에서의 적응 코드북 이득값 G_P와 고정 코드북 이득값 G_C가 각각 출력된다. The structure of the gain value decoder 804 is determined by a gain value quantizer 629 on the encoding device side. The gain value decoder 804 decodes the input gain value quantization information. The gain value quantization information includes an adaptive codebook gain value and a fixed codebook gain value. Accordingly, the adaptive codebook gain value G _P and the fixed codebook gain value G _C in the base layer 600 are output from the gain value decoding unit 804, respectively.

제 1 고정 코드북 복호화부(805)는 입력되는 제 1 고정 코드북 인덱스를 디코딩하여 제 1 고정 코드북을 출력한다. 고정 코드북 복호 방식은 부호화장치의 고정 코드북 탐색부(617)에서의 탐색방식에 의해 결정된다. The first fixed codebook decoder 805 decodes an input first fixed codebook index and outputs a first fixed codebook. The fixed codebook decoding method is determined by the search method in the fixed codebook search unit 617 of the encoding apparatus.

적응 코드북 복호화부(806)는 입력되는 적응 코드북 인덱스를 디코딩하여 적응 코드북을 출력한다. The adaptive codebook decoder 806 decodes an input adaptive codebook index and outputs an adaptive codebook.

상술한 LPC 계수 복호화부(803), 이득값 복호화부(804), 고정 코드북 복호화 부(805), 및 적응 코드북 복호화부(806)는 역다중화기(802)로부터 전송되는 기본 계층(600)에서의 부호화 정보를 디코딩하는 복호화 유니트로 정의될 수 있다. The LPC coefficient decoder 803, the gain value decoder 804, the fixed codebook decoder 805, and the adaptive codebook decoder 806 are described in the base layer 600 transmitted from the demultiplexer 802. It may be defined as a decoding unit for decoding the encoding information.

이득값 차 복호화부(807)와 제 2 고정 코드북 복호화부(808)의 동작은 네트워크 트랙픽 상태나 수신 단말의 처리 용량에 의존한다. The operation of the gain value difference decoder 807 and the second fixed codebook decoder 808 depends on the network traffic state or the processing capacity of the receiving terminal.

만약 이득값 차 복호화부(807)와 제 2 고정 코드북 복호화부(808)가 동작되는 것으로 결정되면, 이득값 차 복호화부(807)는 입력되는 이득값 차 양자화 정보를 디코딩한다. 제 2 고정 코드북 복호화부(808)는 입력되는 제 2 고정 코드북 인덱스를 디코딩한다. 이득값 차 복호화 방식은 부호화 장치측의 이득값 차 양자화기(643)에 의해 결정된다. If it is determined that the gain value difference decoder 807 and the second fixed codebook decoder 808 are operated, the gain value difference decoder 807 decodes the input gain value difference quantization information. The second fixed codebook decoder 808 decodes the input second fixed codebook index. The gain value difference decoding method is determined by the gain value difference quantizer 643 on the encoding device side.

제 2 고정 코드북 복호화부(808)에서의 디코딩 방식은 부호화장치측의 제 2 고정 코드북 탐색부(631)에 의해 결정된다. 이득값 차 복호화부(807)와 제 2 고정 코드북 복호화부(808)는 역다중화기(902)로부터 전송되는 음질 향상 계층(630)에서의 부호화 정보를 디코딩하는 복호화 유니트로 간주될 수 있다. The decoding method of the second fixed codebook decoder 808 is determined by the second fixed codebook search unit 631 on the encoding apparatus side. The gain value difference decoder 807 and the second fixed codebook decoder 808 may be regarded as a decoding unit for decoding the encoding information in the sound quality enhancement layer 630 transmitted from the demultiplexer 902.

승산기(809)는 이득값 복호화부(804)에 의하여 복원된 기본 계층(600)의 고정 코드북 이득값 Gc을 제 1 고정 코드북 복호화부(805)에 의하여 출력된 기본 계층의 고정 코드북에 승산하여 기본 계층의 고정 코드북 벡터를 출력한다.The multiplier 809 multiplies the fixed codebook gain value Gc of the base layer 600 restored by the gain value decoding unit 804 to the fixed codebook of the base layer output by the first fixed codebook decoding unit 805. Output a fixed codebook vector of the hierarchy.

승산기(810)는 이득값 차 복호화부(807)에 의하여 복원된 음질 향상 계층(630)에서의 고정 코드북 이득값

를 제 2 고정 코드북 복호화부(808)에 의하여 출력된 음질 향상 계층의 고정 코드북에 승산하여 음질 향상 계층의 고정 코드북 벡터를 출력한다.The multiplier 810 is a fixed codebook gain value in the sound quality enhancement layer 630 reconstructed by the gain value difference decoder 807.

Is multiplied by the fixed codebook of the sound quality enhancement layer output by the second fixed codebook decoder 808 to output a fixed codebook vector of the sound quality enhancement layer.

가산기(811)는 승산기(809)로부터 출력되는 기본 계층의 고정 코드북 벡터와 승산기(810)로부터 출력되는 음질 향상 계층의 고정 코드북 벡터를 가산한다. 이에 따라 복호화 장치에서의 고정 코드북 펄스는 기본 계층과 음질 향상 계층의 대수 코드북을 누적시켜 다중 크기를 갖는 대수 코드북 펄스 구조를 갖는다. 상기 대수 코드북을 누적시키는 것은 고정 코드북의 모든 펄스의 크기가 같은 크기를 갖는 기존의 고정 코드북 구조에서 발생되는 단점을 보완하기 위한 것이다. The adder 811 adds the fixed codebook vector of the base layer output from the multiplier 809 and the fixed codebook vector of the sound quality enhancement layer output from the multiplier 810. Accordingly, the fixed codebook pulse in the decoding apparatus has an algebraic codebook pulse structure having multiple magnitudes by accumulating algebraic codebooks of the base layer and the sound quality enhancement layer. Accumulating the algebraic codebooks is to compensate for the disadvantages that occur in the existing fixed codebook structure in which all pulses of the fixed codebook have the same magnitude.

선택 스위치(812)는 가산기(811)로부터 출력되는 신호와 승산기(809)로부터 출력되는 기본 계층의 고정 코드북 벡터를 선택적으로 전송한다. 즉, 상기 복호화 장치가 음질 향상 계층에서 동작되지 않을 경우에, 선택 스위치(812)는 승산기(809)로부터 출력되는 기본 계층의 고정 코드북 벡터를 선택하여 전송한다. 상기 부호화 장치가 음질향상 계층에서 동작할 경우에, 선택 스위치(812)는 가산기(811)로부터 출력되는 신호를 전송한다. The selection switch 812 selectively transmits the signal output from the adder 811 and the fixed codebook vector of the base layer output from the multiplier 809. That is, when the decoding apparatus is not operated in the sound quality enhancement layer, the selection switch 812 selects and transmits the fixed codebook vector of the base layer output from the multiplier 809. When the encoder operates in the sound quality enhancement layer, the selection switch 812 transmits a signal output from the adder 811.

승산기(813)는 적응 코드북 복호화부(806)로부터 출력되는 디코딩된 적응 코드북에 이득값 복호화부(804)로부터 출력되는 적응 코드북의 이득값 G_p를 승산하여 적응 코드북 벡터를 출력한다. The multiplier 813 multiplies the decoded adaptive codebook output from the adaptive codebook decoding unit 806 by the gain value G _p of the adaptive codebook output from the gain value decoding unit 804 to output an adaptive codebook vector.

가산기(814)는 선택 스위치(812)에 의해 선택된 고정 코드북 벡터와 승산기(813)로부터 출력되는 적응 코드북 벡터를 가산하여 복원된 여기 신호를 발생한다. The adder 814 adds the fixed codebook vector selected by the selection switch 812 and the adaptive codebook vector output from the multiplier 813 to generate a reconstructed excitation signal.

상술한 승산기(810), 가산기(811) 및 선택 스위치(812)는 상술한 기본 계층 의 부호화 정보를 복호화하는 유니트와 음질 향상 계층의 부호화 정보를 복호화하는 유니트에서 각각 디코딩된 신호를 상기 복호화 장치의 동작환경에 따라 연산하는 연산 유니트로 정의될 수 있다. The multiplier 810, the adder 811, and the selection switch 812 may decode the decoded signal in the unit for decoding the encoding information of the base layer and the unit for decoding the encoding information of the sound quality enhancement layer. It can be defined as a calculation unit that operates according to the operating environment.

합성 필터(815)는 LPC 계수 복호화부(803)로부터 제공되는 복원된 LPC를 이용하여 가산기(814)로부터 제공되는 여기 신호를 합성하여 음성신호를 복원한다. The synthesis filter 815 synthesizes the excitation signal provided from the adder 814 using the reconstructed LPC provided from the LPC coefficient decoder 803 to reconstruct the speech signal.

후처리부(816)는 합성 필터(815)로부터 전송되는 음성신호를 복원한다. 즉, 후처리부(816)는 음성 신호를 복원하기 위하여, LPC 계수 복호화부(803)로부터 제공되는 LPC를 이용하여 합성 필터(815)로부터 출력되는 신호를 필터링 하기 위한 하이패스 필터(High Pass Filtering)를 사용한다. The post processor 816 restores the voice signal transmitted from the synthesis filter 815. That is, the post processor 816 uses a LPC provided from the LPC coefficient decoder 803 to recover the speech signal, and then uses a high pass filter for filtering the signal output from the synthesis filter 815. Use

상술한 합성 필터(815)와 후처리부(816)는 상기 연산 유니트로부터 출력되는 신호를 LPC 계수 복호화부(803)로부터 출력되는 LPC와 합성하여 음성신호를 복원하는 복원 유니트로 정의될 수 있다. The synthesis filter 815 and the post processor 816 described above may be defined as a reconstruction unit for reconstructing a voice signal by synthesizing the signal output from the operation unit with the LPC output from the LPC coefficient decoder 803.

도 9는 도 6의 음성 신호 부호화 장치에서 기본 계층의 고정 코드북 탐색(901)에 의해 탐색된 펄스의 위치와 음질 향상 계층의 고정 코드북 탐색(905)에 의해 탐색된 펄스의 위치에 기초한 고정 코드북 벡터를 이용하여 도 8의 음성 신호 복호화 장치에서 복원되는 펄스의 크기를 설명하기 위한 도면이다. 9 is a fixed codebook vector based on the position of the pulse searched by the fixed codebook search 901 of the base layer and the position of the pulse searched by the fixed codebook search 905 of the sound quality enhancement layer in the speech signal encoding apparatus of FIG. 8 is a view for explaining the magnitude of a pulse restored in the audio signal decoding apparatus of FIG.

도 9를 참조하면, 제 1 고정 코드북 복호화부(805)에서 제공되는 고정 코드북 벡터(902)에 이득값 복호화부(804)에서 제공되는 고정 코드북 이득값(G_c)이 승산기(809)에 의하여 승산되어 이득값이 승산된 기본 계층 고정 코드북 벡터(904)가 생성된다. 9, a fixed codebook gain value G _c provided by the gain value decoding unit 804 is provided by a multiplier 809 to a fixed codebook vector 902 provided by the first fixed codebook decoding unit 805. A base layer fixed codebook vector 904 is generated that is multiplied and multiplied by a gain value.

제 2 고정 코드북 복호화부(808)에서 제공되는 고정 코드북 벡터(906)에 이득값 차 복호화부(807)에서 제공되는 이득값(G_CE)이 승산기(810)에 의하여 승산되어 이득값이 승산된 음질 향상 계층 고정 코드북 벡터(908)가 생성된다. 가산기(811)는 음질 향상 계층 고정 코드북 벡터(908)와 기본 계층 고정 코드북 벡터(904)를 가산한 고정 코드북 벡터(910)를 생성한다. The gain value G _CE provided by the gain value difference decoder 807 is multiplied by the multiplier 810 to the fixed codebook vector 906 provided by the second fixed codebook decoder 808 to multiply the gain value. A sound quality enhancement layer fixed codebook vector 908 is generated. The adder 811 generates a fixed codebook vector 910 obtained by adding the sound quality enhancement layer fixed codebook vector 908 and the base layer fixed codebook vector 904.

도 9에서 생성되는 펄스의 구조를 토대로 알 수 있는 바와 같이 기본 계층 고정 코드북 벡터(904)와 음질 향상 계층 고정 코드북 벡터(908)는 가산기(811)로 입력되어 두 벡터가 가산된 최종 음질 향상 계층 고정 코드북(910)을 생성한다. 최종 음질 향상 계층 고정 코드북(910)은 이득 값이 다른 두 개의 고정 코드북 벡터가 더해져 구성되었기 때문에 다중 크기를 갖는 고정 코드북을 형성할 수 있어 보다 좋은 음질을 제공할 수 있다. As can be seen based on the structure of the pulse generated in FIG. 9, the base layer fixed codebook vector 904 and the sound quality enhancement layer fixed codebook vector 908 are input to the adder 811 to add the final sound quality enhancement layer to which the two vectors are added. Generate a fixed codebook 910. Since the final sound quality enhancement layer fixed codebook 910 is formed by adding two fixed codebook vectors having different gain values, it is possible to form a fixed codebook having multiple sizes, thereby providing better sound quality.

제 1001 단계에서 음성신호 부호화 장치는 도 6의 전처리 유니트(602)와 같이 입력된 음성 신호를 전 처리한다. 제 1002 단계에서 음성신호 부호화 장치는 전처리 된 음성 신호에서 LPC 계수를 추출하고, 추출된 LPC 계수의 양자화 정보를 생성한다.In operation 1001, the apparatus for encoding a speech signal preprocesses an input speech signal as in the preprocessing unit 602 of FIG. 6. In operation 1002, the apparatus for encoding a speech signal extracts LPC coefficients from a preprocessed speech signal and generates quantization information of the extracted LPC coefficients.

제 1003 단계에서 음성신호 부호화 장치는 상기 전 처리된 신호에서 합성 필 터(606)를 거쳐 LPC 계수의 잔 차 신호(residual signal)를 검출한다. 제 1004 단계에서 음성 신호 부호화 장치는 검출된 잔차 신호를 도 6의 인지 가중 필터(610)에서와 같이 필터링하여 인지 가중된 신호를 출력한다. In operation 1003, the apparatus for encoding an audio signal detects a residual signal of the LPC coefficients through the synthesis filter 606 in the preprocessed signal. In operation 1004, the apparatus for encoding a speech signal filters the detected residual signal as in the cognitive weighting filter 610 of FIG. 6 to output a cognitive weighted signal.

제 1005 단계에서 음성 신호 부호화 장치는 인지 가중된 신호의 피치를 도 6의 피치 분석부(612)와 같이 분석하고, 분석된 결과를 이용하여 상기 인지 가중된 신호에서 피치 기여도를 도 6의 피치 기여도 제거부(615)와 같이 제거하여 적응 코드북 이득값과 적응 코드북 인덱스를 생성한다. In operation 1005, the apparatus for encoding a speech signal analyzes the pitch of the cognitive weighted signal as in the pitch analyzer 612 of FIG. 6, and calculates the pitch contribution of the cognitive weighted signal using the analyzed result. The removal unit 615 removes the adaptive codebook gain value and the adaptive codebook index.

제 1006 단계에서 음성 신호 부호화 장치는 도 6의 기본 계층(600)의 고정 코드북 탐색부(617)에서와 같이 기본 계층 고정 코드북을 탐색하여 고정 코드북 이득값과 고정 코드북 인덱스를 생성한다. In operation 1006, the apparatus for encoding a speech signal searches for a base layer fixed codebook as in the fixed codebook search unit 617 of the base layer 600 of FIG. 6 to generate a fixed codebook gain value and a fixed codebook index.

제 1007 단계에서 음성 신호 부호화 장치는 도 6의 이득값 양자화기(629)에서와 같이 상기 검출된 고정 코드북 이득값과 상기 검출된 적응 코드북 이득값을 양자화 한다. In operation 1007, the apparatus for encoding a speech signal quantizes the detected fixed codebook gain value and the detected adaptive codebook gain value as in the gain value quantizer 629 of FIG. 6.

제 1008 단계에서 음성 신호 부호화 장치는 벡터 양자화된 LPC계수를 이용하여 기본 계층(600)에서 생성된 고정 코드북 벡터와 적응 코드북 벡터의 여기 신호(excitation signal)를 도 6의 합성 필터(606)에서와 같이 합성한다.In operation 1008, the apparatus for encoding a speech signal may extract the excitation signals of the fixed codebook vector and the adaptive codebook vector generated in the base layer 600 using the vector quantized LPC coefficients from the synthesis filter 606 of FIG. 6. Synthesize together.

제 1009 단계에서 음성 신호 부호화 장치는 기본 계층(600)에서의 고정 코드북 탐색을 위한 대상 신호의 영향과 음질 향상 계층(630)의 이전의 LPC 합성 신호를 제거함으로써 도 6의 고정 코드북 탐색부(639)에서와 같은 고정 코드북 탐색을 위한 대상 신호를 생성한다. 즉, 기본 계층(600)에서 검출된 대상 신호에서 기본 계층의 고정 코드북 기여도와 음질 향상 계층(630)에서 검출된 이전의 LPC 합성 신호를 제거한 신호를 음질 향상 계층에서의 대상 신호로 한다. In operation 1009, the apparatus for encoding a speech signal removes the influence of the target signal for the fixed codebook search in the base layer 600 and the previous LPC synthesis signal of the sound quality enhancement layer 630. Generate a target signal for fixed codebook search as in < RTI ID = 0.0 > That is, a signal obtained by removing the fixed codebook contribution of the base layer and the previous LPC synthesis signal detected by the sound quality enhancement layer 630 from the target signal detected by the base layer 600 is used as the target signal in the sound quality enhancement layer.

제 1010 단계에서 음성 신호 부호화 장치는 제 1009 단계에서 검출된 대상 신호를 이용하여 음질 향상 계층(630)의 고정 코드북 탐색을 수행하여 음질 향상 계층의 고정 코드북 이득값과 음질 향상 계층의 고정 코드북 인덱스를 각각 생성한다. In operation 1010, the apparatus for encoding a speech signal performs fixed codebook search of the sound quality enhancement layer 630 using the target signal detected in operation 1009 to obtain a fixed codebook gain value of the sound quality enhancement layer and a fixed codebook index of the sound quality enhancement layer. Create each.

제 1011 단계에서 음성 신호 부호화 장치는 기본 계층의 양자화 된 고정 코드북의 이득값과 음질 향상 계층의 양자화 되지 않은 고정 코드북 이득값 간의 로그 스케일 차(log scale difference)를 양자화 한다. 상술한 음질 향상 계층에서의 고정 코드북 탐색 및 이득값 양자화 과정은 복수개의 음질 향상 계층이 구비됨에 따라 복수 회 수행될 수 있다. 음질 향상 계층 처리가 복수 회 수행되면, 그만큼 복원되는 음성 신호의 질이 향상될 수 있다.In operation 1011, the apparatus for encoding a speech signal quantizes a log scale difference between a gain value of the quantized fixed codebook of the base layer and an unquantized fixed codebook gain value of the sound quality enhancement layer. The fixed codebook search and gain value quantization processes in the above-described sound quality enhancement layer may be performed a plurality of times as a plurality of sound quality enhancement layers are provided. When the sound quality enhancement layer processing is performed a plurality of times, the quality of the speech signal restored by that can be improved.

제 1012 단계에서 음성 신호 부호화 장치는 음질 향상 계층에서 생성된 고정 코드북 벡터(또는 여기 신호)를 도 6의 합성 필터(634)에 통과시켜 합성된 신호를 출력한다.In operation 1012, the apparatus for encoding a speech signal passes the fixed codebook vector (or excitation signal) generated in the sound quality enhancement layer and passes through the synthesis filter 634 of FIG. 6 to output the synthesized signal.

제 1013 단계에서 음성 신호 부호화 장치는 상술한 단계들을 통해 얻은 선형 예측 계수 양자화 정보, 기본 계층의 고정 코드북 인덱스, 기본 계층의 적응 코드북 인덱스, 기본 계층의 고정 코드북의 이득값, 기본 계층의 적응 코드북의 이득값, 음질 향상 계층의 고정 코드북 인덱스 및 상기 이득값 차 양자화 정보를 비트 스트림 형태로 다중화하여 음성신호 복호화 장치측으로 송출한다. In operation 1013, the apparatus for encoding a speech signal includes linear prediction coefficient quantization information obtained through the above-described steps, a fixed codebook index of a base layer, an adaptive codebook index of a base layer, a gain value of a fixed codebook of a base layer, and an adaptive codebook of a base layer. The gain value, the fixed codebook index of the sound quality enhancement layer, and the gain value quantization information are multiplexed in the form of a bit stream and sent to the voice signal decoding apparatus.

도 11은 본 발명의 바람직한 다른 실시 예에 따른 비트율 확장 음성 복호화 방법의 동작 흐름도이다. 11 is a flowchart illustrating an operation of a bit rate extended speech decoding method according to another exemplary embodiment of the present invention.

제 1101 단계에서 음성 신호 복호화 장치는 도 8의 역다중화기(802)와 같이 수신되는 비트 스트림을 각 구성의 정보로 역다중화한다. In operation 1101, the apparatus for decoding a speech signal demultiplexes the received bit stream into information of each component, such as the demultiplexer 802 of FIG. 8.

제 1102 단계에서 음성 신호 복호화 장치는 상기 역다중화된 신호를 디코딩한다. 즉, 도 8의 LPC 계수 복호화부(803), 이득값 복호화부(804), 제 1 고정 코드북 복호화부(805), 적응 코드북 복호화부(806), 이득값 차 복호화부(807), 제 2 고정 코드북 복호화부(808)와 같이 상기 역다중화 된 신호를 디코딩한다. In operation 1102, the voice signal decoding apparatus decodes the demultiplexed signal. That is, the LPC coefficient decoder 803, the gain value decoder 804, the first fixed codebook decoder 805, the adaptive codebook decoder 806, the gain value difference decoder 807, and the second of FIG. Like the fixed codebook decoder 808, the demultiplexed signal is decoded.

제 1103 단계에서 음성 신호 복호화 장치는 음성신호 복호화 장치의 동작 조건에 따라 음질 향상 계층의 고정 코드북과 기본 계층의 고정 코드북을 선택적으로 전송하고, 이득값도 선택적으로 전송된다. 즉, 음성 신호 복호화 장치가 음질 향상 계층에서 동작되면, 복원된 음질 향상 계층의 고정 코드북의 이득값이 승산된 음질 향상 계층의 고정 코드북과 기본 계층의 고정 코드북에 기본 계층의 고정 코드북의 이득값이 승산된 고정 코드북을 가산하여 전송시킨다. 반면에 음성 신호 부호화 장치가 음질 향상 계층에서 동작되지 않으면, 복호화된 기본 계층의 고정 코드북에 기본 계층의 고정 코드북의 이득값을 승산한 고정 코드북을 전송시킨다. In operation 1103, the voice signal decoding apparatus selectively transmits the fixed codebook of the sound quality enhancement layer and the fixed codebook of the base layer according to the operating conditions of the voice signal decoding apparatus, and optionally the gain value. That is, when the speech signal decoding apparatus is operated in the sound quality enhancement layer, the gain value of the fixed codebook of the base layer is added to the fixed codebook of the sound quality enhancement layer and the base layer fixed codebook multiplied by the gain value of the fixed codebook of the restored sound quality enhancement layer. The multiplied fixed codebook is added and transmitted. On the other hand, if the speech signal encoding apparatus does not operate in the sound quality enhancement layer, the fixed codebook multiplying the gain value of the fixed codebook of the base layer is transmitted to the decoded fixed codebook of the base layer.

제 1104 단계에서 음성 신호 복호화 장치는 제 1102 단계에서 복호화된 LPC 계수를 이용하여 제 1103 단계에서 선택적으로 전송된 코드북을 합성한다. In operation 1104, the apparatus for decoding speech signals synthesizes a codebook selectively transmitted in operation 1103 using the LPC coefficients decoded in operation 1102.

제 1105 단계에서 음성 신호 복호화 장치는 후처리 유니트(816)와 같이 후처리하여 복원된 음성 신호를 생성한다.In operation 1105, the voice signal decoding apparatus post-processes the post-processing unit 816 to generate a reconstructed voice signal.

이제까지 본 발명에 대하여 그 바람직한 실시 예들을 중심으로 살펴보았다. 본 발명이 속하는 기술 분야에서 통상의 지식을 가진 자는 본 발명이 본 발명의 본질적인 특성에서 벗어나지 않는 범위에서 변형된 형태로 구현될 수 있음을 이해할 수 있을 것이다. 그러므로 개시된 실시 예들은 한정적인 관점이 아니라 설명적인 관점에서 고려되어야 한다. 본 발명의 범위는 전술한 설명이 아니라 특허청구범위에 나타나 있으며, 그와 동등한 범위 내에 있는 모든 차이점은 본 발명에 포함된 것으로 해석되어야 할 것이다.So far I looked at the center of the preferred embodiment for the present invention. Those skilled in the art will appreciate that the present invention can be implemented in a modified form without departing from the essential features of the present invention. Therefore, the disclosed embodiments should be considered in descriptive sense only and not for purposes of limitation. The scope of the present invention is shown in the claims rather than the foregoing description, and all differences within the scope will be construed as being included in the present invention.

상술한 본 발명에 따르면, 상술한 본 발명에 따르면, 기존의 표준화된 CLEP 음성 부호화 구조를 변경하지 않고 비트율을 확장할 수 있는 구조를 제시함에 따라 기존의 표준화된 CLEP 음성 부호화 장치를 구비한 시스템과 호환이 가능하다. According to the present invention described above, according to the present invention described above, by presenting a structure capable of extending the bit rate without changing the existing standardized CLEP speech coding structure and the system having a conventional standardized CLEP speech coding apparatus and Compatible

또한, 상술한 본원 발명의 일 실시 예에 따르면, 기본 계층의 고정 코드북 탐색 대상 신호와 음질 향상 계층의 고정 코드북 탐색 대상 신호를 같게 함으로써, 음질 향상 계층에서 탐색된 코드북은 다음 프레임을 위해 저장되지 않아 기본 계층의 동작에 영향을 주지 않는다. In addition, according to the embodiment of the present invention described above, by fixing the fixed codebook search target signal of the base layer and the fixed codebook search target signal of the sound quality enhancement layer, the codebook searched in the sound quality enhancement layer is not stored for the next frame. It does not affect the behavior of the base layer.

그리고, 음질 향상 계층의 고정 코드북 탐색시 기본 계층의 고정 코드북 탐색 시 구한 매개 변수 값을 사용함으로써 음질 향상 계층이 고정 코드북 탐색에 요구되는 연산량을 줄일 수 있다. When the fixed codebook search of the sound quality enhancement layer is used, the parameter value obtained when the fixed codebook search of the base layer is used may reduce the amount of computation required for the fixed codebook search.

또한, 상술한 본원 발명의 다른 실시 예에 따르면, 음질 향상 계층의 고정 코드북 탐색을 위해 요구되는 대상 신호는 기본 계층의 고정 코드북 대상신호에서 기본 계층의 고정 코드북 기여도와 음질 향상 계층의 합성 필터를 통해 제공되는 이전의 음질 향상 계층의 고정 코드북의 합성 신호를 제거하여 줌으로써, 음질 향상 계층 전용의 대상 신호를 이용한 고정 코드북 탐색이 수행됨에 따라 좀더 정확한 고정 코드북 탐색을 기대할 수 있다. In addition, according to another embodiment of the present invention, the target signal required for the fixed codebook search of the sound quality enhancement layer is a fixed codebook target signal of the base layer through the fixed codebook contribution of the base layer and the synthesis filter of the sound quality enhancement layer By removing the synthesized signal of the fixed codebook of the previous sound quality enhancement layer provided, a more accurate fixed codebook search can be expected as the fixed codebook search using the target signal dedicated to the sound quality enhancement layer is performed.

더욱이, 음질 향상 계층에서 탐색된 펄스의 위치와 기본 계층에서 탐색된 펄스의 위치가 같게 될 수 있어, 대수 코드북의 각 펄스의 크기가 동일한 크기를 갖는 한계점을 극복하고 최종 고정 코드북의 펄스가 다중 크기를 가지므로 복원되는 음성신호의 음질을 개선할 수 있다. Moreover, the position of the searched pulse in the sound quality enhancement layer and the position of the searched pulse in the base layer can be the same, overcoming the limitation that the magnitude of each pulse in the algebraic codebook has the same magnitude, and the pulses of the final fixed codebook are Since it can improve the sound quality of the restored voice signal.

그리고, 음질 향상 계층의 이득값은 기본 계층의 양자화된 이득값과 음질 향상 계층의 이득값간의 차를 양자화하여 상대적으로 동적 범위가 작은 이득값 차를 양자화한 값을 전송함으로써, 음질 향상 계층에서 이득값 양자화에 필요한 비트를 절약할 수 있다. The gain value of the sound quality enhancement layer quantizes a difference between the quantized gain value of the base layer and the gain value of the sound quality enhancement layer, and transmits a quantized value of a gain difference having a relatively small dynamic range, thereby obtaining a gain in the sound quality enhancement layer. The bits required for value quantization can be saved.

Claims

delete

In the audio signal encoding apparatus,

A base layer for linear prediction encoding filtering the input speech signal and generating an excitation signal corresponding to the filtered speech signal by fixed codebook searching and adaptive codebook searching; And

A fixed codebook search unit for searching a fixed codebook using a parameter generated according to the fixed codebook search in the base layer;

A gain value quantizer for detecting a difference between a first fixed codebook gain value generated by the fixed codebook search of the base layer and a second fixed codebook gain value output from the fixed codebook search unit, and quantizing the detected difference; It includes a plurality of sound quality enhancement layer,

And a multiplexer for multiplexing the signal generated in the base layer and the signal generated in the sound quality enhancement layer.

An audio signal decoding apparatus for decoding an encoded speech signal divided into a base layer and at least one sound quality enhancement layer,

A first decoding unit for decoding encoded information in a base layer among the encoded speech signals;

A second decoding unit for restoring encoding information in a sound quality enhancement layer among the encoded speech signals according to an operating environment of the speech signal decoding apparatus;

A calculation unit configured to calculate a signal restored in the first decoding unit and a signal restored in the second decoding unit according to an operating environment of the speech signal decoding apparatus;

And a speech signal reconstruction unit for reconstructing the speech signal by synthesizing the signal output from the calculation unit by using the linear prediction coding coefficients output from the first decoding unit.

The method of claim 8, wherein the first decoding unit,

A linear prediction coding coefficient decoder for decoding the linear prediction coding coefficient quantization information included in the encoding information of the base layer;

A first fixed codebook decoder which decodes a fixed codebook index included in the encoding information in the base layer;

An adaptive codebook decoder which decodes an adaptive codebook index included in the encoding information in the base layer;

And a gain value decoder which decodes the fixed codebook gain value and the adaptive codebook gain value included in the encoding information in the base layer, respectively.

The method of claim 9, wherein the second decoding unit,

A gain value difference decoder for decoding quantization information of the difference between the fixed codebook gain values included in the encoded information in the speech enhancement layer;

And a second fixed codebook decoder for decoding the fixed codebook index included in the encoding information in the sound quality enhancement layer.

The method of claim 10, wherein the calculation unit,

A first adder for adding the decoded fixed codebook gain value output from the gain value decoder and the decoded gain value difference output from the gain value difference decoder;

A first selection switch for transmitting the decoded fixed codebook gain value output from the gain value decoding unit or the gain value output from the first adder according to an operating condition of the speech signal decoding apparatus;

A second adder for adding a fixed codebook of a decoded sound quality enhancement layer output from the second fixed codebook decoder and a fixed codebook of a decoded base layer output from the first fixed codebook decoder;

A second selection switch for transmitting a signal output from the second adder or the decoded fixed codebook output from the first fixed codebook decoder according to an operating condition of the voice signal decoding apparatus;

A first multiplier for multiplying a gain value of the decoded adaptive codebook output from the adaptive codebook decoder and the decoded adaptive codebook output from the gain value decoder;

A second multiplier that multiplies the signal output from the first selection switch with the signal output from the second selection switch;

And a third adder configured to add a signal output from the first multiplier and a signal output from the second multiplier.

The method of claim 11, wherein the audio signal recovery unit,

A synthesis filter for synthesizing a signal output from the third adder using the linear prediction coding coefficients;

And a post-processing unit for obtaining the reconstructed speech signal using the linear prediction coefficients and the signal output from the synthesis filter.

The method of claim 8, wherein the second decoding unit,

And a fixed codebook decoder for decoding the fixed codebook index included in the encoding information in the sound quality enhancement layer.

The method of claim 9, wherein the second decoding unit,

A gain value difference decoding unit for decoding the quantization information of the difference between the fixed codebook logscale gain values included in the encoding information in the sound quality enhancement layer;

The method of claim 14, wherein the calculation unit,

A first adder for adding a fixed codebook of a decoded sound quality enhancement layer output from the second fixed codebook decoder and a fixed codebook of a decoded base layer output from the first fixed codebook decoder;

A selection switch for selectively transmitting a signal output from the first adder or a fixed codebook of the decoded base layer output from the first fixed codebook decoder according to an operating condition of the speech signal decoding apparatus;

And a second adder for adding the signal output from the selection switch and the adaptive codebook of the decoded base layer output from the adaptive codebook decoder.

The method of claim 15, wherein the audio signal recovery unit,

A synthesis filter for synthesizing a signal output from the second adder using the linear prediction coding coefficients;

The method of claim 8, wherein the second decoding unit,

A gain value difference decoder which decodes quantization information of the difference between the fixed codebook logscale gain values included in the encoded information in the speech enhancement layer;

delete

A speech signal decoding method for decoding a speech signal encoded with a base layer and at least one sound quality enhancement layer,

Decoding the encoded speech signal;

Selectively transmitting the codebook for the base layer decoded in the decoding step and the codebook for the sound quality enhancement layer according to the operating condition of the speech signal decoding;

And generating a reconstructed speech signal by combining the selectively transmitted codebook and the linear prediction coefficients decoded in the decoding step.

The method of claim 22, wherein the decoding step,

And demultiplexing the encoded speech signal into encoding information of a base layer and encoding information of a sound quality enhancement layer, and decoding the demultiplexed encoding information.

The method of claim 23, wherein the voice signal decoding method,

Restoring the gain value of the fixed codebook in the sound quality enhancement layer by adding a difference between the gain value of the fixed codebook in the decoded base layer and the gain value of the fixed codebook included in the decoded sound quality enhancement layer in the decoding step. Voice signal decoding method further comprising.

In an audio signal encoding apparatus.

A base layer for filtering an input speech signal using linear predictive coding and generating an excitation signal of the filtered speech signal by fixed codebook searching and adaptive codebook searching;

At least one sound quality enhancement layer for searching the fixed codebook by using a signal obtained by removing the contribution of the fixed codebook of the base layer from the fixed codebook search target signal of the base layer,

And a multiplexer for multiplexing the signal generated in the base layer and the signal generated in the sound quality enhancement layer and outputting the multiplexed signal.

The fixed codebook contribution y ₂ (n) of the base layer is detected by calculating the impulse response of the synthesis code and the fixed codebook c _G multiplied by the quantization gain value of the fixed codebook of the base layer as shown in the following equation. Voice signal encoding apparatus, characterized in that.

The audio signal encoding of claim 25, wherein the sound quality enhancement layer further removes a signal obtained by synthesizing a fixed codebook signal generated in the sound quality enhancement layer using the linear prediction coding coefficients from a target signal of the base layer. Device.

26. The method of claim 25, wherein, in the fixed codebook search of the sound quality enhancement layer, the log scale value of the first gain value obtained by the fixed codebook search of the base layer and the second gain obtained by the fixed codebook search in the sound quality enhancement layer. And a function for multiplying the quantized gain value of the sound quality enhancement layer obtained by quantizing the difference between logarithmic scale values and the fixed codebook vector obtained by the fixed codebook search in the sound quality enhancement layer. .

The method of claim 25, wherein if there are a plurality of sound quality enhancement layers,

And the multiplexer multiplexes quantized information on a difference between a fixed codebook index output from a plurality of sound quality enhancement layers and a log scale gain value of the fixed codebook.

The apparatus of claim 25 or 27, wherein the sound quality enhancement layer performs the fixed codebook search after the cognitive weighted filtering of the target signal.

In the audio signal encoding apparatus,

A base layer for linearly predicting and encoding the input speech signal and generating an excitation signal corresponding to the filtered speech signal by fixed codebook search and adaptive codebook search;

A search unit for searching for a fixed codebook by using a signal from which the fixed codebook contribution of the base layer is removed from the fixed codebook search target signal of the base layer as a fixed codebook search target signal of a sound quality enhancement layer;

Detecting a difference between a log scale gain value of a first fixed codebook generated by the fixed codebook search of the base layer and a log scale gain value of a second fixed codebook output from the fixed codebook search unit, and quantizing the detected difference A plurality of sound quality enhancement layers including a log scale gain value difference quantizer,

A multiplexer for multiplexing the signal generated in the base layer and the signal generated in the sound quality enhancement layer,

And the sound quality enhancement layer further removes a signal obtained by synthesizing the fixed codebook using the linear predictive coding coefficients from the fixed codebook search target signal of the sound quality enhancement layer.

The voice signal coding method is

A base layer processing step of extracting a linear prediction coefficient of the input speech signal and generating an excitation signal corresponding to the input speech signal by fixed codebook searching and adaptive codebook searching;

A sound quality enhancement layer processing step of searching for a fixed codebook by using a signal from which the fixed codebook contribution of the base layer is removed from the fixed codebook search target signal of the base layer as a fixed codebook search target signal of a sound quality enhancement layer;

And multiplexing signals generated by the base layer processing step and the sound quality enhancement layer processing step.

33. The method of claim 32, wherein the fixed codebook search target signal of the sound quality enhancement layer further removes a signal obtained by synthesizing the fixed codebook using linear prediction coding coefficients in the sound quality enhancement layer from the target signal of the base layer. Speech signal coding method.

33. The method of claim 32, wherein the sound quality enhancement layer processing step includes a load scale gain value of a fixed codebook obtained by the fixed codebook search in the base layer processing step and a log scale of a gain value obtained by the fixed codebook search of the sound quality enhancement layer. And quantizing the difference between the gain values.