KR100322703B1

KR100322703B1 - Audio encoder/decoder

Info

Publication number: KR100322703B1
Application number: KR1019950023514A
Authority: KR
Inventors: 김상욱; 김연배; 서양석
Original assignee: 윤종용; 삼성전자 주식회사
Priority date: 1995-07-31
Filing date: 1995-07-31
Publication date: 2002-06-20

Abstract

PURPOSE: An audio encoder/decoder is provided to reduce the number of registers used to store coefficients by realizing a filter using properties of analysis window and mixing window coefficients and a compensation coefficient. CONSTITUTION: An audio encoder comprises an analysis sub-band filter for dividing an input audio signal into N sub-bands according to bands of a predetermined process unit, a sound mentality modeling part for considering a recognition ability of a human according to a frequency property of the audio signal, a quantization part for assigning and quantizing bits into each sub-band by outputs of the analysis sub-band and the sound mentality modeling part, and a packing part for forming a bit stream using quantized values and header information. An audio decoder comprises a header detecting part for unpacking the bit stream from the audio encoder, and a composition sub-band filter for restoring an original audio signal from the unpacked bit stream.

Description

Audio Encoder / Decoder

본 발명은 오디오 부호화기/복호화기에 관한 것으로서, 특히 오디오 부호화기/복호화기를 하나의 칩으로 구현하는 경우 폴리페이즈 필터를 사용함으로써 분석 윈도우 계수와 합성 윈도우 계수를 저장하는데 사용되는 레지스터를 256개로 감소시키는 오디오 부호화기/복호화기에 관한 것이다.BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to an audio encoder / decoder. In particular, when the audio encoder / decoder is implemented in one chip, an audio encoder which reduces the registers used to store the analysis window coefficients and the synthesis window coefficients to 256 by using a polyphase filter. It is about a decoder.

반도체 기술과 컴퓨터 기술의 급속한 발전은 우리가 취급하는 정보 가운데 많은 부분을 디지탈로 처리할 수 있게 하였고, 고속처리가 가능하도록 하였다. 이에 따라 과거에 구현이 불가능하다고 여겨지던 제품들이 서서이 시장에 등장하고 있다. 그 중 디지탈 오디오/비디오 처리장치에서 핵심으로 사용하게 될 기술이 데이타 압축/복원 기술이다. 최근, 오디오와 비디오 신호들의 압축 및 저장, 테스트 등에 관련된 국제 표준안 제정작업이 추진되고 있는데, 이러한 작업은 MPEG(Moving Picture Experts Group)이라 불리우는 그룹에 의해 진행되고 있다.The rapid development of semiconductor and computer technologies has enabled us to process much of the information we handle digitally and to enable high-speed processing. As a result, products that have been considered impossible to implement in the past are now appearing in this market. Among them, data compression / restore technology is the core technology of digital audio / video processing device. Recently, international standardization work related to compression, storage and testing of audio and video signals has been promoted. This work is being carried out by a group called Moving Picture Experts Group (MPEG).

MPEG에서 제안된 MPEG-1의 목적은 데이타 표현에 사용되는 비트수를 줄여 1.5Mbit/s의 속도로 데이타를 읽고 쓰는 방식으로서, CD, DAT, 자기디스크 등에 저장해 표현할 수 있는 정보의 양을 늘리고자 하는 것으로 현재 많은 가라오케 시스템, CD 비젼, CD-FMV(Full Motion Video) 등에 사용되고 있다.The purpose of MPEG-1 proposed in MPEG is to reduce the number of bits used to represent data, and to read and write data at a speed of 1.5 Mbit / s, and to increase the amount of information that can be stored and represented on CDs, DATs, and magnetic disks. It is currently used in many karaoke systems, CD vision, and CD-FMV (Full Motion Video).

한편, MPEG-2는 현재 응용분야가 위성 또는 지상 텔레비전 방송, HDTV, 디지탈 오디오방송(DAB)외의 CATV, ENG(Electronic News Gathering), IPC(Interpersonal Communication), ISM(Interactive Storage Media), NDB(NetworkDatabase Service), DSM(Digital Storge Media), EC(Electronic Cinema), HTT(Home Television Theatre), ISDN 등과 같이 다양한 응용분야를 갖고 있는 것으로서, 특히 방송 및 이기종간의 정보 교환 및 활용에 적합하게 되어 있다.MPEG-2 is currently applied to CATV, Electronic News Gathering (ENG), IPC (Interpersonal Communication), ISM (Interactive Storage Media), and NDB (Network Database) in addition to satellite or terrestrial television broadcasting, HDTV, and digital audio broadcasting (DAB). It has various application fields such as Service, Digital Storge Media (DSM), Electronic Cinema (EC), Home Television Theater (HTT), and ISDN, and is particularly suitable for the exchange and utilization of information between broadcast and heterogeneous types.

이러한 MPEG-1과 MPEG-2의 압축방식들 가운데 오디오 신호들에 대한 새로운 부호화방식들은 공간적인(spectral), 시간적인(temporal) 마스킹 효과들에 의한 인간의 소리에 대한 인식성질들을 이용한다. 마스킹 효과를 이용한 양자화기의 효율적인 사용으로 처리단위 대역에서 발생하는 오차신호가 인간이 느끼지 못하는 한계값 이내가 되도록 만들어주어 청취자가 음질의 차이를 인식할 수 없는 상태를 만족시켜 주면서 데이타 압축효과를 얻는다.Among the MPEG-1 and MPEG-2 compression schemes, new coding schemes for audio signals use recognition properties of human sounds by spatial and temporal masking effects. Efficient use of the quantizer using the masking effect ensures that the error signal generated in the processing unit band is within the limits that humans do not feel, and satisfies the condition where the listener cannot recognize the difference in sound quality, thereby obtaining the data compression effect. .

한편, 인간의 음향심리를 이용한 오디오 부호화기/복호화기의 기본 구조들은 제1도에 도시된 바와 같다. 여기서 오디오 부호화기(a)의 구조는 시간/주파수 맵핑부(11), 인간의 음향심리 모델링부(12), 양자화부(13)와 비트패킹부(14)로 구성되고, 오디오 복호화기(b)는 프레임 언팩킹부(15), 복원부(16)와 역맵핑부(17)로 구성된다.Meanwhile, the basic structures of the audio encoder / decoder using human psychoacoustics are shown in FIG. 1. Here, the structure of the audio encoder (a) is composed of a time / frequency mapping unit 11, a human psychoacoustic modeling unit 12, a quantization unit 13, and a bitpacking unit 14, and an audio decoder (b). Is composed of a frame unpacking unit 15, a restoring unit 16, and a reverse mapping unit 17.

오디오 부호화기에 있어서, 시간/주파수 맵핑부(11)에서는 인간의 음향심리를 고려하기 위해, 인간이 느끼는 처리단위 대역별로 신호들을 나누고, 인간의 음향심리 모델링부(12)에서는 QMF(Quadrature Mirror Filter) 등에 의한 대역필터링방식, 또는 DFT 또는 MDCT 변환에 의한 방식들이 처리에 사용되고, 입력신호에 인간의 음향심리를 고려하여 들어도 느끼지 못하는 한계값들인 마스킹된 문턱치를 공지된 법칙들을 이용하여 계산한다. 시간/주파수 맵핑부(11)와 인간의 음향심리 모델링부(12)의 출력값을 이용하여 각 처리단위 대역별로 중요도에 따라 비트들을 할당하고, 결정된 비트수에 따라 양자화부(13)에서는 값을 표시해 주고, 이 양자화된 값들은 비트스트립 구성을 위해 비트팩킹부(14)에서 팩킹된다.In the audio encoder, the time / frequency mapping unit 11 divides the signals according to the processing unit bands that the human feels in order to consider the human psychology, and the human psychoacoustic modeling unit 12 QMF (Quadrature Mirror Filter) The band filtering method by the method or the method by the DFT or MDCT conversion is used for processing, and the masked thresholds which are threshold values which are not felt even if it considers a human psychoacoustic to an input signal are calculated using well-known laws. By using the output values of the time / frequency mapping unit 11 and the human psychoacoustic modeling unit 12, the bits are allocated according to the importance of each processing unit band, and the quantization unit 13 displays the values according to the determined number of bits. These quantized values are then packed in the bit packing section 14 for bit strip configuration.

MPEG-1과 MPEG-2 오디오의 시간/주파수 맵핑부에서는 인간의 음향심리를 고려해 주기 위해 인간이 느끼는 처리단위 대역별로 신호들을 나누어 줄때 레이어 1, 2의 경우에는 QMF의 효율적인 구현을 위해 폴리페이즈(polyphase) 구조를 채택하였고, 구현시 발생되는 얼라이어싱의 효과를 줄여주기 위하여 계수들을 조절해 주었다. 이때, 부호화하고 복호화하는데 사용되는 분석 윈도우와 합성 윈도우의 필터계수가 다르기 때문에 그 구현에 있어서 각 계수들을 모두 저장하고 있어야 한다. 이러한 경우, 필터계수가 대칭성을 갖도록 설계되어 각각 512개씩인 필터계수는 256개의 계수들로 표현이 가능하다. 필터계수의 례가 제14도에 도시되어 있는데, 여기서 1에서 511은 필터계수의 인덱스를 나타낸다. 인덱스가 0인 경우에는 계수가 0으로, 연산의 결과에 영향을 미치지 않기 때문에 사용하지 않는다. 인덱스가 255인 경우의 계수는 257인 경우와 같고, 2인 경우는 510인 경우와 같다.In the time / frequency mapping section of MPEG-1 and MPEG-2 audio, signals are divided according to the processing unit bands that the human feels in order to consider human acoustic psychology. A polyphase structure was adopted and the coefficients were adjusted to reduce the effect of aliasing in the implementation. At this time, since the filter coefficients of the analysis window and the synthesis window used for encoding and decoding are different, each coefficient must be stored in the implementation. In this case, the filter coefficients are designed to have symmetry, and each of the 512 filter coefficients can be represented by 256 coefficients. An example of the filter coefficient is shown in FIG. 14, where 1 to 511 represent the index of the filter coefficient. If the index is 0, the coefficient is 0 and is not used because it does not affect the result of the operation. The coefficient when the index is 255 is the same as the case of 257, and the case of 2 is the same as the case of 510.

한편, MPEC 오디오의 부호화/복호화과정은 제2도에 도시된 바와 같다. 먼저 부호화과정(a)을 살펴보면, 각 입력신호들은 필터링시킨 후, FFT된 신호들과 인간의 음향심리 법칙들에 의해 계산되는 마스킹 문턱치들을 이용하여 비트들을 할당해 주고, 고정된 비트율에 맞추어 전달해 주는 데이타를 정하고, 양자화 스텝사이즈를 결정한 후 각 대역의 샘플들을 양자화한다. 복호화과정(b)을 살펴보면, 부호화된 비트스트림을 받아들인 후, 헤드 정보에 속하는 비트할당수와 각 대역의 스케일 패터들을 가지고 각 대역의 샘플들을 복원한 다음, 합성 서브밴드 필터를 통과시켜 원하는 신호를 얻는다.Meanwhile, the encoding / decoding process of MPEC audio is shown in FIG. 2. First, in the encoding process (a), each input signal is filtered, and then bits are allocated using masking thresholds calculated by FFT signals and human psychoacoustic laws, and transmitted according to a fixed bit rate. After the data is determined, the quantization step size is determined, and the samples of each band are quantized. Decoding process (b) shows that after receiving the encoded bitstream, reconstructing the samples of each band with the bit allocations belonging to the head information and the scale patterns of each band, and then passing the synthesized subband filter to Get signal.

여기서, 부호화시 분석 서브밴드 필터와 복호화시 합성 서브밴드 필터는 제3도에 도시된 바와 같다. 제3도의 각 단계들 가운데 512 샘플들에 의해 형성되는 Zi와 Wi를 구할때 사용되는 Ci와 Di가 각각 분석 윈도우 계수와 합성 윈도우 계수들이다.Here, the analysis subband filter at the time of encoding and the synthesis subband filter at the time of decoding are as shown in FIG. Ci and Di, which are used to obtain Zi and Wi formed by 512 samples of each step in FIG. 3, are analysis window coefficients and composite window coefficients, respectively.

인간의 음향심리모델은 대역별로 나누어진 각 처리대역에 있어서 느낄 수 있는 잡음의 값을 결정하는데 사용되는 최소 마스킹 문턱치를 계산한다. 최대 신호 크기와 최소 마스킹 문턱치 사이의 차는 비트나 잡음을 할당해 각 블럭의 각 처리대역들의 실제 양자화기의 양자화 스텝사이즈를 결정하는데 사용한다. 두 종류의 모델이 MPEG 표준안에서 다루어지고 있고, 각 모델은 모든 레이어에 사용가능하나, 레이어 1, 2에서는 모델1이, 레이어 3에서는 모델 2가 사용된다. 각 음향심리모델에서 최종 출력은 레이어 1,2의 경우 각 처리대역의 SMR(Signal to Masked threshold Ratio)이고, 레이어 3에서는 대역들 그룹의 SMR이다.The psychoacoustic model of human being calculates the minimum masking threshold used to determine the noise value that can be felt in each processing band divided by band. The difference between the maximum signal magnitude and the minimum masking threshold is assigned bits or noise to determine the quantization step size of the actual quantizer of each processing band of each block. Two types of models are covered in the MPEG standard, each model available for all layers, but model 1 for layers 1 and 2 and model 2 for layer 3. In each psychoacoustic model, the final output is the signal to masked threshold ratio (SMR) of each processing band in the case of layers 1 and 2, and the SMR of the group of bands in layer 3.

이러한 방식들은 보다 낮은 비트율에서 부호화 효율을 높이기 위해 조인트 스테레오 코딩(Joint Stereo Coding) 방식 등이 개발되었고, 많은 연구가 진행되고 있다.These methods have been developed such as joint stereo coding (Joint Stereo Coding) method to improve the coding efficiency at a lower bit rate, and a lot of research is being conducted.

한편, 부호기만을 구현할 경우에는 분석(analysis) 윈도우를 필요로 하고, 복호기만을 구현할 경우에는 합성(synthesis) 윈도우를 필요로 하기 때문에 각각 256개의 레지스터에 각 계수들을 저장해 두면 되지만, 부호화기와 복호화기를 하나의 ASIC이나 소프트웨어로 구현할 경우에는 각각의 계수 값들을 저장할 때 모두 512개의 레지스터가 사용되는 문제가 있다. 일예로, 캠코더와 같이 실시간에 영상과 음성신호들을 저장하고, 경우에 따라서는 TV에 연결하여 출력시키는 경우, 부호화기와 복호화기가 모두 필요한 경우가 된다. 이러한 경우 대칭인 점만을 고려하여 분석 윈도우 계수와 합성 윈도우 계수를 저장하면 512개의 레지스터가 사용된다.On the other hand, if only the encoder is implemented, an analysis window is required. If only the decoder is implemented, a synthesis window is required. Therefore, each coefficient may be stored in 256 registers. If implemented in ASIC or software, there is a problem that all 512 registers are used to store each coefficient value. For example, when video and audio signals are stored in real time, such as a camcorder, and in some cases connected to a TV and outputted, both an encoder and a decoder are required. In this case, 512 registers are used to store the analysis window coefficients and the composite window coefficients considering only symmetrical points.

따라서, 본 발명의 목적은 상술한 문제점을 해결하기 위하여 오디오 부호화기와 복호화기를 하나의 칩으로 구현하는 경우, 분석 윈도우 계수와 합성 윈도우 계수의 성질 및 보상계수를 이용하여 필터를 구현함으로써 계수들을 저장하는데 사용되는 레지스터를 256개로 줄이기 위한 오디오 부호화기/복호화기를 제공하는데 있다.Accordingly, an object of the present invention is to store the coefficients by implementing a filter using the properties of the analysis window coefficients and the synthesis window coefficients and the compensation coefficients when the audio encoder and the decoder are implemented in one chip to solve the above problems. An audio encoder / decoder is provided to reduce the registers used to 256.

본 발명의 다른 목적은 보상계수, 전달함수 특성을 개선한 MPEG 오디오에 사용될 수 있는 변형된 합성 윈도우 계수, 합성 윈도우와 분석 윈도우 사이의 보상계수 연산을 쉽게 하기 위한 변형된 분석 윈도우계수, 변형된 합성 윈도우를 가지고 변형된 분석 윈도우의 연산결과를 구하기 위한 오디오 부호화기/복호화기를 제공하는데 있다.Another object of the present invention is a modified synthesis window coefficient that can be used in MPEG audio with improved compensation coefficient, transfer function characteristics, modified analysis window coefficient for modified calculation coefficient between synthesis window and analysis window, modified synthesis An audio encoder / decoder for calculating the operation result of a modified analysis window with a window is provided.

상기 목적들을 달성하기 위하여 본 발명에 의한 오디오 부호화기 /복호화기에서는In order to achieve the above objects, in the audio encoder / decoder according to the present invention,

오디오 부호화기는Audio encoder

입력되는 오디오신호에 대하여 소정의 처리단위 대역별로 N개의 서브밴드로 나누어주는 분석 서브밴드 필터:An analysis subband filter dividing the N audio subbands into N subbands according to a predetermined processing unit band with respect to an input audio signal:

상기 오디오신호의 주파수 특성에 따른 인간의 인지능력을 고려하기 위한 음향심리 모델링부:A psychoacoustic modeling unit for considering a human cognitive ability according to the frequency characteristics of the audio signal:

상기 분석 서브밴드 필터와 상기 음향심리 모델링부의 출력에 의해 비트들을 각 서브랜드에 할당하고 양자화하는 양자화부; 및A quantizer for allocating and quantizing bits to each subland by the output of the analysis subband filter and the psychoacoustic modeling unit; And

양자화된 값들을 헤더정보와 함께 비트스트림으로 형성하는 패킹부로 구성되고,It consists of a packing unit for forming the quantized values in the bitstream with the header information,

오디오 복호화기는Audio decoder

상기 오디오 부호화기에서 출력되는 비트스트림을 풀기 위한 헤더 검출부: 및A header detector for solving a bitstream output from the audio encoder; and

풀려진 비트스트림으로 부터 원래의 오디오신호를 복원하기 위한 합성 서브밴드 필터로 구성되는 것을 특징으로 한다.And a synthesized subband filter for recovering the original audio signal from the unpacked bitstream.

그러면 본 발명에 대하여 첨부된 도면을 참조하여 상세히 설명하기로 한다.Next, the present invention will be described in detail with reference to the accompanying drawings.

본 발명에 의한 오디오 부호화기/복호화기에 사용되는 필터는 인간의 청각심리를 고려해 주기 위해 32대역으로 나누어 처리하는데, 이는 제4도에서와 같이 QMF 필터들에 의해 구현될 수 있다. 필터탭이 L인 경우, 32대역으로 나누어 처리할 때 L*(1+2+4+8+16)=31*L=(대역수-1)*L의 연산회수를 필요로 한다. 이러한 필터계수들에 의한 연산량을 줄이기 위해 대역제한된 신호들이 갖는 샘플링 한계인 나이퀴스트 샘플링을 고려해 데시메이션을 행해 주는데, 이 결과를 고려해 준 방법이 폴리페이즈 구현방법이다. 제5도에 도시된 바와 같이 폴리페이즈에 의한 구현시 연산의 회수를 L로 줄일 수 있다.The filter used in the audio encoder / decoder according to the present invention is processed into 32 bands to consider human hearing psychology, which can be implemented by QMF filters as shown in FIG. In the case where the filter tap is L, it is required to calculate L * (1 + 2 + 4 + 8 + 16) = 31 * L = (number of bands-1) * L when processing by dividing into 32 bands. In order to reduce the calculation amount due to these filter coefficients, decimation is performed considering the Nyquist sampling, which is a sampling limit of the band-limited signals. As shown in FIG. 5, the number of operations when implemented by polyphase can be reduced to L. FIG.

QMF또는 폴리페이즈와 같이 샘플링 한계를 처리에 고려해 준 경우, 데시메이션에 의해 얼라이어싱이 발생하는 문제가 있다. 이러한 문제는 필터계수를 변형시킴으로써 해결가능하다. MPEG에서 사용하는 분석 윈도우와 합성 윈도우는 얼라이어싱을 고려한 폴리페이즈에 의한 필터계수를 사용하고 있다.If the sampling limit is considered in processing, such as QMF or polyphase, there is a problem that aliasing occurs due to decimation. This problem can be solved by modifying the filter coefficients. The analysis window and synthesis window used in MPEG use the filter coefficients by polyphase considering the aliasing.

제6도와 제7도는 MPEG에서 사용하는 얼라이어싱을 고려한 폴리페이즈 필터의 프로토타입인 경우의 전달함수와 그 필터계수들을 나타낸 것이다. 본 발명에서는 제7도의 프로토타입 필터계수를 이용하여 얼라이어싱을 제거한 제8도와 제9도와 같은 분석 윈도우 계수와 합성 윈도우 계수를 구해 MPEG 오디오에 사용한다.6 and 7 show the transfer function and its filter coefficients in the case of the prototype of the polyphase filter considering the aliasing used in MPEG. In the present invention, the analysis window coefficients and the synthesis window coefficients as shown in Figs. 8 and 9, which have been de-aliased using the prototype filter coefficients of Fig. 7, are obtained and used for MPEG audio.

MPEG 오디오에서 사용한 프로토타입 필터계수의 합은 라운딩 등에 의해 1이 되지 않는다. 본 발명에서는 보다 좋은 전달함수 특성을 얻기위해 MPEG에서 사용되는 프로토타입 필터계수의 합이 1이 되도록 변형해 주었다. 그 결과 얻어진 프로토타입 필터의 전달함수와 필터계수는 제10도와 제11도에 도시된 바와 같다. 제6도와 제10도를 비교하면, 전달함수에 있어서는 원으로 표시된 경우들 이외에는 큰 차이가 없음을 볼 수 있다. 구해진 프로토타입 필터계수를 이용하여 얼라이어싱을 줄인 분석 윈도우와 합성 윈도우를 구한 것은 제12도와 제13도에 도시된 바와 같다.The sum of the prototype filter coefficients used in MPEG audio is not one due to rounding or the like. In the present invention, the prototype filter coefficients used in MPEG are modified to be 1 to obtain better transfer function characteristics. The transfer function and filter coefficient of the resulting prototype filter are as shown in FIG. 10 and FIG. Comparing FIG. 6 and FIG. 10, it can be seen that there is no significant difference in the transfer function except in the cases indicated by circles. The analysis window and the synthesis window which reduced the aliasing using the obtained prototype filter coefficients are as shown in FIG. 12 and FIG.

본 발명에서는 연산의 결과를 계산하는데 보다 쉽게 하기 위해 분석 윈도우 계수와 합성 윈도우 계수가 2의 제곱의 계수 차이를 갖도록 함으로써 연산결과값의 상위 k비트만을 결과로 취해 연산의 결과를 구하는 것이 가능하도록 하였다.In the present invention, in order to make it easier to calculate the result of the calculation, the analysis window coefficient and the composite window coefficient have a coefficient difference of two squares so that it is possible to obtain the result of the operation by taking only the upper k bits of the operation result as a result. .

즉, y=x₁+x₂+x₃일때, x₁+kc₁, x₂= kc₂, x₃=kc₃인 경우,That is, when y = x ₁ + x ₂ + x _3, when x ₁ + kc ₁ , x ₂ = kc ₂ , x ₃ = kc ₃ ,

y=k(c₁+c₂+c₃) 또는 y₁=c₁+c₂=c₃. y=ky₁으로 구하는 경우와 같다. 이때, k가 2의 지수배이기 때문에 예를 들어, k가 32이면 2의 5제곱이므로 y₁연산결과를 5비트 이동함으로써 32배의 효과를 얻을 수 있다.y = k (c ₁ + c ₂ + c ₃ ) or y ₁ = c ₁ + c ₂ = c ₃ . Same as obtaining y = ky ₁ . In this case, since k is an exponential multiple of 2, for example, if k is 32, a power of 2 is increased by 5 bits so that a 32 times effect can be obtained by shifting the y ₁ operation result by 5 bits.

본 발명에 적용되는 폴리페이즈 필터의 일실시예에 대하여 설명하면 다음과 같다.Hereinafter, an embodiment of the polyphase filter applied to the present invention will be described.

필터는으로 표현되며, 여기서 필터탭은 N이 된다. 이것을 풀어쓰면The filter Where the filter tab is N. If you write this

이 된다.Becomes

본 발명에 적용되는 폴리페이즈 필터는Polyphase filter applied to the present invention

과, and,

이 있을때, g_O(i)와 h_O(i) 사이의 관계가 g_O=kh_O(i) 인 경우, g₀(i) 또는 h₀(i)와 k를 가지고서, y₁(n)과 y₂(n)을 구할 수 있음을 보였다. 즉, h₀(i) 와 k만을 아는 경우. y₁(i)을 먼저 구한 다음, y₂(i)=ky₁(i)에 의해 y₂(i)를 구할 수 있고, g₀(i) 와 k만을 아는 경우, y₂(i)을 먼저 구한 다음,에 의해 y₁(i)를 구할 수 있다. 즉, g₀(i)와 h₀(i)를 동시에 가지고 있을 필요가 없으므로 사용되는 레지스터의 수를 줄일 수 있다. 여기서 분석 윈도우 계수와 합성 윈도우 계수가 상수배를 갖는 경우 두 값들을 모두 저장할 필요가 없음을 보였다. In this case, when g _O (i) and h _O (i) is g _O = kh _O (i), with g ₀ (i) or h ₀ (i) and k, y ₁ (n) And y ₂ (n) was found. That is, if only h ₀ (i) and k are known. If the y ₁ (i) first determined, and then, y ₂ (i) = can be obtained for y ₂ (i) by a _{_{ky 1 (i), g 0}} (i) and know only k, y a ₂ (i) First, then Y ₁ (i) can be obtained. That is, since g ₀ (i) and h ₀ (i) need not be held at the same time, the number of registers used can be reduced. Here, we show that when the analysis window coefficient and the composite window coefficient have constant constants, it is not necessary to store both values.

본 발명에 적용되는 폴리페이즈 필터는 MPEG 오디오 방식을 채용한 오디오 기기, HDTV, 캠코더 등에 적용할 수 있다.The polyphase filter applied to the present invention can be applied to an audio device, an HDTV, a camcorder or the like employing the MPEG audio system.

상술한 바와 같이 본 발명에 의한 오디오 부호화기/복호화기는 MPEG 오디오와 호환성을 갖는 부호화기와 복호화기를 함께 구현하는 코덱에 있어서, 분석 윈도우 계수와 합성 윈도우 계수의 저장에 사용되는 레지스터의 수를 512에서 256개로 줄여 하드웨어의 복잡도를 감소시킨 이점이 있다. 또한, 하드웨어 수현을 보다 용이하게 하기 위해 분석 윈도우와 합성 윈도우의 계수비를 2의 지수배가 되도록 함으로써 분석 윈도우 또는 합성 윈도우를 사용하여 계산할때 쉬프터를 이용해 비트이동을 하거나, 결과의 특정비트를 취함으로써 연산의 결과를 얻을 수 있다.As described above, the audio encoder / decoder according to the present invention is a codec that implements an encoder and a decoder that are compatible with MPEG audio, and has a number of registers used for storing analysis window coefficients and synthesized window coefficients from 512 to 256. This reduces the complexity of the hardware. In addition, by making the coefficient ratio of the analysis window and the synthesis window more than two exponents for easier hardware implementation, by shifting bits using the shifter or taking specific bits of the result when calculating using the analysis window or synthesis window. The result of the operation can be obtained.

제1도는 인간의 청각심리를 고려한 오디오 코덱을 나타낸 도면.1 is a diagram illustrating an audio codec considering human hearing psychology.

제2도는 MPEG 오디오 부호화기/복호화기를 나타낸 도면.2 shows an MPEG audio encoder / decoder.

제3도는 MPEG 오디오의 합성/분석 필터를 나타낸 도면.3 shows a synthesis / analysis filter of MPEG audio.

제4도는 QMF에 의해 32대역 필터를 구현하는 방법을 설명하기 위한 도면.4 is a diagram for explaining a method for implementing a 32-band filter by QMF.

제5도는 폴리페이즈에 의해 32대역 필터를 구현하는 방법을 설명하기 위한 도면.5 is a diagram for explaining a method of implementing a 32-band filter by polyphase.

제6도는 MPEG에서 사용된 프로토타입 필터의 전달함수.6 is a transfer function of a prototype filter used in MPEG.

제7도는 MPEG에서 사용된 프로토타입 필터의 계수.7 shows the coefficients of the prototype filter used in MPEG.

제8도는 MPEG에서 사용된 분석 윈도우 계수.8 shows analysis window coefficients used in MPEG.

제9도는 MPEG에서 사용된 합성 윈도우 계수.9 is the synthesis window coefficients used in MPEG.

제10도는 본 발명에서 사용하는 프로토타입 필터의 전달함수.10 is a transfer function of a prototype filter used in the present invention.

제11도는 본 발명에서 사용하는 프로토타입 필터의 계수.11 is the coefficient of the prototype filter used in the present invention.

제12도는 본 발명에서 사용하는 분석 윈도우 계수.12 is an analysis window coefficient used in the present invention.

제13도는 본 발명에서 사용하는 합성 윈도우 계수.13 is a synthesis window coefficient used in the present invention.

제14도는 필터계수의 대칭성을 사용한 레지스터를 나타낸 도면.14 shows registers using symmetry of filter coefficients.

Claims

Audio encoder

An analysis subband filter for dividing the input audio signal into N subbands for each predetermined processing unit band;

Psychoacoustic modeling unit for considering human cognitive ability according to frequency characteristics of the audio signal;

A quantizer for allocating and quantizing bits to each subband by the output of the analysis subband filter and the psychoacoustic modeling unit; And

It consists of a packing unit for forming the quantized values in the bitstream with the header information,

Audio decoder

A header detector for solving a bitstream output from the audio encoder; and

An audio encoder / decoder comprising a composite subband filter for reconstructing the original audio signal from the unpacked bitstream.

The audio encoder / decoder of claim 1, wherein values of analysis window coefficients of the analysis subband filter and synthesis window coefficients of the synthesis subband filter have a constant k times.

3. The audio encoder / decoder of claim 2, wherein the constant k is represented by N powers of two.

4. The audio encoder / decoder according to claim 3, wherein analysis window coefficients are obtained by dividing the synthesis window coefficients by k.

4. The audio encoder / decoder of claim 3, wherein a composite window coefficient is obtained by multiplying the analysis window coefficient by k.

The audio encoder / decoder according to claim 1, wherein the analysis window of the analysis subband filter is symmetrically designed to represent all analysis window coefficients using only N of 2N coefficients.

The audio coder / decoder according to claim 1, wherein the synthesis window of the synthesis subband filter is symmetrically designed to represent all synthesis window coefficients using only N out of 2N coefficients.

The compound window according to claim 1, wherein when the analysis window coefficient and the synthesis window coefficient are designed with a constant k times, the calculation result by the analysis window coefficient is shifted to the right by N bits when the calculation is performed using the analysis window coefficient and the constant k. An audio encoder / decoder, which obtains processing results.

The analysis window according to claim 1, wherein when the analysis window coefficient and the synthesis window coefficient are designed with a constant k times, when the calculation is performed using the synthesis window coefficient and the constant k, the calculation result by the synthesis window coefficient is shifted to the left by N bits. An audio encoder / decoder, characterized by obtaining processing results.

2. The audio encoder / decoding according to claim 1, wherein when the analysis window coefficient and the synthesis window coefficient are designed with a constant k times, the analysis window processing result is a value excluding the lower N bits from the operation result of the synthesis window coefficient. group.