KR20130047630A - Apparatus and method for coding signal in a communication system - Google Patents

Apparatus and method for coding signal in a communication system Download PDF

Info

Publication number
KR20130047630A
KR20130047630A KR1020120119933A KR20120119933A KR20130047630A KR 20130047630 A KR20130047630 A KR 20130047630A KR 1020120119933 A KR1020120119933 A KR 1020120119933A KR 20120119933 A KR20120119933 A KR 20120119933A KR 20130047630 A KR20130047630 A KR 20130047630A
Authority
KR
South Korea
Prior art keywords
residual signal
vector
frequency
signal
order
Prior art date
Application number
KR1020120119933A
Other languages
Korean (ko)
Inventor
김현우
Original Assignee
한국전자통신연구원
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 한국전자통신연구원 filed Critical 한국전자통신연구원
Priority to US13/661,945 priority Critical patent/US8924203B2/en
Publication of KR20130047630A publication Critical patent/KR20130047630A/en

Links

Images

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/04Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
    • G10L19/08Determination or coding of the excitation function; Determination or coding of the long-term prediction parameters
    • G10L19/12Determination or coding of the excitation function; Determination or coding of the long-term prediction parameters the excitation function being a code excitation, e.g. in code excited linear prediction [CELP] vocoders
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/04Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
    • G10L19/26Pre-filtering or post-filtering
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L2019/0001Codebooks
    • G10L2019/0016Codebook for LPC parameters

Abstract

The present invention relates to an apparatus and method for encoding a speech and audio signal using a Code Excited Linear Prediction (CELP) encoding scheme in a communication system. Calculate a residual signal for the speech and audio signal, convert the residual signal into a frequency domain, calculate frequency energy of the residual signal using frequency coefficients of the residual signal, and calculate the frequency energy of the residual signal from the The energy concentration rate for each vector order of the residual signal is calculated, and the target vector order of the residual signal is determined by comparing the energy concentration rates for each vector order.

Figure P1020120119933

Description

Apparatus and method for coding signal in a communication system

BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a communication system, and more particularly, to an apparatus and method for encoding a speech and audio signal using a code excited linear prediction (CELP) coding scheme in a communication system. It is about.

In a communication system, active research is being conducted to provide users with services of various quality of service (QoS: QoS) having a high transmission speed. In such a communication system, methods for rapidly transmitting data having various types of QoS through limited resources have been proposed, and in recent years, as network development and user demand for high-quality services increase, voice and audio signals are compressed in a network. As a scheme for transmitting, a scheme for compressing and restoring a pulse code modulation (PCM) signal has been proposed, and many voice / audio codecs have been proposed for compressing and restoring such a PCM signal. (codec) were developed.

On the other hand, as an example of the voice / audio codec, recent codecs such as ITU-T G.729.1 and G.718 support multiple bit rates in an embedded structure, and voice and audio signals at low bit rates of the multiple bit rates. A high compression ratio is achieved based on the CELP technique modeling the generation of the signal, and at the high bit rate of the multiple bit rate, a residual discrete cosine transform (MDCT) of a speech and audio signal is transformed in the time domain. It is referred to as 'MDCT') or a Discrete Fourier Transform (DFT: Discrete Fourier Transform, hereinafter referred to as 'DFT') to quantize by transforming into a frequency domain.

Here, the CELP technology is designed to be more suitable for voice than audio in voice and audio signals, and the characteristics of the residual signal, which is a difference between the original sound and the synthesized signal encoded by the CELP technology, are different. In other words, in the case of voice, CELP technology can properly express formants and pitches having a large frequency size, but in the case of music, it can not express formants and pitches properly, and thus large frequency components in the residual signal There is a lot left. That is, in the CELP technology, even if the same voice is used, not only a signal in which the frequency distribution is evenly distributed by accurately expressing the formant and the pitch, but also a coefficient having a large frequency because the coefficient and the pitch cannot be accurately expressed as described above. It may appear in.

However, in the current communication system, when the voice and audio signals are encoded using the CELP technology, that is, the CELP coding scheme, as described above, a specific method for normally processing the residual signal for the voice and audio signals is proposed. In particular, since the residual signal is not normally processed, the encoding performance of the voice and audio signals through the CELP encoding method is degraded, thereby providing a high quality service to users.

Accordingly, in order to provide high quality voice and audio services in a communication system, a method of encoding voice and audio signals using a CELP encoding method is required.

Accordingly, an object of the present invention is to provide an apparatus and method for encoding a signal in a communication system.

Another object of the present invention is to provide an apparatus and method for encoding a speech and audio signal using a Code Excited Linear Prediction (CELP) coding scheme in a communication system.

Another object of the present invention is to determine a quantization vector dimension according to a distribution of frequency coefficients of a residual signal for a speech and audio signal when encoding a speech and audio signal using a CELP encoding scheme in a communication system. The present invention provides a signal encoding apparatus and a method for processing the residual signal normally.

In addition, another object of the present invention, when encoding a speech and audio signal using the CELP coding scheme in a communication system, through the frequency characteristic analysis of the residual signal for the speech and audio signal, the quantization vector order according to the energy concentration The present invention provides a signal encoding apparatus and method for improving the voice and audio quality of service by determining a and processing the residual signal normally.

According to an aspect of the present invention, there is provided a signal encoding apparatus in a communication system, the apparatus comprising: an encoding unit encoding a speech and an audio signal by a code excited linear prediction (CELP) encoding scheme; A residual signal calculator configured to calculate a residual signal for the voice and audio signals; A frequency converter converting the residual signal into a frequency domain; An energy calculator configured to calculate frequency energy of the residual signal using frequency coefficients of the residual signal; An energy concentration ratio calculator for calculating an energy concentration ratio for each vector order of the residual signal from the frequency energy of the residual signal; And a vector order determiner which determines an object vector order of the residual signal by comparing energy concentration rates with respect to the respective vector orders.

In accordance with another aspect of the present invention, there is provided a method of encoding a signal in a communication system, the method comprising: encoding a speech and an audio signal by a code excited linear prediction (CELP) encoding scheme; Calculating a residual signal for the speech and audio signal; Converting the residual signal into a frequency domain; Calculating frequency energy of the residual signal using the frequency coefficients of the residual signal; Calculating an energy concentration rate for each vector order of the residual signal from the frequency energy of the residual signal; And comparing the energy concentration rates for the respective vector orders to determine a target vector order of the residual signal.

According to the present invention, in encoding a speech and audio signal using a CELP coding scheme in a communication system, a quantization vector dimension is determined according to a distribution of frequency coefficients of a residual signal for the speech and audio signal, in particular the residual. By analyzing the frequency characteristics of the signal, by determining the quantization vector order according to the energy concentration, the residual signal for the speech and audio signals are processed normally, thereby improving the encoding performance of the speech and audio signals using the CELP encoding method. High quality voice and audio services can be provided.

1 is a view schematically showing a structure of a signal encoding apparatus in a communication system according to an embodiment of the present invention.
2 is a diagram schematically illustrating an encoding process of a signal encoding apparatus in a communication system according to an embodiment of the present invention.

Hereinafter, preferred embodiments of the present invention will be described in detail with reference to the accompanying drawings. It should be noted that in the following description, only parts necessary for understanding the operation according to the present invention will be described, and descriptions of other parts will be omitted so as not to distract from the gist of the present invention.

The present invention proposes a signal encoding apparatus and method in a communication system. Here, according to an embodiment of the present invention, a voice and audio signal for encoding services of various quality of service (QoS: hereinafter referred to as 'QoS') in a communication system, for example, voice and audio service, are encoded. Although an apparatus and a method are described as an example, the signal encoding scheme proposed by the present invention may be equally applied to the case of encoding signals corresponding to other services.

In addition, an embodiment of the present invention proposes an apparatus and method for encoding a speech and audio signal using a Code Excited Linear Prediction (CELP) coding scheme in a communication system. do. Here, in an embodiment of the present invention, when encoding a voice and audio signal using a CELP coding scheme in a communication system, a quantization vector dimension is determined according to the distribution of frequency coefficients of the residual signal for the voice and audio signal. Thus, the residual signal is processed normally, thereby improving the encoding performance of the speech and audio signals. In the embodiment of the present invention, as described above, when encoding the voice and audio signals using the CELP encoding method, the quantization vector according to the energy concentration is analyzed through frequency characteristic analysis of the residual signal for the voice and audio signals. The order is determined, and thus the residual signal for the voice and audio signals is normally processed to provide high quality voice and audio services.

Here, in the communication system according to the embodiment of the present invention, the frequency characteristics of the residual signal for the speech signal are uniformly distributed during the encoding of the speech and audio signals using the CELP encoding scheme, but for the music signal, that is, the audio signal. Since the peak component of the residual signal appears strongly, the quantization vector order is determined in consideration of the distribution of frequency characteristics of the residual signal for the speech and audio signals, thereby efficiently processing the residual signal by performing efficient quantization with limited bits. That is, in an embodiment of the present invention, when CELP coding is applied to speech and audio signals, and the residual signal for the speech and audio signals is encoded, the target vector order is determined according to the distribution of frequency coefficients of the residual signals. By processing the residual signal normally, a high quality voice and audio service is provided.

According to an embodiment of the present invention, an spectral distribution of a residual signal for speech and audio signals that are structurally generated according to the CELP coding scheme in a speech / audio codec using multiple bit rates is analyzed and coded. By adjusting the dimension, the residual signal for the voice and audio signals is normally processed to provide high quality voice and audio services. In the embodiment of the present invention, in the spectral distribution of the residual signal, when the residual signal has a certain number of large frequencies, the order of the target vector to be coded is reduced, so that the large frequency of the residual signal is more precisely defined by a limited bit. The residual signal is processed more normally to improve the quality of speech and audio services, and in the spectral distribution of the residual signal, if the frequencies of the residual signal are evenly distributed, the order of the target vector to be coded is large. To perform quantization.

That is, in an embodiment of the present invention, the frequency distribution of the residual signal for the speech and audio signals is analyzed to determine the target vector order, thereby reducing the quantization error, thereby improving the sound quality of the speech / audio codec. Here, in the embodiment of the present invention, the energy concentration ratio of the residual signal is calculated, and the frequency distribution of the residual signal is analyzed. In this case, when the energy concentration ratio is high, when the target vector order is made small and the energy concentration ratio is low, The order is determined in such a way as to increase the target vector order, and the quantization of more important frequency coefficients is elaborated, thereby improving the quality of voice and audio services.

In other words, in the existing speech / audio codec, the target vector order of the residual signal is fixed to be equal to the number of frequency coefficients, so that several large frequencies, that is, tones, such as the residual signal for the music signal, that is, the audio signal, are fixed. If the component contains a strong frequency, the quantization of the large frequency coefficients is not precisely performed, and thus, the degradation of the voice and audio quality of service occurs when the existing voice and audio signals are encoded. In an embodiment of the present invention, when encoding a voice and audio signal using a CELP encoding scheme, the target vector order is determined according to the frequency distribution of the residual signal for the voice and audio signal, thereby determining the voice and audio service, in particular, music. More precisely quantizes audio signals, such as signals, And audio services. Next, a device for encoding voice and audio signals in a communication system according to an exemplary embodiment of the present invention will be described in more detail with reference to FIG. 1.

1 is a diagram schematically illustrating a structure of a signal encoding apparatus in a communication system according to an embodiment of the present invention.

Referring to FIG. 1, the apparatus for encoding a signal includes a CELP encoder 102 for encoding a speech and an audio signal using a CELP encoding scheme, and a residual signal calculator 104 for calculating a residual signal for the speech and audio signals. ), A frequency converter 106 for converting the residual signal into a signal in a frequency domain, a residual signal weighting unit 108 for obtaining a weighted signal from the frequency coefficients of the residual signal, and using the frequency coefficients of the residual signal. A frequency energy calculation unit 110 that calculates frequency energy, an energy concentration rate calculator 120 that calculates an energy concentration rate for each vector order of the frequency energy, and compares the energy concentration rate to determine a vector order. The vector order determination unit 122 is included. Here, in the communication system according to the embodiment of the present invention, the signal encoding apparatus, as described above, the CELP encoding unit 102, the residual signal calculation unit 104, the frequency converter 106, the residual Through the signal weighting unit 108, the frequency energy calculator 110, and the energy concentration ratio calculator 120, the residual signal frequency distribution according to the CELP codec is analyzed and the vector order determiner 122 is used. Determine the objective vector order.

The signal encoding apparatus includes a vector position determiner 124 that determines the position of the object vector by the object vector order, a vector position quantizer 126 that quantizes positions of frequency coefficients allocated to the object vector, and the object vector. A gain quantizer 128 that quantizes the gain of the signal, a vector normalizer 130 that normalizes the object vector with the quantized gain, and a shape quantizer 132 that shapes and quantizes the normalized object vector. Code quantizer 134 for quantizing the sign of the shape quantized vector position, quantized parameters of the quantizers 126, 128, 132, 134, and multiplexer 136 for multiplexing the determined target vector order. .

In more detail, the CELP encoder 102 receives a voice and audio signal as a pulse code modulation (PCM) signal, and receives the voice and audio signal. Encode with CELP codec. Here, as an example of the voice / audio codec described above, in the G.729.1 codec, after down-sampling the voice and audio signal at 8 kHz, encoding is performed using the CELP codec, and the G.718 codec is used. In the down-sampling of the speech and audio signals at 12.8 kHz, encoding is performed using the CELP codec.

The residual signal calculator 104 calculates a difference between a voice and an audio signal of the PCM signal, that is, an original voice and audio signal, and a signal resynthesized by the CELP codec through the CELP encoder 102. In other words, the residual signal for the voice and audio signals is calculated. For example, in the G.718 codec, after encoding the voice and audio signals with a CELP codec, up-sampling at 16 kHz is performed, and the difference between the original audio and audio signals Is calculated to generate the residual signal.

The frequency converter 106 converts the residual signal from the time domain into the frequency domain, that is, a modified discrete cosine transform (MDCT) or a discrete Fourier transform (DFT). : Convert to frequency domain through Discrete Fourier Transform (hereinafter referred to as 'DFT').

The residual signal weighting unit 108 receives the frequency coefficients of the residual signal transformed into the frequency domain as described above, and applies a perceptual weighting filter to the frequency coefficients of the residual signal, thereby providing the residual. Acquire a weighted signal of the signal. Here, the residual signal weighting unit 108 applies the cognitive weighting filter in a manner of emphasizing the formant in the above-described CELP technique, or a masking effect in the Moving Picture Experts Group (MPEG) technique. Is applied to obtain a weighted signal of the residual signal from the frequency coefficients of the residual signal.

The frequency energy calculator 110 calculates frequency energy of the residual signal by using frequency coefficients of the residual signal. Here, when the frequency coefficient of the residual signal is an MDCT coefficient, the frequency energy calculator 110 calculates the frequency energy of the residual signal using the coefficient passed through the cognitive weighting filter as it is, and calculates the residual energy of the residual signal. When the frequency coefficient is a DFT coefficient, the square sum of the real component and the image component of the DFT coefficient is calculated, and then the cognitive weighting filter is applied to the square sum to obtain the frequency energy of the residual signal. Calculate In addition, the frequency energy calculation unit 110, in calculating the frequency energy of the residual signal, groups the frequencies of the residual signal into arbitrary subbands in order to reduce the amount of calculation, and thus, as the frequency energy of the residual signal. The energy in the subbands of the residual signal is calculated.

For example, the frequency energy calculation unit 110, when there are 320 MDCT coefficients as the frequency coefficients of the residual signal, using the 320 MDCT coefficients in total, 320 frequency energy of the residual signal. Or calculate the subband energy for the four MDCT coefficients by grouping four MDCT coefficients into one subband, and subband energy for the four MDCT coefficients, ie, 80 subbands. A band energy is calculated, and the subband energy can be expressed by Equation 1 below.

Figure pat00001

In Equation 1, e (n) denotes subband energy for the four MDCT coefficients, and n denotes a subband index.

The energy concentration rate calculator 120 receives the frequency energies of the residual signal and calculates an energy concentration rate for each vector order of the residual signal from the frequency energies of the residual signal. For example, as described above, when the frequency energy of the residual signal calculated by the frequency energy calculation unit 110 is 320, and the target vector orders are 16, 32, and 48, the energy concentration ratio calculation unit 120 In order to calculate the energy concentration rate of each vector order, the 320 frequency energies are arranged in order of energy magnitude.

In addition, the energy concentration ratio calculator 120 may arrange the sorted 320 frequency energies in order of energy magnitude, with 16 frequency energies, 32 frequency energies, 48 frequency energies, and 320 frequencies. After summing the energies, 16 total summing frequency energy (ener16), 32 total summing frequency energy (ener32), 48 total summing frequency energy (ener48), and 320 total summing frequency energy (ener320), respectively do. In addition, the energy concentration ratio calculation unit 120 calculates an energy concentration ratio of each vector order using the total sum frequency energies, that is, an energy concentration ratio of 16 vector orders (EC16 = ener16 / ener320), 32 Calculate the energy concentration rate (EC32 = ener32 / ener320) of the vector order and the energy concentration ratio (EC48 = ener48 / ener320) of the 48 vector order, respectively.

The vector order determination unit 122 determines the target vector order by comparing the energy concentration rates of each vector order calculated by the energy concentration rate calculation unit 120. Here, the vector order determination unit 122 determines the vector order having the maximum value from the energy concentration rates of each vector order calculated by the energy concentration rate calculation unit 120 as the target vector order. For example, the energy concentration rate (EC16) of the 16 vector order, the energy concentration rate (EC32) of the 32 vector order, and the energy concentration rate (EC48) of the 48 vector order, namely EC16, γ1 × EC32, and γ2 × EC48). If the maximum value is EC16, 16 vector orders are determined as the target vector order, and if the maximum value is greater than γ1 × EC32, then the 32 vector orders are determined as the target vector order, and if the maximum value is greater than γ2 × EC48, then 48 vectors The order is determined as the target vector order, and if the maximum value is smaller than β, the 320 vector order is determined as the target vector order. Here, γ1 and γ2 are arbitrary setting values having 0.8 to 1.2, and β means any setting value having 0.5 to 0.8.

The vector position determiner 124 assigns frequency coefficients to the object vector in the order of the absolute values of the frequency coefficients in the order of the object vector determined by the vector order determiner 122. The position of the object vector is stored, i.e., the position of the object vector is determined.

The vector position quantization unit 126 calculates the position of the object vector to which the frequency coefficients are assigned and then quantizes the position of the object vector. For example, when the 16 vector order is the object vector order and 16 frequency coefficients are assigned to the object vector, the vector position quantization unit 126 may determine the position of the frequency coefficients assigned to the object vector, that is, the frequency coefficients. After calculating the position (pi) of the assigned object vector, the frequency coefficients are quantized to the position (pi) of the assigned object vector, and the position quantization of the object vector can be represented by binary bits as shown in Equation 2 below. .

Figure pat00002

The gain quantization unit 128 obtains the gain (

Figure pat00003
) Is quantized. Here, the gain quantization unit 128 quantizes the gain of the target vector to a value closest to a codebook previously generated by using training data.

The vector normalizer 130 normalizes the object vector by the gain of the object vector quantized by the gain quantizer 128.

The shape quantization unit 132 quantizes the object vector normalized by the vector normalization unit 130. The shape quantization unit 132 quantizes the normalized object vector by applying an Algebraic vector quantization, or, as described above, the shape quantization unit 132 to a value closest to a codebook previously generated using training data. Quantize the normalized objective vector.

The code quantization unit 134 quantizes the position code of the object vector quantized by the shape quantization unit 132. That is, the code quantization unit 134 quantizes the code of the position of the quantized object vector.

As described above, the multiplexer 136 multiplexes the quantized object vector, the normalized object vector, the position and gain of the quantized object vector, and the vector order of the object vector. Next, the signal encoding operation in the communication system according to the embodiment of the present invention will be described in more detail with reference to FIG. 2.

2 is a diagram schematically illustrating an encoding process of a signal encoding apparatus in a communication system according to an embodiment of the present invention.

Referring to FIG. 2, in step 210, the signal encoding apparatus receives a voice and an audio signal as a PCM signal, encodes the voice and audio signal with a CELP codec, and then regenerates the voice and audio signal of the PCM signal. In other words, the difference between the original speech and audio signals and the signal resynthesized by the CELP codec is calculated, that is, the residual signal for the speech and audio signals is calculated.

In operation 220, the residual signal is converted from the time domain into the frequency domain, that is, the MDCT or DFT is converted into the frequency domain, and then a perceptual weighting filter is applied to the frequency coefficients of the residual signal. Thus, the weighted signal of the residual signal is obtained.

Next, in step 230, after calculating the frequency energy of the residual signal using the frequency coefficients of the residual signal, the energy concentration ratio for each vector order of the residual signal is calculated from the respective frequency energies of the residual signal. do.

Then, in step 240, a target vector order is determined by comparing the calculated energy concentration rates of the respective vector orders, and the frequency is applied to the target vector in the order of the absolute values of the frequency coefficients as large as the determined order vector. After assigning the coefficients, the position of the object vector to which the frequency coefficients are assigned is stored, i.e., the position of the object vector is determined.

In operation 250, the position of the object vector to which the frequency coefficients are assigned is calculated to quantize the position of the object vector, and the gain of the object vector is quantized.

Next, in step 260, after normalizing the object vector with the gain of the quantized object vector, in step 270, the normalized object vector is quantized and the position code of the quantized object vector is quantized.

Thus, in the communication system according to the embodiment of the present invention, the spectral distribution of the residual signal for the speech and audio signals that are structurally generated according to the CELP coding scheme in the voice / audio codec using multiple bit rates, that is, the residual By analyzing the frequency distribution of the residual signal through the energy concentration ratio of the signal, the target vector dimension to be coded is determined. In this case, in the frequency distribution of the residual signal, the residual signal has some specific frequencies. In this case, the order of the target vector to be coded is reduced, so that the large frequency of the residual signal is more quantized more precisely with a limited bit to process the residual signal more normally, thereby improving the quality of voice and audio service, and also in the frequency distribution of the residual signal. If the frequencies of the residual signal are evenly distributed, Greatly to have a degree of ever performs vector quantization. Here, in the communication system according to the embodiment of the present invention, as described above, the frequency distribution of the residual signal is analyzed by calculating the energy concentration ratio of the residual signal and the like, and when the energy concentration ratio is high, the target vector order is made small. When the energy concentration rate is low, the order is determined by increasing the target vector order, and the quantization of more important frequency coefficients is elaborated, thereby improving the quality of voice and audio services.

While the invention has been shown and described with reference to certain preferred embodiments thereof, it will be understood by those skilled in the art that various changes and modifications may be made without departing from the spirit and scope of the invention. Therefore, the scope of the present invention should not be limited by the described embodiments, but should be determined by the scope of the appended claims, as well as the appended claims.

Claims (20)

In the signal encoding apparatus in a communication system,
An encoder which encodes a voice and an audio signal by a code excited linear prediction (CELP) coding method;
A residual signal calculator configured to calculate a residual signal for the voice and audio signals;
A frequency converter converting the residual signal into a frequency domain;
An energy calculator configured to calculate frequency energy of the residual signal using frequency coefficients of the residual signal;
An energy concentration ratio calculator for calculating an energy concentration ratio for each vector order of the residual signal from the frequency energy of the residual signal; And
And a vector order determiner for comparing an energy concentration rate with respect to each vector order to determine a target vector order of the residual signal.
The method of claim 1,
And the vector order determiner determines a vector order having a maximum value as the target vector order at an energy concentration rate for each vector order.
The method of claim 1,
And the residual signal calculator calculates a difference between the speech and audio signals and a signal resynthesized by a code excitation linear prediction codec through the code excitation linear prediction coding scheme.
The method of claim 1,
The frequency converter is a signal encoding, characterized in that for transforming the residual signal into the frequency domain through a modified Discrete Cosine Transform (MDCT) or Discrete Fourier Transform (DFT) in the time domain Device.
The method of claim 1,
And a residual signal weighting unit for obtaining a weighted signal of the residual signal by applying a perceptual weighting filter to the frequency coefficients of the residual signal.
The method of claim 1,
And the energy calculator calculates energy in a subband of the residual signal by using frequency coefficients of the residual signal.
The method of claim 1,
The energy concentration ratio calculation unit arranges the frequency energy of the residual signal in the order of magnitude, calculates the frequency energy for each vector order according to the magnitude order, and calculates the energy concentration rate for each vector order. And a signal encoding apparatus.
The method of claim 1,
A position determiner which stores the positions of the object vectors to which the frequency coefficients are allocated by assigning the frequency coefficients to the object vector of the residual signal in the order of the highest absolute value of the frequency coefficients by the object vector order;
And a position quantizer configured to calculate a position of frequency coefficients allocated to the object vector and quantize the position of the object vector.
9. The method of claim 8,
A gain quantizer for quantizing the gain of the target vector;
A normalizer for normalizing the object vector with a gain of the quantized object vector;
A shape quantizer for quantizing the normalized object vector;
And a code quantizer for quantizing the position code of the object vector.
10. The method of claim 9,
The gain quantization unit quantizes the gain of the target vector to a value closest to a codebook previously generated using training data;
The shape quantization unit quantizes the normalized object vector by applying an Algebraic vector quantization, or quantizes the normalized object vector to a value closest to the codebook.
In the signal encoding method in a communication system,
Encoding a speech and an audio signal using a Code Excited Linear Prediction (CELP) coding scheme;
Calculating a residual signal for the speech and audio signal;
Converting the residual signal into a frequency domain;
Calculating frequency energy of the residual signal using the frequency coefficients of the residual signal;
Calculating an energy concentration rate for each vector order of the residual signal from the frequency energy of the residual signal; And
And determining an object vector order of the residual signal by comparing energy concentration rates with respect to each vector order.
The method of claim 11,
And a vector order having a maximum value as the target vector order in the energy concentration rate for each vector order for determining the target vector order.
The method of claim 11,
The calculating of the residual signal may include calculating a difference between the speech and audio signals and a signal resynthesized by a code excitation linear prediction codec through the code excitation linear prediction coding scheme. Way.
The method of claim 11,
The transforming into the frequency domain may include converting the residual signal into the frequency domain through a modified discrete cosine transform (MDCT) or a discrete fourier transform (DFT) in the time domain. A signal encoding method.
The method of claim 11,
And applying a perceptual weighting filter to the frequency coefficients of the residual signal to obtain a weighted signal of the residual signal.
The method of claim 11,
The calculating of the frequency energy may include calculating energy in a subband of the residual signal by using frequency coefficients of the residual signal.
The method of claim 11,
The calculating of the energy concentration rate may include: aligning the frequency energy of the residual signal in the order of magnitude, and then calculating the frequency energy for each of the vector orders according to the magnitude order, thereby concentrating the energy for each vector order. And a method for calculating the rate.
The method of claim 11,
Allocating the frequency coefficients to the object vector of the residual signal in order of increasing absolute value of the frequency coefficients by the object vector order, and storing the position of the object vector to which the frequency coefficients are assigned;
And calculating the positions of the frequency coefficients assigned to the object vector, and quantizing the position of the object vector.
19. The method of claim 18,
Quantizing the gain of the target vector;
Normalizing the object vector with the gain of the quantized object vector;
Shape quantizing the normalized object vector;
And quantizing the position code of the object vector.
20. The method of claim 19,
Quantizing the gain comprises: quantizing gain of the object vector to a value closest to a codebook previously generated using training data;
The shape quantization may include performing quantization by applying an Algebraic vector quantization to the normalized object vector, or quantizing the normalized object vector to a value closest to the codebook.
KR1020120119933A 2011-10-28 2012-10-26 Apparatus and method for coding signal in a communication system KR20130047630A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
US13/661,945 US8924203B2 (en) 2011-10-28 2012-10-26 Apparatus and method for coding signal in a communication system

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
KR1020110111464 2011-10-28
KR20110111464 2011-10-28

Publications (1)

Publication Number Publication Date
KR20130047630A true KR20130047630A (en) 2013-05-08

Family

ID=48659026

Family Applications (1)

Application Number Title Priority Date Filing Date
KR1020120119933A KR20130047630A (en) 2011-10-28 2012-10-26 Apparatus and method for coding signal in a communication system

Country Status (1)

Country Link
KR (1) KR20130047630A (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
TWI642053B (en) * 2016-04-12 2018-11-21 弗勞恩霍夫爾協會 Audio encoder for encoding an audio signal, method for encoding an audio signal and computer program under consideration of a detected peak spectral region in an upper frequency band

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
TWI642053B (en) * 2016-04-12 2018-11-21 弗勞恩霍夫爾協會 Audio encoder for encoding an audio signal, method for encoding an audio signal and computer program under consideration of a detected peak spectral region in an upper frequency band
US10825461B2 (en) 2016-04-12 2020-11-03 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Audio encoder for encoding an audio signal, method for encoding an audio signal and computer program under consideration of a detected peak spectral region in an upper frequency band
US11682409B2 (en) 2016-04-12 2023-06-20 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Audio encoder for encoding an audio signal, method for encoding an audio signal and computer program under consideration of a detected peak spectral region in an upper frequency band

Similar Documents

Publication Publication Date Title
JP6980871B2 (en) Signal coding method and its device, and signal decoding method and its device
US8099275B2 (en) Sound encoder and sound encoding method for generating a second layer decoded signal based on a degree of variation in a first layer decoded signal
TWI576832B (en) Apparatus and method for generating bandwidth extended signal
KR20090122142A (en) A method and apparatus for processing an audio signal
JP6600054B2 (en) Method, encoder, decoder, and mobile device
EP2814028B1 (en) Audio and speech coding device, audio and speech decoding device, method for coding audio and speech, and method for decoding audio and speech
KR102386738B1 (en) Signal encoding method and apparatus, and signal decoding method and apparatus
US9424857B2 (en) Encoding method and apparatus, and decoding method and apparatus
US9830919B2 (en) Acoustic signal coding apparatus, acoustic signal decoding apparatus, terminal apparatus, base station apparatus, acoustic signal coding method, and acoustic signal decoding method
US8924203B2 (en) Apparatus and method for coding signal in a communication system
CN108701462B (en) Adaptive quantization of weighting matrix coefficients
WO2011045926A1 (en) Encoding device, decoding device, and methods therefor
US8706509B2 (en) Method and a decoder for attenuation of signal regions reconstructed with low accuracy
KR20130047630A (en) Apparatus and method for coding signal in a communication system
KR20160098597A (en) Apparatus and method for codec signal in a communication system

Legal Events

Date Code Title Description
WITN Withdrawal due to no request for examination