TWI488176B - Encoding and decoding of pulse positions of tracks of an audio signal - Google Patents

Encoding and decoding of pulse positions of tracks of an audio signal Download PDF

Info

Publication number
TWI488176B
TWI488176B TW101104350A TW101104350A TWI488176B TW I488176 B TWI488176 B TW I488176B TW 101104350 A TW101104350 A TW 101104350A TW 101104350 A TW101104350 A TW 101104350A TW I488176 B TWI488176 B TW I488176B
Authority
TW
Taiwan
Prior art keywords
pulse
track
positions
tracks
states
Prior art date
Application number
TW101104350A
Other languages
Chinese (zh)
Other versions
TW201248617A (en
Inventor
Guillaume Fuchs
Tom Backstrom
Original Assignee
Fraunhofer Ges Forschung
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Fraunhofer Ges Forschung filed Critical Fraunhofer Ges Forschung
Publication of TW201248617A publication Critical patent/TW201248617A/en
Application granted granted Critical
Publication of TWI488176B publication Critical patent/TWI488176B/en

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/04Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/04Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
    • G10L19/16Vocoder architecture
    • G10L19/167Audio streaming, i.e. formatting and decoding of an encoded audio signal representation into a data stream for transmission or storage purposes
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/005Correction of errors induced by the transmission channel, if related to the coding algorithm
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/04Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
    • G10L19/08Determination or coding of the excitation function; Determination or coding of the long-term prediction parameters
    • G10L19/10Determination or coding of the excitation function; Determination or coding of the long-term prediction parameters the excitation function being a multipulse excitation
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/02Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders
    • G10L19/028Noise substitution, i.e. substituting non-tonal spectral components by noisy source
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/04Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
    • G10L19/08Determination or coding of the excitation function; Determination or coding of the long-term prediction parameters
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10KSOUND-PRODUCING DEVICES; METHODS OR DEVICES FOR PROTECTING AGAINST, OR FOR DAMPING, NOISE OR OTHER ACOUSTIC WAVES IN GENERAL; ACOUSTICS NOT OTHERWISE PROVIDED FOR
    • G10K11/00Methods or devices for transmitting, conducting or directing sound in general; Methods or devices for protecting against, or for damping, noise or other acoustic waves in general
    • G10K11/16Methods or devices for protecting against, or for damping, noise or other acoustic waves in general
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/012Comfort noise or silence coding
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/02Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/02Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders
    • G10L19/0212Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders using orthogonal transformation
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/02Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders
    • G10L19/022Blocking, i.e. grouping of samples in time; Choice of analysis windows; Overlap factoring
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/02Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders
    • G10L19/022Blocking, i.e. grouping of samples in time; Choice of analysis windows; Overlap factoring
    • G10L19/025Detection of transients or attacks for time/frequency resolution switching
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/02Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders
    • G10L19/03Spectral prediction for preventing pre-echo; Temporary noise shaping [TNS], e.g. in MPEG2 or MPEG4
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/04Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
    • G10L19/06Determination or coding of the spectral characteristics, e.g. of the short-term prediction coefficients
    • G10L19/07Line spectrum pair [LSP] vocoders
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/04Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
    • G10L19/08Determination or coding of the excitation function; Determination or coding of the long-term prediction parameters
    • G10L19/10Determination or coding of the excitation function; Determination or coding of the long-term prediction parameters the excitation function being a multipulse excitation
    • G10L19/107Sparse pulse excitation, e.g. by using algebraic codebook
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/04Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
    • G10L19/08Determination or coding of the excitation function; Determination or coding of the long-term prediction parameters
    • G10L19/12Determination or coding of the excitation function; Determination or coding of the long-term prediction parameters the excitation function being a code excitation, e.g. in code excited linear prediction [CELP] vocoders
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/04Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
    • G10L19/08Determination or coding of the excitation function; Determination or coding of the long-term prediction parameters
    • G10L19/12Determination or coding of the excitation function; Determination or coding of the long-term prediction parameters the excitation function being a code excitation, e.g. in code excited linear prediction [CELP] vocoders
    • G10L19/13Residual excited linear prediction [RELP]
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/04Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
    • G10L19/16Vocoder architecture
    • G10L19/18Vocoders using multiple modes
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/04Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
    • G10L19/16Vocoder architecture
    • G10L19/18Vocoders using multiple modes
    • G10L19/22Mode decision, i.e. based on audio signal content versus external parameters
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/02Speech enhancement, e.g. noise reduction or echo cancellation
    • G10L21/0208Noise filtering
    • G10L21/0216Noise filtering characterised by the method used for estimating noise
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/03Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters
    • G10L25/06Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters the extracted parameters being correlation coefficients
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/78Detection of presence or absence of voice signals
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/04Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
    • G10L19/26Pre-filtering or post-filtering

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Computational Linguistics (AREA)
  • Signal Processing (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Spectroscopy & Molecular Physics (AREA)
  • Quality & Reliability (AREA)
  • Algebra (AREA)
  • General Physics & Mathematics (AREA)
  • Mathematical Analysis (AREA)
  • Mathematical Optimization (AREA)
  • Mathematical Physics (AREA)
  • Pure & Applied Mathematics (AREA)
  • Theoretical Computer Science (AREA)
  • Compression, Expansion, Code Conversion, And Decoders (AREA)
  • Transmission Systems Not Characterized By The Medium Used For Transmission (AREA)
  • Compression Or Coding Systems Of Tv Signals (AREA)

Description

音訊信號音軌脈衝位置之編碼與解碼技術Encoding and decoding technology for audio signal track position

本發明係有關於音訊處理及音訊編碼領域,更明確言之,係有關於音訊信號中音軌脈衝位置之編碼與解碼技術。The present invention relates to the field of audio processing and audio coding, and more specifically to encoding and decoding techniques for the position of a track pulse in an audio signal.

音訊處理及/或音訊編碼已經多所進展。於音訊編碼中,線性預測編碼器扮演要角。當編碼音訊信號例如包含語音的音訊信號時,線性預測編碼器通道編碼音訊信號之頻譜波封之表示型態。為了達成該項目的,線性預測編碼器可決定預測濾波器係數來以編碼形式表示該聲音之頻譜波封。然後,濾波器係數可由線性預測解碼器用來藉使用該等線性預測編碼器產生合成音訊信號而解碼該編碼音訊信號。Audio processing and/or audio coding has progressed. In audio coding, a linear predictive encoder plays an important role. When encoding an audio signal, such as an audio signal containing speech, the linear predictive encoder channel encodes the representation of the spectral envelope of the audio signal. To achieve this, the linear predictive encoder can determine the predictive filter coefficients to represent the spectral envelope of the sound in encoded form. The filter coefficients can then be used by a linear predictive decoder to decode the encoded audio signal by using the linear predictive encoder to generate a composite audio signal.

線性預測編碼器之主要實例為ACELP編碼器(ACELP=代數代碼激勵線性預測編碼器)。ACELP編碼器廣為人使用,例如用於USAC(USAC=統一語音與音訊編碼)且可具有額外應用領域例如用於LD-USAC(低延遲統一語音與音訊編碼)。A prime example of a linear predictive coder is the ACELP coder (ACELP = Algebraic Code Excited Linear Predictive Encoder). ACELP encoders are widely used, for example for USAC (USAC = Unified Voice and Audio Coding) and may have additional fields of application such as for LD-USAC (Low Delay Unified Voice and Audio Coding).

ACELP編碼器通道係藉決定預測濾波器係數而編碼音訊信號。為了達成更佳編碼,ACELP編碼器基於欲編碼之音訊信號,及基於已經決定的預測濾波器係數而決定殘差信號,又稱目標信號。殘差信號例如可以是差信號,表示欲編碼之音訊信號與,藉預測濾波器係數及可能地藉音準(pitch)分析所得的適應性濾波器係數編碼之該信號部分間 之差值。然後ACELP編碼器針對編碼該殘差信號。為了達成該項目的,編碼器編碼代數碼簿參數,該等代數碼簿參數係用來表示該殘差信號。The ACELP encoder channel encodes the audio signal by determining the prediction filter coefficients. In order to achieve better coding, the ACELP encoder determines the residual signal, also known as the target signal, based on the audio signal to be encoded and based on the determined prediction filter coefficients. The residual signal may be, for example, a difference signal indicating the audio signal to be encoded and the portion of the signal encoded by the predictive filter coefficients and possibly the adaptive filter coefficients obtained by pitch analysis. The difference. The ACELP encoder then encodes the residual signal. To achieve this item, the encoder encodes the algebraic book parameters, which are used to represent the residual signal.

為了編碼該殘差信號,使用代數碼簿。通常代數碼簿包含多個音軌,例如四個音軌各自包含16音軌位置。於此種組態中,藉個別代數碼簿可表示共計4x16=64樣本位置,相對應於欲編碼的音訊信號之一子訊框的樣本數目。In order to encode the residual signal, a codebook is used. A typical digital book contains multiple tracks, for example four tracks each containing 16 track positions. In this configuration, a single generation digital book can represent a total of 4x16=64 sample positions, corresponding to the number of samples of one of the sub-frames of the audio signal to be encoded.

碼簿之該等音軌可以交插,使得碼簿之音軌0可表示該子訊框之樣本0、4、8、...、60,使得碼簿之音軌1可表示該子訊框之樣本1、5、9、...、61,使得碼簿之音軌2可表示該子訊框之樣本2、6、10、...、62,及使得碼簿之音軌3可表示該子訊框之樣本3、7、11、...、63。各個音軌可有固定數目之脈衝。或者每個音軌之脈衝數目可改變,例如取決於其它狀況而改變。脈衝例如可以是正或負,例如可以+1(正脈衝)或0(負脈衝)表示。The tracks of the codebook can be interleaved so that the track 0 of the codebook can represent the samples 0, 4, 8, ..., 60 of the subframe, so that the track 1 of the codebook can represent the message. The samples 1, 5, 9, ..., 61 of the frame enable the track 2 of the codebook to represent the samples 2, 6, 10, ..., 62 of the subframe, and the track 3 of the code book Samples 3, 7, 11, ..., 63 of the subframe can be represented. Each track can have a fixed number of pulses. Or the number of pulses per track can vary, for example depending on other conditions. The pulse can be, for example, positive or negative, for example +1 (positive pulse) or 0 (negative pulse).

為了編碼該殘差信號,當編碼時,碼簿組態可經選擇使得最佳表示該殘差信號之其餘信號。為了達成該項目的,可用脈衝可位在最佳地反映出欲編碼的信號位置之適當音軌位置。此外,可載明相對應脈衝為正或為負。To encode the residual signal, when encoded, the codebook configuration can be selected such that the remaining signals of the residual signal are optimally represented. In order to achieve this, the available pulses can be positioned to best reflect the appropriate track position of the signal position to be encoded. In addition, it can be stated that the corresponding pulse is positive or negative.

於解碼器端,ACELP解碼器將至少解碼代數碼簿參數。ACELP解碼器也可解碼適應性碼簿參數。為了決定代數碼簿參數,ACELP解碼器可針對代數碼簿之各個音軌決定多個脈衝位置。此外,ACELP解碼器也可解碼在一音軌位置的脈衝是正或負脈衝。又復,ACELP解碼器也可解碼 適應性碼簿參數。基於此項資訊,ACELP解碼器通常產生激勵信號。然後,ACELP解碼器施加該等預測濾波器係數於該激勵信號來產生合成音訊信號而獲得解碼音訊信號。At the decoder side, the ACELP decoder will decode at least the generation code parameters. The ACELP decoder can also decode adaptive codebook parameters. To determine the generation of digital book parameters, the ACELP decoder can determine multiple pulse positions for each track of the digital book. In addition, the ACELP decoder can also decode pulses at a track position that are positive or negative pulses. Again, the ACELP decoder can also decode Adaptive codebook parameters. Based on this information, ACELP decoders typically generate an excitation signal. The ACELP decoder then applies the predictive filter coefficients to the excitation signal to produce a composite audio signal to obtain a decoded audio signal.

於ACELP中,音軌上的脈衝通常係編碼如下。若該音軌具有長度16,及若此一音軌上的脈衝數目為1,則可藉其位置(4位元)及符號(1位元)共計5位元編碼脈衝位置。若該音軌具有長度16及脈衝數目為2,則第一脈衝係藉其位置(4位元)及符號(1位元)編碼。至於第二脈衝,則只需編碼位置(4位元),原因在於若該第二脈衝係在第一脈衝左方則可選擇第二脈衝之符號為正,若係在第一脈衝右方則可選擇第二脈衝之符號為正,及若係在第一脈衝之相同位置則可選擇第二脈衝為與第一脈衝相同符號。因此總計,共需9位元來編碼兩個脈衝。比起分開地各藉5位元編碼脈衝位置,因而針對每一對脈衝節省1位元。In ACELP, the pulses on the track are usually encoded as follows. If the track has a length of 16, and if the number of pulses on the track is 1, the pulse position can be encoded by a total of 5 bits by its position (4 bits) and the symbol (1 bit). If the track has a length of 16 and the number of pulses is 2, the first pulse is encoded by its position (4 bits) and the symbol (1 bit). As for the second pulse, only the coding position (4 bits) is needed, because if the second pulse is to the left of the first pulse, the sign of the second pulse can be selected to be positive, if it is to the right of the first pulse. The sign of the second pulse can be selected to be positive, and if it is at the same position of the first pulse, the second pulse can be selected to be the same symbol as the first pulse. Therefore, in total, a total of 9 bits are required to encode two pulses. The 5-bit encoded pulse position is borrowed separately from each other, thus saving 1 bit for each pair of pulses.

編碼比2更大的脈衝數目,可成對地編碼脈衝,及若脈衝數為奇數,則分開地編碼最末脈衝。如此則例如針對5脈衝的音軌,需要9+9+5=23位元。若有4音軌,則要求4 x 23=92位元來編碼具有4音軌及每個音軌5脈衝的長度64之子訊框。但若位元數可以更進一步減少則更有價值。The number of pulses larger than 2 can be encoded in pairs, and if the number of pulses is odd, the last pulse is encoded separately. Thus, for example, for a 5-pulse track, 9+9+5=23 bits are required. If there are 4 tracks, 4 x 23 = 92 bits are required to encode a sub-frame with a length of 64 tracks of 4 tracks and 5 pulses per track. But it is more valuable if the number of bits can be further reduced.

若能提供具有改良編碼或解碼構思的編碼裝置及個別的解碼裝置將極有價值,該裝置具有針對脈衝資訊表示型態使用較少位元而以改良方式來編碼或解碼脈衝資訊的手段,原因在於如此將例如減低傳輸個別地編碼音訊信號之傳輸速率,及又復,如此將例如減少儲存個別地編碼音訊 信號需要的儲存空間。It would be extremely valuable to provide an encoding device with an improved coding or decoding concept and an individual decoding device that has means for encoding or decoding pulse information in an improved manner using fewer bits for the pulsed information representation type, In this way, for example, the transmission rate of the individually encoded audio signal is reduced, and again, which will, for example, reduce the storage of individually encoded audio. The storage space required for the signal.

因此本發明之一目的係提供音訊信號之音軌脈衝的編碼及解碼之改良構思。本發明之目的可藉如申請專利範圍第1項之解碼裝置、如申請專利範圍第9項之編碼裝置、如申請專利範圍第13項之解碼方法、如申請專利範圍第14項之編碼方法、及如申請專利範圍第15項之電腦程式達成。It is therefore an object of the present invention to provide an improved concept for encoding and decoding audio track pulses of an audio signal. The object of the present invention is the decoding device of claim 1 of the patent application, the encoding device of claim 9 of the patent application, the decoding method of claim 13 of the patent application, the encoding method of claim 14 of the patent scope, And the computer program as claimed in item 15 of the patent application is reached.

依據實施例,假設一個狀態數目可供解碼裝置使用。又復假設指示與該編碼音訊信號相聯結的至少一個音軌之音軌位置總數的音軌位置數目、及指示至少一個音軌之脈衝數目的總脈衝數目可供本發明之解碼裝置使用。較佳地,該音軌位置數目及該總脈衝數目係可供與該編碼音訊信號相聯結的各個音軌使用。According to an embodiment, it is assumed that a number of states is available to the decoding device. Still further hypothesized that the number of track positions indicating the total number of track positions of at least one track associated with the encoded audio signal, and the total number of pulses indicating the number of pulses of at least one track are available for use by the decoding apparatus of the present invention. Preferably, the number of track positions and the total number of pulses are available for individual tracks associated with the encoded audio signal.

舉例言之,具有4音軌有5脈衝,各自可達成粗略地6.6 x 10^21狀態,依據實施例,可藉73位元編碼,比較前述最先進編碼器使用92位元編碼,更有效約21%。For example, there are 5 pulses with 4 tracks, each of which can achieve a coarse 6.6 x 10^21 state. According to an embodiment, 73-bit encoding can be used, and the most advanced encoder is compared with 92-bit encoding, which is more effective. twenty one%.

首先,提供一種構思,如何以有效方式編碼一音訊信號之一音軌的多個脈衝位置。後文中,該構思擴延成允許不僅編碼一音軌的脈衝位置,同時也允許編碼該脈衝是正或負。此外,然後該構思擴延成允許以有效方式針對多個音軌編碼脈衝資訊。此等構思同等適用於解碼器端。First, an idea is provided on how to encode a plurality of pulse positions of a track of an audio signal in an efficient manner. In the following, the concept is extended to allow not only the pulse position of a track to be encoded, but also to allow the pulse to be positive or negative. Furthermore, the idea is then extended to allow pulse information to be encoded for a plurality of tracks in an efficient manner. These concepts are equally applicable to the decoder side.

此外,實施例又更基於發現若編碼策略使用預定位元數目,使得在各個音軌上有相同脈衝數目的任一種組態皆要求相等位元數目。若可用位元數目為固定,則可能直接 地選定使用該給定位元量可編碼多少個脈衝,如此允許以預定品質編碼。此外,使用此種辦法,無需嘗試不等量脈衝直到達成期望的位元率,反而可以直接地選擇正確脈衝量,因而減低複雜度。Moreover, the embodiments are further based on the finding that if the encoding strategy uses a predetermined number of bits, any configuration having the same number of pulses on each track requires an equal number of bits. If the number of available bits is fixed, it may be direct The number of pulses that can be encoded using the given positioning element is selected so as to allow encoding with a predetermined quality. Moreover, with this approach, there is no need to try unequal pulses until the desired bit rate is achieved, instead the direct pulse amount can be directly selected, thus reducing complexity.

基於前述假設,可編碼及/或解碼一音訊信號訊框之一音軌的多個脈衝位置。Based on the foregoing assumptions, a plurality of pulse positions of one of the tracks of an audio signal frame can be encoded and/or decoded.

雖然本發明可採用來編碼或解碼任一種音訊信號,例如語音信號或樂音信號,本發明特別可用於編碼或解碼語音信號。Although the present invention can be used to encode or decode any type of audio signal, such as a voice signal or a tone signal, the present invention is particularly useful for encoding or decoding a voice signal.

於另一實施例中,脈衝資訊解碼器又更適用以使用音軌位置數目、總脈衝數目、及狀態數目來解碼多個脈衝符號,其中該等脈衝符號中之各一者指示多個脈衝中之一者的符號。信號解碼器可適用以又更使用多個脈衝符號,藉產生合成音訊信號而解碼該編碼音訊信號。In another embodiment, the pulse information decoder is further adapted to decode a plurality of pulse symbols using the number of track positions, the total number of pulses, and the number of states, wherein each of the pulse symbols indicates a plurality of pulses One of the symbols. The signal decoder can be adapted to use a plurality of pulse symbols to decode the encoded audio signal by generating a composite audio signal.

依據又一實施例,其中該等一或多個音軌可包含至少一個最末音軌及一或多個其它音軌,該脈衝資訊解碼器可適用以從該狀態數目產生一第一子狀態數目及一第二子狀態數目。該脈衝資訊解碼器可經組配來基於該第一子狀態數目而解碼該等脈衝位置之一第一群組,及該脈衝資訊解碼器更可經組配來基於該第二子狀態數目而解碼該等脈衝位置之一第二群組。該等脈衝位置之該第二群組可只包含指示該最末音軌之音軌位置的脈衝位置。該等脈衝位置之該第一群組可只包含指示該等一或多個其它音軌之音軌位置的脈衝位置。In accordance with yet another embodiment, wherein the one or more tracks can include at least one last track and one or more other tracks, the pulse information decoder can be adapted to generate a first sub-state from the number of states The number and the number of second sub-states. The pulse information decoder can be configured to decode a first group of the pulse positions based on the first number of sub-states, and the pulse information decoder can be further configured to be based on the second number of sub-states A second group of one of the pulse positions is decoded. The second group of the pulse positions may only include a pulse position indicating the position of the track of the last track. The first group of the pulse positions may only include pulse positions indicative of the track positions of the one or more other tracks.

依據另一實施例,脈衝資訊解碼器可經組配來藉將該狀態數目除以f(pk ,N)而獲得一整數部分及一餘數作為除法結果來產生該第一子狀態數目及該第二子狀態數目,其中該整數部分為該第一子狀態數目,及其中該餘數為該第二子狀態數目,其中pk 指示針對該等一或多個音軌中之各一者的脈衝數目,及其中N指示針對該等一或多個音軌中之各一者的音軌位置數目。此處,f(pk ,N)乃返回於長度N具有pk 脈衝的音軌中可達成的狀態數目之函式。According to another embodiment, the pulse information decoder may be configured to obtain an integer part and a remainder as a division result by dividing the number of states by f(p k , N) to generate the first sub-state number and the a second number of sub-states, wherein the integer portion is the first sub-state number, and wherein the remainder is the second sub-state number, wherein pk indicates a pulse for each of the one or more tracks The number, and N thereof, indicates the number of track positions for each of the one or more tracks. Here, f(p k , N) is a function that returns the number of states achievable in the track of length N having a p k pulse.

於另一實施例中,該脈衝資訊解碼器可適用以進行一測試比較該狀態數目或一已更新狀態數目與一臨界值。In another embodiment, the pulse information decoder is adapted to perform a test to compare the number of states or an updated number of states with a threshold.

該脈衝資訊解碼器可適用以藉比較該狀態數目或一已更新狀態數目是否大於、大於或等於、小於、或小於或等於該臨界值而進行該測試,及其中該脈衝資訊解碼器又更適用以取決於該測試結果而更新該狀態數目或一已更新狀態數目。The pulse information decoder can be adapted to perform the test by comparing whether the number of states or an updated number of states is greater than, greater than or equal to, less than, or less than or equal to the threshold, and wherein the pulse information decoder is more suitable The number of states or an updated number of states is updated depending on the test result.

於一實施例中,該脈衝資訊解碼器可經組配來針對該等多個音軌中之一者的各個音軌位置,比較該狀態數目或該已更新狀態數目與該臨界值。In one embodiment, the pulse information decoder can be configured to compare the number of states or the number of updated states to the threshold for each track position of one of the plurality of tracks.

依據一實施例,該脈衝資訊解碼器可經組配來劃分該等音軌中之一者成為包含該等多個音軌位置中之至少兩個音軌位置之一第一音軌區劃,及成為包含該等多個音軌位置中之至少另兩個音軌位置之一第二音軌區劃。該脈衝資訊解碼器可經組配來基於該狀態數目而產生一第一子狀態數目及一第二子狀態數目。此外,該脈衝資訊解碼器可經 組配來基於該第一子狀態數目,解碼與該第一音軌區劃相聯結的脈衝位置之一第一群組。又復,該脈衝資訊解碼器可經組配來基於該第二子狀態數目,解碼與該第二音軌區劃相聯結的脈衝位置之一第二群組。According to an embodiment, the pulse information decoder may be configured to divide one of the audio tracks into a first track region including one of the at least two track positions of the plurality of track positions, and A second track zone that includes one of the at least two other track positions of the plurality of track positions. The pulse information decoder can be configured to generate a first number of sub-states and a second sub-state number based on the number of states. In addition, the pulse information decoder can be And assigning to decode a first group of pulse positions associated with the first track zone based on the first number of substates. Still further, the pulse information decoder can be configured to decode a second group of one of the pulse locations associated with the second track region based on the second number of substates.

依據一實施例,提出一種編碼一音訊信號之裝置。該裝置包含一信號處理器適用以決定與該音訊信號相聯結的多個預測濾波器係數,用以基於該音訊信號及該等多個預測濾波器係數而產生一殘差信號。此外,該裝置包含一脈衝資訊編碼器適用以編碼與一或多個音軌相關的多個脈衝位置來編碼該音訊信號,該等一或多個音軌係與該殘差信號相聯結。該等音軌中之各一者具有多個音軌位置及多個脈衝。該等脈衝位置中之各一者指示該等音軌中之一者的該等音軌位置中之一者來指示該音軌之該等脈衝中之一者的位置。該脈衝資訊編碼器係經組配來藉產生一狀態數目而編碼該等多個脈衝位置,使得該等脈衝位置唯有基於該狀態數目、指示該等音軌中之至少一者的音軌位置總數之一音軌位置數目、及指示該等音軌中之至少一者的脈衝總數之一總脈衝數目才能被解碼。According to an embodiment, an apparatus for encoding an audio signal is presented. The apparatus includes a signal processor adapted to determine a plurality of prediction filter coefficients coupled to the audio signal for generating a residual signal based on the audio signal and the plurality of prediction filter coefficients. Additionally, the apparatus includes a pulse information encoder adapted to encode the plurality of pulse positions associated with one or more tracks to encode the audio signal, the one or more tracks being associated with the residual signal. Each of the tracks has a plurality of track positions and a plurality of pulses. Each of the pulse positions indicates one of the track positions of one of the tracks to indicate the position of one of the pulses of the track. The pulse information encoder is configured to encode the plurality of pulse positions by generating a number of states such that the pulse positions are only based on the number of states, indicating a track position of at least one of the tracks The number of track positions, one of the total number, and the total number of pulses indicating the total number of pulses of at least one of the tracks can be decoded.

依據另一實施例,該脈衝資訊編碼器可適用以編碼多個脈衝符號,其中該等脈衝符號中之各一者指示該等多個脈衝中之一者之一符號。該脈衝資訊編碼器更可經組配來藉產生該狀態數目而編碼該等多個脈衝符號,使得該等脈衝符號唯有基於該狀態數目、指示該等音軌中之至少一者的音軌位置總數之該音軌位置數目、及該總脈衝數目才能 被解碼。In accordance with another embodiment, the pulse information encoder is operative to encode a plurality of pulse symbols, wherein each of the pulse symbols indicates one of the plurality of pulses. The pulse information encoder may be further configured to encode the plurality of pulse symbols by generating the number of states, such that the pulse symbols are only based on the number of states, indicating a track of at least one of the tracks The total number of positions of the track and the total number of pulses Is decoded.

於一實施例中,該脈衝資訊編碼器係經組配來針對該等音軌中之一者的各個音軌位置,將一整數值加至針對於一音軌位置的各個脈衝之一中間數目來獲得該狀態數目。In one embodiment, the pulse information encoder is configured to add an integer value to an intermediate number of each of the pulses for a track position for each track position of one of the tracks. To get the number of states.

依據另一實施例,該脈衝資訊編碼器可經組配來劃分該等音軌中之一者成為包含該等多個音軌位置中之至少兩個音軌位置之一第一音軌區劃,及成為包含該等多個音軌位置中之至少另兩個音軌位置之一第二音軌區劃。此外,該脈衝資訊編碼器可經組配來編碼與該第一區劃相聯結的之一第一子狀態數目。又復,該脈衝資訊編碼器可經組配來編碼與該第二區劃相聯結的之一第二子狀態數目。又,該脈衝資訊編碼器可經組配來組合該第一子狀態數目與該第二子狀態數目而獲得該狀態數目。In accordance with another embodiment, the pulse information encoder can be configured to divide one of the audio tracks into a first track zone that includes one of the at least two of the plurality of track positions. And becoming a second track zone comprising one of the at least two other track positions of the plurality of track positions. Additionally, the pulse information encoder can be assembled to encode a first number of sub-states associated with the first zone. Further, the pulse information encoder can be assembled to encode a second sub-state number associated with the second zone. Moreover, the pulse information encoder can be assembled to combine the first sub-state number and the second sub-state number to obtain the state number.

圖式簡單說明Simple illustration

後文中,將就附圖以進一步細節描述本發明之實施例,附圖中:第1圖顯示依據一實施例用以解碼一編碼音訊信號之裝置,第2圖顯示依據一實施例用以編碼一音訊信號之裝置,第3圖顯示針對具有兩個無符號脈衝及三個音軌位置之一音軌的全部可能組態,第4圖顯示針對具有一個有符號脈衝及兩個音軌位置之一音軌的全部可能組態,第5圖顯示針對具有兩個有符號脈衝及兩個音軌位置 之一音軌的全部可能組態,第6圖為例示說明一實施例之流程圖,闡釋依據一實施例藉脈衝資訊解碼器所進行的處理步驟,及第7圖為例示說明一實施例之流程圖,該流程圖闡釋依據一實施例藉脈衝資訊編碼器所進行的處理步驟。In the following, embodiments of the invention will be described in further detail with reference to the accompanying drawings in which: FIG. 1 shows an apparatus for decoding an encoded audio signal in accordance with an embodiment, and FIG. 2 shows an encoding for use in accordance with an embodiment. An apparatus for an audio signal, Figure 3 shows all possible configurations for a track with two unsigned pulses and three track positions, and Figure 4 shows a position with one signed pulse and two tracks. All possible configurations of one track, Figure 5 shows for two signed pulses and two track positions All of the possible configurations of one of the tracks, FIG. 6 is a flow chart illustrating an embodiment, illustrating the processing steps performed by the pulse information decoder in accordance with an embodiment, and FIG. 7 is an illustration of an embodiment. Flowchart, which illustrates the processing steps performed by the pulse information encoder in accordance with an embodiment.

第1圖例示說明解碼一編碼音訊信號之裝置,其中一或多個音軌係與該編碼音訊信號相聯結,該等音軌中之各一者具有多個音軌位置及多個脈衝。Figure 1 illustrates an apparatus for decoding an encoded audio signal, wherein one or more audio tracks are associated with the encoded audio signal, each of the audio tracks having a plurality of track positions and a plurality of pulses.

該裝置包含一脈衝資訊解碼器110及一信號解碼器120。該脈衝資訊解碼器110係適用以解碼多個脈衝位置。該等脈衝位置中之各一者指示該等音軌中之一者的該等音軌位置中之一者來指示該音軌之該等脈衝中之一者的位置。The device includes a pulse information decoder 110 and a signal decoder 120. The pulse information decoder 110 is adapted to decode a plurality of pulse positions. Each of the pulse positions indicates one of the track positions of one of the tracks to indicate the position of one of the pulses of the track.

該脈衝資訊解碼器110係經組配來藉由使用指示該等音軌中之至少一者的音軌位置總數之一音軌位置數目、指示該等音軌中之至少一者的脈衝總數之一總脈衝數目、及一個狀態數目而解碼該等多個脈衝位置。The pulse information decoder 110 is configured to indicate the total number of pulses of at least one of the tracks by using a number of track positions indicating the total number of track positions of at least one of the tracks. The plurality of pulse positions are decoded by a total number of pulses and a number of states.

該信號解碼器120係適用於藉由使用該等多個脈衝位置及與該編碼音訊信號相聯結的多個預測濾波器係數而產生一合成音訊信號來解碼該編碼音訊信號。The signal decoder 120 is adapted to decode the encoded audio signal by generating a composite audio signal using the plurality of pulse positions and a plurality of prediction filter coefficients coupled to the encoded audio signal.

狀態數目為已經依據後述實施例藉編碼器編碼之數目。狀態數目例如包含於一精簡表示型態中有關多個脈衝位置之資訊,該表示型態例如為要求少數位元的表示型 態,及當有關該音軌位置數目及總脈衝數目之資訊在該解碼器為可利用時可被解碼的表示型態。The number of states is the number of codes that have been encoded by the encoder in accordance with the embodiments described later. The number of states is, for example, included in a reduced representation of information about a plurality of pulse positions, such as a representation requiring a small number of bits. State, and a representation that can be decoded when the information about the number of track positions and the total number of pulses is available to the decoder.

於一實施例中,該音訊信號之一個音軌或各個音軌之該音軌位置數目及/或總脈衝數目在該解碼器為可資利用,原因在於該音軌位置數目及/或總脈衝數目為不變的靜態數值且為接收器所已知。舉例言之,針對各個音軌,該音軌位置數目可以經常性為16,及總脈衝數目可以經常性為4。In one embodiment, the number of track positions and/or the total number of pulses of one track or each track of the audio signal is available at the decoder due to the number of track positions and/or total pulses. The number is a constant static value and is known to the receiver. For example, for each track, the number of track positions can be often 16 and the total number of pulses can be 4 frequently.

於另一實施例中,該音訊信號之一個音軌或各個音軌之該音軌位置數目及/或總脈衝數目例如可藉編碼裝置而明確地傳輸至該解碼裝置。In another embodiment, the number of track positions and/or the total number of pulses of one track or each track of the audio signal can be explicitly transmitted to the decoding device by, for example, an encoding device.

於又一實施例中,解碼器可決定該音訊信號之一個音軌或各個音軌之該音軌位置數目及/或總脈衝數目,該決定方式係藉分析並不明確地陳述該音軌位置數目及/或總脈衝數目的其它參數,反而從該等其它參數可推衍出該音軌位置數目及/或總脈衝數目。In still another embodiment, the decoder may determine the number of track positions and/or the total number of pulses of one track or each track of the audio signal, and the decision manner does not explicitly state the track position by analysis. The number and/or other parameters of the total number of pulses may instead derive from the other parameters the number of track positions and/or the total number of pulses.

於其它實施例中,解碼器可分析可利用來推衍出該音訊信號之一個音軌或各個音軌之該音軌位置數目及/或總脈衝數目之其它資料。In other embodiments, the decoder may analyze other data that may be utilized to derive a track of the audio signal or the number of track positions and/or total number of pulses for each track.

於更一實施例中,該脈衝資訊解碼器可適用以也解碼一脈衝為正脈衝或負脈衝。In a further embodiment, the pulse information decoder is adapted to also decode a pulse as a positive or negative pulse.

於另一實施例中,該脈衝資訊解碼器又更可適用以解碼脈衝資訊,包含有關多個音軌之脈衝的資訊。脈衝資訊例如可以是有關於一音軌中的脈衝位置資訊及/或一脈衝 為正脈衝或負脈衝之資訊。In another embodiment, the pulse information decoder is further adapted to decode pulse information, including information about pulses of a plurality of tracks. The pulse information may be, for example, related to pulse position information and/or a pulse in a track. Information for positive or negative pulses.

第2圖例示說明編碼音訊信號之裝置,包含信號處理器210及脈衝資訊編碼器220。FIG. 2 illustrates an apparatus for encoding an audio signal, including a signal processor 210 and a pulse information encoder 220.

信號處理器210係適用以決定與該音訊信號相聯結的多個預測濾波器係數,用以基於該音訊信號及該等多個預測濾波器係數而產生一殘差信號。The signal processor 210 is adapted to determine a plurality of prediction filter coefficients associated with the audio signal for generating a residual signal based on the audio signal and the plurality of prediction filter coefficients.

脈衝資訊編碼器220係適用以編碼與一或多個音軌相關的多個脈衝位置來編碼該音訊信號。該等一或多個音軌係與由信號處理器210所產生的該殘差信號相聯結。該等音軌中之各一者具有多個音軌位置及多個脈衝。此外,該等脈衝位置中之各一者指示該等音軌中之一者的該等音軌位置中之一者來指示該音軌之該等脈衝中之一者的位置。The pulse information encoder 220 is adapted to encode the plurality of pulse positions associated with one or more tracks to encode the audio signal. The one or more tracks are coupled to the residual signal generated by signal processor 210. Each of the tracks has a plurality of track positions and a plurality of pulses. Additionally, each of the pulse positions indicates one of the track positions of one of the tracks to indicate the position of one of the pulses of the track.

該脈衝資訊編碼器220係經組配來藉產生一狀態數目而編碼該等多個脈衝位置,使得該等脈衝位置唯有基於該狀態數目、指示該等音軌中之至少一者的音軌位置總數之一音軌位置數目、及指示該等音軌中之至少一者的脈衝總數之一總脈衝數目才能被解碼。The pulse information encoder 220 is configured to encode the plurality of pulse positions by generating a number of states such that the pulse positions are only based on the number of states, indicating a track of at least one of the tracks The number of track positions, one of the total number of positions, and the total number of pulses indicating the total number of pulses of at least one of the tracks can be decoded.

後文中,呈示有關藉產生一狀態數目而編碼脈衝位置及可能地編碼脈衝符號(正脈衝或負脈衝)之本發明之實施例的基本構思。In the following, the basic idea of an embodiment of the invention for encoding a pulse position and possibly encoding a pulse symbol (positive pulse or negative pulse) by generating a number of states is presented.

本發明之實施例的編碼原理係植基於下述發現,若考慮於具有n個音軌位置之一音軌中的k個脈衝之全部可能組態之狀態列舉,則係足以編碼一音軌之脈衝的實際狀態。藉儘可能少數位元編碼此種狀態,提供期望的精簡編碼。 藉此,呈示狀態列舉之構思,其中脈衝位置及可能也脈衝符號的各叢表示一個狀態,及各個狀態係獨特地列舉。The coding principle of an embodiment of the present invention is based on the finding that it is sufficient to encode a track if one considers a state list of all possible configurations of k pulses in one of the n track positions. The actual state of the pulse. The desired reduced coding is provided by encoding such a state with as few bits as possible. Thereby, the concept of the presentation state is listed, wherein the pulse positions and the clusters which may also be pulse symbols represent one state, and each state is uniquely enumerated.

第3圖針對簡單情況例示說明此點,於該處闡釋全部可能的組態,此時考慮具有2脈衝及3音軌位置之一音軌。2脈衝可位在相同音軌位置。於第3圖之實例中,不考慮脈衝的符號(例如脈衝是正或負),例如於此一實例中,全部脈衝皆可視為正脈衝。Figure 3 illustrates this point for a simple case where all possible configurations are explained, in which case one of the 2 pulse and 3 track positions is considered. 2 pulses can be placed at the same track position. In the example of Figure 3, the sign of the pulse (e.g., the pulse is positive or negative) is not considered. For example, in this example, all pulses can be considered as positive pulses.

於第3圖中,例示說明針對位在具有3音軌位置(第3圖中:音軌位置1、2及3)之一音軌中兩個無方向性脈衝的全部可能組態。只有六個不同的可能狀態(第3圖中標示為0至5),描述脈衝如何分布在該音軌。藉此,使用0至5之範圍的狀態數目來描述所呈示的實際組態即足。舉例言之,若第3圖實例的狀態數目具有數值(4),且若解碼器知曉編碼方案,則解碼器可獲得結論狀態數目=4,表示該音軌具有一個脈衝在音軌位置0,及另一個脈衝在音軌位置2。於是於第3圖之實例中,3位元即足以編碼狀態數目來識別第3圖之實例的六個不同狀態中之一者。In Fig. 3, all possible configurations for two non-directional pulses in a track having one of three track positions (Fig. 3: track positions 1, 2 and 3) are illustrated. There are only six different possible states (labeled 0 to 5 in Figure 3) that describe how the pulses are distributed over the track. Thereby, the number of states in the range of 0 to 5 is used to describe the actual configuration presented. For example, if the number of states of the example of FIG. 3 has a value (4), and if the decoder knows the encoding scheme, the decoder can obtain the number of conclusion states = 4, indicating that the track has a pulse at the track position 0, And another pulse at track position 2. Thus, in the example of Figure 3, the 3-bit is sufficient to encode the number of states to identify one of the six different states of the example of Figure 3.

第4圖例示說明針對位在具有2音軌位置(第4圖中:音軌位置1及2)之一音軌中一個方向性脈衝的全部可能組態。第4圖中考慮脈衝之符號(例如脈衝是正或負)。有四個不同的可能狀態(第4圖中標示為0至3),描述脈衝如何分布在該音軌,及也描述其符號(正或負)。使用0至3之範圍的狀態數目來描述所呈示的實際組態即足。舉例言之,若第4圖實例的狀態數目具有數值(2),且若解碼器知曉編碼方案, 則解碼器可獲得結論狀態數目=2,表示該音軌具有一個脈衝在音軌位置1,及該脈衝為正脈衝。Figure 4 illustrates all possible configurations for a directional pulse in a track having a position of 2 tracks (in Figure 4: track positions 1 and 2). The sign of the pulse is considered in Figure 4 (for example, the pulse is positive or negative). There are four different possible states (labeled 0 to 3 in Figure 4) that describe how the pulse is distributed over the track and also describe its sign (positive or negative). The number of states in the range 0 to 3 is used to describe the actual configuration presented. For example, if the number of states of the example of FIG. 4 has a value (2), and if the decoder knows the encoding scheme, Then the decoder can get the number of conclusion states = 2, indicating that the track has a pulse at the track position 1, and the pulse is a positive pulse.

第5圖例示說明又另一情況,於該處闡釋當考慮具有2脈衝及2音軌位置的一音軌時的全部可能組態。脈衝可位在相同音軌位置。於第5圖所示實例中,考慮脈衝之符號(例如脈衝是正或負)。假設在同一個音軌位置的脈衝具有相同符號(例如在同一個音軌位置的音軌脈衝乃全正或全負)。Figure 5 illustrates yet another scenario where all possible configurations when considering a track with 2 pulses and 2 track positions are explained. Pulses can be placed at the same track position. In the example shown in Figure 5, consider the sign of the pulse (for example, the pulse is positive or negative). It is assumed that the pulses at the same track position have the same sign (for example, the track pulses at the same track position are all positive or negative).

第5圖中例示說明位在具有2音軌位置(第5圖中:音軌位置1及2)之一音軌中兩個有符號脈衝(例如脈衝是正或負)的全部可能組態。只有八個不同的可能狀態(第5圖中標示為0至7),描述脈衝如何分布在該音軌。藉此,使用0至7之範圍的狀態數目來描述所呈示的實際組態即足。舉例言之,若第5圖實例的狀態數目具有數值(3),且若解碼器知曉編碼方案,則解碼器可獲得結論狀態數目=3,表示該音軌具有一個脈衝在音軌位置0,及另一個脈衝在音軌位置1而該脈衝為負。於是於第5圖之實例中,3位元即足以編碼狀態數目來識別第5圖之實例的八個不同狀態中之一者。Figure 5 illustrates all possible configurations of two signed pulses (e.g., positive or negative pulses) in a track having one of two track positions (Fig. 5: track positions 1 and 2). There are only eight different possible states (labeled 0 to 7 in Figure 5) that describe how the pulses are distributed over the track. Thereby, the number of states in the range of 0 to 7 is used to describe the actual configuration presented. For example, if the number of states of the example of FIG. 5 has a value (3), and if the decoder knows the encoding scheme, the decoder can obtain the number of conclusion states = 3, indicating that the track has a pulse at the track position 0, And another pulse is at track position 1 and the pulse is negative. Thus, in the example of Figure 5, the 3-bit is sufficient to encode the number of states to identify one of the eight different states of the example of Figure 5.

於ACELP中,殘差信號可藉固定數目的有符號脈衝編碼。如前述,脈衝例如可分布在四個交插音軌,使得音軌0含有位置mod(n,4)==0,音軌=1含有位置mod(n,4)==1,等等。各個音軌可有經預先界定的有符號單位脈衝數目,脈衝可重疊,但重疊時脈衝有相同符號。In ACELP, the residual signal can be encoded by a fixed number of signed pulses. As mentioned above, the pulses can be distributed, for example, over four interlaced tracks such that track 0 contains position mod(n, 4) = = 0, track = 1 contains position mod(n, 4) = = 1, and so on. Each track may have a pre-defined number of signed unit pulses, the pulses may overlap, but the pulses have the same sign when overlapped.

藉編碼脈衝,須達成從脈衝位置及其符號對映至使用最小可能的位元數量之表示型態。此外,脈衝編碼須具有 固定的位元耗用量,亦即任何脈衝叢具有相等數目之位元。By means of the coded pulse, a representation from the pulse position and its sign to the smallest possible number of bits must be achieved. In addition, the pulse code must have The fixed bit consumption, that is, any pulse bundle has an equal number of bits.

各音軌係首先獨立地編碼,及然後各個音軌之狀態被組合成一個數目,表示整個子訊框之狀態。此一辦法給予數學上最佳的位元耗用,給定全部狀態有相等機率,及位元耗用量乃固定。Each track is first independently encoded, and then the states of the individual tracks are combined into a single number representing the state of the entire sub-frame. This method gives the mathematically optimal bit consumption, giving all states a equal probability, and the bit consumption is fixed.

狀態列舉構思可運用不同狀態叢之精簡表示型態說明:設欲編碼之殘差信號為xn 。假設考慮例如代數碼簿之四個交插音軌,則第一音軌具有樣本x0 、x4 、x8 、...xN-4 ,第二音軌具有樣本x1 、x5 、x9 、...xN-3 等。假設第一音軌係使用一個有符號單位脈衝量化及T=8,故音軌長度為2(T=欲編碼之殘差信號長度(樣本)。若t=8,及若4音軌係用來編碼殘差信號,則4音軌中之各一者具有2音軌位置。舉例言之,第一音軌可考慮具有兩個音軌位置x0及x4。則第一音軌之脈衝出現在下列中之任一者: The state enumeration concept can use a reduced representation of different state clusters: the residual signal to be encoded is x n . Assuming that four interleaved tracks, such as a generational digital book, are considered, the first track has samples x 0 , x 4 , x 8 , ... x N-4 , and the second track has samples x 1 , x 5 , x 9 ,...x N-3, etc. Assuming that the first track is quantized using a signed unit pulse and T=8, the track length is 2 (T = the residual signal length (sample) to be encoded. If t=8, and if 4 tracks are used To encode the residual signal, each of the 4 tracks has 2 track positions. For example, the first track can be considered to have two track positions x0 and x4. Then the pulse of the first track appears in Any of the following:

此一組態有四個不同狀態。This configuration has four different states.

同理,若第一音軌有兩個脈衝,第一音軌具有兩個音軌位置x0及x4。則脈衝可分配在下列脈衝叢: Similarly, if the first track has two pulses, the first track has two track positions x0 and x4. Then the pulse can be assigned to the following pulse bundles:

因而此一組態有8狀態。Thus this configuration has 8 states.

若殘差信號之長度擴延至T=12,則4音軌各自有3音軌位置。第一音軌獲得多一個樣本,現在有音軌位置x0、x4及x8,使得具有: If the length of the residual signal is extended to T=12, then the 4 tracks each have 3 track positions. The first track gets one more sample, and now has track positions x0, x4, and x8, which have:

上表表示若x8=0(x8不具脈衝)則針對x0及x4有8不同狀態;若x8=1(x8具正脈衝)則針對x0及x4有不同狀態;若x8=-1(x8具負脈衝)則針對x0及x4有不同狀態;若x8=2(x8具兩個正脈衝)則針對x0及x4有一個狀態;及若x8=-2(x8具兩個負脈衝)則針對x0及x4有一個狀態。The above table shows that if x8=0 (x8 does not have a pulse), there are 8 different states for x0 and x4; if x8=1 (x8 has a positive pulse), there are different states for x0 and x4; if x8=-1 (x8 is negative) Pulse) has different states for x0 and x4; if x8=2 (x8 has two positive pulses) then there is one state for x0 and x4; and if x8=-2 (x8 has two negative pulses) then for x0 and X4 has a status.

此處,從先前二表獲得第一列之狀態數目。藉將狀態數目加至第一列,發現此一組態共有18狀態。Here, the number of states of the first column is obtained from the previous two tables. By adding the number of states to the first column, it is found that this configuration has a total of 18 states.

於T=12實例中,5位元足夠編碼全部18不同的可能狀態。然後編碼器例如從範圍[0,...,17]選定狀態數目來載明18組態中之一者。若解碼器知曉編碼方案,例如若解碼器知曉哪個狀態數目表示哪個組態,則可針對一音軌解碼脈衝位置及脈衝符號。In the T=12 example, 5 bits are sufficient to encode all 18 different possible states. The encoder then sets one of the 18 configurations, for example, from the range [0,...,17] selected state number. If the decoder is aware of the encoding scheme, for example, if the decoder knows which state number represents which configuration, the pulse position and pulse symbol can be decoded for a track.

後文中,將提供依據實施例之適當編碼方法及相對應解碼方法。依據實施例提供編碼裝置,其係經組配來執行後文呈示之編碼方法中之一者。此外,依據額外實施例提供解碼裝置,其係經組配來執行後文呈示之解碼方法中之一者。Hereinafter, an appropriate encoding method and a corresponding decoding method according to the embodiments will be provided. An encoding apparatus is provided in accordance with an embodiment that is configured to perform one of the encoding methods presented later. Moreover, in accordance with additional embodiments, a decoding apparatus is provided that is configured to perform one of the decoding methods of the subsequent rendering.

於實施例中,為了產生狀態數目或解碼狀態數目,可計算具有p脈衝之N個音軌位置的可能組態數目。In an embodiment, to generate a number of states or a number of decoding states, a number of possible configurations of N track positions with p-pulses can be calculated.

脈衝可加符號,及可採用遞歸公式,計算針對具有N個音軌位置及p個有符號脈衝(脈衝可以是正或負,但在相同音軌位置的脈衝具有相同符號)的一音軌之狀態數目f (p ,N ),其中遞歸公式f (p ,N )定義為: The pulse can be signed, and a recursive formula can be used to calculate the state of a track for N track positions and p signed pulses (pulses can be positive or negative, but pulses at the same track position have the same sign) The number f ( p , N ), where the recursive formula f ( p , N ) is defined as:

初始條件為 The initial condition is

因具有一或多個脈衝之單一位置要求一個位元(2狀態)來用於符號。遞歸公式係用於全部不同叢之摘要。A single bit (2 state) is required for a symbol because of a single position with one or more pulses. The recursive formula is used for summaries of all the different bundles.

亦即,給定p脈衝,目前位置可具有qN =0至p脈衝,故其餘N-1位置有p-qN 脈衝。於目前位置及其餘N-1位置的狀態數目相乘來獲得具此等脈衝組合之狀態數目,組合經加總來獲得狀態總數。That is, given a p pulse, the current position can have a q N =0 to p pulse, so the remaining N-1 positions have a pq N pulse. The number of states at the current position and the remaining N-1 positions are multiplied to obtain the number of states with such pulse combinations, and the combinations are summed to obtain the total number of states.

於實施例中,遞歸函式可藉迭代重複演算法計算,其中遞歸係以迭代重複置換。In an embodiment, the recursive function can be computed by an iterative iterative algorithm, where the recursion is repeated by iterative repetition.

f (p ,N )之評估就即時應用而言於數值上相當複雜,依據若干實施例,可採用詢查表來計算f (p ,N )。依據若干實施例,該表可已經離線計算。Since the evaluation of f ( p , N ) is quite complex in terms of immediate application, according to several embodiments, an inquiry table can be used to calculate f ( p , N ). According to several embodiments, the table may have been calculated offline.

後文中,提出額外構思用於狀態數目之編碼與解碼: 設f (p ,N )表示具有p個有符號脈衝之N個音軌位置的可能組態數目。In the following, additional ideas are proposed for the encoding and decoding of the number of states: Let f ( p , N ) denote the number of possible configurations of N track positions with p signed pulses.

脈衝資訊編碼器現在可分析音軌:若在該音軌的第一位置不具脈衝,則其餘N-1位置具有p個有符號脈衝,為了描述此叢,只需要f (p ,N -1)狀態。The pulse information encoder can now analyze the audio track: if there is no pulse at the first position of the track, the remaining N-1 positions have p signed pulses. To describe this bundle, only f ( p , N -1) is needed. status.

否則,若第一位置具有一或多個脈衝,則脈衝資訊編碼器可定義總狀態係大於f (p ,N -1)。Otherwise, if the first location has one or more pulses, the pulse information encoder can define that the total state is greater than f ( p , N -1).

然後,在脈衝資訊解碼器,脈衝資訊解碼器例如可始於最末位置,及比較該狀態與一臨界值例如f (p ,N -1)。若該狀態為較大,則脈衝資訊解碼器可決定最末位置具有至少一個脈衝。然後脈衝資訊解碼器可從該狀態扣除f (p ,N -1)來獲得已更新狀態數目及將剩餘脈衝數目減1來更新狀態。Then, in the pulse information decoder, the pulse information decoder can, for example, start at the last position and compare the state with a threshold value such as f ( p , N -1). If the state is large, the pulse information decoder can determine that the last position has at least one pulse. The pulse information decoder can then deduct f ( p , N -1) from this state to obtain the number of updated states and decrement the number of remaining pulses by one to update the state.

否則,若最末位置沒有脈衝,則脈衝資訊解碼器可將剩餘位置數目減1。重複此一程序直到沒有脈衝剩下提供無符號的脈衝位置。Otherwise, if there is no pulse at the last position, the pulse information decoder can decrement the number of remaining positions by one. This procedure is repeated until no pulse is left to provide an unsigned pulse position.

為了也將脈衝符號列入考慮,脈衝資訊編碼器可以最低狀態位元編碼脈衝。於另一實施例中,脈衝資訊編碼器可以最高剩餘狀態位元編碼該符號。但以最低位元編碼脈衝符號為佳,原因在於就整數計算而言更容易處理。In order to also take into account the pulse symbol, the pulse information encoder can encode the pulse with the lowest state bit. In another embodiment, the pulse information encoder can encode the symbol with the highest remaining status bit. However, the lowest bit encoded pulse symbol is preferred because it is easier to handle in terms of integer calculations.

在脈衝資訊解碼器中,若找到給定位置之第一脈衝,則脈衝符號係由最末位元決定。然後,剩餘狀態向右移位一步驟來獲得已更新狀態數目。In the pulse information decoder, if the first pulse of a given position is found, the pulse symbol is determined by the last bit. Then, the remaining state is shifted to the right by one step to obtain the updated number of states.

於一實施例中,脈衝資訊解碼器係經組配來應用下列解碼演算法。於此解碼演算法中,於逐一步驟辦法中,針 對各個音軌位置例如前後相續,狀態數目或已更新狀態數目係與臨界值例如與f(p,k-1)比較。In one embodiment, the pulse information decoder is assembled to apply the following decoding algorithms. In this decoding algorithm, in the step by step method, the needle For each track position, for example, before and after, the number of states or the number of updated states is compared with a threshold value, for example, with f(p, k-1).

依據一實施例,提供脈衝資訊解碼器演算法: According to an embodiment, a pulse information decoder algorithm is provided:

依據一實施例,有關脈衝資訊,脈衝資訊編碼器係經組配來施加下列編碼演算法。脈衝資訊編碼器進行與脈衝資訊解碼器之相同步驟但順序相反。According to an embodiment, with respect to pulse information, the pulse information encoder is assembled to apply the following coding algorithm. The pulse information encoder performs the same steps as the pulse information decoder but in the reverse order.

依據一實施例,提供脈衝資訊編碼器演算法: According to an embodiment, a pulse information encoder algorithm is provided:

藉使用此一演算法編碼狀態數目,針對音軌中之一者的各個音軌位置在一音軌位置的各個脈衝,脈衝資訊編碼器將一整數值加至中間數目(例如中間狀態數目),例如演算法完成前的狀態數目,來獲得狀態數目(之數值)。By using this algorithm to encode the number of states, the pulse information encoder adds an integer value to the middle number (eg, the number of intermediate states) for each pulse of each track position of one of the tracks at a track position. For example, the number of states before the algorithm is completed to obtain the number of states (the value).

脈衝資訊之編碼及解碼辦法,例如脈衝位置及脈衝符號可稱作為「逐一步驟編碼」及「逐一步驟解碼」,原因在於音軌位置由編碼及解碼方法被視為前後相續,亦即逐一步驟。The encoding and decoding methods of pulse information, such as pulse position and pulse symbol, can be referred to as "step-by-step coding" and "step-by-step decoding", because the track position is regarded as the same by the encoding and decoding methods, that is, step by step. .

第6圖為流程圖例示說明一實施例,闡釋依據一實施例由脈衝資訊解碼器所進行的處理步驟。Figure 6 is a flow chart illustrating an embodiment illustrating the processing steps performed by the pulse information decoder in accordance with an embodiment.

於步驟610,目前音軌位置k係設定為N。此處,N表示一音軌之音軌位置數目,其中音軌位置係編碼從1至N。At step 610, the current track position k is set to N. Here, N represents the number of track positions of a track in which the track position is encoded from 1 to N.

於步驟620,測試k是否大於或等於1,亦即是否有任何剩餘音軌位置尚未考慮。若k不大於或等於1,則全部音軌位置皆已考慮及結束處理。In step 620, it is tested whether k is greater than or equal to 1, that is, whether any remaining track positions have not been considered. If k is not greater than or equal to 1, all track positions have been considered and the processing is ended.

否則於步驟630,測試狀態是否大於或等於f(p,k-1)。若為是,則位置k至少存在有一個脈衝。若為否,則於音軌位置k不存在有(額外)脈衝,及處理繼續步驟640,於該處k減1,使得將考慮下個音軌位置。Otherwise, in step 630, the test state is greater than or equal to f(p, k-1). If so, there is at least one pulse at position k. If not, there is no (extra) pulse at track position k, and processing continues with step 640, where k is decremented by 1, so that the next track position will be considered.

但若狀態係大於或等於f(p,k-1),處理繼續步驟642,一脈衝置於音軌位置k,及然後於步驟644,狀態係藉將該狀態減f(p,k-1)更新。然後於步驟650,測試目前脈衝是否為在音軌位置k第一個發現的脈衝。若為否,則於步驟680,剩餘脈衝數目減1,及處理繼續步驟630。However, if the state is greater than or equal to f(p, k-1), processing continues with step 642, a pulse is placed at track position k, and then at step 644, the state subtracts the state by f(p, k-1) ) Update. Then at step 650, it is tested whether the current pulse is the first pulse found at track position k. If not, then at step 680, the number of remaining pulses is decremented by one, and processing continues with step 630.

但若目前脈衝是在音軌位置k第一個發現的脈衝,處理繼續步驟660,於該處測試s的最低位元是否經設定。若為是,則在此音軌位置的脈衝符號係設定為負(步驟662),否則在此音軌位置的脈衝符號係設定為正(步驟664)。兩種情 況下,於步驟670然後狀態向右移位一個步驟(s:=s/2)。然後,剩餘脈衝數目也減1(步驟680),及處理繼續步驟630。However, if the current pulse is the first pulse found at the track position k, processing continues to step 660 where the lowest bit of the test s is set. If so, the pulse symbol at the track position is set to negative (step 662), otherwise the pulse symbol at the track position is set to positive (step 664). Two kinds of love In the case, in step 670 the state is then shifted one step to the right (s:=s/2). Then, the number of remaining pulses is also decremented by one (step 680), and processing continues with step 630.

第7圖為流程圖例示說明一實施例,該流程圖闡釋依據一實施例由脈衝資訊編碼器所進行的處理步驟。Figure 7 is a flow chart illustrating an embodiment illustrating the processing steps performed by a pulse information encoder in accordance with an embodiment.

於步驟710,所找到的脈衝p之數目係設定為0,狀態s係設定為0,及所考慮之音軌位置k係設定為1。In step 710, the number of pulses p found is set to zero, the state s is set to zero, and the track position k considered is set to one.

於步驟720,測試k是否小於或等於N,亦即是否仍有音軌位置尚未考慮(此處,N表示:一音軌之音軌位置數目)。若k係不小於或等於N,則全部音軌位置皆已經考慮及結束處理。In step 720, it is tested whether k is less than or equal to N, that is, whether the track position is still not considered (here, N represents: the number of track positions of a track). If k is not less than or equal to N, then all track positions have been considered and the processing is ended.

否則於步驟730,測試至少一個脈衝是否存在於位置k。若否,處理繼續步驟740,於該處k增加1,使得將考慮下個音軌位置。Otherwise, in step 730, it is tested whether at least one pulse is present at position k. If not, processing continues to step 740 where k is incremented by 1, such that the next track position will be considered.

但若至少一個脈衝係存在於音軌位置k,則於步驟750測試目前考慮的脈衝是否為音軌位置k的最末脈衝。若否,則於步驟770,狀態s係藉f(p,k-1)加至狀態s更新,所找到的脈衝p數目加1,及處理繼續步驟780。However, if at least one pulse is present at track position k, then at step 750 it is tested whether the currently considered pulse is the last pulse of track position k. If not, then in step 770, state s is incremented by f(p, k-1) to state s, the number of pulses p found is incremented by one, and processing continues with step 780.

若目前考慮的脈衝是音軌位置k的最末脈衝,則於步驟750之後,處理繼續步驟755,及狀態向左移位一個步驟(s:=s 2)。然後於步驟760,測試脈衝符號是否為負。若是,則s的最低位元係設定為1(步驟762);否則s的最低位元係設定為0(或不變)(步驟764)。然後於兩種情況下,進行步驟770,於該處狀態s係藉f(p,k-1)加至狀態s更新,所找到的脈衝p數目加1,及處理繼續步驟780。If the currently considered pulse is the last pulse of track position k, then after step 750, processing continues with step 755 and the state is shifted one step to the left (s:=s * 2). Then at step 760, the test pulse symbol is negative. If so, the lowest bit of s is set to 1 (step 762); otherwise the lowest bit of s is set to 0 (or unchanged) (step 764). Then, in either case, step 770 is performed where the state s is incremented by f(p, k-1) to state s, the number of pulses p found is incremented by one, and processing continues with step 780.

於步驟780,測試在位置k是否有另一脈衝。若是,則處理繼續步驟750;否則,處理繼續步驟740。At step 780, a test is made to see if there is another pulse at position k. If so, then processing continues with step 750; otherwise, processing continues with step 740.

後文中,提供編碼多個音軌之狀態的聯合狀態數目之構思。In the following, the concept of encoding the number of joint states of the states of a plurality of tracks is provided.

不幸,於多種情況下,單音軌的可能狀態範圍並非2的倍數,因而各狀態的二進制表示型態無效。舉例言之,若可能的狀態數目為5,則需3位元來以二進制數目表示。但若有4音軌各有5狀態,則整個子訊框有5x5x5x5=625狀態,可以10位元(而非4x3=12位元)表示。如此相當於每個音軌2.5位元而非3位元,如此每音軌節省0.5位元,或相當於每個子訊框節省2位元(占總位元耗用量的20%)。因此重要地係組合各音軌狀態成為一個聯合狀態,原因在於藉此可減少二進制表示型態的無效率。注意相同辦法可用於傳輸的任何數目。舉例言之,因各個子訊框可有一狀態表示脈衝位置,及各訊框可有例如4子訊框,此等狀態可組合成為一個聯合狀態。Unfortunately, in many cases, the possible state range of a single track is not a multiple of 2, and the binary representation of each state is invalid. For example, if the number of possible states is 5, then 3 bits are needed to represent in binary numbers. However, if there are 5 states for each of the 4 tracks, the entire sub-frame has a 5x5x5x5=625 state, which can be represented by 10 bits instead of 4x3=12 bits. This is equivalent to 2.5 bits per track instead of 3 bits, thus saving 0.5 bits per track, or equivalent to saving 2 bits per sub-frame (20% of the total bit consumption). Therefore, it is important to combine the states of the respective tracks into a joint state, because the inefficiency of the binary representation can be reduced by this. Note that the same method can be used for any number of transfers. For example, since each sub-frame can have a state indicating a pulse position, and each frame can have, for example, four sub-frames, the states can be combined into a joint state.

給定子訊框例如有4音軌,則藉聯合編碼各音軌狀態,可減少位元耗用而改良效率。舉例言之,給定各音軌有pk 脈衝,及各音軌具有長度N,例如有N個音軌位置,則各音軌狀態係在0至f (p k ,N )-1之範圍。然後各音軌之狀態sk 可經組合成為子訊框之聯合狀態s,具公式(假設每個子訊框有4音軌)公式2:s =[[s 0 f (p 0 ,N )+s 1 ]f (p 1 ,N )+s 2 ]f (p 2 ,N )+s 3 For example, if there are 4 tracks in the given sub-frame, the state of each track can be jointly coded, which can reduce the bit consumption and improve the efficiency. For example, given that each track has a p k pulse, and each track has a length N, for example, there are N track positions, each track state is in the range of 0 to f ( p k , N )-1 . Then the state s k of each track can be combined into the joint state s of the sub-frames, with the formula (assuming each sub-frame has 4 tracks). Equation 2: s =[[ s 0 f ( p 0 , N )+ s 1 ] f ( p 1 , N )+ s 2 ] f ( p 2 , N )+ s 3

然後各音軌之狀態可於解碼器決定,藉將聯合狀態除以f (p k ,N ),藉此餘數為最末音軌狀態,及整數部分為其餘音軌之聯合狀態。若音軌數目非為4,則方便於上式中適當地加或減項數。The state of each track can then be determined by the decoder by dividing the joint state by f ( p k , N ), whereby the remainder is the last track state, and the integer portion is the joint state of the remaining tracks. If the number of tracks is not 4, it is convenient to add or subtract items as appropriate in the above formula.

注意,當每個音軌之脈衝數目大時,則可能的狀態數目變大。舉例言之,有4音軌,每個音軌有6脈衝,及音軌長度N=16,則狀態為83-位元數目,超過常規中央處理單元(CPU)上的最大二進制數目長度。接著須採取若干額外步驟來使用標準方法以極長整數評估上式。Note that when the number of pulses per track is large, the number of possible states becomes large. For example, there are 4 tracks, each with 6 pulses, and the track length N=16, the state is 83-bit number, which exceeds the maximum binary number length on a conventional central processing unit (CPU). There are then a few additional steps that must be taken to evaluate the above equation with very long integers using standard methods.

當狀態機率假設為相等時,也觀察此一辦法等於音軌狀態之算術編碼。When the state probability is assumed to be equal, this method is also observed to be equal to the arithmetic coding of the track state.

前文已經呈示逐一步驟辦法用以編碼及解碼一音軌之脈衝資訊,例如一音軌之脈衝位置及可能脈衝符號。其它實施例提供另一辦法,稱作為「分裂與征服」辦法。The foregoing has been presented in a step-by-step manner for encoding and decoding pulse information for a track, such as the pulse position of a track and possible pulse symbols. Other embodiments provide another approach, referred to as the "split and conquer" approach.

脈衝資訊編碼器係經組配來施加分裂與征服辦法,將一音軌劃分為兩個音軌區劃x1 及x2 ,可考慮為兩個向量,其中x=[x1 x2 ]。基本構想係分開編碼二向量x1 及x2 ,及然後以下式組合二者 Pulse information encoders are assembled to apply splitting and conquering methods, dividing a track into two track zones x 1 and x 2 , which can be considered as two vectors, where x = [x 1 x 2 ]. The basic idea is to separately encode the two vectors x 1 and x 2 , and then combine the two

上式中,須注意當脈衝數目為已知時,換言之,當向量分別地具有p1 及p2 =p-p1 脈衝時,s(x1 )及s(x2 )為向量x1 及x2 之狀態。為了將向量x1 中具有0至p1 -1脈衝的全部狀態列入 考慮,須將加總項加至上式。In the above formula, it should be noted that when the number of pulses is known, in other words, when the vectors have p 1 and p 2 = pp 1 pulses, respectively, s(x 1 ) and s(x 2 ) are vectors x 1 and x 2 . State. In order to take into account the full state of the vector x 1 with 0 to p 1 -1 pulses, the summation term must be added to the above equation.

如上演算法/公式可施加來藉施加以下兩個前處理步驟來編碼交插音軌脈衝。首先,設向量xtrack k 包含音軌k上的全部樣本,藉定義x=[xtrack 1, xtrack 2, xtrack 3, xtrack 4 ]而合併此等向量。觀察如此只是樣本的重新排序,使得來自音軌1的全部樣本係置於第一組等等。The above algorithm/formula can be applied to encode the interlaced track pulses by applying the following two pre-processing steps. First, let the vector x track k contain all the samples on the track k, and merge these vectors by defining x=[x track 1, x track 2, x track 3, x track 4 ]. Observing this is just a reordering of the samples so that all samples from Track 1 are placed in the first group and so on.

其次,注意每個音軌之脈衝數目通常為固定數目。接著若音軌1經常性有p1 脈衝,則針對全部值k≠p1 ,音軌1上的狀態數目為f (k ,N 1 )=0。此乃陳述音軌1沒有任何狀態不具p1 脈衝的另一個方式。正式地然後將狀態數目公式定義為:公式4:針對具有pk 脈衝的完整音軌xtrack k ,狀態數目為(N=Ntrack k ) Second, note that the number of pulses per track is usually a fixed number. Then, if track 1 has a p 1 pulse, the number of states on track 1 is f ( k , N 1 )=0 for all values k ≠ p 1 . This is a statement of the track does not have any state without another embodiment p 1 pulse. Formally then define the state number formula as: Equation 4: For a complete track x track k with p k pulses, the number of states is (N=N track k )

否則,針對N>1 Otherwise, for N>1

以及針對N=1: And for N=1:

藉樣本之重新排序及使用如上針對狀態數目之定義(公式4),可藉公式3計算全部音軌之聯合狀態。注意因狀態數目大半含有零,當合併音軌狀態時,公式3之加總為零。 因此合併二音軌係與公式2相同。同理,方便顯示採用兩個辦法,合併全部4音軌(或5)也獲得相同結果。By reordering the samples and using the above definition of the number of states (Equation 4), Equation 3 can be used to calculate the joint state of all the tracks. Note that because the number of states contains more than half, when the track state is merged, the sum of Equation 3 is always zero. Therefore, the merged two-track system is the same as Equation 2. For the same reason, it is convenient to display two methods, and all 4 tracks (or 5) are combined to obtain the same result.

依據一實施例,重新排序可用作為編碼器之前處理步驟。於另一實施例中,重新排序可整合入編碼器。同理依據一實施例,重新排序可用作為解碼器之後處理步驟。於另一實施例中,重新排序可整合入解碼器。According to an embodiment, reordering can be used as a pre-encoder step. In another embodiment, reordering can be integrated into the encoder. By the same token, according to an embodiment, reordering can be used as a post-decoder step. In another embodiment, reordering can be integrated into the decoder.

若一音軌上的脈衝數目為不固定,則方便適度修改狀態數目公式,而仍然使用相同編碼演算法。If the number of pulses on a track is not fixed, it is convenient to modify the state number formula moderately while still using the same coding algorithm.

若適當選擇合併音軌順序,則觀察於章節「組合音軌資料」呈示的辦法及前述方法獲得相等結果。同理,逐一步驟辦法及分裂與征服辦法獲得相等結果。因此,依據哪個辦法體現上更實用或哪個辦法最佳匹配平台的運算限制,獨立地選擇哪個辦法用在解碼器及編碼器。If the merged track order is properly selected, observe the method of presenting the chapter "Combined Track Data" and the above method to obtain equal results. By the same token, the one-step approach and the split and conquer approach yield equal results. Therefore, depending on which method is more practical or which method best matches the computational constraints of the platform, independently choose which method to use in the decoder and encoder.

依據一實施例,提供脈衝資訊編碼器演算法,可以假代碼描述 According to an embodiment, a pulse information encoder algorithm is provided, which can be described by a pseudo code

依據一實施例,採用此種編碼演算法,脈衝資訊編碼器係經組配來將音軌中之一者劃分成一第一音軌區劃及一第二音軌區劃。脈衝資訊編碼器係經組配來編碼與第一區劃相聯結的第一子狀態數目。此外,脈衝資訊編碼器係經組配來編碼與第二區劃相聯結的第二子狀態數目。又,脈衝資訊編碼器係經組配來組合第一子狀態數目及第二子狀態數目而獲得該狀態數目。According to an embodiment, with such a coding algorithm, the pulse information encoder is configured to divide one of the tracks into a first track zone and a second track zone. The pulse information encoder is configured to encode the first number of substates associated with the first zone. In addition, the pulse information encoder is configured to encode the number of second sub-states associated with the second zone. Moreover, the pulse information encoder is configured to combine the first sub-state number and the second sub-state number to obtain the number of states.

同理,依據一實施例,提供脈衝資訊解碼器演算法,可以假代碼描述 Similarly, according to an embodiment, a pulse information decoder algorithm is provided, which can be described by a pseudo code.

於實現該分裂與征服辦法之一實施例中,脈衝資訊解碼器係經組配來基於該狀態數目而產生第一子狀態數目及第二子狀態數目。脈衝資訊解碼器係經組配來基於該第一子狀態數目而解碼該等音軌中之一者之第一區劃的脈衝位置之第一群組。此外,脈衝資訊解碼器係經組配來基於該第二子狀態數目而解碼該等音軌中之一者之第二區劃的脈衝位置之第二群組。In one embodiment of implementing the splitting and conquering method, the pulse information decoder is configured to generate a first number of sub-states and a second sub-state number based on the number of states. The pulse information decoder is configured to decode a first group of pulse positions of a first zone of one of the audio tracks based on the first number of sub-states. Additionally, the pulse information decoder is configured to decode a second group of pulse positions of the second region of one of the audio tracks based on the second number of sub-states.

雖然已經以裝置脈絡描述若干構面,但顯然此等構面也表示相對應方法的描述,於該處一方塊或一裝置係相對應於一方法步驟或一方法步驟之特徵。同理,以方法步驟之脈絡描述的構面也表示相對應裝置之相對應方塊或項或特徵結構之描述。Although a number of facets have been described in the context of the device, it is apparent that such facets also represent a description of the corresponding method, where a block or device corresponds to a method step or a method step. Similarly, a facet described by the context of a method step also represents a description of the corresponding block or item or feature structure of the corresponding device.

取決於某些體現要求,本發明之實施例可於硬體或於軟體體現。體現可使用數位儲存媒體執行,例如軟碟、DVD、CD、ROM、PROM、EPROM、EEPROM或快閃記憶 體,具有可電子讀取控制信號儲存於其上,該等信號與(或可與)可程式規劃電腦系統協作,因而執行個別方法。Embodiments of the invention may be embodied in hardware or in software, depending on certain embodiments. The implementation can be performed using digital storage media, such as floppy disk, DVD, CD, ROM, PROM, EPROM, EEPROM or flash memory The body has an electronically readable control signal stored thereon that cooperates with (or can be) a programmable computer system to perform an individual method.

依據本發明之若干實施例包含具有可電子式讀取控制信號的資料載體,該等控制信號可與可程式規劃電腦系統協作,因而執行此處所述方法中之一者。Several embodiments in accordance with the present invention comprise a data carrier having an electronically readable control signal that can cooperate with a programmable computer system to perform one of the methods described herein.

大致言之,本發明之實施例可體現為具有程式代碼的電腦程式產品,該程式代碼係當電腦程式產品在電腦上跑時可執行該等方法中之一者。該程式代碼例如可儲存在機器可讀取載體上。Broadly speaking, embodiments of the present invention can be embodied as a computer program product having a program code that can perform one of the methods when the computer program product runs on a computer. The program code can be stored, for example, on a machine readable carrier.

其它實施例包含儲存在機器可讀取載體或非過渡儲存媒體上的用以執行此處所述方法中之一者的電腦程式。Other embodiments include a computer program stored on a machine readable carrier or non-transitional storage medium for performing one of the methods described herein.

換言之,因此,本發明方法之實施例為一種具有一程式代碼之電腦程式,該程式代碼係當該電腦程式於一電腦上跑時用以執行此處所述方法中之一者。In other words, therefore, an embodiment of the method of the present invention is a computer program having a program code for performing one of the methods described herein when the computer program runs on a computer.

因此,本發明方法之又一實施例為資料載體(或數位儲存媒體或電腦可讀取媒體)包含用以執行此處所述方法中之一者的電腦程式記錄於其上。Thus, yet another embodiment of the method of the present invention is a data carrier (or digital storage medium or computer readable medium) having a computer program for performing one of the methods described herein recorded thereon.

因此,本發明方法之又一實施例為表示用以執行此處所述方法中之一者的電腦程式的資料串流或信號序列。資料串流或信號序列例如可經組配來透過資料通訊連結,例如透過網際網路轉移。Thus, yet another embodiment of the method of the present invention is a data stream or signal sequence representing a computer program for performing one of the methods described herein. The data stream or signal sequence can, for example, be configured to be linked via a data communication, such as over the Internet.

又一實施例包含處理構件例如電腦或可程式規劃邏輯裝置,其係經組配來或適用於執行此處所述方法中之一者。Yet another embodiment includes a processing component, such as a computer or programmable logic device, that is assembled or adapted to perform one of the methods described herein.

又一實施例包含一電腦,其上安裝有用以執行此處所 述方法中之一者的電腦程式。Yet another embodiment includes a computer on which is installed to perform the execution herein A computer program of one of the methods described.

於若干實施例中,可程式規劃邏輯裝置(例如可現場程式規劃閘陣列)可用來執行此處描述之方法的部分或全部功能。於若干實施例中,可現場程式規劃閘陣列可與微處理器協作來執行此處所述方法中之一者。大致上該等方法較佳係藉任何硬體裝置執行。In some embodiments, programmable logic devices, such as field programmable gate arrays, can be used to perform some or all of the functions of the methods described herein. In some embodiments, the field programmable gate array can cooperate with a microprocessor to perform one of the methods described herein. Generally, such methods are preferably performed by any hardware device.

前述實施例係僅供舉例說明本發明之原理。須瞭解此處所述配置及細節之修改及變化將為熟諳技藝人士顯然易知。因此,意圖僅受審查中之專利申請範圍所限而非受藉以描述及解說此處實施例所呈示之特定細節所限。The foregoing embodiments are merely illustrative of the principles of the invention. It will be apparent to those skilled in the art that modifications and variations of the configuration and details described herein will be readily apparent. Therefore, the intention is to be limited only by the scope of the patent application under review and not by the specific details of the embodiments presented herein.

110‧‧‧脈衝資訊解碼器110‧‧‧pulse information decoder

120‧‧‧信號解碼器120‧‧‧Signal decoder

210‧‧‧信號處理器210‧‧‧Signal Processor

220‧‧‧脈衝資訊編碼器220‧‧‧pulse information encoder

610-680、710-780‧‧‧處理步驟610-680, 710-780‧‧‧ Processing steps

第1圖顯示依據一實施例用以解碼一編碼音訊信號之裝置,第2圖顯示依據一實施例用以編碼一音訊信號之裝置,第3圖顯示針對具有兩個無符號脈衝及三個音軌位置之一音軌的全部可能組態,第4圖顯示針對具有一個有符號脈衝及兩個音軌位置之一音軌的全部可能組態,第5圖顯示針對具有兩個有符號脈衝及兩個音軌位置之一音軌的全部可能組態,第6圖為例示說明一實施例之流程圖,闡釋依據一實施例藉脈衝資訊解碼器所進行的處理步驟,及第7圖為例示說明一實施例之流程圖,該流程圖闡釋依據一實施例藉脈衝資訊編碼器所進行的處理步驟。1 shows an apparatus for decoding an encoded audio signal according to an embodiment, and FIG. 2 shows an apparatus for encoding an audio signal according to an embodiment, and FIG. 3 shows for having two unsigned pulses and three tones. All possible configurations of one of the track positions, Figure 4 shows all possible configurations for a track with one signed pulse and two track positions, Figure 5 shows for two signed pulses and All possible configurations of one of the two track positions, FIG. 6 is a flow chart illustrating an embodiment, illustrating processing steps performed by the pulse information decoder according to an embodiment, and FIG. 7 is an illustration A flowchart of an embodiment is illustrated that illustrates the processing steps performed by a pulse information encoder in accordance with an embodiment.

110‧‧‧脈衝資訊解碼器110‧‧‧pulse information decoder

120‧‧‧信號解碼器120‧‧‧Signal decoder

Claims (15)

一種用以解碼一編碼音訊信號之裝置,其中一或多個音軌係與該編碼音訊信號相關聯,該等音軌中之各者具有多個音軌位置及多個脈衝,其中該裝置係包含:一脈衝資訊解碼器,係適用以解碼多個脈衝位置,其中該等脈衝位置中之各者指示該等音軌中之一者的該等音軌位置中之一者來指示該音軌之該等脈衝中之一者的位置,及其中該脈衝資訊解碼器係經組配來藉由使用指示該等音軌中之至少一者的音軌位置總數之一音軌位置數目、指示該等音軌中之至少一者的脈衝總數之一總脈衝數目、及一個狀態數目而解碼該等多個脈衝位置;及一信號解碼器,其係用於藉由使用該等多個脈衝位置及與該編碼音訊信號相關聯的多個預測濾波器係數而產生合成音訊信號來解碼該編碼音訊信號。 An apparatus for decoding an encoded audio signal, wherein one or more audio tracks are associated with the encoded audio signal, each of the audio tracks having a plurality of track positions and a plurality of pulses, wherein the device is The method includes: a pulse information decoder adapted to decode a plurality of pulse positions, wherein each of the pulse positions indicates one of the track positions of one of the tracks to indicate the track a position of one of the pulses, and wherein the pulse information decoder is configured to indicate the number of track positions by using one of a total number of track positions indicating at least one of the tracks One of a total number of pulses of at least one of the equal tracks, and a number of states to decode the plurality of pulse positions; and a signal decoder for using the plurality of pulse positions and Generating a synthesized audio signal to decode the encoded audio signal by a plurality of prediction filter coefficients associated with the encoded audio signal. 如申請專利範圍第1項之裝置,其中該脈衝資訊解碼器進一步適用於使用該音軌位置數目、該總脈衝數目及該狀態數目而解碼多個脈衝符號,其中該等脈衝符號中之各者指示該等多個脈衝中之一者之一符號,及其中該信號解碼器係適用以更進一步使用該等多個脈衝符號而產生合成音訊信號來解碼該編碼音訊信號。 The apparatus of claim 1, wherein the pulse information decoder is further adapted to decode a plurality of pulse symbols using the number of track positions, the total number of pulses, and the number of states, wherein each of the pulse symbols ??? indicating one of the plurality of pulses, and wherein the signal decoder is adapted to further generate the synthesized audio signal to decode the encoded audio signal by using the plurality of pulse symbols. 如申請專利範圍第1或2項之裝置,其中該等一或多個音 軌係包含至少一個最末音軌及一或多個其它音軌,及其中該脈衝資訊解碼器係適用以從該狀態數目產生一第一子狀態數目及一第二子狀態數目,其中該脈衝資訊解碼器係經組配來基於該第一子狀態數目而解碼該等脈衝位置之一第一群組,及其中該脈衝資訊解碼器係經組配來基於該第二子狀態數目而解碼該等脈衝位置之一第二群組,其中該等脈衝位置之該第二群組只包含指示該最末音軌之音軌位置的脈衝位置,及其中該等脈衝位置之該第一群組只包含指示該等一或多個其它音軌之音軌位置的脈衝位置。 Such as the device of claim 1 or 2, wherein the one or more sounds The track system includes at least one last track and one or more other tracks, and wherein the pulse information decoder is adapted to generate a first sub-state number and a second sub-state number from the number of states, wherein the pulse The information decoder is configured to decode a first group of the pulse positions based on the first number of sub-states, and wherein the pulse information decoder is configured to decode the second sub-state number based on the number a second group of one of the pulse positions, wherein the second group of the pulse positions only includes a pulse position indicating a position of the track of the last track, and wherein the first group of the pulse positions is only A pulse position indicating the position of the track of the one or more other tracks is included. 如申請專利範圍第3項之裝置,其中該脈衝資訊解碼器係經組配來藉由將該狀態數目除以f(pk ,N)而獲得一整數部分及一餘數作為除法結果來產生該第一子狀態數目及該第二子狀態數目,其中該整數部分為該第一子狀態數目,及其中該餘數為該第二子狀態數目,其中pk 指示針對該等一或多個音軌中之各者的脈衝數目,及其中N指示針對該等一或多個音軌中之各者的音軌位置數目。The apparatus of claim 3, wherein the pulse information decoder is configured to generate an integer part and a remainder as a division result by dividing the number of states by f(p k , N) a first number of sub-states and a number of the second sub-states, wherein the integer portion is the first sub-state number, and wherein the remainder is the second sub-state number, wherein p k indicates for the one or more audio tracks The number of pulses for each of them, and where N indicates the number of track positions for each of the one or more tracks. 如申請專利範圍第1項之裝置,其中該脈衝資訊解碼器係適用以進行將該狀態數目或一已更新狀態數目與一臨界值相比較之一測試。 The apparatus of claim 1, wherein the pulse information decoder is adapted to perform a test comparing the number of states or an updated state with a threshold. 如申請專利範圍第5項之裝置,其中該脈衝資訊解碼器係適用以藉由比較該狀態數目或已更新狀態數目是否 大於、大於或等於、小於、小於或等於該臨界值而進行該測試,及其中該脈衝資訊解碼器係進一步適用以取決於該測試之結果而更新該狀態數目或已更新狀態數目。 The apparatus of claim 5, wherein the pulse information decoder is adapted to compare whether the number of states or the number of updated states is The test is performed by greater than, greater than or equal to, less than, less than, or equal to the threshold, and wherein the pulse information decoder is further adapted to update the number of states or the number of updated states depending on the result of the test. 如申請專利範圍第6項之裝置,其中該脈衝資訊解碼器係經組配來針對該等多個音軌中之一者的各個音軌位置,比較該狀態數目或該已更新狀態數目與該臨界值。 The apparatus of claim 6, wherein the pulse information decoder is configured to compare the number of states or the number of updated states with respect to respective track positions of one of the plurality of tracks Threshold value. 如申請專利範圍第1項之裝置,其中該脈衝資訊解碼器係經組配來劃分該等音軌中之一者成為包含該等多個音軌位置中之至少兩個音軌位置之一第一音軌分區,及成為包含該等多個音軌位置中之至少兩個其他音軌位置之一第二音軌分區,其中該脈衝資訊解碼器係經組配來基於該狀態數目而產生一第一子狀態數目及一第二子狀態數目,其中該脈衝資訊解碼器係經組配來基於該第一子狀態數目,解碼與該第一音軌分區相關聯的脈衝位置之一第一群組,及其中該脈衝資訊解碼器係經組配來基於該第二子狀態數目,解碼與該第二音軌分區相關聯的脈衝位置之一第二群組。 The apparatus of claim 1, wherein the pulse information decoder is configured to divide one of the audio tracks to include one of at least two of the plurality of audio track positions. a track partition, and a second track partition comprising one of at least two other track positions of the plurality of track positions, wherein the pulse information decoder is configured to generate a number based on the number of states a first sub-state number and a second sub-state number, wherein the pulse information decoder is configured to decode one of the first pulse positions associated with the first track segment based on the first sub-state number The group, and the pulse information decoder thereof, are configured to decode a second group of one of the pulse locations associated with the second track partition based on the second number of substates. 一種用以編碼一音訊信號之裝置,該裝置係包含:一信號處理器,用以決定與該音訊信號相關聯的多個預測濾波器係數,用以基於該音訊信號及該等多個預測濾波器係數而產生一殘差信號;及一脈衝資訊編碼器,用以編碼與一或多個音軌相關 的多個脈衝位置來編碼該音訊信號,該等一或多個音軌係與該殘差信號相關聯,該等音軌中之各者具有多個音軌位置及多個脈衝,其中該等脈衝位置中之各者指示該等音軌中之一者的該等音軌位置中之一者來指示該音軌之該等脈衝中之一者的位置,其中該脈衝資訊編碼器係經組配來藉由產生一狀態數目而編碼該等多個脈衝位置,使得該等脈衝位置唯有基於該狀態數目、指示該等音軌中之至少一者的音軌位置總數之一音軌位置數目、及指示該等音軌中之至少一者的脈衝總數之一總脈衝數目才能被解碼。 An apparatus for encoding an audio signal, the apparatus comprising: a signal processor for determining a plurality of prediction filter coefficients associated with the audio signal for filtering based on the audio signal and the plurality of prediction filters Generating a residual signal; and a pulse information encoder for encoding associated with one or more tracks The plurality of pulse positions are encoded to encode the audio signal, the one or more tracks being associated with the residual signal, each of the tracks having a plurality of track positions and a plurality of pulses, wherein the plurality of tracks Each of the pulse positions indicating one of the track positions of one of the tracks to indicate the position of one of the pulses of the track, wherein the pulse information encoder is a group Arranging to encode the plurality of pulse positions by generating a number of states such that the pulse positions are only based on the number of states, indicating the number of track positions of one of the total number of track positions of at least one of the tracks And a total number of pulses indicating the total number of pulses of at least one of the tracks can be decoded. 如申請專利範圍第9項之裝置,其中該脈衝資訊編碼器係適用以編碼多個脈衝符號,其中該等脈衝符號中之各者指示該等多個脈衝中之一者之一符號,其中該脈衝資訊編碼器係經組配來藉由產生該狀態數目而編碼該等多個脈衝符號,使得該等脈衝符號唯有基於該狀態數目、指示該等音軌中之至少一者的音軌位置總數之該音軌位置數目、及該總脈衝數目才能被解碼。 The apparatus of claim 9, wherein the pulse information encoder is adapted to encode a plurality of pulse symbols, wherein each of the pulse symbols indicates one of the plurality of pulses, wherein the The pulse information encoder is configured to encode the plurality of pulse symbols by generating the number of states such that the pulse symbols are only based on the number of states, indicating a track position of at least one of the tracks The total number of track positions and the total number of pulses can be decoded. 如申請專利範圍第9或10項之裝置,其中該脈衝資訊編碼器係經組配來針對該等音軌中之一者的各個音軌位置,將一整數值加至針對於一音軌位置的各個脈衝之一中間數目來獲得該狀態數目。 The apparatus of claim 9 or 10, wherein the pulse information encoder is configured to add an integer value to a track position for each track position of one of the tracks The middle number of each of the individual pulses is used to obtain the number of states. 如申請專利範圍第9或10項之裝置,其中該脈衝資訊編碼器係經組配來劃分該等音軌中之一者成為包含該等多個音軌位置中之至少兩個音 軌位置之一第一音軌分區,及成為包含該等多個音軌位置中之至少兩個其他音軌位置之一第二音軌分區,其中該脈衝資訊編碼器係經組配來編碼與該第一分區相關聯的一第一子狀態數目,其中該脈衝資訊編碼器係經組配來編碼與該第二分區相關聯的一第二子狀態數目,及其中該脈衝資訊編碼器係經組配來組合該第一子狀態數目與該第二子狀態數目而獲得該狀態數目。 The apparatus of claim 9 or 10, wherein the pulse information encoder is configured to divide one of the audio tracks to include at least two of the plurality of audio track positions One of the track positions, the first track segment, and the second track segment comprising one of the at least two other track positions of the plurality of track positions, wherein the pulse information encoder is assembled to encode a first sub-state number associated with the first partition, wherein the pulse information encoder is configured to encode a second sub-state number associated with the second partition, and wherein the pulse information encoder is The number of the first sub-states and the second sub-states are combined to obtain the number of states. 一種用以解碼一編碼音訊信號之方法,其中一或多個音軌係與該編碼音訊信號相關聯,該等音軌中之各者具有多個音軌位置及多個脈衝,其中該方法係包含:解碼多個脈衝位置,其中該等脈衝位置中之各者指示該等音軌中之一者的該等音軌位置中之一者來指示該音軌之該等脈衝中之一者的位置,及其中該等多個脈衝位置係藉由使用指示該等音軌中之至少一者的音軌位置總數之一音軌位置數目、指示該等音軌中之至少一者的脈衝總數之一總脈衝數目、及一個狀態數目而予解碼;及藉由使用該等多個脈衝位置及與該編碼音訊信號相關聯的多個預測濾波器係數而產生一合成音訊信號來解碼該編碼音訊信號。 A method for decoding an encoded audio signal, wherein one or more audio tracks are associated with the encoded audio signal, each of the audio tracks having a plurality of track positions and a plurality of pulses, wherein the method is The method includes: decoding a plurality of pulse positions, wherein each of the pulse positions indicates one of the track positions of one of the tracks to indicate one of the pulses of the track Position, and wherein the plurality of pulse positions are indicative of a total number of tracks indicating at least one of the tracks by using a number of track positions indicating a total number of track positions of at least one of the tracks Decoding a total number of pulses and a number of states; and decoding a coded audio signal by generating a composite audio signal using the plurality of pulse positions and a plurality of prediction filter coefficients associated with the encoded audio signal . 一種用以編碼一音訊信號之方法,該方法係包含:決定與該音訊信號相關聯的多個預測濾波器係數,用以基於該音訊信號及該等多個預測濾波器係數而 產生一殘差信號;及編碼與一或多個音軌相關的多個脈衝位置來編碼該音訊信號,該等一或多個音軌係與該殘差信號相關聯,該等音軌中之各者具有多個音軌位置及多個脈衝,其中該等脈衝位置中之各者指示該等音軌中之一者的該等音軌位置中之一者來指示該音軌之該等脈衝中之一者的位置,其中該等多個脈衝位置係藉由產生一狀態數目而予編碼,使得該等脈衝位置唯有基於該狀態數目、指示該等音軌中之至少一者的音軌位置總數之一音軌位置數目、及指示該等音軌中之至少一者的脈衝總數之一總脈衝數目才能被解碼。 A method for encoding an audio signal, the method comprising: determining a plurality of prediction filter coefficients associated with the audio signal for based on the audio signal and the plurality of prediction filter coefficients Generating a residual signal; and encoding a plurality of pulse positions associated with one or more tracks to encode the audio signal, the one or more tracks being associated with the residual signal, wherein the tracks are Each having a plurality of track positions and a plurality of pulses, wherein each of the pulse positions indicates one of the track positions of one of the tracks to indicate the pulses of the track The position of one of the plurality of pulse positions, wherein the plurality of pulse positions are encoded by generating a number of states such that the pulse positions are only based on the number of states, indicating a track of at least one of the tracks The number of track positions, one of the total number of positions, and the total number of pulses indicating the total number of pulses of at least one of the tracks can be decoded. 一種電腦程式,該電腦程式當在一電腦或信號處理器上執行時體現如申請專利範圍第13或14項之方法。A computer program embodying the method of claim 13 or 14 when executed on a computer or signal processor.
TW101104350A 2011-02-14 2012-02-10 Encoding and decoding of pulse positions of tracks of an audio signal TWI488176B (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US201161442632P 2011-02-14 2011-02-14
PCT/EP2012/052461 WO2012110480A1 (en) 2011-02-14 2012-02-14 Audio codec supporting time-domain and frequency-domain coding modes

Publications (2)

Publication Number Publication Date
TW201248617A TW201248617A (en) 2012-12-01
TWI488176B true TWI488176B (en) 2015-06-11

Family

ID=71943598

Family Applications (2)

Application Number Title Priority Date Filing Date
TW101104350A TWI488176B (en) 2011-02-14 2012-02-10 Encoding and decoding of pulse positions of tracks of an audio signal
TW101104676A TWI484480B (en) 2011-02-14 2012-02-14 Audio codec supporting time-domain and frequency-domain coding modes

Family Applications After (1)

Application Number Title Priority Date Filing Date
TW101104676A TWI484480B (en) 2011-02-14 2012-02-14 Audio codec supporting time-domain and frequency-domain coding modes

Country Status (19)

Country Link
US (1) US9037457B2 (en)
EP (1) EP2676269B1 (en)
JP (1) JP5851525B2 (en)
KR (2) KR101648133B1 (en)
CN (1) CN103548078B (en)
AR (1) AR085223A1 (en)
AU (2) AU2012217160B2 (en)
BR (1) BR112013020589B1 (en)
CA (1) CA2827296C (en)
ES (1) ES2562189T3 (en)
HK (1) HK1192793A1 (en)
MX (1) MX2013009302A (en)
MY (2) MY159444A (en)
PL (1) PL2676269T3 (en)
RU (1) RU2547241C1 (en)
SG (1) SG192715A1 (en)
TW (2) TWI488176B (en)
WO (1) WO2012110480A1 (en)
ZA (1) ZA201306872B (en)

Families Citing this family (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
TWI480857B (en) 2011-02-14 2015-04-11 Fraunhofer Ges Forschung Audio codec using noise synthesis during inactive phases
US9589570B2 (en) 2012-09-18 2017-03-07 Huawei Technologies Co., Ltd. Audio classification based on perceptual quality for low or medium bit rates
EP2830052A1 (en) 2013-07-22 2015-01-28 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Audio decoder, audio encoder, method for providing at least four audio channel signals on the basis of an encoded representation, method for providing an encoded representation on the basis of at least four audio channel signals and computer program using a bandwidth extension
EP3069338B1 (en) * 2013-11-13 2018-12-19 Fraunhofer Gesellschaft zur Förderung der Angewand Encoder for encoding an audio signal, audio transmission system and method for determining correction values
EP2980790A1 (en) * 2014-07-28 2016-02-03 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Apparatus and method for comfort noise generation mode selection
US10699723B2 (en) * 2017-04-25 2020-06-30 Dts, Inc. Encoding and decoding of digital audio signals using variable alphabet size
US10699721B2 (en) * 2017-04-25 2020-06-30 Dts, Inc. Encoding and decoding of digital audio signals using difference data
KR102632136B1 (en) * 2017-04-28 2024-01-31 디티에스, 인코포레이티드 Audio Coder window size and time-frequency conversion
JP6962445B2 (en) * 2018-03-02 2021-11-05 日本電信電話株式会社 Encoding device, coding method, program, and recording medium

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
TWI253057B (en) * 2004-12-27 2006-04-11 Quanta Comp Inc Search system and method thereof for searching code-vector of speech signal in speech encoder
TW201027517A (en) * 2008-09-30 2010-07-16 Dolby Lab Licensing Corp Transcoding of audio metadata
WO2010093224A2 (en) * 2009-02-16 2010-08-19 한국전자통신연구원 Encoding/decoding method for audio signals using adaptive sine wave pulse coding and apparatus thereof

Family Cites Families (124)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
ATE477571T1 (en) 1991-06-11 2010-08-15 Qualcomm Inc VOCODER WITH VARIABLE BITRATE
US5408580A (en) * 1992-09-21 1995-04-18 Aware, Inc. Audio compression system employing multi-rate signal analysis
BE1007617A3 (en) 1993-10-11 1995-08-22 Philips Electronics Nv Transmission system using different codeerprincipes.
US5784532A (en) 1994-02-16 1998-07-21 Qualcomm Incorporated Application specific integrated circuit (ASIC) for performing rapid speech compression in a mobile telephone system
KR100419545B1 (en) 1994-10-06 2004-06-04 코닌클리케 필립스 일렉트로닉스 엔.브이. Transmission system using different coding principles
EP0720316B1 (en) 1994-12-30 1999-12-08 Daewoo Electronics Co., Ltd Adaptive digital audio encoding apparatus and a bit allocation method thereof
SE506379C3 (en) * 1995-03-22 1998-01-19 Ericsson Telefon Ab L M Lpc speech encoder with combined excitation
US5754733A (en) 1995-08-01 1998-05-19 Qualcomm Incorporated Method and apparatus for generating and encoding line spectral square roots
US5848391A (en) * 1996-07-11 1998-12-08 Fraunhofer-Gesellschaft Zur Forderung Der Angewandten Forschung E.V. Method subband of coding and decoding audio signals using variable length windows
JP3259759B2 (en) 1996-07-22 2002-02-25 日本電気株式会社 Audio signal transmission method and audio code decoding system
JPH10124092A (en) 1996-10-23 1998-05-15 Sony Corp Method and device for encoding speech and method and device for encoding audible signal
US5960389A (en) 1996-11-15 1999-09-28 Nokia Mobile Phones Limited Methods for generating comfort noise during discontinuous transmission
JPH10214100A (en) 1997-01-31 1998-08-11 Sony Corp Voice synthesizing method
US6134518A (en) 1997-03-04 2000-10-17 International Business Machines Corporation Digital audio signal coding using a CELP coder and a transform coder
JP3223966B2 (en) 1997-07-25 2001-10-29 日本電気株式会社 Audio encoding / decoding device
US6070137A (en) * 1998-01-07 2000-05-30 Ericsson Inc. Integrated frequency-domain voice coding using an adaptive spectral enhancement filter
ES2247741T3 (en) * 1998-01-22 2006-03-01 Deutsche Telekom Ag SIGNAL CONTROLLED SWITCHING METHOD BETWEEN AUDIO CODING SCHEMES.
GB9811019D0 (en) 1998-05-21 1998-07-22 Univ Surrey Speech coders
US6317117B1 (en) 1998-09-23 2001-11-13 Eugene Goff User interface for the control of an audio spectrum filter processor
US7272556B1 (en) 1998-09-23 2007-09-18 Lucent Technologies Inc. Scalable and embedded codec for speech and audio signals
US7124079B1 (en) 1998-11-23 2006-10-17 Telefonaktiebolaget Lm Ericsson (Publ) Speech coding with comfort noise variability feature for increased fidelity
JP4024427B2 (en) 1999-05-24 2007-12-19 株式会社リコー Linear prediction coefficient extraction apparatus, linear prediction coefficient extraction method, and computer-readable recording medium recording a program for causing a computer to execute the method
CN1145928C (en) 1999-06-07 2004-04-14 艾利森公司 Methods and apparatus for generating comfort noise using parametric noise model statistics
JP4464484B2 (en) 1999-06-15 2010-05-19 パナソニック株式会社 Noise signal encoding apparatus and speech signal encoding apparatus
US6236960B1 (en) 1999-08-06 2001-05-22 Motorola, Inc. Factorial packing method and apparatus for information coding
ATE341074T1 (en) 2000-02-29 2006-10-15 Qualcomm Inc MULTIMODAL MIXED RANGE CLOSED LOOP VOICE ENCODER
US6757654B1 (en) 2000-05-11 2004-06-29 Telefonaktiebolaget Lm Ericsson Forward error correction in speech coding
JP2002118517A (en) 2000-07-31 2002-04-19 Sony Corp Apparatus and method for orthogonal transformation, apparatus and method for inverse orthogonal transformation, apparatus and method for transformation encoding as well as apparatus and method for decoding
US6847929B2 (en) 2000-10-12 2005-01-25 Texas Instruments Incorporated Algebraic codebook system and method
CA2327041A1 (en) 2000-11-22 2002-05-22 Voiceage Corporation A method for indexing pulse positions and signs in algebraic codebooks for efficient coding of wideband signals
US6701772B2 (en) 2000-12-22 2004-03-09 Honeywell International Inc. Chemical or biological attack detection and mitigation system
US20050130321A1 (en) 2001-04-23 2005-06-16 Nicholson Jeremy K. Methods for analysis of spectral data and their applications
US20020184009A1 (en) 2001-05-31 2002-12-05 Heikkinen Ari P. Method and apparatus for improved voicing determination in speech signals containing high levels of jitter
US20030120484A1 (en) 2001-06-12 2003-06-26 David Wong Method and system for generating colored comfort noise in the absence of silence insertion description packets
US6941263B2 (en) 2001-06-29 2005-09-06 Microsoft Corporation Frequency domain postfiltering for quality enhancement of coded speech
US6879955B2 (en) 2001-06-29 2005-04-12 Microsoft Corporation Signal modification based on continuous time warping for low bit rate CELP coding
KR100438175B1 (en) 2001-10-23 2004-07-01 엘지전자 주식회사 Search method for codebook
CA2388439A1 (en) 2002-05-31 2003-11-30 Voiceage Corporation A method and device for efficient frame erasure concealment in linear predictive based speech codecs
JP3646939B1 (en) 2002-09-19 2005-05-11 松下電器産業株式会社 Audio decoding apparatus and audio decoding method
US7343283B2 (en) 2002-10-23 2008-03-11 Motorola, Inc. Method and apparatus for coding a noise-suppressed audio signal
US7363218B2 (en) 2002-10-25 2008-04-22 Dilithium Networks Pty. Ltd. Method and apparatus for fast CELP parameter mapping
KR100465316B1 (en) 2002-11-18 2005-01-13 한국전자통신연구원 Speech encoder and speech encoding method thereof
US7318035B2 (en) 2003-05-08 2008-01-08 Dolby Laboratories Licensing Corporation Audio coding systems and methods using spectral component coupling and spectral component regeneration
US20050091044A1 (en) 2003-10-23 2005-04-28 Nokia Corporation Method and system for pitch contour quantization in audio coding
CN1875402B (en) 2003-10-30 2012-03-21 皇家飞利浦电子股份有限公司 Audio signal encoding or decoding
CA2457988A1 (en) 2004-02-18 2005-08-18 Voiceage Corporation Methods and devices for audio compression based on acelp/tcx coding and multi-rate lattice vector quantization
FI118835B (en) 2004-02-23 2008-03-31 Nokia Corp Select end of a coding model
WO2005096274A1 (en) 2004-04-01 2005-10-13 Beijing Media Works Co., Ltd An enhanced audio encoding/decoding device and method
GB0408856D0 (en) 2004-04-21 2004-05-26 Nokia Corp Signal encoding
MXPA06012617A (en) 2004-05-17 2006-12-15 Nokia Corp Audio encoding with different coding frame lengths.
US7649988B2 (en) 2004-06-15 2010-01-19 Acoustic Technologies, Inc. Comfort noise generator using modified Doblinger noise estimate
US8160274B2 (en) 2006-02-07 2012-04-17 Bongiovi Acoustics Llc. System and method for digital signal processing
US9047860B2 (en) 2005-01-31 2015-06-02 Skype Method for concatenating frames in communication system
US7519535B2 (en) 2005-01-31 2009-04-14 Qualcomm Incorporated Frame erasure concealment in voice communications
US20070147518A1 (en) 2005-02-18 2007-06-28 Bruno Bessette Methods and devices for low-frequency emphasis during audio compression based on ACELP/TCX
US8155965B2 (en) 2005-03-11 2012-04-10 Qualcomm Incorporated Time warping frames inside the vocoder by modifying the residual
WO2006107838A1 (en) 2005-04-01 2006-10-12 Qualcomm Incorporated Systems, methods, and apparatus for highband time warping
WO2006126843A2 (en) 2005-05-26 2006-11-30 Lg Electronics Inc. Method and apparatus for decoding audio signal
US7707034B2 (en) 2005-05-31 2010-04-27 Microsoft Corporation Audio codec post-filter
PL1897085T3 (en) 2005-06-18 2017-10-31 Nokia Technologies Oy System and method for adaptive transmission of comfort noise parameters during discontinuous speech transmission
KR100851970B1 (en) 2005-07-15 2008-08-12 삼성전자주식회사 Method and apparatus for extracting ISCImportant Spectral Component of audio signal, and method and appartus for encoding/decoding audio signal with low bitrate using it
US7610197B2 (en) 2005-08-31 2009-10-27 Motorola, Inc. Method and apparatus for comfort noise generation in speech communication systems
US7720677B2 (en) 2005-11-03 2010-05-18 Coding Technologies Ab Time warped modified transform coding of audio signals
US7536299B2 (en) 2005-12-19 2009-05-19 Dolby Laboratories Licensing Corporation Correlating and decorrelating transforms for multiple description coding systems
US8255207B2 (en) 2005-12-28 2012-08-28 Voiceage Corporation Method and device for efficient frame erasure concealment in speech codecs
CN101371297A (en) * 2006-01-18 2009-02-18 Lg电子株式会社 Apparatus and method for encoding and decoding signal
US20090281812A1 (en) 2006-01-18 2009-11-12 Lg Electronics Inc. Apparatus and Method for Encoding and Decoding Signal
US8032369B2 (en) 2006-01-20 2011-10-04 Qualcomm Incorporated Arbitrary average data rates for variable rate coders
FR2897733A1 (en) 2006-02-20 2007-08-24 France Telecom Echo discriminating and attenuating method for hierarchical coder-decoder, involves attenuating echoes based on initial processing in discriminated low energy zone, and inhibiting attenuation of echoes in false alarm zone
US20070253577A1 (en) 2006-05-01 2007-11-01 Himax Technologies Limited Equalizer bank with interference reduction
WO2007138511A1 (en) 2006-05-30 2007-12-06 Koninklijke Philips Electronics N.V. Linear predictive coding of an audio signal
US7873511B2 (en) 2006-06-30 2011-01-18 Fraunhofer-Gesellschaft Zur Foerderung Der Angewandten Forschung E.V. Audio encoder, audio decoder and audio processor having a dynamically variable warping characteristic
JP4810335B2 (en) 2006-07-06 2011-11-09 株式会社東芝 Wideband audio signal encoding apparatus and wideband audio signal decoding apparatus
US7933770B2 (en) 2006-07-14 2011-04-26 Siemens Audiologische Technik Gmbh Method and device for coding audio data based on vector quantisation
EP2549440B1 (en) 2006-07-24 2017-01-11 Sony Corporation A hair motion compositor system and optimization techniques for use in a hair/fur graphics pipeline
US7987089B2 (en) 2006-07-31 2011-07-26 Qualcomm Incorporated Systems and methods for modifying a zero pad region of a windowed frame of an audio signal
DE102006049154B4 (en) 2006-10-18 2009-07-09 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Coding of an information signal
US20080147518A1 (en) 2006-10-18 2008-06-19 Siemens Aktiengesellschaft Method and apparatus for pharmacy inventory management and trend detection
CN102395033B (en) * 2006-12-12 2014-08-27 弗劳恩霍夫应用研究促进协会 Encoder, decoder and methods for encoding and decoding data segments representing a time-domain data stream
FR2911228A1 (en) 2007-01-05 2008-07-11 France Telecom TRANSFORMED CODING USING WINDOW WEATHER WINDOWS.
KR101379263B1 (en) * 2007-01-12 2014-03-28 삼성전자주식회사 Method and apparatus for decoding bandwidth extension
FR2911426A1 (en) 2007-01-15 2008-07-18 France Telecom MODIFICATION OF A SPEECH SIGNAL
JP4708446B2 (en) 2007-03-02 2011-06-22 パナソニック株式会社 Encoding device, decoding device and methods thereof
JP2008261904A (en) 2007-04-10 2008-10-30 Matsushita Electric Ind Co Ltd Encoding device, decoding device, encoding method and decoding method
US8630863B2 (en) 2007-04-24 2014-01-14 Samsung Electronics Co., Ltd. Method and apparatus for encoding and decoding audio/speech signal
CN101388210B (en) 2007-09-15 2012-03-07 华为技术有限公司 Coding and decoding method, coder and decoder
KR101513028B1 (en) 2007-07-02 2015-04-17 엘지전자 주식회사 broadcasting receiver and method of processing broadcast signal
US8185381B2 (en) 2007-07-19 2012-05-22 Qualcomm Incorporated Unified filter bank for performing signal conversions
CN101110214B (en) 2007-08-10 2011-08-17 北京理工大学 Speech coding method based on multiple description lattice type vector quantization technology
WO2009029032A2 (en) 2007-08-27 2009-03-05 Telefonaktiebolaget Lm Ericsson (Publ) Low-complexity spectral analysis/synthesis using selectable time resolution
US8566106B2 (en) 2007-09-11 2013-10-22 Voiceage Corporation Method and device for fast algebraic codebook search in speech and audio coding
CN101425292B (en) * 2007-11-02 2013-01-02 华为技术有限公司 Decoding method and device for audio signal
DE102007055830A1 (en) 2007-12-17 2009-06-18 Zf Friedrichshafen Ag Method and device for operating a hybrid drive of a vehicle
CN101483043A (en) 2008-01-07 2009-07-15 中兴通讯股份有限公司 Code book index encoding method based on classification, permutation and combination
CN101488344B (en) 2008-01-16 2011-09-21 华为技术有限公司 Quantitative noise leakage control method and apparatus
US8000487B2 (en) 2008-03-06 2011-08-16 Starkey Laboratories, Inc. Frequency translation by high-frequency spectral envelope warping in hearing assistance devices
EP2107556A1 (en) 2008-04-04 2009-10-07 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Audio transform coding using pitch correction
US8879643B2 (en) 2008-04-15 2014-11-04 Qualcomm Incorporated Data substitution scheme for oversampled data
US8768690B2 (en) 2008-06-20 2014-07-01 Qualcomm Incorporated Coding scheme selection for low-bit-rate applications
ES2654433T3 (en) 2008-07-11 2018-02-13 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Audio signal encoder, method for encoding an audio signal and computer program
EP2346030B1 (en) * 2008-07-11 2014-10-01 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Audio encoder, method for encoding an audio signal and computer program
CN102105930B (en) 2008-07-11 2012-10-03 弗朗霍夫应用科学研究促进协会 Audio encoder and decoder for encoding frames of sampled audio signals
ES2683077T3 (en) 2008-07-11 2018-09-24 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Audio encoder and decoder for encoding and decoding frames of a sampled audio signal
MY154452A (en) 2008-07-11 2015-06-15 Fraunhofer Ges Forschung An apparatus and a method for decoding an encoded audio signal
ES2401487T3 (en) 2008-07-11 2013-04-22 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Apparatus and procedure for encoding / decoding an audio signal using a foreign signal generation switching scheme
JP5551695B2 (en) 2008-07-11 2014-07-16 フラウンホッファー−ゲゼルシャフト ツァ フェルダールング デァ アンゲヴァンテン フォアシュンク エー.ファオ Speech encoder, speech decoder, speech encoding method, speech decoding method, and computer program
US8352279B2 (en) 2008-09-06 2013-01-08 Huawei Technologies Co., Ltd. Efficient temporal envelope coding approach by prediction between low band signal and high band signal
WO2010031049A1 (en) 2008-09-15 2010-03-18 GH Innovation, Inc. Improving celp post-processing for music signals
EP3640941A1 (en) * 2008-10-08 2020-04-22 Fraunhofer Gesellschaft zur Förderung der Angewand Multi-resolution switched audio encoding/decoding scheme
KR101315617B1 (en) * 2008-11-26 2013-10-08 광운대학교 산학협력단 Unified speech/audio coder(usac) processing windows sequence based mode switching
CN101770775B (en) 2008-12-31 2011-06-22 华为技术有限公司 Signal processing method and device
MX2011007925A (en) 2009-01-28 2011-08-17 Dten Forschung E V Fraunhofer Ges Zur Foeerderung Der Angewan Audio coding.
US8457975B2 (en) 2009-01-28 2013-06-04 Fraunhofer-Gesellschaft Zur Foerderung Der Angewandten Forschung E.V. Audio decoder, audio encoder, methods for decoding and encoding an audio signal and computer program
EP2214165A3 (en) 2009-01-30 2010-09-15 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Apparatus, method and computer program for manipulating an audio signal comprising a transient event
PL2234103T3 (en) 2009-03-26 2012-02-29 Fraunhofer Ges Forschung Device and method for manipulating an audio signal
US8725503B2 (en) 2009-06-23 2014-05-13 Voiceage Corporation Forward time-domain aliasing cancellation with application in weighted or original signal domain
CN101958119B (en) * 2009-07-16 2012-02-29 中兴通讯股份有限公司 Audio-frequency drop-frame compensator and compensation method for modified discrete cosine transform domain
MY162251A (en) 2009-10-20 2017-05-31 Fraunhofer Ges Forschung Audio signal encoder,audio signal decoder,method for providing an encoded representation of an audio content,method for providing a decoded representation of an audio content and computer program for use in low delay applications
MY167980A (en) 2009-10-20 2018-10-09 Fraunhofer Ges Forschung Multi- mode audio codec and celp coding adapted therefore
CN102081927B (en) 2009-11-27 2012-07-18 中兴通讯股份有限公司 Layering audio coding and decoding method and system
US8423355B2 (en) 2010-03-05 2013-04-16 Motorola Mobility Llc Encoder for audio signal including generic audio and speech frames
US8428936B2 (en) 2010-03-05 2013-04-23 Motorola Mobility Llc Decoder for audio signal including generic audio and speech frames
WO2011147950A1 (en) * 2010-05-28 2011-12-01 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Low-delay unified speech and audio codec
TWI480857B (en) 2011-02-14 2015-04-11 Fraunhofer Ges Forschung Audio codec using noise synthesis during inactive phases

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
TWI253057B (en) * 2004-12-27 2006-04-11 Quanta Comp Inc Search system and method thereof for searching code-vector of speech signal in speech encoder
TW201027517A (en) * 2008-09-30 2010-07-16 Dolby Lab Licensing Corp Transcoding of audio metadata
WO2010093224A2 (en) * 2009-02-16 2010-08-19 한국전자통신연구원 Encoding/decoding method for audio signals using adaptive sine wave pulse coding and apparatus thereof

Non-Patent Citations (3)

* Cited by examiner, † Cited by third party
Title
ASHLEY J P ET AL: "Wideband coding of speech using a scalable pulse codebook", IEEE WORKSHOP ON SPEECH CODING, 2000, SEPTEMBER 17-20, pages 148-150 *
FISCHER T R ET AL: "ENUMERATION ENCODING AND DECODING ALGORITHMS FOR PYRAMID CUBIC LATTICE AND TRELLIS CODES", IEEE TRANSACTIONS ON INFORMATION THEORY, vol. 41, no. 6, November 1995, pages 2056-2061. *
Timothy B. Terriberry ET AL: "A Multiply-Free Enumeration of Combinations With Replacement and Sign", , 1 December 2007, http://people.xiph.org/~tterribe/pubs/cwrs.pdf *

Also Published As

Publication number Publication date
AU2012217160A1 (en) 2013-10-10
RU2013141935A (en) 2015-03-27
KR101648133B1 (en) 2016-08-23
KR20160060161A (en) 2016-05-27
AU2012217160B2 (en) 2016-02-18
HK1192793A1 (en) 2014-08-29
PL2676269T3 (en) 2016-06-30
TW201248617A (en) 2012-12-01
TWI484480B (en) 2015-05-11
KR101751354B1 (en) 2017-06-27
KR20140000322A (en) 2014-01-02
US9037457B2 (en) 2015-05-19
EP2676269B1 (en) 2015-12-16
AR085223A1 (en) 2013-09-18
JP5851525B2 (en) 2016-02-03
CA2827296C (en) 2016-08-30
BR112013020589B1 (en) 2021-09-21
EP2676269A1 (en) 2013-12-25
WO2012110480A1 (en) 2012-08-23
TW201241823A (en) 2012-10-16
AU2016200351B2 (en) 2017-11-30
MX2013009302A (en) 2013-09-13
ES2562189T3 (en) 2016-03-02
CN103548078A (en) 2014-01-29
BR112013020589A2 (en) 2018-07-10
MY160264A (en) 2017-02-28
SG192715A1 (en) 2013-09-30
JP2014507016A (en) 2014-03-20
CN103548078B (en) 2015-12-23
RU2547241C1 (en) 2015-04-10
AU2016200351A1 (en) 2016-02-11
US20130332174A1 (en) 2013-12-12
ZA201306872B (en) 2014-05-28
CA2827296A1 (en) 2012-08-23
MY159444A (en) 2017-01-13

Similar Documents

Publication Publication Date Title
TWI488176B (en) Encoding and decoding of pulse positions of tracks of an audio signal
CA2827156C (en) Encoding and decoding of pulse positions of tracks of an audio signal
KR20080110757A (en) Improved coding/decoding of a digital audio signal, in celp technique
JP4550176B2 (en) Speech coding method
JPH0844399A (en) Acoustic signal transformation encoding method and decoding method
JPH1097294A (en) Voice coding device
JP6001451B2 (en) Encoding apparatus and encoding method
JP6859379B2 (en) Equipment and methods for comfortable noise generation mode selection
JP4191502B2 (en) Signal encoding method, apparatus thereof, and signal encoding program
EP3285253B1 (en) Method for coding a speech/sound signal
JP3954050B2 (en) Speech coding apparatus and speech coding method
JP2003029798A (en) Methods, devices, programs and recording media for encoding and decoding acoustic signal
ES2821725T3 (en) Encoding and decoding of pulse positions of tracks of an audio signal
JP3576805B2 (en) Voice encoding method and system, and voice decoding method and system
JPH0981191A (en) Voice coding/decoding device and voice decoding device
JP2007179071A (en) Device and method for speech encoding
JP2007206706A (en) Speech encoding device and method