TWI488176B - Encoding and decoding of pulse positions of tracks of an audio signal - Google Patents
Encoding and decoding of pulse positions of tracks of an audio signal Download PDFInfo
- Publication number
- TWI488176B TWI488176B TW101104350A TW101104350A TWI488176B TW I488176 B TWI488176 B TW I488176B TW 101104350 A TW101104350 A TW 101104350A TW 101104350 A TW101104350 A TW 101104350A TW I488176 B TWI488176 B TW I488176B
- Authority
- TW
- Taiwan
- Prior art keywords
- pulse
- track
- positions
- tracks
- states
- Prior art date
Links
- 230000005236 sound signal Effects 0.000 title claims description 63
- 238000000034 method Methods 0.000 claims description 48
- 238000004590 computer program Methods 0.000 claims description 10
- 238000012360 testing method Methods 0.000 claims description 10
- 239000002131 composite material Substances 0.000 claims description 5
- 238000005192 partition Methods 0.000 claims 5
- 238000001914 filtration Methods 0.000 claims 1
- 238000012545 processing Methods 0.000 description 24
- 238000004422 calculation algorithm Methods 0.000 description 13
- 239000013598 vector Substances 0.000 description 7
- 238000013459 approach Methods 0.000 description 5
- 230000003044 adaptive effect Effects 0.000 description 3
- 108010076504 Protein Sorting Signals Proteins 0.000 description 2
- 238000004458 analytical method Methods 0.000 description 2
- 230000005284 excitation Effects 0.000 description 2
- 230000003595 spectral effect Effects 0.000 description 2
- 238000003491 array Methods 0.000 description 1
- 230000005540 biological transmission Effects 0.000 description 1
- 238000004364 calculation method Methods 0.000 description 1
- 238000004891 communication Methods 0.000 description 1
- 238000011156 evaluation Methods 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
- 238000007781 pre-processing Methods 0.000 description 1
- 238000009877 rendering Methods 0.000 description 1
- 238000012552 review Methods 0.000 description 1
- 238000010561 standard procedure Methods 0.000 description 1
- 230000003068 static effect Effects 0.000 description 1
- 238000012546 transfer Methods 0.000 description 1
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/04—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/04—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
- G10L19/16—Vocoder architecture
- G10L19/167—Audio streaming, i.e. formatting and decoding of an encoded audio signal representation into a data stream for transmission or storage purposes
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/005—Correction of errors induced by the transmission channel, if related to the coding algorithm
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/04—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
- G10L19/08—Determination or coding of the excitation function; Determination or coding of the long-term prediction parameters
- G10L19/10—Determination or coding of the excitation function; Determination or coding of the long-term prediction parameters the excitation function being a multipulse excitation
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/02—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders
- G10L19/028—Noise substitution, i.e. substituting non-tonal spectral components by noisy source
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/04—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
- G10L19/08—Determination or coding of the excitation function; Determination or coding of the long-term prediction parameters
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10K—SOUND-PRODUCING DEVICES; METHODS OR DEVICES FOR PROTECTING AGAINST, OR FOR DAMPING, NOISE OR OTHER ACOUSTIC WAVES IN GENERAL; ACOUSTICS NOT OTHERWISE PROVIDED FOR
- G10K11/00—Methods or devices for transmitting, conducting or directing sound in general; Methods or devices for protecting against, or for damping, noise or other acoustic waves in general
- G10K11/16—Methods or devices for protecting against, or for damping, noise or other acoustic waves in general
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/012—Comfort noise or silence coding
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/02—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/02—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders
- G10L19/0212—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders using orthogonal transformation
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/02—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders
- G10L19/022—Blocking, i.e. grouping of samples in time; Choice of analysis windows; Overlap factoring
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/02—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders
- G10L19/022—Blocking, i.e. grouping of samples in time; Choice of analysis windows; Overlap factoring
- G10L19/025—Detection of transients or attacks for time/frequency resolution switching
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/02—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders
- G10L19/03—Spectral prediction for preventing pre-echo; Temporary noise shaping [TNS], e.g. in MPEG2 or MPEG4
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/04—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
- G10L19/06—Determination or coding of the spectral characteristics, e.g. of the short-term prediction coefficients
- G10L19/07—Line spectrum pair [LSP] vocoders
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/04—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
- G10L19/08—Determination or coding of the excitation function; Determination or coding of the long-term prediction parameters
- G10L19/10—Determination or coding of the excitation function; Determination or coding of the long-term prediction parameters the excitation function being a multipulse excitation
- G10L19/107—Sparse pulse excitation, e.g. by using algebraic codebook
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/04—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
- G10L19/08—Determination or coding of the excitation function; Determination or coding of the long-term prediction parameters
- G10L19/12—Determination or coding of the excitation function; Determination or coding of the long-term prediction parameters the excitation function being a code excitation, e.g. in code excited linear prediction [CELP] vocoders
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/04—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
- G10L19/08—Determination or coding of the excitation function; Determination or coding of the long-term prediction parameters
- G10L19/12—Determination or coding of the excitation function; Determination or coding of the long-term prediction parameters the excitation function being a code excitation, e.g. in code excited linear prediction [CELP] vocoders
- G10L19/13—Residual excited linear prediction [RELP]
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/04—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
- G10L19/16—Vocoder architecture
- G10L19/18—Vocoders using multiple modes
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/04—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
- G10L19/16—Vocoder architecture
- G10L19/18—Vocoders using multiple modes
- G10L19/22—Mode decision, i.e. based on audio signal content versus external parameters
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/02—Speech enhancement, e.g. noise reduction or echo cancellation
- G10L21/0208—Noise filtering
- G10L21/0216—Noise filtering characterised by the method used for estimating noise
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
- G10L25/03—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters
- G10L25/06—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters the extracted parameters being correlation coefficients
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
- G10L25/78—Detection of presence or absence of voice signals
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/04—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
- G10L19/26—Pre-filtering or post-filtering
Landscapes
- Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Computational Linguistics (AREA)
- Signal Processing (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Spectroscopy & Molecular Physics (AREA)
- Quality & Reliability (AREA)
- Algebra (AREA)
- General Physics & Mathematics (AREA)
- Mathematical Analysis (AREA)
- Mathematical Optimization (AREA)
- Mathematical Physics (AREA)
- Pure & Applied Mathematics (AREA)
- Theoretical Computer Science (AREA)
- Compression, Expansion, Code Conversion, And Decoders (AREA)
- Transmission Systems Not Characterized By The Medium Used For Transmission (AREA)
- Compression Or Coding Systems Of Tv Signals (AREA)
Description
本發明係有關於音訊處理及音訊編碼領域,更明確言之,係有關於音訊信號中音軌脈衝位置之編碼與解碼技術。The present invention relates to the field of audio processing and audio coding, and more specifically to encoding and decoding techniques for the position of a track pulse in an audio signal.
音訊處理及/或音訊編碼已經多所進展。於音訊編碼中,線性預測編碼器扮演要角。當編碼音訊信號例如包含語音的音訊信號時,線性預測編碼器通道編碼音訊信號之頻譜波封之表示型態。為了達成該項目的,線性預測編碼器可決定預測濾波器係數來以編碼形式表示該聲音之頻譜波封。然後,濾波器係數可由線性預測解碼器用來藉使用該等線性預測編碼器產生合成音訊信號而解碼該編碼音訊信號。Audio processing and/or audio coding has progressed. In audio coding, a linear predictive encoder plays an important role. When encoding an audio signal, such as an audio signal containing speech, the linear predictive encoder channel encodes the representation of the spectral envelope of the audio signal. To achieve this, the linear predictive encoder can determine the predictive filter coefficients to represent the spectral envelope of the sound in encoded form. The filter coefficients can then be used by a linear predictive decoder to decode the encoded audio signal by using the linear predictive encoder to generate a composite audio signal.
線性預測編碼器之主要實例為ACELP編碼器(ACELP=代數代碼激勵線性預測編碼器)。ACELP編碼器廣為人使用,例如用於USAC(USAC=統一語音與音訊編碼)且可具有額外應用領域例如用於LD-USAC(低延遲統一語音與音訊編碼)。A prime example of a linear predictive coder is the ACELP coder (ACELP = Algebraic Code Excited Linear Predictive Encoder). ACELP encoders are widely used, for example for USAC (USAC = Unified Voice and Audio Coding) and may have additional fields of application such as for LD-USAC (Low Delay Unified Voice and Audio Coding).
ACELP編碼器通道係藉決定預測濾波器係數而編碼音訊信號。為了達成更佳編碼,ACELP編碼器基於欲編碼之音訊信號,及基於已經決定的預測濾波器係數而決定殘差信號,又稱目標信號。殘差信號例如可以是差信號,表示欲編碼之音訊信號與,藉預測濾波器係數及可能地藉音準(pitch)分析所得的適應性濾波器係數編碼之該信號部分間 之差值。然後ACELP編碼器針對編碼該殘差信號。為了達成該項目的,編碼器編碼代數碼簿參數,該等代數碼簿參數係用來表示該殘差信號。The ACELP encoder channel encodes the audio signal by determining the prediction filter coefficients. In order to achieve better coding, the ACELP encoder determines the residual signal, also known as the target signal, based on the audio signal to be encoded and based on the determined prediction filter coefficients. The residual signal may be, for example, a difference signal indicating the audio signal to be encoded and the portion of the signal encoded by the predictive filter coefficients and possibly the adaptive filter coefficients obtained by pitch analysis. The difference. The ACELP encoder then encodes the residual signal. To achieve this item, the encoder encodes the algebraic book parameters, which are used to represent the residual signal.
為了編碼該殘差信號,使用代數碼簿。通常代數碼簿包含多個音軌,例如四個音軌各自包含16音軌位置。於此種組態中,藉個別代數碼簿可表示共計4x16=64樣本位置,相對應於欲編碼的音訊信號之一子訊框的樣本數目。In order to encode the residual signal, a codebook is used. A typical digital book contains multiple tracks, for example four tracks each containing 16 track positions. In this configuration, a single generation digital book can represent a total of 4x16=64 sample positions, corresponding to the number of samples of one of the sub-frames of the audio signal to be encoded.
碼簿之該等音軌可以交插,使得碼簿之音軌0可表示該子訊框之樣本0、4、8、...、60,使得碼簿之音軌1可表示該子訊框之樣本1、5、9、...、61,使得碼簿之音軌2可表示該子訊框之樣本2、6、10、...、62,及使得碼簿之音軌3可表示該子訊框之樣本3、7、11、...、63。各個音軌可有固定數目之脈衝。或者每個音軌之脈衝數目可改變,例如取決於其它狀況而改變。脈衝例如可以是正或負,例如可以+1(正脈衝)或0(負脈衝)表示。The tracks of the codebook can be interleaved so that the track 0 of the codebook can represent the samples 0, 4, 8, ..., 60 of the subframe, so that the track 1 of the codebook can represent the message. The samples 1, 5, 9, ..., 61 of the frame enable the track 2 of the codebook to represent the samples 2, 6, 10, ..., 62 of the subframe, and the track 3 of the code book Samples 3, 7, 11, ..., 63 of the subframe can be represented. Each track can have a fixed number of pulses. Or the number of pulses per track can vary, for example depending on other conditions. The pulse can be, for example, positive or negative, for example +1 (positive pulse) or 0 (negative pulse).
為了編碼該殘差信號,當編碼時,碼簿組態可經選擇使得最佳表示該殘差信號之其餘信號。為了達成該項目的,可用脈衝可位在最佳地反映出欲編碼的信號位置之適當音軌位置。此外,可載明相對應脈衝為正或為負。To encode the residual signal, when encoded, the codebook configuration can be selected such that the remaining signals of the residual signal are optimally represented. In order to achieve this, the available pulses can be positioned to best reflect the appropriate track position of the signal position to be encoded. In addition, it can be stated that the corresponding pulse is positive or negative.
於解碼器端,ACELP解碼器將至少解碼代數碼簿參數。ACELP解碼器也可解碼適應性碼簿參數。為了決定代數碼簿參數,ACELP解碼器可針對代數碼簿之各個音軌決定多個脈衝位置。此外,ACELP解碼器也可解碼在一音軌位置的脈衝是正或負脈衝。又復,ACELP解碼器也可解碼 適應性碼簿參數。基於此項資訊,ACELP解碼器通常產生激勵信號。然後,ACELP解碼器施加該等預測濾波器係數於該激勵信號來產生合成音訊信號而獲得解碼音訊信號。At the decoder side, the ACELP decoder will decode at least the generation code parameters. The ACELP decoder can also decode adaptive codebook parameters. To determine the generation of digital book parameters, the ACELP decoder can determine multiple pulse positions for each track of the digital book. In addition, the ACELP decoder can also decode pulses at a track position that are positive or negative pulses. Again, the ACELP decoder can also decode Adaptive codebook parameters. Based on this information, ACELP decoders typically generate an excitation signal. The ACELP decoder then applies the predictive filter coefficients to the excitation signal to produce a composite audio signal to obtain a decoded audio signal.
於ACELP中,音軌上的脈衝通常係編碼如下。若該音軌具有長度16,及若此一音軌上的脈衝數目為1,則可藉其位置(4位元)及符號(1位元)共計5位元編碼脈衝位置。若該音軌具有長度16及脈衝數目為2,則第一脈衝係藉其位置(4位元)及符號(1位元)編碼。至於第二脈衝,則只需編碼位置(4位元),原因在於若該第二脈衝係在第一脈衝左方則可選擇第二脈衝之符號為正,若係在第一脈衝右方則可選擇第二脈衝之符號為正,及若係在第一脈衝之相同位置則可選擇第二脈衝為與第一脈衝相同符號。因此總計,共需9位元來編碼兩個脈衝。比起分開地各藉5位元編碼脈衝位置,因而針對每一對脈衝節省1位元。In ACELP, the pulses on the track are usually encoded as follows. If the track has a length of 16, and if the number of pulses on the track is 1, the pulse position can be encoded by a total of 5 bits by its position (4 bits) and the symbol (1 bit). If the track has a length of 16 and the number of pulses is 2, the first pulse is encoded by its position (4 bits) and the symbol (1 bit). As for the second pulse, only the coding position (4 bits) is needed, because if the second pulse is to the left of the first pulse, the sign of the second pulse can be selected to be positive, if it is to the right of the first pulse. The sign of the second pulse can be selected to be positive, and if it is at the same position of the first pulse, the second pulse can be selected to be the same symbol as the first pulse. Therefore, in total, a total of 9 bits are required to encode two pulses. The 5-bit encoded pulse position is borrowed separately from each other, thus saving 1 bit for each pair of pulses.
編碼比2更大的脈衝數目,可成對地編碼脈衝,及若脈衝數為奇數,則分開地編碼最末脈衝。如此則例如針對5脈衝的音軌,需要9+9+5=23位元。若有4音軌,則要求4 x 23=92位元來編碼具有4音軌及每個音軌5脈衝的長度64之子訊框。但若位元數可以更進一步減少則更有價值。The number of pulses larger than 2 can be encoded in pairs, and if the number of pulses is odd, the last pulse is encoded separately. Thus, for example, for a 5-pulse track, 9+9+5=23 bits are required. If there are 4 tracks, 4 x 23 = 92 bits are required to encode a sub-frame with a length of 64 tracks of 4 tracks and 5 pulses per track. But it is more valuable if the number of bits can be further reduced.
若能提供具有改良編碼或解碼構思的編碼裝置及個別的解碼裝置將極有價值,該裝置具有針對脈衝資訊表示型態使用較少位元而以改良方式來編碼或解碼脈衝資訊的手段,原因在於如此將例如減低傳輸個別地編碼音訊信號之傳輸速率,及又復,如此將例如減少儲存個別地編碼音訊 信號需要的儲存空間。It would be extremely valuable to provide an encoding device with an improved coding or decoding concept and an individual decoding device that has means for encoding or decoding pulse information in an improved manner using fewer bits for the pulsed information representation type, In this way, for example, the transmission rate of the individually encoded audio signal is reduced, and again, which will, for example, reduce the storage of individually encoded audio. The storage space required for the signal.
因此本發明之一目的係提供音訊信號之音軌脈衝的編碼及解碼之改良構思。本發明之目的可藉如申請專利範圍第1項之解碼裝置、如申請專利範圍第9項之編碼裝置、如申請專利範圍第13項之解碼方法、如申請專利範圍第14項之編碼方法、及如申請專利範圍第15項之電腦程式達成。It is therefore an object of the present invention to provide an improved concept for encoding and decoding audio track pulses of an audio signal. The object of the present invention is the decoding device of claim 1 of the patent application, the encoding device of claim 9 of the patent application, the decoding method of claim 13 of the patent application, the encoding method of claim 14 of the patent scope, And the computer program as claimed in item 15 of the patent application is reached.
依據實施例,假設一個狀態數目可供解碼裝置使用。又復假設指示與該編碼音訊信號相聯結的至少一個音軌之音軌位置總數的音軌位置數目、及指示至少一個音軌之脈衝數目的總脈衝數目可供本發明之解碼裝置使用。較佳地,該音軌位置數目及該總脈衝數目係可供與該編碼音訊信號相聯結的各個音軌使用。According to an embodiment, it is assumed that a number of states is available to the decoding device. Still further hypothesized that the number of track positions indicating the total number of track positions of at least one track associated with the encoded audio signal, and the total number of pulses indicating the number of pulses of at least one track are available for use by the decoding apparatus of the present invention. Preferably, the number of track positions and the total number of pulses are available for individual tracks associated with the encoded audio signal.
舉例言之,具有4音軌有5脈衝,各自可達成粗略地6.6 x 10^21狀態,依據實施例,可藉73位元編碼,比較前述最先進編碼器使用92位元編碼,更有效約21%。For example, there are 5 pulses with 4 tracks, each of which can achieve a coarse 6.6 x 10^21 state. According to an embodiment, 73-bit encoding can be used, and the most advanced encoder is compared with 92-bit encoding, which is more effective. twenty one%.
首先,提供一種構思,如何以有效方式編碼一音訊信號之一音軌的多個脈衝位置。後文中,該構思擴延成允許不僅編碼一音軌的脈衝位置,同時也允許編碼該脈衝是正或負。此外,然後該構思擴延成允許以有效方式針對多個音軌編碼脈衝資訊。此等構思同等適用於解碼器端。First, an idea is provided on how to encode a plurality of pulse positions of a track of an audio signal in an efficient manner. In the following, the concept is extended to allow not only the pulse position of a track to be encoded, but also to allow the pulse to be positive or negative. Furthermore, the idea is then extended to allow pulse information to be encoded for a plurality of tracks in an efficient manner. These concepts are equally applicable to the decoder side.
此外,實施例又更基於發現若編碼策略使用預定位元數目,使得在各個音軌上有相同脈衝數目的任一種組態皆要求相等位元數目。若可用位元數目為固定,則可能直接 地選定使用該給定位元量可編碼多少個脈衝,如此允許以預定品質編碼。此外,使用此種辦法,無需嘗試不等量脈衝直到達成期望的位元率,反而可以直接地選擇正確脈衝量,因而減低複雜度。Moreover, the embodiments are further based on the finding that if the encoding strategy uses a predetermined number of bits, any configuration having the same number of pulses on each track requires an equal number of bits. If the number of available bits is fixed, it may be direct The number of pulses that can be encoded using the given positioning element is selected so as to allow encoding with a predetermined quality. Moreover, with this approach, there is no need to try unequal pulses until the desired bit rate is achieved, instead the direct pulse amount can be directly selected, thus reducing complexity.
基於前述假設,可編碼及/或解碼一音訊信號訊框之一音軌的多個脈衝位置。Based on the foregoing assumptions, a plurality of pulse positions of one of the tracks of an audio signal frame can be encoded and/or decoded.
雖然本發明可採用來編碼或解碼任一種音訊信號,例如語音信號或樂音信號,本發明特別可用於編碼或解碼語音信號。Although the present invention can be used to encode or decode any type of audio signal, such as a voice signal or a tone signal, the present invention is particularly useful for encoding or decoding a voice signal.
於另一實施例中,脈衝資訊解碼器又更適用以使用音軌位置數目、總脈衝數目、及狀態數目來解碼多個脈衝符號,其中該等脈衝符號中之各一者指示多個脈衝中之一者的符號。信號解碼器可適用以又更使用多個脈衝符號,藉產生合成音訊信號而解碼該編碼音訊信號。In another embodiment, the pulse information decoder is further adapted to decode a plurality of pulse symbols using the number of track positions, the total number of pulses, and the number of states, wherein each of the pulse symbols indicates a plurality of pulses One of the symbols. The signal decoder can be adapted to use a plurality of pulse symbols to decode the encoded audio signal by generating a composite audio signal.
依據又一實施例,其中該等一或多個音軌可包含至少一個最末音軌及一或多個其它音軌,該脈衝資訊解碼器可適用以從該狀態數目產生一第一子狀態數目及一第二子狀態數目。該脈衝資訊解碼器可經組配來基於該第一子狀態數目而解碼該等脈衝位置之一第一群組,及該脈衝資訊解碼器更可經組配來基於該第二子狀態數目而解碼該等脈衝位置之一第二群組。該等脈衝位置之該第二群組可只包含指示該最末音軌之音軌位置的脈衝位置。該等脈衝位置之該第一群組可只包含指示該等一或多個其它音軌之音軌位置的脈衝位置。In accordance with yet another embodiment, wherein the one or more tracks can include at least one last track and one or more other tracks, the pulse information decoder can be adapted to generate a first sub-state from the number of states The number and the number of second sub-states. The pulse information decoder can be configured to decode a first group of the pulse positions based on the first number of sub-states, and the pulse information decoder can be further configured to be based on the second number of sub-states A second group of one of the pulse positions is decoded. The second group of the pulse positions may only include a pulse position indicating the position of the track of the last track. The first group of the pulse positions may only include pulse positions indicative of the track positions of the one or more other tracks.
依據另一實施例,脈衝資訊解碼器可經組配來藉將該狀態數目除以f(pk ,N)而獲得一整數部分及一餘數作為除法結果來產生該第一子狀態數目及該第二子狀態數目,其中該整數部分為該第一子狀態數目,及其中該餘數為該第二子狀態數目,其中pk 指示針對該等一或多個音軌中之各一者的脈衝數目,及其中N指示針對該等一或多個音軌中之各一者的音軌位置數目。此處,f(pk ,N)乃返回於長度N具有pk 脈衝的音軌中可達成的狀態數目之函式。According to another embodiment, the pulse information decoder may be configured to obtain an integer part and a remainder as a division result by dividing the number of states by f(p k , N) to generate the first sub-state number and the a second number of sub-states, wherein the integer portion is the first sub-state number, and wherein the remainder is the second sub-state number, wherein pk indicates a pulse for each of the one or more tracks The number, and N thereof, indicates the number of track positions for each of the one or more tracks. Here, f(p k , N) is a function that returns the number of states achievable in the track of length N having a p k pulse.
於另一實施例中,該脈衝資訊解碼器可適用以進行一測試比較該狀態數目或一已更新狀態數目與一臨界值。In another embodiment, the pulse information decoder is adapted to perform a test to compare the number of states or an updated number of states with a threshold.
該脈衝資訊解碼器可適用以藉比較該狀態數目或一已更新狀態數目是否大於、大於或等於、小於、或小於或等於該臨界值而進行該測試,及其中該脈衝資訊解碼器又更適用以取決於該測試結果而更新該狀態數目或一已更新狀態數目。The pulse information decoder can be adapted to perform the test by comparing whether the number of states or an updated number of states is greater than, greater than or equal to, less than, or less than or equal to the threshold, and wherein the pulse information decoder is more suitable The number of states or an updated number of states is updated depending on the test result.
於一實施例中,該脈衝資訊解碼器可經組配來針對該等多個音軌中之一者的各個音軌位置,比較該狀態數目或該已更新狀態數目與該臨界值。In one embodiment, the pulse information decoder can be configured to compare the number of states or the number of updated states to the threshold for each track position of one of the plurality of tracks.
依據一實施例,該脈衝資訊解碼器可經組配來劃分該等音軌中之一者成為包含該等多個音軌位置中之至少兩個音軌位置之一第一音軌區劃,及成為包含該等多個音軌位置中之至少另兩個音軌位置之一第二音軌區劃。該脈衝資訊解碼器可經組配來基於該狀態數目而產生一第一子狀態數目及一第二子狀態數目。此外,該脈衝資訊解碼器可經 組配來基於該第一子狀態數目,解碼與該第一音軌區劃相聯結的脈衝位置之一第一群組。又復,該脈衝資訊解碼器可經組配來基於該第二子狀態數目,解碼與該第二音軌區劃相聯結的脈衝位置之一第二群組。According to an embodiment, the pulse information decoder may be configured to divide one of the audio tracks into a first track region including one of the at least two track positions of the plurality of track positions, and A second track zone that includes one of the at least two other track positions of the plurality of track positions. The pulse information decoder can be configured to generate a first number of sub-states and a second sub-state number based on the number of states. In addition, the pulse information decoder can be And assigning to decode a first group of pulse positions associated with the first track zone based on the first number of substates. Still further, the pulse information decoder can be configured to decode a second group of one of the pulse locations associated with the second track region based on the second number of substates.
依據一實施例,提出一種編碼一音訊信號之裝置。該裝置包含一信號處理器適用以決定與該音訊信號相聯結的多個預測濾波器係數,用以基於該音訊信號及該等多個預測濾波器係數而產生一殘差信號。此外,該裝置包含一脈衝資訊編碼器適用以編碼與一或多個音軌相關的多個脈衝位置來編碼該音訊信號,該等一或多個音軌係與該殘差信號相聯結。該等音軌中之各一者具有多個音軌位置及多個脈衝。該等脈衝位置中之各一者指示該等音軌中之一者的該等音軌位置中之一者來指示該音軌之該等脈衝中之一者的位置。該脈衝資訊編碼器係經組配來藉產生一狀態數目而編碼該等多個脈衝位置,使得該等脈衝位置唯有基於該狀態數目、指示該等音軌中之至少一者的音軌位置總數之一音軌位置數目、及指示該等音軌中之至少一者的脈衝總數之一總脈衝數目才能被解碼。According to an embodiment, an apparatus for encoding an audio signal is presented. The apparatus includes a signal processor adapted to determine a plurality of prediction filter coefficients coupled to the audio signal for generating a residual signal based on the audio signal and the plurality of prediction filter coefficients. Additionally, the apparatus includes a pulse information encoder adapted to encode the plurality of pulse positions associated with one or more tracks to encode the audio signal, the one or more tracks being associated with the residual signal. Each of the tracks has a plurality of track positions and a plurality of pulses. Each of the pulse positions indicates one of the track positions of one of the tracks to indicate the position of one of the pulses of the track. The pulse information encoder is configured to encode the plurality of pulse positions by generating a number of states such that the pulse positions are only based on the number of states, indicating a track position of at least one of the tracks The number of track positions, one of the total number, and the total number of pulses indicating the total number of pulses of at least one of the tracks can be decoded.
依據另一實施例,該脈衝資訊編碼器可適用以編碼多個脈衝符號,其中該等脈衝符號中之各一者指示該等多個脈衝中之一者之一符號。該脈衝資訊編碼器更可經組配來藉產生該狀態數目而編碼該等多個脈衝符號,使得該等脈衝符號唯有基於該狀態數目、指示該等音軌中之至少一者的音軌位置總數之該音軌位置數目、及該總脈衝數目才能 被解碼。In accordance with another embodiment, the pulse information encoder is operative to encode a plurality of pulse symbols, wherein each of the pulse symbols indicates one of the plurality of pulses. The pulse information encoder may be further configured to encode the plurality of pulse symbols by generating the number of states, such that the pulse symbols are only based on the number of states, indicating a track of at least one of the tracks The total number of positions of the track and the total number of pulses Is decoded.
於一實施例中,該脈衝資訊編碼器係經組配來針對該等音軌中之一者的各個音軌位置,將一整數值加至針對於一音軌位置的各個脈衝之一中間數目來獲得該狀態數目。In one embodiment, the pulse information encoder is configured to add an integer value to an intermediate number of each of the pulses for a track position for each track position of one of the tracks. To get the number of states.
依據另一實施例,該脈衝資訊編碼器可經組配來劃分該等音軌中之一者成為包含該等多個音軌位置中之至少兩個音軌位置之一第一音軌區劃,及成為包含該等多個音軌位置中之至少另兩個音軌位置之一第二音軌區劃。此外,該脈衝資訊編碼器可經組配來編碼與該第一區劃相聯結的之一第一子狀態數目。又復,該脈衝資訊編碼器可經組配來編碼與該第二區劃相聯結的之一第二子狀態數目。又,該脈衝資訊編碼器可經組配來組合該第一子狀態數目與該第二子狀態數目而獲得該狀態數目。In accordance with another embodiment, the pulse information encoder can be configured to divide one of the audio tracks into a first track zone that includes one of the at least two of the plurality of track positions. And becoming a second track zone comprising one of the at least two other track positions of the plurality of track positions. Additionally, the pulse information encoder can be assembled to encode a first number of sub-states associated with the first zone. Further, the pulse information encoder can be assembled to encode a second sub-state number associated with the second zone. Moreover, the pulse information encoder can be assembled to combine the first sub-state number and the second sub-state number to obtain the state number.
後文中,將就附圖以進一步細節描述本發明之實施例,附圖中:第1圖顯示依據一實施例用以解碼一編碼音訊信號之裝置,第2圖顯示依據一實施例用以編碼一音訊信號之裝置,第3圖顯示針對具有兩個無符號脈衝及三個音軌位置之一音軌的全部可能組態,第4圖顯示針對具有一個有符號脈衝及兩個音軌位置之一音軌的全部可能組態,第5圖顯示針對具有兩個有符號脈衝及兩個音軌位置 之一音軌的全部可能組態,第6圖為例示說明一實施例之流程圖,闡釋依據一實施例藉脈衝資訊解碼器所進行的處理步驟,及第7圖為例示說明一實施例之流程圖,該流程圖闡釋依據一實施例藉脈衝資訊編碼器所進行的處理步驟。In the following, embodiments of the invention will be described in further detail with reference to the accompanying drawings in which: FIG. 1 shows an apparatus for decoding an encoded audio signal in accordance with an embodiment, and FIG. 2 shows an encoding for use in accordance with an embodiment. An apparatus for an audio signal, Figure 3 shows all possible configurations for a track with two unsigned pulses and three track positions, and Figure 4 shows a position with one signed pulse and two tracks. All possible configurations of one track, Figure 5 shows for two signed pulses and two track positions All of the possible configurations of one of the tracks, FIG. 6 is a flow chart illustrating an embodiment, illustrating the processing steps performed by the pulse information decoder in accordance with an embodiment, and FIG. 7 is an illustration of an embodiment. Flowchart, which illustrates the processing steps performed by the pulse information encoder in accordance with an embodiment.
第1圖例示說明解碼一編碼音訊信號之裝置,其中一或多個音軌係與該編碼音訊信號相聯結,該等音軌中之各一者具有多個音軌位置及多個脈衝。Figure 1 illustrates an apparatus for decoding an encoded audio signal, wherein one or more audio tracks are associated with the encoded audio signal, each of the audio tracks having a plurality of track positions and a plurality of pulses.
該裝置包含一脈衝資訊解碼器110及一信號解碼器120。該脈衝資訊解碼器110係適用以解碼多個脈衝位置。該等脈衝位置中之各一者指示該等音軌中之一者的該等音軌位置中之一者來指示該音軌之該等脈衝中之一者的位置。The device includes a pulse information decoder 110 and a signal decoder 120. The pulse information decoder 110 is adapted to decode a plurality of pulse positions. Each of the pulse positions indicates one of the track positions of one of the tracks to indicate the position of one of the pulses of the track.
該脈衝資訊解碼器110係經組配來藉由使用指示該等音軌中之至少一者的音軌位置總數之一音軌位置數目、指示該等音軌中之至少一者的脈衝總數之一總脈衝數目、及一個狀態數目而解碼該等多個脈衝位置。The pulse information decoder 110 is configured to indicate the total number of pulses of at least one of the tracks by using a number of track positions indicating the total number of track positions of at least one of the tracks. The plurality of pulse positions are decoded by a total number of pulses and a number of states.
該信號解碼器120係適用於藉由使用該等多個脈衝位置及與該編碼音訊信號相聯結的多個預測濾波器係數而產生一合成音訊信號來解碼該編碼音訊信號。The signal decoder 120 is adapted to decode the encoded audio signal by generating a composite audio signal using the plurality of pulse positions and a plurality of prediction filter coefficients coupled to the encoded audio signal.
狀態數目為已經依據後述實施例藉編碼器編碼之數目。狀態數目例如包含於一精簡表示型態中有關多個脈衝位置之資訊,該表示型態例如為要求少數位元的表示型 態,及當有關該音軌位置數目及總脈衝數目之資訊在該解碼器為可利用時可被解碼的表示型態。The number of states is the number of codes that have been encoded by the encoder in accordance with the embodiments described later. The number of states is, for example, included in a reduced representation of information about a plurality of pulse positions, such as a representation requiring a small number of bits. State, and a representation that can be decoded when the information about the number of track positions and the total number of pulses is available to the decoder.
於一實施例中,該音訊信號之一個音軌或各個音軌之該音軌位置數目及/或總脈衝數目在該解碼器為可資利用,原因在於該音軌位置數目及/或總脈衝數目為不變的靜態數值且為接收器所已知。舉例言之,針對各個音軌,該音軌位置數目可以經常性為16,及總脈衝數目可以經常性為4。In one embodiment, the number of track positions and/or the total number of pulses of one track or each track of the audio signal is available at the decoder due to the number of track positions and/or total pulses. The number is a constant static value and is known to the receiver. For example, for each track, the number of track positions can be often 16 and the total number of pulses can be 4 frequently.
於另一實施例中,該音訊信號之一個音軌或各個音軌之該音軌位置數目及/或總脈衝數目例如可藉編碼裝置而明確地傳輸至該解碼裝置。In another embodiment, the number of track positions and/or the total number of pulses of one track or each track of the audio signal can be explicitly transmitted to the decoding device by, for example, an encoding device.
於又一實施例中,解碼器可決定該音訊信號之一個音軌或各個音軌之該音軌位置數目及/或總脈衝數目,該決定方式係藉分析並不明確地陳述該音軌位置數目及/或總脈衝數目的其它參數,反而從該等其它參數可推衍出該音軌位置數目及/或總脈衝數目。In still another embodiment, the decoder may determine the number of track positions and/or the total number of pulses of one track or each track of the audio signal, and the decision manner does not explicitly state the track position by analysis. The number and/or other parameters of the total number of pulses may instead derive from the other parameters the number of track positions and/or the total number of pulses.
於其它實施例中,解碼器可分析可利用來推衍出該音訊信號之一個音軌或各個音軌之該音軌位置數目及/或總脈衝數目之其它資料。In other embodiments, the decoder may analyze other data that may be utilized to derive a track of the audio signal or the number of track positions and/or total number of pulses for each track.
於更一實施例中,該脈衝資訊解碼器可適用以也解碼一脈衝為正脈衝或負脈衝。In a further embodiment, the pulse information decoder is adapted to also decode a pulse as a positive or negative pulse.
於另一實施例中,該脈衝資訊解碼器又更可適用以解碼脈衝資訊,包含有關多個音軌之脈衝的資訊。脈衝資訊例如可以是有關於一音軌中的脈衝位置資訊及/或一脈衝 為正脈衝或負脈衝之資訊。In another embodiment, the pulse information decoder is further adapted to decode pulse information, including information about pulses of a plurality of tracks. The pulse information may be, for example, related to pulse position information and/or a pulse in a track. Information for positive or negative pulses.
第2圖例示說明編碼音訊信號之裝置,包含信號處理器210及脈衝資訊編碼器220。FIG. 2 illustrates an apparatus for encoding an audio signal, including a signal processor 210 and a pulse information encoder 220.
信號處理器210係適用以決定與該音訊信號相聯結的多個預測濾波器係數,用以基於該音訊信號及該等多個預測濾波器係數而產生一殘差信號。The signal processor 210 is adapted to determine a plurality of prediction filter coefficients associated with the audio signal for generating a residual signal based on the audio signal and the plurality of prediction filter coefficients.
脈衝資訊編碼器220係適用以編碼與一或多個音軌相關的多個脈衝位置來編碼該音訊信號。該等一或多個音軌係與由信號處理器210所產生的該殘差信號相聯結。該等音軌中之各一者具有多個音軌位置及多個脈衝。此外,該等脈衝位置中之各一者指示該等音軌中之一者的該等音軌位置中之一者來指示該音軌之該等脈衝中之一者的位置。The pulse information encoder 220 is adapted to encode the plurality of pulse positions associated with one or more tracks to encode the audio signal. The one or more tracks are coupled to the residual signal generated by signal processor 210. Each of the tracks has a plurality of track positions and a plurality of pulses. Additionally, each of the pulse positions indicates one of the track positions of one of the tracks to indicate the position of one of the pulses of the track.
該脈衝資訊編碼器220係經組配來藉產生一狀態數目而編碼該等多個脈衝位置,使得該等脈衝位置唯有基於該狀態數目、指示該等音軌中之至少一者的音軌位置總數之一音軌位置數目、及指示該等音軌中之至少一者的脈衝總數之一總脈衝數目才能被解碼。The pulse information encoder 220 is configured to encode the plurality of pulse positions by generating a number of states such that the pulse positions are only based on the number of states, indicating a track of at least one of the tracks The number of track positions, one of the total number of positions, and the total number of pulses indicating the total number of pulses of at least one of the tracks can be decoded.
後文中,呈示有關藉產生一狀態數目而編碼脈衝位置及可能地編碼脈衝符號(正脈衝或負脈衝)之本發明之實施例的基本構思。In the following, the basic idea of an embodiment of the invention for encoding a pulse position and possibly encoding a pulse symbol (positive pulse or negative pulse) by generating a number of states is presented.
本發明之實施例的編碼原理係植基於下述發現,若考慮於具有n個音軌位置之一音軌中的k個脈衝之全部可能組態之狀態列舉,則係足以編碼一音軌之脈衝的實際狀態。藉儘可能少數位元編碼此種狀態,提供期望的精簡編碼。 藉此,呈示狀態列舉之構思,其中脈衝位置及可能也脈衝符號的各叢表示一個狀態,及各個狀態係獨特地列舉。The coding principle of an embodiment of the present invention is based on the finding that it is sufficient to encode a track if one considers a state list of all possible configurations of k pulses in one of the n track positions. The actual state of the pulse. The desired reduced coding is provided by encoding such a state with as few bits as possible. Thereby, the concept of the presentation state is listed, wherein the pulse positions and the clusters which may also be pulse symbols represent one state, and each state is uniquely enumerated.
第3圖針對簡單情況例示說明此點,於該處闡釋全部可能的組態,此時考慮具有2脈衝及3音軌位置之一音軌。2脈衝可位在相同音軌位置。於第3圖之實例中,不考慮脈衝的符號(例如脈衝是正或負),例如於此一實例中,全部脈衝皆可視為正脈衝。Figure 3 illustrates this point for a simple case where all possible configurations are explained, in which case one of the 2 pulse and 3 track positions is considered. 2 pulses can be placed at the same track position. In the example of Figure 3, the sign of the pulse (e.g., the pulse is positive or negative) is not considered. For example, in this example, all pulses can be considered as positive pulses.
於第3圖中,例示說明針對位在具有3音軌位置(第3圖中:音軌位置1、2及3)之一音軌中兩個無方向性脈衝的全部可能組態。只有六個不同的可能狀態(第3圖中標示為0至5),描述脈衝如何分布在該音軌。藉此,使用0至5之範圍的狀態數目來描述所呈示的實際組態即足。舉例言之,若第3圖實例的狀態數目具有數值(4),且若解碼器知曉編碼方案,則解碼器可獲得結論狀態數目=4,表示該音軌具有一個脈衝在音軌位置0,及另一個脈衝在音軌位置2。於是於第3圖之實例中,3位元即足以編碼狀態數目來識別第3圖之實例的六個不同狀態中之一者。In Fig. 3, all possible configurations for two non-directional pulses in a track having one of three track positions (Fig. 3: track positions 1, 2 and 3) are illustrated. There are only six different possible states (labeled 0 to 5 in Figure 3) that describe how the pulses are distributed over the track. Thereby, the number of states in the range of 0 to 5 is used to describe the actual configuration presented. For example, if the number of states of the example of FIG. 3 has a value (4), and if the decoder knows the encoding scheme, the decoder can obtain the number of conclusion states = 4, indicating that the track has a pulse at the track position 0, And another pulse at track position 2. Thus, in the example of Figure 3, the 3-bit is sufficient to encode the number of states to identify one of the six different states of the example of Figure 3.
第4圖例示說明針對位在具有2音軌位置(第4圖中:音軌位置1及2)之一音軌中一個方向性脈衝的全部可能組態。第4圖中考慮脈衝之符號(例如脈衝是正或負)。有四個不同的可能狀態(第4圖中標示為0至3),描述脈衝如何分布在該音軌,及也描述其符號(正或負)。使用0至3之範圍的狀態數目來描述所呈示的實際組態即足。舉例言之,若第4圖實例的狀態數目具有數值(2),且若解碼器知曉編碼方案, 則解碼器可獲得結論狀態數目=2,表示該音軌具有一個脈衝在音軌位置1,及該脈衝為正脈衝。Figure 4 illustrates all possible configurations for a directional pulse in a track having a position of 2 tracks (in Figure 4: track positions 1 and 2). The sign of the pulse is considered in Figure 4 (for example, the pulse is positive or negative). There are four different possible states (labeled 0 to 3 in Figure 4) that describe how the pulse is distributed over the track and also describe its sign (positive or negative). The number of states in the range 0 to 3 is used to describe the actual configuration presented. For example, if the number of states of the example of FIG. 4 has a value (2), and if the decoder knows the encoding scheme, Then the decoder can get the number of conclusion states = 2, indicating that the track has a pulse at the track position 1, and the pulse is a positive pulse.
第5圖例示說明又另一情況,於該處闡釋當考慮具有2脈衝及2音軌位置的一音軌時的全部可能組態。脈衝可位在相同音軌位置。於第5圖所示實例中,考慮脈衝之符號(例如脈衝是正或負)。假設在同一個音軌位置的脈衝具有相同符號(例如在同一個音軌位置的音軌脈衝乃全正或全負)。Figure 5 illustrates yet another scenario where all possible configurations when considering a track with 2 pulses and 2 track positions are explained. Pulses can be placed at the same track position. In the example shown in Figure 5, consider the sign of the pulse (for example, the pulse is positive or negative). It is assumed that the pulses at the same track position have the same sign (for example, the track pulses at the same track position are all positive or negative).
第5圖中例示說明位在具有2音軌位置(第5圖中:音軌位置1及2)之一音軌中兩個有符號脈衝(例如脈衝是正或負)的全部可能組態。只有八個不同的可能狀態(第5圖中標示為0至7),描述脈衝如何分布在該音軌。藉此,使用0至7之範圍的狀態數目來描述所呈示的實際組態即足。舉例言之,若第5圖實例的狀態數目具有數值(3),且若解碼器知曉編碼方案,則解碼器可獲得結論狀態數目=3,表示該音軌具有一個脈衝在音軌位置0,及另一個脈衝在音軌位置1而該脈衝為負。於是於第5圖之實例中,3位元即足以編碼狀態數目來識別第5圖之實例的八個不同狀態中之一者。Figure 5 illustrates all possible configurations of two signed pulses (e.g., positive or negative pulses) in a track having one of two track positions (Fig. 5: track positions 1 and 2). There are only eight different possible states (labeled 0 to 7 in Figure 5) that describe how the pulses are distributed over the track. Thereby, the number of states in the range of 0 to 7 is used to describe the actual configuration presented. For example, if the number of states of the example of FIG. 5 has a value (3), and if the decoder knows the encoding scheme, the decoder can obtain the number of conclusion states = 3, indicating that the track has a pulse at the track position 0, And another pulse is at track position 1 and the pulse is negative. Thus, in the example of Figure 5, the 3-bit is sufficient to encode the number of states to identify one of the eight different states of the example of Figure 5.
於ACELP中,殘差信號可藉固定數目的有符號脈衝編碼。如前述,脈衝例如可分布在四個交插音軌,使得音軌0含有位置mod(n,4)==0,音軌=1含有位置mod(n,4)==1,等等。各個音軌可有經預先界定的有符號單位脈衝數目,脈衝可重疊,但重疊時脈衝有相同符號。In ACELP, the residual signal can be encoded by a fixed number of signed pulses. As mentioned above, the pulses can be distributed, for example, over four interlaced tracks such that track 0 contains position mod(n, 4) = = 0, track = 1 contains position mod(n, 4) = = 1, and so on. Each track may have a pre-defined number of signed unit pulses, the pulses may overlap, but the pulses have the same sign when overlapped.
藉編碼脈衝,須達成從脈衝位置及其符號對映至使用最小可能的位元數量之表示型態。此外,脈衝編碼須具有 固定的位元耗用量,亦即任何脈衝叢具有相等數目之位元。By means of the coded pulse, a representation from the pulse position and its sign to the smallest possible number of bits must be achieved. In addition, the pulse code must have The fixed bit consumption, that is, any pulse bundle has an equal number of bits.
各音軌係首先獨立地編碼,及然後各個音軌之狀態被組合成一個數目,表示整個子訊框之狀態。此一辦法給予數學上最佳的位元耗用,給定全部狀態有相等機率,及位元耗用量乃固定。Each track is first independently encoded, and then the states of the individual tracks are combined into a single number representing the state of the entire sub-frame. This method gives the mathematically optimal bit consumption, giving all states a equal probability, and the bit consumption is fixed.
狀態列舉構思可運用不同狀態叢之精簡表示型態說明:設欲編碼之殘差信號為xn
。假設考慮例如代數碼簿之四個交插音軌,則第一音軌具有樣本x0
、x4
、x8
、...xN-4
,第二音軌具有樣本x1
、x5
、x9
、...xN-3
等。假設第一音軌係使用一個有符號單位脈衝量化及T=8,故音軌長度為2(T=欲編碼之殘差信號長度(樣本)。若t=8,及若4音軌係用來編碼殘差信號,則4音軌中之各一者具有2音軌位置。舉例言之,第一音軌可考慮具有兩個音軌位置x0及x4。則第一音軌之脈衝出現在下列中之任一者:
此一組態有四個不同狀態。This configuration has four different states.
同理,若第一音軌有兩個脈衝,第一音軌具有兩個音軌位置x0及x4。則脈衝可分配在下列脈衝叢:
因而此一組態有8狀態。Thus this configuration has 8 states.
若殘差信號之長度擴延至T=12,則4音軌各自有3音軌位置。第一音軌獲得多一個樣本,現在有音軌位置x0、x4及x8,使得具有:
上表表示若x8=0(x8不具脈衝)則針對x0及x4有8不同狀態;若x8=1(x8具正脈衝)則針對x0及x4有不同狀態;若x8=-1(x8具負脈衝)則針對x0及x4有不同狀態;若x8=2(x8具兩個正脈衝)則針對x0及x4有一個狀態;及若x8=-2(x8具兩個負脈衝)則針對x0及x4有一個狀態。The above table shows that if x8=0 (x8 does not have a pulse), there are 8 different states for x0 and x4; if x8=1 (x8 has a positive pulse), there are different states for x0 and x4; if x8=-1 (x8 is negative) Pulse) has different states for x0 and x4; if x8=2 (x8 has two positive pulses) then there is one state for x0 and x4; and if x8=-2 (x8 has two negative pulses) then for x0 and X4 has a status.
此處,從先前二表獲得第一列之狀態數目。藉將狀態數目加至第一列,發現此一組態共有18狀態。Here, the number of states of the first column is obtained from the previous two tables. By adding the number of states to the first column, it is found that this configuration has a total of 18 states.
於T=12實例中,5位元足夠編碼全部18不同的可能狀態。然後編碼器例如從範圍[0,...,17]選定狀態數目來載明18組態中之一者。若解碼器知曉編碼方案,例如若解碼器知曉哪個狀態數目表示哪個組態,則可針對一音軌解碼脈衝位置及脈衝符號。In the T=12 example, 5 bits are sufficient to encode all 18 different possible states. The encoder then sets one of the 18 configurations, for example, from the range [0,...,17] selected state number. If the decoder is aware of the encoding scheme, for example, if the decoder knows which state number represents which configuration, the pulse position and pulse symbol can be decoded for a track.
後文中,將提供依據實施例之適當編碼方法及相對應解碼方法。依據實施例提供編碼裝置,其係經組配來執行後文呈示之編碼方法中之一者。此外,依據額外實施例提供解碼裝置,其係經組配來執行後文呈示之解碼方法中之一者。Hereinafter, an appropriate encoding method and a corresponding decoding method according to the embodiments will be provided. An encoding apparatus is provided in accordance with an embodiment that is configured to perform one of the encoding methods presented later. Moreover, in accordance with additional embodiments, a decoding apparatus is provided that is configured to perform one of the decoding methods of the subsequent rendering.
於實施例中,為了產生狀態數目或解碼狀態數目,可計算具有p脈衝之N個音軌位置的可能組態數目。In an embodiment, to generate a number of states or a number of decoding states, a number of possible configurations of N track positions with p-pulses can be calculated.
脈衝可加符號,及可採用遞歸公式,計算針對具有N個音軌位置及p個有符號脈衝(脈衝可以是正或負,但在相同音軌位置的脈衝具有相同符號)的一音軌之狀態數目f
(p
,N
),其中遞歸公式f
(p
,N
)定義為:
初始條件為
因具有一或多個脈衝之單一位置要求一個位元(2狀態)來用於符號。遞歸公式係用於全部不同叢之摘要。A single bit (2 state) is required for a symbol because of a single position with one or more pulses. The recursive formula is used for summaries of all the different bundles.
亦即,給定p脈衝,目前位置可具有qN =0至p脈衝,故其餘N-1位置有p-qN 脈衝。於目前位置及其餘N-1位置的狀態數目相乘來獲得具此等脈衝組合之狀態數目,組合經加總來獲得狀態總數。That is, given a p pulse, the current position can have a q N =0 to p pulse, so the remaining N-1 positions have a pq N pulse. The number of states at the current position and the remaining N-1 positions are multiplied to obtain the number of states with such pulse combinations, and the combinations are summed to obtain the total number of states.
於實施例中,遞歸函式可藉迭代重複演算法計算,其中遞歸係以迭代重複置換。In an embodiment, the recursive function can be computed by an iterative iterative algorithm, where the recursion is repeated by iterative repetition.
因f (p ,N )之評估就即時應用而言於數值上相當複雜,依據若干實施例,可採用詢查表來計算f (p ,N )。依據若干實施例,該表可已經離線計算。Since the evaluation of f ( p , N ) is quite complex in terms of immediate application, according to several embodiments, an inquiry table can be used to calculate f ( p , N ). According to several embodiments, the table may have been calculated offline.
後文中,提出額外構思用於狀態數目之編碼與解碼: 設f (p ,N )表示具有p個有符號脈衝之N個音軌位置的可能組態數目。In the following, additional ideas are proposed for the encoding and decoding of the number of states: Let f ( p , N ) denote the number of possible configurations of N track positions with p signed pulses.
脈衝資訊編碼器現在可分析音軌:若在該音軌的第一位置不具脈衝,則其餘N-1位置具有p個有符號脈衝,為了描述此叢,只需要f (p ,N -1)狀態。The pulse information encoder can now analyze the audio track: if there is no pulse at the first position of the track, the remaining N-1 positions have p signed pulses. To describe this bundle, only f ( p , N -1) is needed. status.
否則,若第一位置具有一或多個脈衝,則脈衝資訊編碼器可定義總狀態係大於f (p ,N -1)。Otherwise, if the first location has one or more pulses, the pulse information encoder can define that the total state is greater than f ( p , N -1).
然後,在脈衝資訊解碼器,脈衝資訊解碼器例如可始於最末位置,及比較該狀態與一臨界值例如f (p ,N -1)。若該狀態為較大,則脈衝資訊解碼器可決定最末位置具有至少一個脈衝。然後脈衝資訊解碼器可從該狀態扣除f (p ,N -1)來獲得已更新狀態數目及將剩餘脈衝數目減1來更新狀態。Then, in the pulse information decoder, the pulse information decoder can, for example, start at the last position and compare the state with a threshold value such as f ( p , N -1). If the state is large, the pulse information decoder can determine that the last position has at least one pulse. The pulse information decoder can then deduct f ( p , N -1) from this state to obtain the number of updated states and decrement the number of remaining pulses by one to update the state.
否則,若最末位置沒有脈衝,則脈衝資訊解碼器可將剩餘位置數目減1。重複此一程序直到沒有脈衝剩下提供無符號的脈衝位置。Otherwise, if there is no pulse at the last position, the pulse information decoder can decrement the number of remaining positions by one. This procedure is repeated until no pulse is left to provide an unsigned pulse position.
為了也將脈衝符號列入考慮,脈衝資訊編碼器可以最低狀態位元編碼脈衝。於另一實施例中,脈衝資訊編碼器可以最高剩餘狀態位元編碼該符號。但以最低位元編碼脈衝符號為佳,原因在於就整數計算而言更容易處理。In order to also take into account the pulse symbol, the pulse information encoder can encode the pulse with the lowest state bit. In another embodiment, the pulse information encoder can encode the symbol with the highest remaining status bit. However, the lowest bit encoded pulse symbol is preferred because it is easier to handle in terms of integer calculations.
在脈衝資訊解碼器中,若找到給定位置之第一脈衝,則脈衝符號係由最末位元決定。然後,剩餘狀態向右移位一步驟來獲得已更新狀態數目。In the pulse information decoder, if the first pulse of a given position is found, the pulse symbol is determined by the last bit. Then, the remaining state is shifted to the right by one step to obtain the updated number of states.
於一實施例中,脈衝資訊解碼器係經組配來應用下列解碼演算法。於此解碼演算法中,於逐一步驟辦法中,針 對各個音軌位置例如前後相續,狀態數目或已更新狀態數目係與臨界值例如與f(p,k-1)比較。In one embodiment, the pulse information decoder is assembled to apply the following decoding algorithms. In this decoding algorithm, in the step by step method, the needle For each track position, for example, before and after, the number of states or the number of updated states is compared with a threshold value, for example, with f(p, k-1).
依據一實施例,提供脈衝資訊解碼器演算法: According to an embodiment, a pulse information decoder algorithm is provided:
依據一實施例,有關脈衝資訊,脈衝資訊編碼器係經組配來施加下列編碼演算法。脈衝資訊編碼器進行與脈衝資訊解碼器之相同步驟但順序相反。According to an embodiment, with respect to pulse information, the pulse information encoder is assembled to apply the following coding algorithm. The pulse information encoder performs the same steps as the pulse information decoder but in the reverse order.
依據一實施例,提供脈衝資訊編碼器演算法: According to an embodiment, a pulse information encoder algorithm is provided:
藉使用此一演算法編碼狀態數目,針對音軌中之一者的各個音軌位置在一音軌位置的各個脈衝,脈衝資訊編碼器將一整數值加至中間數目(例如中間狀態數目),例如演算法完成前的狀態數目,來獲得狀態數目(之數值)。By using this algorithm to encode the number of states, the pulse information encoder adds an integer value to the middle number (eg, the number of intermediate states) for each pulse of each track position of one of the tracks at a track position. For example, the number of states before the algorithm is completed to obtain the number of states (the value).
脈衝資訊之編碼及解碼辦法,例如脈衝位置及脈衝符號可稱作為「逐一步驟編碼」及「逐一步驟解碼」,原因在於音軌位置由編碼及解碼方法被視為前後相續,亦即逐一步驟。The encoding and decoding methods of pulse information, such as pulse position and pulse symbol, can be referred to as "step-by-step coding" and "step-by-step decoding", because the track position is regarded as the same by the encoding and decoding methods, that is, step by step. .
第6圖為流程圖例示說明一實施例,闡釋依據一實施例由脈衝資訊解碼器所進行的處理步驟。Figure 6 is a flow chart illustrating an embodiment illustrating the processing steps performed by the pulse information decoder in accordance with an embodiment.
於步驟610,目前音軌位置k係設定為N。此處,N表示一音軌之音軌位置數目,其中音軌位置係編碼從1至N。At step 610, the current track position k is set to N. Here, N represents the number of track positions of a track in which the track position is encoded from 1 to N.
於步驟620,測試k是否大於或等於1,亦即是否有任何剩餘音軌位置尚未考慮。若k不大於或等於1,則全部音軌位置皆已考慮及結束處理。In step 620, it is tested whether k is greater than or equal to 1, that is, whether any remaining track positions have not been considered. If k is not greater than or equal to 1, all track positions have been considered and the processing is ended.
否則於步驟630,測試狀態是否大於或等於f(p,k-1)。若為是,則位置k至少存在有一個脈衝。若為否,則於音軌位置k不存在有(額外)脈衝,及處理繼續步驟640,於該處k減1,使得將考慮下個音軌位置。Otherwise, in step 630, the test state is greater than or equal to f(p, k-1). If so, there is at least one pulse at position k. If not, there is no (extra) pulse at track position k, and processing continues with step 640, where k is decremented by 1, so that the next track position will be considered.
但若狀態係大於或等於f(p,k-1),處理繼續步驟642,一脈衝置於音軌位置k,及然後於步驟644,狀態係藉將該狀態減f(p,k-1)更新。然後於步驟650,測試目前脈衝是否為在音軌位置k第一個發現的脈衝。若為否,則於步驟680,剩餘脈衝數目減1,及處理繼續步驟630。However, if the state is greater than or equal to f(p, k-1), processing continues with step 642, a pulse is placed at track position k, and then at step 644, the state subtracts the state by f(p, k-1) ) Update. Then at step 650, it is tested whether the current pulse is the first pulse found at track position k. If not, then at step 680, the number of remaining pulses is decremented by one, and processing continues with step 630.
但若目前脈衝是在音軌位置k第一個發現的脈衝,處理繼續步驟660,於該處測試s的最低位元是否經設定。若為是,則在此音軌位置的脈衝符號係設定為負(步驟662),否則在此音軌位置的脈衝符號係設定為正(步驟664)。兩種情 況下,於步驟670然後狀態向右移位一個步驟(s:=s/2)。然後,剩餘脈衝數目也減1(步驟680),及處理繼續步驟630。However, if the current pulse is the first pulse found at the track position k, processing continues to step 660 where the lowest bit of the test s is set. If so, the pulse symbol at the track position is set to negative (step 662), otherwise the pulse symbol at the track position is set to positive (step 664). Two kinds of love In the case, in step 670 the state is then shifted one step to the right (s:=s/2). Then, the number of remaining pulses is also decremented by one (step 680), and processing continues with step 630.
第7圖為流程圖例示說明一實施例,該流程圖闡釋依據一實施例由脈衝資訊編碼器所進行的處理步驟。Figure 7 is a flow chart illustrating an embodiment illustrating the processing steps performed by a pulse information encoder in accordance with an embodiment.
於步驟710,所找到的脈衝p之數目係設定為0,狀態s係設定為0,及所考慮之音軌位置k係設定為1。In step 710, the number of pulses p found is set to zero, the state s is set to zero, and the track position k considered is set to one.
於步驟720,測試k是否小於或等於N,亦即是否仍有音軌位置尚未考慮(此處,N表示:一音軌之音軌位置數目)。若k係不小於或等於N,則全部音軌位置皆已經考慮及結束處理。In step 720, it is tested whether k is less than or equal to N, that is, whether the track position is still not considered (here, N represents: the number of track positions of a track). If k is not less than or equal to N, then all track positions have been considered and the processing is ended.
否則於步驟730,測試至少一個脈衝是否存在於位置k。若否,處理繼續步驟740,於該處k增加1,使得將考慮下個音軌位置。Otherwise, in step 730, it is tested whether at least one pulse is present at position k. If not, processing continues to step 740 where k is incremented by 1, such that the next track position will be considered.
但若至少一個脈衝係存在於音軌位置k,則於步驟750測試目前考慮的脈衝是否為音軌位置k的最末脈衝。若否,則於步驟770,狀態s係藉f(p,k-1)加至狀態s更新,所找到的脈衝p數目加1,及處理繼續步驟780。However, if at least one pulse is present at track position k, then at step 750 it is tested whether the currently considered pulse is the last pulse of track position k. If not, then in step 770, state s is incremented by f(p, k-1) to state s, the number of pulses p found is incremented by one, and processing continues with step 780.
若目前考慮的脈衝是音軌位置k的最末脈衝,則於步驟750之後,處理繼續步驟755,及狀態向左移位一個步驟(s:=s* 2)。然後於步驟760,測試脈衝符號是否為負。若是,則s的最低位元係設定為1(步驟762);否則s的最低位元係設定為0(或不變)(步驟764)。然後於兩種情況下,進行步驟770,於該處狀態s係藉f(p,k-1)加至狀態s更新,所找到的脈衝p數目加1,及處理繼續步驟780。If the currently considered pulse is the last pulse of track position k, then after step 750, processing continues with step 755 and the state is shifted one step to the left (s:=s * 2). Then at step 760, the test pulse symbol is negative. If so, the lowest bit of s is set to 1 (step 762); otherwise the lowest bit of s is set to 0 (or unchanged) (step 764). Then, in either case, step 770 is performed where the state s is incremented by f(p, k-1) to state s, the number of pulses p found is incremented by one, and processing continues with step 780.
於步驟780,測試在位置k是否有另一脈衝。若是,則處理繼續步驟750;否則,處理繼續步驟740。At step 780, a test is made to see if there is another pulse at position k. If so, then processing continues with step 750; otherwise, processing continues with step 740.
後文中,提供編碼多個音軌之狀態的聯合狀態數目之構思。In the following, the concept of encoding the number of joint states of the states of a plurality of tracks is provided.
不幸,於多種情況下,單音軌的可能狀態範圍並非2的倍數,因而各狀態的二進制表示型態無效。舉例言之,若可能的狀態數目為5,則需3位元來以二進制數目表示。但若有4音軌各有5狀態,則整個子訊框有5x5x5x5=625狀態,可以10位元(而非4x3=12位元)表示。如此相當於每個音軌2.5位元而非3位元,如此每音軌節省0.5位元,或相當於每個子訊框節省2位元(占總位元耗用量的20%)。因此重要地係組合各音軌狀態成為一個聯合狀態,原因在於藉此可減少二進制表示型態的無效率。注意相同辦法可用於傳輸的任何數目。舉例言之,因各個子訊框可有一狀態表示脈衝位置,及各訊框可有例如4子訊框,此等狀態可組合成為一個聯合狀態。Unfortunately, in many cases, the possible state range of a single track is not a multiple of 2, and the binary representation of each state is invalid. For example, if the number of possible states is 5, then 3 bits are needed to represent in binary numbers. However, if there are 5 states for each of the 4 tracks, the entire sub-frame has a 5x5x5x5=625 state, which can be represented by 10 bits instead of 4x3=12 bits. This is equivalent to 2.5 bits per track instead of 3 bits, thus saving 0.5 bits per track, or equivalent to saving 2 bits per sub-frame (20% of the total bit consumption). Therefore, it is important to combine the states of the respective tracks into a joint state, because the inefficiency of the binary representation can be reduced by this. Note that the same method can be used for any number of transfers. For example, since each sub-frame can have a state indicating a pulse position, and each frame can have, for example, four sub-frames, the states can be combined into a joint state.
給定子訊框例如有4音軌,則藉聯合編碼各音軌狀態,可減少位元耗用而改良效率。舉例言之,給定各音軌有pk 脈衝,及各音軌具有長度N,例如有N個音軌位置,則各音軌狀態係在0至f (p k ,N )-1之範圍。然後各音軌之狀態sk 可經組合成為子訊框之聯合狀態s,具公式(假設每個子訊框有4音軌)公式2:s =[[s 0 f (p 0 ,N )+s 1 ]f (p 1 ,N )+s 2 ]f (p 2 ,N )+s 3 For example, if there are 4 tracks in the given sub-frame, the state of each track can be jointly coded, which can reduce the bit consumption and improve the efficiency. For example, given that each track has a p k pulse, and each track has a length N, for example, there are N track positions, each track state is in the range of 0 to f ( p k , N )-1 . Then the state s k of each track can be combined into the joint state s of the sub-frames, with the formula (assuming each sub-frame has 4 tracks). Equation 2: s =[[ s 0 f ( p 0 , N )+ s 1 ] f ( p 1 , N )+ s 2 ] f ( p 2 , N )+ s 3
然後各音軌之狀態可於解碼器決定,藉將聯合狀態除以f (p k ,N ),藉此餘數為最末音軌狀態,及整數部分為其餘音軌之聯合狀態。若音軌數目非為4,則方便於上式中適當地加或減項數。The state of each track can then be determined by the decoder by dividing the joint state by f ( p k , N ), whereby the remainder is the last track state, and the integer portion is the joint state of the remaining tracks. If the number of tracks is not 4, it is convenient to add or subtract items as appropriate in the above formula.
注意,當每個音軌之脈衝數目大時,則可能的狀態數目變大。舉例言之,有4音軌,每個音軌有6脈衝,及音軌長度N=16,則狀態為83-位元數目,超過常規中央處理單元(CPU)上的最大二進制數目長度。接著須採取若干額外步驟來使用標準方法以極長整數評估上式。Note that when the number of pulses per track is large, the number of possible states becomes large. For example, there are 4 tracks, each with 6 pulses, and the track length N=16, the state is 83-bit number, which exceeds the maximum binary number length on a conventional central processing unit (CPU). There are then a few additional steps that must be taken to evaluate the above equation with very long integers using standard methods.
當狀態機率假設為相等時,也觀察此一辦法等於音軌狀態之算術編碼。When the state probability is assumed to be equal, this method is also observed to be equal to the arithmetic coding of the track state.
前文已經呈示逐一步驟辦法用以編碼及解碼一音軌之脈衝資訊,例如一音軌之脈衝位置及可能脈衝符號。其它實施例提供另一辦法,稱作為「分裂與征服」辦法。The foregoing has been presented in a step-by-step manner for encoding and decoding pulse information for a track, such as the pulse position of a track and possible pulse symbols. Other embodiments provide another approach, referred to as the "split and conquer" approach.
脈衝資訊編碼器係經組配來施加分裂與征服辦法,將一音軌劃分為兩個音軌區劃x1
及x2
,可考慮為兩個向量,其中x=[x1
x2
]。基本構想係分開編碼二向量x1
及x2
,及然後以下式組合二者
上式中,須注意當脈衝數目為已知時,換言之,當向量分別地具有p1 及p2 =p-p1 脈衝時,s(x1 )及s(x2 )為向量x1 及x2 之狀態。為了將向量x1 中具有0至p1 -1脈衝的全部狀態列入 考慮,須將加總項加至上式。In the above formula, it should be noted that when the number of pulses is known, in other words, when the vectors have p 1 and p 2 = pp 1 pulses, respectively, s(x 1 ) and s(x 2 ) are vectors x 1 and x 2 . State. In order to take into account the full state of the vector x 1 with 0 to p 1 -1 pulses, the summation term must be added to the above equation.
如上演算法/公式可施加來藉施加以下兩個前處理步驟來編碼交插音軌脈衝。首先,設向量xtrack k 包含音軌k上的全部樣本,藉定義x=[xtrack 1, xtrack 2, xtrack 3, xtrack 4 ]而合併此等向量。觀察如此只是樣本的重新排序,使得來自音軌1的全部樣本係置於第一組等等。The above algorithm/formula can be applied to encode the interlaced track pulses by applying the following two pre-processing steps. First, let the vector x track k contain all the samples on the track k, and merge these vectors by defining x=[x track 1, x track 2, x track 3, x track 4 ]. Observing this is just a reordering of the samples so that all samples from Track 1 are placed in the first group and so on.
其次,注意每個音軌之脈衝數目通常為固定數目。接著若音軌1經常性有p1
脈衝,則針對全部值k≠p1
,音軌1上的狀態數目為f
(k
,N 1
)=0。此乃陳述音軌1沒有任何狀態不具p1
脈衝的另一個方式。正式地然後將狀態數目公式定義為:公式4:針對具有pk
脈衝的完整音軌xtrack k
,狀態數目為(N=Ntrack k
)
否則,針對N>1
以及針對N=1:
藉樣本之重新排序及使用如上針對狀態數目之定義(公式4),可藉公式3計算全部音軌之聯合狀態。注意因狀態數目大半含有零,當合併音軌狀態時,公式3之加總為零。 因此合併二音軌係與公式2相同。同理,方便顯示採用兩個辦法,合併全部4音軌(或5)也獲得相同結果。By reordering the samples and using the above definition of the number of states (Equation 4), Equation 3 can be used to calculate the joint state of all the tracks. Note that because the number of states contains more than half, when the track state is merged, the sum of Equation 3 is always zero. Therefore, the merged two-track system is the same as Equation 2. For the same reason, it is convenient to display two methods, and all 4 tracks (or 5) are combined to obtain the same result.
依據一實施例,重新排序可用作為編碼器之前處理步驟。於另一實施例中,重新排序可整合入編碼器。同理依據一實施例,重新排序可用作為解碼器之後處理步驟。於另一實施例中,重新排序可整合入解碼器。According to an embodiment, reordering can be used as a pre-encoder step. In another embodiment, reordering can be integrated into the encoder. By the same token, according to an embodiment, reordering can be used as a post-decoder step. In another embodiment, reordering can be integrated into the decoder.
若一音軌上的脈衝數目為不固定,則方便適度修改狀態數目公式,而仍然使用相同編碼演算法。If the number of pulses on a track is not fixed, it is convenient to modify the state number formula moderately while still using the same coding algorithm.
若適當選擇合併音軌順序,則觀察於章節「組合音軌資料」呈示的辦法及前述方法獲得相等結果。同理,逐一步驟辦法及分裂與征服辦法獲得相等結果。因此,依據哪個辦法體現上更實用或哪個辦法最佳匹配平台的運算限制,獨立地選擇哪個辦法用在解碼器及編碼器。If the merged track order is properly selected, observe the method of presenting the chapter "Combined Track Data" and the above method to obtain equal results. By the same token, the one-step approach and the split and conquer approach yield equal results. Therefore, depending on which method is more practical or which method best matches the computational constraints of the platform, independently choose which method to use in the decoder and encoder.
依據一實施例,提供脈衝資訊編碼器演算法,可以假代碼描述 According to an embodiment, a pulse information encoder algorithm is provided, which can be described by a pseudo code
依據一實施例,採用此種編碼演算法,脈衝資訊編碼器係經組配來將音軌中之一者劃分成一第一音軌區劃及一第二音軌區劃。脈衝資訊編碼器係經組配來編碼與第一區劃相聯結的第一子狀態數目。此外,脈衝資訊編碼器係經組配來編碼與第二區劃相聯結的第二子狀態數目。又,脈衝資訊編碼器係經組配來組合第一子狀態數目及第二子狀態數目而獲得該狀態數目。According to an embodiment, with such a coding algorithm, the pulse information encoder is configured to divide one of the tracks into a first track zone and a second track zone. The pulse information encoder is configured to encode the first number of substates associated with the first zone. In addition, the pulse information encoder is configured to encode the number of second sub-states associated with the second zone. Moreover, the pulse information encoder is configured to combine the first sub-state number and the second sub-state number to obtain the number of states.
同理,依據一實施例,提供脈衝資訊解碼器演算法,可以假代碼描述 Similarly, according to an embodiment, a pulse information decoder algorithm is provided, which can be described by a pseudo code.
於實現該分裂與征服辦法之一實施例中,脈衝資訊解碼器係經組配來基於該狀態數目而產生第一子狀態數目及第二子狀態數目。脈衝資訊解碼器係經組配來基於該第一子狀態數目而解碼該等音軌中之一者之第一區劃的脈衝位置之第一群組。此外,脈衝資訊解碼器係經組配來基於該第二子狀態數目而解碼該等音軌中之一者之第二區劃的脈衝位置之第二群組。In one embodiment of implementing the splitting and conquering method, the pulse information decoder is configured to generate a first number of sub-states and a second sub-state number based on the number of states. The pulse information decoder is configured to decode a first group of pulse positions of a first zone of one of the audio tracks based on the first number of sub-states. Additionally, the pulse information decoder is configured to decode a second group of pulse positions of the second region of one of the audio tracks based on the second number of sub-states.
雖然已經以裝置脈絡描述若干構面,但顯然此等構面也表示相對應方法的描述,於該處一方塊或一裝置係相對應於一方法步驟或一方法步驟之特徵。同理,以方法步驟之脈絡描述的構面也表示相對應裝置之相對應方塊或項或特徵結構之描述。Although a number of facets have been described in the context of the device, it is apparent that such facets also represent a description of the corresponding method, where a block or device corresponds to a method step or a method step. Similarly, a facet described by the context of a method step also represents a description of the corresponding block or item or feature structure of the corresponding device.
取決於某些體現要求,本發明之實施例可於硬體或於軟體體現。體現可使用數位儲存媒體執行,例如軟碟、DVD、CD、ROM、PROM、EPROM、EEPROM或快閃記憶 體,具有可電子讀取控制信號儲存於其上,該等信號與(或可與)可程式規劃電腦系統協作,因而執行個別方法。Embodiments of the invention may be embodied in hardware or in software, depending on certain embodiments. The implementation can be performed using digital storage media, such as floppy disk, DVD, CD, ROM, PROM, EPROM, EEPROM or flash memory The body has an electronically readable control signal stored thereon that cooperates with (or can be) a programmable computer system to perform an individual method.
依據本發明之若干實施例包含具有可電子式讀取控制信號的資料載體,該等控制信號可與可程式規劃電腦系統協作,因而執行此處所述方法中之一者。Several embodiments in accordance with the present invention comprise a data carrier having an electronically readable control signal that can cooperate with a programmable computer system to perform one of the methods described herein.
大致言之,本發明之實施例可體現為具有程式代碼的電腦程式產品,該程式代碼係當電腦程式產品在電腦上跑時可執行該等方法中之一者。該程式代碼例如可儲存在機器可讀取載體上。Broadly speaking, embodiments of the present invention can be embodied as a computer program product having a program code that can perform one of the methods when the computer program product runs on a computer. The program code can be stored, for example, on a machine readable carrier.
其它實施例包含儲存在機器可讀取載體或非過渡儲存媒體上的用以執行此處所述方法中之一者的電腦程式。Other embodiments include a computer program stored on a machine readable carrier or non-transitional storage medium for performing one of the methods described herein.
換言之,因此,本發明方法之實施例為一種具有一程式代碼之電腦程式,該程式代碼係當該電腦程式於一電腦上跑時用以執行此處所述方法中之一者。In other words, therefore, an embodiment of the method of the present invention is a computer program having a program code for performing one of the methods described herein when the computer program runs on a computer.
因此,本發明方法之又一實施例為資料載體(或數位儲存媒體或電腦可讀取媒體)包含用以執行此處所述方法中之一者的電腦程式記錄於其上。Thus, yet another embodiment of the method of the present invention is a data carrier (or digital storage medium or computer readable medium) having a computer program for performing one of the methods described herein recorded thereon.
因此,本發明方法之又一實施例為表示用以執行此處所述方法中之一者的電腦程式的資料串流或信號序列。資料串流或信號序列例如可經組配來透過資料通訊連結,例如透過網際網路轉移。Thus, yet another embodiment of the method of the present invention is a data stream or signal sequence representing a computer program for performing one of the methods described herein. The data stream or signal sequence can, for example, be configured to be linked via a data communication, such as over the Internet.
又一實施例包含處理構件例如電腦或可程式規劃邏輯裝置,其係經組配來或適用於執行此處所述方法中之一者。Yet another embodiment includes a processing component, such as a computer or programmable logic device, that is assembled or adapted to perform one of the methods described herein.
又一實施例包含一電腦,其上安裝有用以執行此處所 述方法中之一者的電腦程式。Yet another embodiment includes a computer on which is installed to perform the execution herein A computer program of one of the methods described.
於若干實施例中,可程式規劃邏輯裝置(例如可現場程式規劃閘陣列)可用來執行此處描述之方法的部分或全部功能。於若干實施例中,可現場程式規劃閘陣列可與微處理器協作來執行此處所述方法中之一者。大致上該等方法較佳係藉任何硬體裝置執行。In some embodiments, programmable logic devices, such as field programmable gate arrays, can be used to perform some or all of the functions of the methods described herein. In some embodiments, the field programmable gate array can cooperate with a microprocessor to perform one of the methods described herein. Generally, such methods are preferably performed by any hardware device.
前述實施例係僅供舉例說明本發明之原理。須瞭解此處所述配置及細節之修改及變化將為熟諳技藝人士顯然易知。因此,意圖僅受審查中之專利申請範圍所限而非受藉以描述及解說此處實施例所呈示之特定細節所限。The foregoing embodiments are merely illustrative of the principles of the invention. It will be apparent to those skilled in the art that modifications and variations of the configuration and details described herein will be readily apparent. Therefore, the intention is to be limited only by the scope of the patent application under review and not by the specific details of the embodiments presented herein.
110‧‧‧脈衝資訊解碼器110‧‧‧pulse information decoder
120‧‧‧信號解碼器120‧‧‧Signal decoder
210‧‧‧信號處理器210‧‧‧Signal Processor
220‧‧‧脈衝資訊編碼器220‧‧‧pulse information encoder
610-680、710-780‧‧‧處理步驟610-680, 710-780‧‧‧ Processing steps
第1圖顯示依據一實施例用以解碼一編碼音訊信號之裝置,第2圖顯示依據一實施例用以編碼一音訊信號之裝置,第3圖顯示針對具有兩個無符號脈衝及三個音軌位置之一音軌的全部可能組態,第4圖顯示針對具有一個有符號脈衝及兩個音軌位置之一音軌的全部可能組態,第5圖顯示針對具有兩個有符號脈衝及兩個音軌位置之一音軌的全部可能組態,第6圖為例示說明一實施例之流程圖,闡釋依據一實施例藉脈衝資訊解碼器所進行的處理步驟,及第7圖為例示說明一實施例之流程圖,該流程圖闡釋依據一實施例藉脈衝資訊編碼器所進行的處理步驟。1 shows an apparatus for decoding an encoded audio signal according to an embodiment, and FIG. 2 shows an apparatus for encoding an audio signal according to an embodiment, and FIG. 3 shows for having two unsigned pulses and three tones. All possible configurations of one of the track positions, Figure 4 shows all possible configurations for a track with one signed pulse and two track positions, Figure 5 shows for two signed pulses and All possible configurations of one of the two track positions, FIG. 6 is a flow chart illustrating an embodiment, illustrating processing steps performed by the pulse information decoder according to an embodiment, and FIG. 7 is an illustration A flowchart of an embodiment is illustrated that illustrates the processing steps performed by a pulse information encoder in accordance with an embodiment.
110‧‧‧脈衝資訊解碼器110‧‧‧pulse information decoder
120‧‧‧信號解碼器120‧‧‧Signal decoder
Claims (15)
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US201161442632P | 2011-02-14 | 2011-02-14 | |
PCT/EP2012/052461 WO2012110480A1 (en) | 2011-02-14 | 2012-02-14 | Audio codec supporting time-domain and frequency-domain coding modes |
Publications (2)
Publication Number | Publication Date |
---|---|
TW201248617A TW201248617A (en) | 2012-12-01 |
TWI488176B true TWI488176B (en) | 2015-06-11 |
Family
ID=71943598
Family Applications (2)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
TW101104350A TWI488176B (en) | 2011-02-14 | 2012-02-10 | Encoding and decoding of pulse positions of tracks of an audio signal |
TW101104676A TWI484480B (en) | 2011-02-14 | 2012-02-14 | Audio codec supporting time-domain and frequency-domain coding modes |
Family Applications After (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
TW101104676A TWI484480B (en) | 2011-02-14 | 2012-02-14 | Audio codec supporting time-domain and frequency-domain coding modes |
Country Status (19)
Country | Link |
---|---|
US (1) | US9037457B2 (en) |
EP (1) | EP2676269B1 (en) |
JP (1) | JP5851525B2 (en) |
KR (2) | KR101648133B1 (en) |
CN (1) | CN103548078B (en) |
AR (1) | AR085223A1 (en) |
AU (2) | AU2012217160B2 (en) |
BR (1) | BR112013020589B1 (en) |
CA (1) | CA2827296C (en) |
ES (1) | ES2562189T3 (en) |
HK (1) | HK1192793A1 (en) |
MX (1) | MX2013009302A (en) |
MY (2) | MY159444A (en) |
PL (1) | PL2676269T3 (en) |
RU (1) | RU2547241C1 (en) |
SG (1) | SG192715A1 (en) |
TW (2) | TWI488176B (en) |
WO (1) | WO2012110480A1 (en) |
ZA (1) | ZA201306872B (en) |
Families Citing this family (9)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
TWI480857B (en) | 2011-02-14 | 2015-04-11 | Fraunhofer Ges Forschung | Audio codec using noise synthesis during inactive phases |
US9589570B2 (en) | 2012-09-18 | 2017-03-07 | Huawei Technologies Co., Ltd. | Audio classification based on perceptual quality for low or medium bit rates |
EP2830052A1 (en) | 2013-07-22 | 2015-01-28 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | Audio decoder, audio encoder, method for providing at least four audio channel signals on the basis of an encoded representation, method for providing an encoded representation on the basis of at least four audio channel signals and computer program using a bandwidth extension |
EP3069338B1 (en) * | 2013-11-13 | 2018-12-19 | Fraunhofer Gesellschaft zur Förderung der Angewand | Encoder for encoding an audio signal, audio transmission system and method for determining correction values |
EP2980790A1 (en) * | 2014-07-28 | 2016-02-03 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | Apparatus and method for comfort noise generation mode selection |
US10699723B2 (en) * | 2017-04-25 | 2020-06-30 | Dts, Inc. | Encoding and decoding of digital audio signals using variable alphabet size |
US10699721B2 (en) * | 2017-04-25 | 2020-06-30 | Dts, Inc. | Encoding and decoding of digital audio signals using difference data |
KR102632136B1 (en) * | 2017-04-28 | 2024-01-31 | 디티에스, 인코포레이티드 | Audio Coder window size and time-frequency conversion |
JP6962445B2 (en) * | 2018-03-02 | 2021-11-05 | 日本電信電話株式会社 | Encoding device, coding method, program, and recording medium |
Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
TWI253057B (en) * | 2004-12-27 | 2006-04-11 | Quanta Comp Inc | Search system and method thereof for searching code-vector of speech signal in speech encoder |
TW201027517A (en) * | 2008-09-30 | 2010-07-16 | Dolby Lab Licensing Corp | Transcoding of audio metadata |
WO2010093224A2 (en) * | 2009-02-16 | 2010-08-19 | 한국전자통신연구원 | Encoding/decoding method for audio signals using adaptive sine wave pulse coding and apparatus thereof |
Family Cites Families (124)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
ATE477571T1 (en) | 1991-06-11 | 2010-08-15 | Qualcomm Inc | VOCODER WITH VARIABLE BITRATE |
US5408580A (en) * | 1992-09-21 | 1995-04-18 | Aware, Inc. | Audio compression system employing multi-rate signal analysis |
BE1007617A3 (en) | 1993-10-11 | 1995-08-22 | Philips Electronics Nv | Transmission system using different codeerprincipes. |
US5784532A (en) | 1994-02-16 | 1998-07-21 | Qualcomm Incorporated | Application specific integrated circuit (ASIC) for performing rapid speech compression in a mobile telephone system |
KR100419545B1 (en) | 1994-10-06 | 2004-06-04 | 코닌클리케 필립스 일렉트로닉스 엔.브이. | Transmission system using different coding principles |
EP0720316B1 (en) | 1994-12-30 | 1999-12-08 | Daewoo Electronics Co., Ltd | Adaptive digital audio encoding apparatus and a bit allocation method thereof |
SE506379C3 (en) * | 1995-03-22 | 1998-01-19 | Ericsson Telefon Ab L M | Lpc speech encoder with combined excitation |
US5754733A (en) | 1995-08-01 | 1998-05-19 | Qualcomm Incorporated | Method and apparatus for generating and encoding line spectral square roots |
US5848391A (en) * | 1996-07-11 | 1998-12-08 | Fraunhofer-Gesellschaft Zur Forderung Der Angewandten Forschung E.V. | Method subband of coding and decoding audio signals using variable length windows |
JP3259759B2 (en) | 1996-07-22 | 2002-02-25 | 日本電気株式会社 | Audio signal transmission method and audio code decoding system |
JPH10124092A (en) | 1996-10-23 | 1998-05-15 | Sony Corp | Method and device for encoding speech and method and device for encoding audible signal |
US5960389A (en) | 1996-11-15 | 1999-09-28 | Nokia Mobile Phones Limited | Methods for generating comfort noise during discontinuous transmission |
JPH10214100A (en) | 1997-01-31 | 1998-08-11 | Sony Corp | Voice synthesizing method |
US6134518A (en) | 1997-03-04 | 2000-10-17 | International Business Machines Corporation | Digital audio signal coding using a CELP coder and a transform coder |
JP3223966B2 (en) | 1997-07-25 | 2001-10-29 | 日本電気株式会社 | Audio encoding / decoding device |
US6070137A (en) * | 1998-01-07 | 2000-05-30 | Ericsson Inc. | Integrated frequency-domain voice coding using an adaptive spectral enhancement filter |
ES2247741T3 (en) * | 1998-01-22 | 2006-03-01 | Deutsche Telekom Ag | SIGNAL CONTROLLED SWITCHING METHOD BETWEEN AUDIO CODING SCHEMES. |
GB9811019D0 (en) | 1998-05-21 | 1998-07-22 | Univ Surrey | Speech coders |
US6317117B1 (en) | 1998-09-23 | 2001-11-13 | Eugene Goff | User interface for the control of an audio spectrum filter processor |
US7272556B1 (en) | 1998-09-23 | 2007-09-18 | Lucent Technologies Inc. | Scalable and embedded codec for speech and audio signals |
US7124079B1 (en) | 1998-11-23 | 2006-10-17 | Telefonaktiebolaget Lm Ericsson (Publ) | Speech coding with comfort noise variability feature for increased fidelity |
JP4024427B2 (en) | 1999-05-24 | 2007-12-19 | 株式会社リコー | Linear prediction coefficient extraction apparatus, linear prediction coefficient extraction method, and computer-readable recording medium recording a program for causing a computer to execute the method |
CN1145928C (en) | 1999-06-07 | 2004-04-14 | 艾利森公司 | Methods and apparatus for generating comfort noise using parametric noise model statistics |
JP4464484B2 (en) | 1999-06-15 | 2010-05-19 | パナソニック株式会社 | Noise signal encoding apparatus and speech signal encoding apparatus |
US6236960B1 (en) | 1999-08-06 | 2001-05-22 | Motorola, Inc. | Factorial packing method and apparatus for information coding |
ATE341074T1 (en) | 2000-02-29 | 2006-10-15 | Qualcomm Inc | MULTIMODAL MIXED RANGE CLOSED LOOP VOICE ENCODER |
US6757654B1 (en) | 2000-05-11 | 2004-06-29 | Telefonaktiebolaget Lm Ericsson | Forward error correction in speech coding |
JP2002118517A (en) | 2000-07-31 | 2002-04-19 | Sony Corp | Apparatus and method for orthogonal transformation, apparatus and method for inverse orthogonal transformation, apparatus and method for transformation encoding as well as apparatus and method for decoding |
US6847929B2 (en) | 2000-10-12 | 2005-01-25 | Texas Instruments Incorporated | Algebraic codebook system and method |
CA2327041A1 (en) | 2000-11-22 | 2002-05-22 | Voiceage Corporation | A method for indexing pulse positions and signs in algebraic codebooks for efficient coding of wideband signals |
US6701772B2 (en) | 2000-12-22 | 2004-03-09 | Honeywell International Inc. | Chemical or biological attack detection and mitigation system |
US20050130321A1 (en) | 2001-04-23 | 2005-06-16 | Nicholson Jeremy K. | Methods for analysis of spectral data and their applications |
US20020184009A1 (en) | 2001-05-31 | 2002-12-05 | Heikkinen Ari P. | Method and apparatus for improved voicing determination in speech signals containing high levels of jitter |
US20030120484A1 (en) | 2001-06-12 | 2003-06-26 | David Wong | Method and system for generating colored comfort noise in the absence of silence insertion description packets |
US6941263B2 (en) | 2001-06-29 | 2005-09-06 | Microsoft Corporation | Frequency domain postfiltering for quality enhancement of coded speech |
US6879955B2 (en) | 2001-06-29 | 2005-04-12 | Microsoft Corporation | Signal modification based on continuous time warping for low bit rate CELP coding |
KR100438175B1 (en) | 2001-10-23 | 2004-07-01 | 엘지전자 주식회사 | Search method for codebook |
CA2388439A1 (en) | 2002-05-31 | 2003-11-30 | Voiceage Corporation | A method and device for efficient frame erasure concealment in linear predictive based speech codecs |
JP3646939B1 (en) | 2002-09-19 | 2005-05-11 | 松下電器産業株式会社 | Audio decoding apparatus and audio decoding method |
US7343283B2 (en) | 2002-10-23 | 2008-03-11 | Motorola, Inc. | Method and apparatus for coding a noise-suppressed audio signal |
US7363218B2 (en) | 2002-10-25 | 2008-04-22 | Dilithium Networks Pty. Ltd. | Method and apparatus for fast CELP parameter mapping |
KR100465316B1 (en) | 2002-11-18 | 2005-01-13 | 한국전자통신연구원 | Speech encoder and speech encoding method thereof |
US7318035B2 (en) | 2003-05-08 | 2008-01-08 | Dolby Laboratories Licensing Corporation | Audio coding systems and methods using spectral component coupling and spectral component regeneration |
US20050091044A1 (en) | 2003-10-23 | 2005-04-28 | Nokia Corporation | Method and system for pitch contour quantization in audio coding |
CN1875402B (en) | 2003-10-30 | 2012-03-21 | 皇家飞利浦电子股份有限公司 | Audio signal encoding or decoding |
CA2457988A1 (en) | 2004-02-18 | 2005-08-18 | Voiceage Corporation | Methods and devices for audio compression based on acelp/tcx coding and multi-rate lattice vector quantization |
FI118835B (en) | 2004-02-23 | 2008-03-31 | Nokia Corp | Select end of a coding model |
WO2005096274A1 (en) | 2004-04-01 | 2005-10-13 | Beijing Media Works Co., Ltd | An enhanced audio encoding/decoding device and method |
GB0408856D0 (en) | 2004-04-21 | 2004-05-26 | Nokia Corp | Signal encoding |
MXPA06012617A (en) | 2004-05-17 | 2006-12-15 | Nokia Corp | Audio encoding with different coding frame lengths. |
US7649988B2 (en) | 2004-06-15 | 2010-01-19 | Acoustic Technologies, Inc. | Comfort noise generator using modified Doblinger noise estimate |
US8160274B2 (en) | 2006-02-07 | 2012-04-17 | Bongiovi Acoustics Llc. | System and method for digital signal processing |
US9047860B2 (en) | 2005-01-31 | 2015-06-02 | Skype | Method for concatenating frames in communication system |
US7519535B2 (en) | 2005-01-31 | 2009-04-14 | Qualcomm Incorporated | Frame erasure concealment in voice communications |
US20070147518A1 (en) | 2005-02-18 | 2007-06-28 | Bruno Bessette | Methods and devices for low-frequency emphasis during audio compression based on ACELP/TCX |
US8155965B2 (en) | 2005-03-11 | 2012-04-10 | Qualcomm Incorporated | Time warping frames inside the vocoder by modifying the residual |
WO2006107838A1 (en) | 2005-04-01 | 2006-10-12 | Qualcomm Incorporated | Systems, methods, and apparatus for highband time warping |
WO2006126843A2 (en) | 2005-05-26 | 2006-11-30 | Lg Electronics Inc. | Method and apparatus for decoding audio signal |
US7707034B2 (en) | 2005-05-31 | 2010-04-27 | Microsoft Corporation | Audio codec post-filter |
PL1897085T3 (en) | 2005-06-18 | 2017-10-31 | Nokia Technologies Oy | System and method for adaptive transmission of comfort noise parameters during discontinuous speech transmission |
KR100851970B1 (en) | 2005-07-15 | 2008-08-12 | 삼성전자주식회사 | Method and apparatus for extracting ISCImportant Spectral Component of audio signal, and method and appartus for encoding/decoding audio signal with low bitrate using it |
US7610197B2 (en) | 2005-08-31 | 2009-10-27 | Motorola, Inc. | Method and apparatus for comfort noise generation in speech communication systems |
US7720677B2 (en) | 2005-11-03 | 2010-05-18 | Coding Technologies Ab | Time warped modified transform coding of audio signals |
US7536299B2 (en) | 2005-12-19 | 2009-05-19 | Dolby Laboratories Licensing Corporation | Correlating and decorrelating transforms for multiple description coding systems |
US8255207B2 (en) | 2005-12-28 | 2012-08-28 | Voiceage Corporation | Method and device for efficient frame erasure concealment in speech codecs |
CN101371297A (en) * | 2006-01-18 | 2009-02-18 | Lg电子株式会社 | Apparatus and method for encoding and decoding signal |
US20090281812A1 (en) | 2006-01-18 | 2009-11-12 | Lg Electronics Inc. | Apparatus and Method for Encoding and Decoding Signal |
US8032369B2 (en) | 2006-01-20 | 2011-10-04 | Qualcomm Incorporated | Arbitrary average data rates for variable rate coders |
FR2897733A1 (en) | 2006-02-20 | 2007-08-24 | France Telecom | Echo discriminating and attenuating method for hierarchical coder-decoder, involves attenuating echoes based on initial processing in discriminated low energy zone, and inhibiting attenuation of echoes in false alarm zone |
US20070253577A1 (en) | 2006-05-01 | 2007-11-01 | Himax Technologies Limited | Equalizer bank with interference reduction |
WO2007138511A1 (en) | 2006-05-30 | 2007-12-06 | Koninklijke Philips Electronics N.V. | Linear predictive coding of an audio signal |
US7873511B2 (en) | 2006-06-30 | 2011-01-18 | Fraunhofer-Gesellschaft Zur Foerderung Der Angewandten Forschung E.V. | Audio encoder, audio decoder and audio processor having a dynamically variable warping characteristic |
JP4810335B2 (en) | 2006-07-06 | 2011-11-09 | 株式会社東芝 | Wideband audio signal encoding apparatus and wideband audio signal decoding apparatus |
US7933770B2 (en) | 2006-07-14 | 2011-04-26 | Siemens Audiologische Technik Gmbh | Method and device for coding audio data based on vector quantisation |
EP2549440B1 (en) | 2006-07-24 | 2017-01-11 | Sony Corporation | A hair motion compositor system and optimization techniques for use in a hair/fur graphics pipeline |
US7987089B2 (en) | 2006-07-31 | 2011-07-26 | Qualcomm Incorporated | Systems and methods for modifying a zero pad region of a windowed frame of an audio signal |
DE102006049154B4 (en) | 2006-10-18 | 2009-07-09 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | Coding of an information signal |
US20080147518A1 (en) | 2006-10-18 | 2008-06-19 | Siemens Aktiengesellschaft | Method and apparatus for pharmacy inventory management and trend detection |
CN102395033B (en) * | 2006-12-12 | 2014-08-27 | 弗劳恩霍夫应用研究促进协会 | Encoder, decoder and methods for encoding and decoding data segments representing a time-domain data stream |
FR2911228A1 (en) | 2007-01-05 | 2008-07-11 | France Telecom | TRANSFORMED CODING USING WINDOW WEATHER WINDOWS. |
KR101379263B1 (en) * | 2007-01-12 | 2014-03-28 | 삼성전자주식회사 | Method and apparatus for decoding bandwidth extension |
FR2911426A1 (en) | 2007-01-15 | 2008-07-18 | France Telecom | MODIFICATION OF A SPEECH SIGNAL |
JP4708446B2 (en) | 2007-03-02 | 2011-06-22 | パナソニック株式会社 | Encoding device, decoding device and methods thereof |
JP2008261904A (en) | 2007-04-10 | 2008-10-30 | Matsushita Electric Ind Co Ltd | Encoding device, decoding device, encoding method and decoding method |
US8630863B2 (en) | 2007-04-24 | 2014-01-14 | Samsung Electronics Co., Ltd. | Method and apparatus for encoding and decoding audio/speech signal |
CN101388210B (en) | 2007-09-15 | 2012-03-07 | 华为技术有限公司 | Coding and decoding method, coder and decoder |
KR101513028B1 (en) | 2007-07-02 | 2015-04-17 | 엘지전자 주식회사 | broadcasting receiver and method of processing broadcast signal |
US8185381B2 (en) | 2007-07-19 | 2012-05-22 | Qualcomm Incorporated | Unified filter bank for performing signal conversions |
CN101110214B (en) | 2007-08-10 | 2011-08-17 | 北京理工大学 | Speech coding method based on multiple description lattice type vector quantization technology |
WO2009029032A2 (en) | 2007-08-27 | 2009-03-05 | Telefonaktiebolaget Lm Ericsson (Publ) | Low-complexity spectral analysis/synthesis using selectable time resolution |
US8566106B2 (en) | 2007-09-11 | 2013-10-22 | Voiceage Corporation | Method and device for fast algebraic codebook search in speech and audio coding |
CN101425292B (en) * | 2007-11-02 | 2013-01-02 | 华为技术有限公司 | Decoding method and device for audio signal |
DE102007055830A1 (en) | 2007-12-17 | 2009-06-18 | Zf Friedrichshafen Ag | Method and device for operating a hybrid drive of a vehicle |
CN101483043A (en) | 2008-01-07 | 2009-07-15 | 中兴通讯股份有限公司 | Code book index encoding method based on classification, permutation and combination |
CN101488344B (en) | 2008-01-16 | 2011-09-21 | 华为技术有限公司 | Quantitative noise leakage control method and apparatus |
US8000487B2 (en) | 2008-03-06 | 2011-08-16 | Starkey Laboratories, Inc. | Frequency translation by high-frequency spectral envelope warping in hearing assistance devices |
EP2107556A1 (en) | 2008-04-04 | 2009-10-07 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | Audio transform coding using pitch correction |
US8879643B2 (en) | 2008-04-15 | 2014-11-04 | Qualcomm Incorporated | Data substitution scheme for oversampled data |
US8768690B2 (en) | 2008-06-20 | 2014-07-01 | Qualcomm Incorporated | Coding scheme selection for low-bit-rate applications |
ES2654433T3 (en) | 2008-07-11 | 2018-02-13 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | Audio signal encoder, method for encoding an audio signal and computer program |
EP2346030B1 (en) * | 2008-07-11 | 2014-10-01 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | Audio encoder, method for encoding an audio signal and computer program |
CN102105930B (en) | 2008-07-11 | 2012-10-03 | 弗朗霍夫应用科学研究促进协会 | Audio encoder and decoder for encoding frames of sampled audio signals |
ES2683077T3 (en) | 2008-07-11 | 2018-09-24 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | Audio encoder and decoder for encoding and decoding frames of a sampled audio signal |
MY154452A (en) | 2008-07-11 | 2015-06-15 | Fraunhofer Ges Forschung | An apparatus and a method for decoding an encoded audio signal |
ES2401487T3 (en) | 2008-07-11 | 2013-04-22 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | Apparatus and procedure for encoding / decoding an audio signal using a foreign signal generation switching scheme |
JP5551695B2 (en) | 2008-07-11 | 2014-07-16 | フラウンホッファー−ゲゼルシャフト ツァ フェルダールング デァ アンゲヴァンテン フォアシュンク エー.ファオ | Speech encoder, speech decoder, speech encoding method, speech decoding method, and computer program |
US8352279B2 (en) | 2008-09-06 | 2013-01-08 | Huawei Technologies Co., Ltd. | Efficient temporal envelope coding approach by prediction between low band signal and high band signal |
WO2010031049A1 (en) | 2008-09-15 | 2010-03-18 | GH Innovation, Inc. | Improving celp post-processing for music signals |
EP3640941A1 (en) * | 2008-10-08 | 2020-04-22 | Fraunhofer Gesellschaft zur Förderung der Angewand | Multi-resolution switched audio encoding/decoding scheme |
KR101315617B1 (en) * | 2008-11-26 | 2013-10-08 | 광운대학교 산학협력단 | Unified speech/audio coder(usac) processing windows sequence based mode switching |
CN101770775B (en) | 2008-12-31 | 2011-06-22 | 华为技术有限公司 | Signal processing method and device |
MX2011007925A (en) | 2009-01-28 | 2011-08-17 | Dten Forschung E V Fraunhofer Ges Zur Foeerderung Der Angewan | Audio coding. |
US8457975B2 (en) | 2009-01-28 | 2013-06-04 | Fraunhofer-Gesellschaft Zur Foerderung Der Angewandten Forschung E.V. | Audio decoder, audio encoder, methods for decoding and encoding an audio signal and computer program |
EP2214165A3 (en) | 2009-01-30 | 2010-09-15 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | Apparatus, method and computer program for manipulating an audio signal comprising a transient event |
PL2234103T3 (en) | 2009-03-26 | 2012-02-29 | Fraunhofer Ges Forschung | Device and method for manipulating an audio signal |
US8725503B2 (en) | 2009-06-23 | 2014-05-13 | Voiceage Corporation | Forward time-domain aliasing cancellation with application in weighted or original signal domain |
CN101958119B (en) * | 2009-07-16 | 2012-02-29 | 中兴通讯股份有限公司 | Audio-frequency drop-frame compensator and compensation method for modified discrete cosine transform domain |
MY162251A (en) | 2009-10-20 | 2017-05-31 | Fraunhofer Ges Forschung | Audio signal encoder,audio signal decoder,method for providing an encoded representation of an audio content,method for providing a decoded representation of an audio content and computer program for use in low delay applications |
MY167980A (en) | 2009-10-20 | 2018-10-09 | Fraunhofer Ges Forschung | Multi- mode audio codec and celp coding adapted therefore |
CN102081927B (en) | 2009-11-27 | 2012-07-18 | 中兴通讯股份有限公司 | Layering audio coding and decoding method and system |
US8423355B2 (en) | 2010-03-05 | 2013-04-16 | Motorola Mobility Llc | Encoder for audio signal including generic audio and speech frames |
US8428936B2 (en) | 2010-03-05 | 2013-04-23 | Motorola Mobility Llc | Decoder for audio signal including generic audio and speech frames |
WO2011147950A1 (en) * | 2010-05-28 | 2011-12-01 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | Low-delay unified speech and audio codec |
TWI480857B (en) | 2011-02-14 | 2015-04-11 | Fraunhofer Ges Forschung | Audio codec using noise synthesis during inactive phases |
-
2012
- 2012-02-10 TW TW101104350A patent/TWI488176B/en active
- 2012-02-10 MY MYPI2013002980A patent/MY159444A/en unknown
- 2012-02-14 RU RU2013141935/08A patent/RU2547241C1/en active
- 2012-02-14 KR KR1020137024070A patent/KR101648133B1/en active IP Right Grant
- 2012-02-14 TW TW101104676A patent/TWI484480B/en active
- 2012-02-14 AU AU2012217160A patent/AU2012217160B2/en active Active
- 2012-02-14 SG SG2013060926A patent/SG192715A1/en unknown
- 2012-02-14 MY MYPI2013701415A patent/MY160264A/en unknown
- 2012-02-14 EP EP12706001.0A patent/EP2676269B1/en active Active
- 2012-02-14 PL PL12706001T patent/PL2676269T3/en unknown
- 2012-02-14 AR ARP120100478A patent/AR085223A1/en active IP Right Grant
- 2012-02-14 CA CA2827296A patent/CA2827296C/en active Active
- 2012-02-14 KR KR1020167012861A patent/KR101751354B1/en active IP Right Grant
- 2012-02-14 JP JP2013553902A patent/JP5851525B2/en active Active
- 2012-02-14 BR BR112013020589-0A patent/BR112013020589B1/en active IP Right Grant
- 2012-02-14 CN CN201280018224.4A patent/CN103548078B/en active Active
- 2012-02-14 ES ES12706001.0T patent/ES2562189T3/en active Active
- 2012-02-14 WO PCT/EP2012/052461 patent/WO2012110480A1/en active Application Filing
- 2012-02-14 MX MX2013009302A patent/MX2013009302A/en active IP Right Grant
-
2013
- 2013-08-13 US US13/966,048 patent/US9037457B2/en active Active
- 2013-09-12 ZA ZA2013/06872A patent/ZA201306872B/en unknown
-
2014
- 2014-06-24 HK HK14105996.7A patent/HK1192793A1/en unknown
-
2016
- 2016-01-21 AU AU2016200351A patent/AU2016200351B2/en active Active
Patent Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
TWI253057B (en) * | 2004-12-27 | 2006-04-11 | Quanta Comp Inc | Search system and method thereof for searching code-vector of speech signal in speech encoder |
TW201027517A (en) * | 2008-09-30 | 2010-07-16 | Dolby Lab Licensing Corp | Transcoding of audio metadata |
WO2010093224A2 (en) * | 2009-02-16 | 2010-08-19 | 한국전자통신연구원 | Encoding/decoding method for audio signals using adaptive sine wave pulse coding and apparatus thereof |
Non-Patent Citations (3)
Title |
---|
ASHLEY J P ET AL: "Wideband coding of speech using a scalable pulse codebook", IEEE WORKSHOP ON SPEECH CODING, 2000, SEPTEMBER 17-20, pages 148-150 * |
FISCHER T R ET AL: "ENUMERATION ENCODING AND DECODING ALGORITHMS FOR PYRAMID CUBIC LATTICE AND TRELLIS CODES", IEEE TRANSACTIONS ON INFORMATION THEORY, vol. 41, no. 6, November 1995, pages 2056-2061. * |
Timothy B. Terriberry ET AL: "A Multiply-Free Enumeration of Combinations With Replacement and Sign", , 1 December 2007, http://people.xiph.org/~tterribe/pubs/cwrs.pdf * |
Also Published As
Similar Documents
Publication | Publication Date | Title |
---|---|---|
TWI488176B (en) | Encoding and decoding of pulse positions of tracks of an audio signal | |
CA2827156C (en) | Encoding and decoding of pulse positions of tracks of an audio signal | |
KR20080110757A (en) | Improved coding/decoding of a digital audio signal, in celp technique | |
JP4550176B2 (en) | Speech coding method | |
JPH0844399A (en) | Acoustic signal transformation encoding method and decoding method | |
JPH1097294A (en) | Voice coding device | |
JP6001451B2 (en) | Encoding apparatus and encoding method | |
JP6859379B2 (en) | Equipment and methods for comfortable noise generation mode selection | |
JP4191502B2 (en) | Signal encoding method, apparatus thereof, and signal encoding program | |
EP3285253B1 (en) | Method for coding a speech/sound signal | |
JP3954050B2 (en) | Speech coding apparatus and speech coding method | |
JP2003029798A (en) | Methods, devices, programs and recording media for encoding and decoding acoustic signal | |
ES2821725T3 (en) | Encoding and decoding of pulse positions of tracks of an audio signal | |
JP3576805B2 (en) | Voice encoding method and system, and voice decoding method and system | |
JPH0981191A (en) | Voice coding/decoding device and voice decoding device | |
JP2007179071A (en) | Device and method for speech encoding | |
JP2007206706A (en) | Speech encoding device and method |