CN101159136A - Low bit rate music signal coding method - Google Patents

Low bit rate music signal coding method Download PDF

Info

Publication number
CN101159136A
CN101159136A CNA2007101772860A CN200710177286A CN101159136A CN 101159136 A CN101159136 A CN 101159136A CN A2007101772860 A CNA2007101772860 A CN A2007101772860A CN 200710177286 A CN200710177286 A CN 200710177286A CN 101159136 A CN101159136 A CN 101159136A
Authority
CN
China
Prior art keywords
parameter
value
harmonic
distribution
model
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CNA2007101772860A
Other languages
Chinese (zh)
Inventor
王晖
颜靖华
李传珍
蔡娟娟
张勤
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Communication University of China
Original Assignee
Communication University of China
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Communication University of China filed Critical Communication University of China
Priority to CNA2007101772860A priority Critical patent/CN101159136A/en
Publication of CN101159136A publication Critical patent/CN101159136A/en
Pending legal-status Critical Current

Links

Images

Landscapes

  • Compression, Expansion, Code Conversion, And Decoders (AREA)

Abstract

The invention discloses a method for coding a music signal with low bit rate, which comprises following steps: (1) dividing frame of the music data, adding windows to each frame signal, constructing a harmonic wave model, and selecting each parameter representing the model; (2) acquiring posterior distribution of each parameter according to prior probability distribution and initial value of the parameter in combination with conditional probability distribution of the model and by using the Bayes estimation method; (3) carrying out cyclical iteration according to the posterior distribution and by using reversible jump monte carlo sampling algorithm until the algorithm is convergent, and acquiring the estimation value of each parameter; and (4) transmitting the estimation value of each parameter into a coder to achieve low-bit-rate coding of the audio signal. The method can well fit the music signal, acquire the parameters representing the music signal, and achieve low-bit-rate parameter coding.

Description

A kind of low bit rate music signal coding method
Technical field
The present invention relates to a kind of low bit rate music signal coding method, relate in particular to the low bit rate music signal coding that adopts Bayesian Estimation.
Background technology
The world is just striding forward the information age of digitizing, networking, global integration.Infotech just progressively is penetrated into the every aspect of human society, and people have higher requirement to providing with transmission quality of information.Because the restriction of medium capacity and transmission bandwidth must use various compress techniques to transmit data.Sound is a kind of important transmission information medium, and the audio coding technology of digital audio field mainly contains at present:
1) lossless compress
The development of audio compression techniques is at first from lossless compress, and typical compression method is PCM (pulse code modulation (PCM)).Though PCM can reduce tonequality preferably, the code efficiency of its instantaneous companding technology and piece companding (Block-Compressing) technology is lower, and code check is too high.Along with the development of Digital Signal Processing, the perceptual coding algorithm a series of Frequency Domain Coding algorithms appearred subsequently, i.e..
2) Audio Perceptual Coding:
The target of Audio Perceptual Coding is the perceptual distortion that reduces as much as possible between audio frequency original signal and the encoding and decoding signal, reduces bit rate as much as possible simultaneously and encodes.For music signal, the coefficient that concentrates on mostly the time-frequency representation signal at present carries out transition coding.AAC (the Advanced Audio Coder) coding of typical transition coding such as MPEG-4 can provide gem-pure quality to about 64kbit/s tone signal, but may produce some culture noises under low bit rate.
3) parameter coding:
With respect to perceptual coding, the thinking of parametric audio coding is that sound signal is decomposed according to more adaptive mode.Different compositions represents that with different information source models each model all defines the parameter set of oneself to describe different attributes.Decompose finish after, parameter need be quantized respectively, encodes and be sent to decoding end.For example the SSC of MPEG-4 (Sinusoidal Coding) parameter coding just is based on this mode, and coding quality is better than AAC when 24kbit/s, but it is not suitable for lower code check.
In many instances, under the situation that for example network bandwidth is lower or the capacity of storage medium is very low, the code check of above-mentioned coding method is too high.This just needs one can guarantee better tonequality, and the very low technology of code check.
Summary of the invention
The technical problem to be solved in the present invention provides a kind of music encoding method of low bit rate, and this method is the match music signal preferably, obtains to characterize the parameter of music signal, then parameter is carried out Low Bit-rate Coding.
In order to address the above problem, the invention provides a kind of low bit rate music signal coding method, may further comprise the steps:
1) divides frame with music signal, and, set up harmonic-model, selected each parameter that characterizes this model every frame signal windowing;
2) according to the prior probability distribution and the initial value of described parameter, and in conjunction with the conditional probability distribution of this model, use the Bayesian Estimation method, the posteriority that obtains described each parameter distributes;
3) distribute according to the posteriority that obtains, adopt reversible jump Monte Carlo sampling algorithm, loop iteration reaches convergence until algorithm, obtains the estimated value of each parameter;
Each estimates of parameters that 4) will obtain is sent into scrambler, finishes the Low Bit-rate Coding to sound signal;
Further, method of the present invention, wherein, in the step (1), the described harmonic-model of setting up comprises: with each frame music signal, be decomposed into by different each time multifrequency sinusoid of amplitude and the stack of white Gaussian noise;
Further, method of the present invention, wherein, in the step (1), the parameter of described sign harmonic-model comprises: the amplitude of harmonic number, fundamental frequency, each harmonic and noise variance;
Further, method of the present invention, wherein, in the step (2), the prior probability distribution of described parameter is set to respectively: noise variance is obeyed the distribution of falling the gamma, and harmonic number is obeyed Poisson distribution, and fundamental frequency is obeyed distribution of mean value, whose amplitude obeys Gaussian distribution;
Further, method of the present invention wherein, in the described step (2), comprising: the posterior probability of using resulting each parameter of Bayesian Estimation method distributes, and is proportional to the product of the likelihood function of its prior probability distribution and its conditional probability distribution of description;
Further, method of the present invention, wherein, in the step (3), described reversible jump Monte Carlo sampling algorithm comprises:
I) determine the initial value of each parameter, and select the mode of motion type with certain probability;
Ii) produce recommended value by prior distribution, and begin by 1 to L cycle calculations allowance rate k, greater than 1, each then new parameter prediction value is exactly its recommended value as if allowance rate k; If allowance rate k less than 1, then composes recommended value to new predicted value with the probability of allowance rate k, otherwise new predicted value is a last iteration value constantly;
Iii) loop iteration reaches convergence until algorithm, returns L predicted value;
Further, method of the present invention, wherein, in the step (i), described mode of motion type comprises: harmonic number increase, minimizing, constant three types;
Further, method of the present invention, wherein, step (ii) in, if allowance rate k is less than 1, then, comprising with the recommended value of each parameter of probability calculation of allowance rate k:
For the type that harmonic number increases,, calculate allowance rate g according to reversible jump Monte Carlo sampling algorithm Birth, judge the value of the predicted value of harmonic number and fundamental frequency, and produce the recommended value of all the other parameters;
For the type that harmonic number reduces,, calculate allowance rate g according to reversible jump Monte Carlo sampling algorithm Death, judge the value of the predicted value of harmonic number and fundamental frequency, and produce the recommended value of all the other parameters;
For the type that harmonic number reduces,, calculate allowance rate g according to reversible jump Monte Carlo sampling algorithm Update, judge the value of the predicted value of fundamental frequency, and produce the recommended value of all the other parameters;
Method of the present invention, wherein, in the described step (3), further comprise: distribute according to the posteriority that obtains, adopt reversible jump Monte Carlo sampling algorithm, loop iteration is after algorithm reaches convergence, each parameter all obtains a predicted value ordered series of numbers that contains a L element, select estimation criterion then, respectively the predicted value ordered series of numbers of each parameter is handled, promptly obtained an estimated value of each parameter;
Further, method of the present invention, wherein, described estimation criterion comprises: maximum a posteriori probability is estimated, Posterior Mean is estimated and least mean-square error is estimated;
Further, method of the present invention, wherein, in the step (3), described scrambler adopts following method to encode, and comprising: Differential video coding method, huffman coding method.
Compared with prior art, the method for the invention, match music signal preferably obtains to characterize the parameter of music signal, then parameter is carried out Low Bit-rate Coding.
Description of drawings
Fig. 1 is the low bit rate music signal coding method process flow diagram that adopts the Bayesian Estimation method in the embodiment of the invention;
Fig. 2 is the low bit rate music signal coding system construction drawing that adopts the Bayesian Estimation method in the embodiment of the invention;
Fig. 3 is a reversible jump Monte Carlo sampling algorithm flow chart in the embodiment of the invention;
Fig. 4 is an operation chart level and smooth to the multiframe signal windowing in the embodiment of the invention.
Embodiment
The present invention is in order to solve the drawback that conventional solution exists, further set forth a kind of low bit rate music signal coding method of the present invention by following specific embodiment, below embodiment is described in detail, but not as a limitation of the invention.
Key of the present invention is on the music signal model based, obtains to characterize the parameter of music signal.Music signal has tangible design feature at frequency domain: on frequency domain, music signal is made of two-layer model: at first, each note is that syllable and each harmonic (partials) are formed by fundamental frequency (fundamental frequency), the time luffing degree of harmonic wave has reflected the tone color of music signal, and their frequency is approximately the integer multiple of fundamental frequency; Then, each note is played simultaneously and has been formed chord, i.e. multi-tone signal (polyphony).At these characteristics, the present invention has adopted a kind of harmonic-model, and it regards music signal the stack of a series of multifrequency sinusoids and white Gaussian noise as, and then, the present invention adopts the method for Bayesian Estimation to obtain these parameters.The Bayesian Estimation method provides a class rigorous solution framework for the estimation problem of dynamic system, it utilizes known information to set up the probability density function of system, it is prior distribution, distribute according to condition again, the posteriority that then can obtain known variables or parameter distributes, and distributes the true distribution of approximating parameter with posteriority.The present invention is on the harmonic-model basis, according to the Bayesian Estimation method, distribute the true distribution of each parameter of approximate model with posteriority, adopt reversible jump Monte Carlo sampling (RJMCMC then, Reverse-Jump Markov Chain Monte Carlo) algorithm, loop iteration reaches convergence up to algorithm, has obtained estimates of parameters.At last, estimates of parameters is encoded, promptly realized the low rate encoding of music signal.
As shown in Figure 1, be the low bit rate music signal coding method process flow diagram that adopts the Bayesian Estimation method in the embodiment of the invention, may further comprise the steps:
Step 100 is divided into short time frame with music signal;
Step 101 is represented every frame signal with harmonic-model, promptly signal is regarded as by different each time multifrequency sinusoid of amplitude and the stack of white Gaussian noise; Be that the available harmonic number, harmonic amplitude, fundamental frequency, the isoparametric mathematic(al) representation of noise variance of containing characterizes any music signal thus;
Step 102, suppose that the prior distribution of each parameter is as follows:
Noise variance is obeyed the distribution of falling the gamma; Harmonic number is obeyed Poisson distribution; Fundamental frequency is obeyed distribution of mean value; The whose amplitude obeys Gaussian distribution;
Step 103 according to the prior distribution of each parameter, in conjunction with the conditional probability of this model, is used the Bayesian Estimation method, and the posteriority that obtains each parameter distributes;
Step 104 in conjunction with the posteriority distribution and expression formula of each parameter, adopts reversible jump Monte Carlo sampling (RJMCMC) algorithm, loop iteration, up to convergence, and handle the predicted value ordered series of numbers of each parameter that obtains by sampling, obtain the estimated value of each parameter according to estimation criterion;
Step 105 is sent the estimated value of each parameter into scrambler, carries out differential coding and Huffman encoding, the low Bit Rate Audio Coding of final realization.
As shown in Figure 2, be the low bit rate music signal coding system construction drawing that adopts the Bayesian Estimation method in the embodiment of the invention, as can be seen from the figure this is an analysis-synthesis system.In transmitting terminal 1, at first obtain to characterize the parameter of music signal harmonic-model by analysis module 10, encode by 11 pairs of described parameters of scrambler again; Code stream transmits the back and arrives receiving end 2 in channel, decoded by the demoder in the receiving end 2 20, obtains decoded parameter, brings decoded parameter into harmonic-model by synthesis module 21 again and synthesizes, and finishes the reduction to music signal thus.To introduce method of the present invention according to the operating process of various piece correspondence in the system construction drawing below.
(I) at first, in low bit rate music signal coding method, want earlier the music signal that transmits at first to be analyzed.
In analysis process, be divided into following process: divide frame, set up harmonic-model, make up model probability framework and estimation model parameter.
1) divide frame:
Be actually a time varying signal through digitized voice signal, suppose that generally it can regard stably in the short time that at 10ms~30ms common analysis all is the signal analysis at the short time as.In order to obtain voice signal in short-term, voice signal is divided into short time frame, next analyze at the music signal data of each short time frame exactly.
2) set up harmonic-model:
Generally speaking, be periodic by the music signal of instrument playing, can be regarded as by a series of sine-wave superimposed and form.Each note is made of M harmonic wave, and one of them is a fundamental frequency, and remaining M-1 is harmonic wave, is approximately the integer multiple of fundamental frequency.Consider the music signal of one section short time frame, and in this segment signal, do not have the sudden change of note, be expressed as y[t], t=1,2 ..., N, fundamental frequency are ω, sampling rate is ω s, when being, the amplitude of each harmonic becomes, be respectively a m[t], b m[t], noise are υ [t].Model can be expressed as follows:
y [ t ] = Σ m = 1 M { a m [ t ] cos ( 2 πmω ω s t ) + b m [ t ] sin ( 2 πmω ω s t ) } + υ [ t ] - - - ( 1 )
a m[t], b m[t] can regard as by a series of basis function φ iConstitute, i=0 ..., I is expressed as:
a m [ t ] = Σ i = 0 I a m , i φ i [ t ] , b m [ t ] = Σ i = 0 I b m , i φ i [ t . Wherein φ [t] is basis function (basis function), can be any type of non-oscillatory function, as Hanning window.Select Hanning window for use in this present invention, rolloff-factor is 0.5.The initialization function is φ [τ], τ=-N+1 ...., N-1, so, other basis function is respectively:
φ i[t]=φ[t-iΔt]i=0,...,I,t=0,...,N-1 (2)
Interval of delta t=(N-1)/I wherein.So simplified model can be expressed as follows:
y [ t ] = Σ m = 1 M Σ i = 0 I φ [ t - iΔt ] { a m , i cos ( 2 πmω ω s t ) + b m , i sin ( 2 πmω ω s t ) } + υ [ t ] - - - ( 3 )
If magnitude matrix is β, length is 2M (I+1), m=1 ..., M, i=0 ..., I, concrete element is:
β(2(Mi+m)-1)=a m,i (4)
β(2(Mi+m))=b m,i (5)
Nuclear matrix is D, is N * 2M (I+1) dimension matrix, m=1 ..., M, i=0 ..., I, t=1 ..., the concrete element of N is:
D(t,2(Ni+m)-1)=φ(t-iΔt)cos(mωt) (6)
D(t,2(Mi+m))=φ(t-iΔt)sin(mωt) (7)
So this model can be reduced to:
y=Dβ+v (8)
Be noise vector, be white noise, variance is σ υ 2
3) make up probabilistic framework:
The likelihood function of this model is:
p ( y | β , ω , M , σ υ 2 ) = 1 ( 2 π σ υ 2 ) N / 2 exp [ - 1 2 σ υ 2 | | y - Dβ | | 2 ] - - - ( 9 )
The prior probability of model can be decomposed into:
p ( β , σ υ 2 , ω , M ) = p ( β | σ υ 2 , ω , M ) p ( ω | M ) p ( M ) p ( σ υ 2 ) - - - ( 10 )
The prior probability of β is the zero-mean Gaussian distribution, and covariance matrix is σ υ 2/ ξ 2Σ I, I are unit matrix, ξ 2Be super parameter (hyper parameter) the expression signal to noise ratio (S/N ratio).The probability distribution of β represents that formula is as follows:
p ( β | σ υ 2 , ω , M ) = N ( β ; 0 2 M ( I + 1 ) , σ υ 2 ξ 2 ΣI ) - - - ( 11 )
ξ 2Obey the distribution of falling the gamma (inverted gamma), be shown below:
p ( ξ 2 ) = IG ( α ξ , β ξ ) ∝ e - β ξ / ξ 2 ξ 2 ( α ξ + 1 ) - - - ( 12 )
The prior probability of ω is obeyed distribution of mean value, is expressed as follows:
p(ω|M)=u(ω m;[0,ω s/2M]) (13)
Noise variance σ υ 2Obey the distribution of falling the gamma, be expressed as follows:
p ( σ υ 2 ) = IG ( σ υ 2 ; v 0 , γ 0 ) - - - ( 14 )
The prior probability of M adopts the Poisson distribution model, is expressed as follows:
p ( M = m | Λ ) = e - Λ Λ m m ! - - - ( 15 )
Λ is the expectation value of harmonic number, obeys gamma (gamma) and distributes, and expression formula is as follows:
p(Λ)=G(1/2+α Λ,β Λ) (16)
In fact M is limited in certain scope, and minimum value is M Min, maximal value is M Max, promptly distributed model is the brachymemma Poisson distribution in fact.So
p ( M = m ) = ( Λ m / m ! ) / ( Σ m ′ = M min M max Λ m ′ / m ′ ! ) - - - ( 17 )
The Bayesian Estimation method shows: posterior probability distributes and to be proportional to the product of prior probability distribution and likelihood function, and in conjunction with prior probability distribution of having given and likelihood function, posterior probability that can this model is expressed as follows:
p ( β , ω , M , σ υ 2 | y ) ∝ p ( y | β , ω , M , σ υ 2 ) p ( β , ω , σ υ 2 , M ) - - - ( 18 )
Provide the expression of the posterior probability of each parameter below:
p ( ω , M | y ) ∝ ( γ 0 + y t Py ) - ( N + v 0 ) 2 det ( S ) 1 2 p ( ω | M ) Λ M M ! - - - ( 19 )
P=I-DSD wherein t, S = [ D t D + 1 ξ 2 I ] - 1 , Noise variance σ υ 2Posterior probability be:
p ( σ υ 2 | ω , M , y ) = IG ( N + v 0 2 , γ 0 + y t Py 2 ) ∝ exp ( - γ 0 + y t Py 2 σ υ 2 ) σ υ - N - v 0 - 2 - - - ( 20 )
The posterior probability Gaussian distributed of β, expression is:
p ( β | σ υ 2 , ω , M , y ) = N ( μ , σ υ 2 S ) - - - ( 21 )
μ=SD wherein tY.
ξ 2Posterior probability be:
p ( ξ 2 | β , ω , M , σ υ 2 ) ∝ IG ( α ξ + M ( I + 1 ) , β ξ + β ′ D ′ Dβ 2 σ υ 2 ) - - - ( 22 )
The posterior probability of Λ is:
p(Λ|M)∝(M+1/2+α Λ,1+β Λ) (23)
4) estimation model parameter:
In Bayes's filtering problem, distribute for general multivariate, can not obtain integral result, therefore must be similar to integration.Monte Carlo method (MCMC) is a kind of statistical test method, and its basic thought is at first to set up a probability model or stochastic process; Statistical nature by the observation or the sampling test of model or process are calculated the parameter of asking then, and with arithmetic mean as the approximate value of being found the solution.Generally, the dimension of variable to be estimated changes, and harmonic number to be estimated among the present invention for example is so the dimension of D and β is all changing.RJMCMC allows to make up the Markov chain of a redirect dimension, and it has defined a series of mode of motion, as renewal, generation, extinction, merging, separation etc., any one mode of motion of picked at random in the process of iteration.
Consider a transition kernel function q (z, z '), be also referred to as the suggestion probability function, it is the function that produces suggestion sample value z ' according to currency z.Destination probability is distributed as π (z), is the posteriority for the treatment of estimated parameter and distributes the allowance rate k ( z , z ′ ) = min ( 1 , π ( z ′ ) q ( z ′ , z ) π ( z ) q ( z , z ′ ) ) , To determine the value of predicted value according to the size of allowance rate.
The RJMCMC arthmetic statement is as follows:
A) determine initial value (be generally and estimate last sample value in the ordered series of numbers);
B) from possible mode of motion, select the mode of motion type with certain probability;
C) produce recommended value by suggestion probability distribution (probability density function), begin to carry out circulation (by l to L):
Calculate allowance rate k, greater than 1, then new predicted value is recommended value as if the allowance rate, if allowance rate k less than 1, then composes the recommended value that calculates to new predicted value with the probability of allowance rate k, otherwise new predicted value is a last iteration value constantly;
D) L loop iteration finishes, and returns L predicted value.
As shown in Figure 3, be RJMCMC sampling algorithm process flow diagram in reversible jump Monte Carlo in the embodiment of the invention, may further comprise the steps:
Step 300, the initialization model parameter comprises: number of samples N, iterations L, window function number I etc.;
Step 301 begins to carry out reversible jump Monte Carlo RJMCMC sampling iterative algorithm, selects the mode of motion type, i.e. harmonic number M increase, minimizing, constant three types.
Step 302, according to prior distribution, the parameter of each probability distribution function of initialization, nuclear matrix and amplitude vector;
Step 303, the relation of detection loop control variable l and iterations L, if smaller or equal to, then execution in step 304; If greater than, then execution in step 306;
Step 304, in iterative process, difference corresponding harmonic number increase, minimizing and constant type of sports: carry out harmonic number and increase algorithm, harmonic number minimizing algorithm and harmonic parameters update algorithm; RJMCMC has also all been used in these three kinds of calculations, can carry out detailed explanation below;
Wherein, the probability of the type of sports that harmonic number increases is a, and the probability of the type of sports that harmonic number reduces is b, and the probability of the type of sports that harmonic number is constant is 1-a-b, and probability a and b computing formula are as follows:
a = c min { 1 , p ( M + 1 | Λ ) p ( M | Λ ) }
b = c min { 1 , p ( M | Λ ) p ( M + 1 | Λ ) }
C=0.15 wherein, p (M| Λ) is the prior probability of M;
Step 305 is through the iteration renewal respectively of above-mentioned three kinds of situations, the predicted value of undated parameter, and record; And return execution in step 303;
Step 306, each parameter tends towards stability, and each parameter all obtains a predicted value ordered series of numbers that contains L element, selects estimation criterion, respectively the predicted value ordered series of numbers of each parameter is handled according to certain estimation criterion, promptly obtained an estimated value of each parameter;
Estimation criterion comprises: maximum a posteriori probability estimation, Posterior Mean estimation and least mean-square error estimation etc.
Need indicate, the algorithm that the harmonic number in the step 304 increases, reduces, upgrades has all used RJMCMC, subscript
Figure S2007101772860D00103
With *Represent the recommended value of Monte Carlo particle and generation respectively.Elaborate their step below:
Increase harmonic number algorithm (Partials birth)
● recommended value M * = M ~ ( l - 1 ) + 1 .
● make P *For containing M *Subharmonic P, follow above-mentioned computing formula about allowance rate a, calculate following formula:
g birth = [ γ 0 + y T Py γ 0 + y T P * y ] N + v 0 2 M * M ~ ( l - 1 ) 1 ( 1 + ξ 2 ) ( I + 1 )
● with probability min (1, g Birth), receive recommended value: order M ~ ( l ) = M * , ω ~ ( l ) = ω ~ ( l - 1 ) .
● the posteriority according to all the other parameters distributes, and produces the recommended value of all the other parameters.
Reduce harmonic number algorithm (Partials death)
● recommended value M * = M ~ ( l - 1 ) - 1 .
● remove the
Figure S2007101772860D00115
Subharmonic.
● make P *For removing
Figure S2007101772860D00116
The P of subharmonic follows above-mentioned computing formula about allowance rate a, calculates following formula:
g death = [ γ 0 + y T Py γ 0 + y T P * y ] N + v 0 2 M * M ~ ( l - 1 ) ( 1 + ξ 2 ) ( I + 1 )
● with probability min (1, g Death), receive recommended value: order M ~ ( l ) = M * , ω ~ ( l ) = ω ~ ( l - 1 ) .
● the posteriority according to all the other parameters distributes, and produces the recommended value of all the other parameters.
Harmonic parameters is upgraded (algorithm for partials parameter updates)
● sampling u, obey distribution of mean value, i.e. u=U [0,1]
● as u<λ, adopt suggestion distribution q Init(ω | y), and with probability min (1, g Update) the reception recommended value.
● otherwise adopt suggestion to distribute
Figure S2007101772860D001110
And with probability min (1, g Update) the reception recommended value.
● the posteriority according to all the other parameters distributes, and produces the recommended value of all the other parameters.
In above-mentioned harmonic parameters update algorithm, two kinds of suggestion distribution functions are arranged.
When u<λ, the suggestion distribution function is q Init(ω | y), it is the distribution that the Fourier transform according to signal y produces, and is expressed as:
q init ( ω | y ) ∝ Σ l = 0 N - 1 p l II ( lπ / N , [ l + 1 ) π / N ]
p lBe the Fourier transform amplitude of signal at l π/N place.
When u>λ, the suggestion distribution function is Be expressed as:
q RW ( ω * | ω ~ ( l - 1 ) ) = N ( ω ( l - 1 ) ; σ RW 2 )
For with last sample value
Figure S2007101772860D00123
Be average, σ RW 2Be the Gaussian distribution of variance, i.e. recommended value ω *Be
Figure S2007101772860D00124
Contiguous place samples.
Allowance rate corresponding to top two formulas is g update = [ γ 0 + y T Py γ 0 + y T P * y ] N + v 0 2 , P wherein *For using ω *Matrix P after the renewal.
(II) according to the estimated value of analyzing each parameter of the harmonic-model that obtains in the previous step, encode.
Analysis part has obtained to describe the parameter of music signal, i.e. harmonic wave number M, harmonic amplitude β, fundamental frequency ω, noise variance σ υ 2The design of scrambler is such: to the harmonic wave number M, and fundamental frequency ω, noise variance σ υ 2Quantize huffman coding.Wherein harmonic amplitude β first ties up the amplitude energy maximum of corresponding fundamental tone, and the amplitude of all the other harmonic waves is used difference value, also carries out Huffman encoding then.
So far, low bit rate music signal coding flow process of the present invention is finished.Reduction is through the music signal of the method for the invention coding below.
(III) obtain each parameter of harmonic-model by decoding, and bring decoded parameter into harmonic-model and synthesize.
The work of composite part is simple relatively, as shown in Figure 4, is operation chart level and smooth to the multiframe signal windowing in the embodiment of the invention.
Parameter substitution harmonic-model with decoding obtains restores music signal, then again each frame is carried out the windowing smooth operation and gets final product.For the processing of multiframe signal, be not directly a plurality of single frames music signals to be stitched together among the present invention, because every like this frame signal adjacent can produce sudden change.The present invention selects Hamming window for use, has effectively realized level and smooth.
Certainly; the present invention also can have other various embodiments; under the situation that does not deviate from spirit of the present invention and essence thereof; those of ordinary skill in the art can make various corresponding changes and distortion according to the present invention, but these corresponding changes and distortion all should belong to the protection domain of the appended claim of the present invention.

Claims (11)

1. a low bit rate music signal coding method is characterized in that, may further comprise the steps:
1) divides frame with music signal, and, set up harmonic-model, selected each parameter that characterizes this model every frame signal windowing;
2) according to the prior probability distribution and the initial value of described parameter, and in conjunction with the conditional probability distribution of this model, use the Bayesian Estimation method, the posteriority that obtains described each parameter distributes;
3) distribute according to the posteriority that obtains, adopt reversible jump Monte Carlo sampling algorithm, loop iteration reaches convergence until algorithm, obtains the estimated value of each parameter;
Each estimates of parameters that 4) will obtain is sent into scrambler, finishes the Low Bit-rate Coding to sound signal.
2. the method for claim 1 is characterized in that, in the step (1), the described harmonic-model of setting up comprises:
With each frame music signal, be decomposed into by different each time multifrequency sinusoid of amplitude and the stack of white Gaussian noise.
3. the method for claim 1 is characterized in that, in the step (1), the parameter of described sign harmonic-model comprises:
The amplitude of harmonic number, fundamental frequency, each harmonic and noise variance.
4. method as claimed in claim 3 is characterized in that, in the step (2), the prior probability distribution of described parameter is set to respectively:
Noise variance is obeyed the distribution of falling the gamma, and harmonic number is obeyed Poisson distribution, and fundamental frequency is obeyed distribution of mean value, whose amplitude obeys Gaussian distribution.
5. the method for claim 1 is characterized in that, in the described step (2), comprising:
The posterior probability of using resulting each parameter of Bayesian Estimation method distributes, and is proportional to the product of the likelihood function of its prior probability distribution and its conditional probability distribution of description.
6. the method for claim 1 is characterized in that, in the step (3), described reversible jump Monte Carlo sampling algorithm comprises:
I) determine the initial value of each parameter, and select the mode of motion type with certain probability;
Ii) produce recommended value by prior distribution, and begin by 1 to L cycle calculations allowance rate k, greater than 1, each then new parameter prediction value is its recommended value as if allowance rate k; If allowance rate k is less than 1,, and composes and give new predicted value then with the recommended value of each parameter of probability calculation of allowance rate a;
Iii) loop iteration reaches convergence until algorithm, returns L predicted value.
7. method as claimed in claim 6 is characterized in that, in the step (i), described mode of motion type comprises: harmonic number increase, minimizing, constant three types.
8. method as claimed in claim 7 is characterized in that, step (ii) in, if allowance rate k is less than 1, then recommended value is composed to new predicted value with the probability of allowance rate a, comprising:
For the type that harmonic number increases,, calculate allowance rate g according to reversible jump Monte Carlo sampling algorithm Birth, judge the value of the predicted value of harmonic number and fundamental frequency, and produce the recommended value of all the other parameters;
For the type that harmonic number reduces,, calculate allowance rate g according to reversible jump Monte Carlo sampling algorithm Death, judge the value of the predicted value of harmonic number and fundamental frequency, and produce the recommended value of all the other parameters;
For the type that harmonic number reduces,, calculate allowance rate g according to reversible jump Monte Carlo sampling algorithm Update, judge the value of the predicted value of fundamental frequency, and produce the recommended value of all the other parameters.
9. method as claimed in claim 6 is characterized in that, in the described step (3), further comprises:
Distribute according to the posteriority that obtains, adopt reversible jump Monte Carlo sampling algorithm, loop iteration is after algorithm reaches convergence, each parameter all obtains a predicted value ordered series of numbers that contains a L element, select estimation criterion then, respectively the predicted value ordered series of numbers of each parameter is handled, promptly obtained an estimated value of each parameter.
10. method as claimed in claim 9 is characterized in that, described estimation criterion comprises: maximum a posteriori probability is estimated, Posterior Mean is estimated and least mean-square error is estimated.
11. the method for claim 1 is characterized in that, in the step (3), described scrambler adopts following method to encode, and comprising: Differential video coding method, huffman coding method.
CNA2007101772860A 2007-11-13 2007-11-13 Low bit rate music signal coding method Pending CN101159136A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CNA2007101772860A CN101159136A (en) 2007-11-13 2007-11-13 Low bit rate music signal coding method

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CNA2007101772860A CN101159136A (en) 2007-11-13 2007-11-13 Low bit rate music signal coding method

Publications (1)

Publication Number Publication Date
CN101159136A true CN101159136A (en) 2008-04-09

Family

ID=39307199

Family Applications (1)

Application Number Title Priority Date Filing Date
CNA2007101772860A Pending CN101159136A (en) 2007-11-13 2007-11-13 Low bit rate music signal coding method

Country Status (1)

Country Link
CN (1) CN101159136A (en)

Cited By (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101582262B (en) * 2009-06-16 2011-12-28 武汉大学 Space audio parameter interframe prediction coding and decoding method
CN109524023A (en) * 2016-01-22 2019-03-26 大连民族大学 A kind of method of pair of fundamental frequency estimation experimental verification
CN109586728A (en) * 2018-12-11 2019-04-05 哈尔滨工业大学 Signal blind reconstructing method under modulation wide-band transducer frame based on sparse Bayesian
CN110602494A (en) * 2019-08-01 2019-12-20 杭州皮克皮克科技有限公司 Image coding and decoding system and method based on deep learning
CN112970063A (en) * 2018-10-29 2021-06-15 杜比国际公司 Method and apparatus for rate quality scalable coding with generative models

Cited By (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101582262B (en) * 2009-06-16 2011-12-28 武汉大学 Space audio parameter interframe prediction coding and decoding method
CN109524023A (en) * 2016-01-22 2019-03-26 大连民族大学 A kind of method of pair of fundamental frequency estimation experimental verification
CN112970063A (en) * 2018-10-29 2021-06-15 杜比国际公司 Method and apparatus for rate quality scalable coding with generative models
CN109586728A (en) * 2018-12-11 2019-04-05 哈尔滨工业大学 Signal blind reconstructing method under modulation wide-band transducer frame based on sparse Bayesian
CN110602494A (en) * 2019-08-01 2019-12-20 杭州皮克皮克科技有限公司 Image coding and decoding system and method based on deep learning

Similar Documents

Publication Publication Date Title
EP2085965A1 (en) Variable rate speech coding
EP0927988A2 (en) Encoding speech
CN101371295B (en) Apparatus and method for encoding and decoding signal
WO2007083933A1 (en) Apparatus and method for encoding and decoding signal
JP2001222297A (en) Multi-band harmonic transform coder
TW200935402A (en) Scalable speech and audio encoding using combinatorial encoding of MDCT spectrum
JPH03211599A (en) Voice coder/decoder with 4.8 bps information transmitting speed
CN103262161A (en) Apparatus and method for determining weighting function having low complexity for linear predictive coding (LPC) coefficients quantization
US20080040104A1 (en) Speech coding apparatus, speech decoding apparatus, speech coding method, speech decoding method, and computer readable recording medium
CN101159136A (en) Low bit rate music signal coding method
JP2003050600A (en) Method and system for generating and encoding line spectrum square root
US6223151B1 (en) Method and apparatus for pre-processing speech signals prior to coding by transform-based speech coders
Giacobello et al. Stable 1-norm error minimization based linear predictors for speech modeling
Gottesmann Dispersion phase vector quantization for enhancement of waveform interpolative coder
JP2000514207A (en) Speech synthesis system
KR100768090B1 (en) Apparatus and method for waveform interpolation speech coding for complexity reduction
Aloui et al. Optimized speech compression algorithm based on wavelets techniques and its real time implementation on DSP
Xydeas et al. Theory and Real Time Implementation of a CELP Coder at 4.8 and 6.0 kbits/second Using Ternary Code Excitation
Sooraj et al. Performance analysis of CELP codec for Gaussian and fixed codebooks
Sisman et al. A new speech coding algorithm using zero cross and phoneme based SYMPES
Lopukhova et al. A Codec Simulation for Low-rate Speech Coding with Radial Neural Networks
Cornuz et al. Object coding of harmonic sounds using sparse and structured representations
CN114556470A (en) Method and system for waveform coding of audio signals using generative models
Ilk Low Bit Rate DCT Prototype Interpolation Speech Coding
Chilton Factors affecting the quality of linear predictive coding of speech at low bit-rates

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
C12 Rejection of a patent application after its publication
RJ01 Rejection of invention patent application after publication

Open date: 20080409