EP0481895A2 - Verfahren und Einrichtung zur Übertragung mit niedriger Bitrate einer Sprachsignals mittels CELP-Codierung - Google Patents
Verfahren und Einrichtung zur Übertragung mit niedriger Bitrate einer Sprachsignals mittels CELP-Codierung Download PDFInfo
- Publication number
- EP0481895A2 EP0481895A2 EP91402774A EP91402774A EP0481895A2 EP 0481895 A2 EP0481895 A2 EP 0481895A2 EP 91402774 A EP91402774 A EP 91402774A EP 91402774 A EP91402774 A EP 91402774A EP 0481895 A2 EP0481895 A2 EP 0481895A2
- Authority
- EP
- European Patent Office
- Prior art keywords
- vector
- vectors
- dictionary
- values
- value
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
- 238000000034 method Methods 0.000 title claims abstract description 85
- 230000005540 biological transmission Effects 0.000 title claims abstract description 30
- 239000013598 vector Substances 0.000 claims abstract description 270
- 238000004364 calculation method Methods 0.000 claims abstract description 28
- 230000015572 biosynthetic process Effects 0.000 claims abstract description 23
- 238000003786 synthesis reaction Methods 0.000 claims abstract description 23
- 230000005284 excitation Effects 0.000 claims abstract description 19
- 238000009826 distribution Methods 0.000 claims abstract description 7
- 230000008569 process Effects 0.000 claims description 53
- 230000004044 response Effects 0.000 claims description 18
- 239000011159 matrix material Substances 0.000 claims description 13
- 230000003044 adaptive effect Effects 0.000 claims description 12
- 238000012937 correction Methods 0.000 claims description 9
- 238000001914 filtration Methods 0.000 claims description 8
- 238000000354 decomposition reaction Methods 0.000 claims description 3
- 238000011144 upstream manufacturing Methods 0.000 claims 1
- 230000006870 function Effects 0.000 description 8
- 238000012545 processing Methods 0.000 description 6
- 241000135309 Processus Species 0.000 description 5
- 238000010586 diagram Methods 0.000 description 4
- 238000012546 transfer Methods 0.000 description 4
- 230000006978 adaptation Effects 0.000 description 2
- 239000000463 material Substances 0.000 description 2
- 230000009466 transformation Effects 0.000 description 2
- 238000012935 Averaging Methods 0.000 description 1
- 206010021403 Illusion Diseases 0.000 description 1
- 238000004458 analytical method Methods 0.000 description 1
- 230000008901 benefit Effects 0.000 description 1
- 238000004422 calculation algorithm Methods 0.000 description 1
- 230000008859 change Effects 0.000 description 1
- 238000004891 communication Methods 0.000 description 1
- 230000000694 effects Effects 0.000 description 1
- 150000004678 hydrides Chemical class 0.000 description 1
- 238000012804 iterative process Methods 0.000 description 1
- 238000004519 manufacturing process Methods 0.000 description 1
- 230000008929 regeneration Effects 0.000 description 1
- 238000011069 regeneration method Methods 0.000 description 1
- 230000000717 retained effect Effects 0.000 description 1
- 230000008054 signal transmission Effects 0.000 description 1
- 238000001228 spectrum Methods 0.000 description 1
- 238000003860 storage Methods 0.000 description 1
- 238000012360 testing method Methods 0.000 description 1
- 238000013519 translation Methods 0.000 description 1
Images
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/04—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
- G10L19/08—Determination or coding of the excitation function; Determination or coding of the long-term prediction parameters
- G10L19/083—Determination or coding of the excitation function; Determination or coding of the long-term prediction parameters the excitation function being an excitation gain
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/04—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
- G10L19/08—Determination or coding of the excitation function; Determination or coding of the long-term prediction parameters
- G10L19/12—Determination or coding of the excitation function; Determination or coding of the long-term prediction parameters the excitation function being a code excitation, e.g. in code excited linear prediction [CELP] vocoders
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L2019/0001—Codebooks
- G10L2019/0004—Design or structure of the codebook
- G10L2019/0005—Multi-stage vector quantisation
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L2019/0001—Codebooks
- G10L2019/0007—Codebook element generation
Definitions
- the invention relates to a method of transmission, at low speed, by CELP coding of a speech signal and to the corresponding system.
- CELP Code Excited Linear Prediction
- This technique for coding digital samples representing the speech signal is a hydride coding technique in which the speech signal is modeled by linear prediction filters and residues of this prediction.
- CELP elbows as shown schematically in FIGS. 1a and 1b, exhaustively test all the elements of a list of waveforms. The waveform providing the best synthesis of the signal is retained, and its index, or characteristic address, is transmitted to the decoder. This method is called synthetic analysis.
- the list of waveforms stored in the encoder and the decoder is called a dictionary.
- CELP coder The quality of a CELP coder depends closely on the dictionary chosen, on the method of determination-modeling of the linear prediction filters used, these two parameters constituting two degrees of freedom, not independent, making it possible to adapt a particular CELP coder to the needs of 'a specific application.
- Such a CELP coding technique is suitable for low bit rate coding applications (between 4 to 24 kbit / s).
- bit rate coding applications between 4 to 24 kbit / s.
- each block comprising L values is considered to be a vector of a vector space of dimension L.
- the current excitation signal constituted by a vector v, read in the dictionary of waveforms, must minimize a criterion of perceptual distortion of the form: min ⁇ x - Hv ⁇ 2, in which x denotes a target signal originating from the original signal O to be transmitted after perceptual weighting and H denotes a matrix of dimension LxL of impulse response derived from the product of the transfer functions of the synthesis filter and the perceptual weighting.
- each reference vector vi is associated with an adaptive gain value gk taken from a dictionary of gain values G, which allows, following application of the gain gk to the vector vi to form a vector vk, i, to satisfy the aforementioned minimum distortion criterion.
- Such an operating mode therefore does not make it possible to take into account, as a reference vector, the totality of the possibilities of combinations of the ternary values of the components of the reference vectors, the minimization of the distortion criterion not being able in all cases to be optimal.
- the object of the present invention is to remedy the aforementioned drawbacks, in order in particular to bring about a simplification of the calculations by the introduction as reference vector, in the dictionary of reference vectors, or directions, of almost all the combinations of n-ary values of the components of the vectors, n being an odd number.
- Another object of the present invention is the implementation, prior to the conventional process of applying an adaptive gain to each of the reference vectors, of a correction process by application of a scale factor, introducing the division of the energy of the excitation signal as a function of the frequency spectrum thereof, in order to take into account the non-uniformity of the energy distribution of the signal in the frequency domain.
- Another object of the present invention is finally the implementation of a low-speed transmission method of a speech signal in which each reference vector, constituting the excitation signal, can be regenerated at the level of a decoder based solely on index or address values of the optimal reference vector satisfying the minimum distortion criterion at the level of the coder, which has the effect of considerably simplifying and reducing the manufacturing costs of the aforementioned decoders.
- the method for transmitting a low bit rate speech signal comprises a process for coding digital speech samples by code-excited linear prediction to generate a code signal, a process for transmitting the code signal and a process of decoding the received code signal.
- the coding process corresponds to a process in which a waveform represented by a block of samples comprising L sample values and constituting an initial vector (o) of dimension L is represented, from a synthesis filter , by a reference waveform selected from a dictionary of reference waveforms each forming a reference vector (v) on criterion of minimum quadratic deviation of the initial vector (o) with respect to the form wave or reference vector (v), min ⁇ x - Hv ⁇ 2, where x represents a target vector obtained by perceptual weighting of the initial vector (o) and H a matrix of dimension LxL of impulse response resulting from the product of the filter synthesis and linear perceptual weighting.
- the value n / 2 corresponds to the integer division of n by 2.
- H. ⁇ i.yi> and of all the perceptual energies ⁇ Hy ⁇ 2, which makes it possible to attribute to the initial vector (o) the corresponding optimal reference vector vk *, i *, with vk *, i * gk * . ⁇ i * .yi *, this optimal reference vector being represented by the only values of index k *, i * satisfying the criterion min ⁇ x - gk.H. ⁇ i.yi ⁇ 2.
- the process of transmission at low speed of a speech signal consists in transmitting, as code signal, the only values of the indices k *, i * representative of each optimal reference vector vk *, i * .
- the process of decoding a coded speech signal transmitted at low bit rate according to a code signal is remarkable in that, in order to ensure the decoding of the code signal, this process consists in discriminating the values of the indices k *, i * constituting the code signal, in decomposing the value of the index i *, representative of the optimal reference vector, in base n to regenerate the corresponding base vector yi *, carry out, from the value of the index i *, the scale factor ⁇ i * and the corresponding adaptive gain gk *, a correction of the corresponding regenerated base vector to constitute the regenerated reference vector vk *, i *.
- a synthesis filtering operation is performed on the regenerated reference vector vk *, i * to generate the reconstructed speech signal.
- the method which is the subject of the invention comprises a process for coding digital speech samples by linear prediction excited by codes. This process generates a code signal.
- the method further includes a process for transmitting the code signal and a process for decoding the received code signal.
- the process of coding corresponds to a process in which a waveform represented by a block of samples comprising L values of samples, or frames, constitutes an initial vector denoted o of dimension L this vector being represented, and the corresponding waveform , from a synthesis filter by a reference waveform, denoted v, selected from a dictionary of reference waveforms each forming an aforementioned reference vector.
- the selection is made on the criterion of minimum quadratic deviation of the initial vector o vis-à-vis the waveform or reference vector v, this criterion written: min ⁇ x -Hv ⁇ 2
- x represents a target vector obtained by perceptual weighting of the initial vector o and H represents a matrix of dimension LxL of impulse response resulting from the product of the synthesis filter and from the aforementioned linear perceptual weighting.
- the coding process is such that the selection criterion consists in establishing a dictionary factored into the product of a first dictionary Y of basic vectors denoted yi.
- Each basic vector is a basic vector of n-ary form, that is to say that the components aj of these basic vectors, with j ⁇ [0, L-1], can take n different discrete values.
- each value of the components aj can take a value included in the group [-n / 2, ... 0, ... n / 2] in increments of 1, n being odd, n / 2 representing the integer division of n by 2.
- each basic vector yi is corrected by a scale factor ⁇ i taking into account the distribution of the excitation energy in the frequency domain of the signal.
- the scale factors ⁇ i are determined from a database, experimentally, the database being constituted by recording significant speech samples over several hours for example and for several speakers of the same language of expression or of several languages distinct, experience showing that the diversity of languages of expression only intervenes at the second level in the determination of the aforementioned ⁇ i scale factors.
- the scale factors ⁇ i are determined for each corresponding basic vector yi by a process of identification of each basic vector ⁇ i on a delocalized sequence of L successive recursive speech samples from the database , sorting the weakest adaptation coefficients and averaging a number u of identification or adaptation coefficients to obtain the corresponding scale factor ⁇ i associated with the aforementioned basic vector yi.
- the factorized dictionary previously mentioned also consists of a second dictionary constituting the aforementioned product, this second dictionary being denoted G (y) and being formed by a gain dictionary gk.
- this optimal reference vector is represented by the only values of the parameters of indices k *, i * satisfying the aforementioned criterion: min ⁇ x-gk.H. ⁇ i. yi ⁇ 2.
- the minimum value of the quadratic difference min ⁇ x-gk.H. ⁇ i. yi ⁇ 2 is evaluated by selecting the corresponding gain element gk from the second dictionary G (y) making it possible to minimize the difference
- where g checks the relation: g ⁇ x
- the dictionary Y of basic vectors yi of n-ary form [-n / 2, ..., 0, ... n / 2] of dimension L includes all the basic vectors whose L components have for value the aforementioned n-areas values, with the exception of the null vector.
- the index i of the basic vectors is taken equal to the value in base n of each base vector after transcoding the values ⁇ -n / 2 ..., 0 ... n / 2 ⁇ into corresponding values (0,1,2 ... n). It will thus be understood that the basic vectors yi of n-ary form are arranged as a function of their index i, this index i having for value the value in base n of each vector.
- the set of basic vectors yi constituting the dictionary Y is defined from n / 2 ⁇ L impulse vectors of which only one component aj of order j, with j ⁇ [0, L-1], is equal to -1, -2, ... -n / 2.
- FIGS. 3a and 3b operator cells have been shown respectively making it possible to generate, from the previously defined pulse vectors and from sub-dictionaries constituted by the pulse vector considered and the related vectors corresponding to each pulse vector, the complete dictionary comprising the collection of all of the sub-dictionaries.
- Each operator as represented in FIG. 3a comprises an operator called delay operator R whose transfer function is denoted Z+1, according to the classical notation of transform into Z, a symmetrizer operator denoted Sy which has the function of multiplying the components of all vectors presented at its entry by the value +1, by the value 0 then by the value -1 and a summator, noted S, receiving the output of the delay operator R and of the symmetrizer Sy.
- the summator S receives the output of the delay operator R via a switch 1, in position F, or the zero vector [0,0,0,0,0] of dimension L in position O.
- the operators represented in FIG. 3a are constituted by a single operator represented in 1), 2) and 3) at different stages of the processing process to generate the basic vectors yi of the abovementioned dictionary Y.
- the initial pulse or pulse vector ⁇ L-1 is present at the input of the delay operator R.
- Le symmetrizer Sy is then supplied by a noted sub-dictionary DO ⁇ which is initially formed by the aforementioned pulse vector ⁇ L-1.
- the pulse vector ⁇ L-2 is associated with the sub-dictionary D1 formed by the related vectors y1, y2, y3 with the pulse vector ⁇ L-2 and by the pulse vector initial L-1 initial forming the basic vector y0, as well as the null vector.
- the pulse vector initial L-1 initial forming the basic vector y0, as well as the null vector.
- the operator making it possible to generate the basic vectors yi is such that the latter receives the pulse vector at the level of the delay operator R ⁇ Lm, at the level of the symmetrizor Sy, the dictionary denoted D m-1 formed recursively like the dictionary D1, the summator S as represented in point 2) of the same figure 3a then delivering from the impulse vector ⁇ Lm-1 above delivered by the delay operator R or the zero vector and by the sub-dictionary D m-1 ⁇ the sub-dictionary D m.
- the * represented at the level of the aj components for the level m processing process correspond to values 0, -1 or +1 when the vectors are ternary vectors.
- the * represent values between -n / 2 and + n / 2, under the conditions previously mentioned.
- the total ternary dictionary, sum or union of all the sub-dictionaries of intermediate level m, up to L can be obtained for the only positive or negative values of the components aj, the total dictionary then being able to be obtained by symmetrization by through a symmetrization operator such as Sy.
- this operator is such that the impulse responses of the system H at the relative time 0, 1, 2, L-1, ie the values h0, h1, hL -2, hL-1 are applied to the above operator.
- the symmetrization operator Sy multiplies the elements of S L-1 (Dm-1) by +1, 0, -1 and realizes, as described above, the union of the distinct elements obtained.
- the elementary detripling cell is represented in FIG. 5b from the pulse vectors denoted ⁇ -1, ⁇ 0 and ⁇ 1. It will be noted that the summation of the pulse vectors ⁇ 1, ⁇ 0, ⁇ -1 amounts to replacing the last coordinate of the incident base vector by the component values +1, 0 or -1.
- FIGS. 5a and 5b the architecture as represented in FIGS. 5a and 5b is that of a linear structure of ternary graphs. For an n-ary structure we get an n-ary graph.
- the global graph for obtaining the energies is traversed from right to left, the initial energy E (0) being at SL-1 (0) 2.
- the elementary cell constituting the graph represented in FIG. 5c is represented in FIG. 5d.
- each reference vector vk *, i * can advantageously be weighted by a predicted level factor, noted ⁇ .
- This predicted level factor ⁇ is representative of the average energy of the excitation signal estimated on at least three successive previous excitation vectors.
- the previous expression is then calculated by filtering the expression 2 x ⁇ by the matrix transposed from the matrix H or t H.
- the calculation process as represented by the operator in FIG. 6 allows, in a manner analogous to the calculation of the partial responses S L-1 (yi) previously described, to obtain the quantities x′0, x′Lm-1, x′L-2 and therefore the abovementioned scalar products, the zero vector being replaced by the zero value.
- each scale factor ⁇ i can be determined from a plurality N of frames, from a speech signal database, the scale factor ⁇ i for each base vector yi being chosen so as to make the filter residue of the aforementioned frames minimum for the frame considered. It will be recalled that several processes for determining each scale factor ⁇ i can be considered.
- the transmission of speech at low speed is carried out by the sole transmission, as a code signal, of the values of the indices k * and i * representative of each reference vector vk *, i *.
- the transmission can be carried out using conventional transmission protocols in which a redundancy of the information transmitted is introduced in order to ensure transmission at a rate substantially zero error.
- the value i * can be transmitted either in direct numbering either in retrograde numbering, or according to a translated numbering whose translation table is known to both the coder and the decoder.
- the decoding process consists in discriminating in 1000 the values of the indices k * and i * constituting the code signal and then in decomposing into 1001 the value of the index i * representative of the optimal reference vector in base n in order to regenerate the corresponding base vector yi *.
- the decoding process consists in carrying out a synthesis filtering operation 1003 of the reference vector to generate the reconstructed speech signal.
- each reference vector vk *, i * before the synthesis filtering is weighted by a level factor predicts ⁇ which is estimated on at least three successive previous excitation vectors.
- the determination of the predicted level a will not be described in detail since it corresponds to the level of the decoding process with operations normally known to those skilled in the art.
- the coding circuit comprises a generator 1 of a first dictionary Y of basic vectors yi of n-ary shape of dimension L, the components of these vectors, as mentioned previously, being able to take the values between -n / 2 to n / 2.
- the generator of the dictionary Y can advantageously be constituted by calculating means comprising the operators as described in FIGS. 3a, 3b for example and / or a storage circuit which can be constituted by a random access memory associated with this circuit computer or by ROM.
- the read only memory is associated with a fast sequencer which makes it possible to carry out a successive reading of the basic vectors yi according to the indices in direct or retrograde numbering as described previously.
- the coding circuit as represented in FIG. 8 comprises a circuit 2 correcting the basic vectors yi by a scale factor ⁇ i.
- a fast multiplexer denoted MUX makes it possible successively to read the corresponding values of the corrected base vector yi ⁇ and to deliver this corresponding value to a circuit 3 generator of a second adaptive gain dictionary gk.
- the generator circuit 3 of the second dictionary G (y) may advantageously include an amplifier circuit, denoted 30, connected to a table of values gk constituting the aforementioned second dictionary.
- the coding circuit object of the present invention also comprises an amplifier circuit 4 which makes it possible to apply to each reference vector vk, i the level prediction coefficient ⁇ as it has been defined previously in the description.
- the coding circuit which is the subject of the present invention then comprises in cascade the synthesis filter denoted 5 and the perceptual weighting filter denoted 6 by transmission H as described previously in the description.
- a summator 7 makes it possible to receive on the one hand the original signal via the same perceptual weighting filter 6 after inversion the difference of the signals delivered by the algebraic summator 7, allowing the application to the signal thus obtained from the minimum distortion criterion.
- the coding circuit which is the subject of the present invention comprises a circuit for calculating the minimum distortion 8 which comprises a first circuit 80 for calculating the product.
- a circuit for calculating the minimum distortion 8 which comprises a first circuit 80 for calculating the product.
- the first calculator circuit 80 delivers a first calculation result r1.
- a second calculating circuit 81 makes it possible to calculate the energy of the reconstructed vector and perceptually weighted this energy being of the form gk2 ⁇ H. ⁇ i.yi ⁇ 2.
- the computer circuits 80 and 81 can be constituted by program modules whose calculation graphs have been explained respectively in FIGS. 4 and 5 a) to d) respectively.
- the second calculation circuit 81 delivers a second result of calculation noted r2.
- a comparator 83 makes it possible to compare the value of the calculation results r1 and r2 which makes it possible to determine by discrimination of the values of the indices i and k, the indices i * and k * for which the criterion of minimum of the quadratic difference is satisfied .
- the discrimination of the indices i * and k * is carried out for example by a sorting program noted 84 in FIG. 8.
- the values of the indices k * and i * are then delivered, these indices being representative of the corresponding reference vector vk *, i *.
- FIG. 8 also shows the transmission circuit according to the subject of the present invention, this transmission circuit making it possible to deliver as a code signal representative of the speech signal the only values of the indices k * and i * .
- This transmission circuit does not have any particular characteristic insofar as it can in fact be constituted by a transmission system of conventional type used in the devices for transmitting speech signals by coding of CELP type of the prior art.
- FIG. 9 A more detailed description of a decoding circuit allowing the implementation of the method which is the subject of the invention is shown in FIG. 9.
- the decoding circuit comprises a module 10 for discriminating the values of the indices i *, k * of the code signal received, the code signal being of course transmitted according to a particular protocol which does not enter into the object of the present invention.
- the discrimination circuit 10 thus performing a parallel series transformation of the information relating to the indices i *, k *
- the decoding circuit comprises a decomposition circuit in base n of the value of the index i *.
- the decoding circuit as shown in FIG. 9 includes a table of adaptive gain values gk denoted 11 which on reception of the value of the index k * makes it possible to deliver the corresponding adaptive gain value gk *.
- This circuit 11 can advantageously consist of a read-only memory in which the adaptive gain values gk are stored.
- a generator circuit 12 of the scale factor ⁇ i * is provided.
- This circuit can consist of a read only memory forming a look-up table, which with the value i * matches the value ⁇ i *.
- the decoding circuit comprises a circuit 13 generating the regenerated base vector by decomposition into base n of the value of the index i *.
- a circuit 14 corresponds to the value i * by transcoding the components in base n of the value of index i *, the value ⁇ -n / 2, ..., 0, ... n / 2 ⁇ , which makes it possible to generate a regenerated reference vector vk *, i * of the product of the vector regenerated base and product A.
- a synthesis filter 15 allows starting from the regenerated reference vector to generate the reconstructed speech signal.
Landscapes
- Engineering & Computer Science (AREA)
- Computational Linguistics (AREA)
- Signal Processing (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Physics & Mathematics (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Compression, Expansion, Code Conversion, And Decoders (AREA)
- Transmission Systems Not Characterized By The Medium Used For Transmission (AREA)
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
FR9012980A FR2668288B1 (fr) | 1990-10-19 | 1990-10-19 | Procede de transmission, a bas debit, par codage celp d'un signal de parole et systeme correspondant. |
FR9012980 | 1990-10-19 |
Publications (3)
Publication Number | Publication Date |
---|---|
EP0481895A2 true EP0481895A2 (de) | 1992-04-22 |
EP0481895A3 EP0481895A3 (en) | 1992-08-12 |
EP0481895B1 EP0481895B1 (de) | 1997-12-10 |
Family
ID=9401407
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
EP91402774A Expired - Lifetime EP0481895B1 (de) | 1990-10-19 | 1991-10-17 | Verfahren und Einrichtung zur Übertragung mit niedriger Bitrate eines Sprachsignals mittels CELP-Codierung |
Country Status (5)
Country | Link |
---|---|
US (1) | US5226085A (de) |
EP (1) | EP0481895B1 (de) |
JP (1) | JP3130348B2 (de) |
DE (1) | DE69128407T2 (de) |
FR (1) | FR2668288B1 (de) |
Families Citing this family (18)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JP2658794B2 (ja) * | 1993-01-22 | 1997-09-30 | 日本電気株式会社 | 音声符号化方式 |
AU675322B2 (en) * | 1993-04-29 | 1997-01-30 | Unisearch Limited | Use of an auditory model to improve quality or lower the bit rate of speech synthesis systems |
FR2722631B1 (fr) * | 1994-07-13 | 1996-09-20 | France Telecom Etablissement P | Procede et systeme de filtrage adaptatif par egalisation aveugle d'un signal telephonique numerique et leurs applications |
JP3058028B2 (ja) * | 1994-10-31 | 2000-07-04 | 三菱電機株式会社 | 画像符号化データ再符号化装置 |
SE504010C2 (sv) * | 1995-02-08 | 1996-10-14 | Ericsson Telefon Ab L M | Förfarande och anordning för prediktiv kodning av tal- och datasignaler |
DE19516106C2 (de) * | 1995-05-05 | 2003-04-03 | Philips Corp Intellectual Pty | Verfahren zum Bestimmen von Referenzwerten |
US6477496B1 (en) | 1996-12-20 | 2002-11-05 | Eliot M. Case | Signal synthesis by decoding subband scale factors from one audio signal and subband samples from different one |
US6782365B1 (en) | 1996-12-20 | 2004-08-24 | Qwest Communications International Inc. | Graphic interface system and product for editing encoded audio data |
US5864813A (en) * | 1996-12-20 | 1999-01-26 | U S West, Inc. | Method, system and product for harmonic enhancement of encoded audio signals |
US6463405B1 (en) | 1996-12-20 | 2002-10-08 | Eliot M. Case | Audiophile encoding of digital audio data using 2-bit polarity/magnitude indicator and 8-bit scale factor for each subband |
US6516299B1 (en) | 1996-12-20 | 2003-02-04 | Qwest Communication International, Inc. | Method, system and product for modifying the dynamic range of encoded audio signals |
US5845251A (en) * | 1996-12-20 | 1998-12-01 | U S West, Inc. | Method, system and product for modifying the bandwidth of subband encoded audio data |
US5864820A (en) * | 1996-12-20 | 1999-01-26 | U S West, Inc. | Method, system and product for mixing of encoded audio signals |
FI113571B (fi) | 1998-03-09 | 2004-05-14 | Nokia Corp | Puheenkoodaus |
US7536298B2 (en) * | 2004-03-15 | 2009-05-19 | Intel Corporation | Method of comfort noise generation for speech communication |
JP4993676B2 (ja) * | 2006-09-01 | 2012-08-08 | キヤノン株式会社 | 画像符号化装置及び画像符号化方法 |
CN101430879B (zh) * | 2007-11-05 | 2011-08-10 | 华为技术有限公司 | 一种多速率语音频编码的方法 |
WO2011074233A1 (ja) * | 2009-12-14 | 2011-06-23 | パナソニック株式会社 | ベクトル量子化装置、音声符号化装置、ベクトル量子化方法、及び音声符号化方法 |
Citations (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
EP0379296A2 (de) * | 1989-01-17 | 1990-07-25 | AT&T Corp. | Linearer Prädiktivkodierer mit Code-Anregung für Sprach- oder Audiosignale mit niedriger Verzögerung |
WO1991013432A1 (en) * | 1990-02-23 | 1991-09-05 | Universite De Sherbrooke | Dynamic codebook for efficient speech coding based on algebraic codes |
Family Cites Families (9)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
NL8302985A (nl) * | 1983-08-26 | 1985-03-18 | Philips Nv | Multipulse excitatie lineair predictieve spraakcodeerder. |
NL8500843A (nl) * | 1985-03-22 | 1986-10-16 | Koninkl Philips Electronics Nv | Multipuls-excitatie lineair-predictieve spraakcoder. |
US4944013A (en) * | 1985-04-03 | 1990-07-24 | British Telecommunications Public Limited Company | Multi-pulse speech coder |
IT1195350B (it) * | 1986-10-21 | 1988-10-12 | Cselt Centro Studi Lab Telecom | Procedimento e dispositivo per la codifica e decodifica del segnale vocale mediante estrazione di para metri e tecniche di quantizzazione vettoriale |
US4868867A (en) * | 1987-04-06 | 1989-09-19 | Voicecraft Inc. | Vector excitation speech or audio coder for transmission or storage |
US4910781A (en) * | 1987-06-26 | 1990-03-20 | At&T Bell Laboratories | Code excited linear predictive vocoder using virtual searching |
US4899385A (en) * | 1987-06-26 | 1990-02-06 | American Telephone And Telegraph Company | Code excited linear predictive vocoder |
US5091946A (en) * | 1988-12-23 | 1992-02-25 | Nec Corporation | Communication system capable of improving a speech quality by effectively calculating excitation multipulses |
US4980916A (en) * | 1989-10-26 | 1990-12-25 | General Electric Company | Method for improving speech quality in code excited linear predictive speech coding |
-
1990
- 1990-10-19 FR FR9012980A patent/FR2668288B1/fr not_active Expired - Fee Related
-
1991
- 1991-10-17 DE DE69128407T patent/DE69128407T2/de not_active Expired - Fee Related
- 1991-10-17 EP EP91402774A patent/EP0481895B1/de not_active Expired - Lifetime
- 1991-10-18 JP JP03298096A patent/JP3130348B2/ja not_active Expired - Fee Related
- 1991-10-18 US US07/779,310 patent/US5226085A/en not_active Expired - Lifetime
Patent Citations (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
EP0379296A2 (de) * | 1989-01-17 | 1990-07-25 | AT&T Corp. | Linearer Prädiktivkodierer mit Code-Anregung für Sprach- oder Audiosignale mit niedriger Verzögerung |
WO1991013432A1 (en) * | 1990-02-23 | 1991-09-05 | Universite De Sherbrooke | Dynamic codebook for efficient speech coding based on algebraic codes |
Non-Patent Citations (2)
Title |
---|
Academic Publishers, Dordrecht, NL; R.A. SALAMI: "Binary pulse excitation: a novel approach to low complexity CELP coding" * |
ADVANCED IN SPEECH CODING (IEEE WORKSHOP ON SPEECH CODING FOR TELECOMMUNICATIONS, Vancouver, 5 - 8 septembre 1989), pages 145-156, Kluwer Academic Publishers, Dordrecht, NL; R.A. SALAMI: "Binary pulse excitation: a novel approach to low complexity CELP coding" * |
Also Published As
Publication number | Publication date |
---|---|
EP0481895B1 (de) | 1997-12-10 |
FR2668288B1 (fr) | 1993-01-15 |
DE69128407T2 (de) | 1998-06-04 |
JPH04264500A (ja) | 1992-09-21 |
JP3130348B2 (ja) | 2001-01-31 |
FR2668288A1 (fr) | 1992-04-24 |
US5226085A (en) | 1993-07-06 |
DE69128407D1 (de) | 1998-01-22 |
EP0481895A3 (en) | 1992-08-12 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
EP0481895B1 (de) | Verfahren und Einrichtung zur Übertragung mit niedriger Bitrate eines Sprachsignals mittels CELP-Codierung | |
EP0608174B1 (de) | System zur prädiktiven Kodierung/Dekodierung eines digitalen Sprachsignals mittels einer adaptiven Transformation mit eingebetteten Kodes | |
CA2266654C (fr) | Procede et dispositif d'egalisation aveugle des effets d'un canal de transmission sur un signal de parole numerique | |
EP0511095B1 (de) | Verfahren und Vorrichtung zur Kodierung und Dekodierung eines numerischen Signals | |
EP1994531A2 (de) | Verbesserte verschlüsselung/entschlüsselung eines digitalen tonsignals im celp-verfahren | |
EP0428445B1 (de) | Verfahren und Einrichtung zur Codierung von Prädiktionsfiltern in Vocodern mit sehr niedriger Datenrate | |
FR2690551A1 (fr) | Procédé de quantification d'un filtre prédicteur pour vocodeur à très faible débit. | |
EP0195441B1 (de) | Verfahren zur Sprachcodierung mit niedriger Bitrate unter Verwendung eines Mehrimpulsanregungssignals | |
FR2702075A1 (fr) | Procédé de génération d'un filtre de pondération spectrale du bruit dans un codeur de la parole. | |
EP0692883A1 (de) | Verfahren zur blinden Entzerrung, und dessen Anwendung zur Spracherkennung | |
EP0673113A1 (de) | Signalquellencharakterisiersystem | |
CA2108663C (fr) | Procede et dispositif de filtrage pour la reduction des preechos d'un signal audio-numerique | |
EP0574288B1 (de) | Verfahren und Einrichtung zur Übertragungsfehlerverdeckung von mittels einer Frequenztransformation kodierten digitalen Audiosignalen | |
EP0341129A1 (de) | Verfahren und Einrichtung zur Kodierung von Sprachsignalenergie in Vokodern mit sehr niedriger Bitrate | |
EP0347307B1 (de) | Kodierungsverfahren und linearer Prädiktionssprachkodierer | |
EP1192619B1 (de) | Audio-kodierung, dekodierung zur interpolation | |
EP1194923B1 (de) | Verfahren und system für audio analyse und synthese | |
FR2751776A1 (fr) | Procede d'extraction de la frequence fondamentale d'un signal de parole | |
EP3934282A1 (de) | Verfahren zur umwandlung eines ersten satzes repräsentativer signale eines schallfelds in einen zweiten satz von signalen und entsprechende elektronische vorrichtung | |
EP1192618B1 (de) | Audiokodierung mit adaptiver lifterung | |
WO2010076412A2 (fr) | Procede et un dispositif d'estimation de signaux de source issus d'un signal de melange | |
EP1605440A1 (de) | Verfahren zur Quellentrennung eines Signalgemisches | |
EP0812070A1 (de) | Verfahren und Gerät zur Kompressionskodierung eines digitalen Signals | |
WO2002029786A1 (fr) | Procede et dispositif de codage segmental d'un signal audio | |
EP0696110A1 (de) | Verfahren und Gerät zur Tonkodierung und Tonkodierung durch Frequenzkompression, insbesondere zur Anwendung in einem Massentonspeicher |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PUAI | Public reference made under article 153(3) epc to a published international application that has entered the european phase |
Free format text: ORIGINAL CODE: 0009012 |
|
AK | Designated contracting states |
Kind code of ref document: A2 Designated state(s): DE GB |
|
PUAL | Search report despatched |
Free format text: ORIGINAL CODE: 0009013 |
|
AK | Designated contracting states |
Kind code of ref document: A3 Designated state(s): DE GB |
|
17P | Request for examination filed |
Effective date: 19920910 |
|
17Q | First examination report despatched |
Effective date: 19950606 |
|
GRAG | Despatch of communication of intention to grant |
Free format text: ORIGINAL CODE: EPIDOS AGRA |
|
GRAH | Despatch of communication of intention to grant a patent |
Free format text: ORIGINAL CODE: EPIDOS IGRA |
|
GRAH | Despatch of communication of intention to grant a patent |
Free format text: ORIGINAL CODE: EPIDOS IGRA |
|
GRAA | (expected) grant |
Free format text: ORIGINAL CODE: 0009210 |
|
AK | Designated contracting states |
Kind code of ref document: B1 Designated state(s): DE GB |
|
REF | Corresponds to: |
Ref document number: 69128407 Country of ref document: DE Date of ref document: 19980122 |
|
GBT | Gb: translation of ep patent filed (gb section 77(6)(a)/1977) |
Effective date: 19980209 |
|
PLBE | No opposition filed within time limit |
Free format text: ORIGINAL CODE: 0009261 |
|
STAA | Information on the status of an ep patent application or granted ep patent |
Free format text: STATUS: NO OPPOSITION FILED WITHIN TIME LIMIT |
|
26N | No opposition filed | ||
REG | Reference to a national code |
Ref country code: GB Ref legal event code: IF02 |
|
PGFP | Annual fee paid to national office [announced via postgrant information from national office to epo] |
Ref country code: GB Payment date: 20080926 Year of fee payment: 18 |
|
PGFP | Annual fee paid to national office [announced via postgrant information from national office to epo] |
Ref country code: DE Payment date: 20081030 Year of fee payment: 18 |
|
PG25 | Lapsed in a contracting state [announced via postgrant information from national office to epo] |
Ref country code: DE Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES Effective date: 20100501 |
|
PG25 | Lapsed in a contracting state [announced via postgrant information from national office to epo] |
Ref country code: GB Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES Effective date: 20091017 |