WO1997014139A1 - Methode et dispositif de prediction de signal pour un codeur de parole - Google Patents
Methode et dispositif de prediction de signal pour un codeur de parole Download PDFInfo
- Publication number
- WO1997014139A1 WO1997014139A1 PCT/FR1996/001596 FR9601596W WO9714139A1 WO 1997014139 A1 WO1997014139 A1 WO 1997014139A1 FR 9601596 W FR9601596 W FR 9601596W WO 9714139 A1 WO9714139 A1 WO 9714139A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- vector
- signal
- vectors
- optimal
- filtered
- Prior art date
Links
- 238000000034 method Methods 0.000 title claims abstract description 16
- 230000005284 excitation Effects 0.000 claims abstract description 23
- 230000003044 adaptive effect Effects 0.000 claims abstract description 15
- 239000013598 vector Substances 0.000 claims description 151
- 238000004458 analytical method Methods 0.000 claims description 23
- 238000003786 synthesis reaction Methods 0.000 claims description 10
- 230000015572 biosynthetic process Effects 0.000 claims description 9
- 238000011002 quantification Methods 0.000 claims description 9
- 238000001914 filtration Methods 0.000 claims description 8
- 238000013139 quantization Methods 0.000 claims description 8
- 238000004519 manufacturing process Methods 0.000 claims description 6
- 230000004044 response Effects 0.000 claims description 4
- 230000002596 correlated effect Effects 0.000 abstract 1
- 238000004364 calculation method Methods 0.000 description 8
- 230000007774 longterm Effects 0.000 description 6
- 230000001755 vocal effect Effects 0.000 description 4
- 230000005540 biological transmission Effects 0.000 description 3
- 238000001228 spectrum Methods 0.000 description 3
- 238000004891 communication Methods 0.000 description 2
- 230000006835 compression Effects 0.000 description 2
- 238000007906 compression Methods 0.000 description 2
- 230000006870 function Effects 0.000 description 2
- 230000007246 mechanism Effects 0.000 description 2
- 210000000056 organ Anatomy 0.000 description 2
- 230000008569 process Effects 0.000 description 2
- 238000005070 sampling Methods 0.000 description 2
- 230000009471 action Effects 0.000 description 1
- 230000015556 catabolic process Effects 0.000 description 1
- 238000010276 construction Methods 0.000 description 1
- 238000006731 degradation reaction Methods 0.000 description 1
- 230000006866 deterioration Effects 0.000 description 1
- 230000009977 dual effect Effects 0.000 description 1
- 230000000694 effects Effects 0.000 description 1
- 210000004072 lung Anatomy 0.000 description 1
- 239000011159 matrix material Substances 0.000 description 1
- 230000010355 oscillation Effects 0.000 description 1
- 230000008447 perception Effects 0.000 description 1
- 230000000737 periodic effect Effects 0.000 description 1
- 238000012545 processing Methods 0.000 description 1
- 230000009467 reduction Effects 0.000 description 1
- 238000011160 research Methods 0.000 description 1
- 210000002345 respiratory system Anatomy 0.000 description 1
- 230000035945 sensitivity Effects 0.000 description 1
- 238000010183 spectrum analysis Methods 0.000 description 1
- 230000000007 visual effect Effects 0.000 description 1
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/04—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
- G10L19/08—Determination or coding of the excitation function; Determination or coding of the long-term prediction parameters
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L2019/0001—Codebooks
- G10L2019/0011—Long term prediction filters, i.e. pitch estimation
Definitions
- the present invention relates to a method for predicting, in a so-called CELP speech coder, the residual vector signal, or residual vector, of the short-term analysis, said signal containing the periodicity information present in an initial speech signal to be coded. broken down into successive subframes, said prediction being carried out on the basis of optimal excitations predicted for the preceding subframe.
- CELP speech coder comprising on the one hand a short-term analysis filter, which receives an initial voice signal to be coded broken down into successive sub-frames and delivers a residual vector signal defining the information of periodicity present in the initial voice signal, and on the other hand a device for predicting this residual signal and a circuit for estimating a prediction error by difference between this residual vector signal and the predicted vector signal, more particularly said communication device prediction.
- the words emitted by a phonation organ constitute a vocal signal which presents two types of properties: on the one hand those related to the mechanism of the perception of this signal by the human auditory system (finite bandwidth, finite resolution in frequency, sensitivity at resonance frequencies, insensitivity to the phase of the frequency components of the signal, etc.), and on the other hand those linked to the operating mechanism of the phonation organ (pseudo-periodicity of the sounds, resonant structure of the signal, ).
- the voice message itself can be considered as the combination of content information and additional so-called expression information, which reflects individual variations in the acoustic presentation of the message. It is obvious that an effective transmission of such a message would undoubtedly imply defining a criterion of loyalty. It is however more realistic, in general, to be content with defining a perceptual criterion, which makes it possible to recognize the absence of discernible differences between a message sent and the corresponding received signal.
- the voice signal is, in fact, constituted by variations in air pressure, generated by the vocal tract under the action of the respiratory system which supplies the energy necessary for the production of speech.
- the air flow out of the lungs is modulated at a so-called fundamental frequency F with which the production of vowels is associated.
- This frequency which varies from 70 to 150 hertz approximately for men and from 150 to * ⁇ 00 hertz approximately for women, characterizes so-called voiced sounds (an example of representation of the amplitude A of a voiced sound as a function of time t is given in Figure 1).
- the air flow then excites in forced oscillations of the cavities of the vocal tract, to the shape of which correspond natural frequencies F. ,, F 2 , F- ,, etc ... called formants.
- the voice signal also includes signals which do not have the coherence of voiced sounds, but which are similar to noise, emitted by a source without natural frequency, and which do not excite the natural frequencies of the vocal tract. (these sounds are linked to the production of most consonants).
- the characteristics of the voice signal can be highlighted by a spectral analysis, which shows that the emitted spectrum comprises on the one hand a spectrum of lines (periodic excitations) for the production of the voiced sounds and on the other hand a continuous spectrum (excitations inconsistent) for producing unvoiced sounds.
- a further analysis of a voice signal ultimately shows that its processing with a view to a faithful transmission of the bandwidth which it represents leads to acoustically manipulating a considerable data stream.
- This coder is based on a principle called analysis by synthesis (in English: linear prediction analysis-by-synthesis coding), which comprises on the one hand an analysis step, to determine the coefficients of the synthesis filter, and on the other hand, a step of analysis by synthesis, which consists in finding or calculating a sequence of excitations minimizing a determined error criterion (we often use the criterion of least squares).
- analysis by synthesis in English: linear prediction analysis-by-synthesis coding
- CELP coding a term which will be adopted in the following description
- CELP coding is based on a simplified model of the speech production mode, according to which, as a first approximation, the voice signal can be modeled by a short-term (voice path) and long-term (voice source) correlation filter having a signal as input excitation.
- CELP CELP
- the draft recommendation G.723 cited above also uses so-called harmonic filtering.
- Short-term analysis is based on a predictive method in which the basic idea implemented is, knowing an input voice signal or observed signal s (n)
- the index n denotes the rank of the sample
- the speech coder described in the document G.723 cited above receives a signal consisting of blocks, or frames, comprising 240 samples, with a sampling frequency of 8 kHz, and each frame is supposed to be divided into four sub-frames. of 60 samples each.
- a filter 1 / A (z) called a synthesis filter, which, applied to an excitation signal x (n), makes it possible to obtain a signal p (n) as close as possible to the sampling signal s (n), it is equivalent to searching for a filter A (z), said to be of analysis, whose coefficients are such that the output signal of the filter is as bleached as possible when this filter is attacked by the signal to be transmitted (in an ideal coder, the output signal would be real white noise).
- the first performs a decorrelation of adjacent samples: its purpose is to define the coefficients of the input filter most suitable for obtaining after filtering the signal known, of a residual signal as close as possible to white noise.
- t (.) and p (.) denote respectively, the sample concerned in the signal to be modeled and the predicted sample
- ⁇ is a gain value
- OLP in English, Open Loop Pitch
- the determination of the period OLP and that of the gain ⁇ suffices to implement the prediction represented by the expression (3) -
- this determination has the form of a direct dependence: to within a gain factor, the sample to be predicted would be equal to one of the samples already occurring. In fact, the principle adopted is even more general.
- each of these groups of five vectors is called "V-vector".
- the first component of the first vector of each V-vector is obtained by shifting from
- the operation corresponding to expression (4) is therefore a prediction with linear combination of samples, during which the search for the solution vector is refined by varying the gain ⁇ , for example, in the case of the G project. 723 cited, by giving it five distinct values ( ⁇ is the gain vector formed by these five values), and also by adjusting the OLP quantity by a small value.
- the selection, during this research, of the best possible vector solution is made by including in the course of the determination process a minimization step, in the least squares sense, the difference between the vector t (n) of the analysis filter output (whose coefficients will be transmitted) and the solution vectors resulting from the implementation of expression (4).
- the analysis filter which, having received the voice signal to be transmitted, only delivers a residual signal constituting the periodicity information of this initial voice signal, it is this signal depleted t (n) for which the prediction explained below will be implemented with reference to FIG. 2.
- the adaptive dictionary therefore contains the excitation vectors candidates for the construction of an approximation of this vector t (n) .
- FIG. 2 shows, in the case of the G.723 project. an example of a prediction device making it possible to implement the principles of determination which have just been described.
- This device firstly comprises a circuit 20 for storing excitation vectors (this is the adaptive dictionary mentioned above), constructed from the optimal excitation of the preceding sub-frame, that is to say -to say excitations selected during a previous implementation of the same prediction method for previous samples.
- excitation vectors this is the adaptive dictionary mentioned above
- OLP + ⁇ CLP (in English, Closed Loop Pitch)
- circuit 20 is followed, in the case where one is in an even sub-frame and where ⁇ takes for example the three values - 1, 0 and + 1 (this is the case shown in FIG.
- channels 30 here three identical channels 30a, 30b, 30c (circuit 20 is followed by four identical channels, referenced in a similar way, in the case where one is in an odd sub-frame, ⁇ then taking the four values - 1, 0, + 1, + 2).
- Each of these channels processes the V-vector of the adaptive dictionary which corresponds to the step ⁇ of the channel considered, and includes in this firstly a series effect filter 31. having as response the impulse response of the synthesis filter (defined above).
- a circuit 32 also receiving the target vector to be modeled t (n) is then provided for the calculation of a vector with twenty components V ( ⁇ ) consisting of five correlation terms between the filtered vectors and the residual vector (in English: cross-products), given by the scalar products of the five filtered vectors of the adaptive dictionary by the target vector t (n), of five energy terms , given by the scalar products of the five filtered vectors of the adaptive dictionary by themselves, and ten two-by-two correlation terms between the filtered vectors. From these correlations, it is possible to determine to what extent the residual vector or target vector t (n) can be modeled from the V-vectors of the adaptive dictionary.
- the gains which are quantized, are provided by a memory * t0, or quantization table, which contains the possible values for the different gains (170 to 5.3 kbits / s., And 85 or 170 to 6.3 kbits / s., the 170 vectors of the table used in one or the other case then being the same).
- the information relating to the gains is given, in this quantification table, in the form of vectors each having twenty terms (as previously) defined as follows: five gain values, five values equal to the square of these gains, ten values corresponding to the ten two by two of these five gain values.
- the optimal gain vector ⁇ in the sense of expression (5) is the one which cancels the derivative of ⁇ with respect to each of the components of the vector, and which, by there, maximizes the scalar product of V ( ⁇ ) by a vector from the quantification table.
- a circuit 50 then allows the selection of the maximum scalar product among the three or four scalar products available at the output of these three (or four) circuits, maximum scalar product to which an optimal value of the step correspondent (stored in a memory 110) and an optimal value of the gain vector j3.
- the optimal value of ⁇ is of course one of the three (or four) values used in the three (or four) channels, and the value thus selected makes it possible to control a switch 60, comprising as many inputs as there are of tracks (three, or four). This switch, provided at the output of the filters 31. makes it possible to select the filtered V-vector constituting the best representative for the desired solution vector.
- This selected filtered vector is then presented at the input of an amplifier 70 whose gain vector j3, delivered by the selection circuit 50, had been stored in a memory 80 present at the output of this circuit 50.
- a first aim of the invention is therefore to propose a simpler prediction method, with practically equal quality, than that previously described.
- the invention relates to a prediction method comprising for each of said subframes the following steps: (1) for different values of a step ⁇ said of determining said periodicity information, and with a view to selecting, from said previously predicted optimal excitations, an optimal gain vector p opt and the corresponding value of not optimal, a step of carrying out, in series and for each value of ⁇ , the following substeps:
- (c) a preselection sub-step, for determining an initial gain vector ⁇ ( ⁇ ) init , squares of the components of said vector, and of the products of these same components taken two by two, delivering a second vector P ⁇ ⁇ i n i t •
- Another object of the invention is to propose a speech coder similar to that which has just been described, but with reduced complexity and while retaining practically equivalent quality.
- the invention relates, in an encoder as defined in the preamble to the description, a prediction device comprising:
- (A) a vector storage circuit, called an adaptive dictionary, containing the optimal excitations predicted for the previous sub-frame;
- (D) a memory, called a quantization table, which contains the components of the candidate gain vectors, as well as their squares and their products in pairs;
- (E) a selector switch, in connection with said optimal value of the step, of the corresponding optimal vector V ( ⁇ ) composed of correlation terms, given by the scalar products of the vectors filtered by the residual vector of energy terms, given by the dot products of the vectors filtered by themselves, and of correlation terms, given by the dot products two by two of the filtered vectors; (F) a circuit for selecting, in said quantization table, the candidate gain vectors;
- (I) a subtractor for estimating said prediction error by difference between said residual signal t (n) and the predicted signal p (n) delivered by said amplifier.
- the solution according to the invention makes it possible to make a much lower number of courses (four in the case encoder according to this document), with practically insensible quality degradation.
- the basic idea of the structure thus proposed is indeed the following: by making the simplifying assumption that the predictors are decorrelated, or, which is equivalent, that the correlations between filtered vectors taken two by two, previously defined, are zero, we can, for each sub-frame and for each channel corresponding to a step ⁇ , define an initial gain vector P ( ( 5) init , without costly matrix inversion calculation. The components of this vector are then, for the channel considered, the successive ratios of the correlation terms between the filtered vectors and the target vector and the energy terms of the vector V ( ⁇ ) defined above.
- an initial vector gain ⁇ ( ⁇ J-init llows then determine an initial ⁇ value (and suboptimal) route requiring only the quantization table per subframe
- the optimal initial step determined for each subframe by the calculation subset provided at the output of the channels, is then that which maximizes one of the three, or four depending on whether the subframe is even or odd , scalar products of ⁇ ⁇ ) ⁇ n it ⁇ ar V ( ⁇ ), determined by the calculation circuit provided in each channel after the calculation of V ( ⁇ ) (we recall here that V ( ⁇ ) was defined previously, while that ⁇ (° " ) i n it is a vector which includes the same number of components as V ( ⁇ ), namely, in this case: the five
- the vector of the optimal gain for each sub-frame is then obtained using the circuit provided at the output of the quantization table and which performs the search for the maximum value of the scalar product of the vector V ( ⁇ ) corresponding to the optimal value the step ⁇ which has just been calculated by each of the vectors (here 170 or 85 depending on the flow) of said table.
- This determination of the optimal gain vector requires only one scan of the quantization table per subframe, ie four per frame (instead of three or four per subframe depending on whether the subframe is even or odd. , or fourteen per frame), which results in a significant reduction in complexity.
- FIG. 1 shows an example of representation of the amplitude of a voiced sound as a function of time
- FIG. 2 and 3 show the structure of a closed-loop voicing period prediction device, respectively in the case of the cited G723 document and in the case of the present invention.
- the prediction device presents, with that of FIG. 2, common elements, namely the circuit 20 for storing the candidate excitation vectors (or adaptive dictionary), the filters 31. the circuits 32 for calculating the correlation and energy terms, the memory 40 (or quantification table), the switch 60, the amplifier 70, the memory 80, the subtractor 90, and the memory 110.
- the search for the optimal gain vector is then carried out by the selection circuit 150, and the vector thus selected is kept in memory 80.
- the switch 60 provided at the output of the filters 31 and whose position is controlled by the value of ⁇ issue from memory 110, sends the selected filtered V-vector to amplifier 70.
- the optimal filtered V-vector thus amplified is the prediction vector p (n), sent, as in the case of FIG. 2, to the subtractor 90.
Landscapes
- Engineering & Computer Science (AREA)
- Computational Linguistics (AREA)
- Signal Processing (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Physics & Mathematics (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Compression, Expansion, Code Conversion, And Decoders (AREA)
Priority Applications (3)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
JP9514782A JPH11500837A (ja) | 1995-10-11 | 1996-10-11 | スピーチコーダ用信号予測方法及び装置 |
DE69609592T DE69609592T2 (de) | 1995-10-11 | 1996-10-11 | Verfahren und vorrichtung zur signalprädiktion für einen sprachkodierer |
EP96934902A EP0796490B1 (fr) | 1995-10-11 | 1996-10-11 | Methode et dispositif de prediction de signal pour un codeur de parole |
Applications Claiming Priority (4)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
FR9511937A FR2739964A1 (fr) | 1995-10-11 | 1995-10-11 | Dispositif de prediction de periode de voisement pour codeur de parole |
FR95/11937 | 1996-09-25 | ||
EP96402030.9 | 1996-09-25 | ||
EP96402030 | 1996-09-25 |
Publications (1)
Publication Number | Publication Date |
---|---|
WO1997014139A1 true WO1997014139A1 (fr) | 1997-04-17 |
Family
ID=26144072
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/FR1996/001596 WO1997014139A1 (fr) | 1995-10-11 | 1996-10-11 | Methode et dispositif de prediction de signal pour un codeur de parole |
Country Status (4)
Country | Link |
---|---|
EP (1) | EP0796490B1 (ja) |
JP (1) | JPH11500837A (ja) |
DE (1) | DE69609592T2 (ja) |
WO (1) | WO1997014139A1 (ja) |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
EP0903729A2 (en) * | 1997-09-20 | 1999-03-24 | Matsushita Graphic Communication Systems, Inc. | Speech coding apparatus and pitch prediction method of input speech signal |
Citations (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
EP0296763A1 (en) * | 1987-06-26 | 1988-12-28 | AT&T Corp. | Code excited linear predictive vocoder and method of operation |
US5138661A (en) * | 1990-11-13 | 1992-08-11 | General Electric Company | Linear predictive codeword excited speech synthesizer |
-
1996
- 1996-10-11 EP EP96934902A patent/EP0796490B1/fr not_active Expired - Lifetime
- 1996-10-11 WO PCT/FR1996/001596 patent/WO1997014139A1/fr active IP Right Grant
- 1996-10-11 DE DE69609592T patent/DE69609592T2/de not_active Expired - Fee Related
- 1996-10-11 JP JP9514782A patent/JPH11500837A/ja active Pending
Patent Citations (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
EP0296763A1 (en) * | 1987-06-26 | 1988-12-28 | AT&T Corp. | Code excited linear predictive vocoder and method of operation |
US5138661A (en) * | 1990-11-13 | 1992-08-11 | General Electric Company | Linear predictive codeword excited speech synthesizer |
Non-Patent Citations (4)
Title |
---|
CHEN ET AL.: "Comparison of pitch prediction and adaptation algorithms in forward and backward adaptive CELP systems", IEE PROCEEDINGS I (COMMUNICATIONS, SPEECH AND VISION), vol. 140, no. 4, August 1993 (1993-08-01), STEVENAGE, GB, pages 240 - 245, XP000389911 * |
CUPERMAN: "Low delay speech coding", PROCEEDINGS OF THE ASILOMAR CONFERENCE ON SIGNALS, SYSTEMS AND COMPUTERS, vol. 2, 4 November 1991 (1991-11-04) - 6 November 1991 (1991-11-06), PACIFIC GROVE, CA, US, pages 935 - 939, XP000314435 * |
GERSON ET AL.: "Techniques for improving the performance of CELP-type speech coders", IEEE JOURNAL ON SELECTED AREAS IN COMMUNICATIONS, vol. 10, no. 5, 1 June 1992 (1992-06-01), NEW YORK, US, pages 858 - 865, XP000274720 * |
VEENEMAN ET AL.: "Efficient multi-tap pitch prediction for stochastic coding", SPEECH AND AUDIO CODING FOR WIRELESS AND NETWORK APPLICATIONS, 1 January 1993 (1993-01-01), BOSTON-DORDRECHT-LONDON, pages 225 - 229, XP000470445 * |
Cited By (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
EP0903729A2 (en) * | 1997-09-20 | 1999-03-24 | Matsushita Graphic Communication Systems, Inc. | Speech coding apparatus and pitch prediction method of input speech signal |
EP0903729A3 (en) * | 1997-09-20 | 1999-12-29 | Matsushita Graphic Communication Systems, Inc. | Speech coding apparatus and pitch prediction method of input speech signal |
US6243673B1 (en) | 1997-09-20 | 2001-06-05 | Matsushita Graphic Communication Systems, Inc. | Speech coding apparatus and pitch prediction method of input speech signal |
Also Published As
Publication number | Publication date |
---|---|
EP0796490B1 (fr) | 2000-08-02 |
DE69609592T2 (de) | 2001-03-29 |
JPH11500837A (ja) | 1999-01-19 |
EP0796490A1 (fr) | 1997-09-24 |
DE69609592D1 (de) | 2000-09-07 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
EP0749626B1 (fr) | Procede de codage de parole a prediction lineaire et excitation par codes algebriques | |
EP0768770B1 (fr) | Procédé et dispositif de création d'un bruit de confort dans un système de transmission numérique de parole | |
FR2742568A1 (fr) | Procede d'analyse par prediction lineaire d'un signal audiofrequence, et procedes de codage et de decodage d'un signal audiofrequence en comportant application | |
FR2731548A1 (fr) | Recherche profondeur d'abord dans un repertoire algebrique pour un encodage rapide de la paroie | |
EP0608174A1 (fr) | Systeme de codage-décodage prédictif d'un signal numérique de parole par transformée adaptative à codes imbriqués | |
WO2005066938A1 (fr) | Procede de codage multiple optimise | |
EP0428445B1 (fr) | Procédé et dispositif de codage de filtres prédicteurs de vocodeurs très bas débit | |
EP3580755A1 (fr) | Methode et appareil de modification dynamique du timbre de la voix par decalage en fréquence des formants d'une enveloppe spectrale | |
EP0481895B1 (fr) | Procédé de transmission, à bas débit, par codage CELP d'un signal de parole et système correspondant | |
EP1692687B1 (fr) | Transcodage entre indices de dictionnaires multi-impulsionnels utilises en codage en compression de signaux numeriques | |
EP1836699B1 (fr) | Procédé et dispositif de codage audio optimisé entre deux modèles de prediction à long terme | |
JPH1063297A (ja) | 音声符号化方法および装置 | |
FR2702075A1 (fr) | Procédé de génération d'un filtre de pondération spectrale du bruit dans un codeur de la parole. | |
EP0685833A1 (fr) | Procédé de codage de parole à prédiction linéaire | |
EP0796490B1 (fr) | Methode et dispositif de prediction de signal pour un codeur de parole | |
EP0573358B1 (fr) | Procédé et dispositif de synthèse vocale à vitesse variable | |
EP0616315A1 (fr) | Dispositif de codage et de décodage numérique de la parole, procédé d'exploration d'un dictionnaire pseudo-logarithmique de délais LTP, et procédé d'analyse LTP | |
JP3462464B2 (ja) | 音声符号化方法、音声復号化方法及び電子装置 | |
JPH0738116B2 (ja) | マルチパルス符号化装置 | |
WO2011144863A1 (fr) | Codage avec mise en forme du bruit dans un codeur hierarchique | |
EP1192619B1 (fr) | Codage et decodage audio par interpolation | |
EP0734013B1 (fr) | Determination d'un vecteur d'excitation dans un codeur CELP | |
EP1605440A1 (fr) | Procédé de séparation de signaux sources à partir d'un signal issu du mélange | |
EP0454552A2 (fr) | ProcédÀ© et dispositif de codage bas débit de la parole | |
FR2760285A1 (fr) | Procede et dispositif de generation d'un signal de bruit pour la sortie non vocale d'un signal decode de la parole |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
AK | Designated states |
Kind code of ref document: A1 Designated state(s): JP US |
|
AL | Designated countries for regional patents |
Kind code of ref document: A1 Designated state(s): AT BE CH DE DK ES FI FR GB GR IE IT LU MC NL PT SE |
|
WWE | Wipo information: entry into national phase |
Ref document number: 1996934902 Country of ref document: EP |
|
ENP | Entry into the national phase |
Ref country code: JP Ref document number: 1997 514782 Kind code of ref document: A Format of ref document f/p: F Ref country code: US Ref document number: 1997 849847 Date of ref document: 19970609 Kind code of ref document: A Format of ref document f/p: F |
|
121 | Ep: the epo has been informed by wipo that ep was designated in this application | ||
WWP | Wipo information: published in national office |
Ref document number: 1996934902 Country of ref document: EP |
|
WWG | Wipo information: grant in national office |
Ref document number: 1996934902 Country of ref document: EP |