WO1994010682A1 - Verfahren zur sprachcodierung - Google Patents
Verfahren zur sprachcodierung Download PDFInfo
- Publication number
- WO1994010682A1 WO1994010682A1 PCT/DE1993/000999 DE9300999W WO9410682A1 WO 1994010682 A1 WO1994010682 A1 WO 1994010682A1 DE 9300999 W DE9300999 W DE 9300999W WO 9410682 A1 WO9410682 A1 WO 9410682A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- speech
- quantized
- lsp
- frame
- coefficients
- Prior art date
Links
- 238000000034 method Methods 0.000 title claims abstract description 24
- 238000001228 spectrum Methods 0.000 claims abstract description 20
- 230000005540 biological transmission Effects 0.000 claims abstract description 14
- 238000013139 quantization Methods 0.000 claims abstract description 12
- 238000003786 synthesis reaction Methods 0.000 claims abstract 5
- 230000006870 function Effects 0.000 claims description 5
- 238000001308 synthesis method Methods 0.000 claims description 5
- 230000009466 transformation Effects 0.000 claims description 5
- 238000013507 mapping Methods 0.000 claims description 2
- 230000015572 biosynthetic process Effects 0.000 claims 3
- 230000005284 excitation Effects 0.000 description 3
- 230000009467 reduction Effects 0.000 description 3
- 230000008901 benefit Effects 0.000 description 2
- 230000006872 improvement Effects 0.000 description 2
- 230000035945 sensitivity Effects 0.000 description 2
- 230000006978 adaptation Effects 0.000 description 1
- 230000003044 adaptive effect Effects 0.000 description 1
- 238000004891 communication Methods 0.000 description 1
- 230000007123 defense Effects 0.000 description 1
- 230000001419 dependent effect Effects 0.000 description 1
- 238000010606 normalization Methods 0.000 description 1
- 230000008569 process Effects 0.000 description 1
- 230000004044 response Effects 0.000 description 1
- 230000001131 transforming effect Effects 0.000 description 1
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/04—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
- G10L19/06—Determination or coding of the spectral characteristics, e.g. of the short-term prediction coefficients
- G10L19/07—Line spectrum pair [LSP] vocoders
Definitions
- the invention is based on a method for speech coding using the analysis-by-synthesis method according to the preamble of claims 1 and 2, respectively
- Speech coding methods are known, for example from German Patent 38 34 871.
- the speech coding procedure has one thing in common
- Encoders within a certain period of e.g. 20-30 ms divided. Each speech frame is subjected to a linear prediction analysis in the encoder, which removes linear dependencies in the speech signal.
- the linear prediction is carried out with the help of FIR filters (Finite Impulse Response).
- FIR filters Finite Impulse Response
- Frame redetermined i.e. these are adaptive filters.
- Today's speech coders which operate at bit rates between 4 and 16 kbit / sec. work, generally use the analysis-by-synthesis method, in which the filter coefficients listed above and an associated excitation are determined in the transmitter so that the energy of the weighted error e (n) between the original language and the synthesized language is as small as possible.
- the filter coefficients a i have a large dynamic range and are therefore poorly suited for quantization and transmission. Besides, there is no easy one
- the zeros z Oi of F 1 and F 2 have the following properties, all zeros are on the unit circle, so they are adequately described by specifying a phase i - all zeros are simple
- the polynomials F 1 and F 2 are i by specifying P values
- a common method is the scalar quantization of each individual LSP, for example, in 4.8 kbit / sec.
- CELP speech codec according to the Federal Standard 1016 of the US Department of Defense the Line Spectrum Parameters scalar quantized with a total of 34 bits.
- Quantizer no longer permissible for ⁇ i . This means that some of the bits that are available for the quantization of the parameters LSP are not fully used. According to FIG. 3 there are 8 possible steps for ⁇ i + l
- Another disadvantage of this method is that adaptation to different input spectra of the speech signal is not possible. If the quantizer can be used for this, the range of values for individual line spectrum parameters increases. This leads to an increase in the bit rate.
- References [5] and [6] suggest reducing the bit rate for the transmission of the line spectrum parameters by quantizing their differences.
- the first LSP is scalarized as above.
- the present invention was based on the object
- Speech codecs can be achieved compared to speech signals with different input characteristics. The one needed
- Circuitry should not be too high.
- the method according to the invention has a reduced sensitivity of the speech codec to speech signals with very different input spectra. Another advantage is that a Transmission error with an LSP only affects a maximum of two further LSP values.
- the invention is based on the idea of neither quantizing all LSP parameters scalarly nor quantifying only a single one of the total P parameters scalarly, but rather only quantizing every nth of the P parameters scalarly and the in between
- every second LSP becomes scalar
- every third LSP is quantized scalarly.
- mapping function for the parameters in between are, for example
Landscapes
- Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- Spectroscopy & Molecular Physics (AREA)
- Computational Linguistics (AREA)
- Signal Processing (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Compression, Expansion, Code Conversion, And Decoders (AREA)
Abstract
Description
Claims
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
AU51742/93A AU5174293A (en) | 1992-10-28 | 1993-10-20 | Method of encoding speech |
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
DEP4236315.2 | 1992-10-28 | ||
DE19924236315 DE4236315C1 (de) | 1992-10-28 | 1992-10-28 | Verfahren zur Sprachcodierung |
Publications (1)
Publication Number | Publication Date |
---|---|
WO1994010682A1 true WO1994010682A1 (de) | 1994-05-11 |
Family
ID=6471507
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/DE1993/000999 WO1994010682A1 (de) | 1992-10-28 | 1993-10-20 | Verfahren zur sprachcodierung |
Country Status (3)
Country | Link |
---|---|
AU (1) | AU5174293A (de) |
DE (1) | DE4236315C1 (de) |
WO (1) | WO1994010682A1 (de) |
Families Citing this family (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US7003454B2 (en) * | 2001-05-16 | 2006-02-21 | Nokia Corporation | Method and system for line spectral frequency vector quantization in speech codec |
Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US4975956A (en) * | 1989-07-26 | 1990-12-04 | Itt Corporation | Low-bit-rate speech coder using LPC data reduction processing |
US5012518A (en) * | 1989-07-26 | 1991-04-30 | Itt Corporation | Low-bit-rate speech coder using LPC data reduction processing |
GB2240013A (en) * | 1989-12-22 | 1991-07-17 | Ericsson Ge Mobile Communicat | Error protection for multi-pulse speech coders |
Family Cites Families (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
DE3834871C1 (en) * | 1988-10-13 | 1989-12-14 | Ant Nachrichtentechnik Gmbh, 7150 Backnang, De | Method for encoding speech |
CA2054849C (en) * | 1990-11-02 | 1996-03-12 | Kazunori Ozawa | Speech parameter encoding method capable of transmitting a spectrum parameter at a reduced number of bits |
-
1992
- 1992-10-28 DE DE19924236315 patent/DE4236315C1/de not_active Expired - Fee Related
-
1993
- 1993-10-20 WO PCT/DE1993/000999 patent/WO1994010682A1/de active Application Filing
- 1993-10-20 AU AU51742/93A patent/AU5174293A/en not_active Abandoned
Patent Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US4975956A (en) * | 1989-07-26 | 1990-12-04 | Itt Corporation | Low-bit-rate speech coder using LPC data reduction processing |
US5012518A (en) * | 1989-07-26 | 1991-04-30 | Itt Corporation | Low-bit-rate speech coder using LPC data reduction processing |
GB2240013A (en) * | 1989-12-22 | 1991-07-17 | Ericsson Ge Mobile Communicat | Error protection for multi-pulse speech coders |
Also Published As
Publication number | Publication date |
---|---|
DE4236315C1 (de) | 1994-02-10 |
AU5174293A (en) | 1994-05-24 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
EP1979901B1 (de) | Verfahren und anordnungen zur audiosignalkodierung | |
DE3639753C2 (de) | ||
EP0193143B1 (de) | Verfahren zur Übertragung eines Audiosignals | |
DE69915400T2 (de) | Vorrichtung zur Kodierung und Dekodierung von Audiosignalen | |
EP2022043B1 (de) | Informationssignalcodierung | |
DE60012198T2 (de) | Kodierung der hüllkurve des spektrums mittels variabler zeit/frequenz-auflösung | |
DE60117471T2 (de) | Breitband-signalübertragungssystem | |
DE60319590T2 (de) | Verfahren zur codierung und decodierung von audio mit variabler rate | |
DE60121592T2 (de) | Kodierung und dekodierung eines digitalen signals | |
DE60012760T2 (de) | Multimodaler sprachkodierer | |
EP0954909A1 (de) | Verfahren zum codieren eines audiosignals | |
EP0978172B1 (de) | Verfahren zum verschleiern von fehlern in einem audiodatenstrom | |
DE19811039A1 (de) | Verfahren und Vorrichtungen zum Codieren und Decodieren von Audiosignalen | |
WO2006114368A1 (de) | Verfahren und vorrichtung zur geräuschunterdrückung | |
DE69820362T2 (de) | Nichtlinearer Filter zur Geräuschunterdrückung in linearen Prädiktions-Sprachkodierungs-Vorrichtungen | |
DE69828709T2 (de) | Erhöhung der Dichte von kodierten Sprachsignalen | |
DE60124079T2 (de) | Sprachverarbeitung | |
EP0962015A1 (de) | Verfahren und vorrichtungen zum codieren von diskreten signalen bzw. zum decodieren von codierten diskreten signalen | |
EP1023777B1 (de) | Verfahren und vorrichtung zur erzeugung eines bitratenskalierbaren audio-datenstroms | |
EP0464534B1 (de) | Transformationskodierer mit adaptiver Fensterfunktion | |
DE602004004445T2 (de) | Vorrichtungen zum Komprimieren und Dekomprimieren von Sprache und Verfahren zum Bereitstellen von skalierbaren Bandbreitestrukturen | |
DE69820515T2 (de) | Vorrichtung zur Sprachcodierung unter Verwendung eines Mehrimpulsanregungssignals | |
DE2303497C2 (de) | Verfahren zur Übertragung von Sprachsignalen | |
WO1994010682A1 (de) | Verfahren zur sprachcodierung | |
DE19742201C1 (de) | Verfahren und Vorrichtung zum Codieren von Audiosignalen |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
AK | Designated states |
Kind code of ref document: A1 Designated state(s): AU CA FI JP US |
|
AL | Designated countries for regional patents |
Kind code of ref document: A1 Designated state(s): AT BE CH DE DK ES FR GB GR IE IT LU MC NL PT SE |
|
DFPE | Request for preliminary examination filed prior to expiration of 19th month from priority date (pct application filed before 20040101) | ||
121 | Ep: the epo has been informed by wipo that ep was designated in this application | ||
WWE | Wipo information: entry into national phase |
Ref document number: 1993922885 Country of ref document: EP |
|
ENP | Entry into the national phase |
Ref country code: US Ref document number: 1995 424446 Date of ref document: 19950428 Kind code of ref document: A Format of ref document f/p: F |
|
WWW | Wipo information: withdrawn in national office |
Ref document number: 1993922885 Country of ref document: EP |
|
NENP | Non-entry into the national phase |
Ref country code: CA |
|
122 | Ep: pct application non-entry in european phase |