EP1141944A2 - Method and arrangement for finding an optimal reconstruction point - Google Patents
Method and arrangement for finding an optimal reconstruction pointInfo
- Publication number
- EP1141944A2 EP1141944A2 EP99964900A EP99964900A EP1141944A2 EP 1141944 A2 EP1141944 A2 EP 1141944A2 EP 99964900 A EP99964900 A EP 99964900A EP 99964900 A EP99964900 A EP 99964900A EP 1141944 A2 EP1141944 A2 EP 1141944A2
- Authority
- EP
- European Patent Office
- Prior art keywords
- reconstruction
- point
- distance
- points
- reconstruction points
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Withdrawn
Links
- 238000000034 method Methods 0.000 title claims description 9
- 230000001131 transforming effect Effects 0.000 claims description 2
- 238000013139 quantization Methods 0.000 abstract description 9
- 230000002596 correlated effect Effects 0.000 description 6
- 230000006870 function Effects 0.000 description 6
- 230000001419 dependent effect Effects 0.000 description 3
- 238000004364 calculation method Methods 0.000 description 2
- 230000009466 transformation Effects 0.000 description 2
- 238000013459 approach Methods 0.000 description 1
- 230000005540 biological transmission Effects 0.000 description 1
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
- G10L25/48—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/02—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders
Definitions
- the invention relates generally to vector quantization and, more specifically, to a method and an arrangement for finding an optimal reconstruction point for an N- dimensional point to be quantized.
- Vector quantization is computationally complex in the sense that a search for the optimal vector has to be carried out.
- the optimal vector is usually defined as the vector giving the least residual error. Solutions with a "nearest neighbor" list or similar approaches to find the reconstruction points will reduce the complexity of the search.
- One technical field where vector quantization is used is speech coding, where several correlated gain parameters are jointly quantized.
- All CELP (Code Excited Linear Prediction) coders of today minimize the mean square error in the weighted speech domain when quantizing many of the parameters.
- the correlated gain parameters should also be quantized with regard to the distortion in the weighted speech domain and not to the traditional distortion in the gain domain. This makes it impossible to create a "nearest neighbor" list since this list with this distortion measure, is signal dependent, i.e. depends on other values than the ones to be quantized.
- the distortion measure is signal dependent, all reconstruction points have to be searched in order to find the best reconstruction point.
- D-AMPS Digital American Mobile Phone System
- the complexity of the search will be high since all reconstruction points have to be searched. It will be especially complex for the above mentioned CELP coders, since the distortion calculation itself is complex.
- the object of the invention is to reduce the complexity of the current methods and apparatuses for finding reconstruction points in connection with vector quantization.
- Fig. 1 illustrates an example of reconstruction points for a 2-dimensional vector quantizer, where the values in the two dimensions are strongly correlated
- Fig. 1 illustrates a relationship between predicted values and sorted reconstruction points for the reconstruction points illustrated in Fig. 1,
- FIG. 3 illustrates another example of reconstruction points for a 2-dimensional vector quantizer, where the values in the two dimensions are evenly distributed, and
- - Fig. 4 illustrates a further example of reconstruction points for a 2-dimensional vector quantizer, where the values in the two dimensions are grouped along the border of a circle.
- Fig. 1 illustrates an example of reconstruction points for transformed gain parameters resulting from a speech codec.
- the transformation has been done to reduce the variance of the gain. This reduced variance makes it possible to construct a quantizer that introduces less distortion. As apparent from Fig. 1, the two different gains are strongly correlated.
- the reconstruction points in Fig. 1 are sorted in advance by ordering them by their distance from a reference point in an N- dimensional space.
- the reconstruction points for correlated data are grouped around a line or curve in the N-dimensional space, it is possible to create a cross-reference list that makes it possible to access a set of reconstruction points surrounding an arbitrarily predicted reconstruction point.
- the reconstruction points are grouped around a line in the N-dimensional space as in Fig. 1, it is possible to create the cross- reference list by sorting the reconstruction points by their distance to the reference point at one end of the line.
- a transfer function for transforming the prediction value, in this example the "distance to reference point", into “sorted reconstruction point” is shown.
- the transfer function in Fig. 2 which is created in advance, is used to predict a reconstruction point, called “sorted VQ index” in Fig. 2, from the "distance to reference point” as will be described below.
- the transfer function is created by calculating the distance to the reference point, as described below, for each sorted reconstruction point.
- the transfer function in Fig. 2 can very well be represented by two or three linearized first degree equations, giving a low complexity transformation function from "distance to reference point” into “sorted reconstruction point” .
- finding of an optimal reconstruction point for an N-dimensional point to be quantized, where the N-dimensional point corresponds to N parameters extracted from an input signal is done in the following way:
- N is the dimension of the space
- X t is the unquantized value in the i:th dimension
- Refi is the reference point in the i:th dimension.
- the size of the selected set of reconstruction points should be chosen large enough so that the optimal reconstruction point always exists within the selected set.
- the size of the selected set of reconstruction points may in fact be predetermined.
- the distortion value represents the difference between an original signal, e.g. a speech signal coming in to a speech encoder, and a reconstructed signal, e.g. an output speech signal from the speech encoder.
- the search complexity will be reduced by approximately 80% since only a small part of the total number of reconstruction points has to be searched.
- the complexity can be further reduced if it can be accepted that the optimal reconstruction point is not always used. This further reduction of complexity can be achieved by reducing the selected set of reconstruction points.
- the only cost for the prediction is a small memory increase required for storing the cross reference list used to access the reconstruction points sorted according to the distance measure.
- This method is called index assignment. If index assignment is unnecessary for the reconstruction points, the cross reference list is not needed since the reconstruction points can be ordered in this way from the beginning. Thus, no extra memory is required.
- the reconstruction points are grouped almost evenly in a square in a two dimensional space.
- the reference point is therefore placed in a smaller dimension where it is possible to find correlation. In this example, this is the first dimensional space.
- the reference point may be placed at the x-mark to the right, and only the distance in the horizontal plane would be used. This will lead to that the selected set of reconstruction points will be larger than it would have been, had there been a correlation between the values in the different dimensions. However, the saving in complexity will still be considerable.
- the reconstruction points are approximately placed along the border of a circle.
- the "distance" can be the angular distance, i.e. the angle, to the predicted reconstruction point. If the predicted reconstruction point in this case has an angle close to 0 or 2 ⁇ radians, the selected set of reconstruction points must include reconstruction points that have an angle close to both 0 and 2 ⁇ . This will lead to a low complexity search also when the reconstruction points are placed in this manner.
Landscapes
- Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- Computational Linguistics (AREA)
- Signal Processing (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Spectroscopy & Molecular Physics (AREA)
- Compression, Expansion, Code Conversion, And Decoders (AREA)
Abstract
Description
Claims
Applications Claiming Priority (3)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
SE9804455A SE9804455L (en) | 1998-12-21 | 1998-12-21 | Procedure and arrangement for finding an optimal reconstruction point |
SE9804455 | 1998-12-21 | ||
PCT/SE1999/002404 WO2000038176A2 (en) | 1998-12-21 | 1999-12-17 | Method and arrangement for finding an optimal reconstruction point |
Publications (1)
Publication Number | Publication Date |
---|---|
EP1141944A2 true EP1141944A2 (en) | 2001-10-10 |
Family
ID=20413781
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
EP99964900A Withdrawn EP1141944A2 (en) | 1998-12-21 | 1999-12-17 | Method and arrangement for finding an optimal reconstruction point |
Country Status (6)
Country | Link |
---|---|
EP (1) | EP1141944A2 (en) |
JP (1) | JP2002533962A (en) |
AU (1) | AU3093000A (en) |
CA (1) | CA2356152A1 (en) |
SE (1) | SE9804455L (en) |
WO (1) | WO2000038176A2 (en) |
Family Cites Families (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US4670851A (en) * | 1984-01-09 | 1987-06-02 | Mitsubishi Denki Kabushiki Kaisha | Vector quantizer |
US5077798A (en) * | 1988-09-28 | 1991-12-31 | Hitachi, Ltd. | Method and system for voice coding based on vector quantization |
JP3151874B2 (en) * | 1991-02-26 | 2001-04-03 | 日本電気株式会社 | Voice parameter coding method and apparatus |
-
1998
- 1998-12-21 SE SE9804455A patent/SE9804455L/en not_active Application Discontinuation
-
1999
- 1999-12-17 AU AU30930/00A patent/AU3093000A/en not_active Abandoned
- 1999-12-17 WO PCT/SE1999/002404 patent/WO2000038176A2/en not_active Application Discontinuation
- 1999-12-17 JP JP2000590161A patent/JP2002533962A/en active Pending
- 1999-12-17 EP EP99964900A patent/EP1141944A2/en not_active Withdrawn
- 1999-12-17 CA CA002356152A patent/CA2356152A1/en not_active Abandoned
Non-Patent Citations (1)
Title |
---|
See references of WO0038176A3 * |
Also Published As
Publication number | Publication date |
---|---|
AU3093000A (en) | 2000-07-12 |
CA2356152A1 (en) | 2000-06-29 |
WO2000038176A2 (en) | 2000-06-29 |
WO2000038176A3 (en) | 2000-10-19 |
JP2002533962A (en) | 2002-10-08 |
SE9804455L (en) | 2000-06-22 |
SE9804455D0 (en) | 1998-12-21 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US6885988B2 (en) | Bit error concealment methods for speech coding | |
US7325023B2 (en) | Method of making a window type decision based on MDCT data in audio encoding | |
EP2915166B1 (en) | A method and apparatus for resilient vector quantization | |
EP2441049A2 (en) | Method for encoding/decoding a 3d mesh model that comprises one or more components | |
CN109313908A (en) | Audio coder for being encoded to audio signal, the method for being encoded to audio signal and the computer program for considering the spike spectral regions detected in upper frequency band | |
JP2004526213A (en) | Method and system for line spectral frequency vector quantization in speech codecs | |
US5487087A (en) | Signal quantizer with reduced output fluctuation | |
US7283968B2 (en) | Method for grouping short windows in audio encoding | |
US5748839A (en) | Quantization of input vectors and without rearrangement of vector elements of a candidate vector | |
US7739119B2 (en) | Technique for implementing Huffman decoding | |
Slyz et al. | A nonlinear VQ-based predictive lossless image coder | |
EP1141944A2 (en) | Method and arrangement for finding an optimal reconstruction point | |
KR950003557B1 (en) | Encoding method of voice sample and signal sample | |
US20110112841A1 (en) | Apparatus | |
Ho et al. | A variable-rate image coding scheme with vector quantization and clustering interpolation | |
RU2461079C2 (en) | Simplified vector indexing and deindexing | |
JPH03171828A (en) | Compression encoder | |
KR20180026528A (en) | A bit error detector for an audio signal decoder | |
JP2890522B2 (en) | Method and apparatus for adaptive transform coding | |
Bruhn et al. | The FlexCode speech and audio coding approach | |
JP2638209B2 (en) | Method and apparatus for adaptive transform coding | |
JP2923996B2 (en) | Method and apparatus for adaptive transform coding | |
Torres et al. | Improvements on stochastic vector quantization of images | |
Merazka et al. | Robust split vector quantization of LSP parameters at low bit rates | |
Fonollosa et al. | Adaptive multistage vector quantization |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PUAI | Public reference made under article 153(3) epc to a published international application that has entered the european phase |
Free format text: ORIGINAL CODE: 0009012 |
|
17P | Request for examination filed |
Effective date: 20010613 |
|
AK | Designated contracting states |
Kind code of ref document: A2 Designated state(s): AT BE CH CY DE DK ES FI FR GB GR IE IT LI LU MC NL PT SE |
|
AX | Request for extension of the european patent |
Free format text: AL;LT;LV;MK;RO;SI |
|
RIN1 | Information on inventor provided before grant (corrected) |
Inventor name: LINDQVIST, MORGAN |
|
RAP1 | Party data changed (applicant data changed or rights of an application transferred) |
Owner name: TELEFONAKTIEBOLAGET LM ERICSSON (PUBL) |
|
RBV | Designated contracting states (corrected) |
Designated state(s): DE GB |
|
STAA | Information on the status of an ep patent application or granted ep patent |
Free format text: STATUS: THE APPLICATION IS DEEMED TO BE WITHDRAWN |
|
18D | Application deemed to be withdrawn |
Effective date: 20040701 |