WO2005067150A1

WO2005067150A1 - Method for compressing audio signals without time constraint

Info

Publication number: WO2005067150A1
Application number: PCT/EP2004/053397
Authority: WO
Inventors: François CAPMAN; Bertrand Ravera
Original assignee: Thales
Priority date: 2003-12-12
Filing date: 2004-12-10
Publication date: 2005-07-21
Also published as: FR2863792A1; FR2863792B1

Abstract

The invention relates to a method for compressing/decompressing a signal or message comprising at least one generic parameter, in a compression/decompression system comprising a generic dictionary. Said method consists of the following steps: an original dictionary adapted to the message is determined from the totality or quasi-totality of the signal; an indexing table is created between the original dictionary adapted to the message and the generic dictionary of the compression/decompression system in order to obtain the dictionary adapted to the message; the parameters of the signal are quantified with the dictionary adapted to the message in order to obtain the indexes of the signal on the dictionary adapted to the message; the indexing table is transmitted to a decompression stage and the indexes of the signal are then transmitted to the dictionary adapted to the message; and the parameters of the signal are determined from the dictionary adapted to the reconstituted message and indexes of the signal transmitted to the decompression part. The invention can be applied to the compression/decompression of an audio signal.

Description

METHOD FOR COMPRESSING AUDIO SIGNALS WITHOUT DELAY CONSTRAINTS

The invention relates to a method for compressing audio signals without delay constraint. It applies for example for the compression of audio signals in a messaging system.

The standard compression scheme for audio signals is based on short-term modeling of this signal. This modeling makes it possible to obtain a set of optimal parameters which will be used during the decompression phase. The time constraint is always associated with this type of model. It does not allow to take into account, during the modeling, the strong correlations which exist between different zones of the audio signal. These areas are generally not located over a time interval shorter than the modeling horizon, the latter being itself always less than the characteristic delay of the compression system. The method according to the invention proposes a new method of compressing audio signals which in particular makes it possible to overcome the time constraint, or in other words, the limitation of the modeling horizon.

The invention relates to a compression / decompression method of a signal or message comprising one or more generic parameters, in a compression / decompression system comprising a generic dictionary characterized in that it comprises at least the following steps:

• Determine an original dictionary adapted to the message, DOAM, from the whole signal or almost all of it, • Create an indexing table between the DOAM and the generic dictionary DG, in order to obtain the dictionary adapted to the DAM message, • Quantify the parameters of the signal with the dictionary adapted to the message, in order to obtain the indices of the signal on the dictionary adapted to

5 message, • Transmit the indexing table to a decompression step and after transmitting the signal indices on the dictionary suitable for the message, • Determine the dictionary suitable for the signal from the generic dictionary and the indexing table for the decompression step and to use this DAM dictionary to quantify the signal indices in order to obtain the signal indexes on the generic dictionary, • Determine the signal parameters from the indexes on the generic dictionary. The method according to the invention has the following advantages in particular. It does not assume any strong limitation on the type of audio signal or message compression / decompression system in which it is implemented. It can thus be used in conjunction with most ^'existing systems that have a with0 dictionary quantization stage of part or all of the model parameters, excluding the prediction quantification methods. This or these dictionaries are qualified in the description of "generic dictionaries". 5 Other characteristics and advantages of the method according to the invention will appear better on reading the example given by way of indication and in no way limiting, appended to the figures which represent: • Figure 1 an example of a generation diagram of the adapted original dictionary to an audio signal, 0 • Figure 2 the diagram of the dictionary adapted to the audio signal, • Figure 3 an associated compression / decompression diagram. The example given below concerns the compression / decompression of an audio signal. The compression / decompression system notably has a set of generic parameters, for example short-term envelope filters, pitch, energy, etc. and a generic dictionary which is, in this case, the basic coder of the compression / decompression system. The idea implemented in the method according to the invention, consists in particular, in transmitting from the compression part of the system to the decompression part of the system, a first stream which includes the information necessary for coding and decoding the audio signal and a second stream which includes the parameters of the audio signal. FIG. 1 shows an example of steps prior to the execution of the steps of the method according to the invention. They consist, for example, in coding (step 1) the audio signal by the generic coder of the compression system, in order to obtain the parameters representative of the signal (step 2). These parameters are then transmitted, for example, to a segmentation step known to those skilled in the art. The implementation of the method according to the invention comprises, in relation to FIGS. 1, 2 and 3, at least the following steps:

1. Creation of the original dictionary adapted to the message. Compression part of the system (figure 1). From the audio signal, determine the parameters of the short-term model, taking into account all or almost all of the audio signal. This step 3 is carried out, for example, by quantifying the parameters, using a conventional method known to those skilled in the art. The method then has a set of parameters representative of the audio signal. The long-term correlation of all the parameters makes it possible to create a dictionary of parameters (step 4) representative of the audio signal. This reduced-size dictionary is designated by the expression “original dictionary adapted to the message” or DOAM. 2. Obtaining the dictionary adapted to the message (Figure 2). The following step 5 consists, for example, in representing the original dictionary adapted to the message in the space of the generic parameters of the compression / decompression system. For this, the method creates an indexing table 6 between the original dictionary adapted to the message 5 and the generic dictionary 7 or DG. This indexing table makes it possible to obtain the dictionary qualified as “dictionary adapted to the message” 8. In this example of application for an audio signal, the vectors s (i) dιco_quant of the dictionary adapted to the message 8 are obtained by searching for the best representatives of the vectors s (o) _of ιco among the vectors s (j) _C smell of generic dictionary 7. to do this, a distortion criterion may be used. Only the vector indexes of the generic dictionary are saved in the dictionary suitable for the message. The method has the dictionary adapted to the message which it then uses to quantify the parameters of the audio signal as shown in FIG. 3. 3. Implementation of a specific compression scheme i The following step 6 consists then quantifying the parameters of the audio signal with the dictionary adapted to the message 8 in order to obtain a set of indexes 9 (or indices) on the dictionary adapted to the message. The indexes on the dictionary adapted to the signal are transmitted to the decompression part of the system. In order to be able to apply the reverse operation during the decompression step, the indexing table or index table is transmitted to the decompression device before the indexes relating to the parameters (FIG. 3). At the compression part of the system, the dictionary adapted to the message is obtained from the index table itself resulting from the generic dictionary and from the original dictionary adapted to the message. For the decompression part of the system, it is from the index table and the generic dictionary that the dictionary adapted to the message is reconstructed.

4. Implementation of a specific decompression scheme At the decompression level, the first step of the process is to transform the indexes of the audio signal on the dictionary adapted to the message into index 10 on the generic dictionary of the system. This is achieved, for example, by reverse quantization using the dictionary adapted to the message reconstructed from the transmitted indexing table and the generic dictionary. These indexes on the generic dictionary are then used to produce generic parameters 1 1 necessary for the decompression system, for example by using a quantification method known to those skilled in the art. These generic parameters are then used to obtain the original audio signal.

The example which follows illustrates the implementation of the method for a signal resulting from a recording of a radio program broadcasting news. The typology of the audio signal is not unique. We find undegraded speech (taken from his studio), degraded speech (telephone interview or disturbing sound environment), music (jingle) and a mixture of speech and music (sound marker). The generic compression / decompression system is a proprietary system with a throughput of 3600 bits / s. The audio test material consists of 15 minutes of recording. The audio signal is sampled at 16 KHz. The generic parameters retained for this example of implementation are the LSF (English abbreviation of Une Spectral Frequency). The coding rate for these generic parameters is 2666bits / s (60 bits per 22.5 ms frame). The original dictionary adapted to the message is obtained by applying vector quantization processing to LSF vectors calculated every 22.5 ms and has 1024 vectors (for 10-bit quantization). The dictionary adapted to the message is constructed by looking for the best representatives of the original vectors adapted to the audio signal in the generic dictionary. At the end of this step, the indexing table is determined and known to the compression system. The LSF parameters are then coded with the dictionary adapted to the message. The method transmits to the decompression system, the indexing table and then the indexes on the dictionary adapted to the message (included in this case of application between 0 and 1023). The other parameters of the audio signal are calculated according to the method specific to the generic system. The decompression system receives the indexing table and the indexes on the dictionary adapted to the message. By inverse quantization, the indexes on the dictionary adapted to the message are transmitted as an index on the generic dictionary of the compression / decompression system.

, • 4. The method according to the invention also applies to MMS message type signals, etc.

Claims

1- A method of compressing / decompressing an audio signal comprising one or more generic parameters, in a compression / decompression system comprising a generic dictionary characterized in that it comprises at least the following steps: • Determining an original dictionary adapted to the message (5) from all or almost all of the signal,

• Create an indexing table between the original dictionary adapted to the message (5) and the generic dictionary (7) of the compression / decompression system, in order to obtain the dictionary adapted to the message (8),

• Quantify the signal parameters with the dictionary adapted to the message (8), in order to obtain the indices (9) of the signal on the dictionary adapted to the message,

• Transmit the index table (6) to a decompression step and then transmit the indices (9) of the signal on the dictionary adapted to the message,

• Determine the parameters of the signal from the dictionary adapted to the reconstructed message and the indexes of the signal transmitted to the decompression part.

2 - Compression / decompression method according to claim 1 characterized in that the determination of the parameters at the decompression level comprises at least the following steps:

• Determine the dictionary suitable for the message (5) from the generic dictionary (7) and the indexing table (6) for the decompression step and use this dictionary adapted to the message for quantify the signal indexes in order to obtain the signal indexes on the generic dictionary, • Determine the signal parameters from the indexes on the generic dictionary.

3 - A compression / decompression method according to claim 1 characterized in that it uses a speech signal where the generic parameters are: short-term envelope filters, pitch, energy, etc.

4 - Method of compression / decompression according to claim 1 characterized in that the generic parameters are the LSF and the quantification step is carried out by vector quantization processing of the LSF vectors.

5 - Method of compression / decompression according to claim 1 characterized in that the dictionary adapted to the message (5) is constructed by seeking the best representatives of the original vectors adapted to the audio signal in the generic dictionary.

6 - A compression / decompression method according to claim 1 characterized in that the message is an MMS type message.

7 - A compression / decompression method according to claim 1 characterized in that the indexing table (6) and the signal indices (9) are transmitted via an MMS (Multimedia Messaging Service) type protocol.