EP1794748A1 - Datenverarbeitungsverfahren durch übergang zwischen verschiedenen subband-domänen - Google Patents

Datenverarbeitungsverfahren durch übergang zwischen verschiedenen subband-domänen

Info

Publication number
EP1794748A1
EP1794748A1 EP05798240A EP05798240A EP1794748A1 EP 1794748 A1 EP1794748 A1 EP 1794748A1 EP 05798240 A EP05798240 A EP 05798240A EP 05798240 A EP05798240 A EP 05798240A EP 1794748 A1 EP1794748 A1 EP 1794748A1
Authority
EP
European Patent Office
Prior art keywords
vector
matrix
filters
synthesis
analysis
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
EP05798240A
Other languages
English (en)
French (fr)
Other versions
EP1794748B1 (de
Inventor
Abdellatif Benjelloun Touimi
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Orange SA
Original Assignee
France Telecom SA
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by France Telecom SA filed Critical France Telecom SA
Publication of EP1794748A1 publication Critical patent/EP1794748A1/de
Application granted granted Critical
Publication of EP1794748B1 publication Critical patent/EP1794748B1/de
Not-in-force legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/02Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders
    • G10L19/0204Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders using subband decomposition
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/04Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
    • G10L19/16Vocoder architecture
    • G10L19/173Transcoding, i.e. converting between two coded representations avoiding cascaded coding-decoding

Definitions

  • the present invention relates to data processing by passing between different subband domains, including, but not exclusively, for transcoding between two types of compression coding / decoding.
  • One of the main problems due to the heterogeneity of terminals concerns the diversity of coding formats that they are able to interpret.
  • One possible solution would be to recover the capabilities of the terminal before delivering the content in a compatible format. This solution may be more or less effective depending on the delivery scenario of the multimedia content considered
  • transcoding or changing the encoding format
  • This operation can take place at different levels of the transmission chain. It can intervene at the server to change the format of content previously stored for example in a database, or intervene in a gateway in the network, or other.
  • a common and straightforward method of transcoding is to decode the content and recode it to obtain a representation in the new encoding format.
  • This method generally has the drawbacks of using a large computing power, of increasing the algorithmic delay due to the processing and sometimes of adding further degradation of the perceptual quality of the multimedia signal. These settings are very important in multimedia applications. Their improvement
  • This type of transcoding consists in performing a partial decoding, as minimal as possible, of the initial coding format to extract the parameters allowing the reconstruction of the new coding format.
  • the success of this process It is therefore able to reduce algorithmic complexity and delay and to maintain or even increase perceptual quality.
  • audio transcoding is a definition of audio transcoding and the main problems that arise after a brief review of the principles of perceptual audio coding in subbands.
  • audio coders can be manufacturer-specific (or "proprietary"), or standardized by decision of international organizations. In addition, they all have a common basic structure and are based on the same principles.
  • the basic principle of perceptual frequency audio coding is to reduce the flow of information by exploiting the properties of the human hearing system.
  • the irrelevant components of the audio signal are eliminated.
  • This operation uses the phenomenon called "masking". Since the description of this masking effect is mainly in the frequency domain, the representation of the signal is carried out in the frequency domain.
  • FIG. 2a the input digital audio signal Se is first decomposed by an analysis filter bank 20.
  • the resulting spectral components are then quantized and then coded by the module 22.
  • Quantization uses the result of a perceptual model 24 so that the noise that results from the treatment is inaudible.
  • a multiplexing of the different coded parameters is performed by the module 26 and an audio frame Sc is thus constructed.
  • the decoding is done in a dual manner. After demultiplexing the audio frame by the module 21, the various parameters are decoded and the spectral components of the signal are de-quantized by the module 23.
  • the temporal audio signal is reconstituted by the synthesis filterbank 25.
  • the first stage of any perceptual audio coding system therefore consists of an analysis filter bank 20 used for the time / frequency transformation.
  • filter banks and transforms have been developed and exploited in audio coders.
  • pseudo-QMF filter banks, hybrid filter banks, and MDCT transform banks can be mentioned.
  • the MDCT transform is currently proving to be the most effective in this context. It is the basis of the latest and most advanced audio coding algorithms such as those used for MPEG-4 AAC, TwinVQ and BSAC, Dolby AC-3, in the TDAC encoder / decoder (for "rhyme Domain Aliasing Canceling"). ) of France Telecom, in ITU-T G.722.1.
  • Modulated cosine filter banks orthogonal transforms (or "LOT" for "Lapped Orthogonal Transform ") and more generally for the banks of maximum decimation filters, that is to say to critical sampling.
  • Critical sampling property for a filter bank is that the subsampling / oversampling factor is equal to the number of subbands.
  • FIGS. 3a and 3b respectively illustrate the conventional transcoding and intelligent transcoding schemes in a communication chain, between a coder CO1 according to a first coding format and a decoder DEC2 according to a second coding format.
  • conventional transcoding it is a question of carrying out a complete decoding operation by the decoder module DECl according to the first format (FIG. 3a), followed by recoding by the encoder module C02 according to the second format, to finally arrive at the second coding format.
  • FIG. 4 shows the details of the operations that are merged by the implementation of intelligent transcoding. It mainly involves integrating the functional blocks of the synthesis filter banks BS1 and the BA2 analysis filter banks of the conventional transcoding into a system for direct conversion between subband domains in the module 31.
  • Table 1 gives a summary of the types of filter banks used in the most well-known transform audio coders, as well as their characteristics. As can be seen, in addition to the MDCT transform which is the most used, there are Pseudo-QMF benches. Moreover, they are all part of the family of maximum decimation banks and modulated cosines verifying exactly or almost the perfect reconstruction property.
  • Table 1 The most used filter banks in audio coding and their characteristics.
  • Table 2 below shows some types of subband coding in Table 1, detailing some of their applications.
  • Table 2 Examples of subband coders for audio and speech signals and some examples of their main applications.
  • US-6,134,523 discloses a rate reduction method in the coded domain for audio coded in MPEG-I Layer I or II signals. Although this method is similar to audio transcoding methods, it does not make any change between coding formats and the signals of the subbands remain in the representation of the same transformed domain, namely the representation of the pseudo filter bank. QMF. Here, the signals are simply re-quantized according to a new bit allocation. Moreover, in US-2003/0149559, a method is proposed for reducing the complexity of the psycho ⁇ acoustic model during a transcoding operation.
  • the new system uses values stored in a database of distortion jigs. Even if this method deals with a problem of transcoding, it remains far from the objectives relative to the passage between filter bank domains.
  • This technique of the prior art can be applied only for this particular case of transcoding. • This technique does not really treat a conversion to a different new subband field. It's just a matter of cascading a new Missing analysis filter bank, which allows to increase the frequency resolution.
  • TDF Tranform Domain Filtering
  • TDRT Transform-Domain Resolution Translation
  • DCT for "Discrete Cosine Transform”
  • MLT Modulated Lapped Transform
  • This publication discloses an efficient structure for implementing a synthesis filter bank system, at L sub-bands, followed by an M sub-band analysis filter bank, where M and L are multiple. one of the other.
  • This structure is effective for implementation in VLSI ("Very Large Scale Integration") or FPGA ("Field Programmable Gate Array”) or parallel processors. It requires fewer logic blocks, low power consumption and allows the degree of parallelism to be extended.
  • the proposed method is applicable in situations where subband-based processing follows another subband treatment and where the synthesized intermediate signal is not needed.
  • TDM to FDM trans-multiplexing for "Time Domain Multiplexing” to "Frequency Domain Multiplexing"
  • a synthesis filter bank is used to reconstruct the interleaved time signals (that is to say perform the operation of reverse multiplexing from FDM to TDM).
  • an analysis filter bank is used to reconstruct the interleaved time signals (that is to say perform the operation of reverse multiplexing from FDM to TDM).
  • the structure of the TDM->FDM-> TDM system thus amounts to a cascading of a synthesis filter bank and an analysis filter bank, which corresponds to what is also used in a system. conventional transcoding.
  • the problem generally posed in these trans-multiplexing systems is to reconstruct the original signals without distortions after the TDM->FDM-> TDM operation.
  • the present invention improves the situation with respect to the state of the art presented above.
  • the present invention proposes in particular, but not exclusively, as will be seen below, a transcoding of a first type of coding, any, to a second type of coding, any.
  • the respective numbers of M and Z sub-bands are any natural numbers and are not necessarily linked by a proportionality relation, in the most general case.
  • the method in the sense of the invention can advantageously be applied to the transcoding of a first type of encoding / decoding in compression to at least a second type of encoding / decoding in compression.
  • This application typically consists in compacting in the same treatment the following steps:
  • the present invention also relates to a computer program product, intended to be stored in a memory of a device in a communication network, such as a server, a gateway, or a terminal, and then including instructions for the implementation of of all or part of the process according to the invention.
  • the present invention also relates to equipment such as a server, a gateway, or a terminal for a communication network, and comprising computer resources for implementing the method according to the invention.
  • FIGS. 2a and 2b showing the block diagrams of a perceptual frequency audio compression system, respectively to coding and decoding
  • FIGS. 3a and 3b schematically illustrating communication channels using transcoding. conventional and intelligent transcoding, respectively, and
  • FIG. 4 represents the block diagrams illustrating the conventional transcoding (upper part of the figure) and the intelligent transcoding (lower part of the figure), described above,
  • FIGS. 5a and 5b schematically represent the block diagrams defining the equivalence between the synthesis of the temporal signal and then the analysis with a new bank of filters (FIG. 5a) and the direct conversion between two domains of the subbands (FIG. 5b).
  • FIG. 6 illustrates a representation in multi -addition blocks of the conventional conversion between subband domains
  • FIG. 7 is a multi-layer representation of the sub-domain domain conversion system, in the sense of the invention
  • FIG. 8 schematically summarizes the filtering method in a conversion system, within the meaning of 1'invention
  • FIG. 14 is a representation of the conversion system in the case M ⁇ pL as an LPTV system, with an input rate different from the output rate
  • FIG. 15 is a representation of the conversion system within the meaning of the invention, as an LPTV system, in the general case where M and L are not linked by a particular relationship of proportionality,
  • FIG. 18 illustrates the conversion system. within the meaning of the invention in an embodiment corresponding to an OLA recovery transform and addition for an efficient implementation allowing on-the-fly processing, in the particular case MPL
  • FIG. conversion in the sense of the invention in an embodiment corresponding to a transformation and an addition with OLA overlay for efficient implementation allowing on-the-fly processing in the case by Particularly, FIG. 20a and 20b respectively illustrate a combination filtering with a conversion between domains of FIG. sub-bands, and an equivalent overall system, within the meaning of the invention
  • FIGS. 21a and 21b illustrate the combination of a sampling frequency change (or "resampling") with a conversion between subband domains, conventional and in the sense of the invention, respectively
  • FIG. is a representation in multiple-frame blocks of the conversion system within the meaning of the invention between subband domains combined with re-sampling
  • FIG. 23 represents the system within the meaning of the invention as an LPTV system applied to a combined conversion with a re-sampling
  • FIG. 24 represents a preferred embodiment corresponding to an OLA recovery transform and addition for efficient implementation allowing on-the-fly processing of the conversion system of FIG. 23
  • FIG. 25 represents a transcoding occurring in a gateway GW of a communication network, for a possible application of the present invention
  • - 27 is a table showing the parameters of the conversion system within the meaning of the invention for particular cases of encoding formats.
  • the method of converting between subband domains is described below in a general discussion of the invention.
  • the L-band synthesis bench used by a first compression coding system and defined by its filters, denoted by F ⁇ . (Z), O ⁇ k ⁇ LX, and the M band analysis filter bank are considered.
  • F ⁇ . (Z) The L-band synthesis bench used by a first compression coding system and defined by its filters, denoted by F ⁇ . (Z), O ⁇ k ⁇ LX, and the M band analysis filter bank are considered.
  • a second compression system and defined by its filters, noted
  • the signal vectors of the subbands representing the signal respectively in the areas of the first and second bank of filters.
  • FIG. 5b The principle of conversion between domains of the subbands is illustrated by Figures 5a and 5b. It is a question of finding a conversion system 51 (FIG. 5b) between the vectors of the subband signals, X ( ⁇ ) and Y ( ⁇ ), equivalent to a cascading of the synthesis bench BS1 and the bank of BA2 analysis (FIG. 5a).
  • the objective is to merge certain mathematical calculation operations between these two banks of filters to reduce the algorithmic complexity (that is to say the number of calculation operations and the required memory). Another objective is therefore to minimize the algorithmic delay introduced by this transformation.
  • FIG. 5a By using multi-layer blocks, the diagram of FIG. 5a can be represented by that of FIG. 6, on which an analysis filter bank follows a synthesis filter bank.
  • the synthesis filter bank subbands Z is conventionally compound in each subband k r O ⁇ k ⁇ Li, an upsampling operation by "a factor L followed by a filtering synthesis filter
  • the subband signal corresponding to the kth component of the input vector X (z) is therefore first oversampled and then filtered by the filter F 1 ⁇ z).
  • X (z) synthesized at the output of this synthesis bank is then obtained by summing the results of these filterings for Q ⁇ k ⁇ Ll.
  • This time signal then constitutes the input of the analysis bank to M subbands. It undergoes on each sub-band n, 0 ⁇ n ⁇ Ml, a filtering by the analysis filter, H M (z), followed by a subsampling operation of factor M. It then obtains at the output of this bench analysis a vector of sub-band signals, size M, shown in the domain of the z-transform Y (z) • synthesis of a time signal is therefore generally necessary in this conventional conversion system, unlike to the conversion system within the meaning of the invention which is described below.
  • V (z) T (z) U (z) (4)
  • the conversion matrix T (z) is of size KxK. Its expression is given by: (5) where v (z) is the matrix of size p ⁇ xp 2 whose elements are defined as follows:
  • the operation ® designates the Kronecker product such as:
  • ⁇ K denotes the decimation by a factor K, corresponding to a subsampling where only one sample is selected from K samples.
  • the conversion system can be schematized as shown in FIG. 7, which shows that the system is advantageously a so-called “linear periodically variable time” (LPTV) system, as will be seen later.
  • LPTV linear periodically variable time
  • the input block 71 consisting of the advance P2 ⁇ x and the delay chain, followed by the decimation 72_p 2 -l to 72_0 by a factor p 2 , can be interpreted as a mechanism of blocking each succession of p 2 input vectors, denoted by X ["], into a single vector U [fc], of size K.
  • This latter vector U [A:] is then applied to the filtering matrix T (z) (module 74) and the result is a vector V [A:], of the same size as the vector U [A:].
  • the notation X ()) simply relates to the expression of the vector X according to its transform in z, while the notation X ["] relates to the expression of the vector X in the time domain, conventionally for the skilled person.
  • the last block 73_pi-1 to 73_0 of FIG. 7 finally makes it possible to put in series the successive p ⁇ sub-vectors, each of size M, of the vector V [A:] to have as output the vectors Y [V].
  • FIG. 7 The input and output blocks of FIG. 7 are finally little different from the blocking mechanisms 81 and then series-linking mechanisms 82, respectively, of FIG. 8 which summarizes the main steps of the method within the meaning of the invention.
  • the conversion system within the meaning of the invention is minimal delay.
  • the element filters of the matrix T (z), are all causal if and only if: e mm ⁇ K- ⁇ , - (11)
  • Conversion systems within the meaning of the invention can therefore be constructed with different delays and by making different choices on the parameters a and b, but provided that the inequality (12) is preferentially satisfied.
  • the parameters a and b can therefore be seen as setting parameters for acting on the algorithmic delay introduced by the conversion system between subband domains.
  • v (z) is the matrix whose elements are defined as follows: (17)
  • the relation (16) is therefore the general formula of the conversion matrix T (z), which makes it possible to minimize the algorithmic delay introduced by the conversion system within the meaning of the invention.
  • polyphase components considered in relation (18) correspond to a type decomposition 1 to the order K 1 as described for example in the aforementioned reference:
  • the polyphase components G r n! C (z) (with Q ⁇ r ⁇ Kl) can be determined directly if the synthesis filters and the analysis filters have finite impulse responses (or "FIR"). In the case where one or both banks of filters use recursive filters (with infinite impulse responses or 11 IIR "), the produced filters G nt (z) are also infinite impulse responses.
  • the general procedure for such a decomposition is given in Annex A, "Polyphase decomposition of recursive filters", reference:
  • the conversion matrix in this case is of size MxM and is written as follows:
  • This matrix is therefore the line vector consisting respectively of polyphase components of general index (pk) L1 (where O ⁇ k ⁇ pl), following a type 1 decomposition to order M, of the matrix g (z ), synthesis filter products and analysis.
  • the notation G r mJ (z) (with O ⁇ r ⁇ M-1) refers to the polyphase component of general index r of the filter G mJ (z), resulting from a decomposition to the order M.
  • the conversion matrix in this case is of size LxL and is written as follows:
  • This matrix is therefore the column vector consisting respectively of polyphase components of general index (k + 1) M1 (with O ⁇ k ⁇ p-1), following a type 1 decomposition at order L, of the matrix g ( z), synthesis filter products and analysis.
  • the notation G r tl (z) (with O ⁇ r ⁇ X-1) indicates the polyphase component of general index r of the filter G ⁇ z), resulting from a decomposition to the order L.
  • FIG. 11 The diagram of the conversion system is given in this case in FIG. 11 in multi-layer representation and in FIG. 12 illustrating the main steps of the filtering method in this particular case where L ⁇ pM.
  • This conversion system can be seen as a system
  • the input rate of this system is f -pf s and the output rate is f ⁇ .
  • the transfer matrices A k (z) operate at the sampling frequency / et and the system operates globally as if a switch 140 (FIG. 14), at the input of the system, was flipping in a circular manner to this same frequency f s , from one input of a matrix block A A (z) to the other.
  • the output of the conversion system Y [ w ] 'at the instant nT Si is equal to the sum of the outputs of the ⁇ ⁇ z ) (with O ⁇ k ⁇ p-1), each fed by, at the respective moments:
  • the two switches 151 and 152 shown respectively at the input and the output of the structure of FIG. 15 operate with a frequency - which is also
  • the input rate of this system is / ⁇ and the output rate is f s , allowing processing of the input data, on the fly, by the conversion system within the meaning of the invention.
  • the conversion matrix T (z) is expressed as follows:
  • P n are matrices of size KxK, and N corresponds to the maximum of the lengths of the filters T ffl / (z), elements of T (z).
  • the system can therefore be constructed by a matrix transform P, followed by a recovery addition operation.
  • This implementation is similar to the synthesis part of an overlapping transform "LT" (for "Lapped Transform"), as described in particular in: “Signal Processing with Lapped Transform", HS Malvar, Artech House, Inc. 1992 .
  • the method presented below provides a parallelism in the treatment and efficient use of IT resources (-logi-cial-matéxiis ⁇ re or ⁇ s) for the implementation of the process. It is therefore a presently preferred embodiment at least in the case of finite impulse response filter banks.
  • each transfer matrix A , (z) contains filters of identical lengths and which depend on the value of e ij , then the corresponding matrix B (J) also depends on e ij.
  • the matrices B / contain zero sub-matrices. and their forms are given as follows: o If 0 ⁇ e ij ⁇ r 0 -l then:
  • the null blocks of the matrices B / y allow a reduction of calculation during a transformation of an input vector by this matrix.
  • Addition with cover 2.c step is done on vectors • NM length with a covering (NI) M elements.
  • the output Y ["] of the conversion system corresponds to the result of the overlap addition on the sum vectors resulting from step 3.
  • This matrix has the following form:
  • Each new input vector X [&] is oriented to the common memory of all the subsystems characterized by the transfer matrices A,. (Z), with 0 ⁇ i ⁇ p-1.
  • the filter bank is characterized by the fact that the analysis and synthesis filters are obtained by a cosine modulation of a low-pass protector filter.
  • Equations (57), (58) and the above conditions make it possible to fully characterize a modulated cosine filter bank with perfect reconstruction.
  • modulated cosine filter banks with perfect reconstruction are the basis of all the filter banks of the current audio coders. Even the pseudo-QMF filter bank of the MPEG-1/2 layer I & II coders can be associated with this category, it being understood that the prototype filter is sufficiently well designed to consider that the perfect reconstruction is satisfied.
  • the latter can be considered as an MLT transform (for "Modulated Lapped Transform") also known as MDCT (for "Modified DCT”).
  • MDCT Modulated DCT
  • This transform is used in most coders current frequency audio (MPEG-2/4 AAC, PAC, MSAudio, TDAC, etc.).
  • the window must check the condition of symmetry: and complementarity in power: .
  • This window choice is used in TDAC and G.722.1 encoders.
  • Another choice is to take a window derived from the Kaiser-Bessel window (or "KBD") as in the case of MPEG-4 AAC, BSAC, Twin VQ and AC-3 encoders.
  • the values provided in the MPEG-I Audio Layer I-II standard correspond to the window (-1) h (2lM + j), with 0 ⁇ j ⁇ 2M ⁇ l and 0 ⁇ / ⁇ m-1.
  • HRTF filters Head Related Trasfert Functions
  • FIG. 5a With respect to the block diagram of FIG. 5a, it is a matter of introducing a filter S (z) between the two banks of synthesis and analysis filters and of finding an equivalent system.
  • the block diagrams are shown in FIGS. 20a and 20b.
  • the conversion system combined with the filtering can be modeled by the same type of scheme as that shown in FIG. 5b. However, it is characterized by the new filter matrix T (z) defined by:
  • the filter S FB (z) is a low pass filter of standardized cutoff frequency and gain in bandwidth Q.
  • the conversion system combined with the sampling rate change can be modeled by the scheme of Figure 22. It is characterized by the filter matrix T (z) of size q x Mx.q 2 L, defined as follows:
  • g (z) is the matrix of size MxL whose elements are given by: (69) and v (z) is the matrix whose elements are defined as follows: (70) also respecting the following relation: (71)
  • G nk [z) is interpreted as the result of the convolution of the filter H n (z) oversampled by a factor R, the filter S P ⁇ (z) and the filter F 4 (z ) oversampled by a Q factor.
  • the system according to FIG. 23 operates with the matrices A / y (2) such that:
  • matrices B 011 are of size MxL
  • the following definition of matrices B 1 as shown in FIG. 24 can be given as follows:
  • the present invention provides a generic solution for converting a representation of a signal from one subband (or transform) domain to another.
  • the method is preferably applied in the context where the banks of filters used by the two compression systems are maximum decimation, as has been seen above.
  • the described embodiments may be provided for all transform or subband coders of multimedia signals, especially those used in video, picture, speech coding, or other.
  • These embodiments can also be implemented in any device having a cascade of a synthesis bench and an analysis bench, in particular in the following examples: • Improvement of the quality of the speech in sub-bands followed echo cancellation in sub-bands and vice versa.
  • transcoding can occur at different points in the transmission chain. In the following, we distinguish some possible case.
  • the transcoding mechanism TRANS is advantageous in a gateway GW in the network RES of transmission of the audio content coming from a server SER and destined for a first terminal TER1, equipped with a decoder DECl and another terminal TER2. equipped with another decoder DEC2, as shown in Figure 25.
  • transcoding TRANS FOG. 26
  • Terminal capacity information was previously received and analyzed by the SER server.
  • the audio content is stored in a given encoding format. It is transcoded in real time to be compatible with the terminal at every request of a user before being downloaded.
  • the terminals involved may have different capabilities in terms of coders / decoders.
  • transcoding can occur at the bridge.
  • Table 3 below now shows some possible transcoding, advantageous, between audio coding formats according to the fields of application.
  • Table 3 Examples of some interesting types of transcodings and their areas of application.
  • FIG. 27 then indicates the parameters of the conversion system within the meaning of the invention for these particular cases of coding formats.

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Computational Linguistics (AREA)
  • Signal Processing (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Spectroscopy & Molecular Physics (AREA)
  • Compression, Expansion, Code Conversion, And Decoders (AREA)
  • Image Processing (AREA)
EP05798240A 2004-09-16 2005-08-23 Datenverarbeitungsverfahren durch Übergang zwischen verschiedenen Subband-domänen Not-in-force EP1794748B1 (de)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
FR0409820A FR2875351A1 (fr) 2004-09-16 2004-09-16 Procede de traitement de donnees par passage entre domaines differents de sous-bandes
PCT/FR2005/002127 WO2006032740A1 (fr) 2004-09-16 2005-08-23 Procede de traitement de donnees par passage entre domaines differents de sous-bandes

Publications (2)

Publication Number Publication Date
EP1794748A1 true EP1794748A1 (de) 2007-06-13
EP1794748B1 EP1794748B1 (de) 2010-02-17

Family

ID=34951876

Family Applications (1)

Application Number Title Priority Date Filing Date
EP05798240A Not-in-force EP1794748B1 (de) 2004-09-16 2005-08-23 Datenverarbeitungsverfahren durch Übergang zwischen verschiedenen Subband-domänen

Country Status (8)

Country Link
US (1) US8639735B2 (de)
EP (1) EP1794748B1 (de)
JP (1) JP4850837B2 (de)
CN (1) CN101069233B (de)
AT (1) ATE458242T1 (de)
DE (1) DE602005019431D1 (de)
FR (1) FR2875351A1 (de)
WO (1) WO2006032740A1 (de)

Families Citing this family (25)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7422840B2 (en) * 2004-11-12 2008-09-09 E.I. Du Pont De Nemours And Company Apparatus and process for forming a printing form having a cylindrical support
US7916711B2 (en) * 2005-03-24 2011-03-29 Siport, Inc. Systems and methods for saving power in a digital broadcast receiver
GB2439685B (en) 2005-03-24 2010-04-28 Siport Inc Low power digital media broadcast receiver with time division
US7945233B2 (en) * 2005-06-16 2011-05-17 Siport, Inc. Systems and methods for dynamically controlling a tuner
US8335484B1 (en) 2005-07-29 2012-12-18 Siport, Inc. Systems and methods for dynamically controlling an analog-to-digital converter
US8599841B1 (en) 2006-03-28 2013-12-03 Nvidia Corporation Multi-format bitstream decoding engine
US8593469B2 (en) * 2006-03-29 2013-11-26 Nvidia Corporation Method and circuit for efficient caching of reference video data
FR2901433A1 (fr) * 2006-05-19 2007-11-23 France Telecom Conversion entre representations en domaines de sous-bandes pour des bancs de filtres variant dans le temps
US8700387B2 (en) * 2006-09-14 2014-04-15 Nvidia Corporation Method and system for efficient transcoding of audio data
US8015368B2 (en) * 2007-04-20 2011-09-06 Siport, Inc. Processor extensions for accelerating spectral band replication
US8199769B2 (en) 2007-05-25 2012-06-12 Siport, Inc. Timeslot scheduling in digital audio and hybrid audio radio systems
FR2918228A1 (fr) * 2007-06-29 2009-01-02 France Telecom Conversion entre domaines de sous-bandes pour bancs de filtres modules.
EP2099027A1 (de) * 2008-03-05 2009-09-09 Deutsche Thomson OHG Verfahren und Vorrichtung zur Umwandlung zwischen verschiedenen Filterbankdomänen
US8335080B2 (en) * 2009-04-20 2012-12-18 Infortrend Technology, Inc. Sliding rail having anti-tilting mechanism
US8320823B2 (en) * 2009-05-04 2012-11-27 Siport, Inc. Digital radio broadcast transmission using a table of contents
PL3998606T3 (pl) * 2009-10-21 2023-03-06 Dolby International Ab Nadrpóbkowanie w połączonym banku filtrów modułu transpozycji
US8958510B1 (en) * 2010-06-10 2015-02-17 Fredric J. Harris Selectable bandwidth filter
MY156027A (en) 2010-08-12 2015-12-31 Fraunhofer Ges Forschung Resampling output signals of qmf based audio codecs
US8489053B2 (en) 2011-01-16 2013-07-16 Siport, Inc. Compensation of local oscillator phase jitter
DE102013212840B4 (de) * 2013-07-02 2022-07-07 Robert Bosch Gmbh Modellberechnungseinheit und Steuergerät zur Berechnung eines datenbasierten Funktionsmodells mit Daten in verschiedenen Zahlenformaten
CN104656494A (zh) * 2013-11-19 2015-05-27 北大方正集团有限公司 一种信号实时性处理装置
US9898286B2 (en) * 2015-05-05 2018-02-20 Intel Corporation Packed finite impulse response (FIR) filter processors, methods, systems, and instructions
EP3182411A1 (de) * 2015-12-14 2017-06-21 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Vorrichtung und verfahren zur verarbeitung eines codierten audiosignals
EP3206353B1 (de) * 2016-02-09 2020-02-05 Technische Universität München Filterbänke und verfahren zum betreiben von filterbänken
CN111968658B (zh) * 2020-06-30 2024-02-06 北京百度网讯科技有限公司 语音信号的增强方法、装置、电子设备和存储介质

Family Cites Families (14)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5508949A (en) * 1993-12-29 1996-04-16 Hewlett-Packard Company Fast subband filtering in digital signal coding
JP2985675B2 (ja) * 1994-09-01 1999-12-06 日本電気株式会社 帯域分割適応フィルタによる未知システム同定の方法及び装置
US5610942A (en) * 1995-03-07 1997-03-11 Chen; Keping Digital signal transcoder and method of transcoding a digital signal
US5950151A (en) * 1996-02-12 1999-09-07 Lucent Technologies Inc. Methods for implementing non-uniform filters
JP3283200B2 (ja) 1996-12-19 2002-05-20 ケイディーディーアイ株式会社 符号化音声データの符号化レート変換方法および装置
US6947509B1 (en) * 1999-11-30 2005-09-20 Verance Corporation Oversampled filter bank for subband processing
GB0003954D0 (en) 2000-02-18 2000-04-12 Radioscape Ltd Method of and apparatus for converting a signal between data compression formats
FR2807897B1 (fr) * 2000-04-18 2003-07-18 France Telecom Methode et dispositif d'enrichissement spectral
US6963842B2 (en) * 2001-09-05 2005-11-08 Creative Technology Ltd. Efficient system and method for converting between different transform-domain signal representations
US7020603B2 (en) 2002-02-07 2006-03-28 Intel Corporation Audio coding and transcoding using perceptual distortion templates
KR100499047B1 (ko) * 2002-11-25 2005-07-04 한국전자통신연구원 서로 다른 대역폭을 갖는 켈프 방식 코덱들 간의 상호부호화 장치 및 그 방법
US20050018796A1 (en) * 2003-07-07 2005-01-27 Sande Ravindra Kumar Method of combining an analysis filter bank following a synthesis filter bank and structure therefor
FR2901433A1 (fr) * 2006-05-19 2007-11-23 France Telecom Conversion entre representations en domaines de sous-bandes pour des bancs de filtres variant dans le temps
EP2099027A1 (de) * 2008-03-05 2009-09-09 Deutsche Thomson OHG Verfahren und Vorrichtung zur Umwandlung zwischen verschiedenen Filterbankdomänen

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
See references of WO2006032740A1 *

Also Published As

Publication number Publication date
FR2875351A1 (fr) 2006-03-17
JP2008514071A (ja) 2008-05-01
CN101069233A (zh) 2007-11-07
JP4850837B2 (ja) 2012-01-11
US8639735B2 (en) 2014-01-28
EP1794748B1 (de) 2010-02-17
ATE458242T1 (de) 2010-03-15
WO2006032740A1 (fr) 2006-03-30
CN101069233B (zh) 2011-05-18
DE602005019431D1 (de) 2010-04-01
US20090198753A1 (en) 2009-08-06

Similar Documents

Publication Publication Date Title
EP1794748B1 (de) Datenverarbeitungsverfahren durch Übergang zwischen verschiedenen Subband-domänen
EP1600042B1 (de) Verfahren zum bearbeiten komprimierter audiodaten zur räumlichen wiedergabe
EP1905010B1 (de) Hierarchischen Audio-kodierung/-dekodierung
EP2005420B1 (de) Einrichtung und verfahren zur codierung durch hauptkomponentenanalyse eines mehrkanaligen audiosignals
CA2512179C (fr) Procede de codage et de decodage audio a debit variable
FR2761801A1 (fr) Methode de codage audio et dispositif
EP2304721B1 (de) Raumsynthese mehrkanaliger tonsignale
WO2007104883A1 (fr) Dispositif et procede de codage gradue d'un signal audio multi-canal selon une analyse en composante principale
EP2732448B1 (de) Anpassungen von analyse- oder synthesegewichtungsfenstern zur transformationskodierung oder -dekodierung
WO2007093726A2 (fr) Dispositif de ponderation perceptuelle en codage/decodage audio
EP2374123A1 (de) Verbesserte codierung von mehrkanaligen digitalen audiosignalen
EP2979266A1 (de) Optimiertes partielles mischen von audioströmen mit subband-codierung
WO2007135319A1 (fr) Conversion entre representations en domaines de sous-bandes pour des bancs de filtres variant dans le temps
WO2023165946A1 (fr) Codage et décodage optimisé d'un signal audio utilisant un auto-encodeur à base de réseau de neurones
EP2126905B1 (de) Verfahren und Vorrichtung zur Kodierung und Dekodierung von Audiosignalen, kodiertes Audiosignal
EP0285495B1 (de) Digitales Signalverarbeitungssystem mit einer Filterbank
EP2126904B1 (de) Tonkodierverfahren und vorrichtung
EP1362344A1 (de) Verfahren und vorrichtung zur rekonstruktion des spektrums von mehrkanaligen signalen
FR2759510A1 (fr) Methode de codage audio et dispositif
WO2022136760A1 (fr) Codage optimise de matrices de rotations pour le codage d'un signal audio multicanal
FR2853804A1 (fr) Procede de decodage d'un signal permettant de reconstituer une scene sonore et dispositif de decodage correspondant

Legal Events

Date Code Title Description
PUAI Public reference made under article 153(3) epc to a published international application that has entered the european phase

Free format text: ORIGINAL CODE: 0009012

17P Request for examination filed

Effective date: 20070316

AK Designated contracting states

Kind code of ref document: A1

Designated state(s): AT BE BG CH CY CZ DE DK EE ES FI FR GB GR HU IE IS IT LI LT LU LV MC NL PL PT RO SE SI SK TR

DAX Request for extension of the european patent (deleted)
17Q First examination report despatched

Effective date: 20090421

GRAP Despatch of communication of intention to grant a patent

Free format text: ORIGINAL CODE: EPIDOSNIGR1

RTI1 Title (correction)

Free format text: DATA PROCESSING METHOD BY PASSAGE BETWEEN DIFFERENT SUB-BAND DOMAINS

GRAS Grant fee paid

Free format text: ORIGINAL CODE: EPIDOSNIGR3

GRAA (expected) grant

Free format text: ORIGINAL CODE: 0009210

AK Designated contracting states

Kind code of ref document: B1

Designated state(s): AT BE BG CH CY CZ DE DK EE ES FI FR GB GR HU IE IS IT LI LT LU LV MC NL PL PT RO SE SI SK TR

REG Reference to a national code

Ref country code: GB

Ref legal event code: FG4D

Free format text: NOT ENGLISH

REG Reference to a national code

Ref country code: CH

Ref legal event code: EP

REG Reference to a national code

Ref country code: IE

Ref legal event code: FG4D

Free format text: LANGUAGE OF EP DOCUMENT: FRENCH

REF Corresponds to:

Ref document number: 602005019431

Country of ref document: DE

Date of ref document: 20100401

Kind code of ref document: P

REG Reference to a national code

Ref country code: NL

Ref legal event code: VDEP

Effective date: 20100217

LTIE Lt: invalidation of european patent or patent extension

Effective date: 20100217

PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: IS

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20100617

Ref country code: ES

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20100528

Ref country code: LT

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20100217

Ref country code: PT

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20100617

PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: FI

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20100217

Ref country code: AT

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20100217

Ref country code: LV

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20100217

Ref country code: PL

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20100217

Ref country code: SI

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20100217

REG Reference to a national code

Ref country code: IE

Ref legal event code: FD4D

PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: EE

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20100217

Ref country code: CY

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20100217

Ref country code: NL

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20100217

Ref country code: IE

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20100217

Ref country code: GR

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20100518

Ref country code: SE

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20100217

Ref country code: RO

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20100217

PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: SK

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20100217

Ref country code: CZ

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20100217

Ref country code: BG

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20100517

PLBE No opposition filed within time limit

Free format text: ORIGINAL CODE: 0009261

STAA Information on the status of an ep patent application or granted ep patent

Free format text: STATUS: NO OPPOSITION FILED WITHIN TIME LIMIT

26N No opposition filed

Effective date: 20101118

PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: DK

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20100217

BERE Be: lapsed

Owner name: FRANCE TELECOM

Effective date: 20100831

PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: MC

Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES

Effective date: 20100831

Ref country code: IT

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20100217

REG Reference to a national code

Ref country code: CH

Ref legal event code: PL

PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: CH

Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES

Effective date: 20100831

Ref country code: LI

Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES

Effective date: 20100831

PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: BE

Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES

Effective date: 20100831

PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: HU

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20100818

Ref country code: LU

Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES

Effective date: 20100823

PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: TR

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20100217

REG Reference to a national code

Ref country code: FR

Ref legal event code: PLFP

Year of fee payment: 11

REG Reference to a national code

Ref country code: FR

Ref legal event code: PLFP

Year of fee payment: 12

REG Reference to a national code

Ref country code: FR

Ref legal event code: PLFP

Year of fee payment: 13

PGFP Annual fee paid to national office [announced via postgrant information from national office to epo]

Ref country code: FR

Payment date: 20170720

Year of fee payment: 13

Ref country code: GB

Payment date: 20170719

Year of fee payment: 13

Ref country code: DE

Payment date: 20170719

Year of fee payment: 13

REG Reference to a national code

Ref country code: DE

Ref legal event code: R119

Ref document number: 602005019431

Country of ref document: DE

GBPC Gb: european patent ceased through non-payment of renewal fee

Effective date: 20180823

PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: DE

Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES

Effective date: 20190301

PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: FR

Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES

Effective date: 20180831

PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: GB

Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES

Effective date: 20180823