EP2319037B1 - Reconstruction de données audio multicanal - Google Patents
Reconstruction de données audio multicanal Download PDFInfo
- Publication number
- EP2319037B1 EP2319037B1 EP09802568A EP09802568A EP2319037B1 EP 2319037 B1 EP2319037 B1 EP 2319037B1 EP 09802568 A EP09802568 A EP 09802568A EP 09802568 A EP09802568 A EP 09802568A EP 2319037 B1 EP2319037 B1 EP 2319037B1
- Authority
- EP
- European Patent Office
- Prior art keywords
- spatialization
- data
- value
- model
- predicted
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
- 230000002950 deficient Effects 0.000 claims abstract description 29
- 238000000034 method Methods 0.000 claims abstract description 25
- 238000012360 testing method Methods 0.000 claims abstract description 21
- 230000015654 memory Effects 0.000 claims description 17
- 238000004590 computer program Methods 0.000 claims description 4
- 230000005540 biological transmission Effects 0.000 description 11
- 238000013459 approach Methods 0.000 description 5
- 230000007547 defect Effects 0.000 description 5
- 230000008859 change Effects 0.000 description 4
- 238000001514 detection method Methods 0.000 description 4
- 238000004422 calculation algorithm Methods 0.000 description 3
- 230000015556 catabolic process Effects 0.000 description 3
- 238000006731 degradation reaction Methods 0.000 description 3
- 230000010363 phase shift Effects 0.000 description 3
- 230000008569 process Effects 0.000 description 3
- 230000003044 adaptive effect Effects 0.000 description 2
- 230000002238 attenuated effect Effects 0.000 description 2
- 230000001934 delay Effects 0.000 description 2
- 238000012217 deletion Methods 0.000 description 2
- 230000037430 deletion Effects 0.000 description 2
- 238000009499 grossing Methods 0.000 description 2
- 238000013139 quantization Methods 0.000 description 2
- 230000009466 transformation Effects 0.000 description 2
- 208000029523 Interstitial Lung disease Diseases 0.000 description 1
- 238000004364 calculation method Methods 0.000 description 1
- 238000010276 construction Methods 0.000 description 1
- 125000004122 cyclic group Chemical group 0.000 description 1
- 238000013500 data storage Methods 0.000 description 1
- 235000019800 disodium phosphate Nutrition 0.000 description 1
- 238000006073 displacement reaction Methods 0.000 description 1
- 210000005069 ears Anatomy 0.000 description 1
- 230000000694 effects Effects 0.000 description 1
- 238000007654 immersion Methods 0.000 description 1
- 230000008447 perception Effects 0.000 description 1
- 238000003672 processing method Methods 0.000 description 1
- 230000009467 reduction Effects 0.000 description 1
- 230000035807 sensation Effects 0.000 description 1
- 230000005236 sound signal Effects 0.000 description 1
- 238000001228 spectrum Methods 0.000 description 1
- 238000011282 treatment Methods 0.000 description 1
- 238000012795 verification Methods 0.000 description 1
Images
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/008—Multichannel audio signal coding or decoding using interchannel correlation to reduce redundancy, e.g. joint-stereo, intensity-coding or matrixing
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/005—Correction of errors induced by the transmission channel, if related to the coding algorithm
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04S—STEREOPHONIC SYSTEMS
- H04S3/00—Systems employing more than two channels, e.g. quadraphonic
- H04S3/02—Systems employing more than two channels, e.g. quadraphonic of the matrix type, i.e. in which input signals are combined algebraically, e.g. after having been phase shifted with respect to each other
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04R—LOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; DEAF-AID SETS; PUBLIC ADDRESS SYSTEMS
- H04R2420/00—Details of connection covered by H04R, not provided for in its groups
- H04R2420/03—Connection circuits to selectively connect loudspeakers or headphones to amplifiers
Definitions
- the invention relates to the concealment of defective spatialisation data for the reconstruction of multichannel audio data.
- the multichannel audio data is typically reconstructed from at least spatialization data and audio data over a restricted number of channels, eg, single channel data.
- the multichannel audio data is typically for a plurality of respective audio tracks.
- Several different sound sources can be used to help give the listener the illusion of sound immersion.
- the multichannel audio data may for example comprise stereo data on two channels, or else 5.1 data on six channels, in particular for home theater applications.
- the invention can also find an application in the field of spatialized audio conferencing, where the data corresponding to a speaker undergo a spatialization process in order to give the listener the illusion that the speaker's voice comes from a particular position from space.
- Spatialization data is used to obtain multichannel data from data on a smaller number of channels, for example, single channel data.
- These spatialization data may for example comprise differences in inter-channel level or ILD (Interchannel Level Difference), inter-channel correlations or ICC (Interchannel Cross Correlation), delays between channels or ITD (Interchannel Time Difference), differences in phase between channels or IPD (of the English "Interchannel Phase Difference”), or other.
- received audio data including at least the single channel data and the spatialization data, is defective, that is, some data is missing or erroneous.
- the detection of this defective transmission can be carried out by means of a CRC type code (of the English “Cyclic Redundancy Check”).
- prediction models are known. For example, an arbitrary value, a previous value or a value determined from the previously received audio data, for example linear prediction or other methods, are chosen as predicted value.
- the feeling of returning to a single-channel sound can be disturbing for the listener, especially in the case of binaural signals.
- the binaural signals that is to say, allowing a faithful reproduction of the 3D space at the level of the ears, often correspond to relatively fixed virtual sound sources in the space.
- each predicted spatialization value is compared with an estimated value from the spatialization data received.
- spatialisation data considered valid are used to choose from among a plurality of prediction models a prediction model to adopt in case of reception of spatialization data considered as defective.
- Such an adaptive method according to the content makes it possible to overcome the defects of the spatialization data more satisfactorily than in the prior art where a single prediction model is used.
- a restricted number of channels is meant a number of channels less than the number of channels of the multichannel data.
- data on a restricted number of channels may include single channel data.
- the spatialization data and more generally the audio data received, can come from a transmission channel.
- this data can be received over the Internet.
- the received audio data can be read on a storage medium, for example a DVD (Digital Versatile Disk), or other.
- the invention is in no way limited by the origin of the audio data received.
- the received audio data may comprise a coded signal, a demultiplexed and / or decoded signal, digital values, or the like.
- the steps a / and b / can be performed systematically following the reception of a frame considered valid.
- the treatments are thus distributed over time.
- steps a / and b / are performed for each valid frame, it is possible to write an identifier of the prediction model chosen in memory in order to be able, in the event of subsequent reception of defective spatialization data. , quickly find the model of prediction to apply.
- steps a / and / or b / may be subject to the fulfillment of certain conditions, which may make it possible to avoid performing unnecessary calculations.
- the spatialization data is stored in a memory, at least temporarily.
- the steps a / and b / are performed (from the data thus stored), only in the event of subsequent reception of spatialisation data considered to be defective. This avoids, in particular, making the predictions of step a / when this is not necessary.
- step a / it is possible to make the predictions of step a / systematically following the reception of a frame considered valid, while step b / is performed (from the spatialization data of the previous frame (s), stored in memory) in case of reception of a defective frame.
- the estimated value may be one of the spatialization data, for example the estimated value may include an ILD. In this case, it is possible during step b to compare the predicted spatialisation values directly with received spatialization data.
- the estimated value can derive only spatialization data.
- the estimated value may include a gain from the ILDs for a given frame and frequency band, a delay, or the like. In this case, it is possible during step b to compare the predicted spatialization values with values obtained from received spatialization data.
- the previously predicted spatialisation values are compared with corresponding estimated values.
- the choice of the prediction model most in line with the content can be made more accurately.
- the spatialization data received over several frames can be used, and the predicted values and the estimated values can be compared for several frames.
- the resemblance value can be calculated from a part of this sequence of predicted spatialization values, and from a sequence of values estimated from the data of the frame sequence.
- one will abstain from using defective spatialization data during the step of choosing the prediction model, in order to avoid falsifying this choice.
- the data may be defective due to degradations introduced during the transmission, or degradations of a data storage medium.
- the invention is not limited to this origin of defects.
- data may be missing. among the spatialization data received.
- the defectiveness of the spatialization data can be detected according to known methods, for example by means of a CRC type code.
- the invention is in no way limited by the form of the writing in memory of the identifier of the prediction model chosen. For example, it is possible to copy in a program memory all the instructions of a program corresponding to this model, or simply to memorize a model name in a possibly volatile memory.
- the prediction of the spatialization value is performed according to a prediction model, that is to say in particular that the data used for the prediction can vary according to the model. For example, for a model that assigns an arbitrary value to the spatialization value, no data is needed for the prediction. For a model that consists of taking a previous spatialization value, and / or weighting a previous spatialization value, this previous spatialization value is used during the prediction.
- step a / is performed for spatialization data corresponding to a given frequency band.
- several predictions can be carried out in parallel, in different frequency bands.
- the choice of the most accurate prediction model can be linked to the frequency: according to the frequency band considered, it may be necessary to choose different prediction models.
- the subject of the invention is a computer program comprising instructions for implementing the method explained above, when these instructions are executed by a processor.
- the invention has the aspect of a device for concealing defective spatialization data as defined in claim 10.
- This device comprises a memory unit, which may comprise one or more memories, for storing a plurality of instruction sets, each set of instructions corresponding to a prediction model.
- This device further comprises receiving means for receiving spatialization data.
- a test module makes it possible to test the validity of the spatialization data received by the reception means.
- an estimation module makes it possible, by instruction set stored in the memory unit, to execute this set of instructions so as to predict a spatialization value.
- a selection module makes it possible to choose a prediction model, based on the spatialization values predicted by the estimation module and on the spatialization data received by the reception means.
- the concealment device further comprises a prediction module arranged for, in the event of reception of spatialization data considered as defective by the detection module, to predict, according to the model chosen by the selection module, a spatialization value.
- the subject of the invention is a device for reconstructing multichannel audio data.
- This apparatus comprises multichannel reconstruction means, for reconstructing multichannel audio data from at least data on a restricted number of channels, for example single channel data.
- This apparatus further comprises the concealment device described above.
- the prediction module is arranged, in the event of reception of spatialization data considered as defective by the detection module, to provide the predicted spatialization value to the multichannel reconstruction means for the reconstruction of the multichannel audio data.
- the multi-channel audio data reconstruction apparatus may be integrated into a processor, or else comprise a computer-type device, hi-fi system, or the like.
- the various components of the reconstruction apparatus for example the reconstruction means, the concealment device, the detection module, or the like, may be distinct or merged.
- the number of channels of the multichannel audio data is exactly two, but of course there may be more.
- the multichannel audio data may for example comprise 5.1 data on six channels.
- the invention can also find an application in the field of spatialized audio conferencing.
- the audio data is grouped by frames or packets, indexed n.
- the figure 1 shows an example of an encoder, for which stereo information is transmitted in frequency bands and is applied in the frequency domain.
- the encoder integrates time-frequency transformation means 10, for example a DSP (of the "Digital Signal Processor") capable of producing a transform, for example a discrete Fourier transform or DFT (of the English “Discrete Fourier Transform"), a modified Modified Discrete Cosine Transform (MDCT), an MCLT (Modulated Complex Lapped Transform) transform.
- DSP Digital Signal Processor
- DFT discrete Fourier transform
- MDCT Modified Discrete Cosine Transform
- MCLT Modulated Complex Lapped Transform
- a stamping is then applied to the signals of the left channel S L (k) and the right S R (k), by means of stamping 11.
- the single-channel signal M (k) is typically the half-sum of the left signals S L (k) and right S R (k).
- the residual signal E (k) may be equal to half the difference between the left signals S L (k) and the right S R (k).
- Matrices may be adaptive so that the single channel signal M (k) carries more information.
- the method implemented by the stamping means 11 can change over time, so as to avoid the cancellation of components that would be in phase opposition between the left and right channels.
- Spatialization data estimation means 12 make it possible to estimate spatialization data, for example stereo parameters, from the single-channel signal M (k) and the residual signal E (k). These stereo parameters can be known to those skilled in the art, and understand for example, inter-channel level differences (ILD), inter-channel correlations (ICC) and inter-channel delays or phase differences (IPD / ITD).
- ILD inter-channel level differences
- ICC inter-channel correlations
- IPD / ITD inter-channel delays or phase differences
- These stereo parameters ILD (b) can be determined by frequency bands, indexed by the variable b. These bands can be constituted according to a frequency scale close to human perception. For example, one can use between 8 and 20 frequency bands, according to the desired precision and the richness of the considered spectrum.
- Quantization, coding and multiplexing means 13 make it possible to quantify and code the stereo parameters ILD (b) in order to allow transmission at a reduced rate.
- the single-channel signal M (k) is also quantified and coded by the means 13, in the transformed domain as presented on the figure 1 , or alternatively in the time domain.
- Standardized algorithms may be used to process this single-channel signal M (k), for example an ITU G.729.1 or G.718 type speech encoder. It may also be a generic audio encoder type MPEG-4 AAC or HE-AAC.
- the residual signal E (k) is optionally transmitted, also using a standard coding or a transmission technique specific to this signal in the frequency or time domain.
- the encoded signal S enc obtained at the output of the quantization, coding and multiplexing means 13 is transmitted, for example by radio.
- the encoder leads to obtaining data on more than one monophonic channel, provided that the number of channels of the data obtained at the output of the encoder is less than the number of channels of the input data of the encoder.
- the figure 2 shows an example of a decoder capable of receiving a signal S ' enc corresponding to the signal S enc transmitted.
- Means for decoding and demultiplexing 29 can extract the signal S enc received single-channel data M '(k), spatialization data ILD (b), and optionally the residual data E' (k).
- the decoder further comprises a reconstruction apparatus 26 for reconstructing multichannel audio data S ' L (k), S' R (k), from the single channel data M '(k), spatialization data ILD' (b) , and any residual data E '(k).
- the figure 3 shows an algorithm executable by the reconstruction apparatus 26 of the figure 2 . These two figures will be commented simultaneously.
- the reconstruction apparatus 26 includes a concealment device 20 for providing replacement values in case of defective ILD ' (b) spatialization data, and multichannel reconstruction means 27 for the actual reconstruction.
- M R k Where k denotes the frequency index considered, b is the band affected by the transmitted stereo parameters, M L (k), a signal in the frequency domain, obtained during a step 301 from the single-channel data M '(k), by applying in a manner known to those skilled in the art a phase shift or a delay corresponding to the left channel, this phase shift or delay being obtained from spatialization data not shown, and M R (k), a signal in the frequency domain, obtained equivalently in step 301, for the right channel.
- E ' L is a signal specific to the left channel, issued in a manner known to those skilled in the art from the residual data E' (k) optionally transmitted
- E ' R a signal specific to the right channel, issued in a manner known to those skilled in the art residual data E' (k) optionally transmitted.
- the step of obtaining data E ' L , E ' R is not represented on the figure 3 .
- W L and W R are the gains from spatialisation data ILD '(b, n) for the band b considered and the frame n .
- ILD '(b, n) is the spatialization data ILD' (b) received for the frame n.
- W R b not 2 - W R b not
- the concealment device 20 makes it possible to prevent possible losses of data ILD '(b, n), so that data W R and W L can nevertheless be determined.
- the concealment device 20 comprises unrepresented receiving means for receiving, during a step 305, the spatialization data ILD '(b, n), and possibly the single-channel data M' (k), and the residual data E '. (k).
- These receiving means may for example comprise an input port, pins input, or other.
- a test module 22 connected to these reception means makes it possible to test, during a step 306, the validity of the spatialization data ILD ' (b) .
- This test module can implement a verification of a CRC-type encoding, to verify for example that the transmission has not resulted in degradation of the spatialization data.
- the test module 22 can also read certain values (not shown) extracted from the signal S enc received, these values indicating possible transmitted data layers deletions. Indeed, it can be expected that some elements of the transmission network abstain from transmitting, particularly in the event of congestion of the network, or reduction of the bandwidth of the transmission channel, such or such a set of data. Non-transmitted data sets may be sound details, for example. When the test module 22 reads a value indicating a deletion of certain data, these data are considered as missing.
- the concealment device 20 comprises a memory unit 21 storing several sets of instructions, each set of instructions corresponding to a prediction model.
- the corresponding instructions then consist in copying the values W R (b, n-1), W L (b, n-1) obtained for the previous frame.
- W The 2 b not ⁇ + 1 - ⁇ .
- W The ⁇ b , not - 1 , and W R 2 b not ⁇ + 1 - ⁇ .
- W R ⁇ b , not - 1 with ⁇ between 0 and 1.
- W The 4 b not 1 2 .
- W The ⁇ b , not - 1 + 1 2 ⁇ W The ⁇ b , not - 2 , and W R 4 b not 1 2 .
- W L ( b, ni ) and W R ( b, ni ) instead of W L ( b, ni ) and W R ( b, ni ) respectively, attenuated values, for example 0.9, will be used.
- W R ( b, n - i ) It can be expected to keep in the memory unit these attenuated values, for use directly by applying one of the models described above.
- model examples lead to predict values of W L and W R.
- the models can be used to predict values of the variables ILD '(b, n), W' L and W ' R , or other.
- ILD' (b, n) LTD (b, n-1).
- the corresponding instruction then consists in copying this value ILD '(b, n-1) obtained for the previous frame.
- An estimation module 23 makes it possible to execute the instructions of the different instruction sets. This module 23 is activated for example for each frame such that the corresponding spatialization data ILD '(b, n) are considered valid by the test module 22, or even only for the frames considered valid and which precede a frame considered defective.
- this module 23 When this module 23 is activated, all the stored instruction sets are executed, during repeated steps 307 in a loop traversing the instruction sets, with the conventional steps of initialization, testing and incrementation, so that to get a set of values W The m W R m , m indicating the model used.
- a selection module 24 makes it possible to choose one of these models by comparing the predicted spatialisation values.
- ⁇ The , m 2 , ⁇ R , m 2 from predicted values W The m b not , W R m b not and from estimated values W L ( b , n ), W R ( b , n ).
- a sequence of N received frames is used to determine N values W The m b not and compare them with N estimated values W L ( b, n).
- W The m , W R m vis-à-vis the data W L , W R obtained from the values actually received.
- P m The P W The m b not / W The b not
- P m R P W R m b not / W R b not .
- the prediction model for which the resemblance value indicates a greater adequacy between predicted values and estimated values. For example, we determine the index m * of the model giving the best dissimulation: it will be the index which will minimize ⁇ m 2 or will maximize P m in another embodiment.
- This value m * constitutes an identifier of the prediction model chosen and is stored in the memory unit 21 during a step 309.
- steps 307 can be executed before steps 302, 304, or else in parallel.
- Each step 308 here puts into play values obtained during step 304, and is therefore executed after this step 304.
- the concealment device 20 further comprises a prediction module 25, for, in the event of reception of spatialization data considered to be defective, to predict in a step 310 according to the model identified by the value m * spatialization values W The m * b not and W R m * b not .
- This value is supplied to the multichannel reconstruction means 27, which are then able to reconstruct in step 300 the multichannel data S ' L (k), S' R (k) despite the defects of the spatialization data.
- Frequency-time transformation means 28 make it possible to retrieve audio time data S ' L (n), S' R (n) from the multichannel data S ' L (k), S' R ( k) reconstructed.
- W L (b, n) for the second sub-frequency band
- the values of W L (1, n) are for the most part equal to 1, which corresponds to a relatively monophonic sound signal.
- the values of W L (1, n) correspond to a signal located on the left, while for the C portion, the values of W L (1, n) correspond to a signal located on the right.
- the values of W L (1, n) correspond to a plurality of sound sources located at various locations.
- the best prediction model chosen may vary depending on the type of gain variation.
- the model consisting of repeating the value obtained for the previous frame would lead to erroneously repeat the peaks of values of W L (1, n).
- a more judicious model would be to choose an arbitrary value corresponding to a single channel signal, or to weight the gain obtained for the previous frame so as to approach gradually a gain of 1.
- the most judicious approach may be to repeat the gain value obtained for the previous frame.
- the most judicious model can change according to the type of variations of the gain from one frame to another.
- the process of figure 3 allows to select, without human intervention, the most appropriate prediction model.
- the figure 5 shows a computer including a screen 502, a keyboard, and a central unit.
- This central unit has a memory 500 for storing a computer program comprising instructions corresponding to the steps of the method described above.
- This central unit further comprises a processor 501 connected to the memory 500, to execute these instructions.
Landscapes
- Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- Acoustics & Sound (AREA)
- Signal Processing (AREA)
- Multimedia (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Mathematical Physics (AREA)
- Computational Linguistics (AREA)
- Algebra (AREA)
- General Physics & Mathematics (AREA)
- Mathematical Analysis (AREA)
- Mathematical Optimization (AREA)
- Pure & Applied Mathematics (AREA)
- Theoretical Computer Science (AREA)
- Compression, Expansion, Code Conversion, And Decoders (AREA)
- Stereophonic System (AREA)
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
FR0855249 | 2008-07-30 | ||
PCT/FR2009/051304 WO2010012927A1 (fr) | 2008-07-30 | 2009-07-03 | Reconstruction de donnees audio multicanal |
Publications (2)
Publication Number | Publication Date |
---|---|
EP2319037A1 EP2319037A1 (fr) | 2011-05-11 |
EP2319037B1 true EP2319037B1 (fr) | 2012-05-09 |
Family
ID=40276118
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
EP09802568A Active EP2319037B1 (fr) | 2008-07-30 | 2009-07-03 | Reconstruction de données audio multicanal |
Country Status (8)
Country | Link |
---|---|
US (1) | US8867752B2 (zh) |
EP (1) | EP2319037B1 (zh) |
JP (1) | JP5421367B2 (zh) |
KR (1) | KR101590919B1 (zh) |
CN (1) | CN102138177B (zh) |
AT (1) | ATE557387T1 (zh) |
ES (1) | ES2387869T3 (zh) |
WO (1) | WO2010012927A1 (zh) |
Families Citing this family (10)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JP5333257B2 (ja) * | 2010-01-20 | 2013-11-06 | 富士通株式会社 | 符号化装置、符号化システムおよび符号化方法 |
EP2609592B1 (en) * | 2010-08-24 | 2014-11-05 | Dolby International AB | Concealment of intermittent mono reception of fm stereo radio receivers |
US9552818B2 (en) | 2012-06-14 | 2017-01-24 | Dolby International Ab | Smooth configuration switching for multichannel audio rendering based on a variable number of received channels |
US9460723B2 (en) | 2012-06-14 | 2016-10-04 | Dolby International Ab | Error concealment strategy in a decoding system |
EP2959479B1 (en) | 2013-02-21 | 2019-07-03 | Dolby International AB | Methods for parametric multi-channel encoding |
CN104282309A (zh) | 2013-07-05 | 2015-01-14 | 杜比实验室特许公司 | 丢包掩蔽装置和方法以及音频处理系统 |
CN107886960B (zh) * | 2016-09-30 | 2020-12-01 | 华为技术有限公司 | 一种音频信号重建方法及装置 |
US10043523B1 (en) | 2017-06-16 | 2018-08-07 | Cypress Semiconductor Corporation | Advanced packet-based sample audio concealment |
KR102654181B1 (ko) * | 2019-03-29 | 2024-04-02 | 텔레폰악티에볼라겟엘엠에릭슨(펍) | 예측 코딩에서 저비용 에러 복구를 위한 방법 및 장치 |
CN112740708B (zh) * | 2020-05-21 | 2022-07-22 | 华为技术有限公司 | 一种音频数据传输方法及相关装置 |
Family Cites Families (11)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
DE4111131C2 (de) * | 1991-04-06 | 2001-08-23 | Inst Rundfunktechnik Gmbh | Verfahren zum Übertragen digitalisierter Tonsignale |
US6006173A (en) * | 1991-04-06 | 1999-12-21 | Starguide Digital Networks, Inc. | Method of transmitting and storing digitized audio signals over interference affected channels |
DE19526366A1 (de) * | 1995-07-20 | 1997-01-23 | Bosch Gmbh Robert | Verfahren zur Redundanzreduktion bei der Codierung von mehrkanaligen Signalen und Vorrichtung zur Dekodierung von redundanzreduzierten, mehrkanaligen Signalen |
US6181800B1 (en) * | 1997-03-10 | 2001-01-30 | Advanced Micro Devices, Inc. | System and method for interactive approximation of a head transfer function |
US6154452A (en) * | 1999-05-26 | 2000-11-28 | Xm Satellite Radio Inc. | Method and apparatus for continuous cross-channel interleaving |
US6970506B2 (en) * | 2001-03-05 | 2005-11-29 | Intervideo, Inc. | Systems and methods for reducing frame rates in a video data stream |
US7835916B2 (en) * | 2003-12-19 | 2010-11-16 | Telefonaktiebolaget Lm Ericsson (Publ) | Channel signal concealment in multi-channel audio systems |
SE527866C2 (sv) * | 2003-12-19 | 2006-06-27 | Ericsson Telefon Ab L M | Kanalsignalmaskering i multikanalsaudiosystem |
US7394903B2 (en) * | 2004-01-20 | 2008-07-01 | Fraunhofer-Gesellschaft Zur Forderung Der Angewandten Forschung E.V. | Apparatus and method for constructing a multi-channel output signal or for generating a downmix signal |
SE0402651D0 (sv) * | 2004-11-02 | 2004-11-02 | Coding Tech Ab | Advanced methods for interpolation and parameter signalling |
US20090281798A1 (en) * | 2005-05-25 | 2009-11-12 | Koninklijke Philips Electronics, N.V. | Predictive encoding of a multi channel signal |
-
2009
- 2009-07-03 KR KR1020117004404A patent/KR101590919B1/ko active IP Right Grant
- 2009-07-03 CN CN200980134855.0A patent/CN102138177B/zh active Active
- 2009-07-03 US US13/056,169 patent/US8867752B2/en active Active
- 2009-07-03 AT AT09802568T patent/ATE557387T1/de active
- 2009-07-03 WO PCT/FR2009/051304 patent/WO2010012927A1/fr active Application Filing
- 2009-07-03 JP JP2011520560A patent/JP5421367B2/ja active Active
- 2009-07-03 ES ES09802568T patent/ES2387869T3/es active Active
- 2009-07-03 EP EP09802568A patent/EP2319037B1/fr active Active
Also Published As
Publication number | Publication date |
---|---|
ES2387869T3 (es) | 2012-10-03 |
ATE557387T1 (de) | 2012-05-15 |
US20110129092A1 (en) | 2011-06-02 |
WO2010012927A1 (fr) | 2010-02-04 |
US8867752B2 (en) | 2014-10-21 |
KR20110065447A (ko) | 2011-06-15 |
KR101590919B1 (ko) | 2016-02-02 |
CN102138177B (zh) | 2014-05-28 |
JP5421367B2 (ja) | 2014-02-19 |
EP2319037A1 (fr) | 2011-05-11 |
CN102138177A (zh) | 2011-07-27 |
JP2011529579A (ja) | 2011-12-08 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
EP2319037B1 (fr) | Reconstruction de données audio multicanal | |
EP1600042B1 (fr) | Procede de traitement de donnees sonores compressees, pour spatialisation | |
EP2374123B1 (fr) | Codage perfectionne de signaux audionumeriques multicanaux | |
EP2002424B1 (fr) | Dispositif et procede de codage scalable d'un signal audio multi-canal selon une analyse en composante principale | |
EP2005420B1 (fr) | Dispositif et procede de codage par analyse en composante principale d'un signal audio multi-canal | |
EP2374124B1 (fr) | Codage perfectionne de signaux audionumériques multicanaux | |
EP1992198B1 (fr) | Optimisation d'une spatialisation sonore binaurale a partir d'un encodage multicanal | |
EP2898707B1 (fr) | Calibration optimisee d'un systeme de restitution sonore multi haut-parleurs | |
EP2691952B1 (fr) | Allocation par sous-bandes de bits de quantification de paramètres d'information spatiale pour un codage paramétrique | |
EP3427260B1 (fr) | Codage et décodage optimisé d'informations de spatialisation pour le codage et le décodage paramétrique d'un signal audio multicanal | |
EP2304721B1 (fr) | Synthese spatiale de signaux audio multicanaux | |
FR2966634A1 (fr) | Codage/decodage parametrique stereo ameliore pour les canaux en opposition de phase | |
EP2489039A1 (fr) | Codage/décodage paramétrique bas débit optimisé | |
FR2899424A1 (fr) | Procede de synthese binaurale prenant en compte un effet de salle | |
FR2875351A1 (fr) | Procede de traitement de donnees par passage entre domaines differents de sous-bandes | |
EP3391370A1 (fr) | Traitement de réduction de canaux adaptatif pour le codage d'un signal audio multicanal | |
WO2015011359A1 (fr) | Spatialisation sonore avec effet de salle | |
WO2009081002A1 (fr) | Traitement d'un flux audio 3d en fonction d'un niveau de presence de composantes spatiales | |
FR2845543A1 (fr) | Systeme d'embrouillage securise de flux audio |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PUAI | Public reference made under article 153(3) epc to a published international application that has entered the european phase |
Free format text: ORIGINAL CODE: 0009012 |
|
17P | Request for examination filed |
Effective date: 20110215 |
|
AK | Designated contracting states |
Kind code of ref document: A1 Designated state(s): AT BE BG CH CY CZ DE DK EE ES FI FR GB GR HR HU IE IS IT LI LT LU LV MC MK MT NL NO PL PT RO SE SI SK SM TR |
|
AX | Request for extension of the european patent |
Extension state: AL BA RS |
|
17Q | First examination report despatched |
Effective date: 20110622 |
|
DAX | Request for extension of the european patent (deleted) | ||
GRAP | Despatch of communication of intention to grant a patent |
Free format text: ORIGINAL CODE: EPIDOSNIGR1 |
|
RIN1 | Information on inventor provided before grant (corrected) |
Inventor name: PHILIPPE, PIERRICK Inventor name: VIRETTE, DAVID |
|
GRAS | Grant fee paid |
Free format text: ORIGINAL CODE: EPIDOSNIGR3 |
|
GRAA | (expected) grant |
Free format text: ORIGINAL CODE: 0009210 |
|
AK | Designated contracting states |
Kind code of ref document: B1 Designated state(s): AT BE BG CH CY CZ DE DK EE ES FI FR GB GR HR HU IE IS IT LI LT LU LV MC MK MT NL NO PL PT RO SE SI SK SM TR |
|
REG | Reference to a national code |
Ref country code: GB Ref legal event code: FG4D Free format text: NOT ENGLISH |
|
REG | Reference to a national code |
Ref country code: AT Ref legal event code: REF Ref document number: 557387 Country of ref document: AT Kind code of ref document: T Effective date: 20120515 Ref country code: CH Ref legal event code: EP |
|
REG | Reference to a national code |
Ref country code: IE Ref legal event code: FG4D Free format text: LANGUAGE OF EP DOCUMENT: FRENCH |
|
REG | Reference to a national code |
Ref country code: DE Ref legal event code: R096 Ref document number: 602009006897 Country of ref document: DE Effective date: 20120712 |
|
REG | Reference to a national code |
Ref country code: NL Ref legal event code: VDEP Effective date: 20120509 |
|
REG | Reference to a national code |
Ref country code: ES Ref legal event code: FG2A Ref document number: 2387869 Country of ref document: ES Kind code of ref document: T3 Effective date: 20121003 |
|
REG | Reference to a national code |
Ref country code: LT Ref legal event code: MG4D Effective date: 20120509 |
|
PG25 | Lapsed in a contracting state [announced via postgrant information from national office to epo] |
Ref country code: FI Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20120509 Ref country code: IS Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20120909 Ref country code: PL Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20120509 Ref country code: LT Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20120509 Ref country code: CY Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20120509 Ref country code: NO Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20120809 Ref country code: SE Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20120509 |
|
REG | Reference to a national code |
Ref country code: AT Ref legal event code: MK05 Ref document number: 557387 Country of ref document: AT Kind code of ref document: T Effective date: 20120509 |
|
PG25 | Lapsed in a contracting state [announced via postgrant information from national office to epo] |
Ref country code: PT Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20120910 Ref country code: GR Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20120810 Ref country code: HR Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20120509 Ref country code: SI Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20120509 Ref country code: LV Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20120509 |
|
BERE | Be: lapsed |
Owner name: FRANCE TELECOM Effective date: 20120731 |
|
PG25 | Lapsed in a contracting state [announced via postgrant information from national office to epo] |
Ref country code: SK Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20120509 Ref country code: AT Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20120509 Ref country code: RO Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20120509 Ref country code: EE Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20120509 Ref country code: DK Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20120509 Ref country code: CZ Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20120509 Ref country code: NL Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20120509 |
|
PG25 | Lapsed in a contracting state [announced via postgrant information from national office to epo] |
Ref country code: MC Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES Effective date: 20120731 Ref country code: MK Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20120509 |
|
PLBE | No opposition filed within time limit |
Free format text: ORIGINAL CODE: 0009261 |
|
STAA | Information on the status of an ep patent application or granted ep patent |
Free format text: STATUS: NO OPPOSITION FILED WITHIN TIME LIMIT |
|
26N | No opposition filed |
Effective date: 20130212 |
|
REG | Reference to a national code |
Ref country code: IE Ref legal event code: MM4A |
|
PG25 | Lapsed in a contracting state [announced via postgrant information from national office to epo] |
Ref country code: BE Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES Effective date: 20120731 |
|
REG | Reference to a national code |
Ref country code: DE Ref legal event code: R097 Ref document number: 602009006897 Country of ref document: DE Effective date: 20130212 |
|
PG25 | Lapsed in a contracting state [announced via postgrant information from national office to epo] |
Ref country code: BG Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20120809 Ref country code: MT Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20120509 Ref country code: IE Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES Effective date: 20120703 |
|
REG | Reference to a national code |
Ref country code: CH Ref legal event code: PL |
|
PG25 | Lapsed in a contracting state [announced via postgrant information from national office to epo] |
Ref country code: LI Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES Effective date: 20130731 Ref country code: CH Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES Effective date: 20130731 Ref country code: TR Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20120509 |
|
PG25 | Lapsed in a contracting state [announced via postgrant information from national office to epo] |
Ref country code: LU Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES Effective date: 20120703 Ref country code: SM Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20120509 |
|
PG25 | Lapsed in a contracting state [announced via postgrant information from national office to epo] |
Ref country code: HU Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20090703 |
|
REG | Reference to a national code |
Ref country code: FR Ref legal event code: PLFP Year of fee payment: 7 |
|
REG | Reference to a national code |
Ref country code: FR Ref legal event code: PLFP Year of fee payment: 8 |
|
REG | Reference to a national code |
Ref country code: FR Ref legal event code: PLFP Year of fee payment: 9 |
|
REG | Reference to a national code |
Ref country code: FR Ref legal event code: PLFP Year of fee payment: 10 |
|
PGFP | Annual fee paid to national office [announced via postgrant information from national office to epo] |
Ref country code: IT Payment date: 20230620 Year of fee payment: 15 Ref country code: FR Payment date: 20230621 Year of fee payment: 15 |
|
PGFP | Annual fee paid to national office [announced via postgrant information from national office to epo] |
Ref country code: GB Payment date: 20230620 Year of fee payment: 15 Ref country code: ES Payment date: 20230801 Year of fee payment: 15 |
|
PGFP | Annual fee paid to national office [announced via postgrant information from national office to epo] |
Ref country code: DE Payment date: 20230620 Year of fee payment: 15 |