WO2008040250A1 - Procédé, dispositif et système destinés au masquage d'erreurs d'un flux de données audio - Google Patents

Procédé, dispositif et système destinés au masquage d'erreurs d'un flux de données audio Download PDF

Info

Publication number
WO2008040250A1
WO2008040250A1 PCT/CN2007/070772 CN2007070772W WO2008040250A1 WO 2008040250 A1 WO2008040250 A1 WO 2008040250A1 CN 2007070772 W CN2007070772 W CN 2007070772W WO 2008040250 A1 WO2008040250 A1 WO 2008040250A1
Authority
WO
WIPO (PCT)
Prior art keywords
frame
audio frame
audio
type information
module
Prior art date
Application number
PCT/CN2007/070772
Other languages
English (en)
Chinese (zh)
Inventor
Hualin Wan
Zhe Wang
Jun Zhang
Original Assignee
Huawei Technologies Co., Ltd.
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Huawei Technologies Co., Ltd. filed Critical Huawei Technologies Co., Ltd.
Publication of WO2008040250A1 publication Critical patent/WO2008040250A1/fr

Links

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L1/00Arrangements for detecting or preventing errors in the information received
    • H04L1/004Arrangements for detecting or preventing errors in the information received by using forward error control
    • H04L1/0045Arrangements at the receiver end
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/005Correction of errors induced by the transmission channel, if related to the coding algorithm

Definitions

  • the present invention relates to real-time audio communication technology, and more particularly to a method, apparatus and system for audio stream error concealment. Background of the invention
  • Audio classification and classification are different for different application scenarios. For example, 1. In the noise suppression of advanced audio equipment, the FM signal is usually classified by FM analysis or Bayesian classifier. 2. In order to better index and retrieve audio resources on the Internet, content-based audio classification and retrieval studies have been conducted. The more representative content-based audio classification work analyzes the distinctive features of audio, including loudness, pitch, harmonicity, etc., and the audio classifier is designed. 3. Audio classification Another application is a voice activation detector (VAD) that serves audio, especially a speech coder. The purpose is to detect whether voice is present during voice communication. The voice and non-voice are different respectively. Encoding method to save the channel resources without reducing the call quality.
  • VAD voice activation detector
  • Figure 1 is a block diagram of audio stream error concealment.
  • the compressed audio signal passes through the IP.
  • the received audio data packet is usually stored in a jitter buffer, which is used to solve functions such as late packet and early packet reordering, and then performs packet loss and error packet detection. If there is a packet loss or a wrong packet, the system will start error concealment for packet loss compensation, otherwise the audio packet decoding output will be correctly received.
  • the packet loss recovery technology in audio real-time transmission can be divided into two categories according to the processing stage: sender-based repair and receiver-based repair.
  • the packet loss recovery based on the sender is initiated by the sender and needs to be coordinated between the sender and the receiver.
  • Common methods include increased redundancy, forward error correction, priority setting, and classification processing.
  • FEC Forward Error Correction
  • Priority setting method This technology requires network support and transmits packets according to priority. Otherwise, it cannot be implemented, and it can only improve the packet loss probability caused by network congestion.
  • the transmitting end can classify according to the characteristics of the speech signal, for example, 3GPP2 VMR-WB and ITU-T G.729.1 further the speech frame.
  • the description is voiced, unvoiced, voiced transition, unvoiced transition, onset, etc., and after decoding the terminal, using the voice frame type of the previous frame and the next frame, the type of the dropped frame can be inferred, and the decoder loses the frame. After the type, the information of the lost frame can be recovered better.
  • Receiver error concealment technology that does not require the sender to participate, essentially the number received Estimating lost data through a series of methods, and optimizing according to human physiological characteristics, is basically a passive patching, which is usually easier to implement and does not increase bandwidth requirements. Error-based hiding methods based on the receiving end can be divided into three categories:
  • Insertion-based strategy This type of technology includes methods such as splicing, muting, and noise substitution.
  • the stitching technique will disturb the timing of the media stream, and the effect is not good.
  • the range of mute substitution filling the frame loss position with a mute frame) is very limited, when the packet loss frequency is 4 low
  • the insertion method is independent of the speech coding and is not related to the encoding of the packet, but only the speech lost after decoding.
  • Interpolation-based strategy Compared with the insertion technique, the interpolation technique enables the processed sound to give a relatively better subjective feeling.
  • Re-generation based strategy Extract the decoding status from the information around the lost packet and generate a replacement packet for the lost packet. The implementation of this method is more complicated, but it will achieve better results.
  • the sender error concealment In general, based on the sender error concealment will increase the network bandwidth and computational complexity, the effect is better than the receiver-based, but if the sender error concealment is independent of the receiver, that is, regardless of the media content, then it will not be based on
  • the characteristics of the frame loss adopt the corresponding error concealment strategy for example, the stable speech frame is very similar to the previous frame, and the frame copy strategy can achieve a good hiding effect, and the transition frame needs to consider the state of the previous and subsequent frames to determine the hidden strategy.
  • the technology of the receiving end can also achieve a certain hidden effect, but if the hidden strategy is independent of the audio encoding, that is, the content characteristics of the currently lost frame and surrounding audio frames are not analyzed, so that a targeted error concealment strategy is adopted. Can use error concealment strategy will be very Limited.
  • the encoder analyzes the audio frame characteristics before formal coding, and uses different coding methods for audio frames of different characteristics. For example, AMR-WB+ uses ACELP and TCX encoding for signal frames based on audio frame content to form 26 superframes (each superframe consists of one superframe) encoding mode.
  • the coding mode information is used for error concealment. In the case where a frame is lost, the receiving end infers or estimates the coding mode of the super frame according to the coding type of the remaining three frames of the superframe, thereby realizing a certain error concealment function.
  • the voice frames are classified into voiced, unvoiced, voiced transition, unvoiced transition, onset, etc. according to the characteristics of the pitch, spectrum, and the like of the voice frame.
  • the encoder divides the voice frame into voiced, unvoiced, voiced transition, unvoiced transition, onset (VMR-WB also divides the voice frame according to the frame content and its characteristics. For these 5 classes, the type is indicated by 2 bits in layer 2.
  • G.729.1 also calculates the phase and energy of the frame, which are transmitted in layers 3 and 4 of the next frame, respectively.
  • the decoder will attempt to recover the frame identifier of the dropped frame from the known category identifier (including the category identifier of the previous frame), thereby reconstructing the audio waveform based on the category pattern of the dropped frame, combined with its phase and energy information.
  • encoding mode information is represented by 2, 2, 4, and 8 bits, respectively, and encoding is used in error concealment.
  • Mode information inferring or estimating the coding mode of the superframe (composed of 4 frames and 1024 sample points), thus realizing certain error concealment functions, but only the coding mode of audio coding is indicated, and cannot be based on the content of the audio frame.
  • the strategy is used for frame loss reconstruction, so efficient error concealment cannot be achieved.
  • this type of error concealment technology is designed for speech frames and does not work well when dealing with other types of audio frames. For the classification detection of music and natural sounds, especially how they reconstruct packet loss information in the case of packet loss, audio communication can also tolerate a high packet loss rate. There is currently no effective method.
  • the embodiment of the present invention provides a method for error concealment of an audio stream, which can implement efficient error concealment of an audio stream.
  • Embodiments of the present invention also provide an apparatus and system for error concealment of an audio stream, and the apparatus and system can be used to implement efficient error concealment of an audio stream.
  • a method for transmitting an audio stream error concealment including:
  • a method for receiving an audio stream error concealment comprising:
  • the corresponding error recovery strategy is used for audio frame reconstruction.
  • a method for hiding audio stream errors including:
  • the receiving end determines the type information of the audio frame obtained when it is classified according to the content;
  • the corresponding error recovery strategy is used for audio frame reconstruction.
  • a transmitter for error concealment of an audio stream comprising an audio encoder module, a frame encapsulation module and an audio frame classifier module;
  • the audio frame classifier module is configured to classify the transmitted audio frame according to content, obtain type information of the audio frame, and send the type information to the frame encapsulation module; the frame encapsulation module is configured to receive The type information of the audio frame sent by the audio frame classifier module and the encoding result of the audio frame sent by the audio encoder module are packaged and sent out by the type information of the audio frame and the encoding result of the audio frame.
  • a receiver for error concealment of an audio stream comprising a frame type discriminating module and an error concealing module
  • the frame type discriminating module is configured to determine type information of the audio frame obtained when the lost audio frame is classified according to the content, and send the type information to the error concealment module;
  • the error concealing module is configured to perform audio frame reconstruction by using a corresponding error recovery strategy according to the type information of the received lost audio frame.
  • An audio stream error concealment system comprising: a transmitter and a receiver;
  • the transmitter is configured to classify the transmitted audio frame according to content, obtain type information of the audio frame, and package and send the type information of the audio frame and the encoded result of the audio frame to the receiver;
  • the receiver is configured to determine, when a frame loss occurs, type information obtained when the lost audio frame is classified according to the content, and according to the type information, perform an audio frame reconstruction by using a corresponding error recovery strategy.
  • the embodiment of the present invention classifies the content of the audio frame according to the content of the audio frame, and sends the type information of the audio frame together with the encoded result of the audio frame.
  • the packet loss occurs, according to the loss.
  • the audio frames are classified according to the content, and the corresponding error concealment strategy is used to reconstruct the audio signal.
  • the error concealment method in the embodiment of the present invention makes the reconstruction of the lost frame more targeted, and can adaptively reconstruct the audio frame to achieve better compensation effect, and bring more benefits to the receiving end user.
  • a good subjective auditory experience, while improving the resolution of the audio frame signal, enables audio communication to tolerate higher packet loss rates.
  • Figure 1 is a block diagram of audio stream error concealment
  • FIG. 2 is a general flowchart of a method for error concealment of an audio stream according to an embodiment of the present invention
  • FIG. 3 is a schematic diagram of a system for error concealment of an audio stream according to an embodiment of the present invention
  • FIG. 5 is a general structural diagram of a receiver for error concealment of an audio stream according to an embodiment of the present invention
  • FIG. 6 is a specific flowchart of a method for transmitting error concealment of an audio stream according to an embodiment of the present invention
  • Schematic diagram of classification of audio frames
  • FIG. 8 is a specific flowchart of a method for receiving an audio stream error concealment according to an embodiment of the present invention
  • FIG. 9 is a specific structural diagram of a transmitter for error concealment of an audio stream according to an embodiment of the present invention
  • FIG. 10 is an audio stream error concealment according to an embodiment of the present invention
  • FIG. 2 is a general flow chart of a method for error concealment of an audio stream according to an embodiment of the present invention.
  • Figure 2 As shown, the method includes:
  • Step 201 classify the transmitted audio frame according to content, and obtain type information of the audio frame.
  • Step 202 Encapsulate and send out the type information of the audio frame and the encoding result of the audio frame.
  • Step 203 When a frame loss occurs, determine, for the lost audio frame, type information of the audio frame obtained when the content is classified according to the content;
  • Step 204 Perform audio frame reconstruction according to the type information of the lost audio frame by using a corresponding error recovery policy.
  • Steps 201 to 202 constitute an overall flow of the transmission method of the audio stream error concealment; steps 203 to 204 constitute an overall flow of the reception method of the audio stream error concealment.
  • FIG. 3 is a schematic structural diagram of a system for error concealment of an audio stream according to an embodiment of the present invention.
  • the system includes a transmitter 301 and a receiver 302.
  • the transmitter 301 is configured to classify the transmitted audio frame according to content, obtain type information of the audio frame, and package and transmit the type information of the audio frame and the encoding result of the audio frame to the receiver.
  • the receiver 302 is configured to determine, according to the type information, the type information obtained when the lost audio frame is classified according to the content, and perform the audio frame reconstruction according to the type information.
  • the transmitter and receiver in the system can employ the specific configurations of the transmitter 400 and the receiver 500 shown in Figs. 4 and 5, respectively.
  • the transmitter 400 includes an audio encoder module 410, an audio frame classifier module 420, and a frame encapsulation module 430.
  • the audio encoder module 410 is configured to encode the transmitted audio frame and send the encoded result to the frame encapsulation module 430.
  • An audio frame classifier configured to classify the sent audio frame according to content, obtain type information of the audio frame, and The type information is sent to the frame encapsulation module 430.
  • the frame encapsulating module 430 is configured to receive the encoding result of the audio frame sent by the audio encoder module 410 and the type information of the audio frame sent by the audio frame classifier module 420, and package the type information of the audio frame and the encoding result of the audio frame. Send it out.
  • the audio frame may be encoded differently according to the type information of the audio frame sent by the audio frame classifier, or the same coding mode may be directly applied to all the encoded frames.
  • FIG. 5 is a general structural diagram of a receiver for audio stream error concealment in the present invention.
  • the receiver includes: a frame type discrimination module 510 and an error concealment module 520.
  • the frame type discriminating module 510 is configured to use the type information of the audio frame obtained when the lost audio frame is classified according to the content, and send the type information to the error concealment module 520.
  • the error concealing module 520 is configured to perform audio frame reconstruction according to the type information of the received lost audio frame by using a corresponding error recovery strategy.
  • the present invention classifies the audio frame by the content at the transmitting end to obtain the type information of the audio frame, and sends the type information of the audio frame to the receiving end.
  • the receiving end adopts different error recovery strategies according to the type information of the lost audio frame. Perform audio frame reconstruction to efficiently hide errors.
  • FIG. 6 is a specific flowchart of a method for error concealment of an audio stream according to an embodiment of the present invention. As shown in Figure 6, the method includes:
  • Step 601 Divide the audio signal into equally spaced audio frames.
  • the frame length of the audio frame is determined according to the encoding protocol.
  • Step 602 Analyze the content and characteristics of the audio frame to obtain type information of the audio frame.
  • the audio frame is divided into a voice signal frame, a noise signal frame, a mute signal frame, a tone signal frame, and the like, and then further subdivided for each type, for example, the voice signal can be further divided into Voiced, unvoiced, voiced transition, unvoiced For transition, onset, etc., the tone signal frame can be divided into stable tone frames, transitions, and the like according to the stable characteristics of the signal.
  • Step 603 Perform encoding compression on the transmitted audio frame.
  • the same encoding method may be used for the entire audio signal, or different encoding methods may be used depending on the type of the audio frame.
  • Step 604 package the type of the audio frame and the result of the encoding compression, and send it out.
  • the type information of the audio frame may be identified in the frame header of the current frame or the next frame.
  • the method shown in FIG. 7 can be used.
  • the VAD is first used to detect whether the audio frame is a noise signal frame. If it is a noise signal frame, spectrum energy analysis is performed on the audio frame, and if it is a non-noise signal frame, spectrum stability analysis is performed on the audio frame.
  • the audio frame is divided into a mute signal frame and a noise signal frame, and then the mute signal frame or the noise signal frame can be further classified, and the type information of the audio frame is obtained.
  • the audio frame is divided into a speech signal frame and a tone signal frame, and then the speech signal frame or the tone signal frame can be further classified, and the port can be used for the voice signal.
  • the tone signal frame can be refined into a stable tone frame, a transitional tone frame, and the like.
  • the classification of the audio frame in the above transmission method is performed at the receiving end, in the embodiment, using the method shown in FIG. As shown in FIG. 8, the receiving method includes:
  • Step 801 Perform frame loss detection on the audio signal. If frame loss occurs, perform step 804 and subsequent steps. Otherwise, perform step 802 and subsequent steps. In this step, it is determined whether the loss of the audio frame occurs according to the frame number carried in the audio frame.
  • Step 802 Detect and record the type of the audio frame.
  • the type of the audio frame recorded in this step can be used to determine the type information of the lost audio frame.
  • Step 803 Decode the audio frame, and output the decoding result, and end the process.
  • the corresponding decoding method is used for decoding.
  • Step 804 determining type information obtained when the lost audio frame is classified according to the content.
  • the receiving end extracts the historical data, and infers the type of the currently lost frame according to the type information of the correctly received frame; if the type information of the audio frame is carried in other If the audio frame is correctly received, the receiving end directly extracts the type information of the currently lost frame in the corresponding correctly received audio frame.
  • Step 805 Reconfigure the audio frame by using the corresponding error recovery strategy adaptively according to the type of the lost audio frame, and output the reconstructed result, and end the process.
  • the audio frame can be reconstructed according to the type of the lost audio frame and the most suitable error recovery strategy for the type.
  • a stable speech frame is very similar to its previous frame.
  • a frame copy strategy can achieve a good hiding effect.
  • a transition frame needs to consider the state of the previous and succeeding frames to determine a hidden strategy.
  • the audio frame is classified by using the method shown in FIG. 7 , and other content-based audio frame classification methods may be used, as long as the audio frame is classified according to the content.
  • the purpose can be.
  • the classification of the audio frame can be utilized to efficiently realize error concealment, and the tolerance of the real-time audio communication to the packet loss rate is greatly improved.
  • the foregoing is a specific implementation manner of the method for transmitting and receiving audio stream error concealment provided in the embodiment.
  • the two embodiments cooperate with each other to form a specific embodiment of the method for error concealment of the audio stream in the present invention.
  • this embodiment also provides a specific implementation of the corresponding audio stream error concealed transmitter and receiver.
  • FIG. 9 is a specific structural diagram of a transmitter for error concealment of an audio stream according to an embodiment of the present invention. As shown
  • the transmitter 900 includes: an audio encoder module 910, an audio frame classifier module
  • Frame encapsulation module 930 and audio frame division module 940 Frame encapsulation module 930 and audio frame division module 940.
  • an audio frame dividing module 940 is configured to divide the audio signal into equally spaced audio frames according to different encoding protocols, and send the audio frames to the audio encoder module 910 and the audio frame classifier module 920. .
  • the audio encoder module 910 is configured to encode the audio frame and send the encoded result to the frame encapsulation module 930.
  • the audio frame classifier module 920 is configured to classify the audio frames according to the content, and the specific classification manner may be the manner shown in FIG. 7, and the type information of the audio frames is sent to the frame encapsulation module 930.
  • the frame encapsulation module 930 is configured to receive the audio frame coding result sent by the audio encoder module 910 and the audio frame type information sent by the audio frame classifier module 920, and package and transmit the type information and the encoded result of the audio frame.
  • the type information of the audio frame may be encapsulated in the audio frame or the next audio frame, and may be located in a part of the frame header.
  • the audio frame may be encoded differently according to the type information of the audio frame sent by the audio frame classifier, or the same encoding mode may be directly applied to all the encoded frames.
  • FIG. 10 is a specific structural diagram of a receiver for error concealment of an audio stream according to an embodiment of the present invention.
  • the receiver 1000 includes a frame type discrimination module 1010, an error concealment module 1020, an error detection module 1030, and an audio decoder module 1040.
  • the module 1010 includes a discriminating sub-module 1011 and a storage sub-module 1012.
  • the error concealing module 1020 includes a policy decision sub-module 1021 and an error concealment sub-module 1022.
  • the error detection module 1030 is configured to receive an audio frame from the channel, send the received audio frame to the discriminant sub-module 1011 in the frame type discriminating module 1010, and detect whether a frame loss occurs. When the frame is dropped, the discriminating sub-module 1011 in the frame type discriminating module 1010 is notified.
  • the frame type discriminating module 1010 when determining the type in which the audio frame is classified according to the content, if the type information of the audio frame is carried in the correctly received audio frame, the type information is directly extracted and stored in the storage submodule. In 1012, if the type information of the audio frame is carried in the lost audio frame, the type information obtained when the lost audio frame is classified according to the content is inferred according to the type of the preceding and succeeding frames.
  • the policy decision submodule 1021 is configured to receive the type information of the lost frame sent by the discriminating submodule 1011, and determine the adopted error recovery strategy according to the type information, and send the result to the error concealment submodule. 1022.
  • the error concealment sub-module 1022 is configured to reconstruct the lost audio frame according to the error recovery policy decision result sent by the policy decision sub-module 1021.
  • the audio frame decoder module 1040 is configured to decode the received audio frame and output the decoded result.
  • the audio frame classifier module 920 uses the manner of FIG. 7 to classify audio frames.
  • the frame type decision module 1010 is refined into a decision sub-module 1011 and a storage sub-module 1012, respectively performing frame type decision and storage;
  • the error concealment module 1020 is refined into a policy decision sub-module 1021 and an error concealer.
  • Module 1022 respectively, performs policy decisions and error concealment.
  • the error concealment sub-module 1022 can be further divided into a plurality of different types of error concealment units, such as a noise error concealment list. Meta, voice error concealment unit, etc., used to handle error concealment of different types of audio frames.
  • An embodiment of the audio stream error concealment system of the present invention may be: using the above FIG. 9 and
  • the transmitter 900 and the receiver 1000 shown in FIG. 10 are specific embodiments of the transmitter and the receiver in the audio stream error concealment system, and the audio frame outputted by the frame encapsulation module 930 in the transmitter 900 is transmitted to the receiver 1000. Error detection module 1030.
  • an embodiment of the audio stream error concealment system of the present invention can be constructed.
  • the technical solution of the present invention makes the reconstruction of the lost frame more targeted, and can adaptively reconstruct the audio frame to achieve better.
  • the compensation effect brings a better subjective hearing experience to the receiving end user, and at the same time improves the resolution of the audio frame signal, so that the audio communication can tolerate a higher packet loss rate.

Landscapes

  • Engineering & Computer Science (AREA)
  • Signal Processing (AREA)
  • Computational Linguistics (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Computer Networks & Wireless Communication (AREA)
  • Detection And Prevention Of Errors In Transmission (AREA)

Abstract

L'invention concerne un procédé destiné au masquage d'une erreur dans un flux de données audio. Ce procédé consiste: à classer la trame audio transmise sur la base de ses contenus et à obtenir des informations sur le type de la trame audio; à conditionner les informations sur le type de la trame audio avec les résultats de codage de la trame audio et à les transmettre; lors de la perte d'une trame, à déterminer, pour la trame audio perdue, les informations sur le type de la trame audio obtenue par classement sur la base de ses contenus; et, selon les informations sur le type de la trame audio perdue, à reconstruire la trame audio à l'aide de la stratégie de reprise sur erreur correspondante. Ledit mode de masquage d'erreur a une meilleure pertinence pour la reconstruction de la trame perdue et permet de reconstruire la trame audio perdue de façon auto-adaptative et d'obtenir, ainsi, un meilleur effet de compensation. L'invention concerne également un procédé d'émission et de réception destiné au masquage d'erreurs d'un flux de données audio. L'invention concerne également un émetteur, un récepteur et un système destinés au masquage d'erreurs d'un flux de données audio.
PCT/CN2007/070772 2006-10-01 2007-09-25 Procédé, dispositif et système destinés au masquage d'erreurs d'un flux de données audio WO2008040250A1 (fr)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN 200610159697 CN101155140A (zh) 2006-10-01 2006-10-01 音频流错误隐藏的方法、装置和系统
CN200610159697.2 2006-10-01

Publications (1)

Publication Number Publication Date
WO2008040250A1 true WO2008040250A1 (fr) 2008-04-10

Family

ID=39256584

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2007/070772 WO2008040250A1 (fr) 2006-10-01 2007-09-25 Procédé, dispositif et système destinés au masquage d'erreurs d'un flux de données audio

Country Status (2)

Country Link
CN (1) CN101155140A (fr)
WO (1) WO2008040250A1 (fr)

Cited By (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102244825A (zh) * 2011-06-10 2011-11-16 中兴通讯股份有限公司 一种多媒体流的播放方法及装置
CN109155134A (zh) * 2016-03-07 2019-01-04 弗劳恩霍夫应用研究促进协会 使用正确解码的音频帧的解码表示的特性的错误隐藏单元、音频解码器和相关方法以及计算机程序
CN109313905A (zh) * 2016-03-07 2019-02-05 弗劳恩霍夫应用研究促进协会 对不同的频带根据不同的阻尼因子淡出隐藏的音频帧的错误隐藏单元、音频解码器及相关方法和计算机程序
DE102018200258A1 (de) 2018-01-10 2019-07-11 Robert Bosch Gmbh Verfahren zur verbesserten Übertragung einer Datensequenz und Verfahren zur Fehlerkorrektur bei einer Übertragung einer Datensequenz
US10607614B2 (en) 2013-06-21 2020-03-31 Fraunhofer-Gesellschaft Zur Foerderung Der Angewandten Forschung E.V. Apparatus and method realizing a fading of an MDCT spectrum to white noise prior to FDNS application

Families Citing this family (16)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
DE102008042579B4 (de) * 2008-10-02 2020-07-23 Robert Bosch Gmbh Verfahren zur Fehlerverdeckung bei fehlerhafter Übertragung von Sprachdaten
US8428938B2 (en) * 2009-06-04 2013-04-23 Qualcomm Incorporated Systems and methods for reconstructing an erased speech frame
CN101894558A (zh) * 2010-08-04 2010-11-24 华为技术有限公司 丢帧恢复方法、设备以及语音增强方法、设备和系统
ES2960089T3 (es) 2012-06-08 2024-02-29 Samsung Electronics Co Ltd Procedimiento y aparato para la ocultación de errores de trama y procedimiento y aparato para la decodificación de audio
US9312990B2 (en) * 2012-09-13 2016-04-12 International Business Machines Corporation Packet loss recovery on a wireless link in a transmission layer protocol session
JP6434411B2 (ja) 2012-09-24 2018-12-05 サムスン エレクトロニクス カンパニー リミテッド フレームエラー隠匿方法及びその装置、並びにオーディオ復号化方法及びその装置
CN104282309A (zh) * 2013-07-05 2015-01-14 杜比实验室特许公司 丢包掩蔽装置和方法以及音频处理系统
CN104301064B (zh) 2013-07-16 2018-05-04 华为技术有限公司 处理丢失帧的方法和解码器
CN103646647B (zh) * 2013-12-13 2016-03-16 武汉大学 混合音频解码器中帧差错隐藏的谱参数代替方法及系统
CN103714820B (zh) * 2013-12-27 2017-01-11 广州华多网络科技有限公司 参数域的丢包隐藏方法及装置
NO2780522T3 (fr) * 2014-05-15 2018-06-09
CN106683681B (zh) 2014-06-25 2020-09-25 华为技术有限公司 处理丢失帧的方法和装置
CN112309352A (zh) * 2020-01-15 2021-02-02 北京字节跳动网络技术有限公司 音频信息处理方法、装置、设备和介质
CN111883171B (zh) * 2020-04-08 2023-09-22 珠海市杰理科技股份有限公司 音频信号的处理方法及系统、音频处理芯片、蓝牙设备
CN113035208B (zh) * 2021-03-04 2023-03-28 北京百瑞互联技术有限公司 一种音频解码器的分级错误隐藏方法、装置及存储介质
CN113259063B (zh) * 2021-06-10 2022-02-08 腾讯科技(深圳)有限公司 数据处理方法、装置、计算机设备和计算机可读存储介质

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6754188B1 (en) * 2001-09-28 2004-06-22 Meshnetworks, Inc. System and method for enabling a node in an ad-hoc packet-switched wireless communications network to route packets based on packet content
CN1589550A (zh) * 2001-11-15 2005-03-02 松下电器产业株式会社 错误隐蔽装置和方法
US20050154584A1 (en) * 2002-05-31 2005-07-14 Milan Jelinek Method and device for efficient frame erasure concealment in linear predictive based speech codecs

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6754188B1 (en) * 2001-09-28 2004-06-22 Meshnetworks, Inc. System and method for enabling a node in an ad-hoc packet-switched wireless communications network to route packets based on packet content
CN1589550A (zh) * 2001-11-15 2005-03-02 松下电器产业株式会社 错误隐蔽装置和方法
US20050154584A1 (en) * 2002-05-31 2005-07-14 Milan Jelinek Method and device for efficient frame erasure concealment in linear predictive based speech codecs

Cited By (15)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102244825A (zh) * 2011-06-10 2011-11-16 中兴通讯股份有限公司 一种多媒体流的播放方法及装置
US11462221B2 (en) 2013-06-21 2022-10-04 Fraunhofer-Gesellschaft Zur Foerderung Der Angewandten Forschung E.V. Apparatus and method for generating an adaptive spectral shape of comfort noise
US10867613B2 (en) 2013-06-21 2020-12-15 Fraunhofer-Gesellschaft Zur Foerderung Der Angewandten Forschung E.V. Apparatus and method for improved signal fade out in different domains during error concealment
US11869514B2 (en) 2013-06-21 2024-01-09 Fraunhofer-Gesellschaft Zur Foerderung Der Angewandten Forschung E.V. Apparatus and method for improved signal fade out for switched audio coding systems during error concealment
US10607614B2 (en) 2013-06-21 2020-03-31 Fraunhofer-Gesellschaft Zur Foerderung Der Angewandten Forschung E.V. Apparatus and method realizing a fading of an MDCT spectrum to white noise prior to FDNS application
US10672404B2 (en) 2013-06-21 2020-06-02 Fraunhofer-Gesellschaft Zur Foerderung Der Angewandten Forschung E.V. Apparatus and method for generating an adaptive spectral shape of comfort noise
US10679632B2 (en) 2013-06-21 2020-06-09 Fraunhofer-Gesellschaft Zur Foerderung Der Angewandten Forschung E.V. Apparatus and method for improved signal fade out for switched audio coding systems during error concealment
US10854208B2 (en) 2013-06-21 2020-12-01 Fraunhofer-Gesellschaft Zur Foerderung Der Angewandten Forschung E.V. Apparatus and method realizing improved concepts for TCX LTP
US11776551B2 (en) 2013-06-21 2023-10-03 Fraunhofer-Gesellschaft Zur Foerderung Der Angewandten Forschung E.V. Apparatus and method for improved signal fade out in different domains during error concealment
US11501783B2 (en) 2013-06-21 2022-11-15 Fraunhofer-Gesellschaft Zur Foerderung Der Angewandten Forschung E.V. Apparatus and method realizing a fading of an MDCT spectrum to white noise prior to FDNS application
CN109155134A (zh) * 2016-03-07 2019-01-04 弗劳恩霍夫应用研究促进协会 使用正确解码的音频帧的解码表示的特性的错误隐藏单元、音频解码器和相关方法以及计算机程序
CN109313905B (zh) * 2016-03-07 2023-05-23 弗劳恩霍夫应用研究促进协会 隐藏音频帧丢失的错误隐藏单元、音频解码器及相关方法
CN109155134B (zh) * 2016-03-07 2023-05-23 弗劳恩霍夫应用研究促进协会 隐藏音频帧丢失的错误隐藏单元、音频解码器和相关方法
CN109313905A (zh) * 2016-03-07 2019-02-05 弗劳恩霍夫应用研究促进协会 对不同的频带根据不同的阻尼因子淡出隐藏的音频帧的错误隐藏单元、音频解码器及相关方法和计算机程序
DE102018200258A1 (de) 2018-01-10 2019-07-11 Robert Bosch Gmbh Verfahren zur verbesserten Übertragung einer Datensequenz und Verfahren zur Fehlerkorrektur bei einer Übertragung einer Datensequenz

Also Published As

Publication number Publication date
CN101155140A (zh) 2008-04-02

Similar Documents

Publication Publication Date Title
WO2008040250A1 (fr) Procédé, dispositif et système destinés au masquage d'erreurs d'un flux de données audio
JP6546897B2 (ja) マルチレート・スピーチ/オーディオ・コーデックのためのフレーム損失隠匿について符号化を実行する方法
JP6533285B2 (ja) 符号化器、復号器ならびに隠蔽を増強するためのパラメータを使用してオーディオ内容を符号化および復号するための方法
CN101305417B (zh) 移动电信网络中的方法和装置
EP2647241B1 (fr) Agregation de trames adaptive au signal source
JP2003241799A (ja) 音響符号化方法、復号化方法、符号化装置、復号化装置及び符号化プログラム、復号化プログラム
Johansson et al. Bandwidth efficient AMR operation for VoIP
US7853450B2 (en) Digital voice enhancement
Hoene et al. Voice over IP: improving the quality over wireless LAN by adopting a booster mechanism: an experimental approach
Sanneck et al. Selective packet prioritization for wireless Voice over IP
JP4060317B2 (ja) 双方向通信システム、通信機、および通信制御方法
Dorogov et al. Overview of Technologies for Transmitting Audio Streams over Low-Speed and Unstable Communication Channels
Bhute et al. Error concealment schemes for speech packet transmission over IP network
Serizawa et al. A packet loss recovery method using packets arrived behind the playout time for CELP decoding
Antoszkiewicz Voice Over Internet Protocol (VolP) Packet Loss Concealment (PLC) by Redundant Transmission of Speech Information
SIVASELVAN AUDIO STREAMING USING INTERLEAVED FORWARD ERROR CORRECTION

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 07816963

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 07816963

Country of ref document: EP

Kind code of ref document: A1