MX2021009635A - Estimacion de la forma espectral a partir de coeficientes de mdct. - Google Patents

Estimacion de la forma espectral a partir de coeficientes de mdct.

Info

Publication number
MX2021009635A
MX2021009635A MX2021009635A MX2021009635A MX2021009635A MX 2021009635 A MX2021009635 A MX 2021009635A MX 2021009635 A MX2021009635 A MX 2021009635A MX 2021009635 A MX2021009635 A MX 2021009635A MX 2021009635 A MX2021009635 A MX 2021009635A
Authority
MX
Mexico
Prior art keywords
audio frame
decoded
spectral
frame
values
Prior art date
Application number
MX2021009635A
Other languages
English (en)
Inventor
Jonas Svedberg
Martin Sehlstedt
Original Assignee
Ericsson Telefon Ab L M
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Ericsson Telefon Ab L M filed Critical Ericsson Telefon Ab L M
Publication of MX2021009635A publication Critical patent/MX2021009635A/es

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/005Correction of errors induced by the transmission channel, if related to the coding algorithm
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F17/00Digital computing or data processing equipment or methods, specially adapted for specific functions
    • G06F17/10Complex mathematical operations
    • G06F17/14Fourier, Walsh or analogous domain transformations, e.g. Laplace, Hilbert, Karhunen-Loeve, transforms
    • G06F17/141Discrete Fourier transforms
    • G06F17/142Fast Fourier transforms, e.g. using a Cooley-Tukey type algorithm
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/02Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/02Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders
    • G10L19/0204Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders using subband decomposition
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/03Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters
    • G10L25/18Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters the extracted parameters being spectral information of each sub-band
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/45Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of analysis window
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L65/00Network arrangements, protocols or services for supporting real-time applications in data packet communication
    • H04L65/60Network streaming of media packets
    • H04L65/75Media network packet handling
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L65/00Network arrangements, protocols or services for supporting real-time applications in data packet communication
    • H04L65/80Responding to QoS
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/02Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders
    • G10L19/0212Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders using orthogonal transformation

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Multimedia (AREA)
  • Signal Processing (AREA)
  • Acoustics & Sound (AREA)
  • Human Computer Interaction (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Health & Medical Sciences (AREA)
  • Computational Linguistics (AREA)
  • Spectroscopy & Molecular Physics (AREA)
  • Mathematical Physics (AREA)
  • General Physics & Mathematics (AREA)
  • Mathematical Optimization (AREA)
  • Data Mining & Analysis (AREA)
  • Theoretical Computer Science (AREA)
  • Computational Mathematics (AREA)
  • Mathematical Analysis (AREA)
  • Pure & Applied Mathematics (AREA)
  • Computer Networks & Wireless Communication (AREA)
  • Algebra (AREA)
  • Databases & Information Systems (AREA)
  • General Engineering & Computer Science (AREA)
  • Software Systems (AREA)
  • Discrete Mathematics (AREA)
  • Compression, Expansion, Code Conversion, And Decoders (AREA)
  • Transmission Systems Not Characterized By The Medium Used For Transmission (AREA)
  • Communication Control (AREA)

Abstract

Se proporciona un método, decodificador y código de programa para controlar un método de ocultación de una trama de audio perdida. Una primera trama de audio y una segunda trama de audio de la señal de audio recibida se decodifican para obtener coeficientes de transformada de coseno discreta modificada (MDCT). Se determinan valores de una primera forma espectral con base en coeficientes de MDCT decodificados de la primera trama de audio decodificada y valores de una segunda forma espectral con base en coeficientes de MDCT decodificados de la segunda trama de audio decodificada, cada una de las formas espectrales comprende un número de subbandas. Los valores de las formas espectrales y energías de trama de la primera trama de audio y la segunda trama de audio se transforman en representaciones de análisis espectrales a base de FFT. Se detecta una condición transitoria en función de las representaciones de las FFTs. En respuesta a la detección de la condición transitoria, el método de ocultación se modifica ajustando selectivamente la magnitud de espectro de un espectro de trama de sustitución.
MX2021009635A 2019-02-21 2020-02-20 Estimacion de la forma espectral a partir de coeficientes de mdct. MX2021009635A (es)

Applications Claiming Priority (4)

Application Number Priority Date Filing Date Title
US201962808600P 2019-02-21 2019-02-21
US201962808587P 2019-02-21 2019-02-21
US201962808610P 2019-02-21 2019-02-21
PCT/EP2020/054523 WO2020169757A1 (en) 2019-02-21 2020-02-20 Spectral shape estimation from mdct coefficients

Publications (1)

Publication Number Publication Date
MX2021009635A true MX2021009635A (es) 2021-09-08

Family

ID=69701173

Family Applications (1)

Application Number Title Priority Date Filing Date
MX2021009635A MX2021009635A (es) 2019-02-21 2020-02-20 Estimacion de la forma espectral a partir de coeficientes de mdct.

Country Status (9)

Country Link
US (5) US20220172733A1 (es)
EP (3) EP3928313A1 (es)
JP (6) JP7307805B2 (es)
KR (1) KR20210130743A (es)
CN (2) CN113454714B (es)
BR (1) BR112021014477A2 (es)
CO (2) CO2021010587A2 (es)
MX (1) MX2021009635A (es)
WO (3) WO2020169754A1 (es)

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111984920B (zh) * 2020-08-31 2022-03-18 广东电网有限责任公司广州供电局 次/超同步谐波参数识别方法、装置、设备和介质

Family Cites Families (47)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5884253A (en) * 1992-04-09 1999-03-16 Lucent Technologies, Inc. Prototype waveform speech coding with interpolation of pitch, pitch-period waveforms, and synthesis filter
KR970011728B1 (ko) * 1994-12-21 1997-07-14 김광호 음향신호의 에러은닉방법 및 그 장치
US5848391A (en) * 1996-07-11 1998-12-08 Fraunhofer-Gesellschaft Zur Forderung Der Angewandten Forschung E.V. Method subband of coding and decoding audio signals using variable length windows
US7117156B1 (en) * 1999-04-19 2006-10-03 At&T Corp. Method and apparatus for performing packet loss or frame erasure concealment
US6775649B1 (en) * 1999-09-01 2004-08-10 Texas Instruments Incorporated Concealment of frame erasures for speech transmission and storage system and method
WO2001033411A1 (en) * 1999-10-30 2001-05-10 Stmicroelectronics Asia Pacific Pte. Ltd. Fast modified discrete cosine transform method
CN1311424C (zh) 2001-03-06 2007-04-18 株式会社Ntt都科摩 音频数据内插、关联信息制作、内插信息发送装置和方法
US7324444B1 (en) * 2002-03-05 2008-01-29 The Board Of Trustees Of The Leland Stanford Junior University Adaptive playout scheduling for multimedia communication
KR100467617B1 (ko) * 2002-10-30 2005-01-24 삼성전자주식회사 개선된 심리 음향 모델을 이용한 디지털 오디오 부호화방법과그 장치
KR100477701B1 (ko) * 2002-11-07 2005-03-18 삼성전자주식회사 Mpeg 오디오 인코딩 방법 및 mpeg 오디오 인코딩장치
US7325023B2 (en) * 2003-09-29 2008-01-29 Sony Corporation Method of making a window type decision based on MDCT data in audio encoding
US7831421B2 (en) * 2005-05-31 2010-11-09 Microsoft Corporation Robust decoder
US7526351B2 (en) * 2005-06-01 2009-04-28 Microsoft Corporation Variable speed playback of digital audio
US8473298B2 (en) * 2005-11-01 2013-06-25 Apple Inc. Pre-resampling to achieve continuously variable analysis time/frequency resolution
US8798172B2 (en) * 2006-05-16 2014-08-05 Samsung Electronics Co., Ltd. Method and apparatus to conceal error in decoded audio signal
PT2102619T (pt) * 2006-10-24 2017-05-25 Voiceage Corp Método e dispositivo para codificação de tramas de transição em sinais de voz
JP5103880B2 (ja) 2006-11-24 2012-12-19 富士通株式会社 復号化装置および復号化方法
KR101292771B1 (ko) * 2006-11-24 2013-08-16 삼성전자주식회사 오디오 신호의 오류은폐방법 및 장치
CN101207468B (zh) * 2006-12-19 2010-07-21 华为技术有限公司 丢帧隐藏方法、系统和装置
US8165872B2 (en) * 2007-02-01 2012-04-24 Broadcom Corporation Method and system for improving speech quality
EP2153436B1 (en) * 2007-05-14 2014-07-09 Freescale Semiconductor, Inc. Generating a frame of audio data
WO2008151408A1 (en) * 2007-06-14 2008-12-18 Voiceage Corporation Device and method for frame erasure concealment in a pcm codec interoperable with the itu-t recommendation g.711
US8185388B2 (en) * 2007-07-30 2012-05-22 Huawei Technologies Co., Ltd. Apparatus for improving packet loss, frame erasure, or jitter concealment
TW200912892A (en) * 2007-09-04 2009-03-16 Univ Nat Central Method and apparatus of low-complexity psychoacoustic model applicable for advanced audio coding encoders
US20100324911A1 (en) * 2008-04-07 2010-12-23 Broadcom Corporation Cvsd decoder state update after packet loss
CN101588341B (zh) * 2008-05-22 2012-07-04 华为技术有限公司 一种丢帧隐藏的方法及装置
US9076439B2 (en) * 2009-10-23 2015-07-07 Broadcom Corporation Bit error management and mitigation for sub-band coding
US20110196673A1 (en) * 2010-02-11 2011-08-11 Qualcomm Incorporated Concealing lost packets in a sub-band coding decoder
JP5973582B2 (ja) 2011-10-21 2016-08-23 サムスン エレクトロニクス カンパニー リミテッド フレームエラー隠匿方法及びその装置、並びにオーディオ復号化方法及びその装置
CN103714821A (zh) * 2012-09-28 2014-04-09 杜比实验室特许公司 基于位置的混合域数据包丢失隐藏
US9325544B2 (en) * 2012-10-31 2016-04-26 Csr Technology Inc. Packet-loss concealment for a degraded frame using replacement data from a non-degraded frame
FR3001593A1 (fr) * 2013-01-31 2014-08-01 France Telecom Correction perfectionnee de perte de trame au decodage d'un signal.
MX344550B (es) 2013-02-05 2016-12-20 Ericsson Telefon Ab L M Metodo y aparato para controlar ocultacion de perdida de trama de audio.
KR102037691B1 (ko) 2013-02-05 2019-10-29 텔레폰악티에볼라겟엘엠에릭슨(펍) 오디오 프레임 손실 은폐
WO2014123469A1 (en) 2013-02-05 2014-08-14 Telefonaktiebolaget L M Ericsson (Publ) Enhanced audio frame loss concealment
FR3004876A1 (fr) * 2013-04-18 2014-10-24 France Telecom Correction de perte de trame par injection de bruit pondere.
MY169132A (en) * 2013-06-21 2019-02-18 Fraunhofer Ges Forschung Method and apparatus for obtaining spectrum coefficients for a replacement frame of an audio signal, audio decoder, audio receiver and system for transmitting audio signals
CN104282309A (zh) * 2013-07-05 2015-01-14 杜比实验室特许公司 丢包掩蔽装置和方法以及音频处理系统
EP4336493A3 (en) 2014-07-28 2024-06-12 Samsung Electronics Co., Ltd. Method and apparatus for packet loss concealment, and decoding method and apparatus employing same
FR3024582A1 (fr) * 2014-07-29 2016-02-05 Orange Gestion de la perte de trame dans un contexte de transition fd/lpd
JP2016038435A (ja) * 2014-08-06 2016-03-22 ソニー株式会社 符号化装置および方法、復号装置および方法、並びにプログラム
EP3230980B1 (en) * 2014-12-09 2018-11-28 Dolby International AB Mdct-domain error concealment
US9978400B2 (en) * 2015-06-11 2018-05-22 Zte Corporation Method and apparatus for frame loss concealment in transform domain
US20170178648A1 (en) * 2015-12-18 2017-06-22 Dolby International Ab Enhanced Block Switching and Bit Allocation for Improved Transform Audio Coding
JP6718516B2 (ja) * 2016-03-07 2020-07-08 フラウンホッファー−ゲゼルシャフト ツァ フェルダールング デァ アンゲヴァンテン フォアシュンク エー.ファオ ハイブリッドコンシールメント方法:オーディオコーデックにおける周波数および時間ドメインパケットロスの組み合わせ
US10770082B2 (en) * 2016-06-22 2020-09-08 Dolby International Ab Audio decoder and method for transforming a digital audio signal from a first to a second frequency domain
EP3616196A4 (en) * 2017-04-28 2021-01-20 DTS, Inc. AUDIO ENCODER WINDOW AND TRANSFORMATION IMPLEMENTATIONS

Also Published As

Publication number Publication date
CO2021010587A2 (es) 2021-08-30
JP2023029834A (ja) 2023-03-07
US11705136B2 (en) 2023-07-18
JP2022521188A (ja) 2022-04-06
CN113439302A (zh) 2021-09-24
US20220189490A1 (en) 2022-06-16
WO2020169757A1 (en) 2020-08-27
US20220148602A1 (en) 2022-05-12
KR20210130743A (ko) 2021-11-01
EP3928312A1 (en) 2021-12-29
US20230298597A1 (en) 2023-09-21
JP7471375B2 (ja) 2024-04-19
CO2021012223A2 (es) 2021-09-30
JP2023166423A (ja) 2023-11-21
WO2020169754A1 (en) 2020-08-27
CN113454714B (zh) 2024-05-14
US20240135936A1 (en) 2024-04-25
JP2023138988A (ja) 2023-10-03
JP7307805B2 (ja) 2023-07-12
WO2020169756A1 (en) 2020-08-27
EP3928313A1 (en) 2021-12-29
US11862180B2 (en) 2024-01-02
JP2022521494A (ja) 2022-04-08
JP7178506B2 (ja) 2022-11-25
US20220172733A1 (en) 2022-06-02
EP3928314A1 (en) 2021-12-29
US12002477B2 (en) 2024-06-04
CN113454714A (zh) 2021-09-28
JP2022521077A (ja) 2022-04-05
CN113454713A (zh) 2021-09-28
JP7335968B2 (ja) 2023-08-30
BR112021014477A2 (pt) 2021-09-28

Similar Documents

Publication Publication Date Title
JP7383067B2 (ja) 高度なスペクトラム拡張を使用して量子化ノイズを低減するための圧縮伸張装置および方法
TWI669705B (zh) 用以使用側邊增益及殘餘增益編碼或解碼多通道信號之設備及方法
KR100958144B1 (ko) 오디오 압축
CN105723452B (zh) 音频信号的频谱的频谱系数的解码方法及解码器
CA2604796C (en) Economical loudness measurement of coded audio
ATE535904T1 (de) Verbesserte transformationskodierung von sprach- und audiosignalen
PT1334484E (pt) Melhorar o desempenho de sistemas de codificacao que utilizam metodos de reconstrucao a altas frequencias
EP2722845B1 (en) Method and apparatus for generating downmix signal
US20150317991A1 (en) Voice audio encoding device, voice audio decoding device, voice audio encoding method, and voice audio decoding method
RU2010109206A (ru) Устройство и способ расчета параметров расширения полосы пропускания посредством управления фреймами наклона спектра
NO339114B1 (no) Prosessering av et multikanalsignal
PL1825461T3 (pl) Sposób i urządzenie do sztucznego rozszerzania szerokości pasma sygnałów mowy
JPWO2006075563A1 (ja) オーディオ符号化装置、オーディオ符号化方法およびオーディオ符号化プログラム
ATE394774T1 (de) Kodierungs-, dekodierungsvorrichtung und methode dafür
CA2898677C (en) Low-frequency emphasis for lpc-based coding in frequency domain
KR102426029B1 (ko) 오디오 신호 디코더에서의 개선된 주파수 대역 확장
KR20200077574A (ko) 다운샘플링 또는 스케일 파라미터의 보간을 사용하여 오디오 신호를 인코딩 및 디코딩하기 위한 장치 및 방법
MX2021009635A (es) Estimacion de la forma espectral a partir de coeficientes de mdct.
US20160104499A1 (en) Signal processing device and signal processing method
EP3783607B1 (en) Method and apparatus for encoding stereophonic signal
MX359502B (es) Metodos y dispositivos de codificacion y decodificacion de señal.
KR20100035128A (ko) 오디오 신호 처리 방법 및 장치
US20160196826A1 (en) Method and apparatus for encoding and decoding audio signal
US20140214412A1 (en) Apparatus and method for processing voice signal
Beack et al. Acoustic data transmission by extension on the time domain approach