MX2021004636A - Codificacion de audio perceptual con division en mosaicos de tiempo/frecuencia, no uniforme, adaptativa que utiliza fusion de subbandas y reduccion de solapamiento de dominio de tiempo. - Google Patents

Codificacion de audio perceptual con division en mosaicos de tiempo/frecuencia, no uniforme, adaptativa que utiliza fusion de subbandas y reduccion de solapamiento de dominio de tiempo.

Info

Publication number
MX2021004636A
MX2021004636A MX2021004636A MX2021004636A MX2021004636A MX 2021004636 A MX2021004636 A MX 2021004636A MX 2021004636 A MX2021004636 A MX 2021004636A MX 2021004636 A MX2021004636 A MX 2021004636A MX 2021004636 A MX2021004636 A MX 2021004636A
Authority
MX
Mexico
Prior art keywords
samples
block
audio signal
bins
basis
Prior art date
Application number
MX2021004636A
Other languages
English (en)
Inventor
Bernd Edler
Sascha Disch
Nils Werner
Original Assignee
Fraunhofer Ges Forschung
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Fraunhofer Ges Forschung filed Critical Fraunhofer Ges Forschung
Publication of MX2021004636A publication Critical patent/MX2021004636A/es

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/008Multichannel audio signal coding or decoding using interchannel correlation to reduce redundancy, e.g. joint-stereo, intensity-coding or matrixing
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/02Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders
    • G10L19/0212Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders using orthogonal transformation
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F17/00Digital computing or data processing equipment or methods, specially adapted for specific functions
    • G06F17/10Complex mathematical operations
    • G06F17/14Fourier, Walsh or analogous domain transformations, e.g. Laplace, Hilbert, Karhunen-Loeve, transforms
    • G06F17/147Discrete orthonormal transforms, e.g. discrete cosine transform, discrete sine transform, and variations therefrom, e.g. modified discrete cosine transform, integer transforms approximating the discrete cosine transform
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/02Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders
    • G10L19/0204Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders using subband decomposition
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/02Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders
    • G10L19/022Blocking, i.e. grouping of samples in time; Choice of analysis windows; Overlap factoring
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/04Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
    • G10L19/26Pre-filtering or post-filtering

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Mathematical Physics (AREA)
  • Computational Linguistics (AREA)
  • Signal Processing (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Spectroscopy & Molecular Physics (AREA)
  • Computational Mathematics (AREA)
  • Mathematical Analysis (AREA)
  • Mathematical Optimization (AREA)
  • Pure & Applied Mathematics (AREA)
  • Data Mining & Analysis (AREA)
  • Theoretical Computer Science (AREA)
  • Discrete Mathematics (AREA)
  • Algebra (AREA)
  • Databases & Information Systems (AREA)
  • Software Systems (AREA)
  • General Engineering & Computer Science (AREA)
  • Compression, Expansion, Code Conversion, And Decoders (AREA)

Abstract

Modalidades proporcionan un procesador de audio para procesar una señal de audio para obtener una representación de subbanda de la señal de audio. El procesador de audio se configura para realizar una transformada muestreada críticamente traslapada en cascada en al menos dos bloques parcialmente superpuestos de muestras de la señal de audio, para obtener un conjunto de muestras de subbanda con base en un primer bloque de muestras de la señal de audio, y para obtener un conjunto correspondiente de muestras de subbanda con base en un segundo bloque de muestras de la señal de audio. Además, el procesador de audio se configura para realizar una combinación ponderada de dos conjuntos correspondientes de muestras de subbanda, una obtenida con base en el primer bloque de muestras de la señal de audio y otra obtenida con base en el segundo bloque de muestras de la señal de audio, para obtener una representación de subbanda de solapamiento reducido de la señal de audio; en donde la realización de una transformada muestreada críticamente traslapada en cascada comprende segmentar un conjunto de tramos obtenidos con base en el primer bloque de muestras usando al menos dos funciones de ventana, y para obtener al menos dos conjuntos segmentados de tramos con base en el conjunto segmentado de tramos correspondiente al primer bloque de muestras; en donde la realización de una transformada muestreada críticamente traslapada en cascada comprende segmentar un conjunto de tramos obtenidos con base en el segundo bloque de muestras usando las al menos dos funciones de ventana, y para obtener al menos dos conjuntos de tramos con base en el conjunto segmentado de tramos correspondiente al segundo bloque de muestras; y en donde los conjuntos de tramos se procesan usando una segunda transformada muestreada críticamente traslapada de la transformada muestreada críticamente traslapada en cascada, en donde la segunda transformada muestreada críticamente traslapada comprende realizar transformadas muestreadas críticamente traslapadas que tienen la misma longitud de cuadro para al menos un conjunto de tramos.
MX2021004636A 2018-10-26 2019-10-16 Codificacion de audio perceptual con division en mosaicos de tiempo/frecuencia, no uniforme, adaptativa que utiliza fusion de subbandas y reduccion de solapamiento de dominio de tiempo. MX2021004636A (es)

Applications Claiming Priority (3)

Application Number Priority Date Filing Date Title
EP18202927 2018-10-26
EP19169635.0A EP3644313A1 (en) 2018-10-26 2019-04-16 Perceptual audio coding with adaptive non-uniform time/frequency tiling using subband merging and time domain aliasing reduction
PCT/EP2019/078112 WO2020083727A1 (en) 2018-10-26 2019-10-16 Perceptual audio coding with adaptive non-uniform time/frequency tiling using subband merging and the time domain aliasing reduction

Publications (1)

Publication Number Publication Date
MX2021004636A true MX2021004636A (es) 2021-05-28

Family

ID=64316263

Family Applications (1)

Application Number Title Priority Date Filing Date
MX2021004636A MX2021004636A (es) 2018-10-26 2019-10-16 Codificacion de audio perceptual con division en mosaicos de tiempo/frecuencia, no uniforme, adaptativa que utiliza fusion de subbandas y reduccion de solapamiento de dominio de tiempo.

Country Status (9)

Country Link
US (1) US11688408B2 (es)
EP (2) EP3644313A1 (es)
JP (1) JP7279160B2 (es)
KR (1) KR102630922B1 (es)
CN (1) CN113330515B (es)
BR (1) BR112021007516A2 (es)
CA (1) CA3118121C (es)
MX (1) MX2021004636A (es)
WO (1) WO2020083727A1 (es)

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP3786948A1 (en) * 2019-08-28 2021-03-03 Fraunhofer Gesellschaft zur Förderung der Angewand Time-varying time-frequency tilings using non-uniform orthogonal filterbanks based on mdct analysis/synthesis and tdar

Family Cites Families (16)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5687243A (en) * 1995-09-29 1997-11-11 Motorola, Inc. Noise suppression apparatus and method
US7516064B2 (en) 2004-02-19 2009-04-07 Dolby Laboratories Licensing Corporation Adaptive hybrid transform for signal analysis and synthesis
US7548853B2 (en) * 2005-06-17 2009-06-16 Shmunk Dmitry V Scalable compressed audio bit stream and codec using a hierarchical filterbank and multichannel joint coding
US7548727B2 (en) * 2005-10-26 2009-06-16 Broadcom Corporation Method and system for an efficient implementation of the Bluetooth® subband codec (SBC)
KR100647336B1 (ko) 2005-11-08 2006-11-23 삼성전자주식회사 적응적 시간/주파수 기반 오디오 부호화/복호화 장치 및방법
EP2015293A1 (en) * 2007-06-14 2009-01-14 Deutsche Thomson OHG Method and apparatus for encoding and decoding an audio signal using adaptively switched temporal resolution in the spectral domain
MX2011000375A (es) * 2008-07-11 2011-05-19 Fraunhofer Ges Forschung Codificador y decodificador de audio para codificar y decodificar tramas de una señal de audio muestreada.
EP2144171B1 (en) * 2008-07-11 2018-05-16 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Audio encoder and decoder for encoding and decoding frames of a sampled audio signal
AU2009267518B2 (en) * 2008-07-11 2012-08-16 Fraunhofer-Gesellschaft Zur Foerderung Der Angewandten Forschung E.V. Apparatus and method for encoding/decoding an audio signal using an aliasing switch scheme
GB2473267A (en) 2009-09-07 2011-03-09 Nokia Corp Processing audio signals to reduce noise
EP2524374B1 (en) * 2010-01-13 2018-10-31 Voiceage Corporation Audio decoding with forward time-domain aliasing cancellation using linear-predictive filtering
US8886523B2 (en) 2010-04-14 2014-11-11 Huawei Technologies Co., Ltd. Audio decoding based on audio class with control code for post-processing modes
SG185519A1 (en) * 2011-02-14 2012-12-28 Fraunhofer Ges Forschung Information signal representation using lapped transform
EP3276620A1 (en) 2016-07-29 2018-01-31 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Time domain aliasing reduction for non-uniform filterbanks which use spectral analysis followed by partial synthesis
US10020930B2 (en) * 2016-11-04 2018-07-10 Commissariat A L'energie Atomique Et Aux Energies Alternatives Method of non-uniform wavelet bandpass sampling
EP3786948A1 (en) * 2019-08-28 2021-03-03 Fraunhofer Gesellschaft zur Förderung der Angewand Time-varying time-frequency tilings using non-uniform orthogonal filterbanks based on mdct analysis/synthesis and tdar

Also Published As

Publication number Publication date
US20210233544A1 (en) 2021-07-29
CN113330515A (zh) 2021-08-31
KR20210076134A (ko) 2021-06-23
JP2022505789A (ja) 2022-01-14
JP7279160B2 (ja) 2023-05-22
EP3871215A1 (en) 2021-09-01
CN113330515B (zh) 2024-05-24
WO2020083727A1 (en) 2020-04-30
EP3871215B1 (en) 2023-09-13
US11688408B2 (en) 2023-06-27
KR102630922B1 (ko) 2024-01-30
BR112021007516A2 (pt) 2021-07-27
CA3118121A1 (en) 2020-04-30
CA3118121C (en) 2023-10-03
EP3871215C0 (en) 2023-09-13
EP3644313A1 (en) 2020-04-29

Similar Documents

Publication Publication Date Title
US11373666B2 (en) Apparatus for post-processing an audio signal using a transient location detection
KR102088153B1 (ko) 향상된 스펙트럼 확장을 사용하여 양자화 잡음을 감소시키기 위한 압신 장치 및 방법
RU2658892C2 (ru) Устройство и способ для расширения диапазона частот для акустических сигналов
US11568883B2 (en) Low-frequency emphasis for LPC-based coding in frequency domain
US11908484B2 (en) Apparatus and method for generating an enhanced signal using independent noise-filling at random values and scaling thereupon
EP4362423A3 (en) System and method for processing audio data
TR201908841T4 (tr) Ses sinyali işleme.
MX2019001023A (es) Reduccion de solapamiento en dominio de tiempo para bancos de filtros no uniformes que usan analisis espectral seguido por sintesis parcial.
MX2021004636A (es) Codificacion de audio perceptual con division en mosaicos de tiempo/frecuencia, no uniforme, adaptativa que utiliza fusion de subbandas y reduccion de solapamiento de dominio de tiempo.
KR102104561B1 (ko) 오디오 신호를 처리하기 위한 방법 및 장치
CN109102823B (zh) 一种基于子带谱熵的语音增强方法
US20200013421A1 (en) Apparatus and method for post-processing an audio signal using prediction based shaping
MX2018009145A (es) Aparato y método para mejorar una transición desde una porción de señal de audio oculta hasta una porción de señal de audio subsiguiente de una señal de audio.
Derrien et al. A quasi-orthogonal, invertible, and perceptually relevant time-frequency transform for audio coding
Füg et al. Temporal noise shaping on MDCT subband signals for transform audio coding
US9911423B2 (en) Multi-channel audio signal classifier
Schasse et al. Efficient implementation of single-channel noise reduction for hearing aids using a cascaded filter-bank
US20240194209A1 (en) Apparatus and method for removing undesired auditory roughness
US11610597B2 (en) Anti-causal filter for audio signal processing
Xing et al. An Improvement Based on Wavelet Transform Speech Denoising Threshold Function
WO2012134851A1 (en) Reduced complexity transform for a low-frequency-effects channel