MX2021004636A - Codificacion de audio perceptual con division en mosaicos de tiempo/frecuencia, no uniforme, adaptativa que utiliza fusion de subbandas y reduccion de solapamiento de dominio de tiempo. - Google Patents
Codificacion de audio perceptual con division en mosaicos de tiempo/frecuencia, no uniforme, adaptativa que utiliza fusion de subbandas y reduccion de solapamiento de dominio de tiempo.Info
- Publication number
- MX2021004636A MX2021004636A MX2021004636A MX2021004636A MX2021004636A MX 2021004636 A MX2021004636 A MX 2021004636A MX 2021004636 A MX2021004636 A MX 2021004636A MX 2021004636 A MX2021004636 A MX 2021004636A MX 2021004636 A MX2021004636 A MX 2021004636A
- Authority
- MX
- Mexico
- Prior art keywords
- samples
- block
- audio signal
- bins
- basis
- Prior art date
Links
- 230000003044 adaptive effect Effects 0.000 title 1
- 230000005236 sound signal Effects 0.000 abstract 8
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/008—Multichannel audio signal coding or decoding using interchannel correlation to reduce redundancy, e.g. joint-stereo, intensity-coding or matrixing
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/02—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders
- G10L19/0212—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders using orthogonal transformation
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/10—Complex mathematical operations
- G06F17/14—Fourier, Walsh or analogous domain transformations, e.g. Laplace, Hilbert, Karhunen-Loeve, transforms
- G06F17/147—Discrete orthonormal transforms, e.g. discrete cosine transform, discrete sine transform, and variations therefrom, e.g. modified discrete cosine transform, integer transforms approximating the discrete cosine transform
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/02—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders
- G10L19/0204—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders using subband decomposition
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/02—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders
- G10L19/022—Blocking, i.e. grouping of samples in time; Choice of analysis windows; Overlap factoring
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/04—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
- G10L19/26—Pre-filtering or post-filtering
Landscapes
- Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- General Physics & Mathematics (AREA)
- Mathematical Physics (AREA)
- Computational Linguistics (AREA)
- Signal Processing (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Spectroscopy & Molecular Physics (AREA)
- Computational Mathematics (AREA)
- Mathematical Analysis (AREA)
- Mathematical Optimization (AREA)
- Pure & Applied Mathematics (AREA)
- Data Mining & Analysis (AREA)
- Theoretical Computer Science (AREA)
- Discrete Mathematics (AREA)
- Algebra (AREA)
- Databases & Information Systems (AREA)
- Software Systems (AREA)
- General Engineering & Computer Science (AREA)
- Compression, Expansion, Code Conversion, And Decoders (AREA)
Abstract
Modalidades proporcionan un procesador de audio para procesar una señal de audio para obtener una representación de subbanda de la señal de audio. El procesador de audio se configura para realizar una transformada muestreada críticamente traslapada en cascada en al menos dos bloques parcialmente superpuestos de muestras de la señal de audio, para obtener un conjunto de muestras de subbanda con base en un primer bloque de muestras de la señal de audio, y para obtener un conjunto correspondiente de muestras de subbanda con base en un segundo bloque de muestras de la señal de audio. Además, el procesador de audio se configura para realizar una combinación ponderada de dos conjuntos correspondientes de muestras de subbanda, una obtenida con base en el primer bloque de muestras de la señal de audio y otra obtenida con base en el segundo bloque de muestras de la señal de audio, para obtener una representación de subbanda de solapamiento reducido de la señal de audio; en donde la realización de una transformada muestreada críticamente traslapada en cascada comprende segmentar un conjunto de tramos obtenidos con base en el primer bloque de muestras usando al menos dos funciones de ventana, y para obtener al menos dos conjuntos segmentados de tramos con base en el conjunto segmentado de tramos correspondiente al primer bloque de muestras; en donde la realización de una transformada muestreada críticamente traslapada en cascada comprende segmentar un conjunto de tramos obtenidos con base en el segundo bloque de muestras usando las al menos dos funciones de ventana, y para obtener al menos dos conjuntos de tramos con base en el conjunto segmentado de tramos correspondiente al segundo bloque de muestras; y en donde los conjuntos de tramos se procesan usando una segunda transformada muestreada críticamente traslapada de la transformada muestreada críticamente traslapada en cascada, en donde la segunda transformada muestreada críticamente traslapada comprende realizar transformadas muestreadas críticamente traslapadas que tienen la misma longitud de cuadro para al menos un conjunto de tramos.
Applications Claiming Priority (3)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
EP18202927 | 2018-10-26 | ||
EP19169635.0A EP3644313A1 (en) | 2018-10-26 | 2019-04-16 | Perceptual audio coding with adaptive non-uniform time/frequency tiling using subband merging and time domain aliasing reduction |
PCT/EP2019/078112 WO2020083727A1 (en) | 2018-10-26 | 2019-10-16 | Perceptual audio coding with adaptive non-uniform time/frequency tiling using subband merging and the time domain aliasing reduction |
Publications (1)
Publication Number | Publication Date |
---|---|
MX2021004636A true MX2021004636A (es) | 2021-05-28 |
Family
ID=64316263
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
MX2021004636A MX2021004636A (es) | 2018-10-26 | 2019-10-16 | Codificacion de audio perceptual con division en mosaicos de tiempo/frecuencia, no uniforme, adaptativa que utiliza fusion de subbandas y reduccion de solapamiento de dominio de tiempo. |
Country Status (9)
Country | Link |
---|---|
US (1) | US11688408B2 (es) |
EP (2) | EP3644313A1 (es) |
JP (1) | JP7279160B2 (es) |
KR (1) | KR102630922B1 (es) |
CN (1) | CN113330515B (es) |
BR (1) | BR112021007516A2 (es) |
CA (1) | CA3118121C (es) |
MX (1) | MX2021004636A (es) |
WO (1) | WO2020083727A1 (es) |
Families Citing this family (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
EP3786948A1 (en) * | 2019-08-28 | 2021-03-03 | Fraunhofer Gesellschaft zur Förderung der Angewand | Time-varying time-frequency tilings using non-uniform orthogonal filterbanks based on mdct analysis/synthesis and tdar |
Family Cites Families (16)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US5687243A (en) * | 1995-09-29 | 1997-11-11 | Motorola, Inc. | Noise suppression apparatus and method |
US7516064B2 (en) | 2004-02-19 | 2009-04-07 | Dolby Laboratories Licensing Corporation | Adaptive hybrid transform for signal analysis and synthesis |
US7548853B2 (en) * | 2005-06-17 | 2009-06-16 | Shmunk Dmitry V | Scalable compressed audio bit stream and codec using a hierarchical filterbank and multichannel joint coding |
US7548727B2 (en) * | 2005-10-26 | 2009-06-16 | Broadcom Corporation | Method and system for an efficient implementation of the Bluetooth® subband codec (SBC) |
KR100647336B1 (ko) | 2005-11-08 | 2006-11-23 | 삼성전자주식회사 | 적응적 시간/주파수 기반 오디오 부호화/복호화 장치 및방법 |
EP2015293A1 (en) * | 2007-06-14 | 2009-01-14 | Deutsche Thomson OHG | Method and apparatus for encoding and decoding an audio signal using adaptively switched temporal resolution in the spectral domain |
MX2011000375A (es) * | 2008-07-11 | 2011-05-19 | Fraunhofer Ges Forschung | Codificador y decodificador de audio para codificar y decodificar tramas de una señal de audio muestreada. |
EP2144171B1 (en) * | 2008-07-11 | 2018-05-16 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | Audio encoder and decoder for encoding and decoding frames of a sampled audio signal |
AU2009267518B2 (en) * | 2008-07-11 | 2012-08-16 | Fraunhofer-Gesellschaft Zur Foerderung Der Angewandten Forschung E.V. | Apparatus and method for encoding/decoding an audio signal using an aliasing switch scheme |
GB2473267A (en) | 2009-09-07 | 2011-03-09 | Nokia Corp | Processing audio signals to reduce noise |
EP2524374B1 (en) * | 2010-01-13 | 2018-10-31 | Voiceage Corporation | Audio decoding with forward time-domain aliasing cancellation using linear-predictive filtering |
US8886523B2 (en) | 2010-04-14 | 2014-11-11 | Huawei Technologies Co., Ltd. | Audio decoding based on audio class with control code for post-processing modes |
SG185519A1 (en) * | 2011-02-14 | 2012-12-28 | Fraunhofer Ges Forschung | Information signal representation using lapped transform |
EP3276620A1 (en) | 2016-07-29 | 2018-01-31 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | Time domain aliasing reduction for non-uniform filterbanks which use spectral analysis followed by partial synthesis |
US10020930B2 (en) * | 2016-11-04 | 2018-07-10 | Commissariat A L'energie Atomique Et Aux Energies Alternatives | Method of non-uniform wavelet bandpass sampling |
EP3786948A1 (en) * | 2019-08-28 | 2021-03-03 | Fraunhofer Gesellschaft zur Förderung der Angewand | Time-varying time-frequency tilings using non-uniform orthogonal filterbanks based on mdct analysis/synthesis and tdar |
-
2019
- 2019-04-16 EP EP19169635.0A patent/EP3644313A1/en not_active Withdrawn
- 2019-10-16 BR BR112021007516-0A patent/BR112021007516A2/pt unknown
- 2019-10-16 JP JP2021522453A patent/JP7279160B2/ja active Active
- 2019-10-16 WO PCT/EP2019/078112 patent/WO2020083727A1/en unknown
- 2019-10-16 MX MX2021004636A patent/MX2021004636A/es unknown
- 2019-10-16 CN CN201980087032.0A patent/CN113330515B/zh active Active
- 2019-10-16 EP EP19784111.7A patent/EP3871215B1/en active Active
- 2019-10-16 KR KR1020217015408A patent/KR102630922B1/ko active IP Right Grant
- 2019-10-16 CA CA3118121A patent/CA3118121C/en active Active
-
2021
- 2021-04-15 US US17/301,813 patent/US11688408B2/en active Active
Also Published As
Publication number | Publication date |
---|---|
US20210233544A1 (en) | 2021-07-29 |
CN113330515A (zh) | 2021-08-31 |
KR20210076134A (ko) | 2021-06-23 |
JP2022505789A (ja) | 2022-01-14 |
JP7279160B2 (ja) | 2023-05-22 |
EP3871215A1 (en) | 2021-09-01 |
CN113330515B (zh) | 2024-05-24 |
WO2020083727A1 (en) | 2020-04-30 |
EP3871215B1 (en) | 2023-09-13 |
US11688408B2 (en) | 2023-06-27 |
KR102630922B1 (ko) | 2024-01-30 |
BR112021007516A2 (pt) | 2021-07-27 |
CA3118121A1 (en) | 2020-04-30 |
CA3118121C (en) | 2023-10-03 |
EP3871215C0 (en) | 2023-09-13 |
EP3644313A1 (en) | 2020-04-29 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US11373666B2 (en) | Apparatus for post-processing an audio signal using a transient location detection | |
KR102088153B1 (ko) | 향상된 스펙트럼 확장을 사용하여 양자화 잡음을 감소시키기 위한 압신 장치 및 방법 | |
RU2658892C2 (ru) | Устройство и способ для расширения диапазона частот для акустических сигналов | |
US11568883B2 (en) | Low-frequency emphasis for LPC-based coding in frequency domain | |
US11908484B2 (en) | Apparatus and method for generating an enhanced signal using independent noise-filling at random values and scaling thereupon | |
EP4362423A3 (en) | System and method for processing audio data | |
TR201908841T4 (tr) | Ses sinyali işleme. | |
MX2019001023A (es) | Reduccion de solapamiento en dominio de tiempo para bancos de filtros no uniformes que usan analisis espectral seguido por sintesis parcial. | |
MX2021004636A (es) | Codificacion de audio perceptual con division en mosaicos de tiempo/frecuencia, no uniforme, adaptativa que utiliza fusion de subbandas y reduccion de solapamiento de dominio de tiempo. | |
KR102104561B1 (ko) | 오디오 신호를 처리하기 위한 방법 및 장치 | |
CN109102823B (zh) | 一种基于子带谱熵的语音增强方法 | |
US20200013421A1 (en) | Apparatus and method for post-processing an audio signal using prediction based shaping | |
MX2018009145A (es) | Aparato y método para mejorar una transición desde una porción de señal de audio oculta hasta una porción de señal de audio subsiguiente de una señal de audio. | |
Derrien et al. | A quasi-orthogonal, invertible, and perceptually relevant time-frequency transform for audio coding | |
Füg et al. | Temporal noise shaping on MDCT subband signals for transform audio coding | |
US9911423B2 (en) | Multi-channel audio signal classifier | |
Schasse et al. | Efficient implementation of single-channel noise reduction for hearing aids using a cascaded filter-bank | |
US20240194209A1 (en) | Apparatus and method for removing undesired auditory roughness | |
US11610597B2 (en) | Anti-causal filter for audio signal processing | |
Xing et al. | An Improvement Based on Wavelet Transform Speech Denoising Threshold Function | |
WO2012134851A1 (en) | Reduced complexity transform for a low-frequency-effects channel |