MX2023008074A - Metodo y dispositivo para codificacion unificada de dominio de tiempo / dominio de frecuencia en una se?al sonora. - Google Patents
Metodo y dispositivo para codificacion unificada de dominio de tiempo / dominio de frecuencia en una se?al sonora.Info
- Publication number
- MX2023008074A MX2023008074A MX2023008074A MX2023008074A MX2023008074A MX 2023008074 A MX2023008074 A MX 2023008074A MX 2023008074 A MX2023008074 A MX 2023008074A MX 2023008074 A MX2023008074 A MX 2023008074A MX 2023008074 A MX2023008074 A MX 2023008074A
- Authority
- MX
- Mexico
- Prior art keywords
- sound signal
- domain
- coding
- input sound
- frequency
- Prior art date
Links
- 230000005236 sound signal Effects 0.000 title abstract 9
- 238000000034 method Methods 0.000 title abstract 3
- 238000013139 quantization Methods 0.000 abstract 1
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/02—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders
- G10L19/0204—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders using subband decomposition
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/04—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
- G10L19/16—Vocoder architecture
- G10L19/18—Vocoders using multiple modes
- G10L19/20—Vocoders using multiple modes using sound class specific coding, hybrid encoders or object based coding
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/02—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders
- G10L19/022—Blocking, i.e. grouping of samples in time; Choice of analysis windows; Overlap factoring
- G10L19/025—Detection of transients or attacks for time/frequency resolution switching
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/002—Dynamic bit allocation
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/04—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
- G10L25/78—Detection of presence or absence of voice signals
- G10L25/81—Detection of presence or absence of voice signals for discriminating voice from music
Landscapes
- Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- Computational Linguistics (AREA)
- Signal Processing (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Spectroscopy & Molecular Physics (AREA)
- Compression, Expansion, Code Conversion, And Decoders (AREA)
- Transmission Systems Not Characterized By The Medium Used For Transmission (AREA)
Abstract
Un método de codificación del dominio de tiempo/dominio de frecuencia unificados y un dispositivo para codificar una señal de sonido de entrada comprenden un clasificador de la señal de sonido de entrada en una de una pluralidad de categorías de señales de sonido que comprenden una categoría de tipo de señal poco clara que muestra que la naturaleza de la señal de sonido de entrada es poco clara. Uno de entre una pluralidad de submodos de codificación se selecciona para codificar la señal de sonido de entrada si la señal de sonido de entrada está clasificada en la categoría del tipo de señal poco clara. Un codificador mixto de dominio de tiempo/dominio de frecuencia codifica la señal de sonido de entrada usando el submodo de codificación seleccionado. El codificador mixto de dominio de tiempo/dominio de frecuencia comprende un selector de bandas de frecuencia y un asignador de bits para seleccionar bandas de frecuencia para cuantificar y para distribuir un presupuesto de bits disponible para cuantificación entre las bandas de frecuencia seleccionadas. También se proporcionan el decodificador y el método de decodificación de señales de sonido correspondientes.
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US202163135171P | 2021-01-08 | 2021-01-08 | |
PCT/CA2022/050006 WO2022147615A1 (en) | 2021-01-08 | 2022-01-05 | Method and device for unified time-domain / frequency domain coding of a sound signal |
Publications (1)
Publication Number | Publication Date |
---|---|
MX2023008074A true MX2023008074A (es) | 2023-07-18 |
Family
ID=82357063
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
MX2023008074A MX2023008074A (es) | 2021-01-08 | 2022-01-05 | Metodo y dispositivo para codificacion unificada de dominio de tiempo / dominio de frecuencia en una se?al sonora. |
Country Status (7)
Country | Link |
---|---|
EP (1) | EP4275204A1 (es) |
JP (1) | JP2024503392A (es) |
KR (1) | KR20230128541A (es) |
CN (1) | CN117178322A (es) |
CA (1) | CA3202969A1 (es) |
MX (1) | MX2023008074A (es) |
WO (1) | WO2022147615A1 (es) |
Family Cites Families (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2009118044A1 (en) * | 2008-03-26 | 2009-10-01 | Nokia Corporation | An audio signal classifier |
US8428949B2 (en) * | 2008-06-30 | 2013-04-23 | Waves Audio Ltd. | Apparatus and method for classification and segmentation of audio content, based on the audio signal |
PL2301011T3 (pl) * | 2008-07-11 | 2019-03-29 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | Sposób i dyskryminator do klasyfikacji różnych segmentów sygnału audio zawierającego segmenty mowy i muzyki |
US9401153B2 (en) * | 2012-10-15 | 2016-07-26 | Digimarc Corporation | Multi-mode audio recognition and auxiliary data encoding and decoding |
MX2020002972A (es) * | 2017-09-20 | 2020-07-22 | Voiceage Corp | Metodo y dispositivo para asignar un presupuesto de bits entre subtramas en un codec celp. |
-
2022
- 2022-01-05 EP EP22736474.2A patent/EP4275204A1/en active Pending
- 2022-01-05 CA CA3202969A patent/CA3202969A1/en active Pending
- 2022-01-05 KR KR1020237026813A patent/KR20230128541A/ko unknown
- 2022-01-05 MX MX2023008074A patent/MX2023008074A/es unknown
- 2022-01-05 CN CN202280009268.4A patent/CN117178322A/zh active Pending
- 2022-01-05 WO PCT/CA2022/050006 patent/WO2022147615A1/en active Application Filing
- 2022-01-05 JP JP2023541804A patent/JP2024503392A/ja active Pending
Also Published As
Publication number | Publication date |
---|---|
CN117178322A (zh) | 2023-12-05 |
JP2024503392A (ja) | 2024-01-25 |
EP4275204A1 (en) | 2023-11-15 |
KR20230128541A (ko) | 2023-09-05 |
WO2022147615A1 (en) | 2022-07-14 |
CA3202969A1 (en) | 2022-07-14 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
RU2224302C2 (ru) | Способ и устройство для масштабируемого кодирования/декодирования аудиосигналов | |
RU97122037A (ru) | Способ и устройство для масштабируемого кодирования/декодирования аудиосигнала | |
US7774205B2 (en) | Coding of sparse digital media spectral data | |
EP1395980B1 (en) | Audio coding | |
US7761290B2 (en) | Flexible frequency and time partitioning in perceptual transform coding of audio | |
CN100367348C (zh) | 低比特速率音频编码 | |
RU2012119783A (ru) | Способ и устройство иерархического кодирования/декодирования аудио | |
CN110895945A (zh) | 频谱包络的样本值的基于上下文的熵编码 | |
US8447591B2 (en) | Factorization of overlapping tranforms into two block transforms | |
RU2012141241A (ru) | Аудиокодер, аудиодекодер, способ кодирования и декодирования аудиоинформации и компьютерная программа, определяющая значение поддиапазона контекста на основе нормы ранее декодированных спектральных значений | |
CN1897467A (zh) | 信号编码、信号解码装置和方法、程序以及记录介质 | |
US8831959B2 (en) | Transform audio codec and methods for encoding and decoding a time segment of an audio signal | |
KR102165403B1 (ko) | 음향 신호 부호화 장치, 음향 신호 복호 장치, 음향 신호 부호화 방법 및 음향 신호 복호 방법 | |
CN101223570A (zh) | 获得用于数字媒体的高效编码的频带的频率分段 | |
MX2011000557A (es) | Metodo y aparato de codificacion y decodificacion de señal de audio/voz. | |
CN102306494A (zh) | 对音频信号编码和解码的方法和设备 | |
KR102400016B1 (ko) | 고대역 부호화방법 및 장치와 고대역 복호화 방법 및 장치 | |
TW200507467A (en) | Sacle factor based bit shifting in fine granularity scalability audio coding | |
EP3621071B1 (en) | Signal processing method and apparatus | |
CN102483924A (zh) | 使用通道间及时间冗余减少的音频信号编码 | |
MX2023008074A (es) | Metodo y dispositivo para codificacion unificada de dominio de tiempo / dominio de frecuencia en una se?al sonora. | |
KR20100073139A (ko) | 스펙트럼 계수의 서브대역 할당 방법 및 장치 | |
JP2019070823A (ja) | 音響信号符号化装置、音響信号復号装置、音響信号符号化方法および音響信号復号方法 | |
KR20210133551A (ko) | 적응형 주파수 복원 기법 기반 오디오 부호화 방법 | |
CN101833953B (zh) | 降低多描述编解码冗余度的方法和装置 |