MX2023008074A - Metodo y dispositivo para codificacion unificada de dominio de tiempo / dominio de frecuencia en una se?al sonora. - Google Patents

Metodo y dispositivo para codificacion unificada de dominio de tiempo / dominio de frecuencia en una se?al sonora.

Info

Publication number
MX2023008074A
MX2023008074A MX2023008074A MX2023008074A MX2023008074A MX 2023008074 A MX2023008074 A MX 2023008074A MX 2023008074 A MX2023008074 A MX 2023008074A MX 2023008074 A MX2023008074 A MX 2023008074A MX 2023008074 A MX2023008074 A MX 2023008074A
Authority
MX
Mexico
Prior art keywords
sound signal
domain
coding
input sound
frequency
Prior art date
Application number
MX2023008074A
Other languages
English (en)
Inventor
Tommy Vaillancourt
Vladimir Malenovsky
Original Assignee
Voiceage Corp
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Voiceage Corp filed Critical Voiceage Corp
Publication of MX2023008074A publication Critical patent/MX2023008074A/es

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/02Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders
    • G10L19/0204Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders using subband decomposition
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/04Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
    • G10L19/16Vocoder architecture
    • G10L19/18Vocoders using multiple modes
    • G10L19/20Vocoders using multiple modes using sound class specific coding, hybrid encoders or object based coding
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/02Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders
    • G10L19/022Blocking, i.e. grouping of samples in time; Choice of analysis windows; Overlap factoring
    • G10L19/025Detection of transients or attacks for time/frequency resolution switching
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/002Dynamic bit allocation
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/04Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/78Detection of presence or absence of voice signals
    • G10L25/81Detection of presence or absence of voice signals for discriminating voice from music

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Computational Linguistics (AREA)
  • Signal Processing (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Spectroscopy & Molecular Physics (AREA)
  • Compression, Expansion, Code Conversion, And Decoders (AREA)
  • Transmission Systems Not Characterized By The Medium Used For Transmission (AREA)

Abstract

Un método de codificación del dominio de tiempo/dominio de frecuencia unificados y un dispositivo para codificar una señal de sonido de entrada comprenden un clasificador de la señal de sonido de entrada en una de una pluralidad de categorías de señales de sonido que comprenden una categoría de tipo de señal poco clara que muestra que la naturaleza de la señal de sonido de entrada es poco clara. Uno de entre una pluralidad de submodos de codificación se selecciona para codificar la señal de sonido de entrada si la señal de sonido de entrada está clasificada en la categoría del tipo de señal poco clara. Un codificador mixto de dominio de tiempo/dominio de frecuencia codifica la señal de sonido de entrada usando el submodo de codificación seleccionado. El codificador mixto de dominio de tiempo/dominio de frecuencia comprende un selector de bandas de frecuencia y un asignador de bits para seleccionar bandas de frecuencia para cuantificar y para distribuir un presupuesto de bits disponible para cuantificación entre las bandas de frecuencia seleccionadas. También se proporcionan el decodificador y el método de decodificación de señales de sonido correspondientes.
MX2023008074A 2021-01-08 2022-01-05 Metodo y dispositivo para codificacion unificada de dominio de tiempo / dominio de frecuencia en una se?al sonora. MX2023008074A (es)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US202163135171P 2021-01-08 2021-01-08
PCT/CA2022/050006 WO2022147615A1 (en) 2021-01-08 2022-01-05 Method and device for unified time-domain / frequency domain coding of a sound signal

Publications (1)

Publication Number Publication Date
MX2023008074A true MX2023008074A (es) 2023-07-18

Family

ID=82357063

Family Applications (1)

Application Number Title Priority Date Filing Date
MX2023008074A MX2023008074A (es) 2021-01-08 2022-01-05 Metodo y dispositivo para codificacion unificada de dominio de tiempo / dominio de frecuencia en una se?al sonora.

Country Status (7)

Country Link
EP (1) EP4275204A1 (es)
JP (1) JP2024503392A (es)
KR (1) KR20230128541A (es)
CN (1) CN117178322A (es)
CA (1) CA3202969A1 (es)
MX (1) MX2023008074A (es)
WO (1) WO2022147615A1 (es)

Family Cites Families (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2009118044A1 (en) * 2008-03-26 2009-10-01 Nokia Corporation An audio signal classifier
US8428949B2 (en) * 2008-06-30 2013-04-23 Waves Audio Ltd. Apparatus and method for classification and segmentation of audio content, based on the audio signal
PL2301011T3 (pl) * 2008-07-11 2019-03-29 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Sposób i dyskryminator do klasyfikacji różnych segmentów sygnału audio zawierającego segmenty mowy i muzyki
US9401153B2 (en) * 2012-10-15 2016-07-26 Digimarc Corporation Multi-mode audio recognition and auxiliary data encoding and decoding
MX2020002972A (es) * 2017-09-20 2020-07-22 Voiceage Corp Metodo y dispositivo para asignar un presupuesto de bits entre subtramas en un codec celp.

Also Published As

Publication number Publication date
CN117178322A (zh) 2023-12-05
JP2024503392A (ja) 2024-01-25
EP4275204A1 (en) 2023-11-15
KR20230128541A (ko) 2023-09-05
WO2022147615A1 (en) 2022-07-14
CA3202969A1 (en) 2022-07-14

Similar Documents

Publication Publication Date Title
RU2224302C2 (ru) Способ и устройство для масштабируемого кодирования/декодирования аудиосигналов
RU97122037A (ru) Способ и устройство для масштабируемого кодирования/декодирования аудиосигнала
US7774205B2 (en) Coding of sparse digital media spectral data
EP1395980B1 (en) Audio coding
US7761290B2 (en) Flexible frequency and time partitioning in perceptual transform coding of audio
CN100367348C (zh) 低比特速率音频编码
RU2012119783A (ru) Способ и устройство иерархического кодирования/декодирования аудио
CN110895945A (zh) 频谱包络的样本值的基于上下文的熵编码
US8447591B2 (en) Factorization of overlapping tranforms into two block transforms
RU2012141241A (ru) Аудиокодер, аудиодекодер, способ кодирования и декодирования аудиоинформации и компьютерная программа, определяющая значение поддиапазона контекста на основе нормы ранее декодированных спектральных значений
CN1897467A (zh) 信号编码、信号解码装置和方法、程序以及记录介质
US8831959B2 (en) Transform audio codec and methods for encoding and decoding a time segment of an audio signal
KR102165403B1 (ko) 음향 신호 부호화 장치, 음향 신호 복호 장치, 음향 신호 부호화 방법 및 음향 신호 복호 방법
CN101223570A (zh) 获得用于数字媒体的高效编码的频带的频率分段
MX2011000557A (es) Metodo y aparato de codificacion y decodificacion de señal de audio/voz.
CN102306494A (zh) 对音频信号编码和解码的方法和设备
KR102400016B1 (ko) 고대역 부호화방법 및 장치와 고대역 복호화 방법 및 장치
TW200507467A (en) Sacle factor based bit shifting in fine granularity scalability audio coding
EP3621071B1 (en) Signal processing method and apparatus
CN102483924A (zh) 使用通道间及时间冗余减少的音频信号编码
MX2023008074A (es) Metodo y dispositivo para codificacion unificada de dominio de tiempo / dominio de frecuencia en una se?al sonora.
KR20100073139A (ko) 스펙트럼 계수의 서브대역 할당 방법 및 장치
JP2019070823A (ja) 音響信号符号化装置、音響信号復号装置、音響信号符号化方法および音響信号復号方法
KR20210133551A (ko) 적응형 주파수 복원 기법 기반 오디오 부호화 방법
CN101833953B (zh) 降低多描述编解码冗余度的方法和装置