BR112023000850A2 - Método e aparelho de estimativa de atraso de sinal de áudio estéreo, aparelho de codificação de áudio e meio de armazenamento legível por computador - Google Patents
Método e aparelho de estimativa de atraso de sinal de áudio estéreo, aparelho de codificação de áudio e meio de armazenamento legível por computadorInfo
- Publication number
- BR112023000850A2 BR112023000850A2 BR112023000850A BR112023000850A BR112023000850A2 BR 112023000850 A2 BR112023000850 A2 BR 112023000850A2 BR 112023000850 A BR112023000850 A BR 112023000850A BR 112023000850 A BR112023000850 A BR 112023000850A BR 112023000850 A2 BR112023000850 A2 BR 112023000850A2
- Authority
- BR
- Brazil
- Prior art keywords
- current frame
- audio signal
- stereo audio
- signal
- weighting function
- Prior art date
Links
- 230000005236 sound signal Effects 0.000 title abstract 8
- 238000000034 method Methods 0.000 title abstract 4
- 238000001228 spectrum Methods 0.000 abstract 2
- 230000001427 coherent effect Effects 0.000 abstract 1
- 238000010276 construction Methods 0.000 abstract 1
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/008—Multichannel audio signal coding or decoding using interchannel correlation to reduce redundancy, e.g. joint-stereo, intensity-coding or matrixing
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/02—Speech enhancement, e.g. noise reduction or echo cancellation
- G10L21/0208—Noise filtering
- G10L21/0216—Noise filtering characterised by the method used for estimating noise
- G10L21/0232—Processing in the frequency domain
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
- G10L25/48—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use
- G10L25/51—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use for comparison or discrimination
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
- G10L25/78—Detection of presence or absence of voice signals
- G10L25/84—Detection of presence or absence of voice signals for discriminating voice from noise
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
- G10L25/78—Detection of presence or absence of voice signals
- G10L25/87—Detection of discrete points within a voice signal
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04S—STEREOPHONIC SYSTEMS
- H04S7/00—Indicating arrangements; Control arrangements, e.g. balance control
- H04S7/30—Control circuits for electronic adaptation of the sound field
- H04S7/307—Frequency adjustment, e.g. tone control
Landscapes
- Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- Signal Processing (AREA)
- Acoustics & Sound (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Health & Medical Sciences (AREA)
- Computational Linguistics (AREA)
- Human Computer Interaction (AREA)
- Multimedia (AREA)
- Mathematical Physics (AREA)
- Quality & Reliability (AREA)
- Stereophonic System (AREA)
- Compression, Expansion, Code Conversion, And Decoders (AREA)
Abstract
MÉTODO E APARELHO DE ESTIMATIVA DE ATRASO DE SINAL DE ÁUDIO ESTÉREO, APARELHO DE CODIFICAÇÃO DE ÁUDIO E MEIO DE ARMAZENAMENTO LEGÍVEL POR COMPUTADOR. São divulgados um método e aparelho de estimativa de atraso de sinal de áudio estéreo. O método pode incluir: obter um quadro atual de um sinal de áudio estéreo (S401), onde o quadro atual inclui um primeiro sinal de áudio de canal e um segundo sinal de áudio de canal; e se um tipo de sinal de um sinal de ruído incluído no quadro atual for um tipo de sinal de ruído coerente, estimar uma diferença de tempo intercanal do quadro atual usando um primeiro algoritmo (S403); ou se um tipo de sinal de um sinal de ruído incluído no quadro atual for um tipo de sinal de ruído difuso, estimar uma diferença de tempo intercanal do quadro atual usando um segundo algoritmo (S403). O primeiro algoritmo inclui ponderação de um espectro de potência cruzada de domínio de frequência do quadro atual com base em uma primeira função de ponderação, o segundo algoritmo inclui ponderação de um espectro de potência cruzada de domínio de frequência do quadro atual com base em uma segunda função de ponderação, e um fator de construção da primeira função de ponderação é diferente daquele da segunda função de ponderação. Diferentes algoritmos de estimativa de ITD são usados para sinais de áudio estéreo, incluindo diferentes tipos de ruído, aprimorando a precisão de estimativa de ITD do sinal de áudio estéreo.
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202010700806.7A CN113948098A (zh) | 2020-07-17 | 2020-07-17 | 一种立体声音频信号时延估计方法及装置 |
PCT/CN2021/106515 WO2022012629A1 (zh) | 2020-07-17 | 2021-07-15 | 一种立体声音频信号时延估计方法及装置 |
Publications (1)
Publication Number | Publication Date |
---|---|
BR112023000850A2 true BR112023000850A2 (pt) | 2023-04-04 |
Family
ID=79326926
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
BR112023000850A BR112023000850A2 (pt) | 2020-07-17 | 2021-07-15 | Método e aparelho de estimativa de atraso de sinal de áudio estéreo, aparelho de codificação de áudio e meio de armazenamento legível por computador |
Country Status (8)
Country | Link |
---|---|
US (1) | US20230154483A1 (pt) |
EP (1) | EP4170653A4 (pt) |
JP (1) | JP2023533364A (pt) |
KR (1) | KR20230035387A (pt) |
CN (1) | CN113948098A (pt) |
BR (1) | BR112023000850A2 (pt) |
CA (1) | CA3189232A1 (pt) |
WO (1) | WO2022012629A1 (pt) |
Families Citing this family (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN115691515A (zh) * | 2022-07-12 | 2023-02-03 | 南京拓灵智能科技有限公司 | 一种音频编解码方法及装置 |
WO2024053353A1 (ja) * | 2022-09-08 | 2024-03-14 | パナソニック インテレクチュアル プロパティ コーポレーション オブ アメリカ | 信号処理装置、及び、信号処理方法 |
CN116032901A (zh) * | 2022-12-30 | 2023-04-28 | 北京天兵科技有限公司 | 多路音频数据信号采编方法、装置、系统、介质和设备 |
Family Cites Families (8)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2004002192A1 (en) * | 2002-06-21 | 2003-12-31 | University Of Southern California | System and method for automatic room acoustic correction |
CN101848412B (zh) * | 2009-03-25 | 2012-03-21 | 华为技术有限公司 | 通道间延迟估计的方法及其装置和编码器 |
CN107479030B (zh) * | 2017-07-14 | 2020-11-17 | 重庆邮电大学 | 基于分频和改进的广义互相关双耳时延估计方法 |
CN107393549A (zh) * | 2017-07-21 | 2017-11-24 | 北京华捷艾米科技有限公司 | 时延估计方法及装置 |
RU2762302C1 (ru) * | 2018-04-05 | 2021-12-17 | Фраунхофер-Гезелльшафт Цур Фердерунг Дер Ангевандтен Форшунг Е.Ф. | Устройство, способ или компьютерная программа для оценки разности во времени между каналами |
CN110082725B (zh) * | 2019-03-12 | 2023-02-28 | 西安电子科技大学 | 基于麦克风阵列的声源定位时延估计方法、声源定位系统 |
CN109901114B (zh) * | 2019-03-28 | 2020-10-27 | 广州大学 | 一种适用于声源定位的时延估计方法 |
CN111239686B (zh) * | 2020-02-18 | 2021-12-21 | 中国科学院声学研究所 | 一种基于深度学习的双通道声源定位方法 |
-
2020
- 2020-07-17 CN CN202010700806.7A patent/CN113948098A/zh active Pending
-
2021
- 2021-07-15 KR KR1020237004478A patent/KR20230035387A/ko active Search and Examination
- 2021-07-15 CA CA3189232A patent/CA3189232A1/en active Pending
- 2021-07-15 WO PCT/CN2021/106515 patent/WO2022012629A1/zh unknown
- 2021-07-15 EP EP21842542.9A patent/EP4170653A4/en active Pending
- 2021-07-15 BR BR112023000850A patent/BR112023000850A2/pt unknown
- 2021-07-15 JP JP2023502886A patent/JP2023533364A/ja active Pending
-
2023
- 2023-01-13 US US18/154,549 patent/US20230154483A1/en active Pending
Also Published As
Publication number | Publication date |
---|---|
EP4170653A1 (en) | 2023-04-26 |
WO2022012629A1 (zh) | 2022-01-20 |
EP4170653A4 (en) | 2023-11-29 |
KR20230035387A (ko) | 2023-03-13 |
CN113948098A (zh) | 2022-01-18 |
US20230154483A1 (en) | 2023-05-18 |
CA3189232A1 (en) | 2022-01-20 |
JP2023533364A (ja) | 2023-08-02 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
BR112023000850A2 (pt) | Método e aparelho de estimativa de atraso de sinal de áudio estéreo, aparelho de codificação de áudio e meio de armazenamento legível por computador | |
JP7091411B2 (ja) | マルチチャネル信号の符号化方法およびエンコーダ | |
US10311881B2 (en) | Determining the inter-channel time difference of a multi-channel audio signal | |
ES2773794T3 (es) | Aparato y procedimiento para estimar una diferencia de tiempos entre canales | |
CN1748247B (zh) | 音频编码 | |
KR101670313B1 (ko) | 음원 분리를 위해 자동적으로 문턱치를 선택하는 신호 분리 시스템 및 방법 | |
BRPI0506533A (pt) | equipamento e método para a construção de um sinal de saìda multicanais ou para a geração de um sinal downmix | |
Hines et al. | Robustness of speech quality metrics to background noise and network degradations: Comparing ViSQOL, PESQ and POLQA | |
EP3057095B1 (en) | Method and device for encoding stereo phase parameter | |
EP4220639A1 (en) | Directional loudness map based audio processing | |
AR117567A1 (es) | Aparato, método o programa de computación para estimar la diferencia de tiempo entre canales | |
JP4790318B2 (ja) | 2つの調波信号の共通源の判定方法 | |
Zirn et al. | Perception of interaural phase differences with envelope and fine structure coding strategies in bilateral cochlear implant users | |
BR112019009952A2 (pt) | aparelho e método para decompor um sinal de áudio e programa de computador | |
KR20140074918A (ko) | 직접-산란 분해 | |
BR112017018600A2 (pt) | método e aparelho para determinar parâmetro de diferença de tempo intercanal | |
JP5288148B2 (ja) | 背景雑音キャンセリング装置および方法 | |
Delgado et al. | Objective assessment of spatial audio quality using directional loudness maps | |
EP2413598B1 (en) | Method for estimating inter-channel delay and apparatus and encoder thereof | |
ES2435673T3 (es) | Modelo de calidad de audio paramétrico para servicios IPTV | |
JP2002315089A (ja) | 話者方向検出回路 | |
Schimmel et al. | On the influence of interaural differences on temporal perception of masked noise bursts | |
Ghimire | Speech intelligibility measurement on the basis of ITU-T Recommendation P. 863 | |
Seo et al. | An improved method for objective quality assessment of multichannel audio codecs | |
He et al. | Ambient Spectrum Estimation-Based Primary Ambient Extraction |