BR112023000850A2 - Método e aparelho de estimativa de atraso de sinal de áudio estéreo, aparelho de codificação de áudio e meio de armazenamento legível por computador - Google Patents

Método e aparelho de estimativa de atraso de sinal de áudio estéreo, aparelho de codificação de áudio e meio de armazenamento legível por computador

Info

Publication number
BR112023000850A2
BR112023000850A2 BR112023000850A BR112023000850A BR112023000850A2 BR 112023000850 A2 BR112023000850 A2 BR 112023000850A2 BR 112023000850 A BR112023000850 A BR 112023000850A BR 112023000850 A BR112023000850 A BR 112023000850A BR 112023000850 A2 BR112023000850 A2 BR 112023000850A2
Authority
BR
Brazil
Prior art keywords
current frame
audio signal
stereo audio
signal
weighting function
Prior art date
Application number
BR112023000850A
Other languages
English (en)
Inventor
Ding Jiance
Wang Zhe
Wang Bin
Xia Bingyin
Original Assignee
Huawei Tech Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Huawei Tech Co Ltd filed Critical Huawei Tech Co Ltd
Publication of BR112023000850A2 publication Critical patent/BR112023000850A2/pt

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/008Multichannel audio signal coding or decoding using interchannel correlation to reduce redundancy, e.g. joint-stereo, intensity-coding or matrixing
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/02Speech enhancement, e.g. noise reduction or echo cancellation
    • G10L21/0208Noise filtering
    • G10L21/0216Noise filtering characterised by the method used for estimating noise
    • G10L21/0232Processing in the frequency domain
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/48Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use
    • G10L25/51Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use for comparison or discrimination
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/78Detection of presence or absence of voice signals
    • G10L25/84Detection of presence or absence of voice signals for discriminating voice from noise
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/78Detection of presence or absence of voice signals
    • G10L25/87Detection of discrete points within a voice signal
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04SSTEREOPHONIC SYSTEMS 
    • H04S7/00Indicating arrangements; Control arrangements, e.g. balance control
    • H04S7/30Control circuits for electronic adaptation of the sound field
    • H04S7/307Frequency adjustment, e.g. tone control

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Signal Processing (AREA)
  • Acoustics & Sound (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Health & Medical Sciences (AREA)
  • Computational Linguistics (AREA)
  • Human Computer Interaction (AREA)
  • Multimedia (AREA)
  • Mathematical Physics (AREA)
  • Quality & Reliability (AREA)
  • Stereophonic System (AREA)
  • Compression, Expansion, Code Conversion, And Decoders (AREA)

Abstract

MÉTODO E APARELHO DE ESTIMATIVA DE ATRASO DE SINAL DE ÁUDIO ESTÉREO, APARELHO DE CODIFICAÇÃO DE ÁUDIO E MEIO DE ARMAZENAMENTO LEGÍVEL POR COMPUTADOR. São divulgados um método e aparelho de estimativa de atraso de sinal de áudio estéreo. O método pode incluir: obter um quadro atual de um sinal de áudio estéreo (S401), onde o quadro atual inclui um primeiro sinal de áudio de canal e um segundo sinal de áudio de canal; e se um tipo de sinal de um sinal de ruído incluído no quadro atual for um tipo de sinal de ruído coerente, estimar uma diferença de tempo intercanal do quadro atual usando um primeiro algoritmo (S403); ou se um tipo de sinal de um sinal de ruído incluído no quadro atual for um tipo de sinal de ruído difuso, estimar uma diferença de tempo intercanal do quadro atual usando um segundo algoritmo (S403). O primeiro algoritmo inclui ponderação de um espectro de potência cruzada de domínio de frequência do quadro atual com base em uma primeira função de ponderação, o segundo algoritmo inclui ponderação de um espectro de potência cruzada de domínio de frequência do quadro atual com base em uma segunda função de ponderação, e um fator de construção da primeira função de ponderação é diferente daquele da segunda função de ponderação. Diferentes algoritmos de estimativa de ITD são usados para sinais de áudio estéreo, incluindo diferentes tipos de ruído, aprimorando a precisão de estimativa de ITD do sinal de áudio estéreo.
BR112023000850A 2020-07-17 2021-07-15 Método e aparelho de estimativa de atraso de sinal de áudio estéreo, aparelho de codificação de áudio e meio de armazenamento legível por computador BR112023000850A2 (pt)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN202010700806.7A CN113948098A (zh) 2020-07-17 2020-07-17 一种立体声音频信号时延估计方法及装置
PCT/CN2021/106515 WO2022012629A1 (zh) 2020-07-17 2021-07-15 一种立体声音频信号时延估计方法及装置

Publications (1)

Publication Number Publication Date
BR112023000850A2 true BR112023000850A2 (pt) 2023-04-04

Family

ID=79326926

Family Applications (1)

Application Number Title Priority Date Filing Date
BR112023000850A BR112023000850A2 (pt) 2020-07-17 2021-07-15 Método e aparelho de estimativa de atraso de sinal de áudio estéreo, aparelho de codificação de áudio e meio de armazenamento legível por computador

Country Status (8)

Country Link
US (1) US20230154483A1 (pt)
EP (1) EP4170653A4 (pt)
JP (1) JP2023533364A (pt)
KR (1) KR20230035387A (pt)
CN (1) CN113948098A (pt)
BR (1) BR112023000850A2 (pt)
CA (1) CA3189232A1 (pt)
WO (1) WO2022012629A1 (pt)

Families Citing this family (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN115691515A (zh) * 2022-07-12 2023-02-03 南京拓灵智能科技有限公司 一种音频编解码方法及装置
WO2024053353A1 (ja) * 2022-09-08 2024-03-14 パナソニック インテレクチュアル プロパティ コーポレーション オブ アメリカ 信号処理装置、及び、信号処理方法
CN116032901A (zh) * 2022-12-30 2023-04-28 北京天兵科技有限公司 多路音频数据信号采编方法、装置、系统、介质和设备

Family Cites Families (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2004002192A1 (en) * 2002-06-21 2003-12-31 University Of Southern California System and method for automatic room acoustic correction
CN101848412B (zh) * 2009-03-25 2012-03-21 华为技术有限公司 通道间延迟估计的方法及其装置和编码器
CN107479030B (zh) * 2017-07-14 2020-11-17 重庆邮电大学 基于分频和改进的广义互相关双耳时延估计方法
CN107393549A (zh) * 2017-07-21 2017-11-24 北京华捷艾米科技有限公司 时延估计方法及装置
RU2762302C1 (ru) * 2018-04-05 2021-12-17 Фраунхофер-Гезелльшафт Цур Фердерунг Дер Ангевандтен Форшунг Е.Ф. Устройство, способ или компьютерная программа для оценки разности во времени между каналами
CN110082725B (zh) * 2019-03-12 2023-02-28 西安电子科技大学 基于麦克风阵列的声源定位时延估计方法、声源定位系统
CN109901114B (zh) * 2019-03-28 2020-10-27 广州大学 一种适用于声源定位的时延估计方法
CN111239686B (zh) * 2020-02-18 2021-12-21 中国科学院声学研究所 一种基于深度学习的双通道声源定位方法

Also Published As

Publication number Publication date
EP4170653A1 (en) 2023-04-26
WO2022012629A1 (zh) 2022-01-20
EP4170653A4 (en) 2023-11-29
KR20230035387A (ko) 2023-03-13
CN113948098A (zh) 2022-01-18
US20230154483A1 (en) 2023-05-18
CA3189232A1 (en) 2022-01-20
JP2023533364A (ja) 2023-08-02

Similar Documents

Publication Publication Date Title
BR112023000850A2 (pt) Método e aparelho de estimativa de atraso de sinal de áudio estéreo, aparelho de codificação de áudio e meio de armazenamento legível por computador
JP7091411B2 (ja) マルチチャネル信号の符号化方法およびエンコーダ
US10311881B2 (en) Determining the inter-channel time difference of a multi-channel audio signal
ES2773794T3 (es) Aparato y procedimiento para estimar una diferencia de tiempos entre canales
CN1748247B (zh) 音频编码
KR101670313B1 (ko) 음원 분리를 위해 자동적으로 문턱치를 선택하는 신호 분리 시스템 및 방법
BRPI0506533A (pt) equipamento e método para a construção de um sinal de saìda multicanais ou para a geração de um sinal downmix
Hines et al. Robustness of speech quality metrics to background noise and network degradations: Comparing ViSQOL, PESQ and POLQA
EP3057095B1 (en) Method and device for encoding stereo phase parameter
EP4220639A1 (en) Directional loudness map based audio processing
AR117567A1 (es) Aparato, método o programa de computación para estimar la diferencia de tiempo entre canales
JP4790318B2 (ja) 2つの調波信号の共通源の判定方法
Zirn et al. Perception of interaural phase differences with envelope and fine structure coding strategies in bilateral cochlear implant users
BR112019009952A2 (pt) aparelho e método para decompor um sinal de áudio e programa de computador
KR20140074918A (ko) 직접-산란 분해
BR112017018600A2 (pt) método e aparelho para determinar parâmetro de diferença de tempo intercanal
JP5288148B2 (ja) 背景雑音キャンセリング装置および方法
Delgado et al. Objective assessment of spatial audio quality using directional loudness maps
EP2413598B1 (en) Method for estimating inter-channel delay and apparatus and encoder thereof
ES2435673T3 (es) Modelo de calidad de audio paramétrico para servicios IPTV
JP2002315089A (ja) 話者方向検出回路
Schimmel et al. On the influence of interaural differences on temporal perception of masked noise bursts
Ghimire Speech intelligibility measurement on the basis of ITU-T Recommendation P. 863
Seo et al. An improved method for objective quality assessment of multichannel audio codecs
He et al. Ambient Spectrum Estimation-Based Primary Ambient Extraction