CN115881146A - 用于动态语音增强的方法及系统 - Google Patents

用于动态语音增强的方法及系统 Download PDF

Info

Publication number
CN115881146A
CN115881146A CN202110895493.XA CN202110895493A CN115881146A CN 115881146 A CN115881146 A CN 115881146A CN 202110895493 A CN202110895493 A CN 202110895493A CN 115881146 A CN115881146 A CN 115881146A
Authority
CN
China
Prior art keywords
source input
channel
gain control
control parameter
signal
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202110895493.XA
Other languages
English (en)
Chinese (zh)
Inventor
S-F.施
郑剑文
肖宜
焦其金
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Harman International Industries Inc
Original Assignee
Harman International Industries Inc
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Harman International Industries Inc filed Critical Harman International Industries Inc
Priority to CN202110895493.XA priority Critical patent/CN115881146A/zh
Priority to JP2022110199A priority patent/JP2023024295A/ja
Priority to EP22184919.3A priority patent/EP4131265B1/en
Priority to KR1020220088509A priority patent/KR20230021580A/ko
Priority to US17/879,561 priority patent/US20230040743A1/en
Publication of CN115881146A publication Critical patent/CN115881146A/zh
Pending legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/02Speech enhancement, e.g. noise reduction or echo cancellation
    • G10L21/0316Speech enhancement, e.g. noise reduction or echo cancellation by changing the amplitude
    • G10L21/0364Speech enhancement, e.g. noise reduction or echo cancellation by changing the amplitude for improving intelligibility
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/02Speech enhancement, e.g. noise reduction or echo cancellation
    • G10L21/0316Speech enhancement, e.g. noise reduction or echo cancellation by changing the amplitude
    • G10L21/0324Details of processing therefor
    • G10L21/034Automatic adjustment
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/02Speech enhancement, e.g. noise reduction or echo cancellation
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/03Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/78Detection of presence or absence of voice signals
    • HELECTRICITY
    • H03ELECTRONIC CIRCUITRY
    • H03GCONTROL OF AMPLIFICATION
    • H03G3/00Gain control in amplifiers or frequency changers
    • H03G3/20Automatic control
    • H03G3/30Automatic control in amplifiers having semiconductor devices
    • H03G3/3005Automatic control in amplifiers having semiconductor devices in amplifiers suitable for low-frequencies, e.g. audio amplifiers
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04SSTEREOPHONIC SYSTEMS 
    • H04S1/00Two-channel systems
    • H04S1/007Two-channel systems in which the audio signals are in digital form
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04SSTEREOPHONIC SYSTEMS 
    • H04S3/00Systems employing more than two channels, e.g. quadraphonic
    • H04S3/008Systems employing more than two channels, e.g. quadraphonic in which the audio signals are in digital form, i.e. employing more than two discrete digital channels
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04RLOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; ELECTRIC HEARING AIDS; PUBLIC ADDRESS SYSTEMS
    • H04R2430/00Signal processing covered by H04R, not provided for in its groups
    • H04R2430/01Aspects of volume control, not necessarily automatic, in sound systems
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04SSTEREOPHONIC SYSTEMS 
    • H04S2400/00Details of stereophonic systems covered by H04S but not provided for in its groups
    • H04S2400/01Multi-channel, i.e. more than two input channels, sound reproduction with two speakers wherein the multi-channel information is substantially preserved
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04SSTEREOPHONIC SYSTEMS 
    • H04S2400/00Details of stereophonic systems covered by H04S but not provided for in its groups
    • H04S2400/05Generation or adaptation of centre channel in multi-channel audio systems
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04SSTEREOPHONIC SYSTEMS 
    • H04S2400/00Details of stereophonic systems covered by H04S but not provided for in its groups
    • H04S2400/13Aspects of volume control, not necessarily automatic, in stereophonic sound systems

Landscapes

  • Engineering & Computer Science (AREA)
  • Multimedia (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Signal Processing (AREA)
  • Computational Linguistics (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Quality & Reliability (AREA)
  • Tone Control, Compression And Expansion, Limiting Amplitude (AREA)
  • Circuit For Audible Band Transducer (AREA)
  • Stereophonic System (AREA)
CN202110895493.XA 2021-08-05 2021-08-05 用于动态语音增强的方法及系统 Pending CN115881146A (zh)

Priority Applications (5)

Application Number Priority Date Filing Date Title
CN202110895493.XA CN115881146A (zh) 2021-08-05 2021-08-05 用于动态语音增强的方法及系统
JP2022110199A JP2023024295A (ja) 2021-08-05 2022-07-08 動的音声強調のための方法及びシステム
EP22184919.3A EP4131265B1 (en) 2021-08-05 2022-07-14 Method and system for dynamic voice enhancement
KR1020220088509A KR20230021580A (ko) 2021-08-05 2022-07-18 동적 음성 향상을 위한 방법 및 시스템
US17/879,561 US20230040743A1 (en) 2021-08-05 2022-08-02 Method and system for dynamic voice enhancement

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202110895493.XA CN115881146A (zh) 2021-08-05 2021-08-05 用于动态语音增强的方法及系统

Publications (1)

Publication Number Publication Date
CN115881146A true CN115881146A (zh) 2023-03-31

Family

ID=82608415

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202110895493.XA Pending CN115881146A (zh) 2021-08-05 2021-08-05 用于动态语音增强的方法及系统

Country Status (5)

Country Link
US (1) US20230040743A1 (https=)
EP (1) EP4131265B1 (https=)
JP (1) JP2023024295A (https=)
KR (1) KR20230021580A (https=)
CN (1) CN115881146A (https=)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN116701921A (zh) * 2023-08-08 2023-09-05 电子科技大学 多通道时序信号的时频特征提取电路及自适应抑噪电路

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN119889331A (zh) * 2023-10-24 2025-04-25 哈曼国际工业有限公司 智能动态语音增强的方法及系统

Family Cites Families (22)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2001237920A (ja) * 2000-02-23 2001-08-31 Hitachi Kokusai Electric Inc 入力レベル調整回路
FI20045315L (fi) * 2004-08-30 2006-03-01 Nokia Corp Ääniaktiivisuuden havaitseminen äänisignaalissa
US7464029B2 (en) * 2005-07-22 2008-12-09 Qualcomm Incorporated Robust separation of speech signals in a noisy environment
JP2010539792A (ja) * 2007-09-12 2010-12-16 ドルビー・ラボラトリーズ・ライセンシング・コーポレーション スピーチ増強
JP5094427B2 (ja) * 2008-01-09 2012-12-12 アルパイン株式会社 音声再生方法およびマルチプロセスシステム
US8856049B2 (en) * 2008-03-26 2014-10-07 Nokia Corporation Audio signal classification by shape parameter estimation for a plurality of audio signal samples
EP2107553B1 (en) * 2008-03-31 2011-05-18 Harman Becker Automotive Systems GmbH Method for determining barge-in
MY159890A (en) * 2008-04-18 2017-02-15 Dolby Laboratories Licensing Corp Method and apparatus for maintaining speech audibiliy in multi-channel audio with minimal impact on surround experience
US8831936B2 (en) * 2008-05-29 2014-09-09 Qualcomm Incorporated Systems, methods, apparatus, and computer program products for speech signal processing using spectral contrast enhancement
US8503694B2 (en) * 2008-06-24 2013-08-06 Microsoft Corporation Sound capture system for devices with two microphones
US20110058676A1 (en) * 2009-09-07 2011-03-10 Qualcomm Incorporated Systems, methods, apparatus, and computer-readable media for dereverberation of multichannel signal
US9324337B2 (en) * 2009-11-17 2016-04-26 Dolby Laboratories Licensing Corporation Method and system for dialog enhancement
TWI459828B (zh) * 2010-03-08 2014-11-01 Dolby Lab Licensing Corp 在多頻道音訊中決定語音相關頻道的音量降低比例的方法及系統
US8989403B2 (en) * 2010-03-09 2015-03-24 Mitsubishi Electric Corporation Noise suppression device
US8744091B2 (en) * 2010-11-12 2014-06-03 Apple Inc. Intelligibility control using ambient noise detection
JP5604275B2 (ja) * 2010-12-02 2014-10-08 富士通テン株式会社 相関低減方法、音声信号変換装置および音響再生装置
JP5762549B2 (ja) * 2011-09-15 2015-08-12 三菱電機株式会社 ダイナミックレンジ制御装置
WO2013118192A1 (ja) * 2012-02-10 2013-08-15 三菱電機株式会社 雑音抑圧装置
WO2013184520A1 (en) * 2012-06-04 2013-12-12 Stone Troy Christopher Methods and systems for identifying content types
WO2014043024A1 (en) * 2012-09-17 2014-03-20 Dolby Laboratories Licensing Corporation Long term monitoring of transmission and voice activity patterns for regulating gain control
US10546593B2 (en) * 2017-12-04 2020-01-28 Apple Inc. Deep learning driven multi-channel filtering for speech enhancement
US11164592B1 (en) * 2019-05-09 2021-11-02 Amazon Technologies, Inc. Responsive automatic gain control

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN116701921A (zh) * 2023-08-08 2023-09-05 电子科技大学 多通道时序信号的时频特征提取电路及自适应抑噪电路
CN116701921B (zh) * 2023-08-08 2023-10-20 电子科技大学 多通道时序信号自适应抑噪电路

Also Published As

Publication number Publication date
KR20230021580A (ko) 2023-02-14
EP4131265A3 (en) 2023-04-19
EP4131265A2 (en) 2023-02-08
US20230040743A1 (en) 2023-02-09
JP2023024295A (ja) 2023-02-16
EP4131265B1 (en) 2025-06-11

Similar Documents

Publication Publication Date Title
US20240205629A1 (en) Processing object-based audio signals
US9311923B2 (en) Adaptive audio processing based on forensic detection of media processing history
CN110890101B (zh) 用于基于语音增强元数据进行解码的方法和设备
BRPI0923669A2 (pt) mÉtodo, aparelho e programa de computador para aperfeiÇoar audibilidade de fala em um sinal de Áudio de méltiplos canais
CN105284133B (zh) 基于信号下混比进行中心信号缩放和立体声增强的设备和方法
TW202205259A (zh) 高階保真立體音響訊號表象之壓縮方法和裝置以及解壓縮方法和裝置
US20230040743A1 (en) Method and system for dynamic voice enhancement
US20250365552A1 (en) Binaural signal post-processing
US20240357304A1 (en) Sound Field Related Rendering
CN107771346B (zh) 实现低复杂度格式转换的内部声道处理方法和装置
US9928842B1 (en) Ambience extraction from stereo signals based on least-squares approach
EP3997700B1 (en) Presentation independent mastering of audio content
CN108028988B (zh) 处理低复杂度格式转换的内部声道的设备和方法
US20240381025A1 (en) Beamforming for a microphone array based on a steered response power transformation of audio data
US20120020483A1 (en) System and method for robust audio spatialization using frequency separation
EP4662657A1 (en) Dialog intelligibility enhancement method and system
US20250131939A1 (en) Method and System of Intelligent Dynamic Voice Enhancement
CN109036456B (zh) 用于立体声的源分量环境分量提取方法
US20250279106A1 (en) Audio Signal Upmixer
US20240161762A1 (en) Full-band audio signal reconstruction enabled by output from a machine learning model
US20260059259A1 (en) Ambiance Expansion System For A Vehicle
CN120690218A (zh) 回声消除方法、装置、电子设备、存储介质及车辆
Lee et al. On-Line Monaural Ambience Extraction Algorithm for Multichannel Audio Upmixing System Based on Nonnegative Matrix Factorization

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination