TWI466107B - 多麥克風之穩固雜訊抑制 - Google Patents
多麥克風之穩固雜訊抑制 Download PDFInfo
- Publication number
- TWI466107B TWI466107B TW100115214A TW100115214A TWI466107B TW I466107 B TWI466107 B TW I466107B TW 100115214 A TW100115214 A TW 100115214A TW 100115214 A TW100115214 A TW 100115214A TW I466107 B TWI466107 B TW I466107B
- Authority
- TW
- Taiwan
- Prior art keywords
- noise
- sub
- module
- signal
- band
- Prior art date
Links
- 230000001629 suppression Effects 0.000 title claims description 29
- 238000000034 method Methods 0.000 claims description 31
- 238000012545 processing Methods 0.000 claims description 28
- 239000003607 modifier Substances 0.000 claims description 24
- 230000009467 reduction Effects 0.000 claims description 24
- 230000015654 memory Effects 0.000 claims description 13
- 230000005236 sound signal Effects 0.000 claims description 12
- 201000007201 aphasia Diseases 0.000 claims description 6
- 238000000605 extraction Methods 0.000 claims description 4
- 230000006978 adaptation Effects 0.000 claims 2
- 238000010276 construction Methods 0.000 claims 1
- 238000005516 engineering process Methods 0.000 description 8
- 230000008569 process Effects 0.000 description 6
- 238000010586 diagram Methods 0.000 description 5
- 238000012805 post-processing Methods 0.000 description 5
- 230000003068 static effect Effects 0.000 description 5
- 238000004891 communication Methods 0.000 description 4
- 238000009826 distribution Methods 0.000 description 4
- 230000003595 spectral effect Effects 0.000 description 4
- 238000001228 spectrum Methods 0.000 description 4
- 241000237858 Gastropoda Species 0.000 description 3
- 230000008859 change Effects 0.000 description 3
- 230000000670 limiting effect Effects 0.000 description 3
- 238000005457 optimization Methods 0.000 description 3
- 238000007781 pre-processing Methods 0.000 description 3
- 230000003044 adaptive effect Effects 0.000 description 2
- 238000013461 design Methods 0.000 description 2
- 238000001914 filtration Methods 0.000 description 2
- 230000002238 attenuated effect Effects 0.000 description 1
- 230000005540 biological transmission Effects 0.000 description 1
- 230000015572 biosynthetic process Effects 0.000 description 1
- 230000015556 catabolic process Effects 0.000 description 1
- 238000006243 chemical reaction Methods 0.000 description 1
- 238000006731 degradation reaction Methods 0.000 description 1
- 230000001419 dependent effect Effects 0.000 description 1
- 238000001514 detection method Methods 0.000 description 1
- 230000009977 dual effect Effects 0.000 description 1
- 230000008030 elimination Effects 0.000 description 1
- 238000003379 elimination reaction Methods 0.000 description 1
- 230000007613 environmental effect Effects 0.000 description 1
- 238000007667 floating Methods 0.000 description 1
- 230000006870 function Effects 0.000 description 1
- 230000000873 masking effect Effects 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
- 230000002829 reductive effect Effects 0.000 description 1
- 238000000926 separation method Methods 0.000 description 1
- 238000003786 synthesis reaction Methods 0.000 description 1
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/02—Speech enhancement, e.g. noise reduction or echo cancellation
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04R—LOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; DEAF-AID SETS; PUBLIC ADDRESS SYSTEMS
- H04R3/00—Circuits for transducers, loudspeakers or microphones
- H04R3/002—Damping circuit arrangements for transducers, e.g. motional feedback circuits
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/02—Speech enhancement, e.g. noise reduction or echo cancellation
- G10L21/0208—Noise filtering
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/02—Speech enhancement, e.g. noise reduction or echo cancellation
- G10L21/0208—Noise filtering
- G10L21/0216—Noise filtering characterised by the method used for estimating noise
- G10L21/0232—Processing in the frequency domain
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04B—TRANSMISSION
- H04B3/00—Line transmission systems
- H04B3/02—Details
- H04B3/20—Reducing echo effects or singing; Opening or closing transmitting path; Conditioning for transmission in one direction or the other
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/02—Speech enhancement, e.g. noise reduction or echo cancellation
- G10L21/0208—Noise filtering
- G10L2021/02082—Noise filtering the noise being echo, reverberation of the speech
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/02—Speech enhancement, e.g. noise reduction or echo cancellation
- G10L21/0208—Noise filtering
- G10L21/0216—Noise filtering characterised by the method used for estimating noise
- G10L2021/02161—Number of inputs available containing the signal or the noise to be suppressed
- G10L2021/02166—Microphone arrays; Beamforming
Landscapes
- Engineering & Computer Science (AREA)
- Signal Processing (AREA)
- Physics & Mathematics (AREA)
- Acoustics & Sound (AREA)
- Health & Medical Sciences (AREA)
- Quality & Reliability (AREA)
- Computational Linguistics (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Multimedia (AREA)
- Computer Networks & Wireless Communication (AREA)
- Circuit For Audible Band Transducer (AREA)
- Soundproofing, Sound Blocking, And Sound Damping (AREA)
Description
本發明大體上係關於音訊處理,且更特定言之,係關於音訊信號之雜訊抑制處理。
本申請案主張2010年4月29日申請之名為「Multi-Microphone Noise Suppression」之美國臨時申請案序號61/329,322的優先權利。本申請案係與2010年7月8日申請之名為「Method for Jointly Optimizing Noise Reduction and Voice Quality in a Mono or Multi-Microphone System」之美國專利申請案第xx/xxx,xxx號(代理人案號PA5218US)相關。前述申請案之揭示內容以引用的方式併入本文中。
當前,存在用於減少不利音訊環境中之背景雜訊之許多方法。靜止雜訊抑制系統將靜止雜訊抑制達固定或變化數目個dB。固定抑制系統將靜止或非靜止雜訊抑制達固定數目個dB。靜止雜訊抑制器之缺點在於將不抑制非靜止雜訊,而固定抑制系統之缺點在於其必須將雜訊抑制達一保守位準,以便避免在低SNR下之話語失真。
另一形式之雜訊抑制為動態雜訊抑制。常見類型之動態雜訊抑制系統係基於信雜比(SNR)。SNR可用以判定抑制值。遺憾地,歸因於在音訊環境中存在不同雜訊類型,因而SNR單獨地不為話語失真之極好預測因子。通常,遍及給定時段之話語能量將包括字、暫停、字、暫停,等等。另外,在音訊環境中可存在靜止雜訊及動態雜訊。SNR將所有此等靜止及非靜止話語以及雜訊分量進行平均化。不考慮雜訊信號之特性之SNR之判定(僅考慮雜訊之總位準)。
為了克服先前技術之缺點,需要一種用於處理音訊信號之改良型雜訊抑制系統。
本技術提供一種穩固雜訊抑制系統,該穩固雜訊抑制系統可同時地減少一聲學信號中之雜訊分量及回音分量,同時限制話語失真之位準。該系統可自呈一近講型、手持型或其他組態之兩個或兩個以上麥克風接收聲學信號。將該等經接收聲學信號變換成蝸域副頻帶信號,且可自該等副頻帶信號減去回音分量及雜訊分量。識別該等聲學副頻帶信號中之特徵且使用該等特徵以產生一乘法遮罩。將該乘法遮罩應用於該等減去雜訊之副頻帶信號,且在時域中重新建構該等副頻帶信號。
一實施例包括一種用於執行一音訊信號中之雜訊減少之系統,該系統可包括一記憶體。儲存於該記憶體中且藉由一處理器執行之一頻率分析模組可自時域聲學信號而在一蝸域中產生副頻帶信號。儲存於該記憶體中且藉由一處理器執行之一雜訊消除模組可消除該等副頻帶信號之至少一部分。儲存於該記憶體中且藉由一處理器執行之一修改器模組可抑制該等經修改副頻帶信號中之一雜訊分量或一回音分量。儲存於該記憶體中且藉由一處理器執行之一重新建構器模組可自藉由該修改器模組提供之該等抑制分量之副頻帶信號重新建構一經修改時域信號。
亦可將雜訊減少執行為藉由具有一處理器及記憶體之一機器執行之一程序。另外,可實施一種電腦可讀儲存媒體,在該電腦可讀儲存媒體中體現有一程式,該程式可藉由一處理器執行以執行用於減少一音訊信號中之雜訊之一方法。
本技術提供一種穩固雜訊抑制系統,該穩固雜訊抑制系統可同時地減少一聲學信號中之雜訊分量及回音分量,同時限制話語失真之位準。該系統可自呈一近講型、手持型或其他組態之兩個或兩個以上麥克風接收聲學信號。將該等經接收聲學信號變換成蝸域副頻帶信號,且可自該等副頻帶信號減去回音分量及雜訊分量。識別該等聲學副頻帶信號中之特徵且使用該等特徵以產生一乘法遮罩。將該乘法遮罩應用於該等減去雜訊之副頻帶信號,且在時域中重新建構該等副頻帶信號。本技術為一動態雜訊抑制系統及非靜止雜訊抑制系統,且基於雜訊之特性及使用狀況而提供「感知上最佳」量之雜訊抑制。
經由雜訊消除與雜訊抑制之組合而執行雜訊(及回音)減少會允許音訊裝置設計之靈活性。詳言之,減法階段與乘法階段之組合係有利的,此係因為其允許在音訊裝置上麥克風置放之靈活性及使用狀況(例如,近講型/遠講型)之靈活性兩者,同時最佳化語音品質相對於雜訊抑制之總取捨。麥克風針對「近麥克風」組態可經定位成在彼此之四公分內,或針對「擴展麥克風」組態或具有兩個以上麥克風之組態的組合可經定位成相隔四公分以上。
圖1為可供使用本技術之實施例之環境的說明。使用者可充當至音訊裝置104之音訊(話語)源102。例示性音訊裝置104包括兩個麥克風:與音訊源102有關之主麥克風106,及經定位成與主麥克風106相隔一距離之副麥克風108。或者,音訊裝置104可包括單一麥克風。在又其他實施例中,音訊裝置104可包括兩個以上麥克風,諸如,三個、四個、五個、六個、七個、八個、九個、十個或甚至更多麥克風。
主麥克風106及副麥克風108可為全向麥克風。或者,實施例可利用其他形式之麥克風或聲學感測器,諸如,定向麥克風。
在麥克風106及108自音訊源102接收聲音(亦即,聲學信號)的同時,麥克風106及108亦拾取雜訊112。雖然在圖1中將雜訊110展示為來自單一位置,但雜訊110可包括來自不同於音訊源102之位置之一或多個位置的任何聲音,且可包括混響及回音。雜訊110可為靜止雜訊、非靜止雜訊,及/或靜止雜訊與非靜止雜訊之組合。
一些實施例可利用藉由兩個麥克風106及108接收之聲學信號之間的位準差(例如,能量差)。因為在近講型使用狀況下主麥克風106比副麥克風108更接近音訊源102,所以主麥克風106之強度位準較高,從而導致在(例如)話語/語音片段期間藉由主麥克風106接收之較大能量位準。
可接著使用位準差以鑑別時頻域中之話語及雜訊。另外實施例可使用能量位準差與時間延遲之組合以鑑別話語。基於雙耳提示編碼,可執行話語信號擷取或話語增強。
圖2為例示性音訊裝置104的方塊圖。在所說明實施例中,音訊裝置104包括接收器200、處理器202、主麥克風106、可選副麥克風108、音訊處理系統210,及輸出裝置206。音訊裝置104可包括音訊裝置104操作所必要之另外或其他組件。類似地,音訊裝置104可包括執行類似於或等效於圖2所描繪之功能之功能的較少組件。
處理器202可執行儲存於音訊裝置104中之記憶體(圖2中未說明)中的指令及模組,以執行本文中所描述之功能性,包括聲學信號之雜訊減少。處理器202可包括實施為處理單元之硬體及軟體,處理單元可處理針對處理器202之浮點操作及其他操作。
例示性接收器200為經組態以自通信網路接收信號之聲學感測器。在一些實施例中,接收器200可包括天線裝置。可接著將信號轉遞至音訊處理系統210,以使用本文中所描述之技術來減少雜訊,且將音訊信號提供至輸出裝置206。可在音訊裝置104之傳輸路徑及接收路徑中之一者或兩者中使用本技術。
音訊處理系統210經組態以經由主麥克風106及副麥克風108而自聲學源接收聲學信號,且處理聲學信號。處理可包括執行聲學信號內之雜訊減少。下文更詳細地論述音訊處理系統210。主麥克風106與副麥克風108可間隔開一距離,以便允許偵測主麥克風106與副麥克風108之間的能量位準差、時間差或相位差。可將藉由主麥克風106及副麥克風108接收之聲學信號轉換成電信號(亦即,主電信號及副電信號)。根據一些實施例,可藉由類比至數位轉換器(未圖示)將電信號自身轉換成數位信號以供處理。為了出於清晰目的而區別聲學信號,在本文中將藉由主麥克風106接收之聲學信號稱作主聲學信號,而在本文中將藉由副麥克風108接收之聲學信號稱作副聲學信號。可藉由音訊處理系統210處理主聲學信號及副聲學信號以產生具有改良型信雜比之信號。應注意,可僅利用主麥克風106來實踐本文中所描述之技術之實施例。
輸出裝置206為將音訊輸出提供至使用者之任何裝置。舉例而言,輸出裝置206可包括揚聲器、頭戴式耳機或手機之聽筒,或會議裝置上之揚聲器。
在各種實施例中,在主麥克風與副麥克風為緊密間隔(例如,相隔1 cm至2 cm)之全向麥克風時,可使用波束成形技術以模擬面向前及面向後之定向麥克風。可使用位準差以鑑別時頻域中之話語及雜訊,可在雜訊減少中使用位準差。
圖3為用於執行如本文中所描述之雜訊減少之例示性音訊處理系統210的方塊圖。在例示性實施例中,音訊處理系統210體現於音訊裝置104內之記憶體裝置內。音訊處理系統210可包括頻率分析模組302、特徵擷取模組304、源推斷引擎模組306、遮罩產生器模組308、雜訊消除器模組310、修改器模組312,及重新建構器模組314。音訊處理系統210可包括比圖3所說明之組件更多或更少的組件,且可將模組之功能性組合或擴充至較少或額外模組中。在圖3之各種模組之間及在本文中之其他圖中說明例示性通信線路。通信線路既不意欲限制哪些模組與其他模組以通信方式耦接,通信線路亦不意欲限制在模組之間所傳達之信號之數目及類型。
在操作中,將自主麥克風106及副麥克風108所接收之聲學信號轉換成電信號,且經由頻率分析模組302處理電信號。在藉由頻率分析模組302處理聲學信號之前,可在時域中預處理聲學信號。時域預處理可包括應用輸入限制器增益、話語時間延伸,及使用FIR或IIR濾波器進行濾波。
頻率分析模組302獲取聲學信號,且模仿藉由濾波器組模擬的蝸(例如,蝸域)之頻率分析。頻率分析模組302將主聲學信號及副聲學信號中之每一者分離成兩個或兩個以上頻率副頻帶信號。副頻帶信號為對輸入信號之濾波操作之結果,其中濾波器之頻寬窄於藉由頻率分析模組302接收之信號之頻寬。可藉由一系列級聯式複值一階IIR濾波器來實施濾波器組。或者,可將諸如短時傅立葉變換(STFT)、副頻帶濾波器組、調變式複數重疊變換、蝸模組、小波等等之其他濾波器用於頻率分析及合成。可將頻率副頻帶信號之樣本依序地分組成若干時間訊框(例如,遍及預定時段)。舉例而言,一訊框之長度可為4毫秒、8毫秒或某其他時間長度。在一些實施例中,可能根本不存在訊框。結果可包括在快速蝸變換(FCT)域中之副頻帶信號。
將副頻帶訊框信號自頻率分析模組302提供至分析路徑子系統320及信號路徑子系統330。分析路徑子系統320可處理信號以識別信號特徵、區分副頻帶信號之話語分量與雜訊分量,且產生信號修改器。信號路徑子系統330負責藉由減少主聲學信號之副頻帶信號中之雜訊而修改副頻帶信號。雜訊減少可包括應用修改器(諸如,在分析路徑子系統320中所產生之乘法增益遮罩),或藉由自副頻帶信號減去分量。雜訊減少可減少雜訊且保留副頻帶信號中之所要話語分量。
信號路徑子系統330包括雜訊消除器模組310及修改器模組312。雜訊消除器模組310自頻率分析模組302接收副頻帶訊框信號。雜訊消除器模組310可自主聲學信號之一或多個副頻帶信號減去(例如,消除)雜訊分量。因而,雜訊消除器模組310可輸出主信號中之雜訊分量之副頻帶估計及呈減去雜訊之副頻帶信號之形式的話語分量之副頻帶估計。
雜訊消除器模組310可基於源位置而藉由減法演算法來提供(例如)在具有雙麥克風組態之系統中之雜訊消除。雜訊消除器模組310亦可提供回音消除,且對揚聲器及Rx路徑非線性固有地穩固。藉由在具有很少語音品質降級或無語音品質降級的情況下執行雜訊及回音消除(例如,自主信號副頻帶減去分量),雜訊消除器模組310可增加自頻率分析模組302所接收且提供至修改器模組312及後濾波模組之副頻帶信號中之話語對雜訊比(SNR)。所執行之雜訊消除之量可取決於雜訊源之擴散性及麥克風之間的距離,雜訊源之擴散性及麥克風之間的距離兩者皆有助於麥克風之間的雜訊之相干性,其中較大相干性導致較好消除。
可以多種方式實施雜訊消除器模組310。在一些實施例中,可用單一NPNS模組實施雜訊消除器模組310。或者,雜訊消除器模組310可包括兩個或兩個以上NPNS模組,該等NPNS模組可(例如)以級聯方式予以配置。
在一些實施例中藉由雜訊消除器模組310執行之雜訊消除之一實例被揭示於2008年6月30日申請之名為「System and Method for Providing Noise Suppression Utilizing Null Processing Noise Subtraction」之美國專利申請案第12/215,980號、2009年4月13日申請之名為「Adaptive Noise Cancellation」之美國申請案第12/422,917號及2010年1月26日申請之名為「Adaptive Noise Reduction Using Level Cues」之美國申請案第12/693,998號中,該等申請案之揭示內容各自以引用的方式併入本文中。
分析路徑子系統320之特徵擷取模組304接收自藉由頻率分析模組302提供之主聲學信號及副聲學信號所導出的副頻帶訊框信號,以及NPNS模組310之輸出。特徵擷取模組304計算如下各項:副頻帶信號之訊框能量估計;主聲學信號與副聲學信號之間的麥克風間位準差(ILD)、麥克風間時間差(ITD)及麥克風間相位差(IPD);主麥克風及副麥克風之自雜訊估計;以及可藉由其他模組利用之其他單耳或雙耳特徵,諸如,麥克風信號之間的間距估計及交叉相關。特徵擷取模組304可提供至NPNS模組310之輸入且處理來自NPNS模組310之輸出。
特徵擷取模組304可產生空處理麥克風間位準差(null-processing inter-microphone level difference;NP-ILD)。可在本系統中與原始ILD互換地使用NP-ILD。可藉由特徵擷取模組304內之ILD模組來判定主麥克風與副麥克風之間的原始ILD。可藉由如下方程式以算術方式表示在一實施例中藉由ILD模組計算之ILD:
其中E1及E2分別為主麥克風106及副麥克風108之能量輸出,該等能量輸出係遍及非重疊時間間隔(「訊框」)而在每一副頻帶信號中予以計算。此方程式描述被正規化達c倍且限於範圍[-1,+1]之dB ILD。因此,當音訊源102對於E1而言接近主麥克風106且不存在雜訊時,ILD=1,但隨著添加更多雜訊,ILD將減少。
在一些狀況下,在麥克風之間的距離相對於在主麥克風與嘴之間的距離較小時,原始ILD可能不會有用於將源與擾亂項(distracter)加以鑑別,此係因為源及擾亂項兩者皆可能具有約略相等的原始ILD。為了避免關於用以將源與擾亂項加以鑑別之原始ILD的限制,可使用雜訊消除模組330之輸出以導出針對話語信號具有正值且針對雜訊分量具有小值或負值之ILD,此係因為此等者將在雜訊消除模組310之輸出處顯著地衰減。自雜訊消除模組330之輸出所導出的ILD可為空處理麥克風間位準差(NP-ILD),且藉由如下方程式以算術方式表示該ILD:
其中ENP
為NPNS之輸出能量。NP-ILD之使用允許在音訊裝置內麥克風之置放的較大靈活性。舉例而言,NP-ILD可允許以前後組態置放麥克風,該組態具有在2 cm至15 cm之間的分離距離,且在總抑制位準方面具有幾個dB之效能變化。
NPNS模組可將消除雜訊之副頻帶信號提供至特徵擷取模組304中之ILD區塊。因為可將ILD判定為NPNS輸出信號能量對副麥克風能量之比率,所以ILD常常可與空處理麥克風間位準差(NP-ILD)互換。可使用「原始ILD」以將自「原始」主麥克風信號及副麥克風信號計算ILD之狀況進行歧義消除。
判定能量位準估計及麥克風間位準差被更詳細地論述於名為「System and Method for Utilizing Inter-Microphone Level Differences for Speech Enhancement」之美國專利申請案第11/343,524號中,該申請案以引用的方式併入本文中。
源推斷引擎模組306可處理藉由特徵擷取模組304提供之訊框能量估計以計算雜訊估計且導出副頻帶信號中之雜訊及話語之模型。源推斷引擎模組306調適性地估計聲學源之屬性,諸如,NPNS模組310之輸出信號的聲學源之能譜。可利用能譜屬性以在遮罩產生器模組308中產生乘法遮罩。
源推斷引擎模組306可自特徵擷取模組304接收NP-ILD,且追蹤目標音訊源102、背景雜訊及(視情況)回音之NP-ILD機率分佈或「叢集」。
接著,連同其他聽覺提示使用此資訊,以在源與雜訊類別之間界定分類邊界。歸因於改變環境條件、音訊裝置104之移動、使用者之手及/或臉之位置、與音訊裝置104有關之其他物件,及其他因素,話語、雜訊及回音之NP-ILD分佈可隨著時間推移而變化。叢集追蹤器調適於話語或雜訊源之時變NP-ILD。
當忽略回音時,在無任何一般性損失之情況下,當源及雜訊ILD分佈非重疊時,有可能在該兩個分佈之間指定分類邊界或顯性臨限值,使得在SNR為足夠正時將信號分類為話語,或在SNR為足夠負時將信號分類為雜訊。可按照副頻帶及時間訊框將此分類判定為顯性遮罩(dominance mask),且藉由叢集追蹤器模組將此分類輸出至源推斷引擎模組306內之雜訊估計器模組。
叢集追蹤器可至少部分地基於自聲學信號所導出之聲學特徵而判定聲學特徵之全域概述,以及基於聲學特徵之全域執行估計及全域概述而判定瞬時全域分類。可更新該等全域執行估計,且基於至少該一或多個聲學特徵而導出瞬時局域分類。可接著至少部分地基於該瞬時局域分類及該一或多個聲學特徵而判定譜能量分類。
在一些實施例中,叢集追蹤器模組基於此等局域叢集及觀測而將能譜中之點分類為話語或雜訊。因而,將能譜中之每一點之局域二進位遮罩識別為話語或雜訊。
叢集追蹤器模組可按照副頻帶產生雜訊/話語分類信號且將分類提供至NPNS模組310。在一些實施例中,該分類為指示在雜訊與話語之間的區別的控制信號。雜訊消除器模組310可利用分類信號以估計經接收麥克風信號中之雜訊。在一些實施例中,可將叢集追蹤器模組之結果轉遞至源推斷引擎模組306內之雜訊估計模組。換言之,提供當前雜訊估計連同能譜中可經定位有雜訊之位置以用於處理音訊處理系統210內之雜訊信號。
藉由叢集追蹤器模組來追蹤叢集之一實例被揭示於2007年12月21日申請之名為「System and method for Adaptive Classification of Audio Sources」之美國專利申請案第12/004,897號中,該申請案之揭示內容以引用的方式併入本文中。
源推斷引擎模組306可包括一雜訊估計模組,該雜訊估計模組可自叢集追蹤器模組及雜訊消除器模組310之輸出接收雜訊/話語分類控制信號以估計雜訊N(t,w),其中t為時間點,且W表示頻率或副頻帶。將藉由雜訊估計模組判定之雜訊估計提供至遮罩產生器模組308。在一些實施例中,遮罩產生器模組308接收雜訊消除器模組310之雜訊估計輸出及叢集追蹤器模組之輸出。
源推斷引擎模組306中之雜訊估計模組可包括NP-ILD雜訊估計器及靜止雜訊估計器。可將雜訊估計(諸如)與max()運算進行組合,使得由組合式雜訊估計導致之雜訊抑制效能至少為個別雜訊估計之雜訊抑制效能。
可自顯性遮罩及雜訊消除器模組310之輸出信號能量導出NP-ILD雜訊估計。當在特定副頻帶中顯性遮罩為1(指示話語)時,使雜訊估計凍結,且當在特定副頻帶中顯性遮罩為0(指示雜訊)時,將雜訊估計設定成等於NPNS輸出信號能量。靜止雜訊估計追蹤變化得比話語通常變化得更緩慢的NPNS輸出信號之分量,且至此模組之主要輸入為NPNS輸出能量。
遮罩產生器模組308接收如藉由源推斷引擎模組306估計的副頻帶話語分量及雜訊分量之模型,且產生乘法遮罩。將乘法遮罩應用於藉由NPNS 310提供至修改器312的所估計之減去雜訊之副頻帶信號。修改器模組312使增益遮罩與藉由NPNS模組310輸出的主聲學信號之減去雜訊之副頻帶信號相乘。應用該遮罩會減少主聲學信號之副頻帶信號中雜訊分量之能量位準,且會導致雜訊減少。
藉由溫納濾波器(Wiener filter)及語音品質最佳化抑制系統來界定乘法遮罩。溫納濾波器估計可基於雜訊之功率譜密度及主聲學信號之功率譜密度。溫納濾波器基於雜訊估計而導出增益。考慮到有雜訊信號,使用所導出之增益以產生清潔話語信號之理論MMSE的估計。為了限制由於遮罩應用而導致之話語失真之量,可使用感知上導出之增益下限而在下端處限制溫納增益。
自遮罩產生器模組308所輸出之增益遮罩之值係時間及副頻帶信號相依的,且以每副頻帶為基礎而最佳化雜訊減少。雜訊減少可經受話語損失失真遵守容許臨限極限之約束。臨限極限可基於許多因素,諸如,語音品質最佳化抑制(VQOS)位準。VQOS位準為副頻帶信號中藉由雜訊減少引入之話語損失失真的估計最大臨限位準。VQOS係可調諧的且考量副頻帶信號之性質,且向系統及聲學設計者提供充分設計靈活性。將在副頻帶信號中所執行之雜訊減少之量的下限判定為經受VQOS臨限值,藉此限制副頻帶信號之話語損失失真之量。結果,當可能時可在副頻帶信號中執行大量雜訊減少,且當諸如不可接受高之話語損失失真的條件不允許大量雜訊減少時雜訊減少可較小。
在實施例中,可將副頻帶信號中之雜訊分量之能量位準減少至不小於殘餘雜訊目標位準,殘餘雜訊目標位準可為固定的或緩慢時變的。在一些實施例中,殘餘雜訊目標位準針對每一副頻帶信號係相同的;在其他實施例中,殘餘雜訊目標位準可橫越諸副頻帶而變化。此目標位準可為雜訊分量不再成為可聽到或可感知時之位準、低於用以俘獲主聲學信號之麥克風之自雜訊位準的位準,或低於在實施雜訊減少技術之系統內基頻晶片上之分量或內部雜訊門限(noise gate)之分量之雜訊門限的位準。
修改器模組312自雜訊消除器模組310接收信號路徑蝸樣本,且將自遮罩產生器308所接收之增益遮罩應用於經接收樣本。信號路徑蝸樣本可包括主聲學信號之減去雜訊之副頻帶信號。藉由溫納濾波器估計提供之遮罩可快速地變化(諸如,自訊框至訊框),且雜訊及話語估計可在諸訊框之間變化。為了幫助處理該變化,可藉由修改器312將遮罩之向上及向下時間跳越速率(slew rate)約束成在合理極限內。可使用簡單線性內插將遮罩自訊框速率內插至樣本速率,且藉由乘法雜訊抑制將遮罩應用於副頻帶信號。修改器模組312可輸出經遮罩之頻率副頻帶信號。
重新建構器模組314可將經遮罩之頻率副頻帶信號自蝸域轉換回成時域。該轉換可包括添加經遮罩之頻率副頻帶信號及相移信號。或者,該轉換可包括使經遮罩之頻率副頻帶信號與蝸頻道之反頻率相乘。一旦完成至時域之轉換,隨即可經由輸出裝置206將經合成聲學信號輸出至使用者及/或將經合成聲學信號提供至編解碼器以供編碼。
在一些實施例中,可執行經合成時域聲學信號之額外後處理。舉例而言,可在將經合成聲學信號提供至使用者之前將藉由舒適雜訊產生器產生之舒適雜訊添加至該信號。舒適雜訊可為通常不能為收聽者所辨別的均一恆定雜訊(例如,粉紅雜訊(pink noise))。可將此舒適雜訊添加至經合成聲學信號以加強可聽度之臨限值且以遮罩低位準非靜止輸出雜訊分量。在一些實施例中,可將舒適雜訊位準選擇成恰好高於可聽度之臨限值,且可由使用者設定。在一些實施例中,遮罩產生器模組308可以使用舒適雜訊之位準,以便產生將會將雜訊抑制成處於或低於舒適雜訊之位準的增益遮罩。
圖3之系統可處理藉由音訊裝置接收的若干類型之信號。可經由一或多個麥克風將該系統應用於聲學信號。該系統亦可處理經由天線或其他連接所接收的信號,諸如,數位Rx信號。
圖4及圖5包括用於執行本技術之例示性方法的流程圖。可以任何次序執行圖4及圖5之每一步驟,且圖4及圖5之方法可各自包括相較於所說明之步驟為額外的步驟或比所說明之步驟更少的步驟。
圖4為用於執行聲學信號之雜訊減少之例示性方法的流程圖。在步驟405處,可接收麥克風聲學信號。藉由麥克風106及108接收之聲學信號可各自包括話語及雜訊之至少一部分。在步驟410處,可對聲學信號執行預處理。預處理可包括將增益、等化及其他信號處理應用於聲學信號。
在步驟415處,在蝸域中產生副頻帶信號。可使用複數濾波器之級聯而自時域信號產生副頻帶信號。
在步驟420處,執行特徵擷取。特徵擷取可自用以消除雜訊分量、推斷副頻帶是否具有雜訊或回音且產生遮罩之副頻帶信號擷取特徵。執行特徵擷取係關於圖5予以更詳細地論述。
在步驟425處,執行雜訊消除。可藉由NPNS模組330對自頻率分析模組302所接收之一或多個副頻帶信號執行雜訊消除。雜訊消除可包括自主聲學信號副頻帶減去雜訊分量。在一些實施例中,可自主聲學信號副頻帶消除回音分量。可將消除雜訊(或消除回音)之信號提供至特徵擷取模組304以判定雜訊分量能量估計且將該信號提供至源推斷引擎306。
在步驟430處,可判定副頻帶之雜訊估計、回音估計及話語估計。可判定聲學信號中之每一副頻帶的每一估計及聲學音訊信號中之每一訊框的每一估計。可至少部分地自藉由源推斷引擎306接收之Rx信號判定回音。將關於特定時間訊框內之副頻帶是被判定為雜訊、話語或是回音之推斷提供至遮罩產生器模組308。
在步驟435處,產生遮罩。可藉由遮罩產生器308產生遮罩。可產生遮罩,且在每一訊框期間基於關於特定副頻帶是被判定為雜訊、話語或是回音之判定而將遮罩應用於每一副頻帶。可基於語音品質最佳化抑制(經判定為針對特定語音失真位準而最佳化的抑制位準)而產生遮罩。在步驟440處,可接著將遮罩應用於副頻帶。可藉由修改器312將遮罩應用於藉由NPNS 310輸出之副頻帶信號。可藉由修改器312將遮罩自訊框速率內插至樣本速率。
在步驟445處,自副頻帶信號重新建構時域信號。可藉由重新建構器模組314將一系列延遲及複數乘運算應用於副頻帶信號而重新建構時間頻帶信號。在步驟450處,可接著對經重新建構時域信號執行後處理。可藉由後處理器執行後處理,且後處理可包括將輸出限制器應用於經重新建構信號、應用自動增益控制,及其他後處理。在步驟455處,可接著輸出經重新建構輸出信號。
圖5為用於自音訊信號擷取特徵之例示性方法的流程圖。圖5之方法可提供針對圖4之方法之步驟420的更多細節。在步驟505處,接收副頻帶信號。特徵擷取模組304可自頻率分析模組302接收副頻帶信號且自雜訊消除器模組310接收輸出信號。在步驟510處,判定二階統計,諸如,副頻帶能量位準。可判定每一訊框之每一副頻帶的能量副頻帶位準。在步驟515處,可計算麥克風之間的交叉相關及麥克風信號之自相關。在步驟520處,判定麥克風間位準差(ILD)。在步驟525處,判定空處理麥克風間位準差(NP-ILD)。至少部分地自副頻帶信號能量及雜訊估計能量判定ILD及NP-ILD兩者。接著藉由音訊處理系統利用經擷取特徵以減少副頻帶信號中之雜訊。
上述模組(包括關於圖3所論述之模組)可包括儲存於諸如機器可讀媒體(例如,電腦可讀媒體)之儲存媒體中的指令。可藉由處理器202擷取及執行此等指令以執行本文中所論述之功能性。指令之一些實例包括軟體、程式碼及韌體。儲存媒體之一些實例包括記憶體裝置及積體電路。
雖然參考上文所詳述之較佳實施例及實例來揭示本發明,但應理解,此等實例意欲呈說明性而非限制性意義。應預期,熟習此項技術者將容易地想到修改及組合,該等修改及組合將在本發明之精神內及在以下申請專利範圍之範疇內。
102...目標音訊源
104...音訊裝置
106...主麥克風
108...副麥克風
112...雜訊
200...接收器
202...處理器
206...輸出裝置
210...音訊處理系統
302...頻率分析模組
304...特徵擷取模組
306...源推斷引擎模組
308...遮罩產生器模組
310...雜訊消除器模組/雜訊消除模組/NPNS模組
312...修改器模組
314...重新建構器模組
320...分析路徑子系統
330...信號路徑子系統
圖1為可供使用本技術之實施例之環境的說明。
圖2為例示性音訊裝置的方塊圖。
圖3為例示性音訊處理系統的方塊圖。
圖4為用於執行聲學信號之雜訊減少之例示性方法的流程圖。
圖5為用於自音訊信號擷取特徵之例示性方法的流程圖。
106...主麥克風
108...副麥克風
210...音訊處理系統
302...頻率分析模組
304...特徵擷取模組
306...源推斷引擎模組
308...遮罩產生器模組
310...雜訊消除器模組/雜訊消除模組/NPNS模組
312...修改器模組
314...重新建構器模組
320...分析路徑子系統
330...信號路徑子系統
Claims (15)
- 一種用於執行一音訊(audio)信號中之雜訊減少之系統,該系統包含:一記憶體;一頻率分析模組,該頻率分析模組儲存於該記憶體中且藉由一處理器執行以自時域聲學信號(time domain acoustic signals)而在一頻域(frequency domain)中產生若干副頻帶信號(sub-band signals);一特徵擷取器(feature extractor)模組,其儲存在記憶體中並由處理器執行,以判定該等副頻帶信號之一或多個特徵,該一或多個特徵係針對該等聲學信號之一系列訊框中之每一訊框而判定;一雜訊消除(noise cancellation)模組,該雜訊消除模組儲存於該記憶體中且藉由處理器執行以消除該等副頻帶信號之至少一部分,並產生若干雜訊消除的副頻帶信號(noise-cancelled sub-band signals);一遮罩產生器模組,其儲存於記憶體中且藉由該處理器執行以產生一遮罩,該遮罩之判定係至少部分地根據由該特徵擷取模組所判定的該一或多個特徵,及該遮罩經組態以藉由一修改器(modifier)模組應用於該等雜訊消除的副頻帶信號;該修改器模組,該修改器模組儲存於該記憶體中且藉由處理器執行以抑制該等雜訊消除的副頻帶信號中之一雜訊分量(noise component)及一回音(echo)分量中至少之 一者,以產生若干經修改的副頻帶信號;及一重新建構器(reconstructor)模組,該重新建構器模組儲存於該記憶體中且藉由處理器執行以自該等經修改的副頻帶信號重新建構一經修改時域(time domain)信號。
- 如請求項1之系統,其中該等時域聲學信號係自一音訊裝置上之一或多個麥克風信號予以接收。
- 如請求項1之系統,該特徵擷取模組經組態以控制該雜訊消除模組及該修改器模組中至少之一者之調適(adaptation)。
- 如請求項1之系統,該雜訊消除模組藉由自該等副頻帶信號減去(subtracting)一雜訊分量及一回音分量中至少之一者而消除該等副頻帶信號之至少一部分。
- 如請求項4之系統,該一或多個特徵係一空處理麥克風間位準差(null-processing inter-microphone level difference),其係在該特徵擷取模組中自該雜訊消除模組之輸出及自該等經接收副頻帶信號導出(derived)。
- 如請求項1之系統,其中至少部分地基於話語損失失真之一臨限位準、雜訊或回音抑制之一所要位準或該主聲學信號之每一副頻帶中之一估計信雜比而判定該遮罩。
- 如請求項3之系統,其中該一或多個特徵包含在一主聲學信號及一第二、第三或其他聲學信號之間的麥克風間位準差、麥克風間時間差及麥克風間相位差中至少之一者。
- 一種用於執行一音訊信號中之雜訊減少之方法,該方法 包含:藉由一處理器執行一儲存頻率分析模組以自時域聲學信號而在一頻域中產生若干副頻帶信號;藉由處理器執行一特徵擷取器模組,以判定該等副頻帶信號的一或多個特徵,該一或多個特徵係針對該等聲學信號之一系列訊框中之每一訊框而判定;藉由處理器執行一雜訊消除模組以消除該等副頻帶信號之至少一部分,並產生若干雜訊消除的副頻帶信號;執行一遮罩產生器模組,以產生一遮罩,該遮罩之判定係至少部分地根據由該特徵擷取模組所判定的該一或多個特徵,及該遮罩經組態以藉由一修改器模組應用於該等雜訊消除的副頻帶信號;藉由處理器執行該修改器模組以抑制該等雜訊消除的副頻帶信號中之一雜訊分量及一回音分量中至少之一者,以產生若干經修改的副頻帶信號;及藉由處理器執行一重新建構器模組以自該等經修改的副頻帶信號重新建構一經修改時域信號。
- 如請求項8之方法,其進一步包含自一音訊裝置上之一或多個麥克風信號接收時域聲學信號。
- 如請求項8之方法,其進一步包含控制該雜訊消除模組及該修改器模組中至少之一者之調適。
- 如請求項8之方法,其進一步包含藉由減去一雜訊分量及一回音分量中至少之一者而消除該等副頻帶信號之至少一部分。
- 如請求項11之方法,該一或多個特徵係在該特徵擷取模組中自該雜訊消除模組之輸出及自該等經接收副頻帶信號導出。
- 如請求項8之方法,其中至少部分地基於話語損失失真之一臨限位準、雜訊或回音抑制之一所要位準或該主聲學信號之每一副頻帶中之一估計信雜比而判定該遮罩。
- 如請求項10之方法,其中該一或多個特徵包含在一主聲學信號及一第二、第三或其他聲學信號之間的麥克風間位準差、麥克風間時間差及麥克風間相位差中至少之一者。
- 一種非暫態電腦可讀儲存媒體,在該電腦可讀儲存媒體上體現有(embodied)一程式,該程式可藉由一處理器執行以執行用於減少一音訊信號中之雜訊之一方法,該方法包含:自時域聲學信號而在一頻域中產生若干副頻帶信號;判定該等副頻帶信號之一或多個特徵,該一或多個特徵係針對該等聲學信號之一系列訊框中之每一訊框而判定;消除該等副頻帶信號之至少一部分,以產生雜訊消除的副頻帶信號;產生一遮罩,該遮罩之判定係至少部分地根據由該特徵擷取模組所判定的該一或多個特徵,及該遮罩經組態以藉由一修改器模組應用於由該雜訊消除模組輸出的副頻帶信號; 抑制該等雜訊消除的副頻帶信號中之一雜訊分量及一回音分量中至少之一者,以產生若干經修改的副頻帶信號;及自該等經修改的副頻帶信號重新建構一經修改時域信號。
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US32932210P | 2010-04-29 | 2010-04-29 | |
US12/832,920 US8538035B2 (en) | 2010-04-29 | 2010-07-08 | Multi-microphone robust noise suppression |
Publications (2)
Publication Number | Publication Date |
---|---|
TW201205560A TW201205560A (en) | 2012-02-01 |
TWI466107B true TWI466107B (zh) | 2014-12-21 |
Family
ID=44861918
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
TW100115214A TWI466107B (zh) | 2010-04-29 | 2011-04-29 | 多麥克風之穩固雜訊抑制 |
Country Status (5)
Country | Link |
---|---|
US (2) | US8538035B2 (zh) |
JP (1) | JP2013527493A (zh) |
KR (1) | KR20130108063A (zh) |
TW (1) | TWI466107B (zh) |
WO (1) | WO2011137258A1 (zh) |
Cited By (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US9143857B2 (en) | 2010-04-19 | 2015-09-22 | Audience, Inc. | Adaptively reducing noise while limiting speech loss distortion |
US9343056B1 (en) | 2010-04-27 | 2016-05-17 | Knowles Electronics, Llc | Wind noise detection and suppression |
US9431023B2 (en) | 2010-07-12 | 2016-08-30 | Knowles Electronics, Llc | Monaural noise suppression based on computational auditory scene analysis |
US9438992B2 (en) | 2010-04-29 | 2016-09-06 | Knowles Electronics, Llc | Multi-microphone robust noise suppression |
US10045140B2 (en) | 2015-01-07 | 2018-08-07 | Knowles Electronics, Llc | Utilizing digital microphones for low power keyword detection and noise suppression |
Families Citing this family (75)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US8949120B1 (en) | 2006-05-25 | 2015-02-03 | Audience, Inc. | Adaptive noise cancelation |
US9558755B1 (en) | 2010-05-20 | 2017-01-31 | Knowles Electronics, Llc | Noise suppression assisted automatic speech recognition |
KR101702561B1 (ko) * | 2010-08-30 | 2017-02-03 | 삼성전자 주식회사 | 음원출력장치 및 이를 제어하는 방법 |
US8682006B1 (en) | 2010-10-20 | 2014-03-25 | Audience, Inc. | Noise suppression based on null coherence |
WO2012107561A1 (en) * | 2011-02-10 | 2012-08-16 | Dolby International Ab | Spatial adaptation in multi-microphone sound capture |
US10418047B2 (en) * | 2011-03-14 | 2019-09-17 | Cochlear Limited | Sound processing with increased noise suppression |
US8724823B2 (en) | 2011-05-20 | 2014-05-13 | Google Inc. | Method and apparatus for reducing noise pumping due to noise suppression and echo control interaction |
US9881616B2 (en) * | 2012-06-06 | 2018-01-30 | Qualcomm Incorporated | Method and systems having improved speech recognition |
US8884150B2 (en) * | 2012-08-03 | 2014-11-11 | The Penn State Research Foundation | Microphone array transducer for acoustical musical instrument |
US9264524B2 (en) | 2012-08-03 | 2016-02-16 | The Penn State Research Foundation | Microphone array transducer for acoustic musical instrument |
CN102801861B (zh) * | 2012-08-07 | 2015-08-19 | 歌尔声学股份有限公司 | 一种应用于手机的语音增强方法和装置 |
US9640194B1 (en) | 2012-10-04 | 2017-05-02 | Knowles Electronics, Llc | Noise suppression for speech processing based on machine-learning mask estimation |
US9100466B2 (en) * | 2013-05-13 | 2015-08-04 | Intel IP Corporation | Method for processing an audio signal and audio receiving circuit |
US20180317019A1 (en) | 2013-05-23 | 2018-11-01 | Knowles Electronics, Llc | Acoustic activity detecting microphone |
US9508345B1 (en) | 2013-09-24 | 2016-11-29 | Knowles Electronics, Llc | Continuous voice sensing |
US9953634B1 (en) | 2013-12-17 | 2018-04-24 | Knowles Electronics, Llc | Passive training for automatic speech recognition |
CN103915102B (zh) * | 2014-03-12 | 2017-01-18 | 哈尔滨工程大学 | 一种lfm水声多途信号的噪声抑制方法 |
US9437188B1 (en) | 2014-03-28 | 2016-09-06 | Knowles Electronics, Llc | Buffered reprocessing for multi-microphone automatic speech recognition assist |
WO2016033364A1 (en) | 2014-08-28 | 2016-03-03 | Audience, Inc. | Multi-sourced noise suppression |
WO2016050780A1 (en) * | 2014-10-02 | 2016-04-07 | Sony Corporation | Method, apparatus and system |
US9311928B1 (en) | 2014-11-06 | 2016-04-12 | Vocalzoom Systems Ltd. | Method and system for noise reduction and speech enhancement |
US9648419B2 (en) | 2014-11-12 | 2017-05-09 | Motorola Solutions, Inc. | Apparatus and method for coordinating use of different microphones in a communication device |
US9712915B2 (en) | 2014-11-25 | 2017-07-18 | Knowles Electronics, Llc | Reference microphone for non-linear and time variant echo cancellation |
DE112016000545B4 (de) * | 2015-01-30 | 2019-08-22 | Knowles Electronics, Llc | Kontextabhängiges schalten von mikrofonen |
US10186276B2 (en) * | 2015-09-25 | 2019-01-22 | Qualcomm Incorporated | Adaptive noise suppression for super wideband music |
US10403259B2 (en) | 2015-12-04 | 2019-09-03 | Knowles Electronics, Llc | Multi-microphone feedforward active noise cancellation |
WO2017123814A1 (en) * | 2016-01-14 | 2017-07-20 | Knowles Electronics, Llc | Systems and methods for assisting automatic speech recognition |
US9756421B2 (en) * | 2016-01-22 | 2017-09-05 | Mediatek Inc. | Audio refocusing methods and electronic devices utilizing the same |
US10509626B2 (en) | 2016-02-22 | 2019-12-17 | Sonos, Inc | Handling of loss of pairing between networked devices |
US10264030B2 (en) | 2016-02-22 | 2019-04-16 | Sonos, Inc. | Networked microphone device control |
US10095470B2 (en) | 2016-02-22 | 2018-10-09 | Sonos, Inc. | Audio response playback |
US9826306B2 (en) | 2016-02-22 | 2017-11-21 | Sonos, Inc. | Default playback device designation |
US9838737B2 (en) * | 2016-05-05 | 2017-12-05 | Google Inc. | Filtering wind noises in video content |
US10134399B2 (en) | 2016-07-15 | 2018-11-20 | Sonos, Inc. | Contextualization of voice inputs |
US10115400B2 (en) | 2016-08-05 | 2018-10-30 | Sonos, Inc. | Multiple voice services |
US10181323B2 (en) | 2016-10-19 | 2019-01-15 | Sonos, Inc. | Arbitration-based voice recognition |
EP3542548A1 (en) * | 2016-11-21 | 2019-09-25 | Harman Becker Automotive Systems GmbH | Beamsteering |
WO2018148095A1 (en) | 2017-02-13 | 2018-08-16 | Knowles Electronics, Llc | Soft-talk audio capture for mobile devices |
US10468020B2 (en) * | 2017-06-06 | 2019-11-05 | Cypress Semiconductor Corporation | Systems and methods for removing interference for audio pattern recognition |
US10475449B2 (en) | 2017-08-07 | 2019-11-12 | Sonos, Inc. | Wake-word detection suppression |
US10048930B1 (en) | 2017-09-08 | 2018-08-14 | Sonos, Inc. | Dynamic computation of system response volume |
US10446165B2 (en) * | 2017-09-27 | 2019-10-15 | Sonos, Inc. | Robust short-time fourier transform acoustic echo cancellation during audio playback |
US10482868B2 (en) | 2017-09-28 | 2019-11-19 | Sonos, Inc. | Multi-channel acoustic echo cancellation |
US10621981B2 (en) | 2017-09-28 | 2020-04-14 | Sonos, Inc. | Tone interference cancellation |
US10466962B2 (en) | 2017-09-29 | 2019-11-05 | Sonos, Inc. | Media playback system with voice assistance |
WO2019143759A1 (en) | 2018-01-18 | 2019-07-25 | Knowles Electronics, Llc | Data driven echo cancellation and suppression |
KR102088222B1 (ko) * | 2018-01-25 | 2020-03-16 | 서강대학교 산학협력단 | 분산도 마스크를 이용한 음원 국지화 방법 및 음원 국지화 장치 |
US10755728B1 (en) * | 2018-02-27 | 2020-08-25 | Amazon Technologies, Inc. | Multichannel noise cancellation using frequency domain spectrum masking |
CN108564963B (zh) * | 2018-04-23 | 2019-10-18 | 百度在线网络技术(北京)有限公司 | 用于增强语音的方法和装置 |
US11175880B2 (en) | 2018-05-10 | 2021-11-16 | Sonos, Inc. | Systems and methods for voice-assisted media content selection |
US10959029B2 (en) | 2018-05-25 | 2021-03-23 | Sonos, Inc. | Determining and adapting to changes in microphone performance of playback devices |
US11076035B2 (en) | 2018-08-28 | 2021-07-27 | Sonos, Inc. | Do not disturb feature for audio notifications |
US10587430B1 (en) | 2018-09-14 | 2020-03-10 | Sonos, Inc. | Networked devices, systems, and methods for associating playback devices based on sound codes |
US11024331B2 (en) | 2018-09-21 | 2021-06-01 | Sonos, Inc. | Voice detection optimization using sound metadata |
US11100923B2 (en) | 2018-09-28 | 2021-08-24 | Sonos, Inc. | Systems and methods for selective wake word detection using neural network models |
US11899519B2 (en) | 2018-10-23 | 2024-02-13 | Sonos, Inc. | Multiple stage network microphone device with reduced power consumption and processing load |
US11183183B2 (en) | 2018-12-07 | 2021-11-23 | Sonos, Inc. | Systems and methods of operating media playback systems having multiple voice assistant services |
US11132989B2 (en) | 2018-12-13 | 2021-09-28 | Sonos, Inc. | Networked microphone devices, systems, and methods of localized arbitration |
US10867604B2 (en) | 2019-02-08 | 2020-12-15 | Sonos, Inc. | Devices, systems, and methods for distributed voice processing |
US10964314B2 (en) * | 2019-03-22 | 2021-03-30 | Cirrus Logic, Inc. | System and method for optimized noise reduction in the presence of speech distortion using adaptive microphone array |
US11120794B2 (en) | 2019-05-03 | 2021-09-14 | Sonos, Inc. | Voice assistant persistence across multiple network microphone devices |
US11200894B2 (en) | 2019-06-12 | 2021-12-14 | Sonos, Inc. | Network microphone device with command keyword eventing |
GB2585086A (en) * | 2019-06-28 | 2020-12-30 | Nokia Technologies Oy | Pre-processing for automatic speech recognition |
US10871943B1 (en) | 2019-07-31 | 2020-12-22 | Sonos, Inc. | Noise classification for event detection |
US10764699B1 (en) | 2019-08-09 | 2020-09-01 | Bose Corporation | Managing characteristics of earpieces using controlled calibration |
CN110648679B (zh) * | 2019-09-25 | 2023-07-14 | 腾讯科技(深圳)有限公司 | 回声抑制参数的确定方法和装置、存储介质及电子装置 |
US11189286B2 (en) | 2019-10-22 | 2021-11-30 | Sonos, Inc. | VAS toggle based on device orientation |
US11200900B2 (en) | 2019-12-20 | 2021-12-14 | Sonos, Inc. | Offline voice control |
US11562740B2 (en) | 2020-01-07 | 2023-01-24 | Sonos, Inc. | Voice verification for media playback |
US11308958B2 (en) | 2020-02-07 | 2022-04-19 | Sonos, Inc. | Localized wakeword verification |
DE102020202206A1 (de) * | 2020-02-20 | 2021-08-26 | Sivantos Pte. Ltd. | Verfahren zur Unterdrückung eines Eigenrauschens einer Mikrofonanordnung |
CN115552920A (zh) * | 2020-05-08 | 2022-12-30 | 纽奥斯通讯有限公司 | 用于多麦克风信号处理的数据增强的系统和方法 |
US11482224B2 (en) | 2020-05-20 | 2022-10-25 | Sonos, Inc. | Command keywords with input detection windowing |
US11984123B2 (en) | 2020-11-12 | 2024-05-14 | Sonos, Inc. | Network device interaction by range |
US11610598B2 (en) | 2021-04-14 | 2023-03-21 | Harris Global Communications, Inc. | Voice enhancement in presence of noise |
Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20080019548A1 (en) * | 2006-01-30 | 2008-01-24 | Audience, Inc. | System and method for utilizing omni-directional microphones for speech enhancement |
US20090067642A1 (en) * | 2007-08-13 | 2009-03-12 | Markus Buck | Noise reduction through spatial selectivity and filtering |
TW200933609A (en) * | 2008-01-28 | 2009-08-01 | Qualcomm Inc | Systems, methods, and apparatus for context processing using multiple microphones |
TW201009817A (en) * | 2008-06-30 | 2010-03-01 | Audience Inc | System and method for providing noise suppression utilizing null processing noise subtraction |
Family Cites Families (218)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US3581122A (en) | 1967-10-26 | 1971-05-25 | Bell Telephone Labor Inc | All-pass filter circuit having negative resistance shunting resonant circuit |
US3989897A (en) | 1974-10-25 | 1976-11-02 | Carver R W | Method and apparatus for reducing noise content in audio signals |
US4811404A (en) | 1987-10-01 | 1989-03-07 | Motorola, Inc. | Noise suppression system |
US4910779A (en) | 1987-10-15 | 1990-03-20 | Cooper Duane H | Head diffraction compensated stereo system with optimal equalization |
IL84948A0 (en) | 1987-12-25 | 1988-06-30 | D S P Group Israel Ltd | Noise reduction system |
US5027306A (en) | 1989-05-12 | 1991-06-25 | Dattorro Jon C | Decimation filter as for a sigma-delta analog-to-digital converter |
US5050217A (en) | 1990-02-16 | 1991-09-17 | Akg Acoustics, Inc. | Dynamic noise reduction and spectral restoration system |
US5103229A (en) | 1990-04-23 | 1992-04-07 | General Electric Company | Plural-order sigma-delta analog-to-digital converters using both single-bit and multiple-bit quantization |
JPH0566795A (ja) | 1991-09-06 | 1993-03-19 | Gijutsu Kenkyu Kumiai Iryo Fukushi Kiki Kenkyusho | 雑音抑圧装置とその調整装置 |
JP3279612B2 (ja) | 1991-12-06 | 2002-04-30 | ソニー株式会社 | 雑音低減装置 |
JP3176474B2 (ja) | 1992-06-03 | 2001-06-18 | 沖電気工業株式会社 | 適応ノイズキャンセラ装置 |
US5408235A (en) | 1994-03-07 | 1995-04-18 | Intel Corporation | Second order Sigma-Delta based analog to digital converter having superior analog components and having a programmable comb filter coupled to the digital signal processor |
JP3307138B2 (ja) | 1995-02-27 | 2002-07-24 | ソニー株式会社 | 信号符号化方法及び装置、並びに信号復号化方法及び装置 |
US5828997A (en) | 1995-06-07 | 1998-10-27 | Sensimetrics Corporation | Content analyzer mixing inverse-direction-probability-weighted noise to input signal |
US5687104A (en) | 1995-11-17 | 1997-11-11 | Motorola, Inc. | Method and apparatus for generating decoupled filter parameters and implementing a band decoupled filter |
US5774562A (en) | 1996-03-25 | 1998-06-30 | Nippon Telegraph And Telephone Corp. | Method and apparatus for dereverberation |
JP3325770B2 (ja) | 1996-04-26 | 2002-09-17 | 三菱電機株式会社 | ノイズ低減回路及びノイズ低減装置及びノイズ低減方法 |
US5701350A (en) | 1996-06-03 | 1997-12-23 | Digisonix, Inc. | Active acoustic control in remote regions |
US5825898A (en) | 1996-06-27 | 1998-10-20 | Lamar Signal Processing Ltd. | System and method for adaptive interference cancelling |
US5806025A (en) | 1996-08-07 | 1998-09-08 | U S West, Inc. | Method and system for adaptive filtering of speech signals using signal-to-noise ratio to choose subband filter bank |
JPH10124088A (ja) | 1996-10-24 | 1998-05-15 | Sony Corp | 音声帯域幅拡張装置及び方法 |
US5963651A (en) | 1997-01-16 | 1999-10-05 | Digisonix, Inc. | Adaptive acoustic attenuation system having distributed processing and shared state nodal architecture |
JP3328532B2 (ja) | 1997-01-22 | 2002-09-24 | シャープ株式会社 | デジタルデータの符号化方法 |
US6104993A (en) | 1997-02-26 | 2000-08-15 | Motorola, Inc. | Apparatus and method for rate determination in a communication system |
JP4132154B2 (ja) | 1997-10-23 | 2008-08-13 | ソニー株式会社 | 音声合成方法及び装置、並びに帯域幅拡張方法及び装置 |
US6343267B1 (en) | 1998-04-30 | 2002-01-29 | Matsushita Electric Industrial Co., Ltd. | Dimensionality reduction for speaker normalization and speaker and environment adaptation using eigenvoice techniques |
US6160265A (en) | 1998-07-13 | 2000-12-12 | Kensington Laboratories, Inc. | SMIF box cover hold down latch and box door latch actuating mechanism |
US6240386B1 (en) | 1998-08-24 | 2001-05-29 | Conexant Systems, Inc. | Speech codec employing noise classification for noise compensation |
US6539355B1 (en) | 1998-10-15 | 2003-03-25 | Sony Corporation | Signal band expanding method and apparatus and signal synthesis method and apparatus |
US6011501A (en) | 1998-12-31 | 2000-01-04 | Cirrus Logic, Inc. | Circuits, systems and methods for processing data in a one-bit format |
US6453287B1 (en) | 1999-02-04 | 2002-09-17 | Georgia-Tech Research Corporation | Apparatus and quality enhancement algorithm for mixed excitation linear predictive (MELP) and other speech coders |
US6381570B2 (en) | 1999-02-12 | 2002-04-30 | Telogy Networks, Inc. | Adaptive two-threshold method for discriminating noise from speech in a communication signal |
US6377915B1 (en) | 1999-03-17 | 2002-04-23 | Yrp Advanced Mobile Communication Systems Research Laboratories Co., Ltd. | Speech decoding using mix ratio table |
US6490556B2 (en) | 1999-05-28 | 2002-12-03 | Intel Corporation | Audio classifier for half duplex communication |
US20010044719A1 (en) | 1999-07-02 | 2001-11-22 | Mitsubishi Electric Research Laboratories, Inc. | Method and system for recognizing, indexing, and searching acoustic signals |
US6453284B1 (en) | 1999-07-26 | 2002-09-17 | Texas Tech University Health Sciences Center | Multiple voice tracking system and method |
US6480610B1 (en) | 1999-09-21 | 2002-11-12 | Sonic Innovations, Inc. | Subband acoustic feedback cancellation in hearing aids |
US7054809B1 (en) | 1999-09-22 | 2006-05-30 | Mindspeed Technologies, Inc. | Rate selection method for selectable mode vocoder |
US6326912B1 (en) | 1999-09-24 | 2001-12-04 | Akm Semiconductor, Inc. | Analog-to-digital conversion using a multi-bit analog delta-sigma modulator combined with a one-bit digital delta-sigma modulator |
US6594367B1 (en) | 1999-10-25 | 2003-07-15 | Andrea Electronics Corporation | Super directional beamforming design and implementation |
US6757395B1 (en) | 2000-01-12 | 2004-06-29 | Sonic Innovations, Inc. | Noise reduction apparatus and method |
US20010046304A1 (en) | 2000-04-24 | 2001-11-29 | Rast Rodger H. | System and method for selective control of acoustic isolation in headsets |
JP2001318694A (ja) | 2000-05-10 | 2001-11-16 | Toshiba Corp | 信号処理装置、信号処理方法および記録媒体 |
US7346176B1 (en) | 2000-05-11 | 2008-03-18 | Plantronics, Inc. | Auto-adjust noise canceling microphone with position sensor |
US6377637B1 (en) | 2000-07-12 | 2002-04-23 | Andrea Electronics Corporation | Sub-band exponential smoothing noise canceling system |
US6782253B1 (en) | 2000-08-10 | 2004-08-24 | Koninklijke Philips Electronics N.V. | Mobile micro portal |
CN100337405C (zh) | 2000-08-11 | 2007-09-12 | 皇家菲利浦电子有限公司 | 用于同步∑△调制器的方法和装置 |
JP3566197B2 (ja) * | 2000-08-31 | 2004-09-15 | 松下電器産業株式会社 | 雑音抑圧装置及び雑音抑圧方法 |
US7472059B2 (en) | 2000-12-08 | 2008-12-30 | Qualcomm Incorporated | Method and apparatus for robust speech classification |
US20020128839A1 (en) | 2001-01-12 | 2002-09-12 | Ulf Lindgren | Speech bandwidth extension |
US20020097884A1 (en) | 2001-01-25 | 2002-07-25 | Cairns Douglas A. | Variable noise reduction algorithm based on vehicle conditions |
DE50104998D1 (de) | 2001-05-11 | 2005-02-03 | Siemens Ag | Verfahren zur erweiterung der bandbreite eines schmalbandig gefilterten sprachsignals, insbesondere eines von einem telekommunikationsgerät gesendeten sprachsignals |
US6675164B2 (en) | 2001-06-08 | 2004-01-06 | The Regents Of The University Of California | Parallel object-oriented data mining system |
CN1326415C (zh) | 2001-06-26 | 2007-07-11 | 诺基亚公司 | 对音频信号进行代码变换的方法、码变换器、网元、无线通信网和通信系统 |
US6876859B2 (en) | 2001-07-18 | 2005-04-05 | Trueposition, Inc. | Method for estimating TDOA and FDOA in a wireless location system |
CA2354808A1 (en) | 2001-08-07 | 2003-02-07 | King Tam | Sub-band adaptive signal processing in an oversampled filterbank |
US6988066B2 (en) | 2001-10-04 | 2006-01-17 | At&T Corp. | Method of bandwidth extension for narrow-band speech |
US6895375B2 (en) | 2001-10-04 | 2005-05-17 | At&T Corp. | System for bandwidth extension of Narrow-band speech |
JP3870193B2 (ja) | 2001-11-29 | 2007-01-17 | コーディング テクノロジーズ アクチボラゲット | 高周波再構成に用いる符号器、復号器、方法及びコンピュータプログラム |
US8098844B2 (en) | 2002-02-05 | 2012-01-17 | Mh Acoustics, Llc | Dual-microphone spatial noise suppression |
US7050783B2 (en) | 2002-02-22 | 2006-05-23 | Kyocera Wireless Corp. | Accessory detection system |
WO2003084103A1 (en) | 2002-03-22 | 2003-10-09 | Georgia Tech Research Corporation | Analog audio enhancement system using a noise suppression algorithm |
GB2387008A (en) | 2002-03-28 | 2003-10-01 | Qinetiq Ltd | Signal Processing System |
US7072834B2 (en) | 2002-04-05 | 2006-07-04 | Intel Corporation | Adapting to adverse acoustic environment in speech processing using playback training data |
US7065486B1 (en) | 2002-04-11 | 2006-06-20 | Mindspeed Technologies, Inc. | Linear prediction based noise suppression |
DK1359787T3 (en) | 2002-04-25 | 2015-04-20 | Gn Resound As | Fitting method and hearing prosthesis which is based on signal to noise ratio loss of data |
US7319959B1 (en) | 2002-05-14 | 2008-01-15 | Audience, Inc. | Multi-source phoneme classification for noise-robust automatic speech recognition |
US7257231B1 (en) | 2002-06-04 | 2007-08-14 | Creative Technology Ltd. | Stream segregation for stereo signals |
CA2493105A1 (en) | 2002-07-19 | 2004-01-29 | British Telecommunications Public Limited Company | Method and system for classification of semantic content of audio/video data |
AU2003252143A1 (en) | 2002-08-29 | 2004-03-19 | Bae Systems Information And Electronic Systems Integration, Inc. | Method for separating interferering signals and computing arrival angles |
US7574352B2 (en) | 2002-09-06 | 2009-08-11 | Massachusetts Institute Of Technology | 2-D processing of speech |
US7283956B2 (en) | 2002-09-18 | 2007-10-16 | Motorola, Inc. | Noise suppression |
US7657427B2 (en) | 2002-10-11 | 2010-02-02 | Nokia Corporation | Methods and devices for source controlled variable bit-rate wideband speech coding |
KR100477699B1 (ko) | 2003-01-15 | 2005-03-18 | 삼성전자주식회사 | 양자화 잡음 분포 조절 방법 및 장치 |
US7895036B2 (en) | 2003-02-21 | 2011-02-22 | Qnx Software Systems Co. | System for suppressing wind noise |
EP1604354A4 (en) | 2003-03-15 | 2008-04-02 | Mindspeed Tech Inc | VOICE INDEX CONTROLS FOR CELP LANGUAGE CODING |
GB2401744B (en) | 2003-05-14 | 2006-02-15 | Ultra Electronics Ltd | An adaptive control unit with feedback compensation |
WO2005004113A1 (ja) | 2003-06-30 | 2005-01-13 | Fujitsu Limited | オーディオ符号化装置 |
US7245767B2 (en) | 2003-08-21 | 2007-07-17 | Hewlett-Packard Development Company, L.P. | Method and apparatus for object identification, classification or verification |
US7516067B2 (en) | 2003-08-25 | 2009-04-07 | Microsoft Corporation | Method and apparatus using harmonic-model-based front end for robust speech recognition |
CA2452945C (en) | 2003-09-23 | 2016-05-10 | Mcmaster University | Binaural adaptive hearing system |
US20050075866A1 (en) | 2003-10-06 | 2005-04-07 | Bernard Widrow | Speech enhancement in the presence of background noise |
US7461003B1 (en) | 2003-10-22 | 2008-12-02 | Tellabs Operations, Inc. | Methods and apparatus for improving the quality of speech signals |
AU2003274864A1 (en) | 2003-10-24 | 2005-05-11 | Nokia Corpration | Noise-dependent postfiltering |
US7672693B2 (en) | 2003-11-10 | 2010-03-02 | Nokia Corporation | Controlling method, secondary unit and radio terminal equipment |
US7725314B2 (en) | 2004-02-16 | 2010-05-25 | Microsoft Corporation | Method and apparatus for constructing a speech filter using estimates of clean speech and noise |
US8438026B2 (en) | 2004-02-18 | 2013-05-07 | Nuance Communications, Inc. | Method and system for generating training data for an automatic speech recognizer |
DE602004004242T2 (de) | 2004-03-19 | 2008-06-05 | Harman Becker Automotive Systems Gmbh | System und Verfahren zur Verbesserung eines Audiosignals |
KR101149571B1 (ko) | 2004-04-28 | 2012-05-29 | 코닌클리케 필립스 일렉트로닉스 엔.브이. | 적응성 빔 형성기, 사이드로브 소거기, 핸즈프리 음성 통신 디바이스 |
US8712768B2 (en) | 2004-05-25 | 2014-04-29 | Nokia Corporation | System and method for enhanced artificial bandwidth expansion |
US7254535B2 (en) | 2004-06-30 | 2007-08-07 | Motorola, Inc. | Method and apparatus for equalizing a speech signal generated within a pressurized air delivery system |
US20060089836A1 (en) | 2004-10-21 | 2006-04-27 | Motorola, Inc. | System and method of signal pre-conditioning with adaptive spectral tilt compensation for audio equalization |
US7469155B2 (en) | 2004-11-29 | 2008-12-23 | Cisco Technology, Inc. | Handheld communications device with automatic alert mode selection |
GB2422237A (en) | 2004-12-21 | 2006-07-19 | Fluency Voice Technology Ltd | Dynamic coefficients determined from temporally adjacent speech frames |
US8170221B2 (en) | 2005-03-21 | 2012-05-01 | Harman Becker Automotive Systems Gmbh | Audio enhancement system and method |
CA2603255C (en) | 2005-04-01 | 2015-06-23 | Qualcomm Incorporated | Systems, methods, and apparatus for wideband speech coding |
US8249861B2 (en) | 2005-04-20 | 2012-08-21 | Qnx Software Systems Limited | High frequency compression integration |
US7813931B2 (en) | 2005-04-20 | 2010-10-12 | QNX Software Systems, Co. | System for improving speech quality and intelligibility with bandwidth compression/expansion |
US8280730B2 (en) | 2005-05-25 | 2012-10-02 | Motorola Mobility Llc | Method and apparatus of increasing speech intelligibility in noisy environments |
US20070005351A1 (en) | 2005-06-30 | 2007-01-04 | Sathyendra Harsha M | Method and system for bandwidth expansion for voice communications |
US8112272B2 (en) | 2005-08-11 | 2012-02-07 | Asashi Kasei Kabushiki Kaisha | Sound source separation device, speech recognition device, mobile telephone, sound source separation method, and program |
KR101116363B1 (ko) | 2005-08-11 | 2012-03-09 | 삼성전자주식회사 | 음성신호 분류방법 및 장치, 및 이를 이용한 음성신호부호화방법 및 장치 |
US20070041589A1 (en) | 2005-08-17 | 2007-02-22 | Gennum Corporation | System and method for providing environmental specific noise reduction algorithms |
US8326614B2 (en) | 2005-09-02 | 2012-12-04 | Qnx Software Systems Limited | Speech enhancement system |
EP1760696B1 (en) | 2005-09-03 | 2016-02-03 | GN ReSound A/S | Method and apparatus for improved estimation of non-stationary noise for speech enhancement |
US20070053522A1 (en) | 2005-09-08 | 2007-03-08 | Murray Daniel J | Method and apparatus for directional enhancement of speech elements in noisy environments |
WO2007028250A2 (en) | 2005-09-09 | 2007-03-15 | Mcmaster University | Method and device for binaural signal enhancement |
JP4742226B2 (ja) | 2005-09-28 | 2011-08-10 | 国立大学法人九州大学 | 能動消音制御装置及び方法 |
EP1772855B1 (en) | 2005-10-07 | 2013-09-18 | Nuance Communications, Inc. | Method for extending the spectral bandwidth of a speech signal |
US7813923B2 (en) | 2005-10-14 | 2010-10-12 | Microsoft Corporation | Calibration based beamforming, non-linear adaptive filtering, and multi-sensor headset |
JP4702372B2 (ja) * | 2005-10-26 | 2011-06-15 | 日本電気株式会社 | エコー抑圧方法及び装置 |
US7546237B2 (en) | 2005-12-23 | 2009-06-09 | Qnx Software Systems (Wavemakers), Inc. | Bandwidth extension of narrowband speech |
US8345890B2 (en) * | 2006-01-05 | 2013-01-01 | Audience, Inc. | System and method for utilizing inter-microphone level differences for speech enhancement |
US8032369B2 (en) | 2006-01-20 | 2011-10-04 | Qualcomm Incorporated | Arbitrary average data rates for variable rate coders |
US8744844B2 (en) | 2007-07-06 | 2014-06-03 | Audience, Inc. | System and method for adaptive intelligent noise suppression |
US8271277B2 (en) | 2006-03-03 | 2012-09-18 | Nippon Telegraph And Telephone Corporation | Dereverberation apparatus, dereverberation method, dereverberation program, and recording medium |
EP1994788B1 (en) | 2006-03-10 | 2014-05-07 | MH Acoustics, LLC | Noise-reducing directional microphone array |
US8180067B2 (en) | 2006-04-28 | 2012-05-15 | Harman International Industries, Incorporated | System for selectively extracting components of an audio input signal |
US8204253B1 (en) | 2008-06-30 | 2012-06-19 | Audience, Inc. | Self calibration of audio device |
US20070299655A1 (en) | 2006-06-22 | 2007-12-27 | Nokia Corporation | Method, Apparatus and Computer Program Product for Providing Low Frequency Expansion of Speech |
ATE450987T1 (de) | 2006-06-23 | 2009-12-15 | Gn Resound As | Hörinstrument mit adaptiver richtsignalverarbeitung |
JP4836720B2 (ja) | 2006-09-07 | 2011-12-14 | 株式会社東芝 | ノイズサプレス装置 |
ATE487339T1 (de) | 2006-09-14 | 2010-11-15 | Lg Electronics Inc | Dialogerweiterungsverfahren |
DE102006051071B4 (de) | 2006-10-30 | 2010-12-16 | Siemens Audiologische Technik Gmbh | Pegelabhängige Geräuschreduktion |
EP1933303B1 (en) | 2006-12-14 | 2008-08-06 | Harman/Becker Automotive Systems GmbH | Speech dialog control based on signal pre-processing |
US7986794B2 (en) | 2007-01-11 | 2011-07-26 | Fortemedia, Inc. | Small array microphone apparatus and beam forming method thereof |
JP4882773B2 (ja) | 2007-02-05 | 2012-02-22 | ソニー株式会社 | 信号処理装置、信号処理方法 |
JP5401760B2 (ja) | 2007-02-05 | 2014-01-29 | ソニー株式会社 | ヘッドフォン装置、音声再生システム、音声再生方法 |
US8060363B2 (en) | 2007-02-13 | 2011-11-15 | Nokia Corporation | Audio signal encoding |
EP2118885B1 (en) | 2007-02-26 | 2012-07-11 | Dolby Laboratories Licensing Corporation | Speech enhancement in entertainment audio |
US20080208575A1 (en) | 2007-02-27 | 2008-08-28 | Nokia Corporation | Split-band encoding and decoding of an audio signal |
US7925502B2 (en) | 2007-03-01 | 2011-04-12 | Microsoft Corporation | Pitch model for noise estimation |
KR100905585B1 (ko) | 2007-03-02 | 2009-07-02 | 삼성전자주식회사 | 음성신호의 대역폭 확장 제어 방법 및 장치 |
EP1970900A1 (en) | 2007-03-14 | 2008-09-17 | Harman Becker Automotive Systems GmbH | Method and apparatus for providing a codebook for bandwidth extension of an acoustic signal |
CN101266797B (zh) | 2007-03-16 | 2011-06-01 | 展讯通信(上海)有限公司 | 语音信号后处理滤波方法 |
TWI421856B (zh) | 2007-03-19 | 2014-01-01 | Dolby Lab Licensing Corp | 使用感知模型之語音增強技術 |
US8005238B2 (en) | 2007-03-22 | 2011-08-23 | Microsoft Corporation | Robust adaptive beamforming with enhanced noise suppression |
US7873114B2 (en) | 2007-03-29 | 2011-01-18 | Motorola Mobility, Inc. | Method and apparatus for quickly detecting a presence of abrupt noise and updating a noise estimate |
US8180062B2 (en) | 2007-05-30 | 2012-05-15 | Nokia Corporation | Spatial sound zooming |
JP4455614B2 (ja) | 2007-06-13 | 2010-04-21 | 株式会社東芝 | 音響信号処理方法及び装置 |
US8428275B2 (en) | 2007-06-22 | 2013-04-23 | Sanyo Electric Co., Ltd. | Wind noise reduction device |
US8140331B2 (en) | 2007-07-06 | 2012-03-20 | Xia Lou | Feature extraction for identification and classification of audio signals |
US7817808B2 (en) | 2007-07-19 | 2010-10-19 | Alon Konchitsky | Dual adaptive structure for speech enhancement |
US7856353B2 (en) | 2007-08-07 | 2010-12-21 | Nuance Communications, Inc. | Method for processing speech signal data with reverberation filtering |
US20090043577A1 (en) | 2007-08-10 | 2009-02-12 | Ditech Networks, Inc. | Signal presence detection using bi-directional communication data |
EP2191466B1 (en) | 2007-09-12 | 2013-05-22 | Dolby Laboratories Licensing Corporation | Speech enhancement with voice clarity |
WO2009035613A1 (en) | 2007-09-12 | 2009-03-19 | Dolby Laboratories Licensing Corporation | Speech enhancement with noise level estimation adjustment |
EP2045801B1 (en) | 2007-10-01 | 2010-08-11 | Harman Becker Automotive Systems GmbH | Efficient audio signal processing in the sub-band regime, method, system and associated computer program |
JP4339929B2 (ja) | 2007-10-01 | 2009-10-07 | パナソニック株式会社 | 音源方向検知装置 |
US8107631B2 (en) | 2007-10-04 | 2012-01-31 | Creative Technology Ltd | Correlation-based method for ambience extraction from two-channel audio signals |
US20090095804A1 (en) | 2007-10-12 | 2009-04-16 | Sony Ericsson Mobile Communications Ab | Rfid for connected accessory identification and method |
US8046219B2 (en) | 2007-10-18 | 2011-10-25 | Motorola Mobility, Inc. | Robust two microphone noise suppression system |
JPWO2009051197A1 (ja) * | 2007-10-19 | 2011-03-03 | 日本電気株式会社 | エコー抑圧方法及び装置 |
US8606566B2 (en) | 2007-10-24 | 2013-12-10 | Qnx Software Systems Limited | Speech enhancement through partial speech reconstruction |
ATE456130T1 (de) | 2007-10-29 | 2010-02-15 | Harman Becker Automotive Sys | Partielle sprachrekonstruktion |
EP2058804B1 (en) | 2007-10-31 | 2016-12-14 | Nuance Communications, Inc. | Method for dereverberation of an acoustic signal and system thereof |
DE602007014382D1 (de) | 2007-11-12 | 2011-06-16 | Harman Becker Automotive Sys | Unterscheidung zwischen Vordergrundsprache und Hintergrundgeräuschen |
KR101444100B1 (ko) | 2007-11-15 | 2014-09-26 | 삼성전자주식회사 | 혼합 사운드로부터 잡음을 제거하는 방법 및 장치 |
US20090150144A1 (en) | 2007-12-10 | 2009-06-11 | Qnx Software Systems (Wavemakers), Inc. | Robust voice detector for receive-side automatic gain control |
US8175291B2 (en) | 2007-12-19 | 2012-05-08 | Qualcomm Incorporated | Systems, methods, and apparatus for multi-microphone based speech enhancement |
WO2009082302A1 (en) | 2007-12-20 | 2009-07-02 | Telefonaktiebolaget L M Ericsson (Publ) | Noise suppression method and apparatus |
US8223988B2 (en) | 2008-01-29 | 2012-07-17 | Qualcomm Incorporated | Enhanced blind source separation algorithm for highly correlated mixtures |
US8194882B2 (en) | 2008-02-29 | 2012-06-05 | Audience, Inc. | System and method for providing single microphone noise suppression fallback |
US8355511B2 (en) | 2008-03-18 | 2013-01-15 | Audience, Inc. | System and method for envelope-based acoustic echo cancellation |
US8374854B2 (en) | 2008-03-28 | 2013-02-12 | Southern Methodist University | Spatio-temporal speech enhancement technique based on generalized eigenvalue decomposition |
US9197181B2 (en) | 2008-05-12 | 2015-11-24 | Broadcom Corporation | Loudness enhancement system and method |
US8831936B2 (en) | 2008-05-29 | 2014-09-09 | Qualcomm Incorporated | Systems, methods, apparatus, and computer program products for speech signal processing using spectral contrast enhancement |
US20090315708A1 (en) | 2008-06-19 | 2009-12-24 | John Walley | Method and system for limiting audio output in audio headsets |
US9253568B2 (en) | 2008-07-25 | 2016-02-02 | Broadcom Corporation | Single-microphone wind noise suppression |
TR201810466T4 (tr) | 2008-08-05 | 2018-08-27 | Fraunhofer Ges Forschung | Özellik çıkarımı kullanılarak konuşmanın iyileştirilmesi için bir ses sinyalinin işlenmesine yönelik aparat ve yöntem. |
AU2009287421B2 (en) | 2008-08-29 | 2015-09-17 | Biamp Systems, LLC | A microphone array system and method for sound acquisition |
US8392181B2 (en) | 2008-09-10 | 2013-03-05 | Texas Instruments Incorporated | Subtraction of a shaped component of a noise reduction spectrum from a combined signal |
DK2164066T3 (da) | 2008-09-15 | 2016-06-13 | Oticon As | Støjspektrumsporing i støjende akustiske signaler |
CN102160358B (zh) | 2008-09-19 | 2015-03-11 | 杜比实验室特许公司 | 小蜂窝无线网络中的客户设备的上游信号处理 |
US8583048B2 (en) | 2008-09-25 | 2013-11-12 | Skyphy Networks Limited | Multi-hop wireless systems having noise reduction and bandwidth expansion capabilities and the methods of the same |
US20100082339A1 (en) | 2008-09-30 | 2010-04-01 | Alon Konchitsky | Wind Noise Reduction |
US20100094622A1 (en) | 2008-10-10 | 2010-04-15 | Nexidia Inc. | Feature normalization for speech and audio processing |
US8218397B2 (en) | 2008-10-24 | 2012-07-10 | Qualcomm Incorporated | Audio source proximity estimation using sensor array for noise reduction |
US8724829B2 (en) | 2008-10-24 | 2014-05-13 | Qualcomm Incorporated | Systems, methods, apparatus, and computer-readable media for coherence detection |
US8111843B2 (en) | 2008-11-11 | 2012-02-07 | Motorola Solutions, Inc. | Compensation for nonuniform delayed group communications |
US8243952B2 (en) | 2008-12-22 | 2012-08-14 | Conexant Systems, Inc. | Microphone array calibration method and apparatus |
EP2211339B1 (en) | 2009-01-23 | 2017-05-31 | Oticon A/s | Listening system |
JP4892021B2 (ja) | 2009-02-26 | 2012-03-07 | 株式会社東芝 | 信号帯域拡張装置 |
US8359195B2 (en) | 2009-03-26 | 2013-01-22 | LI Creative Technologies, Inc. | Method and apparatus for processing audio and speech signals |
US8611553B2 (en) | 2010-03-30 | 2013-12-17 | Bose Corporation | ANR instability detection |
US8144890B2 (en) | 2009-04-28 | 2012-03-27 | Bose Corporation | ANR settings boot loading |
US8184822B2 (en) | 2009-04-28 | 2012-05-22 | Bose Corporation | ANR signal processing topology |
US8071869B2 (en) | 2009-05-06 | 2011-12-06 | Gracenote, Inc. | Apparatus and method for determining a prominent tempo of an audio work |
US8160265B2 (en) | 2009-05-18 | 2012-04-17 | Sony Computer Entertainment Inc. | Method and apparatus for enhancing the generation of three-dimensional sound in headphone devices |
US8737636B2 (en) | 2009-07-10 | 2014-05-27 | Qualcomm Incorporated | Systems, methods, apparatus, and computer-readable media for adaptive active noise cancellation |
US7769187B1 (en) | 2009-07-14 | 2010-08-03 | Apple Inc. | Communications circuits for electronic devices and accessories |
US8571231B2 (en) | 2009-10-01 | 2013-10-29 | Qualcomm Incorporated | Suppressing noise in an audio signal |
US20110099010A1 (en) | 2009-10-22 | 2011-04-28 | Broadcom Corporation | Multi-channel noise suppression system |
US8244927B2 (en) | 2009-10-27 | 2012-08-14 | Fairchild Semiconductor Corporation | Method of detecting accessories on an audio jack |
US8848935B1 (en) | 2009-12-14 | 2014-09-30 | Audience, Inc. | Low latency active noise cancellation system |
US8526628B1 (en) | 2009-12-14 | 2013-09-03 | Audience, Inc. | Low latency active noise cancellation system |
US8385559B2 (en) | 2009-12-30 | 2013-02-26 | Robert Bosch Gmbh | Adaptive digital noise canceller |
US9008329B1 (en) | 2010-01-26 | 2015-04-14 | Audience, Inc. | Noise reduction using multi-feature cluster tracker |
US8700391B1 (en) | 2010-04-01 | 2014-04-15 | Audience, Inc. | Low complexity bandwidth expansion of speech |
TWI562137B (en) | 2010-04-09 | 2016-12-11 | Dts Inc | Adaptive environmental noise compensation for audio playback |
US8606571B1 (en) | 2010-04-19 | 2013-12-10 | Audience, Inc. | Spatial selectivity noise reduction tradeoff for multi-microphone systems |
US8538035B2 (en) | 2010-04-29 | 2013-09-17 | Audience, Inc. | Multi-microphone robust noise suppression |
US8958572B1 (en) | 2010-04-19 | 2015-02-17 | Audience, Inc. | Adaptive noise cancellation for multi-microphone systems |
US8473287B2 (en) | 2010-04-19 | 2013-06-25 | Audience, Inc. | Method for jointly optimizing noise reduction and voice quality in a mono or multi-microphone system |
US8781137B1 (en) | 2010-04-27 | 2014-07-15 | Audience, Inc. | Wind noise detection and suppression |
US8447595B2 (en) | 2010-06-03 | 2013-05-21 | Apple Inc. | Echo-related decisions on automatic gain control of uplink speech signal in a communications device |
US8515089B2 (en) | 2010-06-04 | 2013-08-20 | Apple Inc. | Active noise cancellation decisions in a portable audio device |
US8447596B2 (en) | 2010-07-12 | 2013-05-21 | Audience, Inc. | Monaural noise suppression based on computational auditory scene analysis |
US8719475B2 (en) | 2010-07-13 | 2014-05-06 | Broadcom Corporation | Method and system for utilizing low power superspeed inter-chip (LP-SSIC) communications |
US8761410B1 (en) | 2010-08-12 | 2014-06-24 | Audience, Inc. | Systems and methods for multi-channel dereverberation |
US8611552B1 (en) | 2010-08-25 | 2013-12-17 | Audience, Inc. | Direction-aware active noise cancellation system |
US8447045B1 (en) | 2010-09-07 | 2013-05-21 | Audience, Inc. | Multi-microphone active noise cancellation system |
US9049532B2 (en) | 2010-10-19 | 2015-06-02 | Electronics And Telecommunications Research Instittute | Apparatus and method for separating sound source |
US8682006B1 (en) | 2010-10-20 | 2014-03-25 | Audience, Inc. | Noise suppression based on null coherence |
US8311817B2 (en) | 2010-11-04 | 2012-11-13 | Audience, Inc. | Systems and methods for enhancing voice quality in mobile device |
CN102486920A (zh) | 2010-12-06 | 2012-06-06 | 索尼公司 | 音频事件检测方法和装置 |
US9229833B2 (en) | 2011-01-28 | 2016-01-05 | Fairchild Semiconductor Corporation | Successive approximation resistor detection |
JP5817366B2 (ja) | 2011-09-12 | 2015-11-18 | 沖電気工業株式会社 | 音声信号処理装置、方法及びプログラム |
-
2010
- 2010-07-08 US US12/832,920 patent/US8538035B2/en not_active Expired - Fee Related
-
2011
- 2011-04-28 KR KR1020127027868A patent/KR20130108063A/ko not_active IP Right Cessation
- 2011-04-28 JP JP2013508256A patent/JP2013527493A/ja active Pending
- 2011-04-28 WO PCT/US2011/034373 patent/WO2011137258A1/en active Application Filing
- 2011-04-29 TW TW100115214A patent/TWI466107B/zh not_active IP Right Cessation
-
2013
- 2013-08-05 US US13/959,457 patent/US9438992B2/en active Active
Patent Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20080019548A1 (en) * | 2006-01-30 | 2008-01-24 | Audience, Inc. | System and method for utilizing omni-directional microphones for speech enhancement |
US20090067642A1 (en) * | 2007-08-13 | 2009-03-12 | Markus Buck | Noise reduction through spatial selectivity and filtering |
TW200933609A (en) * | 2008-01-28 | 2009-08-01 | Qualcomm Inc | Systems, methods, and apparatus for context processing using multiple microphones |
TW201009817A (en) * | 2008-06-30 | 2010-03-01 | Audience Inc | System and method for providing noise suppression utilizing null processing noise subtraction |
Cited By (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US9143857B2 (en) | 2010-04-19 | 2015-09-22 | Audience, Inc. | Adaptively reducing noise while limiting speech loss distortion |
US9502048B2 (en) | 2010-04-19 | 2016-11-22 | Knowles Electronics, Llc | Adaptively reducing noise to limit speech distortion |
US9343056B1 (en) | 2010-04-27 | 2016-05-17 | Knowles Electronics, Llc | Wind noise detection and suppression |
US9438992B2 (en) | 2010-04-29 | 2016-09-06 | Knowles Electronics, Llc | Multi-microphone robust noise suppression |
US9431023B2 (en) | 2010-07-12 | 2016-08-30 | Knowles Electronics, Llc | Monaural noise suppression based on computational auditory scene analysis |
US10045140B2 (en) | 2015-01-07 | 2018-08-07 | Knowles Electronics, Llc | Utilizing digital microphones for low power keyword detection and noise suppression |
Also Published As
Publication number | Publication date |
---|---|
US20130322643A1 (en) | 2013-12-05 |
US20120027218A1 (en) | 2012-02-02 |
JP2013527493A (ja) | 2013-06-27 |
WO2011137258A1 (en) | 2011-11-03 |
KR20130108063A (ko) | 2013-10-02 |
TW201205560A (en) | 2012-02-01 |
US9438992B2 (en) | 2016-09-06 |
US8538035B2 (en) | 2013-09-17 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
TWI466107B (zh) | 多麥克風之穩固雜訊抑制 | |
US9558755B1 (en) | Noise suppression assisted automatic speech recognition | |
US9343056B1 (en) | Wind noise detection and suppression | |
US9502048B2 (en) | Adaptively reducing noise to limit speech distortion | |
JP5762956B2 (ja) | ヌル処理雑音除去を利用した雑音抑制を提供するシステム及び方法 | |
US8447596B2 (en) | Monaural noise suppression based on computational auditory scene analysis | |
US8682006B1 (en) | Noise suppression based on null coherence | |
US8718290B2 (en) | Adaptive noise reduction using level cues | |
US8606571B1 (en) | Spatial selectivity noise reduction tradeoff for multi-microphone systems | |
US8761410B1 (en) | Systems and methods for multi-channel dereverberation | |
US8712069B1 (en) | Selection of system parameters based on non-acoustic sensor information | |
CN109727604A (zh) | 用于语音识别前端的频域回声消除方法及计算机储存介质 | |
US9343073B1 (en) | Robust noise suppression system in adverse echo conditions | |
WO2009117084A2 (en) | System and method for envelope-based acoustic echo cancellation | |
Hegner et al. | A high performance low complexity noise suppression algorithm |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
MM4A | Annulment or lapse of patent due to non-payment of fees |