JP2016511594A - 音声信号を発生するための方法及び装置 - Google Patents
音声信号を発生するための方法及び装置 Download PDFInfo
- Publication number
- JP2016511594A JP2016511594A JP2015558579A JP2015558579A JP2016511594A JP 2016511594 A JP2016511594 A JP 2016511594A JP 2015558579 A JP2015558579 A JP 2015558579A JP 2015558579 A JP2015558579 A JP 2015558579A JP 2016511594 A JP2016511594 A JP 2016511594A
- Authority
- JP
- Japan
- Prior art keywords
- microphone
- signal
- speech
- audio
- similarity measure
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
- 230000005236 sound signal Effects 0.000 title claims abstract description 72
- 238000000034 method Methods 0.000 title claims description 21
- 238000011524 similarity measure Methods 0.000 claims abstract description 91
- 230000004044 response Effects 0.000 claims abstract description 30
- 239000013598 vector Substances 0.000 claims description 42
- 239000002131 composite material Substances 0.000 claims description 39
- 238000012545 processing Methods 0.000 abstract description 26
- 238000013459 approach Methods 0.000 abstract description 25
- 238000004891 communication Methods 0.000 description 35
- 230000006870 function Effects 0.000 description 9
- 230000003595 spectral effect Effects 0.000 description 7
- 230000006978 adaptation Effects 0.000 description 5
- 230000000694 effects Effects 0.000 description 5
- 230000033001 locomotion Effects 0.000 description 5
- 238000001228 spectrum Methods 0.000 description 5
- 238000012549 training Methods 0.000 description 5
- 238000007476 Maximum Likelihood Methods 0.000 description 4
- 239000011159 matrix material Substances 0.000 description 4
- 230000008569 process Effects 0.000 description 4
- 238000012360 testing method Methods 0.000 description 4
- 230000008901 benefit Effects 0.000 description 3
- 230000003321 amplification Effects 0.000 description 2
- 230000005540 biological transmission Effects 0.000 description 2
- 230000015572 biosynthetic process Effects 0.000 description 2
- 238000000354 decomposition reaction Methods 0.000 description 2
- 238000011156 evaluation Methods 0.000 description 2
- 238000001914 filtration Methods 0.000 description 2
- 238000005259 measurement Methods 0.000 description 2
- 238000003199 nucleic acid amplification method Methods 0.000 description 2
- 238000003786 synthesis reaction Methods 0.000 description 2
- 238000012546 transfer Methods 0.000 description 2
- 230000002238 attenuated effect Effects 0.000 description 1
- 238000006243 chemical reaction Methods 0.000 description 1
- 230000006835 compression Effects 0.000 description 1
- 238000007906 compression Methods 0.000 description 1
- 238000007796 conventional method Methods 0.000 description 1
- 238000012937 correction Methods 0.000 description 1
- 230000001186 cumulative effect Effects 0.000 description 1
- 230000001934 delay Effects 0.000 description 1
- 230000001419 dependent effect Effects 0.000 description 1
- 238000013461 design Methods 0.000 description 1
- 238000010586 diagram Methods 0.000 description 1
- 230000005284 excitation Effects 0.000 description 1
- 230000001747 exhibiting effect Effects 0.000 description 1
- 238000002474 experimental method Methods 0.000 description 1
- 238000005070 sampling Methods 0.000 description 1
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/02—Speech enhancement, e.g. noise reduction or echo cancellation
- G10L21/0208—Noise filtering
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
- G10L25/48—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use
- G10L25/51—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use for comparison or discrimination
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04R—LOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; DEAF-AID SETS; PUBLIC ADDRESS SYSTEMS
- H04R1/00—Details of transducers, loudspeakers or microphones
- H04R1/02—Casings; Cabinets ; Supports therefor; Mountings therein
- H04R1/025—Arrangements for fixing loudspeaker transducers, e.g. in a box, furniture
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04R—LOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; DEAF-AID SETS; PUBLIC ADDRESS SYSTEMS
- H04R1/00—Details of transducers, loudspeakers or microphones
- H04R1/20—Arrangements for obtaining desired frequency or directional characteristics
- H04R1/32—Arrangements for obtaining desired frequency or directional characteristics for obtaining desired directional characteristic only
- H04R1/40—Arrangements for obtaining desired frequency or directional characteristics for obtaining desired directional characteristic only by combining a number of identical transducers
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04R—LOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; DEAF-AID SETS; PUBLIC ADDRESS SYSTEMS
- H04R1/00—Details of transducers, loudspeakers or microphones
- H04R1/20—Arrangements for obtaining desired frequency or directional characteristics
- H04R1/32—Arrangements for obtaining desired frequency or directional characteristics for obtaining desired directional characteristic only
- H04R1/40—Arrangements for obtaining desired frequency or directional characteristics for obtaining desired directional characteristic only by combining a number of identical transducers
- H04R1/406—Arrangements for obtaining desired frequency or directional characteristics for obtaining desired directional characteristic only by combining a number of identical transducers microphones
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04R—LOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; DEAF-AID SETS; PUBLIC ADDRESS SYSTEMS
- H04R3/00—Circuits for transducers, loudspeakers or microphones
- H04R3/005—Circuits for transducers, loudspeakers or microphones for combining the signals of two or more microphones
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/02—Speech enhancement, e.g. noise reduction or echo cancellation
- G10L21/0208—Noise filtering
- G10L2021/02082—Noise filtering the noise being echo, reverberation of the speech
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/02—Speech enhancement, e.g. noise reduction or echo cancellation
- G10L21/0208—Noise filtering
- G10L21/0216—Noise filtering characterised by the method used for estimating noise
- G10L2021/02161—Number of inputs available containing the signal or the noise to be suppressed
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04R—LOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; DEAF-AID SETS; PUBLIC ADDRESS SYSTEMS
- H04R2201/00—Details of transducers, loudspeakers or microphones covered by H04R1/00 but not provided for in any of its subgroups
- H04R2201/02—Details casings, cabinets or mounting therein for transducers covered by H04R1/02 but not provided for in any of its subgroups
- H04R2201/023—Transducers incorporated in garment, rucksacks or the like
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04R—LOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; DEAF-AID SETS; PUBLIC ADDRESS SYSTEMS
- H04R2420/00—Details of connection covered by H04R, not provided for in its groups
- H04R2420/07—Applications of wireless loudspeakers or wireless microphones
Landscapes
- Engineering & Computer Science (AREA)
- Health & Medical Sciences (AREA)
- Physics & Mathematics (AREA)
- Acoustics & Sound (AREA)
- Signal Processing (AREA)
- Otolaryngology (AREA)
- Computational Linguistics (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Multimedia (AREA)
- Quality & Reliability (AREA)
- General Health & Medical Sciences (AREA)
- Circuit For Audible Band Transducer (AREA)
- Measurement Of Mechanical Vibrations Or Ultrasonic Waves (AREA)
Abstract
Description
yk(n)=hk(n)*s(n)+wk(n)
ここで、s(n)は、ユーザの口での音声信号であり、hk(n)は、ユーザの口に対応する位置と第kのマイクロフォンの位置との間の音響伝達関数であり、wk(n)は、雑音信号であり、周囲雑音とマイクロフォン自体の雑音との両方を含む。音声信号と雑音信号が独立していると仮定して、対応する信号のパワースペクトル密度(PSD:power spectral densities)に関する周波数領域での等価な表現は、以下によって与えられる。
ここで、yk=[yk(0),yk(1),...,yk(N−1)]Tであり、a=[1,a1,...,aM]Tは、LP係数の所与のベクトルであり、Mは、LPモデル次数であり、Nは、短時間セグメント中のサンプルの数であり、
は、第kのマイクロフォンでの雑音信号の自動相関行列であり、Rx=g(ATA)−1であり、ここで、Aは、第1の列として[1,a1,a2,...,aM,:0,...,0]Tを有するN×Nの下三角テプリッツ行列であり、gは、利得項であり、正規化されたコードブックスペクトルと観察されたスペクトルとのレベル差を補償する。
ここで、Cは、信号独立定数項を取り込み(capture)、Ai(ω)は、コードブックからの第iのベクトルのスペクトルであり、以下によって与えられる。
ここで、雑音PSD
の誤った推定値により生じ得る分子における負の値は、ゼロに設定される。この式での全ての量が利用可能であることに留意すべきである。雑音を多く含むPSD
及び雑音PSD
が、マイクロフォン信号から推定され得て、Ai(ω)は、第iのコードブックベクトルによって指定される。
であり、ここで、Iは、音声コードブック内のベクトルの数である。ここで、この最大尤度値は、特定のマイクロフォン信号に関する類似性指標として使用される。
Claims (15)
- 音声信号を発生するための装置であって、
複数のマイクロフォンからマイクロフォン信号を受信するためのマイクロフォン受信機と、
各マイクロフォン信号に関して、前記マイクロフォン信号と非反響音声との間の類似性を示す音声類似性指標を決定する比較器であって、前記マイクロフォン信号から導出される少なくとも1つの特性と非反響音声に関する少なくとも1つの参照特性との比較に応答して、前記音声類似性指標を決定する比較器と、
前記音声類似性指標に応答して前記マイクロフォン信号を複合することによって前記音声信号を発生するための発生器とを備える、装置。 - 前記装置は、複数の個別のデバイスを備え、各デバイスが、複数のマイクロフォン信号のうちの少なくとも1つのマイクロフォン信号を受信するためのマイクロフォン受信機を備える、請求項1に記載の装置。
- 前記複数の個別のデバイスのうちの少なくとも第1のデバイスが、前記第1のデバイスの少なくとも1つのマイクロフォン信号に関する第1の音声類似性指標を決定するためのローカル比較器を備える、請求項2に記載の装置。
- 前記発生器が、少なくとも前記第1のデバイスとは別個の発生器デバイス内に実装され、前記第1のデバイスは、前記第1の音声類似性指標を前記発生器デバイスに送信するための送信機を備える、請求項3に記載の装置。
- 前記発生器デバイスが、前記複数の個別のデバイスそれぞれから前記音声類似性指標を受信し、前記発生器が、前記複数の個別のデバイスからのマイクロフォン信号の部分集合を使用して前記音声信号を発生し、前記部分集合は、前記複数の個別のデバイスから受信された前記音声類似性指標に応答して決定される、請求項4に記載の装置。
- 前記複数の個別のデバイスのうちの少なくとも1つのデバイスは、前記少なくとも1つのデバイスの少なくとも1つのマイクロフォン信号がマイクロフォン信号の前記部分集合に含まれる場合にのみ、前記少なくとも1つのデバイスの少なくとも1つのマイクロフォン信号を前記発生器デバイスに送信する、請求項5に記載の装置。
- 前記発生器デバイスは、マイクロフォン信号の前記部分集合を決定する選択器と、前記複数の個別のデバイスの少なくとも1つに前記部分集合の指標を送信するための送信機とを備える、請求項5に記載の装置。
- 前記比較器は、マイクロフォン信号から導出される少なくとも1つの特性と1組の非反響音声サンプルにおける音声サンプルに関する参照特性との比較に応答して、第1のマイクロフォン信号に関して前記音声類似性指標を決定する、請求項1に記載の装置。
- 前記1組の非反響音声サンプルにおける音声サンプルは、非反響音声モデルに関するパラメータによって表現される、請求項8に記載の装置。
- 前記比較器は、第1の音声サンプルに関するパラメータを使用して前記非反響音声モデルを評価することによって発生される音声サンプル信号から、前記1組の非反響音声サンプルのうちの第1の音声サンプルに関する第1の参照特性を決定し、また、第1のマイクロフォン信号から導出される特性と第1の参照特性との比較に応答して、前記複数のマイクロフォン信号のうちの第1のマイクロフォン信号に関する前記音声類似性指標を決定する、請求項9に記載の装置。
- 前記比較器は、前記複数のマイクロフォン信号のうちの第1のマイクロフォン信号を1組の基底信号ベクトルに分解し、前記1組の基底信号ベクトルの特性に応答して前記音声類似性指標を決定する、請求項1に記載の装置。
- 前記比較器は、音声信号の複数のセグメントの各セグメントに関して前記音声類似性指標を決定し、前記発生器は、各セグメントに関して複合のための複合パラメータを決定する、請求項1に記載の装置。
- 前記発生器は、少なくとも1つの前のセグメントの前記音声類似性指標に応答して1つのセグメントに関する複合パラメータを決定する、請求項11に記載の装置。
- 前記発生器は、前記音声類似性指標に応答して複合するためにマイクロフォン信号の部分集合を選択する、請求項1に記載の装置。
- 音声信号を発生する方法であって、
複数のマイクロフォンからマイクロフォン信号を受信するステップと、
各マイクロフォン信号に関して、前記マイクロフォン信号と非反響音声との間の類似性を示す音声類似性指標を決定するステップであって、前記マイクロフォン信号から導出される少なくとも1つの特性と非反響音声に関する少なくとも1つの参照特性との比較に応答して、前記音声類似性指標が決定されるステップと、
前記音声類似性指標に応答して前記マイクロフォン信号を複合することによって、前記音声信号を発生するステップとを含む、方法。
Applications Claiming Priority (3)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US201361769236P | 2013-02-26 | 2013-02-26 | |
US61/769,236 | 2013-02-26 | ||
PCT/IB2014/059057 WO2014132167A1 (en) | 2013-02-26 | 2014-02-18 | Method and apparatus for generating a speech signal |
Publications (3)
Publication Number | Publication Date |
---|---|
JP2016511594A true JP2016511594A (ja) | 2016-04-14 |
JP2016511594A5 JP2016511594A5 (ja) | 2017-03-23 |
JP6519877B2 JP6519877B2 (ja) | 2019-05-29 |
Family
ID=50190513
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
JP2015558579A Active JP6519877B2 (ja) | 2013-02-26 | 2014-02-18 | 音声信号を発生するための方法及び装置 |
Country Status (7)
Country | Link |
---|---|
US (1) | US10032461B2 (ja) |
EP (1) | EP2962300B1 (ja) |
JP (1) | JP6519877B2 (ja) |
CN (1) | CN105308681B (ja) |
BR (1) | BR112015020150B1 (ja) |
RU (1) | RU2648604C2 (ja) |
WO (1) | WO2014132167A1 (ja) |
Cited By (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2020218094A1 (ja) * | 2019-04-26 | 2020-10-29 | 株式会社ソニー・インタラクティブエンタテインメント | 情報処理システム、情報処理装置、情報処理装置の制御方法、及びプログラム |
US11880633B2 (en) | 2019-04-26 | 2024-01-23 | Sony Interactive Entertainment Inc. | Information processing system, information processing apparatus, control method for information processing apparatus, and program |
Families Citing this family (81)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20170287505A1 (en) * | 2014-09-03 | 2017-10-05 | Samsung Electronics Co., Ltd. | Method and apparatus for learning and recognizing audio signal |
US9922643B2 (en) * | 2014-12-23 | 2018-03-20 | Nice Ltd. | User-aided adaptation of a phonetic dictionary |
KR102387567B1 (ko) * | 2015-01-19 | 2022-04-18 | 삼성전자주식회사 | 음성 인식 방법 및 음성 인식 장치 |
JP6631010B2 (ja) * | 2015-02-04 | 2020-01-15 | ヤマハ株式会社 | マイク選択装置、マイクシステムおよびマイク選択方法 |
CN105185371B (zh) | 2015-06-25 | 2017-07-11 | 京东方科技集团股份有限公司 | 一种语音合成装置、语音合成方法、骨传导头盔和助听器 |
US9820039B2 (en) | 2016-02-22 | 2017-11-14 | Sonos, Inc. | Default playback devices |
US10097939B2 (en) | 2016-02-22 | 2018-10-09 | Sonos, Inc. | Compensation for speaker nonlinearities |
US10264030B2 (en) | 2016-02-22 | 2019-04-16 | Sonos, Inc. | Networked microphone device control |
US9965247B2 (en) | 2016-02-22 | 2018-05-08 | Sonos, Inc. | Voice controlled media playback system based on user profile |
US9947316B2 (en) | 2016-02-22 | 2018-04-17 | Sonos, Inc. | Voice control of a media playback system |
US9811314B2 (en) | 2016-02-22 | 2017-11-07 | Sonos, Inc. | Metadata exchange involving a networked playback system and a networked microphone system |
US10095470B2 (en) | 2016-02-22 | 2018-10-09 | Sonos, Inc. | Audio response playback |
DK3217399T3 (en) * | 2016-03-11 | 2019-02-25 | Gn Hearing As | Kalman filtering based speech enhancement using a codebook based approach |
US9978390B2 (en) * | 2016-06-09 | 2018-05-22 | Sonos, Inc. | Dynamic player selection for audio signal processing |
US10134399B2 (en) | 2016-07-15 | 2018-11-20 | Sonos, Inc. | Contextualization of voice inputs |
US10152969B2 (en) | 2016-07-15 | 2018-12-11 | Sonos, Inc. | Voice detection by multiple devices |
US9693164B1 (en) | 2016-08-05 | 2017-06-27 | Sonos, Inc. | Determining direction of networked microphone device relative to audio playback device |
US10115400B2 (en) | 2016-08-05 | 2018-10-30 | Sonos, Inc. | Multiple voice services |
GB201615538D0 (en) * | 2016-09-13 | 2016-10-26 | Nokia Technologies Oy | A method , apparatus and computer program for processing audio signals |
US9794720B1 (en) | 2016-09-22 | 2017-10-17 | Sonos, Inc. | Acoustic position measurement |
US9942678B1 (en) | 2016-09-27 | 2018-04-10 | Sonos, Inc. | Audio playback settings for voice interaction |
US9743204B1 (en) | 2016-09-30 | 2017-08-22 | Sonos, Inc. | Multi-orientation playback device microphones |
US10181323B2 (en) | 2016-10-19 | 2019-01-15 | Sonos, Inc. | Arbitration-based voice recognition |
US10621980B2 (en) * | 2017-03-21 | 2020-04-14 | Harman International Industries, Inc. | Execution of voice commands in a multi-device system |
US11183181B2 (en) | 2017-03-27 | 2021-11-23 | Sonos, Inc. | Systems and methods of multiple voice services |
GB2563857A (en) * | 2017-06-27 | 2019-01-02 | Nokia Technologies Oy | Recording and rendering sound spaces |
US10475449B2 (en) | 2017-08-07 | 2019-11-12 | Sonos, Inc. | Wake-word detection suppression |
US10048930B1 (en) | 2017-09-08 | 2018-08-14 | Sonos, Inc. | Dynamic computation of system response volume |
US10446165B2 (en) | 2017-09-27 | 2019-10-15 | Sonos, Inc. | Robust short-time fourier transform acoustic echo cancellation during audio playback |
US10621981B2 (en) | 2017-09-28 | 2020-04-14 | Sonos, Inc. | Tone interference cancellation |
US10482868B2 (en) | 2017-09-28 | 2019-11-19 | Sonos, Inc. | Multi-channel acoustic echo cancellation |
US10051366B1 (en) | 2017-09-28 | 2018-08-14 | Sonos, Inc. | Three-dimensional beam forming with a microphone array |
US10466962B2 (en) | 2017-09-29 | 2019-11-05 | Sonos, Inc. | Media playback system with voice assistance |
AU2018353008B2 (en) | 2017-10-17 | 2023-04-20 | Magic Leap, Inc. | Mixed reality spatial audio |
US10880650B2 (en) | 2017-12-10 | 2020-12-29 | Sonos, Inc. | Network microphone devices with automatic do not disturb actuation capabilities |
US10818290B2 (en) | 2017-12-11 | 2020-10-27 | Sonos, Inc. | Home graph |
CN108174138B (zh) * | 2018-01-02 | 2021-02-19 | 上海闻泰电子科技有限公司 | 视频拍摄方法、语音采集设备及视频拍摄系统 |
WO2019152722A1 (en) | 2018-01-31 | 2019-08-08 | Sonos, Inc. | Device designation of playback and network microphone device arrangements |
JP2021514081A (ja) | 2018-02-15 | 2021-06-03 | マジック リープ, インコーポレイテッドMagic Leap,Inc. | 複合現実仮想反響音 |
US11175880B2 (en) | 2018-05-10 | 2021-11-16 | Sonos, Inc. | Systems and methods for voice-assisted media content selection |
US10847178B2 (en) | 2018-05-18 | 2020-11-24 | Sonos, Inc. | Linear filtering for noise-suppressed speech detection |
US10959029B2 (en) | 2018-05-25 | 2021-03-23 | Sonos, Inc. | Determining and adapting to changes in microphone performance of playback devices |
US10681460B2 (en) | 2018-06-28 | 2020-06-09 | Sonos, Inc. | Systems and methods for associating playback devices with voice assistant services |
US11076035B2 (en) | 2018-08-28 | 2021-07-27 | Sonos, Inc. | Do not disturb feature for audio notifications |
US10461710B1 (en) | 2018-08-28 | 2019-10-29 | Sonos, Inc. | Media playback system with maximum volume setting |
CN112470496B (zh) * | 2018-09-13 | 2023-09-29 | 科利耳有限公司 | 使用正常事物的听力性能和康复和/或复原增强 |
US10878811B2 (en) | 2018-09-14 | 2020-12-29 | Sonos, Inc. | Networked devices, systems, and methods for intelligently deactivating wake-word engines |
US10587430B1 (en) | 2018-09-14 | 2020-03-10 | Sonos, Inc. | Networked devices, systems, and methods for associating playback devices based on sound codes |
US11024331B2 (en) | 2018-09-21 | 2021-06-01 | Sonos, Inc. | Voice detection optimization using sound metadata |
US10811015B2 (en) | 2018-09-25 | 2020-10-20 | Sonos, Inc. | Voice detection optimization based on selected voice assistant service |
US11100923B2 (en) | 2018-09-28 | 2021-08-24 | Sonos, Inc. | Systems and methods for selective wake word detection using neural network models |
US10692518B2 (en) | 2018-09-29 | 2020-06-23 | Sonos, Inc. | Linear filtering for noise-suppressed speech detection via multiple network microphone devices |
US11899519B2 (en) | 2018-10-23 | 2024-02-13 | Sonos, Inc. | Multiple stage network microphone device with reduced power consumption and processing load |
EP3654249A1 (en) | 2018-11-15 | 2020-05-20 | Snips | Dilated convolutions and gating for efficient keyword spotting |
US11183183B2 (en) | 2018-12-07 | 2021-11-23 | Sonos, Inc. | Systems and methods of operating media playback systems having multiple voice assistant services |
US11132989B2 (en) | 2018-12-13 | 2021-09-28 | Sonos, Inc. | Networked microphone devices, systems, and methods of localized arbitration |
US10602268B1 (en) | 2018-12-20 | 2020-03-24 | Sonos, Inc. | Optimization of network microphone devices using noise classification |
US11315556B2 (en) | 2019-02-08 | 2022-04-26 | Sonos, Inc. | Devices, systems, and methods for distributed voice processing by transmitting sound data associated with a wake word to an appropriate device for identification |
US10867604B2 (en) | 2019-02-08 | 2020-12-15 | Sonos, Inc. | Devices, systems, and methods for distributed voice processing |
EP3951777A4 (en) * | 2019-03-27 | 2022-05-18 | Sony Group Corporation | SIGNAL PROCESSING DEVICE, METHOD AND PROGRAM |
US11120794B2 (en) | 2019-05-03 | 2021-09-14 | Sonos, Inc. | Voice assistant persistence across multiple network microphone devices |
US10586540B1 (en) | 2019-06-12 | 2020-03-10 | Sonos, Inc. | Network microphone device with command keyword conditioning |
US11361756B2 (en) | 2019-06-12 | 2022-06-14 | Sonos, Inc. | Conditional wake word eventing based on environment |
US11200894B2 (en) | 2019-06-12 | 2021-12-14 | Sonos, Inc. | Network microphone device with command keyword eventing |
JP7362320B2 (ja) * | 2019-07-04 | 2023-10-17 | フォルシアクラリオン・エレクトロニクス株式会社 | オーディオ信号処理装置、オーディオ信号処理方法及びオーディオ信号処理プログラム |
US10871943B1 (en) | 2019-07-31 | 2020-12-22 | Sonos, Inc. | Noise classification for event detection |
US11138975B2 (en) | 2019-07-31 | 2021-10-05 | Sonos, Inc. | Locally distributed keyword detection |
US11138969B2 (en) | 2019-07-31 | 2021-10-05 | Sonos, Inc. | Locally distributed keyword detection |
US11189286B2 (en) | 2019-10-22 | 2021-11-30 | Sonos, Inc. | VAS toggle based on device orientation |
CN114586382A (zh) | 2019-10-25 | 2022-06-03 | 奇跃公司 | 混响指纹估计 |
US11217235B1 (en) * | 2019-11-18 | 2022-01-04 | Amazon Technologies, Inc. | Autonomously motile device with audio reflection detection |
US11200900B2 (en) | 2019-12-20 | 2021-12-14 | Sonos, Inc. | Offline voice control |
US11562740B2 (en) | 2020-01-07 | 2023-01-24 | Sonos, Inc. | Voice verification for media playback |
US11556307B2 (en) | 2020-01-31 | 2023-01-17 | Sonos, Inc. | Local voice data processing |
US11308958B2 (en) | 2020-02-07 | 2022-04-19 | Sonos, Inc. | Localized wakeword verification |
US11482224B2 (en) | 2020-05-20 | 2022-10-25 | Sonos, Inc. | Command keywords with input detection windowing |
US11727919B2 (en) | 2020-05-20 | 2023-08-15 | Sonos, Inc. | Memory allocation for keyword spotting engines |
US11308962B2 (en) | 2020-05-20 | 2022-04-19 | Sonos, Inc. | Input detection windowing |
US11698771B2 (en) | 2020-08-25 | 2023-07-11 | Sonos, Inc. | Vocal guidance engines for playback devices |
US11984123B2 (en) | 2020-11-12 | 2024-05-14 | Sonos, Inc. | Network device interaction by range |
US11551700B2 (en) | 2021-01-25 | 2023-01-10 | Sonos, Inc. | Systems and methods for power-efficient keyword detection |
Citations (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JP2009528802A (ja) * | 2006-03-03 | 2009-08-06 | ジーエヌ リザウンド エー/エス | 補聴器の全方向性マイクロホンモードと指向性マイクロホンモードの間の自動切換え |
JP2011511571A (ja) * | 2008-01-29 | 2011-04-07 | クゥアルコム・インコーポレイテッド | 複数のマイクからの信号間で知的に選択することによって音質を改善すること |
Family Cites Families (29)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US3814856A (en) * | 1973-02-22 | 1974-06-04 | D Dugan | Control apparatus for sound reinforcement systems |
US5561737A (en) * | 1994-05-09 | 1996-10-01 | Lucent Technologies Inc. | Voice actuated switching system |
US5638487A (en) * | 1994-12-30 | 1997-06-10 | Purespeech, Inc. | Automatic speech recognition |
JP3541339B2 (ja) | 1997-06-26 | 2004-07-07 | 富士通株式会社 | マイクロホンアレイ装置 |
US6684185B1 (en) * | 1998-09-04 | 2004-01-27 | Matsushita Electric Industrial Co., Ltd. | Small footprint language and vocabulary independent word recognizer using registration by word spelling |
US6243322B1 (en) * | 1999-11-05 | 2001-06-05 | Wavemakers Research, Inc. | Method for estimating the distance of an acoustic signal |
GB0120450D0 (en) * | 2001-08-22 | 2001-10-17 | Mitel Knowledge Corp | Robust talker localization in reverberant environment |
EP1468550B1 (en) | 2002-01-18 | 2012-03-28 | Polycom, Inc. | Digital linking of multiple microphone systems |
ATE324763T1 (de) * | 2003-08-21 | 2006-05-15 | Bernafon Ag | Verfahren zur verarbeitung von audiosignalen |
CA2537977A1 (en) * | 2003-09-05 | 2005-03-17 | Stephen D. Grody | Methods and apparatus for providing services using speech recognition |
CN1808571A (zh) | 2005-01-19 | 2006-07-26 | 松下电器产业株式会社 | 声音信号分离系统及方法 |
US7260491B2 (en) * | 2005-10-27 | 2007-08-21 | International Business Machines Corporation | Duty cycle measurement apparatus and method |
JP4311402B2 (ja) | 2005-12-21 | 2009-08-12 | ヤマハ株式会社 | 拡声システム |
US8233353B2 (en) | 2007-01-26 | 2012-07-31 | Microsoft Corporation | Multi-sensor sound source localization |
WO2010091077A1 (en) * | 2009-02-03 | 2010-08-12 | University Of Ottawa | Method and system for a multi-microphone noise reduction |
US8867754B2 (en) * | 2009-02-13 | 2014-10-21 | Honda Motor Co., Ltd. | Dereverberation apparatus and dereverberation method |
US8644517B2 (en) * | 2009-08-17 | 2014-02-04 | Broadcom Corporation | System and method for automatic disabling and enabling of an acoustic beamformer |
US8589166B2 (en) * | 2009-10-22 | 2013-11-19 | Broadcom Corporation | Speech content based packet loss concealment |
EP2375779A3 (en) * | 2010-03-31 | 2012-01-18 | Fraunhofer-Gesellschaft zur Förderung der Angewandten Forschung e.V. | Apparatus and method for measuring a plurality of loudspeakers and microphone array |
EP2572499B1 (en) * | 2010-05-18 | 2018-07-11 | Telefonaktiebolaget LM Ericsson (publ) | Encoder adaption in teleconferencing system |
US8908874B2 (en) * | 2010-09-08 | 2014-12-09 | Dts, Inc. | Spatial audio encoding and reproduction |
RU2596584C2 (ru) * | 2010-10-25 | 2016-09-10 | Войсэйдж Корпорейшн | Кодирование обобщенных аудиосигналов на низких скоростях передачи битов и с низкой задержкой |
EP2458586A1 (en) * | 2010-11-24 | 2012-05-30 | Koninklijke Philips Electronics N.V. | System and method for producing an audio signal |
SE536046C2 (sv) | 2011-01-19 | 2013-04-16 | Limes Audio Ab | Metod och anordning för mikrofonval |
US9336780B2 (en) * | 2011-06-20 | 2016-05-10 | Agnitio, S.L. | Identification of a local speaker |
US8340975B1 (en) * | 2011-10-04 | 2012-12-25 | Theodore Alfred Rosenberger | Interactive speech recognition device and system for hands-free building control |
US8731911B2 (en) * | 2011-12-09 | 2014-05-20 | Microsoft Corporation | Harmonicity-based single-channel speech quality estimation |
US9058806B2 (en) * | 2012-09-10 | 2015-06-16 | Cisco Technology, Inc. | Speaker segmentation and recognition based on list of speakers |
US20140170979A1 (en) * | 2012-12-17 | 2014-06-19 | Qualcomm Incorporated | Contextual power saving in bluetooth audio |
-
2014
- 2014-02-18 BR BR112015020150-4A patent/BR112015020150B1/pt active IP Right Grant
- 2014-02-18 US US14/766,567 patent/US10032461B2/en active Active
- 2014-02-18 EP EP14707461.1A patent/EP2962300B1/en active Active
- 2014-02-18 WO PCT/IB2014/059057 patent/WO2014132167A1/en active Application Filing
- 2014-02-18 JP JP2015558579A patent/JP6519877B2/ja active Active
- 2014-02-18 CN CN201480010600.4A patent/CN105308681B/zh active Active
- 2014-02-18 RU RU2015140965A patent/RU2648604C2/ru active
Patent Citations (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JP2009528802A (ja) * | 2006-03-03 | 2009-08-06 | ジーエヌ リザウンド エー/エス | 補聴器の全方向性マイクロホンモードと指向性マイクロホンモードの間の自動切換え |
JP2011511571A (ja) * | 2008-01-29 | 2011-04-07 | クゥアルコム・インコーポレイテッド | 複数のマイクからの信号間で知的に選択することによって音質を改善すること |
Cited By (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2020218094A1 (ja) * | 2019-04-26 | 2020-10-29 | 株式会社ソニー・インタラクティブエンタテインメント | 情報処理システム、情報処理装置、情報処理装置の制御方法、及びプログラム |
JPWO2020218094A1 (ja) * | 2019-04-26 | 2021-11-11 | 株式会社ソニー・インタラクティブエンタテインメント | 情報処理システム、情報処理装置、情報処理装置の制御方法、及びプログラム |
JP7170851B2 (ja) | 2019-04-26 | 2022-11-14 | 株式会社ソニー・インタラクティブエンタテインメント | 情報処理システム、情報処理装置、情報処理装置の制御方法、及びプログラム |
US11880633B2 (en) | 2019-04-26 | 2024-01-23 | Sony Interactive Entertainment Inc. | Information processing system, information processing apparatus, control method for information processing apparatus, and program |
Also Published As
Publication number | Publication date |
---|---|
BR112015020150B1 (pt) | 2021-08-17 |
CN105308681B (zh) | 2019-02-12 |
WO2014132167A1 (en) | 2014-09-04 |
RU2648604C2 (ru) | 2018-03-26 |
EP2962300A1 (en) | 2016-01-06 |
US10032461B2 (en) | 2018-07-24 |
EP2962300B1 (en) | 2017-01-25 |
US20150380010A1 (en) | 2015-12-31 |
JP6519877B2 (ja) | 2019-05-29 |
BR112015020150A2 (pt) | 2017-07-18 |
CN105308681A (zh) | 2016-02-03 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
JP6519877B2 (ja) | 音声信号を発生するための方法及び装置 | |
Parchami et al. | Recent developments in speech enhancement in the short-time Fourier transform domain | |
JP4796309B2 (ja) | モバイル・デバイス上のマルチセンサによるスピーチ改良のための方法および装置 | |
US10403300B2 (en) | Spectral estimation of room acoustic parameters | |
US20090018826A1 (en) | Methods, Systems and Devices for Speech Transduction | |
JP2011511571A (ja) | 複数のマイクからの信号間で知的に選択することによって音質を改善すること | |
JP6545419B2 (ja) | 音響信号処理装置、音響信号処理方法、及びハンズフリー通話装置 | |
Potamitis et al. | An integrated system for smart-home control of appliances based on remote speech interaction. | |
WO2009086017A1 (en) | Systems, methods, and apparatus for multi-microphone based speech enhancement | |
JP2014502468A (ja) | オーディオ信号生成システム及び方法 | |
US9378755B2 (en) | Detecting a user's voice activity using dynamic probabilistic models of speech features | |
Habets et al. | Joint dereverberation and residual echo suppression of speech signals in noisy environments | |
JP2015018015A (ja) | 音声処理装置、音声処理方法、及び音声処理プログラム | |
JP2020115206A (ja) | システム及び方法 | |
US8423357B2 (en) | System and method for biometric acoustic noise reduction | |
CN108810778B (zh) | 用于运行听力设备的方法和听力设备 | |
JP6265903B2 (ja) | 信号雑音減衰 | |
Gamper et al. | Predicting word error rate for reverberant speech | |
Srinivasan | Using a remotewireless microphone for speech enhancement in non-stationary noise | |
Fukui et al. | Acoustic echo and noise canceller for personal hands-free video IP phone | |
Lee et al. | Channel prediction-based noise reduction algorithm for dual-microphone mobile phones | |
GB2580655A (en) | Reducing a noise level of an audio signal of a hearing system | |
Potamitis et al. | Speech activity detection and enhancement of a moving speaker based on the wideband generalized likelihood ratio and microphone arrays | |
Aalburg et al. | Single-and Two-Channel Noise Reduction for Robust Speech Recognition | |
Pacheco et al. | Spectral subtraction for reverberation reduction applied to automatic speech recognition |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
A521 | Request for written amendment filed |
Free format text: JAPANESE INTERMEDIATE CODE: A523 Effective date: 20170216 |
|
A621 | Written request for application examination |
Free format text: JAPANESE INTERMEDIATE CODE: A621 Effective date: 20170216 |
|
A131 | Notification of reasons for refusal |
Free format text: JAPANESE INTERMEDIATE CODE: A131 Effective date: 20180507 |
|
A521 | Request for written amendment filed |
Free format text: JAPANESE INTERMEDIATE CODE: A523 Effective date: 20180801 |
|
A02 | Decision of refusal |
Free format text: JAPANESE INTERMEDIATE CODE: A02 Effective date: 20181016 |
|
A521 | Request for written amendment filed |
Free format text: JAPANESE INTERMEDIATE CODE: A523 Effective date: 20190129 |
|
A911 | Transfer to examiner for re-examination before appeal (zenchi) |
Free format text: JAPANESE INTERMEDIATE CODE: A911 Effective date: 20190206 |
|
TRDD | Decision of grant or rejection written | ||
A01 | Written decision to grant a patent or to grant a registration (utility model) |
Free format text: JAPANESE INTERMEDIATE CODE: A01 Effective date: 20190319 |
|
A711 | Notification of change in applicant |
Free format text: JAPANESE INTERMEDIATE CODE: A711 Effective date: 20190329 |
|
RD02 | Notification of acceptance of power of attorney |
Free format text: JAPANESE INTERMEDIATE CODE: A7422 Effective date: 20190329 |
|
A61 | First payment of annual fees (during grant procedure) |
Free format text: JAPANESE INTERMEDIATE CODE: A61 Effective date: 20190411 |
|
R150 | Certificate of patent or registration of utility model |
Ref document number: 6519877 Country of ref document: JP Free format text: JAPANESE INTERMEDIATE CODE: R150 |
|
R250 | Receipt of annual fees |
Free format text: JAPANESE INTERMEDIATE CODE: R250 |
|
R250 | Receipt of annual fees |
Free format text: JAPANESE INTERMEDIATE CODE: R250 |
|
R250 | Receipt of annual fees |
Free format text: JAPANESE INTERMEDIATE CODE: R250 |