JP7564117B2 - キューのクラスター化を使用した音声強化 - Google Patents
キューのクラスター化を使用した音声強化 Download PDFInfo
- Publication number
- JP7564117B2 JP7564117B2 JP2021553756A JP2021553756A JP7564117B2 JP 7564117 B2 JP7564117 B2 JP 7564117B2 JP 2021553756 A JP2021553756 A JP 2021553756A JP 2021553756 A JP2021553756 A JP 2021553756A JP 7564117 B2 JP7564117 B2 JP 7564117B2
- Authority
- JP
- Japan
- Prior art keywords
- frequency
- sound
- acoustic
- samples
- pitch
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
Images
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/02—Speech enhancement, e.g. noise reduction or echo cancellation
- G10L21/0208—Noise filtering
- G10L21/0264—Noise filtering characterised by the type of parameter measurement, e.g. correlation techniques, zero crossing techniques or predictive techniques
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/22—Procedures used during a speech recognition process, e.g. man-machine dialogue
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/26—Speech to text systems
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L17/00—Speaker identification or verification techniques
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/02—Speech enhancement, e.g. noise reduction or echo cancellation
- G10L21/0208—Noise filtering
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/02—Speech enhancement, e.g. noise reduction or echo cancellation
- G10L21/0208—Noise filtering
- G10L21/0216—Noise filtering characterised by the method used for estimating noise
- G10L21/0232—Processing in the frequency domain
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/02—Speech enhancement, e.g. noise reduction or echo cancellation
- G10L21/0272—Voice signal separating
- G10L21/028—Voice signal separating using properties of sound source
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/02—Speech enhancement, e.g. noise reduction or echo cancellation
- G10L21/0272—Voice signal separating
- G10L21/0308—Voice signal separating characterised by the type of parameter measurement, e.g. correlation techniques, zero crossing techniques or predictive techniques
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
- G10L25/78—Detection of presence or absence of voice signals
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
- G10L25/90—Pitch determination of speech signals
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/02—Speech enhancement, e.g. noise reduction or echo cancellation
- G10L21/0208—Noise filtering
- G10L2021/02087—Noise filtering the noise being separate speech, e.g. cocktail party
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/02—Speech enhancement, e.g. noise reduction or echo cancellation
- G10L21/0208—Noise filtering
- G10L21/0216—Noise filtering characterised by the method used for estimating noise
- G10L2021/02161—Number of inputs available containing the signal or the noise to be suppressed
- G10L2021/02166—Microphone arrays; Beamforming
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
- G10L25/90—Pitch determination of speech signals
- G10L2025/906—Pitch tracking
Landscapes
- Engineering & Computer Science (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Physics & Mathematics (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Computational Linguistics (AREA)
- Signal Processing (AREA)
- Quality & Reliability (AREA)
- Circuit For Audible Band Transducer (AREA)
Priority Applications (1)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| JP2024167615A JP2025000790A (ja) | 2019-03-10 | 2024-09-26 | キューのクラスター化を使用した音声強化 |
Applications Claiming Priority (1)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| PCT/IB2019/051933 WO2020183219A1 (en) | 2019-03-10 | 2019-03-10 | Speech enhancement using clustering of cues |
Related Child Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| JP2024167615A Division JP2025000790A (ja) | 2019-03-10 | 2024-09-26 | キューのクラスター化を使用した音声強化 |
Publications (4)
| Publication Number | Publication Date |
|---|---|
| JP2022533300A JP2022533300A (ja) | 2022-07-22 |
| JPWO2020183219A5 JPWO2020183219A5 (https=) | 2024-05-17 |
| JP2022533300A5 JP2022533300A5 (https=) | 2024-05-17 |
| JP7564117B2 true JP7564117B2 (ja) | 2024-10-08 |
Family
ID=72427785
Family Applications (2)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| JP2021553756A Active JP7564117B2 (ja) | 2019-03-10 | 2019-03-10 | キューのクラスター化を使用した音声強化 |
| JP2024167615A Pending JP2025000790A (ja) | 2019-03-10 | 2024-09-26 | キューのクラスター化を使用した音声強化 |
Family Applications After (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| JP2024167615A Pending JP2025000790A (ja) | 2019-03-10 | 2024-09-26 | キューのクラスター化を使用した音声強化 |
Country Status (6)
| Country | Link |
|---|---|
| US (1) | US12148441B2 (https=) |
| EP (1) | EP3939035A4 (https=) |
| JP (2) | JP7564117B2 (https=) |
| KR (2) | KR102789155B1 (https=) |
| CN (2) | CN120089153A (https=) |
| WO (1) | WO2020183219A1 (https=) |
Families Citing this family (8)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| JP7564117B2 (ja) | 2019-03-10 | 2024-10-08 | カードーム テクノロジー リミテッド | キューのクラスター化を使用した音声強化 |
| JP7298702B2 (ja) * | 2019-09-27 | 2023-06-27 | ヤマハ株式会社 | 音響信号解析方法、音響信号解析システムおよびプログラム |
| CN110600051B (zh) * | 2019-11-12 | 2020-03-31 | 乐鑫信息科技(上海)股份有限公司 | 用于选择麦克风阵列的输出波束的方法 |
| CN113473373B (zh) * | 2021-06-08 | 2022-11-01 | 华侨大学 | 一种uwb室内定位方法 |
| US12380910B2 (en) * | 2021-06-30 | 2025-08-05 | Ringcentral, Inc. | Systems and methods for virtual meeting speaker separation |
| CN115910047B (zh) * | 2023-01-06 | 2023-05-19 | 阿里巴巴达摩院(杭州)科技有限公司 | 数据处理方法、模型训练方法、关键词检测方法及设备 |
| US12347449B2 (en) * | 2023-01-26 | 2025-07-01 | Synaptics Incorporated | Spatio-temporal beamformer |
| CN117668499B (zh) * | 2024-01-31 | 2024-05-14 | 平潭综合实验区智慧岛投资发展有限公司 | 一种基于机器学习的海洋公益诉讼线索研判方法、系统、设备及介质 |
Citations (10)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| WO2006059806A1 (ja) | 2004-12-03 | 2006-06-08 | Honda Motor Co., Ltd. | 音声認識装置 |
| JP2008064892A (ja) | 2006-09-05 | 2008-03-21 | National Institute Of Advanced Industrial & Technology | 音声認識方法およびそれを用いた音声認識装置 |
| JP2008203474A (ja) | 2007-02-20 | 2008-09-04 | Nippon Telegr & Teleph Corp <Ntt> | 多信号強調装置、方法、プログラム及びその記録媒体 |
| US20110182436A1 (en) | 2010-01-26 | 2011-07-28 | Carlo Murgia | Adaptive Noise Reduction Using Level Cues |
| US20130024194A1 (en) | 2010-11-25 | 2013-01-24 | Goertek Inc. | Speech enhancing method and device, and nenoising communication headphone enhancing method and device, and denoising communication headphones |
| JP2013201525A (ja) | 2012-03-23 | 2013-10-03 | Mitsubishi Electric Corp | ビームフォーミング処理装置 |
| WO2015157458A1 (en) | 2014-04-09 | 2015-10-15 | Kaonyx Labs, LLC | Methods and systems for improved measurement, entity and parameter estimation, and path propagation effect measurement and mitigation in source signal separation |
| US20150304766A1 (en) | 2012-11-30 | 2015-10-22 | Aalto-Kaorkeakoullusaatio | Method for spatial filtering of at least one sound signal, computer readable storage medium and spatial filtering system based on cross-pattern coherence |
| US20170208415A1 (en) | 2014-07-23 | 2017-07-20 | Pcms Holdings, Inc. | System and method for determining audio context in augmented-reality applications |
| WO2018022222A1 (en) | 2016-07-29 | 2018-02-01 | Qualcomm Incorporated | Far-field audio processing |
Family Cites Families (87)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| FI97758C (fi) * | 1992-11-20 | 1997-02-10 | Nokia Deutschland Gmbh | Järjestelmä audiosignaalin käsittelemiseksi |
| US5647834A (en) | 1995-06-30 | 1997-07-15 | Ron; Samuel | Speech-based biofeedback method and system |
| US5774837A (en) | 1995-09-13 | 1998-06-30 | Voxware, Inc. | Speech coding system and method using voicing probability determination |
| US6593956B1 (en) | 1998-05-15 | 2003-07-15 | Polycom, Inc. | Locating an audio source |
| US7222070B1 (en) | 1999-09-22 | 2007-05-22 | Texas Instruments Incorporated | Hybrid speech coding and system |
| US7076433B2 (en) | 2001-01-24 | 2006-07-11 | Honda Giken Kogyo Kabushiki Kaisha | Apparatus and program for separating a desired sound from a mixed input sound |
| US7130446B2 (en) | 2001-12-03 | 2006-10-31 | Microsoft Corporation | Automatic detection and tracking of multiple individuals using multiple cues |
| US7197456B2 (en) * | 2002-04-30 | 2007-03-27 | Nokia Corporation | On-line parametric histogram normalization for noise robust speech recognition |
| US7574352B2 (en) | 2002-09-06 | 2009-08-11 | Massachusetts Institute Of Technology | 2-D processing of speech |
| US8271279B2 (en) * | 2003-02-21 | 2012-09-18 | Qnx Software Systems Limited | Signature noise removal |
| US7394907B2 (en) | 2003-06-16 | 2008-07-01 | Microsoft Corporation | System and process for sound source localization using microphone array beamsteering |
| US7565282B2 (en) * | 2005-04-14 | 2009-07-21 | Dictaphone Corporation | System and method for adaptive automatic error correction |
| CA2621940C (en) * | 2005-09-09 | 2014-07-29 | Mcmaster University | Method and device for binaural signal enhancement |
| US8949120B1 (en) * | 2006-05-25 | 2015-02-03 | Audience, Inc. | Adaptive noise cancelation |
| JP4897519B2 (ja) | 2007-03-05 | 2012-03-14 | 株式会社神戸製鋼所 | 音源分離装置,音源分離プログラム及び音源分離方法 |
| US8239052B2 (en) | 2007-04-13 | 2012-08-07 | National Institute Of Advanced Industrial Science And Technology | Sound source separation system, sound source separation method, and computer program for sound source separation |
| WO2008144784A1 (en) | 2007-06-01 | 2008-12-04 | Technische Universität Graz | Joint position-pitch estimation of acoustic sources for their tracking and separation |
| GB0720473D0 (en) | 2007-10-19 | 2007-11-28 | Univ Surrey | Accoustic source separation |
| US8213598B2 (en) * | 2008-02-26 | 2012-07-03 | Microsoft Corporation | Harmonic distortion residual echo suppression |
| US8290141B2 (en) * | 2008-04-18 | 2012-10-16 | Freescale Semiconductor, Inc. | Techniques for comfort noise generation in a communication system |
| ES2988414T3 (es) * | 2008-07-11 | 2024-11-20 | Fraunhofer Ges Zur Foerderungder Angewandten Forschung E V | Decodificador de audio |
| US8914282B2 (en) * | 2008-09-30 | 2014-12-16 | Alon Konchitsky | Wind noise reduction |
| US20100145205A1 (en) | 2008-12-05 | 2010-06-10 | Cambridge Heart, Inc. | Analyzing alternans from measurements of an ambulatory electrocardiography device |
| US8750491B2 (en) * | 2009-03-24 | 2014-06-10 | Microsoft Corporation | Mitigation of echo in voice communication using echo detection and adaptive non-linear processor |
| US8923844B2 (en) | 2009-08-14 | 2014-12-30 | Futurewei Technologies, Inc. | Coordinated beam forming and multi-user MIMO |
| WO2011029048A2 (en) * | 2009-09-04 | 2011-03-10 | Massachusetts Institute Of Technology | Method and apparatus for audio source separation |
| JP2011107603A (ja) * | 2009-11-20 | 2011-06-02 | Sony Corp | 音声認識装置、および音声認識方法、並びにプログラム |
| US8798992B2 (en) * | 2010-05-19 | 2014-08-05 | Disney Enterprises, Inc. | Audio noise modification for event broadcasting |
| US8583428B2 (en) | 2010-06-15 | 2013-11-12 | Microsoft Corporation | Sound source separation using spatial filtering and regularization phases |
| WO2012036305A1 (ja) | 2010-09-17 | 2012-03-22 | 日本電気株式会社 | 音声認識装置、音声認識方法、及びプログラム |
| SG192718A1 (en) * | 2011-02-14 | 2013-09-30 | Fraunhofer Ges Forschung | Audio codec using noise synthesis during inactive phases |
| JP5613781B2 (ja) | 2011-02-16 | 2014-10-29 | 日本電信電話株式会社 | 符号化方法、復号方法、符号化装置、復号装置、プログラム及び記録媒体 |
| US9088328B2 (en) * | 2011-05-16 | 2015-07-21 | Intel Mobile Communications GmbH | Receiver of a mobile communication device |
| EP2737480A4 (en) | 2011-07-25 | 2015-03-18 | Incorporated Thotra | SYSTEM AND METHOD FOR ACOUSTIC TRANSFORMATION |
| EP2551846B1 (en) * | 2011-07-26 | 2022-01-19 | AKG Acoustics GmbH | Noise reducing sound reproduction |
| GB2495278A (en) * | 2011-09-30 | 2013-04-10 | Skype | Processing received signals from a range of receiving angles to reduce interference |
| KR101449551B1 (ko) | 2011-10-19 | 2014-10-14 | 한국전자통신연구원 | 유사문장 검색 장치 및 방법, 유사문장 검색 방법을 실행시키기 위한 프로그램이 기록된 기록매체 |
| US9197974B1 (en) * | 2012-01-06 | 2015-11-24 | Audience, Inc. | Directional audio capture adaptation based on alternative sensory input |
| US8880395B2 (en) | 2012-05-04 | 2014-11-04 | Sony Computer Entertainment Inc. | Source separation by independent component analysis in conjunction with source direction information |
| DK2890159T3 (da) | 2012-05-09 | 2017-01-02 | Oticon As | Anordning til behandling af audiosignaler |
| US9560446B1 (en) | 2012-06-27 | 2017-01-31 | Amazon Technologies, Inc. | Sound source locator with distributed microphone array |
| WO2014021318A1 (ja) * | 2012-08-01 | 2014-02-06 | 独立行政法人産業技術総合研究所 | 音声分析合成のためのスペクトル包絡及び群遅延の推定システム及び音声信号の合成システム |
| US9554203B1 (en) | 2012-09-26 | 2017-01-24 | Foundation for Research and Technolgy—Hellas (FORTH) Institute of Computer Science (ICS) | Sound source characterization apparatuses, methods and systems |
| EP2923502A4 (en) | 2012-11-20 | 2016-06-15 | Nokia Technologies Oy | DEVICE FOR ROOM ENHANCEMENT |
| US20140214676A1 (en) * | 2013-01-29 | 2014-07-31 | Dror Bukai | Automatic Learning Fraud Prevention (LFP) System |
| US9460732B2 (en) | 2013-02-13 | 2016-10-04 | Analog Devices, Inc. | Signal source separation |
| US9202463B2 (en) * | 2013-04-01 | 2015-12-01 | Zanavox | Voice-activated precision timing |
| US9640179B1 (en) * | 2013-06-27 | 2017-05-02 | Amazon Technologies, Inc. | Tailoring beamforming techniques to environments |
| US9959886B2 (en) * | 2013-12-06 | 2018-05-01 | Malaspina Labs (Barbados), Inc. | Spectral comb voice activity detection |
| US9324320B1 (en) * | 2014-10-02 | 2016-04-26 | Microsoft Technology Licensing, Llc | Neural network-based speech processing |
| US9583088B1 (en) | 2014-11-25 | 2017-02-28 | Audio Sprockets LLC | Frequency domain training to compensate acoustic instrument pickup signals |
| US10134425B1 (en) * | 2015-06-29 | 2018-11-20 | Amazon Technologies, Inc. | Direction-based speech endpointing |
| WO2017084704A1 (en) * | 2015-11-18 | 2017-05-26 | Huawei Technologies Co., Ltd. | A sound signal processing apparatus and method for enhancing a sound signal |
| US9659555B1 (en) * | 2016-02-09 | 2017-05-23 | Amazon Technologies, Inc. | Multichannel acoustic echo cancellation |
| US9653060B1 (en) * | 2016-02-09 | 2017-05-16 | Amazon Technologies, Inc. | Hybrid reference signal for acoustic echo cancellation |
| US9792897B1 (en) * | 2016-04-13 | 2017-10-17 | Malaspina Labs (Barbados), Inc. | Phoneme-expert assisted speech recognition and re-synthesis |
| US9818425B1 (en) * | 2016-06-17 | 2017-11-14 | Amazon Technologies, Inc. | Parallel output paths for acoustic echo cancellation |
| US10043521B2 (en) | 2016-07-01 | 2018-08-07 | Intel IP Corporation | User defined key phrase detection by user dependent sequence modeling |
| JP6517760B2 (ja) * | 2016-08-18 | 2019-05-22 | 日本電信電話株式会社 | マスク推定用パラメータ推定装置、マスク推定用パラメータ推定方法およびマスク推定用パラメータ推定プログラム |
| US10056091B2 (en) * | 2017-01-06 | 2018-08-21 | Bose Corporation | Microphone array beamforming |
| JP6711765B2 (ja) * | 2017-02-06 | 2020-06-17 | 日本電信電話株式会社 | 形成装置、形成方法および形成プログラム |
| US10360892B2 (en) * | 2017-06-07 | 2019-07-23 | Bose Corporation | Spectral optimization of audio masking waveforms |
| JP2019020640A (ja) * | 2017-07-20 | 2019-02-07 | パイオニア株式会社 | 指向性制御装置、指向性制御方法、及び指向性制御プログラム |
| US10446165B2 (en) * | 2017-09-27 | 2019-10-15 | Sonos, Inc. | Robust short-time fourier transform acoustic echo cancellation during audio playback |
| EP3467819B1 (en) * | 2017-10-05 | 2024-06-12 | Harman Becker Automotive Systems GmbH | Apparatus and method using multiple voice command devices |
| US10192567B1 (en) * | 2017-10-18 | 2019-01-29 | Motorola Mobility Llc | Echo cancellation and suppression in electronic device |
| US10535361B2 (en) | 2017-10-19 | 2020-01-14 | Kardome Technology Ltd. | Speech enhancement using clustering of cues |
| CN107888792B (zh) * | 2017-10-19 | 2019-09-17 | 浙江大华技术股份有限公司 | 一种回声消除方法、装置及系统 |
| CN107731223B (zh) * | 2017-11-22 | 2022-07-26 | 腾讯科技(深圳)有限公司 | 语音活性检测方法、相关装置和设备 |
| EP3514792B1 (en) * | 2018-01-17 | 2023-10-18 | Oticon A/s | A method of optimizing a speech enhancement algorithm with a speech intelligibility prediction algorithm |
| US10885907B2 (en) * | 2018-02-14 | 2021-01-05 | Cirrus Logic, Inc. | Noise reduction system and method for audio device with multiple microphones |
| US10957337B2 (en) * | 2018-04-11 | 2021-03-23 | Microsoft Technology Licensing, Llc | Multi-microphone speech separation |
| US10811000B2 (en) * | 2018-04-13 | 2020-10-20 | Mitsubishi Electric Research Laboratories, Inc. | Methods and systems for recognizing simultaneous speech by multiple speakers |
| JP7564117B2 (ja) | 2019-03-10 | 2024-10-08 | カードーム テクノロジー リミテッド | キューのクラスター化を使用した音声強化 |
| EP3726529A1 (en) * | 2019-04-16 | 2020-10-21 | Fraunhofer Gesellschaft zur Förderung der Angewand | Method and apparatus for determining a deep filter |
| CN110120217B (zh) * | 2019-05-10 | 2023-11-24 | 腾讯科技(深圳)有限公司 | 一种音频数据处理方法及装置 |
| EP3980994B1 (en) * | 2019-06-05 | 2025-11-19 | Harman International Industries, Incorporated | Sound modification based on frequency composition |
| CN120932662A (zh) * | 2019-08-01 | 2025-11-11 | 杜比实验室特许公司 | 用于增强劣化音频信号的系统和方法 |
| US11227586B2 (en) * | 2019-09-11 | 2022-01-18 | Massachusetts Institute Of Technology | Systems and methods for improving model-based speech enhancement with neural networks |
| US11551670B1 (en) * | 2019-09-26 | 2023-01-10 | Sonos, Inc. | Systems and methods for generating labeled data to facilitate configuration of network microphone devices |
| US20230058427A1 (en) * | 2020-02-03 | 2023-02-23 | Huawei Technologies Co., Ltd. | Wireless headset with hearable functions |
| CN111341341B (zh) * | 2020-02-11 | 2021-08-17 | 腾讯科技(深圳)有限公司 | 音频分离网络的训练方法、音频分离方法、装置及介质 |
| US11443760B2 (en) * | 2020-05-08 | 2022-09-13 | DTEN, Inc. | Active sound control |
| CN114073106B (zh) * | 2020-06-04 | 2023-08-04 | 西北工业大学 | 双耳波束形成麦克风阵列 |
| CN112116920B (zh) * | 2020-08-10 | 2022-08-05 | 北京大学 | 一种说话人数未知的多通道语音分离方法 |
| US11617044B2 (en) * | 2021-03-04 | 2023-03-28 | Iyo Inc. | Ear-mount able listening device with voice direction discovery for rotational correction of microphone array outputs |
| GB2607434B (en) * | 2022-04-12 | 2023-06-28 | Biopixs Ltd | Methods for implementing standardised time domain diffuse optical spectroscopy in wearables/portables |
-
2019
- 2019-03-10 JP JP2021553756A patent/JP7564117B2/ja active Active
- 2019-03-10 KR KR1020217032319A patent/KR102789155B1/ko active Active
- 2019-03-10 EP EP19918690.9A patent/EP3939035A4/en active Pending
- 2019-03-10 KR KR1020257009801A patent/KR20250044808A/ko active Pending
- 2019-03-10 WO PCT/IB2019/051933 patent/WO2020183219A1/en not_active Ceased
- 2019-03-10 US US17/437,748 patent/US12148441B2/en active Active
- 2019-03-10 CN CN202510264269.9A patent/CN120089153A/zh active Pending
- 2019-03-10 CN CN201980096208.9A patent/CN113795881B/zh active Active
-
2024
- 2024-09-26 JP JP2024167615A patent/JP2025000790A/ja active Pending
Patent Citations (10)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| WO2006059806A1 (ja) | 2004-12-03 | 2006-06-08 | Honda Motor Co., Ltd. | 音声認識装置 |
| JP2008064892A (ja) | 2006-09-05 | 2008-03-21 | National Institute Of Advanced Industrial & Technology | 音声認識方法およびそれを用いた音声認識装置 |
| JP2008203474A (ja) | 2007-02-20 | 2008-09-04 | Nippon Telegr & Teleph Corp <Ntt> | 多信号強調装置、方法、プログラム及びその記録媒体 |
| US20110182436A1 (en) | 2010-01-26 | 2011-07-28 | Carlo Murgia | Adaptive Noise Reduction Using Level Cues |
| US20130024194A1 (en) | 2010-11-25 | 2013-01-24 | Goertek Inc. | Speech enhancing method and device, and nenoising communication headphone enhancing method and device, and denoising communication headphones |
| JP2013201525A (ja) | 2012-03-23 | 2013-10-03 | Mitsubishi Electric Corp | ビームフォーミング処理装置 |
| US20150304766A1 (en) | 2012-11-30 | 2015-10-22 | Aalto-Kaorkeakoullusaatio | Method for spatial filtering of at least one sound signal, computer readable storage medium and spatial filtering system based on cross-pattern coherence |
| WO2015157458A1 (en) | 2014-04-09 | 2015-10-15 | Kaonyx Labs, LLC | Methods and systems for improved measurement, entity and parameter estimation, and path propagation effect measurement and mitigation in source signal separation |
| US20170208415A1 (en) | 2014-07-23 | 2017-07-20 | Pcms Holdings, Inc. | System and method for determining audio context in augmented-reality applications |
| WO2018022222A1 (en) | 2016-07-29 | 2018-02-01 | Qualcomm Incorporated | Far-field audio processing |
Non-Patent Citations (1)
| Title |
|---|
| Sharon GANNOT et al.,Signal Enhancement Using Beamforming and Nonstationarity with Applications to Speech,IEEE Transactions on Signal Processing, [online],2001年08月,Volume 49, Issue 8,pp. 1614-1626,[2023年11月1日検索], <URL: https://ieeexplore.ieee.org/document/934132> |
Also Published As
| Publication number | Publication date |
|---|---|
| US20220148611A1 (en) | 2022-05-12 |
| CN113795881B (zh) | 2025-03-14 |
| CN113795881A (zh) | 2021-12-14 |
| KR20250044808A (ko) | 2025-04-01 |
| WO2020183219A1 (en) | 2020-09-17 |
| EP3939035A4 (en) | 2022-11-02 |
| CN120089153A (zh) | 2025-06-03 |
| EP3939035A1 (en) | 2022-01-19 |
| JP2022533300A (ja) | 2022-07-22 |
| KR102789155B1 (ko) | 2025-04-01 |
| US12148441B2 (en) | 2024-11-19 |
| KR20210137146A (ko) | 2021-11-17 |
| JP2025000790A (ja) | 2025-01-07 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| JP7564117B2 (ja) | キューのクラスター化を使用した音声強化 | |
| US11694710B2 (en) | Multi-stream target-speech detection and channel fusion | |
| US10535361B2 (en) | Speech enhancement using clustering of cues | |
| Chazan et al. | Multi-microphone speaker separation based on deep DOA estimation | |
| Erdogan et al. | Improved MVDR beamforming using single-channel mask prediction networks. | |
| CN109597022B (zh) | 声源方位角运算、定位目标音频的方法、装置和设备 | |
| Liu et al. | Neural network based time-frequency masking and steering vector estimation for two-channel MVDR beamforming | |
| CN110178178A (zh) | 具有环境自动语音识别(asr)的麦克风选择和多个讲话者分割 | |
| JP2018169473A (ja) | 音声処理装置、音声処理方法及びプログラム | |
| Martinez et al. | DNN-based performance measures for predicting error rates in automatic speech recognition and optimizing hearing aid parameters | |
| Chakraborty et al. | Sound-model-based acoustic source localization using distributed microphone arrays | |
| Martín-Doñas et al. | Dual-channel DNN-based speech enhancement for smartphones | |
| CN114616483A (zh) | 声源定位设备、声源定位方法和程序 | |
| EP2745293B1 (en) | Signal noise attenuation | |
| JPWO2020183219A5 (https=) | ||
| WO2020064089A1 (en) | Determining a room response of a desired source in a reverberant environment | |
| CN121054020A (zh) | 一种音频数据处理方法、装置及电子设备 | |
| Giannoulis et al. | Room-localized speech activity detection in multi-microphone smart homes | |
| Venkatesan et al. | Deep recurrent neural networks based binaural speech segregation for the selection of closest target of interest | |
| JP2003076393A (ja) | 騒音環境下における音声推定方法および音声認識方法 | |
| Venkatesan et al. | Analysis of monaural and binaural statistical properties for the estimation of distance of a target speaker | |
| Malek et al. | Speaker extraction using LCMV beamformer with DNN-based SPP and RTF identification scheme | |
| CN115497495A (zh) | 基于神经网络的音频处理中的空间相关特征提取 | |
| Chiu et al. | A micro-control device of soundscape collection for mixed frog call recognition | |
| JP2010072164A (ja) | 目的信号区間推定装置、目的信号区間推定方法、目的信号区間推定プログラム及び記録媒体 |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| A521 | Request for written amendment filed |
Free format text: JAPANESE INTERMEDIATE CODE: A523 Effective date: 20220309 |
|
| A621 | Written request for application examination |
Free format text: JAPANESE INTERMEDIATE CODE: A621 Effective date: 20220309 |
|
| A977 | Report on retrieval |
Free format text: JAPANESE INTERMEDIATE CODE: A971007 Effective date: 20230207 |
|
| A131 | Notification of reasons for refusal |
Free format text: JAPANESE INTERMEDIATE CODE: A131 Effective date: 20230214 |
|
| A601 | Written request for extension of time |
Free format text: JAPANESE INTERMEDIATE CODE: A601 Effective date: 20230512 |
|
| A601 | Written request for extension of time |
Free format text: JAPANESE INTERMEDIATE CODE: A601 Effective date: 20230714 |
|
| A521 | Request for written amendment filed |
Free format text: JAPANESE INTERMEDIATE CODE: A523 Effective date: 20230814 |
|
| A131 | Notification of reasons for refusal |
Free format text: JAPANESE INTERMEDIATE CODE: A131 Effective date: 20231107 |
|
| A601 | Written request for extension of time |
Free format text: JAPANESE INTERMEDIATE CODE: A601 Effective date: 20240205 |
|
| A601 | Written request for extension of time |
Free format text: JAPANESE INTERMEDIATE CODE: A601 Effective date: 20240405 |
|
| A521 | Request for written amendment filed |
Free format text: JAPANESE INTERMEDIATE CODE: A523 Effective date: 20240507 |
|
| A524 | Written submission of copy of amendment under article 19 pct |
Free format text: JAPANESE INTERMEDIATE CODE: A524 Effective date: 20240507 |
|
| TRDD | Decision of grant or rejection written | ||
| A01 | Written decision to grant a patent or to grant a registration (utility model) |
Free format text: JAPANESE INTERMEDIATE CODE: A01 Effective date: 20240828 |
|
| A61 | First payment of annual fees (during grant procedure) |
Free format text: JAPANESE INTERMEDIATE CODE: A61 Effective date: 20240926 |
|
| R150 | Certificate of patent or registration of utility model |
Ref document number: 7564117 Country of ref document: JP Free format text: JAPANESE INTERMEDIATE CODE: R150 |