CN112530452A - 一种后置滤波补偿方法、装置和系统 - Google Patents
一种后置滤波补偿方法、装置和系统 Download PDFInfo
- Publication number
- CN112530452A CN112530452A CN202011320330.0A CN202011320330A CN112530452A CN 112530452 A CN112530452 A CN 112530452A CN 202011320330 A CN202011320330 A CN 202011320330A CN 112530452 A CN112530452 A CN 112530452A
- Authority
- CN
- China
- Prior art keywords
- audio
- user
- leaked
- post
- leakage
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
- 238000000034 method Methods 0.000 title claims abstract description 42
- 238000001914 filtration Methods 0.000 title claims abstract description 22
- 230000005236 sound signal Effects 0.000 claims abstract description 49
- 238000000926 separation method Methods 0.000 claims abstract description 44
- 230000008030 elimination Effects 0.000 claims abstract description 17
- 238000003379 elimination reaction Methods 0.000 claims abstract description 17
- 238000012545 processing Methods 0.000 claims abstract description 16
- 238000004422 calculation algorithm Methods 0.000 claims description 21
- 230000008569 process Effects 0.000 claims description 8
- 238000004590 computer program Methods 0.000 description 13
- 230000000694 effects Effects 0.000 description 4
- 230000008901 benefit Effects 0.000 description 3
- 230000008859 change Effects 0.000 description 3
- 238000004891 communication Methods 0.000 description 3
- 238000013518 transcription Methods 0.000 description 3
- 230000035897 transcription Effects 0.000 description 3
- 230000006870 function Effects 0.000 description 2
- 230000003287 optical effect Effects 0.000 description 2
- 239000013598 vector Substances 0.000 description 2
- 230000004075 alteration Effects 0.000 description 1
- 238000004364 calculation method Methods 0.000 description 1
- 239000003795 chemical substances by application Substances 0.000 description 1
- 238000007796 conventional method Methods 0.000 description 1
- 238000001514 detection method Methods 0.000 description 1
- 238000010586 diagram Methods 0.000 description 1
- 238000005516 engineering process Methods 0.000 description 1
- 230000002708 enhancing effect Effects 0.000 description 1
- 230000004807 localization Effects 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
- 239000013307 optical fiber Substances 0.000 description 1
- 238000005457 optimization Methods 0.000 description 1
- 239000004065 semiconductor Substances 0.000 description 1
- 238000006467 substitution reaction Methods 0.000 description 1
Images
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/02—Speech enhancement, e.g. noise reduction or echo cancellation
- G10L21/0208—Noise filtering
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/02—Speech enhancement, e.g. noise reduction or echo cancellation
- G10L21/0272—Voice signal separating
- G10L21/028—Voice signal separating using properties of sound source
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/02—Speech enhancement, e.g. noise reduction or echo cancellation
- G10L21/0208—Noise filtering
- G10L2021/02082—Noise filtering the noise being echo, reverberation of the speech
Landscapes
- Engineering & Computer Science (AREA)
- Computational Linguistics (AREA)
- Quality & Reliability (AREA)
- Signal Processing (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Physics & Mathematics (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Circuit For Audible Band Transducer (AREA)
Abstract
Description
Claims (10)
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202011320330.0A CN112530452B (zh) | 2020-11-23 | 一种后置滤波补偿方法、装置和系统 |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202011320330.0A CN112530452B (zh) | 2020-11-23 | 一种后置滤波补偿方法、装置和系统 |
Publications (2)
Publication Number | Publication Date |
---|---|
CN112530452A true CN112530452A (zh) | 2021-03-19 |
CN112530452B CN112530452B (zh) | 2024-06-28 |
Family
ID=
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN113470689A (zh) * | 2021-08-23 | 2021-10-01 | 杭州国芯科技股份有限公司 | 一种语音分离方法 |
Citations (7)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN102164328A (zh) * | 2010-12-29 | 2011-08-24 | 中国科学院声学研究所 | 一种用于家庭环境的基于传声器阵列的音频输入系统 |
US20130266148A1 (en) * | 2011-05-13 | 2013-10-10 | Peter Isberg | Electronic Devices for Reducing Acoustic Leakage Effects and Related Methods and Computer Program Products |
CN105280183A (zh) * | 2015-09-10 | 2016-01-27 | 百度在线网络技术(北京)有限公司 | 语音交互方法和系统 |
CN105827862A (zh) * | 2016-05-12 | 2016-08-03 | Tcl移动通信科技(宁波)有限公司 | 一种自动调节听筒声音的方法及移动终端 |
US20200058293A1 (en) * | 2017-10-23 | 2020-02-20 | Tencent Technology (Shenzhen) Company Limited | Object recognition method, computer device, and computer-readable storage medium |
CN110970049A (zh) * | 2019-12-06 | 2020-04-07 | 广州国音智能科技有限公司 | 多人声识别方法、装置、设备及可读存储介质 |
CN111883135A (zh) * | 2020-07-28 | 2020-11-03 | 北京声智科技有限公司 | 语音转写方法、装置和电子设备 |
Patent Citations (7)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN102164328A (zh) * | 2010-12-29 | 2011-08-24 | 中国科学院声学研究所 | 一种用于家庭环境的基于传声器阵列的音频输入系统 |
US20130266148A1 (en) * | 2011-05-13 | 2013-10-10 | Peter Isberg | Electronic Devices for Reducing Acoustic Leakage Effects and Related Methods and Computer Program Products |
CN105280183A (zh) * | 2015-09-10 | 2016-01-27 | 百度在线网络技术(北京)有限公司 | 语音交互方法和系统 |
CN105827862A (zh) * | 2016-05-12 | 2016-08-03 | Tcl移动通信科技(宁波)有限公司 | 一种自动调节听筒声音的方法及移动终端 |
US20200058293A1 (en) * | 2017-10-23 | 2020-02-20 | Tencent Technology (Shenzhen) Company Limited | Object recognition method, computer device, and computer-readable storage medium |
CN110970049A (zh) * | 2019-12-06 | 2020-04-07 | 广州国音智能科技有限公司 | 多人声识别方法、装置、设备及可读存储介质 |
CN111883135A (zh) * | 2020-07-28 | 2020-11-03 | 北京声智科技有限公司 | 语音转写方法、装置和电子设备 |
Non-Patent Citations (1)
Title |
---|
刘红梅;: "基于FastICA盲源分离算法的语音增强系统", 计算机与数字工程, vol. 45, no. 03, pages 4 - 5 * |
Cited By (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN113470689A (zh) * | 2021-08-23 | 2021-10-01 | 杭州国芯科技股份有限公司 | 一种语音分离方法 |
CN113470689B (zh) * | 2021-08-23 | 2024-01-30 | 杭州国芯科技股份有限公司 | 一种语音分离方法 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
KR102339594B1 (ko) | 객체 인식 방법, 컴퓨터 디바이스 및 컴퓨터 판독 가능 저장 매체 | |
Wang et al. | Multi-microphone complex spectral mapping for utterance-wise and continuous speech separation | |
Xiao et al. | Microsoft speaker diarization system for the voxceleb speaker recognition challenge 2020 | |
Taherian et al. | Robust speaker recognition based on single-channel and multi-channel speech enhancement | |
CN107910011B (zh) | 一种语音降噪方法、装置、服务器及存储介质 | |
US9626970B2 (en) | Speaker identification using spatial information | |
CN109410956B (zh) | 一种音频数据的对象识别方法、装置、设备及存储介质 | |
KR20140135349A (ko) | 복수의 마이크로폰을 이용한 비동기 음성인식 장치 및 방법 | |
JP2014145838A (ja) | 音響処理装置及び音響処理方法 | |
Kinoshita et al. | Tackling real noisy reverberant meetings with all-neural source separation, counting, and diarization system | |
Yu et al. | Audio-visual multi-channel integration and recognition of overlapped speech | |
Mun et al. | The sound of my voice: Speaker representation loss for target voice separation | |
Yamamoto et al. | Making a robot recognize three simultaneous sentences in real-time | |
JP5180928B2 (ja) | 音声認識装置及び音声認識装置のマスク生成方法 | |
Kamo et al. | Target speech extraction with conditional diffusion model | |
KR101122591B1 (ko) | 핵심어 인식에 의한 음성 인식 장치 및 방법 | |
Chen et al. | End-to-end multi-modal speech recognition with air and bone conducted speech | |
Sivasankaran et al. | Analyzing the impact of speaker localization errors on speech separation for automatic speech recognition | |
JP3798530B2 (ja) | 音声認識装置及び音声認識方法 | |
JP3163109B2 (ja) | 多方向同時収音式音声認識方法 | |
CN112530452B (zh) | 一种后置滤波补偿方法、装置和系统 | |
CN112530452A (zh) | 一种后置滤波补偿方法、装置和系统 | |
Gammal et al. | Combating reverberation in speaker verification | |
US20230116052A1 (en) | Array geometry agnostic multi-channel personalized speech enhancement | |
Kundegorski et al. | Two-Microphone dereverberation for automatic speech recognition of Polish |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
TA01 | Transfer of patent application right | ||
TA01 | Transfer of patent application right |
Effective date of registration: 20220112 Address after: 310024 floor 5, zone 2, building 3, Hangzhou cloud computing Industrial Park, Zhuantang street, Xihu District, Hangzhou City, Zhejiang Province Applicant after: Hangzhou suddenly Cognitive Technology Co.,Ltd. Address before: 100083 gate 3, block a, 768 Creative Industry Park, Zhongguancun, No.5 Xueyuan Road, Haidian District, Beijing Applicant before: BEIJING MORAN COGNITIVE TECHNOLOGY Co.,Ltd. |
|
TA01 | Transfer of patent application right |
Effective date of registration: 20240528 Address after: Room 1101, 11th Floor, Pacific International Building, No.106 Zhichun Road, Haidian District, Beijing, 100086 Applicant after: Beijing Haiyunjiexun Technology Co.,Ltd. Country or region after: China Address before: 310024 floor 5, zone 2, building 3, Hangzhou cloud computing Industrial Park, Zhuantang street, Xihu District, Hangzhou City, Zhejiang Province Applicant before: Hangzhou suddenly Cognitive Technology Co.,Ltd. Country or region before: China |
|
GR01 | Patent grant |