CN110495185A - 语音信号处理方法及装置 - Google Patents
语音信号处理方法及装置 Download PDFInfo
- Publication number
- CN110495185A CN110495185A CN201880000268.1A CN201880000268A CN110495185A CN 110495185 A CN110495185 A CN 110495185A CN 201880000268 A CN201880000268 A CN 201880000268A CN 110495185 A CN110495185 A CN 110495185A
- Authority
- CN
- China
- Prior art keywords
- voice signal
- voice
- kalman filtering
- tracking
- microphone array
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
- 230000005236 sound signal Effects 0.000 title claims abstract description 35
- 238000003672 processing method Methods 0.000 title claims abstract description 18
- 238000001914 filtration Methods 0.000 claims abstract description 163
- 238000012545 processing Methods 0.000 claims abstract description 140
- 239000013598 vector Substances 0.000 claims abstract description 98
- 238000000034 method Methods 0.000 claims description 103
- 230000008569 process Effects 0.000 claims description 75
- 238000001514 detection method Methods 0.000 claims description 31
- 238000006073 displacement reaction Methods 0.000 claims description 8
- 230000000694 effects Effects 0.000 abstract description 8
- 230000000875 corresponding effect Effects 0.000 description 16
- 238000003491 array Methods 0.000 description 10
- 238000010586 diagram Methods 0.000 description 7
- 230000004807 localization Effects 0.000 description 6
- 238000006243 chemical reaction Methods 0.000 description 5
- 238000007781 pre-processing Methods 0.000 description 5
- 230000008859 change Effects 0.000 description 4
- 230000006870 function Effects 0.000 description 4
- 230000033001 locomotion Effects 0.000 description 4
- 239000011159 matrix material Substances 0.000 description 3
- 230000004044 response Effects 0.000 description 3
- 101100134058 Caenorhabditis elegans nth-1 gene Proteins 0.000 description 2
- 230000008901 benefit Effects 0.000 description 2
- 230000003139 buffering effect Effects 0.000 description 2
- 230000008030 elimination Effects 0.000 description 2
- 238000003379 elimination reaction Methods 0.000 description 2
- 238000005516 engineering process Methods 0.000 description 2
- 230000003993 interaction Effects 0.000 description 2
- 238000011160 research Methods 0.000 description 2
- 238000001228 spectrum Methods 0.000 description 2
- 238000003786 synthesis reaction Methods 0.000 description 2
- 230000009466 transformation Effects 0.000 description 2
- 230000007704 transition Effects 0.000 description 2
- 230000003321 amplification Effects 0.000 description 1
- 238000004458 analytical method Methods 0.000 description 1
- 238000013473 artificial intelligence Methods 0.000 description 1
- 230000009286 beneficial effect Effects 0.000 description 1
- 230000015572 biosynthetic process Effects 0.000 description 1
- 238000004891 communication Methods 0.000 description 1
- 238000012937 correction Methods 0.000 description 1
- 230000002596 correlated effect Effects 0.000 description 1
- 238000011161 development Methods 0.000 description 1
- 230000002708 enhancing effect Effects 0.000 description 1
- 230000007613 environmental effect Effects 0.000 description 1
- 230000036039 immunity Effects 0.000 description 1
- 230000002401 inhibitory effect Effects 0.000 description 1
- 230000006386 memory function Effects 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
- 238000003199 nucleic acid amplification method Methods 0.000 description 1
- 239000002245 particle Substances 0.000 description 1
- 230000009467 reduction Effects 0.000 description 1
- 238000005070 sampling Methods 0.000 description 1
- 238000006467 substitution reaction Methods 0.000 description 1
- 238000011426 transformation method Methods 0.000 description 1
Images
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04R—LOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; DEAF-AID SETS; PUBLIC ADDRESS SYSTEMS
- H04R3/00—Circuits for transducers, loudspeakers or microphones
Landscapes
- Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- Acoustics & Sound (AREA)
- Signal Processing (AREA)
- Computational Linguistics (AREA)
- Quality & Reliability (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Multimedia (AREA)
- Circuit For Audible Band Transducer (AREA)
- Measurement Of Velocity Or Position Using Acoustic Or Ultrasonic Waves (AREA)
Abstract
本发明实施例提供了一种语音信号处理方法及装置,其中,语音信号处理方法包括:获取语音信号相对于麦克风阵列的角度位置,其中,所述角度位置包括所述语音信号相对于所述麦克风阵列的方位角和俯仰角;根据所述角度位置,确定所述语音信号的声源方向的方向向量;根据所述方向向量,对所述语音信号进行卡尔曼滤波处理;根据所述卡尔曼滤波处理的处理结果,进行语音信号跟踪。本发明实施例提供的语音信号处理方案应用于移动场景中语音信号的快速处理时,可以获得较好的处理效果。
Description
PCT国内申请,说明书已公开。
Claims (22)
- PCT国内申请,权利要求书已公开。
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
PCT/CN2018/078505 WO2019169616A1 (zh) | 2018-03-09 | 2018-03-09 | 语音信号处理方法及装置 |
Publications (2)
Publication Number | Publication Date |
---|---|
CN110495185A true CN110495185A (zh) | 2019-11-22 |
CN110495185B CN110495185B (zh) | 2022-07-01 |
Family
ID=67845832
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201880000268.1A Active CN110495185B (zh) | 2018-03-09 | 2018-03-09 | 语音信号处理方法及装置 |
Country Status (2)
Country | Link |
---|---|
CN (1) | CN110495185B (zh) |
WO (1) | WO2019169616A1 (zh) |
Cited By (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN111402873A (zh) * | 2020-02-25 | 2020-07-10 | 北京声智科技有限公司 | 语音信号处理方法、装置、设备及存储介质 |
CN113225478A (zh) * | 2021-04-28 | 2021-08-06 | 维沃移动通信(杭州)有限公司 | 一种拍摄方法和装置 |
Families Citing this family (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN111785290B (zh) * | 2020-05-18 | 2023-12-26 | 深圳市东微智能科技股份有限公司 | 麦克风阵列语音信号处理方法、装置、设备及存储介质 |
CN111696570B (zh) * | 2020-08-17 | 2020-11-24 | 北京声智科技有限公司 | 语音信号处理方法、装置、设备及存储介质 |
CN111798869B (zh) * | 2020-09-10 | 2020-11-17 | 成都启英泰伦科技有限公司 | 一种基于双麦克风阵列的声源定位方法 |
CN113053376A (zh) * | 2021-03-17 | 2021-06-29 | 财团法人车辆研究测试中心 | 语音辨识装置 |
Citations (17)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN1308505A (zh) * | 1998-04-17 | 2001-08-15 | 麻省理工学院 | 运动跟踪系统 |
EP1349419A2 (en) * | 2002-03-27 | 2003-10-01 | Samsung Electronics Co., Ltd. | Orthogonal circular microphone array system and method for detecting three-dimensional direction of sound source using the same |
US20040252845A1 (en) * | 2003-06-16 | 2004-12-16 | Ivan Tashev | System and process for sound source localization using microphone array beamsteering |
EP1691344A1 (en) * | 2003-11-12 | 2006-08-16 | HONDA MOTOR CO., Ltd. | Speech recognition device |
CN102831898A (zh) * | 2012-08-31 | 2012-12-19 | 厦门大学 | 带声源方向跟踪功能的麦克风阵列语音增强装置及其方法 |
CN102934159A (zh) * | 2010-06-30 | 2013-02-13 | 英特尔公司 | 语音音频处理 |
CN103544959A (zh) * | 2013-10-25 | 2014-01-29 | 华南理工大学 | 一种基于无线定位麦克风阵列语音增强的通话系统及方法 |
CN104200813A (zh) * | 2014-07-01 | 2014-12-10 | 东北大学 | 基于声源方向实时预测跟踪的动态盲信号分离方法 |
US20160014506A1 (en) * | 2014-07-14 | 2016-01-14 | Panasonic Intellectual Property Management Co., Ltd. | Microphone array control apparatus and microphone array system |
CN105807273A (zh) * | 2016-04-20 | 2016-07-27 | 北京百度网讯科技有限公司 | 声源跟踪方法和装置 |
US20160255446A1 (en) * | 2015-02-27 | 2016-09-01 | Giuliano BERNARDI | Methods, Systems, and Devices for Adaptively Filtering Audio Signals |
US20160275964A1 (en) * | 2015-03-20 | 2016-09-22 | Electronics And Telecommunications Research Institute | Feature compensation apparatus and method for speech recogntion in noisy environment |
CN106251877A (zh) * | 2016-08-11 | 2016-12-21 | 珠海全志科技股份有限公司 | 语音声源方向估计方法及装置 |
CN106842128A (zh) * | 2017-02-11 | 2017-06-13 | 陈昭男 | 运动目标的声学跟踪方法及装置 |
CN106970356A (zh) * | 2016-01-14 | 2017-07-21 | 芋头科技(杭州)有限公司 | 一种复杂环境下声源定位跟踪方法 |
CN107534725A (zh) * | 2015-05-19 | 2018-01-02 | 华为技术有限公司 | 一种语音信号处理方法及装置 |
CN107621266A (zh) * | 2017-08-14 | 2018-01-23 | 上海宇航系统工程研究所 | 基于特征点跟踪的空间非合作目标相对导航方法 |
Family Cites Families (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
DE102013215131A1 (de) * | 2013-08-01 | 2015-02-05 | Siemens Medical Instruments Pte. Ltd. | Verfahren zur Verfolgung einer Schallquelle |
CN104330768B (zh) * | 2013-12-04 | 2017-01-04 | 河南科技大学 | 一种基于声矢量传感器的机动声源方位估计方法 |
CN107507623A (zh) * | 2017-10-09 | 2017-12-22 | 维拓智能科技(深圳)有限公司 | 基于麦克风阵列语音交互的自助服务终端 |
-
2018
- 2018-03-09 WO PCT/CN2018/078505 patent/WO2019169616A1/zh active Application Filing
- 2018-03-09 CN CN201880000268.1A patent/CN110495185B/zh active Active
Patent Citations (18)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN1308505A (zh) * | 1998-04-17 | 2001-08-15 | 麻省理工学院 | 运动跟踪系统 |
EP1349419A2 (en) * | 2002-03-27 | 2003-10-01 | Samsung Electronics Co., Ltd. | Orthogonal circular microphone array system and method for detecting three-dimensional direction of sound source using the same |
US20040252845A1 (en) * | 2003-06-16 | 2004-12-16 | Ivan Tashev | System and process for sound source localization using microphone array beamsteering |
EP1691344A1 (en) * | 2003-11-12 | 2006-08-16 | HONDA MOTOR CO., Ltd. | Speech recognition device |
US20090018828A1 (en) * | 2003-11-12 | 2009-01-15 | Honda Motor Co., Ltd. | Automatic Speech Recognition System |
CN102934159A (zh) * | 2010-06-30 | 2013-02-13 | 英特尔公司 | 语音音频处理 |
CN102831898A (zh) * | 2012-08-31 | 2012-12-19 | 厦门大学 | 带声源方向跟踪功能的麦克风阵列语音增强装置及其方法 |
CN103544959A (zh) * | 2013-10-25 | 2014-01-29 | 华南理工大学 | 一种基于无线定位麦克风阵列语音增强的通话系统及方法 |
CN104200813A (zh) * | 2014-07-01 | 2014-12-10 | 东北大学 | 基于声源方向实时预测跟踪的动态盲信号分离方法 |
US20160014506A1 (en) * | 2014-07-14 | 2016-01-14 | Panasonic Intellectual Property Management Co., Ltd. | Microphone array control apparatus and microphone array system |
US20160255446A1 (en) * | 2015-02-27 | 2016-09-01 | Giuliano BERNARDI | Methods, Systems, and Devices for Adaptively Filtering Audio Signals |
US20160275964A1 (en) * | 2015-03-20 | 2016-09-22 | Electronics And Telecommunications Research Institute | Feature compensation apparatus and method for speech recogntion in noisy environment |
CN107534725A (zh) * | 2015-05-19 | 2018-01-02 | 华为技术有限公司 | 一种语音信号处理方法及装置 |
CN106970356A (zh) * | 2016-01-14 | 2017-07-21 | 芋头科技(杭州)有限公司 | 一种复杂环境下声源定位跟踪方法 |
CN105807273A (zh) * | 2016-04-20 | 2016-07-27 | 北京百度网讯科技有限公司 | 声源跟踪方法和装置 |
CN106251877A (zh) * | 2016-08-11 | 2016-12-21 | 珠海全志科技股份有限公司 | 语音声源方向估计方法及装置 |
CN106842128A (zh) * | 2017-02-11 | 2017-06-13 | 陈昭男 | 运动目标的声学跟踪方法及装置 |
CN107621266A (zh) * | 2017-08-14 | 2018-01-23 | 上海宇航系统工程研究所 | 基于特征点跟踪的空间非合作目标相对导航方法 |
Non-Patent Citations (3)
Title |
---|
BENEDIKT T: "Unsupervised Classification of Voiced Speech and Pitch Tracking Using Forward-Backward Kalman Filtering", 《SPEECH COMMUNICATION》 * |
周峰: "基于卡尔曼滤波和预测的可控波束多声源跟踪", 《微电子学与计算机》 * |
王冰: "利用GPS/陀螺组合测姿的矩阵卡尔曼滤波算法", 《测绘科学技术报》 * |
Cited By (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN111402873A (zh) * | 2020-02-25 | 2020-07-10 | 北京声智科技有限公司 | 语音信号处理方法、装置、设备及存储介质 |
CN111402873B (zh) * | 2020-02-25 | 2023-10-20 | 北京声智科技有限公司 | 语音信号处理方法、装置、设备及存储介质 |
CN113225478A (zh) * | 2021-04-28 | 2021-08-06 | 维沃移动通信(杭州)有限公司 | 一种拍摄方法和装置 |
Also Published As
Publication number | Publication date |
---|---|
WO2019169616A1 (zh) | 2019-09-12 |
CN110495185B (zh) | 2022-07-01 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN110495185B (zh) | 语音信号处理方法及装置 | |
US9837099B1 (en) | Method and system for beam selection in microphone array beamformers | |
US10979805B2 (en) | Microphone array auto-directive adaptive wideband beamforming using orientation information from MEMS sensors | |
US9479885B1 (en) | Methods and apparatuses for performing null steering of adaptive microphone array | |
US7206255B2 (en) | Signal processing apparatus and signal processing method | |
US9549255B2 (en) | Sound pickup apparatus and method for picking up sound | |
US8818002B2 (en) | Robust adaptive beamforming with enhanced noise suppression | |
CN106093864B (zh) | 一种麦克风阵列声源空间实时定位方法 | |
US8005237B2 (en) | Sensor array beamformer post-processor | |
CN111025233A (zh) | 一种声源方向定位方法和装置、语音设备和系统 | |
US10887691B2 (en) | Audio capture using beamforming | |
JP2004507767A (ja) | 目的信号源から雑音環境に放射される信号を処理するシステム及び方法 | |
US10638224B2 (en) | Audio capture using beamforming | |
CN113113034A (zh) | 用于平面麦克风阵列的多源跟踪和语音活动检测 | |
CN110534126B (zh) | 一种基于固定波束形成的声源定位和语音增强方法及系统 | |
WO2015106401A1 (zh) | 语音处理方法和语音处理装置 | |
JP6977448B2 (ja) | 機器制御装置、機器制御プログラム、機器制御方法、対話装置、及びコミュニケーションシステム | |
TW202147862A (zh) | 強烈雜訊干擾存在下穩健的揚聲器定位系統與方法 | |
Griebel et al. | Microphone array source localization using realizable delay vectors | |
Huang et al. | Time delay estimation and source localization | |
CN112859000B (zh) | 一种声源定位方法以及装置 | |
CN111933182B (zh) | 声源跟踪方法、装置、设备和存储介质 | |
CN109239665B (zh) | 一种基于信号子空间相似度谱和粒子滤波器的多声源连续定位方法和装置 | |
CN113744752A (zh) | 语音处理方法及装置 | |
CN113740803A (zh) | 一种基于音视频特征的发言人定位跟踪方法及装置 |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant |