CN106782618B - 基于二阶锥规划的目标方向语音检测方法 - Google Patents
基于二阶锥规划的目标方向语音检测方法 Download PDFInfo
- Publication number
- CN106782618B CN106782618B CN201611202064.5A CN201611202064A CN106782618B CN 106782618 B CN106782618 B CN 106782618B CN 201611202064 A CN201611202064 A CN 201611202064A CN 106782618 B CN106782618 B CN 106782618B
- Authority
- CN
- China
- Prior art keywords
- signal
- noise
- power
- target
- ratio
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
- 238000001514 detection method Methods 0.000 title claims abstract description 37
- 238000009499 grossing Methods 0.000 claims abstract description 12
- 238000000034 method Methods 0.000 claims description 25
- 238000005070 sampling Methods 0.000 claims description 10
- 230000004044 response Effects 0.000 claims description 9
- 238000000354 decomposition reaction Methods 0.000 claims description 4
- 239000011159 matrix material Substances 0.000 claims description 3
- 238000004364 calculation method Methods 0.000 abstract description 3
- 238000012935 Averaging Methods 0.000 description 3
- 230000000694 effects Effects 0.000 description 3
- 238000001228 spectrum Methods 0.000 description 2
- 230000003044 adaptive effect Effects 0.000 description 1
- 230000009286 beneficial effect Effects 0.000 description 1
- 230000001427 coherent effect Effects 0.000 description 1
- 238000007796 conventional method Methods 0.000 description 1
- 230000007547 defect Effects 0.000 description 1
- 230000003993 interaction Effects 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
- 230000008569 process Effects 0.000 description 1
- 238000012545 processing Methods 0.000 description 1
- 230000009467 reduction Effects 0.000 description 1
- 238000013179 statistical model Methods 0.000 description 1
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
- G10L25/78—Detection of presence or absence of voice signals
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/02—Speech enhancement, e.g. noise reduction or echo cancellation
- G10L21/0208—Noise filtering
- G10L21/0216—Noise filtering characterised by the method used for estimating noise
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
- G10L25/03—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters
- G10L25/21—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters the extracted parameters being power information
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
- G10L25/27—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the analysis technique
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
- G10L25/48—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use
- G10L25/51—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use for comparison or discrimination
- G10L25/60—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use for comparison or discrimination for measuring the quality of voice signals
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
- G10L25/78—Detection of presence or absence of voice signals
- G10L25/84—Detection of presence or absence of voice signals for discriminating voice from noise
Landscapes
- Engineering & Computer Science (AREA)
- Computational Linguistics (AREA)
- Signal Processing (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Physics & Mathematics (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Quality & Reliability (AREA)
- Circuit For Audible Band Transducer (AREA)
- Measurement Of Velocity Or Position Using Acoustic Or Ultrasonic Waves (AREA)
Abstract
Description
Claims (8)
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201611202064.5A CN106782618B (zh) | 2016-12-23 | 2016-12-23 | 基于二阶锥规划的目标方向语音检测方法 |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201611202064.5A CN106782618B (zh) | 2016-12-23 | 2016-12-23 | 基于二阶锥规划的目标方向语音检测方法 |
Publications (2)
Publication Number | Publication Date |
---|---|
CN106782618A CN106782618A (zh) | 2017-05-31 |
CN106782618B true CN106782618B (zh) | 2020-07-31 |
Family
ID=58897475
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201611202064.5A Active CN106782618B (zh) | 2016-12-23 | 2016-12-23 | 基于二阶锥规划的目标方向语音检测方法 |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN106782618B (zh) |
Families Citing this family (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN107785029B (zh) * | 2017-10-23 | 2021-01-29 | 科大讯飞股份有限公司 | 目标语音检测方法及装置 |
CN111261178B (zh) * | 2018-11-30 | 2024-09-20 | 北京京东尚科信息技术有限公司 | 波束形成方法和装置 |
CN109831709B (zh) * | 2019-02-15 | 2020-10-09 | 杭州嘉楠耘智信息科技有限公司 | 音源定向方法及装置和计算机可读存储介质 |
CN111381210A (zh) * | 2020-03-04 | 2020-07-07 | 哈尔滨工程大学 | 基于二阶锥规划的舰船辐射噪声抑制方法 |
Family Cites Families (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN101976565A (zh) * | 2010-07-09 | 2011-02-16 | 瑞声声学科技(深圳)有限公司 | 基于双麦克风语音增强装置及方法 |
CN104768100B (zh) * | 2014-01-02 | 2018-03-23 | 中国科学院声学研究所 | 用于环形阵的时域宽带谐波域波束形成器及波束形成方法 |
-
2016
- 2016-12-23 CN CN201611202064.5A patent/CN106782618B/zh active Active
Also Published As
Publication number | Publication date |
---|---|
CN106782618A (zh) | 2017-05-31 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN110082725B (zh) | 基于麦克风阵列的声源定位时延估计方法、声源定位系统 | |
CN106782618B (zh) | 基于二阶锥规划的目标方向语音检测方法 | |
US11081123B2 (en) | Microphone array-based target voice acquisition method and device | |
CN108122563B (zh) | 提高语音唤醒率及修正doa的方法 | |
US7626889B2 (en) | Sensor array post-filter for tracking spatial distributions of signals and noise | |
US9633651B2 (en) | Apparatus and method for providing an informed multichannel speech presence probability estimation | |
JP4937622B2 (ja) | 位置標定モデルを構築するコンピュータ実施方法 | |
EP3047483B1 (en) | Adaptive phase difference based noise reduction for automatic speech recognition (asr) | |
US20170140771A1 (en) | Information processing apparatus, information processing method, and computer program product | |
EP3566461B1 (en) | Method and apparatus for audio capture using beamforming | |
CN110133596A (zh) | 一种基于频点信噪比和偏置软判决的阵列声源定位方法 | |
US20110274289A1 (en) | Sensor array beamformer post-processor | |
Niwa et al. | Post-filter design for speech enhancement in various noisy environments | |
US10887691B2 (en) | Audio capture using beamforming | |
CN103165137B (zh) | 一种非平稳噪声环境下传声器阵列的语音增强方法 | |
CN111025273B (zh) | 一种畸变拖曳阵线谱特征增强方法及系统 | |
CN108538306B (zh) | 提高语音设备doa估计的方法及装置 | |
CN109188362A (zh) | 一种麦克风阵列声源定位信号处理方法 | |
JP4422662B2 (ja) | 音源位置・受音位置推定方法、その装置、そのプログラム、およびその記録媒体 | |
CN106683685B (zh) | 基于最小二乘法的目标方向语音检测方法 | |
Ince et al. | Assessment of general applicability of ego noise estimation | |
CN108549052A (zh) | 一种时频-空域联合加权的圆谐域伪声强声源定位方法 | |
US11900920B2 (en) | Sound pickup device, sound pickup method, and non-transitory computer readable recording medium storing sound pickup program | |
CN111798869A (zh) | 一种基于双麦克风阵列的声源定位方法 | |
EP3566228B1 (en) | Audio capture using beamforming |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
TA01 | Transfer of patent application right |
Effective date of registration: 20170929 Address after: 200233 Shanghai City, Xuhui District Guangxi 65 No. 1 Jinglu room 702 unit 03 Applicant after: YUNZHISHENG (SHANGHAI) INTELLIGENT TECHNOLOGY CO.,LTD. Address before: 200233 Shanghai, Qinzhou, North Road, No. 82, building 2, layer 1198, Applicant before: SHANGHAI YUZHIYI INFORMATION TECHNOLOGY Co.,Ltd. |
|
TA01 | Transfer of patent application right | ||
GR01 | Patent grant | ||
GR01 | Patent grant | ||
EE01 | Entry into force of recordation of patent licensing contract |
Application publication date: 20170531 Assignee: Xiamen yunzhixin Intelligent Technology Co.,Ltd. Assignor: YUNZHISHENG (SHANGHAI) INTELLIGENT TECHNOLOGY Co.,Ltd. Contract record no.: X2021310000020 Denomination of invention: Target direction speech detection method based on second order cone programming Granted publication date: 20200731 License type: Common License Record date: 20210408 |
|
EE01 | Entry into force of recordation of patent licensing contract | ||
EC01 | Cancellation of recordation of patent licensing contract |
Assignee: Xiamen yunzhixin Intelligent Technology Co.,Ltd. Assignor: YUNZHISHENG (SHANGHAI) INTELLIGENT TECHNOLOGY CO.,LTD. Contract record no.: X2021310000020 Date of cancellation: 20221111 |
|
EC01 | Cancellation of recordation of patent licensing contract |