WO2018095035A1 - 耳机及其语音识别方法 - Google Patents
耳机及其语音识别方法 Download PDFInfo
- Publication number
- WO2018095035A1 WO2018095035A1 PCT/CN2017/090776 CN2017090776W WO2018095035A1 WO 2018095035 A1 WO2018095035 A1 WO 2018095035A1 CN 2017090776 W CN2017090776 W CN 2017090776W WO 2018095035 A1 WO2018095035 A1 WO 2018095035A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- signal
- ambient sound
- sound signal
- ambient
- earphone
- Prior art date
Links
- 238000000034 method Methods 0.000 title claims abstract description 45
- 230000005236 sound signal Effects 0.000 claims abstract description 132
- 230000007613 environmental effect Effects 0.000 claims description 39
- 230000002459 sustained effect Effects 0.000 claims description 26
- 230000002085 persistent effect Effects 0.000 claims description 24
- 230000003252 repetitive effect Effects 0.000 claims description 8
- 230000000694 effects Effects 0.000 description 3
- 238000004891 communication Methods 0.000 description 2
- 238000001228 spectrum Methods 0.000 description 2
- 241001465754 Metazoa Species 0.000 description 1
- 238000003491 array Methods 0.000 description 1
- 230000005540 biological transmission Effects 0.000 description 1
- 238000010586 diagram Methods 0.000 description 1
- 238000005516 engineering process Methods 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
- 230000001755 vocal effect Effects 0.000 description 1
Images
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
- G10L25/78—Detection of presence or absence of voice signals
- G10L25/84—Detection of presence or absence of voice signals for discriminating voice from noise
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/22—Procedures used during a speech recognition process, e.g. man-machine dialogue
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/22—Procedures used during a speech recognition process, e.g. man-machine dialogue
- G10L2015/223—Execution procedure of a spoken command
Definitions
- the present invention relates to the field of acoustic technology, and more particularly to an earphone and a voice recognition method thereof.
- some earphone products receive and remind the useful voice signals in the corresponding situations by storing some keywords; however, the main disadvantages of this solution are: need to set storage devices such as storage modules or storage libraries to key Words are saved by default; in addition, the storage of keywords is limited, which can not meet the actual application requirements, and the algorithm is complex and costly.
- an object of the present invention is to provide an earphone and a voice recognition method thereof, so as to solve the problem that the current user cannot receive the sound signal in the environment during the process of wearing the earphone, resulting in poor communication and even accidents.
- a method for voice recognition of a headset comprises: the signal acquisition module collecting the ambient sound signal, and transmitting the collected ambient sound signal to the signal processing module for determining and identifying; the signal processing module receiving the ambient sound signal and determining the ambient sound Whether the signal is a non-sustained environmental noise signal, when the ambient sound signal is a non-sustained environmental noise signal, further determining whether the ambient sound signal is a target signal, and when the ambient sound signal is a target signal, the signal processing module controls the current play signal to be turned off.
- the non-sustained ambient noise signal means that the signal characteristic suddenly appearing in the continuous environmental noise signal is completely different from the current persistent environmental noise signal; or, and Current persistent environmental noise signals have relatively different sound signals; signal characteristics include frequency, amplitude, and energy.
- a preferred solution is: in the process of determining whether the ambient sound signal is a non-sustained ambient noise signal, when the ambient sound signal is a continuous environmental noise signal, the early warning processing is not performed and the voice to the ambient sound signal is ended. Identification.
- the warning processing is not performed and the voice recognition of the environmental sound signal is ended.
- the target signal is a discontinuous repetitive signal
- the signal processing module identifies the environment when the environmental sound signal is received for the first time. The sound signal, when receiving the repeated signal of the first received ambient sound signal again, the signal processing module recognizes the repeated signal as the target signal.
- a preferred solution is that the signal processing module simultaneously turns off the noise reduction function of the earphone while the signal processing module controls the current playback signal to be turned off.
- the signal processing module is a CPU, an MPU or a DSP chip.
- the signal acquisition module is a single microphone or an array of microphones.
- an earphone comprising: a signal acquisition module, configured to collect an ambient sound signal in a surrounding environment; and a signal processing module, configured to receive an ambient sound signal and determine the ambient sound Whether the signal is a non-sustained ambient noise signal, and when the ambient sound signal is a non-sustained ambient noise signal, further determining whether the ambient sound signal is a target signal; and when the ambient sound signal is a target signal, controlling the current play signal to be turned off, Amplifying the target signal and transmitting it through the sounding unit of the earphone, wherein the non-sustained ambient noise signal means that the signal characteristic suddenly appearing in the continuous environmental noise signal is completely different from the current continuous environmental noise.
- the signal processing module determines whether the ambient sound signal is a non-sustained environmental noise signal, when the ambient sound signal is a continuous environmental noise signal, no early warning processing is performed. End speech recognition of the ambient sound signal.
- the process of the signal processing module determining whether the ambient sound signal is a target signal when the ambient sound signal is not a target signal, the early warning process is not performed and the voice recognition of the ambient sound signal is ended.
- the target signal is a discontinuous repetitive signal
- the signal processing module determines whether the ambient sound signal is a target signal, when the environmental sound signal is received for the first time, The signal processing module identifies the ambient sound signal, and when the repeated signal of the first received ambient sound signal is received again, the signal processing module recognizes the repeated signal as the target signal.
- the earphone and the voice recognition method thereof By using the earphone and the voice recognition method thereof according to the present invention, during the normal use of the earphone, the sound signal in the external environment can be simultaneously collected, and the environmental sound signal is processed in real time to obtain the target signal; then, the current content of the earphone is turned off.
- the target signal that meets the requirements is amplified and sent out through the earphone, so that the user can clearly hear the external signal without removing the earphone.
- FIG. 1 is a flow chart 1 of a method for voice recognition of a headset according to an embodiment of the present invention
- FIG. 2 is a second flowchart of a method for voice recognition of a headset according to an embodiment of the invention.
- FIG. 1 shows a flow of a method for recognizing a headphone voice according to an embodiment of the present invention.
- a signal acquisition module for picking up a sound signal is first set in the earphone, and the earphone voice recognition method includes:
- the signal acquisition module collects ambient sound signals in the environment, and transmits the collected ambient sound signals to the signal processing module for judgment and identification.
- the signal processing module receives the ambient sound signal and determines whether the ambient sound signal is a non-persistent environmental noise signal.
- step S130 When the signal processing module determines that the ambient sound signal is a non-persistent ambient noise signal, step S130 is performed: otherwise, step S150 is performed.
- step S130 After confirming that the ambient sound signal is a non-sustained ambient noise signal, further determining whether the ambient sound signal is a target signal; when the ambient sound signal is the target signal, performing step S140; otherwise, executing step S150.
- the signal processing module controls the current headphone playing signal to be turned off, and the target signal is amplified, and the amplified target signal is sent out through the sound emitting unit of the earphone, and is picked up by the user.
- ending speech recognition of the current ambient sound signal does not mean that speech recognition is stopped as such. In fact, the speech recognition of the ambient sound signal should be continuous or spaced.
- the target signal in the embodiment of the present invention is a non-sustained ambient noise signal, thereby distinguishing the persistent noise signal in the environment from the target signal, and confirming that the final target signal is transmitted to the human ear.
- the non-sustained environmental noise signal refers to a signal characteristic that suddenly appears in the persistent environmental noise signal is completely different from the current persistent environmental noise signal; or a sound signal that is largely different from the current persistent environmental noise signal, The signal characteristics include frequency, amplitude or energy.
- the waveform characteristics of the environmental noise signal whether it is a persistent environmental noise signal or a non-sustained environmental noise signal; wherein the waveform characteristic parameters include: amplitude, phase, Frequency spectrum, energy spectrum, duration, rise time, etc.
- the target signal needs to be transmitted to the user's ear, so that the user can receive the sound signal in the external environment without removing the earphone. Therefore, the signal processing module will turn off the current playing signal of the earphone.
- the earphone is equipped with functions such as noise reduction, the noise reduction function and the like are simultaneously turned off to ensure that the target signal can be clearly and accurately picked up by the user.
- FIG. 2 shows a detailed flow of a method for recognizing a headphone voice according to an embodiment of the present invention.
- the method for voice recognition of a headset according to an embodiment of the present invention includes the following steps:
- S210 Acquire a sound signal in a user's surroundings as an ambient sound signal through a microphone built in the earphone.
- S220 transmitting the ambient sound signal collected by the microphone to the signal processing module, and the signal processing module identifies and judges the ambient sound signal.
- step S230 After the signal processing module recognizes that the ambient sound signal is a persistent noise signal in the environment, step S270 is performed, the early warning processing is not performed, and the voice recognition of the current ambient sound signal is ended.
- the ambient sound signal is a non-sustained ambient noise signal, but at the same time is a non-repetitive signal, and the current ambient sound signal is not the target signal, and step S270 is performed, the early warning processing is not performed, and the voice recognition of the current ambient sound signal is ended. .
- the ambient sound signal is a non-sustained ambient noise signal, and at the same time is a repeated signal, confirming that the current ambient sound signal is the target signal, and executing step S280.
- the signal processing module turns off the playing signal of the current earphone, amplifies the target signal and sends out through the sounding unit of the earphone, ensuring that the user picks up an intermittent repetitive signal from the outside.
- the target signal in the embodiment of the present invention is a discontinuous repetitive signal.
- the signal processing module identifies the environment.
- the sound signal when receiving the repeated signal of the first received ambient sound signal again, the signal processing module recognizes the repeated signal as the target signal.
- the target signal may be an externally repeated stress vocal (eg, a shout signal, a name signal), an alarm signal (eg, a horn signal, a whistle signal), a dangerous animal sound signal, and the like.
- an externally repeated stress vocal eg, a shout signal, a name signal
- an alarm signal eg, a horn signal, a whistle signal
- a dangerous animal sound signal e.g., a dangerous animal sound signal, and the like.
- a signal that has a target characteristic to the outside world such as a duplicate name, appears to the outside world.
- the first name signal is transmitted to the signal processing module for processing and recognition.
- the sound signal having the target characteristic can be identified.
- the signal processing module will The sound signal originally played is turned off, and the name signal received by the microphone is amplified and sent to the human ear through the pronunciation unit.
- the present invention further provides an earphone device, which utilizes a voice recognition method for earphones to timely pick up a target signal in an external environment and issue a reminder to ensure that the user can capture the outside while listening to the earphone.
- the earphone includes a signal acquisition module for collecting an ambient sound signal in a surrounding environment, and a signal processing module, configured to receive an ambient sound signal and determine whether the ambient sound signal is a non-persistent environmental noise signal, when the environment When the sound signal is a non-sustained ambient noise signal, further determining whether the ambient sound signal is a target signal; when the ambient sound signal is a target signal, controlling the current play signal to be turned off, and amplifying the target signal and transmitting it through the sound emitting unit of the earphone, wherein
- the non-sustained ambient noise signal refers to a signal characteristic that suddenly appears in the persistent environmental noise signal is completely different from the current persistent environmental noise signal; or a sound that is significantly different from the current persistent environmental noise signal.
- Signal the signal characteristics include frequency, amplitude, and energy.
- the signal processing module determines whether the ambient sound signal is a non-sustained ambient noise signal, when the ambient sound signal is a persistent environmental noise signal, the early warning processing is not performed and the ambient sound signal is ended. Speech recognition. Further, when the signal processing module determines whether the environmental sound signal is a target signal, when the environmental sound signal is not the target signal, the warning processing is not performed and the voice recognition of the environmental sound signal is ended. After confirming the target signal, the signal processing module controls the current headphone playback signal to be turned off, and the target signal is amplified, and the amplified target signal is sent to the human ear through the sound emitting unit of the earphone.
- the target signal in the embodiment of the present invention is a discontinuous repetitive signal.
- the signal processing module identifies the environment.
- the sound signal when receiving the repeated signal of the first received ambient sound signal again, the signal processing module recognizes the repeated signal as the target signal.
- the signal collection module may be a single microphone or two or more microphone arrays;
- the signal processing module may be a CPU (Central Processing Unit), an MPU (Microprocessor Unit). , microprocessor), DSP (Digital Signal Processor) chip or control chip with storage function, etc.
- the algorithm of the sound recognition can exist in the control chip, and when the user wears the earphone, the repeated signal of the intermediate interruption can be clearly captured by the human ear.
- the earphone and the voice recognition method provided by the invention can recognize the environment signal with the target characteristic and amplify the output, can simplify the algorithm of the voice recognition, capture the signal in the environment in time, and remind or Early warning users get useful information or avoid dangers to meet the diversity needs of users.
Landscapes
- Engineering & Computer Science (AREA)
- Computational Linguistics (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Physics & Mathematics (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Signal Processing (AREA)
- Headphones And Earphones (AREA)
Abstract
一种耳机及其语音识别方法,其中的耳机语音识别方法包括:信号采集模块采集环境中的环境声音信号,并将所采集到的环境声音信号传递至信号处理模块进行判断识别(S110);信号处理模块接收环境声音信号并判断环境声音信号是否为非持续性环境噪声信号(S120),当环境声音信号为非持续性环境噪声信号时,进一步判断环境声音信号是否为目标信号(S130),当环境声音信号为目标信号时,信号处理模块控制当前播放信号关闭,并将目标信号放大经耳机的发声单元发出(S140)。能够在用户佩戴耳机听音的同时,获取外界环境中有用的声音信号。
Description
本发明涉及声学技术领域,更为具体地,涉及一种耳机及其语音识别方法。
随着高品质智能手机、Pad等便携式电子产品的广泛应用,人们对可与其配合应用的耳机的要求也越来越高,在要求其体形小巧的同时,还要求其具备能够逼真再现各种音效的高保真音质性能。此外,无论上述何种电子产品,为了让使用者在不干扰旁人的状况下聆听电子产品所提供的声音信号,耳机已经成为各电子产品的必要配件,此外耳机也提供了聆听者较佳的声音传输,使聆听者能清楚的听到及了解声音内容。
但是,大部分用户都有一边佩戴耳机听音乐,一边工作、运动或者进行其他活动的习惯。由于耳机的主动降噪或者被动降噪效果较好,当有用的外部语音信号出现时,例如同事呼喊耳机佩戴者,或者运动时周围路人提醒安全信息等,耳机佩戴者往往不能马上听到这些信息,从而导致信息沟通不畅,甚或由于接听不到关键信息而导致意外事故发生。
现有技术中,部分耳机产品会通过存储部分关键词对相应情况下的有用语音信号进行接收并提醒;但是,这种方案存在的主要缺点有:需要设置储存模块或者存储库等存储装置对关键词进行预设保存;另外,关键词的存储有限,无法满足实际的应用需求,算法复杂、成本较高。
发明内容
鉴于上述问题,本发明的目的是提供一种耳机及其语音识别方法,以解决目前用户在佩戴耳机过程中,无法接收环境中的声音信号,造成沟通不畅,甚至意外事故的发生等问题。
根据本发明的一个方面,提供一种耳机语音识别方法,在耳机内设置有
拾取声音信号的信号采集模块,语音识别方法包括:信号采集模块采集环境声音信号,并将所采集到的环境声音信号传递至信号处理模块进行判断识别;信号处理模块接收环境声音信号并判断环境声音信号是否为非持续性环境噪声信号,当环境声音信号为非持续性环境噪声信号时,进一步判断环境声音信号是否为目标信号,当环境声音信号为目标信号时,信号处理模块控制当前播放信号关闭,并将目标信号放大经耳机的发声单元发出;其中,非持续性环境噪声信号指的是,在持续性环境噪声信号中突然出现的信号特征完全不同于当前持续性环境噪声信号;或者,和当前持续性环境噪声信号有较大区别的声音信号;信号特征包括频率、幅度和能量。
此外,优选的方案是,在判断所述环境声音信号是否为非持续性环境噪声信号的过程中,当环境声音信号为持续性环境噪声信号时,不进行预警处理并结束对环境声音信号的语音识别。
此外,优选的方案是,在判断环境声音信号是否为目标信号的过程中,当确认环境声音信号不是目标信号时,不进行预警处理并结束对环境声音信号的语音识别。
此外,优选的方案是,目标信号为间断性的重复信号,在判断所述环境声音信号是否为目标信号的过程中,当第一次接收到环境声音信号时,所述信号处理模块识别该环境声音信号,当再次接收到第一次接收的环境声音信号的重复信号时,所述信号处理模块识别出重复信号作为目标信号。
此外,优选的方案是,在信号处理模块控制当前播放信号关闭的过程中,信号处理模块同时关闭耳机的降噪功能。
此外,优选的方案是,信号处理模块为CPU、MPU或者DSP芯片。
此外,优选的方案是,信号采集模块为单颗麦克风或者麦克风阵列。
根据本发明的另一方面,提供一种耳机,其特征在于,包括:信号采集模块,用于采集周围环境中的环境声音信号;信号处理模块,用于接收环境声音信号并判断所述环境声音信号是否为非持续性环境噪声信号,当所述环境声音信号为非持续性环境噪声信号时,进一步判断环境声音信号是否为目标信号;当环境声音信号为目标信号时,控制当前播放信号关闭,将目标信号放大并经耳机的发声单元发出,其中,所述非持续性环境噪声信号指的是,在持续性环境噪声信号中突然出现的信号特征完全不同于当前持续性环境噪
声信号;或者,和当前持续性环境噪声信号有较大区别的声音信号;所述信号特征包括频率、幅度和能量。
此外,优选的方案是,在所述信号处理模块判断所述环境声音信号是否为非持续性环境噪声信号的过程中,当所述环境声音信号为持续性环境噪声信号时,不进行预警处理并结束对环境声音信号的语音识别。并且,在所述信号处理模块判断所述环境声音信号是否为目标信号的过程中,当所述环境声音信号不是目标信号时,不进行预警处理并结束对环境声音信号的语音识别。
此外,优选的方案是,所述目标信号为间断性的重复信号,在所述信号处理模块判断所述环境声音信号是否为目标信号的过程中,当第一次接收到环境声音信号时,所述信号处理模块识别该环境声音信号,当再次接收到第一次接收的环境声音信号的重复信号时,所述信号处理模块识别出重复信号作为目标信号。
利用上述根据本发明的耳机及其语音识别方法,在耳机正常使用过程中,能够同时采集外界环境中的声音信号,并对环境声音信号进行实时处理,获取目标信号;然后,关闭耳机当前播放内容,将符合要求的目标信号放大并经耳机发出,使用户无需摘除耳机即可清晰听到外界信号。
为了实现上述以及相关目的,本发明的一个或多个方面包括后面将详细说明的特征。下面的说明以及附图详细说明了本发明的某些示例性方面。然而,这些方面指示的仅仅是可使用本发明的原理的各种方式中的一些方式。此外,本发明旨在包括所有这些方面以及它们的等同物。
通过参考以下结合附图的说明,并且随着对本发明的更全面理解,本发明的其它目的及结果将更加明白及易于理解。在附图中:
图1为根据本发明实施例的耳机语音识别方法流程图一;
图2为根据本发明实施例的耳机语音识别方法流程图二。
在所有附图中相同的标号指示相似或相应的特征或功能。
在下面的描述中,出于说明的目的,为了提供对一个或多个实施例的全面理解,阐述了许多具体细节。然而,很明显,也可以在没有这些具体细节的情况下实现这些实施例。在其它例子中,为了便于描述一个或多个实施例,公知的结构和设备以方框图的形式示出。
为详细描述本发明实施例的耳机及其语音识别方法,以下将结合附图对本发明的具体实施例进行详细描述。
图1示出了根据本发明实施例的耳机语音识别方法流程。
如图1所示,本发明实施例的耳机语音识别方法,首先要在耳机内设置用于拾取声音信号的信号采集模块,耳机语音识别方法包括:
S110:信号采集模块采集环境中的环境声音信号,并将所采集到的环境声音信号传递至信号处理模块进行判断识别。
S120:信号处理模块接收环境声音信号并判断环境声音信号是否为非持续性环境噪声信号。
其中,当信号处理模块判断环境声音信号为非持续性环境噪声信号时,执行步骤S130:否则,执行步骤S150。
S130:确认环境声音信号为非持续性环境噪声信号后,进一步判断该环境声音信号是否为目标信号;当环境声音信号为目标信号时,执行步骤S140;否则,执行步骤S150。
S140:信号处理模块控制当前耳机播放信号关闭,并将目标信号进行放大,放大后的目标信号经耳机的发声单元发出,并被用户拾取。
S150:不进行预警处理,并结束对当前环境声音信号的语音识别。
在S150中,结束对当前环境声音信号的语音识别并不是指就此停止语音识别。实际上,环境声音信号的语音识别应持续或间隔进行。
可知,本发明实施例的目标信号是非持续性环境噪声信号,从而将环境中持续性噪声信号与目标信号进行区分,确认最终的目标信号再传输至人耳内。其中,非持续性环境噪声信号指的是在持续性环境噪声信号中突然出现的信号特征完全不同于当前持续性环境噪声信号;或者,和当前持续性环境噪声信号有较大区别的声音信号,其中的信号特征包括频率、幅度或者能量等。此外,也可根据环境噪声信号的波形特征区别其是否为持续性环境噪声信号或者非持续性环境噪声信号;其中,波形特征参数包括:幅值、相位、
频率谱、能量谱、持续时间、上升时间等。
在本发明的一个具体实施方式中,信号处理模块确认最终的目标信号后,需要将目标信号传输至用户人耳,使用户在免摘除耳机的情况下,接收外界环境中的声音信号。因此,信号处理模块会关闭耳机当前播放信号,当耳机具备降噪等功能时,会同时关闭该降噪功能等,以确保目标信号能够清晰、准确的被用户拾取。
具体地,图2示出了根据本发明实施例的耳机语音识别方法的详细流程。如图2所示,本发明实施例的耳机语音识别方法,包括如下所示步骤:
S210:通过耳机内置的麦克风采集用户周围环境中的声音信号作为环境声音信号。
S220:将麦克风采集到的环境声音信号传输至信号处理模块,信号处理模块对环境声音信号进行识别判断。
S230:信号处理模块识别出环境声音信号为环境中的持续性噪声信号后,执行步骤S270,不进行预警处理,并结束对当前环境声音信号的语音识别。
S240:信号处理模块识别出环境声音信号为非持续性环境噪声信号后,需进一步对环境声音信号进行是否为目标信号进行判断。
S250:环境声音信号为非持续性环境噪声信号,但同时为非重复信号,可确认当前环境声音信号不是目标信号,并执行步骤S270,不进行预警处理,并结束对当前环境声音信号的语音识别。
S260:环境声音信号为非持续性环境噪声信号,同时为重复信号,则确认当前环境声音信号为目标信号,并执行步骤S280。
S280:信号处理模块关闭当前耳机的播放信号,将目标信号放大并经过耳机的发声单元发出,确保用户拾取到外界间断性的重复信号。
可知,本发明实施例的目标信号为间断性的重复信号,在判断所述环境声音信号是否为目标信号的过程中,当第一次接收到环境声音信号时,所述信号处理模块识别该环境声音信号,当再次接收到第一次接收的环境声音信号的重复信号时,所述信号处理模块识别出重复信号作为目标信号。
作为示例,目标信号可以为外界重复的应激人声(例如,呼喊信号、名字信号)、报警信号(例如,喇叭信号、鸣笛信号)、危险动物声音信号等等。
具体地,对外界具备目标特性的信号,比如重复的名字,外界出现某人
名字时,其第一声名字信号就传递至信号处理模块进行处理识别,在第二声传递至信号处理模块后,即可识别出是具有目标特性的声信号,此时,信号处理模块会将原先在播放的声音信号关闭,同时将麦克风接收到的名字信号放大后经发音单元发送到人耳。
与上述耳机语音识别方法相对应,本发明还提供一种耳机装置,利用耳机语音识别方法对外界环境中的目标信号进行及时拾取并发出提醒,确保用户能够在佩戴耳机听音的同时,捕捉外界中的有用目标信息。
具体地,耳机包括信号采集模块,用于采集周围环境中的环境声音信号;信号处理模块,用于接收环境声音信号并判断所述环境声音信号是否为非持续性环境噪声信号,当所述环境声音信号为非持续性环境噪声信号时,进一步判断环境声音信号是否为目标信号;当环境声音信号为目标信号时,控制当前播放信号关闭,并将目标信号放大并经耳机的发声单元发出,其中,所述非持续性环境噪声信号指的是,在持续性环境噪声信号中突然出现的信号特征完全不同于当前持续性环境噪声信号;或者,和当前持续性环境噪声信号有较大区别的声音信号;所述信号特征包括频率、幅度和能量。
其中,在所述信号处理模块判断所述环境声音信号是否为非持续性环境噪声信号的过程中,当所述环境声音信号为持续性环境噪声信号时,不进行预警处理并结束对环境声音信号的语音识别。并且,在所述信号处理模块判断环境声音信号是否为目标信号时,当环境声音信号不是目标信号时,不进行预警处理并结束对环境声音信号的语音识别。确认目标信号后,信号处理模块会控制当前耳机播放信号关闭,并将目标信号进行放大,放大后的目标信号经耳机的发声单元发送至人耳。
可知,本发明实施例的目标信号为间断性的重复信号,在判断所述环境声音信号是否为目标信号的过程中,当第一次接收到环境声音信号时,所述信号处理模块识别该环境声音信号,当再次接收到第一次接收的环境声音信号的重复信号时,所述信号处理模块识别出重复信号作为目标信号。另外,在本发明实施例的耳机语音识别方法中,信号采集模块可以为单颗麦克风或者两颗以上的麦克风阵列;信号处理模块可以为CPU(Central Processing Unit,中央处理器)、MPU(Microprocessor Unit,微处理器)、DSP(Digital signal processor,数字信号处理)芯片或者具有存储功能的控制芯片等,用于实现语
音识别的算法可以存在控制芯片内,用户在佩戴耳机时,保证环境中间断性的重复信号可被人耳清晰的捕捉到。
通过上述实施方式可以看出,本发明提供的耳机及其语音识别方法,能够识别出具有目标特性的环境信号并放大输出,可简化语音识别的算法,及时对环境中的信号进行捕捉,提醒或者预警用户获取有用信息或者规避危险,满足用户多样性的耳机需求。
如上参照附图以示例的方式描述根据本发明的耳机及其语音识别方法。但是,本领域技术人员应当理解,对于上述本发明所提出的耳机及其语音识别方法,还可以在不脱离本发明内容的基础上做出各种改进。因此,本发明的保护范围应当由所附的权利要求书的内容确定。
Claims (10)
- 一种耳机语音识别方法,其特征在于,在所述耳机内设置有拾取声音信号的信号采集模块,所述语音识别方法包括:所述信号采集模块采集环境中的环境声音信号,并将所采集到的环境声音信号传递至信号处理模块进行判断识别;所述信号处理模块接收所述环境声音信号并判断所述环境声音信号是否为非持续性环境噪声信号,当所述环境声音信号为非持续性环境噪声信号时,进一步判断所述环境声音信号是否为目标信号,当所述环境声音信号为目标信号时,所述信号处理模块控制当前播放信号关闭,并将所述目标信号放大经耳机的发声单元发出;其中,所述非持续性环境噪声信号指的是,在持续性环境噪声信号中突然出现的信号特征完全不同于当前持续性环境噪声信号;或者,和当前持续性环境噪声信号有较大区别的声音信号;所述信号特征包括频率、幅度和能量。
- 如权利要求1所述的耳机语音识别方法,其特征在于,在判断所述环境声音信号是否为非持续性环境噪声信号的过程中,当所述环境声音信号为持续性环境噪声信号时,不进行预警处理并结束对所述环境声音信号的语音识别。
- 如权利要求1所述的耳机语音识别方法,其特征在于,在判断所述环境声音信号是否为目标信号的过程中,当所述环境声音信号不是目标信号时,不进行预警处理并结束对所述环境声音信号的语音识别。
- 如权利要求1所述的耳机语音识别方法,其特征在于,所述目标信号为间断性的重复信号,在判断所述环境声音信号是否为目标信号的过程中,当第一次接收到环境声音信号时,所述信号处理模块识别该环境声音信号,当再次接收到第一次接收的环境声音信号的重复信号时,所述信号处理模块识别出重复信号作为目标信号。
- 如权利要求1所述的耳机语音识别方法,其特征在于,在所述信号处理模块控制当前播放信号关闭的过程中,所述信号处理模块同时关闭所述耳机的降噪功能。
- 一种耳机,其特征在于,包括:信号采集模块,用于采集周围环境中的环境声音信号;信号处理模块,用于接收所述环境声音信号并判断所述环境声音信号是否为非持续性环境噪声信号,当所述环境声音信号为非持续性环境噪声信号时,进一步判断所述环境声音信号是否为目标信号;当所述环境声音信号为目标信号时,控制当前播放信号关闭,将所述目标信号放大并经耳机的发声单元发出,其中,所述非持续性环境噪声信号指的是,在持续性环境噪声信号中突然出现的信号特征完全不同于当前持续性环境噪声信号;或者,和当前持续性环境噪声信号有较大区别的声音信号;所述信号特征包括频率、幅度和能量。
- 如权利要求6所述的耳机,其特征在于,在所述信号处理模块判断所述环境声音信号是否为非持续性环境噪声信号的过程中,当所述环境声音信号为持续性环境噪声信号时,不进行预警处理并结束对所述环境声音信号的语音识别,并且,在所述信号处理模块判断所述环境声音信号是否为目标信号的过程中,当所述环境声音信号不是目标信号时,不进行预警处理并结束对所述环境声音信号的语音识别。
- 如权利要求6所述的耳机,其特征在于,所述目标信号为间断性的重复信号,在所述信号处理模块判断所述环境声音信号是否为目标信号的过程中,当第一次接收到环境声音信号时,所述信号处理模块识别该环境声音信号,当再次接收到第一次接收的环境声音信号的重复信号时,所述信号处理模块识别出重复信号作为目标信号。
- 如权利要求6所述的耳机,其特征在于,所述信号处理模块为CPU、MPU或者DSP芯片。
- 如权利要求6所述的耳机,其特征在于,所述信号采集模块为单颗麦克风或者麦克风阵列。
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201611067886.7A CN106601272B (zh) | 2016-11-24 | 2016-11-24 | 耳机及其语音识别方法 |
CN201611067886.7 | 2016-11-24 |
Publications (1)
Publication Number | Publication Date |
---|---|
WO2018095035A1 true WO2018095035A1 (zh) | 2018-05-31 |
Family
ID=58595464
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/CN2017/090776 WO2018095035A1 (zh) | 2016-11-24 | 2017-06-29 | 耳机及其语音识别方法 |
Country Status (2)
Country | Link |
---|---|
CN (1) | CN106601272B (zh) |
WO (1) | WO2018095035A1 (zh) |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN113194383A (zh) * | 2021-04-29 | 2021-07-30 | 歌尔科技有限公司 | 一种声音播放方法、装置、电子设备及可读存储介质 |
Families Citing this family (12)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN106601272B (zh) * | 2016-11-24 | 2019-09-17 | 歌尔股份有限公司 | 耳机及其语音识别方法 |
CN109791775A (zh) * | 2017-06-16 | 2019-05-21 | 深圳市柔宇科技有限公司 | 耳机控制方法及系统 |
CN107333199A (zh) | 2017-07-21 | 2017-11-07 | 京东方科技集团股份有限公司 | 耳机控制装置、耳机及耳机控制方法 |
CN107564523B (zh) * | 2017-09-28 | 2021-06-22 | 歌尔科技有限公司 | 一种耳机接听方法、装置以及耳机 |
CN109671444B (zh) * | 2017-10-16 | 2020-08-14 | 腾讯科技(深圳)有限公司 | 一种语音处理方法及装置 |
CN108600893A (zh) * | 2018-07-10 | 2018-09-28 | 武汉轻工大学 | 军事环境音频分类系统、方法及军用降噪耳机 |
CN110232919A (zh) * | 2019-06-19 | 2019-09-13 | 北京智合大方科技有限公司 | 实时语音流提取与语音识别系统及方法 |
CN110475170A (zh) * | 2019-07-10 | 2019-11-19 | 深圳壹账通智能科技有限公司 | 耳机播放状态的控制方法、装置、移动终端及存储介质 |
CN110619890A (zh) * | 2019-08-23 | 2019-12-27 | 苏宁智能终端有限公司 | 一种佩戴耳机状态下识别外界声音的方法和系统 |
WO2021159369A1 (zh) | 2020-02-13 | 2021-08-19 | 深圳市汇顶科技股份有限公司 | 一种用于降噪的助听方法、装置、芯片、耳机及存储介质 |
CN111326159B (zh) * | 2020-03-10 | 2023-07-25 | 苏宁云计算有限公司 | 一种语音识别方法、装置、系统 |
CN112102829A (zh) * | 2020-09-04 | 2020-12-18 | 刘耀 | 一种基于语音识别的播放器控制系统及其方法 |
Citations (7)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JP2004260779A (ja) * | 2003-02-25 | 2004-09-16 | Hiroshi Kawanami | ヘッドオーディオ |
CN202475722U (zh) * | 2012-03-07 | 2012-10-03 | 金哲 | 一种自动感知汽车鸣笛的智能耳机 |
CN202524542U (zh) * | 2012-04-25 | 2012-11-07 | 无锡商业职业技术学院 | 一种智能耳机 |
CN105100367A (zh) * | 2014-04-18 | 2015-11-25 | 深圳富泰宏精密工业有限公司 | 交通鸣笛安全警示系统与方法 |
CN105205955A (zh) * | 2015-09-25 | 2015-12-30 | 小米科技有限责任公司 | 一种发出提示信号的方法和装置 |
CN205123960U (zh) * | 2015-11-10 | 2016-03-30 | 山东科技大学 | 一种智能头戴式蓝牙耳机 |
CN106601272A (zh) * | 2016-11-24 | 2017-04-26 | 歌尔股份有限公司 | 耳机及其语音识别方法 |
-
2016
- 2016-11-24 CN CN201611067886.7A patent/CN106601272B/zh active Active
-
2017
- 2017-06-29 WO PCT/CN2017/090776 patent/WO2018095035A1/zh active Application Filing
Patent Citations (7)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JP2004260779A (ja) * | 2003-02-25 | 2004-09-16 | Hiroshi Kawanami | ヘッドオーディオ |
CN202475722U (zh) * | 2012-03-07 | 2012-10-03 | 金哲 | 一种自动感知汽车鸣笛的智能耳机 |
CN202524542U (zh) * | 2012-04-25 | 2012-11-07 | 无锡商业职业技术学院 | 一种智能耳机 |
CN105100367A (zh) * | 2014-04-18 | 2015-11-25 | 深圳富泰宏精密工业有限公司 | 交通鸣笛安全警示系统与方法 |
CN105205955A (zh) * | 2015-09-25 | 2015-12-30 | 小米科技有限责任公司 | 一种发出提示信号的方法和装置 |
CN205123960U (zh) * | 2015-11-10 | 2016-03-30 | 山东科技大学 | 一种智能头戴式蓝牙耳机 |
CN106601272A (zh) * | 2016-11-24 | 2017-04-26 | 歌尔股份有限公司 | 耳机及其语音识别方法 |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN113194383A (zh) * | 2021-04-29 | 2021-07-30 | 歌尔科技有限公司 | 一种声音播放方法、装置、电子设备及可读存储介质 |
Also Published As
Publication number | Publication date |
---|---|
CN106601272A (zh) | 2017-04-26 |
CN106601272B (zh) | 2019-09-17 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
WO2018095035A1 (zh) | 耳机及其语音识别方法 | |
US11569789B2 (en) | Compensation for ambient sound signals to facilitate adjustment of an audio volume | |
JP6747538B2 (ja) | 情報処理装置 | |
CN102124758B (zh) | 助听器、助听系统、步行检测方法和助听方法 | |
US20180336901A1 (en) | Ear-borne Audio Device Conversation Recording and Compressed Data Transmission | |
US10224019B2 (en) | Wearable audio device | |
US20170345408A1 (en) | Active Noise Reduction Headset Device with Hearing Aid Features | |
US9818423B2 (en) | Method of improving sound quality and headset thereof | |
WO2020019821A1 (zh) | 麦克风堵孔检测方法及相关产品 | |
JP2020500492A (ja) | 空間的アンビエントアウェア型の個人用オーディオ供給デバイス | |
US8654998B2 (en) | Hearing aid apparatus | |
CN110602594A (zh) | 具体环境声音提醒模式的耳机装置 | |
WO2015139642A1 (zh) | 一种实现蓝牙耳机降噪的方法、装置和系统 | |
US10129633B1 (en) | Automated awareness for ANR systems | |
CN106170108B (zh) | 具有分贝提醒模式的耳机装置 | |
CN103581786A (zh) | 用于耳机的安全设备和方法 | |
CN113949956B (zh) | 降噪处理方法、装置、电子设备、耳机及存储介质 | |
CN111683319A (zh) | 一种通话拾音降噪方法及耳机、存储介质 | |
CN113949955B (zh) | 降噪处理方法、装置、电子设备、耳机及存储介质 | |
CN116324969A (zh) | 具有定位反馈的听力增强和可穿戴系统 | |
CN108235165A (zh) | 一种麦克风颈环耳机 | |
CN113194383A (zh) | 一种声音播放方法、装置、电子设备及可读存储介质 | |
WO2021255415A1 (en) | Wear detection | |
CN106302974B (zh) | 一种信息处理的方法及电子设备 | |
CN111028838A (zh) | 一种语音唤醒方法、设备及计算机可读存储介质 |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
121 | Ep: the epo has been informed by wipo that ep was designated in this application |
Ref document number: 17874403 Country of ref document: EP Kind code of ref document: A1 |
|
NENP | Non-entry into the national phase |
Ref country code: DE |
|
122 | Ep: pct application non-entry in european phase |
Ref document number: 17874403 Country of ref document: EP Kind code of ref document: A1 |