WO2017190219A1 - Device and method for improving the quality of in- ear microphone signals in noisy environments - Google Patents

Device and method for improving the quality of in- ear microphone signals in noisy environments Download PDF

Info

Publication number
WO2017190219A1
WO2017190219A1 PCT/CA2017/000115 CA2017000115W WO2017190219A1 WO 2017190219 A1 WO2017190219 A1 WO 2017190219A1 CA 2017000115 W CA2017000115 W CA 2017000115W WO 2017190219 A1 WO2017190219 A1 WO 2017190219A1
Authority
WO
WIPO (PCT)
Prior art keywords
signal
ear microphone
ear
intra
user
Prior art date
Application number
PCT/CA2017/000115
Other languages
French (fr)
Inventor
Rachel E. BOUSERHAL
Jérémie VOIX
Tiago H. FALK
Original Assignee
Eers Global Technologies Inc.
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Family has litigation
First worldwide family litigation filed litigation Critical https://patents.darts-ip.com/?family=60202532&utm_source=google_patent&utm_medium=platform_link&utm_campaign=public_patent_search&patent=WO2017190219(A1) "Global patent litigation dataset” by Darts-ip is licensed under a Creative Commons Attribution 4.0 International License.
Application filed by Eers Global Technologies Inc. filed Critical Eers Global Technologies Inc.
Priority to EP17792310.9A priority Critical patent/EP3453189B1/en
Priority to DK17792310.9T priority patent/DK3453189T3/en
Priority to US16/099,274 priority patent/US10783904B2/en
Priority to PL17792310T priority patent/PL3453189T3/en
Priority to CA3074050A priority patent/CA3074050A1/en
Publication of WO2017190219A1 publication Critical patent/WO2017190219A1/en

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/78Detection of presence or absence of voice signals
    • G10L25/84Detection of presence or absence of voice signals for discriminating voice from noise
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04RLOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; DEAF-AID SETS; PUBLIC ADDRESS SYSTEMS
    • H04R1/00Details of transducers, loudspeakers or microphones
    • H04R1/10Earpieces; Attachments therefor ; Earphones; Monophonic headphones
    • H04R1/1016Earpieces of the intra-aural type
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04RLOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; DEAF-AID SETS; PUBLIC ADDRESS SYSTEMS
    • H04R1/00Details of transducers, loudspeakers or microphones
    • H04R1/10Earpieces; Attachments therefor ; Earphones; Monophonic headphones
    • H04R1/1083Reduction of ambient noise
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04RLOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; DEAF-AID SETS; PUBLIC ADDRESS SYSTEMS
    • H04R3/00Circuits for transducers, loudspeakers or microphones
    • H04R3/005Circuits for transducers, loudspeakers or microphones for combining the signals of two or more microphones
    • AHUMAN NECESSITIES
    • A61MEDICAL OR VETERINARY SCIENCE; HYGIENE
    • A61FFILTERS IMPLANTABLE INTO BLOOD VESSELS; PROSTHESES; DEVICES PROVIDING PATENCY TO, OR PREVENTING COLLAPSING OF, TUBULAR STRUCTURES OF THE BODY, e.g. STENTS; ORTHOPAEDIC, NURSING OR CONTRACEPTIVE DEVICES; FOMENTATION; TREATMENT OR PROTECTION OF EYES OR EARS; BANDAGES, DRESSINGS OR ABSORBENT PADS; FIRST-AID KITS
    • A61F11/00Methods or devices for treatment of the ears or hearing sense; Non-electric hearing aids; Methods or devices for enabling ear patients to achieve auditory perception through physiological senses other than hearing sense; Protective devices for the ears, carried on the body or in the hand
    • A61F11/06Protective devices for the ears
    • A61F11/14Protective devices for the ears external, e.g. earcaps or earmuffs
    • A61F11/145Protective devices for the ears external, e.g. earcaps or earmuffs electric, e.g. for active noise reduction
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04RLOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; DEAF-AID SETS; PUBLIC ADDRESS SYSTEMS
    • H04R2410/00Microphones
    • H04R2410/01Noise reduction using microphones having different directional characteristics
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04RLOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; DEAF-AID SETS; PUBLIC ADDRESS SYSTEMS
    • H04R2460/00Details of hearing devices, i.e. of ear- or headphones covered by H04R1/10 or H04R5/033 but not provided for in any of their subgroups, or of hearing aids covered by H04R25/00 but not provided for in any of its subgroups
    • H04R2460/01Hearing devices using active noise cancellation

Definitions

  • the present disclosure relates to a device and method for improving the quality of in-ear microphone signals such as speech and blosignals including breath, heartbeat, etc. in noisy environments. More specifically, the present disclosure relates to an intra-aural device and method for improving the quality of in-ear microphone signals via adaptive filtering and bandwidth extension.
  • HPD Hearing Protection Devices
  • Bone conduction sensors can be placed in various locations and can provide a relatively high SNR speech signal [10].
  • the elevated SNR comes at a price of very limited frequency bandwidth of the picked-up signal, typically less than 2 kHz [11].
  • the enhancement of bone and tissue conducted speech is a topic of great Interest.
  • Many different techniques have been developed for the bandwidth extension of BTC speech [6] [12] [13] [14]. Even though these techniques oan enhance the quality of bone and tissue conducted speech, they are either computationally complex or require a substantial amount of training from the user [1 1], thus limiting their widespread use in practical settings.
  • An effective compromise between the two extremes of noisy air conducted speech and bandlimited BTC speech captured by bone conduction sensors is speech captured from inside an occluded ear using an in-ear microphone.
  • Occluding the ear canal with an HPD, or more generally an Intra-aural device causes bone and tissue conducted vibrations originating from the cranium to resonate inside the ear canal leading the wearer to hear an amplified version of their voice, this is called the occlusion effect [15].
  • a speech signal is available inside the ear and can be captured using an in-ear microphone.
  • occluding the ear canal with a highly isolating intra-aural device equipped with an in-ear microphone allows for the capturing of a speech signal that is not greatly affected by the background noise because of the passive attenuation of the intra-aural device.
  • Another advantage of using an in-ear microphone instead of a bone conduction microphone is that the speech is still captured acoustically and can share a significant amount of Information with clean speech, such as the one captured -in silence- in front of the mouth in the 0 to 2 kHz range [16].
  • a bandwidth extension technique that utilizes non-linear characteristics should extend the bandwidth of the in-ear microphone signal and add the high frequency harmonics [17].
  • a device and method for enhancing speech generated from bone and tissue conduction captured using an in-ear microphone using adaptive filtering and a non-linear bandwidth extension process are provided.
  • a method for detecting speech of a user of an intra-aural device in a noisy environment comprising the steps of:
  • [0012] acquiring a signal provided by the outer-ear microphone; [0013] applying an adaptive filter on the in-ear microphone signal, using the outer-ear microphone signal as a reference for the ambient noise; the adaptive filter being initialized by an estimated transfer function of the intra- aural device between the outer-ear microphone signal and the In-ear microphone signal; the adaptive filter being represented as a vector of filter weights over a series of time indexes; [0014] upon the computation of an increase in the filter weights of two consequent time indexes greater than a triggering threshold, detecting speech produced by the user.
  • a method for enhancing speech generated from bone and tissue conduction of a user of an in-ear device in a noisy environment the intra-aural device having an in-ear microphone adapted to be in fluid communication with the ear canal of the user and an outer-ear microphone adapted to be In fluid communication with the environment outside the ear, the method comprising the steps of:
  • a device for enhancing speech generated from bone and tissue conduction of a user of an intra-aural device in a noisy environment comprising: [0022] an intra-aural device adapted to be located into the ear canal of the user, the intra-aural device having an in-ear microphone adapted to be in fluid communication with the ear canal of the user and an outer ear microphone adapted to be in fluid communication with the environment outside the ear, and
  • a processing unit operatively connected to the in-ear microphone to receive an internal signal therefrom, to the outer-ear microphone to receive an external signal therefrom and to send a resulting signal to an interlocutor, the processing unit being configured so as to: [0024] execute the method for enhancing speech generated from bone and tissue conduction of a user of an intra-aural device in a noisy environment.
  • Figure 1 Is a perspective view of an intra-aural device for improving the quality of in-ear microphone signals in noisy environments in accordance with an illustrative embodiment of the present disclosure
  • Figure 2 is a cross-section of the intra-aural device of Figure 1 ;
  • Figure 3 is a schematic architecture diagram representation of the intra- aural device of Figure 1;
  • Figure 4A is a schematic representation of two users communicating (only one way presented) in a noisy environment using intra-aural the device of Figure 1 ;
  • Figure 4B is a block diagram representing the interconnections between in- ear microphones, outer-ear microphones and internal speakers of the intra-aural device used by the two users communicating in Figure 4A;
  • Figures 5A and 5B are block diagrams representing the normalized least mean squared (NLMS) adaptive filtering stage when the adaptation is ON ( Figure 5A) and when it is OFF ( Figure 5B);
  • NLMS normalized least mean squared
  • Figure 6 is a schematic plot diagram of an example of a test signal for the in-ear microphone (IEM) to optimize speech detection criteria
  • Figure 7 is a flow diagram of the adaptation prooess for the adaptive filter use to denoise the in-ear microphone (IEM) signals;
  • FIG 8 is a schematic plot diagram of an example of the linear predictive coding (LPC) spectral envelope of the phoneme / 1 / recorded with the reference microphone (REF), the outer-ear microphone (OEM) and the in-ear microphone (IEM) simultaneously;
  • Figure 9 is a block diagram of the bandwidth extension process;
  • Figures 10A, 10B and 10C are block diagrams of the various input-outputs of the intra-aural device of Figure 1 , including the filtered, denoised and enhanced in-ear speech signal (Figure 10A), the filtered and denoised biosignals (Figure 10B) and the Voice Activity Detector (VAD) output state ( Figure 10C);
  • Figure 11 is a schematic representation of an example of use of the intra- aural device of Figure 1 as a portable application for biosignal monitoring;
  • Figure 12 is a block diagram of an intra-aural system for communicating in noisy environments in accordance with another Illustrative embodiment of the present disclosure.
  • the non-limitative illustrative embodiments of the present disclosure provide a device and method for improving the quality of in-ear microphone signals, such as speech, and biosignals, including breath, heartbeat, etc., in noisy environments. It is to be understood that although the present disclosure relates mainly to a device and method for Improving the quality of in-ear microphone speech signals, the technique disclosed can improve the quality of any of the aforementioned signals via adaptive filtering and bandwidth extension.
  • Bone and tissue conducted speech has been used to provide a relatively high Signal-to-Noise Ratio (SNR) in noisy environments. However, the limited bandwidth of bone and tissue conducted speech degrades the quality of the speech signal.
  • SNR Signal-to-Noise Ratio
  • the disclosed device and method use an adaptive filtering approach to denoise the bone and tissue conducted speech signal and, once the signal is denoised, extended its bandwidth by creating odd harmonics in order to recreate the high frequency harmonics,
  • the in-ear microphone picks up speech generated from bone and tissue conduction and generates a speech signal to which an adaptive filter is applied in order to denoise using the signal from the outer-ear microphone.
  • a voice activity detection criteria using the filter coefficients of the adaptive filter is used to ensure that only noise is reduced while the speech content of the speech signal from the in- ear microphone remains unaffected.
  • the described method provides a simple, speaker independent, non- computationally exhaustive method to enhance the quality of speech picked up using an in-ear microphone.
  • POLQA Perceptual Objective Listening Quality Assessment
  • MOS-LQO Objective Listening Quality - Mean Opinion Score
  • MUSHRA Multiple Stimuli with Hidden Reference and Anchor
  • the Intra-aural device for improving the quality of In- ear microphone signals in noisy enviroriments 10 takes the form of an intra-aural unit 12 generally conforming to the ear canal of a user, which may be inflatable, compressible, custom molded, etc., for passive attenuation of ambient noise and a communication link 14, for example a wireless or Bluetooth communication link.
  • the intra-aural device 10 generally includes an in- ear microphone (IEM) 16, a miniature loudspeaker 18, a receiver 20, an outer-ear microphone (OEM) 22 located flush on the outer face of the intra-aural unit 12, transmitter 24, all of which, along with the wireless communication link 14, are operatively connected to a digital signal processing (DSP) unit 26 having an associated memory comprising instructions stored thereon that, when executed on the processor of the DSP unit 26, perform the steps of the various processes which will be further described below. It is to be understood that In alternative embodiments some or all of the receiver 20, transmitter 24 and DSP 26 may be located outside the intra-aural unit 12, for example in an external unit worn by the user of the Intra-aural device 10.
  • IEM in- ear microphone
  • OEM outer-ear microphone
  • FIG. 4A and 4B there is shown two users 1 , 2 wearing intra- aural devices 10-1 and 10-2, respectively, communicating in a noisy environment 30 having a variety of noise sources 32.
  • user 1 Is the speaker and user 2 the listener.
  • the IEM 16 of device 10-1 picks up speech generated from bone and tissue conduction of user 1 and generates a speech signal to which, using the DSP unit 26, the adaptive filter is applied in order to denoise the speech signal using the signal from the OEM 22.
  • the voice activity detection criteria which uses the filter coefficients of the adaptive filter, ensures that only the noise 32 is reduced while the speech content of the speech signal from the IEM 16 remains unaffected.
  • the speech signal is denoised, its bandwidth is extended by exploiting the nonlinear characteristics of a cubic operator.
  • the resulting improved speech signal 34 is then transmitted 36, using the transmitter 24, from device 10-1 to device 10-2 via wireless communication link 14, which provides, when received by receiver 20, the improved speech signal 34 to the loudspeaker 18 of intra-aural device 10-2 and hence user 2. It is to be noted that all of the described steps are performed in real time,
  • the improved speech signal 34 maybe transmitted to another device, for example a smart phone or other such device.
  • a smart phone for example a smart phone or other such device.
  • the presence of the OEM 22 and IEM 16 allows the determination of the relationship between the sound outside the ear and inside the ear, i.e. the transfer function of the intra-aural device 10. This provides insight about the "in-ear" noise and enables denoising through adaptive filtering. Once the IEM 16 speech signal is denoised, bandwidth extension can than be performed to further Improve quality.
  • the Intra-aural device 10 transfer function is estimated, as it varies from user to user. This is accomplished by exposing a worn device for Improving the quality of in-ear microphone speech in noisy environments 10 to white noise at 85 dB (SPL) using a loudspeaker outside the ear for at least 2 seconds.
  • the OEM 22 and IEM 16 simultaneously capture the signals outside and inside the ear respectively and the transfer function of the intra-aural device 12, estimated as
  • the IEM 16 speech signal can be denoised using normalized least mean squared (NLMS) adaptive filtering.
  • NLMS normalized least mean squared
  • the adaptation process must be frozen (OFF) when the user is speaking and active (ON) when the user is not speaking. This ensures that the adaptive filter cancels only the noise and does not interfere with any speech produced by the user.
  • the two states of the adaptive filter are shown in Figures 5A and 5B.
  • the adaptation is ON ( Figure 5A) the structure of the proposed adaptive filter follows the well-known structure commonly described in the literature [19]; the only exception being that the signal of interest is the error signal, e(n).
  • H(z) is the true transfer function of the intra-aural device 12 while is the estimated intra-aural device 12 transfer function.
  • the adaptive filter of order 160 is defined as follows:
  • n is the current time index
  • is the adaptation step size
  • w(ri) Is the vector of filter weights at time index n
  • e is a very small number to avoid division by zero.
  • the signal x(n) picked up by the OEM 22 is then filtered using the #f» and the output, x(n), is fed to the input of the NLMS adaptive filter.
  • the output of the adaptive filter, y(n) is then subtracted from d(n).
  • the adaptive filter brings the difference between the residual noise, n t (n), and the estimated residual noise, fi ⁇ n) to zero. Since the OEM 22 speech signal is almost entirely masked by the noise, the effect of s r (n) and ⁇ r (n) Is negligible. Therefore, the resulting difference between the output of the adaptive filter and the signal captured by the IEM 16 is the speech signal originating from bone and tissue conduction, st(n), with minimal effects of noise.
  • the adaptation process is a function of whether or not the user is speaking.
  • the adaptive filter To denoise the user's speech, the adaptive filter must only adapt when the user is not speaking. This ensures that the filter is adapting to the intra-aural device 12 transfer function (I.e. H(z)) and thus the noise and only the noise is subtracted from the signal and not any relevant speech information.
  • voice activity detection inside the ear Is achieved by monitoring the value of the coefficients of the adaptive filter.
  • the vector of filter weights over the entire index of time, w is used to detect if the user is speaking.
  • test signals can be used, for example the first 10 lists of the recorded Harvard phonetically balanced sentences, for both the OEM 22 and the IEM 16, each test signal starting with at least 2 seconds of noise followed by 8 to 10 seconds of speech either by the user or by an external competing speaker. Exterior speech can be added to simulate a case where the user is not speaking but loud enough that some residual speech exists after the passive attenuation of the intra-aural device 12, The residual speech should not trigger the speech activity of the adaptation process.
  • the residual speech can be simulated by passing the speech through /?(z). The location of the user's speech and the residual speech is randomized to avoid any trends in the adaptation process.
  • Figure 6 shown an example of a randomly chosen IEM 1 ⁇ test signal with both user speech and external speech segments.
  • T g The value for T g has to be selected such that it is not particular to a specific speaker. This can per performed using recorded conversations, through the IEM 16 and the OEM 22, from different speakers (varying gender, age, etc.), for which is analyzed the effect of using different triggering thresholds.
  • the bandwidth of the denoised signals for the various speakers resulting from the sweep is extended using a bandwidth extension (BWE) process, which will be further detailed below.
  • BWE bandwidth extension
  • T a The quality of these signals is measured before and after the BWE to see the effect of the different values for the triggering criteria.
  • the change in filter weights is triggered at the onset of speech but not the end.
  • a € at the onset of speech Is also measured and monitored, per sample, i.e. A e (n).
  • the adaptation is disabled for at least one second and as long as A f is maintained.
  • the filter weights of the adaptive filter are updated with those from the previous second, w(n - fs). This is to ensure that the filter weights are those from when no speech is produced by the user.
  • a e (n) ⁇ ⁇ the adaptation starts again.
  • the process of monitoring the change in gives a non-ad-hoc way to turn ON the adaptation once the user is no longer speaking.
  • the adaptation process is illustrated by the flow diagram in Figure 7.
  • the adaptive filtering denoises the IEM 16 signal by utilizing the information about the noise captured by the OEM 22. Once the IEM 16 is denoised its quality can be enhanced by extending its bandwidth in the high frequencies using the BWE.
  • the triggering threshold, ⁇ # can be set a priori at the time of manufacturing or, In an alternative embodiment, may be set using a calibration process such that T g is specific to the user and/or the intra-aural device 12.
  • the upsampied signal is filtered by a whitening filter using the coefficients of a linear predictive coding (LPC) analysis [20],
  • LPC linear predictive coding
  • the whitening filter is a finite infinite response filter whose coefficients are those of an 18th order LPC filter at that time frame. Cubing the excitation reproduces the odd harmonics along the entire bandwidth including the high band, in this scenario from 1.8 kHz to 4 kHz. Since the high frequencies are the only region of Interest and to eliminate any overlap, the excitation signal is high passed at 1.8 kHz with a third order filter.
  • the upsampied IEM 16 signal is low passed at 1.8 kHz with a third order filter because it contains no relevant frequency information above 1 .8 kHz (see Figure 8).
  • the high pass filter used for the excitation signal and the low pass filter used for the upsampied IEM 16 signal are designed to be power complementary for perfect reconstruction.
  • the sum of the two filtered signals is then band passed with a fourth order Linkwitz-Riley filter at 160 Hz and 3.5 kHz by cascading a second order low pass Butterworth filter and a second order high pass Butterworth filter. This is done to eliminate the boomy effect coming from the bone and tissue conduction as well as any ringing caused by the odd harmonics of the cubed excitation signal.
  • the overall output is then downaampled by a factor of 2 to go back to an 8 kHz sampling frequency. It is important to note that this bandwidth extension technique adds missing harmonics In the high frequencies. However, missing formants and frlcatlon noise are not recovered.
  • the adaptation process was described in the context of denoising a user's speech signal, the determination of whether or not the user is speaking may also be used in an alternative embodiment in order to Interrupt another process when the user is speaking, for example the recording of some biological process (i.e. heart rate, respiration, etc.),
  • the adaptation process may be adapted to detect sounds inside a device or space enclosed in a noisy environment, i.e. the in-ear microphone takes the form of an in-devlce/space microphone and the outer-ear microphone takes the form of an outer-device/space microphone.
  • FIGS. 10A to 10C there are shown various alternative use of the intra-aural device 10, which include the providing of a filtered, denoised and enhanced (using enhancer 28) in-ear speech signal (Figure 10A), filtered and denoised biosignals (Figure 10B) and a Voice Activity Detector (VAD) output state, i.e. a signal indicating the presence or not of voice activity of the user, which can be used for automatic activation of a personal communication system, such as voice activation, voice operated switch or Voice Operated Exchange (VOX) in a two-way radiocommunication device (Figure 10C).
  • a Voice Activity Detector VAD output state
  • a personal communication system such as voice activation, voice operated switch or Voice Operated Exchange (VOX) in a two-way radiocommunication device (Figure 10C).
  • VOX Voice Operated Exchange
  • FIG. 11 there is shown an example of use of the intra- aural device 10 as a portable application for biosignal monitoring.
  • the intra-aural device 10 transmits biosignals of a user via wireless communication link 14, using transmitter 24, to a smart phone 40 on which runs a biosignal monitoring application 42 that can analyze and/or display the biosignals.
  • the application 42 may also warn the user of some specific condition if detected.
  • FIG 12 there is shown an intra-aural system for communicating in noisy environments 10' in accordance with another illustrative embodiment of the present disclosure, which takes the form of a pair of Intra-aural units 12' and a main unit 13.
  • Each intra-aural unit 12 ⁇ includes an in-ear microphone 1 ⁇ , an outer-ear microphone 22 and a pair of miniature loudspeakers 16a, 16b.
  • the receiver 20, transmitter 24 and processing unit 26 are externally located inside a main unit 13 operatively connected to each of the intra-aural units 12',
  • the main unit 13 includes audio interfaces 15 for communication with the intra- aural units 12', a power manager 17 and battery 19 for providing power to the components of the intra-aural 12' and main 13 units, a processing unit 26 in the form of a central processing unit with associated memory (RAM, FLASH memory), a receiver 22 and transmitter 24 In the form of blue tooth and short range radio modems for providing a communication link 14 to remote components, an inertia! measurement unit 21 , a USB port 1 1 for accessing the associated memory and configuring the central processing unit 26, and a plurality of buttons and LEDs 23 for providing various functionalities to a user of the intra-aural communication system 10',

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Signal Processing (AREA)
  • Health & Medical Sciences (AREA)
  • Otolaryngology (AREA)
  • General Health & Medical Sciences (AREA)
  • Computational Linguistics (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Multimedia (AREA)
  • Soundproofing, Sound Blocking, And Sound Damping (AREA)
  • Headphones And Earphones (AREA)

Abstract

A method, and device, for enhancing speech generated from bone and tissue conduction of a user of an In-ear device in a noisy environment, the Intra-aural device having an in-ear microphone adapted to be in fluid communication with the ear canal of the user and an outer-ear microphone adapted to be in fluid communication with the environment outside the ear. The method comprises applying an adaptive filter on the in-ear microphone signal, using the outer-ear microphone signal as a reference for the ambient noise and interrupting the application of the adaptive filter to the In-ear microphone signal upon detecting speech by the user.

Description

DEVICE AND METHOD FOR IMPROVING THE QUALITY OF IN- EAR MICROPHONE SIGNALS IN NOISY ENVIRONMENTS
CROSS-REFERENCE TO RELATED APPLICATIONS
[0001] This application claims the benefits of U.S. provisional patent applications Nos. 62/332,861 and 62/460,682, filed on May 6, 2016, and February 2, 2017, respectively, which are herein incorporated by reference.
TECHNICAL FIELD
[0002] The present disclosure relates to a device and method for improving the quality of in-ear microphone signals such as speech and blosignals including breath, heartbeat, etc. in noisy environments. More specifically, the present disclosure relates to an intra-aural device and method for improving the quality of in-ear microphone signals via adaptive filtering and bandwidth extension.
BACKGROUND
[0003] Traditionally, communication headsets use a boom microphone, placed in front of the mouth, to capture speech in noisy settings. Although directional, these microphones often suffer from a low slgnal-to-noise ratio (SNR) in excessively noisy environments and require noise cancelation for enhancement [1]. Alternatively, speech captured through bone and tissue vibrations has been used to provide a signal with a higher SNR [2]. Bone conduction speech can be captured either by microphones placed inside an occluded ear [3] [4] or through bone conduction sensors placed somewhere on the cranium [5]. Although speech generated from bone and tissue conduction can have a relatively high SNR, it suffers from a limited frequency bandwidth (less than 2 kHz), thus reducing signal quality and intelligibility [6]. For applications in which quality and intelligibility are important (e.g. command and control), bone and tissue conduction speech can be a limiting factor. Therefore, to this day, communicating in noise is a difficult task to achieve as the communication signal either suffers from noise interference, in case of airborne speech, or from limited bandwidth, in case of bone and tissue conducted (BTC) speech. [0004] Moreover, in excessively noisy industrial environments where workers are exposed to high level of noise -typically greater than 90 dB(A) for 8 hours-, the Occupational Safety and Health Administration enforces the use of Hearing Protection Devices (HPD) [7], When worn correctly, HPDs can be very effective in preventing noise induced hearing loss [8], However, limited communication remains the number one complaint of workers equipped with HPDs [9].
[0005] Communication headsets are a great way of combining good hearing protection and communication features. Most commonly, headsets made up of circumaural HPDs equipped with a directional boom microphone placed in front of the mouth are used. Circumaural HPDs can generally provide better attenuation than intra-aural HPDs, because they are easier to wear properly [8]. The disadvantages of these types of communication headsets Is two-fold. First, the boom microphone is exposed to the background noise and can still capture unwanted noise, air conducted, that can mask the speech signal of the wearer. Second, circumaural HPDs with boom microphones are not compatible with most other personal protection equipment. The use of other personal protection equipment alongside HPDs Is common in noisy environments. For example, the use of helmets is required for construction workers as are gas masks for firefighters. Using bone and tissue conduction microphones to capture speech is a convenient way to eliminate both of those problems, Bone conduction sensors can be placed in various locations and can provide a relatively high SNR speech signal [10]. As mentioned previously, however, the elevated SNR comes at a price of very limited frequency bandwidth of the picked-up signal, typically less than 2 kHz [11]. As a consequence, the enhancement of bone and tissue conducted speech is a topic of great Interest. Many different techniques have been developed for the bandwidth extension of BTC speech [6] [12] [13] [14]. Even though these techniques oan enhance the quality of bone and tissue conducted speech, they are either computationally complex or require a substantial amount of training from the user [1 1], thus limiting their widespread use in practical settings.
[0006] An effective compromise between the two extremes of noisy air conducted speech and bandlimited BTC speech captured by bone conduction sensors is speech captured from inside an occluded ear using an in-ear microphone. Occluding the ear canal with an HPD, or more generally an Intra-aural device, causes bone and tissue conducted vibrations originating from the cranium to resonate inside the ear canal leading the wearer to hear an amplified version of their voice, this is called the occlusion effect [15]. By way of this occlusion effect, as a consequence of wearing an intra-aural device, a speech signal is available inside the ear and can be captured using an in-ear microphone. Therefore, occluding the ear canal with a highly isolating intra-aural device equipped with an in-ear microphone allows for the capturing of a speech signal that is not greatly affected by the background noise because of the passive attenuation of the intra-aural device. Another advantage of using an in-ear microphone instead of a bone conduction microphone is that the speech is still captured acoustically and can share a significant amount of Information with clean speech, such as the one captured -in silence- in front of the mouth in the 0 to 2 kHz range [16]. A bandwidth extension technique that utilizes non-linear characteristics should extend the bandwidth of the in-ear microphone signal and add the high frequency harmonics [17].
[0007] However, in extremely noisy situations, some residual noise can exist inside the occluded ear canal and capturing speech through air-conduction can result in a reduced SNR. In these noisy conditions extending the bandwidth of the bandlimited in-ear microphone speech becomes a difficult task because depending on the spectrum of the noise, simple bandwidth extension techniques may actually amplify the noise in the signal and decrease the SNR. Bandwidth extension techniques for noisy speech are rare and are typically computationally complex [12] [18]. Since the SNR of the in-ear microphone speech is relatively high, denoising the speech signal becomes an easier task if the noise information insido the ear canal is known. In such extremely noisy conditions that the in-ear microphone signal becomes noisy, speech captured through air-conduction outside the ear has a very low SNR and is almost completely masked by the noise. [0008] Accordingly, there is a need for a system and method for removing the residual noise extending the frequency bandwidth of signals captured by an in-ear microphone in noisy environments.
SUMMARY
[0009] It is therefore a general object of the present disclosure to provide a device and method for removing the residual noise and extending the bandwidth of the signals, such as speech, and biosignals, including breath, heartbeat, etc., captured with an in-ear microphones, for example in an intra-aural device, in noisy environments.
[0010] According to an aspect of the present disclosure there is provided a device and method for enhancing speech generated from bone and tissue conduction captured using an in-ear microphone using adaptive filtering and a non-linear bandwidth extension process.
[0011] According to an aspect of the present disclosure there is provided a method for detecting speech of a user of an intra-aural device in a noisy environment, the intra-aural device having an in-ear microphone adapted to be in fluid communication with the ear canal of the user and an external microphone, dubbed outer-ear mic, adapted to be In fluid communication with the environment outside the ear, the method comprising the steps of:
[0012] acquiring a signal provided by the outer-ear microphone; [0013] applying an adaptive filter on the in-ear microphone signal, using the outer-ear microphone signal as a reference for the ambient noise; the adaptive filter being initialized by an estimated transfer function of the intra- aural device between the outer-ear microphone signal and the In-ear microphone signal; the adaptive filter being represented as a vector of filter weights over a series of time indexes; [0014] upon the computation of an increase in the filter weights of two consequent time indexes greater than a triggering threshold, detecting speech produced by the user.
[0015] According to another aspect of the present disclosure there Is provided a method for enhancing speech generated from bone and tissue conduction of a user of an in-ear device in a noisy environment, the intra-aural device having an in-ear microphone adapted to be in fluid communication with the ear canal of the user and an outer-ear microphone adapted to be In fluid communication with the environment outside the ear, the method comprising the steps of:
[0016] executing the method for detecting speech produced by the user of an intra-aural device in a noisy environment;
[0017] interrupting the application of the adaptive filter to the in-ear microphone signal upon detecting speech by the user;
[0018] providing the filtered and denoised signal.
[0019] According to a further aspect of the present disclosure there is provided a method as described above, further comprising the step of:
[0020] extending the bandwidth of the filtered and denoised signal in the high frequencies using a non-linear bandwidth extension process previous to providing the filtered signal to an interlocutor.
[0021] There is also provided a device for enhancing speech generated from bone and tissue conduction of a user of an intra-aural device in a noisy environment, the device comprising: [0022] an intra-aural device adapted to be located into the ear canal of the user, the intra-aural device having an in-ear microphone adapted to be in fluid communication with the ear canal of the user and an outer ear microphone adapted to be in fluid communication with the environment outside the ear, and
[0023] a processing unit operatively connected to the in-ear microphone to receive an internal signal therefrom, to the outer-ear microphone to receive an external signal therefrom and to send a resulting signal to an interlocutor, the processing unit being configured so as to: [0024] execute the method for enhancing speech generated from bone and tissue conduction of a user of an intra-aural device in a noisy environment.
[0025] There is also provided a device and method for picking-up, with the in-ear microphone of an intra-aural device occluding the ear canal of the user, the physiological noises that are present in the occluded ear canal and to further filter and denoise these biosignals for monitoring applications.
[0026] Other objects and advantages of the present disclosure will become apparent from a careful reading of the detailed description provided herein, with appropriate reference to the accompanying drawings. BRIEF DESCRIPTION OF THE DRAWINGS
[0027] Embodiments of the disclosure will be described by way of examples only with reference to the accompanying Figures, in which:
[0028] Figure 1 Is a perspective view of an intra-aural device for improving the quality of in-ear microphone signals in noisy environments in accordance with an illustrative embodiment of the present disclosure; [0029] Figure 2 is a cross-section of the intra-aural device of Figure 1 ;
[0030] Figure 3 is a schematic architecture diagram representation of the intra- aural device of Figure 1;
[0031] Figure 4A is a schematic representation of two users communicating (only one way presented) in a noisy environment using intra-aural the device of Figure 1 ;
[0032] Figure 4B is a block diagram representing the interconnections between in- ear microphones, outer-ear microphones and internal speakers of the intra-aural device used by the two users communicating in Figure 4A;
[0033] Figures 5A and 5B are block diagrams representing the normalized least mean squared (NLMS) adaptive filtering stage when the adaptation is ON (Figure 5A) and when it is OFF (Figure 5B);
[0034] Figure 6 is a schematic plot diagram of an example of a test signal for the in-ear microphone (IEM) to optimize speech detection criteria;
[0035] Figure 7 is a flow diagram of the adaptation prooess for the adaptive filter use to denoise the in-ear microphone (IEM) signals;
[0036] Figure 8 is a schematic plot diagram of an example of the linear predictive coding (LPC) spectral envelope of the phoneme / 1 / recorded with the reference microphone (REF), the outer-ear microphone (OEM) and the in-ear microphone (IEM) simultaneously; [0037] Figure 9 is a block diagram of the bandwidth extension process;
[0038] Figures 10A, 10B and 10C are block diagrams of the various input-outputs of the intra-aural device of Figure 1 , including the filtered, denoised and enhanced in-ear speech signal (Figure 10A), the filtered and denoised biosignals (Figure 10B) and the Voice Activity Detector (VAD) output state (Figure 10C); [0039] Figure 11 is a schematic representation of an example of use of the intra- aural device of Figure 1 as a portable application for biosignal monitoring; and
[0040] Figure 12 is a block diagram of an intra-aural system for communicating in noisy environments in accordance with another Illustrative embodiment of the present disclosure.
[0041] Similar references used in different Figures denote similar components. DETAILED DESCRIPTION
[0042] Generally stated, the non-limitative illustrative embodiments of the present disclosure provide a device and method for improving the quality of in-ear microphone signals, such as speech, and biosignals, including breath, heartbeat, etc., in noisy environments. It is to be understood that although the present disclosure relates mainly to a device and method for Improving the quality of in-ear microphone speech signals, the technique disclosed can improve the quality of any of the aforementioned signals via adaptive filtering and bandwidth extension. [0043] Bone and tissue conducted speech has been used to provide a relatively high Signal-to-Noise Ratio (SNR) in noisy environments. However, the limited bandwidth of bone and tissue conducted speech degrades the quality of the speech signal. In very noisy conditions, bandwidth of the bone and tissue conducted speech becomes problematic. The disclosed device and method use an adaptive filtering approach to denoise the bone and tissue conducted speech signal and, once the signal is denoised, extended its bandwidth by creating odd harmonics in order to recreate the high frequency harmonics,
[0044] More specifically, this is performed, in real time, using an in-ear and an outer-ear microphones, the in-ear microphone picks up speech generated from bone and tissue conduction and generates a speech signal to which an adaptive filter is applied in order to denoise using the signal from the outer-ear microphone. A voice activity detection criteria using the filter coefficients of the adaptive filter is used to ensure that only noise is reduced while the speech content of the speech signal from the in- ear microphone remains unaffected. Once the speech signal is denoised, its bandwidth is extended by exploiting the nonlinear characteristics of a cubic operator. [0045] The bandwidth extension of the denoised In-ear microphone speech signal signifioantly enhances Its quality. For noisy environments, for example a factory, the described method provides a simple, speaker independent, non- computationally exhaustive method to enhance the quality of speech picked up using an in-ear microphone. Overall, gains of 123 (out of 4.5) in Perceptual Objective Listening Quality Assessment (POLQA) Objective Listening Quality - Mean Opinion Score (MOS-LQO) scores and 45 (out of 100) In Multiple Stimuli with Hidden Reference and Anchor (MUSHRA) scores have been observed, which show the benefits of the proposed speech enhancement solution.
[0046] Referring to Figure 1 , the Intra-aural device for improving the quality of In- ear microphone signals in noisy enviroriments 10, in accordance with an illustrative embodiment of the present disclosure, takes the form of an intra-aural unit 12 generally conforming to the ear canal of a user, which may be inflatable, compressible, custom molded, etc., for passive attenuation of ambient noise and a communication link 14, for example a wireless or Bluetooth communication link. Referring now to Figures 2 and 3, the intra-aural device 10 generally includes an in- ear microphone (IEM) 16, a miniature loudspeaker 18, a receiver 20, an outer-ear microphone (OEM) 22 located flush on the outer face of the intra-aural unit 12, transmitter 24, all of which, along with the wireless communication link 14, are operatively connected to a digital signal processing (DSP) unit 26 having an associated memory comprising instructions stored thereon that, when executed on the processor of the DSP unit 26, perform the steps of the various processes which will be further described below. It is to be understood that In alternative embodiments some or all of the receiver 20, transmitter 24 and DSP 26 may be located outside the intra-aural unit 12, for example in an external unit worn by the user of the Intra-aural device 10. [0047] Referring to Figures 4A and 4B, there is shown two users 1 , 2 wearing intra- aural devices 10-1 and 10-2, respectively, communicating in a noisy environment 30 having a variety of noise sources 32. In the illustrated scenario, user 1 Is the speaker and user 2 the listener. When user 1 speaks, the IEM 16 of device 10-1 picks up speech generated from bone and tissue conduction of user 1 and generates a speech signal to which, using the DSP unit 26, the adaptive filter is applied in order to denoise the speech signal using the signal from the OEM 22. The voice activity detection criteria, which uses the filter coefficients of the adaptive filter, ensures that only the noise 32 is reduced while the speech content of the speech signal from the IEM 16 remains unaffected. Onca the speech signal is denoised, its bandwidth is extended by exploiting the nonlinear characteristics of a cubic operator. The resulting improved speech signal 34 is then transmitted 36, using the transmitter 24, from device 10-1 to device 10-2 via wireless communication link 14, which provides, when received by receiver 20, the improved speech signal 34 to the loudspeaker 18 of intra-aural device 10-2 and hence user 2. It is to be noted that all of the described steps are performed in real time,
[0040] is to be understood that in an alternative embodiment the improved speech signal 34 maybe transmitted to another device, for example a smart phone or other such device. [0049] The presence of the OEM 22 and IEM 16 allows the determination of the relationship between the sound outside the ear and inside the ear, i.e. the transfer function of the intra-aural device 10. This provides insight about the "in-ear" noise and enables denoising through adaptive filtering. Once the IEM 16 speech signal is denoised, bandwidth extension can than be performed to further Improve quality. Intra-Aural Device Transfer Function Identification
[0050] The Intra-aural device 10 transfer function is estimated, as it varies from user to user. This is accomplished by exposing a worn device for Improving the quality of in-ear microphone speech in noisy environments 10 to white noise at 85 dB (SPL) using a loudspeaker outside the ear for at least 2 seconds. The OEM 22 and IEM 16 simultaneously capture the signals outside and inside the ear respectively and the transfer function of the intra-aural device 12, estimated as
Figure imgf000013_0001
In-Ear Microphone Noise Reduction
[0051] Once the noise level is high enough that the OEM 22 speech signal is almost completely masked (i.e. SNR < -5 dB), the IEM 16 speech signal can be denoised using normalized least mean squared (NLMS) adaptive filtering. The choice of adaptive filtering comes from a need to create an algorithm that assumes no properties about the noise and is, thus, robust to various types of noise. Therefore, using adaptive filtering is beneficial for the user by enhancing the received communication signal.
[0052] To properly denoise the IEM 16 speech signal produced by the user without affecting the speech content, the adaptation process must be frozen (OFF) when the user is speaking and active (ON) when the user is not speaking. This ensures that the adaptive filter cancels only the noise and does not interfere with any speech produced by the user. The two states of the adaptive filter are shown in Figures 5A and 5B. When the adaptation is ON (Figure 5A) the structure of the proposed adaptive filter follows the well-known structure commonly described in the literature [19]; the only exception being that the signal of interest is the error signal, e(n). Here, H(z) is the true transfer function of the intra-aural device 12 while is the estimated intra-aural device 12 transfer function. When the adaptation is ON (Figure 5A), the user is not speaking, The OEM 22 captures the noise outside the ear, n0(n), while the IEM 16 captures the residual noise inside the ear nr(n), colored by H(z), The signal captured by the IEM 16 is defined as the desired signal, rf(n). The input, x(n), to the adaptive filter is the signal captured by the OEM 22 filtered with the adaptive filter which is initialized by the estimated transfer function of the intra-aural device 12 /?(z). The output of the adaptive filter, y(n), is thus a close estimate of the residual noise inside the ear and the difference between d(ri) and y(n) should approach 0. The adaptive filter of order 160 is defined as follows:
Figure imgf000014_0001
[0053] where n is the current time index, μ is the adaptation step size, w(ri) Is the vector of filter weights at time index n, and e is a very small number to avoid division by zero.
[0054] When the adaptation is OFF (Figure 5B), let sQ(n) and n0(n) be the speech signal produced by the user and noise signal outside the ear, respectively. Therefore, the OEM 22 picks up the sum of these two signals, x(n). Meanwhile, the IEM 16 picks up the residual noise signal after the attenuation of the intra-aural device 12, nr(n), and the residual speech signal sr(n). The speech signal originating from bone and tissue conduction, ^(n), is also picked up by the IEM 16. The sum of all three signals picked up by the IEM 16 is the desired signal d(n). The signal x(n) picked up by the OEM 22 is then filtered using the #f» and the output, x(n), is fed to the input of the NLMS adaptive filter. The output of the adaptive filter, y(n) is then subtracted from d(n). The adaptive filter brings the difference between the residual noise, nt(n), and the estimated residual noise, fi^n) to zero. Since the OEM 22 speech signal is almost entirely masked by the noise, the effect of sr(n) and §r(n) Is negligible. Therefore, the resulting difference between the output of the adaptive filter and the signal captured by the IEM 16 is the speech signal originating from bone and tissue conduction, st(n), with minimal effects of noise.
Adaptation Process [0055] To achieve denolsing without affecting the speech content, the adaptation process is a function of whether or not the user is speaking. To denoise the user's speech, the adaptive filter must only adapt when the user is not speaking. This ensures that the filter is adapting to the intra-aural device 12 transfer function (I.e. H(z)) and thus the noise and only the noise is subtracted from the signal and not any relevant speech information. To guarantee robustness of the speech detection process, voice activity detection inside the ear Is achieved by monitoring the value of the coefficients of the adaptive filter. After completion of the two second identification stages, the vector of filter weights over the entire index of time, w, is used to detect if the user is speaking. To decide what criteria can be used to detect speech inside the ear using filter weights, test signals can be used, for example the first 10 lists of the recorded Harvard phonetically balanced sentences, for both the OEM 22 and the IEM 16, each test signal starting with at least 2 seconds of noise followed by 8 to 10 seconds of speech either by the user or by an external competing speaker. Exterior speech can be added to simulate a case where the user is not speaking but loud enough that some residual speech exists after the passive attenuation of the intra-aural device 12, The residual speech should not trigger the speech activity of the adaptation process. For the IEM 16 signal, the residual speech can be simulated by passing the speech through /?(z). The location of the user's speech and the residual speech is randomized to avoid any trends in the adaptation process. Figure 6 shown an example of a randomly chosen IEM 1 Θ test signal with both user speech and external speech segments.
[0056] Through analysis of the changes in the filter weights for the test signals, it was concluded that the maximum valued filter weight can be chosen as a good triggering criteria. Once the maximum filter weight increases more than a triggering threshold, Tg, from one time index to the other, it is predicted that the user is speaking. Therefore once max (w(n)) _
max (w(fi-l)) ^ 8>
[0057] speech by the user is detected and the adaptation is turned OFF (Figure 5B). Tg Value Selection
[0058] The value for Tg has to be selected such that it is not particular to a specific speaker. This can per performed using recorded conversations, through the IEM 16 and the OEM 22, from different speakers (varying gender, age, etc.), for which is analyzed the effect of using different triggering thresholds. A sweep of the voice activity detection triggering threshold, Tg, is then performed, for example a sweep from Tg = 1.01 to T3 - 1.20 with a step size of 0.01, during the adaptation process, The bandwidth of the denoised signals for the various speakers resulting from the sweep is extended using a bandwidth extension (BWE) process, which will be further detailed below. The quality of these signals is measured before and after the BWE to see the effect of the different values for the triggering criteria, The choice of Ta is then made as the triggering percentage value that produces the optimal objective quality over the various speakers, In the illustrative example, a peek was observed at around Tg = 1.06 - 1.07, suggesting a triggering threshold of Tg— 1.06 to detect speech activity inside the ear.
[0059] The change in filter weights is triggered at the onset of speech but not the end. To ensure that the adaptive process starts back once speech inside the ear is no longer present the overall change in energy, A, at the onset of speech Is also measured and monitored, per sample, i.e. Ae(n). Once triggered by the user's speech, the adaptation is disabled for at least one second and as long as Af is maintained. When the adaptation is OFF the filter weights of the adaptive filter are updated with those from the previous second, w(n - fs). This is to ensure that the filter weights are those from when no speech is produced by the user. Once the change in energy is less than the onset change, Ae(n) < Δ^, the adaptation starts again. The process of monitoring the change in gives a non-ad-hoc way to turn ON the adaptation once the user is no longer speaking.
[0060] The adaptation process is illustrated by the flow diagram in Figure 7. [0061] The adaptive filtering denoises the IEM 16 signal by utilizing the information about the noise captured by the OEM 22. Once the IEM 16 is denoised its quality can be enhanced by extending its bandwidth in the high frequencies using the BWE. [0062] It Is to be understood that the triggering threshold, Τ#, can be set a priori at the time of manufacturing or, In an alternative embodiment, may be set using a calibration process such that Tg is specific to the user and/or the intra-aural device 12.
Bandwidth Extension Process [0063] Artificially extending the bandwidth of a clean bandlimited signal has been very well studied, With reference to Figure 8, since the IEM 16 signal shares mutual information with the reference speech signal, i.e. picked up using a reference microphone (REF) placed In front of the mouth, between 0-2 kHz [16], it is only necessary to extend the bandwidth in the high frequency range, 2-4 kHz. As described by [17], a simple yet effective way of extending the bandwidth is through the application of the signal's nonlinear characteristics. Figure 9 shows a block diagram of the bandwidth extension process. First, the signal is upsampied by a factor of 2 to provoke spectral folding. To reach an excitation signal similar to that extracted from a wideband speech signal, the upsampied signal is filtered by a whitening filter using the coefficients of a linear predictive coding (LPC) analysis [20], The whitening filter is a finite infinite response filter whose coefficients are those of an 18th order LPC filter at that time frame. Cubing the excitation reproduces the odd harmonics along the entire bandwidth including the high band, in this scenario from 1.8 kHz to 4 kHz. Since the high frequencies are the only region of Interest and to eliminate any overlap, the excitation signal is high passed at 1.8 kHz with a third order filter. Meanwhile, the upsampied IEM 16 signal is low passed at 1.8 kHz with a third order filter because it contains no relevant frequency information above 1 .8 kHz (see Figure 8). The high pass filter used for the excitation signal and the low pass filter used for the upsampied IEM 16 signal are designed to be power complementary for perfect reconstruction. The sum of the two filtered signals is then band passed with a fourth order Linkwitz-Riley filter at 160 Hz and 3.5 kHz by cascading a second order low pass Butterworth filter and a second order high pass Butterworth filter. This is done to eliminate the boomy effect coming from the bone and tissue conduction as well as any ringing caused by the odd harmonics of the cubed excitation signal. The overall output is then downaampled by a factor of 2 to go back to an 8 kHz sampling frequency. It is important to note that this bandwidth extension technique adds missing harmonics In the high frequencies. However, missing formants and frlcatlon noise are not recovered.
[0064] Although the adaptation process was described in the context of denoising a user's speech signal, the determination of whether or not the user is speaking may also be used In an alternative embodiment in order to Interrupt another process when the user is speaking, for example the recording of some biological process (i.e. heart rate, respiration, etc.), In a further alternative embodiment, the adaptation process may be adapted to detect sounds inside a device or space enclosed in a noisy environment, i.e. the in-ear microphone takes the form of an in-devlce/space microphone and the outer-ear microphone takes the form of an outer-device/space microphone. [00Θ5] Referring to Figures 10A to 10C, there are shown various alternative use of the intra-aural device 10, which include the providing of a filtered, denoised and enhanced (using enhancer 28) in-ear speech signal (Figure 10A), filtered and denoised biosignals (Figure 10B) and a Voice Activity Detector (VAD) output state, i.e. a signal indicating the presence or not of voice activity of the user, which can be used for automatic activation of a personal communication system, such as voice activation, voice operated switch or Voice Operated Exchange (VOX) in a two-way radiocommunication device (Figure 10C).
[0066] Referring now to Figure 11 , there is shown an example of use of the intra- aural device 10 as a portable application for biosignal monitoring. The intra-aural device 10 transmits biosignals of a user via wireless communication link 14, using transmitter 24, to a smart phone 40 on which runs a biosignal monitoring application 42 that can analyze and/or display the biosignals. The application 42 may also warn the user of some specific condition if detected. [0067] Referring to Figure 12, there is shown an intra-aural system for communicating in noisy environments 10' in accordance with another illustrative embodiment of the present disclosure, which takes the form of a pair of Intra-aural units 12' and a main unit 13. Each intra-aural unit 12\ includes an in-ear microphone 1 Θ, an outer-ear microphone 22 and a pair of miniature loudspeakers 16a, 16b. The receiver 20, transmitter 24 and processing unit 26 are externally located inside a main unit 13 operatively connected to each of the intra-aural units 12', The main unit 13 includes audio interfaces 15 for communication with the intra- aural units 12', a power manager 17 and battery 19 for providing power to the components of the intra-aural 12' and main 13 units, a processing unit 26 in the form of a central processing unit with associated memory (RAM, FLASH memory), a receiver 22 and transmitter 24 In the form of blue tooth and short range radio modems for providing a communication link 14 to remote components, an inertia! measurement unit 21 , a USB port 1 1 for accessing the associated memory and configuring the central processing unit 26, and a plurality of buttons and LEDs 23 for providing various functionalities to a user of the intra-aural communication system 10',
[0068] Although the present disclosure has been described with a certain degree of particularity and by way of illustrative embodiments and examples thereof, it is to be understood that the present disclosure is not limited to the features of the embodiments described and illustrated herein, but includes all variations and modifications within the scope and spirit of the disclosure as hereinafter claimed. LIST OF REFERENCES
[1] Gan, W. and Kuo, S. "Integrated active noise control communication headsets." Proceedings of International Symposium on Circuits and Systems., 4:IV-353- IV-356 (2003). [2] Casali, J. and Berger, E. "Technology advancements in hearing protection circa 1995: Active noise reduction, frequency/amplitude-sensitivity, and uniform attenuation." American Industrial Hygiene Association, 57(2):175-185 (1996).
[3] Bou Serhal, R., Falk, T., and Voix, J. "Integration of a distance sensitive wireless communication protocol to hearing protectors equipped with in-ear microphones." In Proceedings of Meetings on Acoustics, volume 19, 040013. Acoustical Society of America (2013).
[4] Kondo, K., Fujita, T„ and Nakagawa, K. "On equalization of bone conducted speech for improved speech quality." Sixth IEEE International Symposium on Signal Processing and Information Technology, ISSPIT, 426-431 (2007). [5] Zheng, Y., Liu, Z., Zhang, Z., Sinclair, M., Droppo, J., Deng, L, Acero, A, and Huang, X. "Air- and bone-conductive integrated microphones for robust speech detection and enhancement," 2003 IEEE Workshop on Automatic Speech Recognition and Understanding (IEEE Cat. NO.03EX721 ), 3-8 (2003).
[6] Turan, T. and Erzin, E. "Enhancement of throat microphone recordings by learning phone-dependent mappings of speech spectra." In IEEE International Conference on Acoustics, Speech and Signal Processing, 7049-7053. IEEE (2013).
[7] OSHA, U. S. Occupational Noise Exposure: Hearing Conservation Amendment, Final Rule. Federal Register (19Θ3). [Θ] Berger, E. The Noise Manual. AIHA (2003). [9] NIOSH. "Advanced Hearing Protector Study." Technical report, Flint, Ml (2005).
[10] McBride, M., Tran, P., Letowski, T, and Patrick, R. "The effect of bone conduction microphone locations on speech intelligibility and sound quality." Applied ergonomics, 42(3):495-502 (2011).
[11] Shin, H. S., Kang, H., and Fingscheidt, T. "Survey of speech enhancement supported by a bone conduction microphone." In Speech Communication; 10. ITG Symposium; Proceedings of , 1-4, VDE (2012).
[12] Li, M., Cohen, I., and Mousazadeh, S. "Multisonsory speech enhancement jn noisy environments using bone-conducted and air-conducted microphones." In Signal and Information Processing (ChinaSIP), 2014 IEEE China Summit & International Conference on, 1-5. IEEE (2014).
[13] Dekene, T. and Verhelst, W. "Body Conducted Speech Enhancement by Equalization and Signal Fusion," (2013). [14] Rahman, M. and Shimamura, T. "Intelligibility enhancement of bone conducted speech by an analysis-synthesis method." 2011 IEEE 54th International Midwest Symposium on Circuits and Systems (MWSCAS), 1-4 (201 1).
[15] Bernier, A. and Voix, J. "An active hearing protection device for musicians." In Proceedings of Meetings on Acoustics, volume 19, 040015. Acoustical Society of America (2013).
[16] Bouserhal, R., Falk, T., and Voix, J. "On the potential for Artificial Bandwidth Extension of Bone and Tissue Conducted Speech: A Mutual Information Study." In International Conference on Acoustics, Speech, and Signal Processing, 2015., volume 1 , 665-668. IEEE (2015). [17] Iser, B. and Schmidt, G. "Bandwidth extension of telephony speech." I n Speech and Audio Processing in Adverse Environments, chapter 5, 135-164 (2008).
[18] Seltzer, M„ Acero, A., and Droppo, J. "Robust Bandwidth Extension of Noisecorrupted Narrowband Speech." Interspeech 2005 , 1509-1512 (2005).
[19] Manolakis, D., Ingle, V., and Kogon, S. "Statistical and adaptive signal processing: spectral estimation, signal modeling, adaptive filtering, and array processing", volume 46. Artech House Norwood (2005).
[20] Valin, J.-M. and Lefebvre, R. "Bandwidth extension of narrowband speech for low bit-rate wideband coding", in Speech Coding, 2000. Proceedings, 2000 IEEE Workshop on, pages 130-132, IEEE. Delavan, Wisconsin, USA.

Claims

CLAIMS We claim:
1. A method for enhancing speech generated from bone and tissue conduction of a user of an intra-aural device in a noisy environment, the intra-aural device having an in-ear microphone adapted to be in fluid communication with an outer ear canal of the user and an outer-ear microphone adapted to be in fluid communication with an environment outside the ear, the method comprising the steps of:
acquiring a signal from the in-ear microphone;
acquiring a signal from the outer-ear microphone;
applying an adaptive filter to the acquired in-ear microphone signal to produce a denoised signal, the adaptive filter:
being initialized by an estimated transfer function of the intra-aural device based on the outer-ear microphone signal and the in-ear microphone signal;
having an adaptation process continuously adjusting the estimated transfer function using the acquired in-ear microphone signal and outer ear microphone signal;
detecting speech from the user;
interrupting application of the adaptation process upon detecting speech by the user;
restarting the application of the adaptation process once speech is no longer detected;
providing the denoised signal.
2. The method of claim 1 , wherein the step of interrupting application of the adaptation process includes updating filter weights of the adaptive filter to values previous to the detection of speech by the user.
3. The method of claim 1 , wherein the step of detecting speech from the user includes the sub-steps of:
computing filter weights of the adaptive filter;
upon detecting an increase in the filter weights for two consequent time indexes greater than a triggering threshold, providing an indication of detection of speech by the user.
4. The method of claim 3, wherein the triggering threshold is between 1 and 20 percent.
5. The method of claim 3, wherein the triggering threshold is between 6 and 7 percent.
6. The method of any of claims 1 to 5, wherein the estimated transfer function of the intra-aural device is estimated, while the user is wearing the intra-aural device, by:
generating white noise outside the ear of the user for at least two seconds; simultaneously acquiring the in-ear microphone signal and the outer-ear microphone signal;
computing the estimated transfer function of the intra-aural device based on the simultaneously acquired in-ear microphone signal and outer-ear microphone signal.
7. The method of claim 6, wherein the white noise is at least 85 dB.
8. The method of any of claims 1 to 7, wherein the adaptive filter is a normalized least mean square adaptive filter.
9. The method of any of claims 1 to 8, further comprising the step of:
extending the bandwidth of the denoised signal in high frequencies using a nonlinear bandwidth extension process previous to providing the denoised signal.
10. The method of claim 9, wherein the bandwidth is extended in the range from 1.8kHz to 4kHz.
11. The method of claim 9, wherein extending the bandwidth of the denoised signal includes the sub-steps of:
upsampling the denoised signal by a factor of two;
applying a whitening filter to the upsampled denoised signal using linear predictive coding coefficients;
cubing the filtered upsampled denoised signal;
applying a high pass third order filter to the cubed filtered upsampled denoised signal;
applying a low pass third order filter to the upsampled denoised signal;
summing the high passed signal and the low passed signal;
applying a band pass fourth order filter to the summed signals;
downsampling the band passed signal by a factor of two.
12. The method of claim 10, wherein high pass and low pass third order filters are at 1.8kHz.
13. The method of either of claims 10 or 11 , wherein the band pass fourth order filter is a Linkwitz-Riley filter at 160Hz and 3.5kHz.
14. A device for enhancing speech generated from bone and tissue conduction of a user in a noisy environment, the device comprising:
an intra-aural unit adapted to be positioned into an ear of the user, the intra- aural unit having an in-ear microphone adapted to be in fluid communication with an outer ear canal of the ear and an outer ear microphone adapted to be in fluid communication with an environment outside the ear;
a transmitter;
a processing unit operatively connected to the in-ear microphone to receive an internal signal therefrom, to the outer-ear microphone to receive an external signal therefrom and to the transmitter, the processing unit having an associated memory comprising instructions stored thereon, that when executed on the processor perform the steps of:
acquiring the internal signal from the in-ear microphone;
acquiring the external signal from the outer-ear microphone; applying an adaptive filter to the acquired in-ear microphone signal to produce a denoised signal, the adaptive filter:
being initialized by an estimated transfer function of the intra- aural device based on the outer-ear microphone signal and the in-ear microphone signal;
having an adaptation process continuously adjusting the estimated transfer function using the acquired in-ear microphone signal and outer ear microphone signal;
detecting speech from the user;
interrupting application of the adaptation process upon detecting speech by the user;
restarting the application of the adaptation process once speech is no longer detected; and
providing the denoised signal via the transmitter.
15. The device of claim 14, wherein the intra-aural unit is inflatable, compressible or custom molded to the ear of the user.
16. The device of either of claims 14 or 15, wherein at least one of the transmitter and the processing unit is located inside the intra-aural unit.
17. The device of any of claims 14 to 16, further comprising a receiver and wherein the intra-aural unit further includes a loudspeaker.
18. The device of claim 17, wherein the receiver is located inside the intra-aural unit.
19. The device of claim 14, wherein when the processor performs the step of interrupting application of the adaptation process, the processor further performs the sub-steps of updating filter weights of the adaptive filter to values previous to the detection of speech by the user.
20. The device of any of claims 14 to 19, wherein when the processor performs the step of detecting speech from the user, the processor further performs the sub- steps of:
computing filter weights of the adaptive filter;
upon detecting an increase in the filter weights for two consequent time indexes greater than a triggering threshold, providing an indication of detection of speech by the user.
21. The device of any of claims 14 to 20, wherein the triggering threshold is between 1 and 20 percent.
22. The device of any of claims 14 to 20, wherein the triggering threshold is between 6 and 7 percent.
23. The device of any of claims 14 to 22, wherein the estimated transfer function of the intra-aural device is estimated, while the user is wearing the intra- aural device, by:
generating white noise outside the ear of the user for at least two seconds; simultaneously acquiring the in-ear microphone signal and the outer-ear microphone signal;
computing the estimated transfer function of the intra-aural device based on the simultaneously acquired in-ear microphone signal and outer-ear microphone signal.
24. The device of claim 23, wherein the white noise is at least 85 dB.
25. The device of any of claims 14 to 24, wherein the adaptive filter is a normalized least mean square adaptive filter.
26. The device of any of claims 14 to 25, wherein the processor further performs the steps of:
extending the bandwidth of the denoised signal in high frequencies using a nonlinear bandwidth extension process previous to providing the denoised signal.
27. The device of claim 26, wherein the bandwidth is extended in the range from 1.8kHz to 4kHz.
28. The device of claim 26, wherein when the processor performs the step of extending the bandwidth of the denoised signal, the processor further performs the sub-steps of:
upsampling the denoised signal by a factor of two;
applying a whitening filter to the upsampled denoised signal using linear predictive coding coefficients;
cubing the filtered upsampled denoised signal;
applying a high pass third order filter to the cubed filtered upsampled denoised signal;
applying a low pass third order filter to the upsampled denoised signal;
summing the high passed signal and the low passed signal;
applying a band pass fourth order filter to the summed signals;
downsampling the band passed signal by a factor of two.
29. The device of claim 28, wherein high pass and low pass third order filters are at 1.8kHz.
30. The device of either of claims 28 or 29, wherein the band pass fourth order filter is a Linkwitz-Riley filter at 160Hz and 3.5kHz.
31. A method for detecting speech of a user of an intra-aural device in a noisy environment, the intra-aural device having an in-ear microphone adapted to be in fluid communication with an outer-ear ear canal of the user and an outer-ear microphone adapted to be in fluid communication with an environment outside the ear, the method comprising the steps of:
acquiring a signal from the in-ear microphone;
acquiring a signal from the outer-ear microphone;
applying an adaptive filter to the acquired in-ear microphone signal, the adaptive filter being initialized by an estimated transfer function of the intra- aural device based on the outer-ear microphone signal and the in-ear microphone signal;
computing filter weights of the adaptive filter;
upon detecting an increase in the filter weights for two consequent time indexes greater than a triggering threshold, providing an indication of detection of speech by the user.
32. The method of claim 31 , wherein the adaptive filter is a normalized least mean square adaptive filter.
33. The method of either of claims 31 or 32, wherein the triggering threshold is between 1 and 20 percent.
34. The method of either of claims 31 or 32, wherein the triggering threshold is between 6 and 7 percent.
35. The method of any of claims 31 to 34, wherein the estimated transfer function of the intra-aural device is estimated, while the user is wearing the intra- aural device, by:
generating white noise outside the ear of the user for at least two seconds; simultaneously acquiring the in-ear microphone signal and the outer-ear microphone signal;
computing the estimated transfer function of the intra-aural device based on the simultaneously acquired in-ear microphone signal and outer-ear microphone signal.
36. A device for detecting speech of a user of an intra-aural device in a noisy environment,, the device comprising:
an intra-aural unit adapted to be positioned into an ear of the user, the intra- aural unit having an in-ear microphone adapted to be in fluid communication with an outer ear canal of the ear and an outer ear microphone adapted to be in fluid communication with an environment outside the ear;
a transmitter;
a processing unit operatively connected to the in-ear microphone to receive an internal signal therefrom, to the outer-ear microphone to receive an external signal therefrom and to the transmitter, the processing unit having an associated memory comprising instructions stored thereon, that when executed on the processor perform the steps of:
acquiring a signal from the in-ear microphone;
acquiring a signal from the outer-ear microphone;
applying an adaptive filter to the acquired in-ear microphone signal, the adaptive filter being initialized by an estimated transfer function of the intra- aural device based on the outer-ear microphone signal and the in-ear microphone signal;
computing filter weights of the adaptive filter;
upon detecting an increase in the filter weights for two consequent time indexes greater than a triggering threshold, providing an indication of detection of speech by the user via the transmitter.
37. The device of claim 36, wherein the adaptive filter is a normalized least mean square adaptive filter.
38. The device of either of claims 36 or 37, wherein the triggering threshold is between 1 and 20 percent.
39. The device of either of claims 36 or 37, wherein the triggering threshold is between 6 and 7 percent.
40. The device of any of claims 36 to 39, wherein the estimated transfer function of the intra-aural device is estimated, while the user is wearing the intra- aural device, by:
generating white noise outside the ear of the user for at least two seconds; simultaneously acquiring the in-ear microphone signal and the outer-ear microphone signal;
computing the estimated transfer function of the intra-aural device based on the simultaneously acquired in-ear microphone signal and outer-ear microphone signal.
PCT/CA2017/000115 2016-05-06 2017-05-10 Device and method for improving the quality of in- ear microphone signals in noisy environments WO2017190219A1 (en)

Priority Applications (5)

Application Number Priority Date Filing Date Title
EP17792310.9A EP3453189B1 (en) 2016-05-06 2017-05-10 Device and method for improving the quality of in- ear microphone signals in noisy environments
DK17792310.9T DK3453189T3 (en) 2016-05-06 2017-05-10 DEVICE AND PROCEDURE FOR IMPROVING THE QUALITY OF IN-EAR MICROPHONE SIGNALS IN NOISING ENVIRONMENTS
US16/099,274 US10783904B2 (en) 2016-05-06 2017-05-10 Device and method for improving the quality of in-ear microphone signals in noisy environments
PL17792310T PL3453189T3 (en) 2016-05-06 2017-05-10 Device and method for improving the quality of in- ear microphone signals in noisy environments
CA3074050A CA3074050A1 (en) 2016-05-06 2017-05-10 Device and method for improving the quality of in-ear microphone signals in noisy environments

Applications Claiming Priority (4)

Application Number Priority Date Filing Date Title
US201662332861P 2016-05-06 2016-05-06
US62/332,861 2016-05-06
US201762460682P 2017-02-17 2017-02-17
US62/460,682 2017-02-17

Publications (1)

Publication Number Publication Date
WO2017190219A1 true WO2017190219A1 (en) 2017-11-09

Family

ID=60202532

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CA2017/000115 WO2017190219A1 (en) 2016-05-06 2017-05-10 Device and method for improving the quality of in- ear microphone signals in noisy environments

Country Status (5)

Country Link
US (1) US10783904B2 (en)
EP (1) EP3453189B1 (en)
DK (1) DK3453189T3 (en)
PL (1) PL3453189T3 (en)
WO (1) WO2017190219A1 (en)

Cited By (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN108520754A (en) * 2018-04-09 2018-09-11 广东思派康电子科技有限公司 A kind of noise reduction meeting machine
EP3696814A1 (en) * 2019-02-15 2020-08-19 Shenzhen Goodix Technology Co., Ltd. Speech enhancement method and apparatus, device and storage medium
CN112005557A (en) * 2018-02-08 2020-11-27 脸谱科技有限责任公司 Listening device for mitigating variations between ambient and internal sounds caused by a listening device blocking the ear canal of a user
GB2593435A (en) * 2020-02-11 2021-09-29 Breatheox Ltd Respiratory monitoring device
WO2022041167A1 (en) * 2020-08-29 2022-03-03 深圳市韶音科技有限公司 Method and system for obtaining vibration transfer function
EP4070310A4 (en) * 2019-12-03 2023-12-06 EERS Global Technologies Inc. User voice detector device and method using in-ear microphone signal of occluded ear

Families Citing this family (10)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US10043535B2 (en) * 2013-01-15 2018-08-07 Staton Techiya, Llc Method and device for spectral expansion for an audio signal
EP3453189B1 (en) * 2016-05-06 2021-04-14 Eers Global Technologies Inc. Device and method for improving the quality of in- ear microphone signals in noisy environments
EP3646615A4 (en) * 2017-06-26 2021-04-21 Ecole de Technologie Supérieure System, device and method for assessing a fit quality of an earpiece
US10595151B1 (en) * 2019-03-18 2020-03-17 Cirrus Logic, Inc. Compensation of own voice occlusion
US11044961B1 (en) * 2019-10-09 2021-06-29 Jessel Craig Safety helmet
CN110619886B (en) * 2019-10-11 2022-03-22 北京工商大学 End-to-end voice enhancement method for low-resource Tujia language
US20220230659A1 (en) * 2021-01-15 2022-07-21 Facebook Technologies, Llc System for non-verbal hands-free user input
CN114420140B (en) * 2022-03-30 2022-06-21 北京百瑞互联技术有限公司 Frequency band expansion method, encoding and decoding method and system based on generation countermeasure network
US11978468B2 (en) * 2022-04-06 2024-05-07 Analog Devices International Unlimited Company Audio signal processing method and system for noise mitigation of a voice signal measured by a bone conduction sensor, a feedback sensor and a feedforward sensor
WO2023197203A1 (en) * 2022-04-13 2023-10-19 Harman International Industries, Incorporated Method and system for reconstructing speech signals

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20070160243A1 (en) 2005-12-23 2007-07-12 Phonak Ag System and method for separation of a user's voice from ambient sound
EP2555189A1 (en) 2010-11-25 2013-02-06 Goertek Inc. Method and device for speech enhancement, and communication headphones with noise reduction
US8675884B2 (en) * 2008-05-22 2014-03-18 DSP Group Method and a system for processing signals
US8682010B2 (en) * 2009-12-17 2014-03-25 Nxp B.V. Automatic environmental acoustics identification
EP2843915A1 (en) 2013-05-22 2015-03-04 Goertek Inc. Headset communication method under loud-noise environment and headset

Family Cites Families (17)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5251263A (en) 1992-05-22 1993-10-05 Andrea Electronics Corporation Adaptive noise cancellation and speech enhancement system and apparatus therefor
US7590538B2 (en) * 1999-08-31 2009-09-15 Accenture Llp Voice recognition system for navigating on the internet
US6275806B1 (en) * 1999-08-31 2001-08-14 Andersen Consulting, Llp System method and article of manufacture for detecting emotion in voice signals by utilizing statistics for voice signal parameters
CN1282156C (en) 2001-11-23 2006-10-25 皇家飞利浦电子股份有限公司 Audio signal bandwidth extension
WO2006033104A1 (en) 2004-09-22 2006-03-30 Shalon Ventures Research, Llc Systems and methods for monitoring and modifying behavior
KR100708121B1 (en) 2005-01-22 2007-04-16 삼성전자주식회사 Method and apparatus for bandwidth extension of speech
US8526645B2 (en) 2007-05-04 2013-09-03 Personics Holdings Inc. Method and device for in ear canal echo suppression
EP2356826A4 (en) * 2008-11-10 2014-01-29 Bone Tone Comm Ltd An earpiece and a method for playing a stereo and a mono signal
US9219964B2 (en) 2009-04-01 2015-12-22 Starkey Laboratories, Inc. Hearing assistance system with own voice detection
US8477973B2 (en) 2009-04-01 2013-07-02 Starkey Laboratories, Inc. Hearing assistance system with own voice detection
US20110293109A1 (en) 2010-05-27 2011-12-01 Sony Ericsson Mobile Communications Ab Hands-Free Unit with Noise Tolerant Audio Sensor
US9053697B2 (en) 2010-06-01 2015-06-09 Qualcomm Incorporated Systems, methods, devices, apparatus, and computer program products for audio equalization
US9037458B2 (en) 2011-02-23 2015-05-19 Qualcomm Incorporated Systems, methods, apparatus, and computer-readable media for spatially selective audio augmentation
KR102003520B1 (en) 2012-09-21 2019-07-24 삼성전자주식회사 Signal processing apparatus and method thereof
US20150170633A1 (en) * 2013-12-17 2015-06-18 Kabushiki Kaisha Toshiba Bone-conduction noise cancelling headphones
US10515152B2 (en) * 2015-08-28 2019-12-24 Freedom Solutions Group, Llc Mitigation of conflicts between content matchers in automated document analysis
EP3453189B1 (en) * 2016-05-06 2021-04-14 Eers Global Technologies Inc. Device and method for improving the quality of in- ear microphone signals in noisy environments

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20070160243A1 (en) 2005-12-23 2007-07-12 Phonak Ag System and method for separation of a user's voice from ambient sound
US8675884B2 (en) * 2008-05-22 2014-03-18 DSP Group Method and a system for processing signals
US8682010B2 (en) * 2009-12-17 2014-03-25 Nxp B.V. Automatic environmental acoustics identification
EP2555189A1 (en) 2010-11-25 2013-02-06 Goertek Inc. Method and device for speech enhancement, and communication headphones with noise reduction
EP2843915A1 (en) 2013-05-22 2015-03-04 Goertek Inc. Headset communication method under loud-noise environment and headset

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
See also references of EP3453189A4

Cited By (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112005557A (en) * 2018-02-08 2020-11-27 脸谱科技有限责任公司 Listening device for mitigating variations between ambient and internal sounds caused by a listening device blocking the ear canal of a user
CN112005557B (en) * 2018-02-08 2022-02-25 脸谱科技有限责任公司 Listening device for mitigating variations between ambient and internal sounds caused by a listening device blocking the ear canal of a user
CN108520754A (en) * 2018-04-09 2018-09-11 广东思派康电子科技有限公司 A kind of noise reduction meeting machine
EP3696814A1 (en) * 2019-02-15 2020-08-19 Shenzhen Goodix Technology Co., Ltd. Speech enhancement method and apparatus, device and storage medium
US11056130B2 (en) 2019-02-15 2021-07-06 Shenzhen GOODIX Technology Co., Ltd. Speech enhancement method and apparatus, device and storage medium
EP4070310A4 (en) * 2019-12-03 2023-12-06 EERS Global Technologies Inc. User voice detector device and method using in-ear microphone signal of occluded ear
GB2593435A (en) * 2020-02-11 2021-09-29 Breatheox Ltd Respiratory monitoring device
WO2022041167A1 (en) * 2020-08-29 2022-03-03 深圳市韶音科技有限公司 Method and system for obtaining vibration transfer function
JP7426512B2 (en) 2020-08-29 2024-02-01 シェンツェン・ショックス・カンパニー・リミテッド Method and system for obtaining vibration transfer function

Also Published As

Publication number Publication date
US20190214038A1 (en) 2019-07-11
US10783904B2 (en) 2020-09-22
EP3453189A4 (en) 2019-05-29
DK3453189T3 (en) 2021-07-26
EP3453189B1 (en) 2021-04-14
PL3453189T3 (en) 2021-11-02
EP3453189A1 (en) 2019-03-13

Similar Documents

Publication Publication Date Title
EP3453189B1 (en) Device and method for improving the quality of in- ear microphone signals in noisy environments
US8606572B2 (en) Noise cancellation device for communications in high noise environments
US8675884B2 (en) Method and a system for processing signals
US20180359564A1 (en) Method And Device For Voice Operated Control
US9418675B2 (en) Wearable communication system with noise cancellation
US8611560B2 (en) Method and device for voice operated control
JP5635182B2 (en) Speech enhancement method, apparatus and noise reduction communication headphones
US8781137B1 (en) Wind noise detection and suppression
KR20110107833A (en) Acoustic in-ear detection for earpiece
Bouserhal et al. In-ear microphone speech quality enhancement via adaptive filtering and artificial bandwidth extension
EP3213527B1 (en) Self-voice occlusion mitigation in headsets
CN111935584A (en) Wind noise processing method and device for wireless earphone assembly and earphone
US11551704B2 (en) Method and device for spectral expansion for an audio signal
Shankar et al. Influence of MVDR beamformer on a speech enhancement based smartphone application for hearing aids
CN109788420A (en) Hearing protection system and correlation technique with own voices estimation
CN112055278A (en) Deep learning noise reduction method and device integrating in-ear microphone and out-of-ear microphone
CN115866474A (en) Transparent transmission noise reduction control method and system of wireless earphone and wireless earphone
US11671767B2 (en) Hearing aid comprising a feedback control system
CA3074050A1 (en) Device and method for improving the quality of in-ear microphone signals in noisy environments
Bouserhal et al. Improving the quality of in-ear microphone speech via adaptive filtering and artificial bandwidth extension
Bispo et al. A cepstral method to estimate the stable optimal solution for feedforward occlusion cancellation in hearing aids
Brodersen et al. Signal enhancement for communication systems used by fire fighters
US20230012052A1 (en) User voice detector device and method using in-ear microphone signal of occluded ear
US11615801B1 (en) System and method of enhancing intelligibility of audio playback
AU2022360083A1 (en) Joint far-end and near-end speech intelligibility enhancement

Legal Events

Date Code Title Description
NENP Non-entry into the national phase

Ref country code: DE

121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 17792310

Country of ref document: EP

Kind code of ref document: A1

ENP Entry into the national phase

Ref document number: 2017792310

Country of ref document: EP

Effective date: 20181206

ENP Entry into the national phase

Ref document number: 3074050

Country of ref document: CA