EP4266705A1 - Audio processing method for a wearable auto device - Google Patents

Audio processing method for a wearable auto device Download PDF

Info

Publication number
EP4266705A1
EP4266705A1 EP22386020.6A EP22386020A EP4266705A1 EP 4266705 A1 EP4266705 A1 EP 4266705A1 EP 22386020 A EP22386020 A EP 22386020A EP 4266705 A1 EP4266705 A1 EP 4266705A1
Authority
EP
European Patent Office
Prior art keywords
signal
accelerometer
user
microphone
audio device
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
EP22386020.6A
Other languages
German (de)
French (fr)
Inventor
Johannes LIJZENGA
Ilias Athanasios ANTONOPOULOS
Aernout Steven Ferdinand ARENDS
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Absolute Audio Labs BV
Original Assignee
Absolute Audio Labs BV
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Absolute Audio Labs BV filed Critical Absolute Audio Labs BV
Priority to EP22386020.6A priority Critical patent/EP4266705A1/en
Publication of EP4266705A1 publication Critical patent/EP4266705A1/en
Pending legal-status Critical Current

Links

Images

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04RLOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; DEAF-AID SETS; PUBLIC ADDRESS SYSTEMS
    • H04R25/00Deaf-aid sets, i.e. electro-acoustic or electro-mechanical hearing aids; Electric tinnitus maskers providing an auditory perception
    • H04R25/45Prevention of acoustic reaction, i.e. acoustic oscillatory feedback
    • H04R25/453Prevention of acoustic reaction, i.e. acoustic oscillatory feedback electronically
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10KSOUND-PRODUCING DEVICES; METHODS OR DEVICES FOR PROTECTING AGAINST, OR FOR DAMPING, NOISE OR OTHER ACOUSTIC WAVES IN GENERAL; ACOUSTICS NOT OTHERWISE PROVIDED FOR
    • G10K11/00Methods or devices for transmitting, conducting or directing sound in general; Methods or devices for protecting against, or for damping, noise or other acoustic waves in general
    • G10K11/16Methods or devices for protecting against, or for damping, noise or other acoustic waves in general
    • G10K11/175Methods or devices for protecting against, or for damping, noise or other acoustic waves in general using interference effects; Masking sound
    • G10K11/178Methods or devices for protecting against, or for damping, noise or other acoustic waves in general using interference effects; Masking sound by electro-acoustically regenerating the original acoustic waves in anti-phase
    • G10K11/1781Methods or devices for protecting against, or for damping, noise or other acoustic waves in general using interference effects; Masking sound by electro-acoustically regenerating the original acoustic waves in anti-phase characterised by the analysis of input or output signals, e.g. frequency range, modes, transfer functions
    • G10K11/17813Methods or devices for protecting against, or for damping, noise or other acoustic waves in general using interference effects; Masking sound by electro-acoustically regenerating the original acoustic waves in anti-phase characterised by the analysis of input or output signals, e.g. frequency range, modes, transfer functions characterised by the analysis of the acoustic paths, e.g. estimating, calibrating or testing of transfer functions or cross-terms
    • G10K11/17819Methods or devices for protecting against, or for damping, noise or other acoustic waves in general using interference effects; Masking sound by electro-acoustically regenerating the original acoustic waves in anti-phase characterised by the analysis of input or output signals, e.g. frequency range, modes, transfer functions characterised by the analysis of the acoustic paths, e.g. estimating, calibrating or testing of transfer functions or cross-terms between the output signals and the reference signals, e.g. to prevent howling
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10KSOUND-PRODUCING DEVICES; METHODS OR DEVICES FOR PROTECTING AGAINST, OR FOR DAMPING, NOISE OR OTHER ACOUSTIC WAVES IN GENERAL; ACOUSTICS NOT OTHERWISE PROVIDED FOR
    • G10K11/00Methods or devices for transmitting, conducting or directing sound in general; Methods or devices for protecting against, or for damping, noise or other acoustic waves in general
    • G10K11/16Methods or devices for protecting against, or for damping, noise or other acoustic waves in general
    • G10K11/175Methods or devices for protecting against, or for damping, noise or other acoustic waves in general using interference effects; Masking sound
    • G10K11/178Methods or devices for protecting against, or for damping, noise or other acoustic waves in general using interference effects; Masking sound by electro-acoustically regenerating the original acoustic waves in anti-phase
    • G10K11/1783Methods or devices for protecting against, or for damping, noise or other acoustic waves in general using interference effects; Masking sound by electro-acoustically regenerating the original acoustic waves in anti-phase handling or detecting of non-standard events or conditions, e.g. changing operating modes under specific operating conditions
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10KSOUND-PRODUCING DEVICES; METHODS OR DEVICES FOR PROTECTING AGAINST, OR FOR DAMPING, NOISE OR OTHER ACOUSTIC WAVES IN GENERAL; ACOUSTICS NOT OTHERWISE PROVIDED FOR
    • G10K11/00Methods or devices for transmitting, conducting or directing sound in general; Methods or devices for protecting against, or for damping, noise or other acoustic waves in general
    • G10K11/16Methods or devices for protecting against, or for damping, noise or other acoustic waves in general
    • G10K11/175Methods or devices for protecting against, or for damping, noise or other acoustic waves in general using interference effects; Masking sound
    • G10K11/178Methods or devices for protecting against, or for damping, noise or other acoustic waves in general using interference effects; Masking sound by electro-acoustically regenerating the original acoustic waves in anti-phase
    • G10K11/1787General system configurations
    • G10K11/17873General system configurations using a reference signal without an error signal, e.g. pure feedforward
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04RLOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; DEAF-AID SETS; PUBLIC ADDRESS SYSTEMS
    • H04R1/00Details of transducers, loudspeakers or microphones
    • H04R1/10Earpieces; Attachments therefor ; Earphones; Monophonic headphones
    • H04R1/1016Earpieces of the intra-aural type
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04RLOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; DEAF-AID SETS; PUBLIC ADDRESS SYSTEMS
    • H04R1/00Details of transducers, loudspeakers or microphones
    • H04R1/10Earpieces; Attachments therefor ; Earphones; Monophonic headphones
    • H04R1/1083Reduction of ambient noise
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04RLOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; DEAF-AID SETS; PUBLIC ADDRESS SYSTEMS
    • H04R1/00Details of transducers, loudspeakers or microphones
    • H04R1/10Earpieces; Attachments therefor ; Earphones; Monophonic headphones
    • H04R1/1091Details not provided for in groups H04R1/1008 - H04R1/1083
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04RLOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; DEAF-AID SETS; PUBLIC ADDRESS SYSTEMS
    • H04R25/00Deaf-aid sets, i.e. electro-acoustic or electro-mechanical hearing aids; Electric tinnitus maskers providing an auditory perception
    • H04R25/50Customised settings for obtaining desired overall acoustical characteristics
    • H04R25/505Customised settings for obtaining desired overall acoustical characteristics using digital signal processing
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04RLOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; DEAF-AID SETS; PUBLIC ADDRESS SYSTEMS
    • H04R3/00Circuits for transducers, loudspeakers or microphones
    • H04R3/005Circuits for transducers, loudspeakers or microphones for combining the signals of two or more microphones
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10KSOUND-PRODUCING DEVICES; METHODS OR DEVICES FOR PROTECTING AGAINST, OR FOR DAMPING, NOISE OR OTHER ACOUSTIC WAVES IN GENERAL; ACOUSTICS NOT OTHERWISE PROVIDED FOR
    • G10K2210/00Details of active noise control [ANC] covered by G10K11/178 but not provided for in any of its subgroups
    • G10K2210/10Applications
    • G10K2210/108Communication systems, e.g. where useful sound is kept and noise is cancelled
    • G10K2210/1081Earphones, e.g. for telephones, ear protectors or headsets
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10KSOUND-PRODUCING DEVICES; METHODS OR DEVICES FOR PROTECTING AGAINST, OR FOR DAMPING, NOISE OR OTHER ACOUSTIC WAVES IN GENERAL; ACOUSTICS NOT OTHERWISE PROVIDED FOR
    • G10K2210/00Details of active noise control [ANC] covered by G10K11/178 but not provided for in any of its subgroups
    • G10K2210/10Applications
    • G10K2210/129Vibration, e.g. instead of, or in addition to, acoustic noise
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10KSOUND-PRODUCING DEVICES; METHODS OR DEVICES FOR PROTECTING AGAINST, OR FOR DAMPING, NOISE OR OTHER ACOUSTIC WAVES IN GENERAL; ACOUSTICS NOT OTHERWISE PROVIDED FOR
    • G10K2210/00Details of active noise control [ANC] covered by G10K11/178 but not provided for in any of its subgroups
    • G10K2210/30Means
    • G10K2210/301Computational
    • G10K2210/3025Determination of spectrum characteristics, e.g. FFT
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10KSOUND-PRODUCING DEVICES; METHODS OR DEVICES FOR PROTECTING AGAINST, OR FOR DAMPING, NOISE OR OTHER ACOUSTIC WAVES IN GENERAL; ACOUSTICS NOT OTHERWISE PROVIDED FOR
    • G10K2210/00Details of active noise control [ANC] covered by G10K11/178 but not provided for in any of its subgroups
    • G10K2210/50Miscellaneous
    • G10K2210/501Acceleration, e.g. for accelerometers
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04RLOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; DEAF-AID SETS; PUBLIC ADDRESS SYSTEMS
    • H04R2460/00Details of hearing devices, i.e. of ear- or headphones covered by H04R1/10 or H04R5/033 but not provided for in any of their subgroups, or of hearing aids covered by H04R25/00 but not provided for in any of its subgroups
    • H04R2460/05Electronic compensation of the occlusion effect
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04RLOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; DEAF-AID SETS; PUBLIC ADDRESS SYSTEMS
    • H04R2460/00Details of hearing devices, i.e. of ear- or headphones covered by H04R1/10 or H04R5/033 but not provided for in any of their subgroups, or of hearing aids covered by H04R25/00 but not provided for in any of its subgroups
    • H04R2460/13Hearing devices using bone conduction transducers

Definitions

  • the invention relates to a wearable audio device, such as a hearing aid or earphone having an in-ear component, e.g. an earbud, for being inserted into the ear canal of a user and an audio signal processing method for such wearable audio device.
  • a wearable audio device such as a hearing aid or earphone having an in-ear component, e.g. an earbud, for being inserted into the ear canal of a user and an audio signal processing method for such wearable audio device.
  • Wearable audio devices such as hearing aids or earphones, can be used for providing an input audio signal to the wearer.
  • Such input audio signal can be obtained from a microphone of the wearable audio device, or e.g. transmitted to the wearable audio device, e.g. wiredly or wirelessly.
  • the wearable audio device can also be used for providing an output audio signal, e.g. for telephone conversation.
  • Hearing aids are developed to alleviate the effects of hearing loss in individuals.
  • Modern hearing aids include a microphone for recording audio signals from the environment, and dedicated processing means for enhancing the audio signals so as to output an audio signal that is audible for the user.
  • a variety of processing algorithms are employed in modern hearing aids, for example for noise reduction, audio compression, etc.
  • Tissue conduction voice pickup is achieved by sensing the vibrations that the voice transmits through the user's body, predominantly through the skull, to the housing or other mechanical part of a wearable audio device (such as a hearing aid or earphone having an in-ear component, e.g. an earbud, for being inserted into the ear canal of a user) by an accelerometer of the wearable audio device.
  • the tissue conduction can include bone conduction and/or soft tissue conduction.
  • the term accelerometer is used to refer to any inertial sensor configured to detect vibration of the user's vocal chords, mouth or throat, based on vibrations in bones and tissue of the user's head.
  • the accelerometer can e.g. be configured for determining acceleration along one or more, e.g.
  • the accelerometer can e.g. be a three-axis accelerometer.
  • the wearable audio device typically also contains an additional input for obtaining an input audio signal.
  • the input can e.g. be one or more microphones of the wearable audio device and/or a receiver for wiredly or wirelessly receiving an external input audio signal from an audio signal source, such as a mobile communications device.
  • the wearable audio device also includes an output speaker for outputting sound representative of an input audio signal.
  • a speaker signal can be provided to the speaker to be output by the speaker to the user.
  • the speaker signal can be based on the input audio signal.
  • the speaker may also transmit vibrations via mechanical parts of the wearable audio device, such as the housing, to the accelerometer.
  • the accelerometer signal recorded by the accelerometer, may not only include a tissue-conducted speech signal component originating from the user's own voice, but also a crosstalk signal component originating from the output speaker. This unwanted crosstalk component corrupts the accelerometer signal and can trouble the voice detection using the accelerometer signal.
  • speech and speech signal are used interchangeably with voice and voice signal, respectively, and include speech, singing, humming and other sounds created using the user's mouth and/or throat.
  • a method of reducing crosstalk in tissue-conducted voice pickup in a wearable audio device having at least an accelerometer for voice pickup and an output speaker.
  • the wearable audio device may for example be a hearing aid, or earphone having an in-ear component, e.g. an earbud, for being inserted into the ear canal of a user.
  • the method comprises determining a characteristic crosstalk transfer function for audio signals being transferred from the output speaker to the accelerometer; determining, on the basis of a speaker signal to be output by the speaker and the crosstalk transfer function, an estimated crosstalk signal to be sensed by the accelerometer.
  • the method comprises obtaining from the accelerometer an accelerometer signal representative of voice pickup, and determining a modified accelerometer signal by subtracting the estimated cross-talk signal from the accelerometer signal.
  • the accelerometer signal, recorded by the accelerometer may for example include a crosstalk signal component and a tissue-conducted speech signal component, wherein an estimation of the tissue-conducted speech signal component may be estimated by subtracting the estimated crosstalk signal component from the recorded accelerometer signal.
  • the crosstalk transfer function represents an acoustic transfer characteristic of vibrations propagating from the output speaker to the accelerometer, particularly propagating through hardware components, such as a housing, of the wearable audio device.
  • the accelerometer signal can be modified using the crosstalk transfer function to suppress a crosstalk signal originating from the output speaker being picked up by the accelerometer.
  • the accelerometer can be filtered, e.g. in the time-domain or the frequency domain, using the cross-talk transfer function.
  • the crosstalk transfer function may for example be represented as an impulse response function or a frequency response function.
  • the crosstalk transfer function may be determined a priori, i.e. before the wearable audio device is being used by a user and/or while in use by the user.
  • a predetermined common crosstalk transfer function is used for plurality of wearable audio devices, such as hearing aids or earphones, wherein the common crosstalk transfer function is used for cancelling a crosstalk component from the accelerometer signal in each wearable audio device.
  • an average crosstalk transfer function is determined over a plurality of wearable audio devices, wherein the average crosstalk transfer function is used for cancelling a crosstalk component from the accelerometer signal in each wearable audio device.
  • the average crosstalk transfer function may for example be adaptively updated while in use by a user.
  • the transfer function is device-specific and e.g. determined by outputting, with the output speaker of the wearable audio device, a predefined excitation signal, and recording a response thereof with the accelerometer.
  • the excitation signal may for example be pulse, such as a short click-sound, or chirp signal outputted by the output audio device.
  • the acoustic transfer characteristic may vary between wearable audio devices, and hence be device-specific, and may be determined for an accurate estimation of the crosstalk transfer function for a specific wearable audio device.
  • the acoustic transfer characteristic of the device may further vary depending on the user wearing the audio device, as the device and the user interact.
  • the transfer function is optionally determined while the wearable audio device is worn by a user.
  • the crosstalk transfer function may accordingly also be user specific.
  • the excitation signal includes a maximum length sequence signal.
  • the determining of the crosstalk transfer function can be particularly computationally efficient when using a maximum length sequence signal, i.e. a pseudo-random binary sequence, as the excitation signal.
  • the crosstalk transfer function e.g. an impulse response function, can for example be extracted by a computationally efficient deconvolution using a Hadamard transform of the accelerometer signal after outputting the maximum length sequence signal with the output speaker.
  • Occlusion of the ear canal by wearable audio devices in particular hearing aids or earphones having an in-ear component, e.g. earbuds or hearing aid receivers, for being inserted into the ear canal of a user, may however be beneficial for several reasons.
  • various audio signal processing methods are often more effective when the ear canal is occluded by the in-ear component, for example because the occlusion prevents acoustic bypass. Users may however experience such occlusion as uncomfortable, i.a. because their own voice is perceived differently compared to when the ear canal is open. The inventors realized that this phenomenon is two-fold.
  • a first issue is that the occlusion blocks transmission of sounds from the surroundings of the user to the inner ear of the user. This first issue can be alleviated by recording surrounding sounds with one or more microphones and actively outputting the recorded sounds towards the inner ear of the user; effectively creating a "transparency mode" with the wearable audio device.
  • a second issue is, however, that the occlusion prevents tissue-conducted audio signals to exit through the ear canal, as they otherwise would in absence of the occlusion.
  • a method for actively reducing occlusion perception of a wearable audio device comprises obtaining an accelerometer signal with an accelerometer of the wearable audio device indicative of a tissue-conducted speech signal being generated by a user's own voice.
  • the method comprises inverting the accelerometer signal and presenting the inverted accelerometer signal at a speaker of the wearable audio device to be outputted to the user.
  • the method can comprise obtaining an input audio signal.
  • the input audio signal can be obtained with one or more microphones of the wearable audio device and/or wiredly or wirelessly received from an audio signal source.
  • the method can comprise determining a modified input audio signal by adding the inverted accelerometer signal to the input audio signal.
  • the modified input audio signal may be presented at the speaker of the wearable audio device and outputted to the user.
  • the inventors realized that the uncomfortable sense of occlusion can be reduced by presenting the inverted tissue-conducted speech signal, optionally together with the input audio signal, to be outputted by the wearable audio device towards the user.
  • an anti-sound can be created cancelling the acoustic signal that comes from bone conduction in the ear canal.
  • the user's sound perception is as if the tissue conducted speech signal exits the user's ear.
  • the tissue-conducted speech signal is transmitted internally, through a body of the user, for example substantially by bone e.g. the skull of the user.
  • the accelerometer may be arranged to sense vibrations of the user's tissue, and transduce these vibrations into an electronic signal.
  • the accelerometer may for instance be arranged to sense vibrations of a mechanical part of the hearing aid, such as a housing, wherein said part is configured to be positioned in contact with the user's body, e.g. to a wall of the ear canal or skull.
  • tissue-conducted audio signals may have different characteristics, such as frequency spectrum, compared to air-conducted audio signals in which sound waves propagate through the air.
  • the method comprises presenting a filtered inverted accelerometer signal at the speaker. It is possible to filter the accelerometer signal before or after inverting.
  • the filtering can e.g. include low pass filtering, e.g. for frequencies below 1 kHz, preferably below 500 Hz, more preferably between 50 and 400Hz. Low pass filtering may reduce the effect of phase differences between the tissue conducted speech signal and the inverted accelerometer signal presented by the speaker.
  • other forms of filtering are contemplated.
  • the method comprises determining a magnitude spectrum of the tissue-conducted speech signal and subtracting said determined magnitude spectrum from a magnitude spectrum of the input audio signal. It has particularly been found that the frequency spectrum of the tissue-conducted speech signal equals, or at least is similar to, the frequency spectrum of an acoustic occlusion audio signal that would otherwise, i.e. with an open ear canal, be transmitted outward through the open ear canal.
  • the method comprises obtaining from the accelerometer an accelerometer signal representative of voice pickup and modifying the obtained accelerometer signal by subtracting an estimated crosstalk signal from the accelerometer signal.
  • This step may particularly be executed according to a method of reducing crosstalk in tissue-conducted voice pickup as described herein in the first aspect.
  • the estimated cross talk signal to be subtracted from the accelerometer signal may be determined based on the speaker signal to be output by the speaker.
  • the crosstalk signal originates from an output speaker of the wearable audio device and is conducted through the audio device's hardware components to the accelerometer.
  • the accelerometer may sense vibrations that are originating from an output speaker via transmission through the wearable audio device, e.g. through the housing. This crosstalk signal may be considered as undesirable noise, and may therefore be cancelled from the accelerometer signal.
  • a method is provided of adjusting a relative contribution to an output audio signal of an accelerometer signal recorded with an accelerometer of a wearable audio device and a microphone signal recorded by one or more microphones of the wearable audio device.
  • the wearable audio device can be a hearing aid or earphone having an in-ear component, e.g. an earbud, for being inserted into the ear canal of a user.
  • the method comprises detecting whether or not a speech signal of a user wearing the audio device is present in the microphone signal and/or the accelerometer signal, determining a noise condition of the microphone signal when no speech signal is detected, and adjusting a contribution of the accelerometer signal relative to the microphone signal to the output audio signal based on the determined noise condition.
  • the output audio signal may be selected to be a combination of the microphone signal and the accelerometer signal according to an adjustable contribution ratio.
  • the audio signal may for example be selected to correspond to either the microphone signal or the accelerometer signal, or a combination of the microphone signal and the accelerometer signal.
  • the microphone signal and/or the accelerometer signal may have undergone processing, and that the output audio signal may accordingly correspond to, e.g. a processed microphone signal and/or a processed accelerometer signal.
  • the method comprises adjusting the contribution to the output audio signal of the microphone signal relative to the accelerometer signal based on a noise condition of the microphone signal.
  • Microphones may generally provide better overall sound quality compared to accelerometers, but are also prone to being noisy. Speech intelligibility may therefore be enhanced in certain situations by reducing the contribution of the microphone signal and increasing the contribution of accelerometer signal in the output audio signal, for example when the microphone signal is very noisy. For example, with many, and/or high volume, background noises speech intelligibility may be improved by increasing the accelerometer signal contribution relative to the microphone signal in the output audio signal.
  • the method comprises detecting whether or not a speech signal is present in the microphone signal and/or the accelerometer signal, and determining the noise condition of the microphone signal, e.g. only, when no speech signal is detected.
  • background noise may ideally be cancelled.
  • An accurate determination of the background noise can be established in absence of a speech signal.
  • a presence of a speech signal may be detected using various speech recognition methods.
  • the recorded signal in absence of speech is indicative of a background noise. This background noise may for example be filtered from the audio signal.
  • the method comprises detecting whether or not a speech signal originating from a user's own voice is present in the accelerator signal, and preventing an adjustment of the relative contributions to the output audio signal of the microphone signal and the accelerometer signal in case a presence of a speech signal originating from the user's own voice is detected. It may be unwanted to adjust signal contributions to the output audio signal, while the user is talking. Hence, an adjustment setting may be frozen, for as long as the user is talking.
  • the method comprises allowing adjustment of the relative contributions to the output audio signal of the microphone signal and the accelerometer signal only in case no speech signal originating from a user's own voice is detected for a predefined time period.
  • the adjustment setting may be changed if required.
  • the predefined time period may for example be at least an average pronunciation length of a phoneme, e.g. at least 0.15 seconds, preferably at least 0.25 seconds.
  • a presence of a speech signal originating from the user's own voice in the accelerometer signal is ascertained, in case a loudness level of the accelerator signal exceeds a predefined loudness threshold level.
  • the accelerometer signal may be filtered and/or cleaned up for increased accuracy of own voice detection.
  • the accelerometer may be particular sensitive to vibrations caused by a user's own speech, which may be transmitted through the user's body such as the skull. For example, a current magnitude of the accelerator signal exceeding a predefined magnitude threshold, may indicate that the user is currently talking. Conversely, a current magnitude of the accelerator signal being below the predefined magnitude threshold, may indicate that the user is not currently talking.
  • the method includes subtracting an estimated crosstalk signal from the accelerometer signal prior to determining whether the accelerometer signal includes a speech signal originating from the user's own voice.
  • This step may particularly be executed according to a method of reducing crosstalk in tissue-conducted voice pickup as described herein in the first aspect.
  • the estimated cross talk signal to be subtracted from the accelerometer signal may be determined based on the speaker signal to be output by the speaker as described herein in the second aspect.
  • the method includes increasing the relative contribution of the accelerometer signal, relative to the microphone signal, to the output audio signal, in case the noise condition of the microphone signal exceeds a predefined noise threshold.
  • the microphone signal is very noisy, it may be desirable to increase the accelerometer signal relative to the microphone signal.
  • such adjustment may be inhibited, and optionally executed after the user has stopped talking.
  • the method includes increasing the relative contribution of the accelerometer signal, relative to the microphone signal, to the output audio signal, in case a loudness level of the microphone signal exceeds a predefined loudness threshold level. Loud sounds may be very uncomfortably perceived by the user, and furthermore microphone saturation is likely to occur. If sounds above the predefined loudness threshold level are detected, the microphone signal may be partially or entirely suppressed, or the microphone may be partly or entirely muted. This adjustment may be given precedence over other conditionals, and may for instance be executed regardless of the user talking.
  • the method comprises eliminating the contribution of the microphone signal to the output audio signal entirely in case a loudness level of the microphone signal exceeds a predefined loudness threshold level.
  • the microphone signal may for example be completely suppressed or the microphone may be completely muted, e.g. by switching off the microphone.
  • the output audio signal may for example be selected to correspond to the accelerator signal in case the determined loudness level of the microphone signal exceeds the predetermined threshold level.
  • a wearable audio device such as a hearing aid or earphone having an in-ear component, e.g. an earbud, for being inserted into the ear canal of a user, is provided configured for executing a method as described herein.
  • the wearable audio device may for example comprise one or more microphones arranged for recording the microphone signal, an accelerometer arranged for recording the accelerometer signal, and a processing unit configured for receiving the microphone signal and the accelerometer signal and determining the output audio signal based on the microphone signal and the accelerometer signal.
  • the processing unit includes an active occlusion cancellation module for cancelling a user-perceived occlusion of its hearing canal.
  • the processing unit comprises a crosstalk cancellation module for canceling a crosstalk signal from the accelerometer signal.
  • the processing unit comprises a dynamic mixing module for receiving the microphone signal and the accelerometer signal, and outputting an audio signal being either one of the microphone signal and the accelerometer signal, or a mix of the microphone signal and accelerometer signal according to an adaptable contribution ratio.
  • a wearable audio device can be provided having at least an accelerometer for voice pickup and an output speaker.
  • the wearable audio device can include a crosstalk cancellation module configured for determining a characteristic crosstalk transfer function for audio signals being transferred from the output speaker to the accelerometer; determining, on the basis of a speaker signal to be output by the speaker and the cross-talk transfer function, an estimated cross-talk signal to be sensed by the accelerometer; obtaining from the accelerometer an accelerometer signal representative of voice pickup; and determining a modified accelerometer signal by subtracting the estimated crosstalk signal from the accelerometer signal.
  • a wearable audio device can be provided having an accelerometer and a speaker.
  • the wearable audio device can include an occlusion cancellation module configured for obtaining an accelerometer signal with an accelerometer of the wearable audio device indicative of a tissue-conducted speech signal being generated by a user's own voice; inverting the accelerometer signal; and presenting the inverted accelerometer signal at a speaker of the wearable audio device to be outputted to the user.
  • the occlusion cancellation module is configured for obtaining an input audio signal; determining a modified input audio signal by adding the inverted accelerometer signal to the input audio signal; and presenting the modified input audio signal at the speaker of the wearable audio device to be outputted to the user.
  • a wearable audio device can be provided having an accelerometer and one or more microphones.
  • the wearable audio device can include a dynamic mixing module configured for detecting whether or not a speech signal is present in the microphone signal and/or the accelerometer signal; determining a noise condition of the microphone signal when no speech signal is detected; and adjusting a contribution of the accelerometer signal relative to the microphone signal to the output audio signal based on the determined noise condition.
  • Figure 1 shows a schematic example of a wearable audio device 100, such as a hearing aid or earphone, e.g. an in-earphone, on-earphone, or over-earphone, such as an earbud.
  • Figure 2 shows an example of a wearable audio device 100, such as a hearing aid, an earphone, e.g. an in-earphone, on-earphone, or over-earphone.
  • Figure 2 particularly shows an in-earphone for being inserted into a hearing canal of a user.
  • the wearable audio device has an in-ear component for being inserted into the ear canal of a user.
  • the wearable audio device 100 is arranged for receiving an input audio signal that is to be communicated to the user.
  • the input audio signal can be retrieved by a first input transducer, such as a microphone 10.
  • the wearable audio device 100 can include more than one microphone, although only one microphone is shown in Figure 1 for clarity.
  • the input audio signal can be wiredly or wirelessly received, e.g. from an audio signal source, such as a mobile communications device, using a receiver 12.
  • the input audio signal can be received via an IEEE 802.15 connection, via Bluetooth, or BTLE.
  • the wearable audio device 100 can be arranged for generating an output audio signal that is to be communicated to a third party, e.g. for telephone communication.
  • the output audio signal can e.g. be transmitted using a transmitter 14.
  • the output audio signal can be representative of speech of the user.
  • the speech of the user can e.g. by retrieved by a second input transducer, such as one or more accelerometers 20.
  • the wearable audio device 100 also comprises one or more output transducers, including an output speaker 30, for outputting incoming sound representative of the input audio signal to the wearer of the audio device.
  • the wearable audio device 100 may particularly include an in-ear component for being inserted into a hearing canal of a user, such as an earbud, wherein the in-ear component may occlude the hearing canal.
  • the output speaker 30 may be arranged at an internal side of the in-ear component for transmitting the incoming sound substantially towards the inner ear of the user, while wearing the audio device.
  • the accelerometer 20 may also be arranged at the in-ear component, and may for example be configured to sense vibrations, e.g. vibrations transmitted through the user's tissue, e.g. via bone conduction and/or soft tissue conduction.
  • the microphone(s) 10 may be arranged for sensing sounds from an environment of the user, and may therefore also be arranged at an outward side of the in-ear component, but also at other positions such as at an optional behind-the-ear component of the wearable audio device.
  • the wearable audio device 100 may include multiple microphones and/or multiple accelerometers, arranged at various positions.
  • the wearable audio device includes a processing unit 16 for processing the input audio signals and/or output audio signals.
  • the wearable audio device 100 e.g. the processing unit 16 may comprise an analog to digital converter 18 configured for receiving an analog audio signal, e.g. from the microphone 10 and/or the accelerometer 20, and converting the analog audio signal to a digital audio signal.
  • the audio signal can be divided into multiple time frame signals.
  • the digital audio signal may be divided into successive time frame signals, either overlapping or non-overlapping, that are, at least partly, shifted in time.
  • Each time frame signal may be represented by a finite number of samples, particularly 2N samples wherein 2N is a power of 2, such as 256, 512 or 1024.
  • the wearable audio device 100 e.g. the processing unit 16, here, comprises an active occlusion cancellation module 40, configured for cancelling a user-perceived occlusion of the hearing canal.
  • Figure 3 shows an exemplary flow chart of a method performed by the occlusion cancellation module 40.
  • the occlusion cancellation module 40 is in this example configured to receive 210 an input audio signal, e.g. from the microphone 10 or receiver 12 and accelerometer 20 and output 260 a modified audio signal for being outputted by the output speaker 30.
  • the accelerometer signal is received via a crosstalk cancelation module 50.
  • the user-perceived occlusion of the hearing canal by the in-ear component may be caused by two mechanisms.
  • the in-ear component blocks external air-conducted audio signals from entering the hearing canal. Passing through external sound signal such as received by the microphone 10 via a transparency mixer 80, e.g. amplified, may substantially alleviate this effect.
  • the in-ear component blocks an escape through the hearing canal of tissue-conducted audio signals generated by the user itself. This second effect may particularly be actively alleviated by the occlusion cancellation module 40.
  • the occlusion cancellation module 40 is particularly configured to obtain an accelerometer signal from the accelerometer 20 (here via crosstalk cancelation module 50) indicative of a tissue-conducted speech signal being generated by a user's own voice.
  • the occlusion cancellation module 40 is hence configured to receive 220 the accelerometer signal.
  • a tissue-conducted speech signal is estimated 230 by the occlusion cancellation module 40.
  • the tissue-conducted speech signal is generated by a user's own voice and conducted through the user's body tissue to the accelerometer.
  • the tissue-conducted speech signal can e.g. be estimated to be proportional to, e.g. equal to, the accelerometer signal.
  • the occlusion cancellation module 40 may invert 240 the accelerometer signal representative of speech and present 260 the inverted tissue-conducted speech signal at the speaker 30. This may be perceived by the user as if the tissue-conducted speech signal has escaped through the hearing canal, as it would in absence of any occlusion.
  • the inverted tissue conducted speech signal may for example be filtered, e.g. in the frequency domain, by an occlusion cancellation filter, which occlusion cancellation filter is determined based on the accelerometer signal.
  • the wearable audio device 100 receives an input audio signal from the microphone(s) 10 and/or the receiver 12, such as via the transparency mixer 80, the inverted accelerometer signal representative of tissue-conducted speech may be added 250 to the input audio signal.
  • the tissue-conducted speech signal is suppressed in the sound provided to the user by the speaker 30, e.g. effectively by subtracting the estimated tissue-conducted speech signal from the input output audio signal.
  • the accelerometer 20 may also record a crosstalk signal originating from the output speaker and being transmitted through hardware components 60 of the wearable audio device 100 back towards the accelerometer 20.
  • the accelerometer signal may therefore be corrupted with a crosstalk signal.
  • it may be desired to remove the crosstalk signal from the accelerometer signal.
  • the wearable audio device 100 e.g. the processing unit 16, may hence, here further, comprise a crosstalk cancellation module 50.
  • Figure 4 shows an exemplary flow chart of a method performed by the crosstalk cancelation module 50.
  • the crosstalk cancellation module 50 is in this example arranged between the accelerometer 20 and the occlusion cancellation module 40.
  • the accelerometer signal is first processed by the crosstalk cancellation module 50 wherein the resultant accelerometer signal is subsequently used by the occlusion cancellation module 40 for modifying the speaker signal to be output by the speaker.
  • the accelerometer signal is first processed by the crosstalk cancellation module 50 wherein the resultant crosstalk-corrected accelerometer signal is subsequently used by the occlusion cancellation module 40 for modifying the speaker signal.
  • the crosstalk cancellation module 50 is particularly configured for determining 310 a device-specific crosstalk transfer function, characterizing a transfer of audio signals, e.g. vibrations, from the output speaker 30 to the accelerometer 20, e.g. via the wearable audio device hardware 60.
  • a crosstalk component is, at least partially, cancelled 320 from the accelerometer signal using the determined device-specific crosstalk transfer function.
  • the crosstalk transfer function of a device may for example be determined by outputting 311 an excitation signal with the output speaker 30, and recording 312 an accelerometer signal with the accelerometer 20.
  • the device-specific crosstalk transfer function may be determined 310 in various ways from the known excitation signal and the observed response as recorded by the accelerometer 20. Various excitation signals may be used.
  • a particularly computationally efficient determination of the device-specific crosstalk transfer function can be obtained by using a maximum length sequence (MLS) signal as the excitation signal.
  • the device-specific crosstalk transfer function can subsequently be used, in various ways, for filtering the accelerometer signal so as to effectively subtract the crosstalk signal from the accelerometer signal.
  • the crosstalk cancellation module 50 may receive a speaker signal, such as from the transparency mixer 80, to be outputted by the output speaker 30, and filter, e.g. with an adaptive filter, the accelerometer signal based on the speaker signal and the device-specific cross-talk transfer function.
  • the crosstalk cancellation module may calculate, using the device-specific crosstalk transfer function and the received speaker signal, the predicted signal component that is predicted to be recorded at the accelerometer in view of the to be outputted audio signal, and subtract said signal component from the actually recorded accelerometer signal.
  • the modified accelerometer signal may be provided to the transmitter 14. Alternatively, or additionally, the modified accelerometer signal may be provided to the occlusion cancellation module 40 as described above.
  • the wearable audio device 100 further comprises a dynamic mixing module 70 configured for receiving the microphone signal and the accelerometer signal, and generating an output audio signal, to be transmitted to a third party, being either one of the microphone signals and the accelerometer signal, or a mix of the microphone signal and accelerometer signal according to an adaptive mixing ratio, particularly to optimize speech intelligibility.
  • the mixing ratio can be adaptively determined based various audio conditions in the microphone signal and/or the accelerometer signal.
  • Figure 5 shows an exemplary flow chart of the dynamic mixing module 70.
  • the mixing module 70 is arranged to detect 410 whether or not a loudness level of the microphone signal exceeds a predefined loudness threshold.
  • the mixing module 70 detects 430 whether or not a speech signal originating from the user's own voice is present, e.g. based on the accelerometer signal. If such speech signal is detected 430 (yes), adjustment of the relative contributions of the accelerometer signal and the microphone signal is prohibited 440. If such speech signal is not detected (no), adjustment of the relative contributions of the accelerometer signal and the microphone signal is allowed 420. Whether or not the user currently speaks may be detected using the accelerometer. It may for example be ascertained that the user is currently speaking if a magnitude of the accelerometer signal exceeds a predefined threshold.
  • the adjustment of the relative contributions of the accelerometer signal and the microphone signal to the output signal may be based on a determined noise level of the microphone signal. For example, a relative contribution of the accelerometer signal to the mixing module output signal may be increased, relative to the microphone signal contribution, when the microphone signal becomes more noisy. Regardless of the noise level, the mixing module may for example adjust the relative contributions in such a way, that the mixing module output signal only contains the accelerometer signal, in case a loudness level of the microphone signal exceeds a predetermined loudness threshold, e.g. regardless of the noise level and regardless of the user talking.
  • the mixing module 70 may be used in conjunction with the crosstalk cancellation module 50. In that case, the mixing module can receive the accelerometer signal from the crosstalk cancellation module 50. In case the mixing module 70 is used without using the crosstalk cancellation module 50, the mixing module can receive the accelerometer signal directly from the accelerometer 20 (or converter 18), as indicated by the dashed arrow in Figure 1 .
  • Figure 1 shows an exemplary wearable audio device 100 comprising a particular combination of the occlusion cancellation module 40, the crosstalk cancelling module 50 and the mixing module 70, but it will be appreciated that the wearable audio device can include any combination of said modules.
  • the wearable audio device 100 may include only one, anyone, of said modules, or only two, any two, of said modules.
  • Some or all of the occlusion cancellation module 40, the crosstalk cancelling module 50 and/or the mixing module 70 may be part of the processing unit 16 of the wearable audio device 100.
  • the wearable audio device can include the occlusion cancellation module 40.
  • the wearable audio device can e.g. include the occlusion cancellation module while not including, or not using, the crosstalk cancellation module.
  • the wearable audio device can include the crosstalk cancelling module 50.
  • the wearable audio device can e.g. include the crosstalk cancellation module while not including, or not using, the occlusion cancellation module.
  • the wearable audio device can include the mixing module 70.
  • the wearable audio device can e.g. include the mixing module while not including, or not using, the occlusion cancellation module and/or the crosstalk cancellation module.
  • the wearable audio device can include the occlusion cancellation module and the crosstalk cancelling module, e.g. while not including the mixing module.
  • any reference signs placed between parentheses shall not be construed as limiting the claim.
  • the word 'comprising' does not exclude the presence of other features or steps than those listed in a claim.
  • the words 'a' and 'an' shall not be construed as limited to 'only one', but instead are used to mean 'at least one', and do not exclude a plurality.
  • the mere fact that certain measures are recited in mutually different claims does not indicate that a combination of these measures cannot be used to an advantage.

Landscapes

  • Physics & Mathematics (AREA)
  • Engineering & Computer Science (AREA)
  • Acoustics & Sound (AREA)
  • Signal Processing (AREA)
  • Health & Medical Sciences (AREA)
  • General Health & Medical Sciences (AREA)
  • Otolaryngology (AREA)
  • Multimedia (AREA)
  • Neurosurgery (AREA)
  • Circuit For Audible Band Transducer (AREA)

Abstract

The disclosure relates to a method of processing an audio signal in a wearable audio device. The method comprises recording a microphone signal with one or more microphones of the audio device, and further recording an accelerometer signal with one or more accelerometers of the audio device indicative of an acceleration of the audio device with respect to one or more axes; and determining an output audio signal based on the microphone signal(s) and the accelerometer signal(s).

Description

    FIELD
  • The invention relates to a wearable audio device, such as a hearing aid or earphone having an in-ear component, e.g. an earbud, for being inserted into the ear canal of a user and an audio signal processing method for such wearable audio device.
  • BACKGROUND
  • Wearable audio devices, such as hearing aids or earphones, can be used for providing an input audio signal to the wearer. Such input audio signal can be obtained from a microphone of the wearable audio device, or e.g. transmitted to the wearable audio device, e.g. wiredly or wirelessly. The wearable audio device can also be used for providing an output audio signal, e.g. for telephone conversation.
  • Hearing aids are developed to alleviate the effects of hearing loss in individuals. Modern hearing aids include a microphone for recording audio signals from the environment, and dedicated processing means for enhancing the audio signals so as to output an audio signal that is audible for the user. A variety of processing algorithms are employed in modern hearing aids, for example for noise reduction, audio compression, etc.
  • SUMMARY
  • It is aim to provide an improved processing method for processing audio signals in a wearable audio device.
  • Tissue conduction voice pickup is achieved by sensing the vibrations that the voice transmits through the user's body, predominantly through the skull, to the housing or other mechanical part of a wearable audio device (such as a hearing aid or earphone having an in-ear component, e.g. an earbud, for being inserted into the ear canal of a user) by an accelerometer of the wearable audio device. The tissue conduction can include bone conduction and/or soft tissue conduction. Herein, the term accelerometer is used to refer to any inertial sensor configured to detect vibration of the user's vocal chords, mouth or throat, based on vibrations in bones and tissue of the user's head. The accelerometer can e.g. be configured for determining acceleration along one or more, e.g. orthogonal, axis. The accelerometer can e.g. be a three-axis accelerometer. The wearable audio device however, typically also contains an additional input for obtaining an input audio signal. The input can e.g. be one or more microphones of the wearable audio device and/or a receiver for wiredly or wirelessly receiving an external input audio signal from an audio signal source, such as a mobile communications device. The wearable audio device also includes an output speaker for outputting sound representative of an input audio signal. A speaker signal can be provided to the speaker to be output by the speaker to the user. The speaker signal can be based on the input audio signal. The speaker, however, may also transmit vibrations via mechanical parts of the wearable audio device, such as the housing, to the accelerometer. This means that the accelerometer signal, recorded by the accelerometer, may not only include a tissue-conducted speech signal component originating from the user's own voice, but also a crosstalk signal component originating from the output speaker. This unwanted crosstalk component corrupts the accelerometer signal and can trouble the voice detection using the accelerometer signal. Herein the terms speech and speech signal are used interchangeably with voice and voice signal, respectively, and include speech, singing, humming and other sounds created using the user's mouth and/or throat.
  • Hereto, according to a first aspect, a method is provided of reducing crosstalk in tissue-conducted voice pickup in a wearable audio device having at least an accelerometer for voice pickup and an output speaker. The wearable audio device may for example be a hearing aid, or earphone having an in-ear component, e.g. an earbud, for being inserted into the ear canal of a user. The method comprises determining a characteristic crosstalk transfer function for audio signals being transferred from the output speaker to the accelerometer; determining, on the basis of a speaker signal to be output by the speaker and the crosstalk transfer function, an estimated crosstalk signal to be sensed by the accelerometer. The method comprises obtaining from the accelerometer an accelerometer signal representative of voice pickup, and determining a modified accelerometer signal by subtracting the estimated cross-talk signal from the accelerometer signal.
  • Hence, the accelerometer signal, recorded by the accelerometer, may for example include a crosstalk signal component and a tissue-conducted speech signal component, wherein an estimation of the tissue-conducted speech signal component may be estimated by subtracting the estimated crosstalk signal component from the recorded accelerometer signal.
  • The crosstalk transfer function represents an acoustic transfer characteristic of vibrations propagating from the output speaker to the accelerometer, particularly propagating through hardware components, such as a housing, of the wearable audio device. The accelerometer signal can be modified using the crosstalk transfer function to suppress a crosstalk signal originating from the output speaker being picked up by the accelerometer. The accelerometer can be filtered, e.g. in the time-domain or the frequency domain, using the cross-talk transfer function. The crosstalk transfer function may for example be represented as an impulse response function or a frequency response function. The crosstalk transfer function may be determined a priori, i.e. before the wearable audio device is being used by a user and/or while in use by the user. Optionally, a predetermined common crosstalk transfer function is used for plurality of wearable audio devices, such as hearing aids or earphones, wherein the common crosstalk transfer function is used for cancelling a crosstalk component from the accelerometer signal in each wearable audio device. Optionally, an average crosstalk transfer function is determined over a plurality of wearable audio devices, wherein the average crosstalk transfer function is used for cancelling a crosstalk component from the accelerometer signal in each wearable audio device. The average crosstalk transfer function may for example be adaptively updated while in use by a user.
  • Optionally, the transfer function is device-specific and e.g. determined by outputting, with the output speaker of the wearable audio device, a predefined excitation signal, and recording a response thereof with the accelerometer. The excitation signal may for example be pulse, such as a short click-sound, or chirp signal outputted by the output audio device. The acoustic transfer characteristic may vary between wearable audio devices, and hence be device-specific, and may be determined for an accurate estimation of the crosstalk transfer function for a specific wearable audio device. Moreover, the acoustic transfer characteristic of the device may further vary depending on the user wearing the audio device, as the device and the user interact. Hence, the transfer function is optionally determined while the wearable audio device is worn by a user. The crosstalk transfer function may accordingly also be user specific.
  • Optionally, the excitation signal includes a maximum length sequence signal. The determining of the crosstalk transfer function can be particularly computationally efficient when using a maximum length sequence signal, i.e. a pseudo-random binary sequence, as the excitation signal. The crosstalk transfer function, e.g. an impulse response function, can for example be extracted by a computationally efficient deconvolution using a Hadamard transform of the accelerometer signal after outputting the maximum length sequence signal with the output speaker.
  • When the wearable audio device occludes the ear canal of the user, this is often experienced as unpleasant. Occlusion of the ear canal by wearable audio devices, in particular hearing aids or earphones having an in-ear component, e.g. earbuds or hearing aid receivers, for being inserted into the ear canal of a user, may however be beneficial for several reasons. For example, various audio signal processing methods are often more effective when the ear canal is occluded by the in-ear component, for example because the occlusion prevents acoustic bypass. Users may however experience such occlusion as uncomfortable, i.a. because their own voice is perceived differently compared to when the ear canal is open. The inventors realized that this phenomenon is two-fold. A first issue is that the occlusion blocks transmission of sounds from the surroundings of the user to the inner ear of the user. This first issue can be alleviated by recording surrounding sounds with one or more microphones and actively outputting the recorded sounds towards the inner ear of the user; effectively creating a "transparency mode" with the wearable audio device. A second issue is, however, that the occlusion prevents tissue-conducted audio signals to exit through the ear canal, as they otherwise would in absence of the occlusion.
  • Hereto, according to a second aspect, a method for actively reducing occlusion perception of a wearable audio device is provided. The method comprises obtaining an accelerometer signal with an accelerometer of the wearable audio device indicative of a tissue-conducted speech signal being generated by a user's own voice. The method comprises inverting the accelerometer signal and presenting the inverted accelerometer signal at a speaker of the wearable audio device to be outputted to the user.
  • The method can comprise obtaining an input audio signal. The input audio signal can be obtained with one or more microphones of the wearable audio device and/or wiredly or wirelessly received from an audio signal source. The method can comprise determining a modified input audio signal by adding the inverted accelerometer signal to the input audio signal. The modified input audio signal may be presented at the speaker of the wearable audio device and outputted to the user.
  • Hence, the inventors realized that the uncomfortable sense of occlusion can be reduced by presenting the inverted tissue-conducted speech signal, optionally together with the input audio signal, to be outputted by the wearable audio device towards the user. Hence, an anti-sound can be created cancelling the acoustic signal that comes from bone conduction in the ear canal. By presenting the inverted tissue-conducted speech signal, or effectively subtracting the tissue-conducted speech signal from the incoming sound, the user's sound perception is as if the tissue conducted speech signal exits the user's ear. The tissue-conducted speech signal is transmitted internally, through a body of the user, for example substantially by bone e.g. the skull of the user. The accelerometer may be arranged to sense vibrations of the user's tissue, and transduce these vibrations into an electronic signal. The accelerometer may for instance be arranged to sense vibrations of a mechanical part of the hearing aid, such as a housing, wherein said part is configured to be positioned in contact with the user's body, e.g. to a wall of the ear canal or skull. It will be appreciated that tissue-conducted audio signals may have different characteristics, such as frequency spectrum, compared to air-conducted audio signals in which sound waves propagate through the air.
  • Optionally, the method comprises presenting a filtered inverted accelerometer signal at the speaker. It is possible to filter the accelerometer signal before or after inverting. The filtering can e.g. include low pass filtering, e.g. for frequencies below 1 kHz, preferably below 500 Hz, more preferably between 50 and 400Hz. Low pass filtering may reduce the effect of phase differences between the tissue conducted speech signal and the inverted accelerometer signal presented by the speaker. However, other forms of filtering are contemplated.
  • Optionally, the method comprises determining a magnitude spectrum of the tissue-conducted speech signal and subtracting said determined magnitude spectrum from a magnitude spectrum of the input audio signal. It has particularly been found that the frequency spectrum of the tissue-conducted speech signal equals, or at least is similar to, the frequency spectrum of an acoustic occlusion audio signal that would otherwise, i.e. with an open ear canal, be transmitted outward through the open ear canal.
  • Optionally, the method comprises obtaining from the accelerometer an accelerometer signal representative of voice pickup and modifying the obtained accelerometer signal by subtracting an estimated crosstalk signal from the accelerometer signal. This step may particularly be executed according to a method of reducing crosstalk in tissue-conducted voice pickup as described herein in the first aspect. The estimated cross talk signal to be subtracted from the accelerometer signal may be determined based on the speaker signal to be output by the speaker. The crosstalk signal originates from an output speaker of the wearable audio device and is conducted through the audio device's hardware components to the accelerometer. The accelerometer may sense vibrations that are originating from an output speaker via transmission through the wearable audio device, e.g. through the housing. This crosstalk signal may be considered as undesirable noise, and may therefore be cancelled from the accelerometer signal.
  • According to a third aspect, a method is provided of adjusting a relative contribution to an output audio signal of an accelerometer signal recorded with an accelerometer of a wearable audio device and a microphone signal recorded by one or more microphones of the wearable audio device. The wearable audio device can be a hearing aid or earphone having an in-ear component, e.g. an earbud, for being inserted into the ear canal of a user. The method comprises detecting whether or not a speech signal of a user wearing the audio device is present in the microphone signal and/or the accelerometer signal, determining a noise condition of the microphone signal when no speech signal is detected, and adjusting a contribution of the accelerometer signal relative to the microphone signal to the output audio signal based on the determined noise condition. For example, the output audio signal may be selected to be a combination of the microphone signal and the accelerometer signal according to an adjustable contribution ratio. The audio signal may for example be selected to correspond to either the microphone signal or the accelerometer signal, or a combination of the microphone signal and the accelerometer signal. It will be appreciated that the microphone signal and/or the accelerometer signal may have undergone processing, and that the output audio signal may accordingly correspond to, e.g. a processed microphone signal and/or a processed accelerometer signal.
  • Optionally, the method comprises adjusting the contribution to the output audio signal of the microphone signal relative to the accelerometer signal based on a noise condition of the microphone signal. Microphones may generally provide better overall sound quality compared to accelerometers, but are also prone to being noisy. Speech intelligibility may therefore be enhanced in certain situations by reducing the contribution of the microphone signal and increasing the contribution of accelerometer signal in the output audio signal, for example when the microphone signal is very noisy. For example, with many, and/or high volume, background noises speech intelligibility may be improved by increasing the accelerometer signal contribution relative to the microphone signal in the output audio signal.
  • Optionally, the method comprises detecting whether or not a speech signal is present in the microphone signal and/or the accelerometer signal, and determining the noise condition of the microphone signal, e.g. only, when no speech signal is detected. For enhancing speech intelligibility, background noise may ideally be cancelled. An accurate determination of the background noise can be established in absence of a speech signal. A presence of a speech signal may be detected using various speech recognition methods. The recorded signal in absence of speech, is indicative of a background noise. This background noise may for example be filtered from the audio signal.
  • Optionally, the method comprises detecting whether or not a speech signal originating from a user's own voice is present in the accelerator signal, and preventing an adjustment of the relative contributions to the output audio signal of the microphone signal and the accelerometer signal in case a presence of a speech signal originating from the user's own voice is detected. It may be unwanted to adjust signal contributions to the output audio signal, while the user is talking. Hence, an adjustment setting may be frozen, for as long as the user is talking.
  • Optionally, the method comprises allowing adjustment of the relative contributions to the output audio signal of the microphone signal and the accelerometer signal only in case no speech signal originating from a user's own voice is detected for a predefined time period. Hence, after elapse of the predefined time period of no speech detection, the adjustment setting may be changed if required. The predefined time period may for example be at least an average pronunciation length of a phoneme, e.g. at least 0.15 seconds, preferably at least 0.25 seconds.
  • Optionally, a presence of a speech signal originating from the user's own voice in the accelerometer signal is ascertained, in case a loudness level of the accelerator signal exceeds a predefined loudness threshold level. The accelerometer signal may be filtered and/or cleaned up for increased accuracy of own voice detection. The accelerometer may be particular sensitive to vibrations caused by a user's own speech, which may be transmitted through the user's body such as the skull. For example, a current magnitude of the accelerator signal exceeding a predefined magnitude threshold, may indicate that the user is currently talking. Conversely, a current magnitude of the accelerator signal being below the predefined magnitude threshold, may indicate that the user is not currently talking.
  • Optionally, the method includes subtracting an estimated crosstalk signal from the accelerometer signal prior to determining whether the accelerometer signal includes a speech signal originating from the user's own voice. This step may particularly be executed according to a method of reducing crosstalk in tissue-conducted voice pickup as described herein in the first aspect. The estimated cross talk signal to be subtracted from the accelerometer signal may be determined based on the speaker signal to be output by the speaker as described herein in the second aspect.
  • Optionally, the method includes increasing the relative contribution of the accelerometer signal, relative to the microphone signal, to the output audio signal, in case the noise condition of the microphone signal exceeds a predefined noise threshold. In case the microphone signal is very noisy, it may be desirable to increase the accelerometer signal relative to the microphone signal. However, if the user is currently speaking, such adjustment may be inhibited, and optionally executed after the user has stopped talking.
  • Optionally, the method includes increasing the relative contribution of the accelerometer signal, relative to the microphone signal, to the output audio signal, in case a loudness level of the microphone signal exceeds a predefined loudness threshold level. Loud sounds may be very uncomfortably perceived by the user, and furthermore microphone saturation is likely to occur. If sounds above the predefined loudness threshold level are detected, the microphone signal may be partially or entirely suppressed, or the microphone may be partly or entirely muted. This adjustment may be given precedence over other conditionals, and may for instance be executed regardless of the user talking.
  • Optionally, the method comprises eliminating the contribution of the microphone signal to the output audio signal entirely in case a loudness level of the microphone signal exceeds a predefined loudness threshold level. The microphone signal may for example be completely suppressed or the microphone may be completely muted, e.g. by switching off the microphone. The output audio signal may for example be selected to correspond to the accelerator signal in case the determined loudness level of the microphone signal exceeds the predetermined threshold level.
  • According to a further aspect, a wearable audio device, such as a hearing aid or earphone having an in-ear component, e.g. an earbud, for being inserted into the ear canal of a user, is provided configured for executing a method as described herein.
  • The wearable audio device may for example comprise one or more microphones arranged for recording the microphone signal, an accelerometer arranged for recording the accelerometer signal, and a processing unit configured for receiving the microphone signal and the accelerometer signal and determining the output audio signal based on the microphone signal and the accelerometer signal. Optionally, the processing unit includes an active occlusion cancellation module for cancelling a user-perceived occlusion of its hearing canal. Optionally, the processing unit comprises a crosstalk cancellation module for canceling a crosstalk signal from the accelerometer signal. Optionally, the processing unit comprises a dynamic mixing module for receiving the microphone signal and the accelerometer signal, and outputting an audio signal being either one of the microphone signal and the accelerometer signal, or a mix of the microphone signal and accelerometer signal according to an adaptable contribution ratio.
  • A wearable audio device can be provided having at least an accelerometer for voice pickup and an output speaker. The wearable audio device can include a crosstalk cancellation module configured for determining a characteristic crosstalk transfer function for audio signals being transferred from the output speaker to the accelerometer; determining, on the basis of a speaker signal to be output by the speaker and the cross-talk transfer function, an estimated cross-talk signal to be sensed by the accelerometer; obtaining from the accelerometer an accelerometer signal representative of voice pickup; and determining a modified accelerometer signal by subtracting the estimated crosstalk signal from the accelerometer signal.
  • A wearable audio device can be provided having an accelerometer and a speaker. The wearable audio device can include an occlusion cancellation module configured for obtaining an accelerometer signal with an accelerometer of the wearable audio device indicative of a tissue-conducted speech signal being generated by a user's own voice; inverting the accelerometer signal; and presenting the inverted accelerometer signal at a speaker of the wearable audio device to be outputted to the user. Optionally, the occlusion cancellation module is configured for obtaining an input audio signal; determining a modified input audio signal by adding the inverted accelerometer signal to the input audio signal; and presenting the modified input audio signal at the speaker of the wearable audio device to be outputted to the user.
  • A wearable audio device can be provided having an accelerometer and one or more microphones. The wearable audio device can include a dynamic mixing module configured for detecting whether or not a speech signal is present in the microphone signal and/or the accelerometer signal; determining a noise condition of the microphone signal when no speech signal is detected; and adjusting a contribution of the accelerometer signal relative to the microphone signal to the output audio signal based on the determined noise condition.
  • It will be appreciated that any of the aspects, features and options described herein can be combined. It will particularly be appreciated that any of the aspects, features and options described in view of the methods apply equally to the wearable hearing device, and vice versa.
  • BRIEF DESCRIPTION OF THE DRAWINGS
  • Embodiments of the present invention will now be described in detail with reference to the accompanying drawings in which:
    • Figures 1 and 2 show schematic examples of a wearable audio device;
    • Figures 3-5 show exemplary flow charts.
    DETAILED DESCRIPTION
  • Figure 1 shows a schematic example of a wearable audio device 100, such as a hearing aid or earphone, e.g. an in-earphone, on-earphone, or over-earphone, such as an earbud. Figure 2 shows an example of a wearable audio device 100, such as a hearing aid, an earphone, e.g. an in-earphone, on-earphone, or over-earphone. Figure 2 particularly shows an in-earphone for being inserted into a hearing canal of a user. In this example, the wearable audio device has an in-ear component for being inserted into the ear canal of a user. In this example, the wearable audio device 100 is arranged for receiving an input audio signal that is to be communicated to the user. The input audio signal can be retrieved by a first input transducer, such as a microphone 10. The wearable audio device 100 can include more than one microphone, although only one microphone is shown in Figure 1 for clarity. Alternatively, or additionally, the input audio signal can be wiredly or wirelessly received, e.g. from an audio signal source, such as a mobile communications device, using a receiver 12. In this example, the input audio signal can be received via an IEEE 802.15 connection, via Bluetooth, or BTLE. The wearable audio device 100 can be arranged for generating an output audio signal that is to be communicated to a third party, e.g. for telephone communication. The output audio signal can e.g. be transmitted using a transmitter 14. The output audio signal can be representative of speech of the user. The speech of the user can e.g. by retrieved by a second input transducer, such as one or more accelerometers 20.
  • The wearable audio device 100 also comprises one or more output transducers, including an output speaker 30, for outputting incoming sound representative of the input audio signal to the wearer of the audio device.
  • The wearable audio device 100 may particularly include an in-ear component for being inserted into a hearing canal of a user, such as an earbud, wherein the in-ear component may occlude the hearing canal. The output speaker 30 may be arranged at an internal side of the in-ear component for transmitting the incoming sound substantially towards the inner ear of the user, while wearing the audio device. The accelerometer 20 may also be arranged at the in-ear component, and may for example be configured to sense vibrations, e.g. vibrations transmitted through the user's tissue, e.g. via bone conduction and/or soft tissue conduction. The microphone(s) 10 may be arranged for sensing sounds from an environment of the user, and may therefore also be arranged at an outward side of the in-ear component, but also at other positions such as at an optional behind-the-ear component of the wearable audio device. The wearable audio device 100 may include multiple microphones and/or multiple accelerometers, arranged at various positions.
  • The wearable audio device includes a processing unit 16 for processing the input audio signals and/or output audio signals.
  • The wearable audio device 100, e.g. the processing unit 16 may comprise an analog to digital converter 18 configured for receiving an analog audio signal, e.g. from the microphone 10 and/or the accelerometer 20, and converting the analog audio signal to a digital audio signal. The audio signal can be divided into multiple time frame signals. For example, the digital audio signal may be divided into successive time frame signals, either overlapping or non-overlapping, that are, at least partly, shifted in time. Each time frame signal may be represented by a finite number of samples, particularly 2N samples wherein 2N is a power of 2, such as 256, 512 or 1024.
  • The wearable audio device 100, e.g. the processing unit 16, here, comprises an active occlusion cancellation module 40, configured for cancelling a user-perceived occlusion of the hearing canal. Figure 3 shows an exemplary flow chart of a method performed by the occlusion cancellation module 40. The occlusion cancellation module 40 is in this example configured to receive 210 an input audio signal, e.g. from the microphone 10 or receiver 12 and accelerometer 20 and output 260 a modified audio signal for being outputted by the output speaker 30. As will be explained below, in this example the accelerometer signal is received via a crosstalk cancelation module 50. The user-perceived occlusion of the hearing canal by the in-ear component may be caused by two mechanisms. Firstly, the in-ear component blocks external air-conducted audio signals from entering the hearing canal. Passing through external sound signal such as received by the microphone 10 via a transparency mixer 80, e.g. amplified, may substantially alleviate this effect. Secondly, the in-ear component blocks an escape through the hearing canal of tissue-conducted audio signals generated by the user itself. This second effect may particularly be actively alleviated by the occlusion cancellation module 40.
  • The occlusion cancellation module 40 is particularly configured to obtain an accelerometer signal from the accelerometer 20 (here via crosstalk cancelation module 50) indicative of a tissue-conducted speech signal being generated by a user's own voice. The occlusion cancellation module 40 is hence configured to receive 220 the accelerometer signal. Based on the accelerometer signal, a tissue-conducted speech signal is estimated 230 by the occlusion cancellation module 40. The tissue-conducted speech signal is generated by a user's own voice and conducted through the user's body tissue to the accelerometer. The tissue-conducted speech signal can e.g. be estimated to be proportional to, e.g. equal to, the accelerometer signal. The occlusion cancellation module 40 may invert 240 the accelerometer signal representative of speech and present 260 the inverted tissue-conducted speech signal at the speaker 30. This may be perceived by the user as if the tissue-conducted speech signal has escaped through the hearing canal, as it would in absence of any occlusion. The inverted tissue conducted speech signal may for example be filtered, e.g. in the frequency domain, by an occlusion cancellation filter, which occlusion cancellation filter is determined based on the accelerometer signal.
  • In case the wearable audio device 100 receives an input audio signal from the microphone(s) 10 and/or the receiver 12, such as via the transparency mixer 80, the inverted accelerometer signal representative of tissue-conducted speech may be added 250 to the input audio signal. Hence, a contribution of the tissue-conducted speech signal is suppressed in the sound provided to the user by the speaker 30, e.g. effectively by subtracting the estimated tissue-conducted speech signal from the input output audio signal.
  • In addition to tissue-conducted audio signals, such as originating from the user's own voice, the accelerometer 20 may also record a crosstalk signal originating from the output speaker and being transmitted through hardware components 60 of the wearable audio device 100 back towards the accelerometer 20. The accelerometer signal may therefore be corrupted with a crosstalk signal. For various applications, it may be desired to remove the crosstalk signal from the accelerometer signal. Particularly when performing occlusion cancellation it can be beneficial to also perform crosstalk cancellation. The wearable audio device 100, e.g. the processing unit 16, may hence, here further, comprise a crosstalk cancellation module 50. Figure 4 shows an exemplary flow chart of a method performed by the crosstalk cancelation module 50. The crosstalk cancellation module 50 is in this example arranged between the accelerometer 20 and the occlusion cancellation module 40. Hence, here, the accelerometer signal is first processed by the crosstalk cancellation module 50 wherein the resultant accelerometer signal is subsequently used by the occlusion cancellation module 40 for modifying the speaker signal to be output by the speaker. Hence, here, the accelerometer signal is first processed by the crosstalk cancellation module 50 wherein the resultant crosstalk-corrected accelerometer signal is subsequently used by the occlusion cancellation module 40 for modifying the speaker signal.
  • The crosstalk cancellation module 50 is particularly configured for determining 310 a device-specific crosstalk transfer function, characterizing a transfer of audio signals, e.g. vibrations, from the output speaker 30 to the accelerometer 20, e.g. via the wearable audio device hardware 60. A crosstalk component is, at least partially, cancelled 320 from the accelerometer signal using the determined device-specific crosstalk transfer function. The crosstalk transfer function of a device may for example be determined by outputting 311 an excitation signal with the output speaker 30, and recording 312 an accelerometer signal with the accelerometer 20. The device-specific crosstalk transfer function may be determined 310 in various ways from the known excitation signal and the observed response as recorded by the accelerometer 20. Various excitation signals may be used. A particularly computationally efficient determination of the device-specific crosstalk transfer function can be obtained by using a maximum length sequence (MLS) signal as the excitation signal. The device-specific crosstalk transfer function can subsequently be used, in various ways, for filtering the accelerometer signal so as to effectively subtract the crosstalk signal from the accelerometer signal. The crosstalk cancellation module 50 may receive a speaker signal, such as from the transparency mixer 80, to be outputted by the output speaker 30, and filter, e.g. with an adaptive filter, the accelerometer signal based on the speaker signal and the device-specific cross-talk transfer function. The crosstalk cancellation module may calculate, using the device-specific crosstalk transfer function and the received speaker signal, the predicted signal component that is predicted to be recorded at the accelerometer in view of the to be outputted audio signal, and subtract said signal component from the actually recorded accelerometer signal. The modified accelerometer signal may be provided to the transmitter 14. Alternatively, or additionally, the modified accelerometer signal may be provided to the occlusion cancellation module 40 as described above.
  • In this example, the wearable audio device 100 further comprises a dynamic mixing module 70 configured for receiving the microphone signal and the accelerometer signal, and generating an output audio signal, to be transmitted to a third party, being either one of the microphone signals and the accelerometer signal, or a mix of the microphone signal and accelerometer signal according to an adaptive mixing ratio, particularly to optimize speech intelligibility. The mixing ratio can be adaptively determined based various audio conditions in the microphone signal and/or the accelerometer signal. Figure 5 shows an exemplary flow chart of the dynamic mixing module 70. The mixing module 70 is arranged to detect 410 whether or not a loudness level of the microphone signal exceeds a predefined loudness threshold. If the microphone signal loudness exceeds the predefined loudness threshold (yes), adjustment of the relative contributions of the accelerometer signal and the microphone signal is allowed 420. If the microphone signal loudness does not exceed the predefined loudness threshold (no), the mixing module 70 detects 430 whether or not a speech signal originating from the user's own voice is present, e.g. based on the accelerometer signal. If such speech signal is detected 430 (yes), adjustment of the relative contributions of the accelerometer signal and the microphone signal is prohibited 440. If such speech signal is not detected (no), adjustment of the relative contributions of the accelerometer signal and the microphone signal is allowed 420. Whether or not the user currently speaks may be detected using the accelerometer. It may for example be ascertained that the user is currently speaking if a magnitude of the accelerometer signal exceeds a predefined threshold.
  • The adjustment of the relative contributions of the accelerometer signal and the microphone signal to the output signal may be based on a determined noise level of the microphone signal. For example, a relative contribution of the accelerometer signal to the mixing module output signal may be increased, relative to the microphone signal contribution, when the microphone signal becomes more noisy. Regardless of the noise level, the mixing module may for example adjust the relative contributions in such a way, that the mixing module output signal only contains the accelerometer signal, in case a loudness level of the microphone signal exceeds a predetermined loudness threshold, e.g. regardless of the noise level and regardless of the user talking.
  • It will be appreciated that the mixing module 70 may be used in conjunction with the crosstalk cancellation module 50. In that case, the mixing module can receive the accelerometer signal from the crosstalk cancellation module 50. In case the mixing module 70 is used without using the crosstalk cancellation module 50, the mixing module can receive the accelerometer signal directly from the accelerometer 20 (or converter 18), as indicated by the dashed arrow in Figure 1.
  • Figure 1 shows an exemplary wearable audio device 100 comprising a particular combination of the occlusion cancellation module 40, the crosstalk cancelling module 50 and the mixing module 70, but it will be appreciated that the wearable audio device can include any combination of said modules. For example, the wearable audio device 100 may include only one, anyone, of said modules, or only two, any two, of said modules. Some or all of the occlusion cancellation module 40, the crosstalk cancelling module 50 and/or the mixing module 70 may be part of the processing unit 16 of the wearable audio device 100. For example, the wearable audio device can include the occlusion cancellation module 40. The wearable audio device can e.g. include the occlusion cancellation module while not including, or not using, the crosstalk cancellation module. It is possible to minimize crosstalk by other, such as mechanical, means. In another example, the wearable audio device can include the crosstalk cancelling module 50. The wearable audio device can e.g. include the crosstalk cancellation module while not including, or not using, the occlusion cancellation module. In another example, the wearable audio device can include the mixing module 70. The wearable audio device can e.g. include the mixing module while not including, or not using, the occlusion cancellation module and/or the crosstalk cancellation module. In an example, the wearable audio device can include the occlusion cancellation module and the crosstalk cancelling module, e.g. while not including the mixing module.
  • Herein, the invention is described with reference to specific examples of embodiments of the invention. It will, however, be evident that various modifications and changes may be made therein, without departing from the essence of the invention. For the purpose of clarity and a concise description, features are described herein as part of the same or separate embodiments, however, alternative embodiments having combinations of all or some of the features described in these separate embodiments are also envisaged.
  • However, other modifications, variations, and alternatives are also possible. The specifications, drawings and examples are, accordingly, to be regarded in an illustrative sense rather than in a restrictive sense.
  • For the purpose of clarity and a concise description, features are described herein as part of the same or separate embodiments, however, it will be appreciated that the scope of the invention may include embodiments having combinations of all or some of the features described.
  • In the claims, any reference signs placed between parentheses shall not be construed as limiting the claim. The word 'comprising' does not exclude the presence of other features or steps than those listed in a claim. Furthermore, the words 'a' and 'an' shall not be construed as limited to 'only one', but instead are used to mean 'at least one', and do not exclude a plurality. The mere fact that certain measures are recited in mutually different claims does not indicate that a combination of these measures cannot be used to an advantage.

Claims (15)

  1. Method of reducing crosstalk in tissue-conduction voice pickup in a wearable audio device having at least an accelerometer for voice pickup and an output speaker, the method comprising:
    determining a characteristic crosstalk transfer function for audio signals being transferred from the output speaker to the accelerometer,
    determining, on the basis of a speaker signal to be output by the speaker and the crosstalk transfer function, an estimated crosstalk signal to be sensed by the accelerometer,
    obtaining from the accelerometer an accelerometer signal representative of voice pickup, and
    determining a modified accelerometer signal by subtracting the estimated crosstalk signal from the accelerometer signal.
  2. Method of claim 1, wherein the crosstalk transfer function is device-specific, and optionally determined, e.g. while the wearable audio device is worn by a user, by outputting, with the output speaker, a predefined excitation signal, and recording a response signal thereof with the accelerometer.
  3. Method of claim 2, wherein the excitation signal includes a maximum length sequence signal.
  4. Method of actively reducing occlusion perception of a wearable audio device, the method comprising:
    obtaining an accelerometer signal with an accelerometer of the wearable audio device indicative of a tissue-conducted speech signal being generated by a user's own voice;
    inverting the accelerometer signal; and
    presenting the inverted accelerometer signal at a speaker of the wearable audio device to be outputted to the user.
  5. Method of claim 4, comprising:
    obtaining an input audio signal;
    determining a modified input audio signal by adding the inverted accelerometer signal to the input audio signal; and
    presenting the modified input audio signal at the speaker of the wearable audio device to be outputted to the user.
  6. Method of claim 5, comprising determining a magnitude spectrum of the accelerometer signal and subtracting said determined magnitude spectrum from a magnitude spectrum of the input audio signal.
  7. Method of claim 4, 5 or 6, comprising obtaining from the accelerometer an accelerometer signal representative of voice pickup and modifying the obtained accelerometer signal by subtracting an estimated crosstalk signal from the accelerometer signal, wherein optionally the estimated crosstalk signal is determined according to any one of claims 1-3.
  8. Method of adjusting a relative contribution to an output audio signal of an accelerometer signal recorded with an accelerometer of a wearable audio device and a microphone signal recorded by one or more microphones of the wearable audio device, the method comprising
    detecting whether or not a speech signal is present in the microphone signal and/or the accelerometer signal,
    determining a noise condition of the microphone signal when no speech signal is detected, and
    adjusting a contribution of the accelerometer signal relative to the microphone signal to the output audio signal based on the determined noise condition.
  9. Method of claim 8, comprising detecting whether or not a speech signal originating from a user's own voice is present in the accelerator signal, and preventing an adjustment of the relative contributions to the output audio signal of the microphone signal and the accelerometer signal in case a presence of a speech signal originating from the user's own voice is detected.
  10. Method of claim 9, comprising allowing the adjustment of the relative contributions to the output audio signal of the microphone signal and the accelerometer signal only in case no speech signal originating from a user's own voice during a predefined time period is detected.
  11. Method of claim 9 or 10, wherein a presence of a speech signal originating from the user's own voice in the accelerometer signal is ascertained, in case a loudness level of the accelerator signal exceeds a predefined loudness threshold level.
  12. Method of any of claims 8-11, comprising increasing the relative contribution of the accelerometer signal, relative to the microphone signal, to the output audio signal, in case the noise condition of the microphone signal exceeds a predefined noise threshold.
  13. Method of any of claims 8-12, comprising increasing the relative contribution of the accelerometer signal, relative to the microphone signal, to the output audio signal, in case a loudness level of the microphone signal exceeds a predefined loudness threshold level.
  14. Method of claim 13, comprising eliminating the contribution of the microphone signal to the output audio signal entirely in case a loudness level of the microphone signal exceeds a predefined loudness threshold level.
  15. Wearable audio device, such as a hearing aid or earphone having an in-ear component, configured for executing a method of any preceding claim.
EP22386020.6A 2022-04-20 2022-04-20 Audio processing method for a wearable auto device Pending EP4266705A1 (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
EP22386020.6A EP4266705A1 (en) 2022-04-20 2022-04-20 Audio processing method for a wearable auto device

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
EP22386020.6A EP4266705A1 (en) 2022-04-20 2022-04-20 Audio processing method for a wearable auto device

Publications (1)

Publication Number Publication Date
EP4266705A1 true EP4266705A1 (en) 2023-10-25

Family

ID=81648642

Family Applications (1)

Application Number Title Priority Date Filing Date
EP22386020.6A Pending EP4266705A1 (en) 2022-04-20 2022-04-20 Audio processing method for a wearable auto device

Country Status (1)

Country Link
EP (1) EP4266705A1 (en)

Citations (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2010094812A2 (en) * 2010-06-07 2010-08-26 Phonak Ag Bone conduction hearing aid system
US20140270231A1 (en) * 2013-03-15 2014-09-18 Apple Inc. System and method of mixing accelerometer and microphone signals to improve voice quality in a mobile device
WO2016185441A1 (en) * 2015-05-21 2016-11-24 Cochlear Limited Advanced management of an implantable sound management system
EP3182721A1 (en) * 2015-12-15 2017-06-21 Sony Mobile Communications, Inc. Controlling own-voice experience of talker with occluded ear
US20200020313A1 (en) * 2018-07-16 2020-01-16 Apple Inc. Headphone transparency, occlusion effect mitigation and wind noise detection
CN111683319A (en) * 2020-06-08 2020-09-18 北京爱德发科技有限公司 Call pickup noise reduction method, earphone and storage medium
CN113645534A (en) * 2021-08-31 2021-11-12 歌尔科技有限公司 Earphone blocking effect eliminating method and earphone
US20220068298A1 (en) * 2020-08-25 2022-03-03 Bose Corporation Wearable mixed sensor array for self-voice capture
WO2022054008A1 (en) * 2020-09-11 2022-03-17 Cochlear Limited Advanced noise cancellation

Patent Citations (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2010094812A2 (en) * 2010-06-07 2010-08-26 Phonak Ag Bone conduction hearing aid system
US20140270231A1 (en) * 2013-03-15 2014-09-18 Apple Inc. System and method of mixing accelerometer and microphone signals to improve voice quality in a mobile device
WO2016185441A1 (en) * 2015-05-21 2016-11-24 Cochlear Limited Advanced management of an implantable sound management system
EP3182721A1 (en) * 2015-12-15 2017-06-21 Sony Mobile Communications, Inc. Controlling own-voice experience of talker with occluded ear
US20200020313A1 (en) * 2018-07-16 2020-01-16 Apple Inc. Headphone transparency, occlusion effect mitigation and wind noise detection
CN111683319A (en) * 2020-06-08 2020-09-18 北京爱德发科技有限公司 Call pickup noise reduction method, earphone and storage medium
US20220068298A1 (en) * 2020-08-25 2022-03-03 Bose Corporation Wearable mixed sensor array for self-voice capture
WO2022054008A1 (en) * 2020-09-11 2022-03-17 Cochlear Limited Advanced noise cancellation
CN113645534A (en) * 2021-08-31 2021-11-12 歌尔科技有限公司 Earphone blocking effect eliminating method and earphone

Similar Documents

Publication Publication Date Title
US11710473B2 (en) Method and device for acute sound detection and reproduction
DK180471B1 (en) Headset with active noise cancellation
US8675884B2 (en) Method and a system for processing signals
JP5315506B2 (en) Method and system for bone conduction sound propagation
US9191740B2 (en) Method and apparatus for in-ear canal sound suppression
JP6495448B2 (en) Self-voice blockage reduction in headset
WO2009136953A1 (en) Method and device for acoustic management control of multiple microphones
CN112543393B (en) Spectral mixing of built-in microphones
US20230011879A1 (en) Method and apparatus for in-ear canal sound suppression
WO2015191588A2 (en) Self-voice feedback in communications headsets
US11335315B2 (en) Wearable electronic device with low frequency noise reduction
EP4266705A1 (en) Audio processing method for a wearable auto device
US20230328462A1 (en) Method, device, headphones and computer program for actively suppressing the occlusion effect during the playback of audio signals
JP2004048207A (en) Hearing aid
US20240147169A1 (en) A hearing aid system and a method of operating a hearing aid system
US20230141100A1 (en) In-ear headphone device with active noise control

Legal Events

Date Code Title Description
STAA Information on the status of an ep patent application or granted ep patent

Free format text: STATUS: THE APPLICATION HAS BEEN PUBLISHED

PUAI Public reference made under article 153(3) epc to a published international application that has entered the european phase

Free format text: ORIGINAL CODE: 0009012

AK Designated contracting states

Kind code of ref document: A1

Designated state(s): AL AT BE BG CH CY CZ DE DK EE ES FI FR GB GR HR HU IE IS IT LI LT LU LV MC MK MT NL NO PL PT RO RS SE SI SK SM TR

RIN1 Information on inventor provided before grant (corrected)

Inventor name: ARENDS, AERNOUT STEVEN FERDINAND

Inventor name: ANTONOPOULOS, ILIAS ATHANASIOS

Inventor name: LIJZENGA, JOHANNES

RBV Designated contracting states (corrected)

Designated state(s): AL AT BE BG CH CY CZ DE DK EE ES FI FR GB GR HR HU IE IS IT LI LT LU LV MC MK MT NL NO PL PT RO RS SE SI SK SM TR