WO2023119764A1 - 耳装着型デバイス、及び、再生方法 - Google Patents

耳装着型デバイス、及び、再生方法 Download PDF

Info

Publication number
WO2023119764A1
WO2023119764A1 PCT/JP2022/035130 JP2022035130W WO2023119764A1 WO 2023119764 A1 WO2023119764 A1 WO 2023119764A1 JP 2022035130 W JP2022035130 W JP 2022035130W WO 2023119764 A1 WO2023119764 A1 WO 2023119764A1
Authority
WO
WIPO (PCT)
Prior art keywords
sound
sound signal
ear
signal
ratio
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Ceased
Application number
PCT/JP2022/035130
Other languages
English (en)
French (fr)
Japanese (ja)
Inventor
伸一郎 栗原
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Panasonic Intellectual Property Management Co Ltd
Original Assignee
Panasonic Intellectual Property Management Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Panasonic Intellectual Property Management Co Ltd filed Critical Panasonic Intellectual Property Management Co Ltd
Priority to JP2023569068A priority Critical patent/JPWO2023119764A1/ja
Priority to EP22910490.6A priority patent/EP4456561A4/en
Publication of WO2023119764A1 publication Critical patent/WO2023119764A1/ja
Priority to US18/745,000 priority patent/US20240339101A1/en
Anticipated expiration legal-status Critical
Ceased legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10KSOUND-PRODUCING DEVICES; METHODS OR DEVICES FOR PROTECTING AGAINST, OR FOR DAMPING, NOISE OR OTHER ACOUSTIC WAVES IN GENERAL; ACOUSTICS NOT OTHERWISE PROVIDED FOR
    • G10K11/00Methods or devices for transmitting, conducting or directing sound in general; Methods or devices for protecting against, or for damping, noise or other acoustic waves in general
    • G10K11/16Methods or devices for protecting against, or for damping, noise or other acoustic waves in general
    • G10K11/175Methods or devices for protecting against, or for damping, noise or other acoustic waves in general using interference effects; Masking sound
    • G10K11/178Methods or devices for protecting against, or for damping, noise or other acoustic waves in general using interference effects; Masking sound by electro-acoustically regenerating the original acoustic waves in anti-phase
    • G10K11/1783Methods or devices for protecting against, or for damping, noise or other acoustic waves in general using interference effects; Masking sound by electro-acoustically regenerating the original acoustic waves in anti-phase handling or detecting of non-standard events or conditions, e.g. changing operating modes under specific operating conditions
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10KSOUND-PRODUCING DEVICES; METHODS OR DEVICES FOR PROTECTING AGAINST, OR FOR DAMPING, NOISE OR OTHER ACOUSTIC WAVES IN GENERAL; ACOUSTICS NOT OTHERWISE PROVIDED FOR
    • G10K11/00Methods or devices for transmitting, conducting or directing sound in general; Methods or devices for protecting against, or for damping, noise or other acoustic waves in general
    • G10K11/16Methods or devices for protecting against, or for damping, noise or other acoustic waves in general
    • G10K11/175Methods or devices for protecting against, or for damping, noise or other acoustic waves in general using interference effects; Masking sound
    • G10K11/178Methods or devices for protecting against, or for damping, noise or other acoustic waves in general using interference effects; Masking sound by electro-acoustically regenerating the original acoustic waves in anti-phase
    • G10K11/1781Methods or devices for protecting against, or for damping, noise or other acoustic waves in general using interference effects; Masking sound by electro-acoustically regenerating the original acoustic waves in anti-phase characterised by the analysis of input or output signals, e.g. frequency range, modes, transfer functions
    • G10K11/17821Methods or devices for protecting against, or for damping, noise or other acoustic waves in general using interference effects; Masking sound by electro-acoustically regenerating the original acoustic waves in anti-phase characterised by the analysis of input or output signals, e.g. frequency range, modes, transfer functions characterised by the analysis of the input signals only
    • G10K11/17827Desired external signals, e.g. pass-through audio such as music or speech
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10KSOUND-PRODUCING DEVICES; METHODS OR DEVICES FOR PROTECTING AGAINST, OR FOR DAMPING, NOISE OR OTHER ACOUSTIC WAVES IN GENERAL; ACOUSTICS NOT OTHERWISE PROVIDED FOR
    • G10K11/00Methods or devices for transmitting, conducting or directing sound in general; Methods or devices for protecting against, or for damping, noise or other acoustic waves in general
    • G10K11/16Methods or devices for protecting against, or for damping, noise or other acoustic waves in general
    • G10K11/175Methods or devices for protecting against, or for damping, noise or other acoustic waves in general using interference effects; Masking sound
    • G10K11/178Methods or devices for protecting against, or for damping, noise or other acoustic waves in general using interference effects; Masking sound by electro-acoustically regenerating the original acoustic waves in anti-phase
    • G10K11/1781Methods or devices for protecting against, or for damping, noise or other acoustic waves in general using interference effects; Masking sound by electro-acoustically regenerating the original acoustic waves in anti-phase characterised by the analysis of input or output signals, e.g. frequency range, modes, transfer functions
    • G10K11/17821Methods or devices for protecting against, or for damping, noise or other acoustic waves in general using interference effects; Masking sound by electro-acoustically regenerating the original acoustic waves in anti-phase characterised by the analysis of input or output signals, e.g. frequency range, modes, transfer functions characterised by the analysis of the input signals only
    • G10K11/17823Reference signals, e.g. ambient acoustic environment
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10KSOUND-PRODUCING DEVICES; METHODS OR DEVICES FOR PROTECTING AGAINST, OR FOR DAMPING, NOISE OR OTHER ACOUSTIC WAVES IN GENERAL; ACOUSTICS NOT OTHERWISE PROVIDED FOR
    • G10K11/00Methods or devices for transmitting, conducting or directing sound in general; Methods or devices for protecting against, or for damping, noise or other acoustic waves in general
    • G10K11/16Methods or devices for protecting against, or for damping, noise or other acoustic waves in general
    • G10K11/175Methods or devices for protecting against, or for damping, noise or other acoustic waves in general using interference effects; Masking sound
    • G10K11/178Methods or devices for protecting against, or for damping, noise or other acoustic waves in general using interference effects; Masking sound by electro-acoustically regenerating the original acoustic waves in anti-phase
    • G10K11/1783Methods or devices for protecting against, or for damping, noise or other acoustic waves in general using interference effects; Masking sound by electro-acoustically regenerating the original acoustic waves in anti-phase handling or detecting of non-standard events or conditions, e.g. changing operating modes under specific operating conditions
    • G10K11/17837Methods or devices for protecting against, or for damping, noise or other acoustic waves in general using interference effects; Masking sound by electro-acoustically regenerating the original acoustic waves in anti-phase handling or detecting of non-standard events or conditions, e.g. changing operating modes under specific operating conditions by retaining part of the ambient acoustic environment, e.g. speech or alarm signals that the user needs to hear
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10KSOUND-PRODUCING DEVICES; METHODS OR DEVICES FOR PROTECTING AGAINST, OR FOR DAMPING, NOISE OR OTHER ACOUSTIC WAVES IN GENERAL; ACOUSTICS NOT OTHERWISE PROVIDED FOR
    • G10K11/00Methods or devices for transmitting, conducting or directing sound in general; Methods or devices for protecting against, or for damping, noise or other acoustic waves in general
    • G10K11/16Methods or devices for protecting against, or for damping, noise or other acoustic waves in general
    • G10K11/175Methods or devices for protecting against, or for damping, noise or other acoustic waves in general using interference effects; Masking sound
    • G10K11/178Methods or devices for protecting against, or for damping, noise or other acoustic waves in general using interference effects; Masking sound by electro-acoustically regenerating the original acoustic waves in anti-phase
    • G10K11/1787General system configurations
    • G10K11/17873General system configurations using a reference signal without an error signal, e.g. pure feedforward
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/02Speech enhancement, e.g. noise reduction or echo cancellation
    • G10L21/0316Speech enhancement, e.g. noise reduction or echo cancellation by changing the amplitude
    • G10L21/0364Speech enhancement, e.g. noise reduction or echo cancellation by changing the amplitude for improving intelligibility
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04RLOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; ELECTRIC HEARING AIDS; PUBLIC ADDRESS SYSTEMS
    • H04R1/00Details of transducers, loudspeakers or microphones
    • H04R1/08Mouthpieces; Microphones; Attachments therefor
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04RLOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; ELECTRIC HEARING AIDS; PUBLIC ADDRESS SYSTEMS
    • H04R1/00Details of transducers, loudspeakers or microphones
    • H04R1/10Earpieces; Attachments therefor ; Earphones; Monophonic headphones
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04RLOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; ELECTRIC HEARING AIDS; PUBLIC ADDRESS SYSTEMS
    • H04R1/00Details of transducers, loudspeakers or microphones
    • H04R1/10Earpieces; Attachments therefor ; Earphones; Monophonic headphones
    • H04R1/1016Earpieces of the intra-aural type
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04RLOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; ELECTRIC HEARING AIDS; PUBLIC ADDRESS SYSTEMS
    • H04R1/00Details of transducers, loudspeakers or microphones
    • H04R1/10Earpieces; Attachments therefor ; Earphones; Monophonic headphones
    • H04R1/1041Mechanical or electronic switches, or control elements
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04RLOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; ELECTRIC HEARING AIDS; PUBLIC ADDRESS SYSTEMS
    • H04R1/00Details of transducers, loudspeakers or microphones
    • H04R1/10Earpieces; Attachments therefor ; Earphones; Monophonic headphones
    • H04R1/1083Reduction of ambient noise
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04RLOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; ELECTRIC HEARING AIDS; PUBLIC ADDRESS SYSTEMS
    • H04R1/00Details of transducers, loudspeakers or microphones
    • H04R1/10Earpieces; Attachments therefor ; Earphones; Monophonic headphones
    • H04R1/1091Details not provided for in groups H04R1/1008 - H04R1/1083
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04RLOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; ELECTRIC HEARING AIDS; PUBLIC ADDRESS SYSTEMS
    • H04R3/00Circuits for transducers
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10KSOUND-PRODUCING DEVICES; METHODS OR DEVICES FOR PROTECTING AGAINST, OR FOR DAMPING, NOISE OR OTHER ACOUSTIC WAVES IN GENERAL; ACOUSTICS NOT OTHERWISE PROVIDED FOR
    • G10K2210/00Details of active noise control [ANC] covered by G10K11/178 but not provided for in any of its subgroups
    • G10K2210/10Applications
    • G10K2210/108Communication systems, e.g. where useful sound is kept and noise is cancelled
    • G10K2210/1081Earphones, e.g. for telephones, ear protectors or headsets
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/02Speech enhancement, e.g. noise reduction or echo cancellation
    • G10L21/0208Noise filtering
    • G10L21/0216Noise filtering characterised by the method used for estimating noise
    • G10L21/0232Processing in the frequency domain
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04RLOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; ELECTRIC HEARING AIDS; PUBLIC ADDRESS SYSTEMS
    • H04R2225/00Details of deaf aids covered by H04R25/00, not provided for in any of its subgroups
    • H04R2225/43Signal processing in hearing aids to enhance the speech intelligibility
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04RLOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; ELECTRIC HEARING AIDS; PUBLIC ADDRESS SYSTEMS
    • H04R2430/00Signal processing covered by H04R, not provided for in its groups
    • H04R2430/01Aspects of volume control, not necessarily automatic, in sound systems
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04RLOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; ELECTRIC HEARING AIDS; PUBLIC ADDRESS SYSTEMS
    • H04R2460/00Details of hearing devices, i.e. of ear- or headphones covered by H04R1/10 or H04R5/033 but not provided for in any of their subgroups, or of hearing aids covered by H04R25/00 but not provided for in any of its subgroups
    • H04R2460/01Hearing devices using active noise cancellation

Definitions

  • the present disclosure relates to an ear-worn device and a reproduction method.
  • Patent Literature 1 discloses a technology related to headphones.
  • the present disclosure provides an ear-worn device capable of reproducing the voices of people heard in the surroundings.
  • An ear-mounted device includes a microphone that acquires sound and outputs a first sound signal of the acquired sound, a determination regarding the S/N ratio of the first sound signal, and the sound Determining the bandwidth based on the peak frequency in the power spectrum and determining whether or not the sound includes a human voice, and at least one of the S / N ratio and the bandwidth is a predetermined
  • a signal processing circuit that outputs a second sound signal based on the first sound signal when it is determined that the requirements are satisfied and the sound includes a human voice; and a signal processing circuit based on the output second sound signal and a housing that accommodates the microphone, the signal processing circuit, and the speaker.
  • An ear-mounted device can reproduce human voices heard in the surroundings.
  • FIG. 1 is an external view of a device that constitutes a sound signal processing system according to an embodiment.
  • FIG. 2 is a block diagram showing the functional configuration of the sound signal processing system according to the embodiment.
  • FIG. 3 is a diagram for explaining a case in which the transition to the external sound capture mode is not made even though an announcement sound is being output.
  • FIG. 4 is a flowchart of Example 1 of the ear-mounted device according to the embodiment.
  • FIG. 5 is a first flow chart of the operation in the external sound capture mode of the ear-mounted device according to the embodiment.
  • FIG. 6 is a second flow chart of the operation in the external sound capture mode of the ear-worn device according to the embodiment.
  • FIG. 1 is an external view of a device that constitutes a sound signal processing system according to an embodiment.
  • FIG. 2 is a block diagram showing the functional configuration of the sound signal processing system according to the embodiment.
  • FIG. 3 is a diagram for explaining a case in which the transition to the external sound capture mode is not
  • FIG. 7 is a flowchart of operations in the noise canceling mode of the ear-worn device according to the embodiment.
  • FIG. 8 is a flow chart of Example 2 of the ear-mounted device according to the embodiment.
  • FIG. 9 is a diagram showing an example of an operation mode selection screen.
  • each figure is a schematic diagram and is not necessarily strictly illustrated. Moreover, in each figure, the same code
  • FIG. 1 is an external view of a device that constitutes a sound signal processing system according to an embodiment.
  • FIG. 2 is a block diagram showing the functional configuration of the sound signal processing system according to the embodiment.
  • the sound signal processing system 10 includes an ear-worn device 20 and a mobile terminal 30.
  • the ear-worn device 20 is an earphone-type device that reproduces the fourth sound signal provided from the mobile terminal 30 .
  • the fourth sound signal is, for example, a sound signal of music content.
  • the ear-worn device 20 has an external sound capturing function (also referred to as an external sound capturing mode) that captures sounds around the user during reproduction of the fourth sound signal.
  • the surrounding sounds here are, for example, announcement sounds.
  • the announcement sound is, for example, inside a moving body such as a train, a bus, or an airplane, and is output from a speaker provided in the moving body.
  • the announcement sound includes human voice.
  • the ear-mounted device 20 operates in a normal mode of reproducing the fourth sound signal provided from the mobile terminal 30, and operates in an external sound capture mode of capturing and reproducing the surrounding sounds of the user. For example, when the user wearing the ear-worn device 20 is on a mobile object that is moving and listening to music content in the normal mode, an announcement sound is output in the mobile object and is output. If the announced sound includes a human voice, the ear-worn device 20 automatically transitions from the normal mode to the external sound capture mode. This prevents the user from missing the announcement sound.
  • the ear-worn device 20 specifically includes a microphone 21, a DSP 22, a communication circuit 27a, a mixing circuit 27b, and a speaker 28.
  • the communication circuit 27a and the mixing circuit 27b may be included in the DSP 22.
  • Microphone 21, DSP 22, communication circuit 27a, mixing circuit 27b, and speaker 28 are housed in housing 29 (shown in FIG. 1).
  • the microphone 21 is a sound pickup device that acquires sounds around the ear-mounted device 20 and outputs a first sound signal based on the acquired sounds.
  • the microphone 21 is specifically a condenser microphone, a dynamic microphone, or a MEMS (Micro Electro Mechanical Systems) microphone, but is not particularly limited. Also, the microphone 21 may be omnidirectional or directional.
  • the DSP 22 implements an external sound capture function by performing signal processing on the first sound signal output from the microphone 21 .
  • the DSP 22 realizes an external sound capturing function by outputting a second sound signal based on the first sound signal to the speaker 28, for example.
  • the DSP 22 also has a noise canceling function, and can output to the speaker 28 a third sound signal obtained by performing signal processing including phase inversion processing on the first sound signal.
  • DSP22 is an example of a signal processing circuit.
  • the DSP 22 includes a high-pass filter 23, a noise extraction unit 24a, an S/N ratio calculation unit 24b, a bandwidth calculation unit 24c, a voice feature amount calculation unit 24d, a determination unit 24e, a switching unit 24f, and a memory 26.
  • the high-pass filter 23 attenuates the components in the band of 512 Hz or less included in the first sound signal output from the microphone 21 .
  • the high-pass filter 23 is, for example, a nonlinear digital filter.
  • the cutoff frequency of the high-pass filter 23 is an example, and the cutoff frequency may be determined empirically or experimentally. The cutoff frequency may be determined, for example, according to the type of mobile object in which the ear-worn device 20 is assumed to be used.
  • the noise extraction unit 24a, the S/N ratio calculation unit 24b, the bandwidth calculation unit 24c, the audio feature amount calculation unit 24d, the determination unit 24e, and the switching unit 24f are functional components.
  • the functions of these components are realized, for example, by DSP 22 executing a computer program stored in memory 26 .
  • the details of the functions of the noise extractor 24a, the S/N ratio calculator 24b, the bandwidth calculator 24c, the voice feature amount calculator 24d, the determiner 24e, and the switcher 24f will be described later.
  • the memory 26 is a storage device that stores computer programs executed by the DSP 22 and various information necessary for realizing the external sound capturing function.
  • the memory 26 is implemented by a semiconductor memory or the like. Note that the memory 26 may be realized as an external memory of the DSP 22 instead of an internal memory of the DSP 22 .
  • the communication circuit 27 a receives the fourth sound signal from the mobile terminal 30 .
  • the communication circuit 27a is, for example, a wireless communication circuit, and communicates with the mobile terminal 30 based on a communication standard such as Bluetooth (registered trademark) or BLE (Bluetooth (registered trademark) Low Energy).
  • the mixing circuit 27 b mixes the fourth sound signal received by the communication circuit 27 a with one of the second sound signal and the third sound signal output by the DSP 22 and outputs the result to the speaker 28 .
  • the communication circuit 27a and the mixing circuit 27b may be realized as one SoC (System-on-a-Chip).
  • the speaker 28 outputs reproduced sound based on the mixed sound signal obtained from the mixing circuit 27b.
  • the speaker 28 is a speaker that emits sound waves toward the ear canal (eardrum) of the user wearing the ear-worn device 20, but may be a bone conduction speaker.
  • the mobile terminal 30 is an information terminal that functions as a user interface device in the sound signal processing system 10 by installing a predetermined application program.
  • the mobile terminal 30 also functions as a sound source that provides the ear-worn device 20 with a fourth sound signal (music content). Specifically, by operating the mobile terminal 30 , the user can select music content to be reproduced by the speaker 28 , switch the operation mode of the ear-worn device 20 , and the like.
  • the mobile terminal 30 includes a UI (User Interface) 31 , a communication circuit 32 , a CPU 33 and a memory 34 .
  • the UI 31 is a user interface device that receives user operations and presents images to the user.
  • the UI 31 is implemented by an operation reception unit such as a touch panel and a display unit such as a display panel.
  • the UI 31 may be a voice UI that accepts user's voice, and in this case, the UI 31 is realized by a microphone and a speaker.
  • the communication circuit 32 transmits the fourth sound signal, which is the sound signal of the music content selected by the user, to the ear-mounted device 20 .
  • the communication circuit 32 is, for example, a wireless communication circuit, and communicates with the ear-worn device 20 based on a communication standard such as Bluetooth (registered trademark) or BLE (Bluetooth (registered trademark) Low Energy).
  • the CPU 33 performs information processing related to image display on the display unit, transmission of the fourth sound signal using the communication circuit 32, and the like.
  • the CPU 33 is implemented by, for example, a microcomputer, but may be implemented by a processor.
  • the image display function, the fourth sound signal transmission function, and the like are realized by the CPU 33 executing a computer program stored in the memory 34 .
  • the memory 34 is a storage device that stores various information necessary for the CPU 33 to process information, a computer program executed by the CPU 33, a fourth sound signal (music content), and the like.
  • the memory 34 is implemented by, for example, a semiconductor memory.
  • the ear-worn device 20 can automatically transition to the external sound capture mode when an announcement sound is output while the user is riding in the vehicle. For example, when the S/N ratio of the sound signal of the sound acquired by the microphone 21 is relatively high and the sound includes a human voice, an announcement sound (human (a relatively loud voice) is output.
  • the passenger's speech human This is considered to be the time when a relatively low voice of the
  • the external sound capture mode is an operation mode that makes it easier to hear the announcement sound instead of the passenger's voice. Therefore, in the ear-worn device 20, the S/N ratio of the sound signal of the sound acquired by the microphone 21 is higher than a threshold (hereinafter also referred to as a first threshold), and the sound includes human voice. It is conceivable that the operation of the external sound capturing mode should be performed at times.
  • FIG. 3 is a diagram for explaining such a case.
  • FIG. 3 is a diagram showing temporal changes in the power spectrum of the sound acquired by the microphone 21, where the vertical axis indicates frequency and the horizontal axis indicates time.
  • (b) of FIG. 3 is a diagram showing the temporal change of the bandwidth with reference to the peak frequency (the frequency at which the power is maximized) in the power spectrum of (a) of FIG. 3, and the vertical axis is the bandwidth. , the horizontal axis indicates time. As will be described later, more specifically, the peak frequency is the peak frequency in the frequency band of 512 Hz or higher.
  • (c) of FIG. 3 shows the period during which the announcement sound is actually output, and (d) of FIG. A period higher than the first threshold is shown.
  • the S/N ratio is determined to be equal to or less than the first threshold, but as shown in (c) of FIG. there is That is, when the S/N ratio of the sound signal of the sound acquired by the microphone 21 is higher than the first threshold and the sound includes a human voice, in the configuration in which the external sound capture mode is operated, the period At T, the external sound capturing mode operation is not performed.
  • the reason why the S/N ratio is low in period T is that the announcement sound is output, but the noise due to the movement of the moving object is larger than that.
  • the S/N ratio is low even if an announcement sound is output. becomes low.
  • the ear-worn device 20 determines whether the bandwidth is narrower than the threshold (hereinafter also referred to as the second threshold). make a judgment.
  • (e) of FIG. 3 shows a period in which the bandwidth is narrower than the second threshold.
  • the ear-mounted device 20 regards a period in which the bandwidth is narrower than the second threshold as a period in which an announcement sound may be output even if the S/N ratio is equal to or lower than the first threshold.
  • the period during which it is determined that there is a possibility that the announcement sound is being output based on both the S/N ratio and the bandwidth is as shown in FIG. 3(f). This period includes the period during which the announcement sound is actually output, as shown in FIG. 3(c).
  • the ear-worn device 20 performs the determination regarding the bandwidth in addition to the determination regarding the S/N ratio, and does not operate in the external sound capture mode even though the announcement sound is being output. You can suppress the occurrence of the situation.
  • Example 1 A plurality of embodiments of the ear-mounted device 20 will be described below, taking specific situations as examples. First, Example 1 of the ear-mounted device 20 will be described. FIG. 4 is a flow chart of Example 1 of the ear-worn device 20 . It should be noted that Example 1 shows an operation that is assumed to be used when the user wearing the ear-mounted device 20 is on a mobile object.
  • the microphone 21 acquires sound and outputs a first sound signal of the acquired sound (S11).
  • the S/N ratio calculator 24b calculates the S/N ratio based on the noise component of the first sound signal output from the microphone 21 and the signal component obtained by subtracting the noise component from the first sound signal. Calculate (S12). Extraction of the noise component is performed by the noise extractor 24a. Extraction of the noise component is performed based on the power spectrum estimation method of the noise component used in the spectral subtraction method.
  • the S/N ratio calculated in step S12 is, for example, a parameter obtained by dividing the average value of the power of the signal component in the frequency domain by the average value of the power of the noise component in the frequency domain.
  • the power spectrum of the noise component estimated separately is subtracted from the power spectrum of the sound signal containing the noise component, and the power spectrum of the sound signal after the power spectrum of the noise component is subtracted is subjected to inverse Fourier transform. is a method of obtaining a sound signal (the above-mentioned signal component) in which the noise component is reduced.
  • the power spectrum of the noise component can be estimated based on the signal belonging to the non-speech section (the section where the signal component is small and the noise component occupies most) in the sound signal.
  • the non-speech section may be specified in any manner, but is specified based on the determination result of the determination unit 24e, for example. As will be described later, the determination unit 24e determines whether or not the sound acquired by the microphone 21 includes a human voice. The determined segment can be adopted as the non-speech segment.
  • the bandwidth calculation unit 24c performs signal processing on the first sound signal to which the high-pass filter 23 is applied, thereby calculating the bandwidth based on the peak frequency in the power spectrum of the sound acquired by the microphone 21. Calculate (S13).
  • the bandwidth calculation unit 24c calculates the power spectrum of the sound by Fourier transforming the first sound signal to which the high-pass filter 23 is applied, and calculates the peak frequency (the maximum power) in the spectrum of the sound. frequency). Further, the bandwidth calculation unit 24c uses the power at the peak frequency as a reference (100%), and when the power at a frequency lower than the peak frequency in the power spectrum decreases by a predetermined rate (for example, 80%) from the peak frequency is specified as the lower frequency limit. The bandwidth calculation unit 24c uses the power at the peak frequency as a reference, and sets the frequency, which is higher than the peak frequency in the power spectrum and at which the power drops by a predetermined rate (eg, 80%) from the peak frequency, as the upper limit frequency. Identify. The bandwidth calculator 24c can calculate the width from the lower limit frequency to the upper limit frequency as the bandwidth.
  • the sound feature amount calculation unit 24d calculates MFCC (Mel-Frequency Cepstral Coefficient) by performing signal processing on the first sound signal output from the microphone 21 (S14).
  • MFCC is a coefficient of cepstrum that is used as a feature quantity in speech recognition, etc.
  • the determination unit 24e determines whether at least one of the S/N ratio calculated in step S12 and the bandwidth calculated in step S13 satisfies a predetermined requirement (S15).
  • a predetermined requirement for the S/N ratio is that the S/N ratio is higher than a first threshold
  • a predetermined requirement for the bandwidth is that the bandwidth is narrower than a second threshold. That is, in step S15, the determination unit 24e determines that the S/N ratio calculated in step S12 is higher than the first threshold, and that the bandwidth calculated in step S13 is narrower than the second threshold. Determine whether at least one of the requirements is satisfied.
  • the first threshold and the second threshold are appropriately determined empirically or experimentally.
  • the determination unit 24e determines the microphone 21 based on the MFCC calculated by the audio feature amount calculation unit 24d. It is determined whether or not the sound acquired by includes a human voice (S16).
  • the determination unit 24e includes, for example, a machine learning model (neural network) that receives the MFCC as an input and outputs a determination result as to whether or not the sound contains a human voice. Using such a machine learning model, the microphone 21 determines whether or not the sound acquired by includes a human voice.
  • the human voice here is assumed to be the human voice included in the announcement sound.
  • the switching unit 24f When it is determined that the sound acquired by the microphone 21 includes a human voice (Yes in S16), the switching unit 24f operates from the normal mode to the external sound capture mode (S17). That is, the ear-mounted device 20 (switching unit 24f) determines that at least one of the S/N ratio and the bandwidth satisfies the predetermined requirements (Yes in S15) and that human voice is being output. When it does (Yes in S16), the external sound capturing mode is operated (S17).
  • FIG. 5 is a first flow chart of operations in the ambient sound capture mode.
  • the switching unit 24f In the external sound capture mode, the switching unit 24f generates a second sound signal by performing equalizing processing for emphasizing a specific frequency component in the first sound signal output by the microphone 21, and generates the second sound signal. is output (S17a).
  • a specific frequency component is, for example, a frequency component of 100 Hz or more and 2 kHz or less. If the band corresponding to the frequency band of the human voice is emphasized in this way, the human voice is thereby emphasized, so the announcement sound (more specifically, the human voice included in the announcement sound) is emphasized. be.
  • the mixing circuit 27b mixes the fourth sound signal (music content) received by the communication circuit 27a with the second sound signal and outputs the result to the speaker 28 (S17b), and the speaker 28 outputs the mixed fourth sound signal.
  • a reproduced sound is output based on the second sound signal (S17c). Since the announcement sound is emphasized as a result of the process of step S17a, the user of the ear-worn device 20 can easily hear the announcement sound.
  • the switching unit 24f when it is determined that neither the S/N ratio nor the bandwidth satisfies the predetermined requirements (No in S15 of FIG. 4), and it is determined that the sound does not contain human voice If so (Yes in S15 and No in S16), the switching unit 24f operates in the normal mode (S18).
  • the reproduced sound (music content) of the fourth sound signal received by the communication circuit 27a is output from the speaker 28, and the reproduced sound based on the second sound signal is not output. That is, the switching unit 24f does not cause the speaker 28 to output the reproduced sound based on the second sound signal.
  • the processing shown in the flowchart of FIG. 4 above is repeated every predetermined time. In other words, it is determined in which mode, the normal mode or the external sound capturing mode, the operation is to be performed at predetermined time intervals.
  • the predetermined time is, for example, 1/60 second.
  • the DSP 22 determines the S/N ratio of the first sound signal of the sound acquired by the microphone 21, determines the bandwidth based on the peak frequency in the power spectrum of the sound, and determines the bandwidth of the sound. If it is determined that at least one of the S/N ratio and bandwidth satisfies the predetermined requirements and that the sound contains human voice , to output a second sound signal based on the first sound signal. Specifically, the DSP 22 outputs a second sound signal obtained by performing signal processing on the first sound signal. This signal processing includes equalizing processing for emphasizing specific frequency components of sound. Further, when the DSP 22 determines that neither the S/N ratio nor the bandwidth satisfies the predetermined requirements, and when it determines that the sound does not contain human voice, the speaker 28 Playback sound based on the second sound signal is not output.
  • the ear-worn device 20 can assist the user on the mobile body to hear the announcement sound while the mobile body is moving. Even if the user is immersed in the music content, it becomes difficult for the user to miss the announcement sound. Moreover, the ear-worn device 20 makes a determination regarding the bandwidth in addition to the determination regarding the S/N ratio, so that the external sound capture mode operation is not performed even though the announcement sound is being output. The occurrence can be suppressed.
  • the operation in the ambient sound capturing mode is not limited to the operation shown in FIG.
  • the equalizing process is performed in step S17a
  • the second sound signal may be generated by signal processing for increasing the gain (increasing the amplitude) of the first sound signal.
  • the signal processing performed on the first sound signal when generating the second sound signal does not include phase inversion processing.
  • the external sound capture mode it is not essential that the first sound signal is subjected to signal processing.
  • FIG. 6 is a second flowchart of the operation in the ambient sound capturing mode.
  • the switching unit 24f outputs the first sound signal output by the microphone 21 as the second sound signal (S17d). That is, the switching unit 24f outputs the first sound signal substantially as it is as the second sound signal.
  • the switching unit 24f also instructs the mixing circuit 27b to attenuate the fourth sound signal (gain down, amplitude attenuation) during mixing.
  • the mixing circuit 27b mixes the second sound signal with the fourth sound signal (music content) whose amplitude is attenuated compared to the normal mode, and outputs the result to the speaker 28 (S17e). A reproduced sound is output based on the second sound signal obtained by mixing the signals (S17f).
  • the amplitude is attenuated more than during the operation of the normal mode before the output of the second sound signal is started.
  • the resulting fourth sound signal may be mixed with the second sound signal.
  • the operation in the external sound capturing mode is not limited to the operation shown in FIGS. 5 and 6.
  • the fourth sound signal attenuated as in step S17e in FIG. May be mixed.
  • the process of attenuating the fourth sound signal may be omitted, and the unattenuated fourth sound signal may be mixed with the second sound signal.
  • the process of stopping the output of the fourth sound signal from the mobile terminal 30, the process of setting the amplitude of the fourth sound signal to 0, and the mixing in the mixing circuit 27b are stopped (by performing at least one process such as the process of not mixing the fourth sound signal, the music content does not have to be output from the speaker 28 . That is, in the external sound capture mode, the user does not have to hear the music content.
  • the ear-worn device 20 has a noise canceling function (hereinafter also referred to as a noise canceling mode) that reduces environmental sounds around the user wearing the ear-worn device 20 during reproduction of the fourth sound signal (music content). ).
  • a noise canceling mode that reduces environmental sounds around the user wearing the ear-worn device 20 during reproduction of the fourth sound signal (music content).
  • the noise cancellation mode will be explained.
  • the CPU 33 uses the communication circuit 32 to issue a setting command for setting the noise cancellation mode to the ear-worn device 20 .
  • the setting command is received by the communication circuit 27a of the ear-worn device 20, the switching section 24f operates in the noise canceling mode.
  • FIG. 7 is a flowchart of operations in noise cancellation mode.
  • the switching unit 24f performs signal processing including phase inversion processing on the first sound signal output from the microphone 21, and outputs the result as a third sound signal (S19a).
  • This signal processing may include equalizing processing, gain-up processing, or the like, in addition to phase inversion processing.
  • a specific frequency component is, for example, a frequency component of 100 Hz or more and 2 kHz or less.
  • the mixing circuit 27b mixes the fourth sound signal (music content) received by the communication circuit 27a with the third sound signal and outputs the result to the speaker 28 (S19b), and the speaker 28 outputs the mixed fourth sound signal.
  • a reproduced sound is output based on the third sound signal (S19c).
  • FIG. 8 is a flow chart of Example 2 of the ear-worn device 20 .
  • Example 2 shows the operation when the user wearing the ear-worn device 20 rides on a moving object.
  • steps S11 to S14 in FIG. 8 is the same as the processing of steps S11 to S14 in the first embodiment (FIG. 4).
  • the determination unit 24e determines whether at least one of the S/N ratio calculated in step S12 and the bandwidth calculated in step S13 satisfies a predetermined requirement (S15 ). Details of the processing in step S15 are the same as in step S15 of the first embodiment (FIG. 4). Specifically, the determining unit 24e satisfies the requirement that the S/N ratio calculated in step S12 is higher than the first threshold, and the requirement that the bandwidth calculated in step S13 is narrower than the second threshold. is satisfied.
  • the determination unit 24e determines the microphone 21 based on the MFCC calculated by the audio feature amount calculation unit 24d. It is determined whether or not the sound acquired by includes a human voice (S16). Details of the processing in step S16 are the same as in step S16 of the first embodiment (FIG. 4).
  • the switching unit 24f switches from the noise canceling mode to the external sound capturing mode (S16). That is, the ear-mounted device 20 (switching unit 24f) determines that at least one of the S/N ratio and the bandwidth satisfies the predetermined requirements (Yes in S15) and that human voice is being output.
  • the external sound capturing mode is operated (S17). The operation in the external sound capture mode is as described with reference to FIGS. 5 and 6 and the like. Since the announcement sound is emphasized according to the operation in the external sound capture mode, the user of the ear-worn device 20 can easily hear the announcement sound.
  • the switching unit 24f operates in the noise canceling mode (S19).
  • the noise cancellation mode operation is as described with reference to FIG.
  • the processing shown in the flowchart of FIG. 8 is repeated at predetermined time intervals. In other words, it is determined in which mode, the noise canceling mode or the external sound capturing mode, the operation is to be performed at predetermined time intervals.
  • the predetermined time is, for example, 1/60 second.
  • a third sound signal obtained by performing phase inversion processing on the first sound signal is output.
  • the speaker 28 outputs reproduced sound based on the outputted third sound signal.
  • the ear-worn device 20 can help the user on a mobile object to clearly listen to music content while the mobile object is moving.
  • FIG. 9 is a diagram showing an example of an operation mode selection screen.
  • the user-selectable operation modes include, for example, three modes: normal mode, noise cancellation mode, and ambient sound capture mode. That is, the ear-worn device 20 may operate in the external sound capturing mode based on the user's operation on the mobile terminal 30 .
  • the CPU 33 transmits an operation mode switching command to the ear-worn device 20 via the communication circuit 32 based on the operation mode selection operation accepted by the UI 31 .
  • the switching unit 24f of the ear-worn device 20 can acquire an operating mode switching command via the communication circuit 27a, and switch the operating mode based on the acquired operating mode switching command.
  • the ear-mounted device 20 includes the microphone 21 that acquires sound and outputs the first sound signal of the acquired sound, the determination of the S/N ratio of the first sound signal, the power of the sound, and the Determining the bandwidth based on the peak frequency in the spectrum and determining whether or not the sound includes a human voice, and at least one of the S/N ratio and the bandwidth satisfies predetermined requirements and a DSP 22 that outputs a second sound signal based on the first sound signal when it is determined that the sound includes a human voice, and a speaker that outputs a reproduced sound based on the output second sound signal.
  • DSP22 is an example of a signal processing circuit.
  • Such an ear-worn device 20 can reproduce the voices of people heard in the surroundings.
  • the ear-worn device 20 can output a reproduced sound including the announcement sound from the speaker 28 when an announcement sound is output inside the mobile object while the mobile object is moving.
  • the DSP 22 determines that at least one of the S/N ratio and the bandwidth satisfies a predetermined requirement and that the sound includes a human voice
  • the DSP 22 converts the first sound signal to the second sound signal. Output as sound signal.
  • Such an ear-mounted device 20 can reproduce the voice of a person who can be heard in the surroundings based on the first sound signal.
  • the DSP 22 determines that at least one of the S/N ratio and the bandwidth satisfies a predetermined requirement and that the sound includes a human voice
  • the signal processing is performed on the first sound signal. to output the second sound signal.
  • Such an ear-mounted device 20 can reproduce the voices of people heard around it based on the signal-processed first sound signal.
  • the signal processing includes equalizing processing for emphasizing a specific frequency component of the sound.
  • Such an ear-mounted device 20 can emphasize and reproduce the voices of people heard in the surroundings.
  • the speaker 28 is not caused to output the reproduced sound based on the second sound signal.
  • Such an ear-mounted device 20 can stop outputting the reproduced sound based on the second sound signal when, for example, no human voice can be heard in the surroundings.
  • a third sound signal obtained by phase-inverting the first sound signal is output, and the speaker 28 outputs a reproduced sound based on the output third sound signal.
  • Such an ear-mounted device 20 can make it difficult to hear surrounding sounds when, for example, people's voices cannot be heard around them.
  • the predetermined requirement for the S/N ratio is that the S/N ratio is higher than the first threshold
  • the predetermined requirement for the bandwidth is that the bandwidth is narrower than the second threshold
  • Such an ear-mounted device 20 is used when the S/N ratio is estimated to be low due to excessive noise, that is, when the voices of people heard in the surroundings are buried in excessive noise. In addition, it is possible to reproduce the voices of people heard in the surroundings.
  • the ear-worn device 20 further includes a mixing circuit 27b that mixes the outputted second sound signal with the fourth sound signal provided from the sound source.
  • a mixing circuit 27b that mixes the outputted second sound signal with the fourth sound signal provided from the sound source.
  • Such an ear-mounted device 20 can emphasize and reproduce the voices of people heard in the surroundings.
  • the reproduction method executed by a computer such as the DSP 22 is based on the first sound signal of the sound output by the microphone 21 that acquires the sound, the determination of the S / N ratio of the first sound signal, the sound Judgment steps S15 and S16 for judging the bandwidth based on the peak frequency in the power spectrum of and judging whether or not the sound contains a human voice, the S/N ratio, and the bandwidth an output step S17a (or S17d) of outputting a second sound signal based on the first sound signal when it is determined that at least one of them satisfies a predetermined requirement and the sound includes a human voice; and a reproducing step S17c (or S17f) of outputting a reproduced sound from the speaker 28 based on the second sound signal.
  • Such a reproduction method can reproduce the voices of people who can be heard in the surroundings.
  • the ear-mounted device was described as an earphone-type device, but it may be a headphone-type device. Further, in the above embodiments, the ear-mounted device has the function of reproducing music content, but may not have the function of reproducing music content (communication circuit and mixing circuit).
  • the ear-worn device may be earplugs or hearing aids with noise cancellation and ambient sound capture capabilities.
  • the machine learning model is used to determine whether or not the sound acquired by the microphone contains a human voice. It may also be based on other algorithms that do not use models.
  • the configuration of the ear-mounted device according to the above embodiment is an example.
  • the ear worn device may include components not shown such as D/A converters, filters, power amplifiers, or A/D converters.
  • the sound signal processing system is realized by a plurality of devices, but it may be realized by a single device.
  • the functional components included in the sound signal processing system may be distributed to the plurality of devices in any way.
  • the mobile terminal may include some or all of the functional components included in the ear-worn device.
  • the communication method between devices in the above embodiment is not particularly limited.
  • a relay device (not shown) may intervene between the two devices.
  • the order of processing described in the above embodiment is an example.
  • the order of multiple processes may be changed, and multiple processes may be executed in parallel.
  • a process executed by a specific processing unit may be executed by another processing unit.
  • part of the digital signal processing described in the above embodiments may be realized by analog signal processing.
  • each component may be realized by executing a software program suitable for each component.
  • Each component may be realized by reading and executing a software program recorded in a recording medium such as a hard disk or a semiconductor memory by a program execution unit such as a CPU or processor.
  • each component may be realized by hardware.
  • each component may be a circuit (or integrated circuit). These circuits may form one circuit as a whole, or may be separate circuits. These circuits may be general-purpose circuits or dedicated circuits.
  • general or specific aspects of the present disclosure may be implemented in a system, apparatus, method, integrated circuit, computer program, or recording medium such as a computer-readable CD-ROM.
  • any combination of systems, devices, methods, integrated circuits, computer programs and recording media may be implemented.
  • the present disclosure may be implemented as a reproduction method executed by a computer such as an ear-worn device or a mobile terminal, or may be implemented as a program for causing a computer to execute such a reproduction method.
  • the present disclosure may be implemented as a computer-readable non-temporary recording medium in which such a program is recorded.
  • the program here includes an application program for causing a general-purpose mobile terminal to function as the mobile terminal of the above embodiment.
  • the ear-mounted device of the present disclosure can output reproduced sounds including the voices of surrounding people according to the surrounding noise environment.

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Signal Processing (AREA)
  • Multimedia (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Health & Medical Sciences (AREA)
  • General Health & Medical Sciences (AREA)
  • Computational Linguistics (AREA)
  • Quality & Reliability (AREA)
  • Human Computer Interaction (AREA)
  • Circuit For Audible Band Transducer (AREA)
  • Headphones And Earphones (AREA)
PCT/JP2022/035130 2021-12-21 2022-09-21 耳装着型デバイス、及び、再生方法 Ceased WO2023119764A1 (ja)

Priority Applications (3)

Application Number Priority Date Filing Date Title
JP2023569068A JPWO2023119764A1 (https=) 2021-12-21 2022-09-21
EP22910490.6A EP4456561A4 (en) 2021-12-21 2022-09-21 EAR-MOUNTED DEVICE AND REPRODUCTION METHOD
US18/745,000 US20240339101A1 (en) 2021-12-21 2024-06-17 Ear-worn device and reproduction method

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
JP2021-207539 2021-12-21
JP2021207539 2021-12-21

Related Child Applications (1)

Application Number Title Priority Date Filing Date
US18/745,000 Continuation US20240339101A1 (en) 2021-12-21 2024-06-17 Ear-worn device and reproduction method

Publications (1)

Publication Number Publication Date
WO2023119764A1 true WO2023119764A1 (ja) 2023-06-29

Family

ID=86901840

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/JP2022/035130 Ceased WO2023119764A1 (ja) 2021-12-21 2022-09-21 耳装着型デバイス、及び、再生方法

Country Status (4)

Country Link
US (1) US20240339101A1 (https=)
EP (1) EP4456561A4 (https=)
JP (1) JPWO2023119764A1 (https=)
WO (1) WO2023119764A1 (https=)

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP4354898A4 (en) * 2021-06-08 2024-10-16 Panasonic Intellectual Property Management Co., Ltd. Ear-mounted device and reproduction method

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20090010442A1 (en) * 2007-06-28 2009-01-08 Personics Holdings Inc. Method and device for background mitigation
JP2009021826A (ja) 2007-07-12 2009-01-29 Sony Corp 信号処理装置、信号処理方法、プログラム、ノイズキャンセリングシステム
JP2021511755A (ja) * 2017-12-07 2021-05-06 エイチイーディ・テクノロジーズ・エスアーエルエル 音声認識オーディオシステムおよび方法

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20090010442A1 (en) * 2007-06-28 2009-01-08 Personics Holdings Inc. Method and device for background mitigation
JP2009021826A (ja) 2007-07-12 2009-01-29 Sony Corp 信号処理装置、信号処理方法、プログラム、ノイズキャンセリングシステム
JP2021511755A (ja) * 2017-12-07 2021-05-06 エイチイーディ・テクノロジーズ・エスアーエルエル 音声認識オーディオシステムおよび方法

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
See also references of EP4456561A4

Also Published As

Publication number Publication date
JPWO2023119764A1 (https=) 2023-06-29
US20240339101A1 (en) 2024-10-10
EP4456561A4 (en) 2025-04-16
EP4456561A1 (en) 2024-10-30

Similar Documents

Publication Publication Date Title
US11710473B2 (en) Method and device for acute sound detection and reproduction
CN109195045B (zh) 检测耳机佩戴状态的方法、装置及耳机
US9071900B2 (en) Multi-channel recording
KR101540896B1 (ko) 전자 디바이스 상에서의 마스킹 신호 생성
CN104918177B (zh) 信号处理装置、信号处理方法和程序
US20160086594A1 (en) Noise reduction audio reproducing device and noise reduction audio reproducing methods
EP3433857B1 (en) Method and apparatus for noise reduction
US12520080B2 (en) Audio processing based on target signal-to-noise ratio
JP2014514609A (ja) 改善されたオーディオのための統合されたサイコアコースティック・バス・エンハンスメント(pbe)
US20160267925A1 (en) Audio processing apparatus that outputs, among sounds surrounding user, sound to be provided to user
JP7576780B2 (ja) 耳装着型デバイス、及び、再生方法
JP7734346B2 (ja) 耳装着型デバイス、及び、再生方法
WO2024205944A1 (en) Audio processing based on target signal-to-noise ratio
US20240339101A1 (en) Ear-worn device and reproduction method
JPWO2016059878A1 (ja) 信号処理装置、信号処理方法及びコンピュータプログラム
JP5202021B2 (ja) 音声信号変換装置、音声信号変換方法、制御プログラム、および、コンピュータ読み取り可能な記録媒体
WO2025174605A1 (en) Artificial intelligence awareness modes for adjusting output of an audio device
JP2019016851A (ja) 音声処理装置、音声処理方法、及びプログラム
CN115580678A (zh) 一种数据处理方法、装置和设备
WO2024127986A1 (ja) 音声処理システム、音声処理方法、及びプログラム
CN119545274A (zh) 拾音方法及装置、声音播放方法及装置
WO2022230275A1 (ja) 情報処理装置、情報処理方法、及び、プログラム
CN115190212A (zh) 基于耳机设备的通话降噪方法、装置、耳机设备以及介质

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 22910490

Country of ref document: EP

Kind code of ref document: A1

WWE Wipo information: entry into national phase

Ref document number: 2023569068

Country of ref document: JP

NENP Non-entry into the national phase

Ref country code: DE

ENP Entry into the national phase

Ref document number: 2022910490

Country of ref document: EP

Effective date: 20240722