WO2023214571A1 - Procédé et système de formation de faisceaux - Google Patents

Procédé et système de formation de faisceaux Download PDF

Info

Publication number
WO2023214571A1
WO2023214571A1 PCT/JP2023/017083 JP2023017083W WO2023214571A1 WO 2023214571 A1 WO2023214571 A1 WO 2023214571A1 JP 2023017083 W JP2023017083 W JP 2023017083W WO 2023214571 A1 WO2023214571 A1 WO 2023214571A1
Authority
WO
WIPO (PCT)
Prior art keywords
filter
mvdr
beamforming
signal
input
Prior art date
Application number
PCT/JP2023/017083
Other languages
English (en)
Japanese (ja)
Inventor
信彦 昼間
洋一 藤坂
Original Assignee
リオン株式会社
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by リオン株式会社 filed Critical リオン株式会社
Publication of WO2023214571A1 publication Critical patent/WO2023214571A1/fr

Links

Images

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/02Speech enhancement, e.g. noise reduction or echo cancellation
    • G10L21/0208Noise filtering
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/02Speech enhancement, e.g. noise reduction or echo cancellation
    • G10L21/0272Voice signal separating
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/02Speech enhancement, e.g. noise reduction or echo cancellation
    • G10L21/0272Voice signal separating
    • G10L21/0308Voice signal separating characterised by the type of parameter measurement, e.g. correlation techniques, zero crossing techniques or predictive techniques

Definitions

  • Binaural beamforming using MVDR is an algorithm that is guaranteed to preserve the desired audio spatial information.
  • this algorithm it is known that when this algorithm is used, the spatial information of the noise is distorted, and the noise is perceived as coming from the same direction as the desired voice (see, for example, Non-Patent Document 1).
  • the present invention employs the following beamforming method and a beamforming system (beamforming device) to which this method is applied. Note that the following words in parentheses are merely examples, and the present invention is not limited thereto.
  • the beamforming method of the fifth aspect various gain calculations are performed for each frequency band in the frequency domain on the second signal path branched from the first signal path, and in this process, the MVDR Gain is also applied. Since coefficients based on the results are calculated on the second signal path and supplied to the FIR filter on the first signal path, according to the beamforming method of the fifth aspect, there is no delay due to analysis and synthesis. , the filtering can be accomplished by an FIR filter on the first signal path. As a result, beamforming with lower delay and more natural hearing can be achieved.
  • FIG. 1 is a block diagram schematically showing a configuration example of a binaural hearing device 100 including a binaural beamformer 1 according to an embodiment.
  • FIG. 2 is a diagram showing an example of a basic configuration of binaural beamforming.
  • FIG. 3 is a diagram more specifically illustrating a basic configuration example of binaural beamforming.
  • FIG. 2 is a block diagram showing in detail a configuration example of a binaural hearing device 100 with two input channels. It is a figure showing an example of the flow of processing in a filter bank of an embodiment.
  • FIG. 7 is a diagram (1/3) showing an example of the flow of processing in a filter bank of a comparative example.
  • the sound input section 10 is a microphone, and converts the sound input into the plurality of microphones into an electrical signal (hereinafter, this signal is referred to as an "input signal"), and sends it to the signal processing section 20.
  • the binaural beamformer 1 performs various signal processing including beamforming using MVDR on the input signal of each microphone, and outputs the processed signal to the sound output section 30.
  • the MVDR-IC algorithm is applied to the binaural beamformer 1. Note that details of the MVDR filter will be described in detail later.
  • the sound output section 30 is a microphone or a speaker, and converts the signals for the left and right channels output from the binaural beamformer 1 into sound and outputs the sound to the outside.
  • the signal processing unit 20 can be implemented, for example, by signal processing by a processor such as a DSP (digital signal processor).
  • N indicates the number of input channels.
  • MVDR is an optimal filter for minimizing distortion of the audio signal
  • the problem is that noise signals are also perceived as coming from the same direction as the audio signal.
  • the above-mentioned non-patent document 4 states that in a diffuse noisy environment, when the desired speech component and the noise component both arrive from the same direction, the SRT corresponding to 50% speech intelligibility does not improve. It has been reported that Therefore, the binaural beamformer 1 employs an MVDR-IC that holds an IC in order to spatially separate the output audio component and the residual noise component.
  • the cost function J of MVDR- IC can be expressed by the following formula.

Landscapes

  • Engineering & Computer Science (AREA)
  • Computational Linguistics (AREA)
  • Quality & Reliability (AREA)
  • Signal Processing (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Circuit For Audible Band Transducer (AREA)

Abstract

Dans un formeur de faisceaux binaural (1) auquel est appliqué un algorithme pour MVDR-IC, un paramètre pour commander un compromis entre la préservation d'IC d'une composante de bruit provoquée par cet algorithme et la suppression de bruit peut être réglé depuis l'extérieur, afin de permettre à un utilisateur de régler le paramètre lui-même ou automatiquement, en fonction d'un environnement auditif et d'obtenir une formation de faisceaux appropriée.
PCT/JP2023/017083 2022-05-06 2023-05-01 Procédé et système de formation de faisceaux WO2023214571A1 (fr)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
JP2022076676A JP2023165528A (ja) 2022-05-06 2022-05-06 ビームフォーミング方法、ビームフォーミングシステム
JP2022-076676 2022-05-06

Publications (1)

Publication Number Publication Date
WO2023214571A1 true WO2023214571A1 (fr) 2023-11-09

Family

ID=88646530

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/JP2023/017083 WO2023214571A1 (fr) 2022-05-06 2023-05-01 Procédé et système de formation de faisceaux

Country Status (2)

Country Link
JP (1) JP2023165528A (fr)
WO (1) WO2023214571A1 (fr)

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2007123052A1 (fr) * 2006-04-20 2007-11-01 Nec Corporation Dispositif, procédé et programme de commande de réseau adaptatif et dispositif, procédé et programme associés de traitement de réseau adaptatif
US20180330726A1 (en) * 2017-05-15 2018-11-15 Baidu Online Network Technology (Beijing) Co., Ltd Speech recognition method and device based on artificial intelligence

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2007123052A1 (fr) * 2006-04-20 2007-11-01 Nec Corporation Dispositif, procédé et programme de commande de réseau adaptatif et dispositif, procédé et programme associés de traitement de réseau adaptatif
US20180330726A1 (en) * 2017-05-15 2018-11-15 Baidu Online Network Technology (Beijing) Co., Ltd Speech recognition method and device based on artificial intelligence

Non-Patent Citations (3)

* Cited by examiner, † Cited by third party
Title
DANIEL MARQUARDT ; SIMON DOCLO: "Interaural Coherence Preservation for Binaural Noise Reduction Using Partial Noise Estimation and Spectral Postfiltering", ARXIV:1806.04885V2, vol. 26, no. 7, 1 July 2018 (2018-07-01), pages 1257 - 1270, XP058403498, DOI: 10.1109/TASLP.2018.2823081 *
HIRUMA NOBUHIKO, FUJISAKA YOH-ICHI, MURAYAMA YOSHITAKA, CO RION, JAPAN TOKYO, CEAR ), JAPAN INC TOKYO: "Low-Latency Real-Time Binaural MVDR-IC for Hearing Assistive Device", CONFERENCE: 2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), SHOW & TELL DEMONSTRATIONS, 11 May 2022 (2022-05-11), XP093105739, Retrieved from the Internet <URL: https://www.researchgate.net/profile/Nobuhiko-Hiruma/publication/360514499_Low-latency_real-time_binaural_MVDR-IC_for_hearing_assistive_device> [retrieved on 20231127] *
KATES JAMES M., AREHART KATHRYN HOBERG: "Multichannel Dynamic-Range Compression Using Digital Frequency Warping", EURASIP JOURNAL ON ADVANCES IN SIGNAL PROCESSING, vol. 2005, no. 18, 1 November 2005 (2005-11-01), pages 3003 - 3014, XP093105735, DOI: 10.1155/ASP.2005.3003 *

Also Published As

Publication number Publication date
JP2023165528A (ja) 2023-11-16

Similar Documents

Publication Publication Date Title
DK2916321T3 (en) Processing a noisy audio signal to estimate target and noise spectral variations
EP2207168B1 (fr) Système robuste de suppression de bruit à deux microphones
US7054451B2 (en) Sound reinforcement system having an echo suppressor and loudspeaker beamformer
EP1417756B1 (fr) Traitement adaptatif du signal par sous-bandes dans un banc de filtres surechantillonne
EP2238592B1 (fr) Procédé de réduction de bruit dans un signal d&#39;entrée d&#39;un dispositif auditif et dispositif auditif
Gilloire et al. Using auditory properties to improve the behaviour of stereophonic acoustic echo cancellers
US10979100B2 (en) Audio signal processing with acoustic echo cancellation
US20030026437A1 (en) Sound reinforcement system having an multi microphone echo suppressor as post processor
US8892432B2 (en) Signal processing system, apparatus and method used on the system, and program thereof
DK3008924T3 (en) METHOD OF SIGNAL PROCESSING IN A HEARING SYSTEM AND HEARING SYSTEM
WO2009104252A1 (fr) Processeur de sons, procédé de traitement de sons et programme de traitement de sons
EP2466914B1 (fr) Réseau de haut-parleur pour rendu sonore ambiophonique virtuel
Marquardt et al. Interaural coherence preservation for binaural noise reduction using partial noise estimation and spectral postfiltering
CN107113484B (zh) 操作助听器系统的方法和助听器系统
DK180745B1 (en) Procedure by a hearing aid
US11373668B2 (en) Enhancement of audio from remote audio sources
WO2023214571A1 (fr) Procédé et système de formation de faisceaux
Corey et al. Binaural audio source remixing with microphone array listening devices
Puder Adaptive signal processing for interference cancellation in hearing aids
EP3886463A1 (fr) Procédé au niveau d&#39;un dispositif auditif
Xiao et al. Effect of target signals and delays on spatially selective active noise control for open-fitting hearables
Puder Acoustic noise control: An overview of several methods based on applications in hearing aids
CA2397084C (fr) Amelioration de l&#39;intelligibilite sonore a l&#39;aide d&#39;un modele psychoacoustique et d&#39;un signal de banc de filtres surechantillonne
CN113286227A (zh) 用于抑制麦克风装置的固有噪声的方法
Hongo et al. Two-input two-output speech enhancement with binaural spatial information using a soft decision mask filter

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 23799500

Country of ref document: EP

Kind code of ref document: A1