WO2023214571A1 - Procédé et système de formation de faisceaux - Google Patents
Procédé et système de formation de faisceaux Download PDFInfo
- Publication number
- WO2023214571A1 WO2023214571A1 PCT/JP2023/017083 JP2023017083W WO2023214571A1 WO 2023214571 A1 WO2023214571 A1 WO 2023214571A1 JP 2023017083 W JP2023017083 W JP 2023017083W WO 2023214571 A1 WO2023214571 A1 WO 2023214571A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- filter
- mvdr
- beamforming
- signal
- input
- Prior art date
Links
- 238000000034 method Methods 0.000 title claims description 44
- 230000008569 process Effects 0.000 claims description 17
- 238000004364 calculation method Methods 0.000 claims description 6
- 238000013461 design Methods 0.000 claims description 5
- 230000001629 suppression Effects 0.000 abstract description 5
- 238000004321 preservation Methods 0.000 abstract description 4
- 238000012545 processing Methods 0.000 description 49
- 230000000052 comparative effect Effects 0.000 description 15
- 238000010586 diagram Methods 0.000 description 15
- 238000004458 analytical method Methods 0.000 description 14
- 230000005236 sound signal Effects 0.000 description 9
- 230000006870 function Effects 0.000 description 8
- 230000014759 maintenance of location Effects 0.000 description 5
- 230000004044 response Effects 0.000 description 5
- 239000000872 buffer Substances 0.000 description 4
- 230000015572 biosynthetic process Effects 0.000 description 3
- 230000000694 effects Effects 0.000 description 3
- 238000001914 filtration Methods 0.000 description 3
- 239000011159 matrix material Substances 0.000 description 3
- 238000003786 synthesis reaction Methods 0.000 description 3
- 230000008901 benefit Effects 0.000 description 2
- 238000006243 chemical reaction Methods 0.000 description 2
- 230000006835 compression Effects 0.000 description 2
- 238000007906 compression Methods 0.000 description 2
- 230000003595 spectral effect Effects 0.000 description 2
- 230000009466 transformation Effects 0.000 description 2
- 230000017105 transposition Effects 0.000 description 2
- 230000003044 adaptive effect Effects 0.000 description 1
- 230000005540 biological transmission Effects 0.000 description 1
- 230000003139 buffering effect Effects 0.000 description 1
- 230000001934 delay Effects 0.000 description 1
- 238000005516 engineering process Methods 0.000 description 1
- 230000004807 localization Effects 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
- 230000008447 perception Effects 0.000 description 1
- 230000009467 reduction Effects 0.000 description 1
- 238000000926 separation method Methods 0.000 description 1
- 238000012546 transfer Methods 0.000 description 1
Images
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/02—Speech enhancement, e.g. noise reduction or echo cancellation
- G10L21/0208—Noise filtering
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/02—Speech enhancement, e.g. noise reduction or echo cancellation
- G10L21/0272—Voice signal separating
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/02—Speech enhancement, e.g. noise reduction or echo cancellation
- G10L21/0272—Voice signal separating
- G10L21/0308—Voice signal separating characterised by the type of parameter measurement, e.g. correlation techniques, zero crossing techniques or predictive techniques
Definitions
- Binaural beamforming using MVDR is an algorithm that is guaranteed to preserve the desired audio spatial information.
- this algorithm it is known that when this algorithm is used, the spatial information of the noise is distorted, and the noise is perceived as coming from the same direction as the desired voice (see, for example, Non-Patent Document 1).
- the present invention employs the following beamforming method and a beamforming system (beamforming device) to which this method is applied. Note that the following words in parentheses are merely examples, and the present invention is not limited thereto.
- the beamforming method of the fifth aspect various gain calculations are performed for each frequency band in the frequency domain on the second signal path branched from the first signal path, and in this process, the MVDR Gain is also applied. Since coefficients based on the results are calculated on the second signal path and supplied to the FIR filter on the first signal path, according to the beamforming method of the fifth aspect, there is no delay due to analysis and synthesis. , the filtering can be accomplished by an FIR filter on the first signal path. As a result, beamforming with lower delay and more natural hearing can be achieved.
- FIG. 1 is a block diagram schematically showing a configuration example of a binaural hearing device 100 including a binaural beamformer 1 according to an embodiment.
- FIG. 2 is a diagram showing an example of a basic configuration of binaural beamforming.
- FIG. 3 is a diagram more specifically illustrating a basic configuration example of binaural beamforming.
- FIG. 2 is a block diagram showing in detail a configuration example of a binaural hearing device 100 with two input channels. It is a figure showing an example of the flow of processing in a filter bank of an embodiment.
- FIG. 7 is a diagram (1/3) showing an example of the flow of processing in a filter bank of a comparative example.
- the sound input section 10 is a microphone, and converts the sound input into the plurality of microphones into an electrical signal (hereinafter, this signal is referred to as an "input signal"), and sends it to the signal processing section 20.
- the binaural beamformer 1 performs various signal processing including beamforming using MVDR on the input signal of each microphone, and outputs the processed signal to the sound output section 30.
- the MVDR-IC algorithm is applied to the binaural beamformer 1. Note that details of the MVDR filter will be described in detail later.
- the sound output section 30 is a microphone or a speaker, and converts the signals for the left and right channels output from the binaural beamformer 1 into sound and outputs the sound to the outside.
- the signal processing unit 20 can be implemented, for example, by signal processing by a processor such as a DSP (digital signal processor).
- N indicates the number of input channels.
- MVDR is an optimal filter for minimizing distortion of the audio signal
- the problem is that noise signals are also perceived as coming from the same direction as the audio signal.
- the above-mentioned non-patent document 4 states that in a diffuse noisy environment, when the desired speech component and the noise component both arrive from the same direction, the SRT corresponding to 50% speech intelligibility does not improve. It has been reported that Therefore, the binaural beamformer 1 employs an MVDR-IC that holds an IC in order to spatially separate the output audio component and the residual noise component.
- the cost function J of MVDR- IC can be expressed by the following formula.
Landscapes
- Engineering & Computer Science (AREA)
- Computational Linguistics (AREA)
- Quality & Reliability (AREA)
- Signal Processing (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Physics & Mathematics (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Circuit For Audible Band Transducer (AREA)
Abstract
Dans un formeur de faisceaux binaural (1) auquel est appliqué un algorithme pour MVDR-IC, un paramètre pour commander un compromis entre la préservation d'IC d'une composante de bruit provoquée par cet algorithme et la suppression de bruit peut être réglé depuis l'extérieur, afin de permettre à un utilisateur de régler le paramètre lui-même ou automatiquement, en fonction d'un environnement auditif et d'obtenir une formation de faisceaux appropriée.
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
JP2022076676A JP2023165528A (ja) | 2022-05-06 | 2022-05-06 | ビームフォーミング方法、ビームフォーミングシステム |
JP2022-076676 | 2022-05-06 |
Publications (1)
Publication Number | Publication Date |
---|---|
WO2023214571A1 true WO2023214571A1 (fr) | 2023-11-09 |
Family
ID=88646530
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/JP2023/017083 WO2023214571A1 (fr) | 2022-05-06 | 2023-05-01 | Procédé et système de formation de faisceaux |
Country Status (2)
Country | Link |
---|---|
JP (1) | JP2023165528A (fr) |
WO (1) | WO2023214571A1 (fr) |
Citations (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2007123052A1 (fr) * | 2006-04-20 | 2007-11-01 | Nec Corporation | Dispositif, procédé et programme de commande de réseau adaptatif et dispositif, procédé et programme associés de traitement de réseau adaptatif |
US20180330726A1 (en) * | 2017-05-15 | 2018-11-15 | Baidu Online Network Technology (Beijing) Co., Ltd | Speech recognition method and device based on artificial intelligence |
-
2022
- 2022-05-06 JP JP2022076676A patent/JP2023165528A/ja active Pending
-
2023
- 2023-05-01 WO PCT/JP2023/017083 patent/WO2023214571A1/fr unknown
Patent Citations (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2007123052A1 (fr) * | 2006-04-20 | 2007-11-01 | Nec Corporation | Dispositif, procédé et programme de commande de réseau adaptatif et dispositif, procédé et programme associés de traitement de réseau adaptatif |
US20180330726A1 (en) * | 2017-05-15 | 2018-11-15 | Baidu Online Network Technology (Beijing) Co., Ltd | Speech recognition method and device based on artificial intelligence |
Non-Patent Citations (3)
Title |
---|
DANIEL MARQUARDT ; SIMON DOCLO: "Interaural Coherence Preservation for Binaural Noise Reduction Using Partial Noise Estimation and Spectral Postfiltering", ARXIV:1806.04885V2, vol. 26, no. 7, 1 July 2018 (2018-07-01), pages 1257 - 1270, XP058403498, DOI: 10.1109/TASLP.2018.2823081 * |
HIRUMA NOBUHIKO, FUJISAKA YOH-ICHI, MURAYAMA YOSHITAKA, CO RION, JAPAN TOKYO, CEAR ), JAPAN INC TOKYO: "Low-Latency Real-Time Binaural MVDR-IC for Hearing Assistive Device", CONFERENCE: 2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), SHOW & TELL DEMONSTRATIONS, 11 May 2022 (2022-05-11), XP093105739, Retrieved from the Internet <URL: https://www.researchgate.net/profile/Nobuhiko-Hiruma/publication/360514499_Low-latency_real-time_binaural_MVDR-IC_for_hearing_assistive_device> [retrieved on 20231127] * |
KATES JAMES M., AREHART KATHRYN HOBERG: "Multichannel Dynamic-Range Compression Using Digital Frequency Warping", EURASIP JOURNAL ON ADVANCES IN SIGNAL PROCESSING, vol. 2005, no. 18, 1 November 2005 (2005-11-01), pages 3003 - 3014, XP093105735, DOI: 10.1155/ASP.2005.3003 * |
Also Published As
Publication number | Publication date |
---|---|
JP2023165528A (ja) | 2023-11-16 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
DK2916321T3 (en) | Processing a noisy audio signal to estimate target and noise spectral variations | |
EP2207168B1 (fr) | Système robuste de suppression de bruit à deux microphones | |
US7054451B2 (en) | Sound reinforcement system having an echo suppressor and loudspeaker beamformer | |
EP1417756B1 (fr) | Traitement adaptatif du signal par sous-bandes dans un banc de filtres surechantillonne | |
EP2238592B1 (fr) | Procédé de réduction de bruit dans un signal d'entrée d'un dispositif auditif et dispositif auditif | |
Gilloire et al. | Using auditory properties to improve the behaviour of stereophonic acoustic echo cancellers | |
US10979100B2 (en) | Audio signal processing with acoustic echo cancellation | |
US20030026437A1 (en) | Sound reinforcement system having an multi microphone echo suppressor as post processor | |
US8892432B2 (en) | Signal processing system, apparatus and method used on the system, and program thereof | |
DK3008924T3 (en) | METHOD OF SIGNAL PROCESSING IN A HEARING SYSTEM AND HEARING SYSTEM | |
WO2009104252A1 (fr) | Processeur de sons, procédé de traitement de sons et programme de traitement de sons | |
EP2466914B1 (fr) | Réseau de haut-parleur pour rendu sonore ambiophonique virtuel | |
Marquardt et al. | Interaural coherence preservation for binaural noise reduction using partial noise estimation and spectral postfiltering | |
CN107113484B (zh) | 操作助听器系统的方法和助听器系统 | |
DK180745B1 (en) | Procedure by a hearing aid | |
US11373668B2 (en) | Enhancement of audio from remote audio sources | |
WO2023214571A1 (fr) | Procédé et système de formation de faisceaux | |
Corey et al. | Binaural audio source remixing with microphone array listening devices | |
Puder | Adaptive signal processing for interference cancellation in hearing aids | |
EP3886463A1 (fr) | Procédé au niveau d'un dispositif auditif | |
Xiao et al. | Effect of target signals and delays on spatially selective active noise control for open-fitting hearables | |
Puder | Acoustic noise control: An overview of several methods based on applications in hearing aids | |
CA2397084C (fr) | Amelioration de l'intelligibilite sonore a l'aide d'un modele psychoacoustique et d'un signal de banc de filtres surechantillonne | |
CN113286227A (zh) | 用于抑制麦克风装置的固有噪声的方法 | |
Hongo et al. | Two-input two-output speech enhancement with binaural spatial information using a soft decision mask filter |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
121 | Ep: the epo has been informed by wipo that ep was designated in this application |
Ref document number: 23799500 Country of ref document: EP Kind code of ref document: A1 |