EP3029671A1 - Procédé et appareil d'amélioration de sources acoustiques - Google Patents
Procédé et appareil d'amélioration de sources acoustiques Download PDFInfo
- Publication number
- EP3029671A1 EP3029671A1 EP14306947.4A EP14306947A EP3029671A1 EP 3029671 A1 EP3029671 A1 EP 3029671A1 EP 14306947 A EP14306947 A EP 14306947A EP 3029671 A1 EP3029671 A1 EP 3029671A1
- Authority
- EP
- European Patent Office
- Prior art keywords
- signal
- output
- audio
- source
- processing
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Withdrawn
Links
- 238000000034 method Methods 0.000 title claims description 50
- 230000002708 enhancing effect Effects 0.000 title description 5
- 230000004807 localization Effects 0.000 claims abstract description 20
- 239000000203 mixture Substances 0.000 claims abstract description 17
- 238000012545 processing Methods 0.000 claims description 31
- 230000005236 sound signal Effects 0.000 claims description 25
- 230000004044 response Effects 0.000 claims description 5
- 230000008569 process Effects 0.000 claims description 4
- 238000012805 post-processing Methods 0.000 description 19
- 230000003595 spectral effect Effects 0.000 description 12
- 238000000926 separation method Methods 0.000 description 7
- 238000004140 cleaning Methods 0.000 description 6
- 238000004891 communication Methods 0.000 description 4
- 238000001914 filtration Methods 0.000 description 4
- 230000008901 benefit Effects 0.000 description 3
- 230000000694 effects Effects 0.000 description 3
- 238000013459 approach Methods 0.000 description 2
- 238000010586 diagram Methods 0.000 description 2
- 230000001629 suppression Effects 0.000 description 2
- 230000001419 dependent effect Effects 0.000 description 1
- 238000001514 detection method Methods 0.000 description 1
- 230000007613 environmental effect Effects 0.000 description 1
- 230000006870 function Effects 0.000 description 1
- 230000006872 improvement Effects 0.000 description 1
- 230000000873 masking effect Effects 0.000 description 1
- 230000003446 memory effect Effects 0.000 description 1
- 229920001690 polydopamine Polymers 0.000 description 1
- 230000000750 progressive effect Effects 0.000 description 1
- 238000001228 spectrum Methods 0.000 description 1
Images
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/02—Speech enhancement, e.g. noise reduction or echo cancellation
- G10L21/0208—Noise filtering
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/02—Speech enhancement, e.g. noise reduction or echo cancellation
- G10L21/0208—Noise filtering
- G10L21/0216—Noise filtering characterised by the method used for estimating noise
- G10L2021/02161—Number of inputs available containing the signal or the noise to be suppressed
- G10L2021/02166—Microphone arrays; Beamforming
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/02—Speech enhancement, e.g. noise reduction or echo cancellation
- G10L21/0272—Voice signal separating
Definitions
- a method for processing an audio signal is presented, the audio signal being a mixture of at least a first signal from a first audio source and a second signal from a second audio source, comprising: processing the audio signal to generate a first output using a first beamformer pointing to a first direction, the first direction corresponding to the first audio source; processing the audio signal to generate a second output using a second beamformer pointing to a second direction, the second direction corresponding to the second audio source; and processing the first output and the second output to generate an enhanced first signal as described below.
- an apparatus for performing these steps is also presented.
- ⁇ j can also be set in an intermediate (i.e., between "soft” cleaning and “hard” cleaning) way by adjusting its values according to the level differences between
- Audio system 500 uses four microphones m 1 -m 4 (510, 512, 514, 516).
- the signal from each microphone is transformed from the time domain into the time-frequency domain, for example, using FFT modules (520, 522, 524, 526).
- Beamformers 530, 532 and 534 perform beamforming based on the time-frequency signals. In one example, beamformers 530, 532 and 534 may point to directions 0°, 90°, 180°, respectively, to sample the sound field (360°).
- Post-processor 540 performs post-processing based on the outputs of beamformers 530, 532 and 534, for example, using one of the methods described in Eqs. (2)-(7). When a reference signal is used for post-processor, post-processor 540 may use the signal from a microphone (for example, m 4 ) as the reference signal.
- Audio system 600 also uses four microphones m 1 -m 4 (610, 612, 614, 616).
- the signal from each microphone is transformed from the time domain into the time-frequency domain, for example, using FFT modules (620, 622, 624, 626).
- Beamformers 630, 632, 634, 636, and 638 perform beamforming based on the time-frequency signals, and they point to directions 0°, 45°, 90°, 135°, and 180°, respectively.
- Post-processor 640 performs post-processing based on the outputs of beamformers 630, 632, 634, 636, and 638, for example, using one of the methods described in Eqs. (2)-(7).
Landscapes
- Engineering & Computer Science (AREA)
- Computational Linguistics (AREA)
- Quality & Reliability (AREA)
- Signal Processing (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Physics & Mathematics (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Circuit For Audible Band Transducer (AREA)
Priority Applications (8)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
EP14306947.4A EP3029671A1 (fr) | 2014-12-04 | 2014-12-04 | Procédé et appareil d'amélioration de sources acoustiques |
EP15766406.1A EP3189521B1 (fr) | 2014-09-05 | 2015-08-25 | Procédé et appareil permettant d'améliorer des sources sonores |
PCT/EP2015/069417 WO2016034454A1 (fr) | 2014-09-05 | 2015-08-25 | Procédé et appareil permettant d'améliorer des sources sonores |
US15/508,925 US20170287499A1 (en) | 2014-09-05 | 2015-08-25 | Method and apparatus for enhancing sound sources |
KR1020177006109A KR102470962B1 (ko) | 2014-09-05 | 2015-08-25 | 사운드 소스들을 향상시키기 위한 방법 및 장치 |
CN201580047111.0A CN106716526B (zh) | 2014-09-05 | 2015-08-25 | 用于增强声源的方法和装置 |
JP2017512383A JP6703525B2 (ja) | 2014-09-05 | 2015-08-25 | 音源を強調するための方法及び機器 |
TW104128191A TW201621888A (zh) | 2014-09-05 | 2015-08-27 | 用於增強音源之方法及裝置 |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
EP14306947.4A EP3029671A1 (fr) | 2014-12-04 | 2014-12-04 | Procédé et appareil d'amélioration de sources acoustiques |
Publications (1)
Publication Number | Publication Date |
---|---|
EP3029671A1 true EP3029671A1 (fr) | 2016-06-08 |
Family
ID=52338939
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
EP14306947.4A Withdrawn EP3029671A1 (fr) | 2014-09-05 | 2014-12-04 | Procédé et appareil d'amélioration de sources acoustiques |
Country Status (1)
Country | Link |
---|---|
EP (1) | EP3029671A1 (fr) |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN113593596A (zh) * | 2021-07-07 | 2021-11-02 | 中国科学院声学研究所 | 一种基于子阵划分的鲁棒自适应波束形成定向拾音方法 |
Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20090097670A1 (en) * | 2007-10-12 | 2009-04-16 | Samsung Electronics Co., Ltd. | Method, medium, and apparatus for extracting target sound from mixed sound |
US20100081487A1 (en) * | 2008-09-30 | 2010-04-01 | Apple Inc. | Multiple microphone switching and configuration |
EP2562752A1 (fr) * | 2010-08-25 | 2013-02-27 | Asahi Kasei Kabushiki Kaisha | Dispositif de séparation de sources sonores, procédé de séparation de sources sonores et programme |
-
2014
- 2014-12-04 EP EP14306947.4A patent/EP3029671A1/fr not_active Withdrawn
Patent Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20090097670A1 (en) * | 2007-10-12 | 2009-04-16 | Samsung Electronics Co., Ltd. | Method, medium, and apparatus for extracting target sound from mixed sound |
US20100081487A1 (en) * | 2008-09-30 | 2010-04-01 | Apple Inc. | Multiple microphone switching and configuration |
EP2562752A1 (fr) * | 2010-08-25 | 2013-02-27 | Asahi Kasei Kabushiki Kaisha | Dispositif de séparation de sources sonores, procédé de séparation de sources sonores et programme |
Cited By (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN113593596A (zh) * | 2021-07-07 | 2021-11-02 | 中国科学院声学研究所 | 一种基于子阵划分的鲁棒自适应波束形成定向拾音方法 |
CN113593596B (zh) * | 2021-07-07 | 2022-05-31 | 中国科学院声学研究所 | 一种基于子阵划分的鲁棒自适应波束形成定向拾音方法 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
EP3189521B1 (fr) | Procédé et appareil permettant d'améliorer des sources sonores | |
US10650796B2 (en) | Single-channel, binaural and multi-channel dereverberation | |
US10395660B2 (en) | Apparatus and method for multichannel direct-ambient decompostion for audio signal processing | |
JP5007442B2 (ja) | 発話改善のためにマイク間レベル差を用いるシステム及び方法 | |
KR101726737B1 (ko) | 다채널 음원 분리 장치 및 그 방법 | |
EP2984852B1 (fr) | Procédé et appareil pour enregistrer du son spatial | |
EP3526979B1 (fr) | Procédé et appareil d'égalisation de signal de sortie entre microphones | |
CN112567763B (zh) | 用于音频信号处理的装置和方法 | |
US20130016854A1 (en) | Microphone array processing system | |
US9743215B2 (en) | Apparatus and method for center signal scaling and stereophonic enhancement based on a signal-to-downmix ratio | |
JP4448464B2 (ja) | 雑音低減方法、装置、プログラム及び記録媒体 | |
WO2022256577A1 (fr) | Procédé d'amélioration de la parole et dispositif informatique mobile mettant en oeuvre le procédé | |
US11962992B2 (en) | Spatial audio processing | |
EP3029671A1 (fr) | Procédé et appareil d'amélioration de sources acoustiques | |
Matsumoto | Vision-referential speech enhancement of an audio signal using mask information captured as visual data | |
Zou et al. | Speech enhancement with an acoustic vector sensor: an effective adaptive beamforming and post-filtering approach | |
CN117121104A (zh) | 估计用于处理所获取的声音数据的优化掩模 |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PUAI | Public reference made under article 153(3) epc to a published international application that has entered the european phase |
Free format text: ORIGINAL CODE: 0009012 |
|
AK | Designated contracting states |
Kind code of ref document: A1 Designated state(s): AL AT BE BG CH CY CZ DE DK EE ES FI FR GB GR HR HU IE IS IT LI LT LU LV MC MK MT NL NO PL PT RO RS SE SI SK SM TR |
|
AX | Request for extension of the european patent |
Extension state: BA ME |
|
18D | Application deemed to be withdrawn |
Effective date: 20161209 |
|
STAA | Information on the status of an ep patent application or granted ep patent |
Free format text: STATUS: THE APPLICATION IS DEEMED TO BE WITHDRAWN |