EP3029671A1 - Procédé et appareil d'amélioration de sources acoustiques - Google Patents

Procédé et appareil d'amélioration de sources acoustiques Download PDF

Info

Publication number
EP3029671A1
EP3029671A1 EP14306947.4A EP14306947A EP3029671A1 EP 3029671 A1 EP3029671 A1 EP 3029671A1 EP 14306947 A EP14306947 A EP 14306947A EP 3029671 A1 EP3029671 A1 EP 3029671A1
Authority
EP
European Patent Office
Prior art keywords
signal
output
audio
source
processing
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Withdrawn
Application number
EP14306947.4A
Other languages
German (de)
English (en)
Inventor
Quang Khanh Ngoc DUONG
Michel Kerdranvat
Pierre Berthet
Eric ZABRE
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Thomson Licensing SAS
Original Assignee
Thomson Licensing SAS
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Thomson Licensing SAS filed Critical Thomson Licensing SAS
Priority to EP14306947.4A priority Critical patent/EP3029671A1/fr
Priority to EP15766406.1A priority patent/EP3189521B1/fr
Priority to PCT/EP2015/069417 priority patent/WO2016034454A1/fr
Priority to US15/508,925 priority patent/US20170287499A1/en
Priority to KR1020177006109A priority patent/KR102470962B1/ko
Priority to CN201580047111.0A priority patent/CN106716526B/zh
Priority to JP2017512383A priority patent/JP6703525B2/ja
Priority to TW104128191A priority patent/TW201621888A/zh
Publication of EP3029671A1 publication Critical patent/EP3029671A1/fr
Withdrawn legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/02Speech enhancement, e.g. noise reduction or echo cancellation
    • G10L21/0208Noise filtering
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/02Speech enhancement, e.g. noise reduction or echo cancellation
    • G10L21/0208Noise filtering
    • G10L21/0216Noise filtering characterised by the method used for estimating noise
    • G10L2021/02161Number of inputs available containing the signal or the noise to be suppressed
    • G10L2021/02166Microphone arrays; Beamforming
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/02Speech enhancement, e.g. noise reduction or echo cancellation
    • G10L21/0272Voice signal separating

Definitions

  • a method for processing an audio signal is presented, the audio signal being a mixture of at least a first signal from a first audio source and a second signal from a second audio source, comprising: processing the audio signal to generate a first output using a first beamformer pointing to a first direction, the first direction corresponding to the first audio source; processing the audio signal to generate a second output using a second beamformer pointing to a second direction, the second direction corresponding to the second audio source; and processing the first output and the second output to generate an enhanced first signal as described below.
  • an apparatus for performing these steps is also presented.
  • ⁇ j can also be set in an intermediate (i.e., between "soft” cleaning and “hard” cleaning) way by adjusting its values according to the level differences between
  • Audio system 500 uses four microphones m 1 -m 4 (510, 512, 514, 516).
  • the signal from each microphone is transformed from the time domain into the time-frequency domain, for example, using FFT modules (520, 522, 524, 526).
  • Beamformers 530, 532 and 534 perform beamforming based on the time-frequency signals. In one example, beamformers 530, 532 and 534 may point to directions 0°, 90°, 180°, respectively, to sample the sound field (360°).
  • Post-processor 540 performs post-processing based on the outputs of beamformers 530, 532 and 534, for example, using one of the methods described in Eqs. (2)-(7). When a reference signal is used for post-processor, post-processor 540 may use the signal from a microphone (for example, m 4 ) as the reference signal.
  • Audio system 600 also uses four microphones m 1 -m 4 (610, 612, 614, 616).
  • the signal from each microphone is transformed from the time domain into the time-frequency domain, for example, using FFT modules (620, 622, 624, 626).
  • Beamformers 630, 632, 634, 636, and 638 perform beamforming based on the time-frequency signals, and they point to directions 0°, 45°, 90°, 135°, and 180°, respectively.
  • Post-processor 640 performs post-processing based on the outputs of beamformers 630, 632, 634, 636, and 638, for example, using one of the methods described in Eqs. (2)-(7).

Landscapes

  • Engineering & Computer Science (AREA)
  • Computational Linguistics (AREA)
  • Quality & Reliability (AREA)
  • Signal Processing (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Circuit For Audible Band Transducer (AREA)
EP14306947.4A 2014-09-05 2014-12-04 Procédé et appareil d'amélioration de sources acoustiques Withdrawn EP3029671A1 (fr)

Priority Applications (8)

Application Number Priority Date Filing Date Title
EP14306947.4A EP3029671A1 (fr) 2014-12-04 2014-12-04 Procédé et appareil d'amélioration de sources acoustiques
EP15766406.1A EP3189521B1 (fr) 2014-09-05 2015-08-25 Procédé et appareil permettant d'améliorer des sources sonores
PCT/EP2015/069417 WO2016034454A1 (fr) 2014-09-05 2015-08-25 Procédé et appareil permettant d'améliorer des sources sonores
US15/508,925 US20170287499A1 (en) 2014-09-05 2015-08-25 Method and apparatus for enhancing sound sources
KR1020177006109A KR102470962B1 (ko) 2014-09-05 2015-08-25 사운드 소스들을 향상시키기 위한 방법 및 장치
CN201580047111.0A CN106716526B (zh) 2014-09-05 2015-08-25 用于增强声源的方法和装置
JP2017512383A JP6703525B2 (ja) 2014-09-05 2015-08-25 音源を強調するための方法及び機器
TW104128191A TW201621888A (zh) 2014-09-05 2015-08-27 用於增強音源之方法及裝置

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
EP14306947.4A EP3029671A1 (fr) 2014-12-04 2014-12-04 Procédé et appareil d'amélioration de sources acoustiques

Publications (1)

Publication Number Publication Date
EP3029671A1 true EP3029671A1 (fr) 2016-06-08

Family

ID=52338939

Family Applications (1)

Application Number Title Priority Date Filing Date
EP14306947.4A Withdrawn EP3029671A1 (fr) 2014-09-05 2014-12-04 Procédé et appareil d'amélioration de sources acoustiques

Country Status (1)

Country Link
EP (1) EP3029671A1 (fr)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113593596A (zh) * 2021-07-07 2021-11-02 中国科学院声学研究所 一种基于子阵划分的鲁棒自适应波束形成定向拾音方法

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20090097670A1 (en) * 2007-10-12 2009-04-16 Samsung Electronics Co., Ltd. Method, medium, and apparatus for extracting target sound from mixed sound
US20100081487A1 (en) * 2008-09-30 2010-04-01 Apple Inc. Multiple microphone switching and configuration
EP2562752A1 (fr) * 2010-08-25 2013-02-27 Asahi Kasei Kabushiki Kaisha Dispositif de séparation de sources sonores, procédé de séparation de sources sonores et programme

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20090097670A1 (en) * 2007-10-12 2009-04-16 Samsung Electronics Co., Ltd. Method, medium, and apparatus for extracting target sound from mixed sound
US20100081487A1 (en) * 2008-09-30 2010-04-01 Apple Inc. Multiple microphone switching and configuration
EP2562752A1 (fr) * 2010-08-25 2013-02-27 Asahi Kasei Kabushiki Kaisha Dispositif de séparation de sources sonores, procédé de séparation de sources sonores et programme

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113593596A (zh) * 2021-07-07 2021-11-02 中国科学院声学研究所 一种基于子阵划分的鲁棒自适应波束形成定向拾音方法
CN113593596B (zh) * 2021-07-07 2022-05-31 中国科学院声学研究所 一种基于子阵划分的鲁棒自适应波束形成定向拾音方法

Similar Documents

Publication Publication Date Title
EP3189521B1 (fr) Procédé et appareil permettant d'améliorer des sources sonores
US10650796B2 (en) Single-channel, binaural and multi-channel dereverberation
US10395660B2 (en) Apparatus and method for multichannel direct-ambient decompostion for audio signal processing
JP5007442B2 (ja) 発話改善のためにマイク間レベル差を用いるシステム及び方法
KR101726737B1 (ko) 다채널 음원 분리 장치 및 그 방법
EP2984852B1 (fr) Procédé et appareil pour enregistrer du son spatial
EP3526979B1 (fr) Procédé et appareil d'égalisation de signal de sortie entre microphones
CN112567763B (zh) 用于音频信号处理的装置和方法
US20130016854A1 (en) Microphone array processing system
US9743215B2 (en) Apparatus and method for center signal scaling and stereophonic enhancement based on a signal-to-downmix ratio
JP4448464B2 (ja) 雑音低減方法、装置、プログラム及び記録媒体
WO2022256577A1 (fr) Procédé d'amélioration de la parole et dispositif informatique mobile mettant en oeuvre le procédé
US11962992B2 (en) Spatial audio processing
EP3029671A1 (fr) Procédé et appareil d'amélioration de sources acoustiques
Matsumoto Vision-referential speech enhancement of an audio signal using mask information captured as visual data
Zou et al. Speech enhancement with an acoustic vector sensor: an effective adaptive beamforming and post-filtering approach
CN117121104A (zh) 估计用于处理所获取的声音数据的优化掩模

Legal Events

Date Code Title Description
PUAI Public reference made under article 153(3) epc to a published international application that has entered the european phase

Free format text: ORIGINAL CODE: 0009012

AK Designated contracting states

Kind code of ref document: A1

Designated state(s): AL AT BE BG CH CY CZ DE DK EE ES FI FR GB GR HR HU IE IS IT LI LT LU LV MC MK MT NL NO PL PT RO RS SE SI SK SM TR

AX Request for extension of the european patent

Extension state: BA ME

18D Application deemed to be withdrawn

Effective date: 20161209

STAA Information on the status of an ep patent application or granted ep patent

Free format text: STATUS: THE APPLICATION IS DEEMED TO BE WITHDRAWN