MX2021007323A - Aparato y metodo para separacion de fuente usando una estimacion y control de calidad de sonido. - Google Patents

Aparato y metodo para separacion de fuente usando una estimacion y control de calidad de sonido.

Info

Publication number
MX2021007323A
MX2021007323A MX2021007323A MX2021007323A MX2021007323A MX 2021007323 A MX2021007323 A MX 2021007323A MX 2021007323 A MX2021007323 A MX 2021007323A MX 2021007323 A MX2021007323 A MX 2021007323A MX 2021007323 A MX2021007323 A MX 2021007323A
Authority
MX
Mexico
Prior art keywords
signal
audio
residual
estimated
target
Prior art date
Application number
MX2021007323A
Other languages
English (en)
Inventor
Jürgen Herre
Harald Fuchs
Sascha Disch
Jouni Paulus
Oliver Hellmuth
Christian Uhle
Matteo Torcoli
Original Assignee
Fraunhofer Ges Forschung
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Fraunhofer Ges Forschung filed Critical Fraunhofer Ges Forschung
Publication of MX2021007323A publication Critical patent/MX2021007323A/es

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/02Speech enhancement, e.g. noise reduction or echo cancellation
    • G10L21/0272Voice signal separating
    • G10L21/0308Voice signal separating characterised by the type of parameter measurement, e.g. correlation techniques, zero crossing techniques or predictive techniques
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/08Learning methods
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/27Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the analysis technique
    • G10L25/30Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the analysis technique using neural networks
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/48Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use
    • G10L25/51Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use for comparison or discrimination
    • G10L25/60Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use for comparison or discrimination for measuring the quality of voice signals

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Health & Medical Sciences (AREA)
  • Computational Linguistics (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Signal Processing (AREA)
  • Human Computer Interaction (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Quality & Reliability (AREA)
  • Evolutionary Computation (AREA)
  • Artificial Intelligence (AREA)
  • Theoretical Computer Science (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • General Physics & Mathematics (AREA)
  • Data Mining & Analysis (AREA)
  • General Health & Medical Sciences (AREA)
  • Molecular Biology (AREA)
  • Computing Systems (AREA)
  • General Engineering & Computer Science (AREA)
  • Biophysics (AREA)
  • Mathematical Physics (AREA)
  • Software Systems (AREA)
  • Biomedical Technology (AREA)
  • Tone Control, Compression And Expansion, Limiting Amplitude (AREA)
  • Circuit For Audible Band Transducer (AREA)
  • Measurement Of Mechanical Vibrations Or Ultrasonic Waves (AREA)

Abstract

Se proporciona un aparato para generar una señal de audio separada de una señal de entrada de audio. La señal de audio de entrada comprende una porción de señal de audio objetivo y una porción de señal de audio residual. La porción de señal de audio residual indica un residuo entre la señal de audio de entrada y a porción de señal de audio objetivo: El aparato comprende un separador de fuente (110), un módulo de determinación (120) y un procesador de señales (130). El separador de fuente (110) se configura para determinar una señal objetivo estimada que depende la señal de audio de entrada, siendo la señal objetivo estimada un estimación de una señal que únicamente comprende la porción de señal de audio objetivo. El módulo de determinación (120) se configura para determinar uno o más valores de resultado dependiendo de una calidad de sonido estimada de la señal objetivo estimada para obtener uno o más valores de parámetro, donde uno o más de os valores de parámetro son uno o más valores de resultado o depende de uno o más de los valores de resultado. El procesador de señales (130) se configura para generarla señal de audio separada dependiendo de uno o más de los valores de parámetro y dependiendo de al menos una de la señal objetivo estimada y la señal de entrada de audio y una señal residual estimada, siendo la señal residual estimada una estimación de una señal que únicamente comprende la porción de señal de audio residual.
MX2021007323A 2018-12-21 2019-12-20 Aparato y metodo para separacion de fuente usando una estimacion y control de calidad de sonido. MX2021007323A (es)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
EP18215707.3A EP3671739A1 (en) 2018-12-21 2018-12-21 Apparatus and method for source separation using an estimation and control of sound quality
PCT/EP2019/086565 WO2020127900A1 (en) 2018-12-21 2019-12-20 Apparatus and method for source separation using an estimation and control of sound quality

Publications (1)

Publication Number Publication Date
MX2021007323A true MX2021007323A (es) 2021-08-24

Family

ID=65011753

Family Applications (1)

Application Number Title Priority Date Filing Date
MX2021007323A MX2021007323A (es) 2018-12-21 2019-12-20 Aparato y metodo para separacion de fuente usando una estimacion y control de calidad de sonido.

Country Status (10)

Country Link
US (1) US20210312939A1 (es)
EP (2) EP3671739A1 (es)
JP (1) JP7314279B2 (es)
KR (1) KR102630449B1 (es)
CN (1) CN113574597B (es)
BR (1) BR112021012308A2 (es)
CA (1) CA3124017C (es)
ES (1) ES2966063T3 (es)
MX (1) MX2021007323A (es)
WO (1) WO2020127900A1 (es)

Families Citing this family (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN116997962A (zh) * 2020-11-30 2023-11-03 杜比国际公司 基于卷积神经网络的鲁棒侵入式感知音频质量评估
CN113470689B (zh) * 2021-08-23 2024-01-30 杭州国芯科技股份有限公司 一种语音分离方法
WO2023073596A1 (en) * 2021-10-27 2023-05-04 WingNut Films Productions Limited Audio source separation processing workflow systems and methods
US11763826B2 (en) 2021-10-27 2023-09-19 WingNut Films Productions Limited Audio source separation processing pipeline systems and methods
US20230126779A1 (en) * 2021-10-27 2023-04-27 WingNut Films Productions Limited Audio Source Separation Systems and Methods
CN113850246B (zh) * 2021-11-30 2022-02-18 杭州一知智能科技有限公司 基于对偶一致网络的声源定位与声源分离的方法和系统
CN117475360B (zh) * 2023-12-27 2024-03-26 南京纳实医学科技有限公司 基于改进型mlstm-fcn的音视频特点的生物特征提取与分析方法

Family Cites Families (22)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1808571A (zh) * 2005-01-19 2006-07-26 松下电器产业株式会社 声音信号分离系统及方法
US7464029B2 (en) * 2005-07-22 2008-12-09 Qualcomm Incorporated Robust separation of speech signals in a noisy environment
EP2375409A1 (en) * 2010-04-09 2011-10-12 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Audio encoder, audio decoder and related methods for processing multi-channel audio signals using complex prediction
DE102011084035A1 (de) * 2011-10-05 2013-04-11 Nero Ag Vorrichtung, verfahren und computerprogramm zur bewertung einer wahrgenommenen audioqualität
EP2747081A1 (en) 2012-12-18 2014-06-25 Oticon A/s An audio processing device comprising artifact reduction
SG11201507066PA (en) * 2013-03-05 2015-10-29 Fraunhofer Ges Forschung Apparatus and method for multichannel direct-ambient decomposition for audio signal processing
EP2790419A1 (en) * 2013-04-12 2014-10-15 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Apparatus and method for center signal scaling and stereophonic enhancement based on a signal-to-downmix ratio
GB2516483B (en) * 2013-07-24 2018-07-18 Canon Kk Sound source separation method
JP6143887B2 (ja) * 2013-12-26 2017-06-07 株式会社東芝 方法、電子機器およびプログラム
WO2016033269A1 (en) * 2014-08-28 2016-03-03 Analog Devices, Inc. Audio processing using an intelligent microphone
US10397711B2 (en) * 2015-09-24 2019-08-27 Gn Hearing A/S Method of determining objective perceptual quantities of noisy speech signals
MX2018003529A (es) * 2015-09-25 2018-08-01 Fraunhofer Ges Forschung Codificador y metodo para codificar una se?al de audio con ruido de fondo reducido que utiliza codificacion predictiva lineal.
KR20170101629A (ko) * 2016-02-29 2017-09-06 한국전자통신연구원 스테레오 오디오 신호 기반의 다국어 오디오 서비스 제공 장치 및 방법
EP3220661B1 (en) * 2016-03-15 2019-11-20 Oticon A/s A method for predicting the intelligibility of noisy and/or enhanced speech and a binaural hearing system
EP3453187B1 (en) * 2016-05-25 2020-05-13 Huawei Technologies Co., Ltd. Audio signal processing stage, audio signal processing apparatus and audio signal processing method
DK3252766T3 (da) * 2016-05-30 2021-09-06 Oticon As Audiobehandlingsanordning og fremgangsmåde til estimering af signal-til-støj-forholdet for et lydsignal
US10861478B2 (en) * 2016-05-30 2020-12-08 Oticon A/S Audio processing device and a method for estimating a signal-to-noise-ratio of a sound signal
CN106531190B (zh) * 2016-10-12 2020-05-05 科大讯飞股份有限公司 语音质量评价方法和装置
CN106847301A (zh) * 2017-01-03 2017-06-13 东南大学 一种基于压缩感知和空间方位信息的双耳语音分离方法
EP3474280B1 (en) * 2017-10-19 2021-07-07 Goodix Technology (HK) Company Limited Signal processor for speech signal enhancement
CN107993671A (zh) * 2017-12-04 2018-05-04 南京地平线机器人技术有限公司 声音处理方法、装置和电子设备
EP3573058B1 (en) * 2018-05-23 2021-02-24 Harman Becker Automotive Systems GmbH Dry sound and ambient sound separation

Also Published As

Publication number Publication date
EP3671739A1 (en) 2020-06-24
BR112021012308A2 (pt) 2021-09-08
EP3899936B1 (en) 2023-09-06
ES2966063T3 (es) 2024-04-18
CA3124017C (en) 2024-01-16
WO2020127900A1 (en) 2020-06-25
KR102630449B1 (ko) 2024-01-31
JP7314279B2 (ja) 2023-07-25
CN113574597B (zh) 2024-04-12
JP2022514878A (ja) 2022-02-16
CA3124017A1 (en) 2020-06-25
EP3899936C0 (en) 2023-09-06
KR20210110622A (ko) 2021-09-08
CN113574597A (zh) 2021-10-29
US20210312939A1 (en) 2021-10-07
EP3899936A1 (en) 2021-10-27

Similar Documents

Publication Publication Date Title
MX2021007323A (es) Aparato y metodo para separacion de fuente usando una estimacion y control de calidad de sonido.
MX2018004828A (es) Método y aparato para generar una señal de audio filtrada realizando representación de elevación.
WO2020031061A3 (en) Mvd precision for affine
MX2018005090A (es) Aparato, metodo o programa de computadora para generar una descripcion de campo de sonido.
EP3373292A3 (en) Method for controlling artificial intelligence system that performs multilingual processing
WO2016009444A3 (en) Music performance system and method thereof
MX2016012543A (es) Metodo y aparato de reproduccion de señal acustica y medio de grabacion susceptible de ser leido en computadora.
MY189000A (en) Audio processing device and method, and program therefor
PH12019500347A1 (en) Method for determining change in distance, location prompting method and apparatus and system thereof
PL3216236T3 (pl) Urządzenie i sposób generowania sygnałów wyjściowych w oparciu o sygnał źródła audio, system odtwarzania dźwięku i sygnał głośników
MX337845B (es) Aparato para proveer una señal de audio para reproduccion por un transductor de sonido, sistema, metodo y programa de computadora.
MY188581A (en) Headtracking for parametric binaural output system and method
MX2021006078A (es) Metodo para monitorear una instalacion ganadera y/o animales de ganado en una instalacion ganadera usando tecnicas de procesamiento de sonido mejoradas.
DK2306756T3 (da) Fremgangsmåde til finindstilling af et høreapparat samt høreapparat
US9530429B2 (en) Reverberation suppression apparatus used for auditory device
MY190143A (en) Device and method for generating a high-band signal from non-linearly processed sub-ranges
MX2019011522A (es) Aparato y métodos para procesar una señal de audio.
MX366125B (es) Reproduccion de sonido diferencial.
WO2016020511A3 (de) Verfahren zur senkung der verständlichkeit von sprachsignalen und trennbauteil zur beeinflussung der schallübertragung
JP2010136173A5 (es)
MX2016012695A (es) Metodo y aparato para emitir una señal acustica, y medio de grabacion legible en computadora.
ATE484160T1 (de) Verfahren zur rückkopplungslöschung in einem hörgerät und hörgerät
WO2011083979A3 (en) An apparatus for processing an audio signal and method thereof
EP2608201A3 (en) Signal processing apparatus and signal processing method
JP2017500780A5 (es)