MX2021007323A - Aparato y metodo para separacion de fuente usando una estimacion y control de calidad de sonido. - Google Patents
Aparato y metodo para separacion de fuente usando una estimacion y control de calidad de sonido.Info
- Publication number
- MX2021007323A MX2021007323A MX2021007323A MX2021007323A MX2021007323A MX 2021007323 A MX2021007323 A MX 2021007323A MX 2021007323 A MX2021007323 A MX 2021007323A MX 2021007323 A MX2021007323 A MX 2021007323A MX 2021007323 A MX2021007323 A MX 2021007323A
- Authority
- MX
- Mexico
- Prior art keywords
- signal
- audio
- residual
- estimated
- target
- Prior art date
Links
- 238000000034 method Methods 0.000 title 1
- 238000000926 separation method Methods 0.000 title 1
- 230000005236 sound signal Effects 0.000 abstract 8
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/02—Speech enhancement, e.g. noise reduction or echo cancellation
- G10L21/0272—Voice signal separating
- G10L21/0308—Voice signal separating characterised by the type of parameter measurement, e.g. correlation techniques, zero crossing techniques or predictive techniques
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/08—Learning methods
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
- G10L25/27—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the analysis technique
- G10L25/30—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the analysis technique using neural networks
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
- G10L25/48—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use
- G10L25/51—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use for comparison or discrimination
- G10L25/60—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use for comparison or discrimination for measuring the quality of voice signals
Landscapes
- Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- Health & Medical Sciences (AREA)
- Computational Linguistics (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Signal Processing (AREA)
- Human Computer Interaction (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Quality & Reliability (AREA)
- Evolutionary Computation (AREA)
- Artificial Intelligence (AREA)
- Theoretical Computer Science (AREA)
- Life Sciences & Earth Sciences (AREA)
- General Physics & Mathematics (AREA)
- Data Mining & Analysis (AREA)
- General Health & Medical Sciences (AREA)
- Molecular Biology (AREA)
- Computing Systems (AREA)
- General Engineering & Computer Science (AREA)
- Biophysics (AREA)
- Mathematical Physics (AREA)
- Software Systems (AREA)
- Biomedical Technology (AREA)
- Tone Control, Compression And Expansion, Limiting Amplitude (AREA)
- Circuit For Audible Band Transducer (AREA)
- Measurement Of Mechanical Vibrations Or Ultrasonic Waves (AREA)
Abstract
Se proporciona un aparato para generar una señal de audio separada de una señal de entrada de audio. La señal de audio de entrada comprende una porción de señal de audio objetivo y una porción de señal de audio residual. La porción de señal de audio residual indica un residuo entre la señal de audio de entrada y a porción de señal de audio objetivo: El aparato comprende un separador de fuente (110), un módulo de determinación (120) y un procesador de señales (130). El separador de fuente (110) se configura para determinar una señal objetivo estimada que depende la señal de audio de entrada, siendo la señal objetivo estimada un estimación de una señal que únicamente comprende la porción de señal de audio objetivo. El módulo de determinación (120) se configura para determinar uno o más valores de resultado dependiendo de una calidad de sonido estimada de la señal objetivo estimada para obtener uno o más valores de parámetro, donde uno o más de os valores de parámetro son uno o más valores de resultado o depende de uno o más de los valores de resultado. El procesador de señales (130) se configura para generarla señal de audio separada dependiendo de uno o más de los valores de parámetro y dependiendo de al menos una de la señal objetivo estimada y la señal de entrada de audio y una señal residual estimada, siendo la señal residual estimada una estimación de una señal que únicamente comprende la porción de señal de audio residual.
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
EP18215707.3A EP3671739A1 (en) | 2018-12-21 | 2018-12-21 | Apparatus and method for source separation using an estimation and control of sound quality |
PCT/EP2019/086565 WO2020127900A1 (en) | 2018-12-21 | 2019-12-20 | Apparatus and method for source separation using an estimation and control of sound quality |
Publications (1)
Publication Number | Publication Date |
---|---|
MX2021007323A true MX2021007323A (es) | 2021-08-24 |
Family
ID=65011753
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
MX2021007323A MX2021007323A (es) | 2018-12-21 | 2019-12-20 | Aparato y metodo para separacion de fuente usando una estimacion y control de calidad de sonido. |
Country Status (10)
Country | Link |
---|---|
US (1) | US20210312939A1 (es) |
EP (2) | EP3671739A1 (es) |
JP (1) | JP7314279B2 (es) |
KR (1) | KR102630449B1 (es) |
CN (1) | CN113574597B (es) |
BR (1) | BR112021012308A2 (es) |
CA (1) | CA3124017C (es) |
ES (1) | ES2966063T3 (es) |
MX (1) | MX2021007323A (es) |
WO (1) | WO2020127900A1 (es) |
Families Citing this family (7)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN116997962A (zh) * | 2020-11-30 | 2023-11-03 | 杜比国际公司 | 基于卷积神经网络的鲁棒侵入式感知音频质量评估 |
CN113470689B (zh) * | 2021-08-23 | 2024-01-30 | 杭州国芯科技股份有限公司 | 一种语音分离方法 |
WO2023073596A1 (en) * | 2021-10-27 | 2023-05-04 | WingNut Films Productions Limited | Audio source separation processing workflow systems and methods |
US11763826B2 (en) | 2021-10-27 | 2023-09-19 | WingNut Films Productions Limited | Audio source separation processing pipeline systems and methods |
US20230126779A1 (en) * | 2021-10-27 | 2023-04-27 | WingNut Films Productions Limited | Audio Source Separation Systems and Methods |
CN113850246B (zh) * | 2021-11-30 | 2022-02-18 | 杭州一知智能科技有限公司 | 基于对偶一致网络的声源定位与声源分离的方法和系统 |
CN117475360B (zh) * | 2023-12-27 | 2024-03-26 | 南京纳实医学科技有限公司 | 基于改进型mlstm-fcn的音视频特点的生物特征提取与分析方法 |
Family Cites Families (22)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN1808571A (zh) * | 2005-01-19 | 2006-07-26 | 松下电器产业株式会社 | 声音信号分离系统及方法 |
US7464029B2 (en) * | 2005-07-22 | 2008-12-09 | Qualcomm Incorporated | Robust separation of speech signals in a noisy environment |
EP2375409A1 (en) * | 2010-04-09 | 2011-10-12 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | Audio encoder, audio decoder and related methods for processing multi-channel audio signals using complex prediction |
DE102011084035A1 (de) * | 2011-10-05 | 2013-04-11 | Nero Ag | Vorrichtung, verfahren und computerprogramm zur bewertung einer wahrgenommenen audioqualität |
EP2747081A1 (en) | 2012-12-18 | 2014-06-25 | Oticon A/s | An audio processing device comprising artifact reduction |
SG11201507066PA (en) * | 2013-03-05 | 2015-10-29 | Fraunhofer Ges Forschung | Apparatus and method for multichannel direct-ambient decomposition for audio signal processing |
EP2790419A1 (en) * | 2013-04-12 | 2014-10-15 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | Apparatus and method for center signal scaling and stereophonic enhancement based on a signal-to-downmix ratio |
GB2516483B (en) * | 2013-07-24 | 2018-07-18 | Canon Kk | Sound source separation method |
JP6143887B2 (ja) * | 2013-12-26 | 2017-06-07 | 株式会社東芝 | 方法、電子機器およびプログラム |
WO2016033269A1 (en) * | 2014-08-28 | 2016-03-03 | Analog Devices, Inc. | Audio processing using an intelligent microphone |
US10397711B2 (en) * | 2015-09-24 | 2019-08-27 | Gn Hearing A/S | Method of determining objective perceptual quantities of noisy speech signals |
MX2018003529A (es) * | 2015-09-25 | 2018-08-01 | Fraunhofer Ges Forschung | Codificador y metodo para codificar una se?al de audio con ruido de fondo reducido que utiliza codificacion predictiva lineal. |
KR20170101629A (ko) * | 2016-02-29 | 2017-09-06 | 한국전자통신연구원 | 스테레오 오디오 신호 기반의 다국어 오디오 서비스 제공 장치 및 방법 |
EP3220661B1 (en) * | 2016-03-15 | 2019-11-20 | Oticon A/s | A method for predicting the intelligibility of noisy and/or enhanced speech and a binaural hearing system |
EP3453187B1 (en) * | 2016-05-25 | 2020-05-13 | Huawei Technologies Co., Ltd. | Audio signal processing stage, audio signal processing apparatus and audio signal processing method |
DK3252766T3 (da) * | 2016-05-30 | 2021-09-06 | Oticon As | Audiobehandlingsanordning og fremgangsmåde til estimering af signal-til-støj-forholdet for et lydsignal |
US10861478B2 (en) * | 2016-05-30 | 2020-12-08 | Oticon A/S | Audio processing device and a method for estimating a signal-to-noise-ratio of a sound signal |
CN106531190B (zh) * | 2016-10-12 | 2020-05-05 | 科大讯飞股份有限公司 | 语音质量评价方法和装置 |
CN106847301A (zh) * | 2017-01-03 | 2017-06-13 | 东南大学 | 一种基于压缩感知和空间方位信息的双耳语音分离方法 |
EP3474280B1 (en) * | 2017-10-19 | 2021-07-07 | Goodix Technology (HK) Company Limited | Signal processor for speech signal enhancement |
CN107993671A (zh) * | 2017-12-04 | 2018-05-04 | 南京地平线机器人技术有限公司 | 声音处理方法、装置和电子设备 |
EP3573058B1 (en) * | 2018-05-23 | 2021-02-24 | Harman Becker Automotive Systems GmbH | Dry sound and ambient sound separation |
-
2018
- 2018-12-21 EP EP18215707.3A patent/EP3671739A1/en not_active Withdrawn
-
2019
- 2019-12-20 MX MX2021007323A patent/MX2021007323A/es unknown
- 2019-12-20 EP EP19824332.1A patent/EP3899936B1/en active Active
- 2019-12-20 WO PCT/EP2019/086565 patent/WO2020127900A1/en active Search and Examination
- 2019-12-20 KR KR1020217023148A patent/KR102630449B1/ko active IP Right Grant
- 2019-12-20 BR BR112021012308-3A patent/BR112021012308A2/pt unknown
- 2019-12-20 ES ES19824332T patent/ES2966063T3/es active Active
- 2019-12-20 JP JP2021535739A patent/JP7314279B2/ja active Active
- 2019-12-20 CA CA3124017A patent/CA3124017C/en active Active
- 2019-12-20 CN CN201980092879.8A patent/CN113574597B/zh active Active
-
2021
- 2021-06-21 US US17/353,297 patent/US20210312939A1/en active Pending
Also Published As
Publication number | Publication date |
---|---|
EP3671739A1 (en) | 2020-06-24 |
BR112021012308A2 (pt) | 2021-09-08 |
EP3899936B1 (en) | 2023-09-06 |
ES2966063T3 (es) | 2024-04-18 |
CA3124017C (en) | 2024-01-16 |
WO2020127900A1 (en) | 2020-06-25 |
KR102630449B1 (ko) | 2024-01-31 |
JP7314279B2 (ja) | 2023-07-25 |
CN113574597B (zh) | 2024-04-12 |
JP2022514878A (ja) | 2022-02-16 |
CA3124017A1 (en) | 2020-06-25 |
EP3899936C0 (en) | 2023-09-06 |
KR20210110622A (ko) | 2021-09-08 |
CN113574597A (zh) | 2021-10-29 |
US20210312939A1 (en) | 2021-10-07 |
EP3899936A1 (en) | 2021-10-27 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
MX2021007323A (es) | Aparato y metodo para separacion de fuente usando una estimacion y control de calidad de sonido. | |
MX2018004828A (es) | Método y aparato para generar una señal de audio filtrada realizando representación de elevación. | |
WO2020031061A3 (en) | Mvd precision for affine | |
MX2018005090A (es) | Aparato, metodo o programa de computadora para generar una descripcion de campo de sonido. | |
EP3373292A3 (en) | Method for controlling artificial intelligence system that performs multilingual processing | |
WO2016009444A3 (en) | Music performance system and method thereof | |
MX2016012543A (es) | Metodo y aparato de reproduccion de señal acustica y medio de grabacion susceptible de ser leido en computadora. | |
MY189000A (en) | Audio processing device and method, and program therefor | |
PH12019500347A1 (en) | Method for determining change in distance, location prompting method and apparatus and system thereof | |
PL3216236T3 (pl) | Urządzenie i sposób generowania sygnałów wyjściowych w oparciu o sygnał źródła audio, system odtwarzania dźwięku i sygnał głośników | |
MX337845B (es) | Aparato para proveer una señal de audio para reproduccion por un transductor de sonido, sistema, metodo y programa de computadora. | |
MY188581A (en) | Headtracking for parametric binaural output system and method | |
MX2021006078A (es) | Metodo para monitorear una instalacion ganadera y/o animales de ganado en una instalacion ganadera usando tecnicas de procesamiento de sonido mejoradas. | |
DK2306756T3 (da) | Fremgangsmåde til finindstilling af et høreapparat samt høreapparat | |
US9530429B2 (en) | Reverberation suppression apparatus used for auditory device | |
MY190143A (en) | Device and method for generating a high-band signal from non-linearly processed sub-ranges | |
MX2019011522A (es) | Aparato y métodos para procesar una señal de audio. | |
MX366125B (es) | Reproduccion de sonido diferencial. | |
WO2016020511A3 (de) | Verfahren zur senkung der verständlichkeit von sprachsignalen und trennbauteil zur beeinflussung der schallübertragung | |
JP2010136173A5 (es) | ||
MX2016012695A (es) | Metodo y aparato para emitir una señal acustica, y medio de grabacion legible en computadora. | |
ATE484160T1 (de) | Verfahren zur rückkopplungslöschung in einem hörgerät und hörgerät | |
WO2011083979A3 (en) | An apparatus for processing an audio signal and method thereof | |
EP2608201A3 (en) | Signal processing apparatus and signal processing method | |
JP2017500780A5 (es) |