MX2020012886A - Evaluador de similitud de audio, codificador de audio, metodos y programa de computadora. - Google Patents
Evaluador de similitud de audio, codificador de audio, metodos y programa de computadora.Info
- Publication number
- MX2020012886A MX2020012886A MX2020012886A MX2020012886A MX2020012886A MX 2020012886 A MX2020012886 A MX 2020012886A MX 2020012886 A MX2020012886 A MX 2020012886A MX 2020012886 A MX2020012886 A MX 2020012886A MX 2020012886 A MX2020012886 A MX 2020012886A
- Authority
- MX
- Mexico
- Prior art keywords
- audio
- similarity evaluator
- evaluator
- modulation
- methods
- Prior art date
Links
- 238000004590 computer program Methods 0.000 title 1
- 238000000034 method Methods 0.000 title 1
- 230000005236 sound signal Effects 0.000 abstract 3
- 230000001537 neural effect Effects 0.000 abstract 1
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/02—Speech enhancement, e.g. noise reduction or echo cancellation
- G10L21/038—Speech enhancement, e.g. noise reduction or echo cancellation using band spreading techniques
- G10L21/0388—Details of processing therefor
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/04—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
- G10L19/16—Vocoder architecture
- G10L19/18—Vocoders using multiple modes
- G10L19/22—Mode decision, i.e. based on audio signal content versus external parameters
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/04—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
- G10L19/08—Determination or coding of the excitation function; Determination or coding of the long-term prediction parameters
- G10L19/09—Long term prediction, i.e. removing periodical redundancies, e.g. by using adaptive codebook or pitch predictor
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/04—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
- G10L19/26—Pre-filtering or post-filtering
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/02—Speech enhancement, e.g. noise reduction or echo cancellation
- G10L21/038—Speech enhancement, e.g. noise reduction or echo cancellation using band spreading techniques
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
- G10L25/03—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters
- G10L25/18—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters the extracted parameters being spectral information of each sub-band
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
- G10L25/48—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use
- G10L25/51—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use for comparison or discrimination
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N5/00—Computing arrangements using knowledge-based models
- G06N5/04—Inference or reasoning models
Landscapes
- Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- Health & Medical Sciences (AREA)
- Signal Processing (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Computational Linguistics (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Quality & Reliability (AREA)
- Spectroscopy & Molecular Physics (AREA)
- Compression, Expansion, Code Conversion, And Decoders (AREA)
- Electrophonic Musical Instruments (AREA)
- Indexing, Searching, Synchronizing, And The Amount Of Synchronization Travel Of Record Carriers (AREA)
- Toys (AREA)
Abstract
Un evaluador de similitud de audio obtiene señales de envolvente para una pluralidad de rangos de frecuencia con base en una señal de audio de entrada. El evaluador de similitud de audio está configurado para obtener una información de modulación asociada con las señales de envolvente para una pluralidad de rangos de frecuencia de modulación, en donde la información de modulación describe la modulación de las señales de envolvente. El evaluador de similitud de audio está configurado para comparar la información de modulación obtenida con una información de modulación de referencia asociada con una señal de audio de referencia, con el fin de obtener una información acerca de una similitud entre la señal de audio de entrada y la señal de audio de referencia. Un codificador de audio utiliza tal evaluador de similitud de audio. Otro evaluador de similitud de audio utiliza una red neuronal entrenada utilizando el evaluador de similitud de audio.
Applications Claiming Priority (3)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
EP18175251 | 2018-05-30 | ||
EP18198992.2A EP3576088A1 (en) | 2018-05-30 | 2018-10-05 | Audio similarity evaluator, audio encoder, methods and computer program |
PCT/EP2019/064105 WO2019229190A1 (en) | 2018-05-30 | 2019-05-29 | Audio similarity evaluator, audio encoder, methods and computer program |
Publications (1)
Publication Number | Publication Date |
---|---|
MX2020012886A true MX2020012886A (es) | 2021-04-28 |
Family
ID=62567262
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
MX2020012886A MX2020012886A (es) | 2018-05-30 | 2019-05-29 | Evaluador de similitud de audio, codificador de audio, metodos y programa de computadora. |
Country Status (10)
Country | Link |
---|---|
US (1) | US12051431B2 (es) |
EP (3) | EP3576088A1 (es) |
JP (1) | JP7301073B2 (es) |
KR (1) | KR102640748B1 (es) |
CN (1) | CN112470220B (es) |
BR (1) | BR112020024361A2 (es) |
CA (2) | CA3165021A1 (es) |
ES (1) | ES2960785T3 (es) |
MX (1) | MX2020012886A (es) |
WO (1) | WO2019229190A1 (es) |
Families Citing this family (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
FR3084489B1 (fr) * | 2018-07-26 | 2020-09-11 | Etat Francais Represente Par Le Delegue General Pour Larmement | Procede de detection d’au moins un equipement informatique compromis au sein d’un systeme d’information |
CN113593586A (zh) * | 2020-04-15 | 2021-11-02 | 华为技术有限公司 | 音频信号编码方法、解码方法、编码设备以及解码设备 |
WO2022076404A1 (en) * | 2020-10-05 | 2022-04-14 | The Trustees Of Columbia University In The City Of New York | Systems and methods for brain-informed speech separation |
CN115497485B (zh) * | 2021-06-18 | 2024-10-18 | 华为技术有限公司 | 三维音频信号编码方法、装置、编码器和系统 |
CN116386611B (zh) * | 2023-04-20 | 2023-10-13 | 珠海谷田科技有限公司 | 一种教学声场环境的去噪方法 |
Family Cites Families (23)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JP3183072B2 (ja) * | 1994-12-19 | 2001-07-03 | 松下電器産業株式会社 | 音声符号化装置 |
JPH08263099A (ja) * | 1995-03-23 | 1996-10-11 | Toshiba Corp | 符号化装置 |
JP3762204B2 (ja) | 2000-09-07 | 2006-04-05 | 三菱電機株式会社 | 音声符号化・復号化機器の検査方法および検査装置 |
US6842733B1 (en) * | 2000-09-15 | 2005-01-11 | Mindspeed Technologies, Inc. | Signal processing system for filtering spectral content of a signal for speech coding |
DE10123366C1 (de) | 2001-05-14 | 2002-08-08 | Fraunhofer Ges Forschung | Vorrichtung zum Analysieren eines Audiosignals hinsichtlich von Rhythmusinformationen |
JP4272897B2 (ja) | 2002-01-30 | 2009-06-03 | パナソニック株式会社 | 符号化装置、復号化装置およびその方法 |
US7565213B2 (en) | 2004-05-07 | 2009-07-21 | Gracenote, Inc. | Device and method for analyzing an information signal |
EP1782419A1 (en) * | 2004-08-17 | 2007-05-09 | Koninklijke Philips Electronics N.V. | Scalable audio coding |
CN101053018A (zh) * | 2004-11-01 | 2007-10-10 | 皇家飞利浦电子股份有限公司 | 包括幅度包络的参数音频编码 |
KR100803205B1 (ko) | 2005-07-15 | 2008-02-14 | 삼성전자주식회사 | 저비트율 오디오 신호 부호화/복호화 방법 및 장치 |
WO2007034375A2 (en) * | 2005-09-23 | 2007-03-29 | Koninklijke Philips Electronics N.V. | Determination of a distortion measure for audio encoding |
US20070083365A1 (en) | 2005-10-06 | 2007-04-12 | Dts, Inc. | Neural network classifier for separating audio sources from a monophonic audio signal |
KR101149448B1 (ko) | 2007-02-12 | 2012-05-25 | 삼성전자주식회사 | 오디오 부호화 및 복호화 장치와 그 방법 |
EP2362375A1 (en) | 2010-02-26 | 2011-08-31 | Fraunhofer-Gesellschaft zur Förderung der Angewandten Forschung e.V. | Apparatus and method for modifying an audio signal using harmonic locking |
JP5533502B2 (ja) * | 2010-09-28 | 2014-06-25 | 富士通株式会社 | オーディオ符号化装置、オーディオ符号化方法及びオーディオ符号化用コンピュータプログラム |
AU2012218409B2 (en) * | 2011-02-18 | 2016-09-15 | Ntt Docomo, Inc. | Speech decoder, speech encoder, speech decoding method, speech encoding method, speech decoding program, and speech encoding program |
EP2951822B1 (en) | 2013-01-29 | 2019-11-13 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | Audio encoder, audio decoder, method for providing an encoded audio information, method for providing a decoded audio information, computer program and encoded representation using a signal-adaptive bandwidth extension |
EP2830061A1 (en) * | 2013-07-22 | 2015-01-28 | Fraunhofer Gesellschaft zur Förderung der angewandten Forschung e.V. | Apparatus and method for encoding and decoding an encoded audio signal using temporal noise/patch shaping |
JP6306175B2 (ja) | 2013-10-31 | 2018-04-04 | フラウンホッファー−ゲゼルシャフト ツァ フェルダールング デァ アンゲヴァンテン フォアシュンク エー.ファオ | 時間ドメイン励振信号に基づくエラーコンシールメントを用いて、復号化されたオーディオ情報を提供するオーディオデコーダおよび復号化されたオーディオ情報を提供する方法 |
US10163447B2 (en) * | 2013-12-16 | 2018-12-25 | Qualcomm Incorporated | High-band signal modeling |
CN104485114B (zh) * | 2014-11-27 | 2018-03-06 | 湖南省计量检测研究院 | 一种基于听觉感知特性的语音质量客观评估的方法 |
JP6668372B2 (ja) * | 2015-02-26 | 2020-03-18 | フラウンホッファー−ゲゼルシャフト ツァ フェルダールング デァ アンゲヴァンテン フォアシュンク エー.ファオ | 目標時間領域エンベロープを用いて処理されたオーディオ信号を得るためにオーディオ信号を処理するための装置および方法 |
EP3402217A1 (en) * | 2017-05-09 | 2018-11-14 | GN Hearing A/S | Speech intelligibility-based hearing devices and associated methods |
-
2018
- 2018-10-05 EP EP18198992.2A patent/EP3576088A1/en not_active Withdrawn
-
2019
- 2019-05-29 CA CA3165021A patent/CA3165021A1/en active Pending
- 2019-05-29 CN CN201980049602.7A patent/CN112470220B/zh active Active
- 2019-05-29 BR BR112020024361-2A patent/BR112020024361A2/pt unknown
- 2019-05-29 EP EP19737471.3A patent/EP3803865B1/en active Active
- 2019-05-29 MX MX2020012886A patent/MX2020012886A/es unknown
- 2019-05-29 EP EP23180176.2A patent/EP4270393A3/en active Pending
- 2019-05-29 WO PCT/EP2019/064105 patent/WO2019229190A1/en active Search and Examination
- 2019-05-29 ES ES19737471T patent/ES2960785T3/es active Active
- 2019-05-29 JP JP2020567028A patent/JP7301073B2/ja active Active
- 2019-05-29 CA CA3101911A patent/CA3101911C/en active Active
- 2019-05-29 KR KR1020207037819A patent/KR102640748B1/ko active IP Right Grant
-
2020
- 2020-11-27 US US17/105,845 patent/US12051431B2/en active Active
Also Published As
Publication number | Publication date |
---|---|
JP2021526240A (ja) | 2021-09-30 |
KR102640748B1 (ko) | 2024-02-27 |
KR20210021490A (ko) | 2021-02-26 |
CA3165021A1 (en) | 2019-12-05 |
WO2019229190A1 (en) | 2019-12-05 |
CN112470220B (zh) | 2024-07-05 |
EP3803865C0 (en) | 2023-08-09 |
JP7301073B2 (ja) | 2023-06-30 |
BR112020024361A2 (pt) | 2021-03-02 |
EP4270393A3 (en) | 2023-12-20 |
US20210082447A1 (en) | 2021-03-18 |
CA3101911A1 (en) | 2019-12-05 |
US12051431B2 (en) | 2024-07-30 |
CA3101911C (en) | 2023-12-12 |
EP3576088A1 (en) | 2019-12-04 |
EP4270393A2 (en) | 2023-11-01 |
CN112470220A (zh) | 2021-03-09 |
ES2960785T3 (es) | 2024-03-06 |
EP3803865A1 (en) | 2021-04-14 |
EP3803865B1 (en) | 2023-08-09 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
MX2020012886A (es) | Evaluador de similitud de audio, codificador de audio, metodos y programa de computadora. | |
GB2574555A (en) | Adaptable processing components | |
EP3575980A3 (en) | Intelligent data quality | |
MX2018000989A (es) | Un metodo y un sistema para descomposicion de señal acustica en objetos de sonido, un objeto de sonido y su uso. | |
MX2019006756A (es) | Metodo, aparato y dispositivo electronico de reclamacion de productos basicos a base de cadena de bloques. | |
MX2019006199A (es) | Metodo y aparato de ejecucion de servicios basados en cadena de bloques y dispositivo electronico. | |
WO2015178992A3 (en) | Processing signals in a quantum computing system | |
AU2016409886A1 (en) | Intelligent list reading | |
EP4340397A3 (en) | Audio processing device and method, and program therefor | |
MX2015017316A (es) | Metodo y aparato para realizar conversion de analogico a digital de señales de entrada multiple. | |
MX2018001037A (es) | Extraccion de señales portadoras a partir de señales moduladas. | |
PH12016501396B1 (en) | Harmonic bandwidth extension of audio signals | |
MX2018005090A (es) | Aparato, metodo o programa de computadora para generar una descripcion de campo de sonido. | |
ZA202108890B (en) | Audio decoder, apparatus for determining a set of values defining characteristics of a filter, methods for providing a decoded audio representation, methods for determining a set of values defining characteristics of a filter and computer program | |
WO2015193226A9 (en) | System and methods for transmitting information using inaudible acoustic signals | |
GB201108885D0 (en) | Processing audio signals | |
MX2018001483A (es) | Sistemas y metodos para detectar tornados. | |
MY185944A (en) | Actuatable motion base system | |
GB2565701A (en) | Repair diagnostic system and method | |
WO2018151503A3 (ko) | 제스처 인식 방법 및 장치 | |
AU2017247045A1 (en) | Audio fingerprinting based on audio energy characteristics | |
GB2515920A (en) | Physical Performance Assessment | |
WO2019185529A9 (en) | Apparatus and method for providing a fingerprint of an input signal | |
MY180981A (en) | Aquatic time synchronisation system and method of determining a time offset | |
MX2019012095A (es) | Generacion de guia para contenido relacionado con la musica. |