MX2020012886A - Evaluador de similitud de audio, codificador de audio, metodos y programa de computadora. - Google Patents

Evaluador de similitud de audio, codificador de audio, metodos y programa de computadora.

Info

Publication number
MX2020012886A
MX2020012886A MX2020012886A MX2020012886A MX2020012886A MX 2020012886 A MX2020012886 A MX 2020012886A MX 2020012886 A MX2020012886 A MX 2020012886A MX 2020012886 A MX2020012886 A MX 2020012886A MX 2020012886 A MX2020012886 A MX 2020012886A
Authority
MX
Mexico
Prior art keywords
audio
similarity evaluator
evaluator
modulation
methods
Prior art date
Application number
MX2020012886A
Other languages
English (en)
Inventor
Bernd Edler
Sascha Disch
Andreas Niedermeier
Der Par Steven Van
Pérez Elena Burdiel
Original Assignee
Fraunhofer Ges Forschung
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Fraunhofer Ges Forschung filed Critical Fraunhofer Ges Forschung
Publication of MX2020012886A publication Critical patent/MX2020012886A/es

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/02Speech enhancement, e.g. noise reduction or echo cancellation
    • G10L21/038Speech enhancement, e.g. noise reduction or echo cancellation using band spreading techniques
    • G10L21/0388Details of processing therefor
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/04Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
    • G10L19/16Vocoder architecture
    • G10L19/18Vocoders using multiple modes
    • G10L19/22Mode decision, i.e. based on audio signal content versus external parameters
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/04Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
    • G10L19/08Determination or coding of the excitation function; Determination or coding of the long-term prediction parameters
    • G10L19/09Long term prediction, i.e. removing periodical redundancies, e.g. by using adaptive codebook or pitch predictor
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/04Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
    • G10L19/26Pre-filtering or post-filtering
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/02Speech enhancement, e.g. noise reduction or echo cancellation
    • G10L21/038Speech enhancement, e.g. noise reduction or echo cancellation using band spreading techniques
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/03Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters
    • G10L25/18Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters the extracted parameters being spectral information of each sub-band
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/48Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use
    • G10L25/51Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use for comparison or discrimination
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N5/00Computing arrangements using knowledge-based models
    • G06N5/04Inference or reasoning models

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Health & Medical Sciences (AREA)
  • Signal Processing (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Computational Linguistics (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Quality & Reliability (AREA)
  • Spectroscopy & Molecular Physics (AREA)
  • Compression, Expansion, Code Conversion, And Decoders (AREA)
  • Electrophonic Musical Instruments (AREA)
  • Indexing, Searching, Synchronizing, And The Amount Of Synchronization Travel Of Record Carriers (AREA)
  • Toys (AREA)

Abstract

Un evaluador de similitud de audio obtiene señales de envolvente para una pluralidad de rangos de frecuencia con base en una señal de audio de entrada. El evaluador de similitud de audio está configurado para obtener una información de modulación asociada con las señales de envolvente para una pluralidad de rangos de frecuencia de modulación, en donde la información de modulación describe la modulación de las señales de envolvente. El evaluador de similitud de audio está configurado para comparar la información de modulación obtenida con una información de modulación de referencia asociada con una señal de audio de referencia, con el fin de obtener una información acerca de una similitud entre la señal de audio de entrada y la señal de audio de referencia. Un codificador de audio utiliza tal evaluador de similitud de audio. Otro evaluador de similitud de audio utiliza una red neuronal entrenada utilizando el evaluador de similitud de audio.
MX2020012886A 2018-05-30 2019-05-29 Evaluador de similitud de audio, codificador de audio, metodos y programa de computadora. MX2020012886A (es)

Applications Claiming Priority (3)

Application Number Priority Date Filing Date Title
EP18175251 2018-05-30
EP18198992.2A EP3576088A1 (en) 2018-05-30 2018-10-05 Audio similarity evaluator, audio encoder, methods and computer program
PCT/EP2019/064105 WO2019229190A1 (en) 2018-05-30 2019-05-29 Audio similarity evaluator, audio encoder, methods and computer program

Publications (1)

Publication Number Publication Date
MX2020012886A true MX2020012886A (es) 2021-04-28

Family

ID=62567262

Family Applications (1)

Application Number Title Priority Date Filing Date
MX2020012886A MX2020012886A (es) 2018-05-30 2019-05-29 Evaluador de similitud de audio, codificador de audio, metodos y programa de computadora.

Country Status (10)

Country Link
US (1) US12051431B2 (es)
EP (3) EP3576088A1 (es)
JP (1) JP7301073B2 (es)
KR (1) KR102640748B1 (es)
CN (1) CN112470220B (es)
BR (1) BR112020024361A2 (es)
CA (2) CA3165021A1 (es)
ES (1) ES2960785T3 (es)
MX (1) MX2020012886A (es)
WO (1) WO2019229190A1 (es)

Families Citing this family (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
FR3084489B1 (fr) * 2018-07-26 2020-09-11 Etat Francais Represente Par Le Delegue General Pour Larmement Procede de detection d’au moins un equipement informatique compromis au sein d’un systeme d’information
CN113593586A (zh) * 2020-04-15 2021-11-02 华为技术有限公司 音频信号编码方法、解码方法、编码设备以及解码设备
WO2022076404A1 (en) * 2020-10-05 2022-04-14 The Trustees Of Columbia University In The City Of New York Systems and methods for brain-informed speech separation
CN115497485B (zh) * 2021-06-18 2024-10-18 华为技术有限公司 三维音频信号编码方法、装置、编码器和系统
CN116386611B (zh) * 2023-04-20 2023-10-13 珠海谷田科技有限公司 一种教学声场环境的去噪方法

Family Cites Families (23)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP3183072B2 (ja) * 1994-12-19 2001-07-03 松下電器産業株式会社 音声符号化装置
JPH08263099A (ja) * 1995-03-23 1996-10-11 Toshiba Corp 符号化装置
JP3762204B2 (ja) 2000-09-07 2006-04-05 三菱電機株式会社 音声符号化・復号化機器の検査方法および検査装置
US6842733B1 (en) * 2000-09-15 2005-01-11 Mindspeed Technologies, Inc. Signal processing system for filtering spectral content of a signal for speech coding
DE10123366C1 (de) 2001-05-14 2002-08-08 Fraunhofer Ges Forschung Vorrichtung zum Analysieren eines Audiosignals hinsichtlich von Rhythmusinformationen
JP4272897B2 (ja) 2002-01-30 2009-06-03 パナソニック株式会社 符号化装置、復号化装置およびその方法
US7565213B2 (en) 2004-05-07 2009-07-21 Gracenote, Inc. Device and method for analyzing an information signal
EP1782419A1 (en) * 2004-08-17 2007-05-09 Koninklijke Philips Electronics N.V. Scalable audio coding
CN101053018A (zh) * 2004-11-01 2007-10-10 皇家飞利浦电子股份有限公司 包括幅度包络的参数音频编码
KR100803205B1 (ko) 2005-07-15 2008-02-14 삼성전자주식회사 저비트율 오디오 신호 부호화/복호화 방법 및 장치
WO2007034375A2 (en) * 2005-09-23 2007-03-29 Koninklijke Philips Electronics N.V. Determination of a distortion measure for audio encoding
US20070083365A1 (en) 2005-10-06 2007-04-12 Dts, Inc. Neural network classifier for separating audio sources from a monophonic audio signal
KR101149448B1 (ko) 2007-02-12 2012-05-25 삼성전자주식회사 오디오 부호화 및 복호화 장치와 그 방법
EP2362375A1 (en) 2010-02-26 2011-08-31 Fraunhofer-Gesellschaft zur Förderung der Angewandten Forschung e.V. Apparatus and method for modifying an audio signal using harmonic locking
JP5533502B2 (ja) * 2010-09-28 2014-06-25 富士通株式会社 オーディオ符号化装置、オーディオ符号化方法及びオーディオ符号化用コンピュータプログラム
AU2012218409B2 (en) * 2011-02-18 2016-09-15 Ntt Docomo, Inc. Speech decoder, speech encoder, speech decoding method, speech encoding method, speech decoding program, and speech encoding program
EP2951822B1 (en) 2013-01-29 2019-11-13 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Audio encoder, audio decoder, method for providing an encoded audio information, method for providing a decoded audio information, computer program and encoded representation using a signal-adaptive bandwidth extension
EP2830061A1 (en) * 2013-07-22 2015-01-28 Fraunhofer Gesellschaft zur Förderung der angewandten Forschung e.V. Apparatus and method for encoding and decoding an encoded audio signal using temporal noise/patch shaping
JP6306175B2 (ja) 2013-10-31 2018-04-04 フラウンホッファー−ゲゼルシャフト ツァ フェルダールング デァ アンゲヴァンテン フォアシュンク エー.ファオ 時間ドメイン励振信号に基づくエラーコンシールメントを用いて、復号化されたオーディオ情報を提供するオーディオデコーダおよび復号化されたオーディオ情報を提供する方法
US10163447B2 (en) * 2013-12-16 2018-12-25 Qualcomm Incorporated High-band signal modeling
CN104485114B (zh) * 2014-11-27 2018-03-06 湖南省计量检测研究院 一种基于听觉感知特性的语音质量客观评估的方法
JP6668372B2 (ja) * 2015-02-26 2020-03-18 フラウンホッファー−ゲゼルシャフト ツァ フェルダールング デァ アンゲヴァンテン フォアシュンク エー.ファオ 目標時間領域エンベロープを用いて処理されたオーディオ信号を得るためにオーディオ信号を処理するための装置および方法
EP3402217A1 (en) * 2017-05-09 2018-11-14 GN Hearing A/S Speech intelligibility-based hearing devices and associated methods

Also Published As

Publication number Publication date
JP2021526240A (ja) 2021-09-30
KR102640748B1 (ko) 2024-02-27
KR20210021490A (ko) 2021-02-26
CA3165021A1 (en) 2019-12-05
WO2019229190A1 (en) 2019-12-05
CN112470220B (zh) 2024-07-05
EP3803865C0 (en) 2023-08-09
JP7301073B2 (ja) 2023-06-30
BR112020024361A2 (pt) 2021-03-02
EP4270393A3 (en) 2023-12-20
US20210082447A1 (en) 2021-03-18
CA3101911A1 (en) 2019-12-05
US12051431B2 (en) 2024-07-30
CA3101911C (en) 2023-12-12
EP3576088A1 (en) 2019-12-04
EP4270393A2 (en) 2023-11-01
CN112470220A (zh) 2021-03-09
ES2960785T3 (es) 2024-03-06
EP3803865A1 (en) 2021-04-14
EP3803865B1 (en) 2023-08-09

Similar Documents

Publication Publication Date Title
MX2020012886A (es) Evaluador de similitud de audio, codificador de audio, metodos y programa de computadora.
GB2574555A (en) Adaptable processing components
EP3575980A3 (en) Intelligent data quality
MX2018000989A (es) Un metodo y un sistema para descomposicion de señal acustica en objetos de sonido, un objeto de sonido y su uso.
MX2019006756A (es) Metodo, aparato y dispositivo electronico de reclamacion de productos basicos a base de cadena de bloques.
MX2019006199A (es) Metodo y aparato de ejecucion de servicios basados en cadena de bloques y dispositivo electronico.
WO2015178992A3 (en) Processing signals in a quantum computing system
AU2016409886A1 (en) Intelligent list reading
EP4340397A3 (en) Audio processing device and method, and program therefor
MX2015017316A (es) Metodo y aparato para realizar conversion de analogico a digital de señales de entrada multiple.
MX2018001037A (es) Extraccion de señales portadoras a partir de señales moduladas.
PH12016501396B1 (en) Harmonic bandwidth extension of audio signals
MX2018005090A (es) Aparato, metodo o programa de computadora para generar una descripcion de campo de sonido.
ZA202108890B (en) Audio decoder, apparatus for determining a set of values defining characteristics of a filter, methods for providing a decoded audio representation, methods for determining a set of values defining characteristics of a filter and computer program
WO2015193226A9 (en) System and methods for transmitting information using inaudible acoustic signals
GB201108885D0 (en) Processing audio signals
MX2018001483A (es) Sistemas y metodos para detectar tornados.
MY185944A (en) Actuatable motion base system
GB2565701A (en) Repair diagnostic system and method
WO2018151503A3 (ko) 제스처 인식 방법 및 장치
AU2017247045A1 (en) Audio fingerprinting based on audio energy characteristics
GB2515920A (en) Physical Performance Assessment
WO2019185529A9 (en) Apparatus and method for providing a fingerprint of an input signal
MY180981A (en) Aquatic time synchronisation system and method of determining a time offset
MX2019012095A (es) Generacion de guia para contenido relacionado con la musica.