ES2531137T3 - Clasificación de señales de audio basada en marcos - Google Patents

Clasificación de señales de audio basada en marcos Download PDF

Info

Publication number
ES2531137T3
ES2531137T3 ES11717266T ES11717266T ES2531137T3 ES 2531137 T3 ES2531137 T3 ES 2531137T3 ES 11717266 T ES11717266 T ES 11717266T ES 11717266 T ES11717266 T ES 11717266T ES 2531137 T3 ES2531137 T3 ES 2531137T3
Authority
ES
Spain
Prior art keywords
frames
characteristic
range
fraction
speaking
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
ES11717266T
Other languages
English (en)
Inventor
Volodya Grancharov
Sebastian Näslund
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Telefonaktiebolaget LM Ericsson AB
Original Assignee
Telefonaktiebolaget LM Ericsson AB
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Telefonaktiebolaget LM Ericsson AB filed Critical Telefonaktiebolaget LM Ericsson AB
Application granted granted Critical
Publication of ES2531137T3 publication Critical patent/ES2531137T3/es
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/02Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/78Detection of presence or absence of voice signals
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/04Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
    • G10L19/16Vocoder architecture
    • G10L19/18Vocoders using multiple modes
    • G10L19/20Vocoders using multiple modes using sound class specific coding, hybrid encoders or object based coding
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/78Detection of presence or absence of voice signals
    • G10L2025/783Detection of presence or absence of voice signals based on threshold decision
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/48Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use
    • G10L25/51Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use for comparison or discrimination

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Computational Linguistics (AREA)
  • Signal Processing (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Spectroscopy & Molecular Physics (AREA)
  • Compression, Expansion, Code Conversion, And Decoders (AREA)

Abstract

Un método de clasificación de señales de audio basado en marcos o cuadros, caracterizado por los pasos de: determinar (S1), para cada uno de un número predeterminado de marcos consecutivos, medidas de características que representan al menos las siguientes características: - un coeficiente de auto-correlación (Tn), - una energía de señal de marco (En) en un dominio comprimido, - una variación de energía entre marcos; comparar (S2) cada medida de característica determinada con al menos un correspondiente intervalo predeterminado de características; calcular (S3), para cada intervalo de características, una medida de fracción (Φ ;1 - Φ 5) que representa el número total de medidas correspondientes de características (Tn, En, Φ ;;En) que caen dentro del intervalo de características; clasificar (S4) el último de los marcos consecutivos como habla si cada medida de fracción se sitúa dentro de un intervalo de fracción correspondiente, y como no-habla en caso contrario.
ES11717266T 2011-04-28 2011-04-28 Clasificación de señales de audio basada en marcos Active ES2531137T3 (es)

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
PCT/EP2011/056761 WO2012146290A1 (en) 2011-04-28 2011-04-28 Frame based audio signal classification

Publications (1)

Publication Number Publication Date
ES2531137T3 true ES2531137T3 (es) 2015-03-11

Family

ID=44626095

Family Applications (1)

Application Number Title Priority Date Filing Date
ES11717266T Active ES2531137T3 (es) 2011-04-28 2011-04-28 Clasificación de señales de audio basada en marcos

Country Status (5)

Country Link
US (1) US9240191B2 (es)
EP (1) EP2702585B1 (es)
BR (1) BR112013026333B1 (es)
ES (1) ES2531137T3 (es)
WO (1) WO2012146290A1 (es)

Families Citing this family (12)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP5850216B2 (ja) 2010-04-13 2016-02-03 ソニー株式会社 信号処理装置および方法、符号化装置および方法、復号装置および方法、並びにプログラム
JP6037156B2 (ja) * 2011-08-24 2016-11-30 ソニー株式会社 符号化装置および方法、並びにプログラム
US20130090926A1 (en) * 2011-09-16 2013-04-11 Qualcomm Incorporated Mobile device context information using speech detection
JP6593173B2 (ja) 2013-12-27 2019-10-23 ソニー株式会社 復号化装置および方法、並びにプログラム
CN104934032B (zh) * 2014-03-17 2019-04-05 华为技术有限公司 根据频域能量对语音信号进行处理的方法和装置
JP6596924B2 (ja) * 2014-05-29 2019-10-30 日本電気株式会社 音声データ処理装置、音声データ処理方法、及び、音声データ処理プログラム
CN105336338B (zh) 2014-06-24 2017-04-12 华为技术有限公司 音频编码方法和装置
CN106328169B (zh) * 2015-06-26 2018-12-11 中兴通讯股份有限公司 一种激活音修正帧数的获取方法、激活音检测方法和装置
EP3242295B1 (en) * 2016-05-06 2019-10-23 Nxp B.V. A signal processor
CN108074584A (zh) * 2016-11-18 2018-05-25 南京大学 一种基于信号多特征统计的音频信号分类方法
US10325588B2 (en) * 2017-09-28 2019-06-18 International Business Machines Corporation Acoustic feature extractor selected according to status flag of frame of acoustic signal
CN115294947B (zh) * 2022-07-29 2024-06-11 腾讯科技(深圳)有限公司 音频数据处理方法、装置、电子设备及介质

Family Cites Families (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
SE501981C2 (sv) * 1993-11-02 1995-07-03 Ericsson Telefon Ab L M Förfarande och anordning för diskriminering mellan stationära och icke stationära signaler
US5712953A (en) * 1995-06-28 1998-01-27 Electronic Data Systems Corporation System and method for classification of audio or audio/video signals based on musical content
SE9700772D0 (sv) 1997-03-03 1997-03-03 Ericsson Telefon Ab L M A high resolution post processing method for a speech decoder
US6983242B1 (en) 2000-08-21 2006-01-03 Mindspeed Technologies, Inc. Method for robust classification in speech coding
US6640208B1 (en) * 2000-09-12 2003-10-28 Motorola, Inc. Voiced/unvoiced speech classifier
US6993481B2 (en) * 2000-12-04 2006-01-31 Global Ip Sound Ab Detection of speech activity using feature model adaptation
US7127392B1 (en) * 2003-02-12 2006-10-24 The United States Of America As Represented By The National Security Agency Device for and method of detecting voice activity
CN100483509C (zh) 2006-12-05 2009-04-29 华为技术有限公司 声音信号分类方法和装置

Also Published As

Publication number Publication date
EP2702585B1 (en) 2014-12-31
US9240191B2 (en) 2016-01-19
BR112013026333A2 (pt) 2020-11-03
EP2702585A1 (en) 2014-03-05
BR112013026333B1 (pt) 2021-05-18
US20140046658A1 (en) 2014-02-13
WO2012146290A1 (en) 2012-11-01

Similar Documents

Publication Publication Date Title
ES2531137T3 (es) Clasificación de señales de audio basada en marcos
WO2014020502A3 (en) Markers associated with sensitivity to inhibitors of human double minute 2 (mdm2)
AR102615A1 (es) Analizador de analitos
ATE493290T1 (de) Alcotestgerät
MY170983A (en) Biomarker assays for detecting or measuring inhibition of tor kinase activity
MX2014004906A (es) Analisis y control de flujo de aerosol.
BR112016019836A2 (pt) método para analisar uma amostra de um sujeito, dispositivo de diagnóstico para utilização no diagnóstico da endometriose, kit, uso de um biomarcador, e, método para aumentar uma resposta de anticorpos em um sujeito
WO2012150993A3 (en) Accurate and fast neural network training for library-based critical dimension (cd) metrology
MX2015012303A (es) Un metodo no invasivo para medir el estres oxidativoy daño oxidativo de los biomarcadores cutaneos.
WO2012160527A3 (en) Integrity evaluation system in an implantable hearing prosthesis
AR102517A1 (es) Ensayos para detectar subgrupos inmunes de células t y sus métodos de uso
GB2498283A (en) Benchmarks for normal cell identification
MX2017014196A (es) Deteccion de factores de virulencia microbiana bucal.
WO2012122374A3 (en) Non-invasive methods for diagnosing chronic organ transplant rejection
UY36223A (es) Dispositivos para ensayos, métodos para realizar ensayos, kits para ensayos y método para fabricar dispositivos para ensayos
WO2015164747A8 (en) Methods for diagnosing celiac disease using circulating cytokines/chemokines
GB2530428A (en) Optical computing device having a redundant light source and optical train
TR201722520A2 (tr) Vehicle and method of associating vehicle settings with a user of the vehicle.
ES2659184T3 (es) Moduladores del receptor MRG
MY172155A (en) Method for assessing cell aging
WO2017223254A8 (en) Methods for cell proliferation and toxicity testing
NO20151763A1 (en) Portable arrangement for automatical annulus testing
CL2015001972A1 (es) Sistema y método para probar un sintetizador de frecuencia
CL2017001937A1 (es) Detección de desgaste o daño en ciclones utilizando mediciones individuales en el flujo de salida superior de un ciclón
WO2017071728A8 (en) Lateral flow immunoassay device