ES2531137T3 - Clasificación de señales de audio basada en marcos - Google Patents
Clasificación de señales de audio basada en marcos Download PDFInfo
- Publication number
- ES2531137T3 ES2531137T3 ES11717266T ES11717266T ES2531137T3 ES 2531137 T3 ES2531137 T3 ES 2531137T3 ES 11717266 T ES11717266 T ES 11717266T ES 11717266 T ES11717266 T ES 11717266T ES 2531137 T3 ES2531137 T3 ES 2531137T3
- Authority
- ES
- Spain
- Prior art keywords
- frames
- characteristic
- range
- fraction
- speaking
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
- 230000005236 sound signal Effects 0.000 title abstract 2
- 238000005259 measurement Methods 0.000 abstract 2
- 238000000034 method Methods 0.000 abstract 1
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/02—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
- G10L25/78—Detection of presence or absence of voice signals
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/04—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
- G10L19/16—Vocoder architecture
- G10L19/18—Vocoders using multiple modes
- G10L19/20—Vocoders using multiple modes using sound class specific coding, hybrid encoders or object based coding
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
- G10L25/78—Detection of presence or absence of voice signals
- G10L2025/783—Detection of presence or absence of voice signals based on threshold decision
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
- G10L25/48—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use
- G10L25/51—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use for comparison or discrimination
Landscapes
- Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- Computational Linguistics (AREA)
- Signal Processing (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Spectroscopy & Molecular Physics (AREA)
- Compression, Expansion, Code Conversion, And Decoders (AREA)
Abstract
Un método de clasificación de señales de audio basado en marcos o cuadros, caracterizado por los pasos de: determinar (S1), para cada uno de un número predeterminado de marcos consecutivos, medidas de características que representan al menos las siguientes características: - un coeficiente de auto-correlación (Tn), - una energía de señal de marco (En) en un dominio comprimido, - una variación de energía entre marcos; comparar (S2) cada medida de característica determinada con al menos un correspondiente intervalo predeterminado de características; calcular (S3), para cada intervalo de características, una medida de fracción (Φ ;1 - Φ 5) que representa el número total de medidas correspondientes de características (Tn, En, Φ ;;En) que caen dentro del intervalo de características; clasificar (S4) el último de los marcos consecutivos como habla si cada medida de fracción se sitúa dentro de un intervalo de fracción correspondiente, y como no-habla en caso contrario.
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
PCT/EP2011/056761 WO2012146290A1 (en) | 2011-04-28 | 2011-04-28 | Frame based audio signal classification |
Publications (1)
Publication Number | Publication Date |
---|---|
ES2531137T3 true ES2531137T3 (es) | 2015-03-11 |
Family
ID=44626095
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
ES11717266T Active ES2531137T3 (es) | 2011-04-28 | 2011-04-28 | Clasificación de señales de audio basada en marcos |
Country Status (5)
Country | Link |
---|---|
US (1) | US9240191B2 (es) |
EP (1) | EP2702585B1 (es) |
BR (1) | BR112013026333B1 (es) |
ES (1) | ES2531137T3 (es) |
WO (1) | WO2012146290A1 (es) |
Families Citing this family (12)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JP5850216B2 (ja) | 2010-04-13 | 2016-02-03 | ソニー株式会社 | 信号処理装置および方法、符号化装置および方法、復号装置および方法、並びにプログラム |
JP6037156B2 (ja) * | 2011-08-24 | 2016-11-30 | ソニー株式会社 | 符号化装置および方法、並びにプログラム |
US20130090926A1 (en) * | 2011-09-16 | 2013-04-11 | Qualcomm Incorporated | Mobile device context information using speech detection |
JP6593173B2 (ja) | 2013-12-27 | 2019-10-23 | ソニー株式会社 | 復号化装置および方法、並びにプログラム |
CN104934032B (zh) * | 2014-03-17 | 2019-04-05 | 华为技术有限公司 | 根据频域能量对语音信号进行处理的方法和装置 |
JP6596924B2 (ja) * | 2014-05-29 | 2019-10-30 | 日本電気株式会社 | 音声データ処理装置、音声データ処理方法、及び、音声データ処理プログラム |
CN105336338B (zh) | 2014-06-24 | 2017-04-12 | 华为技术有限公司 | 音频编码方法和装置 |
CN106328169B (zh) * | 2015-06-26 | 2018-12-11 | 中兴通讯股份有限公司 | 一种激活音修正帧数的获取方法、激活音检测方法和装置 |
EP3242295B1 (en) * | 2016-05-06 | 2019-10-23 | Nxp B.V. | A signal processor |
CN108074584A (zh) * | 2016-11-18 | 2018-05-25 | 南京大学 | 一种基于信号多特征统计的音频信号分类方法 |
US10325588B2 (en) * | 2017-09-28 | 2019-06-18 | International Business Machines Corporation | Acoustic feature extractor selected according to status flag of frame of acoustic signal |
CN115294947B (zh) * | 2022-07-29 | 2024-06-11 | 腾讯科技(深圳)有限公司 | 音频数据处理方法、装置、电子设备及介质 |
Family Cites Families (8)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
SE501981C2 (sv) * | 1993-11-02 | 1995-07-03 | Ericsson Telefon Ab L M | Förfarande och anordning för diskriminering mellan stationära och icke stationära signaler |
US5712953A (en) * | 1995-06-28 | 1998-01-27 | Electronic Data Systems Corporation | System and method for classification of audio or audio/video signals based on musical content |
SE9700772D0 (sv) | 1997-03-03 | 1997-03-03 | Ericsson Telefon Ab L M | A high resolution post processing method for a speech decoder |
US6983242B1 (en) | 2000-08-21 | 2006-01-03 | Mindspeed Technologies, Inc. | Method for robust classification in speech coding |
US6640208B1 (en) * | 2000-09-12 | 2003-10-28 | Motorola, Inc. | Voiced/unvoiced speech classifier |
US6993481B2 (en) * | 2000-12-04 | 2006-01-31 | Global Ip Sound Ab | Detection of speech activity using feature model adaptation |
US7127392B1 (en) * | 2003-02-12 | 2006-10-24 | The United States Of America As Represented By The National Security Agency | Device for and method of detecting voice activity |
CN100483509C (zh) | 2006-12-05 | 2009-04-29 | 华为技术有限公司 | 声音信号分类方法和装置 |
-
2011
- 2011-04-28 BR BR112013026333-4A patent/BR112013026333B1/pt not_active IP Right Cessation
- 2011-04-28 ES ES11717266T patent/ES2531137T3/es active Active
- 2011-04-28 US US14/113,616 patent/US9240191B2/en not_active Expired - Fee Related
- 2011-04-28 EP EP11717266.8A patent/EP2702585B1/en not_active Not-in-force
- 2011-04-28 WO PCT/EP2011/056761 patent/WO2012146290A1/en active Application Filing
Also Published As
Publication number | Publication date |
---|---|
EP2702585B1 (en) | 2014-12-31 |
US9240191B2 (en) | 2016-01-19 |
BR112013026333A2 (pt) | 2020-11-03 |
EP2702585A1 (en) | 2014-03-05 |
BR112013026333B1 (pt) | 2021-05-18 |
US20140046658A1 (en) | 2014-02-13 |
WO2012146290A1 (en) | 2012-11-01 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
ES2531137T3 (es) | Clasificación de señales de audio basada en marcos | |
WO2014020502A3 (en) | Markers associated with sensitivity to inhibitors of human double minute 2 (mdm2) | |
AR102615A1 (es) | Analizador de analitos | |
ATE493290T1 (de) | Alcotestgerät | |
MY170983A (en) | Biomarker assays for detecting or measuring inhibition of tor kinase activity | |
MX2014004906A (es) | Analisis y control de flujo de aerosol. | |
BR112016019836A2 (pt) | método para analisar uma amostra de um sujeito, dispositivo de diagnóstico para utilização no diagnóstico da endometriose, kit, uso de um biomarcador, e, método para aumentar uma resposta de anticorpos em um sujeito | |
WO2012150993A3 (en) | Accurate and fast neural network training for library-based critical dimension (cd) metrology | |
MX2015012303A (es) | Un metodo no invasivo para medir el estres oxidativoy daño oxidativo de los biomarcadores cutaneos. | |
WO2012160527A3 (en) | Integrity evaluation system in an implantable hearing prosthesis | |
AR102517A1 (es) | Ensayos para detectar subgrupos inmunes de células t y sus métodos de uso | |
GB2498283A (en) | Benchmarks for normal cell identification | |
MX2017014196A (es) | Deteccion de factores de virulencia microbiana bucal. | |
WO2012122374A3 (en) | Non-invasive methods for diagnosing chronic organ transplant rejection | |
UY36223A (es) | Dispositivos para ensayos, métodos para realizar ensayos, kits para ensayos y método para fabricar dispositivos para ensayos | |
WO2015164747A8 (en) | Methods for diagnosing celiac disease using circulating cytokines/chemokines | |
GB2530428A (en) | Optical computing device having a redundant light source and optical train | |
TR201722520A2 (tr) | Vehicle and method of associating vehicle settings with a user of the vehicle. | |
ES2659184T3 (es) | Moduladores del receptor MRG | |
MY172155A (en) | Method for assessing cell aging | |
WO2017223254A8 (en) | Methods for cell proliferation and toxicity testing | |
NO20151763A1 (en) | Portable arrangement for automatical annulus testing | |
CL2015001972A1 (es) | Sistema y método para probar un sintetizador de frecuencia | |
CL2017001937A1 (es) | Detección de desgaste o daño en ciclones utilizando mediciones individuales en el flujo de salida superior de un ciclón | |
WO2017071728A8 (en) | Lateral flow immunoassay device |