DK3379535T3 - Audiosignalklassifikator - Google Patents

Audiosignalklassifikator Download PDF

Info

Publication number
DK3379535T3
DK3379535T3 DK18172361.0T DK18172361T DK3379535T3 DK 3379535 T3 DK3379535 T3 DK 3379535T3 DK 18172361 T DK18172361 T DK 18172361T DK 3379535 T3 DK3379535 T3 DK 3379535T3
Authority
DK
Denmark
Prior art keywords
audio signal
signal classifier
classifier
audio
signal
Prior art date
Application number
DK18172361.0T
Other languages
English (en)
Inventor
Erik Norvell
Volodya Grancharov
Original Assignee
Ericsson Telefon Ab L M
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Ericsson Telefon Ab L M filed Critical Ericsson Telefon Ab L M
Application granted granted Critical
Publication of DK3379535T3 publication Critical patent/DK3379535T3/da

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/04Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
    • G10L19/06Determination or coding of the spectral characteristics, e.g. of the short-term prediction coefficients
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/04Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
    • G10L19/16Vocoder architecture
    • G10L19/167Audio streaming, i.e. formatting and decoding of an encoded audio signal representation into a data stream for transmission or storage purposes
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/04Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
    • G10L19/16Vocoder architecture
    • G10L19/18Vocoders using multiple modes
    • G10L19/20Vocoders using multiple modes using sound class specific coding, hybrid encoders or object based coding
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/04Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
    • G10L19/16Vocoder architecture
    • G10L19/18Vocoders using multiple modes
    • G10L19/22Mode decision, i.e. based on audio signal content versus external parameters
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/03Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters
    • G10L25/18Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters the extracted parameters being spectral information of each sub-band
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/48Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use
    • G10L25/51Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use for comparison or discrimination
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/78Detection of presence or absence of voice signals
    • G10L25/81Detection of presence or absence of voice signals for discriminating voice from music

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Computational Linguistics (AREA)
  • Signal Processing (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Spectroscopy & Molecular Physics (AREA)
  • Compression, Expansion, Code Conversion, And Decoders (AREA)
DK18172361.0T 2014-05-08 2015-05-07 Audiosignalklassifikator DK3379535T3 (da)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US201461990354P 2014-05-08 2014-05-08
EP15724098.7A EP3140831B1 (en) 2014-05-08 2015-05-07 Audio signal discriminator and coder

Publications (1)

Publication Number Publication Date
DK3379535T3 true DK3379535T3 (da) 2019-12-16

Family

ID=53200274

Family Applications (2)

Application Number Title Priority Date Filing Date
DK15724098.7T DK3140831T3 (da) 2014-05-08 2015-05-07 Audiosignaldiskriminator og koder
DK18172361.0T DK3379535T3 (da) 2014-05-08 2015-05-07 Audiosignalklassifikator

Family Applications Before (1)

Application Number Title Priority Date Filing Date
DK15724098.7T DK3140831T3 (da) 2014-05-08 2015-05-07 Audiosignaldiskriminator og koder

Country Status (11)

Country Link
US (3) US9620138B2 (da)
EP (3) EP3379535B1 (da)
CN (3) CN110619891B (da)
BR (1) BR112016025850B1 (da)
DK (2) DK3140831T3 (da)
ES (3) ES2690577T3 (da)
HU (1) HUE046477T2 (da)
MX (2) MX356883B (da)
MY (1) MY182165A (da)
PL (2) PL3594948T3 (da)
WO (1) WO2015171061A1 (da)

Families Citing this family (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP3226242B1 (en) 2013-10-18 2018-12-19 Telefonaktiebolaget LM Ericsson (publ) Coding of spectral peak positions
WO2015171061A1 (en) * 2014-05-08 2015-11-12 Telefonaktiebolaget L M Ericsson (Publ) Audio signal discriminator and coder
JP6411509B2 (ja) * 2014-07-28 2018-10-24 日本電信電話株式会社 符号化方法、装置、プログラム及び記録媒体
CN110211580B (zh) * 2019-05-15 2021-07-16 海尔优家智能科技(北京)有限公司 多智能设备应答方法、装置、系统及存储介质

Family Cites Families (26)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN100361405C (zh) * 1998-05-27 2008-01-09 微软公司 利用可升级的音频编码器和解码器处理输入信号的方法
US6226608B1 (en) * 1999-01-28 2001-05-01 Dolby Laboratories Licensing Corporation Data framing for adaptive-block-length coding system
US6959274B1 (en) * 1999-09-22 2005-10-25 Mindspeed Technologies, Inc. Fixed rate speech compression system and method
US6785645B2 (en) * 2001-11-29 2004-08-31 Microsoft Corporation Real-time speech and music classifier
KR100762596B1 (ko) * 2006-04-05 2007-10-01 삼성전자주식회사 음성 신호 전처리 시스템 및 음성 신호 특징 정보 추출방법
US20070282601A1 (en) * 2006-06-02 2007-12-06 Texas Instruments Inc. Packet loss concealment for a conjugate structure algebraic code excited linear prediction decoder
CN101145345B (zh) * 2006-09-13 2011-02-09 华为技术有限公司 音频分类方法
CA2690433C (en) * 2007-06-22 2016-01-19 Voiceage Corporation Method and device for sound activity detection and sound signal classification
CN101399039B (zh) * 2007-09-30 2011-05-11 华为技术有限公司 一种确定非噪声音频信号类别的方法及装置
KR101599875B1 (ko) * 2008-04-17 2016-03-14 삼성전자주식회사 멀티미디어의 컨텐트 특성에 기반한 멀티미디어 부호화 방법 및 장치, 멀티미디어의 컨텐트 특성에 기반한 멀티미디어 복호화 방법 및 장치
PL2346030T3 (pl) 2008-07-11 2015-03-31 Fraunhofer Ges Forschung Koder audio, sposób kodowania sygnału audio oraz program komputerowy
EP2210944A1 (en) 2009-01-22 2010-07-28 ATG:biosynthetics GmbH Methods for generation of RNA and (poly)peptide libraries and their use
CN102044246B (zh) * 2009-10-15 2012-05-23 华为技术有限公司 一种音频信号检测方法和装置
KR101754970B1 (ko) * 2010-01-12 2017-07-06 삼성전자주식회사 무선 통신 시스템의 채널 상태 측정 기준신호 처리 장치 및 방법
US9652999B2 (en) * 2010-04-29 2017-05-16 Educational Testing Service Computer-implemented systems and methods for estimating word accuracy for automatic speech recognition
CN102985966B (zh) * 2010-07-16 2016-07-06 瑞典爱立信有限公司 音频编码器和解码器及用于音频信号的编码和解码的方法
RU2010152225A (ru) * 2010-12-20 2012-06-27 ЭлЭсАй Корпорейшн (US) Обнаружение музыки с использованием анализа спектральных пиков
CN102982804B (zh) * 2011-09-02 2017-05-03 杜比实验室特许公司 音频分类方法和系统
CN102522082B (zh) * 2011-12-27 2013-07-10 重庆大学 一种公共场所异常声音的识别与定位方法
US9111531B2 (en) * 2012-01-13 2015-08-18 Qualcomm Incorporated Multiple coding mode signal classification
US20130282372A1 (en) * 2012-04-23 2013-10-24 Qualcomm Incorporated Systems and methods for audio signal processing
BR112014032735B1 (pt) * 2012-06-28 2022-04-26 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V Codificador e decodificador de áudio com base em predição linear e respectivos métodos para codificar e decodificar
US9401153B2 (en) * 2012-10-15 2016-07-26 Digimarc Corporation Multi-mode audio recognition and auxiliary data encoding and decoding
WO2015171061A1 (en) * 2014-05-08 2015-11-12 Telefonaktiebolaget L M Ericsson (Publ) Audio signal discriminator and coder
WO2015168925A1 (en) 2014-05-09 2015-11-12 Qualcomm Incorporated Restricted aperiodic csi measurement reporting in enhanced interference management and traffic adaptation
TWI602172B (zh) * 2014-08-27 2017-10-11 弗勞恩霍夫爾協會 使用參數以加強隱蔽之用於編碼及解碼音訊內容的編碼器、解碼器及方法

Also Published As

Publication number Publication date
US20160086615A1 (en) 2016-03-24
EP3379535A1 (en) 2018-09-26
PL3140831T3 (pl) 2018-12-31
HUE046477T2 (hu) 2020-03-30
US20170178660A1 (en) 2017-06-22
EP3140831B1 (en) 2018-07-11
CN110619891B (zh) 2023-01-17
EP3594948A1 (en) 2020-01-15
ES2690577T3 (es) 2018-11-21
MY182165A (en) 2021-01-18
MX2018007257A (es) 2022-08-25
CN110619891A (zh) 2019-12-27
ES2763280T3 (es) 2020-05-27
CN106463141A (zh) 2017-02-22
CN110619892A (zh) 2019-12-27
CN106463141B (zh) 2019-11-01
EP3379535B1 (en) 2019-09-18
US9620138B2 (en) 2017-04-11
BR112016025850B1 (pt) 2022-08-16
DK3140831T3 (da) 2018-10-15
US10242687B2 (en) 2019-03-26
BR112016025850A2 (da) 2017-08-15
WO2015171061A1 (en) 2015-11-12
EP3594948B1 (en) 2021-03-03
PL3594948T3 (pl) 2021-08-30
CN110619892B (zh) 2023-04-11
US20190198032A1 (en) 2019-06-27
EP3140831A1 (en) 2017-03-15
MX2016014534A (es) 2017-02-20
US10984812B2 (en) 2021-04-20
MX356883B (es) 2018-06-19
ES2874757T3 (es) 2021-11-05

Similar Documents

Publication Publication Date Title
DK3013070T3 (da) Høresystem
DK2947898T3 (da) Høreanordning
GB201406574D0 (en) Audio Signal Processing
DK3006079T3 (da) Høresystem
DK3132517T3 (da) Frekvenssvar
GB201405123D0 (en) Audio signal payload
DK3110442T3 (da) Modificerede meningokok-fhbp-polypeptider
HK1226169A1 (zh) 分析音頻數據
DK3001700T3 (da) Positioneret høresystem
GB201401626D0 (en) Audio signal analysis
MA41015A (fr) Hydrocyclone anti-boudinage
DK3148701T3 (da) Separator
ES2969748T3 (es) Concepto de empalme de audio
HK1214674A1 (zh) 音頻信號處理
FI20165684A (fi) Parannussignaaligeneraattori
GB201401689D0 (en) Audio signal processing
DK3379535T3 (da) Audiosignalklassifikator
FR3023088B1 (fr) Amplificateur audio
DK3191083T3 (da) Gelatine-/pektinpartikler
ES1134932Y (es) Reposacabezas anatomico
FR3024297B1 (fr) Goulotte
DK3422742T3 (da) Høreapparatkonfigurationsdetektering
FI20145150L (fi) Signaalinkäsittelyjärjestelmä
ITUB20161213A1 (it) Altoparlante
ITMI20140386U1 (it) Gioiello