CN110619891B - 音频信号区分器和编码器 - Google Patents

音频信号区分器和编码器 Download PDF

Info

Publication number
CN110619891B
CN110619891B CN201910918149.0A CN201910918149A CN110619891B CN 110619891 B CN110619891 B CN 110619891B CN 201910918149 A CN201910918149 A CN 201910918149A CN 110619891 B CN110619891 B CN 110619891B
Authority
CN
China
Prior art keywords
audio signal
peak
spectral
coefficients
average distance
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201910918149.0A
Other languages
English (en)
Chinese (zh)
Other versions
CN110619891A (zh
Inventor
艾力克·诺维尔
沃洛佳·格兰恰诺夫
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Telefonaktiebolaget LM Ericsson AB
Original Assignee
Telefonaktiebolaget LM Ericsson AB
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Telefonaktiebolaget LM Ericsson AB filed Critical Telefonaktiebolaget LM Ericsson AB
Publication of CN110619891A publication Critical patent/CN110619891A/zh
Application granted granted Critical
Publication of CN110619891B publication Critical patent/CN110619891B/zh
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/04Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
    • G10L19/16Vocoder architecture
    • G10L19/18Vocoders using multiple modes
    • G10L19/22Mode decision, i.e. based on audio signal content versus external parameters
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/04Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
    • G10L19/06Determination or coding of the spectral characteristics, e.g. of the short-term prediction coefficients
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/04Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
    • G10L19/16Vocoder architecture
    • G10L19/167Audio streaming, i.e. formatting and decoding of an encoded audio signal representation into a data stream for transmission or storage purposes
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/48Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use
    • G10L25/51Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use for comparison or discrimination
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/78Detection of presence or absence of voice signals
    • G10L25/81Detection of presence or absence of voice signals for discriminating voice from music
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/04Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
    • G10L19/16Vocoder architecture
    • G10L19/18Vocoders using multiple modes
    • G10L19/20Vocoders using multiple modes using sound class specific coding, hybrid encoders or object based coding
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/03Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters
    • G10L25/18Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters the extracted parameters being spectral information of each sub-band

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Computational Linguistics (AREA)
  • Signal Processing (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Spectroscopy & Molecular Physics (AREA)
  • Compression, Expansion, Code Conversion, And Decoders (AREA)
CN201910918149.0A 2014-05-08 2015-05-07 音频信号区分器和编码器 Active CN110619891B (zh)

Applications Claiming Priority (4)

Application Number Priority Date Filing Date Title
US201461990354P 2014-05-08 2014-05-08
US61/990,354 2014-05-08
CN201580023968.9A CN106463141B (zh) 2014-05-08 2015-05-07 音频信号区分器和编码器
PCT/SE2015/050503 WO2015171061A1 (en) 2014-05-08 2015-05-07 Audio signal discriminator and coder

Related Parent Applications (1)

Application Number Title Priority Date Filing Date
CN201580023968.9A Division CN106463141B (zh) 2014-05-08 2015-05-07 音频信号区分器和编码器

Publications (2)

Publication Number Publication Date
CN110619891A CN110619891A (zh) 2019-12-27
CN110619891B true CN110619891B (zh) 2023-01-17

Family

ID=53200274

Family Applications (3)

Application Number Title Priority Date Filing Date
CN201910918149.0A Active CN110619891B (zh) 2014-05-08 2015-05-07 音频信号区分器和编码器
CN201910919030.5A Active CN110619892B (zh) 2014-05-08 2015-05-07 音频信号区分器和编码器
CN201580023968.9A Active CN106463141B (zh) 2014-05-08 2015-05-07 音频信号区分器和编码器

Family Applications After (2)

Application Number Title Priority Date Filing Date
CN201910919030.5A Active CN110619892B (zh) 2014-05-08 2015-05-07 音频信号区分器和编码器
CN201580023968.9A Active CN106463141B (zh) 2014-05-08 2015-05-07 音频信号区分器和编码器

Country Status (11)

Country Link
US (3) US9620138B2 (https=)
EP (3) EP3594948B1 (https=)
CN (3) CN110619891B (https=)
BR (1) BR112016025850B1 (https=)
DK (2) DK3379535T3 (https=)
ES (3) ES2690577T3 (https=)
HU (1) HUE046477T2 (https=)
MX (2) MX356883B (https=)
MY (1) MY182165A (https=)
PL (2) PL3140831T3 (https=)
WO (1) WO2015171061A1 (https=)

Families Citing this family (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
MY176776A (en) 2013-10-18 2020-08-21 Ericsson Telefon Ab L M Coding and decoding of spectral peak positions
CN110619891B (zh) * 2014-05-08 2023-01-17 瑞典爱立信有限公司 音频信号区分器和编码器
CN112992164B (zh) * 2014-07-28 2024-12-06 日本电信电话株式会社 编码方法、装置、程序产品以及记录介质
CN110211580B (zh) * 2019-05-15 2021-07-16 海尔优家智能科技(北京)有限公司 多智能设备应答方法、装置、系统及存储介质
CA3184152A1 (en) * 2020-06-30 2022-01-06 Rivarol VERGIN Cumulative average spectral entropy analysis for tone and speech classification
CN113890492B (zh) * 2021-10-09 2025-07-18 深圳市创成微电子有限公司 音频功率放大器的供电电压控制方法、控制器和音频设备
US20250201255A1 (en) * 2023-12-13 2025-06-19 Qualcomm Incorporated Content-based switchable audio codec

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101145345A (zh) * 2006-09-13 2008-03-19 华为技术有限公司 音频分类方法
CN101399039A (zh) * 2007-09-30 2009-04-01 华为技术有限公司 一种确定非噪声音频信号类别的方法及装置
CN102044246A (zh) * 2009-10-15 2011-05-04 华为技术有限公司 一种音频信号检测方法和装置

Family Cites Families (23)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1146130C (zh) * 1998-05-27 2004-04-14 微软公司 输入信号处理系统的编码器和屏蔽频信号量化噪声方法
US6226608B1 (en) * 1999-01-28 2001-05-01 Dolby Laboratories Licensing Corporation Data framing for adaptive-block-length coding system
US6959274B1 (en) * 1999-09-22 2005-10-25 Mindspeed Technologies, Inc. Fixed rate speech compression system and method
US6785645B2 (en) * 2001-11-29 2004-08-31 Microsoft Corporation Real-time speech and music classifier
KR100762596B1 (ko) * 2006-04-05 2007-10-01 삼성전자주식회사 음성 신호 전처리 시스템 및 음성 신호 특징 정보 추출방법
US20070282601A1 (en) * 2006-06-02 2007-12-06 Texas Instruments Inc. Packet loss concealment for a conjugate structure algebraic code excited linear prediction decoder
US8990073B2 (en) * 2007-06-22 2015-03-24 Voiceage Corporation Method and device for sound activity detection and sound signal classification
KR101599875B1 (ko) * 2008-04-17 2016-03-14 삼성전자주식회사 멀티미디어의 컨텐트 특성에 기반한 멀티미디어 부호화 방법 및 장치, 멀티미디어의 컨텐트 특성에 기반한 멀티미디어 복호화 방법 및 장치
PL2346029T3 (pl) * 2008-07-11 2013-11-29 Fraunhofer Ges Forschung Koder sygnału audio, sposób kodowania sygnału audio i odpowiadający mu program komputerowy
EP2210944A1 (en) 2009-01-22 2010-07-28 ATG:biosynthetics GmbH Methods for generation of RNA and (poly)peptide libraries and their use
KR101754970B1 (ko) * 2010-01-12 2017-07-06 삼성전자주식회사 무선 통신 시스템의 채널 상태 측정 기준신호 처리 장치 및 방법
US9652999B2 (en) * 2010-04-29 2017-05-16 Educational Testing Service Computer-implemented systems and methods for estimating word accuracy for automatic speech recognition
US8977542B2 (en) * 2010-07-16 2015-03-10 Telefonaktiebolaget L M Ericsson (Publ) Audio encoder and decoder and methods for encoding and decoding an audio signal
RU2010152225A (ru) * 2010-12-20 2012-06-27 ЭлЭсАй Корпорейшн (US) Обнаружение музыки с использованием анализа спектральных пиков
CN102982804B (zh) * 2011-09-02 2017-05-03 杜比实验室特许公司 音频分类方法和系统
CN102522082B (zh) * 2011-12-27 2013-07-10 重庆大学 一种公共场所异常声音的识别与定位方法
US9111531B2 (en) * 2012-01-13 2015-08-18 Qualcomm Incorporated Multiple coding mode signal classification
US20130282373A1 (en) * 2012-04-23 2013-10-24 Qualcomm Incorporated Systems and methods for audio signal processing
AU2013283568B2 (en) * 2012-06-28 2016-05-12 Fraunhofer-Gesellschaft Zur Foerderung Der Angewandten Forschung E.V. Linear prediction based audio coding using improved probability distribution estimation
US9401153B2 (en) * 2012-10-15 2016-07-26 Digimarc Corporation Multi-mode audio recognition and auxiliary data encoding and decoding
CN110619891B (zh) * 2014-05-08 2023-01-17 瑞典爱立信有限公司 音频信号区分器和编码器
WO2015168925A1 (en) 2014-05-09 2015-11-12 Qualcomm Incorporated Restricted aperiodic csi measurement reporting in enhanced interference management and traffic adaptation
TWI602172B (zh) * 2014-08-27 2017-10-11 弗勞恩霍夫爾協會 使用參數以加強隱蔽之用於編碼及解碼音訊內容的編碼器、解碼器及方法

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101145345A (zh) * 2006-09-13 2008-03-19 华为技术有限公司 音频分类方法
CN101399039A (zh) * 2007-09-30 2009-04-01 华为技术有限公司 一种确定非噪声音频信号类别的方法及装置
CN102044246A (zh) * 2009-10-15 2011-05-04 华为技术有限公司 一种音频信号检测方法和装置

Also Published As

Publication number Publication date
MX2018007257A (es) 2022-08-25
US20160086615A1 (en) 2016-03-24
HUE046477T2 (hu) 2020-03-30
EP3140831A1 (en) 2017-03-15
EP3140831B1 (en) 2018-07-11
CN106463141B (zh) 2019-11-01
MY182165A (en) 2021-01-18
EP3594948A1 (en) 2020-01-15
PL3140831T3 (pl) 2018-12-31
BR112016025850B1 (pt) 2022-08-16
MX2016014534A (es) 2017-02-20
MX356883B (es) 2018-06-19
DK3140831T3 (en) 2018-10-15
CN106463141A (zh) 2017-02-22
CN110619892B (zh) 2023-04-11
ES2763280T3 (es) 2020-05-27
WO2015171061A1 (en) 2015-11-12
US9620138B2 (en) 2017-04-11
US20190198032A1 (en) 2019-06-27
US20170178660A1 (en) 2017-06-22
PL3594948T3 (pl) 2021-08-30
DK3379535T3 (da) 2019-12-16
CN110619892A (zh) 2019-12-27
ES2874757T3 (es) 2021-11-05
US10984812B2 (en) 2021-04-20
ES2690577T3 (es) 2018-11-21
EP3594948B1 (en) 2021-03-03
EP3379535B1 (en) 2019-09-18
BR112016025850A2 (https=) 2017-08-15
EP3379535A1 (en) 2018-09-26
CN110619891A (zh) 2019-12-27
US10242687B2 (en) 2019-03-26

Similar Documents

Publication Publication Date Title
CN110619891B (zh) 音频信号区分器和编码器
CN105408956B (zh) 用于获取音频信号的替换帧的频谱系数的方法及相关产品
KR20180073649A (ko) 에코 지연을 추적하는 방법 및 장치
KR20140121443A (ko) 백그라운드 잡음의 존재에서 음성 액티비티 검출
CN117459157B (zh) 一种端到端的微弱卫星信号智能检测方法
US20210235226A1 (en) Information processing device
Kumar et al. MDI-SS: matched filter detection with inverse covariance matrix-based spectrum sensing in cognitive radio
CN118692488A (zh) 具有不确定性量化的音频设备及相关方法
CN108538290A (zh) 一种基于音频信号检测的智能家居控制方法
JP6558073B2 (ja) 移動目標の検出方法及び移動目標の検出装置
CN103915099A (zh) 语音基音周期检测方法和装置
JP2016017793A (ja) 無線測位装置、無線測位方法、無線測位システム、及び、コンピュータ・プログラム
CN116403595A (zh) 一种抗干扰无线对讲方法、系统、设备及介质
WO2010101527A1 (en) Methods for determining whether a signal includes a wanted signal and apparatuses configured to determine whether a signal includes a wanted signal
Treeumnuk et al. Energy detector with adaptive sensing window for improved spectrum utilization in dynamic cognitive radio systems
CN116318445B (zh) 频谱感知方法、装置、电子设备和存储介质
EP2770758A1 (en) Method and device for estimating speed, or speed class, of a user mobile communication device in a wireless communication network
Song et al. Voice Activity Detection Based on Generalized Normal-Laplace Distribution Incorporating Conditional MAP
CN120487070A (zh) 含煤地层岩性识别方法、装置、设备及存储介质
CN103560838A (zh) 一种抑制直流偏置的能量检测方法
Van et al. Malicious user suppression based on Kullback-Leibler divergence for cognitive radio.
Vu-Van et al. Goodness‐of‐Fit Based Secure Cooperative Spectrum Sensing for Cognitive Radio Network

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant