CN105229736B - 用于选择第一编码算法与第二编码算法中的一个的装置及方法 - Google Patents

用于选择第一编码算法与第二编码算法中的一个的装置及方法 Download PDF

Info

Publication number
CN105229736B
CN105229736B CN201480019093.0A CN201480019093A CN105229736B CN 105229736 B CN105229736 B CN 105229736B CN 201480019093 A CN201480019093 A CN 201480019093A CN 105229736 B CN105229736 B CN 105229736B
Authority
CN
China
Prior art keywords
audio signal
encoding algorithm
estimated
encoding
quality measure
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201480019093.0A
Other languages
English (en)
Chinese (zh)
Other versions
CN105229736A (zh
Inventor
埃曼努埃尔·拉维利
斯特凡·多赫拉
纪尧姆·福奇斯
埃莱尼·福托普洛
克里斯蒂安·赫尔姆里希
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Fraunhofer Gesellschaft zur Foerderung der Angewandten Forschung eV
Original Assignee
Fraunhofer Gesellschaft zur Foerderung der Angewandten Forschung eV
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Family has litigation
First worldwide family litigation filed litigation Critical https://patents.darts-ip.com/?family=50033499&utm_source=google_patent&utm_medium=platform_link&utm_campaign=public_patent_search&patent=CN105229736(B) "Global patent litigation dataset” by Darts-ip is licensed under a Creative Commons Attribution 4.0 International License.
Application filed by Fraunhofer Gesellschaft zur Foerderung der Angewandten Forschung eV filed Critical Fraunhofer Gesellschaft zur Foerderung der Angewandten Forschung eV
Priority to CN201910556401.8A priority Critical patent/CN110517700B/zh
Publication of CN105229736A publication Critical patent/CN105229736A/zh
Application granted granted Critical
Publication of CN105229736B publication Critical patent/CN105229736B/zh
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/04Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
    • G10L19/16Vocoder architecture
    • G10L19/18Vocoders using multiple modes
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/04Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
    • G10L19/08Determination or coding of the excitation function; Determination or coding of the long-term prediction parameters
    • G10L19/12Determination or coding of the excitation function; Determination or coding of the long-term prediction parameters the excitation function being a code excitation, e.g. in code excited linear prediction [CELP] vocoders
    • G10L19/125Pitch excitation, e.g. pitch synchronous innovation CELP [PSI-CELP]
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/04Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
    • G10L19/16Vocoder architecture
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/04Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
    • G10L19/16Vocoder architecture
    • G10L19/18Vocoders using multiple modes
    • G10L19/22Mode decision, i.e. based on audio signal content versus external parameters
    • HELECTRICITY
    • H03ELECTRONIC CIRCUITRY
    • H03MCODING; DECODING; CODE CONVERSION IN GENERAL
    • H03M7/00Conversion of a code where information is represented by a given sequence or number of digits to a code where the same, similar or subset of information is represented by a different sequence or number of digits
    • H03M7/30Compression; Expansion; Suppression of unnecessary data, e.g. redundancy reduction
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/02Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders
    • G10L19/0212Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders using orthogonal transformation
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/04Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
    • G10L19/08Determination or coding of the excitation function; Determination or coding of the long-term prediction parameters

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Health & Medical Sciences (AREA)
  • Signal Processing (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Computational Linguistics (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Theoretical Computer Science (AREA)
  • Compression, Expansion, Code Conversion, And Decoders (AREA)
  • Spectroscopy & Molecular Physics (AREA)
  • Transmission Systems Not Characterized By The Medium Used For Transmission (AREA)
CN201480019093.0A 2013-01-29 2014-01-28 用于选择第一编码算法与第二编码算法中的一个的装置及方法 Active CN105229736B (zh)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201910556401.8A CN110517700B (zh) 2013-01-29 2014-01-28 用于选择第一编码算法与第二编码算法中的一个的装置

Applications Claiming Priority (3)

Application Number Priority Date Filing Date Title
US201361758100P 2013-01-29 2013-01-29
US61/758,100 2013-01-29
PCT/EP2014/051557 WO2014118136A1 (en) 2013-01-29 2014-01-28 Apparatus and method for selecting one of a first audio encoding algorithm and a second audio encoding algorithm

Related Child Applications (1)

Application Number Title Priority Date Filing Date
CN201910556401.8A Division CN110517700B (zh) 2013-01-29 2014-01-28 用于选择第一编码算法与第二编码算法中的一个的装置

Publications (2)

Publication Number Publication Date
CN105229736A CN105229736A (zh) 2016-01-06
CN105229736B true CN105229736B (zh) 2019-07-19

Family

ID=50033499

Family Applications (2)

Application Number Title Priority Date Filing Date
CN201480019093.0A Active CN105229736B (zh) 2013-01-29 2014-01-28 用于选择第一编码算法与第二编码算法中的一个的装置及方法
CN201910556401.8A Active CN110517700B (zh) 2013-01-29 2014-01-28 用于选择第一编码算法与第二编码算法中的一个的装置

Family Applications After (1)

Application Number Title Priority Date Filing Date
CN201910556401.8A Active CN110517700B (zh) 2013-01-29 2014-01-28 用于选择第一编码算法与第二编码算法中的一个的装置

Country Status (18)

Country Link
US (4) US20150332698A1 (https=)
EP (1) EP2951820B1 (https=)
JP (1) JP6148810B2 (https=)
KR (1) KR101701081B1 (https=)
CN (2) CN105229736B (https=)
AR (1) AR094676A1 (https=)
AU (1) AU2014211583B2 (https=)
BR (1) BR112015018021B1 (https=)
CA (1) CA2899013C (https=)
ES (1) ES2616434T3 (https=)
MX (1) MX347410B (https=)
MY (1) MY189267A (https=)
PL (1) PL2951820T3 (https=)
PT (1) PT2951820T (https=)
RU (1) RU2618848C2 (https=)
SG (1) SG11201505947XA (https=)
TW (1) TWI549120B (https=)
WO (1) WO2014118136A1 (https=)

Families Citing this family (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
ES2616434T3 (es) * 2013-01-29 2017-06-13 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Aparato y método para seleccionar uno de un primer algoritmo de codificación de audio y un segundo algoritmo de codificación de audio
EP2830051A3 (en) 2013-07-22 2015-03-04 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Audio encoder, audio decoder, methods and computer program using jointly encoded residual signals
CN105096958B (zh) 2014-04-29 2017-04-12 华为技术有限公司 音频编码方法及相关装置
EP3000110B1 (en) 2014-07-28 2016-12-07 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Selection of one of a first encoding algorithm and a second encoding algorithm using harmonics reduction
JP2016218345A (ja) * 2015-05-25 2016-12-22 ヤマハ株式会社 音素材処理装置および音素材処理プログラム
WO2017050398A1 (en) * 2015-09-25 2017-03-30 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Encoder, decoder and methods for signal-adaptive switching of the overlap ratio in audio transform coding
US10225730B2 (en) * 2016-06-24 2019-03-05 The Nielsen Company (Us), Llc Methods and apparatus to perform audio sensor selection in an audience measurement device
US11817111B2 (en) * 2018-04-11 2023-11-14 Dolby Laboratories Licensing Corporation Perceptually-based loss functions for audio encoding and decoding based on machine learning

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101261834A (zh) * 2007-03-09 2008-09-10 富士通株式会社 编码装置及编码方法
CN102113051A (zh) * 2008-07-11 2011-06-29 弗朗霍夫应用科学研究促进协会 具有级联开关的低比特率音频编码/解码方案
WO2012110448A1 (en) * 2011-02-14 2012-08-23 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Apparatus and method for coding a portion of an audio signal using a transient detection and a quality result
CN102099856B (zh) * 2008-07-17 2012-11-07 弗劳恩霍夫应用研究促进协会 具有可切换旁路的音频编码/解码方法及设备

Family Cites Families (20)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2002037688A1 (en) 2000-11-03 2002-05-10 Koninklijke Philips Electronics N.V. Parametric coding of audio signals
US6934676B2 (en) * 2001-05-11 2005-08-23 Nokia Mobile Phones Ltd. Method and system for inter-channel signal redundancy removal in perceptual audio coding
DE10124420C1 (de) * 2001-05-18 2002-11-28 Siemens Ag Verfahren zur Codierung und zur Übertragung von Sprachsignalen
DE102004007200B3 (de) 2004-02-13 2005-08-11 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Audiocodierung
CA2457988A1 (en) * 2004-02-18 2005-08-18 Voiceage Corporation Methods and devices for audio compression based on acelp/tcx coding and multi-rate lattice vector quantization
FI118835B (fi) 2004-02-23 2008-03-31 Nokia Corp Koodausmallin valinta
FI119533B (fi) * 2004-04-15 2008-12-15 Nokia Corp Audiosignaalien koodaus
EP1747554B1 (en) 2004-05-17 2010-02-10 Nokia Corporation Audio encoding with different coding frame lengths
US7739120B2 (en) 2004-05-17 2010-06-15 Nokia Corporation Selection of coding models for encoding an audio signal
RU2393552C2 (ru) * 2004-09-17 2010-06-27 Конинклейке Филипс Электроникс Н.В. Комбинированное аудиокодирование, минимизирующее воспринимаемое искажение
WO2006048824A1 (en) * 2004-11-05 2006-05-11 Koninklijke Philips Electronics N.V. Efficient audio coding using signal properties
US7873511B2 (en) * 2006-06-30 2011-01-18 Fraunhofer-Gesellschaft Zur Foerderung Der Angewandten Forschung E.V. Audio encoder, audio decoder and audio processor having a dynamically variable warping characteristic
ATE408217T1 (de) * 2006-06-30 2008-09-15 Fraunhofer Ges Forschung Audiokodierer, audiodekodierer und audioprozessor mit einer dynamisch variablen warp-charakteristik
US7953595B2 (en) 2006-10-18 2011-05-31 Polycom, Inc. Dual-transform coding of audio signals
US8527265B2 (en) * 2007-10-22 2013-09-03 Qualcomm Incorporated Low-complexity encoding/decoding of quantized MDCT spectrum in scalable speech and audio codecs
KR101649376B1 (ko) * 2008-10-13 2016-08-31 한국전자통신연구원 Mdct 기반 음성/오디오 통합 부호화기의 lpc 잔차신호 부호화/복호화 장치
TWI435317B (zh) * 2009-10-20 2014-04-21 Fraunhofer Ges Forschung 音訊信號編碼器、音訊信號解碼器、用以提供音訊內容之編碼表示型態之方法、用以提供音訊內容之解碼表示型態之方法及使用於低延遲應用之電腦程式
RU2013110317A (ru) * 2010-09-10 2014-10-20 Панасоник Корпорэйшн Кодирующее устройство и способ кодирования
ES2616434T3 (es) * 2013-01-29 2017-06-13 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Aparato y método para seleccionar uno de un primer algoritmo de codificación de audio y un segundo algoritmo de codificación de audio
EP3000110B1 (en) * 2014-07-28 2016-12-07 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Selection of one of a first encoding algorithm and a second encoding algorithm using harmonics reduction

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101261834A (zh) * 2007-03-09 2008-09-10 富士通株式会社 编码装置及编码方法
CN102113051A (zh) * 2008-07-11 2011-06-29 弗朗霍夫应用科学研究促进协会 具有级联开关的低比特率音频编码/解码方案
CN102099856B (zh) * 2008-07-17 2012-11-07 弗劳恩霍夫应用研究促进协会 具有可切换旁路的音频编码/解码方法及设备
WO2012110448A1 (en) * 2011-02-14 2012-08-23 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Apparatus and method for coding a portion of an audio signal using a transient detection and a quality result

Non-Patent Citations (5)

* Cited by examiner, † Cited by third party
Title
EUROPEAN TELECOMMUNICATIONS STANDARDS INSTITUTE(ETSI)."DIGITAL CELLULAR TELECOMMUNICATIONS SYSTEM(PHASE 2+) *
EXTENDED ADAPTIVE MULTI-RATE-WIDEBAND(AMR-WB+)CODEC *
LTE;AUDIO CODEC PROCESSING FUNCTIONS *
TRANSCODING FUNCTIONS(3GPP TS 26.290 VERSION 11.0.0 RELEASE 11)".《TECHNICAL SPECIFICATION》.2012,第3GPP SA 4卷(第V11.0.0期), *
UNIVERSAL MOBILE TELECOMMUNICATIONS SYSTEM(UMTS) *

Also Published As

Publication number Publication date
US20150332698A1 (en) 2015-11-19
KR20150108848A (ko) 2015-09-30
AR094676A1 (es) 2015-08-19
JP6148810B2 (ja) 2017-06-14
RU2015136467A (ru) 2017-03-07
MY189267A (en) 2022-01-31
ES2616434T3 (es) 2017-06-13
CN105229736A (zh) 2016-01-06
US20190103121A1 (en) 2019-04-04
BR112015018021B1 (pt) 2022-10-11
AU2014211583B2 (en) 2017-01-05
HK1218461A1 (en) 2017-02-17
JP2016505902A (ja) 2016-02-25
MX347410B (es) 2017-04-26
CN110517700A (zh) 2019-11-29
EP2951820B1 (en) 2016-12-07
US20200227059A1 (en) 2020-07-16
US11521631B2 (en) 2022-12-06
PT2951820T (pt) 2017-03-02
MX2015009745A (es) 2015-11-06
WO2014118136A1 (en) 2014-08-07
US20230079574A1 (en) 2023-03-16
PL2951820T3 (pl) 2017-06-30
TWI549120B (zh) 2016-09-11
US10622000B2 (en) 2020-04-14
KR101701081B1 (ko) 2017-01-31
CN110517700B (zh) 2023-06-09
EP2951820A1 (en) 2015-12-09
CA2899013A1 (en) 2014-08-07
TW201434037A (zh) 2014-09-01
SG11201505947XA (en) 2015-09-29
CA2899013C (en) 2017-11-07
US11908485B2 (en) 2024-02-20
BR112015018021A2 (https=) 2017-07-11
AU2014211583A1 (en) 2015-09-17
RU2618848C2 (ru) 2017-05-12

Similar Documents

Publication Publication Date Title
US11908485B2 (en) Apparatus and method for selecting one of a first encoding algorithm and a second encoding algorithm
US10706865B2 (en) Apparatus and method for selecting one of a first encoding algorithm and a second encoding algorithm using harmonics reduction
JP2014510303A (ja) 過渡検出及び品質結果を使用してオーディオ信号の一部分を符号化する装置及び方法
CA2910878C (en) Apparatus and method for selecting one of a first encoding algorithm and a second encoding algorithm using harmonics reduction
HK1218461B (en) Apparatus and method for selecting one of a first audio encoding algorithm and a second audio encoding algorithm
HK1222943B (en) Selection of one of a first encoding algorithm and a second encoding algorithm using harmonics reduction

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant