CN101091108A - 用于对音频信号进行编码的编码模型的选择 - Google Patents
用于对音频信号进行编码的编码模型的选择 Download PDFInfo
- Publication number
- CN101091108A CN101091108A CNA200580015656XA CN200580015656A CN101091108A CN 101091108 A CN101091108 A CN 101091108A CN A200580015656X A CNA200580015656X A CN A200580015656XA CN 200580015656 A CN200580015656 A CN 200580015656A CN 101091108 A CN101091108 A CN 101091108A
- Authority
- CN
- China
- Prior art keywords
- encoding model
- audio content
- sound signal
- encoding
- model
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
- 230000005236 sound signal Effects 0.000 title claims abstract description 87
- 238000000034 method Methods 0.000 claims abstract description 42
- 238000011156 evaluation Methods 0.000 claims description 54
- 238000005457 optimization Methods 0.000 claims description 21
- 230000008569 process Effects 0.000 claims description 10
- 230000007704 transition Effects 0.000 claims description 10
- 238000012545 processing Methods 0.000 claims description 4
- 238000010972 statistical evaluation Methods 0.000 abstract 1
- 238000010586 diagram Methods 0.000 description 8
- 238000004458 analytical method Methods 0.000 description 4
- 238000004422 calculation algorithm Methods 0.000 description 4
- 230000000694 effects Effects 0.000 description 4
- 230000005284 excitation Effects 0.000 description 3
- 238000005259 measurement Methods 0.000 description 3
- 238000013461 design Methods 0.000 description 2
- 230000003595 spectral effect Effects 0.000 description 2
- 230000006978 adaptation Effects 0.000 description 1
- 230000008901 benefit Effects 0.000 description 1
- 230000005540 biological transmission Effects 0.000 description 1
- 230000015556 catabolic process Effects 0.000 description 1
- 230000008859 change Effects 0.000 description 1
- 238000007635 classification algorithm Methods 0.000 description 1
- 238000006731 degradation reaction Methods 0.000 description 1
- 230000007774 longterm Effects 0.000 description 1
- 230000000873 masking effect Effects 0.000 description 1
- 230000000737 periodic effect Effects 0.000 description 1
- 238000005096 rolling process Methods 0.000 description 1
- 238000001228 spectrum Methods 0.000 description 1
- 238000003786 synthesis reaction Methods 0.000 description 1
Images
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/04—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
- G10L19/16—Vocoder architecture
- G10L19/18—Vocoders using multiple modes
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/04—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
- G10L19/16—Vocoder architecture
- G10L19/18—Vocoders using multiple modes
- G10L19/22—Mode decision, i.e. based on audio signal content versus external parameters
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/04—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
- G10L19/08—Determination or coding of the excitation function; Determination or coding of the long-term prediction parameters
- G10L19/12—Determination or coding of the excitation function; Determination or coding of the long-term prediction parameters the excitation function being a code excitation, e.g. in code excited linear prediction [CELP] vocoders
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/04—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
- G10L19/16—Vocoder architecture
- G10L19/18—Vocoders using multiple modes
- G10L19/20—Vocoders using multiple modes using sound class specific coding, hybrid encoders or object based coding
Landscapes
- Engineering & Computer Science (AREA)
- Computational Linguistics (AREA)
- Signal Processing (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Physics & Mathematics (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Compression, Expansion, Code Conversion, And Decoders (AREA)
- Transmission Systems Not Characterized By The Medium Used For Transmission (AREA)
- Reduction Or Emphasis Of Bandwidth Of Signals (AREA)
Abstract
Description
Claims (21)
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US10/847,651 US7739120B2 (en) | 2004-05-17 | 2004-05-17 | Selection of coding models for encoding an audio signal |
US10/847,651 | 2004-05-17 |
Publications (2)
Publication Number | Publication Date |
---|---|
CN101091108A true CN101091108A (zh) | 2007-12-19 |
CN100485337C CN100485337C (zh) | 2009-05-06 |
Family
ID=34962977
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CNB200580015656XA Active CN100485337C (zh) | 2004-05-17 | 2005-04-06 | 用于对音频信号进行编码的编码模型的选择 |
Country Status (17)
Country | Link |
---|---|
US (1) | US7739120B2 (zh) |
EP (1) | EP1747442B1 (zh) |
JP (1) | JP2008503783A (zh) |
KR (1) | KR20080083719A (zh) |
CN (1) | CN100485337C (zh) |
AT (1) | ATE479885T1 (zh) |
AU (1) | AU2005242993A1 (zh) |
BR (1) | BRPI0511150A (zh) |
CA (1) | CA2566353A1 (zh) |
DE (1) | DE602005023295D1 (zh) |
HK (1) | HK1110111A1 (zh) |
MX (1) | MXPA06012579A (zh) |
PE (1) | PE20060385A1 (zh) |
RU (1) | RU2006139795A (zh) |
TW (1) | TW200606815A (zh) |
WO (1) | WO2005111567A1 (zh) |
ZA (1) | ZA200609479B (zh) |
Cited By (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US7835906B1 (en) | 2009-05-31 | 2010-11-16 | Huawei Technologies Co., Ltd. | Encoding method, apparatus and device and decoding method |
CN101221766B (zh) * | 2008-01-23 | 2011-01-05 | 清华大学 | 音频编码器切换的方法 |
CN104919524A (zh) * | 2012-11-13 | 2015-09-16 | 三星电子株式会社 | 用于确定编码模式的方法和设备、用于对音频信号进行编码的方法和设备以及用于对音频信号进行解码的方法和设备 |
Families Citing this family (24)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2006136179A1 (en) * | 2005-06-20 | 2006-12-28 | Telecom Italia S.P.A. | Method and apparatus for transmitting speech data to a remote device in a distributed speech recognition system |
WO2007083931A1 (en) * | 2006-01-18 | 2007-07-26 | Lg Electronics Inc. | Apparatus and method for encoding and decoding signal |
JP5235684B2 (ja) * | 2006-02-24 | 2013-07-10 | フランス・テレコム | 信号包絡線の量子化インデックスをバイナリ符号化する方法、信号包絡線を復号化する方法、および、対応する符号化および復号化モジュール |
US9159333B2 (en) * | 2006-06-21 | 2015-10-13 | Samsung Electronics Co., Ltd. | Method and apparatus for adaptively encoding and decoding high frequency band |
KR101434198B1 (ko) * | 2006-11-17 | 2014-08-26 | 삼성전자주식회사 | 신호 복호화 방법 |
KR100964402B1 (ko) * | 2006-12-14 | 2010-06-17 | 삼성전자주식회사 | 오디오 신호의 부호화 모드 결정 방법 및 장치와 이를 이용한 오디오 신호의 부호화/복호화 방법 및 장치 |
US20080202042A1 (en) * | 2007-02-22 | 2008-08-28 | Azad Mesrobian | Drawworks and motor |
PL2165328T3 (pl) * | 2007-06-11 | 2018-06-29 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | Kodowanie i dekodowanie sygnału audio zawierającego część impulsową i część stacjonarną |
US9653088B2 (en) * | 2007-06-13 | 2017-05-16 | Qualcomm Incorporated | Systems, methods, and apparatus for signal encoding using pitch-regularizing and non-pitch-regularizing coding |
EP2198426A4 (en) * | 2007-10-15 | 2012-01-18 | Lg Electronics Inc | METHOD AND DEVICE FOR PROCESSING A SIGNAL |
WO2010003254A1 (en) * | 2008-07-10 | 2010-01-14 | Voiceage Corporation | Multi-reference lpc filter quantization and inverse quantization device and method |
RU2515704C2 (ru) * | 2008-07-11 | 2014-05-20 | Фраунхофер-Гезелльшафт Цур Фердерунг Дер Ангевандтен Форшунг Е.Ф. | Аудиокодер и аудиодекодер для кодирования и декодирования отсчетов аудиосигнала |
EP2144230A1 (en) | 2008-07-11 | 2010-01-13 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | Low bitrate audio encoding/decoding scheme having cascaded switches |
BR112012009032B1 (pt) * | 2009-10-20 | 2021-09-21 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e. V. | Codificador de sinal de áudio, decodificador de sinal de áudio, método para prover uma representação codificada de um conteúdo de áudio, método para prover uma representação decodificada de um conteúdo de áudio para uso em aplicações de baixo retardamento |
US8442837B2 (en) * | 2009-12-31 | 2013-05-14 | Motorola Mobility Llc | Embedded speech and audio coding using a switchable model core |
IL205394A (en) * | 2010-04-28 | 2016-09-29 | Verint Systems Ltd | A system and method for automatically identifying a speech encoding scheme |
CN105355209B (zh) | 2010-07-02 | 2020-02-14 | 杜比国际公司 | 音高增强后置滤波器 |
CN103180899B (zh) * | 2010-11-17 | 2015-07-22 | 松下电器(美国)知识产权公司 | 立体声信号的编码装置、解码装置、编码方法及解码方法 |
WO2014118136A1 (en) | 2013-01-29 | 2014-08-07 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | Apparatus and method for selecting one of a first audio encoding algorithm and a second audio encoding algorithm |
CN107452390B (zh) | 2014-04-29 | 2021-10-26 | 华为技术有限公司 | 音频编码方法及相关装置 |
CN107424622B (zh) * | 2014-06-24 | 2020-12-25 | 华为技术有限公司 | 音频编码方法和装置 |
EP2980794A1 (en) * | 2014-07-28 | 2016-02-03 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | Audio encoder and decoder using a frequency domain processor and a time domain processor |
EP2980795A1 (en) * | 2014-07-28 | 2016-02-03 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | Audio encoding and decoding using a frequency domain processor, a time domain processor and a cross processor for initialization of the time domain processor |
EP3000110B1 (en) | 2014-07-28 | 2016-12-07 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | Selection of one of a first encoding algorithm and a second encoding algorithm using harmonics reduction |
Family Cites Families (8)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US6134518A (en) * | 1997-03-04 | 2000-10-17 | International Business Machines Corporation | Digital audio signal coding using a CELP coder and a transform coder |
DE69926821T2 (de) | 1998-01-22 | 2007-12-06 | Deutsche Telekom Ag | Verfahren zur signalgesteuerten Schaltung zwischen verschiedenen Audiokodierungssystemen |
US6633841B1 (en) | 1999-07-29 | 2003-10-14 | Mindspeed Technologies, Inc. | Voice activity detection speech coding to accommodate music signals |
ATE341074T1 (de) | 2000-02-29 | 2006-10-15 | Qualcomm Inc | Multimodaler mischbereich-sprachkodierer mit geschlossener regelschleife |
WO2002023530A2 (en) | 2000-09-11 | 2002-03-21 | Matsushita Electric Industrial Co., Ltd. | Quantization of spectral sequences for audio signal coding |
US6658383B2 (en) | 2001-06-26 | 2003-12-02 | Microsoft Corporation | Method for coding speech and music signals |
US6785645B2 (en) * | 2001-11-29 | 2004-08-31 | Microsoft Corporation | Real-time speech and music classifier |
US7613606B2 (en) | 2003-10-02 | 2009-11-03 | Nokia Corporation | Speech codecs |
-
2004
- 2004-05-17 US US10/847,651 patent/US7739120B2/en active Active
-
2005
- 2005-04-06 AU AU2005242993A patent/AU2005242993A1/en not_active Abandoned
- 2005-04-06 MX MXPA06012579A patent/MXPA06012579A/es not_active Application Discontinuation
- 2005-04-06 KR KR1020087021059A patent/KR20080083719A/ko not_active Application Discontinuation
- 2005-04-06 WO PCT/IB2005/000924 patent/WO2005111567A1/en active Application Filing
- 2005-04-06 CN CNB200580015656XA patent/CN100485337C/zh active Active
- 2005-04-06 CA CA002566353A patent/CA2566353A1/en not_active Abandoned
- 2005-04-06 RU RU2006139795/28A patent/RU2006139795A/ru not_active Application Discontinuation
- 2005-04-06 AT AT05718394T patent/ATE479885T1/de not_active IP Right Cessation
- 2005-04-06 DE DE602005023295T patent/DE602005023295D1/de active Active
- 2005-04-06 JP JP2007517472A patent/JP2008503783A/ja not_active Withdrawn
- 2005-04-06 BR BRPI0511150-1A patent/BRPI0511150A/pt not_active IP Right Cessation
- 2005-04-06 EP EP05718394A patent/EP1747442B1/en active Active
- 2005-05-12 PE PE2005000527A patent/PE20060385A1/es not_active Application Discontinuation
- 2005-05-13 TW TW094115502A patent/TW200606815A/zh unknown
-
2006
- 2006-11-15 ZA ZA200609479A patent/ZA200609479B/xx unknown
-
2008
- 2008-04-21 HK HK08104429.5A patent/HK1110111A1/xx unknown
Cited By (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN101221766B (zh) * | 2008-01-23 | 2011-01-05 | 清华大学 | 音频编码器切换的方法 |
US7835906B1 (en) | 2009-05-31 | 2010-11-16 | Huawei Technologies Co., Ltd. | Encoding method, apparatus and device and decoding method |
CN104919524A (zh) * | 2012-11-13 | 2015-09-16 | 三星电子株式会社 | 用于确定编码模式的方法和设备、用于对音频信号进行编码的方法和设备以及用于对音频信号进行解码的方法和设备 |
CN104919524B (zh) * | 2012-11-13 | 2018-01-23 | 三星电子株式会社 | 用于确定编码模式的方法和设备、用于对音频信号进行编码的方法和设备以及用于对音频信号进行解码的方法和设备 |
US10468046B2 (en) | 2012-11-13 | 2019-11-05 | Samsung Electronics Co., Ltd. | Coding mode determination method and apparatus, audio encoding method and apparatus, and audio decoding method and apparatus |
US11004458B2 (en) | 2012-11-13 | 2021-05-11 | Samsung Electronics Co., Ltd. | Coding mode determination method and apparatus, audio encoding method and apparatus, and audio decoding method and apparatus |
Also Published As
Publication number | Publication date |
---|---|
HK1110111A1 (en) | 2008-07-04 |
ZA200609479B (en) | 2008-09-25 |
JP2008503783A (ja) | 2008-02-07 |
WO2005111567A1 (en) | 2005-11-24 |
PE20060385A1 (es) | 2006-05-19 |
BRPI0511150A (pt) | 2007-11-27 |
CA2566353A1 (en) | 2005-11-24 |
ATE479885T1 (de) | 2010-09-15 |
CN100485337C (zh) | 2009-05-06 |
DE602005023295D1 (de) | 2010-10-14 |
US20050256701A1 (en) | 2005-11-17 |
TW200606815A (en) | 2006-02-16 |
MXPA06012579A (es) | 2006-12-15 |
EP1747442B1 (en) | 2010-09-01 |
US7739120B2 (en) | 2010-06-15 |
RU2006139795A (ru) | 2008-06-27 |
EP1747442A1 (en) | 2007-01-31 |
KR20080083719A (ko) | 2008-09-18 |
AU2005242993A1 (en) | 2005-11-24 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN100485337C (zh) | 用于对音频信号进行编码的编码模型的选择 | |
CN1954365B (zh) | 使用不同编码模型的音频编码 | |
CN1954367B (zh) | 支持音频编码器模式间的转换 | |
CN1954364A (zh) | 带有不同编码帧长度的音频编码 | |
CN101681627B (zh) | 使用音调规则化及非音调规则化译码的信号编码方法及设备 | |
CN1957399B (zh) | 语音/音频解码装置以及语音/音频解码方法 | |
CN101320563B (zh) | 一种背景噪声编码/解码装置、方法和通信设备 | |
CN101622666B (zh) | 非因果后置滤波器 | |
CN101494055A (zh) | 用于码分多址无线系统的方法和装置 | |
CN1244090C (zh) | 具备背景噪声再现的语音编码 | |
CN102760441B (zh) | 一种背景噪声编码/解码装置、方法和通信设备 | |
KR20070017379A (ko) | 오디오 신호를 부호화하기 위한 부호화 모델들의 선택 | |
KR20080091305A (ko) | 서로 다른 코딩 모델들을 통한 오디오 인코딩 | |
Drygajilo | Speech Coding Techniques and Standards | |
KR20070017378A (ko) | 서로 다른 코딩 모델들을 통한 오디오 인코딩 | |
KR20070017380A (ko) | 서로 다른 코딩 프레임 길이의 오디오 인코딩 | |
ZA200609478B (en) | Audio encoding with different coding frame lengths |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C06 | Publication | ||
PB01 | Publication | ||
C10 | Entry into substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
REG | Reference to a national code |
Ref country code: HK Ref legal event code: DE Ref document number: 1110111 Country of ref document: HK |
|
C14 | Grant of patent or utility model | ||
GR01 | Patent grant | ||
REG | Reference to a national code |
Ref country code: HK Ref legal event code: GR Ref document number: 1110111 Country of ref document: HK |
|
C41 | Transfer of patent application or patent right or utility model | ||
TR01 | Transfer of patent right |
Effective date of registration: 20160206 Address after: Espoo, Finland Patentee after: Technology Co., Ltd. of Nokia Address before: Espoo, Finland Patentee before: Nokia Oyj |