CN1653521B - 用于音频代码转换中的自适应码本音调滞后计算的方法 - Google Patents
用于音频代码转换中的自适应码本音调滞后计算的方法 Download PDFInfo
- Publication number
- CN1653521B CN1653521B CN038106450A CN03810645A CN1653521B CN 1653521 B CN1653521 B CN 1653521B CN 038106450 A CN038106450 A CN 038106450A CN 03810645 A CN03810645 A CN 03810645A CN 1653521 B CN1653521 B CN 1653521B
- Authority
- CN
- China
- Prior art keywords
- subframe
- tone
- input
- sluggish
- module
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Expired - Fee Related
Links
- 230000003044 adaptive effect Effects 0.000 title claims description 61
- 238000000034 method Methods 0.000 title claims description 36
- 230000008878 coupling Effects 0.000 claims description 5
- 238000010168 coupling process Methods 0.000 claims description 5
- 238000005859 coupling reaction Methods 0.000 claims description 5
- 238000005070 sampling Methods 0.000 claims description 3
- 238000004364 calculation method Methods 0.000 abstract description 6
- 206010041052 Sluggishness Diseases 0.000 description 49
- 238000006243 chemical reaction Methods 0.000 description 29
- 230000008901 benefit Effects 0.000 description 6
- 238000012986 modification Methods 0.000 description 6
- 230000004048 modification Effects 0.000 description 6
- 238000004422 calculation algorithm Methods 0.000 description 5
- 230000006835 compression Effects 0.000 description 5
- 238000007906 compression Methods 0.000 description 5
- 238000010586 diagram Methods 0.000 description 4
- 238000005516 engineering process Methods 0.000 description 3
- 230000008569 process Effects 0.000 description 3
- 238000007796 conventional method Methods 0.000 description 2
- 230000003760 hair shine Effects 0.000 description 2
- 238000003780 insertion Methods 0.000 description 2
- 230000037431 insertion Effects 0.000 description 2
- 238000012545 processing Methods 0.000 description 2
- 230000004913 activation Effects 0.000 description 1
- 238000004458 analytical method Methods 0.000 description 1
- 230000005540 biological transmission Effects 0.000 description 1
- 230000000694 effects Effects 0.000 description 1
- 238000007689 inspection Methods 0.000 description 1
- 238000011835 investigation Methods 0.000 description 1
- 210000000088 lip Anatomy 0.000 description 1
- 230000007774 longterm Effects 0.000 description 1
- 238000012423 maintenance Methods 0.000 description 1
- 210000000214 mouth Anatomy 0.000 description 1
- 210000003928 nasal cavity Anatomy 0.000 description 1
- 238000012856 packing Methods 0.000 description 1
- 238000005086 pumping Methods 0.000 description 1
Images
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/04—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
- G10L19/16—Vocoder architecture
- G10L19/173—Transcoding, i.e. converting between two coded representations avoiding cascaded coding-decoding
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/04—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
- G10L19/08—Determination or coding of the excitation function; Determination or coding of the long-term prediction parameters
- G10L19/09—Long term prediction, i.e. removing periodical redundancies, e.g. by using adaptive codebook or pitch predictor
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/04—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
- G10L19/08—Determination or coding of the excitation function; Determination or coding of the long-term prediction parameters
- G10L19/12—Determination or coding of the excitation function; Determination or coding of the long-term prediction parameters the excitation function being a code excitation, e.g. in code excited linear prediction [CELP] vocoders
Landscapes
- Engineering & Computer Science (AREA)
- Computational Linguistics (AREA)
- Signal Processing (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Physics & Mathematics (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Compression, Expansion, Code Conversion, And Decoders (AREA)
Abstract
Description
Claims (20)
Applications Claiming Priority (3)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US36440302P | 2002-03-12 | 2002-03-12 | |
US60/364,403 | 2002-03-12 | ||
PCT/US2003/007901 WO2003079330A1 (en) | 2002-03-12 | 2003-03-12 | Method for adaptive codebook pitch-lag computation in audio transcoders |
Publications (2)
Publication Number | Publication Date |
---|---|
CN1653521A CN1653521A (zh) | 2005-08-10 |
CN1653521B true CN1653521B (zh) | 2010-05-26 |
Family
ID=28041908
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN038106450A Expired - Fee Related CN1653521B (zh) | 2002-03-12 | 2003-03-12 | 用于音频代码转换中的自适应码本音调滞后计算的方法 |
Country Status (7)
Country | Link |
---|---|
US (2) | US7260524B2 (zh) |
EP (1) | EP1483758A4 (zh) |
JP (1) | JP2005520206A (zh) |
KR (1) | KR20040104508A (zh) |
CN (1) | CN1653521B (zh) |
AU (1) | AU2003214182A1 (zh) |
WO (1) | WO2003079330A1 (zh) |
Cited By (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US9263051B2 (en) | 2009-01-06 | 2016-02-16 | Skype | Speech coding by quantizing with random-noise signal |
US9530423B2 (en) | 2009-01-06 | 2016-12-27 | Skype | Speech encoding by determining a quantization gain based on inverse of a pitch correlation |
Families Citing this family (22)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2003079330A1 (en) * | 2002-03-12 | 2003-09-25 | Dilithium Networks Pty Limited | Method for adaptive codebook pitch-lag computation in audio transcoders |
KR100546758B1 (ko) * | 2003-06-30 | 2006-01-26 | 한국전자통신연구원 | 음성의 상호부호화시 전송률 결정 장치 및 방법 |
US7433815B2 (en) * | 2003-09-10 | 2008-10-07 | Dilithium Networks Pty Ltd. | Method and apparatus for voice transcoding between variable rate coders |
US7519532B2 (en) * | 2003-09-29 | 2009-04-14 | Texas Instruments Incorporated | Transcoding EVRC to G.729ab |
US9058812B2 (en) * | 2005-07-27 | 2015-06-16 | Google Technology Holdings LLC | Method and system for coding an information signal using pitch delay contour adjustment |
US7602745B2 (en) * | 2005-12-05 | 2009-10-13 | Intel Corporation | Multiple input, multiple output wireless communication system, associated methods and data structures |
KR100900438B1 (ko) * | 2006-04-25 | 2009-06-01 | 삼성전자주식회사 | 음성 패킷 복구 장치 및 방법 |
US8218529B2 (en) * | 2006-07-07 | 2012-07-10 | Avaya Canada Corp. | Device for and method of terminating a VoIP call |
EP1903559A1 (en) * | 2006-09-20 | 2008-03-26 | Deutsche Thomson-Brandt Gmbh | Method and device for transcoding audio signals |
GB2466672B (en) | 2009-01-06 | 2013-03-13 | Skype | Speech coding |
GB2466670B (en) | 2009-01-06 | 2012-11-14 | Skype | Speech encoding |
GB2466673B (en) | 2009-01-06 | 2012-11-07 | Skype | Quantization |
GB2466669B (en) * | 2009-01-06 | 2013-03-06 | Skype | Speech coding |
US8243610B2 (en) * | 2009-04-21 | 2012-08-14 | Futurewei Technologies, Inc. | System and method for precoding codebook adaptation with low feedback overhead |
EP2249334A1 (en) * | 2009-05-08 | 2010-11-10 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | Audio format transcoder |
US8452606B2 (en) | 2009-09-29 | 2013-05-28 | Skype | Speech encoding using multiple bit rates |
US8521541B2 (en) * | 2010-11-02 | 2013-08-27 | Google Inc. | Adaptive audio transcoding |
CN104243734B (zh) * | 2013-06-18 | 2019-03-01 | 深圳市共进电子股份有限公司 | 音频处理系统和方法 |
ES2671006T3 (es) | 2013-06-21 | 2018-06-04 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | Reconstrucción de una trama de voz |
TWI569262B (zh) | 2013-06-21 | 2017-02-01 | 弗勞恩霍夫爾協會 | 用於將經編碼音訊信號解碼以獲得經重構音訊信號之裝置及方法與相關電腦程式 |
JP6482540B2 (ja) * | 2013-06-21 | 2019-03-13 | フラウンホーファーゲゼルシャフト ツール フォルデルング デル アンゲヴァンテン フォルシユング エー.フアー. | 改善されたピッチラグ推定を採用するacelp型封じ込めにおける適応型コードブックの改善された封じ込めのための装置および方法 |
EP2980799A1 (en) * | 2014-07-28 | 2016-02-03 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | Apparatus and method for processing an audio signal using a harmonic post-filter |
Citations (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US5995923A (en) * | 1997-06-26 | 1999-11-30 | Nortel Networks Corporation | Method and apparatus for improving the voice quality of tandemed vocoders |
US6115687A (en) * | 1996-11-11 | 2000-09-05 | Matsushita Electric Industrial Co., Ltd. | Sound reproducing speed converter |
Family Cites Families (8)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JPH08146997A (ja) | 1994-11-21 | 1996-06-07 | Hitachi Ltd | 符号変換装置および符号変換システム |
US6260009B1 (en) * | 1999-02-12 | 2001-07-10 | Qualcomm Incorporated | CELP-based to CELP-based vocoder packet translation |
WO2001020595A1 (en) * | 1999-09-14 | 2001-03-22 | Fujitsu Limited | Voice encoder/decoder |
US6760698B2 (en) * | 2000-09-15 | 2004-07-06 | Mindspeed Technologies Inc. | System for coding speech information using an adaptive codebook with enhanced variable resolution scheme |
JP2002202799A (ja) * | 2000-10-30 | 2002-07-19 | Fujitsu Ltd | 音声符号変換装置 |
JP2002229599A (ja) | 2001-02-02 | 2002-08-16 | Nec Corp | 音声符号列の変換装置および変換方法 |
WO2003079330A1 (en) * | 2002-03-12 | 2003-09-25 | Dilithium Networks Pty Limited | Method for adaptive codebook pitch-lag computation in audio transcoders |
JP2004222009A (ja) | 2003-01-16 | 2004-08-05 | Nec Corp | 異種網接続ゲートウェイおよび異種網間通信課金システム |
-
2003
- 2003-03-12 WO PCT/US2003/007901 patent/WO2003079330A1/en active Application Filing
- 2003-03-12 CN CN038106450A patent/CN1653521B/zh not_active Expired - Fee Related
- 2003-03-12 JP JP2003577246A patent/JP2005520206A/ja not_active Withdrawn
- 2003-03-12 KR KR10-2004-7014297A patent/KR20040104508A/ko not_active Application Discontinuation
- 2003-03-12 EP EP03711590A patent/EP1483758A4/en not_active Withdrawn
- 2003-03-12 AU AU2003214182A patent/AU2003214182A1/en not_active Abandoned
- 2003-03-12 US US10/350,349 patent/US7260524B2/en not_active Expired - Fee Related
-
2007
- 2007-07-26 US US11/881,742 patent/US7996217B2/en not_active Expired - Fee Related
Patent Citations (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US6115687A (en) * | 1996-11-11 | 2000-09-05 | Matsushita Electric Industrial Co., Ltd. | Sound reproducing speed converter |
US5995923A (en) * | 1997-06-26 | 1999-11-30 | Nortel Networks Corporation | Method and apparatus for improving the voice quality of tandemed vocoders |
Cited By (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US9263051B2 (en) | 2009-01-06 | 2016-02-16 | Skype | Speech coding by quantizing with random-noise signal |
US9530423B2 (en) | 2009-01-06 | 2016-12-27 | Skype | Speech encoding by determining a quantization gain based on inverse of a pitch correlation |
Also Published As
Publication number | Publication date |
---|---|
US20040002855A1 (en) | 2004-01-01 |
JP2005520206A (ja) | 2005-07-07 |
AU2003214182A1 (en) | 2003-09-29 |
KR20040104508A (ko) | 2004-12-10 |
CN1653521A (zh) | 2005-08-10 |
WO2003079330A1 (en) | 2003-09-25 |
US7260524B2 (en) | 2007-08-21 |
EP1483758A1 (en) | 2004-12-08 |
US20080189101A1 (en) | 2008-08-07 |
US7996217B2 (en) | 2011-08-09 |
EP1483758A4 (en) | 2007-04-11 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN1653521B (zh) | 用于音频代码转换中的自适应码本音调滞后计算的方法 | |
RU2675044C1 (ru) | Способ квантования коэффициентов кодирования с линейным предсказанием, способ кодирования звука, способ деквантования коэффициентов кодирования с линейным предсказанием, способ декодирования звука и носитель записи | |
TW519616B (en) | Method and apparatus for predictively quantizing voiced speech | |
US6625576B2 (en) | Method and apparatus for performing text-to-speech conversion in a client/server environment | |
CN104040626B (zh) | 多译码模式信号分类 | |
KR100837451B1 (ko) | 향상된 품질의 음성 변환부호화를 위한 방법 및 장치 | |
US6119086A (en) | Speech coding via speech recognition and synthesis based on pre-enrolled phonetic tokens | |
JP2004501391A (ja) | 可変レート音声符号器におけるフレーム消去補償方法 | |
KR20040028784A (ko) | 분산형 음성 인식 시스템에서 음성 활성을 송신하는 방법및 장치 | |
JP4511094B2 (ja) | 音声コーダにおける線スペクトル情報量子化方法を交錯するための方法および装置 | |
US20230197061A1 (en) | Method and System for Outputting Target Audio, Readable Storage Medium, and Electronic Device | |
CN102934162A (zh) | 搜索随后被重放的包括基本层和至少一个增强层分层分级比特流的方法和设备 | |
JP2003036097A (ja) | 情報検出装置及び方法、並びに情報検索装置及び方法 | |
US20020128826A1 (en) | Speech recognition system and method, and information processing apparatus and method used in that system | |
CN112908293B (zh) | 一种基于语义注意力机制的多音字发音纠错方法及装置 | |
US20040024589A1 (en) | Transmission apparatus, transmission method, reception apparatus, reception method, and transmission/reception apparatus | |
US20080162150A1 (en) | System and Method for a High Performance Audio Codec | |
JPH05265496A (ja) | 複数のコードブックを有する音声符号化方法 | |
US7200557B2 (en) | Method of reducing index sizes used to represent spectral content vectors | |
JP3700310B2 (ja) | ベクトル量子化装置及びベクトル量子化方法 | |
Huong et al. | A new vocoder based on AMR 7.4 kbit/s mode in speaker dependent coding system | |
JP4932530B2 (ja) | 音響処理装置、音響処理方法、音響処理プログラム、照合処理装置、照合処理方法及び照合処理プログラム | |
JPH09120300A (ja) | ベクトル量子化装置 | |
JPH08171400A (ja) | 音声符号化装置 | |
US7031914B2 (en) | Systems and methods for concatenating electronically encoded voice |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C06 | Publication | ||
PB01 | Publication | ||
C10 | Entry into substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
C14 | Grant of patent or utility model | ||
GR01 | Patent grant | ||
ASS | Succession or assignment of patent right |
Owner name: ONMOBILE GLOBAL LTD. Free format text: FORMER OWNER: DILITHIUM (ASSIGNMENT FOR THE BENEFIT OF CREDITORS) INC. Effective date: 20130220 Owner name: DILITHIUM (ASSIGNMENT FOR THE BENEFIT OF CREDITORS Free format text: FORMER OWNER: DILITHIUM NETWORKS INC. Effective date: 20130220 |
|
C41 | Transfer of patent application or patent right or utility model | ||
TR01 | Transfer of patent right |
Effective date of registration: 20130220 Address after: bangalore Patentee after: DILITHIUM NETWORKS, Inc. Address before: California, USA Patentee before: Di Lee Sim (for the benefit of creditors) Ltd. Effective date of registration: 20130220 Address after: California, USA Patentee after: Di Lee Sim (for the benefit of creditors) Ltd. Address before: California, USA Patentee before: Di Lee Sim Network Inc. Effective date of registration: 20130220 Address after: California, USA Patentee after: Di Lee Sim Network Inc. Address before: New South Wales Patentee before: DILITHIUM NETWORKS Pty Ltd. |
|
CF01 | Termination of patent right due to non-payment of annual fee |
Granted publication date: 20100526 Termination date: 20150312 |
|
EXPY | Termination of patent right or utility model |