ES2269112T3 - Codificador de voz multimodal en bucle cerrado de dominio mixto. - Google Patents
Codificador de voz multimodal en bucle cerrado de dominio mixto. Download PDFInfo
- Publication number
- ES2269112T3 ES2269112T3 ES00912053T ES00912053T ES2269112T3 ES 2269112 T3 ES2269112 T3 ES 2269112T3 ES 00912053 T ES00912053 T ES 00912053T ES 00912053 T ES00912053 T ES 00912053T ES 2269112 T3 ES2269112 T3 ES 2269112T3
- Authority
- ES
- Spain
- Prior art keywords
- voice
- frame
- coding
- domain
- encoder
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Expired - Lifetime
Links
- 238000000034 method Methods 0.000 claims description 51
- 238000001228 spectrum Methods 0.000 claims description 15
- 238000012545 processing Methods 0.000 claims description 14
- 238000010187 selection method Methods 0.000 claims description 3
- 230000007704 transition Effects 0.000 abstract description 16
- 230000007246 mechanism Effects 0.000 abstract description 5
- 230000003595 spectral effect Effects 0.000 description 45
- 230000005540 biological transmission Effects 0.000 description 17
- 206010011878 Deafness Diseases 0.000 description 15
- 230000015572 biosynthetic process Effects 0.000 description 14
- 238000003786 synthesis reaction Methods 0.000 description 14
- 230000000737 periodic effect Effects 0.000 description 10
- 239000002699 waste material Substances 0.000 description 10
- 238000011002 quantification Methods 0.000 description 6
- 230000001755 vocal effect Effects 0.000 description 6
- 238000004891 communication Methods 0.000 description 5
- 230000006835 compression Effects 0.000 description 5
- 238000007906 compression Methods 0.000 description 5
- 238000010586 diagram Methods 0.000 description 5
- 230000006870 function Effects 0.000 description 5
- 230000005284 excitation Effects 0.000 description 4
- 238000013139 quantization Methods 0.000 description 4
- 238000005070 sampling Methods 0.000 description 4
- 230000009466 transformation Effects 0.000 description 4
- 239000000284 extract Substances 0.000 description 3
- 230000002123 temporal effect Effects 0.000 description 3
- 230000008901 benefit Effects 0.000 description 2
- 238000004364 calculation method Methods 0.000 description 2
- 238000001514 detection method Methods 0.000 description 2
- 238000005259 measurement Methods 0.000 description 2
- 230000008569 process Effects 0.000 description 2
- 239000007787 solid Substances 0.000 description 2
- 238000010183 spectrum analysis Methods 0.000 description 2
- 239000013598 vector Substances 0.000 description 2
- 238000005311 autocorrelation function Methods 0.000 description 1
- 230000015556 catabolic process Effects 0.000 description 1
- 238000006243 chemical reaction Methods 0.000 description 1
- 231100000895 deafness Toxicity 0.000 description 1
- 238000006731 degradation reaction Methods 0.000 description 1
- 230000001419 dependent effect Effects 0.000 description 1
- 238000013461 design Methods 0.000 description 1
- 238000009792 diffusion process Methods 0.000 description 1
- 230000000694 effects Effects 0.000 description 1
- 238000011156 evaluation Methods 0.000 description 1
- 208000016354 hearing loss disease Diseases 0.000 description 1
- 230000006872 improvement Effects 0.000 description 1
- 230000007774 longterm Effects 0.000 description 1
- 238000012423 maintenance Methods 0.000 description 1
- 230000003278 mimic effect Effects 0.000 description 1
- 239000002245 particle Substances 0.000 description 1
- 231100000572 poisoning Toxicity 0.000 description 1
- 230000000607 poisoning effect Effects 0.000 description 1
- 238000011160 research Methods 0.000 description 1
- 238000012827 research and development Methods 0.000 description 1
- 238000010845 search algorithm Methods 0.000 description 1
- 230000001360 synchronised effect Effects 0.000 description 1
- 230000001052 transient effect Effects 0.000 description 1
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/04—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
- G10L19/16—Vocoder architecture
- G10L19/18—Vocoders using multiple modes
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
Landscapes
- Engineering & Computer Science (AREA)
- Computational Linguistics (AREA)
- Signal Processing (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Physics & Mathematics (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Compression, Expansion, Code Conversion, And Decoders (AREA)
- Physical Or Chemical Processes And Apparatus (AREA)
Applications Claiming Priority (1)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| PCT/US2000/005140 WO2001065544A1 (en) | 2000-02-29 | 2000-02-29 | Closed-loop multimode mixed-domain linear prediction speech coder |
Publications (1)
| Publication Number | Publication Date |
|---|---|
| ES2269112T3 true ES2269112T3 (es) | 2007-04-01 |
Family
ID=21741098
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| ES00912053T Expired - Lifetime ES2269112T3 (es) | 2000-02-29 | 2000-02-29 | Codificador de voz multimodal en bucle cerrado de dominio mixto. |
Country Status (9)
| Country | Link |
|---|---|
| EP (1) | EP1259957B1 (enExample) |
| JP (1) | JP4907826B2 (enExample) |
| KR (1) | KR100711047B1 (enExample) |
| CN (1) | CN1266674C (enExample) |
| AT (1) | ATE341074T1 (enExample) |
| AU (1) | AU2000233851A1 (enExample) |
| DE (1) | DE60031002T2 (enExample) |
| ES (1) | ES2269112T3 (enExample) |
| WO (1) | WO2001065544A1 (enExample) |
Families Citing this family (29)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US6438518B1 (en) * | 1999-10-28 | 2002-08-20 | Qualcomm Incorporated | Method and apparatus for using coding scheme selection patterns in a predictive speech coder to reduce sensitivity to frame error conditions |
| CA2392640A1 (en) | 2002-07-05 | 2004-01-05 | Voiceage Corporation | A method and device for efficient in-based dim-and-burst signaling and half-rate max operation in variable bit-rate wideband speech coding for cdma wireless systems |
| EP1700266A4 (en) * | 2003-12-19 | 2010-01-20 | Creative Tech Ltd | METHOD AND SYSTEM FOR PROCESSING A DIGITAL IMAGE |
| US7739120B2 (en) | 2004-05-17 | 2010-06-15 | Nokia Corporation | Selection of coding models for encoding an audio signal |
| CN101283406B (zh) * | 2005-10-05 | 2013-06-19 | Lg电子株式会社 | 信号处理的方法和装置以及编码和解码方法及其装置 |
| JP5319286B2 (ja) * | 2005-10-05 | 2013-10-16 | エルジー エレクトロニクス インコーポレイティド | データ処理方法及び装置、エンコーディング及びデコーディング方法並びにそのための装置 |
| KR100647336B1 (ko) * | 2005-11-08 | 2006-11-23 | 삼성전자주식회사 | 적응적 시간/주파수 기반 오디오 부호화/복호화 장치 및방법 |
| KR101390188B1 (ko) * | 2006-06-21 | 2014-04-30 | 삼성전자주식회사 | 적응적 고주파수영역 부호화 및 복호화 방법 및 장치 |
| US9159333B2 (en) | 2006-06-21 | 2015-10-13 | Samsung Electronics Co., Ltd. | Method and apparatus for adaptively encoding and decoding high frequency band |
| WO2007148925A1 (en) | 2006-06-21 | 2007-12-27 | Samsung Electronics Co., Ltd. | Method and apparatus for adaptively encoding and decoding high frequency band |
| CN101145345B (zh) * | 2006-09-13 | 2011-02-09 | 华为技术有限公司 | 音频分类方法 |
| KR101131880B1 (ko) * | 2007-03-23 | 2012-04-03 | 삼성전자주식회사 | 오디오 신호의 인코딩 방법 및 장치, 그리고 오디오 신호의디코딩 방법 및 장치 |
| DE112007003567A5 (de) * | 2007-04-26 | 2010-04-08 | Siemens Aktiengesellschaft | Baugruppe mit automatischer Erweiterung eines Überwachungskreises |
| KR101756834B1 (ko) | 2008-07-14 | 2017-07-12 | 삼성전자주식회사 | 오디오/스피치 신호의 부호화 및 복호화 방법 및 장치 |
| US8990094B2 (en) * | 2010-09-13 | 2015-03-24 | Qualcomm Incorporated | Coding and decoding a transient frame |
| MY159444A (en) | 2011-02-14 | 2017-01-13 | Fraunhofer-Gesellschaft Zur Forderung Der Angewandten Forschung E V | Encoding and decoding of pulse positions of tracks of an audio signal |
| BR112013020588B1 (pt) | 2011-02-14 | 2021-07-13 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | Aparelho e método para codificação de uma parte de um sinal de áudio utilizando uma detecção transiente e um resultado de qualidade |
| ES2639646T3 (es) | 2011-02-14 | 2017-10-27 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | Codificación y decodificación de posiciones de impulso de pistas de una señal de audio |
| MX2013009306A (es) | 2011-02-14 | 2013-09-26 | Fraunhofer Ges Forschung | Aparato y metodo para codificar y decodificar una señal de audio utilizando una porcion alineada anticipada. |
| BR112013020592B1 (pt) | 2011-02-14 | 2021-06-22 | Fraunhofer-Gellschaft Zur Fôrderung Der Angewandten Forschung E. V. | Codec de áudio utilizando síntese de ruído durante fases inativas |
| WO2012110447A1 (en) | 2011-02-14 | 2012-08-23 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | Apparatus and method for error concealment in low-delay unified speech and audio coding (usac) |
| RU2580924C2 (ru) | 2011-02-14 | 2016-04-10 | Фраунхофер-Гезелльшафт Цур Фердерунг Дер Ангевандтен Форшунг Е.Ф. | Представление информационного сигнала с использованием преобразования с перекрытием |
| ES2535609T3 (es) | 2011-02-14 | 2015-05-13 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | Codificador de audio con estimación de ruido de fondo durante fases activas |
| EP2676268B1 (en) | 2011-02-14 | 2014-12-03 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | Apparatus and method for processing a decoded audio signal in a spectral domain |
| EP2757558A1 (en) * | 2013-01-18 | 2014-07-23 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | Time domain level adjustment for audio signal decoding or encoding |
| US9685166B2 (en) | 2014-07-26 | 2017-06-20 | Huawei Technologies Co., Ltd. | Classification between time-domain coding and frequency domain coding |
| EP3067886A1 (en) | 2015-03-09 | 2016-09-14 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | Audio encoder for encoding a multichannel signal and audio decoder for decoding an encoded audio signal |
| US10957331B2 (en) | 2018-12-17 | 2021-03-23 | Microsoft Technology Licensing, Llc | Phase reconstruction in a speech decoder |
| CN115343007B (zh) * | 2022-07-20 | 2024-11-08 | 上海卫星工程研究所 | 卫星运输冲击过程的等效正弦模拟方法和系统 |
Family Cites Families (8)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| WO1986005617A1 (en) * | 1985-03-18 | 1986-09-25 | Massachusetts Institute Of Technology | Processing of acoustic waveforms |
| US5023910A (en) * | 1988-04-08 | 1991-06-11 | At&T Bell Laboratories | Vector quantization in a harmonic speech coding arrangement |
| JPH02288739A (ja) * | 1989-04-28 | 1990-11-28 | Fujitsu Ltd | 音声符号復号化伝送方式 |
| JP3680374B2 (ja) * | 1995-09-28 | 2005-08-10 | ソニー株式会社 | 音声合成方法 |
| JPH10214100A (ja) * | 1997-01-31 | 1998-08-11 | Sony Corp | 音声合成方法 |
| US6233550B1 (en) * | 1997-08-29 | 2001-05-15 | The Regents Of The University Of California | Method and apparatus for hybrid coding of speech at 4kbps |
| ES2247741T3 (es) * | 1998-01-22 | 2006-03-01 | Deutsche Telekom Ag | Metodo para conmutacion controlada por señales entre esquemas de codificacion de audio. |
| JPH11224099A (ja) * | 1998-02-06 | 1999-08-17 | Sony Corp | 位相量子化装置及び方法 |
-
2000
- 2000-02-29 AU AU2000233851A patent/AU2000233851A1/en not_active Abandoned
- 2000-02-29 WO PCT/US2000/005140 patent/WO2001065544A1/en not_active Ceased
- 2000-02-29 CN CNB008192219A patent/CN1266674C/zh not_active Expired - Lifetime
- 2000-02-29 AT AT00912053T patent/ATE341074T1/de not_active IP Right Cessation
- 2000-02-29 ES ES00912053T patent/ES2269112T3/es not_active Expired - Lifetime
- 2000-02-29 DE DE60031002T patent/DE60031002T2/de not_active Expired - Lifetime
- 2000-02-29 EP EP00912053A patent/EP1259957B1/en not_active Expired - Lifetime
- 2000-02-29 KR KR1020027011306A patent/KR100711047B1/ko not_active Expired - Lifetime
- 2000-02-29 JP JP2001564148A patent/JP4907826B2/ja not_active Expired - Lifetime
Also Published As
| Publication number | Publication date |
|---|---|
| EP1259957B1 (en) | 2006-09-27 |
| KR20020081374A (ko) | 2002-10-26 |
| JP2003525473A (ja) | 2003-08-26 |
| CN1266674C (zh) | 2006-07-26 |
| KR100711047B1 (ko) | 2007-04-24 |
| EP1259957A1 (en) | 2002-11-27 |
| ATE341074T1 (de) | 2006-10-15 |
| DE60031002D1 (de) | 2006-11-09 |
| JP4907826B2 (ja) | 2012-04-04 |
| WO2001065544A1 (en) | 2001-09-07 |
| HK1055833A1 (en) | 2004-01-21 |
| CN1437747A (zh) | 2003-08-20 |
| DE60031002T2 (de) | 2007-05-10 |
| AU2000233851A1 (en) | 2001-09-12 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| ES2269112T3 (es) | Codificador de voz multimodal en bucle cerrado de dominio mixto. | |
| US6640209B1 (en) | Closed-loop multimode mixed-domain linear prediction (MDLP) speech coder | |
| KR100805983B1 (ko) | 가변율 음성 코더에서 프레임 소거를 보상하는 방법 | |
| ES2276845T3 (es) | Metodos y aparatos para la clasificacion de voz robusta. | |
| US7426466B2 (en) | Method and apparatus for quantizing pitch, amplitude, phase and linear spectrum of voiced speech | |
| EP1141947B1 (en) | Variable rate speech coding | |
| ES2302754T3 (es) | Procedimiento y aparato para codificacion de habla sorda. | |
| WO2005104095A1 (en) | Signal encoding | |
| ES2253226T3 (es) | Codigo interpolativo multipulso de tramas de voz. | |
| ES2297578T3 (es) | Procedimiento y aparato para submuestrear informacion del espectro de fase. | |
| ES2276690T3 (es) | Particion de espectro de frecuencia de una forma de onda prototipo. | |
| US6449592B1 (en) | Method and apparatus for tracking the phase of a quasi-periodic signal | |
| ES2254155T3 (es) | Procedimiento y aparato para realizar el seguimiento de la fase de una señal casi periodica. | |
| KR100711040B1 (ko) | 유사주기 신호의 위상을 추적하는 방법 및 장치 | |
| JP2011090311A (ja) | 閉ループのマルチモードの混合領域の線形予測音声コーダ | |
| HK1055834B (en) | Method and apparatus for tracking the phase of a quasi-periodic signal | |
| HK1091583B (en) | Method and apparatus for subsampling phase spectrum information | |
| HK1114684A (en) | Frame erasure compensation method in a variable rate speech coder |