BR0014212A - Sistema de compressão de conversação, módulo de processamento de excitação, e, corrente de bits que representa um quadro de um sinal de conversação - Google Patents
Sistema de compressão de conversação, módulo de processamento de excitação, e, corrente de bits que representa um quadro de um sinal de conversaçãoInfo
- Publication number
- BR0014212A BR0014212A BR0014212-3A BR0014212A BR0014212A BR 0014212 A BR0014212 A BR 0014212A BR 0014212 A BR0014212 A BR 0014212A BR 0014212 A BR0014212 A BR 0014212A
- Authority
- BR
- Brazil
- Prior art keywords
- conversation
- compression system
- rate
- speech
- frame
- Prior art date
Links
- 230000006835 compression Effects 0.000 title abstract 4
- 238000007906 compression Methods 0.000 title abstract 4
- 230000005284 excitation Effects 0.000 title 1
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/04—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
- G10L19/16—Vocoder architecture
- G10L19/167—Audio streaming, i.e. formatting and decoding of an encoded audio signal representation into a data stream for transmission or storage purposes
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/04—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
- G10L19/16—Vocoder architecture
- G10L19/18—Vocoders using multiple modes
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
-
- H—ELECTRICITY
- H03—ELECTRONIC CIRCUITRY
- H03G—CONTROL OF AMPLIFICATION
- H03G3/00—Gain control in amplifiers or frequency changers
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/04—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
- G10L19/16—Vocoder architecture
- G10L19/18—Vocoders using multiple modes
- G10L19/24—Variable rate codecs, e.g. for generating different qualities using a scalable representation such as hierarchical encoding or layered encoding
Landscapes
- Engineering & Computer Science (AREA)
- Computational Linguistics (AREA)
- Signal Processing (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Physics & Mathematics (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Quality & Reliability (AREA)
- Compression, Expansion, Code Conversion, And Decoders (AREA)
- Transmission Systems Not Characterized By The Medium Used For Transmission (AREA)
- Reduction Or Emphasis Of Bandwidth Of Signals (AREA)
- Lubricants (AREA)
- Ink Jet (AREA)
- Graft Or Block Polymers (AREA)
Abstract
"SISTEMA DE COMPRESSãO DE CONVERSAçãO, MóDULO DE PROCESSAMENTO DE EXCITAçãO, E, CORRENTE DE BITS QUE REPRESENTA UM QUADRO DE UM SINAL DE CONVERSAçãO". Um sistema de compressão de conversação (10) capaz de codificar um sinal de conversação (18) em uma corrente de bits para subseq³ente decodificação para gerar conversação sintetizada (20) é revelado. O sistema de compressão de conversação (10) otimiza a largura de faixa consumida pela corrente de bits, equilibrando a desejada velocidade de bit média com a qualidade perceptual da conversação reconstruída. O sistema de compressão de conversação (10) compreende uma codec de velocidade total (22), uma codec de meia velocidade 24, uma codec de um quarto de velocidade (26) e uma codec de um oitavo de velocidade (28)AS codecs (22, 24-9 26 e 28) são seletivamente ativadas com base em uma seleção de velocidade. Em adição, as codecs de velocidade total e de meia velocidade (22 e 24) são ativadas seletivamente com base em uma classificação de tipo. Cada codec (22, 24, 26 e 28) é seletivamente ativada para codificar e decodificar o sinal de conversação (18) e diferentes velocidade de bits enfatizando diferentes aspectos do sinal de conversação (18) para realçar a qualidade global da conversação sintetizada (20).
Applications Claiming Priority (3)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US15532199P | 1999-09-22 | 1999-09-22 | |
US09/574,396 US6782360B1 (en) | 1999-09-22 | 2000-05-19 | Gain quantization for a CELP speech coder |
PCT/US2000/025182 WO2001022402A1 (en) | 1999-09-22 | 2000-09-15 | Multimode speech encoder |
Publications (2)
Publication Number | Publication Date |
---|---|
BR0014212A true BR0014212A (pt) | 2003-06-10 |
BRPI0014212B1 BRPI0014212B1 (pt) | 2016-07-26 |
Family
ID=26852220
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
BRPI0014212A BRPI0014212B1 (pt) | 1999-09-22 | 2000-09-15 | sistema de compressão de conversação para codificar e decodificar quadros de um sinal de conversação para gerar conversação sintetizada |
Country Status (8)
Country | Link |
---|---|
EP (1) | EP1214706B9 (pt) |
JP (2) | JP4176349B2 (pt) |
KR (1) | KR100488080B1 (pt) |
CN (1) | CN1245706C (pt) |
AT (1) | ATE272885T1 (pt) |
AU (1) | AU7486200A (pt) |
BR (1) | BRPI0014212B1 (pt) |
DE (1) | DE60012760T2 (pt) |
Families Citing this family (27)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
KR100463418B1 (ko) * | 2002-11-11 | 2004-12-23 | 한국전자통신연구원 | Celp 음성 부호화기에서 사용되는 가변적인 고정코드북 검색방법 및 장치 |
FR2867649A1 (fr) * | 2003-12-10 | 2005-09-16 | France Telecom | Procede de codage multiple optimise |
WO2006098274A1 (ja) * | 2005-03-14 | 2006-09-21 | Matsushita Electric Industrial Co., Ltd. | スケーラブル復号化装置およびスケーラブル復号化方法 |
US7177804B2 (en) * | 2005-05-31 | 2007-02-13 | Microsoft Corporation | Sub-band voice codec with multi-stage codebooks and redundant coding |
CN101371296B (zh) * | 2006-01-18 | 2012-08-29 | Lg电子株式会社 | 用于编码和解码信号的设备和方法 |
US8451915B2 (en) | 2007-03-21 | 2013-05-28 | Samsung Electronics Co., Ltd. | Efficient uplink feedback in a wireless communication system |
KR20100006492A (ko) * | 2008-07-09 | 2010-01-19 | 삼성전자주식회사 | 부호화 방식 결정 방법 및 장치 |
CA2729665C (en) | 2008-07-10 | 2016-11-22 | Voiceage Corporation | Variable bit rate lpc filter quantizing and inverse quantizing device and method |
KR101170466B1 (ko) | 2008-07-29 | 2012-08-03 | 한국전자통신연구원 | Mdct 영역에서의 후처리 방법, 및 장치 |
JP2010122617A (ja) * | 2008-11-21 | 2010-06-03 | Yamaha Corp | ノイズゲート、及び収音装置 |
JP2010160496A (ja) * | 2010-02-15 | 2010-07-22 | Toshiba Corp | 信号処理装置および信号処理方法 |
US9047875B2 (en) * | 2010-07-19 | 2015-06-02 | Futurewei Technologies, Inc. | Spectrum flatness control for bandwidth extension |
DK2676271T3 (da) * | 2011-02-15 | 2020-08-24 | Voiceage Evs Llc | Anordning og fremgangsmåde til kvantisering af forstærkninger af adaptive og faste bidrag fra excitationen i en celp-koder-dekoder |
US9626982B2 (en) | 2011-02-15 | 2017-04-18 | Voiceage Corporation | Device and method for quantizing the gains of the adaptive and fixed contributions of the excitation in a CELP codec |
US9026434B2 (en) * | 2011-04-11 | 2015-05-05 | Samsung Electronic Co., Ltd. | Frame erasure concealment for a multi rate speech and audio codec |
US9336789B2 (en) * | 2013-02-21 | 2016-05-10 | Qualcomm Incorporated | Systems and methods for determining an interpolation factor set for synthesizing a speech signal |
CN104517612B (zh) * | 2013-09-30 | 2018-10-12 | 上海爱聊信息科技有限公司 | 基于amr-nb语音信号的可变码率编码器和解码器及其编码和解码方法 |
JP5981408B2 (ja) * | 2013-10-29 | 2016-08-31 | 株式会社Nttドコモ | 音声信号処理装置、音声信号処理方法、及び音声信号処理プログラム |
KR20240010550A (ko) | 2014-03-28 | 2024-01-23 | 삼성전자주식회사 | 선형예측계수 양자화방법 및 장치와 역양자화 방법 및 장치 |
WO2015170899A1 (ko) | 2014-05-07 | 2015-11-12 | 삼성전자 주식회사 | 선형예측계수 양자화방법 및 장치와 역양자화 방법 및 장치 |
JP6170575B2 (ja) * | 2014-07-28 | 2017-07-26 | テレフオンアクチーボラゲット エルエム エリクソン(パブル) | ピラミッドベクトル量子化器形状サーチ |
US10109284B2 (en) * | 2016-02-12 | 2018-10-23 | Qualcomm Incorporated | Inter-channel encoding and decoding of multiple high-band audio signals |
US10373630B2 (en) * | 2017-03-31 | 2019-08-06 | Intel Corporation | Systems and methods for energy efficient and low power distributed automatic speech recognition on wearable devices |
CN111183476B (zh) * | 2017-10-06 | 2024-03-22 | 索尼欧洲有限公司 | 基于子窗口序列内的rms功率的音频文件包络 |
CN108122552B (zh) * | 2017-12-15 | 2021-10-15 | 上海智臻智能网络科技股份有限公司 | 语音情绪识别方法和装置 |
CN113593521B (zh) * | 2021-07-29 | 2022-09-20 | 北京三快在线科技有限公司 | 语音合成方法、装置、设备及可读存储介质 |
CN118430508B (zh) * | 2024-05-29 | 2024-09-17 | 中国矿业大学 | 基于神经音频编解码器的语音合成方法 |
Family Cites Families (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JP3353852B2 (ja) * | 1994-02-15 | 2002-12-03 | 日本電信電話株式会社 | 音声の符号化方法 |
US5701390A (en) * | 1995-02-22 | 1997-12-23 | Digital Voice Systems, Inc. | Synthesis of MBE-based coded speech using regenerated phase information |
-
2000
- 2000-09-12 AU AU74862/00A patent/AU7486200A/en not_active Abandoned
- 2000-09-15 KR KR10-2002-7003768A patent/KR100488080B1/ko active IP Right Grant
- 2000-09-15 DE DE60012760T patent/DE60012760T2/de not_active Expired - Lifetime
- 2000-09-15 AT AT00963447T patent/ATE272885T1/de not_active IP Right Cessation
- 2000-09-15 BR BRPI0014212A patent/BRPI0014212B1/pt not_active IP Right Cessation
- 2000-09-15 EP EP00963447A patent/EP1214706B9/en not_active Expired - Lifetime
- 2000-09-15 CN CNB008159408A patent/CN1245706C/zh not_active Expired - Fee Related
- 2000-09-15 JP JP2001525686A patent/JP4176349B2/ja not_active Expired - Fee Related
-
2005
- 2005-07-11 JP JP2005202337A patent/JP2005338872A/ja active Pending
Also Published As
Publication number | Publication date |
---|---|
JP4176349B2 (ja) | 2008-11-05 |
CN1451155A (zh) | 2003-10-22 |
CN1245706C (zh) | 2006-03-15 |
KR20020033819A (ko) | 2002-05-07 |
ATE272885T1 (de) | 2004-08-15 |
EP1214706A1 (en) | 2002-06-19 |
DE60012760T2 (de) | 2005-08-04 |
EP1214706B1 (en) | 2004-08-04 |
JP2003513296A (ja) | 2003-04-08 |
KR100488080B1 (ko) | 2005-05-06 |
AU7486200A (en) | 2001-04-24 |
DE60012760D1 (de) | 2004-09-09 |
EP1214706B9 (en) | 2005-01-05 |
JP2005338872A (ja) | 2005-12-08 |
BRPI0014212B1 (pt) | 2016-07-26 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
BR0014212A (pt) | Sistema de compressão de conversação, módulo de processamento de excitação, e, corrente de bits que representa um quadro de um sinal de conversação | |
AU2001287969A1 (en) | Codebook structure and search for speech coding | |
BR9805989B1 (pt) | método e aparelho para decodificar um sinal codificado. | |
AU2003278014A1 (en) | Methods for interoperation between adaptive multi-rate wideband (amr-wb) and multi-mode variable bit-rate wideband (wmr-wb) speech codecs | |
Valin | The speex codec manual version 1.2 beta 3 | |
US20050261900A1 (en) | Supporting a switch between audio coder modes | |
FI932465A0 (fi) | CELP-baserad talkompressor | |
WO2004006226A1 (en) | Method and device for efficient in-band dim-and-burst signaling and half-rate max operation in variable bit-rate wideband speech coding for cdma wireless systems | |
DK1222659T3 (da) | LPC-harmonisk talekoder med superramme-struktur | |
ATE326122T1 (de) | Fgs dekodierung unter kontrolle eines im dekoder kalkulierten bildqualitätsparameters | |
CA2440348A1 (en) | Testing loops for channel codecs | |
EP1204092A3 (en) | Speech decoder capable of decoding background noise signal with high quality | |
DE60027140D1 (de) | Sprachsynthetisierer auf der basis von sprachkodierung mit veränderlicher bit-rate | |
WO2002023533A3 (en) | System for improved use of pitch enhancement with subcodebooks | |
Wang et al. | Transcoding Scheme between AMR-WB and VMR-WB | |
WO2002023537A8 (en) | System for enhancing perceptual quality of decoded speech | |
CA2491623C (en) | Method and device for efficient in-band dim-and-burst signaling and half-rate max operation in variable bit-rate wideband speech coding for cdma wireless systems | |
WO2003042648A1 (fr) | Codeur de signal vocal, decodeur de signal vocal, procede de codage de signal vocal et procede de decodage de signal vocal | |
Xu et al. | A Novel Transcoding Algorithm between 3GPP AMR-NB (7.95 kbit/s) and ITU-T G. 729a (8kbit/s) |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
B07A | Application suspended after technical examination (opinion) [chapter 7.1 patent gazette] | ||
B15K | Others concerning applications: alteration of classification |
Free format text: A CLASSIFICACAO ANTERIOR ERA: G10L 19/14 Ipc: G10L 19/10 (2013.01) |
|
B06A | Patent application procedure suspended [chapter 6.1 patent gazette] | ||
B09A | Decision: intention to grant [chapter 9.1 patent gazette] | ||
B16A | Patent or certificate of addition of invention granted [chapter 16.1 patent gazette] |
Free format text: PRAZO DE VALIDADE: 10 (DEZ) ANOS CONTADOS A PARTIR DE 26/07/2016, OBSERVADAS AS CONDICOES LEGAIS. |
|
B21F | Lapse acc. art. 78, item iv - on non-payment of the annual fees in time |
Free format text: REFERENTE A 24A ANUIDADE. |