BR0014212A - Sistema de compressão de conversação, módulo de processamento de excitação, e, corrente de bits que representa um quadro de um sinal de conversação - Google Patents

Sistema de compressão de conversação, módulo de processamento de excitação, e, corrente de bits que representa um quadro de um sinal de conversação

Info

Publication number
BR0014212A
BR0014212A BR0014212-3A BR0014212A BR0014212A BR 0014212 A BR0014212 A BR 0014212A BR 0014212 A BR0014212 A BR 0014212A BR 0014212 A BR0014212 A BR 0014212A
Authority
BR
Brazil
Prior art keywords
conversation
compression system
rate
speech
frame
Prior art date
Application number
BR0014212-3A
Other languages
English (en)
Other versions
BRPI0014212B1 (pt
Inventor
Yang Gao
Adil Benyassine
Jes Thyssen
Eyal Sholomot
Huang-Yu Su
Original Assignee
Conexant Systems Inc
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Priority claimed from US09/574,396 external-priority patent/US6782360B1/en
Application filed by Conexant Systems Inc filed Critical Conexant Systems Inc
Publication of BR0014212A publication Critical patent/BR0014212A/pt
Publication of BRPI0014212B1 publication Critical patent/BRPI0014212B1/pt

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/04Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
    • G10L19/16Vocoder architecture
    • G10L19/167Audio streaming, i.e. formatting and decoding of an encoded audio signal representation into a data stream for transmission or storage purposes
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/04Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
    • G10L19/16Vocoder architecture
    • G10L19/18Vocoders using multiple modes
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • HELECTRICITY
    • H03ELECTRONIC CIRCUITRY
    • H03GCONTROL OF AMPLIFICATION
    • H03G3/00Gain control in amplifiers or frequency changers
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/04Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
    • G10L19/16Vocoder architecture
    • G10L19/18Vocoders using multiple modes
    • G10L19/24Variable rate codecs, e.g. for generating different qualities using a scalable representation such as hierarchical encoding or layered encoding

Landscapes

  • Engineering & Computer Science (AREA)
  • Computational Linguistics (AREA)
  • Signal Processing (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Quality & Reliability (AREA)
  • Compression, Expansion, Code Conversion, And Decoders (AREA)
  • Transmission Systems Not Characterized By The Medium Used For Transmission (AREA)
  • Reduction Or Emphasis Of Bandwidth Of Signals (AREA)
  • Lubricants (AREA)
  • Ink Jet (AREA)
  • Graft Or Block Polymers (AREA)

Abstract

"SISTEMA DE COMPRESSãO DE CONVERSAçãO, MóDULO DE PROCESSAMENTO DE EXCITAçãO, E, CORRENTE DE BITS QUE REPRESENTA UM QUADRO DE UM SINAL DE CONVERSAçãO". Um sistema de compressão de conversação (10) capaz de codificar um sinal de conversação (18) em uma corrente de bits para subseq³ente decodificação para gerar conversação sintetizada (20) é revelado. O sistema de compressão de conversação (10) otimiza a largura de faixa consumida pela corrente de bits, equilibrando a desejada velocidade de bit média com a qualidade perceptual da conversação reconstruída. O sistema de compressão de conversação (10) compreende uma codec de velocidade total (22), uma codec de meia velocidade 24, uma codec de um quarto de velocidade (26) e uma codec de um oitavo de velocidade (28)AS codecs (22, 24-9 26 e 28) são seletivamente ativadas com base em uma seleção de velocidade. Em adição, as codecs de velocidade total e de meia velocidade (22 e 24) são ativadas seletivamente com base em uma classificação de tipo. Cada codec (22, 24, 26 e 28) é seletivamente ativada para codificar e decodificar o sinal de conversação (18) e diferentes velocidade de bits enfatizando diferentes aspectos do sinal de conversação (18) para realçar a qualidade global da conversação sintetizada (20).
BRPI0014212A 1999-09-22 2000-09-15 sistema de compressão de conversação para codificar e decodificar quadros de um sinal de conversação para gerar conversação sintetizada BRPI0014212B1 (pt)

Applications Claiming Priority (3)

Application Number Priority Date Filing Date Title
US15532199P 1999-09-22 1999-09-22
US09/574,396 US6782360B1 (en) 1999-09-22 2000-05-19 Gain quantization for a CELP speech coder
PCT/US2000/025182 WO2001022402A1 (en) 1999-09-22 2000-09-15 Multimode speech encoder

Publications (2)

Publication Number Publication Date
BR0014212A true BR0014212A (pt) 2003-06-10
BRPI0014212B1 BRPI0014212B1 (pt) 2016-07-26

Family

ID=26852220

Family Applications (1)

Application Number Title Priority Date Filing Date
BRPI0014212A BRPI0014212B1 (pt) 1999-09-22 2000-09-15 sistema de compressão de conversação para codificar e decodificar quadros de um sinal de conversação para gerar conversação sintetizada

Country Status (8)

Country Link
EP (1) EP1214706B9 (pt)
JP (2) JP4176349B2 (pt)
KR (1) KR100488080B1 (pt)
CN (1) CN1245706C (pt)
AT (1) ATE272885T1 (pt)
AU (1) AU7486200A (pt)
BR (1) BRPI0014212B1 (pt)
DE (1) DE60012760T2 (pt)

Families Citing this family (27)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
KR100463418B1 (ko) * 2002-11-11 2004-12-23 한국전자통신연구원 Celp 음성 부호화기에서 사용되는 가변적인 고정코드북 검색방법 및 장치
FR2867649A1 (fr) * 2003-12-10 2005-09-16 France Telecom Procede de codage multiple optimise
WO2006098274A1 (ja) * 2005-03-14 2006-09-21 Matsushita Electric Industrial Co., Ltd. スケーラブル復号化装置およびスケーラブル復号化方法
US7177804B2 (en) * 2005-05-31 2007-02-13 Microsoft Corporation Sub-band voice codec with multi-stage codebooks and redundant coding
CN101371296B (zh) * 2006-01-18 2012-08-29 Lg电子株式会社 用于编码和解码信号的设备和方法
US8451915B2 (en) 2007-03-21 2013-05-28 Samsung Electronics Co., Ltd. Efficient uplink feedback in a wireless communication system
KR20100006492A (ko) * 2008-07-09 2010-01-19 삼성전자주식회사 부호화 방식 결정 방법 및 장치
CA2729665C (en) 2008-07-10 2016-11-22 Voiceage Corporation Variable bit rate lpc filter quantizing and inverse quantizing device and method
KR101170466B1 (ko) 2008-07-29 2012-08-03 한국전자통신연구원 Mdct 영역에서의 후처리 방법, 및 장치
JP2010122617A (ja) * 2008-11-21 2010-06-03 Yamaha Corp ノイズゲート、及び収音装置
JP2010160496A (ja) * 2010-02-15 2010-07-22 Toshiba Corp 信号処理装置および信号処理方法
US9047875B2 (en) * 2010-07-19 2015-06-02 Futurewei Technologies, Inc. Spectrum flatness control for bandwidth extension
DK2676271T3 (da) * 2011-02-15 2020-08-24 Voiceage Evs Llc Anordning og fremgangsmåde til kvantisering af forstærkninger af adaptive og faste bidrag fra excitationen i en celp-koder-dekoder
US9626982B2 (en) 2011-02-15 2017-04-18 Voiceage Corporation Device and method for quantizing the gains of the adaptive and fixed contributions of the excitation in a CELP codec
US9026434B2 (en) * 2011-04-11 2015-05-05 Samsung Electronic Co., Ltd. Frame erasure concealment for a multi rate speech and audio codec
US9336789B2 (en) * 2013-02-21 2016-05-10 Qualcomm Incorporated Systems and methods for determining an interpolation factor set for synthesizing a speech signal
CN104517612B (zh) * 2013-09-30 2018-10-12 上海爱聊信息科技有限公司 基于amr-nb语音信号的可变码率编码器和解码器及其编码和解码方法
JP5981408B2 (ja) * 2013-10-29 2016-08-31 株式会社Nttドコモ 音声信号処理装置、音声信号処理方法、及び音声信号処理プログラム
KR20240010550A (ko) 2014-03-28 2024-01-23 삼성전자주식회사 선형예측계수 양자화방법 및 장치와 역양자화 방법 및 장치
WO2015170899A1 (ko) 2014-05-07 2015-11-12 삼성전자 주식회사 선형예측계수 양자화방법 및 장치와 역양자화 방법 및 장치
JP6170575B2 (ja) * 2014-07-28 2017-07-26 テレフオンアクチーボラゲット エルエム エリクソン(パブル) ピラミッドベクトル量子化器形状サーチ
US10109284B2 (en) * 2016-02-12 2018-10-23 Qualcomm Incorporated Inter-channel encoding and decoding of multiple high-band audio signals
US10373630B2 (en) * 2017-03-31 2019-08-06 Intel Corporation Systems and methods for energy efficient and low power distributed automatic speech recognition on wearable devices
CN111183476B (zh) * 2017-10-06 2024-03-22 索尼欧洲有限公司 基于子窗口序列内的rms功率的音频文件包络
CN108122552B (zh) * 2017-12-15 2021-10-15 上海智臻智能网络科技股份有限公司 语音情绪识别方法和装置
CN113593521B (zh) * 2021-07-29 2022-09-20 北京三快在线科技有限公司 语音合成方法、装置、设备及可读存储介质
CN118430508B (zh) * 2024-05-29 2024-09-17 中国矿业大学 基于神经音频编解码器的语音合成方法

Family Cites Families (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP3353852B2 (ja) * 1994-02-15 2002-12-03 日本電信電話株式会社 音声の符号化方法
US5701390A (en) * 1995-02-22 1997-12-23 Digital Voice Systems, Inc. Synthesis of MBE-based coded speech using regenerated phase information

Also Published As

Publication number Publication date
JP4176349B2 (ja) 2008-11-05
CN1451155A (zh) 2003-10-22
CN1245706C (zh) 2006-03-15
KR20020033819A (ko) 2002-05-07
ATE272885T1 (de) 2004-08-15
EP1214706A1 (en) 2002-06-19
DE60012760T2 (de) 2005-08-04
EP1214706B1 (en) 2004-08-04
JP2003513296A (ja) 2003-04-08
KR100488080B1 (ko) 2005-05-06
AU7486200A (en) 2001-04-24
DE60012760D1 (de) 2004-09-09
EP1214706B9 (en) 2005-01-05
JP2005338872A (ja) 2005-12-08
BRPI0014212B1 (pt) 2016-07-26

Similar Documents

Publication Publication Date Title
BR0014212A (pt) Sistema de compressão de conversação, módulo de processamento de excitação, e, corrente de bits que representa um quadro de um sinal de conversação
AU2001287969A1 (en) Codebook structure and search for speech coding
BR9805989B1 (pt) método e aparelho para decodificar um sinal codificado.
AU2003278014A1 (en) Methods for interoperation between adaptive multi-rate wideband (amr-wb) and multi-mode variable bit-rate wideband (wmr-wb) speech codecs
Valin The speex codec manual version 1.2 beta 3
US20050261900A1 (en) Supporting a switch between audio coder modes
FI932465A0 (fi) CELP-baserad talkompressor
WO2004006226A1 (en) Method and device for efficient in-band dim-and-burst signaling and half-rate max operation in variable bit-rate wideband speech coding for cdma wireless systems
DK1222659T3 (da) LPC-harmonisk talekoder med superramme-struktur
ATE326122T1 (de) Fgs dekodierung unter kontrolle eines im dekoder kalkulierten bildqualitätsparameters
CA2440348A1 (en) Testing loops for channel codecs
EP1204092A3 (en) Speech decoder capable of decoding background noise signal with high quality
DE60027140D1 (de) Sprachsynthetisierer auf der basis von sprachkodierung mit veränderlicher bit-rate
WO2002023533A3 (en) System for improved use of pitch enhancement with subcodebooks
Wang et al. Transcoding Scheme between AMR-WB and VMR-WB
WO2002023537A8 (en) System for enhancing perceptual quality of decoded speech
CA2491623C (en) Method and device for efficient in-band dim-and-burst signaling and half-rate max operation in variable bit-rate wideband speech coding for cdma wireless systems
WO2003042648A1 (fr) Codeur de signal vocal, decodeur de signal vocal, procede de codage de signal vocal et procede de decodage de signal vocal
Xu et al. A Novel Transcoding Algorithm between 3GPP AMR-NB (7.95 kbit/s) and ITU-T G. 729a (8kbit/s)

Legal Events

Date Code Title Description
B07A Application suspended after technical examination (opinion) [chapter 7.1 patent gazette]
B15K Others concerning applications: alteration of classification

Free format text: A CLASSIFICACAO ANTERIOR ERA: G10L 19/14

Ipc: G10L 19/10 (2013.01)

B06A Patent application procedure suspended [chapter 6.1 patent gazette]
B09A Decision: intention to grant [chapter 9.1 patent gazette]
B16A Patent or certificate of addition of invention granted [chapter 16.1 patent gazette]

Free format text: PRAZO DE VALIDADE: 10 (DEZ) ANOS CONTADOS A PARTIR DE 26/07/2016, OBSERVADAS AS CONDICOES LEGAIS.

B21F Lapse acc. art. 78, item iv - on non-payment of the annual fees in time

Free format text: REFERENTE A 24A ANUIDADE.