TWI340377B - Method and vocoders of communication speech - Google Patents

Method and vocoders of communication speech

Info

Publication number
TWI340377B
TWI340377B TW096129874A TW96129874A TWI340377B TW I340377 B TWI340377 B TW I340377B TW 096129874 A TW096129874 A TW 096129874A TW 96129874 A TW96129874 A TW 96129874A TW I340377 B TWI340377 B TW I340377B
Authority
TW
Taiwan
Prior art keywords
vocoders
communication speech
speech
communication
Prior art date
Application number
TW096129874A
Other languages
English (en)
Other versions
TW200822062A (en
Inventor
Rohit Kapoor
Serafin Diaz Spindola
Original Assignee
Qualcomm Inc
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Qualcomm Inc filed Critical Qualcomm Inc
Publication of TW200822062A publication Critical patent/TW200822062A/zh
Application granted granted Critical
Publication of TWI340377B publication Critical patent/TWI340377B/zh

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/04Time compression or expansion
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/04Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
    • G10L19/16Vocoder architecture
    • G10L19/18Vocoders using multiple modes
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/04Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
    • G10L19/08Determination or coding of the excitation function; Determination or coding of the long-term prediction parameters
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/003Changing voice quality, e.g. pitch or formants
    • G10L21/007Changing voice quality, e.g. pitch or formants characterised by the process used
    • G10L21/01Correction of time axis
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/04Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
    • G10L19/08Determination or coding of the excitation function; Determination or coding of the long-term prediction parameters
    • G10L19/087Determination or coding of the excitation function; Determination or coding of the long-term prediction parameters using mixed excitation models, e.g. MELP, MBE, split band LPC or HVXC

Landscapes

  • Engineering & Computer Science (AREA)
  • Computational Linguistics (AREA)
  • Signal Processing (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Quality & Reliability (AREA)
  • Compression, Expansion, Code Conversion, And Decoders (AREA)
TW096129874A 2006-08-22 2007-08-13 Method and vocoders of communication speech TWI340377B (en)

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
US11/508,396 US8239190B2 (en) 2006-08-22 2006-08-22 Time-warping frames of wideband vocoder

Publications (2)

Publication Number Publication Date
TW200822062A TW200822062A (en) 2008-05-16
TWI340377B true TWI340377B (en) 2011-04-11

Family

ID=38926197

Family Applications (1)

Application Number Title Priority Date Filing Date
TW096129874A TWI340377B (en) 2006-08-22 2007-08-13 Method and vocoders of communication speech

Country Status (10)

Country Link
US (1) US8239190B2 (zh)
EP (1) EP2059925A2 (zh)
JP (1) JP5006398B2 (zh)
KR (1) KR101058761B1 (zh)
CN (1) CN101506877B (zh)
BR (1) BRPI0715978A2 (zh)
CA (1) CA2659197C (zh)
RU (1) RU2414010C2 (zh)
TW (1) TWI340377B (zh)
WO (1) WO2008024615A2 (zh)

Families Citing this family (17)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7720677B2 (en) * 2005-11-03 2010-05-18 Coding Technologies Ab Time warped modified transform coding of audio signals
US9653088B2 (en) * 2007-06-13 2017-05-16 Qualcomm Incorporated Systems, methods, and apparatus for signal encoding using pitch-regularizing and non-pitch-regularizing coding
CN100524462C (zh) 2007-09-15 2009-08-05 华为技术有限公司 对高带信号进行帧错误隐藏的方法及装置
ES2747903T3 (es) * 2008-03-10 2020-03-12 Fraunhofer Ges Forschung Dispositivo y método para manipular una señal de audio que tiene un evento transitorio
US8768690B2 (en) 2008-06-20 2014-07-01 Qualcomm Incorporated Coding scheme selection for low-bit-rate applications
EP2410522B1 (en) 2008-07-11 2017-10-04 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Audio signal encoder, method for encoding an audio signal and computer program
MY154452A (en) * 2008-07-11 2015-06-15 Fraunhofer Ges Forschung An apparatus and a method for decoding an encoded audio signal
US8798776B2 (en) 2008-09-30 2014-08-05 Dolby International Ab Transcoding of audio metadata
US8428938B2 (en) * 2009-06-04 2013-04-23 Qualcomm Incorporated Systems and methods for reconstructing an erased speech frame
BR112012022744B1 (pt) 2010-03-10 2021-02-17 Fraunhofer - Gesellschaft Zur Förderung Der Angewandten Forschung E.V. decodificador de sinal de áudio, codificador de sinal de áudio, método para decodificar um sinal de áudio, método para codificar um sinal de áudio e programa de computador utilizando uma adaptação dependente de timbre de um contexto de codificação
US9117461B2 (en) 2010-10-06 2015-08-25 Panasonic Corporation Coding device, decoding device, coding method, and decoding method for audio signals
CN102201240B (zh) * 2011-05-27 2012-10-03 中国科学院自动化研究所 基于逆滤波的谐波噪声激励模型声码器
JP6303340B2 (ja) * 2013-08-30 2018-04-04 富士通株式会社 音声処理装置、音声処理方法及び音声処理用コンピュータプログラム
US10083708B2 (en) * 2013-10-11 2018-09-25 Qualcomm Incorporated Estimation of mixing factors to generate high-band excitation signal
PL3447766T3 (pl) * 2014-04-24 2020-08-24 Nippon Telegraph And Telephone Corporation Sposób kodowania, urządzenie kodujące, odpowiedni program i nośnik zapisu
PL3703051T3 (pl) 2014-05-01 2021-11-22 Nippon Telegraph And Telephone Corporation Koder, dekoder, sposób kodowania, sposób dekodowania, program kodujący, program dekodujący i nośnik rejestrujący
DE102018206689A1 (de) * 2018-04-30 2019-10-31 Sivantos Pte. Ltd. Verfahren zur Rauschunterdrückung in einem Audiosignal

Family Cites Families (55)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
FR2412987A1 (fr) * 1977-12-23 1979-07-20 Ibm France Procede de compression de donnees relatives au signal vocal et dispositif mettant en oeuvre ledit procede
US4570232A (en) * 1981-12-21 1986-02-11 Nippon Telegraph & Telephone Public Corporation Speech recognition apparatus
CA1204855A (en) * 1982-03-23 1986-05-20 Phillip J. Bloom Method and apparatus for use in processing signals
US5210820A (en) * 1990-05-02 1993-05-11 Broadcast Data Systems Limited Partnership Signal recognition system and method
JP3277398B2 (ja) * 1992-04-15 2002-04-22 ソニー株式会社 有声音判別方法
DE4324853C1 (de) 1993-07-23 1994-09-22 Siemens Ag Spannungserzeugungsschaltung
US5517595A (en) * 1994-02-08 1996-05-14 At&T Corp. Decomposition in noise and periodic signal waveforms in waveform interpolation
US5717823A (en) 1994-04-14 1998-02-10 Lucent Technologies Inc. Speech-rate modification for linear-prediction based analysis-by-synthesis speech coders
US5594174A (en) * 1994-06-06 1997-01-14 University Of Washington System and method for measuring acoustic reflectance
US5787387A (en) * 1994-07-11 1998-07-28 Voxware, Inc. Harmonic adaptive speech coding method and system
US5598505A (en) * 1994-09-30 1997-01-28 Apple Computer, Inc. Cepstral correction vector quantizer for speech recognition
JP2976860B2 (ja) 1995-09-13 1999-11-10 松下電器産業株式会社 再生装置
AU7463696A (en) * 1995-10-23 1997-05-15 Regents Of The University Of California, The Control structure for sound synthesis
TW321810B (zh) * 1995-10-26 1997-12-01 Sony Co Ltd
US5749073A (en) * 1996-03-15 1998-05-05 Interval Research Corporation System for automatically morphing audio information
US5828994A (en) * 1996-06-05 1998-10-27 Interval Research Corporation Non-uniform time scale modification of recorded audio
US6766300B1 (en) * 1996-11-07 2004-07-20 Creative Technology Ltd. Method and apparatus for transient detection and non-distortion time scaling
US6233550B1 (en) * 1997-08-29 2001-05-15 The Regents Of The University Of California Method and apparatus for hybrid coding of speech at 4kbps
US7072832B1 (en) * 1998-08-24 2006-07-04 Mindspeed Technologies, Inc. System for speech encoding having an adaptive encoding arrangement
US7272556B1 (en) * 1998-09-23 2007-09-18 Lucent Technologies Inc. Scalable and embedded codec for speech and audio signals
FR2786308B1 (fr) * 1998-11-20 2001-02-09 Sextant Avionique Procede de reconnaissance vocale dans un signal acoustique bruite et systeme mettant en oeuvre ce procede
US6691084B2 (en) * 1998-12-21 2004-02-10 Qualcomm Incorporated Multiple mode variable rate speech coding
US6456964B2 (en) * 1998-12-21 2002-09-24 Qualcomm, Incorporated Encoding of periodic speech using prototype waveforms
US7315815B1 (en) 1999-09-22 2008-01-01 Microsoft Corporation LPC-harmonic vocoder with superframe structure
US6842735B1 (en) * 1999-12-17 2005-01-11 Interval Research Corporation Time-scale modification of data-compressed audio information
JP2001255882A (ja) * 2000-03-09 2001-09-21 Sony Corp 音声信号処理装置及びその信号処理方法
US6735563B1 (en) 2000-07-13 2004-05-11 Qualcomm, Inc. Method and apparatus for constructing voice templates for a speaker-independent voice recognition system
US6671669B1 (en) 2000-07-18 2003-12-30 Qualcomm Incorporated combined engine system and method for voice recognition
US6990453B2 (en) * 2000-07-31 2006-01-24 Landmark Digital Services Llc System and methods for recognizing sound and music signals in high noise and distortion
US6477502B1 (en) * 2000-08-22 2002-11-05 Qualcomm Incorporated Method and apparatus for using non-symmetric speech coders to produce non-symmetric links in a wireless communication system
US6754629B1 (en) 2000-09-08 2004-06-22 Qualcomm Incorporated System and method for automatic voice recognition using mapping
EP1340317A1 (en) * 2000-11-03 2003-09-03 Koninklijke Philips Electronics N.V. Parametric coding of audio signals
US7472059B2 (en) * 2000-12-08 2008-12-30 Qualcomm Incorporated Method and apparatus for robust speech classification
US20020133334A1 (en) * 2001-02-02 2002-09-19 Geert Coorman Time scale modification of digitally sampled waveforms in the time domain
US6999598B2 (en) * 2001-03-23 2006-02-14 Fuji Xerox Co., Ltd. Systems and methods for embedding data by dimensional compression and expansion
CA2365203A1 (en) 2001-12-14 2003-06-14 Voiceage Corporation A signal modification method for efficient coding of speech signals
US20030182106A1 (en) * 2002-03-13 2003-09-25 Spectral Design Method and device for changing the temporal length and/or the tone pitch of a discrete audio signal
US7254533B1 (en) * 2002-10-17 2007-08-07 Dilithium Networks Pty Ltd. Method and apparatus for a thin CELP voice codec
US7394833B2 (en) * 2003-02-11 2008-07-01 Nokia Corporation Method and apparatus for reducing synchronization delay in packet switched voice terminals using speech decoder modification
WO2004084467A2 (en) * 2003-03-15 2004-09-30 Mindspeed Technologies, Inc. Recovering an erased voice frame with time warping
US7433815B2 (en) * 2003-09-10 2008-10-07 Dilithium Networks Pty Ltd. Method and apparatus for voice transcoding between variable rate coders
US7672838B1 (en) * 2003-12-01 2010-03-02 The Trustees Of Columbia University In The City Of New York Systems and methods for speech recognition using frequency domain linear prediction polynomials to form temporal and spectral envelopes from frequency domain representations of signals
US20050137730A1 (en) * 2003-12-18 2005-06-23 Steven Trautmann Time-scale modification of audio using separated frequency bands
CA2457988A1 (en) * 2004-02-18 2005-08-18 Voiceage Corporation Methods and devices for audio compression based on acelp/tcx coding and multi-rate lattice vector quantization
JP4146489B2 (ja) 2004-05-26 2008-09-10 日本電信電話株式会社 音声パケット再生方法、音声パケット再生装置、音声パケット再生プログラム、記録媒体
BRPI0514801B1 (pt) * 2004-08-30 2019-06-18 Qualcomm Incorporated Buffer de-jitter adaptativo para voz sobre ip
US8085678B2 (en) * 2004-10-13 2011-12-27 Qualcomm Incorporated Media (voice) playback (de-jitter) buffer adjustments based on air interface
SG124307A1 (en) 2005-01-20 2006-08-30 St Microelectronics Asia Method and system for lost packet concealment in high quality audio streaming applications
US8155965B2 (en) * 2005-03-11 2012-04-10 Qualcomm Incorporated Time warping frames inside the vocoder by modifying the residual
US8355907B2 (en) * 2005-03-11 2013-01-15 Qualcomm Incorporated Method and apparatus for phase matching frames in vocoders
ES2340608T3 (es) * 2005-04-01 2010-06-07 Qualcomm Incorporated Aparato y procedimiento para codificar mediante banda dividida una señal de voz.
US7945305B2 (en) * 2005-04-14 2011-05-17 The Board Of Trustees Of The University Of Illinois Adaptive acquisition and reconstruction of dynamic MR images
US7490036B2 (en) * 2005-10-20 2009-02-10 Motorola, Inc. Adaptive equalizer for a coded speech signal
US7720677B2 (en) * 2005-11-03 2010-05-18 Coding Technologies Ab Time warped modified transform coding of audio signals
CN100524462C (zh) * 2007-09-15 2009-08-05 华为技术有限公司 对高带信号进行帧错误隐藏的方法及装置

Also Published As

Publication number Publication date
RU2414010C2 (ru) 2011-03-10
EP2059925A2 (en) 2009-05-20
KR20090053917A (ko) 2009-05-28
JP5006398B2 (ja) 2012-08-22
JP2010501896A (ja) 2010-01-21
WO2008024615A2 (en) 2008-02-28
US20080052065A1 (en) 2008-02-28
US8239190B2 (en) 2012-08-07
WO2008024615A3 (en) 2008-04-17
CN101506877A (zh) 2009-08-12
RU2009110202A (ru) 2010-10-27
CN101506877B (zh) 2012-11-28
KR101058761B1 (ko) 2011-08-24
BRPI0715978A2 (pt) 2013-08-06
CA2659197A1 (en) 2008-02-28
CA2659197C (en) 2013-06-25
TW200822062A (en) 2008-05-16

Similar Documents

Publication Publication Date Title
TWI340377B (en) Method and vocoders of communication speech
TWI349267B (en) Voice recognition system and method thereof
HK1132831A1 (en) Method and system for providing speech recognition
GB0623915D0 (en) Phonetic decoding and concatentive speech synthesis
TWI315295B (en) Mems microphone module and method thereof
GB2457855B (en) Speech recognition system and speech recognition system program
IL197660A0 (en) Methods and apparatus for communication of notifications
GB0610503D0 (en) Communication apparatus and method
EP2009509A4 (en) PROCESS OF DEVELOPMENT AND DEVELOPMENT DEVICE
GB0601076D0 (en) Acoustic device and method of making acoustic device
EP2095363A4 (en) RECOGNITION OF SPOKEN LANGUAGE IN WORKABLE AUDIO STREAMS
GB0525055D0 (en) Method and apparatus for labelling speech
GB0704622D0 (en) Speech coding system and method
EP1988191A4 (en) VENEER APPARATUS AND METHOD
TWI349266B (en) Voice recognition system and method
EP2199743A4 (en) INSTRUMENT AND APPEARANCE PRESENTED IN A VEHICLE
EP2011249A4 (en) COMMUNICATION PROCESS AND COMMUNICATION SYSTEM
EP2111726A4 (en) SYSTEM AND METHOD FOR DYNAMICALLY MODIFYING THE EVALUATION OF THE INTELLIGIBILITY OF WORDS
EP2055017A4 (en) PROCESS AND COMMUNICATIONS SYSTEM
EP2229675A4 (en) DEVICE AND METHOD FOR INCREASING THE QUALITY OF A LANGUAGE CODE
GB0424005D0 (en) Method of coating
GB2455697B (en) Method and apparatus for word forming and word building
GB2442515B (en) Communication system and method of operation therefor
EP2141693A4 (en) ENCODER / DECODER OF SPEECH
GB0602628D0 (en) Improved buoyancy apparatus and method of use

Legal Events

Date Code Title Description
MM4A Annulment or lapse of patent due to non-payment of fees