DE60233238D1 - Verfahren und vorrichtung zur codierung aufeinanderfolgender grundperioden in einem sprachsignal - Google Patents

Verfahren und vorrichtung zur codierung aufeinanderfolgender grundperioden in einem sprachsignal

Info

Publication number
DE60233238D1
DE60233238D1 DE60233238T DE60233238T DE60233238D1 DE 60233238 D1 DE60233238 D1 DE 60233238D1 DE 60233238 T DE60233238 T DE 60233238T DE 60233238 T DE60233238 T DE 60233238T DE 60233238 D1 DE60233238 D1 DE 60233238D1
Authority
DE
Germany
Prior art keywords
pitch
loop
closed
periods
search
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Expired - Lifetime
Application number
DE60233238T
Other languages
English (en)
Inventor
Ari Heikkinen
Vesa Ruoppila
Samuli Pietilae
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Qualcomm Inc
Original Assignee
Qualcomm Inc
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Qualcomm Inc filed Critical Qualcomm Inc
Application granted granted Critical
Publication of DE60233238D1 publication Critical patent/DE60233238D1/de
Anticipated expiration legal-status Critical
Expired - Lifetime legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/04Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
    • G10L19/08Determination or coding of the excitation function; Determination or coding of the long-term prediction parameters
    • G10L19/09Long term prediction, i.e. removing periodical redundancies, e.g. by using adaptive codebook or pitch predictor
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/04Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
    • G10L19/08Determination or coding of the excitation function; Determination or coding of the long-term prediction parameters
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/04Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
    • G10L19/08Determination or coding of the excitation function; Determination or coding of the long-term prediction parameters
    • G10L19/12Determination or coding of the excitation function; Determination or coding of the long-term prediction parameters the excitation function being a code excitation, e.g. in code excited linear prediction [CELP] vocoders
    • HELECTRICITY
    • H03ELECTRONIC CIRCUITRY
    • H03MCODING; DECODING; CODE CONVERSION IN GENERAL
    • H03M7/00Conversion of a code where information is represented by a given sequence or number of digits to a code where the same, similar or subset of information is represented by a different sequence or number of digits
    • H03M7/30Compression; Expansion; Suppression of unnecessary data, e.g. redundancy reduction

Landscapes

  • Engineering & Computer Science (AREA)
  • Computational Linguistics (AREA)
  • Signal Processing (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Theoretical Computer Science (AREA)
  • Compression, Expansion, Code Conversion, And Decoders (AREA)
  • Reduction Or Emphasis Of Bandwidth Of Signals (AREA)
  • Selective Calling Equipment (AREA)
DE60233238T 2001-06-11 2002-06-07 Verfahren und vorrichtung zur codierung aufeinanderfolgender grundperioden in einem sprachsignal Expired - Lifetime DE60233238D1 (de)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US09/878,762 US6584437B2 (en) 2001-06-11 2001-06-11 Method and apparatus for coding successive pitch periods in speech signal
PCT/IB2002/002078 WO2002101718A2 (en) 2001-06-11 2002-06-07 Coding successive pitch periods in speech signal

Publications (1)

Publication Number Publication Date
DE60233238D1 true DE60233238D1 (de) 2009-09-17

Family

ID=25372784

Family Applications (1)

Application Number Title Priority Date Filing Date
DE60233238T Expired - Lifetime DE60233238D1 (de) 2001-06-11 2002-06-07 Verfahren und vorrichtung zur codierung aufeinanderfolgender grundperioden in einem sprachsignal

Country Status (8)

Country Link
US (1) US6584437B2 (de)
EP (1) EP1428202B1 (de)
KR (1) KR100896944B1 (de)
CN (1) CN1262993C (de)
AT (1) ATE438911T1 (de)
AU (1) AU2002258104A1 (de)
DE (1) DE60233238D1 (de)
WO (1) WO2002101718A2 (de)

Families Citing this family (15)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP1422690B1 (de) * 2001-08-31 2009-10-28 Kabushiki Kaisha Kenwood Vorrichtung und verfahren zum erzeugen eines tonhöhen-kurvenformsignals und vorrichtung und verfahren zum komprimieren, dekomprimieren und synthetisieren eines sprachsignals damit
US7124075B2 (en) * 2001-10-26 2006-10-17 Dmitry Edward Terez Methods and apparatus for pitch determination
JP2005510925A (ja) * 2001-11-30 2005-04-21 コーニンクレッカ フィリップス エレクトロニクス エヌ ヴィ 信号コード化
US7376553B2 (en) * 2003-07-08 2008-05-20 Robert Patel Quinn Fractal harmonic overtone mapping of speech and musical sounds
US7619995B1 (en) * 2003-07-18 2009-11-17 Nortel Networks Limited Transcoders and mixers for voice-over-IP conferencing
BRPI0517246A (pt) * 2004-10-28 2008-10-07 Matsushita Electric Ind Co Ltd aparelho de codificação escalável, aparelho de decodificação escalável e métodos para os mesmos
WO2007111649A2 (en) * 2006-03-20 2007-10-04 Mindspeed Technologies, Inc. Open-loop pitch track smoothing
US20080097757A1 (en) * 2006-10-24 2008-04-24 Nokia Corporation Audio coding
EP2101319B1 (de) * 2006-12-15 2015-09-16 Panasonic Intellectual Property Corporation of America Einrichtung zur adaptiven schallquellen-vektorquantisierung und verfahren dafür
CN101622664B (zh) * 2007-03-02 2012-02-01 松下电器产业株式会社 自适应激励矢量量化装置和自适应激励矢量量化方法
EP2301021B1 (de) 2008-07-10 2017-06-21 VoiceAge Corporation Vorrichtung und verfahren zur quantisierung von lpc-filtern in einem superrahmen
US8670990B2 (en) * 2009-08-03 2014-03-11 Broadcom Corporation Dynamic time scale modification for reduced bit rate audio coding
CN112233682A (zh) * 2019-06-29 2021-01-15 华为技术有限公司 一种立体声编码方法、立体声解码方法和装置
WO2021000724A1 (zh) * 2019-06-29 2021-01-07 华为技术有限公司 一种立体声编码方法、立体声解码方法和装置
CN110390953B (zh) * 2019-07-25 2023-11-17 腾讯科技(深圳)有限公司 啸叫语音信号的检测方法、装置、终端及存储介质

Family Cites Families (17)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPS58215822A (ja) 1982-06-10 1983-12-15 Toshiba Corp 音声信号の予測符号化装置
JPS60501477A (ja) 1983-06-03 1985-09-05 ザ・ヴアリアブル・スピ−チ・コントロ−ル・カンパニイ オーディオ信号のピッチを変化させる方法およびピッチ変換装置
US4704730A (en) * 1984-03-12 1987-11-03 Allophonix, Inc. Multi-state speech encoder and decoder
JPH0632021B2 (ja) 1987-07-15 1994-04-27 シャープ株式会社 日本語音声認識装置
JPH0451200A (ja) 1990-06-18 1992-02-19 Fujitsu Ltd 音声符号化方式
JP3226180B2 (ja) * 1992-04-09 2001-11-05 日本電信電話株式会社 音声のピッチ周期符号化法
US5884253A (en) 1992-04-09 1999-03-16 Lucent Technologies, Inc. Prototype waveform speech coding with interpolation of pitch, pitch-period waveforms, and synthesis filter
US5388124A (en) * 1992-06-12 1995-02-07 University Of Maryland Precoding scheme for transmitting data using optimally-shaped constellations over intersymbol-interference channels
GB2282943B (en) 1993-03-26 1998-06-03 Motorola Inc Vector quantizer method and apparatus
US5504834A (en) * 1993-05-28 1996-04-02 Motrola, Inc. Pitch epoch synchronous linear predictive coding vocoder and method
WO1997017692A1 (en) * 1995-11-07 1997-05-15 Euphonics, Incorporated Parametric signal modeling musical synthesizer
US5799276A (en) 1995-11-07 1998-08-25 Accent Incorporated Knowledge-based speech recognition system and methods having frame length computed based upon estimated pitch period of vocalic intervals
US5729694A (en) 1996-02-06 1998-03-17 The Regents Of The University Of California Speech coding, reconstruction and recognition using acoustics and electromagnetic waves
US6006175A (en) 1996-02-06 1999-12-21 The Regents Of The University Of California Methods and apparatus for non-acoustic speech characterization and recognition
US6009394A (en) * 1996-09-05 1999-12-28 The Board Of Trustees Of The University Of Illinois System and method for interfacing a 2D or 3D movement space to a high dimensional sound synthesis control space
US6185527B1 (en) 1999-01-19 2001-02-06 International Business Machines Corporation System and method for automatic audio content analysis for word spotting, indexing, classification and retrieval
US6704711B2 (en) * 2000-01-28 2004-03-09 Telefonaktiebolaget Lm Ericsson (Publ) System and method for modifying speech signals

Also Published As

Publication number Publication date
EP1428202A2 (de) 2004-06-16
CN1262993C (zh) 2006-07-05
EP1428202A4 (de) 2005-10-26
WO2002101718A2 (en) 2002-12-19
KR20040028774A (ko) 2004-04-03
EP1428202B1 (de) 2009-08-05
US6584437B2 (en) 2003-06-24
WO2002101718A3 (en) 2003-04-10
KR100896944B1 (ko) 2009-05-14
CN1514994A (zh) 2004-07-21
ATE438911T1 (de) 2009-08-15
US20030004709A1 (en) 2003-01-02
AU2002258104A1 (en) 2002-12-23

Similar Documents

Publication Publication Date Title
DE60233238D1 (de) Verfahren und vorrichtung zur codierung aufeinanderfolgender grundperioden in einem sprachsignal
DE602004007786D1 (de) Verfahren und vorrichtung zur quantisierung des verstärkungsfaktors in einem breitbandsprachkodierer mit variabler bitrate
DE602004012909D1 (de) Verfahren und Vorrichtung zur Modellierung eines Spracherkennungssystems und zur Schätzung einer Wort-Fehlerrate basierend auf einem Text
ATE531031T1 (de) Segmentbasierte tonale modellierung für tonale sprachen
WO2006086053A3 (en) System and method for automatic enrichment of documents
DE60310785D1 (de) Verfahren und Vorrichtung zur Übersetzung von gesprochener Sprache
DE69937176D1 (de) Segmentierungsverfahren zur Erweiterung des aktiven Vokabulars von Spracherkennern
DE69530066T2 (de) Verfahren und vorrichtung zur auswahl der kodierrate in einem vocoder mit variabler rate
EP1629464A4 (de) Spracherkennungssystem und verfahren auf phonetischer basis
ATE233935T1 (de) Vorrichtung und verfahren zur unterscheidung von ähnlich klingenden wörtern in der spracherkennung
DE60023736D1 (de) Verfahren und vorrichtung zur spracherkennung mit verschiedenen sprachmodellen
DE69937854D1 (de) Verfahren und Vorrichtung zur Spracherkennung unter Verwendung von phonetischen Transkriptionen
ATE533146T1 (de) Verfahren und vorrichtung zur suche einer grundfrequenz
DE602004011411D1 (de) Verfahren zur blockbeschränkten trellis-kodierten Quantisierung und ihre Verwendung in einem Verfahren und einer Vorrichtung zur Quantisierung von LSF-Parametern in einem Sprachkodiersystem
DE69025091T2 (de) Verfahren und Vorrichtung zur Übersetzung eines Satzes mit einem durch Trennung gebildeten, zusammengesetzten Wort
DE60118627D1 (de) Vorrichtung und Verfahren zur Breitbandcodierung von Sprachsignalen
ATE366431T1 (de) Verfahren zur regelung eines thermodynamischen prozesses
ATE338330T1 (de) Verfahren und vorrichtung zur zweiphasen- grundfrequenzdetektion
ATE423783T1 (de) Verfahren zur herstellung von perfluoralkylphosphinen und deren verwendung als perfluoralkylierungsreagenzien
ATE211291T1 (de) Vefahren zur spracherkennung unter verwendung von einer grammatik
ATE480852T1 (de) Verfahren, vorrichtung zur sprachkodierung in einem mobilen kommunikationsendgerät mittels plp
DE602006002279D1 (de) Verfahren, Computerprogramm und Vorrichtung zur eindeutigen Identifizierung von einem Kontakt in einer Kontaktdatenbank durch eine einzige Sprachäusserung
DE60122327D1 (de) Verfahren und vorrichtung zur abschwächung von übertragungsfehlern in einem verteilten spracherkennungsverfahren und system
DE69424960T2 (de) Verfahren und Vorrichtung zur Sprachkodierung mit Trellis-kodierter Quantisierung für LPC- Quantisierung
Mertens Transcription of tonal aspects in speech and a system for automatic tonal annotation

Legal Events

Date Code Title Description
8328 Change in the person/name/address of the agent

Representative=s name: WAGNER & GEYER PARTNERSCHAFT PATENT- UND RECHTSANW

8364 No opposition during term of opposition