DE60233238D1 - Verfahren und vorrichtung zur codierung aufeinanderfolgender grundperioden in einem sprachsignal - Google Patents
Verfahren und vorrichtung zur codierung aufeinanderfolgender grundperioden in einem sprachsignalInfo
- Publication number
- DE60233238D1 DE60233238D1 DE60233238T DE60233238T DE60233238D1 DE 60233238 D1 DE60233238 D1 DE 60233238D1 DE 60233238 T DE60233238 T DE 60233238T DE 60233238 T DE60233238 T DE 60233238T DE 60233238 D1 DE60233238 D1 DE 60233238D1
- Authority
- DE
- Germany
- Prior art keywords
- pitch
- loop
- closed
- periods
- search
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Expired - Lifetime
Links
- 238000000034 method Methods 0.000 title abstract 2
- 238000007670 refining Methods 0.000 abstract 1
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/04—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
- G10L19/08—Determination or coding of the excitation function; Determination or coding of the long-term prediction parameters
- G10L19/09—Long term prediction, i.e. removing periodical redundancies, e.g. by using adaptive codebook or pitch predictor
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/04—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
- G10L19/08—Determination or coding of the excitation function; Determination or coding of the long-term prediction parameters
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/04—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
- G10L19/08—Determination or coding of the excitation function; Determination or coding of the long-term prediction parameters
- G10L19/12—Determination or coding of the excitation function; Determination or coding of the long-term prediction parameters the excitation function being a code excitation, e.g. in code excited linear prediction [CELP] vocoders
-
- H—ELECTRICITY
- H03—ELECTRONIC CIRCUITRY
- H03M—CODING; DECODING; CODE CONVERSION IN GENERAL
- H03M7/00—Conversion of a code where information is represented by a given sequence or number of digits to a code where the same, similar or subset of information is represented by a different sequence or number of digits
- H03M7/30—Compression; Expansion; Suppression of unnecessary data, e.g. redundancy reduction
Landscapes
- Engineering & Computer Science (AREA)
- Computational Linguistics (AREA)
- Signal Processing (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Physics & Mathematics (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Theoretical Computer Science (AREA)
- Compression, Expansion, Code Conversion, And Decoders (AREA)
- Reduction Or Emphasis Of Bandwidth Of Signals (AREA)
- Selective Calling Equipment (AREA)
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US09/878,762 US6584437B2 (en) | 2001-06-11 | 2001-06-11 | Method and apparatus for coding successive pitch periods in speech signal |
PCT/IB2002/002078 WO2002101718A2 (en) | 2001-06-11 | 2002-06-07 | Coding successive pitch periods in speech signal |
Publications (1)
Publication Number | Publication Date |
---|---|
DE60233238D1 true DE60233238D1 (de) | 2009-09-17 |
Family
ID=25372784
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
DE60233238T Expired - Lifetime DE60233238D1 (de) | 2001-06-11 | 2002-06-07 | Verfahren und vorrichtung zur codierung aufeinanderfolgender grundperioden in einem sprachsignal |
Country Status (8)
Country | Link |
---|---|
US (1) | US6584437B2 (de) |
EP (1) | EP1428202B1 (de) |
KR (1) | KR100896944B1 (de) |
CN (1) | CN1262993C (de) |
AT (1) | ATE438911T1 (de) |
AU (1) | AU2002258104A1 (de) |
DE (1) | DE60233238D1 (de) |
WO (1) | WO2002101718A2 (de) |
Families Citing this family (15)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
EP1422690B1 (de) * | 2001-08-31 | 2009-10-28 | Kabushiki Kaisha Kenwood | Vorrichtung und verfahren zum erzeugen eines tonhöhen-kurvenformsignals und vorrichtung und verfahren zum komprimieren, dekomprimieren und synthetisieren eines sprachsignals damit |
US7124075B2 (en) * | 2001-10-26 | 2006-10-17 | Dmitry Edward Terez | Methods and apparatus for pitch determination |
JP2005510925A (ja) * | 2001-11-30 | 2005-04-21 | コーニンクレッカ フィリップス エレクトロニクス エヌ ヴィ | 信号コード化 |
US7376553B2 (en) * | 2003-07-08 | 2008-05-20 | Robert Patel Quinn | Fractal harmonic overtone mapping of speech and musical sounds |
US7619995B1 (en) * | 2003-07-18 | 2009-11-17 | Nortel Networks Limited | Transcoders and mixers for voice-over-IP conferencing |
BRPI0517246A (pt) * | 2004-10-28 | 2008-10-07 | Matsushita Electric Ind Co Ltd | aparelho de codificação escalável, aparelho de decodificação escalável e métodos para os mesmos |
WO2007111649A2 (en) * | 2006-03-20 | 2007-10-04 | Mindspeed Technologies, Inc. | Open-loop pitch track smoothing |
US20080097757A1 (en) * | 2006-10-24 | 2008-04-24 | Nokia Corporation | Audio coding |
EP2101319B1 (de) * | 2006-12-15 | 2015-09-16 | Panasonic Intellectual Property Corporation of America | Einrichtung zur adaptiven schallquellen-vektorquantisierung und verfahren dafür |
CN101622664B (zh) * | 2007-03-02 | 2012-02-01 | 松下电器产业株式会社 | 自适应激励矢量量化装置和自适应激励矢量量化方法 |
EP2301021B1 (de) | 2008-07-10 | 2017-06-21 | VoiceAge Corporation | Vorrichtung und verfahren zur quantisierung von lpc-filtern in einem superrahmen |
US8670990B2 (en) * | 2009-08-03 | 2014-03-11 | Broadcom Corporation | Dynamic time scale modification for reduced bit rate audio coding |
CN112233682A (zh) * | 2019-06-29 | 2021-01-15 | 华为技术有限公司 | 一种立体声编码方法、立体声解码方法和装置 |
WO2021000724A1 (zh) * | 2019-06-29 | 2021-01-07 | 华为技术有限公司 | 一种立体声编码方法、立体声解码方法和装置 |
CN110390953B (zh) * | 2019-07-25 | 2023-11-17 | 腾讯科技(深圳)有限公司 | 啸叫语音信号的检测方法、装置、终端及存储介质 |
Family Cites Families (17)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JPS58215822A (ja) | 1982-06-10 | 1983-12-15 | Toshiba Corp | 音声信号の予測符号化装置 |
JPS60501477A (ja) | 1983-06-03 | 1985-09-05 | ザ・ヴアリアブル・スピ−チ・コントロ−ル・カンパニイ | オーディオ信号のピッチを変化させる方法およびピッチ変換装置 |
US4704730A (en) * | 1984-03-12 | 1987-11-03 | Allophonix, Inc. | Multi-state speech encoder and decoder |
JPH0632021B2 (ja) | 1987-07-15 | 1994-04-27 | シャープ株式会社 | 日本語音声認識装置 |
JPH0451200A (ja) | 1990-06-18 | 1992-02-19 | Fujitsu Ltd | 音声符号化方式 |
JP3226180B2 (ja) * | 1992-04-09 | 2001-11-05 | 日本電信電話株式会社 | 音声のピッチ周期符号化法 |
US5884253A (en) | 1992-04-09 | 1999-03-16 | Lucent Technologies, Inc. | Prototype waveform speech coding with interpolation of pitch, pitch-period waveforms, and synthesis filter |
US5388124A (en) * | 1992-06-12 | 1995-02-07 | University Of Maryland | Precoding scheme for transmitting data using optimally-shaped constellations over intersymbol-interference channels |
GB2282943B (en) | 1993-03-26 | 1998-06-03 | Motorola Inc | Vector quantizer method and apparatus |
US5504834A (en) * | 1993-05-28 | 1996-04-02 | Motrola, Inc. | Pitch epoch synchronous linear predictive coding vocoder and method |
WO1997017692A1 (en) * | 1995-11-07 | 1997-05-15 | Euphonics, Incorporated | Parametric signal modeling musical synthesizer |
US5799276A (en) | 1995-11-07 | 1998-08-25 | Accent Incorporated | Knowledge-based speech recognition system and methods having frame length computed based upon estimated pitch period of vocalic intervals |
US5729694A (en) | 1996-02-06 | 1998-03-17 | The Regents Of The University Of California | Speech coding, reconstruction and recognition using acoustics and electromagnetic waves |
US6006175A (en) | 1996-02-06 | 1999-12-21 | The Regents Of The University Of California | Methods and apparatus for non-acoustic speech characterization and recognition |
US6009394A (en) * | 1996-09-05 | 1999-12-28 | The Board Of Trustees Of The University Of Illinois | System and method for interfacing a 2D or 3D movement space to a high dimensional sound synthesis control space |
US6185527B1 (en) | 1999-01-19 | 2001-02-06 | International Business Machines Corporation | System and method for automatic audio content analysis for word spotting, indexing, classification and retrieval |
US6704711B2 (en) * | 2000-01-28 | 2004-03-09 | Telefonaktiebolaget Lm Ericsson (Publ) | System and method for modifying speech signals |
-
2001
- 2001-06-11 US US09/878,762 patent/US6584437B2/en not_active Expired - Lifetime
-
2002
- 2002-06-07 DE DE60233238T patent/DE60233238D1/de not_active Expired - Lifetime
- 2002-06-07 WO PCT/IB2002/002078 patent/WO2002101718A2/en not_active Application Discontinuation
- 2002-06-07 AT AT02727961T patent/ATE438911T1/de not_active IP Right Cessation
- 2002-06-07 EP EP02727961A patent/EP1428202B1/de not_active Expired - Lifetime
- 2002-06-07 KR KR1020037016101A patent/KR100896944B1/ko not_active IP Right Cessation
- 2002-06-07 CN CNB028117263A patent/CN1262993C/zh not_active Expired - Fee Related
- 2002-06-07 AU AU2002258104A patent/AU2002258104A1/en not_active Abandoned
Also Published As
Publication number | Publication date |
---|---|
EP1428202A2 (de) | 2004-06-16 |
CN1262993C (zh) | 2006-07-05 |
EP1428202A4 (de) | 2005-10-26 |
WO2002101718A2 (en) | 2002-12-19 |
KR20040028774A (ko) | 2004-04-03 |
EP1428202B1 (de) | 2009-08-05 |
US6584437B2 (en) | 2003-06-24 |
WO2002101718A3 (en) | 2003-04-10 |
KR100896944B1 (ko) | 2009-05-14 |
CN1514994A (zh) | 2004-07-21 |
ATE438911T1 (de) | 2009-08-15 |
US20030004709A1 (en) | 2003-01-02 |
AU2002258104A1 (en) | 2002-12-23 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
DE60233238D1 (de) | Verfahren und vorrichtung zur codierung aufeinanderfolgender grundperioden in einem sprachsignal | |
DE602004007786D1 (de) | Verfahren und vorrichtung zur quantisierung des verstärkungsfaktors in einem breitbandsprachkodierer mit variabler bitrate | |
DE602004012909D1 (de) | Verfahren und Vorrichtung zur Modellierung eines Spracherkennungssystems und zur Schätzung einer Wort-Fehlerrate basierend auf einem Text | |
ATE531031T1 (de) | Segmentbasierte tonale modellierung für tonale sprachen | |
WO2006086053A3 (en) | System and method for automatic enrichment of documents | |
DE60310785D1 (de) | Verfahren und Vorrichtung zur Übersetzung von gesprochener Sprache | |
DE69937176D1 (de) | Segmentierungsverfahren zur Erweiterung des aktiven Vokabulars von Spracherkennern | |
DE69530066T2 (de) | Verfahren und vorrichtung zur auswahl der kodierrate in einem vocoder mit variabler rate | |
EP1629464A4 (de) | Spracherkennungssystem und verfahren auf phonetischer basis | |
ATE233935T1 (de) | Vorrichtung und verfahren zur unterscheidung von ähnlich klingenden wörtern in der spracherkennung | |
DE60023736D1 (de) | Verfahren und vorrichtung zur spracherkennung mit verschiedenen sprachmodellen | |
DE69937854D1 (de) | Verfahren und Vorrichtung zur Spracherkennung unter Verwendung von phonetischen Transkriptionen | |
ATE533146T1 (de) | Verfahren und vorrichtung zur suche einer grundfrequenz | |
DE602004011411D1 (de) | Verfahren zur blockbeschränkten trellis-kodierten Quantisierung und ihre Verwendung in einem Verfahren und einer Vorrichtung zur Quantisierung von LSF-Parametern in einem Sprachkodiersystem | |
DE69025091T2 (de) | Verfahren und Vorrichtung zur Übersetzung eines Satzes mit einem durch Trennung gebildeten, zusammengesetzten Wort | |
DE60118627D1 (de) | Vorrichtung und Verfahren zur Breitbandcodierung von Sprachsignalen | |
ATE366431T1 (de) | Verfahren zur regelung eines thermodynamischen prozesses | |
ATE338330T1 (de) | Verfahren und vorrichtung zur zweiphasen- grundfrequenzdetektion | |
ATE423783T1 (de) | Verfahren zur herstellung von perfluoralkylphosphinen und deren verwendung als perfluoralkylierungsreagenzien | |
ATE211291T1 (de) | Vefahren zur spracherkennung unter verwendung von einer grammatik | |
ATE480852T1 (de) | Verfahren, vorrichtung zur sprachkodierung in einem mobilen kommunikationsendgerät mittels plp | |
DE602006002279D1 (de) | Verfahren, Computerprogramm und Vorrichtung zur eindeutigen Identifizierung von einem Kontakt in einer Kontaktdatenbank durch eine einzige Sprachäusserung | |
DE60122327D1 (de) | Verfahren und vorrichtung zur abschwächung von übertragungsfehlern in einem verteilten spracherkennungsverfahren und system | |
DE69424960T2 (de) | Verfahren und Vorrichtung zur Sprachkodierung mit Trellis-kodierter Quantisierung für LPC- Quantisierung | |
Mertens | Transcription of tonal aspects in speech and a system for automatic tonal annotation |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
8328 | Change in the person/name/address of the agent |
Representative=s name: WAGNER & GEYER PARTNERSCHAFT PATENT- UND RECHTSANW |
|
8364 | No opposition during term of opposition |