DE69618408T2 - Verfahren und Vorrichtung zur Sprachkodierung - Google Patents

Verfahren und Vorrichtung zur Sprachkodierung

Info

Publication number
DE69618408T2
DE69618408T2 DE69618408T DE69618408T DE69618408T2 DE 69618408 T2 DE69618408 T2 DE 69618408T2 DE 69618408 T DE69618408 T DE 69618408T DE 69618408 T DE69618408 T DE 69618408T DE 69618408 T2 DE69618408 T2 DE 69618408T2
Authority
DE
Germany
Prior art keywords
speech coding
speech
coding
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Expired - Lifetime
Application number
DE69618408T
Other languages
English (en)
Other versions
DE69618408D1 (de
Inventor
Masauki Nishiguchi
Jun Matsumoto
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Sony Corp
Original Assignee
Sony Corp
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Sony Corp filed Critical Sony Corp
Application granted granted Critical
Publication of DE69618408D1 publication Critical patent/DE69618408D1/de
Publication of DE69618408T2 publication Critical patent/DE69618408T2/de
Anticipated expiration legal-status Critical
Expired - Lifetime legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L13/00Speech synthesis; Text to speech systems
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/04Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
    • G10L19/08Determination or coding of the excitation function; Determination or coding of the long-term prediction parameters
    • G10L19/093Determination or coding of the excitation function; Determination or coding of the long-term prediction parameters using sinusoidal excitation models
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/93Discriminating between voiced and unvoiced parts of speech signals
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/04Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
    • G10L19/08Determination or coding of the excitation function; Determination or coding of the long-term prediction parameters
    • G10L19/10Determination or coding of the excitation function; Determination or coding of the long-term prediction parameters the excitation function being a multipulse excitation
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/27Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the analysis technique

Landscapes

  • Engineering & Computer Science (AREA)
  • Computational Linguistics (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Signal Processing (AREA)
  • Compression, Expansion, Code Conversion, And Decoders (AREA)
  • Fittings On The Vehicle Exterior For Carrying Loads, And Devices For Holding Or Mounting Articles (AREA)
DE69618408T 1995-09-28 1996-09-26 Verfahren und Vorrichtung zur Sprachkodierung Expired - Lifetime DE69618408T2 (de)

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
JP25098395A JP3680374B2 (ja) 1995-09-28 1995-09-28 音声合成方法

Publications (2)

Publication Number Publication Date
DE69618408D1 DE69618408D1 (de) 2002-02-14
DE69618408T2 true DE69618408T2 (de) 2002-08-29

Family

ID=17215938

Family Applications (1)

Application Number Title Priority Date Filing Date
DE69618408T Expired - Lifetime DE69618408T2 (de) 1995-09-28 1996-09-26 Verfahren und Vorrichtung zur Sprachkodierung

Country Status (8)

Country Link
US (1) US6029134A (de)
EP (1) EP0766230B1 (de)
JP (1) JP3680374B2 (de)
KR (1) KR100406674B1 (de)
CN (1) CN1132146C (de)
BR (1) BR9603941A (de)
DE (1) DE69618408T2 (de)
NO (1) NO312428B1 (de)

Families Citing this family (19)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6240384B1 (en) * 1995-12-04 2001-05-29 Kabushiki Kaisha Toshiba Speech synthesis method
JP3055608B2 (ja) * 1997-06-06 2000-06-26 日本電気株式会社 音声符号化方法および装置
US6449592B1 (en) 1999-02-26 2002-09-10 Qualcomm Incorporated Method and apparatus for tracking the phase of a quasi-periodic signal
SE9903223L (sv) * 1999-09-09 2001-05-08 Ericsson Telefon Ab L M Förfarande och anordning i telekommunikationssystem
KR100711047B1 (ko) * 2000-02-29 2007-04-24 퀄컴 인코포레이티드 폐루프 멀티모드 혼합영역 선형예측 (mdlp) 음성 코더
KR100711040B1 (ko) * 2000-02-29 2007-04-24 퀄컴 인코포레이티드 유사주기 신호의 위상을 추적하는 방법 및 장치
WO2004082288A1 (en) * 2003-03-11 2004-09-23 Nokia Corporation Switching between coding schemes
JP4992717B2 (ja) * 2005-09-06 2012-08-08 日本電気株式会社 音声合成装置及び方法とプログラム
JP2007114417A (ja) * 2005-10-19 2007-05-10 Fujitsu Ltd 音声データ処理方法及び装置
EP1918911A1 (de) * 2006-11-02 2008-05-07 RWTH Aachen University Zeitskalenmodifikation eines Audiosignals
US8121835B2 (en) * 2007-03-21 2012-02-21 Texas Instruments Incorporated Automatic level control of speech signals
WO2009004727A1 (ja) * 2007-07-04 2009-01-08 Fujitsu Limited 符号化装置、符号化方法および符号化プログラム
JP5262171B2 (ja) 2008-02-19 2013-08-14 富士通株式会社 符号化装置、符号化方法および符号化プログラム
CN102103855B (zh) * 2009-12-16 2013-08-07 北京中星微电子有限公司 一种检测音频片段的方法及装置
WO2012006770A1 (en) * 2010-07-12 2012-01-19 Huawei Technologies Co., Ltd. Audio signal generator
JP2012058358A (ja) * 2010-09-07 2012-03-22 Sony Corp 雑音抑圧装置、雑音抑圧方法およびプログラム
WO2016142002A1 (en) * 2015-03-09 2016-09-15 Fraunhofer-Gesellschaft Zur Foerderung Der Angewandten Forschung E.V. Audio encoder, audio decoder, method for encoding an audio signal and method for decoding an encoded audio signal
CN111862931A (zh) * 2020-05-08 2020-10-30 北京嘀嘀无限科技发展有限公司 一种语音生成方法及装置
CN112820267B (zh) * 2021-01-15 2022-10-04 科大讯飞股份有限公司 波形生成方法以及相关模型的训练方法和相关设备、装置

Family Cites Families (10)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CA1242279A (en) * 1984-07-10 1988-09-20 Tetsu Taguchi Speech signal processor
US5179626A (en) * 1988-04-08 1993-01-12 At&T Bell Laboratories Harmonic speech coding arrangement where a set of parameters for a continuous magnitude spectrum is determined by a speech analyzer and the parameters are used by a synthesizer to determine a spectrum which is used to determine senusoids for synthesis
US5081681B1 (en) * 1989-11-30 1995-08-15 Digital Voice Systems Inc Method and apparatus for phase synthesis for speech processing
US5216747A (en) * 1990-09-20 1993-06-01 Digital Voice Systems, Inc. Voiced/unvoiced estimation of an acoustic signal
US5226108A (en) * 1990-09-20 1993-07-06 Digital Voice Systems, Inc. Processing a speech signal with estimated pitch
US5664051A (en) * 1990-09-24 1997-09-02 Digital Voice Systems, Inc. Method and apparatus for phase synthesis for speech processing
JP3218679B2 (ja) * 1992-04-15 2001-10-15 ソニー株式会社 高能率符号化方法
JP3277398B2 (ja) * 1992-04-15 2002-04-22 ソニー株式会社 有声音判別方法
US5504834A (en) * 1993-05-28 1996-04-02 Motrola, Inc. Pitch epoch synchronous linear predictive coding vocoder and method
JP3338885B2 (ja) * 1994-04-15 2002-10-28 松下電器産業株式会社 音声符号化復号化装置

Also Published As

Publication number Publication date
EP0766230B1 (de) 2002-01-09
EP0766230A3 (de) 1998-06-03
KR970017173A (ko) 1997-04-30
BR9603941A (pt) 1998-06-09
EP0766230A2 (de) 1997-04-02
CN1157452A (zh) 1997-08-20
KR100406674B1 (ko) 2004-01-28
CN1132146C (zh) 2003-12-24
DE69618408D1 (de) 2002-02-14
JP3680374B2 (ja) 2005-08-10
NO963935D0 (no) 1996-09-19
NO963935L (no) 1997-04-01
US6029134A (en) 2000-02-22
JPH0990968A (ja) 1997-04-04
NO312428B1 (no) 2002-05-06

Similar Documents

Publication Publication Date Title
DE69631728D1 (de) Verfahren und Vorrichtung zur Sprachkodierung
DE69625875T2 (de) Verfahren und Vorrichtung zur Sprachkodierung und -dekodierung
DE69727895D1 (de) Verfahren und Vorrichtung zur Sprachkodierung
DE69309557T2 (de) Verfahren und Vorrichtung zur Sprachkodierung
DE69518705T2 (de) Verfahren und Vorrichtung zur Spracherkennung
DE69524829D1 (de) Verfahren und Vorrichtung zur Spracherkennung
DE69632901D1 (de) Vorrichtung und Verfahren zur Sprachsynthese
DE69717899D1 (de) Verfahren und Vorrichtung zur Spracherkennung
DE69806557D1 (de) Verfahren und Vorrichtung zur Spracherkennung
DE69726235D1 (de) Verfahren und Vorrichtung zur Spracherkennung
DE59707384D1 (de) Verfahren und Vorrichtung zur Spracherkennung
DE69519820D1 (de) Verfahren und Vorrichtung zur Sprachsynthese
DE69431445T2 (de) Verfahren und Vorrichtung zur Sprachkodierung
DE69726685D1 (de) Verfahren zur Sprachanalyse sowie Verfahren und Vorrichtung zur Sprachkodierung
DE69523998D1 (de) Verfahren und Vorrichtung zur Sprachsynthese
DE69710525T2 (de) Verfahren und Vorrichtung zur Sprachsynthese
DE69618408D1 (de) Verfahren und Vorrichtung zur Sprachkodierung
DE69715071T2 (de) Verfahren und Vorrichtung zur Sprachverarbeitung
DE69506449T2 (de) Verfahren und vorrichtung zur zwischenbildkodierung
DE69921066D1 (de) Verfahren und Vorrichtung zur Sprachkodierung
DE69517829T2 (de) Vorrichtung und Verfahren zur Spracherkennung
DE69519818D1 (de) Verfahren und Vorrichtung zur Sprachsynthese
DE69715281D1 (de) Verfahren und Vorrichtung zur Spracherkennung
DE69620304D1 (de) Vorrichtung und Verfahren zur Spracherkennung
DE69721108D1 (de) Verfahren und Vorrichtung zur Sprachsynthese

Legal Events

Date Code Title Description
8364 No opposition during term of opposition