AU2015411306A1 - Method for forming the excitation signal for a glottal pulse model based parametric speech synthesis system - Google Patents

Method for forming the excitation signal for a glottal pulse model based parametric speech synthesis system Download PDF

Info

Publication number
AU2015411306A1
AU2015411306A1 AU2015411306A AU2015411306A AU2015411306A1 AU 2015411306 A1 AU2015411306 A1 AU 2015411306A1 AU 2015411306 A AU2015411306 A AU 2015411306A AU 2015411306 A AU2015411306 A AU 2015411306A AU 2015411306 A1 AU2015411306 A1 AU 2015411306A1
Authority
AU
Australia
Prior art keywords
band
speech
glottal
sub
database
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Abandoned
Application number
AU2015411306A
Other languages
English (en)
Inventor
Rajesh DACHIRAJU
Aravind GANAPATHIRAJU
E. Veera Raghavendra
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Interactive Intelligence Group Inc
Original Assignee
Interactive Intelligence Group Inc
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Interactive Intelligence Group Inc filed Critical Interactive Intelligence Group Inc
Publication of AU2015411306A1 publication Critical patent/AU2015411306A1/en
Abandoned legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L13/00Speech synthesis; Text to speech systems
    • G10L13/02Methods for producing synthetic speech; Speech synthesisers
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/75Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 for modelling vocal tract parameters

Landscapes

  • Engineering & Computer Science (AREA)
  • Computational Linguistics (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Signal Processing (AREA)
  • Compression, Expansion, Code Conversion, And Decoders (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
  • Machine Translation (AREA)
AU2015411306A 2015-10-06 2015-10-06 Method for forming the excitation signal for a glottal pulse model based parametric speech synthesis system Abandoned AU2015411306A1 (en)

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
PCT/US2015/054122 WO2017061985A1 (en) 2015-10-06 2015-10-06 Method for forming the excitation signal for a glottal pulse model based parametric speech synthesis system

Publications (1)

Publication Number Publication Date
AU2015411306A1 true AU2015411306A1 (en) 2018-05-24

Family

ID=58488102

Family Applications (1)

Application Number Title Priority Date Filing Date
AU2015411306A Abandoned AU2015411306A1 (en) 2015-10-06 2015-10-06 Method for forming the excitation signal for a glottal pulse model based parametric speech synthesis system

Country Status (6)

Country Link
EP (1) EP3363015A4 (de)
KR (1) KR20180078252A (de)
CN (1) CN108369803B (de)
AU (1) AU2015411306A1 (de)
CA (1) CA3004700C (de)
WO (1) WO2017061985A1 (de)

Families Citing this family (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US11869482B2 (en) 2018-09-30 2024-01-09 Microsoft Technology Licensing, Llc Speech waveform generation
CN109767755A (zh) * 2019-03-01 2019-05-17 广州多益网络股份有限公司 一种语音合成方法和系统
CN111862931A (zh) * 2020-05-08 2020-10-30 北京嘀嘀无限科技发展有限公司 一种语音生成方法及装置
CN112365875B (zh) * 2020-11-18 2021-09-10 北京百度网讯科技有限公司 语音合成方法、装置、声码器和电子设备
CN113571079A (zh) * 2021-02-08 2021-10-29 腾讯科技(深圳)有限公司 语音增强方法、装置、设备及存储介质

Family Cites Families (14)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6070140A (en) * 1995-06-05 2000-05-30 Tran; Bao Q. Speech recognizer
US5937384A (en) * 1996-05-01 1999-08-10 Microsoft Corporation Method and system for speech recognition using continuous density hidden Markov models
US20020116196A1 (en) * 1998-11-12 2002-08-22 Tran Bao Q. Speech recognizer
US6970820B2 (en) * 2001-02-26 2005-11-29 Matsushita Electric Industrial Co., Ltd. Voice personalization of speech synthesizer
EP1422690B1 (de) * 2001-08-31 2009-10-28 Kabushiki Kaisha Kenwood Vorrichtung und verfahren zum erzeugen eines tonhöhen-kurvenformsignals und vorrichtung und verfahren zum komprimieren, dekomprimieren und synthetisieren eines sprachsignals damit
EP2058803B1 (de) * 2007-10-29 2010-01-20 Harman/Becker Automotive Systems GmbH Partielle Sprachrekonstruktion
WO2009144368A1 (en) * 2008-05-30 2009-12-03 Nokia Corporation Method, apparatus and computer program product for providing improved speech synthesis
EP2242045B1 (de) * 2009-04-16 2012-06-27 Université de Mons Verfahren zur Sprachsynthese und Kodierung
CN102231275B (zh) * 2011-06-01 2013-10-16 北京宇音天下科技有限公司 一种基于加权混合激励的嵌入式语音合成方法
CN102270449A (zh) * 2011-08-10 2011-12-07 歌尔声学股份有限公司 参数语音合成方法和系统
US20130080172A1 (en) * 2011-09-22 2013-03-28 General Motors Llc Objective evaluation of synthesized speech attributes
US10453479B2 (en) * 2011-09-23 2019-10-22 Lessac Technologies, Inc. Methods for aligning expressive speech utterances with text and systems therefor
GB2508417B (en) * 2012-11-30 2017-02-08 Toshiba Res Europe Ltd A speech processing system
TWI573129B (zh) * 2013-02-05 2017-03-01 國立交通大學 編碼串流產生裝置、韻律訊息編碼裝置、韻律結構分析裝置與語音合成之裝置及方法

Also Published As

Publication number Publication date
CN108369803A (zh) 2018-08-03
EP3363015A1 (de) 2018-08-22
WO2017061985A1 (en) 2017-04-13
KR20180078252A (ko) 2018-07-09
CA3004700A1 (en) 2017-04-13
CA3004700C (en) 2021-03-23
CN108369803B (zh) 2023-04-04
EP3363015A4 (de) 2019-06-12

Similar Documents

Publication Publication Date Title
US10621969B2 (en) Method for forming the excitation signal for a glottal pulse model based parametric speech synthesis system
US9368103B2 (en) Estimation system of spectral envelopes and group delays for sound analysis and synthesis, and audio signal synthesis system
US7035791B2 (en) Feature-domain concatenative speech synthesis
CA3004700C (en) Method for forming the excitation signal for a glottal pulse model based parametric speech synthesis system
WO2011026247A1 (en) Speech enhancement techniques on the power spectrum
AU2020227065B2 (en) Method for forming the excitation signal for a glottal pulse model based parametric speech synthesis system
US10014007B2 (en) Method for forming the excitation signal for a glottal pulse model based parametric speech synthesis system
CN110459202A (zh) 一种韵律标注方法、装置、设备、介质
KR20230109630A (ko) 오디오 신호 생성 및 오디오 생성기 훈련을 위한 방법 및 오디오 생성기
US10446133B2 (en) Multi-stream spectral representation for statistical parametric speech synthesis
CN102231275B (zh) 一种基于加权混合激励的嵌入式语音合成方法
EP3113180B1 (de) Verfahren zur durchführung einer audio-einblendung in ein sprachsignal und vorrichtung zur durchführung einer audio-einblendung in ein sprachsignal
Kadyan et al. Prosody features based low resource Punjabi children ASR and T-NT classifier using data augmentation
CN111862931A (zh) 一种语音生成方法及装置
Alhanjouri et al. Robust speaker identification using denoised wave atom and GMM
CN116994553A (zh) 语音合成模型的训练方法、语音合成方法、装置及设备
Ye Efficient Approaches for Voice Change and Voice Conversion Systems
Jinachitra Robust structured voice extraction for flexible expressive resynthesis
HERMUS Exponentieel Sinusoıdale Spraakcompressie voor Corpus-gebaseerde Tekst-naar-Spraak Synthese
Gao et al. A new approach to generating Pitch Cycle Waveform (PCW) for Waveform Interpolation codec

Legal Events

Date Code Title Description
MK5 Application lapsed section 142(2)(e) - patent request and compl. specification not accepted