KR20180078252A - 성문 펄스 모델 기반 매개 변수식 음성 합성 시스템의 여기 신호 형성 방법 - Google Patents

성문 펄스 모델 기반 매개 변수식 음성 합성 시스템의 여기 신호 형성 방법 Download PDF

Info

Publication number
KR20180078252A
KR20180078252A KR1020187012944A KR20187012944A KR20180078252A KR 20180078252 A KR20180078252 A KR 20180078252A KR 1020187012944 A KR1020187012944 A KR 1020187012944A KR 20187012944 A KR20187012944 A KR 20187012944A KR 20180078252 A KR20180078252 A KR 20180078252A
Authority
KR
South Korea
Prior art keywords
pulse
speech
database
way
subband
Prior art date
Application number
KR1020187012944A
Other languages
English (en)
Korean (ko)
Inventor
라제쉬 다치라주
이. 비라 라가벤드라
아라빈드 가나파티라주
Original Assignee
인터랙티브 인텔리전스 그룹, 인코포레이티드
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 인터랙티브 인텔리전스 그룹, 인코포레이티드 filed Critical 인터랙티브 인텔리전스 그룹, 인코포레이티드
Publication of KR20180078252A publication Critical patent/KR20180078252A/ko

Links

Images

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L13/00Speech synthesis; Text to speech systems
    • G10L13/02Methods for producing synthetic speech; Speech synthesisers
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/75Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 for modelling vocal tract parameters

Landscapes

  • Engineering & Computer Science (AREA)
  • Computational Linguistics (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Signal Processing (AREA)
  • Compression, Expansion, Code Conversion, And Decoders (AREA)
  • Machine Translation (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
KR1020187012944A 2015-10-06 2015-10-06 성문 펄스 모델 기반 매개 변수식 음성 합성 시스템의 여기 신호 형성 방법 KR20180078252A (ko)

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
PCT/US2015/054122 WO2017061985A1 (en) 2015-10-06 2015-10-06 Method for forming the excitation signal for a glottal pulse model based parametric speech synthesis system

Publications (1)

Publication Number Publication Date
KR20180078252A true KR20180078252A (ko) 2018-07-09

Family

ID=58488102

Family Applications (1)

Application Number Title Priority Date Filing Date
KR1020187012944A KR20180078252A (ko) 2015-10-06 2015-10-06 성문 펄스 모델 기반 매개 변수식 음성 합성 시스템의 여기 신호 형성 방법

Country Status (6)

Country Link
EP (1) EP3363015A4 (de)
KR (1) KR20180078252A (de)
CN (1) CN108369803B (de)
AU (1) AU2015411306A1 (de)
CA (1) CA3004700C (de)
WO (1) WO2017061985A1 (de)

Families Citing this family (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP3857541B1 (de) * 2018-09-30 2023-07-19 Microsoft Technology Licensing, LLC Erzeugung von sprachwellenformen
CN109767755A (zh) * 2019-03-01 2019-05-17 广州多益网络股份有限公司 一种语音合成方法和系统
CN111862931B (zh) * 2020-05-08 2024-09-24 北京嘀嘀无限科技发展有限公司 一种语音生成方法及装置
CN112365875B (zh) * 2020-11-18 2021-09-10 北京百度网讯科技有限公司 语音合成方法、装置、声码器和电子设备
CN113571079A (zh) * 2021-02-08 2021-10-29 腾讯科技(深圳)有限公司 语音增强方法、装置、设备及存储介质

Family Cites Families (14)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6070140A (en) * 1995-06-05 2000-05-30 Tran; Bao Q. Speech recognizer
US5937384A (en) * 1996-05-01 1999-08-10 Microsoft Corporation Method and system for speech recognition using continuous density hidden Markov models
US20020116196A1 (en) * 1998-11-12 2002-08-22 Tran Bao Q. Speech recognizer
US6970820B2 (en) * 2001-02-26 2005-11-29 Matsushita Electric Industrial Co., Ltd. Voice personalization of speech synthesizer
WO2003019527A1 (fr) * 2001-08-31 2003-03-06 Kabushiki Kaisha Kenwood Procede et appareil de generation d'un signal affecte d'un pas et procede et appareil de compression/decompression et de synthese d'un signal vocal l'utilisant
ATE456130T1 (de) * 2007-10-29 2010-02-15 Harman Becker Automotive Sys Partielle sprachrekonstruktion
CA2724753A1 (en) * 2008-05-30 2009-12-03 Nokia Corporation Method, apparatus and computer program product for providing improved speech synthesis
PL2242045T3 (pl) * 2009-04-16 2013-02-28 Univ Mons Sposób kodowania i syntezy mowy
CN102231275B (zh) * 2011-06-01 2013-10-16 北京宇音天下科技有限公司 一种基于加权混合激励的嵌入式语音合成方法
CN102270449A (zh) * 2011-08-10 2011-12-07 歌尔声学股份有限公司 参数语音合成方法和系统
US20130080172A1 (en) * 2011-09-22 2013-03-28 General Motors Llc Objective evaluation of synthesized speech attributes
US10453479B2 (en) * 2011-09-23 2019-10-22 Lessac Technologies, Inc. Methods for aligning expressive speech utterances with text and systems therefor
GB2508417B (en) * 2012-11-30 2017-02-08 Toshiba Res Europe Ltd A speech processing system
TWI573129B (zh) * 2013-02-05 2017-03-01 國立交通大學 編碼串流產生裝置、韻律訊息編碼裝置、韻律結構分析裝置與語音合成之裝置及方法

Also Published As

Publication number Publication date
WO2017061985A1 (en) 2017-04-13
AU2015411306A1 (en) 2018-05-24
CN108369803A (zh) 2018-08-03
CA3004700A1 (en) 2017-04-13
CA3004700C (en) 2021-03-23
CN108369803B (zh) 2023-04-04
EP3363015A4 (de) 2019-06-12
EP3363015A1 (de) 2018-08-22

Similar Documents

Publication Publication Date Title
US9368103B2 (en) Estimation system of spectral envelopes and group delays for sound analysis and synthesis, and audio signal synthesis system
US10186252B1 (en) Text to speech synthesis using deep neural network with constant unit length spectrogram
US10621969B2 (en) Method for forming the excitation signal for a glottal pulse model based parametric speech synthesis system
US7035791B2 (en) Feature-domain concatenative speech synthesis
US8280724B2 (en) Speech synthesis using complex spectral modeling
AU2020227065B2 (en) Method for forming the excitation signal for a glottal pulse model based parametric speech synthesis system
KR20180078252A (ko) 성문 펄스 모델 기반 매개 변수식 음성 합성 시스템의 여기 신호 형성 방법
US10014007B2 (en) Method for forming the excitation signal for a glottal pulse model based parametric speech synthesis system
GB2603776A (en) Methods and systems for modifying speech generated by a text-to-speech synthesiser
RU2427044C1 (ru) Текстозависимый способ конверсии голоса
Ghai et al. Exploring the effect of differences in the acoustic correlates of adults' and children's speech in the context of automatic speech recognition
JP2017520016A5 (ja) パラメトリック音声合成システムに基づく声門パルスモデルの励起信号形成方法
US10446133B2 (en) Multi-stream spectral representation for statistical parametric speech synthesis
Narendra et al. Time-domain deterministic plus noise model based hybrid source modeling for statistical parametric speech synthesis
CN114270433A (zh) 声学模型学习装置、语音合成装置、方法以及程序
JP5245962B2 (ja) 音声合成装置、音声合成方法、プログラム及び記録媒体
JPWO2009041402A1 (ja) 周波数軸伸縮係数推定装置とシステム方法並びにプログラム
Yakoumaki et al. Emotional speech classification using adaptive sinusoidal modelling.
Sulír et al. The influence of adaptation database size on the quality of HMM-based synthetic voice based on the large average voice model
Achanta et al. Significance of Maximum Spectral Amplitude in Sub-bands for Spectral Envelope Estimation and Its Application to Statistical Parametric Speech Synthesis
Tychtl et al. Corpus-Based Database of Residual Excitations Used for Speech Reconstruction from MFCCs
HERMUS Exponentieel Sinusoıdale Spraakcompressie voor Corpus-gebaseerde Tekst-naar-Spraak Synthese

Legal Events

Date Code Title Description
A201 Request for examination
E902 Notification of reason for refusal
AMND Amendment
E902 Notification of reason for refusal
AMND Amendment
E601 Decision to refuse application
X091 Application refused [patent]
E601 Decision to refuse application
E801 Decision on dismissal of amendment