AU2014395554B2 - Method for forming the excitation signal for a glottal pulse model based parametric speech synthesis system - Google Patents

Method for forming the excitation signal for a glottal pulse model based parametric speech synthesis system Download PDF

Info

Publication number
AU2014395554B2
AU2014395554B2 AU2014395554A AU2014395554A AU2014395554B2 AU 2014395554 B2 AU2014395554 B2 AU 2014395554B2 AU 2014395554 A AU2014395554 A AU 2014395554A AU 2014395554 A AU2014395554 A AU 2014395554A AU 2014395554 B2 AU2014395554 B2 AU 2014395554B2
Authority
AU
Australia
Prior art keywords
glottal
glottal pulse
signal
pulse
database
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
AU2014395554A
Other languages
English (en)
Other versions
AU2014395554A1 (en
Inventor
Rajesh DACHIRAJU
Aravind GANAPATHIRAJU
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Interactive Intelligence Inc
Original Assignee
Interactive Intelligence Inc
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Interactive Intelligence Inc filed Critical Interactive Intelligence Inc
Publication of AU2014395554A1 publication Critical patent/AU2014395554A1/en
Priority to AU2020227065A priority Critical patent/AU2020227065B2/en
Application granted granted Critical
Publication of AU2014395554B2 publication Critical patent/AU2014395554B2/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/90Pitch determination of speech signals
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L13/00Speech synthesis; Text to speech systems
    • G10L13/02Methods for producing synthetic speech; Speech synthesisers

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Computational Linguistics (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Signal Processing (AREA)
  • Compression, Expansion, Code Conversion, And Decoders (AREA)
  • Circuit For Audible Band Transducer (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
  • Mobile Radio Communication Systems (AREA)
  • Measuring Pulse, Heart Rate, Blood Pressure Or Blood Flow (AREA)
AU2014395554A 2014-05-28 2014-05-28 Method for forming the excitation signal for a glottal pulse model based parametric speech synthesis system Active AU2014395554B2 (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
AU2020227065A AU2020227065B2 (en) 2014-05-28 2020-09-03 Method for forming the excitation signal for a glottal pulse model based parametric speech synthesis system

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
PCT/US2014/039722 WO2015183254A1 (en) 2014-05-28 2014-05-28 Method for forming the excitation signal for a glottal pulse model based parametric speech synthesis system

Related Child Applications (1)

Application Number Title Priority Date Filing Date
AU2020227065A Division AU2020227065B2 (en) 2014-05-28 2020-09-03 Method for forming the excitation signal for a glottal pulse model based parametric speech synthesis system

Publications (2)

Publication Number Publication Date
AU2014395554A1 AU2014395554A1 (en) 2016-11-24
AU2014395554B2 true AU2014395554B2 (en) 2020-09-24

Family

ID=54699420

Family Applications (2)

Application Number Title Priority Date Filing Date
AU2014395554A Active AU2014395554B2 (en) 2014-05-28 2014-05-28 Method for forming the excitation signal for a glottal pulse model based parametric speech synthesis system
AU2020227065A Active AU2020227065B2 (en) 2014-05-28 2020-09-03 Method for forming the excitation signal for a glottal pulse model based parametric speech synthesis system

Family Applications After (1)

Application Number Title Priority Date Filing Date
AU2020227065A Active AU2020227065B2 (en) 2014-05-28 2020-09-03 Method for forming the excitation signal for a glottal pulse model based parametric speech synthesis system

Country Status (8)

Country Link
EP (1) EP3149727B1 (pt)
JP (1) JP6449331B2 (pt)
AU (2) AU2014395554B2 (pt)
BR (1) BR112016027537B1 (pt)
CA (2) CA3178027A1 (pt)
NZ (1) NZ725925A (pt)
WO (1) WO2015183254A1 (pt)
ZA (1) ZA201607696B (pt)

Families Citing this family (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US10014007B2 (en) 2014-05-28 2018-07-03 Interactive Intelligence, Inc. Method for forming the excitation signal for a glottal pulse model based parametric speech synthesis system
US10255903B2 (en) 2014-05-28 2019-04-09 Interactive Intelligence Group, Inc. Method for forming the excitation signal for a glottal pulse model based parametric speech synthesis system
US10614814B2 (en) 2016-06-02 2020-04-07 Interactive Intelligence Group, Inc. Technologies for authenticating a speaker using voice biometrics
JP2018040838A (ja) * 2016-09-05 2018-03-15 国立研究開発法人情報通信研究機構 音声のイントネーション構造を抽出する方法及びそのためのコンピュータプログラム

Family Cites Families (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5400434A (en) * 1990-09-04 1995-03-21 Matsushita Electric Industrial Co., Ltd. Voice source for synthetic speech system
US6795807B1 (en) * 1999-08-17 2004-09-21 David R. Baraff Method and means for creating prosody in speech regeneration for laryngectomees
JP2002244689A (ja) * 2001-02-22 2002-08-30 Rikogaku Shinkokai 平均声の合成方法及び平均声からの任意話者音声の合成方法
EP2279507A4 (en) * 2008-05-30 2013-01-23 Nokia Corp METHOD, DEVICE AND COMPUTER PROGRAM PRODUCT FOR IMPROVED LANGUAGE SYNTHESIS
JP5075865B2 (ja) * 2009-03-25 2012-11-21 株式会社東芝 音声処理装置、方法、及びプログラム
DK2242045T3 (da) * 2009-04-16 2012-09-24 Univ Mons Talesyntese og kodningsfremgangsmåder
JP5085700B2 (ja) * 2010-08-30 2012-11-28 株式会社東芝 音声合成装置、音声合成方法およびプログラム
US8744854B1 (en) * 2012-09-24 2014-06-03 Chengjun Julian Chen System and method for voice transformation

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
RAITIO, T. ET AL, "Comparing glottal-flow-excited statistical parametric speech synthesis methods", IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING; 26-31 MAY 2013. *
RAITIO, T. ET AL, "Utilizing glottal source pulse library for generating improved excitation signal for HMM-based speech synthesis", 2011 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING : 22 - 27 MAY 2011. *

Also Published As

Publication number Publication date
ZA201607696B (en) 2019-03-27
WO2015183254A1 (en) 2015-12-03
EP3149727A1 (en) 2017-04-05
BR112016027537A2 (pt) 2017-08-15
CA2947957C (en) 2023-01-03
AU2020227065B2 (en) 2021-11-18
EP3149727A4 (en) 2018-01-24
JP6449331B2 (ja) 2019-01-09
CA2947957A1 (en) 2015-12-03
AU2020227065A1 (en) 2020-09-24
NZ725925A (en) 2020-04-24
BR112016027537B1 (pt) 2022-05-10
AU2014395554A1 (en) 2016-11-24
JP2017520016A (ja) 2017-07-20
EP3149727B1 (en) 2021-01-27
CA3178027A1 (en) 2015-12-03

Similar Documents

Publication Publication Date Title
AU2020227065B2 (en) Method for forming the excitation signal for a glottal pulse model based parametric speech synthesis system
US10621969B2 (en) Method for forming the excitation signal for a glottal pulse model based parametric speech synthesis system
JP4802135B2 (ja) 話者認証登録及び確認方法並びに装置
US10014007B2 (en) Method for forming the excitation signal for a glottal pulse model based parametric speech synthesis system
KR20130133858A (ko) 청각 주의 큐를 사용하여 스피치 음절/모음/음의 경계 검출
CN108369803B (zh) 用于形成基于声门脉冲模型的参数语音合成系统的激励信号的方法
Ismail et al. Mfcc-vq approach for qalqalahtajweed rule checking
US11929058B2 (en) Systems and methods for adapting human speaker embeddings in speech synthesis
JP2017520016A5 (ja) パラメトリック音声合成システムに基づく声門パルスモデルの励起信号形成方法
EP3113180B1 (en) Method for performing audio inpainting on a speech signal and apparatus for performing audio inpainting on a speech signal
JP6142401B2 (ja) 音声合成モデル学習装置、方法、及びプログラム
Vasudev et al. Speaker identification using FBCC in Malayalam language
JP2012058293A (ja) 無声フィルタ学習装置、音声合成装置、無声フィルタ学習方法、およびプログラム
Yakoumaki et al. Emotional speech classification using adaptive sinusoidal modelling.
KR100488121B1 (ko) 화자간 변별력 향상을 위하여 개인별 켑스트럼 가중치를 적용한 화자 인증 장치 및 그 방법
Pan et al. Comprehensive voice conversion analysis based on DGMM and feature combination
CN116741156A (zh) 基于语义场景的语音识别方法、装置、设备及存储介质
CN116884385A (zh) 语音合成方法、装置及计算机可读存储介质
Apte Innovative wavelet based speech model using optimal mother wavelet generated from pitch synchronous LPC trajectory

Legal Events

Date Code Title Description
FGA Letters patent sealed or granted (standard patent)