NZ725925A - Method for forming the excitation signal for a glottal pulse model based parametric speech synthesis system - Google Patents

Method for forming the excitation signal for a glottal pulse model based parametric speech synthesis system

Info

Publication number
NZ725925A
NZ725925A NZ725925A NZ72592514A NZ725925A NZ 725925 A NZ725925 A NZ 725925A NZ 725925 A NZ725925 A NZ 725925A NZ 72592514 A NZ72592514 A NZ 72592514A NZ 725925 A NZ725925 A NZ 725925A
Authority
NZ
New Zealand
Prior art keywords
excitation signal
model based
glottal
forming
synthesis system
Prior art date
Application number
NZ725925A
Inventor
Rajesh Dachiraju
Aravind Ganapathiraju
Original Assignee
Interactive Intelligence Inc
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Interactive Intelligence Inc filed Critical Interactive Intelligence Inc
Publication of NZ725925A publication Critical patent/NZ725925A/en

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/90Pitch determination of speech signals
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L13/00Speech synthesis; Text to speech systems
    • G10L13/02Methods for producing synthetic speech; Speech synthesisers

Abstract

A method is presented for forming the excitation signal for a glottal pulse model based parametric speech synthesis system. In one embodiment, fundamental frequency values are used to form the excitation signal. The excitation is modeled using a voice source pulse selected from a database of a given speaker. The voice source signal is segmented into glottal segments, which are used in vector representation to identify the glottal pulse used for formation of the excitation signal. Use of a novel distance metric and preserving the original signals extracted from the speakers voice samples helps capture low frequency information of the excitation signal. In addition, segment edge artifacts are removed by applying a unique segment joining method to improve the quality of synthetic speech while creating a true representation of the voice quality of a speaker.
NZ725925A 2014-05-28 2014-05-28 Method for forming the excitation signal for a glottal pulse model based parametric speech synthesis system NZ725925A (en)

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
PCT/US2014/039722 WO2015183254A1 (en) 2014-05-28 2014-05-28 Method for forming the excitation signal for a glottal pulse model based parametric speech synthesis system

Publications (1)

Publication Number Publication Date
NZ725925A true NZ725925A (en) 2020-04-24

Family

ID=54699420

Family Applications (1)

Application Number Title Priority Date Filing Date
NZ725925A NZ725925A (en) 2014-05-28 2014-05-28 Method for forming the excitation signal for a glottal pulse model based parametric speech synthesis system

Country Status (8)

Country Link
EP (1) EP3149727B1 (en)
JP (1) JP6449331B2 (en)
AU (2) AU2014395554B2 (en)
BR (1) BR112016027537B1 (en)
CA (2) CA2947957C (en)
NZ (1) NZ725925A (en)
WO (1) WO2015183254A1 (en)
ZA (1) ZA201607696B (en)

Families Citing this family (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US10255903B2 (en) 2014-05-28 2019-04-09 Interactive Intelligence Group, Inc. Method for forming the excitation signal for a glottal pulse model based parametric speech synthesis system
US10014007B2 (en) 2014-05-28 2018-07-03 Interactive Intelligence, Inc. Method for forming the excitation signal for a glottal pulse model based parametric speech synthesis system
NZ749370A (en) * 2016-06-02 2020-03-27 Genesys Telecommunications Laboratories Inc Technologies for authenticating a speaker using voice biometrics
JP2018040838A (en) * 2016-09-05 2018-03-15 国立研究開発法人情報通信研究機構 Method for extracting intonation structure of voice and computer program therefor

Family Cites Families (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5400434A (en) * 1990-09-04 1995-03-21 Matsushita Electric Industrial Co., Ltd. Voice source for synthetic speech system
US6795807B1 (en) * 1999-08-17 2004-09-21 David R. Baraff Method and means for creating prosody in speech regeneration for laryngectomees
JP2002244689A (en) * 2001-02-22 2002-08-30 Rikogaku Shinkokai Synthesizing method for averaged voice and method for synthesizing arbitrary-speaker's voice from averaged voice
CN102047321A (en) * 2008-05-30 2011-05-04 诺基亚公司 Method, apparatus and computer program product for providing improved speech synthesis
JP5075865B2 (en) * 2009-03-25 2012-11-21 株式会社東芝 Audio processing apparatus, method, and program
PL2242045T3 (en) * 2009-04-16 2013-02-28 Univ Mons Speech synthesis and coding methods
JP5085700B2 (en) * 2010-08-30 2012-11-28 株式会社東芝 Speech synthesis apparatus, speech synthesis method and program
US8744854B1 (en) * 2012-09-24 2014-06-03 Chengjun Julian Chen System and method for voice transformation

Also Published As

Publication number Publication date
ZA201607696B (en) 2019-03-27
CA2947957A1 (en) 2015-12-03
EP3149727A1 (en) 2017-04-05
AU2014395554A1 (en) 2016-11-24
JP6449331B2 (en) 2019-01-09
EP3149727A4 (en) 2018-01-24
CA2947957C (en) 2023-01-03
EP3149727B1 (en) 2021-01-27
JP2017520016A (en) 2017-07-20
WO2015183254A1 (en) 2015-12-03
AU2020227065B2 (en) 2021-11-18
BR112016027537B1 (en) 2022-05-10
CA3178027A1 (en) 2015-12-03
AU2020227065A1 (en) 2020-09-24
AU2014395554B2 (en) 2020-09-24
BR112016027537A2 (en) 2017-08-15

Similar Documents

Publication Publication Date Title
MX2018004828A (en) Apparatus and method for generating a filtered audio signal realizing elevation rendering.
MX364461B (en) Method and apparatus for implementing recording of object audio, and electronic device.
MX2016012317A (en) Apparatus and method for audio rendering employing a geometric distance definition.
WO2014145960A3 (en) Method and system for generating advanced feature discrimination vectors for use in speech recognition
WO2014133756A3 (en) Method and apparatus for learning-enhanced altas-based auto-segmentation
MY173561A (en) Audio signal classification method and apparatus
NZ725925A (en) Method for forming the excitation signal for a glottal pulse model based parametric speech synthesis system
WO2014025682A3 (en) Acoustic data selection for training the parameters of an acoustic model
WO2012129255A3 (en) Systems and methods for segmenting and/or classifying an audio signal from transformed audio information
MX352154B (en) Unvoiced/voiced decision for speech processing.
EP2960866A3 (en) Method and apparatus for creating curved surface model
TW201614642A (en) Method and apparatus for separating speech data from background data in audio communication
JP2016071029A5 (en)
GB2539592A (en) Subsurface formation modeling with integrated stress profiles
MY172710A (en) Apparatus and method for generating a frequency enhancement signal using an energy limitation operation
PH12016500600B1 (en) Method, apparatus, device, computer-readable medium for bandwidth extension of an audio signal using a scaled high-band excitation
EP3363015A4 (en) Method for forming the excitation signal for a glottal pulse model based parametric speech synthesis system
GB2565744A (en) Full waveform inversion of vertical seismic profile data for anisotropic velocities using pseudo-acoustic wave equations
SG10201805102PA (en) Audio coding method and related apparatus
MX2016012004A (en) Apparatus, method and corresponding computer program for generating an error concealment signal using an adaptive noise estimation.
TW201612894A (en) Synthesis method of audio files and synthesis system of audio files using same
GB2557056A (en) Acoustic anisotrophy log visualization
GB2574164A (en) Sound identification utilizing periodic indications
Sahidullah Enhancement of speaker recognition performance using block level, relative and temporal information of subband energies
WO2016036163A3 (en) Method and apparatus for learning and recognizing audio signal

Legal Events

Date Code Title Description
PSEA Patent sealed
RENW Renewal (renewal fees accepted)

Free format text: PATENT RENEWED FOR 1 YEAR UNTIL 28 MAY 2022 BY FRKELLY

Effective date: 20210528

RENW Renewal (renewal fees accepted)

Free format text: PATENT RENEWED FOR 1 YEAR UNTIL 28 MAY 2023 BY FRKELLY

Effective date: 20220506

RENW Renewal (renewal fees accepted)

Free format text: PATENT RENEWED FOR 1 YEAR UNTIL 28 MAY 2024 BY FRKELLY

Effective date: 20230505