NZ725925A

NZ725925A - Method for forming the excitation signal for a glottal pulse model based parametric speech synthesis system

Info

Publication number: NZ725925A
Application number: NZ725925A
Authority: NZ
Inventors: Rajesh Dachiraju; Aravind Ganapathiraju
Original assignee: Interactive Intelligence Inc
Priority date: 2014-05-28
Filing date: 2014-05-28
Publication date: 2020-04-24
Also published as: ZA201607696B; CA2947957A1; EP3149727A1; AU2014395554A1; JP6449331B2; EP3149727A4; CA2947957C; EP3149727B1; JP2017520016A; WO2015183254A1; AU2020227065B2; BR112016027537B1; CA3178027A1; AU2020227065A1; AU2014395554B2; BR112016027537A2

Abstract

A method is presented for forming the excitation signal for a glottal pulse model based parametric speech synthesis system. In one embodiment, fundamental frequency values are used to form the excitation signal. The excitation is modeled using a voice source pulse selected from a database of a given speaker. The voice source signal is segmented into glottal segments, which are used in vector representation to identify the glottal pulse used for formation of the excitation signal. Use of a novel distance metric and preserving the original signals extracted from the speakers voice samples helps capture low frequency information of the excitation signal. In addition, segment edge artifacts are removed by applying a unique segment joining method to improve the quality of synthetic speech while creating a true representation of the voice quality of a speaker.