EP1860646A3

EP1860646A3 - Automatic segmentaion in speech synthesis

Info

Publication number: EP1860646A3
Application number: EP07116266A
Authority: EP
Inventors: Alistair D. Conkie; Yeon-Jun Kim
Original assignee: AT&T Corp
Current assignee: AT&T Corp
Priority date: 2002-03-29
Filing date: 2003-03-27
Publication date: 2008-09-03
Also published as: EP1860645A2; EP1860646A2; EP1860645A3

Abstract

A method for segmenting phone labels to reduce misalignments in order to improve synthetic speech when the phone labels are concatenated comprises:
training a set of HMMs using one of a specific speaker's hand-labeled speech data and speaker-independent speech data;
segmenting the trained set of HMMs using an alignment to produce phone labels, wherein each phone label has a spectral boundary;
using a weighted slope metric to identify bending points of spectral transitions, wherein each bending point corresponds to a spectral boundary; and
correcting a particular spectral boundary of a particular phone label if the particular spectral boundary does not coincide with a particular bending point.