EP1860646A3 - Automatic segmentaion in speech synthesis - Google Patents

Automatic segmentaion in speech synthesis Download PDF

Info

Publication number
EP1860646A3
EP1860646A3 EP07116266A EP07116266A EP1860646A3 EP 1860646 A3 EP1860646 A3 EP 1860646A3 EP 07116266 A EP07116266 A EP 07116266A EP 07116266 A EP07116266 A EP 07116266A EP 1860646 A3 EP1860646 A3 EP 1860646A3
Authority
EP
European Patent Office
Prior art keywords
phone
spectral
spectral boundary
boundary
labels
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Withdrawn
Application number
EP07116266A
Other languages
German (de)
French (fr)
Other versions
EP1860646A2 (en
Inventor
Alistair D. Conkie
Yeon-Jun Kim
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
AT&T Corp
Original Assignee
AT&T Corp
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Priority claimed from US10/341,869 external-priority patent/US7266497B2/en
Application filed by AT&T Corp filed Critical AT&T Corp
Publication of EP1860646A2 publication Critical patent/EP1860646A2/en
Publication of EP1860646A3 publication Critical patent/EP1860646A3/en
Withdrawn legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L13/00Speech synthesis; Text to speech systems
    • G10L13/06Elementary speech units used in speech synthesisers; Concatenation rules

Abstract

A method for segmenting phone labels to reduce misalignments in order to improve synthetic speech when the phone labels are concatenated comprises:
training a set of HMMs using one of a specific speaker's hand-labeled speech data and speaker-independent speech data;
segmenting the trained set of HMMs using an alignment to produce phone labels, wherein each phone label has a spectral boundary;
using a weighted slope metric to identify bending points of spectral transitions, wherein each bending point corresponds to a spectral boundary; and
correcting a particular spectral boundary of a particular phone label if the particular spectral boundary does not coincide with a particular bending point.
EP07116266A 2002-03-29 2003-03-27 Automatic segmentaion in speech synthesis Withdrawn EP1860646A3 (en)

Applications Claiming Priority (3)

Application Number Priority Date Filing Date Title
US36904302P 2002-03-29 2002-03-29
US10/341,869 US7266497B2 (en) 2002-03-29 2003-01-14 Automatic segmentation in speech synthesis
EP03100795A EP1394769B1 (en) 2002-03-29 2003-03-27 Automatic segmentation in speech synthesis

Related Parent Applications (1)

Application Number Title Priority Date Filing Date
EP03100795A Division EP1394769B1 (en) 2002-03-29 2003-03-27 Automatic segmentation in speech synthesis

Publications (2)

Publication Number Publication Date
EP1860646A2 EP1860646A2 (en) 2007-11-28
EP1860646A3 true EP1860646A3 (en) 2008-09-03

Family

ID=38621945

Family Applications (2)

Application Number Title Priority Date Filing Date
EP07116265A Withdrawn EP1860645A3 (en) 2002-03-29 2003-03-27 Automatic segmentation in speech synthesis
EP07116266A Withdrawn EP1860646A3 (en) 2002-03-29 2003-03-27 Automatic segmentaion in speech synthesis

Family Applications Before (1)

Application Number Title Priority Date Filing Date
EP07116265A Withdrawn EP1860645A3 (en) 2002-03-29 2003-03-27 Automatic segmentation in speech synthesis

Country Status (1)

Country Link
EP (2) EP1860645A3 (en)

Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP1035537A2 (en) * 1999-03-09 2000-09-13 Matsushita Electric Industrial Co., Ltd. Identification of unit overlap regions for concatenative speech synthesis system

Patent Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP1035537A2 (en) * 1999-03-09 2000-09-13 Matsushita Electric Industrial Co., Ltd. Identification of unit overlap regions for concatenative speech synthesis system

Non-Patent Citations (3)

* Cited by examiner, † Cited by third party
Title
BRUGNARA F ET AL: "AUTOMATIC SEGMENTATION AND LABELING OF SPEECH BASED ON HIDDEN MARKOV MODELS", SPEECH COMMUNICATION, ELSEVIER SCIENCE PUBLISHERS, AMSTERDAM, NL, vol. 12, no. 4, 1 August 1993 (1993-08-01), pages 357 - 370, XP000393652, ISSN: 0167-6393 *
HON H ET AL: "Automatic generation of synthesis units for trainable text-to-speech systems", ACOUSTICS, SPEECH AND SIGNAL PROCESSING, 1998. PROCEEDINGS OF THE 1998 IEEE INTERNATIONAL CONFERENCE ON SEATTLE, WA, USA 12-15 MAY 1998, NEW YORK, NY, USA,IEEE, US, 12 May 1998 (1998-05-12), pages 293 - 296, XP010279159, ISBN: 0-7803-4428-6 *
TOLEDANO D T: "Neural network boundary refining for automatic speech segmentation", 2000 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL, vol. 6, 5 June 2000 (2000-06-05), pages 3438 - 3441, XP010505636 *

Also Published As

Publication number Publication date
EP1860645A2 (en) 2007-11-28
EP1860646A2 (en) 2007-11-28
EP1860645A3 (en) 2008-09-03

Similar Documents

Publication Publication Date Title
CA2307300A1 (en) Method and system for proofreading and correcting dictated text
CA2299051A1 (en) Hierarchical subband linear predictive cepstral features for hmm-based speech recognition
ATE317583T1 (en) TEXT EDITING OF RECOGNIZED LANGUAGE WITH SIMULTANEOUS PLAYBACK
ATE319161T1 (en) CORRECTION DEVICE WITH MARKING PARTS OF A RECOGNIZED TEXT
AU1901101A (en) Information terminal with built-in fingerprint recognizer
EP1455268A3 (en) Presentation of data based on user input
WO2006023631A3 (en) Document transcription system training
WO2001001373A3 (en) Electronic book with voice synthesis and recognition
EP1205898A3 (en) Technique for mentoring pre-readers and early readers
EP1968030A3 (en) Method and terminal of producing and providing traffic signal information
WO2000039788A3 (en) Knowledge-based strategies applied to n-best lists in automatic speech recognition systems
WO2008142836A1 (en) Voice tone converting device and voice tone converting method
AU2003212510A1 (en) Unsupervised data segmentation
CA2233179A1 (en) Unsupervised hmm adaptation based on speech-silence discrimination
WO2004084467A3 (en) Recovering an erased voice frame with time warping
WO1999016052A3 (en) Speech recognition system for recognizing continuous and isolated speech
CA2423144A1 (en) Automatic segmentation in speech synthesis
CA2366892A1 (en) Method and apparatus for speaker recognition using a speaker dependent transform
WO2003041051A3 (en) Hmm-based text-to-phoneme parser and method for training same
DE60020504D1 (en) ADJUSTING A LANGUAGE IDENTIFIER TO CORRECTED TEXTS
EP2325234A3 (en) High impact poly(urethane urea) polysulfides
EP1860646A3 (en) Automatic segmentaion in speech synthesis
WO2002079744A3 (en) Sound characterisation and/or identification based on prosodic listening
DE60022976D1 (en) LANGUAGE RECOGNITION WITH TRANSFER
WO2004109508A3 (en) System and method for object navigation grammar completion

Legal Events

Date Code Title Description
PUAI Public reference made under article 153(3) epc to a published international application that has entered the european phase

Free format text: ORIGINAL CODE: 0009012

AC Divisional application: reference to earlier application

Ref document number: 1394769

Country of ref document: EP

Kind code of ref document: P

AK Designated contracting states

Kind code of ref document: A2

Designated state(s): DE FI FR GB NL

PUAL Search report despatched

Free format text: ORIGINAL CODE: 0009013

AK Designated contracting states

Kind code of ref document: A3

Designated state(s): DE FI FR GB NL

RIC1 Information provided on ipc code assigned before grant

Ipc: G10L 15/14 20060101ALI20080725BHEP

Ipc: G10L 13/06 20060101AFI20071024BHEP

17P Request for examination filed

Effective date: 20090302

17Q First examination report despatched

Effective date: 20090403

AKX Designation fees paid

Designated state(s): DE FI FR GB NL

STAA Information on the status of an ep patent application or granted ep patent

Free format text: STATUS: THE APPLICATION IS DEEMED TO BE WITHDRAWN

18D Application deemed to be withdrawn

Effective date: 20090814