EP1860645A3 - Automatic segmentation in speech synthesis - Google Patents

Automatic segmentation in speech synthesis Download PDF

Info

Publication number
EP1860645A3
EP1860645A3 EP07116265A EP07116265A EP1860645A3 EP 1860645 A3 EP1860645 A3 EP 1860645A3 EP 07116265 A EP07116265 A EP 07116265A EP 07116265 A EP07116265 A EP 07116265A EP 1860645 A3 EP1860645 A3 EP 1860645A3
Authority
EP
European Patent Office
Prior art keywords
phone
spectral
speech synthesis
spectral boundary
labels
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Withdrawn
Application number
EP07116265A
Other languages
German (de)
French (fr)
Other versions
EP1860645A2 (en
Inventor
Alistair D. Conkie
Yeon-Jun Kim
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
AT&T Corp
Original Assignee
AT&T Corp
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Priority claimed from US10/341,869 external-priority patent/US7266497B2/en
Application filed by AT&T Corp filed Critical AT&T Corp
Publication of EP1860645A2 publication Critical patent/EP1860645A2/en
Publication of EP1860645A3 publication Critical patent/EP1860645A3/en
Withdrawn legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L13/00Speech synthesis; Text to speech systems
    • G10L13/06Elementary speech units used in speech synthesisers; Concatenation rules

Landscapes

  • Engineering & Computer Science (AREA)
  • Computational Linguistics (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Telephonic Communication Services (AREA)
  • Image Analysis (AREA)
  • Machine Translation (AREA)
  • Telephone Function (AREA)

Abstract

In a system having a speech inventory that includes phone labels that are concatenated to form synthetic speech, a method for segmenting the phone labels comprises:
performing an alignment on a trained set of HMMs to produce phone labels that are segmented, wherein each phone label has a spectral boundary; and
performing spectral boundary correction on the phone labels, wherein spectral boundary correction re-aligns each spectral boundary using bending points of spectral transitions,

wherein the phone labels having spectral boundary correction are used for speech synthesis.
EP07116265A 2002-03-29 2003-03-27 Automatic segmentation in speech synthesis Withdrawn EP1860645A3 (en)

Applications Claiming Priority (3)

Application Number Priority Date Filing Date Title
US36904302P 2002-03-29 2002-03-29
US10/341,869 US7266497B2 (en) 2002-03-29 2003-01-14 Automatic segmentation in speech synthesis
EP03100795A EP1394769B1 (en) 2002-03-29 2003-03-27 Automatic segmentation in speech synthesis

Related Parent Applications (1)

Application Number Title Priority Date Filing Date
EP03100795A Division EP1394769B1 (en) 2002-03-29 2003-03-27 Automatic segmentation in speech synthesis

Publications (2)

Publication Number Publication Date
EP1860645A2 EP1860645A2 (en) 2007-11-28
EP1860645A3 true EP1860645A3 (en) 2008-09-03

Family

ID=38621945

Family Applications (2)

Application Number Title Priority Date Filing Date
EP07116266A Withdrawn EP1860646A3 (en) 2002-03-29 2003-03-27 Automatic segmentaion in speech synthesis
EP07116265A Withdrawn EP1860645A3 (en) 2002-03-29 2003-03-27 Automatic segmentation in speech synthesis

Family Applications Before (1)

Application Number Title Priority Date Filing Date
EP07116266A Withdrawn EP1860646A3 (en) 2002-03-29 2003-03-27 Automatic segmentaion in speech synthesis

Country Status (1)

Country Link
EP (2) EP1860646A3 (en)

Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP1035537A2 (en) * 1999-03-09 2000-09-13 Matsushita Electric Industrial Co., Ltd. Identification of unit overlap regions for concatenative speech synthesis system

Patent Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP1035537A2 (en) * 1999-03-09 2000-09-13 Matsushita Electric Industrial Co., Ltd. Identification of unit overlap regions for concatenative speech synthesis system

Non-Patent Citations (3)

* Cited by examiner, † Cited by third party
Title
BRUGNARA F ET AL: "AUTOMATIC SEGMENTATION AND LABELING OF SPEECH BASED ON HIDDEN MARKOV MODELS", SPEECH COMMUNICATION, ELSEVIER SCIENCE PUBLISHERS, AMSTERDAM, NL, vol. 12, no. 4, 1 August 1993 (1993-08-01), pages 357 - 370, XP000393652, ISSN: 0167-6393 *
HON H ET AL: "Automatic generation of synthesis units for trainable text-to-speech systems", ACOUSTICS, SPEECH AND SIGNAL PROCESSING, 1998. PROCEEDINGS OF THE 1998 IEEE INTERNATIONAL CONFERENCE ON SEATTLE, WA, USA 12-15 MAY 1998, NEW YORK, NY, USA,IEEE, US, 12 May 1998 (1998-05-12), pages 293 - 296, XP010279159, ISBN: 0-7803-4428-6 *
TOLEDANO D T: "Neural network boundary refining for automatic speech segmentation", 2000 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL, vol. 6, 5 June 2000 (2000-06-05), pages 3438 - 3441, XP010505636 *

Also Published As

Publication number Publication date
EP1860646A2 (en) 2007-11-28
EP1860645A2 (en) 2007-11-28
EP1860646A3 (en) 2008-09-03

Similar Documents

Publication Publication Date Title
AU2003214684A1 (en) Creation method for characters/words and the information and communication service method thereby
WO2005111761A3 (en) System and method for creating tamper-resistant code
TWI316076B (en) An optical disk,a system for labeling a substrate and a method for labeling an optical disk
AU2003212510A1 (en) Unsupervised data segmentation
EP1339187B8 (en) Information communication method
AU2003296860A1 (en) Information extraction using an object based semantic network
WO2005055006A3 (en) Business software application generation system and method
AU2003226446A1 (en) Text structure for voice synthesis, voice synthesis method, voice synthesis apparatus, and computer program thereof
AU2003262646A1 (en) Information communication apparatus and method
WO2005036327A3 (en) System and method for providing information regarding an identity's media availability
AU2003262667A1 (en) Methods and apparatus for simultaneous independent voice and data services using a remote subscriber identity module (sim)
AU2003264141A1 (en) A method and a system for performing connectivity evaluations on data communication networks and related information technology product
MXPA03007178A (en) Method, module, device and server for voice recognition.
AU2002326089A1 (en) A method for creating multimedia messages with rfid tag information
AU2003211448A1 (en) Server, information providing method and program
AU2003215097A1 (en) System and method for concurrent multimodal communication using concurrent multimodal tags
AU2003281603A1 (en) System for extracting information from a natural language text
AU2003291397A1 (en) Method and apparatus for coding gain information in a speech coding system
EP1860645A3 (en) Automatic segmentation in speech synthesis
AU2003226449A1 (en) Display apparatus, information display method, information display program, readable recording medium, and information apparatus
AU1663100A (en) Method, system and business model for performing an auction
AU2003211970A1 (en) Information providing apparatus, provided information presenting apparatus, and information providing method
AU2003258092A1 (en) A smart audio guide system and method
WO2004109508A3 (en) System and method for object navigation grammar completion
AU2003261982A1 (en) Information processing apparatus, and information processing method

Legal Events

Date Code Title Description
PUAI Public reference made under article 153(3) epc to a published international application that has entered the european phase

Free format text: ORIGINAL CODE: 0009012

AC Divisional application: reference to earlier application

Ref document number: 1394769

Country of ref document: EP

Kind code of ref document: P

AK Designated contracting states

Kind code of ref document: A2

Designated state(s): DE FI FR GB NL

PUAL Search report despatched

Free format text: ORIGINAL CODE: 0009013

AK Designated contracting states

Kind code of ref document: A3

Designated state(s): DE FI FR GB NL

RIC1 Information provided on ipc code assigned before grant

Ipc: G10L 15/14 20060101ALI20080725BHEP

Ipc: G10L 13/06 20060101AFI20071024BHEP

17P Request for examination filed

Effective date: 20090302

17Q First examination report despatched

Effective date: 20090403

AKX Designation fees paid

Designated state(s): DE FI FR GB NL

STAA Information on the status of an ep patent application or granted ep patent

Free format text: STATUS: THE APPLICATION IS DEEMED TO BE WITHDRAWN

18D Application deemed to be withdrawn

Effective date: 20090825