EP1860646A3 - Automatic segmentaion in speech synthesis - Google Patents
Automatic segmentaion in speech synthesis Download PDFInfo
- Publication number
- EP1860646A3 EP1860646A3 EP07116266A EP07116266A EP1860646A3 EP 1860646 A3 EP1860646 A3 EP 1860646A3 EP 07116266 A EP07116266 A EP 07116266A EP 07116266 A EP07116266 A EP 07116266A EP 1860646 A3 EP1860646 A3 EP 1860646A3
- Authority
- EP
- European Patent Office
- Prior art keywords
- phone
- spectral
- spectral boundary
- boundary
- labels
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Withdrawn
Links
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L13/00—Speech synthesis; Text to speech systems
- G10L13/06—Elementary speech units used in speech synthesisers; Concatenation rules
Abstract
training a set of HMMs using one of a specific speaker's hand-labeled speech data and speaker-independent speech data;
segmenting the trained set of HMMs using an alignment to produce phone labels, wherein each phone label has a spectral boundary;
using a weighted slope metric to identify bending points of spectral transitions, wherein each bending point corresponds to a spectral boundary; and
correcting a particular spectral boundary of a particular phone label if the particular spectral boundary does not coincide with a particular bending point.
Applications Claiming Priority (3)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US36904302P | 2002-03-29 | 2002-03-29 | |
US10/341,869 US7266497B2 (en) | 2002-03-29 | 2003-01-14 | Automatic segmentation in speech synthesis |
EP03100795A EP1394769B1 (en) | 2002-03-29 | 2003-03-27 | Automatic segmentation in speech synthesis |
Related Parent Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
EP03100795A Division EP1394769B1 (en) | 2002-03-29 | 2003-03-27 | Automatic segmentation in speech synthesis |
Publications (2)
Publication Number | Publication Date |
---|---|
EP1860646A2 EP1860646A2 (en) | 2007-11-28 |
EP1860646A3 true EP1860646A3 (en) | 2008-09-03 |
Family
ID=38621945
Family Applications (2)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
EP07116265A Withdrawn EP1860645A3 (en) | 2002-03-29 | 2003-03-27 | Automatic segmentation in speech synthesis |
EP07116266A Withdrawn EP1860646A3 (en) | 2002-03-29 | 2003-03-27 | Automatic segmentaion in speech synthesis |
Family Applications Before (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
EP07116265A Withdrawn EP1860645A3 (en) | 2002-03-29 | 2003-03-27 | Automatic segmentation in speech synthesis |
Country Status (1)
Country | Link |
---|---|
EP (2) | EP1860645A3 (en) |
Citations (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
EP1035537A2 (en) * | 1999-03-09 | 2000-09-13 | Matsushita Electric Industrial Co., Ltd. | Identification of unit overlap regions for concatenative speech synthesis system |
-
2003
- 2003-03-27 EP EP07116265A patent/EP1860645A3/en not_active Withdrawn
- 2003-03-27 EP EP07116266A patent/EP1860646A3/en not_active Withdrawn
Patent Citations (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
EP1035537A2 (en) * | 1999-03-09 | 2000-09-13 | Matsushita Electric Industrial Co., Ltd. | Identification of unit overlap regions for concatenative speech synthesis system |
Non-Patent Citations (3)
Title |
---|
BRUGNARA F ET AL: "AUTOMATIC SEGMENTATION AND LABELING OF SPEECH BASED ON HIDDEN MARKOV MODELS", SPEECH COMMUNICATION, ELSEVIER SCIENCE PUBLISHERS, AMSTERDAM, NL, vol. 12, no. 4, 1 August 1993 (1993-08-01), pages 357 - 370, XP000393652, ISSN: 0167-6393 * |
HON H ET AL: "Automatic generation of synthesis units for trainable text-to-speech systems", ACOUSTICS, SPEECH AND SIGNAL PROCESSING, 1998. PROCEEDINGS OF THE 1998 IEEE INTERNATIONAL CONFERENCE ON SEATTLE, WA, USA 12-15 MAY 1998, NEW YORK, NY, USA,IEEE, US, 12 May 1998 (1998-05-12), pages 293 - 296, XP010279159, ISBN: 0-7803-4428-6 * |
TOLEDANO D T: "Neural network boundary refining for automatic speech segmentation", 2000 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL, vol. 6, 5 June 2000 (2000-06-05), pages 3438 - 3441, XP010505636 * |
Also Published As
Publication number | Publication date |
---|---|
EP1860645A2 (en) | 2007-11-28 |
EP1860646A2 (en) | 2007-11-28 |
EP1860645A3 (en) | 2008-09-03 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CA2307300A1 (en) | Method and system for proofreading and correcting dictated text | |
CA2299051A1 (en) | Hierarchical subband linear predictive cepstral features for hmm-based speech recognition | |
ATE317583T1 (en) | TEXT EDITING OF RECOGNIZED LANGUAGE WITH SIMULTANEOUS PLAYBACK | |
ATE319161T1 (en) | CORRECTION DEVICE WITH MARKING PARTS OF A RECOGNIZED TEXT | |
AU1901101A (en) | Information terminal with built-in fingerprint recognizer | |
EP1455268A3 (en) | Presentation of data based on user input | |
WO2006023631A3 (en) | Document transcription system training | |
WO2001001373A3 (en) | Electronic book with voice synthesis and recognition | |
EP1205898A3 (en) | Technique for mentoring pre-readers and early readers | |
EP1968030A3 (en) | Method and terminal of producing and providing traffic signal information | |
WO2000039788A3 (en) | Knowledge-based strategies applied to n-best lists in automatic speech recognition systems | |
WO2008142836A1 (en) | Voice tone converting device and voice tone converting method | |
AU2003212510A1 (en) | Unsupervised data segmentation | |
CA2233179A1 (en) | Unsupervised hmm adaptation based on speech-silence discrimination | |
WO2004084467A3 (en) | Recovering an erased voice frame with time warping | |
WO1999016052A3 (en) | Speech recognition system for recognizing continuous and isolated speech | |
CA2423144A1 (en) | Automatic segmentation in speech synthesis | |
CA2366892A1 (en) | Method and apparatus for speaker recognition using a speaker dependent transform | |
WO2003041051A3 (en) | Hmm-based text-to-phoneme parser and method for training same | |
DE60020504D1 (en) | ADJUSTING A LANGUAGE IDENTIFIER TO CORRECTED TEXTS | |
EP2325234A3 (en) | High impact poly(urethane urea) polysulfides | |
EP1860646A3 (en) | Automatic segmentaion in speech synthesis | |
WO2002079744A3 (en) | Sound characterisation and/or identification based on prosodic listening | |
DE60022976D1 (en) | LANGUAGE RECOGNITION WITH TRANSFER | |
WO2004109508A3 (en) | System and method for object navigation grammar completion |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PUAI | Public reference made under article 153(3) epc to a published international application that has entered the european phase |
Free format text: ORIGINAL CODE: 0009012 |
|
AC | Divisional application: reference to earlier application |
Ref document number: 1394769 Country of ref document: EP Kind code of ref document: P |
|
AK | Designated contracting states |
Kind code of ref document: A2 Designated state(s): DE FI FR GB NL |
|
PUAL | Search report despatched |
Free format text: ORIGINAL CODE: 0009013 |
|
AK | Designated contracting states |
Kind code of ref document: A3 Designated state(s): DE FI FR GB NL |
|
RIC1 | Information provided on ipc code assigned before grant |
Ipc: G10L 15/14 20060101ALI20080725BHEP Ipc: G10L 13/06 20060101AFI20071024BHEP |
|
17P | Request for examination filed |
Effective date: 20090302 |
|
17Q | First examination report despatched |
Effective date: 20090403 |
|
AKX | Designation fees paid |
Designated state(s): DE FI FR GB NL |
|
STAA | Information on the status of an ep patent application or granted ep patent |
Free format text: STATUS: THE APPLICATION IS DEEMED TO BE WITHDRAWN |
|
18D | Application deemed to be withdrawn |
Effective date: 20090814 |