EP1777697A3 - Method and apparatus for speech synthesis without prosody modification - Google Patents

Method and apparatus for speech synthesis without prosody modification Download PDF

Info

Publication number
EP1777697A3
EP1777697A3 EP07002565A EP07002565A EP1777697A3 EP 1777697 A3 EP1777697 A3 EP 1777697A3 EP 07002565 A EP07002565 A EP 07002565A EP 07002565 A EP07002565 A EP 07002565A EP 1777697 A3 EP1777697 A3 EP 1777697A3
Authority
EP
European Patent Office
Prior art keywords
speech
samples
present
produce
prosodic
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
EP07002565A
Other languages
German (de)
French (fr)
Other versions
EP1777697B1 (en
EP1777697A2 (en
Inventor
Min Chu
Hu Peng
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Microsoft Corp
Original Assignee
Microsoft Corp
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Priority claimed from US09/850,527 external-priority patent/US6978239B2/en
Application filed by Microsoft Corp filed Critical Microsoft Corp
Publication of EP1777697A2 publication Critical patent/EP1777697A2/en
Publication of EP1777697A3 publication Critical patent/EP1777697A3/en
Application granted granted Critical
Publication of EP1777697B1 publication Critical patent/EP1777697B1/en
Anticipated expiration legal-status Critical
Expired - Lifetime legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L13/00Speech synthesis; Text to speech systems
    • G10L13/06Elementary speech units used in speech synthesisers; Concatenation rules
    • G10L13/07Concatenation rules
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L13/00Speech synthesis; Text to speech systems
    • G10L13/02Methods for producing synthetic speech; Speech synthesisers
    • G10L13/04Details of speech synthesis systems, e.g. synthesiser structure or memory management

Landscapes

  • Engineering & Computer Science (AREA)
  • Computational Linguistics (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Machine Translation (AREA)
  • Document Processing Apparatus (AREA)

Abstract

A speech synthesizer is provided that concatenates stored samples of speech units without modifying the prosody of the samples. The present invention is able to achieve a high level of naturalness in synthesized speech with a carefully designed training speech corpus by storing samples based on the prosodic and phonetic context in which they occur. In particular, some embodiments of the present invention limit the training text to those sentences that will produce the most frequent sets of prosodic contexts for each speech unit. Further embodiments of the present invention also provide a multi-tier selection mechanism for selecting a set of samples that will produce the most natural sounding speech.
EP07002565A 2000-12-04 2001-12-03 Method for speech synthesis without prosody modification Expired - Lifetime EP1777697B1 (en)

Applications Claiming Priority (3)

Application Number Priority Date Filing Date Title
US25116700P 2000-12-04 2000-12-04
US09/850,527 US6978239B2 (en) 2000-12-04 2001-05-07 Method and apparatus for speech synthesis without prosody modification
EP01128765A EP1213705B1 (en) 2000-12-04 2001-12-03 Method and apparatus for speech synthesis

Related Parent Applications (2)

Application Number Title Priority Date Filing Date
EP01128765A Division EP1213705B1 (en) 2000-12-04 2001-12-03 Method and apparatus for speech synthesis
EP01128765.3 Division 2001-12-03

Publications (3)

Publication Number Publication Date
EP1777697A2 EP1777697A2 (en) 2007-04-25
EP1777697A3 true EP1777697A3 (en) 2008-06-18
EP1777697B1 EP1777697B1 (en) 2013-03-20

Family

ID=37831625

Family Applications (1)

Application Number Title Priority Date Filing Date
EP07002565A Expired - Lifetime EP1777697B1 (en) 2000-12-04 2001-12-03 Method for speech synthesis without prosody modification

Country Status (1)

Country Link
EP (1) EP1777697B1 (en)

Families Citing this family (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
GB2559767A (en) * 2017-02-17 2018-08-22 Pastel Dreams Method and system for personalised voice synthesis
GB2559766A (en) * 2017-02-17 2018-08-22 Pastel Dreams Method and system for defining text content for speech segmentation
EP3895157A4 (en) * 2018-12-13 2022-07-27 Microsoft Technology Licensing, LLC Neural text-to-speech synthesis with multi-level text information

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP0984426A2 (en) * 1998-08-31 2000-03-08 Canon Kabushiki Kaisha Speech synthesizing apparatus and method, and storage medium therefor
US6064960A (en) * 1997-12-18 2000-05-16 Apple Computer, Inc. Method and apparatus for improved duration modeling of phonemes

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6064960A (en) * 1997-12-18 2000-05-16 Apple Computer, Inc. Method and apparatus for improved duration modeling of phonemes
EP0984426A2 (en) * 1998-08-31 2000-03-08 Canon Kabushiki Kaisha Speech synthesizing apparatus and method, and storage medium therefor

Non-Patent Citations (7)

* Cited by examiner, † Cited by third party
Title
BIGORGNE D ET AL: "Multilingual PSOLA text-to-speech system", STATISTICAL SIGNAL AND ARRAY PROCESSING. MINNEAPOLIS, APR. 27 - 30, 1993, PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP), NEW YORK, IEEE, US, vol. VOL. 4, 27 April 1993 (1993-04-27), pages 187 - 190, XP010110425, ISBN: 0-7803-0946-4 *
BLACK A W ET AL: "OPTIMISING SELECTION OF UNITS FROM SPEECH DATABASES FOR CONCATENATIVE SYNTHESIS", 4TH EUROPEAN CONFERENCE ON SPEECH COMMUNICATION AND TECHNOLOGY. EUROSPEECH '95. MADRID, SPAIN, SEPT. 18 - 21, 1995, EUROPEAN CONFERENCE ON SPEECH COMMUNICATION AND TECHNOLOGY. (EUROSPEECH), MADRID : GRAFICAS BRENS, ES, vol. VOL. 1 CONF. 4, 18 September 1995 (1995-09-18), pages 581 - 584, XP000854776 *
FU-CHIANG CHOU ET AL: "A Chinese text-to-speech system based on part-of-speech analysis, prosodic modeling and non-uniform units", ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 1997. ICASSP-97., 1997 IEEE INTERNATIONAL CONFERENCE ON MUNICH, GERMANY 21-24 APRIL 1997, LOS ALAMITOS, CA, USA,IEEE COMPUT. SOC, US, 21 April 1997 (1997-04-21), pages 923 - 926, XP010225946, ISBN: 0-8186-7919-0 *
HUANG X ET AL: "Recent improvements on Microsoft's trainable text-to-speech system-Whistler", ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 1997. ICASSP-97., 1997 IEEE INTERNATIONAL CONFERENCE ON MUNICH, GERMANY 21-24 APRIL 1997, LOS ALAMITOS, CA, USA,IEEE COMPUT. SOC, US, 21 April 1997 (1997-04-21), pages 959 - 962, XP010225955, ISBN: 0-8186-7919-0 *
HUNT A J ET AL: "Unit selection in a concatenative speech synthesis system using a large speech database", 1996 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING - PROCEEDINGS. (ICASSP). ATLANTA, MAY 7 - 10, 1996, IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING - PROCEEDINGS. (ICASSP), NEW YORK, IEEE, US, vol. VOL. 1 CONF. 21, 7 May 1996 (1996-05-07), pages 373 - 376, XP002133444, ISBN: 0-7803-3193-1 *
NAKAJIMA S ET AL: "Automatic generation of synthesis units based on context oriented clustering", ICASSP 88: 1988 INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND ICASSP 88: 1988 INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (CAT. NO.88CH2561-9) - 11-14 APRIL 1988, 11 April 1988 (1988-04-11), NEW YORK, USA, pages 659 - 662, XP010073228 *
TIEN YING FUNG ET AL: "Concatenating syllables for response generation in spoken language applications", IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2000; PROCEEDINGS, vol. 2, 5 June 2000 (2000-06-05), Istanbul, Turkey, 5-9 June 2000, pages 933 - 936, XP010504877 *

Also Published As

Publication number Publication date
EP1777697B1 (en) 2013-03-20
EP1777697A2 (en) 2007-04-25

Similar Documents

Publication Publication Date Title
EP1213705A3 (en) Method and apparatus for speech synthesis without prosody modification
US6308156B1 (en) Microsegment-based speech-synthesis process
EP1675101A3 (en) Singing voice-synthesizing method and apparatus and storage medium
WO2005088606B1 (en) Prosodic speech text codes and their use in computerized speech systems
EP1037195A3 (en) Generation and synthesis of prosody templates
WO2008114258A1 (en) Speech samples library for text-to-speech and methods and apparatus for generating and using same
EP1777697A3 (en) Method and apparatus for speech synthesis without prosody modification
Allen The PHONASCII system
JPH01284898A (en) Voice synthesizing device
JPH05100692A (en) Voice synthesizer
Rama et al. Thirukkural: a text-to-speech synthesis system
Lukaszewicz et al. Microphonemic method of speech synthesis
Terken et al. Automatic synthesis of natural-sounding intonation for text-to-speech conversion in Dutch
Ding et al. Glottalization in inventory construction: A cross-language study
Adeniyi An Outline of Ìgás̩i Tone System
Gros et al. Slovenian Text-to-Speech Synthesis for Speech User Interfaces.
JPH09292897A (en) Voice synthesizing device
Sokolovć Fundamental frequency change in two-syllable words with long rising accent
JPH0439698A (en) Speech synthesizer
Karjalainen Review of speech synthesis technology
JPH0313999A (en) Voice synthesizer
Paton Gateway to German diction: a guide for singers
Piccolo Where is the Hawaiian language headed? A phonetic study
JPH01112297A (en) Voice synthesizer
Christogiannis et al. Construction of the acoustic inventory for a greek text-to-speech concatenative synthesis system

Legal Events

Date Code Title Description
PUAI Public reference made under article 153(3) epc to a published international application that has entered the european phase

Free format text: ORIGINAL CODE: 0009012

17P Request for examination filed

Effective date: 20070206

AC Divisional application: reference to earlier application

Ref document number: 1213705

Country of ref document: EP

Kind code of ref document: P

AK Designated contracting states

Kind code of ref document: A2

Designated state(s): AT BE CH CY DE DK ES FI FR GB GR IE IT LI LU MC NL PT SE TR

PUAL Search report despatched

Free format text: ORIGINAL CODE: 0009013

AK Designated contracting states

Kind code of ref document: A3

Designated state(s): AT BE CH CY DE DK ES FI FR GB GR IE IT LI LU MC NL PT SE TR

RIC1 Information provided on ipc code assigned before grant

Ipc: G10L 13/06 20060101ALI20080515BHEP

Ipc: G10L 13/08 20060101AFI20080515BHEP

17Q First examination report despatched

Effective date: 20080724

AKX Designation fees paid

Designated state(s): AT BE CH CY DE DK ES FI FR GB GR IE IT LI LU MC NL PT SE TR

GRAP Despatch of communication of intention to grant a patent

Free format text: ORIGINAL CODE: EPIDOSNIGR1

GRAS Grant fee paid

Free format text: ORIGINAL CODE: EPIDOSNIGR3

GRAA (expected) grant

Free format text: ORIGINAL CODE: 0009210

AC Divisional application: reference to earlier application

Ref document number: 1213705

Country of ref document: EP

Kind code of ref document: P

AK Designated contracting states

Kind code of ref document: B1

Designated state(s): AT BE CH CY DE DK ES FI FR GB GR IE IT LI LU MC NL PT SE TR

REG Reference to a national code

Ref country code: GB

Ref legal event code: FG4D

REG Reference to a national code

Ref country code: CH

Ref legal event code: EP

REG Reference to a national code

Ref country code: IE

Ref legal event code: FG4D

REG Reference to a national code

Ref country code: AT

Ref legal event code: REF

Ref document number: 602496

Country of ref document: AT

Kind code of ref document: T

Effective date: 20130415

REG Reference to a national code

Ref country code: DE

Ref legal event code: R096

Ref document number: 60147799

Country of ref document: DE

Effective date: 20130516

PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: ES

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20130701

Ref country code: SE

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20130320

REG Reference to a national code

Ref country code: AT

Ref legal event code: MK05

Ref document number: 602496

Country of ref document: AT

Kind code of ref document: T

Effective date: 20130320

PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: GR

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20130621

Ref country code: FI

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20130320

REG Reference to a national code

Ref country code: NL

Ref legal event code: VDEP

Effective date: 20130320

PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: BE

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20130320

PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: NL

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20130320

Ref country code: PT

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20130722

Ref country code: AT

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20130320

PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: CY

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20130320

PLBE No opposition filed within time limit

Free format text: ORIGINAL CODE: 0009261

STAA Information on the status of an ep patent application or granted ep patent

Free format text: STATUS: NO OPPOSITION FILED WITHIN TIME LIMIT

PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: DK

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20130320

26N No opposition filed

Effective date: 20140102

PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: IT

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20130320

REG Reference to a national code

Ref country code: DE

Ref legal event code: R097

Ref document number: 60147799

Country of ref document: DE

Effective date: 20140102

REG Reference to a national code

Ref country code: CH

Ref legal event code: PL

PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: LU

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20131203

Ref country code: MC

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20130320

REG Reference to a national code

Ref country code: IE

Ref legal event code: MM4A

PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: LI

Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES

Effective date: 20131231

Ref country code: IE

Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES

Effective date: 20131203

Ref country code: CH

Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES

Effective date: 20131231

REG Reference to a national code

Ref country code: DE

Ref legal event code: R082

Ref document number: 60147799

Country of ref document: DE

Representative=s name: GRUENECKER, KINKELDEY, STOCKMAIR & SCHWANHAEUS, DE

REG Reference to a national code

Ref country code: GB

Ref legal event code: 732E

Free format text: REGISTERED BETWEEN 20150108 AND 20150114

REG Reference to a national code

Ref country code: DE

Ref legal event code: R082

Ref document number: 60147799

Country of ref document: DE

Representative=s name: GRUENECKER PATENT- UND RECHTSANWAELTE PARTG MB, DE

Effective date: 20150126

Ref country code: DE

Ref legal event code: R081

Ref document number: 60147799

Country of ref document: DE

Owner name: MICROSOFT TECHNOLOGY LICENSING, LLC, REDMOND, US

Free format text: FORMER OWNER: MICROSOFT CORP., REDMOND, WASH., US

Effective date: 20150126

Ref country code: DE

Ref legal event code: R081

Ref document number: 60147799

Country of ref document: DE

Owner name: MICROSOFT TECHNOLOGY LICENSING, LLC, REDMOND, US

Free format text: FORMER OWNER: MICROSOFT CORP., REDMOND, WASH., US

Effective date: 20130320

PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: TR

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20130320

REG Reference to a national code

Ref country code: FR

Ref legal event code: TP

Owner name: MICROSOFT TECHNOLOGY LICENSING, LLC, US

Effective date: 20150724

REG Reference to a national code

Ref country code: FR

Ref legal event code: PLFP

Year of fee payment: 15

REG Reference to a national code

Ref country code: FR

Ref legal event code: PLFP

Year of fee payment: 16

REG Reference to a national code

Ref country code: FR

Ref legal event code: PLFP

Year of fee payment: 17

PGFP Annual fee paid to national office [announced via postgrant information from national office to epo]

Ref country code: FR

Payment date: 20171113

Year of fee payment: 17

Ref country code: DE

Payment date: 20171129

Year of fee payment: 17

PGFP Annual fee paid to national office [announced via postgrant information from national office to epo]

Ref country code: GB

Payment date: 20171129

Year of fee payment: 17

REG Reference to a national code

Ref country code: DE

Ref legal event code: R119

Ref document number: 60147799

Country of ref document: DE

GBPC Gb: european patent ceased through non-payment of renewal fee

Effective date: 20181203

PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: FR

Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES

Effective date: 20181231

Ref country code: DE

Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES

Effective date: 20190702

PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: GB

Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES

Effective date: 20181203