EP1777697A3 - Method and apparatus for speech synthesis without prosody modification - Google Patents

Method and apparatus for speech synthesis without prosody modification Download PDF

Info

Publication number
EP1777697A3
EP1777697A3 EP07002565A EP07002565A EP1777697A3 EP 1777697 A3 EP1777697 A3 EP 1777697A3 EP 07002565 A EP07002565 A EP 07002565A EP 07002565 A EP07002565 A EP 07002565A EP 1777697 A3 EP1777697 A3 EP 1777697A3
Authority
EP
European Patent Office
Prior art keywords
speech
samples
present invention
produce
apparatus
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
EP07002565A
Other languages
German (de)
French (fr)
Other versions
EP1777697A2 (en
EP1777697B1 (en
Inventor
Min Chu
Hu Peng
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Microsoft Corp
Original Assignee
Microsoft Corp
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Priority to US25116700P priority Critical
Priority to US09/850,527 priority patent/US6978239B2/en
Application filed by Microsoft Corp filed Critical Microsoft Corp
Priority to EP20010128765 priority patent/EP1213705B1/en
Publication of EP1777697A2 publication Critical patent/EP1777697A2/en
Publication of EP1777697A3 publication Critical patent/EP1777697A3/en
Application granted granted Critical
Publication of EP1777697B1 publication Critical patent/EP1777697B1/en
Application status is Expired - Fee Related legal-status Critical
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L13/00Speech synthesis; Text to speech systems
    • G10L13/06Elementary speech units used in speech synthesisers; Concatenation rules
    • G10L13/07Concatenation rules
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L13/00Speech synthesis; Text to speech systems
    • G10L13/02Methods for producing synthetic speech; Speech synthesisers
    • G10L13/04Details of speech synthesis systems, e.g. synthesiser structure or memory management

Abstract

A speech synthesizer is provided that concatenates stored samples of speech units without modifying the prosody of the samples. The present invention is able to achieve a high level of naturalness in synthesized speech with a carefully designed training speech corpus by storing samples based on the prosodic and phonetic context in which they occur. In particular, some embodiments of the present invention limit the training text to those sentences that will produce the most frequent sets of prosodic contexts for each speech unit. Further embodiments of the present invention also provide a multi-tier selection mechanism for selecting a set of samples that will produce the most natural sounding speech.
EP07002565A 2000-12-04 2001-12-03 Method for speech synthesis without prosody modification Expired - Fee Related EP1777697B1 (en)

Priority Applications (3)

Application Number Priority Date Filing Date Title
US25116700P true 2000-12-04 2000-12-04
US09/850,527 US6978239B2 (en) 2000-12-04 2001-05-07 Method and apparatus for speech synthesis without prosody modification
EP20010128765 EP1213705B1 (en) 2000-12-04 2001-12-03 Method and apparatus for speech synthesis

Related Parent Applications (2)

Application Number Title Priority Date Filing Date
EP01128765.3 Division 2001-12-03
EP20010128765 Division EP1213705B1 (en) 2000-12-04 2001-12-03 Method and apparatus for speech synthesis

Publications (3)

Publication Number Publication Date
EP1777697A2 EP1777697A2 (en) 2007-04-25
EP1777697A3 true EP1777697A3 (en) 2008-06-18
EP1777697B1 EP1777697B1 (en) 2013-03-20

Family

ID=37831625

Family Applications (1)

Application Number Title Priority Date Filing Date
EP07002565A Expired - Fee Related EP1777697B1 (en) 2000-12-04 2001-12-03 Method for speech synthesis without prosody modification

Country Status (1)

Country Link
EP (1) EP1777697B1 (en)

Families Citing this family (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
GB2559767A (en) * 2017-02-17 2018-08-22 Pastel Dreams Method and system for personalised voice synthesis
GB2559766A (en) * 2017-02-17 2018-08-22 Pastel Dreams Method and system for defining text content for speech segmentation

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP0984426A2 (en) * 1998-08-31 2000-03-08 Canon Kabushiki Kaisha Speech synthesizing apparatus and method, and storage medium therefor
US6064960A (en) * 1997-12-18 2000-05-16 Apple Computer, Inc. Method and apparatus for improved duration modeling of phonemes

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6064960A (en) * 1997-12-18 2000-05-16 Apple Computer, Inc. Method and apparatus for improved duration modeling of phonemes
EP0984426A2 (en) * 1998-08-31 2000-03-08 Canon Kabushiki Kaisha Speech synthesizing apparatus and method, and storage medium therefor

Non-Patent Citations (7)

* Cited by examiner, † Cited by third party
Title
BIGORGNE D ET AL: "Multilingual PSOLA text-to-speech system", STATISTICAL SIGNAL AND ARRAY PROCESSING. MINNEAPOLIS, APR. 27 - 30, 1993, PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP), NEW YORK, IEEE, US, vol. VOL. 4, 27 April 1993 (1993-04-27), pages 187 - 190, XP010110425, ISBN: 0-7803-0946-4 *
BLACK A W ET AL: "OPTIMISING SELECTION OF UNITS FROM SPEECH DATABASES FOR CONCATENATIVE SYNTHESIS", 4TH EUROPEAN CONFERENCE ON SPEECH COMMUNICATION AND TECHNOLOGY. EUROSPEECH '95. MADRID, SPAIN, SEPT. 18 - 21, 1995, EUROPEAN CONFERENCE ON SPEECH COMMUNICATION AND TECHNOLOGY. (EUROSPEECH), MADRID : GRAFICAS BRENS, ES, vol. VOL. 1 CONF. 4, 18 September 1995 (1995-09-18), pages 581 - 584, XP000854776 *
FU-CHIANG CHOU ET AL: "A Chinese text-to-speech system based on part-of-speech analysis, prosodic modeling and non-uniform units", ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 1997. ICASSP-97., 1997 IEEE INTERNATIONAL CONFERENCE ON MUNICH, GERMANY 21-24 APRIL 1997, LOS ALAMITOS, CA, USA,IEEE COMPUT. SOC, US, 21 April 1997 (1997-04-21), pages 923 - 926, XP010225946, ISBN: 0-8186-7919-0 *
HUANG X ET AL: "Recent improvements on Microsoft's trainable text-to-speech system-Whistler", ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 1997. ICASSP-97., 1997 IEEE INTERNATIONAL CONFERENCE ON MUNICH, GERMANY 21-24 APRIL 1997, LOS ALAMITOS, CA, USA,IEEE COMPUT. SOC, US, 21 April 1997 (1997-04-21), pages 959 - 962, XP010225955, ISBN: 0-8186-7919-0 *
HUNT A J ET AL: "Unit selection in a concatenative speech synthesis system using a large speech database", 1996 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING - PROCEEDINGS. (ICASSP). ATLANTA, MAY 7 - 10, 1996, IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING - PROCEEDINGS. (ICASSP), NEW YORK, IEEE, US, vol. VOL. 1 CONF. 21, 7 May 1996 (1996-05-07), pages 373 - 376, XP002133444, ISBN: 0-7803-3193-1 *
NAKAJIMA S ET AL: "Automatic generation of synthesis units based on context oriented clustering", ICASSP 88: 1988 INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND ICASSP 88: 1988 INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (CAT. NO.88CH2561-9) - 11-14 APRIL 1988, 11 April 1988 (1988-04-11), NEW YORK, USA, pages 659 - 662, XP010073228 *
TIEN YING FUNG ET AL: "Concatenating syllables for response generation in spoken language applications", IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2000; PROCEEDINGS, vol. 2, 5 June 2000 (2000-06-05), Istanbul, Turkey, 5-9 June 2000, pages 933 - 936, XP010504877 *

Also Published As

Publication number Publication date
EP1777697B1 (en) 2013-03-20
EP1777697A2 (en) 2007-04-25

Similar Documents

Publication Publication Date Title
Delattre Comparing the prosodic features of English, German, Spanish and French
Peterson et al. Segmentation techniques in speech synthesis
JP4067762B2 (en) Singing synthesis device
DE69821673T2 (en) Method and apparatus for editing synthetic voice messages, and storage means with the method
Klatt Review of text‐to‐speech conversion for English
CN1108603C (en) Voice synthesis method and device
US6865533B2 (en) Text to speech
KR100769033B1 (en) Method for synthesizing speech
CA2351842C (en) Synthesis-based pre-selection of suitable units for concatenative speech
US20080288257A1 (en) Application of emotion-based intonation and prosody to speech in text-to-speech systems
Gårding A generative model of intonation
Carlson et al. Experiments with voice modelling in speech synthesis
Keating et al. Domain-initial articulatory strengthening in four languages
US4692941A (en) Real-time text-to-speech conversion system
US6810378B2 (en) Method and apparatus for controlling a speech synthesis system to provide multiple styles of speech
US6470316B1 (en) Speech synthesis apparatus having prosody generator with user-set speech-rate- or adjusted phoneme-duration-dependent selective vowel devoicing
US5704007A (en) Utilization of multiple voice sources in a speech synthesizer
DE19610019C2 (en) Digital speech synthesis method
Montero et al. Analysis and modelling of emotional speech in Spanish
US4398059A (en) Speech producing system
Machač et al. Principles of phonetic segmentation
US5930755A (en) Utilization of a recorded sound sample as a voice source in a speech synthesizer
Lindau Testing a model of intonation in a tone language
Möhler et al. Parametric modeling of intonation using vector quantization
US20090048843A1 (en) System-effected text annotation for expressive prosody in speech synthesis and recognition

Legal Events

Date Code Title Description
AC Divisional application: reference to earlier application

Ref document number: 1213705

Country of ref document: EP

Kind code of ref document: P

17P Request for examination filed

Effective date: 20070206

AK Designated contracting states

Kind code of ref document: A2

Designated state(s): AT BE CH CY DE DK ES FI FR GB GR IE IT LI LU MC NL PT SE TR

RIC1 Information provided on ipc code assigned before grant

Ipc: G10L 13/06 20060101ALI20080515BHEP

Ipc: G10L 13/08 20060101AFI20080515BHEP

AK Designated contracting states

Kind code of ref document: A3

Designated state(s): AT BE CH CY DE DK ES FI FR GB GR IE IT LI LU MC NL PT SE TR

17Q First examination report despatched

Effective date: 20080724

AKX Designation fees paid

Designated state(s): AT BE CH CY DE DK ES FI FR GB GR IE IT LI LU MC NL PT SE TR

AK Designated contracting states

Kind code of ref document: B1

Designated state(s): AT BE CH CY DE DK ES FI FR GB GR IE IT LI LU MC NL PT SE TR

AC Divisional application: reference to earlier application

Ref document number: 1213705

Country of ref document: EP

Kind code of ref document: P

REG Reference to a national code

Ref country code: GB

Ref legal event code: FG4D

REG Reference to a national code

Ref country code: CH

Ref legal event code: EP

REG Reference to a national code

Ref country code: IE

Ref legal event code: FG4D

REG Reference to a national code

Ref country code: AT

Ref legal event code: REF

Ref document number: 602496

Country of ref document: AT

Kind code of ref document: T

Effective date: 20130415

REG Reference to a national code

Ref country code: DE

Ref legal event code: R096

Ref document number: 60147799

Country of ref document: DE

Effective date: 20130516

PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: ES

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20130701

Ref country code: SE

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20130320

REG Reference to a national code

Ref country code: AT

Ref legal event code: MK05

Ref document number: 602496

Country of ref document: AT

Kind code of ref document: T

Effective date: 20130320

PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: GR

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20130621

Ref country code: FI

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20130320

REG Reference to a national code

Ref country code: NL

Ref legal event code: VDEP

Effective date: 20130320

PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: BE

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20130320

PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: NL

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20130320

Ref country code: PT

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20130722

Ref country code: AT

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20130320

PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: CY

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20130320

PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: DK

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20130320

26N No opposition filed

Effective date: 20140102

PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: IT

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20130320

REG Reference to a national code

Ref country code: DE

Ref legal event code: R097

Ref document number: 60147799

Country of ref document: DE

Effective date: 20140102

REG Reference to a national code

Ref country code: CH

Ref legal event code: PL

PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: LU

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20131203

Ref country code: MC

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20130320

REG Reference to a national code

Ref country code: IE

Ref legal event code: MM4A

PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: LI

Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES

Effective date: 20131231

Ref country code: IE

Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES

Effective date: 20131203

Ref country code: CH

Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES

Effective date: 20131231

REG Reference to a national code

Ref country code: DE

Ref legal event code: R082

Ref document number: 60147799

Country of ref document: DE

Representative=s name: GRUENECKER, KINKELDEY, STOCKMAIR & SCHWANHAEUS, DE

REG Reference to a national code

Ref country code: GB

Ref legal event code: 732E

Free format text: REGISTERED BETWEEN 20150108 AND 20150114

REG Reference to a national code

Ref country code: DE

Ref legal event code: R082

Ref document number: 60147799

Country of ref document: DE

Representative=s name: GRUENECKER PATENT- UND RECHTSANWAELTE PARTG MB, DE

Effective date: 20150126

Ref country code: DE

Ref legal event code: R081

Ref document number: 60147799

Country of ref document: DE

Owner name: MICROSOFT TECHNOLOGY LICENSING, LLC, REDMOND, US

Free format text: FORMER OWNER: MICROSOFT CORP., REDMOND, WASH., US

Effective date: 20150126

Ref country code: DE

Ref legal event code: R081

Ref document number: 60147799

Country of ref document: DE

Owner name: MICROSOFT TECHNOLOGY LICENSING, LLC, REDMOND, US

Free format text: FORMER OWNER: MICROSOFT CORP., REDMOND, WASH., US

Effective date: 20130320

PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: TR

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20130320

REG Reference to a national code

Ref country code: FR

Ref legal event code: TP

Owner name: MICROSOFT TECHNOLOGY LICENSING, LLC, US

Effective date: 20150724

REG Reference to a national code

Ref country code: FR

Ref legal event code: PLFP

Year of fee payment: 15

REG Reference to a national code

Ref country code: FR

Ref legal event code: PLFP

Year of fee payment: 16

REG Reference to a national code

Ref country code: FR

Ref legal event code: PLFP

Year of fee payment: 17

PGFP Annual fee paid to national office [announced from national office to epo]

Ref country code: FR

Payment date: 20171113

Year of fee payment: 17

Ref country code: DE

Payment date: 20171129

Year of fee payment: 17

PGFP Annual fee paid to national office [announced from national office to epo]

Ref country code: GB

Payment date: 20171129

Year of fee payment: 17

REG Reference to a national code

Ref country code: DE

Ref legal event code: R119

Ref document number: 60147799

Country of ref document: DE

GBPC Gb: european patent ceased through non-payment of renewal fee

Effective date: 20181203

PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: FR

Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES

Effective date: 20181231

Ref country code: DE

Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES

Effective date: 20190702

PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: GB

Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES

Effective date: 20181203