EP1058235A3 - Reproduction method for voice controlled systems with text based speech synthesis - Google Patents

Reproduction method for voice controlled systems with text based speech synthesis Download PDF

Info

Publication number
EP1058235A3
EP1058235A3 EP00108486A EP00108486A EP1058235A3 EP 1058235 A3 EP1058235 A3 EP 1058235A3 EP 00108486 A EP00108486 A EP 00108486A EP 00108486 A EP00108486 A EP 00108486A EP 1058235 A3 EP1058235 A3 EP 1058235A3
Authority
EP
European Patent Office
Prior art keywords
characters
train
converted
reproduction
text
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
EP00108486A
Other languages
German (de)
French (fr)
Other versions
EP1058235A2 (en
EP1058235B1 (en
Inventor
Peter Buth
Frank Dufhues
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Nokia Oyj
Original Assignee
Nokia Oyj
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Nokia Oyj filed Critical Nokia Oyj
Publication of EP1058235A2 publication Critical patent/EP1058235A2/en
Publication of EP1058235A3 publication Critical patent/EP1058235A3/en
Application granted granted Critical
Publication of EP1058235B1 publication Critical patent/EP1058235B1/en
Anticipated expiration legal-status Critical
Expired - Lifetime legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L13/00Speech synthesis; Text to speech systems
    • G10L13/02Methods for producing synthetic speech; Speech synthesisers
    • G10L13/04Details of speech synthesis systems, e.g. synthesiser structure or memory management

Abstract

The invention specifies a simple reproduction method with improved pronunciation for voice-controlled systems with text-based speech synthesis even when the stored train of characters to be synthesized does not follow the general rules of speech reproduction. According to the invention, the method of "copying" the original spoken input text into the otherwise synthesized reproduction text, which is the current state of the art, is avoided, which will significantly increase the acceptance of the user of the voice-controlled system due to the process invented. More specifically, when there is actual spoken speech input that corresponds to a stored train of characters, the converted train of characters is compared to the speech input before reproduction of the train of characters described phonetically according to general rules and converted to a purely synthetic form. When the converted train of characters is found to deviate from the speech input by a value above a threshold value, at least one variation of the converted train of characters is created. This variation is then output instead of the converted train of characters as long as this variation deviates from the speech input by a value below the threshold value.
EP00108486A 1999-05-05 2000-04-19 Reproduction method for voice controlled systems with text based speech synthesis Expired - Lifetime EP1058235B1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
DE19920501A DE19920501A1 (en) 1999-05-05 1999-05-05 Speech reproduction method for voice-controlled system with text-based speech synthesis has entered speech input compared with synthetic speech version of stored character chain for updating latter
DE19920501 1999-05-05

Publications (3)

Publication Number Publication Date
EP1058235A2 EP1058235A2 (en) 2000-12-06
EP1058235A3 true EP1058235A3 (en) 2003-02-05
EP1058235B1 EP1058235B1 (en) 2003-11-05

Family

ID=7906935

Family Applications (1)

Application Number Title Priority Date Filing Date
EP00108486A Expired - Lifetime EP1058235B1 (en) 1999-05-05 2000-04-19 Reproduction method for voice controlled systems with text based speech synthesis

Country Status (5)

Country Link
US (1) US6546369B1 (en)
EP (1) EP1058235B1 (en)
JP (1) JP4602511B2 (en)
AT (1) ATE253762T1 (en)
DE (2) DE19920501A1 (en)

Families Citing this family (16)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP4759827B2 (en) * 2001-03-28 2011-08-31 日本電気株式会社 Voice segmentation apparatus and method, and control program therefor
US7107215B2 (en) * 2001-04-16 2006-09-12 Sakhr Software Company Determining a compact model to transcribe the arabic language acoustically in a well defined basic phonetic study
AT6920U1 (en) 2002-02-14 2004-05-25 Sail Labs Technology Ag METHOD FOR GENERATING NATURAL LANGUAGE IN COMPUTER DIALOG SYSTEMS
DE10253786B4 (en) * 2002-11-19 2009-08-06 Anwaltssozietät BOEHMERT & BOEHMERT GbR (vertretungsberechtigter Gesellschafter: Dr. Carl-Richard Haarmann, 28209 Bremen) Method for the computer-aided determination of a similarity of an electronically registered first identifier to at least one electronically detected second identifier as well as apparatus and computer program for carrying out the same
DE60314844T2 (en) * 2003-05-07 2008-03-13 Harman Becker Automotive Systems Gmbh Method and apparatus for voice output, data carrier with voice data
CN1879146B (en) * 2003-11-05 2011-06-08 皇家飞利浦电子股份有限公司 Error detection for speech to text transcription systems
JP2006047866A (en) * 2004-08-06 2006-02-16 Canon Inc Electronic dictionary device and control method thereof
US20060136195A1 (en) * 2004-12-22 2006-06-22 International Business Machines Corporation Text grouping for disambiguation in a speech application
JP4385949B2 (en) * 2005-01-11 2009-12-16 トヨタ自動車株式会社 In-vehicle chat system
US20070016421A1 (en) * 2005-07-12 2007-01-18 Nokia Corporation Correcting a pronunciation of a synthetically generated speech object
US20070129945A1 (en) * 2005-12-06 2007-06-07 Ma Changxue C Voice quality control for high quality speech reconstruction
US8504365B2 (en) * 2008-04-11 2013-08-06 At&T Intellectual Property I, L.P. System and method for detecting synthetic speaker verification
US8494854B2 (en) * 2008-06-23 2013-07-23 John Nicholas and Kristin Gross CAPTCHA using challenges optimized for distinguishing between humans and machines
US8752141B2 (en) * 2008-06-27 2014-06-10 John Nicholas Methods for presenting and determining the efficacy of progressive pictorial and motion-based CAPTCHAs
US9564120B2 (en) * 2010-05-14 2017-02-07 General Motors Llc Speech adaptation in speech synthesis
KR20170044849A (en) * 2015-10-16 2017-04-26 삼성전자주식회사 Electronic device and method for transforming text to speech utilizing common acoustic data set for multi-lingual/speaker

Family Cites Families (13)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
DE2435654C2 (en) * 1974-07-24 1983-11-17 Gretag AG, 8105 Regensdorf, Zürich Method and device for the analysis and synthesis of human speech
NL8302985A (en) * 1983-08-26 1985-03-18 Philips Nv MULTIPULSE EXCITATION LINEAR PREDICTIVE VOICE CODER.
US5029200A (en) * 1989-05-02 1991-07-02 At&T Bell Laboratories Voice message system using synthetic speech
US5293449A (en) * 1990-11-23 1994-03-08 Comsat Corporation Analysis-by-synthesis 2,4 kbps linear predictive speech codec
GB9223066D0 (en) * 1992-11-04 1992-12-16 Secr Defence Children's speech training aid
FI98163C (en) * 1994-02-08 1997-04-25 Nokia Mobile Phones Ltd Coding system for parametric speech coding
US6005549A (en) * 1995-07-24 1999-12-21 Forest; Donald K. User interface method and apparatus
US5913193A (en) * 1996-04-30 1999-06-15 Microsoft Corporation Method and system of runtime acoustic unit selection for speech synthesis
JPH10153998A (en) * 1996-09-24 1998-06-09 Nippon Telegr & Teleph Corp <Ntt> Auxiliary information utilizing type voice synthesizing method, recording medium recording procedure performing this method, and device performing this method
US6163769A (en) * 1997-10-02 2000-12-19 Microsoft Corporation Text-to-speech using clustered context-dependent phoneme-based units
US6081780A (en) * 1998-04-28 2000-06-27 International Business Machines Corporation TTS and prosody based authoring system
US6173263B1 (en) * 1998-08-31 2001-01-09 At&T Corp. Method and system for performing concatenative speech synthesis using half-phonemes
US6266638B1 (en) * 1999-03-30 2001-07-24 At&T Corp Voice quality compensation system for speech synthesis based on unit-selection speech database

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
DESHMUKH N ET AL: "Automated generation of N-best pronunciations of proper nouns", 1996 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING - PROCEEDINGS. (ICASSP). ATLANTA, MAY 7 - 10, 1996, IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING - PROCEEDINGS. (ICASSP), NEW YORK, IEEE, US, vol. 1 CONF. 21, 7 May 1996 (1996-05-07), pages 283 - 286, XP002164538, ISBN: 0-7803-3193-1 *

Also Published As

Publication number Publication date
DE19920501A1 (en) 2000-11-09
US6546369B1 (en) 2003-04-08
JP4602511B2 (en) 2010-12-22
EP1058235A2 (en) 2000-12-06
JP2000347681A (en) 2000-12-15
EP1058235B1 (en) 2003-11-05
DE50004296D1 (en) 2003-12-11
ATE253762T1 (en) 2003-11-15

Similar Documents

Publication Publication Date Title
EP1058235A3 (en) Reproduction method for voice controlled systems with text based speech synthesis
US5546500A (en) Arrangement for increasing the comprehension of speech when translating speech from a first language to a second language
AU2002211438A1 (en) Language independent voice-based search system
EP0984428A3 (en) Method and system for automatically determining phonetic transciptions associated with spelled words
MX9703138A (en) Speech recognition.
CA2363561A1 (en) Automated transcription system and method using two speech converting instances and computer-assisted correction
ATE325413T1 (en) METHOD AND DEVICE FOR CONVERTING SPOKEN TEXTS INTO WRITTEN AND CORRECTING THE RECOGNIZED TEXTS
EP0749109A3 (en) Speech recognition for tonal languages
MX9505299A (en) Systems, methods and articles of manufacture for performing high resolution n-best string hypothesization.
MY117070A (en) Voice control of a user interface to service applications
AU2003299312A1 (en) Text-to-speech method and system, computer program product therefor
WO2004100638A3 (en) Source-dependent text-to-speech system
DE69022237D1 (en) Speech synthesis device based on the phonetic hidden Markov model.
DE69623364T2 (en) Device for recognizing continuously spoken language
EP1045372A3 (en) Speech sound communication system
ATE363120T1 (en) AUDIO DIALOGUE SYSTEM AND VOICE-CONTROLLED BROWSING PROCESS
AU2002233238A1 (en) Mobile terminal controllable by spoken utterances
Hackman Patterns in purported speech acts
JPH06342297A (en) Speech synthesizing device
JPH07200554A (en) Sentence read-aloud device
WO2006034152A3 (en) Discriminative training of document transcription system
JP2536896B2 (en) Speech synthesizer
JP3709436B2 (en) Fine segment acoustic model creation device for speech recognition
JPH02247696A (en) Text voice synthesizer
KR20140047722A (en) Method and device for slowing a digital audio signal

Legal Events

Date Code Title Description
PUAI Public reference made under article 153(3) epc to a published international application that has entered the european phase

Free format text: ORIGINAL CODE: 0009012

AK Designated contracting states

Kind code of ref document: A2

Designated state(s): AT BE CH CY DE DK ES FI FR GB GR IE IT LI LU MC NL PT SE

AX Request for extension of the european patent

Free format text: AL;LT;LV;MK;RO;SI

RAP1 Party data changed (applicant data changed or rights of an application transferred)

Owner name: NOKIA CORPORATION

PUAL Search report despatched

Free format text: ORIGINAL CODE: 0009013

AK Designated contracting states

Designated state(s): AT BE CH CY DE DK ES FI FR GB GR IE IT LI LU MC NL PT SE

AX Request for extension of the european patent

Extension state: AL LT LV MK RO SI

17P Request for examination filed

Effective date: 20030120

GRAH Despatch of communication of intention to grant a patent

Free format text: ORIGINAL CODE: EPIDOS IGRA

GRAS Grant fee paid

Free format text: ORIGINAL CODE: EPIDOSNIGR3

GRAA (expected) grant

Free format text: ORIGINAL CODE: 0009210

AKX Designation fees paid

Designated state(s): AT BE CH CY DE DK ES FI FR GB GR IE IT LI LU MC NL PT SE

AK Designated contracting states

Kind code of ref document: B1

Designated state(s): AT BE CH CY DE DK ES FI FR GB GR IE IT LI LU MC NL PT SE

PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: IE

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20031105

Ref country code: IT

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT;WARNING: LAPSES OF ITALIAN PATENTS WITH EFFECTIVE DATE BEFORE 2007 MAY HAVE OCCURRED AT ANY TIME BEFORE 2007. THE CORRECT EFFECTIVE DATE MAY BE DIFFERENT FROM THE ONE RECORDED.

Effective date: 20031105

Ref country code: FI

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20031105

Ref country code: CY

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20031105

REG Reference to a national code

Ref country code: GB

Ref legal event code: FG4D

Free format text: NOT ENGLISH

REG Reference to a national code

Ref country code: CH

Ref legal event code: EP

REF Corresponds to:

Ref document number: 50004296

Country of ref document: DE

Date of ref document: 20031211

Kind code of ref document: P

REG Reference to a national code

Ref country code: IE

Ref legal event code: FG4D

Free format text: GERMAN

GBT Gb: translation of ep patent filed (gb section 77(6)(a)/1977)

Effective date: 20031224

PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: SE

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20040205

Ref country code: GR

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20040205

Ref country code: DK

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20040205

PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: ES

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20040216

PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: LU

Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES

Effective date: 20040419

Ref country code: AT

Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES

Effective date: 20040419

PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: MC

Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES

Effective date: 20040430

Ref country code: CH

Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES

Effective date: 20040430

Ref country code: BE

Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES

Effective date: 20040430

Ref country code: LI

Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES

Effective date: 20040430

REG Reference to a national code

Ref country code: IE

Ref legal event code: FD4D

ET Fr: translation filed
PLBE No opposition filed within time limit

Free format text: ORIGINAL CODE: 0009261

STAA Information on the status of an ep patent application or granted ep patent

Free format text: STATUS: NO OPPOSITION FILED WITHIN TIME LIMIT

26N No opposition filed

Effective date: 20040806

BERE Be: lapsed

Owner name: *NOKIA CORP.

Effective date: 20040430

REG Reference to a national code

Ref country code: CH

Ref legal event code: PL

PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: PT

Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES

Effective date: 20040405

PGFP Annual fee paid to national office [announced via postgrant information from national office to epo]

Ref country code: DE

Payment date: 20120425

Year of fee payment: 13

Ref country code: NL

Payment date: 20120413

Year of fee payment: 13

PGFP Annual fee paid to national office [announced via postgrant information from national office to epo]

Ref country code: FR

Payment date: 20120504

Year of fee payment: 13

Ref country code: GB

Payment date: 20120418

Year of fee payment: 13

REG Reference to a national code

Ref country code: NL

Ref legal event code: V1

Effective date: 20131101

GBPC Gb: european patent ceased through non-payment of renewal fee

Effective date: 20130419

PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: DE

Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES

Effective date: 20131101

Ref country code: GB

Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES

Effective date: 20130419

REG Reference to a national code

Ref country code: FR

Ref legal event code: ST

Effective date: 20131231

REG Reference to a national code

Ref country code: DE

Ref legal event code: R119

Ref document number: 50004296

Country of ref document: DE

Effective date: 20131101

PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: FR

Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES

Effective date: 20130430

Ref country code: NL

Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES

Effective date: 20131101