WO2001031434A3 - Verfahren zum bestimmen des zeitlichen verlaufs einer grundfrequenz einer zu synthetisierenden sprachausgabe - Google Patents

Verfahren zum bestimmen des zeitlichen verlaufs einer grundfrequenz einer zu synthetisierenden sprachausgabe Download PDF

Info

Publication number
WO2001031434A3
WO2001031434A3 PCT/DE2000/003753 DE0003753W WO0131434A3 WO 2001031434 A3 WO2001031434 A3 WO 2001031434A3 DE 0003753 W DE0003753 W DE 0003753W WO 0131434 A3 WO0131434 A3 WO 0131434A3
Authority
WO
WIPO (PCT)
Prior art keywords
fundamental frequency
synthesised
audio
detecting
response unit
Prior art date
Application number
PCT/DE2000/003753
Other languages
English (en)
French (fr)
Other versions
WO2001031434A2 (de
Inventor
Martin Holzapfel
Caglayan Erdem
Original Assignee
Siemens Ag
Martin Holzapfel
Caglayan Erdem
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Siemens Ag, Martin Holzapfel, Caglayan Erdem filed Critical Siemens Ag
Priority to US10/111,695 priority Critical patent/US7219061B1/en
Priority to DE50008976T priority patent/DE50008976D1/de
Priority to EP00984858A priority patent/EP1224531B1/de
Priority to JP2001533505A priority patent/JP4005360B2/ja
Publication of WO2001031434A2 publication Critical patent/WO2001031434A2/de
Publication of WO2001031434A3 publication Critical patent/WO2001031434A3/de

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/90Pitch determination of speech signals
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/78Detection of presence or absence of voice signals
    • G10L2025/783Detection of presence or absence of voice signals based on threshold decision
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/27Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the analysis technique
    • G10L25/30Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the analysis technique using neural networks

Landscapes

  • Engineering & Computer Science (AREA)
  • Computational Linguistics (AREA)
  • Signal Processing (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Machine Translation (AREA)
  • Management, Administration, Business Operations System, And Electronic Commerce (AREA)

Abstract

Die Erfindung betrifft ein Verfahren zum Bestimmen des zeitlichen Verlaufs einer Grundfrequenz einer zu synthetisierenden Sprachausgabe. Die Erfindung zeichnet sich dadurch aus, daß Vorgabemakrosegmente der Grundfrequenz mittels eines neuronalen Netzwerkes bestimmt werden, und diese Vorgabemakrosegmente mittels in einer Datenbasis gespeicherten Grundfrequenzsequenzen nachgebildet werden. Durch das erfindungsgemäße Verfahren wird die Grundfrequenz auf Grundlage eines größeren Textabschnittes, der mittels des neuronalen Netzwerkes analysiert wird, erzeugt, wobei aus der Datenbasis Mikrostrukturen in der Grundfrequenz aufgenommen werden. Die derart gebildete Grundfrequenz ist somit bezüglich ihrer Makro- als auch ihrer Mikrostruktur optimiert. Hierdurch wird ein äußerst natürlicher Klang erzielt.
PCT/DE2000/003753 1999-10-28 2000-10-24 Verfahren zum bestimmen des zeitlichen verlaufs einer grundfrequenz einer zu synthetisierenden sprachausgabe WO2001031434A2 (de)

Priority Applications (4)

Application Number Priority Date Filing Date Title
US10/111,695 US7219061B1 (en) 1999-10-28 2000-10-24 Method for detecting the time sequences of a fundamental frequency of an audio response unit to be synthesized
DE50008976T DE50008976D1 (de) 1999-10-28 2000-10-24 Verfahren zum bestimmen des zeitlichen verlaufs einer grundfrequenz einer zu synthetisierenden sprachausgabe
EP00984858A EP1224531B1 (de) 1999-10-28 2000-10-24 Verfahren zum bestimmen des zeitlichen verlaufs einer grundfrequenz einer zu synthetisierenden sprachausgabe
JP2001533505A JP4005360B2 (ja) 1999-10-28 2000-10-24 合成すべき音声応答の基本周波数の時間特性を定めるための方法

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
DE19952051.8 1999-10-28
DE19952051 1999-10-28

Publications (2)

Publication Number Publication Date
WO2001031434A2 WO2001031434A2 (de) 2001-05-03
WO2001031434A3 true WO2001031434A3 (de) 2002-02-14

Family

ID=7927243

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/DE2000/003753 WO2001031434A2 (de) 1999-10-28 2000-10-24 Verfahren zum bestimmen des zeitlichen verlaufs einer grundfrequenz einer zu synthetisierenden sprachausgabe

Country Status (5)

Country Link
US (1) US7219061B1 (de)
EP (1) EP1224531B1 (de)
JP (1) JP4005360B2 (de)
DE (1) DE50008976D1 (de)
WO (1) WO2001031434A2 (de)

Families Citing this family (10)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
AT6920U1 (de) 2002-02-14 2004-05-25 Sail Labs Technology Ag Verfahren zur erzeugung natürlicher sprache in computer-dialogsystemen
DE10230884B4 (de) * 2002-07-09 2006-01-12 Siemens Ag Vereinigung von Prosodiegenerierung und Bausteinauswahl bei der Sprachsynthese
JP4264030B2 (ja) * 2003-06-04 2009-05-13 株式会社ケンウッド 音声データ選択装置、音声データ選択方法及びプログラム
JP2005018036A (ja) * 2003-06-05 2005-01-20 Kenwood Corp 音声合成装置、音声合成方法及びプログラム
WO2005119650A1 (ja) * 2004-06-04 2005-12-15 Matsushita Electric Industrial Co., Ltd. 音声合成装置
US10453479B2 (en) * 2011-09-23 2019-10-22 Lessac Technologies, Inc. Methods for aligning expressive speech utterances with text and systems therefor
US10109014B1 (en) 2013-03-15 2018-10-23 Allstate Insurance Company Pre-calculated insurance premiums with wildcarding
CN105357613B (zh) * 2015-11-03 2018-06-29 广东欧珀移动通信有限公司 音频输出设备播放参数的调整方法及装置
CN106653056B (zh) * 2016-11-16 2020-04-24 中国科学院自动化研究所 基于lstm循环神经网络的基频提取模型及训练方法
CN108630190B (zh) * 2018-05-18 2019-12-10 百度在线网络技术(北京)有限公司 用于生成语音合成模型的方法和装置

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
GB2325599A (en) * 1997-05-22 1998-11-25 Motorola Inc Speech synthesis with prosody enhancement
US5913194A (en) * 1997-07-14 1999-06-15 Motorola, Inc. Method, device and system for using statistical information to reduce computation and memory requirements of a neural network based speech synthesis system

Family Cites Families (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
AU675389B2 (en) 1994-04-28 1997-01-30 Motorola, Inc. A method and apparatus for converting text into audible signals using a neural network
US5787387A (en) * 1994-07-11 1998-07-28 Voxware, Inc. Harmonic adaptive speech coding method and system
JPH10153998A (ja) * 1996-09-24 1998-06-09 Nippon Telegr & Teleph Corp <Ntt> 補助情報利用型音声合成方法、この方法を実施する手順を記録した記録媒体、およびこの方法を実施する装置
US6064960A (en) * 1997-12-18 2000-05-16 Apple Computer, Inc. Method and apparatus for improved duration modeling of phonemes
US6078885A (en) * 1998-05-08 2000-06-20 At&T Corp Verbal, fully automatic dictionary updates by end-users of speech synthesis and recognition systems
US6665641B1 (en) * 1998-11-13 2003-12-16 Scansoft, Inc. Speech synthesis using concatenation of speech waveforms
US7222075B2 (en) * 1999-08-31 2007-05-22 Accenture Llp Detecting emotions using voice signal analysis

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
GB2325599A (en) * 1997-05-22 1998-11-25 Motorola Inc Speech synthesis with prosody enhancement
US5913194A (en) * 1997-07-14 1999-06-15 Motorola, Inc. Method, device and system for using statistical information to reduce computation and memory requirements of a neural network based speech synthesis system

Also Published As

Publication number Publication date
US7219061B1 (en) 2007-05-15
JP2003513311A (ja) 2003-04-08
EP1224531B1 (de) 2004-12-15
WO2001031434A2 (de) 2001-05-03
EP1224531A2 (de) 2002-07-24
JP4005360B2 (ja) 2007-11-07
DE50008976D1 (de) 2005-01-20

Similar Documents

Publication Publication Date Title
WO2002041102A3 (en) Processing web editor for data processing in a digital oscilloscope or similar instrument
WO2001031627A3 (en) Pattern matching method and apparatus
AU2001247877A1 (en) Method and apparatus for input of alphanumeric text data from twelve key keyboards
DE50211346D1 (de) Verfahren zum Betrieb eines Hörgerätes sowie Hörgerät
WO2001031434A3 (de) Verfahren zum bestimmen des zeitlichen verlaufs einer grundfrequenz einer zu synthetisierenden sprachausgabe
WO2004061750A3 (en) Method and apparatus for displaying speech recognition results
AU2003222001A8 (en) Method and system for generating a likelihood of cardiovascular disease from analyzing cardiovascular sound signals.
WO2004075078A3 (en) Method and apparatus for fundamental operations on token sequences: computing similarity, extracting term values, and searching efficiently
AU2001293705A1 (en) Method and input device for inputting characters from a character set, especially one-handedly
WO2003015076A1 (fr) Dispositif et procede d&#39;evaluation des sentiments d&#39;un chien a partir d&#39;une analyse caracterielle des cris de l&#39;animal
AU2001245695A1 (en) Method and apparatus for producing an oil, water, and/or gas wel l
TW200419910A (en) Method and device for generating a clock signal having predetermined clock signal properties
EP1168306A3 (de) Verfahren und Vorrichtung zur Verbesserung von der Verständlichkeit eines digital komprimierten Sprachsignals
WO2008016595A3 (en) Method of and system for browsing of music
WO2005069679A3 (en) Audio signal enhancement
WO2005034398A3 (en) Data hiding via phase manipulation of audio signals
WO2005022318A3 (en) A method and system for generating acoustic fingerprints
MY134526A (en) Method for selecting an operating mode based on a detected synchronization pattern
GB0229940D0 (en) Audio signal analysing method and apparatus
EP1569199A4 (de) Datenerzeugungseinrichtung und verfahren für musikkompositionen
WO2002021458A3 (en) Document sensing apparatus and method
MY139788A (en) Method for performing a domain transformation of a digital signal from the time domain into the frequency domain and vice versa
TW200516554A (en) Method and device for information recovery
DE60113034T2 (de) Sinusoidale kodierung
NO20033892L (no) Fremgangsmate og anordning for ekspandering av et parti av et ror.

Legal Events

Date Code Title Description
AK Designated states

Kind code of ref document: A2

Designated state(s): JP US

AL Designated countries for regional patents

Kind code of ref document: A2

Designated state(s): AT BE CH CY DE DK ES FI FR GB GR IE IT LU MC NL PT SE

121 Ep: the epo has been informed by wipo that ep was designated in this application
DFPE Request for preliminary examination filed prior to expiration of 19th month from priority date (pct application filed before 20040101)
AK Designated states

Kind code of ref document: A3

Designated state(s): JP US

AL Designated countries for regional patents

Kind code of ref document: A3

Designated state(s): AT BE CH CY DE DK ES FI FR GB GR IE IT LU MC NL PT SE

WWE Wipo information: entry into national phase

Ref document number: 2000984858

Country of ref document: EP

WWE Wipo information: entry into national phase

Ref document number: 10111695

Country of ref document: US

ENP Entry into the national phase

Ref country code: JP

Ref document number: 2001 533505

Kind code of ref document: A

Format of ref document f/p: F

WWP Wipo information: published in national office

Ref document number: 2000984858

Country of ref document: EP

WWG Wipo information: grant in national office

Ref document number: 2000984858

Country of ref document: EP