WO2003019528A1 - Procede de production d'intonation, dispositif de synthese de signaux vocaux fonctionnant selon ledit procede et serveur vocal - Google Patents

Procede de production d'intonation, dispositif de synthese de signaux vocaux fonctionnant selon ledit procede et serveur vocal Download PDF

Info

Publication number
WO2003019528A1
WO2003019528A1 PCT/JP2002/007882 JP0207882W WO03019528A1 WO 2003019528 A1 WO2003019528 A1 WO 2003019528A1 JP 0207882 W JP0207882 W JP 0207882W WO 03019528 A1 WO03019528 A1 WO 03019528A1
Authority
WO
WIPO (PCT)
Prior art keywords
intonation
speech
text
pattern
generating
Prior art date
Application number
PCT/JP2002/007882
Other languages
English (en)
French (fr)
Inventor
Takashi Saitoh
Masaharu Sakamoto
Original Assignee
International Business Machines Corporation
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by International Business Machines Corporation filed Critical International Business Machines Corporation
Priority to JP2003522906A priority Critical patent/JP4056470B2/ja
Publication of WO2003019528A1 publication Critical patent/WO2003019528A1/ja
Priority to US10/784,044 priority patent/US7502739B2/en

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L13/00Speech synthesis; Text to speech systems
    • G10L13/08Text analysis or generation of parameters for speech synthesis out of text, e.g. grapheme to phoneme translation, prosody generation or stress or intonation determination
    • G10L13/10Prosody rules derived from text; Stress or intonation
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L13/00Speech synthesis; Text to speech systems
    • G10L13/02Methods for producing synthetic speech; Speech synthesisers
    • G10L13/04Details of speech synthesis systems, e.g. synthesiser structure or memory management

Landscapes

  • Engineering & Computer Science (AREA)
  • Computational Linguistics (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Machine Translation (AREA)
  • Computer And Data Communications (AREA)
  • Telephonic Communication Services (AREA)
PCT/JP2002/007882 2001-08-22 2002-08-01 Procede de production d'intonation, dispositif de synthese de signaux vocaux fonctionnant selon ledit procede et serveur vocal WO2003019528A1 (fr)

Priority Applications (2)

Application Number Priority Date Filing Date Title
JP2003522906A JP4056470B2 (ja) 2001-08-22 2002-08-01 イントネーション生成方法、その方法を用いた音声合成装置及びボイスサーバ
US10/784,044 US7502739B2 (en) 2001-08-22 2005-01-24 Intonation generation method, speech synthesis apparatus using the method and voice server

Applications Claiming Priority (4)

Application Number Priority Date Filing Date Title
JP2001-251903 2001-08-22
JP2001251903 2001-08-22
JP2002-72288 2002-03-15
JP2002072288 2002-03-15

Publications (1)

Publication Number Publication Date
WO2003019528A1 true WO2003019528A1 (fr) 2003-03-06

Family

ID=26620814

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/JP2002/007882 WO2003019528A1 (fr) 2001-08-22 2002-08-01 Procede de production d'intonation, dispositif de synthese de signaux vocaux fonctionnant selon ledit procede et serveur vocal

Country Status (4)

Country Link
US (1) US7502739B2 (ja)
JP (1) JP4056470B2 (ja)
CN (1) CN1234109C (ja)
WO (1) WO2003019528A1 (ja)

Cited By (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2006084967A (ja) * 2004-09-17 2006-03-30 Advanced Telecommunication Research Institute International 予測モデルの作成方法およびコンピュータプログラム
JP2006084666A (ja) * 2004-09-15 2006-03-30 Nippon Hoso Kyokai <Nhk> 韻律生成装置及び韻律生成プログラム
WO2006095925A1 (ja) * 2005-03-11 2006-09-14 Kabushiki Kaisha Kenwood 音声合成装置、音声合成方法及びプログラム
JP2007004011A (ja) * 2005-06-27 2007-01-11 Nippon Telegr & Teleph Corp <Ntt> 音声合成装置、音声合成方法、音声合成プログラムおよびその記録媒体
WO2009044596A1 (ja) * 2007-10-05 2009-04-09 Nec Corporation 音声合成装置、音声合成方法および音声合成プログラム
WO2016103652A1 (ja) * 2014-12-24 2016-06-30 日本電気株式会社 音声処理装置、音声処理方法、および記録媒体
JP6132077B1 (ja) * 2016-03-29 2017-05-24 三菱電機株式会社 韻律候補提示装置

Families Citing this family (36)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
KR100547858B1 (ko) * 2003-07-07 2006-01-31 삼성전자주식회사 음성인식 기능을 이용하여 문자 입력이 가능한 이동통신단말기 및 방법
JP2006309162A (ja) * 2005-03-29 2006-11-09 Toshiba Corp ピッチパターン生成方法、ピッチパターン生成装置及びプログラム
JP4738057B2 (ja) * 2005-05-24 2011-08-03 株式会社東芝 ピッチパターン生成方法及びその装置
US8600753B1 (en) * 2005-12-30 2013-12-03 At&T Intellectual Property Ii, L.P. Method and apparatus for combining text to speech and recorded prompts
JP2007264503A (ja) * 2006-03-29 2007-10-11 Toshiba Corp 音声合成装置及びその方法
US8130679B2 (en) * 2006-05-25 2012-03-06 Microsoft Corporation Individual processing of VoIP contextual information
US20080154605A1 (en) * 2006-12-21 2008-06-26 International Business Machines Corporation Adaptive quality adjustments for speech synthesis in a real-time speech processing system based upon load
JP2008225254A (ja) * 2007-03-14 2008-09-25 Canon Inc 音声合成装置及び方法並びにプログラム
JP2009042509A (ja) * 2007-08-09 2009-02-26 Toshiba Corp アクセント情報抽出装置及びその方法
JP2009047957A (ja) * 2007-08-21 2009-03-05 Toshiba Corp ピッチパターン生成方法及びその装置
JP4455633B2 (ja) * 2007-09-10 2010-04-21 株式会社東芝 基本周波数パターン生成装置、基本周波数パターン生成方法及びプログラム
US9330720B2 (en) * 2008-01-03 2016-05-03 Apple Inc. Methods and apparatus for altering audio output signals
US8380503B2 (en) 2008-06-23 2013-02-19 John Nicholas and Kristin Gross Trust System and method for generating challenge items for CAPTCHAs
US9266023B2 (en) 2008-06-27 2016-02-23 John Nicholas and Kristin Gross Pictorial game system and method
US20100066742A1 (en) * 2008-09-18 2010-03-18 Microsoft Corporation Stylized prosody for speech synthesis-based applications
US9761219B2 (en) * 2009-04-21 2017-09-12 Creative Technology Ltd System and method for distributed text-to-speech synthesis and intelligibility
RU2421827C2 (ru) * 2009-08-07 2011-06-20 Общество с ограниченной ответственностью "Центр речевых технологий" Способ синтеза речи
JP2011180416A (ja) * 2010-03-02 2011-09-15 Denso Corp 音声合成装置、音声合成方法およびカーナビゲーションシステム
US8428759B2 (en) * 2010-03-26 2013-04-23 Google Inc. Predictive pre-recording of audio for voice input
CN102682767B (zh) * 2011-03-18 2015-04-08 株式公司Cs 一种应用于家庭网络的语音识别方法
RU2460154C1 (ru) * 2011-06-15 2012-08-27 Александр Юрьевич Бредихин Способ автоматизированной обработки текста и компьютерное устройство для реализации этого способа
US9240180B2 (en) 2011-12-01 2016-01-19 At&T Intellectual Property I, L.P. System and method for low-latency web-based text-to-speech without plugins
US10469623B2 (en) * 2012-01-26 2019-11-05 ZOOM International a.s. Phrase labeling within spoken audio recordings
US9390085B2 (en) 2012-03-23 2016-07-12 Tata Consultancy Sevices Limited Speech processing system and method for recognizing speech samples from a speaker with an oriyan accent when speaking english
JP2014038282A (ja) * 2012-08-20 2014-02-27 Toshiba Corp 韻律編集装置、方法およびプログラム
US9734819B2 (en) * 2013-02-21 2017-08-15 Google Technology Holdings LLC Recognizing accented speech
WO2014141054A1 (en) * 2013-03-11 2014-09-18 Video Dubber Ltd. Method, apparatus and system for regenerating voice intonation in automatically dubbed videos
JP5807921B2 (ja) * 2013-08-23 2015-11-10 国立研究開発法人情報通信研究機構 定量的f0パターン生成装置及び方法、f0パターン生成のためのモデル学習装置、並びにコンピュータプログラム
US9348812B2 (en) * 2014-03-14 2016-05-24 Splice Software Inc. Method, system and apparatus for assembling a recording plan and data driven dialogs for automated communications
US10803850B2 (en) * 2014-09-08 2020-10-13 Microsoft Technology Licensing, Llc Voice generation with predetermined emotion type
CN105788588B (zh) * 2014-12-23 2020-08-14 深圳市腾讯计算机系统有限公司 导航语音播报方法和装置
US11183170B2 (en) * 2016-08-17 2021-11-23 Sony Corporation Interaction control apparatus and method
CN112005298B (zh) * 2018-05-11 2023-11-07 谷歌有限责任公司 时钟式层次变分编码器
CN110619866A (zh) * 2018-06-19 2019-12-27 普天信息技术有限公司 语音合成方法及装置
US11227578B2 (en) * 2019-05-15 2022-01-18 Lg Electronics Inc. Speech synthesizer using artificial intelligence, method of operating speech synthesizer and computer-readable recording medium
CN112397050B (zh) * 2020-11-25 2023-07-07 北京百度网讯科技有限公司 韵律预测方法、训练方法、装置、电子设备和介质

Citations (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPH0419799A (ja) * 1990-05-15 1992-01-23 Matsushita Electric Works Ltd 音声合成装置
JPH04349499A (ja) * 1991-05-28 1992-12-03 Matsushita Electric Works Ltd 音声合成システム
JPH0990970A (ja) * 1995-09-20 1997-04-04 Atr Onsei Honyaku Tsushin Kenkyusho:Kk 音声合成装置
JPH1195783A (ja) * 1997-09-16 1999-04-09 Toshiba Corp 音声情報処理方法
JPH11265194A (ja) * 1998-03-17 1999-09-28 Toshiba Corp 音声情報処理方法
JP2000047681A (ja) * 1998-07-31 2000-02-18 Toshiba Corp 情報処理方法
JP2000148182A (ja) * 1998-11-03 2000-05-26 Internatl Business Mach Corp <Ibm> 電話メッセ―ジの転記のために使用される編集システム及び方法
JP2000250573A (ja) * 1999-03-01 2000-09-14 Nippon Telegr & Teleph Corp <Ntt> 音声素片データベース作成方法及びその装置並びにこの音声素片データベースを用いた音声合成方法及びその装置
JP2001034284A (ja) * 1999-07-23 2001-02-09 Toshiba Corp 音声合成方法及び装置、並びに文音声変換プログラムを記録した記録媒体

Family Cites Families (15)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2782147B2 (ja) * 1993-03-10 1998-07-30 日本電信電話株式会社 波形編集型音声合成装置
JP3093113B2 (ja) * 1994-09-21 2000-10-03 日本アイ・ビー・エム株式会社 音声合成方法及びシステム
JP3085631B2 (ja) * 1994-10-19 2000-09-11 日本アイ・ビー・エム株式会社 音声合成方法及びシステム
US5905972A (en) 1996-09-30 1999-05-18 Microsoft Corporation Prosodic databases holding fundamental frequency templates for use in speech synthesis
US6226614B1 (en) * 1997-05-21 2001-05-01 Nippon Telegraph And Telephone Corporation Method and apparatus for editing/creating synthetic speech message and recording medium with the method recorded thereon
JP3224760B2 (ja) * 1997-07-10 2001-11-05 インターナショナル・ビジネス・マシーンズ・コーポレーション 音声メールシステム、音声合成装置およびこれらの方法
US6260016B1 (en) * 1998-11-25 2001-07-10 Matsushita Electric Industrial Co., Ltd. Speech synthesis employing prosody templates
JP2000206982A (ja) * 1999-01-12 2000-07-28 Toshiba Corp 音声合成装置及び文音声変換プログラムを記録した機械読み取り可能な記録媒体
JP3420964B2 (ja) 1999-02-25 2003-06-30 日本電信電話株式会社 ピッチパタン生成方法、その装置及びプログラム記録媒体
JP2000305585A (ja) * 1999-04-23 2000-11-02 Oki Electric Ind Co Ltd 音声合成装置
JP3450237B2 (ja) * 1999-10-06 2003-09-22 株式会社アルカディア 音声合成装置および方法
US7035794B2 (en) * 2001-03-30 2006-04-25 Intel Corporation Compressing and using a concatenative speech database in text-to-speech systems
JP2003108178A (ja) * 2001-09-27 2003-04-11 Nec Corp 音声合成装置及び音声合成用素片作成装置
JP2006309162A (ja) * 2005-03-29 2006-11-09 Toshiba Corp ピッチパターン生成方法、ピッチパターン生成装置及びプログラム
JP4738057B2 (ja) * 2005-05-24 2011-08-03 株式会社東芝 ピッチパターン生成方法及びその装置

Patent Citations (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPH0419799A (ja) * 1990-05-15 1992-01-23 Matsushita Electric Works Ltd 音声合成装置
JPH04349499A (ja) * 1991-05-28 1992-12-03 Matsushita Electric Works Ltd 音声合成システム
JPH0990970A (ja) * 1995-09-20 1997-04-04 Atr Onsei Honyaku Tsushin Kenkyusho:Kk 音声合成装置
JPH1195783A (ja) * 1997-09-16 1999-04-09 Toshiba Corp 音声情報処理方法
JPH11265194A (ja) * 1998-03-17 1999-09-28 Toshiba Corp 音声情報処理方法
JP2000047681A (ja) * 1998-07-31 2000-02-18 Toshiba Corp 情報処理方法
JP2000148182A (ja) * 1998-11-03 2000-05-26 Internatl Business Mach Corp <Ibm> 電話メッセ―ジの転記のために使用される編集システム及び方法
JP2000250573A (ja) * 1999-03-01 2000-09-14 Nippon Telegr & Teleph Corp <Ntt> 音声素片データベース作成方法及びその装置並びにこの音声素片データベースを用いた音声合成方法及びその装置
JP2001034284A (ja) * 1999-07-23 2001-02-09 Toshiba Corp 音声合成方法及び装置、並びに文音声変換プログラムを記録した記録媒体

Cited By (13)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2006084666A (ja) * 2004-09-15 2006-03-30 Nippon Hoso Kyokai <Nhk> 韻律生成装置及び韻律生成プログラム
JP4542400B2 (ja) * 2004-09-15 2010-09-15 日本放送協会 韻律生成装置及び韻律生成プログラム
JP2006084967A (ja) * 2004-09-17 2006-03-30 Advanced Telecommunication Research Institute International 予測モデルの作成方法およびコンピュータプログラム
JP4516863B2 (ja) * 2005-03-11 2010-08-04 株式会社ケンウッド 音声合成装置、音声合成方法及びプログラム
JP2006251538A (ja) * 2005-03-11 2006-09-21 Kenwood Corp 音声合成装置、音声合成方法及びプログラム
WO2006095925A1 (ja) * 2005-03-11 2006-09-14 Kabushiki Kaisha Kenwood 音声合成装置、音声合成方法及びプログラム
CN101171624B (zh) * 2005-03-11 2011-08-10 株式会社建伍 语音合成装置及语音合成方法
JP2007004011A (ja) * 2005-06-27 2007-01-11 Nippon Telegr & Teleph Corp <Ntt> 音声合成装置、音声合成方法、音声合成プログラムおよびその記録媒体
JP4533255B2 (ja) * 2005-06-27 2010-09-01 日本電信電話株式会社 音声合成装置、音声合成方法、音声合成プログラムおよびその記録媒体
WO2009044596A1 (ja) * 2007-10-05 2009-04-09 Nec Corporation 音声合成装置、音声合成方法および音声合成プログラム
WO2016103652A1 (ja) * 2014-12-24 2016-06-30 日本電気株式会社 音声処理装置、音声処理方法、および記録媒体
JP6132077B1 (ja) * 2016-03-29 2017-05-24 三菱電機株式会社 韻律候補提示装置
WO2017168544A1 (ja) * 2016-03-29 2017-10-05 三菱電機株式会社 韻律候補提示装置

Also Published As

Publication number Publication date
JPWO2003019528A1 (ja) 2004-12-16
JP4056470B2 (ja) 2008-03-05
US7502739B2 (en) 2009-03-10
US20050114137A1 (en) 2005-05-26
CN1234109C (zh) 2005-12-28
CN1545693A (zh) 2004-11-10

Similar Documents

Publication Publication Date Title
WO2003019528A1 (fr) Procede de production d&#39;intonation, dispositif de synthese de signaux vocaux fonctionnant selon ledit procede et serveur vocal
US9218803B2 (en) Method and system for enhancing a speech database
EP0140777B1 (en) Process for encoding speech and an apparatus for carrying out the process
US7565291B2 (en) Synthesis-based pre-selection of suitable units for concatenative speech
US7979274B2 (en) Method and system for preventing speech comprehension by interactive voice response systems
US6829581B2 (en) Method for prosody generation by unit selection from an imitation speech database
US7526430B2 (en) Speech synthesis apparatus
EP0710378A1 (en) A method and apparatus for converting text into audible signals using a neural network
WO2009023660A1 (en) Synthesis by generation and concatenation of multi-form segments
EP3065130B1 (en) Voice synthesis
WO1996023298A3 (en) System amd method for generating and using context dependent sub-syllable models to recognize a tonal language
US7912718B1 (en) Method and system for enhancing a speech database
WO2004012183A3 (en) Concatenative text-to-speech conversion
US7280969B2 (en) Method and apparatus for producing natural sounding pitch contours in a speech synthesizer
Nose et al. Speaker-independent HMM-based voice conversion using adaptive quantization of the fundamental frequency
KR100373329B1 (ko) 음운환경과 묵음구간 길이를 이용한 텍스트/음성변환 장치 및그 방법
US8510112B1 (en) Method and system for enhancing a speech database
Kishore et al. Building Hindi and Telugu voices using festvox
Nitta et al. One-model speech recognition and synthesis based on articulatory movement HMMs.
Delmonte et al. A text-to-speech system for italian
EP1589524A1 (en) Method and device for speech synthesis
Law et al. Cantonese text-to-speech synthesis using sub-syllable units.
Thippareddy et al. Prosody transplantation using unit-selection: Principles and early results
Sun et al. Two-step generation of Mandarin F0 contours based on tone nucleus and superpositional models.
Paulo et al. Reducing the corpus-based TTS signal degradation due to speaker's word pronunciations.

Legal Events

Date Code Title Description
AK Designated states

Kind code of ref document: A1

Designated state(s): AE AG AL AM AT AU AZ BA BB BG BR BY BZ CA CH CN CO CR CU CZ DE DK DM DZ EC EE ES FI GB GD GE GH GM HR HU ID IL IN IS JP KE KG KP KR KZ LC LK LR LS LT LU LV MA MD MG MK MN MW MX MZ NO NZ OM PH PL PT RO RU SD SE SG SI SK SL TJ TM TN TR TT TZ UA UG US UZ VN YU ZA ZM ZW

Kind code of ref document: A1

Designated state(s): AE AG AL AM AT AU AZ BA BB BG BY BZ CA CH CN CO CR CU CZ DE DM DZ EC EE ES FI GB GD GE GH HR HU ID IL IN IS JP KE KG KP KR LC LK LR LS LT LU LV MA MD MG MN MW MX MZ NO NZ OM PH PL PT RU SD SE SG SI SK SL TJ TM TN TR TZ UA UG US UZ VN YU ZA ZM

AL Designated countries for regional patents

Kind code of ref document: A1

Designated state(s): GH GM KE LS MW MZ SD SL SZ TZ UG ZM ZW AM AZ BY KG KZ MD RU TJ TM AT BE BG CH CY CZ DE DK EE ES FI FR GB GR IE IT LU MC NL PT SE SK TR BF BJ CF CG CI CM GA GN GQ GW ML MR NE SN TD TG

Kind code of ref document: A1

Designated state(s): GH GM KE LS MW MZ SD SL SZ UG ZM ZW AM AZ BY KG KZ RU TJ TM AT BE BG CH CY CZ DK EE ES FI FR GB GR IE IT LU MC PT SE SK TR BF BJ CF CG CI GA GN GQ GW ML MR NE SN TD TG

DFPE Request for preliminary examination filed prior to expiration of 19th month from priority date (pct application filed before 20040101)
121 Ep: the epo has been informed by wipo that ep was designated in this application
WWE Wipo information: entry into national phase

Ref document number: 2003522906

Country of ref document: JP

WWE Wipo information: entry into national phase

Ref document number: 340/CHENP/2004

Country of ref document: IN

WWE Wipo information: entry into national phase

Ref document number: 20028163397

Country of ref document: CN

122 Ep: pct application non-entry in european phase