JP6520108B2 - 音声合成装置、方法、およびプログラム - Google Patents

音声合成装置、方法、およびプログラム Download PDF

Info

Publication number
JP6520108B2
JP6520108B2 JP2014259485A JP2014259485A JP6520108B2 JP 6520108 B2 JP6520108 B2 JP 6520108B2 JP 2014259485 A JP2014259485 A JP 2014259485A JP 2014259485 A JP2014259485 A JP 2014259485A JP 6520108 B2 JP6520108 B2 JP 6520108B2
Authority
JP
Japan
Prior art keywords
speech
segment
voice
pitch
power
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
JP2014259485A
Other languages
English (en)
Japanese (ja)
Other versions
JP2016118722A (ja
JP2016118722A5 (enrdf_load_stackoverflow
Inventor
飛雄太 田中
飛雄太 田中
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Casio Computer Co Ltd
Original Assignee
Casio Computer Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Casio Computer Co Ltd filed Critical Casio Computer Co Ltd
Priority to JP2014259485A priority Critical patent/JP6520108B2/ja
Priority to US14/969,150 priority patent/US9805711B2/en
Priority to CN201510968697.6A priority patent/CN105719640B/zh
Publication of JP2016118722A publication Critical patent/JP2016118722A/ja
Publication of JP2016118722A5 publication Critical patent/JP2016118722A5/ja
Application granted granted Critical
Publication of JP6520108B2 publication Critical patent/JP6520108B2/ja
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L13/00Speech synthesis; Text to speech systems
    • G10L13/06Elementary speech units used in speech synthesisers; Concatenation rules
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L13/00Speech synthesis; Text to speech systems
    • G10L13/08Text analysis or generation of parameters for speech synthesis out of text, e.g. grapheme to phoneme translation, prosody generation or stress or intonation determination
    • G10L13/10Prosody rules derived from text; Stress or intonation
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L13/00Speech synthesis; Text to speech systems
    • G10L13/02Methods for producing synthetic speech; Speech synthesisers
    • G10L13/033Voice editing, e.g. manipulating the voice of the synthesiser
    • G10L13/0335Pitch control
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L13/00Speech synthesis; Text to speech systems
    • G10L13/08Text analysis or generation of parameters for speech synthesis out of text, e.g. grapheme to phoneme translation, prosody generation or stress or intonation determination

Landscapes

  • Engineering & Computer Science (AREA)
  • Computational Linguistics (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Machine Translation (AREA)
  • Quality & Reliability (AREA)
  • Two-Way Televisions, Distribution Of Moving Picture Or The Like (AREA)
  • Signal Processing (AREA)
JP2014259485A 2014-12-22 2014-12-22 音声合成装置、方法、およびプログラム Active JP6520108B2 (ja)

Priority Applications (3)

Application Number Priority Date Filing Date Title
JP2014259485A JP6520108B2 (ja) 2014-12-22 2014-12-22 音声合成装置、方法、およびプログラム
US14/969,150 US9805711B2 (en) 2014-12-22 2015-12-15 Sound synthesis device, sound synthesis method and storage medium
CN201510968697.6A CN105719640B (zh) 2014-12-22 2015-12-22 声音合成装置及声音合成方法

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
JP2014259485A JP6520108B2 (ja) 2014-12-22 2014-12-22 音声合成装置、方法、およびプログラム

Publications (3)

Publication Number Publication Date
JP2016118722A JP2016118722A (ja) 2016-06-30
JP2016118722A5 JP2016118722A5 (enrdf_load_stackoverflow) 2018-02-08
JP6520108B2 true JP6520108B2 (ja) 2019-05-29

Family

ID=56130165

Family Applications (1)

Application Number Title Priority Date Filing Date
JP2014259485A Active JP6520108B2 (ja) 2014-12-22 2014-12-22 音声合成装置、方法、およびプログラム

Country Status (3)

Country Link
US (1) US9805711B2 (enrdf_load_stackoverflow)
JP (1) JP6520108B2 (enrdf_load_stackoverflow)
CN (1) CN105719640B (enrdf_load_stackoverflow)

Families Citing this family (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109952609B (zh) * 2016-11-07 2023-08-15 雅马哈株式会社 声音合成方法
KR102304701B1 (ko) * 2017-03-28 2021-09-24 삼성전자주식회사 사용자의 음성 입력에 대한 답변을 제공하는 방법 및 장치
KR102079453B1 (ko) * 2018-07-31 2020-02-19 전자부품연구원 비디오 특성에 부합하는 오디오 합성 방법
CN113160792B (zh) * 2021-01-15 2023-11-17 广东外语外贸大学 一种多语种的语音合成方法、装置和系统
CN113409798B (zh) * 2021-06-22 2024-07-05 科大讯飞股份有限公司 车内含噪语音数据生成方法、装置以及设备
CN115148186A (zh) * 2022-06-29 2022-10-04 北京有竹居网络技术有限公司 语音合成方法、装置、可读介质及电子设备

Family Cites Families (27)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US4692941A (en) * 1984-04-10 1987-09-08 First Byte Real-time text-to-speech conversion system
US5636325A (en) * 1992-11-13 1997-06-03 International Business Machines Corporation Speech synthesis and analysis of dialects
US5642466A (en) * 1993-01-21 1997-06-24 Apple Computer, Inc. Intonation adjustment in text-to-speech systems
US5796916A (en) * 1993-01-21 1998-08-18 Apple Computer, Inc. Method and apparatus for prosody for synthetic speech prosody determination
CN1032391C (zh) * 1994-04-01 1996-07-24 清华大学 基于波形编辑的汉语文字-语音转换方法及系统
CN1118493A (zh) * 1994-08-01 1996-03-13 中国科学院声学研究所 基音同步波形叠加汉语文语转换系统
US5832434A (en) * 1995-05-26 1998-11-03 Apple Computer, Inc. Method and apparatus for automatic assignment of duration values for synthetic speech
JP3173382B2 (ja) * 1996-08-06 2001-06-04 ヤマハ株式会社 楽音制御装置、カラオケ装置、音楽情報供給及び再生方法、音楽情報供給装置並びに音楽再生装置
JPH10153998A (ja) * 1996-09-24 1998-06-09 Nippon Telegr & Teleph Corp <Ntt> 補助情報利用型音声合成方法、この方法を実施する手順を記録した記録媒体、およびこの方法を実施する装置
JP2000010581A (ja) * 1998-06-19 2000-01-14 Nec Corp 音声合成装置
JP3515039B2 (ja) * 2000-03-03 2004-04-05 沖電気工業株式会社 テキスト音声変換装置におけるピッチパタン制御方法
JP2003223181A (ja) * 2002-01-29 2003-08-08 Yamaha Corp 文字−音声変換装置およびそれを用いた携帯端末装置
US6988064B2 (en) * 2003-03-31 2006-01-17 Motorola, Inc. System and method for combined frequency-domain and time-domain pitch extraction for speech signals
JP4428093B2 (ja) * 2004-03-05 2010-03-10 ヤマハ株式会社 ピッチパターン生成装置、ピッチパターン生成方法及びピッチパターン生成プログラム
JP2006309162A (ja) * 2005-03-29 2006-11-09 Toshiba Corp ピッチパターン生成方法、ピッチパターン生成装置及びプログラム
JP4738057B2 (ja) * 2005-05-24 2011-08-03 株式会社東芝 ピッチパターン生成方法及びその装置
CN100347741C (zh) * 2005-09-02 2007-11-07 清华大学 移动语音合成方法
JP4241762B2 (ja) * 2006-05-18 2009-03-18 株式会社東芝 音声合成装置、その方法、及びプログラム
CN101000764B (zh) * 2006-12-18 2011-05-18 黑龙江大学 基于韵律结构的语音合成文本处理方法
JP5434587B2 (ja) * 2007-02-20 2014-03-05 日本電気株式会社 音声合成装置及び方法とプログラム
JP2009048003A (ja) * 2007-08-21 2009-03-05 Toshiba Corp 音声翻訳装置及び方法
CN101452699A (zh) * 2007-12-04 2009-06-10 株式会社东芝 韵律自适应及语音合成的方法和装置
US8244546B2 (en) * 2008-05-28 2012-08-14 National Institute Of Advanced Industrial Science And Technology Singing synthesis parameter data estimation system
JP2010039277A (ja) * 2008-08-06 2010-02-18 Mitsubishi Electric Corp 音声合成装置
JP2012220701A (ja) * 2011-04-08 2012-11-12 Hitachi Ltd 音声合成装置及びその合成音声修正方法
TWI573129B (zh) * 2013-02-05 2017-03-01 國立交通大學 編碼串流產生裝置、韻律訊息編碼裝置、韻律結構分析裝置與語音合成之裝置及方法
US9208775B2 (en) * 2013-02-21 2015-12-08 Qualcomm Incorporated Systems and methods for determining pitch pulse period signal boundaries

Also Published As

Publication number Publication date
JP2016118722A (ja) 2016-06-30
CN105719640B (zh) 2019-11-05
US20160180833A1 (en) 2016-06-23
CN105719640A (zh) 2016-06-29
US9805711B2 (en) 2017-10-31

Similar Documents

Publication Publication Date Title
JP6520108B2 (ja) 音声合成装置、方法、およびプログラム
JP4025355B2 (ja) 音声合成装置及び音声合成方法
JP6342428B2 (ja) 音声合成装置、音声合成方法およびプログラム
JP6561499B2 (ja) 音声合成装置および音声合成方法
JP2007249212A (ja) テキスト音声合成のための方法、コンピュータプログラム及びプロセッサ
GB2603776A (en) Methods and systems for modifying speech generated by a text-to-speech synthesiser
JP4738057B2 (ja) ピッチパターン生成方法及びその装置
JP6821970B2 (ja) 音声合成装置および音声合成方法
JP6013104B2 (ja) 音声合成方法、装置、及びプログラム
US8478595B2 (en) Fundamental frequency pattern generation apparatus and fundamental frequency pattern generation method
JP2001265375A (ja) 規則音声合成装置
JP2003108178A (ja) 音声合成装置及び音声合成用素片作成装置
JP6314828B2 (ja) 韻律モデル学習装置、韻律モデル学習方法、音声合成システム、および韻律モデル学習プログラム
JP2016065900A (ja) 音声合成装置、方法、およびプログラム
WO2008056604A1 (fr) Système de collecte de son, procédé de collecte de son et programme de traitement de collecte
Wen et al. Prosody Conversion for Emotional Mandarin Speech Synthesis Using the Tone Nucleus Model.
JPH09319391A (ja) 音声合成方法
JP6213217B2 (ja) 音声合成装置及び音声合成用コンピュータプログラム
JP2008191477A (ja) ハイブリッド型音声合成方法、及びその装置とそのプログラムと、その記憶媒体
Huang et al. Hierarchical prosodic pattern selection based on Fujisaki model for natural mandarin speech synthesis
JP3854593B2 (ja) 音声合成装置及びそのためのコスト計算装置、並びにコンピュータプログラム
JP6519096B2 (ja) 音声合成装置、方法、およびプログラム
JP6519097B2 (ja) 音声合成装置、方法、およびプログラム
JP6056190B2 (ja) 音声合成装置
WO2014017024A1 (ja) 音声合成装置、音声合成方法、及び音声合成プログラム

Legal Events

Date Code Title Description
A521 Written amendment

Free format text: JAPANESE INTERMEDIATE CODE: A523

Effective date: 20171219

A621 Written request for application examination

Free format text: JAPANESE INTERMEDIATE CODE: A621

Effective date: 20171219

A977 Report on retrieval

Free format text: JAPANESE INTERMEDIATE CODE: A971007

Effective date: 20180926

A131 Notification of reasons for refusal

Free format text: JAPANESE INTERMEDIATE CODE: A131

Effective date: 20181002

A521 Written amendment

Free format text: JAPANESE INTERMEDIATE CODE: A523

Effective date: 20181025

TRDD Decision of grant or rejection written
A01 Written decision to grant a patent or to grant a registration (utility model)

Free format text: JAPANESE INTERMEDIATE CODE: A01

Effective date: 20190402

A61 First payment of annual fees (during grant procedure)

Free format text: JAPANESE INTERMEDIATE CODE: A61

Effective date: 20190415

R150 Certificate of patent or registration of utility model

Ref document number: 6520108

Country of ref document: JP

Free format text: JAPANESE INTERMEDIATE CODE: R150