JP6520108B2 - 音声合成装置、方法、およびプログラム - Google Patents
音声合成装置、方法、およびプログラム Download PDFInfo
- Publication number
- JP6520108B2 JP6520108B2 JP2014259485A JP2014259485A JP6520108B2 JP 6520108 B2 JP6520108 B2 JP 6520108B2 JP 2014259485 A JP2014259485 A JP 2014259485A JP 2014259485 A JP2014259485 A JP 2014259485A JP 6520108 B2 JP6520108 B2 JP 6520108B2
- Authority
- JP
- Japan
- Prior art keywords
- speech
- segment
- voice
- pitch
- power
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
- 238000000034 method Methods 0.000 title description 28
- 230000015572 biosynthetic process Effects 0.000 claims description 34
- 238000003786 synthesis reaction Methods 0.000 claims description 32
- 230000006978 adaptation Effects 0.000 claims description 30
- 239000000284 extract Substances 0.000 claims description 15
- 238000000605 extraction Methods 0.000 claims description 13
- 238000009499 grossing Methods 0.000 claims description 7
- 238000004364 calculation method Methods 0.000 claims description 5
- 238000001308 synthesis method Methods 0.000 claims description 4
- 239000011295 pitch Substances 0.000 description 86
- 238000012545 processing Methods 0.000 description 29
- 238000004458 analytical method Methods 0.000 description 16
- 238000004891 communication Methods 0.000 description 7
- 238000010586 diagram Methods 0.000 description 5
- 230000006870 function Effects 0.000 description 5
- 230000003044 adaptive effect Effects 0.000 description 4
- 238000005516 engineering process Methods 0.000 description 4
- 238000011156 evaluation Methods 0.000 description 4
- 230000000877 morphologic effect Effects 0.000 description 4
- 230000008451 emotion Effects 0.000 description 3
- 230000006866 deterioration Effects 0.000 description 2
- 238000012935 Averaging Methods 0.000 description 1
- 238000006243 chemical reaction Methods 0.000 description 1
- 230000007423 decrease Effects 0.000 description 1
- 238000001514 detection method Methods 0.000 description 1
- 239000011306 natural pitch Substances 0.000 description 1
- 230000003287 optical effect Effects 0.000 description 1
- 238000007639 printing Methods 0.000 description 1
- 239000007787 solid Substances 0.000 description 1
- 238000001228 spectrum Methods 0.000 description 1
- 238000013179 statistical model Methods 0.000 description 1
- 210000001260 vocal cord Anatomy 0.000 description 1
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L13/00—Speech synthesis; Text to speech systems
- G10L13/06—Elementary speech units used in speech synthesisers; Concatenation rules
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L13/00—Speech synthesis; Text to speech systems
- G10L13/08—Text analysis or generation of parameters for speech synthesis out of text, e.g. grapheme to phoneme translation, prosody generation or stress or intonation determination
- G10L13/10—Prosody rules derived from text; Stress or intonation
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L13/00—Speech synthesis; Text to speech systems
- G10L13/02—Methods for producing synthetic speech; Speech synthesisers
- G10L13/033—Voice editing, e.g. manipulating the voice of the synthesiser
- G10L13/0335—Pitch control
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L13/00—Speech synthesis; Text to speech systems
- G10L13/08—Text analysis or generation of parameters for speech synthesis out of text, e.g. grapheme to phoneme translation, prosody generation or stress or intonation determination
Landscapes
- Engineering & Computer Science (AREA)
- Computational Linguistics (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Physics & Mathematics (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Machine Translation (AREA)
- Quality & Reliability (AREA)
- Two-Way Televisions, Distribution Of Moving Picture Or The Like (AREA)
- Signal Processing (AREA)
Priority Applications (3)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
JP2014259485A JP6520108B2 (ja) | 2014-12-22 | 2014-12-22 | 音声合成装置、方法、およびプログラム |
US14/969,150 US9805711B2 (en) | 2014-12-22 | 2015-12-15 | Sound synthesis device, sound synthesis method and storage medium |
CN201510968697.6A CN105719640B (zh) | 2014-12-22 | 2015-12-22 | 声音合成装置及声音合成方法 |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
JP2014259485A JP6520108B2 (ja) | 2014-12-22 | 2014-12-22 | 音声合成装置、方法、およびプログラム |
Publications (3)
Publication Number | Publication Date |
---|---|
JP2016118722A JP2016118722A (ja) | 2016-06-30 |
JP2016118722A5 JP2016118722A5 (enrdf_load_stackoverflow) | 2018-02-08 |
JP6520108B2 true JP6520108B2 (ja) | 2019-05-29 |
Family
ID=56130165
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
JP2014259485A Active JP6520108B2 (ja) | 2014-12-22 | 2014-12-22 | 音声合成装置、方法、およびプログラム |
Country Status (3)
Country | Link |
---|---|
US (1) | US9805711B2 (enrdf_load_stackoverflow) |
JP (1) | JP6520108B2 (enrdf_load_stackoverflow) |
CN (1) | CN105719640B (enrdf_load_stackoverflow) |
Families Citing this family (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN109952609B (zh) * | 2016-11-07 | 2023-08-15 | 雅马哈株式会社 | 声音合成方法 |
KR102304701B1 (ko) * | 2017-03-28 | 2021-09-24 | 삼성전자주식회사 | 사용자의 음성 입력에 대한 답변을 제공하는 방법 및 장치 |
KR102079453B1 (ko) * | 2018-07-31 | 2020-02-19 | 전자부품연구원 | 비디오 특성에 부합하는 오디오 합성 방법 |
CN113160792B (zh) * | 2021-01-15 | 2023-11-17 | 广东外语外贸大学 | 一种多语种的语音合成方法、装置和系统 |
CN113409798B (zh) * | 2021-06-22 | 2024-07-05 | 科大讯飞股份有限公司 | 车内含噪语音数据生成方法、装置以及设备 |
CN115148186A (zh) * | 2022-06-29 | 2022-10-04 | 北京有竹居网络技术有限公司 | 语音合成方法、装置、可读介质及电子设备 |
Family Cites Families (27)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US4692941A (en) * | 1984-04-10 | 1987-09-08 | First Byte | Real-time text-to-speech conversion system |
US5636325A (en) * | 1992-11-13 | 1997-06-03 | International Business Machines Corporation | Speech synthesis and analysis of dialects |
US5642466A (en) * | 1993-01-21 | 1997-06-24 | Apple Computer, Inc. | Intonation adjustment in text-to-speech systems |
US5796916A (en) * | 1993-01-21 | 1998-08-18 | Apple Computer, Inc. | Method and apparatus for prosody for synthetic speech prosody determination |
CN1032391C (zh) * | 1994-04-01 | 1996-07-24 | 清华大学 | 基于波形编辑的汉语文字-语音转换方法及系统 |
CN1118493A (zh) * | 1994-08-01 | 1996-03-13 | 中国科学院声学研究所 | 基音同步波形叠加汉语文语转换系统 |
US5832434A (en) * | 1995-05-26 | 1998-11-03 | Apple Computer, Inc. | Method and apparatus for automatic assignment of duration values for synthetic speech |
JP3173382B2 (ja) * | 1996-08-06 | 2001-06-04 | ヤマハ株式会社 | 楽音制御装置、カラオケ装置、音楽情報供給及び再生方法、音楽情報供給装置並びに音楽再生装置 |
JPH10153998A (ja) * | 1996-09-24 | 1998-06-09 | Nippon Telegr & Teleph Corp <Ntt> | 補助情報利用型音声合成方法、この方法を実施する手順を記録した記録媒体、およびこの方法を実施する装置 |
JP2000010581A (ja) * | 1998-06-19 | 2000-01-14 | Nec Corp | 音声合成装置 |
JP3515039B2 (ja) * | 2000-03-03 | 2004-04-05 | 沖電気工業株式会社 | テキスト音声変換装置におけるピッチパタン制御方法 |
JP2003223181A (ja) * | 2002-01-29 | 2003-08-08 | Yamaha Corp | 文字−音声変換装置およびそれを用いた携帯端末装置 |
US6988064B2 (en) * | 2003-03-31 | 2006-01-17 | Motorola, Inc. | System and method for combined frequency-domain and time-domain pitch extraction for speech signals |
JP4428093B2 (ja) * | 2004-03-05 | 2010-03-10 | ヤマハ株式会社 | ピッチパターン生成装置、ピッチパターン生成方法及びピッチパターン生成プログラム |
JP2006309162A (ja) * | 2005-03-29 | 2006-11-09 | Toshiba Corp | ピッチパターン生成方法、ピッチパターン生成装置及びプログラム |
JP4738057B2 (ja) * | 2005-05-24 | 2011-08-03 | 株式会社東芝 | ピッチパターン生成方法及びその装置 |
CN100347741C (zh) * | 2005-09-02 | 2007-11-07 | 清华大学 | 移动语音合成方法 |
JP4241762B2 (ja) * | 2006-05-18 | 2009-03-18 | 株式会社東芝 | 音声合成装置、その方法、及びプログラム |
CN101000764B (zh) * | 2006-12-18 | 2011-05-18 | 黑龙江大学 | 基于韵律结构的语音合成文本处理方法 |
JP5434587B2 (ja) * | 2007-02-20 | 2014-03-05 | 日本電気株式会社 | 音声合成装置及び方法とプログラム |
JP2009048003A (ja) * | 2007-08-21 | 2009-03-05 | Toshiba Corp | 音声翻訳装置及び方法 |
CN101452699A (zh) * | 2007-12-04 | 2009-06-10 | 株式会社东芝 | 韵律自适应及语音合成的方法和装置 |
US8244546B2 (en) * | 2008-05-28 | 2012-08-14 | National Institute Of Advanced Industrial Science And Technology | Singing synthesis parameter data estimation system |
JP2010039277A (ja) * | 2008-08-06 | 2010-02-18 | Mitsubishi Electric Corp | 音声合成装置 |
JP2012220701A (ja) * | 2011-04-08 | 2012-11-12 | Hitachi Ltd | 音声合成装置及びその合成音声修正方法 |
TWI573129B (zh) * | 2013-02-05 | 2017-03-01 | 國立交通大學 | 編碼串流產生裝置、韻律訊息編碼裝置、韻律結構分析裝置與語音合成之裝置及方法 |
US9208775B2 (en) * | 2013-02-21 | 2015-12-08 | Qualcomm Incorporated | Systems and methods for determining pitch pulse period signal boundaries |
-
2014
- 2014-12-22 JP JP2014259485A patent/JP6520108B2/ja active Active
-
2015
- 2015-12-15 US US14/969,150 patent/US9805711B2/en active Active
- 2015-12-22 CN CN201510968697.6A patent/CN105719640B/zh active Active
Also Published As
Publication number | Publication date |
---|---|
JP2016118722A (ja) | 2016-06-30 |
CN105719640B (zh) | 2019-11-05 |
US20160180833A1 (en) | 2016-06-23 |
CN105719640A (zh) | 2016-06-29 |
US9805711B2 (en) | 2017-10-31 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
JP6520108B2 (ja) | 音声合成装置、方法、およびプログラム | |
JP4025355B2 (ja) | 音声合成装置及び音声合成方法 | |
JP6342428B2 (ja) | 音声合成装置、音声合成方法およびプログラム | |
JP6561499B2 (ja) | 音声合成装置および音声合成方法 | |
JP2007249212A (ja) | テキスト音声合成のための方法、コンピュータプログラム及びプロセッサ | |
GB2603776A (en) | Methods and systems for modifying speech generated by a text-to-speech synthesiser | |
JP4738057B2 (ja) | ピッチパターン生成方法及びその装置 | |
JP6821970B2 (ja) | 音声合成装置および音声合成方法 | |
JP6013104B2 (ja) | 音声合成方法、装置、及びプログラム | |
US8478595B2 (en) | Fundamental frequency pattern generation apparatus and fundamental frequency pattern generation method | |
JP2001265375A (ja) | 規則音声合成装置 | |
JP2003108178A (ja) | 音声合成装置及び音声合成用素片作成装置 | |
JP6314828B2 (ja) | 韻律モデル学習装置、韻律モデル学習方法、音声合成システム、および韻律モデル学習プログラム | |
JP2016065900A (ja) | 音声合成装置、方法、およびプログラム | |
WO2008056604A1 (fr) | Système de collecte de son, procédé de collecte de son et programme de traitement de collecte | |
Wen et al. | Prosody Conversion for Emotional Mandarin Speech Synthesis Using the Tone Nucleus Model. | |
JPH09319391A (ja) | 音声合成方法 | |
JP6213217B2 (ja) | 音声合成装置及び音声合成用コンピュータプログラム | |
JP2008191477A (ja) | ハイブリッド型音声合成方法、及びその装置とそのプログラムと、その記憶媒体 | |
Huang et al. | Hierarchical prosodic pattern selection based on Fujisaki model for natural mandarin speech synthesis | |
JP3854593B2 (ja) | 音声合成装置及びそのためのコスト計算装置、並びにコンピュータプログラム | |
JP6519096B2 (ja) | 音声合成装置、方法、およびプログラム | |
JP6519097B2 (ja) | 音声合成装置、方法、およびプログラム | |
JP6056190B2 (ja) | 音声合成装置 | |
WO2014017024A1 (ja) | 音声合成装置、音声合成方法、及び音声合成プログラム |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
A521 | Written amendment |
Free format text: JAPANESE INTERMEDIATE CODE: A523 Effective date: 20171219 |
|
A621 | Written request for application examination |
Free format text: JAPANESE INTERMEDIATE CODE: A621 Effective date: 20171219 |
|
A977 | Report on retrieval |
Free format text: JAPANESE INTERMEDIATE CODE: A971007 Effective date: 20180926 |
|
A131 | Notification of reasons for refusal |
Free format text: JAPANESE INTERMEDIATE CODE: A131 Effective date: 20181002 |
|
A521 | Written amendment |
Free format text: JAPANESE INTERMEDIATE CODE: A523 Effective date: 20181025 |
|
TRDD | Decision of grant or rejection written | ||
A01 | Written decision to grant a patent or to grant a registration (utility model) |
Free format text: JAPANESE INTERMEDIATE CODE: A01 Effective date: 20190402 |
|
A61 | First payment of annual fees (during grant procedure) |
Free format text: JAPANESE INTERMEDIATE CODE: A61 Effective date: 20190415 |
|
R150 | Certificate of patent or registration of utility model |
Ref document number: 6520108 Country of ref document: JP Free format text: JAPANESE INTERMEDIATE CODE: R150 |