KR101882103B1 - 음성 합성 시스템의 최적화 방법 및 장치 - Google Patents

음성 합성 시스템의 최적화 방법 및 장치 Download PDF

Info

Publication number
KR101882103B1
KR101882103B1 KR1020160170531A KR20160170531A KR101882103B1 KR 101882103 B1 KR101882103 B1 KR 101882103B1 KR 1020160170531 A KR1020160170531 A KR 1020160170531A KR 20160170531 A KR20160170531 A KR 20160170531A KR 101882103 B1 KR101882103 B1 KR 101882103B1
Authority
KR
South Korea
Prior art keywords
speech synthesis
synthesis system
optimizing speech
optimizing
synthesis
Prior art date
Application number
KR1020160170531A
Other languages
English (en)
Other versions
KR20170087016A (ko
Inventor
칭창 하오
슈린 리
지에 바이
하이유안 탕
Original Assignee
바이두 온라인 네트웍 테크놀러지 (베이징) 캄파니 리미티드
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 바이두 온라인 네트웍 테크놀러지 (베이징) 캄파니 리미티드 filed Critical 바이두 온라인 네트웍 테크놀러지 (베이징) 캄파니 리미티드
Publication of KR20170087016A publication Critical patent/KR20170087016A/ko
Application granted granted Critical
Publication of KR101882103B1 publication Critical patent/KR101882103B1/ko

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L13/00Speech synthesis; Text to speech systems
    • G10L13/02Methods for producing synthetic speech; Speech synthesisers
    • G10L13/04Details of speech synthesis systems, e.g. synthesiser structure or memory management
    • G10L13/047Architecture of speech synthesisers
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L13/00Speech synthesis; Text to speech systems
    • G10L13/08Text analysis or generation of parameters for speech synthesis out of text, e.g. grapheme to phoneme translation, prosody generation or stress or intonation determination
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L13/00Speech synthesis; Text to speech systems
    • G10L13/02Methods for producing synthetic speech; Speech synthesisers
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L13/00Speech synthesis; Text to speech systems
    • G10L13/06Elementary speech units used in speech synthesisers; Concatenation rules
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L13/00Speech synthesis; Text to speech systems
    • G10L13/08Text analysis or generation of parameters for speech synthesis out of text, e.g. grapheme to phoneme translation, prosody generation or stress or intonation determination
    • G10L13/10Prosody rules derived from text; Stress or intonation
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L45/00Routing or path finding of packets in data switching networks
    • H04L45/38Flow based routing
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L13/00Speech synthesis; Text to speech systems
    • G10L13/02Methods for producing synthetic speech; Speech synthesisers
    • G10L2013/021Overlap-add techniques

Landscapes

  • Engineering & Computer Science (AREA)
  • Computational Linguistics (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Computer Networks & Wireless Communication (AREA)
  • Signal Processing (AREA)
  • Telephonic Communication Services (AREA)
  • Machine Translation (AREA)
KR1020160170531A 2016-01-19 2016-12-14 음성 합성 시스템의 최적화 방법 및 장치 KR101882103B1 (ko)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN201610034930.8 2016-01-19
CN201610034930.8A CN105489216B (zh) 2016-01-19 2016-01-19 语音合成系统的优化方法和装置

Publications (2)

Publication Number Publication Date
KR20170087016A KR20170087016A (ko) 2017-07-27
KR101882103B1 true KR101882103B1 (ko) 2018-07-25

Family

ID=55676163

Family Applications (1)

Application Number Title Priority Date Filing Date
KR1020160170531A KR101882103B1 (ko) 2016-01-19 2016-12-14 음성 합성 시스템의 최적화 방법 및 장치

Country Status (4)

Country Link
US (1) US10242660B2 (ko)
JP (1) JP6373924B2 (ko)
KR (1) KR101882103B1 (ko)
CN (1) CN105489216B (ko)

Families Citing this family (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107749931A (zh) * 2017-09-29 2018-03-02 携程旅游信息技术(上海)有限公司 互动式语音应答的方法、系统、设备及存储介质
CN112837669B (zh) * 2020-05-21 2023-10-24 腾讯科技(深圳)有限公司 语音合成方法、装置及服务器

Family Cites Families (16)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP3446764B2 (ja) * 1991-11-12 2003-09-16 富士通株式会社 音声合成システム及び音声合成サーバ
JP3083640B2 (ja) * 1992-05-28 2000-09-04 株式会社東芝 音声合成方法および装置
KR0140131B1 (ko) * 1995-04-26 1998-07-01 김주용 이동통신 시스템에서 셀렉터와 다수개의 보코더 인터페이스 장치 및 방법
US6052666A (en) * 1995-11-06 2000-04-18 Thomson Multimedia S.A. Vocal identification of devices in a home environment
US7136816B1 (en) * 2002-04-05 2006-11-14 At&T Corp. System and method for predicting prosodic parameters
JP2004020613A (ja) * 2002-06-12 2004-01-22 Canon Inc サーバ、受信端末
CN1261846C (zh) * 2004-08-03 2006-06-28 威盛电子股份有限公司 一种计算机系统的实时电源管理方法及其系统
CN1787072B (zh) * 2004-12-07 2010-06-16 北京捷通华声语音技术有限公司 基于韵律模型和参数选音的语音合成方法
US8023574B2 (en) * 2006-05-05 2011-09-20 Intel Corporation Method and apparatus to support scalability in a multicarrier network
US20080154605A1 (en) * 2006-12-21 2008-06-26 International Business Machines Corporation Adaptive quality adjustments for speech synthesis in a real-time speech processing system based upon load
WO2009059456A1 (en) * 2007-11-06 2009-05-14 Lucent Technologies Inc. Method for controlling load balance of network system, client, server and network system
CN102117614B (zh) * 2010-01-05 2013-01-02 索尼爱立信移动通讯有限公司 个性化文本语音合成和个性化语音特征提取
JP2013057734A (ja) * 2011-09-07 2013-03-28 Toshiba Corp 音声変換装置、音声変換装システム、プログラムおよび音声変換方法
CN103649948A (zh) * 2012-06-21 2014-03-19 华为技术有限公司 键值数据库的数据合并方法和装置
CN103841042B (zh) * 2014-02-19 2017-09-19 华为技术有限公司 在高运行效率下传输数据的方法和装置
CN104850612B (zh) * 2015-05-13 2020-08-04 中国电力科学研究院 一种基于增强凝聚层次聚类的配网用户负荷特征分类方法

Also Published As

Publication number Publication date
JP2017129840A (ja) 2017-07-27
US10242660B2 (en) 2019-03-26
US20170206886A1 (en) 2017-07-20
CN105489216A (zh) 2016-04-13
KR20170087016A (ko) 2017-07-27
JP6373924B2 (ja) 2018-08-15
CN105489216B (zh) 2020-03-03

Similar Documents

Publication Publication Date Title
EP3859731A4 (en) METHOD AND DEVICE FOR SPEECH SYNTHESIS
EP3183727A4 (en) System and method for speech validation
EP3180785A4 (en) Systems and methods for speech transcription
EP3543731A4 (en) POSITIONING METHOD AND SYSTEM, AND ASSOCIATED DEVICE
EP3371808B8 (en) Speech processing system and method
EP3373293A4 (en) METHOD AND APPARATUS FOR VOICE RECOGNITION
EP3401776A4 (en) METHOD AND SYSTEM FOR CONFIGURING SOUND EFFECT AND DEVICE THEREOF
EP3160138A4 (en) Image synthesis system, image synthesis device therefor, and image synthesis method
EP3511937A4 (en) DEVICE AND METHOD FOR SEPARATING SOUND SOURCES AND PROGRAM
EP3544002A4 (en) SPEECH RECOGNITION DEVICE AND SPEECH RECOGNITION SYSTEM
EP3136677A4 (en) Voice verification method, device and system
EP3211637A4 (en) Speech synthesis device and method
EP3537432A4 (en) LANGUAGE SYNTHESIS PROCEDURE
EP3364672A4 (en) Positioning system, method and device
EP3468132A4 (en) METHOD AND DEVICE FOR TRANSMITTING LANGUAGE DATA
EP3457301A4 (en) METHOD AND SYSTEM FOR STARTING AN APPLICATION
EP3321927A4 (en) LANGUAGE INTERACTION PROCEDURE AND LANGUAGE INTERACTION DEVICE
EP3389043A4 (en) VOICE INTERACTION DEVICE AND VOICE INTERACTION METHOD
EP3168839A4 (en) Voice recognition device and voice recognition system
EP3205116A4 (en) Method and apparatus for providing customised sound distributions
EP3441381A4 (en) PROCESS FOR PRODUCTION OF METHANOL AND DEVICE FOR PRODUCTION OF METHANOL
EP3714452A4 (en) LYRICAL IMPROVEMENT PROCESS AND SYSTEM
EP3095112A4 (en) System and method for synthesis of speech from provided text
EP3457666A4 (en) METHOD AND SYSTEM FOR STARTING AN APPLICATION
EP3516856A4 (en) SYSTEM AND METHOD FOR SAFE INTERACTIVE VOICE RESPONSE

Legal Events

Date Code Title Description
A201 Request for examination
E902 Notification of reason for refusal
E701 Decision to grant or registration of patent right
GRNT Written decision to grant