SG11202009556XA - Text-to-speech synthesis system and method - Google Patents

Text-to-speech synthesis system and method

Info

Publication number
SG11202009556XA
SG11202009556XA SG11202009556XA SG11202009556XA SG11202009556XA SG 11202009556X A SG11202009556X A SG 11202009556XA SG 11202009556X A SG11202009556X A SG 11202009556XA SG 11202009556X A SG11202009556X A SG 11202009556XA SG 11202009556X A SG11202009556X A SG 11202009556XA
Authority
SG
Singapore
Prior art keywords
text
speech synthesis
synthesis system
speech
synthesis
Prior art date
Application number
SG11202009556XA
Other languages
English (en)
Inventor
Piero Perucci
Martin Reber
Vijeta Avijeet
Original Assignee
Telepathy Labs Inc
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Telepathy Labs Inc filed Critical Telepathy Labs Inc
Publication of SG11202009556XA publication Critical patent/SG11202009556XA/en

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L13/00Speech synthesis; Text to speech systems
    • G10L13/02Methods for producing synthetic speech; Speech synthesisers
    • G10L13/04Details of speech synthesis systems, e.g. synthesiser structure or memory management
    • G10L13/047Architecture of speech synthesisers
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L13/00Speech synthesis; Text to speech systems
    • G10L13/02Methods for producing synthetic speech; Speech synthesisers
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/08Learning methods
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L13/00Speech synthesis; Text to speech systems
    • G10L13/08Text analysis or generation of parameters for speech synthesis out of text, e.g. grapheme to phoneme translation, prosody generation or stress or intonation determination
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/48Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use
    • G10L25/51Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use for comparison or discrimination
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/27Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the analysis technique
    • G10L25/30Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the analysis technique using neural networks

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Computational Linguistics (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Theoretical Computer Science (AREA)
  • Biophysics (AREA)
  • Molecular Biology (AREA)
  • Biomedical Technology (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Data Mining & Analysis (AREA)
  • Evolutionary Computation (AREA)
  • General Health & Medical Sciences (AREA)
  • Artificial Intelligence (AREA)
  • Computing Systems (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Mathematical Physics (AREA)
  • Software Systems (AREA)
  • Signal Processing (AREA)
  • Machine Translation (AREA)
SG11202009556XA 2018-03-28 2019-03-27 Text-to-speech synthesis system and method SG11202009556XA (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US201862649312P 2018-03-28 2018-03-28
PCT/US2019/024317 WO2019191251A1 (fr) 2018-03-28 2019-03-27 Procédé et système de synthèse vocale

Publications (1)

Publication Number Publication Date
SG11202009556XA true SG11202009556XA (en) 2020-10-29

Family

ID=68058486

Family Applications (1)

Application Number Title Priority Date Filing Date
SG11202009556XA SG11202009556XA (en) 2018-03-28 2019-03-27 Text-to-speech synthesis system and method

Country Status (4)

Country Link
US (3) US11450307B2 (fr)
EP (1) EP3776532A4 (fr)
SG (1) SG11202009556XA (fr)
WO (1) WO2019191251A1 (fr)

Families Citing this family (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2019191251A1 (fr) * 2018-03-28 2019-10-03 Telepathy Labs, Inc. Procédé et système de synthèse vocale
US11545132B2 (en) * 2019-08-28 2023-01-03 International Business Machines Corporation Speech characterization using a synthesized reference audio signal
US11302300B2 (en) * 2019-11-19 2022-04-12 Applications Technology (Apptek), Llc Method and apparatus for forced duration in neural speech synthesis
CN111048116B (zh) * 2019-12-23 2022-08-19 度小满科技(北京)有限公司 一种数据处理方法、装置及电子设备
TWI749447B (zh) * 2020-01-16 2021-12-11 國立中正大學 同步語音產生裝置及其產生方法
US11574622B2 (en) * 2020-07-02 2023-02-07 Ford Global Technologies, Llc Joint automatic speech recognition and text to speech conversion using adversarial neural networks
CN111898755B (zh) * 2020-08-11 2023-09-12 中国人民解放军海军航空大学 单一航迹智能合成方法及装置
WO2022094740A1 (fr) * 2020-11-03 2022-05-12 Microsoft Technology Licensing, Llc Entraînement controlé et utilisation de modèles de texte-parole et voix générées par des modèles personnalisées
CN113411456B (zh) * 2021-06-29 2023-05-02 中国人民解放军63892部队 一种基于语音识别的话音质量评估方法及装置

Family Cites Families (19)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US4624012A (en) * 1982-05-06 1986-11-18 Texas Instruments Incorporated Method and apparatus for converting voice characteristics of synthesized speech
JP3502247B2 (ja) * 1997-10-28 2004-03-02 ヤマハ株式会社 音声変換装置
KR100438826B1 (ko) * 2001-10-31 2004-07-05 삼성전자주식회사 스무딩 필터를 이용한 음성 합성 시스템 및 그 방법
US20070016421A1 (en) * 2005-07-12 2007-01-18 Nokia Corporation Correcting a pronunciation of a synthetically generated speech object
JP4878538B2 (ja) * 2006-10-24 2012-02-15 株式会社日立製作所 音声合成装置
US8886537B2 (en) * 2007-03-20 2014-11-11 Nuance Communications, Inc. Method and system for text-to-speech synthesis with personalized voice
US8954324B2 (en) * 2007-09-28 2015-02-10 Qualcomm Incorporated Multiple microphone voice activity detector
US9564120B2 (en) * 2010-05-14 2017-02-07 General Motors Llc Speech adaptation in speech synthesis
TWI413104B (zh) * 2010-12-22 2013-10-21 Ind Tech Res Inst 可調控式韻律重估測系統與方法及電腦程式產品
JP5664480B2 (ja) * 2011-06-30 2015-02-04 富士通株式会社 異常状態検出装置、電話機、異常状態検出方法、及びプログラム
TWI471854B (zh) * 2012-10-19 2015-02-01 Ind Tech Res Inst 引導式語者調適語音合成的系統與方法及電腦程式產品
US8527276B1 (en) * 2012-10-25 2013-09-03 Google Inc. Speech synthesis using deep neural networks
JP6268717B2 (ja) * 2013-03-04 2018-01-31 富士通株式会社 状態推定装置、状態推定方法及び状態推定用コンピュータプログラム
US9293129B2 (en) * 2013-03-05 2016-03-22 Microsoft Technology Licensing, Llc Speech recognition assisted evaluation on text-to-speech pronunciation issue detection
US10127901B2 (en) * 2014-06-13 2018-11-13 Microsoft Technology Licensing, Llc Hyper-structure recurrent neural networks for text-to-speech
US9824681B2 (en) * 2014-09-11 2017-11-21 Microsoft Technology Licensing, Llc Text-to-speech with emotional content
US9818396B2 (en) * 2015-07-24 2017-11-14 Yamaha Corporation Method and device for editing singing voice synthesis data, and method for analyzing singing
EP3151239A1 (fr) * 2015-09-29 2017-04-05 Yandex Europe AG Procedes et systemes pour la synthese de texte en discours
WO2019191251A1 (fr) * 2018-03-28 2019-10-03 Telepathy Labs, Inc. Procédé et système de synthèse vocale

Also Published As

Publication number Publication date
WO2019191251A1 (fr) 2019-10-03
US20220375452A1 (en) 2022-11-24
EP3776532A4 (fr) 2021-12-01
US20230368775A1 (en) 2023-11-16
US11741942B2 (en) 2023-08-29
US11450307B2 (en) 2022-09-20
EP3776532A1 (fr) 2021-02-17
US20210366460A1 (en) 2021-11-25

Similar Documents

Publication Publication Date Title
SG11202009556XA (en) Text-to-speech synthesis system and method
EP3688671A4 (fr) Procédé et système destinés à la synthèse d'un réseau neuronal
EP3625791A4 (fr) Système et procédé de texte-parole reposant sur l'intelligence artificielle
EP3859731A4 (fr) Procédé et dispositif de synthèse de parole
SG10202104872UA (en) Acoustic method and system for providing digital data
ZA202003999B (en) System and methods
IL254317A0 (en) A system and method for creating accurate speech transcription from natural speech sound signals
HUE064070T2 (hu) Nyelvek közötti hangátalakító rendszer és eljárás
ZA201604177B (en) System and method for synthesis of speech from provided text
GB201801768D0 (en) Synthesis method
EP3631789A4 (fr) Système et procédé de production automatique de sortie musicale
GB2596770B (en) Carrier-resolved photo-hall system and method
GB2572677B (en) System and method
GB2582865B (en) Biopolymer synthesis system and method
GB201913039D0 (en) Polynicleotide synthesis method kit and system
IL263348A (en) A method and system for creating a jewel in the shape of a snowflake
GB201809582D0 (en) System and method
IL257059B (en) Multi-beamforming system and method
KR101882103B1 (ko) 음성 합성 시스템의 최적화 방법 및 장치
PT3844135T (pt) Processo e sistema para a síntese do metanol
GB201816668D0 (en) System and method
GB201812593D0 (en) Illimination system and method
GB201817593D0 (en) An ATM-requesting-and-accessing system and method
GB201817404D0 (en) The speech synthesis system and its implementation method
GB201913041D0 (en) Polynicleotide synthesis method kit and system