GB202108468D0 - Text-to-speech system - Google Patents
Text-to-speech systemInfo
- Publication number
- GB202108468D0 GB202108468D0 GBGB2108468.6A GB202108468A GB202108468D0 GB 202108468 D0 GB202108468 D0 GB 202108468D0 GB 202108468 A GB202108468 A GB 202108468A GB 202108468 D0 GB202108468 D0 GB 202108468D0
- Authority
- GB
- United Kingdom
- Prior art keywords
- text
- speech system
- speech
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L13/00—Speech synthesis; Text to speech systems
- G10L13/02—Methods for producing synthetic speech; Speech synthesisers
- G10L13/027—Concept to speech synthesisers; Generation of natural phrases from machine-based concepts
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L13/00—Speech synthesis; Text to speech systems
- G10L13/08—Text analysis or generation of parameters for speech synthesis out of text, e.g. grapheme to phoneme translation, prosody generation or stress or intonation determination
- G10L13/10—Prosody rules derived from text; Stress or intonation
Landscapes
- Engineering & Computer Science (AREA)
- Computational Linguistics (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Physics & Mathematics (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Document Processing Apparatus (AREA)
- Management, Administration, Business Operations System, And Electronic Commerce (AREA)
Priority Applications (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
GB2108468.6A GB2607903A (en) | 2021-06-14 | 2021-06-14 | Text-to-speech system |
PCT/GB2022/051491 WO2022263806A1 (en) | 2021-06-14 | 2022-06-14 | Text-to-speech system |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
GB2108468.6A GB2607903A (en) | 2021-06-14 | 2021-06-14 | Text-to-speech system |
Publications (2)
Publication Number | Publication Date |
---|---|
GB202108468D0 true GB202108468D0 (en) | 2021-07-28 |
GB2607903A GB2607903A (en) | 2022-12-21 |
Family
ID=76954504
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
GB2108468.6A Pending GB2607903A (en) | 2021-06-14 | 2021-06-14 | Text-to-speech system |
Country Status (2)
Country | Link |
---|---|
GB (1) | GB2607903A (en) |
WO (1) | WO2022263806A1 (en) |
Family Cites Families (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US9570065B2 (en) * | 2014-09-29 | 2017-02-14 | Nuance Communications, Inc. | Systems and methods for multi-style speech synthesis |
KR102616214B1 (en) * | 2019-08-03 | 2023-12-21 | 구글 엘엘씨 | Expressive control in end-to-end speech synthesis systems |
WO2021034786A1 (en) * | 2019-08-21 | 2021-02-25 | Dolby Laboratories Licensing Corporation | Systems and methods for adapting human speaker embeddings in speech synthesis |
-
2021
- 2021-06-14 GB GB2108468.6A patent/GB2607903A/en active Pending
-
2022
- 2022-06-14 WO PCT/GB2022/051491 patent/WO2022263806A1/en unknown
Non-Patent Citations (3)
Title |
---|
A. VASWANI, N. SHAZEER, N. PARMAR, J. USZKOREIT, L. JONES, A. N. GOMEZ, L. U. KAISER, AND I. POLOSUKHIN: "Advances in Neural Information Processing Systems", vol. 30, 2017, CURRAN ASSOCIATES, INC., article "Attention is all you need" |
J. SHENR. PANGR. J. WEISSM. SCHUSTERN. JAITLYZ. YANGZ. CHENY. ZHANGY. WANGR. SKERRY- RYAN: "Natural TTS synthesis by conditioning wavenet on mel spectrogram predictions", PROCEEDINGS OF THE IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), (CALGARY, CANADA, April 2018 (2018-04-01), pages 4779 - 4783 |
Y. WANGD. STANTONY. ZHANGR.-S. RYANE. BATTENBERGJ. SHORY. XIAOY. JIAF. RENR. A. SAUROUS: "Proceedings of the 35th International Conference on Machine Learning", vol. 80, 10 July 2018, article "Style tokens: Unsupervised style modeling, control and transfer in end-to-end speech synthesis", pages: 5180 - 5189 |
Also Published As
Publication number | Publication date |
---|---|
GB2607903A (en) | 2022-12-21 |
WO2022263806A1 (en) | 2022-12-22 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
GB2591245B (en) | An expressive text-to-speech system | |
SG11202009556XA (en) | Text-to-speech synthesis system and method | |
EP4128551A4 (en) | Voice interactive system | |
EP4250893A4 (en) | Mounting-related system | |
SG11202009311RA (en) | Speech analysis system | |
GB202010620D0 (en) | System | |
GB202108468D0 (en) | Text-to-speech system | |
GB2611336B (en) | Net-launching system | |
ZA202109464B (en) | Employment-managing system | |
EP4225612A4 (en) | Illuminated-marking system | |
GB202113813D0 (en) | ADDOR system | |
EP4318260A4 (en) | Data-saving system | |
GB202102363D0 (en) | System | |
GB2607345B (en) | Connection system | |
GB202101568D0 (en) | System | |
GB202018336D0 (en) | VxRy system | |
GB202020321D0 (en) | X-smart system | |
GB2591790B (en) | Speaker system | |
GB202314021D0 (en) | Thethering point system | |
GB202404685D0 (en) | Livewell system | |
GB202318654D0 (en) | System | |
GB202317255D0 (en) | System | |
GB202315789D0 (en) | System | |
GB202314700D0 (en) | System | |
GB202313710D0 (en) | System |