PL401371A1 - Opracowanie głosu dla zautomatyzowanej zamiany tekstu na mowę - Google Patents
Opracowanie głosu dla zautomatyzowanej zamiany tekstu na mowęInfo
- Publication number
- PL401371A1 PL401371A1 PL401371A PL40137112A PL401371A1 PL 401371 A1 PL401371 A1 PL 401371A1 PL 401371 A PL401371 A PL 401371A PL 40137112 A PL40137112 A PL 40137112A PL 401371 A1 PL401371 A1 PL 401371A1
- Authority
- PL
- Poland
- Prior art keywords
- voice
- text
- development
- synthesized speech
- conversion system
- Prior art date
Links
- 238000006243 chemical reaction Methods 0.000 title 1
- 238000010801 machine learning Methods 0.000 abstract 1
- 238000000034 method Methods 0.000 abstract 1
- 238000012986 modification Methods 0.000 abstract 1
- 230000004048 modification Effects 0.000 abstract 1
- 230000009466 transformation Effects 0.000 abstract 1
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L13/00—Speech synthesis; Text to speech systems
- G10L13/08—Text analysis or generation of parameters for speech synthesis out of text, e.g. grapheme to phoneme translation, prosody generation or stress or intonation determination
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L13/00—Speech synthesis; Text to speech systems
- G10L13/02—Methods for producing synthetic speech; Speech synthesisers
- G10L13/033—Voice editing, e.g. manipulating the voice of the synthesiser
Landscapes
- Engineering & Computer Science (AREA)
- Computational Linguistics (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Physics & Mathematics (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Machine Translation (AREA)
- Electrically Operated Instructional Devices (AREA)
Abstract
Wynalazek dotyczy opracowania głosu dla zautomatyzowanej zamiany tekstu na mowę. Grupie użytkowników przedstawia się tekst oraz nagrania mowy syntetyzowanej dla danego tekstu. Użytkownicy wysłuchują nagrania mowy syntetyzowanej i przekazują informacje zwrotne na temat błędów nagrania oraz innych kwestii dotyczących syntetyzowanej mowy. System obejmujący co najmniej jedno urządzenie obliczeniowe przeprowadza analizę informacji zwrotnych, dokonuje modyfikacji głosu lub reguł przekształcania i cyklicznie testuje zmodyfikowane nagrania. Modyfikacje są określane przy pomocy algorytmów uczenia maszynowego oraz innych zautomatyzowanych procesów.
Priority Applications (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
PL401371A PL401371A1 (pl) | 2012-10-26 | 2012-10-26 | Opracowanie głosu dla zautomatyzowanej zamiany tekstu na mowę |
US13/720,925 US9196240B2 (en) | 2012-10-26 | 2012-12-19 | Automated text to speech voice development |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
PL401371A PL401371A1 (pl) | 2012-10-26 | 2012-10-26 | Opracowanie głosu dla zautomatyzowanej zamiany tekstu na mowę |
Publications (1)
Publication Number | Publication Date |
---|---|
PL401371A1 true PL401371A1 (pl) | 2014-04-28 |
Family
ID=50515001
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PL401371A PL401371A1 (pl) | 2012-10-26 | 2012-10-26 | Opracowanie głosu dla zautomatyzowanej zamiany tekstu na mowę |
Country Status (2)
Country | Link |
---|---|
US (1) | US9196240B2 (pl) |
PL (1) | PL401371A1 (pl) |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN109634872A (zh) * | 2019-02-25 | 2019-04-16 | 北京达佳互联信息技术有限公司 | 应用测试方法、装置、终端及存储介质 |
Families Citing this family (22)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US9275633B2 (en) * | 2012-01-09 | 2016-03-01 | Microsoft Technology Licensing, Llc | Crowd-sourcing pronunciation corrections in text-to-speech engines |
US9311913B2 (en) * | 2013-02-05 | 2016-04-12 | Nuance Communications, Inc. | Accuracy of text-to-speech synthesis |
US9524717B2 (en) * | 2013-10-15 | 2016-12-20 | Trevo Solutions Group LLC | System, method, and computer program for integrating voice-to-text capability into call systems |
US20150149178A1 (en) * | 2013-11-22 | 2015-05-28 | At&T Intellectual Property I, L.P. | System and method for data-driven intonation generation |
US9911408B2 (en) * | 2014-03-03 | 2018-03-06 | General Motors Llc | Dynamic speech system tuning |
US9384728B2 (en) | 2014-09-30 | 2016-07-05 | International Business Machines Corporation | Synthesizing an aggregate voice |
US10360716B1 (en) * | 2015-09-18 | 2019-07-23 | Amazon Technologies, Inc. | Enhanced avatar animation |
KR20170044849A (ko) * | 2015-10-16 | 2017-04-26 | 삼성전자주식회사 | 전자 장치 및 다국어/다화자의 공통 음향 데이터 셋을 활용하는 tts 변환 방법 |
US10074359B2 (en) | 2016-11-01 | 2018-09-11 | Google Llc | Dynamic text-to-speech provisioning |
DE212016000292U1 (de) * | 2016-11-03 | 2019-07-03 | Bayerische Motoren Werke Aktiengesellschaft | System zur Text-zu-Sprache-Leistungsbewertung |
US9741337B1 (en) * | 2017-04-03 | 2017-08-22 | Green Key Technologies Llc | Adaptive self-trained computer engines with associated databases and methods of use thereof |
US10319364B2 (en) | 2017-05-18 | 2019-06-11 | Telepathy Labs, Inc. | Artificial intelligence-based text-to-speech system and method |
CN111201565A (zh) | 2017-05-24 | 2020-05-26 | 调节股份有限公司 | 用于声对声转换的系统和方法 |
US10565981B2 (en) * | 2017-09-26 | 2020-02-18 | Microsoft Technology Licensing, Llc | Computer-assisted conversation using addressible conversation segments |
US11416801B2 (en) * | 2017-11-20 | 2022-08-16 | Accenture Global Solutions Limited | Analyzing value-related data to identify an error in the value-related data and/or a source of the error |
US10521946B1 (en) | 2017-11-21 | 2019-12-31 | Amazon Technologies, Inc. | Processing speech to drive animations on avatars |
US11232645B1 (en) | 2017-11-21 | 2022-01-25 | Amazon Technologies, Inc. | Virtual spaces as a platform |
US10732708B1 (en) * | 2017-11-21 | 2020-08-04 | Amazon Technologies, Inc. | Disambiguation of virtual reality information using multi-modal data including speech |
US10755725B2 (en) | 2018-06-04 | 2020-08-25 | Motorola Solutions, Inc. | Determining and remedying audio quality issues in a voice communication |
CN110032626B (zh) * | 2019-04-19 | 2022-04-12 | 百度在线网络技术(北京)有限公司 | 语音播报方法和装置 |
WO2021030759A1 (en) | 2019-08-14 | 2021-02-18 | Modulate, Inc. | Generation and detection of watermark for real-time voice conversion |
JP2023546989A (ja) | 2020-10-08 | 2023-11-08 | モジュレイト インク. | コンテンツモデレーションのためのマルチステージ適応型システム |
Family Cites Families (23)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US5920840A (en) * | 1995-02-28 | 1999-07-06 | Motorola, Inc. | Communication system and method using a speaker dependent time-scaling technique |
JP4132109B2 (ja) * | 1995-10-26 | 2008-08-13 | ソニー株式会社 | 音声信号の再生方法及び装置、並びに音声復号化方法及び装置、並びに音声合成方法及び装置 |
DE19610019C2 (de) * | 1996-03-14 | 1999-10-28 | Data Software Gmbh G | Digitales Sprachsyntheseverfahren |
EP1187100A1 (en) * | 2000-09-06 | 2002-03-13 | Koninklijke KPN N.V. | A method and a device for objective speech quality assessment without reference signal |
US20020087224A1 (en) * | 2000-12-29 | 2002-07-04 | Barile Steven E. | Concatenated audio title |
US6487494B2 (en) * | 2001-03-29 | 2002-11-26 | Wingcast, Llc | System and method for reducing the amount of repetitive data sent by a server to a client for vehicle navigation |
US6658383B2 (en) * | 2001-06-26 | 2003-12-02 | Microsoft Corporation | Method for coding speech and music signals |
US6999066B2 (en) * | 2002-06-24 | 2006-02-14 | Xerox Corporation | System for audible feedback for touch screen displays |
AU2003299312A1 (en) * | 2003-12-16 | 2005-07-05 | Loquendo S.P.A. | Text-to-speech method and system, computer program product therefor |
US7454348B1 (en) * | 2004-01-08 | 2008-11-18 | At&T Intellectual Property Ii, L.P. | System and method for blending synthetic voices |
DE602005026778D1 (de) * | 2004-01-16 | 2011-04-21 | Scansoft Inc | Corpus-gestützte sprachsynthese auf der basis von segmentrekombination |
US20080140406A1 (en) * | 2004-10-18 | 2008-06-12 | Koninklijke Philips Electronics, N.V. | Data-Processing Device and Method for Informing a User About a Category of a Media Content Item |
US7735012B2 (en) * | 2004-11-04 | 2010-06-08 | Apple Inc. | Audio user interface for computing devices |
US20070124142A1 (en) * | 2005-11-25 | 2007-05-31 | Mukherjee Santosh K | Voice enabled knowledge system |
US7684991B2 (en) * | 2006-01-05 | 2010-03-23 | Alpine Electronics, Inc. | Digital audio file search method and apparatus using text-to-speech processing |
US9318108B2 (en) | 2010-01-18 | 2016-04-19 | Apple Inc. | Intelligent automated assistant |
US20080129520A1 (en) * | 2006-12-01 | 2008-06-05 | Apple Computer, Inc. | Electronic device with enhanced audio feedback |
US8321222B2 (en) * | 2007-08-14 | 2012-11-27 | Nuance Communications, Inc. | Synthesis by generation and concatenation of multi-form segments |
US8996376B2 (en) * | 2008-04-05 | 2015-03-31 | Apple Inc. | Intelligent text-to-speech conversion |
US20100082328A1 (en) * | 2008-09-29 | 2010-04-01 | Apple Inc. | Systems and methods for speech preprocessing in text to speech synthesis |
US8352268B2 (en) * | 2008-09-29 | 2013-01-08 | Apple Inc. | Systems and methods for selective rate of speech and speech preferences for text to speech synthesis |
KR101617461B1 (ko) * | 2009-11-17 | 2016-05-02 | 엘지전자 주식회사 | 이동 통신 단말기에서의 티티에스 음성 데이터 출력 방법 및 이를 적용한 이동 통신 단말기 |
US20110161085A1 (en) * | 2009-12-31 | 2011-06-30 | Nokia Corporation | Method and apparatus for audio summary of activity for user |
-
2012
- 2012-10-26 PL PL401371A patent/PL401371A1/pl unknown
- 2012-12-19 US US13/720,925 patent/US9196240B2/en active Active
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN109634872A (zh) * | 2019-02-25 | 2019-04-16 | 北京达佳互联信息技术有限公司 | 应用测试方法、装置、终端及存储介质 |
Also Published As
Publication number | Publication date |
---|---|
US20140122081A1 (en) | 2014-05-01 |
US9196240B2 (en) | 2015-11-24 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
PL401371A1 (pl) | Opracowanie głosu dla zautomatyzowanej zamiany tekstu na mowę | |
BR112021018532A2 (pt) | Sistemas e métodos para fidedignidade de modelo | |
WO2015009586A3 (en) | Performing an operation relative to tabular data based upon voice input | |
MX2021008012A (es) | Esquema flexible de adaptacion de modelo de lenguaje. | |
BR112017010222A2 (pt) | discriminando expressões ambíguas para aprimorar experiência do usuário | |
WO2013134641A3 (en) | Recognizing speech in multiple languages | |
WO2014070306A3 (en) | System and method for applying a business rule management system to a customer relationship management system | |
RU2015106668A (ru) | Устранение неоднозначности динамических команд | |
IN2014CN03209A (pl) | ||
WO2014140816A3 (en) | Apparatus and method for performing actions based on captured image data | |
WO2014085776A3 (en) | Web search ranking | |
DOP2014000045A (es) | Sistema y método para el aprendizaje de idiomas | |
PL401372A1 (pl) | Hybrydowa kompresja danych głosowych w systemach zamiany tekstu na mowę | |
AU2014205024A8 (en) | Methods and apparatus for identifying concepts corresponding to input information | |
MX2014006124A (es) | Sistema de enseñanza de idiomas que facilita la implicación del mentor. | |
BR112015020015A8 (pt) | método, meio de armazenamento legível por computador e aparelho para premiar conteúdo gerado por usuário | |
WO2013131025A3 (en) | Product cycle analysis using social media data | |
WO2012057588A3 (ko) | 학습능력 진단 장치 및 방법 | |
IN2013MU02064A (pl) | ||
MY194297A (en) | A method and device for providing search engine label | |
MX2015015876A (es) | Dispositivo informatico optico con una fuente de luz redundante y tren optico. | |
IL230969B (en) | A network-based system and method for influence | |
Tronnier et al. | Tendencies of Swedish word accent production by L2-learners with tonal and non-tonal L1 | |
WO2016029045A3 (en) | Lexical dialect analysis system | |
MX2015012797A (es) | Sistemas y metodos para interpretar informacion medica. |