CN101727904B - 语音翻译方法和装置 - Google Patents
语音翻译方法和装置 Download PDFInfo
- Publication number
- CN101727904B CN101727904B CN2008101746288A CN200810174628A CN101727904B CN 101727904 B CN101727904 B CN 101727904B CN 2008101746288 A CN2008101746288 A CN 2008101746288A CN 200810174628 A CN200810174628 A CN 200810174628A CN 101727904 B CN101727904 B CN 101727904B
- Authority
- CN
- China
- Prior art keywords
- voice
- information
- translation
- unit
- legible
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Expired - Fee Related
Links
- 238000000034 method Methods 0.000 title claims abstract description 56
- 238000000605 extraction Methods 0.000 claims description 29
- 239000000284 extract Substances 0.000 claims description 14
- 230000033764 rhythmic process Effects 0.000 claims description 14
- 230000007613 environmental effect Effects 0.000 claims description 12
- 238000003066 decision tree Methods 0.000 claims description 10
- 230000000295 complement effect Effects 0.000 claims description 4
- 230000000717 retained effect Effects 0.000 abstract description 2
- 230000015572 biosynthetic process Effects 0.000 description 4
- 238000003786 synthesis reaction Methods 0.000 description 4
- 238000004590 computer program Methods 0.000 description 3
- 238000006243 chemical reaction Methods 0.000 description 2
- 235000007926 Craterellus fallax Nutrition 0.000 description 1
- 241001269238 Data Species 0.000 description 1
- 240000007175 Datura inoxia Species 0.000 description 1
- 238000005452 bending Methods 0.000 description 1
- 230000000694 effects Effects 0.000 description 1
- 238000005516 engineering process Methods 0.000 description 1
- 239000012634 fragment Substances 0.000 description 1
- 230000002194 synthesizing effect Effects 0.000 description 1
Images
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F40/00—Handling natural language data
- G06F40/40—Processing or translation of natural language
- G06F40/58—Use of machine translation, e.g. for multi-lingual retrieval, for server-side translation for client devices or for real-time translation
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L13/00—Speech synthesis; Text to speech systems
- G10L13/02—Methods for producing synthetic speech; Speech synthesisers
- G10L13/033—Voice editing, e.g. manipulating the voice of the synthesiser
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L13/00—Speech synthesis; Text to speech systems
- G10L13/08—Text analysis or generation of parameters for speech synthesis out of text, e.g. grapheme to phoneme translation, prosody generation or stress or intonation determination
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
- G10L25/48—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use
Landscapes
- Engineering & Computer Science (AREA)
- Computational Linguistics (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Physics & Mathematics (AREA)
- Theoretical Computer Science (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Human Computer Interaction (AREA)
- Artificial Intelligence (AREA)
- General Health & Medical Sciences (AREA)
- General Engineering & Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Machine Translation (AREA)
Abstract
Description
Claims (14)
Priority Applications (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN2008101746288A CN101727904B (zh) | 2008-10-31 | 2008-10-31 | 语音翻译方法和装置 |
US12/609,647 US9342509B2 (en) | 2008-10-31 | 2009-10-30 | Speech translation method and apparatus utilizing prosodic information |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN2008101746288A CN101727904B (zh) | 2008-10-31 | 2008-10-31 | 语音翻译方法和装置 |
Publications (2)
Publication Number | Publication Date |
---|---|
CN101727904A CN101727904A (zh) | 2010-06-09 |
CN101727904B true CN101727904B (zh) | 2013-04-24 |
Family
ID=42132508
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN2008101746288A Expired - Fee Related CN101727904B (zh) | 2008-10-31 | 2008-10-31 | 语音翻译方法和装置 |
Country Status (2)
Country | Link |
---|---|
US (1) | US9342509B2 (zh) |
CN (1) | CN101727904B (zh) |
Families Citing this family (28)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN101727904B (zh) * | 2008-10-31 | 2013-04-24 | 国际商业机器公司 | 语音翻译方法和装置 |
US9069757B2 (en) * | 2010-10-31 | 2015-06-30 | Speech Morphing, Inc. | Speech morphing communication system |
DE102011055672A1 (de) * | 2011-11-24 | 2013-05-29 | Ben Fredj Mehdi | Verfahren zur Extraktion und Übersetzung eines Sprachinhalts, Vorrichtung auf dem das Verfahren durchführbar gespeichert ist und Verwendung eines dezentralen Netzwerks zur Durchführung des Verfahrens |
CN104754536A (zh) * | 2013-12-27 | 2015-07-01 | 中国移动通信集团公司 | 一种不同语言间实现通信的方法和系统 |
US20170329766A1 (en) * | 2014-12-09 | 2017-11-16 | Sony Corporation | Information processing apparatus, control method, and program |
CN105786801A (zh) * | 2014-12-22 | 2016-07-20 | 中兴通讯股份有限公司 | 一种语音翻译方法、通讯方法及相关装置 |
KR102251832B1 (ko) | 2016-06-16 | 2021-05-13 | 삼성전자주식회사 | 번역 서비스를 제공하는 전자 장치 및 방법 |
WO2018025090A2 (en) * | 2016-08-01 | 2018-02-08 | Speech Morphing Systems, Inc. | Method to model and transfer prosody of tags across languages |
KR102580904B1 (ko) * | 2016-09-26 | 2023-09-20 | 삼성전자주식회사 | 음성 신호를 번역하는 방법 및 그에 따른 전자 디바이스 |
US20180174577A1 (en) * | 2016-12-19 | 2018-06-21 | Microsoft Technology Licensing, Llc | Linguistic modeling using sets of base phonetics |
CN107315742A (zh) * | 2017-07-03 | 2017-11-03 | 中国科学院自动化研究所 | 具有人机对话功能的拟人化口语翻译方法及系统 |
WO2019071541A1 (zh) * | 2017-10-12 | 2019-04-18 | 深圳市沃特沃德股份有限公司 | 语音翻译方法、装置和终端设备 |
CN107992485A (zh) * | 2017-11-27 | 2018-05-04 | 北京搜狗科技发展有限公司 | 一种同声传译方法及装置 |
CN108090051A (zh) * | 2017-12-20 | 2018-05-29 | 深圳市沃特沃德股份有限公司 | 连续长语音文件的翻译方法与翻译机 |
EP3739476A4 (en) * | 2018-01-11 | 2021-12-08 | Neosapience, Inc. | SPEECH SYNTHESIS PROCESS FROM MULTILINGUAL TEXT |
CN108231062B (zh) * | 2018-01-12 | 2020-12-22 | 科大讯飞股份有限公司 | 一种语音翻译方法及装置 |
CN108447486B (zh) * | 2018-02-28 | 2021-12-03 | 科大讯飞股份有限公司 | 一种语音翻译方法及装置 |
CN112037768A (zh) * | 2019-05-14 | 2020-12-04 | 北京三星通信技术研究有限公司 | 语音翻译方法、装置、电子设备及计算机可读存储介质 |
KR20220024049A (ko) * | 2019-05-14 | 2022-03-03 | 삼성전자주식회사 | 음성 번역을 위한 방법, 장치, 전자 디바이스 및 컴퓨터 판독가능 저장 매체 |
US11587561B2 (en) * | 2019-10-25 | 2023-02-21 | Mary Lee Weir | Communication system and method of extracting emotion data during translations |
CN111128116B (zh) * | 2019-12-20 | 2021-07-23 | 珠海格力电器股份有限公司 | 一种语音处理方法、装置、计算设备及存储介质 |
US11417337B1 (en) * | 2021-08-12 | 2022-08-16 | Cresta Intelligence Inc. | Initiating conversation monitoring system action based on conversational content |
CN113781997B (zh) * | 2021-09-22 | 2024-07-23 | 联想(北京)有限公司 | 语音合成方法及电子设备 |
CN113921011A (zh) * | 2021-10-14 | 2022-01-11 | 安徽听见科技有限公司 | 音频处理方法、装置及设备 |
US20230245644A1 (en) * | 2022-01-28 | 2023-08-03 | Speech Morphing Systems, Inc. | End-to-end modular speech synthesis systems and methods |
CN114495977B (zh) * | 2022-01-28 | 2024-01-30 | 北京百度网讯科技有限公司 | 语音翻译和模型训练方法、装置、电子设备以及存储介质 |
US20230274100A1 (en) * | 2022-02-28 | 2023-08-31 | Google Llc | Techniques and Models for Multilingual Text Rewriting |
CN118430513B (zh) * | 2024-07-03 | 2024-09-20 | 广州趣丸网络科技有限公司 | 一种自然语音翻译系统 |
Family Cites Families (71)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US5615380A (en) * | 1969-11-24 | 1997-03-25 | Hyatt; Gilbert P. | Integrated circuit computer system having a keyboard input and a sound output |
US5278943A (en) * | 1990-03-23 | 1994-01-11 | Bright Star Technology, Inc. | Speech animation and inflection system |
US5384893A (en) * | 1992-09-23 | 1995-01-24 | Emerson & Stern Associates, Inc. | Method and apparatus for speech synthesis based on prosodic analysis |
SE500277C2 (sv) * | 1993-05-10 | 1994-05-24 | Televerket | Anordning för att öka talförståelsen vid översätttning av tal från ett första språk till ett andra språk |
US5860064A (en) * | 1993-05-13 | 1999-01-12 | Apple Computer, Inc. | Method and apparatus for automatic generation of vocal emotion in a synthetic text-to-speech system |
SE516526C2 (sv) * | 1993-11-03 | 2002-01-22 | Telia Ab | Metod och anordning vid automatisk extrahering av prosodisk information |
US5704007A (en) * | 1994-03-11 | 1997-12-30 | Apple Computer, Inc. | Utilization of multiple voice sources in a speech synthesizer |
US5734794A (en) * | 1995-06-22 | 1998-03-31 | White; Tom H. | Method and system for voice-activated cell animation |
US6006175A (en) * | 1996-02-06 | 1999-12-21 | The Regents Of The University Of California | Methods and apparatus for non-acoustic speech characterization and recognition |
US5850629A (en) * | 1996-09-09 | 1998-12-15 | Matsushita Electric Industrial Co., Ltd. | User interface controller for text-to-speech synthesizer |
US5884266A (en) | 1997-04-02 | 1999-03-16 | Motorola, Inc. | Audio interface for document based information resource navigation and method therefor |
US6226614B1 (en) * | 1997-05-21 | 2001-05-01 | Nippon Telegraph And Telephone Corporation | Method and apparatus for editing/creating synthetic speech message and recording medium with the method recorded thereon |
JP4197195B2 (ja) * | 1998-02-27 | 2008-12-17 | ヒューレット・パッカード・カンパニー | 音声情報の提供方法 |
US6236966B1 (en) * | 1998-04-14 | 2001-05-22 | Michael K. Fleming | System and method for production of audio control parameters using a learning machine |
US6631368B1 (en) * | 1998-11-13 | 2003-10-07 | Nortel Networks Limited | Methods and apparatus for operating on non-text messages |
JP2000187435A (ja) * | 1998-12-24 | 2000-07-04 | Sony Corp | 情報処理装置、携帯機器、電子ペット装置、情報処理手順を記録した記録媒体及び情報処理方法 |
US6356865B1 (en) * | 1999-01-29 | 2002-03-12 | Sony Corporation | Method and apparatus for performing spoken language translation |
US6442524B1 (en) * | 1999-01-29 | 2002-08-27 | Sony Corporation | Analyzing inflectional morphology in a spoken language translation system |
US6697780B1 (en) * | 1999-04-30 | 2004-02-24 | At&T Corp. | Method and apparatus for rapid acoustic unit selection from a large speech corpus |
JP2001034282A (ja) * | 1999-07-21 | 2001-02-09 | Konami Co Ltd | 音声合成方法、音声合成のための辞書構築方法、音声合成装置、並びに音声合成プログラムを記録したコンピュータ読み取り可能な媒体 |
US6535849B1 (en) * | 2000-01-18 | 2003-03-18 | Scansoft, Inc. | Method and system for generating semi-literal transcripts for speech recognition systems |
US20030028380A1 (en) * | 2000-02-02 | 2003-02-06 | Freeland Warwick Peter | Speech system |
US6847931B2 (en) * | 2002-01-29 | 2005-01-25 | Lessac Technology, Inc. | Expressive parsing in computerized conversion of text to speech |
US7254531B2 (en) * | 2000-09-05 | 2007-08-07 | Nir Einat H | In-context analysis and automatic translation |
US6731307B1 (en) * | 2000-10-30 | 2004-05-04 | Koninklije Philips Electronics N.V. | User interface/entertainment device that simulates personal interaction and responds to user's mental state and/or personality |
JP4687936B2 (ja) * | 2001-03-22 | 2011-05-25 | ソニー株式会社 | 音声出力装置および音声出力方法、並びにプログラムおよび記録媒体 |
US6895376B2 (en) * | 2001-05-04 | 2005-05-17 | Matsushita Electric Industrial Co., Ltd. | Eigenvoice re-estimation technique of acoustic models for speech recognition, speaker identification and speaker verification |
EP1256931A1 (en) * | 2001-05-11 | 2002-11-13 | Sony France S.A. | Method and apparatus for voice synthesis and robot apparatus |
GB0113583D0 (en) * | 2001-06-04 | 2001-07-25 | Hewlett Packard Co | Speech system barge-in control |
IL144818A (en) * | 2001-08-09 | 2006-08-20 | Voicesense Ltd | Method and apparatus for speech analysis |
US20080300856A1 (en) * | 2001-09-21 | 2008-12-04 | Talkflow Systems, Llc | System and method for structuring information |
US7401020B2 (en) * | 2002-11-29 | 2008-07-15 | International Business Machines Corporation | Application of emotion-based intonation and prosody to speech in text-to-speech systems |
JP2003295882A (ja) * | 2002-04-02 | 2003-10-15 | Canon Inc | 音声合成用テキスト構造、音声合成方法、音声合成装置及びそのコンピュータ・プログラム |
US7382868B2 (en) * | 2002-04-02 | 2008-06-03 | Verizon Business Global Llc | Telephony services system with instant communications enhancements |
US8494859B2 (en) * | 2002-10-15 | 2013-07-23 | Gh, Llc | Universal processing system and methods for production of outputs accessible by people with disabilities |
JP3667332B2 (ja) * | 2002-11-21 | 2005-07-06 | 松下電器産業株式会社 | 標準モデル作成装置及び標準モデル作成方法 |
US6961704B1 (en) * | 2003-01-31 | 2005-11-01 | Speechworks International, Inc. | Linguistic prosodic model-based text to speech |
JP3950802B2 (ja) * | 2003-01-31 | 2007-08-01 | 株式会社エヌ・ティ・ティ・ドコモ | 顔情報送信システム、顔情報送信方法、顔情報送信プログラム、及びコンピュータ読取可能な記録媒体 |
US7280968B2 (en) * | 2003-03-25 | 2007-10-09 | International Business Machines Corporation | Synthetically generated speech responses including prosodic characteristics of speech inputs |
JP2004349851A (ja) * | 2003-05-20 | 2004-12-09 | Ntt Docomo Inc | 携帯端末、画像通信プログラム、及び画像通信方法 |
US20050144002A1 (en) * | 2003-12-09 | 2005-06-30 | Hewlett-Packard Development Company, L.P. | Text-to-speech conversion with associated mood tag |
CN1894740B (zh) * | 2003-12-12 | 2012-07-04 | 日本电气株式会社 | 信息处理系统、信息处理方法以及信息处理用程序 |
KR100571831B1 (ko) * | 2004-02-10 | 2006-04-17 | 삼성전자주식회사 | 음성 식별 장치 및 방법 |
US7472065B2 (en) * | 2004-06-04 | 2008-12-30 | International Business Machines Corporation | Generating paralinguistic phenomena via markup in text-to-speech synthesis |
GB2415518A (en) * | 2004-06-24 | 2005-12-28 | Sharp Kk | Method and apparatus for translation based on a repository of existing translations |
JP4328698B2 (ja) * | 2004-09-15 | 2009-09-09 | キヤノン株式会社 | 素片セット作成方法および装置 |
DE102004050785A1 (de) * | 2004-10-14 | 2006-05-04 | Deutsche Telekom Ag | Verfahren und Anordnung zur Bearbeitung von Nachrichten im Rahmen eines Integrated Messaging Systems |
US20060122834A1 (en) * | 2004-12-03 | 2006-06-08 | Bennett Ian M | Emotion detection device & method for use in distributed systems |
TWI281145B (en) * | 2004-12-10 | 2007-05-11 | Delta Electronics Inc | System and method for transforming text to speech |
WO2006123539A1 (ja) * | 2005-05-18 | 2006-11-23 | Matsushita Electric Industrial Co., Ltd. | 音声合成装置 |
JP3910628B2 (ja) * | 2005-06-16 | 2007-04-25 | 松下電器産業株式会社 | 音声合成装置、音声合成方法およびプログラム |
JP4559950B2 (ja) * | 2005-10-20 | 2010-10-13 | 株式会社東芝 | 韻律制御規則生成方法、音声合成方法、韻律制御規則生成装置、音声合成装置、韻律制御規則生成プログラム及び音声合成プログラム |
CA2536976A1 (en) * | 2006-02-20 | 2007-08-20 | Diaphonics, Inc. | Method and apparatus for detecting speaker change in a voice transaction |
US7983910B2 (en) | 2006-03-03 | 2011-07-19 | International Business Machines Corporation | Communicating across voice and text channels with emotion preservation |
US8032356B2 (en) * | 2006-05-25 | 2011-10-04 | University Of Southern California | Spoken translation system using meta information strings |
JP4175390B2 (ja) * | 2006-06-09 | 2008-11-05 | ソニー株式会社 | 情報処理装置、および情報処理方法、並びにコンピュータ・プログラム |
JP4085130B2 (ja) * | 2006-06-23 | 2008-05-14 | 松下電器産業株式会社 | 感情認識装置 |
US7860719B2 (en) * | 2006-08-19 | 2010-12-28 | International Business Machines Corporation | Disfluency detection for a speech-to-speech translation system using phrase-level machine translation with weighted finite state transducers |
US7860705B2 (en) * | 2006-09-01 | 2010-12-28 | International Business Machines Corporation | Methods and apparatus for context adaptation of speech-to-speech translation systems |
US8027837B2 (en) * | 2006-09-15 | 2011-09-27 | Apple Inc. | Using non-speech sounds during text-to-speech synthesis |
KR100859532B1 (ko) * | 2006-11-06 | 2008-09-24 | 한국전자통신연구원 | 대응 문형 패턴 기반 자동통역 방법 및 장치 |
US8438032B2 (en) * | 2007-01-09 | 2013-05-07 | Nuance Communications, Inc. | System for tuning synthesized speech |
JP4213755B2 (ja) | 2007-03-28 | 2009-01-21 | 株式会社東芝 | 音声翻訳装置、方法およびプログラム |
US20080300855A1 (en) * | 2007-05-31 | 2008-12-04 | Alibaig Mohammad Munwar | Method for realtime spoken natural language translation and apparatus therefor |
JP2009048003A (ja) * | 2007-08-21 | 2009-03-05 | Toshiba Corp | 音声翻訳装置及び方法 |
CN101399044B (zh) * | 2007-09-29 | 2013-09-04 | 纽奥斯通讯有限公司 | 语音转换方法和系统 |
US7996214B2 (en) * | 2007-11-01 | 2011-08-09 | At&T Intellectual Property I, L.P. | System and method of exploiting prosodic features for dialog act tagging in a discriminative modeling framework |
US8224652B2 (en) * | 2008-09-26 | 2012-07-17 | Microsoft Corporation | Speech and text driven HMM-based body animation synthesis |
US8571849B2 (en) * | 2008-09-30 | 2013-10-29 | At&T Intellectual Property I, L.P. | System and method for enriching spoken language translation with prosodic information |
CN101727904B (zh) * | 2008-10-31 | 2013-04-24 | 国际商业机器公司 | 语音翻译方法和装置 |
CN102237081B (zh) * | 2010-04-30 | 2013-04-24 | 国际商业机器公司 | 语音韵律评估方法与系统 |
-
2008
- 2008-10-31 CN CN2008101746288A patent/CN101727904B/zh not_active Expired - Fee Related
-
2009
- 2009-10-30 US US12/609,647 patent/US9342509B2/en active Active
Also Published As
Publication number | Publication date |
---|---|
US9342509B2 (en) | 2016-05-17 |
CN101727904A (zh) | 2010-06-09 |
US20100114556A1 (en) | 2010-05-06 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN101727904B (zh) | 语音翻译方法和装置 | |
CN111667814B (zh) | 一种多语种的语音合成方法及装置 | |
CN101178896B (zh) | 基于声学统计模型的单元挑选语音合成方法 | |
CN1169115C (zh) | 语音合成系统及方法 | |
CA2351988C (en) | Method and system for preselection of suitable units for concatenative speech | |
US7590540B2 (en) | Method and system for statistic-based distance definition in text-to-speech conversion | |
US9865251B2 (en) | Text-to-speech method and multi-lingual speech synthesizer using the method | |
KR20170041105A (ko) | 음성 인식에서의 음향 점수 계산 장치 및 방법과, 음향 모델 학습 장치 및 방법 | |
CN101872615A (zh) | 用于分布式文本到话音合成以及可理解性的系统和方法 | |
CN109326280B (zh) | 一种歌唱合成方法及装置、电子设备 | |
CN103632663B (zh) | 一种基于hmm的蒙古语语音合成前端处理的方法 | |
JP2024505076A (ja) | 多様で自然なテキスト読み上げサンプルを生成する | |
TWI503813B (zh) | 可控制語速的韻律訊息產生裝置及語速相依之階層式韻律模組 | |
KR20100068965A (ko) | 자동 통역 장치 및 그 방법 | |
CN101178895A (zh) | 基于生成参数听感误差最小化的模型自适应方法 | |
KR100669241B1 (ko) | 화행 정보를 이용한 대화체 음성합성 시스템 및 방법 | |
WO2023197206A1 (en) | Personalized and dynamic text to speech voice cloning using incompletely trained text to speech models | |
CN112242134A (zh) | 语音合成方法及装置 | |
EP1589524B1 (en) | Method and device for speech synthesis | |
CN1979636B (zh) | 一种音标到语音的转换方法 | |
CN117636842B (zh) | 基于韵律情感迁移的语音合成系统及方法 | |
CN102752239B (zh) | 一种提供音库混合训练模型的方法和系统 | |
CN114420086B (zh) | 语音合成方法和装置 | |
CN115910033B (zh) | 一种语音的合成方法、装置、电子设备及可读存储介质 | |
US11335321B2 (en) | Building a text-to-speech system from a small amount of speech data |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C06 | Publication | ||
PB01 | Publication | ||
C10 | Entry into substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
C14 | Grant of patent or utility model | ||
GR01 | Patent grant | ||
ASS | Succession or assignment of patent right |
Owner name: NUANCE COMMUNICATIONS, INC. Free format text: FORMER OWNER: INTERNATIONAL BUSINESS MACHINES CORPORATION Effective date: 20140108 |
|
C41 | Transfer of patent application or patent right or utility model | ||
TR01 | Transfer of patent right |
Effective date of registration: 20140108 Address after: Massachusetts, USA Patentee after: Nuance Communications, Inc. Address before: American New York Patentee before: International Business Machines Corp. |
|
CF01 | Termination of patent right due to non-payment of annual fee | ||
CF01 | Termination of patent right due to non-payment of annual fee |
Granted publication date: 20130424 |