CN1243339C - 为拼接的文语转换声音确定未对准语音单元的方法和系统 - Google Patents
为拼接的文语转换声音确定未对准语音单元的方法和系统 Download PDFInfo
- Publication number
- CN1243339C CN1243339C CN200410037463.1A CN200410037463A CN1243339C CN 1243339 C CN1243339 C CN 1243339C CN 200410037463 A CN200410037463 A CN 200410037463A CN 1243339 C CN1243339 C CN 1243339C
- Authority
- CN
- China
- Prior art keywords
- unit
- voice
- voice unit
- abnormal
- suspicious
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Expired - Fee Related
Links
- 238000000034 method Methods 0.000 title claims abstract description 52
- 238000001914 filtration Methods 0.000 claims abstract description 9
- 230000002159 abnormal effect Effects 0.000 claims description 99
- 230000006870 function Effects 0.000 claims description 20
- 239000000203 mixture Substances 0.000 claims description 14
- 238000006243 chemical reaction Methods 0.000 claims description 10
- 230000008569 process Effects 0.000 claims description 10
- 239000000284 extract Substances 0.000 claims description 6
- 238000012790 confirmation Methods 0.000 claims description 5
- 230000005856 abnormality Effects 0.000 abstract 4
- 238000013500 data storage Methods 0.000 description 28
- 238000001514 detection method Methods 0.000 description 11
- 238000005516 engineering process Methods 0.000 description 10
- 238000010586 diagram Methods 0.000 description 4
- 230000001960 triggered effect Effects 0.000 description 4
- 230000008859 change Effects 0.000 description 3
- 238000004590 computer program Methods 0.000 description 3
- 238000011156 evaluation Methods 0.000 description 3
- 230000011218 segmentation Effects 0.000 description 3
- 230000007488 abnormal function Effects 0.000 description 2
- 230000010365 information processing Effects 0.000 description 2
- 239000000463 material Substances 0.000 description 2
- 238000012545 processing Methods 0.000 description 2
- 238000012360 testing method Methods 0.000 description 2
- 238000013528 artificial neural network Methods 0.000 description 1
- 230000008901 benefit Effects 0.000 description 1
- 230000008878 coupling Effects 0.000 description 1
- 238000010168 coupling process Methods 0.000 description 1
- 238000005859 coupling reaction Methods 0.000 description 1
- 230000001419 dependent effect Effects 0.000 description 1
- 230000005284 excitation Effects 0.000 description 1
- 230000001747 exhibiting effect Effects 0.000 description 1
- 238000000605 extraction Methods 0.000 description 1
- 210000004704 glottis Anatomy 0.000 description 1
- 230000008676 import Effects 0.000 description 1
- 230000003993 interaction Effects 0.000 description 1
- 238000002386 leaching Methods 0.000 description 1
- 230000005055 memory storage Effects 0.000 description 1
- 238000010606 normalization Methods 0.000 description 1
- 238000003672 processing method Methods 0.000 description 1
- 238000011002 quantification Methods 0.000 description 1
Images
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L13/00—Speech synthesis; Text to speech systems
- G10L13/06—Elementary speech units used in speech synthesisers; Concatenation rules
Landscapes
- Engineering & Computer Science (AREA)
- Computational Linguistics (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Physics & Mathematics (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Telephonic Communication Services (AREA)
- Machine Translation (AREA)
Abstract
Description
Claims (22)
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US10/630,113 | 2003-07-30 | ||
US10/630,113 US7280967B2 (en) | 2003-07-30 | 2003-07-30 | Method for detecting misaligned phonetic units for a concatenative text-to-speech voice |
Publications (2)
Publication Number | Publication Date |
---|---|
CN1577489A CN1577489A (zh) | 2005-02-09 |
CN1243339C true CN1243339C (zh) | 2006-02-22 |
Family
ID=34103774
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN200410037463.1A Expired - Fee Related CN1243339C (zh) | 2003-07-30 | 2004-04-29 | 为拼接的文语转换声音确定未对准语音单元的方法和系统 |
Country Status (2)
Country | Link |
---|---|
US (1) | US7280967B2 (zh) |
CN (1) | CN1243339C (zh) |
Families Citing this family (16)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JP4150645B2 (ja) * | 2003-08-27 | 2008-09-17 | 株式会社ケンウッド | 音声ラベリングエラー検出装置、音声ラベリングエラー検出方法及びプログラム |
TWI220511B (en) * | 2003-09-12 | 2004-08-21 | Ind Tech Res Inst | An automatic speech segmentation and verification system and its method |
US20080306727A1 (en) * | 2005-03-07 | 2008-12-11 | Linguatec Sprachtechnologien Gmbh | Hybrid Machine Translation System |
JP2006323538A (ja) * | 2005-05-17 | 2006-11-30 | Yokogawa Electric Corp | 異常監視システムおよび異常監視方法 |
US7630898B1 (en) * | 2005-09-27 | 2009-12-08 | At&T Intellectual Property Ii, L.P. | System and method for preparing a pronunciation dictionary for a text-to-speech voice |
US7693716B1 (en) | 2005-09-27 | 2010-04-06 | At&T Intellectual Property Ii, L.P. | System and method of developing a TTS voice |
US7711562B1 (en) * | 2005-09-27 | 2010-05-04 | At&T Intellectual Property Ii, L.P. | System and method for testing a TTS voice |
US7742919B1 (en) * | 2005-09-27 | 2010-06-22 | At&T Intellectual Property Ii, L.P. | System and method for repairing a TTS voice database |
US7742921B1 (en) | 2005-09-27 | 2010-06-22 | At&T Intellectual Property Ii, L.P. | System and method for correcting errors when generating a TTS voice |
JP5238205B2 (ja) * | 2007-09-07 | 2013-07-17 | ニュアンス コミュニケーションズ,インコーポレイテッド | 音声合成システム、プログラム及び方法 |
US20090172546A1 (en) * | 2007-12-31 | 2009-07-02 | Motorola, Inc. | Search-based dynamic voice activation |
US20140047332A1 (en) * | 2012-08-08 | 2014-02-13 | Microsoft Corporation | E-reader systems |
CN103903633B (zh) | 2012-12-27 | 2017-04-12 | 华为技术有限公司 | 检测语音信号的方法和装置 |
CN104795077B (zh) * | 2015-03-17 | 2018-02-02 | 北京航空航天大学 | 一种检验语音标注质量的一致性检测方法 |
CN108877765A (zh) * | 2018-05-31 | 2018-11-23 | 百度在线网络技术(北京)有限公司 | 语音拼接合成的处理方法及装置、计算机设备及可读介质 |
CN109166569B (zh) * | 2018-07-25 | 2020-01-31 | 北京海天瑞声科技股份有限公司 | 音素误标注的检测方法和装置 |
Family Cites Families (10)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US5142677A (en) * | 1989-05-04 | 1992-08-25 | Texas Instruments Incorporated | Context switching devices, systems and methods |
US5727125A (en) * | 1994-12-05 | 1998-03-10 | Motorola, Inc. | Method and apparatus for synthesis of speech excitation waveforms |
US5848163A (en) * | 1996-02-02 | 1998-12-08 | International Business Machines Corporation | Method and apparatus for suppressing background music or noise from the speech input of a speech recognizer |
US5937384A (en) * | 1996-05-01 | 1999-08-10 | Microsoft Corporation | Method and system for speech recognition using continuous density hidden Markov models |
US5884267A (en) * | 1997-02-24 | 1999-03-16 | Digital Equipment Corporation | Automated speech alignment for image synthesis |
WO2000030069A2 (en) * | 1998-11-13 | 2000-05-25 | Lernout & Hauspie Speech Products N.V. | Speech synthesis using concatenation of speech waveforms |
US6202049B1 (en) * | 1999-03-09 | 2001-03-13 | Matsushita Electric Industrial Co., Ltd. | Identification of unit overlap regions for concatenative speech synthesis system |
US6529866B1 (en) * | 1999-11-24 | 2003-03-04 | The United States Of America As Represented By The Secretary Of The Navy | Speech recognition system and associated methods |
US6792407B2 (en) * | 2001-03-30 | 2004-09-14 | Matsushita Electric Industrial Co., Ltd. | Text selection and recording by feedback and adaptation for development of personalized text-to-speech systems |
US7010488B2 (en) * | 2002-05-09 | 2006-03-07 | Oregon Health & Science University | System and method for compressing concatenative acoustic inventories for speech synthesis |
-
2003
- 2003-07-30 US US10/630,113 patent/US7280967B2/en active Active
-
2004
- 2004-04-29 CN CN200410037463.1A patent/CN1243339C/zh not_active Expired - Fee Related
Also Published As
Publication number | Publication date |
---|---|
US7280967B2 (en) | 2007-10-09 |
US20050027531A1 (en) | 2005-02-03 |
CN1577489A (zh) | 2005-02-09 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN1243339C (zh) | 为拼接的文语转换声音确定未对准语音单元的方法和系统 | |
CN103035247B (zh) | 基于声纹信息对音频/视频文件进行操作的方法及装置 | |
DE60211197T2 (de) | Verfahren und vorrichtung zur wandlung gesprochener in geschriebene texte und korrektur der erkannten texte | |
EP2506252B1 (en) | Topic specific models for text formatting and speech recognition | |
CN101076851B (zh) | 口语识别系统以及用于训练和操作该系统的方法 | |
US7818308B2 (en) | System and method for document section segmentation | |
US20050144184A1 (en) | System and method for document section segmentation | |
CN112632326B (zh) | 一种基于视频脚本语义识别的视频生产方法及装置 | |
KR20070121810A (ko) | 복합 뉴스 스토리 합성 | |
CA2423033A1 (en) | A document categorisation system | |
CN106295717A (zh) | 一种基于稀疏表示和机器学习的西洋乐器分类方法 | |
CN110619115A (zh) | 一种模板创建方法、装置、电子设备及存储介质 | |
CN110428811A (zh) | 一种数据处理方法、装置及电子设备 | |
CN110942765B (zh) | 一种构建语料库的方法、设备、服务器和存储介质 | |
CN106897379B (zh) | 语音文件的lrc时间轴文件自动生成方法及相关设备 | |
Buist et al. | Automatic Summarization of Meeting Data: A Feasibility Study. | |
US20020184019A1 (en) | Method of using empirical substitution data in speech recognition | |
CN116246598A (zh) | 一种基于片段式的多阶段自动音准评分方法 | |
CN112231512B (zh) | 歌曲标注检测方法、装置和系统及存储介质 | |
CN100389421C (zh) | 一种快速构造用于关键词检出任务的语音数据库的方法 | |
JP2008083434A (ja) | 音声学習支援装置及び音声学習支援プログラム | |
CN114783424A (zh) | 文本语料筛选方法、装置、设备及存储介质 | |
CN1371090A (zh) | 一种将语音文件转换成文本文件的方法 | |
CN1074552C (zh) | 使用者行为记录装置及其方法 | |
Clavel et al. | Fear-type emotions of the SAFE Corpus: annotation issues. |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C06 | Publication | ||
PB01 | Publication | ||
C10 | Entry into substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
C14 | Grant of patent or utility model | ||
GR01 | Patent grant | ||
ASS | Succession or assignment of patent right |
Owner name: NEW ANST COMMUNICATION CO.,LTD. Free format text: FORMER OWNER: INTERNATIONAL BUSINESS MACHINE CORP. Effective date: 20090703 |
|
C41 | Transfer of patent application or patent right or utility model | ||
TR01 | Transfer of patent right |
Effective date of registration: 20090703 Address after: Massachusetts, USA Patentee after: Nuance Communications Inc Address before: American New York Patentee before: International Business Machines Corp. |
|
CF01 | Termination of patent right due to non-payment of annual fee | ||
CF01 | Termination of patent right due to non-payment of annual fee |
Granted publication date: 20060222 Termination date: 20170429 |