CN106710585A - 语音交互过程中的多音字播报方法及系统 - Google Patents
语音交互过程中的多音字播报方法及系统 Download PDFInfo
- Publication number
- CN106710585A CN106710585A CN201611199610.4A CN201611199610A CN106710585A CN 106710585 A CN106710585 A CN 106710585A CN 201611199610 A CN201611199610 A CN 201611199610A CN 106710585 A CN106710585 A CN 106710585A
- Authority
- CN
- China
- Prior art keywords
- information
- polyphone
- module
- voice
- feedback information
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
- 238000000034 method Methods 0.000 title claims abstract description 35
- 230000003993 interaction Effects 0.000 title abstract description 4
- 230000008569 process Effects 0.000 title abstract description 3
- 230000002452 interceptive effect Effects 0.000 claims description 20
- 238000000465 moulding Methods 0.000 claims description 2
- 230000000694 effects Effects 0.000 abstract description 5
- 230000015572 biosynthetic process Effects 0.000 description 9
- 238000003786 synthesis reaction Methods 0.000 description 9
- 230000006872 improvement Effects 0.000 description 6
- 238000005516 engineering process Methods 0.000 description 4
- 230000008901 benefit Effects 0.000 description 2
- 230000009471 action Effects 0.000 description 1
- 230000009286 beneficial effect Effects 0.000 description 1
- 238000013499 data model Methods 0.000 description 1
- 238000003066 decision tree Methods 0.000 description 1
- 230000010365 information processing Effects 0.000 description 1
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/02—Feature extraction for speech recognition; Selection of recognition unit
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L13/00—Speech synthesis; Text to speech systems
- G10L13/02—Methods for producing synthetic speech; Speech synthesisers
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L13/00—Speech synthesis; Text to speech systems
- G10L13/08—Text analysis or generation of parameters for speech synthesis out of text, e.g. grapheme to phoneme translation, prosody generation or stress or intonation determination
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/06—Creation of reference templates; Training of speech recognition systems, e.g. adaptation to the characteristics of the speaker's voice
- G10L15/063—Training
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/02—Feature extraction for speech recognition; Selection of recognition unit
- G10L2015/025—Phonemes, fenemes or fenones being the recognition units
Landscapes
- Engineering & Computer Science (AREA)
- Computational Linguistics (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Physics & Mathematics (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Computer Vision & Pattern Recognition (AREA)
- Artificial Intelligence (AREA)
- Electrically Operated Instructional Devices (AREA)
Abstract
Description
Claims (10)
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201611199610.4A CN106710585B (zh) | 2016-12-22 | 2016-12-22 | 语音交互过程中的多音字播报方法及系统 |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201611199610.4A CN106710585B (zh) | 2016-12-22 | 2016-12-22 | 语音交互过程中的多音字播报方法及系统 |
Publications (2)
Publication Number | Publication Date |
---|---|
CN106710585A true CN106710585A (zh) | 2017-05-24 |
CN106710585B CN106710585B (zh) | 2019-11-08 |
Family
ID=58902972
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201611199610.4A Active CN106710585B (zh) | 2016-12-22 | 2016-12-22 | 语音交互过程中的多音字播报方法及系统 |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN106710585B (zh) |
Cited By (8)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN108364652A (zh) * | 2018-01-16 | 2018-08-03 | 成都易讯呼科技有限公司 | 一种用于人工智能电话的智能语音对答交互控制系统 |
CN109616111A (zh) * | 2018-12-24 | 2019-04-12 | 北京恒泰实达科技股份有限公司 | 一种基于语音识别的场景交互控制方法 |
CN110032626A (zh) * | 2019-04-19 | 2019-07-19 | 百度在线网络技术(北京)有限公司 | 语音播报方法和装置 |
CN110264994A (zh) * | 2019-07-02 | 2019-09-20 | 珠海格力电器股份有限公司 | 一种语音合成方法、电子设备及智能家居系统 |
CN110277085A (zh) * | 2019-06-25 | 2019-09-24 | 腾讯科技(深圳)有限公司 | 确定多音字发音的方法及装置 |
CN111128186A (zh) * | 2019-12-30 | 2020-05-08 | 云知声智能科技股份有限公司 | 多音字标音方法及装置 |
CN112259092A (zh) * | 2020-10-15 | 2021-01-22 | 深圳市同行者科技有限公司 | 一种语音播报方法、装置及语音交互设备 |
CN113658586A (zh) * | 2021-08-13 | 2021-11-16 | 北京百度网讯科技有限公司 | 语音识别模型的训练方法、语音交互方法及装置 |
Citations (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN1612209A (zh) * | 2003-10-29 | 2005-05-04 | 何佩娟 | 一种语音录入电话号码条目的方法及其装置 |
CN1697019A (zh) * | 2004-05-13 | 2005-11-16 | 深圳市移动核软件有限公司 | 使汉字自动发音的方法及使手机朗读短消息的方法 |
CN101033977A (zh) * | 2007-04-18 | 2007-09-12 | 江苏新科数字技术有限公司 | 导航仪的语音导航方法 |
CN101324884A (zh) * | 2008-07-29 | 2008-12-17 | 无敌科技(西安)有限公司 | 一种多音字发音方法 |
CN103456297A (zh) * | 2012-05-29 | 2013-12-18 | 中国移动通信集团公司 | 一种语音识别匹配的方法和设备 |
CN105336322A (zh) * | 2015-09-30 | 2016-02-17 | 百度在线网络技术(北京)有限公司 | 多音字模型训练方法、语音合成方法及装置 |
-
2016
- 2016-12-22 CN CN201611199610.4A patent/CN106710585B/zh active Active
Patent Citations (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN1612209A (zh) * | 2003-10-29 | 2005-05-04 | 何佩娟 | 一种语音录入电话号码条目的方法及其装置 |
CN1697019A (zh) * | 2004-05-13 | 2005-11-16 | 深圳市移动核软件有限公司 | 使汉字自动发音的方法及使手机朗读短消息的方法 |
CN101033977A (zh) * | 2007-04-18 | 2007-09-12 | 江苏新科数字技术有限公司 | 导航仪的语音导航方法 |
CN101324884A (zh) * | 2008-07-29 | 2008-12-17 | 无敌科技(西安)有限公司 | 一种多音字发音方法 |
CN103456297A (zh) * | 2012-05-29 | 2013-12-18 | 中国移动通信集团公司 | 一种语音识别匹配的方法和设备 |
CN105336322A (zh) * | 2015-09-30 | 2016-02-17 | 百度在线网络技术(北京)有限公司 | 多音字模型训练方法、语音合成方法及装置 |
Cited By (15)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN108364652A (zh) * | 2018-01-16 | 2018-08-03 | 成都易讯呼科技有限公司 | 一种用于人工智能电话的智能语音对答交互控制系统 |
CN109616111A (zh) * | 2018-12-24 | 2019-04-12 | 北京恒泰实达科技股份有限公司 | 一种基于语音识别的场景交互控制方法 |
CN109616111B (zh) * | 2018-12-24 | 2023-03-14 | 北京恒泰实达科技股份有限公司 | 一种基于语音识别的场景交互控制方法 |
CN110032626A (zh) * | 2019-04-19 | 2019-07-19 | 百度在线网络技术(北京)有限公司 | 语音播报方法和装置 |
CN110032626B (zh) * | 2019-04-19 | 2022-04-12 | 百度在线网络技术(北京)有限公司 | 语音播报方法和装置 |
CN110277085A (zh) * | 2019-06-25 | 2019-09-24 | 腾讯科技(深圳)有限公司 | 确定多音字发音的方法及装置 |
CN110277085B (zh) * | 2019-06-25 | 2021-08-24 | 腾讯科技(深圳)有限公司 | 确定多音字发音的方法及装置 |
CN110264994A (zh) * | 2019-07-02 | 2019-09-20 | 珠海格力电器股份有限公司 | 一种语音合成方法、电子设备及智能家居系统 |
CN110264994B (zh) * | 2019-07-02 | 2021-08-20 | 珠海格力电器股份有限公司 | 一种语音合成方法、电子设备及智能家居系统 |
CN111128186B (zh) * | 2019-12-30 | 2022-06-17 | 云知声智能科技股份有限公司 | 多音字标音方法及装置 |
CN111128186A (zh) * | 2019-12-30 | 2020-05-08 | 云知声智能科技股份有限公司 | 多音字标音方法及装置 |
CN112259092A (zh) * | 2020-10-15 | 2021-01-22 | 深圳市同行者科技有限公司 | 一种语音播报方法、装置及语音交互设备 |
CN112259092B (zh) * | 2020-10-15 | 2023-09-01 | 深圳市同行者科技有限公司 | 一种语音播报方法、装置及语音交互设备 |
CN113658586A (zh) * | 2021-08-13 | 2021-11-16 | 北京百度网讯科技有限公司 | 语音识别模型的训练方法、语音交互方法及装置 |
CN113658586B (zh) * | 2021-08-13 | 2024-04-09 | 北京百度网讯科技有限公司 | 语音识别模型的训练方法、语音交互方法及装置 |
Also Published As
Publication number | Publication date |
---|---|
CN106710585B (zh) | 2019-11-08 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US11496582B2 (en) | Generation of automated message responses | |
US11264030B2 (en) | Indicator for voice-based communications | |
CN106710585B (zh) | 语音交互过程中的多音字播报方法及系统 | |
US10140973B1 (en) | Text-to-speech processing using previously speech processed data | |
US10074363B2 (en) | Method and apparatus for keyword speech recognition | |
US10074369B2 (en) | Voice-based communications | |
US10453449B2 (en) | Indicator for voice-based communications | |
US10917758B1 (en) | Voice-based messaging | |
Ramani et al. | A common attribute based unified HTS framework for speech synthesis in Indian languages | |
US20080177543A1 (en) | Stochastic Syllable Accent Recognition | |
Prahallad et al. | Sub-phonetic modeling for capturing pronunciation variations for conversational speech synthesis | |
CN105654943A (zh) | 一种语音唤醒方法、装置及系统 | |
US11798559B2 (en) | Voice-controlled communication requests and responses | |
JPH0922297A (ja) | 音声‐テキスト変換のための方法および装置 | |
CN108305611B (zh) | 文本转语音的方法、装置、存储介质和计算机设备 | |
WO2018045154A1 (en) | Voice-based communications | |
US11176943B2 (en) | Voice recognition device, voice recognition method, and computer program product | |
JP2000172294A (ja) | 音声認識方法、その装置及びプログラム記録媒体 | |
CN114822489A (zh) | 文本转写方法和文本转写装置 | |
KR100806287B1 (ko) | 문말 억양 예측 방법 및 이를 기반으로 하는 음성합성 방법및 시스템 | |
CN110310620B (zh) | 基于原生发音强化学习的语音融合方法 | |
JPH10173769A (ja) | 音声メッセージ検索装置 | |
JP2004347732A (ja) | 言語自動識別方法及び装置 | |
JP3727436B2 (ja) | 音声原稿最適照合装置および方法 | |
Barnard et al. | Phone recognition for spoken web search |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
TA01 | Transfer of patent application right |
Effective date of registration: 20170929 Address after: 200233 Shanghai City, Xuhui District Guangxi 65 No. 1 Jinglu room 702 unit 03 Applicant after: YUNZHISHENG (SHANGHAI) INTELLIGENT TECHNOLOGY CO.,LTD. Address before: 200233 Shanghai, Qinzhou, North Road, No. 82, building 2, layer 1198, Applicant before: SHANGHAI YUZHIYI INFORMATION TECHNOLOGY Co.,Ltd. |
|
TA01 | Transfer of patent application right | ||
GR01 | Patent grant | ||
GR01 | Patent grant | ||
PE01 | Entry into force of the registration of the contract for pledge of patent right |
Denomination of invention: Method and system of polyphone broadcasting in speech interaction Effective date of registration: 20201201 Granted publication date: 20191108 Pledgee: Bank of Hangzhou Limited by Share Ltd. Shanghai branch Pledgor: YUNZHISHENG (SHANGHAI) INTELLIGENT TECHNOLOGY Co.,Ltd. Registration number: Y2020310000047 |
|
PE01 | Entry into force of the registration of the contract for pledge of patent right | ||
PC01 | Cancellation of the registration of the contract for pledge of patent right |
Date of cancellation: 20220307 Granted publication date: 20191108 Pledgee: Bank of Hangzhou Limited by Share Ltd. Shanghai branch Pledgor: YUNZHISHENG (SHANGHAI) INTELLIGENT TECHNOLOGY CO.,LTD. Registration number: Y2020310000047 |
|
PC01 | Cancellation of the registration of the contract for pledge of patent right | ||
PE01 | Entry into force of the registration of the contract for pledge of patent right |
Denomination of invention: The method and system of polyphonic broadcasting in the process of voice interaction Effective date of registration: 20230210 Granted publication date: 20191108 Pledgee: Bank of Hangzhou Limited by Share Ltd. Shanghai branch Pledgor: YUNZHISHENG (SHANGHAI) INTELLIGENT TECHNOLOGY CO.,LTD. Registration number: Y2023310000028 |
|
PE01 | Entry into force of the registration of the contract for pledge of patent right | ||
PC01 | Cancellation of the registration of the contract for pledge of patent right |
Granted publication date: 20191108 Pledgee: Bank of Hangzhou Limited by Share Ltd. Shanghai branch Pledgor: YUNZHISHENG (SHANGHAI) INTELLIGENT TECHNOLOGY CO.,LTD. Registration number: Y2023310000028 |
|
PC01 | Cancellation of the registration of the contract for pledge of patent right | ||
PE01 | Entry into force of the registration of the contract for pledge of patent right |
Denomination of invention: The method and system for broadcasting polyphonic characters in the process of voice interaction Granted publication date: 20191108 Pledgee: Bank of Hangzhou Limited by Share Ltd. Shanghai branch Pledgor: YUNZHISHENG (SHANGHAI) INTELLIGENT TECHNOLOGY CO.,LTD. Registration number: Y2024310000165 |
|
PE01 | Entry into force of the registration of the contract for pledge of patent right |