CN109920432B - 一种语音识别方法、装置、设备及存储介质 - Google Patents
一种语音识别方法、装置、设备及存储介质 Download PDFInfo
- Publication number
- CN109920432B CN109920432B CN201910163304.2A CN201910163304A CN109920432B CN 109920432 B CN109920432 B CN 109920432B CN 201910163304 A CN201910163304 A CN 201910163304A CN 109920432 B CN109920432 B CN 109920432B
- Authority
- CN
- China
- Prior art keywords
- text data
- data
- data table
- common
- voice
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
- 238000000034 method Methods 0.000 title claims abstract description 38
- 230000011218 segmentation Effects 0.000 claims description 13
- 238000004458 analytical method Methods 0.000 claims description 10
- 238000012545 processing Methods 0.000 claims description 8
- 238000004590 computer program Methods 0.000 claims description 3
- 230000015654 memory Effects 0.000 description 21
- 238000004891 communication Methods 0.000 description 9
- 230000006870 function Effects 0.000 description 8
- 230000003993 interaction Effects 0.000 description 8
- 238000010586 diagram Methods 0.000 description 6
- 238000005516 engineering process Methods 0.000 description 4
- 230000009471 action Effects 0.000 description 3
- 238000012937 correction Methods 0.000 description 3
- 238000011022 operating instruction Methods 0.000 description 3
- 230000008569 process Effects 0.000 description 3
- 238000003491 array Methods 0.000 description 2
- 239000000463 material Substances 0.000 description 2
- 230000003287 optical effect Effects 0.000 description 2
- 230000002093 peripheral effect Effects 0.000 description 2
- 230000009286 beneficial effect Effects 0.000 description 1
- 230000005540 biological transmission Effects 0.000 description 1
- 238000013461 design Methods 0.000 description 1
- 238000000802 evaporation-induced self-assembly Methods 0.000 description 1
- 239000013307 optical fiber Substances 0.000 description 1
- 238000006467 substitution reaction Methods 0.000 description 1
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L15/10—Speech classification or search using distance or distortion measures between unknown speech and reference templates
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/26—Speech to text systems
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/02—Feature extraction for speech recognition; Selection of recognition unit
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/04—Segmentation; Word boundary detection
- G10L15/05—Word boundary detection
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/06—Creation of reference templates; Training of speech recognition systems, e.g. adaptation to the characteristics of the speaker's voice
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L15/18—Speech classification or search using natural language modelling
- G10L15/1807—Speech classification or search using natural language modelling using prosody or stress
Landscapes
- Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- Computational Linguistics (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Artificial Intelligence (AREA)
- Computer Vision & Pattern Recognition (AREA)
- Machine Translation (AREA)
- Telephonic Communication Services (AREA)
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
Abstract
Description
常用文本数据 | 语义解析结果 |
打开车门 | 控制车门打开的操作指令 |
打开门 | 控制车门打开的操作指令 |
开启车门 | 控制车门打开的操作指令 |
打开车窗 | |
…… |
Claims (12)
Priority Applications (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201910163304.2A CN109920432B (zh) | 2019-03-05 | 2019-03-05 | 一种语音识别方法、装置、设备及存储介质 |
US16/801,742 US11264034B2 (en) | 2019-03-05 | 2020-02-26 | Voice identification method, device, apparatus, and storage medium |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201910163304.2A CN109920432B (zh) | 2019-03-05 | 2019-03-05 | 一种语音识别方法、装置、设备及存储介质 |
Publications (2)
Publication Number | Publication Date |
---|---|
CN109920432A CN109920432A (zh) | 2019-06-21 |
CN109920432B true CN109920432B (zh) | 2024-06-18 |
Family
ID=66963254
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201910163304.2A Active CN109920432B (zh) | 2019-03-05 | 2019-03-05 | 一种语音识别方法、装置、设备及存储介质 |
Country Status (2)
Country | Link |
---|---|
US (1) | US11264034B2 (zh) |
CN (1) | CN109920432B (zh) |
Families Citing this family (7)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN110570842B (zh) * | 2019-10-25 | 2020-07-10 | 南京云白信息科技有限公司 | 基于音素近似度和发音标准度的语音识别方法及系统 |
CN111261166B (zh) * | 2020-01-15 | 2022-09-27 | 云知声智能科技股份有限公司 | 一种语音识别方法及装置 |
CN111399800A (zh) * | 2020-03-13 | 2020-07-10 | 胡勇军 | 一种语音输入法系统 |
CN113706977A (zh) * | 2020-08-13 | 2021-11-26 | 苏州韵果莘莘影视科技有限公司 | 基于译语智能手语翻译软件的播放方法及系统 |
CN113345442B (zh) * | 2021-06-30 | 2024-06-04 | 西安乾阳电子科技有限公司 | 语音识别方法、装置、电子设备及存储介质 |
CN114327355A (zh) * | 2021-12-30 | 2022-04-12 | 科大讯飞股份有限公司 | 语音输入方法、电子设备以及计算机存储介质 |
CN115547337B (zh) * | 2022-11-25 | 2023-03-03 | 深圳市人马互动科技有限公司 | 语音识别方法及相关产品 |
Citations (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN107092606A (zh) * | 2016-02-18 | 2017-08-25 | 腾讯科技(深圳)有限公司 | 一种搜索方法、装置及服务器 |
CN107993654A (zh) * | 2017-11-24 | 2018-05-04 | 珠海格力电器股份有限公司 | 一种语音指令识别方法及系统 |
Family Cites Families (11)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US7103542B2 (en) * | 2001-12-14 | 2006-09-05 | Ben Franklin Patent Holding Llc | Automatically improving a voice recognition system |
KR100718147B1 (ko) * | 2005-02-01 | 2007-05-14 | 삼성전자주식회사 | 음성인식용 문법망 생성장치 및 방법과 이를 이용한 대화체음성인식장치 및 방법 |
CN1979638A (zh) * | 2005-12-02 | 2007-06-13 | 中国科学院自动化研究所 | 一种语音识别结果纠错方法 |
JP4869268B2 (ja) * | 2008-03-04 | 2012-02-08 | 日本放送協会 | 音響モデル学習装置およびプログラム |
CN103714048B (zh) * | 2012-09-29 | 2017-07-21 | 国际商业机器公司 | 用于校正文本的方法和系统 |
KR102247533B1 (ko) * | 2014-07-30 | 2021-05-03 | 삼성전자주식회사 | 음성 인식 장치 및 그 제어 방법 |
CN106340293B (zh) | 2015-07-06 | 2019-11-29 | 无锡天脉聚源传媒科技有限公司 | 一种音频数据识别结果的调整方法及装置 |
CN105389400B (zh) * | 2015-12-24 | 2020-02-14 | Tcl集团股份有限公司 | 语音交互方法及装置 |
CN107015969A (zh) * | 2017-05-19 | 2017-08-04 | 四川长虹电器股份有限公司 | 可自我更新的语义理解系统与方法 |
CN107679033B (zh) * | 2017-09-11 | 2021-12-14 | 百度在线网络技术(北京)有限公司 | 文本断句位置识别方法和装置 |
CN109065054A (zh) | 2018-08-31 | 2018-12-21 | 出门问问信息科技有限公司 | 语音识别纠错方法、装置、电子设备及可读存储介质 |
-
2019
- 2019-03-05 CN CN201910163304.2A patent/CN109920432B/zh active Active
-
2020
- 2020-02-26 US US16/801,742 patent/US11264034B2/en active Active
Patent Citations (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN107092606A (zh) * | 2016-02-18 | 2017-08-25 | 腾讯科技(深圳)有限公司 | 一种搜索方法、装置及服务器 |
CN107993654A (zh) * | 2017-11-24 | 2018-05-04 | 珠海格力电器股份有限公司 | 一种语音指令识别方法及系统 |
Also Published As
Publication number | Publication date |
---|---|
US20200286486A1 (en) | 2020-09-10 |
CN109920432A (zh) | 2019-06-21 |
US11264034B2 (en) | 2022-03-01 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN109920432B (zh) | 一种语音识别方法、装置、设备及存储介质 | |
CN107195295B (zh) | 基于中英文混合词典的语音识别方法及装置 | |
JP5901001B1 (ja) | 音響言語モデルトレーニングのための方法およびデバイス | |
CN110415705B (zh) | 一种热词识别方法、系统、装置及存储介质 | |
US6711542B2 (en) | Method of identifying a language and of controlling a speech synthesis unit and a communication device | |
CN107729313B (zh) | 基于深度神经网络的多音字读音的判别方法和装置 | |
CN111797632B (zh) | 信息处理方法、装置及电子设备 | |
EP3258417A1 (en) | Method and device for improving fingerprint template and terminal device | |
CN109710087B (zh) | 输入法模型生成方法及装置 | |
CN106897439A (zh) | 文本的情感识别方法、装置、服务器以及存储介质 | |
CN110188353B (zh) | 文本纠错方法及装置 | |
CN108682420A (zh) | 一种音视频通话方言识别方法及终端设备 | |
CN113506574A (zh) | 自定义命令词的识别方法、装置和计算机设备 | |
CN112256849B (zh) | 模型训练方法、文本检测方法、装置、设备和存储介质 | |
CN112509566B (zh) | 一种语音识别方法、装置、设备、存储介质及程序产品 | |
CN110689881A (zh) | 语音识别方法、装置、计算机设备和存储介质 | |
CN110309504B (zh) | 基于分词的文本处理方法、装置、设备及存储介质 | |
CN114420102B (zh) | 语音断句方法、装置、电子设备及存储介质 | |
CN113569021B (zh) | 用户分类的方法、计算机设备和可读存储介质 | |
CA2596126A1 (en) | Speech recognition by statistical language using square-root discounting | |
CN110929514B (zh) | 文本校对方法、装置、计算机可读存储介质及电子设备 | |
CN112559725A (zh) | 文本匹配方法、装置、终端和存储介质 | |
JP2000089786A (ja) | 音声認識結果の修正方法および装置 | |
US20100145677A1 (en) | System and Method for Making a User Dependent Language Model | |
CN112464644B (zh) | 自动断句模型建立方法及自动断句方法 |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
TA01 | Transfer of patent application right |
Effective date of registration: 20211011 Address after: 100176 101, floor 1, building 1, yard 7, Ruihe West 2nd Road, economic and Technological Development Zone, Daxing District, Beijing Applicant after: Apollo Intelligent Connectivity (Beijing) Technology Co., Ltd. Address before: 100085 third floor, baidu building, No. 10, Shangdi 10th Street, Haidian District, Beijing Applicant before: BAIDU ONLINE NETWORK TECHNOLOGY (BEIJING) Co.,Ltd. |
|
TA01 | Transfer of patent application right | ||
TA01 | Transfer of patent application right |
Effective date of registration: 20240403 Address after: 510000 No. 106 Fengze East Road, Nansha District, Guangzhou City, Guangdong Province Applicant after: Guangzhou dinghang Information Technology Service Co.,Ltd. Country or region after: China Address before: 100176 Room 101, 1st floor, building 1, yard 7, Ruihe West 2nd Road, economic and Technological Development Zone, Daxing District, Beijing Applicant before: Apollo Intelligent Connectivity (Beijing) Technology Co., Ltd. Country or region before: China |
|
TA01 | Transfer of patent application right | ||
TA01 | Transfer of patent application right |
Effective date of registration: 20240520 Address after: 100176 No. 1, Zhonghe street, Beijing Economic and Technological Development Zone, Daxing District, Beijing Applicant after: CHINA UNICOM ONLINE INFORMATION TECHNOLOGY Co.,Ltd. Country or region after: China Address before: 510000 No. 106 Fengze East Road, Nansha District, Guangzhou City, Guangdong Province Applicant before: Guangzhou dinghang Information Technology Service Co.,Ltd. Country or region before: China |
|
TA01 | Transfer of patent application right | ||
GR01 | Patent grant |