CN110970027A - 一种语音识别方法、装置、计算机存储介质及系统 - Google Patents
一种语音识别方法、装置、计算机存储介质及系统 Download PDFInfo
- Publication number
- CN110970027A CN110970027A CN201911355864.4A CN201911355864A CN110970027A CN 110970027 A CN110970027 A CN 110970027A CN 201911355864 A CN201911355864 A CN 201911355864A CN 110970027 A CN110970027 A CN 110970027A
- Authority
- CN
- China
- Prior art keywords
- terminal
- voice information
- played
- audio data
- voice
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
- 238000000034 method Methods 0.000 title claims abstract description 45
- 238000001514 detection method Methods 0.000 claims abstract description 37
- 230000015654 memory Effects 0.000 claims description 35
- 238000004590 computer program Methods 0.000 claims description 12
- 239000000126 substance Substances 0.000 claims description 3
- 238000005516 engineering process Methods 0.000 description 12
- 238000004422 calculation algorithm Methods 0.000 description 8
- 230000001360 synchronised effect Effects 0.000 description 6
- 238000013473 artificial intelligence Methods 0.000 description 2
- 238000004364 calculation method Methods 0.000 description 2
- 238000004891 communication Methods 0.000 description 2
- 238000010586 diagram Methods 0.000 description 2
- 238000000605 extraction Methods 0.000 description 2
- 230000006870 function Effects 0.000 description 2
- 230000003287 optical effect Effects 0.000 description 2
- 230000003068 static effect Effects 0.000 description 2
- 238000006467 substitution reaction Methods 0.000 description 2
Images
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/22—Procedures used during a speech recognition process, e.g. man-machine dialogue
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/28—Constructional details of speech recognition systems
- G10L15/30—Distributed recognition, e.g. in client-server systems, for mobile phones or network applications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/28—Constructional details of speech recognition systems
- G10L15/34—Adaptation of a single recogniser for parallel processing, e.g. by use of multiple processors or cloud computing
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L17/00—Speaker identification or verification techniques
- G10L17/06—Decision making techniques; Pattern matching strategies
- G10L17/14—Use of phonemic categorisation or speech recognition prior to speaker recognition or verification
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
- G10L25/48—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use
- G10L25/51—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use for comparison or discrimination
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/22—Procedures used during a speech recognition process, e.g. man-machine dialogue
- G10L2015/223—Execution procedure of a spoken command
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/22—Procedures used during a speech recognition process, e.g. man-machine dialogue
- G10L2015/225—Feedback of the input speech
Landscapes
- Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Computational Linguistics (AREA)
- Computing Systems (AREA)
- Mathematical Physics (AREA)
- Theoretical Computer Science (AREA)
- Business, Economics & Management (AREA)
- Computer Vision & Pattern Recognition (AREA)
- Game Theory and Decision Science (AREA)
- Signal Processing (AREA)
- Telephonic Communication Services (AREA)
Abstract
Description
Claims (10)
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201911355864.4A CN110970027B (zh) | 2019-12-25 | 2019-12-25 | 一种语音识别方法、装置、计算机存储介质及系统 |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201911355864.4A CN110970027B (zh) | 2019-12-25 | 2019-12-25 | 一种语音识别方法、装置、计算机存储介质及系统 |
Publications (2)
Publication Number | Publication Date |
---|---|
CN110970027A true CN110970027A (zh) | 2020-04-07 |
CN110970027B CN110970027B (zh) | 2023-07-25 |
Family
ID=70036337
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201911355864.4A Active CN110970027B (zh) | 2019-12-25 | 2019-12-25 | 一种语音识别方法、装置、计算机存储介质及系统 |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN110970027B (zh) |
Cited By (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN111524529A (zh) * | 2020-04-15 | 2020-08-11 | 广州极飞科技有限公司 | 音频数据处理方法、装置和系统、电子设备及存储介质 |
CN115050366A (zh) * | 2022-07-08 | 2022-09-13 | 合众新能源汽车有限公司 | 一种语音识别方法、装置及计算机存储介质 |
Citations (9)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20020123892A1 (en) * | 2001-03-01 | 2002-09-05 | International Business Machines Corporation | Detecting speech recognition errors in an embedded speech recognition system |
US20070106507A1 (en) * | 2005-11-09 | 2007-05-10 | International Business Machines Corporation | Noise playback enhancement of prerecorded audio for speech recognition operations |
CN102917119A (zh) * | 2012-09-19 | 2013-02-06 | 东莞宇龙通信科技有限公司 | 一种移动终端基于语音识别处理音乐的方法及系统 |
CN106098054A (zh) * | 2016-06-13 | 2016-11-09 | 惠州Tcl移动通信有限公司 | 一种语音识别中扬声器噪音的过滤装置及方法 |
CN106409294A (zh) * | 2016-10-18 | 2017-02-15 | 广州视源电子科技股份有限公司 | 防止语音命令误识别的方法和装置 |
CN108447471A (zh) * | 2017-02-15 | 2018-08-24 | 腾讯科技(深圳)有限公司 | 语音识别方法及语音识别装置 |
CN109389976A (zh) * | 2018-09-27 | 2019-02-26 | 珠海格力电器股份有限公司 | 智能家电设备控制方法、装置、智能家电设备及存储介质 |
JP2019079070A (ja) * | 2019-01-28 | 2019-05-23 | 日本電信電話株式会社 | 音声認識装置、音声認識方法及び音声認識プログラム |
US20190341035A1 (en) * | 2018-05-01 | 2019-11-07 | International Business Machines Corporation | Ignoring trigger words in streamed media content |
-
2019
- 2019-12-25 CN CN201911355864.4A patent/CN110970027B/zh active Active
Patent Citations (10)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20020123892A1 (en) * | 2001-03-01 | 2002-09-05 | International Business Machines Corporation | Detecting speech recognition errors in an embedded speech recognition system |
US20070106507A1 (en) * | 2005-11-09 | 2007-05-10 | International Business Machines Corporation | Noise playback enhancement of prerecorded audio for speech recognition operations |
CN102917119A (zh) * | 2012-09-19 | 2013-02-06 | 东莞宇龙通信科技有限公司 | 一种移动终端基于语音识别处理音乐的方法及系统 |
CN106098054A (zh) * | 2016-06-13 | 2016-11-09 | 惠州Tcl移动通信有限公司 | 一种语音识别中扬声器噪音的过滤装置及方法 |
CN106409294A (zh) * | 2016-10-18 | 2017-02-15 | 广州视源电子科技股份有限公司 | 防止语音命令误识别的方法和装置 |
CN108447471A (zh) * | 2017-02-15 | 2018-08-24 | 腾讯科技(深圳)有限公司 | 语音识别方法及语音识别装置 |
US20190295534A1 (en) * | 2017-02-15 | 2019-09-26 | Tencent Technology (Shenzhen) Company Limited | Speech recognition method, electronic device, and computer storage medium |
US20190341035A1 (en) * | 2018-05-01 | 2019-11-07 | International Business Machines Corporation | Ignoring trigger words in streamed media content |
CN109389976A (zh) * | 2018-09-27 | 2019-02-26 | 珠海格力电器股份有限公司 | 智能家电设备控制方法、装置、智能家电设备及存储介质 |
JP2019079070A (ja) * | 2019-01-28 | 2019-05-23 | 日本電信電話株式会社 | 音声認識装置、音声認識方法及び音声認識プログラム |
Cited By (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN111524529A (zh) * | 2020-04-15 | 2020-08-11 | 广州极飞科技有限公司 | 音频数据处理方法、装置和系统、电子设备及存储介质 |
CN111524529B (zh) * | 2020-04-15 | 2023-11-24 | 广州极飞科技股份有限公司 | 音频数据处理方法、装置和系统、电子设备及存储介质 |
CN115050366A (zh) * | 2022-07-08 | 2022-09-13 | 合众新能源汽车有限公司 | 一种语音识别方法、装置及计算机存储介质 |
CN115050366B (zh) * | 2022-07-08 | 2024-05-17 | 合众新能源汽车股份有限公司 | 一种语音识别方法、装置及计算机存储介质 |
Also Published As
Publication number | Publication date |
---|---|
CN110970027B (zh) | 2023-07-25 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
WO2018188586A1 (zh) | 一种用户注册方法、装置及电子设备 | |
US10819811B2 (en) | Accumulation of real-time crowd sourced data for inferring metadata about entities | |
US8700194B2 (en) | Robust media fingerprints | |
CN102568478B (zh) | 一种基于语音识别的视频播放控制方法和系统 | |
US7908338B2 (en) | Content retrieval method and apparatus, communication system and communication method | |
WO2017160498A1 (en) | Audio scripts for various content | |
CN102486920A (zh) | 音频事件检测方法和装置 | |
JP2007534995A (ja) | 音声信号を分類する方法及びシステム | |
KR20160106075A (ko) | 오디오 스트림에서 음악 작품을 식별하기 위한 방법 및 디바이스 | |
CN110970027B (zh) | 一种语音识别方法、装置、计算机存储介质及系统 | |
US20200013422A1 (en) | System, Method, and Apparatus for Morphing of an Audio Track | |
CN116343771A (zh) | 一种基于知识图谱的音乐点播语音指令识别方法、装置 | |
Hajihashemi et al. | Novel time-frequency based scheme for detecting sound events from sound background in audio segments | |
CN111859008A (zh) | 一种推荐音乐的方法及终端 | |
CN111933176B (zh) | 一种批量定位语音内容的方法及装置 | |
KR101002732B1 (ko) | 온라인을 통한 디지털 컨텐츠 관리 시스템 | |
CN109710798B (zh) | 曲目演奏考评方法和装置 | |
CN109377988B (zh) | 用于智能音箱的交互方法、介质、装置和计算设备 | |
CN115699168A (zh) | 一种声纹管理方法及装置 | |
CN113392262A (zh) | 音乐识别方法、推荐方法、装置、设备及存储介质 | |
KR20150078239A (ko) | 차량에 적용된 미디어를 통한 음원 정보 제공 방법 | |
CN114242120B (zh) | 一种基于dtmf技术的音频剪辑方法及音频标记方法 | |
Huang et al. | VPCID—A VoIP phone call identification database | |
JPH1051337A (ja) | Fm文字多重放送録音制御プログラム装置 | |
US20240105203A1 (en) | Enhanced audio file generator |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
CB03 | Change of inventor or designer information |
Inventor after: Ying Zhenkai Inventor before: Ying Yilun |
|
CB03 | Change of inventor or designer information | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
CB02 | Change of applicant information |
Address after: Room 208, building 4, 1411 Yecheng Road, Jiading District, Shanghai, 201821 Applicant after: Botai vehicle networking technology (Shanghai) Co.,Ltd. Address before: Room 208, building 4, 1411 Yecheng Road, Jiading District, Shanghai, 201821 Applicant before: SHANGHAI PATEO ELECTRONIC EQUIPMENT MANUFACTURING Co.,Ltd. |
|
CB02 | Change of applicant information | ||
GR01 | Patent grant | ||
GR01 | Patent grant | ||
CP03 | Change of name, title or address |
Address after: Room 3701, No. 866 East Changzhi Road, Hongkou District, Shanghai, 200080 Patentee after: Botai vehicle networking technology (Shanghai) Co.,Ltd. Country or region after: China Address before: Room 208, building 4, 1411 Yecheng Road, Jiading District, Shanghai, 201821 Patentee before: Botai vehicle networking technology (Shanghai) Co.,Ltd. Country or region before: China |