CN107293293A - 一种语音指令识别方法、系统及机器人 - Google Patents
一种语音指令识别方法、系统及机器人 Download PDFInfo
- Publication number
- CN107293293A CN107293293A CN201710364233.3A CN201710364233A CN107293293A CN 107293293 A CN107293293 A CN 107293293A CN 201710364233 A CN201710364233 A CN 201710364233A CN 107293293 A CN107293293 A CN 107293293A
- Authority
- CN
- China
- Prior art keywords
- environment
- voice
- speech
- voice print
- speech data
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
- 238000000034 method Methods 0.000 title claims abstract description 30
- 239000000284 extract Substances 0.000 claims abstract description 19
- 238000000605 extraction Methods 0.000 claims abstract description 18
- 230000001755 vocal effect Effects 0.000 claims description 42
- 238000011946 reduction process Methods 0.000 claims description 8
- 230000009467 reduction Effects 0.000 claims description 7
- 230000005534 acoustic noise Effects 0.000 description 15
- 230000008569 process Effects 0.000 description 9
- 230000006870 function Effects 0.000 description 8
- 230000005540 biological transmission Effects 0.000 description 7
- 230000001934 delay Effects 0.000 description 7
- 238000005516 engineering process Methods 0.000 description 6
- 230000004044 response Effects 0.000 description 6
- 238000003860 storage Methods 0.000 description 5
- 238000010168 coupling process Methods 0.000 description 3
- 238000005859 coupling reaction Methods 0.000 description 3
- 238000001914 filtration Methods 0.000 description 3
- 238000004891 communication Methods 0.000 description 2
- 230000008878 coupling Effects 0.000 description 2
- 230000000694 effects Effects 0.000 description 2
- 238000012545 processing Methods 0.000 description 2
- 230000035945 sensitivity Effects 0.000 description 2
- 230000007474 system interaction Effects 0.000 description 2
- 241000208340 Araliaceae Species 0.000 description 1
- 235000005035 Panax pseudoginseng ssp. pseudoginseng Nutrition 0.000 description 1
- 235000003140 Panax quinquefolius Nutrition 0.000 description 1
- 230000004913 activation Effects 0.000 description 1
- 238000004378 air conditioning Methods 0.000 description 1
- 230000009286 beneficial effect Effects 0.000 description 1
- 238000013075 data extraction Methods 0.000 description 1
- 238000013461 design Methods 0.000 description 1
- 238000011161 development Methods 0.000 description 1
- 238000010586 diagram Methods 0.000 description 1
- 230000004069 differentiation Effects 0.000 description 1
- 230000005611 electricity Effects 0.000 description 1
- 230000007613 environmental effect Effects 0.000 description 1
- 235000008434 ginseng Nutrition 0.000 description 1
- 230000006872 improvement Effects 0.000 description 1
- 230000002045 lasting effect Effects 0.000 description 1
- 238000004519 manufacturing process Methods 0.000 description 1
- 238000006467 substitution reaction Methods 0.000 description 1
- 238000012795 verification Methods 0.000 description 1
- 230000000007 visual effect Effects 0.000 description 1
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/20—Speech recognition techniques specially adapted for robustness in adverse environments, e.g. in noise, of stress induced speech
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/22—Procedures used during a speech recognition process, e.g. man-machine dialogue
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/28—Constructional details of speech recognition systems
- G10L15/30—Distributed recognition, e.g. in client-server systems, for mobile phones or network applications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L17/00—Speaker identification or verification techniques
- G10L17/22—Interactive procedures; Man-machine interfaces
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
- G10L25/03—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N21/00—Selective content distribution, e.g. interactive television or video on demand [VOD]
- H04N21/20—Servers specifically adapted for the distribution of content, e.g. VOD servers; Operations thereof
- H04N21/23—Processing of content or additional data; Elementary server operations; Server middleware
- H04N21/233—Processing of audio elementary streams
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/22—Procedures used during a speech recognition process, e.g. man-machine dialogue
- G10L2015/223—Execution procedure of a spoken command
Landscapes
- Engineering & Computer Science (AREA)
- Multimedia (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Physics & Mathematics (AREA)
- Acoustics & Sound (AREA)
- Computational Linguistics (AREA)
- Signal Processing (AREA)
- Machine Translation (AREA)
Abstract
Description
Claims (10)
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201710364233.3A CN107293293A (zh) | 2017-05-22 | 2017-05-22 | 一种语音指令识别方法、系统及机器人 |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201710364233.3A CN107293293A (zh) | 2017-05-22 | 2017-05-22 | 一种语音指令识别方法、系统及机器人 |
Publications (1)
Publication Number | Publication Date |
---|---|
CN107293293A true CN107293293A (zh) | 2017-10-24 |
Family
ID=60095151
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201710364233.3A Pending CN107293293A (zh) | 2017-05-22 | 2017-05-22 | 一种语音指令识别方法、系统及机器人 |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN107293293A (zh) |
Cited By (11)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN108053828A (zh) * | 2017-12-25 | 2018-05-18 | 无锡小天鹅股份有限公司 | 确定控制指令的方法、装置和家用电器 |
CN108062949A (zh) * | 2017-12-11 | 2018-05-22 | 广州朗国电子科技有限公司 | 语音控制跑步机的方法及装置 |
CN108389578A (zh) * | 2018-02-09 | 2018-08-10 | 深圳市鹰硕技术有限公司 | 智能教室语音控制系统 |
CN108962235A (zh) * | 2017-12-27 | 2018-12-07 | 北京猎户星空科技有限公司 | 语音交互方法及装置 |
CN109524013A (zh) * | 2018-12-18 | 2019-03-26 | 北京猎户星空科技有限公司 | 一种语音处理方法、装置、介质和智能设备 |
CN110730274A (zh) * | 2019-10-17 | 2020-01-24 | 厦门快商通科技股份有限公司 | 语音抓包解析方法、系统、移动终端及存储介质 |
CN111009239A (zh) * | 2019-11-18 | 2020-04-14 | 北京小米移动软件有限公司 | 回声消除方法、回声消除装置及电子设备 |
CN111341325A (zh) * | 2020-02-13 | 2020-06-26 | 平安科技(深圳)有限公司 | 声纹识别方法、装置、存储介质、电子装置 |
CN111583934A (zh) * | 2020-04-30 | 2020-08-25 | 联想(北京)有限公司 | 一种数据处理方法及装置 |
CN112687274A (zh) * | 2019-10-17 | 2021-04-20 | 北京猎户星空科技有限公司 | 一种语音信息的处理方法、装置、设备及介质 |
CN116021250A (zh) * | 2023-03-29 | 2023-04-28 | 清华大学 | 智能装配系统 |
Citations (16)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN101938610A (zh) * | 2010-09-27 | 2011-01-05 | 冠捷显示科技(厦门)有限公司 | 一种基于声纹识别的新型电视装置 |
US20110246495A1 (en) * | 2010-04-01 | 2011-10-06 | Sony Computer Entertainment Inc. | Media fingerprinting for social networking |
US20120323796A1 (en) * | 2011-06-17 | 2012-12-20 | Sanjay Udani | Methods and systems for recording verifiable documentation |
CN102843599A (zh) * | 2012-09-27 | 2012-12-26 | 北京导视互动网络技术有限公司 | 电视节目的互动方法及系统 |
CN103442290A (zh) * | 2013-08-15 | 2013-12-11 | 安徽科大讯飞信息科技股份有限公司 | 基于电视终端用户及语音的信息提供方法及系统 |
CN103607609A (zh) * | 2013-11-27 | 2014-02-26 | Tcl集团股份有限公司 | 一种电视机频道的语音切换方法和装置 |
CN103871419A (zh) * | 2012-12-11 | 2014-06-18 | 联想(北京)有限公司 | 一种信息处理方法及电子设备 |
US20150020087A1 (en) * | 2013-07-10 | 2015-01-15 | Anthony Rose | System for Identifying Features in a Television Signal |
CN104796729A (zh) * | 2015-04-09 | 2015-07-22 | 宁波创视信息技术有限公司 | 高清晰实时获取电视播放画面的方法 |
CN104796751A (zh) * | 2015-04-23 | 2015-07-22 | 福州大学 | 一种电视信号识别的方法及装置 |
US20160050457A1 (en) * | 2014-08-14 | 2016-02-18 | Sandipan Mondal | Method and system for tv channel content management and monetization based on content fingerprinting using a portable computing and communications device |
CN105701686A (zh) * | 2016-01-23 | 2016-06-22 | 北京掌阔移动传媒科技有限公司 | 一种声纹广告实现方法和装置 |
US9531993B1 (en) * | 2012-06-22 | 2016-12-27 | Google Inc. | Dynamic companion online campaign for television content |
CN106486130A (zh) * | 2015-08-25 | 2017-03-08 | 百度在线网络技术(北京)有限公司 | 噪声消除、语音识别方法及装置 |
US9646628B1 (en) * | 2015-06-26 | 2017-05-09 | Amazon Technologies, Inc. | Noise cancellation for open microphone mode |
CN106653024A (zh) * | 2016-12-30 | 2017-05-10 | 首都师范大学 | 语音控制方法和装置、平衡车控制方法和装置与平衡车 |
-
2017
- 2017-05-22 CN CN201710364233.3A patent/CN107293293A/zh active Pending
Patent Citations (16)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20110246495A1 (en) * | 2010-04-01 | 2011-10-06 | Sony Computer Entertainment Inc. | Media fingerprinting for social networking |
CN101938610A (zh) * | 2010-09-27 | 2011-01-05 | 冠捷显示科技(厦门)有限公司 | 一种基于声纹识别的新型电视装置 |
US20120323796A1 (en) * | 2011-06-17 | 2012-12-20 | Sanjay Udani | Methods and systems for recording verifiable documentation |
US9531993B1 (en) * | 2012-06-22 | 2016-12-27 | Google Inc. | Dynamic companion online campaign for television content |
CN102843599A (zh) * | 2012-09-27 | 2012-12-26 | 北京导视互动网络技术有限公司 | 电视节目的互动方法及系统 |
CN103871419A (zh) * | 2012-12-11 | 2014-06-18 | 联想(北京)有限公司 | 一种信息处理方法及电子设备 |
US20150020087A1 (en) * | 2013-07-10 | 2015-01-15 | Anthony Rose | System for Identifying Features in a Television Signal |
CN103442290A (zh) * | 2013-08-15 | 2013-12-11 | 安徽科大讯飞信息科技股份有限公司 | 基于电视终端用户及语音的信息提供方法及系统 |
CN103607609A (zh) * | 2013-11-27 | 2014-02-26 | Tcl集团股份有限公司 | 一种电视机频道的语音切换方法和装置 |
US20160050457A1 (en) * | 2014-08-14 | 2016-02-18 | Sandipan Mondal | Method and system for tv channel content management and monetization based on content fingerprinting using a portable computing and communications device |
CN104796729A (zh) * | 2015-04-09 | 2015-07-22 | 宁波创视信息技术有限公司 | 高清晰实时获取电视播放画面的方法 |
CN104796751A (zh) * | 2015-04-23 | 2015-07-22 | 福州大学 | 一种电视信号识别的方法及装置 |
US9646628B1 (en) * | 2015-06-26 | 2017-05-09 | Amazon Technologies, Inc. | Noise cancellation for open microphone mode |
CN106486130A (zh) * | 2015-08-25 | 2017-03-08 | 百度在线网络技术(北京)有限公司 | 噪声消除、语音识别方法及装置 |
CN105701686A (zh) * | 2016-01-23 | 2016-06-22 | 北京掌阔移动传媒科技有限公司 | 一种声纹广告实现方法和装置 |
CN106653024A (zh) * | 2016-12-30 | 2017-05-10 | 首都师范大学 | 语音控制方法和装置、平衡车控制方法和装置与平衡车 |
Non-Patent Citations (1)
Title |
---|
李骏修等: "《世纪之光 科学家展望21世纪》", 30 November 1996 * |
Cited By (14)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN108062949A (zh) * | 2017-12-11 | 2018-05-22 | 广州朗国电子科技有限公司 | 语音控制跑步机的方法及装置 |
CN108053828A (zh) * | 2017-12-25 | 2018-05-18 | 无锡小天鹅股份有限公司 | 确定控制指令的方法、装置和家用电器 |
CN108962235A (zh) * | 2017-12-27 | 2018-12-07 | 北京猎户星空科技有限公司 | 语音交互方法及装置 |
CN108389578A (zh) * | 2018-02-09 | 2018-08-10 | 深圳市鹰硕技术有限公司 | 智能教室语音控制系统 |
CN109524013A (zh) * | 2018-12-18 | 2019-03-26 | 北京猎户星空科技有限公司 | 一种语音处理方法、装置、介质和智能设备 |
CN109524013B (zh) * | 2018-12-18 | 2022-07-22 | 北京猎户星空科技有限公司 | 一种语音处理方法、装置、介质和智能设备 |
CN112687274A (zh) * | 2019-10-17 | 2021-04-20 | 北京猎户星空科技有限公司 | 一种语音信息的处理方法、装置、设备及介质 |
CN110730274B (zh) * | 2019-10-17 | 2021-11-19 | 厦门快商通科技股份有限公司 | 语音抓包解析方法、系统、移动终端及存储介质 |
CN110730274A (zh) * | 2019-10-17 | 2020-01-24 | 厦门快商通科技股份有限公司 | 语音抓包解析方法、系统、移动终端及存储介质 |
CN111009239A (zh) * | 2019-11-18 | 2020-04-14 | 北京小米移动软件有限公司 | 回声消除方法、回声消除装置及电子设备 |
CN111341325A (zh) * | 2020-02-13 | 2020-06-26 | 平安科技(深圳)有限公司 | 声纹识别方法、装置、存储介质、电子装置 |
CN111583934A (zh) * | 2020-04-30 | 2020-08-25 | 联想(北京)有限公司 | 一种数据处理方法及装置 |
CN116021250A (zh) * | 2023-03-29 | 2023-04-28 | 清华大学 | 智能装配系统 |
CN116021250B (zh) * | 2023-03-29 | 2023-06-06 | 清华大学 | 智能装配系统 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN107293293A (zh) | 一种语音指令识别方法、系统及机器人 | |
CN109473123B (zh) | 语音活动检测方法及装置 | |
CN110517689B (zh) | 一种语音数据处理方法、装置及存储介质 | |
US20080091423A1 (en) | Generation of domain models from noisy transcriptions | |
WO2017031846A1 (zh) | 噪声消除、语音识别方法、装置、设备及非易失性计算机存储介质 | |
CN105979376A (zh) | 一种推荐方法和装置 | |
CN104575504A (zh) | 采用声纹和语音识别进行个性化电视语音唤醒的方法 | |
CN104766608A (zh) | 一种语音控制方法及装置 | |
CN109994106B (zh) | 一种语音处理方法及设备 | |
CN107241616A (zh) | 视频台词提取方法、装置及存储介质 | |
CN105635782A (zh) | 一种字幕输出方法及装置 | |
US9058384B2 (en) | System and method for identification of highly-variable vocalizations | |
WO2023222088A1 (zh) | 语音识别与分类方法和装置 | |
CN108491517A (zh) | 一种地域性农业信息服务语音查询终端 | |
CN103347070B (zh) | 推送语音数据的方法、终端、服务器及系统 | |
WO2023222089A1 (zh) | 基于深度学习的物品分类方法和装置 | |
CN109714608A (zh) | 视频数据处理方法、装置、计算机设备和存储介质 | |
CN101867742A (zh) | 一种基于声控控制下的电视系统 | |
CN111415128A (zh) | 控制会议的方法、系统、装置、设备和介质 | |
CN112530410A (zh) | 一种命令词识别方法及设备 | |
CN110211609A (zh) | 一种提升语音识别准确率的方法 | |
CN113779208A (zh) | 用于人机对话的方法和装置 | |
CN114996489A (zh) | 新闻数据的违规检测方法、装置、设备及存储介质 | |
Hori et al. | Real-time meeting recognition and understanding using distant microphones and omni-directional camera | |
US20140046967A1 (en) | System and method for pattern recognition and analysis |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
CB03 | Change of inventor or designer information |
Inventor after: Yan Bin Inventor before: Wei Jinjing Inventor before: Xing Xueqiang |
|
TA01 | Transfer of patent application right |
Effective date of registration: 20171207 Address after: 510730 Guangdong city of Guangzhou Province Economic and Technological Development Zone Science 232 Xue Cheng Guang Bao Lu Building No. 2 room 507 Applicant after: Guangdong all intelligent engineering Co., Ltd. Address before: 518000 Guangdong city of Shenzhen province Nanshan District Nanhai Avenue West of Jinhui building area A 4 Building No. 127 Applicant before: Shenzhen search Fruit Technology Development Co., Ltd. |
|
RJ01 | Rejection of invention patent application after publication |
Application publication date: 20171024 |
|
RJ01 | Rejection of invention patent application after publication |