CN103839549A - Voice instruction control method and system - Google Patents

Voice instruction control method and system Download PDF

Info

Publication number
CN103839549A
CN103839549A CN201210478777.XA CN201210478777A CN103839549A CN 103839549 A CN103839549 A CN 103839549A CN 201210478777 A CN201210478777 A CN 201210478777A CN 103839549 A CN103839549 A CN 103839549A
Authority
CN
China
Prior art keywords
speech
data
mobile terminal
server
identification
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201210478777.XA
Other languages
Chinese (zh)
Inventor
曾亮
陈磊
薄川川
邓朔
郝宏伟
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Tencent Technology Shenzhen Co Ltd
Original Assignee
Tencent Technology Shenzhen Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Tencent Technology Shenzhen Co Ltd filed Critical Tencent Technology Shenzhen Co Ltd
Priority to CN201210478777.XA priority Critical patent/CN103839549A/en
Publication of CN103839549A publication Critical patent/CN103839549A/en
Pending legal-status Critical Current

Links

Images

Abstract

The invention discloses a voice instruction control method and system. The voice instruction control method comprises: packing voice data received by a mobile terminal for sending to a server; matching the voice data with training samples in the server, determining a proper identification voice text, and returning the identification voice text to the mobile terminal; and commanding the mobile terminal to execute corresponding operation according to the content of the identification voice text returned by the serer. By using the voice instruction control method and system provided by the invention, the voice data received by the mobile terminal is sent to the server, the server determines the proper identification voice text by matching the voice data with the training samples in the server, such that voice identification is more accurate, the voice instruction accuracy is improved, and the user application experience is improved.

Description

A kind of phonetic order control method and system
[technical field]
The present invention relates to voice control technology field, particularly a kind of phonetic order control method and system.
[background technology]
Siri is the critical function that iphone4S carries, user can directly simply be exchanged and mobile phone is sent to instruction with smart mobile phone by voice, along with the issue of Siri Chinese edition, people never stop the discussion of the intelligent human-machine interaction technology (HCI) such as voice.And the Voice Actions(phonetic order of Android system) very solid reliable voice recognition engine is also provided, its high resolution is startling, but require the language that user inputs to possess strict syntactic structure and form, otherwise system is by None-identified.The no matter Siri of iphone or the Voice Actions of Android system, all only be based on mobile terminal this locality and carry out speech recognition, but owing to being subject to the impact of the factors such as environment for use or user pronunciation and syntactic structure and form, mobile terminal there will be the situation of speech recognition errors or None-identified, affects user's experience.
Therefore, be necessary to propose a kind of new technical scheme, there is the technical matters of speech recognition errors or None-identified to solve existing voice recognition technology.
[summary of the invention]
One object of the present invention is to provide a kind of phonetic order control method and system, is intended to solve existing voice recognition technology and exists the technical matters of speech recognition errors or None-identified.
For achieving the above object, the invention provides a kind of phonetic order control method, comprising:
The speech data packing that mobile terminal is received sends to server;
Speech data is mated with the training sample in server, determine suitable identification speech text, and identification speech text is returned to mobile terminal;
The identification speech text contents command mobile terminal returning according to server is carried out corresponding operation.
In above-mentioned phonetic order control method, before sending to server step, the described speech data packing that mobile terminal is received also comprises: enter intelligent sound identification interface by intelligent sound entrance, the input of wait user speech, and judge efficient voice input within effective time, whether to be detected, if efficient voice input do not detected within effective time, finish this phonetic entry; If efficient voice input detected within effective time, receive user speech.
In above-mentioned phonetic order control method, in described reception user speech step, also comprise: judge whether to recognize user speech input endpoint or input overtime, if not recognizing user speech input endpoint or input does not have overtime, the speech data receiving is encoded, and continue to receive next section of user speech; If recognize user speech input endpoint or input overtimely, stop receiving speech data, complete all encoded speech datas.
In above-mentioned phonetic order control method, described, speech data is mated with the training sample in server, determine suitable identification speech text, and also comprise before identification speech text is returned to mobile terminal step: cloud server receives encoded speech data, encoded speech data is decoded and denoising.
In above-mentioned phonetic order control method, described, speech data is mated with the training sample in server, determine suitable identification speech text, and identification speech text is returned in mobile terminal step and also comprised: according to the additional steering order of speech text content.
In above-mentioned phonetic order control method, before carrying out corresponding operation steps, the described identification speech text contents command mobile terminal returning according to server also comprises: receive identification speech text and resolve steering order, carry out operation corresponding to speech text content according to steering order type command mobile terminal, wherein, described steering order type comprises plug-in application type, local function type, popular type of site and search-type.
The present invention also provides a kind of phonetic order control system, comprises mobile terminal and server, and described mobile terminal comprises data transmission blocks and command execution module, and described server comprises that Data Matching module and data return to module,
Data transmission blocks: for the speech data packing of reception is sent to server;
Command execution module: carry out corresponding operation for the identification speech text contents command mobile terminal returning according to server;
Data Matching module: mate with the training sample of server for the speech data that mobile terminal is sent, determine suitable identification speech text;
Data are returned to module: for identification speech text is returned to mobile terminal.
In above-mentioned phonetic order control system, described mobile terminal also comprises
Interface enters module: for enter intelligent sound identification interface by intelligent sound entrance;
Speech detection module: for waiting for user speech input, and judge efficient voice input whether detected within effective time, if efficient voice input do not detected within effective time, finish this phonetic entry; If efficient voice input detected within effective time, receive speech data by phonetic incepting module.
In above-mentioned phonetic order control system, described mobile terminal also comprises
Phonetic incepting module: for receiving user speech, and judge whether to recognize user speech input endpoint or input overtime, if not recognizing user speech input endpoint or input does not have overtime, by data coding module, the speech data receiving is encoded, phonetic incepting module continues to receive next section of user speech simultaneously; If recognize user speech input endpoint or input overtimely, stop receiving speech data, and complete all encoded speech datas by data coding module;
Data coding module: for all speech datas that receive are encoded, and send encoded speech data by data transmission blocks.
In above-mentioned phonetic order control system, described server also comprises data reception module: the encoded speech data sending for mobile terminal receive, encoded speech data is decoded and denoising.
In above-mentioned phonetic order control system, described Data Matching module is also for determining after suitable identification speech text according to the additional steering order of speech text content.
In above-mentioned phonetic order control system, described mobile terminal also comprises data resolution module: the identification speech text returning for reception server is also resolved steering order, and described command execution module is carried out operation corresponding to speech text content according to steering order type command mobile terminal.
In above-mentioned phonetic order control system, described steering order type comprises plug-in application type, local function type, popular type of site and search-type.
The speech data that phonetic order control method provided by the invention and system receive mobile terminal sends to server, server is by mating speech data with the training sample in server, determine suitable identification speech text, make speech recognition more accurate, improve the degree of accuracy of phonetic order, can greatly avoid the situation of mobile terminal sound identification error or None-identified, improve user's experience; In addition, the present invention classifies to the operating function of mobile terminal by the additional steering order of identification speech text content, improves the degree of accuracy of phonetic order.
For foregoing of the present invention can be become apparent, preferred embodiment cited below particularly, and coordinate appended graphicly, be described in detail below:
[accompanying drawing explanation]
Fig. 1 is the process flow diagram of the phonetic order control method of first embodiment of the invention;
Fig. 2 is the process flow diagram of the phonetic order control method of second embodiment of the invention;
Fig. 3 is the structural representation of the phonetic order control system of first embodiment of the invention;
Fig. 4 is the structural representation of the phonetic order control system of second embodiment of the invention.
[embodiment]
The explanation of following embodiment is graphic with reference to what add, can be in order to the specific embodiment of implementing in order to illustrate the present invention.
Please refer to Fig. 1, is the process flow diagram of the phonetic order control method of first embodiment of the invention.The phonetic order control method of first embodiment of the invention comprises the following steps:
Step S100: the speech data packing that mobile terminal is received sends to server;
Step S110: speech data is mated with the training sample in server, determine suitable identification speech text, and identification speech text is returned to mobile terminal;
In step S110, the present invention, by the speech data of user's input is uploaded onto the server and mated with the training sample in server, makes speech recognition more accurate, can greatly avoid the situation of mobile terminal sound identification error or None-identified;
Step S120: the identification speech text contents command mobile terminal returning according to server is carried out corresponding operation.
Please refer to Fig. 2, is the process flow diagram of the phonetic order control method of second embodiment of the invention.The phonetic order control method of second embodiment of the invention comprises the following steps:
Step S200: enter intelligent sound identification interface by intelligent sound entrance;
In step S200, user can be by clicking intelligent sound quick links icon or long by toolbar(tool bar) mode such as certain hour ejects intelligent sound and identifies interface, specifically seeing also Fig. 3, is mobile terminal intelligent sound identification interfacial effect figure of the present invention.In embodiments of the present invention, length, specifically can arrange according to different demands for being greater than 0.5s by the time of toolbar.
Step S210: wait for user speech input, and judge efficient voice input whether detected within effective time, if efficient voice input, execution step 220 do not detected within effective time; If efficient voice input detected within effective time, execution step 230;
In step 210, refer to the stand-by period of phonetic entry effective time, can arrange according to different demands, be set to 5s effective time in embodiments of the present invention; If user inputs voice within effective time, be efficient voice input, otherwise, if phonetic entry wait timeout finishes this input.
Step S220: finish this phonetic entry;
Step S230: receive user speech, and judge whether to recognize user speech input endpoint or input overtime, if do not recognize user speech input endpoint or input do not have overtime, execution step S240; If recognize user speech input endpoint or input overtime, execution step 250;
In step S230, recognize user speech input endpoint and refer to that user inputs a dead time after complete word or sentence and meets end points condition for identification, end points condition for identification can be set according to different situations, such as 5s, 10s etc.; If recognize user speech input endpoint or input overtimely, be defaulted as this phonetic entry complete, otherwise user can proceed phonetic entry.
Step S240: the speech data receiving is encoded, and again perform step next section of user speech of S230 continuation reception;
Step S250: stop receiving speech data, complete all encoded speech datas;
Step S260: all speech datas after coding are packed and asked to send to server by HTTP;
Step S270: cloud server receives encoded speech data, decodes encoded speech data denoising;
Step S280: decoded speech data is mated with the training sample in server, determine suitable identification speech text, according to the additional steering order of speech text content;
In step S280, the present invention, by the speech data of user's input is uploaded onto the server and mated with the training sample in server, makes speech recognition more accurate, can greatly avoid the situation of mobile terminal sound identification error or None-identified; Steering order is that cloud server is in determining identification speech text, according to the particular content of speech text, be mapped to the conventional operational instruction that client is supported, user side can carry out corresponding operation according to the steering order type command mobile terminal of speech text, for example, play music, send note, make a phone call, open webpage etc., have some situation of mistake identification, but along with the use result of a large number of users is constantly revised, it is accurate that this instruction also can be tending towards.
Step S290: speech text and steering order are returned to mobile terminal;
Step S300: receive speech text and resolve steering order, carrying out operation corresponding to speech text content according to steering order type command mobile terminal;
In step S300, steering order type comprises plug-in application type, local function type, popular type of site and search-type etc., wherein, if steering order type is plug-in application type, open corresponding application according to speech text content, as " music plug-in unit ", " Quick Response Code " etc.; If steering order type is local function type, call corresponding local function according to speech text content, as " opening bookmark ", " emptying all data " etc.; If steering order type is popular type of site, open corresponding webpage according to speech text content, as " Tengxun's homepage ", " Sina website "; Other speech texts that do not belong to above-mentioned three types, the present invention all thinks search-type, directly uses result corresponding to mobile terminal current search engine search speech text; Concrete key data structure is
typedef?enum?{
VoiceControlCmdUnkonwn?=?0x0,
VoiceControlCmdSerach,
VoiceControlCmdPlugin,
VoiceControlCmdLocalApp,
VoiceControlCmdWebSite
VoiceControlCmd; // voice control type
typedef?struct?{
Char * text; // speech recognition text
VoiceControlCmd controlCmd; // control type
Please refer to Fig. 3, is the structural representation of the phonetic order control system of first embodiment of the invention.The phonetic order control system of first embodiment of the invention comprises mobile terminal and server, and mobile terminal comprises data transmission blocks and command execution module, and server comprises that Data Matching module and data return to module, wherein
Data transmission blocks: for the speech data packing of reception is sent to server;
Command execution module: carry out corresponding operation for the identification speech text contents command mobile terminal returning according to server;
Data Matching module: mate with the training sample of server for the speech data that mobile terminal is sent, determine suitable identification speech text; Wherein, the present invention, by the speech data of user's input is uploaded onto the server and mated with the training sample in server, makes speech recognition more accurate, can greatly avoid the situation of mobile terminal sound identification error or None-identified;
Data are returned to module: for identification speech text is returned to mobile terminal;
Please refer to Fig. 4, is the structural representation of the phonetic order control system of second embodiment of the invention.The phonetic order control system of second embodiment of the invention comprises mobile terminal and server, mobile terminal comprises that interface enters module, speech detection module, phonetic incepting module, data coding module, data transmission blocks, data resolution module and command execution module, server comprises that data reception module, Data Matching module and data return to module, wherein
Interface enters module: for enter intelligent sound identification interface by intelligent sound entrance; Wherein, user can be by clicking intelligent sound quick links icon or long by toolbar(tool bar) mode such as certain hour ejects intelligent sound identification interface, specifically sees also Fig. 3, is that mobile terminal intelligent sound of the present invention is identified interfacial effect figure.In embodiments of the present invention, length, specifically can arrange according to different demands for being greater than 0.5s by the time of toolbar.
Speech detection module: for waiting for user speech input, and judge efficient voice input whether detected within effective time, if efficient voice input do not detected within effective time, finish this phonetic entry; If efficient voice input detected within effective time, receive speech data by phonetic incepting module; Wherein, refer to the stand-by period of phonetic entry effective time, can arrange according to different demands, be set to 5s effective time in embodiments of the present invention; If user inputs voice within effective time, be efficient voice input, otherwise, if phonetic entry wait timeout finishes this input.
Phonetic incepting module: for receiving user speech, and judge whether to recognize user speech input endpoint or input overtime, if not recognizing user speech input endpoint or input does not have overtime, by data coding module, the speech data receiving is encoded, phonetic incepting module continues to receive next section of user speech simultaneously; If recognize user speech input endpoint or input overtimely, stop receiving speech data, and complete all encoded speech datas by data coding module; Wherein, recognize user speech input endpoint and refer to that user inputs a dead time after complete word or sentence and meets end points condition for identification, end points condition for identification can be set according to different situations, such as 5s, 10s etc.; If recognize user speech input endpoint or input overtimely, be defaulted as this phonetic entry complete, otherwise user can proceed phonetic entry.
Data coding module: for all speech datas that receive are encoded, and send encoded speech data by data transmission blocks;
Data transmission blocks: for all speech datas after coding are packed and asked to send to server by HTTP;
Data resolution module: the identification speech text returning for reception server is also resolved steering order;
Command execution module: for carrying out operation corresponding to speech text content according to steering order type command mobile terminal; Wherein, steering order type comprises plug-in application type, local function type, popular type of site and search-type etc., wherein, if steering order type is plug-in application type, open corresponding application according to speech text content, as " music plug-in unit ", " Quick Response Code " etc.; If steering order type is local function type, call corresponding local function according to speech text content, as " opening bookmark ", " emptying all data " etc.; If steering order type is popular type of site, open corresponding webpage according to speech text content, as " Tengxun's homepage ", " Sina website "; Other speech texts that do not belong to above-mentioned three types, the present invention all thinks search-type, directly uses result corresponding to mobile terminal current search engine search speech text; Concrete key data structure is
typedef?enum?{
VoiceControlCmdUnkonwn?=?0x0,
VoiceControlCmdSerach,
VoiceControlCmdPlugin,
VoiceControlCmdLocalApp,
VoiceControlCmdWebSite
VoiceControlCmd; // voice control type
typedef?struct?{
Char * text; // speech recognition text
VoiceControlCmd controlCmd; // control type
Data reception module: the encoded speech data sending for mobile terminal receive, encoded speech data is decoded and denoising;
Data Matching module: for decoded speech data is mated with the training sample result of server, determine suitable identification speech text, according to the additional steering order of speech text content; Wherein, the present invention, by the speech data of user's input is uploaded onto the server and mated with the training sample in server, makes speech recognition more accurate, can greatly avoid the situation of mobile terminal sound identification error or None-identified; Steering order is that cloud server is in determining identification speech text, according to the particular content of speech text, be mapped to the conventional operational instruction that client is supported, user side can carry out corresponding operation according to the steering order type command mobile terminal of speech text, for example, play music, send note, make a phone call, open webpage etc., have some situation of mistake identification, but along with the use result of a large number of users is constantly revised, it is accurate that this instruction also can be tending towards.
Data are returned to module: for speech text and steering order are returned to mobile terminal;
The speech data that phonetic order control method provided by the invention and system receive mobile terminal sends to server, server is by mating speech data with the training sample in server, determine that returning to mobile terminal after suitable identification speech text carries out corresponding operation again, make speech recognition more accurate, can greatly avoid the situation of mobile terminal sound identification error or None-identified, improve user's experience; In addition, the present invention classifies to the operating function of mobile terminal by the additional steering order of identification speech text content, improves the degree of accuracy of phonetic order.
In sum; although the present invention discloses as above with preferred embodiment; but above preferred embodiment is not in order to limit the present invention; those of ordinary skill in the art; without departing from the spirit and scope of the present invention; all can do various changes and retouching, the scope that therefore protection scope of the present invention defines with claim is as the criterion.

Claims (13)

1. a phonetic order control method, comprising:
The speech data packing that mobile terminal is received sends to server;
Speech data is mated with the training sample in server, determine suitable identification speech text, and identification speech text is returned to mobile terminal;
The identification speech text contents command mobile terminal returning according to server is carried out corresponding operation.
2. phonetic order control method according to claim 1, it is characterized in that, before sending to server step, the described speech data packing that mobile terminal is received also comprises: enter intelligent sound identification interface by intelligent sound entrance, the input of wait user speech, and judge efficient voice input within effective time, whether to be detected, if efficient voice input do not detected within effective time, finish this phonetic entry; If efficient voice input detected within effective time, receive user speech.
3. phonetic order control method according to claim 2, it is characterized in that, in described reception user speech step, also comprise: judge whether to recognize user speech input endpoint or input overtime, if not recognizing user speech input endpoint or input does not have overtime, the speech data receiving is encoded, and continue to receive next section of user speech; If recognize user speech input endpoint or input overtimely, stop receiving speech data, complete all encoded speech datas.
4. phonetic order control method according to claim 3, it is characterized in that, described, speech data is mated with the training sample in server, determine suitable identification speech text, and also comprise before identification speech text is returned to mobile terminal step: cloud server receives encoded speech data, encoded speech data is decoded and denoising.
5. phonetic order control method according to claim 1, it is characterized in that, described, speech data is mated with the training sample in server, determine suitable identification speech text, and identification speech text is returned in mobile terminal step and also comprised: according to the additional steering order of speech text content.
6. phonetic order control method according to claim 1 or 5, it is characterized in that, before carrying out corresponding operation steps, the described identification speech text contents command mobile terminal returning according to server also comprises: receive identification speech text and resolve steering order, carry out operation corresponding to speech text content according to steering order type command mobile terminal, wherein, described steering order type comprises plug-in application type, local function type, popular type of site and search-type.
7. a phonetic order control system, is characterized in that, comprises mobile terminal and server, and described mobile terminal comprises data transmission blocks and command execution module, and described server comprises that Data Matching module and data return to module,
Data transmission blocks: for the speech data packing of reception is sent to server;
Command execution module: carry out corresponding operation for the identification speech text contents command mobile terminal returning according to server;
Data Matching module: mate with the training sample of server for the speech data that mobile terminal is sent, determine suitable identification speech text;
Data are returned to module: for identification speech text is returned to mobile terminal.
8. phonetic order control system according to claim 7, is characterized in that, described mobile terminal also comprises
Interface enters module: for enter intelligent sound identification interface by intelligent sound entrance;
Speech detection module: for waiting for user speech input, and judge efficient voice input whether detected within effective time, if efficient voice input do not detected within effective time, finish this phonetic entry; If efficient voice input detected within effective time, receive speech data by phonetic incepting module.
9. phonetic order control system according to claim 8, is characterized in that, described mobile terminal also comprises
Phonetic incepting module: for receiving user speech, and judge whether to recognize user speech input endpoint or input overtime, if not recognizing user speech input endpoint or input does not have overtime, by data coding module, the speech data receiving is encoded, phonetic incepting module continues to receive next section of user speech simultaneously; If recognize user speech input endpoint or input overtimely, stop receiving speech data, and complete all encoded speech datas by data coding module;
Data coding module: for all speech datas that receive are encoded, and send encoded speech data by data transmission blocks.
10. phonetic order control system according to claim 9, is characterized in that, described server also comprises data reception module: the encoded speech data sending for mobile terminal receive, encoded speech data is decoded and denoising.
11. phonetic order control system according to claim 7, is characterized in that, described Data Matching module is also for determining after suitable identification speech text according to the additional steering order of speech text content.
12. according to the phonetic order control system described in claim 7 or 11, it is characterized in that, described mobile terminal also comprises data resolution module: the identification speech text returning for reception server is also resolved steering order, and described command execution module is carried out operation corresponding to speech text content according to steering order type command mobile terminal.
13. phonetic order control system according to claim 12, is characterized in that, described steering order type comprises plug-in application type, local function type, popular type of site and search-type.
CN201210478777.XA 2012-11-22 2012-11-22 Voice instruction control method and system Pending CN103839549A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201210478777.XA CN103839549A (en) 2012-11-22 2012-11-22 Voice instruction control method and system

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201210478777.XA CN103839549A (en) 2012-11-22 2012-11-22 Voice instruction control method and system

Publications (1)

Publication Number Publication Date
CN103839549A true CN103839549A (en) 2014-06-04

Family

ID=50802981

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201210478777.XA Pending CN103839549A (en) 2012-11-22 2012-11-22 Voice instruction control method and system

Country Status (1)

Country Link
CN (1) CN103839549A (en)

Cited By (24)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104183237A (en) * 2014-09-04 2014-12-03 百度在线网络技术(北京)有限公司 Speech processing method and device for portable terminal
CN104268195A (en) * 2014-09-19 2015-01-07 三星电子(中国)研发中心 Method and device for processing local resources in terminal
CN105094807A (en) * 2015-06-25 2015-11-25 三星电子(中国)研发中心 Method and device for implementing voice control
CN105609118A (en) * 2015-12-30 2016-05-25 生迪智慧科技有限公司 Speech detection method and device
CN105788594A (en) * 2016-03-01 2016-07-20 江西掌中无限网络科技股份有限公司 Voice and meaning identification method and system of flow-free APP
WO2016112634A1 (en) * 2015-01-12 2016-07-21 芋头科技(杭州)有限公司 Voice recognition system and method of robot system
CN105827878A (en) * 2015-01-04 2016-08-03 中国移动通信集团公司 Voice information conversion method and voice conversion gateway
CN106504753A (en) * 2015-09-07 2017-03-15 上海隆通网络系统有限公司 A kind of audio recognition method and system in IT operation management system
CN106847284A (en) * 2017-03-09 2017-06-13 深圳市八圈科技有限公司 Electronic equipment, computer-readable recording medium and voice interactive method
CN107086037A (en) * 2017-03-17 2017-08-22 上海庆科信息技术有限公司 A kind of voice interactive method of embedded device, device and embedded device
CN107146618A (en) * 2017-06-16 2017-09-08 北京云知声信息技术有限公司 Method of speech processing and device
CN107153499A (en) * 2016-03-04 2017-09-12 株式会社理光 The Voice command of interactive whiteboard equipment
CN107919130A (en) * 2017-11-06 2018-04-17 百度在线网络技术(北京)有限公司 Method of speech processing and device based on high in the clouds
CN108111696A (en) * 2017-12-29 2018-06-01 深圳市酷达通讯有限公司 A kind of wireless fixed telephone
CN108986811A (en) * 2018-08-31 2018-12-11 北京新能源汽车股份有限公司 A kind of detection method of speech recognition, device and equipment
CN109036430A (en) * 2018-09-29 2018-12-18 芜湖星途机器人科技有限公司 Voice control terminal
CN109120774A (en) * 2018-06-29 2019-01-01 深圳市九洲电器有限公司 Terminal applies voice control method and system
CN109118747A (en) * 2017-06-23 2019-01-01 中兴通讯股份有限公司 Infrared equipment control method, system, storage medium and computer equipment
CN109474843A (en) * 2017-09-08 2019-03-15 腾讯科技(深圳)有限公司 The method of speech control terminal, client, server
CN111225261A (en) * 2018-11-27 2020-06-02 Lg电子株式会社 Multimedia device for processing voice command and control method thereof
CN111261153A (en) * 2018-12-03 2020-06-09 现代自动车株式会社 Vehicle voice command processing device and method
CN111462738A (en) * 2019-01-18 2020-07-28 阿里巴巴集团控股有限公司 Voice recognition method and device
CN112565849A (en) * 2019-09-26 2021-03-26 深圳市茁壮网络股份有限公司 Voice control method of digital television, television control system and storage medium
CN112789561A (en) * 2018-10-15 2021-05-11 美的集团股份有限公司 System and method for customizing a portable natural language processing interface for an appliance

Citations (28)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1356688A (en) * 2000-11-27 2002-07-03 佳能株式会社 Speech recognition system, server and client, and control method thereof
US20020193998A1 (en) * 2001-05-31 2002-12-19 Dvorak Joseph L. Virtual speech interface system and method of using same
CN1627672A (en) * 2003-05-02 2005-06-15 索尼株式会社 Network system, electronic equipment terminal, server apparatus and method for distributing and reproducing the contents
CN1735027A (en) * 2004-08-13 2006-02-15 上海赢思软件技术有限公司 Chat robot system
KR20060034337A (en) * 2004-10-18 2006-04-24 주식회사 팬택 Mobile phone and server for managing home-network by voice, and system and method for home-network management using the same
CN101030994A (en) * 2007-04-11 2007-09-05 华为技术有限公司 Speech discriminating method system and server
CN101360118A (en) * 2007-08-02 2009-02-04 广东新支点技术服务有限公司 Method and protocol suitable for mobile terminal multimedia file sharing and searching
CN101420543A (en) * 2008-12-05 2009-04-29 天津三星电子显示器有限公司 Method for voice controlling television and television therewith
CN101437039A (en) * 2007-11-15 2009-05-20 华为技术有限公司 Mobile searching method, system and equipment
CN101599270A (en) * 2008-06-02 2009-12-09 海尔集团公司 Voice server and voice control method
US20100088100A1 (en) * 2008-10-02 2010-04-08 Lindahl Aram M Electronic devices with voice command and contextual data processing capabilities
CN101715018A (en) * 2009-11-03 2010-05-26 沈阳晨讯希姆通科技有限公司 Voice control method of functions of mobile phone
CN102270213A (en) * 2011-04-20 2011-12-07 深圳市凯立德科技股份有限公司 Searching method and device for interesting points of navigation system, and location service terminal
CN102316162A (en) * 2011-09-01 2012-01-11 深圳市子栋科技有限公司 Vehicle remote control method based on voice command, apparatus and system thereof
CN102316361A (en) * 2011-07-04 2012-01-11 深圳市子栋科技有限公司 Audio-frequency / video-frequency on demand method based on natural speech recognition and system thereof
US20120030712A1 (en) * 2010-08-02 2012-02-02 At&T Intellectual Property I, L.P. Network-integrated remote control with voice activation
CN102497481A (en) * 2011-12-02 2012-06-13 深圳市车音网科技有限公司 Method, device and system for voice dialing
CN102497391A (en) * 2011-11-21 2012-06-13 宇龙计算机通信科技(深圳)有限公司 Server, mobile terminal and prompt method
CN102541574A (en) * 2010-12-13 2012-07-04 鸿富锦精密工业(深圳)有限公司 Application program opening system and method
CN102541505A (en) * 2011-01-04 2012-07-04 中国移动通信集团公司 Voice input method and system thereof
CN102571882A (en) * 2010-12-31 2012-07-11 上海博泰悦臻电子设备制造有限公司 Network-based voice reminding method and system
CN102591932A (en) * 2011-12-23 2012-07-18 优视科技有限公司 Voice search method, voice search system, mobile terminal and transfer server
CN102629246A (en) * 2012-02-10 2012-08-08 北京百纳信息技术有限公司 Server used for recognizing browser voice commands and browser voice command recognition system
CN102650960A (en) * 2012-03-31 2012-08-29 奇智软件(北京)有限公司 Method and device for eliminating faults of terminal equipment
CN102724309A (en) * 2012-06-14 2012-10-10 广东好帮手电子科技股份有限公司 Vehicular voice network music system and control method thereof
CN102741146A (en) * 2010-02-23 2012-10-17 三菱电机株式会社 Elevator device
CN102760431A (en) * 2012-07-12 2012-10-31 上海语联信息技术有限公司 Intelligentized voice recognition system
CN102792320A (en) * 2010-01-18 2012-11-21 苹果公司 Intelligent automated assistant

Patent Citations (28)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1356688A (en) * 2000-11-27 2002-07-03 佳能株式会社 Speech recognition system, server and client, and control method thereof
US20020193998A1 (en) * 2001-05-31 2002-12-19 Dvorak Joseph L. Virtual speech interface system and method of using same
CN1627672A (en) * 2003-05-02 2005-06-15 索尼株式会社 Network system, electronic equipment terminal, server apparatus and method for distributing and reproducing the contents
CN1735027A (en) * 2004-08-13 2006-02-15 上海赢思软件技术有限公司 Chat robot system
KR20060034337A (en) * 2004-10-18 2006-04-24 주식회사 팬택 Mobile phone and server for managing home-network by voice, and system and method for home-network management using the same
CN101030994A (en) * 2007-04-11 2007-09-05 华为技术有限公司 Speech discriminating method system and server
CN101360118A (en) * 2007-08-02 2009-02-04 广东新支点技术服务有限公司 Method and protocol suitable for mobile terminal multimedia file sharing and searching
CN101437039A (en) * 2007-11-15 2009-05-20 华为技术有限公司 Mobile searching method, system and equipment
CN101599270A (en) * 2008-06-02 2009-12-09 海尔集团公司 Voice server and voice control method
US20100088100A1 (en) * 2008-10-02 2010-04-08 Lindahl Aram M Electronic devices with voice command and contextual data processing capabilities
CN101420543A (en) * 2008-12-05 2009-04-29 天津三星电子显示器有限公司 Method for voice controlling television and television therewith
CN101715018A (en) * 2009-11-03 2010-05-26 沈阳晨讯希姆通科技有限公司 Voice control method of functions of mobile phone
CN102792320A (en) * 2010-01-18 2012-11-21 苹果公司 Intelligent automated assistant
CN102741146A (en) * 2010-02-23 2012-10-17 三菱电机株式会社 Elevator device
US20120030712A1 (en) * 2010-08-02 2012-02-02 At&T Intellectual Property I, L.P. Network-integrated remote control with voice activation
CN102541574A (en) * 2010-12-13 2012-07-04 鸿富锦精密工业(深圳)有限公司 Application program opening system and method
CN102571882A (en) * 2010-12-31 2012-07-11 上海博泰悦臻电子设备制造有限公司 Network-based voice reminding method and system
CN102541505A (en) * 2011-01-04 2012-07-04 中国移动通信集团公司 Voice input method and system thereof
CN102270213A (en) * 2011-04-20 2011-12-07 深圳市凯立德科技股份有限公司 Searching method and device for interesting points of navigation system, and location service terminal
CN102316361A (en) * 2011-07-04 2012-01-11 深圳市子栋科技有限公司 Audio-frequency / video-frequency on demand method based on natural speech recognition and system thereof
CN102316162A (en) * 2011-09-01 2012-01-11 深圳市子栋科技有限公司 Vehicle remote control method based on voice command, apparatus and system thereof
CN102497391A (en) * 2011-11-21 2012-06-13 宇龙计算机通信科技(深圳)有限公司 Server, mobile terminal and prompt method
CN102497481A (en) * 2011-12-02 2012-06-13 深圳市车音网科技有限公司 Method, device and system for voice dialing
CN102591932A (en) * 2011-12-23 2012-07-18 优视科技有限公司 Voice search method, voice search system, mobile terminal and transfer server
CN102629246A (en) * 2012-02-10 2012-08-08 北京百纳信息技术有限公司 Server used for recognizing browser voice commands and browser voice command recognition system
CN102650960A (en) * 2012-03-31 2012-08-29 奇智软件(北京)有限公司 Method and device for eliminating faults of terminal equipment
CN102724309A (en) * 2012-06-14 2012-10-10 广东好帮手电子科技股份有限公司 Vehicular voice network music system and control method thereof
CN102760431A (en) * 2012-07-12 2012-10-31 上海语联信息技术有限公司 Intelligentized voice recognition system

Cited By (34)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104183237A (en) * 2014-09-04 2014-12-03 百度在线网络技术(北京)有限公司 Speech processing method and device for portable terminal
CN104183237B (en) * 2014-09-04 2017-10-31 百度在线网络技术(北京)有限公司 Method of speech processing and device for portable terminal
CN104268195A (en) * 2014-09-19 2015-01-07 三星电子(中国)研发中心 Method and device for processing local resources in terminal
CN105827878B (en) * 2015-01-04 2019-06-25 中国移动通信集团公司 Voice messaging conversion method and voice transfer gateway
CN105827878A (en) * 2015-01-04 2016-08-03 中国移动通信集团公司 Voice information conversion method and voice conversion gateway
JP2018507434A (en) * 2015-01-12 2018-03-15 ユウトウ・テクノロジー(ハンジョウ)・カンパニー・リミテッド Voice identification system and method for robot system
WO2016112634A1 (en) * 2015-01-12 2016-07-21 芋头科技(杭州)有限公司 Voice recognition system and method of robot system
CN105845135A (en) * 2015-01-12 2016-08-10 芋头科技(杭州)有限公司 Sound recognition system and method for robot system
CN105094807A (en) * 2015-06-25 2015-11-25 三星电子(中国)研发中心 Method and device for implementing voice control
CN106504753A (en) * 2015-09-07 2017-03-15 上海隆通网络系统有限公司 A kind of audio recognition method and system in IT operation management system
CN105609118A (en) * 2015-12-30 2016-05-25 生迪智慧科技有限公司 Speech detection method and device
CN105609118B (en) * 2015-12-30 2020-02-07 生迪智慧科技有限公司 Voice detection method and device
CN105788594A (en) * 2016-03-01 2016-07-20 江西掌中无限网络科技股份有限公司 Voice and meaning identification method and system of flow-free APP
CN107153499A (en) * 2016-03-04 2017-09-12 株式会社理光 The Voice command of interactive whiteboard equipment
CN106847284A (en) * 2017-03-09 2017-06-13 深圳市八圈科技有限公司 Electronic equipment, computer-readable recording medium and voice interactive method
CN107086037A (en) * 2017-03-17 2017-08-22 上海庆科信息技术有限公司 A kind of voice interactive method of embedded device, device and embedded device
CN107146618A (en) * 2017-06-16 2017-09-08 北京云知声信息技术有限公司 Method of speech processing and device
CN109118747A (en) * 2017-06-23 2019-01-01 中兴通讯股份有限公司 Infrared equipment control method, system, storage medium and computer equipment
CN109474843A (en) * 2017-09-08 2019-03-15 腾讯科技(深圳)有限公司 The method of speech control terminal, client, server
CN107919130A (en) * 2017-11-06 2018-04-17 百度在线网络技术(北京)有限公司 Method of speech processing and device based on high in the clouds
CN107919130B (en) * 2017-11-06 2021-12-17 百度在线网络技术(北京)有限公司 Cloud-based voice processing method and device
US11024332B2 (en) 2017-11-06 2021-06-01 Baidu Online Network Technology (Beijing) Co., Ltd. Cloud-based speech processing method and apparatus
CN108111696A (en) * 2017-12-29 2018-06-01 深圳市酷达通讯有限公司 A kind of wireless fixed telephone
CN109120774A (en) * 2018-06-29 2019-01-01 深圳市九洲电器有限公司 Terminal applies voice control method and system
CN108986811A (en) * 2018-08-31 2018-12-11 北京新能源汽车股份有限公司 A kind of detection method of speech recognition, device and equipment
CN109036430A (en) * 2018-09-29 2018-12-18 芜湖星途机器人科技有限公司 Voice control terminal
CN112789561A (en) * 2018-10-15 2021-05-11 美的集团股份有限公司 System and method for customizing a portable natural language processing interface for an appliance
CN112789561B (en) * 2018-10-15 2022-04-05 美的集团股份有限公司 System and method for customizing a portable natural language processing interface for an appliance
CN111225261A (en) * 2018-11-27 2020-06-02 Lg电子株式会社 Multimedia device for processing voice command and control method thereof
CN111225261B (en) * 2018-11-27 2021-11-26 Lg电子株式会社 Multimedia device for processing voice command and control method thereof
CN111261153A (en) * 2018-12-03 2020-06-09 现代自动车株式会社 Vehicle voice command processing device and method
CN111261153B (en) * 2018-12-03 2023-12-19 现代自动车株式会社 Vehicle voice command processing device and method
CN111462738A (en) * 2019-01-18 2020-07-28 阿里巴巴集团控股有限公司 Voice recognition method and device
CN112565849A (en) * 2019-09-26 2021-03-26 深圳市茁壮网络股份有限公司 Voice control method of digital television, television control system and storage medium

Similar Documents

Publication Publication Date Title
CN103839549A (en) Voice instruction control method and system
US20140379334A1 (en) Natural language understanding automatic speech recognition post processing
KR20200108775A (en) Training corpus generating method, apparatus, device and storage medium
CN108710704B (en) Method and device for determining conversation state, electronic equipment and storage medium
KR102046486B1 (en) Information inputting method
CN101221576B (en) Input method and device capable of implementing automatic translation
KR20190021338A (en) Subsequent voice query prediction
CN109559748B (en) A kind of method for recognizing semantics, device, smart machine and storage medium
CN111402861B (en) Voice recognition method, device, equipment and storage medium
CN106372054B (en) Method and device for multi-language semantic analysis
CN104575499B (en) Voice control method of mobile terminal and mobile terminal
RU2011130550A (en) LANGUAGE-BASED MARKING SELECTION AND USE OF RECOGNITORS FOR PROCESSING PROMOTION
CN109785829B (en) Customer service assisting method and system based on voice control
CN109256125B (en) Off-line voice recognition method and device and storage medium
CN105512182A (en) Speech control method and intelligent television
WO2020024620A1 (en) Voice information processing method and device, apparatus, and storage medium
CN112669842A (en) Man-machine conversation control method, device, computer equipment and storage medium
CN110991179A (en) Semantic analysis method based on electric power professional term
CN111933149A (en) Voice interaction method, wearable device, terminal and voice interaction system
CN112286485B (en) Method and device for controlling application through voice, electronic equipment and storage medium
CN110808031A (en) Voice recognition method and device and computer equipment
CN114299955B (en) Voice interaction method and device, electronic equipment and storage medium
CN112035648B (en) User data processing method and device and electronic equipment
CN114171016A (en) Voice interaction method and device, electronic equipment and storage medium
CN114781359A (en) Text error correction method and device, computer equipment and storage medium

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication
RJ01 Rejection of invention patent application after publication

Application publication date: 20140604