CN103839549A - Voice instruction control method and system - Google Patents
Voice instruction control method and system Download PDFInfo
- Publication number
- CN103839549A CN103839549A CN201210478777.XA CN201210478777A CN103839549A CN 103839549 A CN103839549 A CN 103839549A CN 201210478777 A CN201210478777 A CN 201210478777A CN 103839549 A CN103839549 A CN 103839549A
- Authority
- CN
- China
- Prior art keywords
- speech
- data
- mobile terminal
- server
- identification
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
Images
Abstract
The invention discloses a voice instruction control method and system. The voice instruction control method comprises: packing voice data received by a mobile terminal for sending to a server; matching the voice data with training samples in the server, determining a proper identification voice text, and returning the identification voice text to the mobile terminal; and commanding the mobile terminal to execute corresponding operation according to the content of the identification voice text returned by the serer. By using the voice instruction control method and system provided by the invention, the voice data received by the mobile terminal is sent to the server, the server determines the proper identification voice text by matching the voice data with the training samples in the server, such that voice identification is more accurate, the voice instruction accuracy is improved, and the user application experience is improved.
Description
[technical field]
The present invention relates to voice control technology field, particularly a kind of phonetic order control method and system.
[background technology]
Siri is the critical function that iphone4S carries, user can directly simply be exchanged and mobile phone is sent to instruction with smart mobile phone by voice, along with the issue of Siri Chinese edition, people never stop the discussion of the intelligent human-machine interaction technology (HCI) such as voice.And the Voice Actions(phonetic order of Android system) very solid reliable voice recognition engine is also provided, its high resolution is startling, but require the language that user inputs to possess strict syntactic structure and form, otherwise system is by None-identified.The no matter Siri of iphone or the Voice Actions of Android system, all only be based on mobile terminal this locality and carry out speech recognition, but owing to being subject to the impact of the factors such as environment for use or user pronunciation and syntactic structure and form, mobile terminal there will be the situation of speech recognition errors or None-identified, affects user's experience.
Therefore, be necessary to propose a kind of new technical scheme, there is the technical matters of speech recognition errors or None-identified to solve existing voice recognition technology.
[summary of the invention]
One object of the present invention is to provide a kind of phonetic order control method and system, is intended to solve existing voice recognition technology and exists the technical matters of speech recognition errors or None-identified.
For achieving the above object, the invention provides a kind of phonetic order control method, comprising:
The speech data packing that mobile terminal is received sends to server;
Speech data is mated with the training sample in server, determine suitable identification speech text, and identification speech text is returned to mobile terminal;
The identification speech text contents command mobile terminal returning according to server is carried out corresponding operation.
In above-mentioned phonetic order control method, before sending to server step, the described speech data packing that mobile terminal is received also comprises: enter intelligent sound identification interface by intelligent sound entrance, the input of wait user speech, and judge efficient voice input within effective time, whether to be detected, if efficient voice input do not detected within effective time, finish this phonetic entry; If efficient voice input detected within effective time, receive user speech.
In above-mentioned phonetic order control method, in described reception user speech step, also comprise: judge whether to recognize user speech input endpoint or input overtime, if not recognizing user speech input endpoint or input does not have overtime, the speech data receiving is encoded, and continue to receive next section of user speech; If recognize user speech input endpoint or input overtimely, stop receiving speech data, complete all encoded speech datas.
In above-mentioned phonetic order control method, described, speech data is mated with the training sample in server, determine suitable identification speech text, and also comprise before identification speech text is returned to mobile terminal step: cloud server receives encoded speech data, encoded speech data is decoded and denoising.
In above-mentioned phonetic order control method, described, speech data is mated with the training sample in server, determine suitable identification speech text, and identification speech text is returned in mobile terminal step and also comprised: according to the additional steering order of speech text content.
In above-mentioned phonetic order control method, before carrying out corresponding operation steps, the described identification speech text contents command mobile terminal returning according to server also comprises: receive identification speech text and resolve steering order, carry out operation corresponding to speech text content according to steering order type command mobile terminal, wherein, described steering order type comprises plug-in application type, local function type, popular type of site and search-type.
The present invention also provides a kind of phonetic order control system, comprises mobile terminal and server, and described mobile terminal comprises data transmission blocks and command execution module, and described server comprises that Data Matching module and data return to module,
Data transmission blocks: for the speech data packing of reception is sent to server;
Command execution module: carry out corresponding operation for the identification speech text contents command mobile terminal returning according to server;
Data Matching module: mate with the training sample of server for the speech data that mobile terminal is sent, determine suitable identification speech text;
Data are returned to module: for identification speech text is returned to mobile terminal.
In above-mentioned phonetic order control system, described mobile terminal also comprises
Interface enters module: for enter intelligent sound identification interface by intelligent sound entrance;
Speech detection module: for waiting for user speech input, and judge efficient voice input whether detected within effective time, if efficient voice input do not detected within effective time, finish this phonetic entry; If efficient voice input detected within effective time, receive speech data by phonetic incepting module.
In above-mentioned phonetic order control system, described mobile terminal also comprises
Phonetic incepting module: for receiving user speech, and judge whether to recognize user speech input endpoint or input overtime, if not recognizing user speech input endpoint or input does not have overtime, by data coding module, the speech data receiving is encoded, phonetic incepting module continues to receive next section of user speech simultaneously; If recognize user speech input endpoint or input overtimely, stop receiving speech data, and complete all encoded speech datas by data coding module;
Data coding module: for all speech datas that receive are encoded, and send encoded speech data by data transmission blocks.
In above-mentioned phonetic order control system, described server also comprises data reception module: the encoded speech data sending for mobile terminal receive, encoded speech data is decoded and denoising.
In above-mentioned phonetic order control system, described Data Matching module is also for determining after suitable identification speech text according to the additional steering order of speech text content.
In above-mentioned phonetic order control system, described mobile terminal also comprises data resolution module: the identification speech text returning for reception server is also resolved steering order, and described command execution module is carried out operation corresponding to speech text content according to steering order type command mobile terminal.
In above-mentioned phonetic order control system, described steering order type comprises plug-in application type, local function type, popular type of site and search-type.
The speech data that phonetic order control method provided by the invention and system receive mobile terminal sends to server, server is by mating speech data with the training sample in server, determine suitable identification speech text, make speech recognition more accurate, improve the degree of accuracy of phonetic order, can greatly avoid the situation of mobile terminal sound identification error or None-identified, improve user's experience; In addition, the present invention classifies to the operating function of mobile terminal by the additional steering order of identification speech text content, improves the degree of accuracy of phonetic order.
For foregoing of the present invention can be become apparent, preferred embodiment cited below particularly, and coordinate appended graphicly, be described in detail below:
[accompanying drawing explanation]
Fig. 1 is the process flow diagram of the phonetic order control method of first embodiment of the invention;
Fig. 2 is the process flow diagram of the phonetic order control method of second embodiment of the invention;
Fig. 3 is the structural representation of the phonetic order control system of first embodiment of the invention;
Fig. 4 is the structural representation of the phonetic order control system of second embodiment of the invention.
[embodiment]
The explanation of following embodiment is graphic with reference to what add, can be in order to the specific embodiment of implementing in order to illustrate the present invention.
Please refer to Fig. 1, is the process flow diagram of the phonetic order control method of first embodiment of the invention.The phonetic order control method of first embodiment of the invention comprises the following steps:
Step S100: the speech data packing that mobile terminal is received sends to server;
Step S110: speech data is mated with the training sample in server, determine suitable identification speech text, and identification speech text is returned to mobile terminal;
In step S110, the present invention, by the speech data of user's input is uploaded onto the server and mated with the training sample in server, makes speech recognition more accurate, can greatly avoid the situation of mobile terminal sound identification error or None-identified;
Step S120: the identification speech text contents command mobile terminal returning according to server is carried out corresponding operation.
Please refer to Fig. 2, is the process flow diagram of the phonetic order control method of second embodiment of the invention.The phonetic order control method of second embodiment of the invention comprises the following steps:
Step S200: enter intelligent sound identification interface by intelligent sound entrance;
In step S200, user can be by clicking intelligent sound quick links icon or long by toolbar(tool bar) mode such as certain hour ejects intelligent sound and identifies interface, specifically seeing also Fig. 3, is mobile terminal intelligent sound identification interfacial effect figure of the present invention.In embodiments of the present invention, length, specifically can arrange according to different demands for being greater than 0.5s by the time of toolbar.
Step S210: wait for user speech input, and judge efficient voice input whether detected within effective time, if efficient voice input, execution step 220 do not detected within effective time; If efficient voice input detected within effective time, execution step 230;
In step 210, refer to the stand-by period of phonetic entry effective time, can arrange according to different demands, be set to 5s effective time in embodiments of the present invention; If user inputs voice within effective time, be efficient voice input, otherwise, if phonetic entry wait timeout finishes this input.
Step S220: finish this phonetic entry;
Step S230: receive user speech, and judge whether to recognize user speech input endpoint or input overtime, if do not recognize user speech input endpoint or input do not have overtime, execution step S240; If recognize user speech input endpoint or input overtime, execution step 250;
In step S230, recognize user speech input endpoint and refer to that user inputs a dead time after complete word or sentence and meets end points condition for identification, end points condition for identification can be set according to different situations, such as 5s, 10s etc.; If recognize user speech input endpoint or input overtimely, be defaulted as this phonetic entry complete, otherwise user can proceed phonetic entry.
Step S240: the speech data receiving is encoded, and again perform step next section of user speech of S230 continuation reception;
Step S250: stop receiving speech data, complete all encoded speech datas;
Step S260: all speech datas after coding are packed and asked to send to server by HTTP;
Step S270: cloud server receives encoded speech data, decodes encoded speech data denoising;
Step S280: decoded speech data is mated with the training sample in server, determine suitable identification speech text, according to the additional steering order of speech text content;
In step S280, the present invention, by the speech data of user's input is uploaded onto the server and mated with the training sample in server, makes speech recognition more accurate, can greatly avoid the situation of mobile terminal sound identification error or None-identified; Steering order is that cloud server is in determining identification speech text, according to the particular content of speech text, be mapped to the conventional operational instruction that client is supported, user side can carry out corresponding operation according to the steering order type command mobile terminal of speech text, for example, play music, send note, make a phone call, open webpage etc., have some situation of mistake identification, but along with the use result of a large number of users is constantly revised, it is accurate that this instruction also can be tending towards.
Step S290: speech text and steering order are returned to mobile terminal;
Step S300: receive speech text and resolve steering order, carrying out operation corresponding to speech text content according to steering order type command mobile terminal;
In step S300, steering order type comprises plug-in application type, local function type, popular type of site and search-type etc., wherein, if steering order type is plug-in application type, open corresponding application according to speech text content, as " music plug-in unit ", " Quick Response Code " etc.; If steering order type is local function type, call corresponding local function according to speech text content, as " opening bookmark ", " emptying all data " etc.; If steering order type is popular type of site, open corresponding webpage according to speech text content, as " Tengxun's homepage ", " Sina website "; Other speech texts that do not belong to above-mentioned three types, the present invention all thinks search-type, directly uses result corresponding to mobile terminal current search engine search speech text; Concrete key data structure is
typedef?enum?{
VoiceControlCmdUnkonwn?=?0x0,
VoiceControlCmdSerach,
VoiceControlCmdPlugin,
VoiceControlCmdLocalApp,
VoiceControlCmdWebSite
VoiceControlCmd; // voice control type
typedef?struct?{
Char * text; // speech recognition text
VoiceControlCmd controlCmd; // control type
Please refer to Fig. 3, is the structural representation of the phonetic order control system of first embodiment of the invention.The phonetic order control system of first embodiment of the invention comprises mobile terminal and server, and mobile terminal comprises data transmission blocks and command execution module, and server comprises that Data Matching module and data return to module, wherein
Data transmission blocks: for the speech data packing of reception is sent to server;
Command execution module: carry out corresponding operation for the identification speech text contents command mobile terminal returning according to server;
Data Matching module: mate with the training sample of server for the speech data that mobile terminal is sent, determine suitable identification speech text; Wherein, the present invention, by the speech data of user's input is uploaded onto the server and mated with the training sample in server, makes speech recognition more accurate, can greatly avoid the situation of mobile terminal sound identification error or None-identified;
Data are returned to module: for identification speech text is returned to mobile terminal;
Please refer to Fig. 4, is the structural representation of the phonetic order control system of second embodiment of the invention.The phonetic order control system of second embodiment of the invention comprises mobile terminal and server, mobile terminal comprises that interface enters module, speech detection module, phonetic incepting module, data coding module, data transmission blocks, data resolution module and command execution module, server comprises that data reception module, Data Matching module and data return to module, wherein
Interface enters module: for enter intelligent sound identification interface by intelligent sound entrance; Wherein, user can be by clicking intelligent sound quick links icon or long by toolbar(tool bar) mode such as certain hour ejects intelligent sound identification interface, specifically sees also Fig. 3, is that mobile terminal intelligent sound of the present invention is identified interfacial effect figure.In embodiments of the present invention, length, specifically can arrange according to different demands for being greater than 0.5s by the time of toolbar.
Speech detection module: for waiting for user speech input, and judge efficient voice input whether detected within effective time, if efficient voice input do not detected within effective time, finish this phonetic entry; If efficient voice input detected within effective time, receive speech data by phonetic incepting module; Wherein, refer to the stand-by period of phonetic entry effective time, can arrange according to different demands, be set to 5s effective time in embodiments of the present invention; If user inputs voice within effective time, be efficient voice input, otherwise, if phonetic entry wait timeout finishes this input.
Phonetic incepting module: for receiving user speech, and judge whether to recognize user speech input endpoint or input overtime, if not recognizing user speech input endpoint or input does not have overtime, by data coding module, the speech data receiving is encoded, phonetic incepting module continues to receive next section of user speech simultaneously; If recognize user speech input endpoint or input overtimely, stop receiving speech data, and complete all encoded speech datas by data coding module; Wherein, recognize user speech input endpoint and refer to that user inputs a dead time after complete word or sentence and meets end points condition for identification, end points condition for identification can be set according to different situations, such as 5s, 10s etc.; If recognize user speech input endpoint or input overtimely, be defaulted as this phonetic entry complete, otherwise user can proceed phonetic entry.
Data coding module: for all speech datas that receive are encoded, and send encoded speech data by data transmission blocks;
Data transmission blocks: for all speech datas after coding are packed and asked to send to server by HTTP;
Data resolution module: the identification speech text returning for reception server is also resolved steering order;
Command execution module: for carrying out operation corresponding to speech text content according to steering order type command mobile terminal; Wherein, steering order type comprises plug-in application type, local function type, popular type of site and search-type etc., wherein, if steering order type is plug-in application type, open corresponding application according to speech text content, as " music plug-in unit ", " Quick Response Code " etc.; If steering order type is local function type, call corresponding local function according to speech text content, as " opening bookmark ", " emptying all data " etc.; If steering order type is popular type of site, open corresponding webpage according to speech text content, as " Tengxun's homepage ", " Sina website "; Other speech texts that do not belong to above-mentioned three types, the present invention all thinks search-type, directly uses result corresponding to mobile terminal current search engine search speech text; Concrete key data structure is
typedef?enum?{
VoiceControlCmdUnkonwn?=?0x0,
VoiceControlCmdSerach,
VoiceControlCmdPlugin,
VoiceControlCmdLocalApp,
VoiceControlCmdWebSite
VoiceControlCmd; // voice control type
typedef?struct?{
Char * text; // speech recognition text
VoiceControlCmd controlCmd; // control type
Data reception module: the encoded speech data sending for mobile terminal receive, encoded speech data is decoded and denoising;
Data Matching module: for decoded speech data is mated with the training sample result of server, determine suitable identification speech text, according to the additional steering order of speech text content; Wherein, the present invention, by the speech data of user's input is uploaded onto the server and mated with the training sample in server, makes speech recognition more accurate, can greatly avoid the situation of mobile terminal sound identification error or None-identified; Steering order is that cloud server is in determining identification speech text, according to the particular content of speech text, be mapped to the conventional operational instruction that client is supported, user side can carry out corresponding operation according to the steering order type command mobile terminal of speech text, for example, play music, send note, make a phone call, open webpage etc., have some situation of mistake identification, but along with the use result of a large number of users is constantly revised, it is accurate that this instruction also can be tending towards.
Data are returned to module: for speech text and steering order are returned to mobile terminal;
The speech data that phonetic order control method provided by the invention and system receive mobile terminal sends to server, server is by mating speech data with the training sample in server, determine that returning to mobile terminal after suitable identification speech text carries out corresponding operation again, make speech recognition more accurate, can greatly avoid the situation of mobile terminal sound identification error or None-identified, improve user's experience; In addition, the present invention classifies to the operating function of mobile terminal by the additional steering order of identification speech text content, improves the degree of accuracy of phonetic order.
In sum; although the present invention discloses as above with preferred embodiment; but above preferred embodiment is not in order to limit the present invention; those of ordinary skill in the art; without departing from the spirit and scope of the present invention; all can do various changes and retouching, the scope that therefore protection scope of the present invention defines with claim is as the criterion.
Claims (13)
1. a phonetic order control method, comprising:
The speech data packing that mobile terminal is received sends to server;
Speech data is mated with the training sample in server, determine suitable identification speech text, and identification speech text is returned to mobile terminal;
The identification speech text contents command mobile terminal returning according to server is carried out corresponding operation.
2. phonetic order control method according to claim 1, it is characterized in that, before sending to server step, the described speech data packing that mobile terminal is received also comprises: enter intelligent sound identification interface by intelligent sound entrance, the input of wait user speech, and judge efficient voice input within effective time, whether to be detected, if efficient voice input do not detected within effective time, finish this phonetic entry; If efficient voice input detected within effective time, receive user speech.
3. phonetic order control method according to claim 2, it is characterized in that, in described reception user speech step, also comprise: judge whether to recognize user speech input endpoint or input overtime, if not recognizing user speech input endpoint or input does not have overtime, the speech data receiving is encoded, and continue to receive next section of user speech; If recognize user speech input endpoint or input overtimely, stop receiving speech data, complete all encoded speech datas.
4. phonetic order control method according to claim 3, it is characterized in that, described, speech data is mated with the training sample in server, determine suitable identification speech text, and also comprise before identification speech text is returned to mobile terminal step: cloud server receives encoded speech data, encoded speech data is decoded and denoising.
5. phonetic order control method according to claim 1, it is characterized in that, described, speech data is mated with the training sample in server, determine suitable identification speech text, and identification speech text is returned in mobile terminal step and also comprised: according to the additional steering order of speech text content.
6. phonetic order control method according to claim 1 or 5, it is characterized in that, before carrying out corresponding operation steps, the described identification speech text contents command mobile terminal returning according to server also comprises: receive identification speech text and resolve steering order, carry out operation corresponding to speech text content according to steering order type command mobile terminal, wherein, described steering order type comprises plug-in application type, local function type, popular type of site and search-type.
7. a phonetic order control system, is characterized in that, comprises mobile terminal and server, and described mobile terminal comprises data transmission blocks and command execution module, and described server comprises that Data Matching module and data return to module,
Data transmission blocks: for the speech data packing of reception is sent to server;
Command execution module: carry out corresponding operation for the identification speech text contents command mobile terminal returning according to server;
Data Matching module: mate with the training sample of server for the speech data that mobile terminal is sent, determine suitable identification speech text;
Data are returned to module: for identification speech text is returned to mobile terminal.
8. phonetic order control system according to claim 7, is characterized in that, described mobile terminal also comprises
Interface enters module: for enter intelligent sound identification interface by intelligent sound entrance;
Speech detection module: for waiting for user speech input, and judge efficient voice input whether detected within effective time, if efficient voice input do not detected within effective time, finish this phonetic entry; If efficient voice input detected within effective time, receive speech data by phonetic incepting module.
9. phonetic order control system according to claim 8, is characterized in that, described mobile terminal also comprises
Phonetic incepting module: for receiving user speech, and judge whether to recognize user speech input endpoint or input overtime, if not recognizing user speech input endpoint or input does not have overtime, by data coding module, the speech data receiving is encoded, phonetic incepting module continues to receive next section of user speech simultaneously; If recognize user speech input endpoint or input overtimely, stop receiving speech data, and complete all encoded speech datas by data coding module;
Data coding module: for all speech datas that receive are encoded, and send encoded speech data by data transmission blocks.
10. phonetic order control system according to claim 9, is characterized in that, described server also comprises data reception module: the encoded speech data sending for mobile terminal receive, encoded speech data is decoded and denoising.
11. phonetic order control system according to claim 7, is characterized in that, described Data Matching module is also for determining after suitable identification speech text according to the additional steering order of speech text content.
12. according to the phonetic order control system described in claim 7 or 11, it is characterized in that, described mobile terminal also comprises data resolution module: the identification speech text returning for reception server is also resolved steering order, and described command execution module is carried out operation corresponding to speech text content according to steering order type command mobile terminal.
13. phonetic order control system according to claim 12, is characterized in that, described steering order type comprises plug-in application type, local function type, popular type of site and search-type.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201210478777.XA CN103839549A (en) | 2012-11-22 | 2012-11-22 | Voice instruction control method and system |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201210478777.XA CN103839549A (en) | 2012-11-22 | 2012-11-22 | Voice instruction control method and system |
Publications (1)
Publication Number | Publication Date |
---|---|
CN103839549A true CN103839549A (en) | 2014-06-04 |
Family
ID=50802981
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201210478777.XA Pending CN103839549A (en) | 2012-11-22 | 2012-11-22 | Voice instruction control method and system |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN103839549A (en) |
Cited By (24)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN104183237A (en) * | 2014-09-04 | 2014-12-03 | 百度在线网络技术(北京)有限公司 | Speech processing method and device for portable terminal |
CN104268195A (en) * | 2014-09-19 | 2015-01-07 | 三星电子(中国)研发中心 | Method and device for processing local resources in terminal |
CN105094807A (en) * | 2015-06-25 | 2015-11-25 | 三星电子(中国)研发中心 | Method and device for implementing voice control |
CN105609118A (en) * | 2015-12-30 | 2016-05-25 | 生迪智慧科技有限公司 | Speech detection method and device |
CN105788594A (en) * | 2016-03-01 | 2016-07-20 | 江西掌中无限网络科技股份有限公司 | Voice and meaning identification method and system of flow-free APP |
WO2016112634A1 (en) * | 2015-01-12 | 2016-07-21 | 芋头科技(杭州)有限公司 | Voice recognition system and method of robot system |
CN105827878A (en) * | 2015-01-04 | 2016-08-03 | 中国移动通信集团公司 | Voice information conversion method and voice conversion gateway |
CN106504753A (en) * | 2015-09-07 | 2017-03-15 | 上海隆通网络系统有限公司 | A kind of audio recognition method and system in IT operation management system |
CN106847284A (en) * | 2017-03-09 | 2017-06-13 | 深圳市八圈科技有限公司 | Electronic equipment, computer-readable recording medium and voice interactive method |
CN107086037A (en) * | 2017-03-17 | 2017-08-22 | 上海庆科信息技术有限公司 | A kind of voice interactive method of embedded device, device and embedded device |
CN107146618A (en) * | 2017-06-16 | 2017-09-08 | 北京云知声信息技术有限公司 | Method of speech processing and device |
CN107153499A (en) * | 2016-03-04 | 2017-09-12 | 株式会社理光 | The Voice command of interactive whiteboard equipment |
CN107919130A (en) * | 2017-11-06 | 2018-04-17 | 百度在线网络技术(北京)有限公司 | Method of speech processing and device based on high in the clouds |
CN108111696A (en) * | 2017-12-29 | 2018-06-01 | 深圳市酷达通讯有限公司 | A kind of wireless fixed telephone |
CN108986811A (en) * | 2018-08-31 | 2018-12-11 | 北京新能源汽车股份有限公司 | A kind of detection method of speech recognition, device and equipment |
CN109036430A (en) * | 2018-09-29 | 2018-12-18 | 芜湖星途机器人科技有限公司 | Voice control terminal |
CN109120774A (en) * | 2018-06-29 | 2019-01-01 | 深圳市九洲电器有限公司 | Terminal applies voice control method and system |
CN109118747A (en) * | 2017-06-23 | 2019-01-01 | 中兴通讯股份有限公司 | Infrared equipment control method, system, storage medium and computer equipment |
CN109474843A (en) * | 2017-09-08 | 2019-03-15 | 腾讯科技(深圳)有限公司 | The method of speech control terminal, client, server |
CN111225261A (en) * | 2018-11-27 | 2020-06-02 | Lg电子株式会社 | Multimedia device for processing voice command and control method thereof |
CN111261153A (en) * | 2018-12-03 | 2020-06-09 | 现代自动车株式会社 | Vehicle voice command processing device and method |
CN111462738A (en) * | 2019-01-18 | 2020-07-28 | 阿里巴巴集团控股有限公司 | Voice recognition method and device |
CN112565849A (en) * | 2019-09-26 | 2021-03-26 | 深圳市茁壮网络股份有限公司 | Voice control method of digital television, television control system and storage medium |
CN112789561A (en) * | 2018-10-15 | 2021-05-11 | 美的集团股份有限公司 | System and method for customizing a portable natural language processing interface for an appliance |
Citations (28)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN1356688A (en) * | 2000-11-27 | 2002-07-03 | 佳能株式会社 | Speech recognition system, server and client, and control method thereof |
US20020193998A1 (en) * | 2001-05-31 | 2002-12-19 | Dvorak Joseph L. | Virtual speech interface system and method of using same |
CN1627672A (en) * | 2003-05-02 | 2005-06-15 | 索尼株式会社 | Network system, electronic equipment terminal, server apparatus and method for distributing and reproducing the contents |
CN1735027A (en) * | 2004-08-13 | 2006-02-15 | 上海赢思软件技术有限公司 | Chat robot system |
KR20060034337A (en) * | 2004-10-18 | 2006-04-24 | 주식회사 팬택 | Mobile phone and server for managing home-network by voice, and system and method for home-network management using the same |
CN101030994A (en) * | 2007-04-11 | 2007-09-05 | 华为技术有限公司 | Speech discriminating method system and server |
CN101360118A (en) * | 2007-08-02 | 2009-02-04 | 广东新支点技术服务有限公司 | Method and protocol suitable for mobile terminal multimedia file sharing and searching |
CN101420543A (en) * | 2008-12-05 | 2009-04-29 | 天津三星电子显示器有限公司 | Method for voice controlling television and television therewith |
CN101437039A (en) * | 2007-11-15 | 2009-05-20 | 华为技术有限公司 | Mobile searching method, system and equipment |
CN101599270A (en) * | 2008-06-02 | 2009-12-09 | 海尔集团公司 | Voice server and voice control method |
US20100088100A1 (en) * | 2008-10-02 | 2010-04-08 | Lindahl Aram M | Electronic devices with voice command and contextual data processing capabilities |
CN101715018A (en) * | 2009-11-03 | 2010-05-26 | 沈阳晨讯希姆通科技有限公司 | Voice control method of functions of mobile phone |
CN102270213A (en) * | 2011-04-20 | 2011-12-07 | 深圳市凯立德科技股份有限公司 | Searching method and device for interesting points of navigation system, and location service terminal |
CN102316162A (en) * | 2011-09-01 | 2012-01-11 | 深圳市子栋科技有限公司 | Vehicle remote control method based on voice command, apparatus and system thereof |
CN102316361A (en) * | 2011-07-04 | 2012-01-11 | 深圳市子栋科技有限公司 | Audio-frequency / video-frequency on demand method based on natural speech recognition and system thereof |
US20120030712A1 (en) * | 2010-08-02 | 2012-02-02 | At&T Intellectual Property I, L.P. | Network-integrated remote control with voice activation |
CN102497481A (en) * | 2011-12-02 | 2012-06-13 | 深圳市车音网科技有限公司 | Method, device and system for voice dialing |
CN102497391A (en) * | 2011-11-21 | 2012-06-13 | 宇龙计算机通信科技(深圳)有限公司 | Server, mobile terminal and prompt method |
CN102541574A (en) * | 2010-12-13 | 2012-07-04 | 鸿富锦精密工业(深圳)有限公司 | Application program opening system and method |
CN102541505A (en) * | 2011-01-04 | 2012-07-04 | 中国移动通信集团公司 | Voice input method and system thereof |
CN102571882A (en) * | 2010-12-31 | 2012-07-11 | 上海博泰悦臻电子设备制造有限公司 | Network-based voice reminding method and system |
CN102591932A (en) * | 2011-12-23 | 2012-07-18 | 优视科技有限公司 | Voice search method, voice search system, mobile terminal and transfer server |
CN102629246A (en) * | 2012-02-10 | 2012-08-08 | 北京百纳信息技术有限公司 | Server used for recognizing browser voice commands and browser voice command recognition system |
CN102650960A (en) * | 2012-03-31 | 2012-08-29 | 奇智软件(北京)有限公司 | Method and device for eliminating faults of terminal equipment |
CN102724309A (en) * | 2012-06-14 | 2012-10-10 | 广东好帮手电子科技股份有限公司 | Vehicular voice network music system and control method thereof |
CN102741146A (en) * | 2010-02-23 | 2012-10-17 | 三菱电机株式会社 | Elevator device |
CN102760431A (en) * | 2012-07-12 | 2012-10-31 | 上海语联信息技术有限公司 | Intelligentized voice recognition system |
CN102792320A (en) * | 2010-01-18 | 2012-11-21 | 苹果公司 | Intelligent automated assistant |
-
2012
- 2012-11-22 CN CN201210478777.XA patent/CN103839549A/en active Pending
Patent Citations (28)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN1356688A (en) * | 2000-11-27 | 2002-07-03 | 佳能株式会社 | Speech recognition system, server and client, and control method thereof |
US20020193998A1 (en) * | 2001-05-31 | 2002-12-19 | Dvorak Joseph L. | Virtual speech interface system and method of using same |
CN1627672A (en) * | 2003-05-02 | 2005-06-15 | 索尼株式会社 | Network system, electronic equipment terminal, server apparatus and method for distributing and reproducing the contents |
CN1735027A (en) * | 2004-08-13 | 2006-02-15 | 上海赢思软件技术有限公司 | Chat robot system |
KR20060034337A (en) * | 2004-10-18 | 2006-04-24 | 주식회사 팬택 | Mobile phone and server for managing home-network by voice, and system and method for home-network management using the same |
CN101030994A (en) * | 2007-04-11 | 2007-09-05 | 华为技术有限公司 | Speech discriminating method system and server |
CN101360118A (en) * | 2007-08-02 | 2009-02-04 | 广东新支点技术服务有限公司 | Method and protocol suitable for mobile terminal multimedia file sharing and searching |
CN101437039A (en) * | 2007-11-15 | 2009-05-20 | 华为技术有限公司 | Mobile searching method, system and equipment |
CN101599270A (en) * | 2008-06-02 | 2009-12-09 | 海尔集团公司 | Voice server and voice control method |
US20100088100A1 (en) * | 2008-10-02 | 2010-04-08 | Lindahl Aram M | Electronic devices with voice command and contextual data processing capabilities |
CN101420543A (en) * | 2008-12-05 | 2009-04-29 | 天津三星电子显示器有限公司 | Method for voice controlling television and television therewith |
CN101715018A (en) * | 2009-11-03 | 2010-05-26 | 沈阳晨讯希姆通科技有限公司 | Voice control method of functions of mobile phone |
CN102792320A (en) * | 2010-01-18 | 2012-11-21 | 苹果公司 | Intelligent automated assistant |
CN102741146A (en) * | 2010-02-23 | 2012-10-17 | 三菱电机株式会社 | Elevator device |
US20120030712A1 (en) * | 2010-08-02 | 2012-02-02 | At&T Intellectual Property I, L.P. | Network-integrated remote control with voice activation |
CN102541574A (en) * | 2010-12-13 | 2012-07-04 | 鸿富锦精密工业(深圳)有限公司 | Application program opening system and method |
CN102571882A (en) * | 2010-12-31 | 2012-07-11 | 上海博泰悦臻电子设备制造有限公司 | Network-based voice reminding method and system |
CN102541505A (en) * | 2011-01-04 | 2012-07-04 | 中国移动通信集团公司 | Voice input method and system thereof |
CN102270213A (en) * | 2011-04-20 | 2011-12-07 | 深圳市凯立德科技股份有限公司 | Searching method and device for interesting points of navigation system, and location service terminal |
CN102316361A (en) * | 2011-07-04 | 2012-01-11 | 深圳市子栋科技有限公司 | Audio-frequency / video-frequency on demand method based on natural speech recognition and system thereof |
CN102316162A (en) * | 2011-09-01 | 2012-01-11 | 深圳市子栋科技有限公司 | Vehicle remote control method based on voice command, apparatus and system thereof |
CN102497391A (en) * | 2011-11-21 | 2012-06-13 | 宇龙计算机通信科技(深圳)有限公司 | Server, mobile terminal and prompt method |
CN102497481A (en) * | 2011-12-02 | 2012-06-13 | 深圳市车音网科技有限公司 | Method, device and system for voice dialing |
CN102591932A (en) * | 2011-12-23 | 2012-07-18 | 优视科技有限公司 | Voice search method, voice search system, mobile terminal and transfer server |
CN102629246A (en) * | 2012-02-10 | 2012-08-08 | 北京百纳信息技术有限公司 | Server used for recognizing browser voice commands and browser voice command recognition system |
CN102650960A (en) * | 2012-03-31 | 2012-08-29 | 奇智软件(北京)有限公司 | Method and device for eliminating faults of terminal equipment |
CN102724309A (en) * | 2012-06-14 | 2012-10-10 | 广东好帮手电子科技股份有限公司 | Vehicular voice network music system and control method thereof |
CN102760431A (en) * | 2012-07-12 | 2012-10-31 | 上海语联信息技术有限公司 | Intelligentized voice recognition system |
Cited By (34)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN104183237A (en) * | 2014-09-04 | 2014-12-03 | 百度在线网络技术(北京)有限公司 | Speech processing method and device for portable terminal |
CN104183237B (en) * | 2014-09-04 | 2017-10-31 | 百度在线网络技术(北京)有限公司 | Method of speech processing and device for portable terminal |
CN104268195A (en) * | 2014-09-19 | 2015-01-07 | 三星电子(中国)研发中心 | Method and device for processing local resources in terminal |
CN105827878B (en) * | 2015-01-04 | 2019-06-25 | 中国移动通信集团公司 | Voice messaging conversion method and voice transfer gateway |
CN105827878A (en) * | 2015-01-04 | 2016-08-03 | 中国移动通信集团公司 | Voice information conversion method and voice conversion gateway |
JP2018507434A (en) * | 2015-01-12 | 2018-03-15 | ユウトウ・テクノロジー(ハンジョウ)・カンパニー・リミテッド | Voice identification system and method for robot system |
WO2016112634A1 (en) * | 2015-01-12 | 2016-07-21 | 芋头科技(杭州)有限公司 | Voice recognition system and method of robot system |
CN105845135A (en) * | 2015-01-12 | 2016-08-10 | 芋头科技(杭州)有限公司 | Sound recognition system and method for robot system |
CN105094807A (en) * | 2015-06-25 | 2015-11-25 | 三星电子(中国)研发中心 | Method and device for implementing voice control |
CN106504753A (en) * | 2015-09-07 | 2017-03-15 | 上海隆通网络系统有限公司 | A kind of audio recognition method and system in IT operation management system |
CN105609118A (en) * | 2015-12-30 | 2016-05-25 | 生迪智慧科技有限公司 | Speech detection method and device |
CN105609118B (en) * | 2015-12-30 | 2020-02-07 | 生迪智慧科技有限公司 | Voice detection method and device |
CN105788594A (en) * | 2016-03-01 | 2016-07-20 | 江西掌中无限网络科技股份有限公司 | Voice and meaning identification method and system of flow-free APP |
CN107153499A (en) * | 2016-03-04 | 2017-09-12 | 株式会社理光 | The Voice command of interactive whiteboard equipment |
CN106847284A (en) * | 2017-03-09 | 2017-06-13 | 深圳市八圈科技有限公司 | Electronic equipment, computer-readable recording medium and voice interactive method |
CN107086037A (en) * | 2017-03-17 | 2017-08-22 | 上海庆科信息技术有限公司 | A kind of voice interactive method of embedded device, device and embedded device |
CN107146618A (en) * | 2017-06-16 | 2017-09-08 | 北京云知声信息技术有限公司 | Method of speech processing and device |
CN109118747A (en) * | 2017-06-23 | 2019-01-01 | 中兴通讯股份有限公司 | Infrared equipment control method, system, storage medium and computer equipment |
CN109474843A (en) * | 2017-09-08 | 2019-03-15 | 腾讯科技(深圳)有限公司 | The method of speech control terminal, client, server |
CN107919130A (en) * | 2017-11-06 | 2018-04-17 | 百度在线网络技术(北京)有限公司 | Method of speech processing and device based on high in the clouds |
CN107919130B (en) * | 2017-11-06 | 2021-12-17 | 百度在线网络技术(北京)有限公司 | Cloud-based voice processing method and device |
US11024332B2 (en) | 2017-11-06 | 2021-06-01 | Baidu Online Network Technology (Beijing) Co., Ltd. | Cloud-based speech processing method and apparatus |
CN108111696A (en) * | 2017-12-29 | 2018-06-01 | 深圳市酷达通讯有限公司 | A kind of wireless fixed telephone |
CN109120774A (en) * | 2018-06-29 | 2019-01-01 | 深圳市九洲电器有限公司 | Terminal applies voice control method and system |
CN108986811A (en) * | 2018-08-31 | 2018-12-11 | 北京新能源汽车股份有限公司 | A kind of detection method of speech recognition, device and equipment |
CN109036430A (en) * | 2018-09-29 | 2018-12-18 | 芜湖星途机器人科技有限公司 | Voice control terminal |
CN112789561A (en) * | 2018-10-15 | 2021-05-11 | 美的集团股份有限公司 | System and method for customizing a portable natural language processing interface for an appliance |
CN112789561B (en) * | 2018-10-15 | 2022-04-05 | 美的集团股份有限公司 | System and method for customizing a portable natural language processing interface for an appliance |
CN111225261A (en) * | 2018-11-27 | 2020-06-02 | Lg电子株式会社 | Multimedia device for processing voice command and control method thereof |
CN111225261B (en) * | 2018-11-27 | 2021-11-26 | Lg电子株式会社 | Multimedia device for processing voice command and control method thereof |
CN111261153A (en) * | 2018-12-03 | 2020-06-09 | 现代自动车株式会社 | Vehicle voice command processing device and method |
CN111261153B (en) * | 2018-12-03 | 2023-12-19 | 现代自动车株式会社 | Vehicle voice command processing device and method |
CN111462738A (en) * | 2019-01-18 | 2020-07-28 | 阿里巴巴集团控股有限公司 | Voice recognition method and device |
CN112565849A (en) * | 2019-09-26 | 2021-03-26 | 深圳市茁壮网络股份有限公司 | Voice control method of digital television, television control system and storage medium |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN103839549A (en) | Voice instruction control method and system | |
US20140379334A1 (en) | Natural language understanding automatic speech recognition post processing | |
KR20200108775A (en) | Training corpus generating method, apparatus, device and storage medium | |
CN108710704B (en) | Method and device for determining conversation state, electronic equipment and storage medium | |
KR102046486B1 (en) | Information inputting method | |
CN101221576B (en) | Input method and device capable of implementing automatic translation | |
KR20190021338A (en) | Subsequent voice query prediction | |
CN109559748B (en) | A kind of method for recognizing semantics, device, smart machine and storage medium | |
CN111402861B (en) | Voice recognition method, device, equipment and storage medium | |
CN106372054B (en) | Method and device for multi-language semantic analysis | |
CN104575499B (en) | Voice control method of mobile terminal and mobile terminal | |
RU2011130550A (en) | LANGUAGE-BASED MARKING SELECTION AND USE OF RECOGNITORS FOR PROCESSING PROMOTION | |
CN109785829B (en) | Customer service assisting method and system based on voice control | |
CN109256125B (en) | Off-line voice recognition method and device and storage medium | |
CN105512182A (en) | Speech control method and intelligent television | |
WO2020024620A1 (en) | Voice information processing method and device, apparatus, and storage medium | |
CN112669842A (en) | Man-machine conversation control method, device, computer equipment and storage medium | |
CN110991179A (en) | Semantic analysis method based on electric power professional term | |
CN111933149A (en) | Voice interaction method, wearable device, terminal and voice interaction system | |
CN112286485B (en) | Method and device for controlling application through voice, electronic equipment and storage medium | |
CN110808031A (en) | Voice recognition method and device and computer equipment | |
CN114299955B (en) | Voice interaction method and device, electronic equipment and storage medium | |
CN112035648B (en) | User data processing method and device and electronic equipment | |
CN114171016A (en) | Voice interaction method and device, electronic equipment and storage medium | |
CN114781359A (en) | Text error correction method and device, computer equipment and storage medium |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C06 | Publication | ||
PB01 | Publication | ||
C10 | Entry into substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
RJ01 | Rejection of invention patent application after publication | ||
RJ01 | Rejection of invention patent application after publication |
Application publication date: 20140604 |