CN105261366A - Voice identification method, voice engine and terminal - Google Patents

Voice identification method, voice engine and terminal Download PDF

Info

Publication number
CN105261366A
CN105261366A CN201510548423.1A CN201510548423A CN105261366A CN 105261366 A CN105261366 A CN 105261366A CN 201510548423 A CN201510548423 A CN 201510548423A CN 105261366 A CN105261366 A CN 105261366A
Authority
CN
China
Prior art keywords
local
speech engine
phonetic order
recognition result
instruction
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN201510548423.1A
Other languages
Chinese (zh)
Other versions
CN105261366B (en
Inventor
王维平
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Nubia Technology Co Ltd
Original Assignee
Nubia Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Nubia Technology Co Ltd filed Critical Nubia Technology Co Ltd
Priority to CN201510548423.1A priority Critical patent/CN105261366B/en
Publication of CN105261366A publication Critical patent/CN105261366A/en
Application granted granted Critical
Publication of CN105261366B publication Critical patent/CN105261366B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Abstract

The invention discloses a voice identification method, a voice engine and a terminal. The voice identification method comprises: a voice instruction is received; whether a network is normal is determined; if so, the voice instruction is sent to a cloud voice engine to carry out identification; and an identification result returned by the cloud voice engine is received and a local voice base of a local voice engine is updated according to the identification result. With the voice identification method, when the network is normal, the cloud voice engine is used for identifying the voice instruction and the voice instruction is added to the local grammar base; and when the network is abnormal, the local grammar base with the added voice instruction is used for carrying out voice instruction identification. Therefore, the identification accuracy and efficiency of the local voice engine are improved.

Description

Audio recognition method, speech engine and terminal
Technical field
The present invention relates to field of speech recognition, more particularly, relate to a kind of audio recognition method, speech engine and terminal.
Background technology
Along with the significant increase of speech recognition technology, people more and more bring into use speech recognition technology, voice assistant, intelligent robot, Smart Home etc. on such as mobile phone.
In order to improve the accuracy of speech recognition, in prior art, corresponding speech engine is set beyond the clouds.The speech engine in high in the clouds can by analyzing large data and utilizing various resource thus identify instruction and the meaning of user accurately.Such as, the siri of apple is exactly the speech engine using high in the clouds completely, and its recognition accuracy is very high.
When utilizing high in the clouds to carry out speech recognition, local device needs to be connected with cloud device by network, mutual to realize.When there is not having network or the slow situation of network speed, because local device cannot be connected to cloud device, thus the speech identifying function in high in the clouds normally can not be used.This causes very large puzzlement to user.
On the other hand, also speech recognition is realized by the local speech engine of local device.It can overcome does not have network or the inconvenience that situation causes to user such as network speed is slow, but the restriction of the resources such as storage space, CPU and internal memory is limited to due to local speech engine, very simple phonetic order can only be identified, the instruction that None-identified is comparatively complicated, there is significant limitation, the demand of user can not be met.
Therefore, prior art existing defects, needs to improve.
Summary of the invention
The technical problem to be solved in the present invention is, for the above-mentioned defect of prior art, provides a kind of audio recognition method, speech engine and terminal.
The technical solution adopted for the present invention to solve the technical problems is: construct a kind of audio recognition method, comprising:
Receive phonetic order;
Judge that whether current network is normal, if normally, then send to high in the clouds speech engine to identify described phonetic order;
Receive the recognition result that high in the clouds speech engine returns, and according to described recognition result, the local syntax library of local speech engine is upgraded.
Wherein, the described local syntax library to local speech engine carries out renewal and comprises:
According to recognition result, carry out instruction classification;
Determine whether the instruction of preset kind based on instruction classification result, if so, then syntax parsing is carried out to described phonetic order, and extract grammer key message;
Judge whether the grammer key message of described extraction meets pre-conditioned, if meet, then described phonetic order is increased to described local syntax library according to after preset format conversion.
Wherein, describedly pre-conditionedly to comprise: the frequency of utilization of grammer exceedes preset value;
Describedly judge whether the grammer key message of described extraction meets pre-conditioned comprising:
According to the grammer key message of described extraction, judge its grammer whether belonging to frequency of utilization and exceed preset value.
Wherein, described method also comprises:
After receiving phonetic order, carry out speech recognition to obtain recognition result according to described local syntax library.
Wherein, described method also comprises: perform corresponding operation according to described recognition result.
Wherein, described method also comprises:
Preset a time period;
In described preset time period, according to the recognition result that high in the clouds speech engine returns, the local syntax library of described local speech engine is upgraded.
Wherein, described method also comprises:
When described preset time period then, from described local syntax library, delete upgraded phonetic order.
On the other hand, a kind of speech engine is provided, comprises:
Speech reception module, for receiving the phonetic order of user's input;
Judge module, for judging that whether current network is normal, if normally, then sends to high in the clouds speech engine to identify described phonetic order;
Receiver module, for receiving the recognition result that high in the clouds speech engine returns;
Update module, for according to described recognition result, upgrades syntax library.
Wherein, described update module comprises:
Taxon, for according to recognition result, carries out instruction classification;
Resolution unit, for determining whether the instruction of preset kind based on instruction classification result, if so, then carries out syntax parsing to described phonetic order, and extracts grammer key message;
Judging unit, for judging whether the grammer key message of described extraction meets pre-conditioned, if meet, is then increased to described syntax library by described phonetic order according to after preset format conversion.
The third aspect, provides a kind of terminal, comprises above-mentioned speech engine.
Implement audio recognition method of the present invention, speech engine and terminal, there is following beneficial effect: when network is normal, utilize high in the clouds speech engine to carry out the identification of phonetic order, and phonetic order is increased to local syntax library; When Network Abnormal, then utilize the local syntax library adding phonetic order to carry out phonetic order identification, improve recognition correct rate and the efficiency of local speech engine.By upgrading the study of local syntax library, increasing the phonetic order scope that local speech engine can identify, improving Consumer's Experience.
Accompanying drawing explanation
Below in conjunction with drawings and Examples, the invention will be further described, in accompanying drawing:
Fig. 1 is the process flow diagram of the audio recognition method of the embodiment of the present invention;
Fig. 2 is the identification process figure of the high in the clouds speech engine of the embodiment of the present invention;
Fig. 3 is the detail flowchart of the step S3 of the audio recognition method of the embodiment of the present invention;
Fig. 4 is the schematic flow sheet that the local speech engine of the embodiment of the present invention carries out identifying;
Fig. 5 is the process flow diagram of the audio recognition method of another embodiment of the present invention;
Fig. 6 is the structural representation of the speech engine of the embodiment of the present invention;
Fig. 7 is the schematic diagram being carried out speech recognition by the speech engine of the embodiment of the present invention;
Fig. 8 is the identifying schematic diagram of the intelligent learning module of the embodiment of the present invention;
Fig. 9 is the terminal of the embodiment of the present invention and the communication scheme of high in the clouds speech engine.
Embodiment
The circumscribed problem of network condition and local speech engine is limited to for solving audio recognition method of the prior art, the embodiment of the present invention provides a kind of audio recognition method, speech engine and terminal, by the recognition result of high in the clouds speech engine is increased to local syntax library, carry out study to local syntax library to upgrade, increase the phonetic order scope that local speech engine can identify, achieve the recognition correct rate and efficiency that improve local speech engine, speech recognition is made no longer to be limited to the limitation of network condition and local speech engine, improve the technique effect of Consumer's Experience.
The Integral Thought that the embodiment of the present invention solves the problems of the technologies described above is: receive phonetic order; Judge that whether current network is normal, if normally, then sent to by phonetic order high in the clouds speech engine to identify; Receive the recognition result that high in the clouds speech engine returns, and according to recognition result, the local syntax library of local speech engine is upgraded; If Network Abnormal, then local speech engine is utilized to carry out speech recognition.
In order to there be understanding clearly to technical characteristic of the present invention, object and effect, now contrast accompanying drawing and describe the specific embodiment of the present invention in detail.
It is the process flow diagram of the audio recognition method of the embodiment of the present invention see Fig. 1.The audio recognition method of the embodiment of the present invention comprises the following steps:
S1, reception phonetic order.
Concrete, the collection of phonetic order is carried out by voice acquisition device (such as, microphone).Phonetic order is that user sends, and such as, phones Xiao Ming, connects wifi, opens camera etc.
S2, judge that current network is whether normal, if normally, then send to high in the clouds speech engine to identify described phonetic order.
Concrete, by being connected to high in the clouds speech engine for networks such as wifi, 3G, 4G, GPRS.Can judge that whether network is normal by the signal quality etc. detecting interconnection network.If disconnection occurs network or network speed crosses the situations such as slow, then judged result is network is abnormal.
High in the clouds speech engine is a server in fact, and it has very strong computing power and syntax library.In an embodiment of the present invention, if network is normal, then carry out phonetic order identification by high in the clouds speech engine, thus, improve the accuracy rate identified.Being the identification process figure of the high in the clouds speech engine of the embodiment of the present invention see Fig. 2, when receiving phonetic order, after the steps such as grammatical analysis, large data analysis, feature extraction, pattern match, arithmetic analysis, result examination, exporting recognition result successively.The identifying that should be understood that high in the clouds speech engine is prior art, and the embodiment of the present invention is not described in detail this.
The recognition result that S3, reception high in the clouds speech engine return, and according to recognition result, the local syntax library of local speech engine is upgraded.
Concrete, according to the recognition result that high in the clouds speech engine returns, identifiable design goes out the phonetic order of user, performs corresponding operation thus.Such as, phonetic order " phones Xiao Ming ", according to the recognition result returned, can carry out the operation of dialling to " Xiao Ming ".
In an embodiment of the present invention, except performing except corresponding operation according to recognition result, also perform the operation upgrading local syntax library, the phonetic order identified by this is increased to local syntax library, thus, when to receive identical phonetic order next time, accurate identification can be realized by local syntax library.Should understand, when phonetic order is increased to local syntax library, need to carry out according to the preset format of local syntax library, such as, only the subject of storaged voice instruction and predicate are (such as, I will phone) or only store the modes such as predicate (such as, making a phone call) and store.
See Fig. 3, above-mentioned steps S3 specifically comprises:
S31, according to recognition result, carry out instruction classification.
In an embodiment of the present invention, instruction type comprises: make a phone call, inquire about, search for, take pictures, clear up mobile phone etc.Thus, recognition result corresponding for each phonetic order is unique an instruction type can be divided into.By instruction classification, better can realize identifying, and also provide reference to the need of being updated to local syntax library.Such as, phonetic order " please open camera ", the recognition result of " startup is taken pictures " is all taken pictures for opening camera, this instruction type that it all belongs to " taking pictures ".
S32, determine whether the instruction of preset kind based on instruction classification result, if so, then syntax parsing is carried out to phonetic order, and extract grammer key message.
In an embodiment of the present invention, the instruction of preset kind can be the instruction that user often uses or the instruction arranged voluntarily by user, and such as, the instruction of preset kind can comprise: make a phone call, take pictures, inquiry etc.In order to raise the efficiency, after carrying out instruction classification, judge that whether it is the instruction of preset kind, if not, then flow process terminates no longer to carry out follow-up process, namely instruction is not increased to the operation of local syntax library, thus, the instruction being increased to local syntax library can be made to be the instruction useful to user, not only can to save storage efficiency and also can improve Consumer's Experience.Such as, user seldom uses counter, and the instruction of preset kind does not comprise opens counter, when phonetic order is for " opening counter ", it does not belong to the instruction of preset kind, therefore, no longer carries out follow-up syntax parsing, extracts the operations such as grammer key message.
Syntax parsing and extraction key message are the filtrations to phonetic order, such as, phonetic order " I thinks opened bluetooth function ", " I wants opened bluetooth function ", " opening bluetooth ", the recognition result of " opened bluetooth function ", " please open bluetooth " etc. is all the operation of opening bluetooth.Therefore, by syntax parsing, identical grammer key message can be drawn into and " open bluetooth ".By extracting grammer key message, it is more accurate to make follow-up judgement and be increased to the operation of local syntax library.
Should be understood that syntax parsing realizes by sentence trunk extraction method, such as, realize syntax parsing by the analysis of " SVO is determined shape and mended ", if phonetic order is " I wants opened bluetooth function ", then subject is " I ", predicate is " opening ", and object is " bluetooth ".In addition, the mode also by other realizes syntax parsing, does not invent and is not restricted this.
Extracting key message is based on the result of syntax parsing, if syntax parsing is for realize by sentence trunk extraction method, then predicate wherein and object can be extracted as key message, for above-mentioned example, the key message of extraction is " opening bluetooth ".In addition, also SVO or object etc. can be extracted as key message, the present invention is not restricted this.According to pairing comparision
S33, judge whether the grammer key message of described extraction meets pre-conditioned, if meet, then described phonetic order is increased to described local syntax library according to after preset format conversion.
After being drawn into grammer key message, judge whether it meets according to grammer key message pre-conditioned.Pre-conditionedly can be, such as, the frequency of utilization of grammer exceedes preset value.Thus, judge according to grammer key message the grammer whether phonetic order belongs to frequency of utilization and exceed preset value.If exceed, then the frequency of utilization of this phonetic order is high, then perform the operation being increased to local syntax library, otherwise, be not increased to local syntax library.Thus, the storage space of local syntax library can be saved, and the real grammer useful to user can be increased, can Consumer's Experience be promoted.
In an embodiment of the present invention, the pre-conditioned mode of frequency of utilization and function combination that adopts pre-sets and stores, such as, for the operation of " making a phone call " (i.e. aforesaid function), according to the frequency of utilization of statistics, the frequency of utilization of following grammer exceedes preset value (such as, 10 times): " phoning ", " calling ", " dialing ", " dial to ", " calling out ", " making a phone call to ", then can as shown in table 1ly carry out arranging and storing about the pre-conditioned of " making a phone call " this function.
Table 1
After being drawn into grammer key message, by question blank 1, then can judge whether corresponding phonetic order belongs to the one in above-mentioned grammer.If belong to, carry out the operation being increased to local syntax library.When phonetic order is increased to local syntax library, the form according to presetting is needed to change phonetic order.This form preset is the discernible form of local speech engine, such as, or not object, interjection etc. during storage.And for avoiding repeated storage, when storing, the grammer of repetition is deleted.
Below with reference to the example of concrete " making a phone call ", the audio recognition method of the embodiment of the present invention is described.
See as follows be the grammatical form of the local syntax library of the local speech engine of the embodiment of the present invention:
<CallCmd>;
!tag(CALLCMD_TAG,
" phone "! Id (1001) |
" make a phone call to "! Id (1002) |
" call "! Id (1003) |
" dial to "! Id (1004) |
" calling "! Id (1005) |
" call out "! Id (1006) |
);
According to local syntax library, when user needs to carry out the operation of making a phone call, its phonetic order of saying is " phoning Xiao Ming ", " making a phone call to Xiao Ming ", " calling Xiao Ming " etc., and local speech engine all can successfully identify, and return recognition result, carry out the operation of phoning Xiao Ming.If phonetic order is " I wants to phone Xiao Ming ", then can not realize because local syntax library does not define corresponding grammer.According to the audio recognition method of the embodiment of the present invention, phonetic order " I wants to phone Xiao Ming " is sent to high in the clouds speech engine.High in the clouds speech engine obtains the operation of recognition result-make a phone call by identifying, and recognition result (operation of making a phone call) is returned, thus, the operation of making a phone call can be performed, and after above-mentioned steps S31-S33, this phonetic order will be added in local syntax library and (store with the form not comprising object), and the local syntax library after increase is as follows:
<CallCmd>;
!tag(CALLCMD_TAG,
" phone "! Id (1001) |
" make a phone call to "! Id (1002) |
" call "! Id (1003) |
" dial to "! Id (1004) |
" calling "! Id (1005) |
" call out "! Id (1006) |
" I wants to phone "! Id (1007) |
);
Thus, local syntax library adds a grammer, when identifying next time, does not identify even without network by high in the clouds speech engine, also can recognize the phonetic order of " I wants to phone Xiao Ming " quickly and accurately.
By the audio recognition method of the embodiment of the present invention, user uses high in the clouds to carry out speech recognition when there being network; When using local speech engine without when network.Because local speech engine is according to the recognition result in high in the clouds, constantly " enlarging one's knowledge ", local speech engine can identify large amount of complex instruction, reaches self-teaching, has the object of memory; Increase the scope of speech recognition on the one hand, improve Consumer's Experience on the other hand.
In an embodiment of the present invention, when receiving phonetic order at every turn, can determine by high in the clouds speech engine identification (network is normal) according to network state, or by local speech engine identification (network is abnormal); Also first by local speech engine identification, can be identified by high in the clouds speech engine again when its None-identified.Due to the audio recognition method according to the embodiment of the present invention, local speech engine " constantly can increase knowledge ", compared to the local speech engine of prior art, more complicated and the more instruction of identifiable design, therefore, a kind of mode after adopting, for user saves flow and cost on the basis ensureing accurately identification, can promote Consumer's Experience further.
Carry out the schematic flow sheet identified see Fig. 4 local speech engine that is the embodiment of the present invention, after it receives phonetic order, carry out grammatical analysis successively, dynamic syntax is resolved, pattern match and result export recognition result after screening.Should understand, the local syntax library of local speech engine is adopt the method for the embodiment of the present invention to carry out grammer to increase the local syntax library after upgrading, and the processes such as grammatical analysis, dynamic syntax parsing, pattern match and result examination are prior art, the embodiment of the present invention does not describe in detail this.
It is the process flow diagram of the audio recognition method of another embodiment of the present invention see Fig. 5.The audio recognition method of this embodiment comprises:
S100, a default time period.
Preset time period can sky, the moon or hour etc. be unit, such as, can 2 months be set to.
S101, in preset time period, according to the recognition result that high in the clouds speech engine returns, the local syntax library of local speech engine to be upgraded.
Concrete, in preset time period, receive phonetic order, then according to the method for above-described embodiment, judge that whether network is normal, if normal, sent to by phonetic order high in the clouds speech engine to identify.When high in the clouds returns recognition result, phonetic order is increased in local syntax library.Thus, when user says this phonetic order again next time, local speech engine can identify accurately to this.
S102, when preset time period then, from local syntax library, delete upgraded phonetic order.
According to this embodiment, deleted by the phonetic order that preset time period is increased, can storage space be discharged.In addition, in practice, user can change in different time period demands, such as, works at city A, and its conventional phonetic order is " making a phone call ", then conventional " taking pictures ", " inquiry " during city B travels.Therefore, if be preset time period by the set of time of tourism, then during can making tourism, increase the dependent parser instruction of " taking pictures " in local syntax library, and get back to the place of work, then delete these instructions and the instruction be of little use, to discharge storage space, save internal memory.
Should be understood that other implementation and the details (such as, the renewal process etc. to local syntax library) of this embodiment are identical with above-described embodiment, do not repeat them here.
Be the structural representation of the speech engine of the embodiment of the present invention see Fig. 6, it comprises:
Speech reception module 40, for receiving the phonetic order of user's input;
Judge module 41, for judging that whether current network is normal, if normally, then sends to high in the clouds speech engine to identify by phonetic order;
Receiver module 42, for receiving the recognition result that high in the clouds speech engine returns;
Update module 43, for according to recognition result, upgrades syntax library.
Wherein, update module 43 comprises:
Taxon 431, for according to recognition result, carries out instruction classification;
Resolution unit 432, for determining whether the instruction of preset kind based on instruction classification result, if so, then carries out syntax parsing to phonetic order, and extracts grammer key message;
Judging unit 433, for judging whether the grammer key message extracted meets pre-conditioned, if meet, is then increased to syntax library by phonetic order according to after preset format conversion.
In addition, the speech engine of the embodiment of the present invention also comprises one for the sound identification module 44 performing grammatical analysis described in above-mentioned Fig. 4, dynamic syntax parsing, pattern match, result are screened, and for storing the syntax library 45 of grammer.
Should be understood that the speech engine of the embodiment of the present invention is corresponding with above-mentioned audio recognition method, be not described in detail herein.
See Fig. 7, the above-mentioned speech reception module 40 of the embodiment of the present invention, judge module 41, receiver module 42, update module 43 form intelligent learning module (non-label).Concrete, see Fig. 8, intelligent learning module receives in step 70 recognition result that high in the clouds speech engine returns; Instruction classification is carried out in step 72; Carry out syntax parsing in step 74, after completing parsing, extract grammer key message in step 76, after determining whether renewal according to grammer key message, upgrade local syntax library in step 80.To learn the recognition result of high in the clouds speech engine, be then updated to the syntax library of local speech engine.Should be understood that intelligent learning module is included in (Fig. 7 this relation not shown) in local speech engine.
Thus, by the speech engine of the embodiment of the present invention, when user is when there being network or when network is good, using high in the clouds speech engine to carry out speech recognition, and recognition result is updated to the syntax library of local speech engine; When without when network or time network is bad, local speech engine is used to carry out speech recognition.Because local speech engine is according to the recognition result in high in the clouds, constantly " enlarging one's knowledge ", local speech engine can identify large amount of complex instruction, reaches self-teaching, has the object of memory, increases the scope of speech recognition, improves Consumer's Experience.
On the other hand, the embodiment of the present invention also provides a kind of terminal, and it comprises above-mentioned speech engine.Should be understood that terminal can be the hardware device such as mobile phone, panel computer, personal digital assistant, e-book comprising communication unit, audio/video (A/V) input block, user input unit, sensing cell, output unit, storer, interface unit, controller and power supply unit etc.
See Fig. 9, terminal 1 and the high in the clouds speech engine 2 of the embodiment of the present invention communicate to connect.Terminal 1 gathers the phonetic order of user by voice acquisition devices such as microphones.When there being network or when network is good, using high in the clouds speech engine 2 to carry out speech recognition, and recognition result being updated to the syntax library of the local speech engine of terminal 1; When without when network or time network is bad, the local speech engine of terminal 1 is used to carry out speech recognition.Because people's what someone said or phonetic order are in one day or a period of time, have is repeated greatly, therefore, by descending the recognition result of high in the clouds speech engine to be updated in local syntax library by having when network, local speech engine can constantly " enlarge one's knowledge " according to the recognition result in high in the clouds, makes local speech engine can identify large amount of complex instruction, reach self-teaching, there is the object of memory, increase the scope of speech recognition, improve Consumer's Experience.
The audio recognition method of the embodiment of the present invention, speech recognition engine and terminal, the knowledge method for distinguishing of local speech engine can be improved to a great extent, facilitate user also can accurately identify history phonetic order when there is no network, reach the object of self-teaching, Consumer's Experience can be improved greatly.
In process flow diagram or any process otherwise described in an embodiment of the present invention or method describe and can be understood to, represent and comprise one or more for realizing the module of the code of the executable instruction of the step of specific logical function or process, fragment or part, and the scope of embodiment of the present invention comprises other realization, wherein can not according to order that is shown or that discuss, comprise according to involved function by the mode while of basic or by contrary order, carry out n-back test, this should understand by those skilled in the art described in embodiments of the invention.
The present invention can also be implemented by computer program, and described routine package contains whole features that can realize the inventive method, when it is installed in computer system, by running, and can be to perform the methods of the present invention.Computer program in presents referred to: any expression formula of one group of instruction that any program language, code or symbol can be adopted to write, this instruction group makes system have information processing capability, directly to realize specific function, or realize specific function after carrying out one or two step following: a) convert other Languages, coding or symbol to; B) reproduce in a different format.
By reference to the accompanying drawings embodiments of the invention are described above; but the present invention is not limited to above-mentioned embodiment; above-mentioned embodiment is only schematic; instead of it is restrictive; those of ordinary skill in the art is under enlightenment of the present invention; do not departing under the ambit that present inventive concept and claim protect, also can make a lot of form, these all belong within protection of the present invention.

Claims (10)

1. an audio recognition method, is characterized in that, comprising:
Receive phonetic order;
Judge that whether current network is normal, if normally, then send to high in the clouds speech engine to identify described phonetic order;
Receive the recognition result that high in the clouds speech engine returns, and according to described recognition result, the local syntax library of local speech engine is upgraded.
2. audio recognition method according to claim 1, is characterized in that, the described local syntax library to local speech engine carries out renewal and comprises:
According to recognition result, carry out instruction classification;
Determine whether the instruction of preset kind based on instruction classification result, if so, then syntax parsing is carried out to described phonetic order, and extract grammer key message;
Judge whether the grammer key message of described extraction meets pre-conditioned, if meet, then described phonetic order is increased to described local syntax library according to after preset format conversion.
3. audio recognition method according to claim 2, is characterized in that, describedly pre-conditionedly to comprise: the frequency of utilization of grammer exceedes preset value;
Describedly judge whether the grammer key message of described extraction meets pre-conditioned comprising:
According to the grammer key message of described extraction, judge its grammer whether belonging to frequency of utilization and exceed preset value.
4. audio recognition method according to claim 1, is characterized in that, described method also comprises:
After receiving phonetic order, carry out speech recognition to obtain recognition result according to described local syntax library.
5. audio recognition method according to claim 1, is characterized in that, described method also comprises:
Corresponding operation is performed according to described recognition result.
6. the audio recognition method according to any one of claim 1-5, is characterized in that, described method also comprises:
Preset a time period;
In described preset time period, according to the recognition result that high in the clouds speech engine returns, the local syntax library of described local speech engine is upgraded.
7. audio recognition method according to claim 6, is characterized in that, described method also comprises:
When described preset time period then, from described local syntax library, delete upgraded phonetic order.
8. a speech engine, is characterized in that, comprising:
Speech reception module, for receiving the phonetic order of user's input;
Judge module, for judging that whether current network is normal, if normally, then sends to high in the clouds speech engine to identify described phonetic order;
Receiver module, for receiving the recognition result that high in the clouds speech engine returns;
Update module, for according to described recognition result, upgrades syntax library.
9. speech engine according to claim 8, is characterized in that, described update module comprises:
Taxon, for according to recognition result, carries out instruction classification;
Resolution unit, for determining whether the instruction of preset kind based on instruction classification result, if so, then carries out syntax parsing to described phonetic order, and extracts grammer key message;
Judging unit, for judging whether the grammer key message of described extraction meets pre-conditioned, if meet, is then increased to described syntax library by described phonetic order according to after preset format conversion.
10. a terminal, is characterized in that, comprising: the speech engine described in any one of claim 8-9.
CN201510548423.1A 2015-08-31 2015-08-31 Audio recognition method, speech engine and terminal Active CN105261366B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201510548423.1A CN105261366B (en) 2015-08-31 2015-08-31 Audio recognition method, speech engine and terminal

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201510548423.1A CN105261366B (en) 2015-08-31 2015-08-31 Audio recognition method, speech engine and terminal

Publications (2)

Publication Number Publication Date
CN105261366A true CN105261366A (en) 2016-01-20
CN105261366B CN105261366B (en) 2016-11-09

Family

ID=55101025

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201510548423.1A Active CN105261366B (en) 2015-08-31 2015-08-31 Audio recognition method, speech engine and terminal

Country Status (1)

Country Link
CN (1) CN105261366B (en)

Cited By (23)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106373565A (en) * 2016-08-24 2017-02-01 安徽咪鼠科技有限公司 Method for automatically switching speech recognition engines
CN106384594A (en) * 2016-11-04 2017-02-08 湖南海翼电子商务股份有限公司 On-vehicle terminal for voice recognition and method thereof
CN106444479A (en) * 2016-08-29 2017-02-22 普天智能照明研究院有限公司 Voice-controlled switch system
CN106778927A (en) * 2016-12-30 2017-05-31 深圳Tcl新技术有限公司 Update TV semantics recognition dictionary method and device
CN106847291A (en) * 2017-02-20 2017-06-13 成都启英泰伦科技有限公司 Speech recognition system and method that a kind of local and high in the clouds is combined
CN107146617A (en) * 2017-06-15 2017-09-08 成都启英泰伦科技有限公司 A kind of novel voice identification equipment and method
CN108305628A (en) * 2017-06-27 2018-07-20 腾讯科技(深圳)有限公司 Audio recognition method, device, computer equipment and storage medium
CN108615529A (en) * 2018-04-13 2018-10-02 上海思依暄机器人科技股份有限公司 A kind of control method, device and domestic robot based on semantics recognition
CN108734294A (en) * 2017-04-19 2018-11-02 北京众星智联科技有限责任公司 A kind of realization of artificial intelligence complex control system
CN108806685A (en) * 2018-07-02 2018-11-13 英业达科技有限公司 Speech control system and its method
CN109495360A (en) * 2018-12-18 2019-03-19 深圳国美云智科技有限公司 A kind of smart home Internet of Things platform, offline sound control method and system
CN109658929A (en) * 2018-12-17 2019-04-19 北京车和家信息技术有限公司 A kind of vehicle-mounted voice recognition methods, device and engine end
CN109664307A (en) * 2018-12-16 2019-04-23 哈尔滨理工大学 A kind of dialogue robot based on deep learning
CN109961792A (en) * 2019-03-04 2019-07-02 百度在线网络技术(北京)有限公司 The method and apparatus of voice for identification
CN110866157A (en) * 2018-08-27 2020-03-06 北京猎户星空科技有限公司 Robot response method and device and robot
CN110970032A (en) * 2018-09-28 2020-04-07 深圳市冠旭电子股份有限公司 Sound box voice interaction control method and device
CN111415668A (en) * 2020-04-23 2020-07-14 惠州莫思特科技有限公司 Intelligent language control system and device
CN111554292A (en) * 2020-05-22 2020-08-18 广州三星通信技术研究有限公司 Method and device for man-machine interaction in mobile terminal
CN112334976A (en) * 2018-06-27 2021-02-05 谷歌有限责任公司 Presenting responses to a spoken utterance of a user using a local text response mapping
CN112837687A (en) * 2021-03-03 2021-05-25 北京百家科技集团有限公司 Answering method, answering device, computer equipment and storage medium
CN113129896A (en) * 2019-12-30 2021-07-16 北京猎户星空科技有限公司 Voice interaction method and device, electronic equipment and storage medium
CN113168836A (en) * 2018-09-27 2021-07-23 株式会社OPTiM Computer system, speech recognition method and program
CN115472156A (en) * 2022-09-05 2022-12-13 Oppo广东移动通信有限公司 Voice control method, device, storage medium and electronic equipment

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102496364A (en) * 2011-11-30 2012-06-13 苏州奇可思信息科技有限公司 Interactive speech recognition method based on cloud network
CN103078986A (en) * 2012-12-19 2013-05-01 北京百度网讯科技有限公司 Method and device for saving call information of mobile terminal and mobile terminal
CN103247291A (en) * 2013-05-07 2013-08-14 华为终端有限公司 Updating method, device, and system of voice recognition device
CN103440867A (en) * 2013-08-02 2013-12-11 安徽科大讯飞信息科技股份有限公司 Method and system for recognizing voice
CN103730119A (en) * 2013-12-18 2014-04-16 惠州市车仆电子科技有限公司 Vehicle-mounted man-machine voice interaction system
US8996372B1 (en) * 2012-10-30 2015-03-31 Amazon Technologies, Inc. Using adaptation data with cloud-based speech recognition

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102496364A (en) * 2011-11-30 2012-06-13 苏州奇可思信息科技有限公司 Interactive speech recognition method based on cloud network
US8996372B1 (en) * 2012-10-30 2015-03-31 Amazon Technologies, Inc. Using adaptation data with cloud-based speech recognition
CN103078986A (en) * 2012-12-19 2013-05-01 北京百度网讯科技有限公司 Method and device for saving call information of mobile terminal and mobile terminal
CN103247291A (en) * 2013-05-07 2013-08-14 华为终端有限公司 Updating method, device, and system of voice recognition device
CN103440867A (en) * 2013-08-02 2013-12-11 安徽科大讯飞信息科技股份有限公司 Method and system for recognizing voice
CN103730119A (en) * 2013-12-18 2014-04-16 惠州市车仆电子科技有限公司 Vehicle-mounted man-machine voice interaction system

Cited By (27)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106373565A (en) * 2016-08-24 2017-02-01 安徽咪鼠科技有限公司 Method for automatically switching speech recognition engines
CN106444479A (en) * 2016-08-29 2017-02-22 普天智能照明研究院有限公司 Voice-controlled switch system
CN106384594A (en) * 2016-11-04 2017-02-08 湖南海翼电子商务股份有限公司 On-vehicle terminal for voice recognition and method thereof
CN106778927A (en) * 2016-12-30 2017-05-31 深圳Tcl新技术有限公司 Update TV semantics recognition dictionary method and device
CN106847291A (en) * 2017-02-20 2017-06-13 成都启英泰伦科技有限公司 Speech recognition system and method that a kind of local and high in the clouds is combined
CN108734294A (en) * 2017-04-19 2018-11-02 北京众星智联科技有限责任公司 A kind of realization of artificial intelligence complex control system
CN107146617A (en) * 2017-06-15 2017-09-08 成都启英泰伦科技有限公司 A kind of novel voice identification equipment and method
CN108305628B (en) * 2017-06-27 2020-10-02 腾讯科技(深圳)有限公司 Speech recognition method, speech recognition device, computer equipment and storage medium
CN108305628A (en) * 2017-06-27 2018-07-20 腾讯科技(深圳)有限公司 Audio recognition method, device, computer equipment and storage medium
CN108615529A (en) * 2018-04-13 2018-10-02 上海思依暄机器人科技股份有限公司 A kind of control method, device and domestic robot based on semantics recognition
CN112334976A (en) * 2018-06-27 2021-02-05 谷歌有限责任公司 Presenting responses to a spoken utterance of a user using a local text response mapping
CN108806685A (en) * 2018-07-02 2018-11-13 英业达科技有限公司 Speech control system and its method
CN110866157A (en) * 2018-08-27 2020-03-06 北京猎户星空科技有限公司 Robot response method and device and robot
CN113168836B (en) * 2018-09-27 2024-04-23 株式会社OPTiM Computer system, voice recognition method and program product
CN113168836A (en) * 2018-09-27 2021-07-23 株式会社OPTiM Computer system, speech recognition method and program
CN110970032A (en) * 2018-09-28 2020-04-07 深圳市冠旭电子股份有限公司 Sound box voice interaction control method and device
CN109664307A (en) * 2018-12-16 2019-04-23 哈尔滨理工大学 A kind of dialogue robot based on deep learning
CN109658929A (en) * 2018-12-17 2019-04-19 北京车和家信息技术有限公司 A kind of vehicle-mounted voice recognition methods, device and engine end
CN109495360A (en) * 2018-12-18 2019-03-19 深圳国美云智科技有限公司 A kind of smart home Internet of Things platform, offline sound control method and system
CN109961792A (en) * 2019-03-04 2019-07-02 百度在线网络技术(北京)有限公司 The method and apparatus of voice for identification
CN109961792B (en) * 2019-03-04 2022-01-11 阿波罗智联(北京)科技有限公司 Method and apparatus for recognizing speech
CN113129896B (en) * 2019-12-30 2023-12-12 北京猎户星空科技有限公司 Voice interaction method and device, electronic equipment and storage medium
CN113129896A (en) * 2019-12-30 2021-07-16 北京猎户星空科技有限公司 Voice interaction method and device, electronic equipment and storage medium
CN111415668A (en) * 2020-04-23 2020-07-14 惠州莫思特科技有限公司 Intelligent language control system and device
CN111554292A (en) * 2020-05-22 2020-08-18 广州三星通信技术研究有限公司 Method and device for man-machine interaction in mobile terminal
CN112837687A (en) * 2021-03-03 2021-05-25 北京百家科技集团有限公司 Answering method, answering device, computer equipment and storage medium
CN115472156A (en) * 2022-09-05 2022-12-13 Oppo广东移动通信有限公司 Voice control method, device, storage medium and electronic equipment

Also Published As

Publication number Publication date
CN105261366B (en) 2016-11-09

Similar Documents

Publication Publication Date Title
CN105261366A (en) Voice identification method, voice engine and terminal
US20230072352A1 (en) Speech Recognition Method and Apparatus, Terminal, and Storage Medium
CN109753560B (en) Information processing method and device of intelligent question-answering system
CN103578463A (en) Automatic testing method and automatic testing device
CN103442130A (en) Voice control method, mobile terminal device and voice control system
CN103024169A (en) Method and device for starting communication terminal application program through voice
CN105469789A (en) Voice information processing method and voice information processing terminal
CN105224201A (en) A kind of method of interface display and terminal
CN108682421B (en) Voice recognition method, terminal equipment and computer readable storage medium
CN102932539A (en) Terminal and method for unlocking in application opening mode based on voice identification
CN105827804A (en) Control method and device of mobile terminal
CN105791557B (en) VoLTE functions automatic switching method, system and storage medium
CN105049579A (en) Call content based method and system for updating terminal address list
CN111192590B (en) Voice wake-up method, device, equipment and storage medium
CN104575494A (en) Speech processing method and terminal
CN103514882A (en) Voice identification method and system
CN104994547B (en) A kind of Network Access Method and user terminal
CN104462058A (en) Character string identification method and device
CN110290280A (en) A kind of recognition methods of the SOT state of termination, device and storage medium
CN103249027A (en) Method and equipment for terminal service processing
CN105049646A (en) Voice change conversation method, device and terminal
CN110808031A (en) Voice recognition method and device and computer equipment
CN105550361A (en) Log processing method and apparatus, and ask-answer information processing method and apparatus
CN113840040B (en) Man-machine cooperation outbound method, device, equipment and storage medium
CN110633318A (en) Data extraction processing method, device, equipment and storage medium

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
C14 Grant of patent or utility model
GR01 Patent grant