CN102708865A - Method, device and system for voice recognition - Google Patents

Method, device and system for voice recognition Download PDF

Info

Publication number
CN102708865A
CN102708865A CN2012101233692A CN201210123369A CN102708865A CN 102708865 A CN102708865 A CN 102708865A CN 2012101233692 A CN2012101233692 A CN 2012101233692A CN 201210123369 A CN201210123369 A CN 201210123369A CN 102708865 A CN102708865 A CN 102708865A
Authority
CN
China
Prior art keywords
recognition result
cloud computing
computing platform
platform server
local
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN2012101233692A
Other languages
Chinese (zh)
Inventor
沈嘉鑫
王力劭
邵颖
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
BEIJING VCYBER TECHNOLOGY Co Ltd
Original Assignee
BEIJING VCYBER TECHNOLOGY Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by BEIJING VCYBER TECHNOLOGY Co Ltd filed Critical BEIJING VCYBER TECHNOLOGY Co Ltd
Priority to CN2012101233692A priority Critical patent/CN102708865A/en
Publication of CN102708865A publication Critical patent/CN102708865A/en
Pending legal-status Critical Current

Links

Images

Landscapes

  • Telephonic Communication Services (AREA)

Abstract

The invention discloses a method, a device and a system for voice recognition, relating to voice recognition technology. The invention is invented in order to solve the problem that in the prior art, the network delay is caused, so that the accuracy rate of the voice recognition is lower. The technical scheme disclosed by the invention embodiment comprises the following steps of: receiving a voice message sent by a user; recognizing and analyzing the voice message by an embedded voice recognition database to obtain the local recognition result corresponding to the voice message and a reliability value of the local recognition result; outputting the local recognition result if the reliability value of the local recognition result is more than preset reliability threshold; if not, sending the voice message to a cloud computing platform server, so that the cloud computing platform server recognizes and analyzes the voice message by a remote end voice recognition database to obtain the remote end recognition result corresponding to the voice message; and outputting the remote end recognition result returned by the cloud computing platform server. The technical scheme disclosed by the embodiment of the invention can be applied to an information service system.

Description

Audio recognition method, Apparatus and system
Technical field
The present invention relates to speech recognition technology, relate in particular to a kind of audio recognition method, Apparatus and system.
Background technology
Along with the sustainable development of computing machine and infotech, interactive voice has become the necessary means of man-machine interaction.As one of important technology of interactive voice, speech recognition technology reaches its maturity, and is widely used through the development of nearly half a century.
The process of speech recognition comprises in the prior art: receive the voice messaging that the user sends; Connect with speech recognition server; This voice messaging is sent to speech recognition server, makes speech recognition server discern, resolve, obtain corresponding recognition result this voice messaging; Receive the recognition result that speech recognition server returns.
Because the speech recognition server through network side carries out speech recognition, makes each speech recognition all need carry out alternately with network side, produce network delay; And, when network condition is relatively poor, carry out may producing packet loss in the mutual process with network side, make that the accuracy rate of speech recognition is lower.
Summary of the invention
Embodiments of the invention provide a kind of audio recognition method, Apparatus and system, can reduce network delay, and improve the accuracy rate of speech recognition.
On the one hand, a kind of audio recognition method is provided, comprises: receive the voice messaging that the user sends; Through the Embedded Speech Recognition System database said voice messaging is discerned, resolved, obtain the corresponding local recognition result of said voice messaging and the confidence value of said local recognition result; If the confidence value of said local recognition result greater than preset reliable degree thresholding, is exported said local recognition result; Otherwise, send said voice messaging to the cloud computing platform server, make said cloud computing platform server discern, resolve said voice messaging through the far-end speech identification database, obtain the corresponding far-end recognition result of said voice messaging; Export the far-end recognition result that said cloud computing platform server returns.
On the other hand, a kind of speech recognition equipment is provided, comprises:
The voice receiver module is used to receive the voice messaging that the user sends;
Identification module is used for through the Embedded Speech Recognition System database said voice messaging being discerned, being resolved, and obtains the corresponding local recognition result of said voice messaging and the confidence value of said local recognition result;
First output module is if the confidence value that is used for said local recognition result is exported said local recognition result greater than preset reliable degree thresholding;
Information sending module; Be used for otherwise; Send said voice messaging to the cloud computing platform server, make said cloud computing platform server discern, resolve said voice messaging, obtain the corresponding far-end recognition result of said voice messaging through the far-end speech identification database;
Second output module is used to export the far-end recognition result that said cloud computing platform server returns.
Another aspect provides a kind of speech recognition system, comprising:
Speech recognition equipment is used to receive the voice messaging that the user sends; Through the Embedded Speech Recognition System database said voice messaging is discerned, resolved, obtain the corresponding local recognition result of said voice messaging and the confidence value of said local recognition result; If the confidence value of said local recognition result greater than preset reliable degree thresholding, is exported said local recognition result; Otherwise, send said voice messaging to the cloud computing platform server; Export the far-end recognition result that said cloud computing platform server returns;
Said cloud computing platform server is used to receive the voice messaging that said speech recognition equipment sends; Said voice messaging is discerned, resolved, obtain the corresponding far-end recognition result of said voice messaging; Send said far-end recognition result to said speech recognition equipment.
The audio recognition method that the embodiment of the invention provides, Apparatus and system combine Embedded Speech Recognition System with the high in the clouds speech recognition, if the confidence value of local recognition result greater than preset reliable degree thresholding, output should this locality recognition result; Otherwise, send voice messaging and export the far-end recognition result that it returns to the cloud computing platform server.Because the technical scheme that the embodiment of the invention provides combines Embedded Speech Recognition System with the high in the clouds speech recognition; Make that need not each speech recognition all carries out alternately with network side; Thereby under the prerequisite of the accuracy rate that guarantees speech recognition; Reduce the reciprocal process with network side, reduced network delay; And, when network condition is relatively poor, can reduce packet loss, thereby improve the accuracy rate of speech recognition; Solved prior art owing to the speech recognition server through network side carries out speech recognition, made each speech recognition all need carry out alternately, produced network delay with network side; And, when network condition is relatively poor, carry out may producing packet loss in the mutual process with network side, make that the accuracy rate of speech recognition is lower.
Description of drawings
In order to be illustrated more clearly in the embodiment of the invention or technical scheme of the prior art; To do to introduce simply to the accompanying drawing of required use in embodiment or the description of the Prior Art below; Obviously, the accompanying drawing in describing below only is some embodiments of the present invention, for those of ordinary skills; Under the prerequisite of not paying creative work, can also obtain other accompanying drawing according to these accompanying drawings.
The process flow diagram of the audio recognition method that Fig. 1 provides for the embodiment of the invention one;
The process flow diagram one of the audio recognition method that Fig. 2 provides for the embodiment of the invention two;
The flowchart 2 of the audio recognition method that Fig. 3 provides for the embodiment of the invention two;
The process flow diagram of the audio recognition method that Fig. 4 provides for the embodiment of the invention three;
The structural representation one of the speech recognition equipment that Fig. 5 provides for the embodiment of the invention four;
The structural representation two of the speech recognition equipment that Fig. 6 provides for the embodiment of the invention four;
The structural representation three of the speech recognition equipment that Fig. 7 provides for the embodiment of the invention four;
The structural representation of the speech recognition system that Fig. 8 provides for the embodiment of the invention five.
Embodiment
To combine the accompanying drawing in the embodiment of the invention below, the technical scheme in the embodiment of the invention is carried out clear, intactly description, obviously, described embodiment only is the present invention's part embodiment, rather than whole embodiment.Based on the embodiment among the present invention, those of ordinary skills are not making the every other embodiment that is obtained under the creative work prerequisite, all belong to the scope of the present invention's protection.
In order to solve the problem that prior art produces the accuracy rate of network delay and speech recognition, the embodiment of the invention provides a kind of audio recognition method, Apparatus and system.
Embodiment one:
Audio recognition method as shown in Figure 1, that the embodiment of the invention provides comprises:
Step 101 receives the voice messaging that the user sends.
In the present embodiment, step 101 can receive the voice messaging that the user sends after the user presses voice typing key, also can carry out other operation backs the user and receive the voice messaging that the user sends, and does not limit at this.Wherein, the voice messaging of user's input can be simple phonetic order, also can give unnecessary details no longer one by one once more for comprising other information of phonetic order.
Step 102 is discerned, is resolved this voice messaging through the Embedded Speech Recognition System database, obtains corresponding local recognition result of voice messaging and confidence value that should this locality recognition result.
In the present embodiment, the Embedded Speech Recognition System database can be used to store any phonetic feature storehouse in the step 102, and in order to dwindle the scale of Embedded Speech Recognition System database, preferred, this Embedded Speech Recognition System database can be used for control store instruction.Be applied as example with music, the Embedded Speech Recognition System database can be used for storage broadcast, time-out, a last head, next etc. steering order; The steering order of Embedded Speech Recognition System database storing includes but are not limited to the above, gives unnecessary details no longer one by one at this.
In the present embodiment; Step 102 is discerned, is resolved voice messaging through the Embedded Speech Recognition System database; Obtain the process of local recognition result; Can obtain the confidence value of each phonetic feature in the Embedded Speech Recognition System database, and the phonetic feature that confidence value is the highest be as local recognition result for the phonetic feature in voice messaging and the Embedded Speech Recognition System database being carried out similarity respectively relatively; Step 102 also can obtain local recognition result through other modes, gives unnecessary details no longer one by one at this.Wherein, the confidence value of local recognition result can be confirmed through said process, also can confirm through other modes, does not limit at this.
In the present embodiment, the Embedded Speech Recognition System storehouse can be stored several kinds of typical phonetic feature storehouses in advance; Also can store multiple wide spectrum phonetic feature storehouse in advance.Need to prove; This wide spectrum phonetic feature storehouse can be through gathering the whole of China various places, various people and these people under varying environment behind the sound of (different noise background); The set of the wide spectrum phonetic feature that essence extracts; This wide spectrum phonetic feature storehouse only depends on the information in existing " phonetic feature storehouse ", and does not rely on someone's phonetic feature training result.Special, this wide spectrum phonetic feature storehouse can also comprise outer repertorie, wherein should can have the external language librarys of main flow such as English storehouse, method repertorie, German storehouse, day repertorie by outer repertorie.
Step 103, whether the confidence value of judging local recognition result is greater than preset reliable degree thresholding.
In the present embodiment, the confidence level thresholding can be provided with arbitrarily in the step 103, also can not limit at this according to the statistics setting.If the confidence value of passing through the definite local recognition result of step 103 can be through the local recognition result of step 104 output greater than preset reliable degree thresholding; Otherwise, send voice messaging to the cloud computing platform server through step 105.
Step 104 is exported local recognition result.
Step 105 is sent voice messaging to the cloud computing platform server, makes the cloud computing platform server discern, resolve voice messaging through the far-end identification database, obtains the corresponding far-end recognition result of voice messaging.
In the present embodiment, this locality can connect with the cloud computing platform server in advance, also can the confidence value of local recognition result during less than preset reliable degree thresholding and the cloud computing platform server connect, do not limit at this.Can be through connecting like multiple communication modes such as Internet, 3G mobile network and cloud computing platform server; Concrete; Can store cloud computing platform network address of server (like uniform resource position mark URL) or call number in advance, according to the network address or call number through establishing a communications link with the cloud computing platform server like Internet, 3G mobile network etc.
In the present embodiment; The cloud computing platform server can be stored multiple wide spectrum phonetic feature storehouse in advance; For example: the wide spectrum phonetic feature storehouse that is provided with according to place name, wide spectrum phonetic feature storehouse that is provided with according to the audio frequency and video title and the wide spectrum phonetic feature storehouse that is provided with according to name etc.Need to prove; This wide spectrum phonetic feature storehouse can be through gathering the whole of China various places, various people and these people under varying environment behind the sound of (different noise background); The set of the wide spectrum phonetic feature that essence extracts; This wide spectrum phonetic feature storehouse only depends on the information in existing " phonetic feature storehouse ", and does not rely on someone's phonetic feature training result.Special, this wide spectrum phonetic feature storehouse can also comprise outer repertorie, wherein should can have the external language librarys of main flow such as English storehouse, method repertorie, German storehouse, day repertorie by outer repertorie.
Step 106, the far-end recognition result that output cloud computing platform server returns.
The far-end recognition result that can directly return in the present embodiment, through step 106 output cloud computing platform server; In the time of also can being higher than the confidence value of local recognition result,, give unnecessary details no longer one by one at this through the far-end recognition result that step 106 output cloud computing platform server returns in the confidence value of far-end recognition result.
The audio recognition method that the embodiment of the invention provides combines Embedded Speech Recognition System with the high in the clouds speech recognition, if the confidence value of local recognition result greater than preset reliable degree thresholding, output should this locality recognition result; Otherwise, send voice messaging and export the far-end recognition result that it returns to the cloud computing platform server.Because the technical scheme that the embodiment of the invention provides combines Embedded Speech Recognition System with the high in the clouds speech recognition; Make that need not each speech recognition all carries out alternately with network side; Thereby under the prerequisite of the accuracy rate that guarantees speech recognition; Reduce the reciprocal process with network side, reduced network delay; And, when network condition is relatively poor, can reduce packet loss, thereby improve the accuracy rate of speech recognition; Solved prior art owing to the speech recognition server through network side carries out speech recognition, made each speech recognition all need carry out alternately, produced network delay with network side; And, when network condition is relatively poor, carry out may producing packet loss in the mutual process with network side, make that the accuracy rate of speech recognition is lower.
Embodiment two:
Audio recognition method as shown in Figure 2, that the embodiment of the invention provides comprises:
Step 201 to step 205 is obtained the confidence value of local recognition result and local recognition result, and the confidence value of local recognition result is exported during greater than preset reliable degree thresholding, otherwise sends voice command to the cloud computing platform server.Detailed process is similar with step 101 to step 105 shown in Figure 1, gives unnecessary details no longer one by one at this.
Step 206, the confidence value of sending local recognition result and local recognition result to the cloud computing platform server.
Whether step 207, the confidence value of judging the far-end recognition result be greater than the confidence value of local recognition result.
In the present embodiment, if the confidence value of confirming the far-end recognition result through step 207 during smaller or equal to the confidence value of local recognition result, could be through the local recognition result of step 208 output.
Step 208 according to the control command that the cloud computing platform server returns, is exported local recognition result.
In the present embodiment, control command is used for the local recognition result of indication output in the step 208.
Further, as shown in Figure 3, audio recognition method in the present embodiment can also comprise:
Step 209, the far-end recognition result that output cloud computing platform server returns.
In the present embodiment, if confirm the confidence value of the confidence value of far-end recognition result, can export the far-end recognition result that the cloud computing platform server returns through step 209 greater than local recognition result through step 207.
The audio recognition method that the embodiment of the invention provides combines Embedded Speech Recognition System with the high in the clouds speech recognition, if the confidence value of local recognition result greater than preset reliable degree thresholding, output should this locality recognition result; Otherwise, send voice messaging and export the far-end recognition result that it returns to the cloud computing platform server.Because the technical scheme that the embodiment of the invention provides combines Embedded Speech Recognition System with the high in the clouds speech recognition; Make that need not each speech recognition all carries out alternately with network side; Thereby under the prerequisite of the accuracy rate that guarantees speech recognition; Reduce the reciprocal process with network side, reduced network delay; And, when network condition is relatively poor, can reduce packet loss, thereby improve the accuracy rate of speech recognition; Solved prior art owing to the speech recognition server through network side carries out speech recognition, made each speech recognition all need carry out alternately, produced network delay with network side; And, when network condition is relatively poor, carry out may producing packet loss in the mutual process with network side, make that the accuracy rate of speech recognition is lower.
Embodiment three:
Audio recognition method as shown in Figure 4, that the embodiment of the invention provides, this method is similar with audio recognition method shown in Figure 1, and difference is, also comprises:
Step 107 is obtained database update information from the cloud computing platform server.
In the present embodiment, the database update information of obtaining from the cloud computing platform server through step 107 can be sent the database update request to the cloud computing platform server for this locality, and the corresponding information of returning according to database is obtained; Also can obtain for the information returned according to the cloud computing platform server; Can also give unnecessary details no longer one by one at this for what obtain through other modes.Wherein, the Data Update request is sent to the cloud computing platform server in this locality, can be timed sending, also can not limit at this for indicating transmission according to the user; The information that the cloud computing platform server returns can not limit at this for the information of returning according to other settings for the information of regularly returning yet.
In the present embodiment; Database update information in the step 107 can be the increase information of the phonetic feature of Embedded Speech Recognition System database, also can be the minimizing information of the phonetic feature of Embedded Speech Recognition System database; Also can be Embedded Speech Recognition System database deletion information; Can also be the stack of foregoing,, give unnecessary details no longer one by one at this like the increase information of the phonetic feature of Embedded Speech Recognition System database and Embedded Speech Recognition System database deletion information etc.
Step 108 is according to this database update information updating Embedded Speech Recognition System database.
In the present embodiment, obtain database update information through step 107 from the cloud computing platform server after, can upgrade operation accordingly to the Embedded Speech Recognition System database according to this database update information.For example: obtain Embedded Speech Recognition System database deletion information through step 107 from the cloud computing platform server, the Embedded Speech Recognition System database is carried out corresponding deletion action, give unnecessary details no longer one by one at this.
The audio recognition method that the embodiment of the invention provides combines Embedded Speech Recognition System with the high in the clouds speech recognition, if the confidence value of local recognition result greater than preset reliable degree thresholding, output should this locality recognition result; Otherwise, send voice messaging and export the far-end recognition result that it returns to the cloud computing platform server.Because the technical scheme that the embodiment of the invention provides combines Embedded Speech Recognition System with the high in the clouds speech recognition; Make that need not each speech recognition all carries out alternately with network side; Thereby under the prerequisite of the accuracy rate that guarantees speech recognition; Reduce the reciprocal process with network side, reduced network delay; And, when network condition is relatively poor, can reduce packet loss, thereby improve the accuracy rate of speech recognition; Solved prior art owing to the speech recognition server through network side carries out speech recognition, made each speech recognition all need carry out alternately, produced network delay with network side; And, when network condition is relatively poor, carry out may producing packet loss in the mutual process with network side, make that the accuracy rate of speech recognition is lower.
Embodiment four:
Speech recognition equipment as shown in Figure 5, that the embodiment of the invention provides comprises:
Voice receiver module 501 is used to receive the voice messaging that the user sends.
In the present embodiment, voice receiver module 501 can receive the voice messaging that the user sends after the user presses voice typing key, also can carry out other operation backs the user and receive the voice messaging that the user sends, and does not limit at this.Wherein, the voice messaging of user's input can be simple phonetic order, also can give unnecessary details no longer one by one once more for comprising other information of phonetic order.
Identification module 502 is used for through the Embedded Speech Recognition System database voice messaging being discerned, being resolved, and obtains the corresponding local recognition result of voice messaging and the confidence value of local recognition result.
In the present embodiment, the Embedded Speech Recognition System database can be used to store any phonetic feature storehouse in the identification module 502, and in order to dwindle the scale of Embedded Speech Recognition System database, preferred, this Embedded Speech Recognition System database can be used for control store instruction.Be applied as example with music, the Embedded Speech Recognition System database can be used for storage broadcast, time-out, a last head, next etc. steering order; The steering order of Embedded Speech Recognition System database storing includes but are not limited to the above, gives unnecessary details no longer one by one at this.
In the present embodiment; Identification module 502 is discerned, is resolved voice messaging through the Embedded Speech Recognition System database; Obtain the process of local recognition result; Can obtain the confidence value of each phonetic feature in the Embedded Speech Recognition System database, and the phonetic feature that confidence value is the highest be as local recognition result for the phonetic feature in voice messaging and the Embedded Speech Recognition System database being carried out similarity respectively relatively; Identification module 502 also can obtain local recognition result through other modes, gives unnecessary details no longer one by one at this.Wherein, the confidence value of local recognition result can be confirmed through said process, also can confirm through other modes, does not limit at this.
In the present embodiment, the Embedded Speech Recognition System storehouse can be stored several kinds of typical phonetic feature storehouses in advance; Also can store multiple wide spectrum phonetic feature storehouse in advance.Need to prove; This wide spectrum phonetic feature storehouse can be through gathering the whole of China various places, various people and these people under varying environment behind the sound of (different noise background); The set of the wide spectrum phonetic feature that essence extracts; This wide spectrum phonetic feature storehouse only depends on the information in existing " phonetic feature storehouse ", and does not rely on someone's phonetic feature training result.Special, this wide spectrum phonetic feature storehouse can also comprise outer repertorie, wherein should can have the external language librarys of main flow such as English storehouse, method repertorie, German storehouse, day repertorie by outer repertorie.
First output module 503 is if the confidence value that is used for local recognition result is exported local recognition result greater than preset reliable degree thresholding.
Information sending module 504, be used for otherwise, send voice messaging to the cloud computing platform server, make the cloud computing platform server discern, resolve voice messaging through the far-end speech identification database, obtain the corresponding far-end recognition result of voice messaging.
In the present embodiment, this locality can connect with the cloud computing platform server in advance, also can the confidence value of local recognition result during less than preset reliable degree thresholding and the cloud computing platform server connect, do not limit at this.Can be through connecting like multiple communication modes such as Internet, 3G mobile network and cloud computing platform server; Concrete; Can store cloud computing platform network address of server (like uniform resource position mark URL) or call number in advance, according to the network address or call number through establishing a communications link with the cloud computing platform server like Internet, 3G mobile network etc.
In the present embodiment; The cloud computing platform server can be stored multiple wide spectrum phonetic feature storehouse in advance; For example: the wide spectrum phonetic feature storehouse that is provided with according to place name, wide spectrum phonetic feature storehouse that is provided with according to the audio frequency and video title and the wide spectrum phonetic feature storehouse that is provided with according to name etc.Need to prove; This wide spectrum phonetic feature storehouse can be through gathering the whole of China various places, various people and these people under varying environment behind the sound of (different noise background); The set of the wide spectrum phonetic feature that essence extracts; This wide spectrum phonetic feature storehouse only depends on the information in existing " phonetic feature storehouse ", and does not rely on someone's phonetic feature training result.Special, this wide spectrum phonetic feature storehouse can also comprise outer repertorie, wherein should can have the external language librarys of main flow such as English storehouse, method repertorie, German storehouse, day repertorie by outer repertorie.
Second output module 505 is used to export the far-end recognition result that the cloud computing platform server returns.
The far-end recognition result that can directly return in the present embodiment, through second output module, 505 output cloud computing platform servers; In the time of also can being higher than the confidence value of local recognition result,, give unnecessary details no longer one by one at this through the far-end recognition result that second output module, 505 output cloud computing platform servers return in the confidence value of far-end recognition result.
Further, as shown in Figure 6, the speech recognition equipment that present embodiment provides also comprises:
Recognition result sending module 506 is used for sending to the cloud computing platform server confidence value of local recognition result and local recognition result.
At this moment; Second output module 505, if also be used for the confidence value of the confidence value of far-end recognition result smaller or equal to local recognition result, the control command of returning according to the cloud computing platform server; Export local recognition result, control command is used for the local recognition result of indication output.
Further, as shown in Figure 7, the speech recognition equipment that present embodiment provides can also comprise:
Lastest imformation acquisition module 507 is used for obtaining database update information from the cloud computing platform server.
In the present embodiment, the database update information of obtaining from the cloud computing platform server through lastest imformation acquisition module 507 can be sent the database update request to the cloud computing platform server for this locality, and the corresponding information of returning according to database is obtained; Also can obtain for the information returned according to the cloud computing platform server; Can also give unnecessary details no longer one by one at this for what obtain through other modes.Wherein, the Data Update request is sent to the cloud computing platform server in this locality, can be timed sending, also can not limit at this for indicating transmission according to the user; The information that the cloud computing platform server returns can not limit at this for the information of returning according to other settings for the information of regularly returning yet.
In the present embodiment; Database update information in the lastest imformation acquisition module 507 can be the increase information of the phonetic feature of Embedded Speech Recognition System database, also can be the minimizing information of the phonetic feature of Embedded Speech Recognition System database; Also can be Embedded Speech Recognition System database deletion information; Can also be the stack of foregoing,, give unnecessary details no longer one by one at this like the increase information of the phonetic feature of Embedded Speech Recognition System database and Embedded Speech Recognition System database deletion information etc.
Update module 508 is used for according to database update information updating Embedded Speech Recognition System database.
In the present embodiment, obtain database update information through lastest imformation acquisition module 507 from the cloud computing platform server after, can upgrade operation accordingly to the Embedded Speech Recognition System database according to this database update information.For example: obtain Embedded Speech Recognition System database deletion information through lastest imformation acquisition module 507 from the cloud computing platform server, the Embedded Speech Recognition System database is carried out corresponding deletion action, give unnecessary details no longer one by one at this.
The speech recognition equipment that the embodiment of the invention provides combines Embedded Speech Recognition System with the high in the clouds speech recognition, if the confidence value of local recognition result greater than preset reliable degree thresholding, output should this locality recognition result; Otherwise, send voice messaging and export the far-end recognition result that it returns to the cloud computing platform server.Because the technical scheme that the embodiment of the invention provides combines Embedded Speech Recognition System with the high in the clouds speech recognition; Make that need not each speech recognition all carries out alternately with network side; Thereby under the prerequisite of the accuracy rate that guarantees speech recognition; Reduce the reciprocal process with network side, reduced network delay; And, when network condition is relatively poor, can reduce packet loss, thereby improve the accuracy rate of speech recognition; Solved prior art owing to the speech recognition server through network side carries out speech recognition, made each speech recognition all need carry out alternately, produced network delay with network side; And, when network condition is relatively poor, carry out may producing packet loss in the mutual process with network side, make that the accuracy rate of speech recognition is lower.
Embodiment five:
Speech recognition system as shown in Figure 8, that the embodiment of the invention provides comprises:
Speech recognition equipment 801 is used to receive the voice messaging that the user sends; Through the Embedded Speech Recognition System database voice messaging is discerned, resolved, obtain the corresponding local recognition result of voice messaging and the confidence value of local recognition result; If the confidence value of local recognition result is exported local recognition result greater than preset reliable degree thresholding; Otherwise, send voice messaging to the cloud computing platform server; The far-end recognition result that output cloud computing platform server returns.
In the present embodiment, can after the user presses voice typing key, receive the voice messaging that the user sends, also can carry out other operation backs and receive the voice messaging that the user sends, not limit at this user.Wherein, the voice messaging of user's input can be simple phonetic order, also can give unnecessary details no longer one by one once more for comprising other information of phonetic order.
In the present embodiment, the Embedded Speech Recognition System database can be used to store any phonetic feature storehouse, and in order to dwindle the scale of Embedded Speech Recognition System database, preferred, this Embedded Speech Recognition System database can be used for control store instruction.Be applied as example with music, the Embedded Speech Recognition System database can be used for storage broadcast, time-out, a last head, next etc. steering order; The steering order of Embedded Speech Recognition System database storing includes but are not limited to the above, gives unnecessary details no longer one by one at this.
In the present embodiment; Through the Embedded Speech Recognition System database voice messaging is discerned, resolved; Obtain the process of local recognition result; Can obtain the confidence value of each phonetic feature in the Embedded Speech Recognition System database, and the phonetic feature that confidence value is the highest be as local recognition result for the phonetic feature in voice messaging and the Embedded Speech Recognition System database being carried out similarity respectively relatively; Also can obtain local recognition result, give unnecessary details no longer one by one at this through other modes.Wherein, the confidence value of local recognition result can be confirmed through said process, also can confirm through other modes, does not limit at this.
In the present embodiment, the Embedded Speech Recognition System storehouse can be stored several kinds of typical phonetic feature storehouses in advance; Also can store multiple wide spectrum phonetic feature storehouse in advance.Need to prove; This wide spectrum phonetic feature storehouse can be through gathering the whole of China various places, various people and these people under varying environment behind the sound of (different noise background); The set of the wide spectrum phonetic feature that essence extracts; This wide spectrum phonetic feature storehouse only depends on the information in existing " phonetic feature storehouse ", and does not rely on someone's phonetic feature training result.Special, this wide spectrum phonetic feature storehouse can also comprise outer repertorie, wherein should can have the external language librarys of main flow such as English storehouse, method repertorie, German storehouse, day repertorie by outer repertorie.
In the present embodiment, this locality can connect with the cloud computing platform server in advance, also can the confidence value of local recognition result during less than preset reliable degree thresholding and the cloud computing platform server connect, do not limit at this.Can be through connecting like multiple communication modes such as Internet, 3G mobile network and cloud computing platform server; Concrete; Can store cloud computing platform network address of server (like uniform resource position mark URL) or call number in advance, according to the network address or call number through establishing a communications link with the cloud computing platform server like Internet, 3G mobile network etc.
In the present embodiment, can directly export the far-end recognition result that the cloud computing platform server returns; In the time of also can being higher than the confidence value of local recognition result in the confidence value of far-end recognition result, the far-end recognition result that output cloud computing platform server returns is given unnecessary details at this no longer one by one.
Cloud computing platform server 802 is used to receive the voice messaging that speech recognition equipment sends; Voice messaging is discerned, resolved, obtain the corresponding far-end recognition result of voice messaging; Send the far-end recognition result to speech recognition equipment.
In the present embodiment; The cloud computing platform server can be stored multiple wide spectrum phonetic feature storehouse in advance; For example: the wide spectrum phonetic feature storehouse that is provided with according to place name, wide spectrum phonetic feature storehouse that is provided with according to the audio frequency and video title and the wide spectrum phonetic feature storehouse that is provided with according to name etc.Need to prove; This wide spectrum phonetic feature storehouse can be through gathering the whole of China various places, various people and these people under varying environment behind the sound of (different noise background); The set of the wide spectrum phonetic feature that essence extracts; This wide spectrum phonetic feature storehouse only depends on the information in existing " phonetic feature storehouse ", and does not rely on someone's phonetic feature training result.Special, this wide spectrum phonetic feature storehouse can also comprise outer repertorie, wherein should can have the external language librarys of main flow such as English storehouse, method repertorie, German storehouse, day repertorie by outer repertorie.
Further, in the speech recognition system that present embodiment provides, speech recognition equipment 801 also is used for sending to the cloud computing platform server confidence value of local recognition result and local recognition result; According to the control command that the cloud computing platform server returns, export local recognition result; Cloud computing platform server 802 also is used to obtain the confidence value of far-end recognition result; If the confidence value of far-end recognition result smaller or equal to the confidence value of local recognition result, is sent the control command of the local recognition result of indication output to speech recognition equipment.
The speech recognition system that the embodiment of the invention provides combines Embedded Speech Recognition System with the high in the clouds speech recognition, if the confidence value of local recognition result greater than preset reliable degree thresholding, output should this locality recognition result; Otherwise, send voice messaging and export the far-end recognition result that it returns to the cloud computing platform server.Because the technical scheme that the embodiment of the invention provides combines Embedded Speech Recognition System with the high in the clouds speech recognition; Make that need not each speech recognition all carries out alternately with network side; Thereby under the prerequisite of the accuracy rate that guarantees speech recognition; Reduce the reciprocal process with network side, reduced network delay; And, when network condition is relatively poor, can reduce packet loss, thereby improve the accuracy rate of speech recognition; Solved prior art owing to the speech recognition server through network side carries out speech recognition, made each speech recognition all need carry out alternately, produced network delay with network side; And, when network condition is relatively poor, carry out may producing packet loss in the mutual process with network side, make that the accuracy rate of speech recognition is lower.
The audio recognition method that the embodiment of the invention provides, Apparatus and system can be applied in as in the information service systems such as navigation, requesting song and contact person's inquiry.
The above; Be merely embodiment of the present invention, but protection scope of the present invention is not limited thereto, any technician who is familiar with the present technique field is in the technical scope that the present invention discloses; Can expect easily changing or replacement, all should be encompassed within protection scope of the present invention.Therefore, protection scope of the present invention should be as the criterion by said protection domain with claim.

Claims (10)

1. an audio recognition method is characterized in that, comprising:
Receive the voice messaging that the user sends;
Through the Embedded Speech Recognition System database said voice messaging is discerned, resolved, obtain the corresponding local recognition result of said voice messaging and the confidence value of said local recognition result;
If the confidence value of said local recognition result greater than preset reliable degree thresholding, is exported said local recognition result;
Otherwise, send said voice messaging to the cloud computing platform server, make said cloud computing platform server discern, resolve said voice messaging through the far-end speech identification database, obtain the corresponding far-end recognition result of said voice messaging;
Export the far-end recognition result that said cloud computing platform server returns.
2. audio recognition method according to claim 1 is characterized in that, also comprises:
The confidence value of sending said local recognition result and local recognition result to said cloud computing platform server;
The far-end recognition result that the said cloud computing platform server of then said output returns replaces with:
If the confidence value of said far-end recognition result smaller or equal to the confidence value of local recognition result, according to the control command that the cloud computing platform server returns, is exported local recognition result, said control command is used for the local recognition result of indication output.
3. audio recognition method according to claim 1 is characterized in that, also comprises:
Obtain database update information from said cloud computing platform server;
According to the said Embedded Speech Recognition System database of said database update information updating.
4. according to any described audio recognition method among the claim 1-3, it is characterized in that said Embedded Speech Recognition System database is used for control store instruction.
5. a speech recognition equipment is characterized in that, comprising:
The voice receiver module is used to receive the voice messaging that the user sends;
Identification module is used for through the Embedded Speech Recognition System database said voice messaging being discerned, being resolved, and obtains the corresponding local recognition result of said voice messaging and the confidence value of said local recognition result;
First output module is if the confidence value that is used for said local recognition result is exported said local recognition result greater than preset reliable degree thresholding;
Information sending module; Be used for otherwise; Send said voice messaging to the cloud computing platform server, make said cloud computing platform server discern, resolve said voice messaging, obtain the corresponding far-end recognition result of said voice messaging through the far-end speech identification database;
Second output module is used to export the far-end recognition result that said cloud computing platform server returns.
6. speech recognition equipment according to claim 5 is characterized in that, also comprises:
The recognition result sending module is used for the confidence value of sending said local recognition result and local recognition result to said cloud computing platform server;
Said second output module; If also be used for the confidence value of the confidence value of said far-end recognition result smaller or equal to local recognition result; According to the control command that the cloud computing platform server returns, export local recognition result, said control command is used for the local recognition result of indication output.
7. speech recognition equipment according to claim 5 is characterized in that, also comprises:
The lastest imformation acquisition module is used for obtaining database update information from said cloud computing platform server;
Update module is used for according to the said Embedded Speech Recognition System database of said database update information updating.
8. according to any described speech recognition equipment among the claim 5-7, it is characterized in that said Embedded Speech Recognition System database is used for control store instruction.
9. a speech recognition system is characterized in that, comprising:
Speech recognition equipment is used to receive the voice messaging that the user sends; Through the Embedded Speech Recognition System database said voice messaging is discerned, resolved, obtain the corresponding local recognition result of said voice messaging and the confidence value of said local recognition result; If the confidence value of said local recognition result greater than preset reliable degree thresholding, is exported said local recognition result; Otherwise, send said voice messaging to the cloud computing platform server; Export the far-end recognition result that said cloud computing platform server returns;
Said cloud computing platform server is used to receive the voice messaging that said speech recognition equipment sends; Said voice messaging is discerned, resolved, obtain the corresponding far-end recognition result of said voice messaging; Send said far-end recognition result to said speech recognition equipment.
10. speech recognition system according to claim 9 is characterized in that,
Said speech recognition equipment also is used for the confidence value of sending said local recognition result and local recognition result to said cloud computing platform server; According to the control command that the cloud computing platform server returns, export local recognition result;
Said cloud computing platform server also is used to obtain the confidence value of said far-end recognition result; If the confidence value of said far-end recognition result smaller or equal to the confidence value of local recognition result, is sent the control command of the local recognition result of indication output to said speech recognition equipment.
CN2012101233692A 2012-04-25 2012-04-25 Method, device and system for voice recognition Pending CN102708865A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN2012101233692A CN102708865A (en) 2012-04-25 2012-04-25 Method, device and system for voice recognition

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN2012101233692A CN102708865A (en) 2012-04-25 2012-04-25 Method, device and system for voice recognition

Publications (1)

Publication Number Publication Date
CN102708865A true CN102708865A (en) 2012-10-03

Family

ID=46901567

Family Applications (1)

Application Number Title Priority Date Filing Date
CN2012101233692A Pending CN102708865A (en) 2012-04-25 2012-04-25 Method, device and system for voice recognition

Country Status (1)

Country Link
CN (1) CN102708865A (en)

Cited By (43)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102968992A (en) * 2012-11-26 2013-03-13 北京奇虎科技有限公司 Voice identification processing method for internet explorer and internet explorer
CN103247291A (en) * 2013-05-07 2013-08-14 华为终端有限公司 Updating method, device, and system of voice recognition device
CN103440867A (en) * 2013-08-02 2013-12-11 安徽科大讯飞信息科技股份有限公司 Method and system for recognizing voice
CN103488384A (en) * 2013-09-30 2014-01-01 乐视致新电子科技(天津)有限公司 Voice assistant application interface display method and device
CN103488401A (en) * 2013-09-30 2014-01-01 乐视致新电子科技(天津)有限公司 Voice assistant activating method and device
CN103489444A (en) * 2013-09-30 2014-01-01 乐视致新电子科技(天津)有限公司 Speech recognition method and device
CN104240707A (en) * 2012-11-26 2014-12-24 北京奇虎科技有限公司 Browser and voice identification processing method for same
CN104407834A (en) * 2014-11-13 2015-03-11 腾讯科技(成都)有限公司 Message input method and device
CN104536978A (en) * 2014-12-05 2015-04-22 奇瑞汽车股份有限公司 Voice data identifying method and device
CN104575494A (en) * 2013-10-16 2015-04-29 中兴通讯股份有限公司 Speech processing method and terminal
CN104681026A (en) * 2013-11-27 2015-06-03 夏普株式会社 Voice Recognition Terminal, Server, Method Of Controlling Server, Voice Recognition System,non-transitory Storage Medium
CN104795069A (en) * 2014-01-21 2015-07-22 腾讯科技(深圳)有限公司 Speech recognition method and server
CN104916283A (en) * 2015-06-11 2015-09-16 百度在线网络技术(北京)有限公司 Voice recognition method and device
CN104978971A (en) * 2014-04-08 2015-10-14 安徽科大讯飞信息科技股份有限公司 Oral evaluation method and system
CN105118508A (en) * 2015-09-14 2015-12-02 百度在线网络技术(北京)有限公司 Voice recognition method and device
CN105578240A (en) * 2015-12-23 2016-05-11 广州视源电子科技股份有限公司 Television terminal interaction method and system
CN105824857A (en) * 2015-01-08 2016-08-03 中兴通讯股份有限公司 Voice search method, device and terminal
CN105931633A (en) * 2016-05-30 2016-09-07 深圳市鼎盛智能科技有限公司 Speech recognition method and system
CN105931645A (en) * 2016-04-12 2016-09-07 深圳市京华信息技术有限公司 Control method of virtual reality device, apparatus, virtual reality device and system
CN106019993A (en) * 2016-06-01 2016-10-12 佛山市顺德区美的电热电器制造有限公司 Cooking system
CN106098062A (en) * 2016-06-16 2016-11-09 杭州古北电子科技有限公司 Intelligent sound control system for identifying that processing locality is combined with wireless network and method
CN106126714A (en) * 2016-06-30 2016-11-16 联想(北京)有限公司 Information processing method and information processor
CN106228975A (en) * 2016-09-08 2016-12-14 康佳集团股份有限公司 The speech recognition system of a kind of mobile terminal and method
CN106328148A (en) * 2016-08-19 2017-01-11 上汽通用汽车有限公司 Natural speech recognition method, natural speech recognition device and natural speech recognition system based on local and cloud hybrid recognition
CN106847287A (en) * 2017-01-22 2017-06-13 陈海峰 Word read recognition methods, user terminal and word read identifying system
CN106847291A (en) * 2017-02-20 2017-06-13 成都启英泰伦科技有限公司 Speech recognition system and method that a kind of local and high in the clouds is combined
CN106910504A (en) * 2015-12-22 2017-06-30 北京君正集成电路股份有限公司 A kind of speech reminding method and device based on speech recognition
CN106992009A (en) * 2017-05-03 2017-07-28 深圳车盒子科技有限公司 Vehicle-mounted voice exchange method, system and computer-readable recording medium
CN107146617A (en) * 2017-06-15 2017-09-08 成都启英泰伦科技有限公司 A kind of novel voice identification equipment and method
CN107785019A (en) * 2017-10-26 2018-03-09 西安Tcl软件开发有限公司 Mobile unit and its audio recognition method, readable storage medium storing program for executing
CN109869862A (en) * 2019-01-23 2019-06-11 四川虹美智能科技有限公司 The control method and a kind of air-conditioning system of a kind of air-conditioning, a kind of air-conditioning
CN109949815A (en) * 2014-04-07 2019-06-28 三星电子株式会社 Electronic device
CN110299136A (en) * 2018-03-22 2019-10-01 上海擎感智能科技有限公司 A kind of processing method and its system for speech recognition
CN110706711A (en) * 2014-01-17 2020-01-17 微软技术许可有限责任公司 Merging of exogenous large vocabulary models into rule-based speech recognition
WO2020119438A1 (en) * 2018-12-11 2020-06-18 青岛海尔洗衣机有限公司 Voice control method, cloud server and terminal device
WO2020119437A1 (en) * 2018-12-11 2020-06-18 青岛海尔洗衣机有限公司 Voice control method, cloud server and terminal device
CN112509585A (en) * 2020-12-22 2021-03-16 北京百度网讯科技有限公司 Voice processing method, device and equipment of vehicle-mounted equipment and storage medium
CN112562660A (en) * 2019-09-25 2021-03-26 深圳云端生活科技有限公司 Combined speech recognition processing method
CN112714284A (en) * 2020-12-22 2021-04-27 全球能源互联网研究院有限公司 Power equipment detection method and device and mobile terminal
CN113129896A (en) * 2019-12-30 2021-07-16 北京猎户星空科技有限公司 Voice interaction method and device, electronic equipment and storage medium
WO2022063288A1 (en) * 2020-09-27 2022-03-31 中国商用飞机有限责任公司北京民用飞机技术研究中心 On-board information assisting system and method
WO2022217621A1 (en) * 2021-04-17 2022-10-20 华为技术有限公司 Speech interaction method and apparatus
US11817101B2 (en) 2013-09-19 2023-11-14 Microsoft Technology Licensing, Llc Speech recognition using phoneme matching

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1448915A (en) * 2002-04-01 2003-10-15 欧姆龙株式会社 Sound recognition system, device, sound recognition method and sound recognition program
US20060009980A1 (en) * 2004-07-12 2006-01-12 Burke Paul M Allocation of speech recognition tasks and combination of results thereof
CN101454775A (en) * 2006-05-23 2009-06-10 摩托罗拉公司 Grammar adaptation through cooperative client and server based speech recognition
CN102196207A (en) * 2011-05-12 2011-09-21 深圳市子栋科技有限公司 Method, device and system for controlling television by using voice

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1448915A (en) * 2002-04-01 2003-10-15 欧姆龙株式会社 Sound recognition system, device, sound recognition method and sound recognition program
US20060009980A1 (en) * 2004-07-12 2006-01-12 Burke Paul M Allocation of speech recognition tasks and combination of results thereof
CN101454775A (en) * 2006-05-23 2009-06-10 摩托罗拉公司 Grammar adaptation through cooperative client and server based speech recognition
CN102196207A (en) * 2011-05-12 2011-09-21 深圳市子栋科技有限公司 Method, device and system for controlling television by using voice

Cited By (56)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104240707A (en) * 2012-11-26 2014-12-24 北京奇虎科技有限公司 Browser and voice identification processing method for same
CN102968992A (en) * 2012-11-26 2013-03-13 北京奇虎科技有限公司 Voice identification processing method for internet explorer and internet explorer
CN102968992B (en) * 2012-11-26 2014-11-05 北京奇虎科技有限公司 Voice identification processing method for internet explorer and internet explorer
CN103247291A (en) * 2013-05-07 2013-08-14 华为终端有限公司 Updating method, device, and system of voice recognition device
WO2014180218A1 (en) * 2013-05-07 2014-11-13 华为终端有限公司 Update method, apparatus and system for voice recognition device
CN103440867A (en) * 2013-08-02 2013-12-11 安徽科大讯飞信息科技股份有限公司 Method and system for recognizing voice
CN103440867B (en) * 2013-08-02 2016-08-10 科大讯飞股份有限公司 Audio recognition method and system
US11817101B2 (en) 2013-09-19 2023-11-14 Microsoft Technology Licensing, Llc Speech recognition using phoneme matching
CN103488401A (en) * 2013-09-30 2014-01-01 乐视致新电子科技(天津)有限公司 Voice assistant activating method and device
CN103489444A (en) * 2013-09-30 2014-01-01 乐视致新电子科技(天津)有限公司 Speech recognition method and device
CN103488384A (en) * 2013-09-30 2014-01-01 乐视致新电子科技(天津)有限公司 Voice assistant application interface display method and device
CN104575494A (en) * 2013-10-16 2015-04-29 中兴通讯股份有限公司 Speech processing method and terminal
CN104681026A (en) * 2013-11-27 2015-06-03 夏普株式会社 Voice Recognition Terminal, Server, Method Of Controlling Server, Voice Recognition System,non-transitory Storage Medium
CN104681026B (en) * 2013-11-27 2019-03-15 夏普株式会社 Voice recognition terminal and system, server and its control method
CN110706711A (en) * 2014-01-17 2020-01-17 微软技术许可有限责任公司 Merging of exogenous large vocabulary models into rule-based speech recognition
CN110706711B (en) * 2014-01-17 2023-11-28 微软技术许可有限责任公司 Merging exogenous large vocabulary models into rule-based speech recognition
CN104795069A (en) * 2014-01-21 2015-07-22 腾讯科技(深圳)有限公司 Speech recognition method and server
CN109949815A (en) * 2014-04-07 2019-06-28 三星电子株式会社 Electronic device
CN109949815B (en) * 2014-04-07 2024-06-07 三星电子株式会社 Electronic device
CN104978971B (en) * 2014-04-08 2019-04-05 科大讯飞股份有限公司 A kind of method and system for evaluating spoken language
CN104978971A (en) * 2014-04-08 2015-10-14 安徽科大讯飞信息科技股份有限公司 Oral evaluation method and system
CN104407834A (en) * 2014-11-13 2015-03-11 腾讯科技(成都)有限公司 Message input method and device
CN104536978A (en) * 2014-12-05 2015-04-22 奇瑞汽车股份有限公司 Voice data identifying method and device
CN105824857A (en) * 2015-01-08 2016-08-03 中兴通讯股份有限公司 Voice search method, device and terminal
CN104916283A (en) * 2015-06-11 2015-09-16 百度在线网络技术(北京)有限公司 Voice recognition method and device
CN105118508B (en) * 2015-09-14 2018-10-23 百度在线网络技术(北京)有限公司 Audio recognition method and device
CN105118508A (en) * 2015-09-14 2015-12-02 百度在线网络技术(北京)有限公司 Voice recognition method and device
CN106910504A (en) * 2015-12-22 2017-06-30 北京君正集成电路股份有限公司 A kind of speech reminding method and device based on speech recognition
CN105578240A (en) * 2015-12-23 2016-05-11 广州视源电子科技股份有限公司 Television terminal interaction method and system
CN105931645A (en) * 2016-04-12 2016-09-07 深圳市京华信息技术有限公司 Control method of virtual reality device, apparatus, virtual reality device and system
CN105931633A (en) * 2016-05-30 2016-09-07 深圳市鼎盛智能科技有限公司 Speech recognition method and system
WO2017206661A1 (en) * 2016-05-30 2017-12-07 深圳市鼎盛智能科技有限公司 Voice recognition method and system
CN106019993A (en) * 2016-06-01 2016-10-12 佛山市顺德区美的电热电器制造有限公司 Cooking system
CN106098062A (en) * 2016-06-16 2016-11-09 杭州古北电子科技有限公司 Intelligent sound control system for identifying that processing locality is combined with wireless network and method
CN106126714A (en) * 2016-06-30 2016-11-16 联想(北京)有限公司 Information processing method and information processor
CN106328148A (en) * 2016-08-19 2017-01-11 上汽通用汽车有限公司 Natural speech recognition method, natural speech recognition device and natural speech recognition system based on local and cloud hybrid recognition
CN106228975A (en) * 2016-09-08 2016-12-14 康佳集团股份有限公司 The speech recognition system of a kind of mobile terminal and method
CN106847287A (en) * 2017-01-22 2017-06-13 陈海峰 Word read recognition methods, user terminal and word read identifying system
CN106847291A (en) * 2017-02-20 2017-06-13 成都启英泰伦科技有限公司 Speech recognition system and method that a kind of local and high in the clouds is combined
CN106992009A (en) * 2017-05-03 2017-07-28 深圳车盒子科技有限公司 Vehicle-mounted voice exchange method, system and computer-readable recording medium
CN107146617A (en) * 2017-06-15 2017-09-08 成都启英泰伦科技有限公司 A kind of novel voice identification equipment and method
CN107785019A (en) * 2017-10-26 2018-03-09 西安Tcl软件开发有限公司 Mobile unit and its audio recognition method, readable storage medium storing program for executing
CN110299136A (en) * 2018-03-22 2019-10-01 上海擎感智能科技有限公司 A kind of processing method and its system for speech recognition
US11705129B2 (en) 2018-12-11 2023-07-18 Qingdao Haier Washing Machine Co., Ltd. Voice control method, cloud server and terminal device
CN111312234A (en) * 2018-12-11 2020-06-19 青岛海尔洗衣机有限公司 Voice control method, cloud processor and terminal equipment
WO2020119437A1 (en) * 2018-12-11 2020-06-18 青岛海尔洗衣机有限公司 Voice control method, cloud server and terminal device
US11967320B2 (en) 2018-12-11 2024-04-23 Qingdao Haier Washing Machine Co., Ltd. Processing voice information with a terminal device and a cloud server to control an operation
WO2020119438A1 (en) * 2018-12-11 2020-06-18 青岛海尔洗衣机有限公司 Voice control method, cloud server and terminal device
CN109869862A (en) * 2019-01-23 2019-06-11 四川虹美智能科技有限公司 The control method and a kind of air-conditioning system of a kind of air-conditioning, a kind of air-conditioning
CN112562660A (en) * 2019-09-25 2021-03-26 深圳云端生活科技有限公司 Combined speech recognition processing method
CN113129896A (en) * 2019-12-30 2021-07-16 北京猎户星空科技有限公司 Voice interaction method and device, electronic equipment and storage medium
CN113129896B (en) * 2019-12-30 2023-12-12 北京猎户星空科技有限公司 Voice interaction method and device, electronic equipment and storage medium
WO2022063288A1 (en) * 2020-09-27 2022-03-31 中国商用飞机有限责任公司北京民用飞机技术研究中心 On-board information assisting system and method
CN112509585A (en) * 2020-12-22 2021-03-16 北京百度网讯科技有限公司 Voice processing method, device and equipment of vehicle-mounted equipment and storage medium
CN112714284A (en) * 2020-12-22 2021-04-27 全球能源互联网研究院有限公司 Power equipment detection method and device and mobile terminal
WO2022217621A1 (en) * 2021-04-17 2022-10-20 华为技术有限公司 Speech interaction method and apparatus

Similar Documents

Publication Publication Date Title
CN102708865A (en) Method, device and system for voice recognition
CN1333385C (en) Voice browser dialog enabler for a communication system
CN102196207B (en) Method, device and system for controlling television by using voice
CN104715752A (en) Voice recognition method, voice recognition device and voice recognition system
CN104123940A (en) Voice control system and method based on intelligent home system
CN104754536A (en) Method and system for realizing communication between different languages
CN105206272A (en) Voice transmission control method and system
CN101576901A (en) Method for generating search request and mobile communication equipment
CN102708858A (en) Voice bank realization voice recognition system and method based on organizing way
CN103377652A (en) Method, device and equipment for carrying out voice recognition
CN103956167A (en) Visual sign language interpretation method and device based on Web
CN104091478A (en) Answering-while-questioning learning machine and network learning system
CN110992955A (en) Voice operation method, device, equipment and storage medium of intelligent equipment
CN106205613B (en) A kind of navigation audio recognition method and system
CN102004624A (en) Voice recognition control system and method
CN103076893A (en) Method and equipment for realizing voice input
CN108538289A (en) The method, apparatus and terminal device of voice remote control are realized based on bluetooth
CN103701994A (en) Automatic responding method and automatic responding device
CN110139127A (en) Audio file play method, server, intelligent sound box and play system
CN114155855A (en) Voice recognition method, server and electronic equipment
CN109670109A (en) Information acquisition method, device, server, terminal and medium
CN101943991A (en) Input method and equipment based on cloud computing
CN108540677A (en) Method of speech processing and system
CN104216982A (en) Information processing method and electronic equipment
CN104135569A (en) Method for seeking for help, method for processing help-seeking behavior and smart mobile apparatus for seeking for help

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
C12 Rejection of a patent application after its publication
RJ01 Rejection of invention patent application after publication

Application publication date: 20121003