CN104778946A - Voice control method and system - Google Patents

Voice control method and system Download PDF

Info

Publication number
CN104778946A
CN104778946A CN201410011484.XA CN201410011484A CN104778946A CN 104778946 A CN104778946 A CN 104778946A CN 201410011484 A CN201410011484 A CN 201410011484A CN 104778946 A CN104778946 A CN 104778946A
Authority
CN
China
Prior art keywords
user
command information
corpus
speech command
steering order
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201410011484.XA
Other languages
Chinese (zh)
Inventor
马宇飞
邓佳佳
林毅
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
China Telecom Corp Ltd
Original Assignee
China Telecom Corp Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by China Telecom Corp Ltd filed Critical China Telecom Corp Ltd
Priority to CN201410011484.XA priority Critical patent/CN104778946A/en
Publication of CN104778946A publication Critical patent/CN104778946A/en
Pending legal-status Critical Current

Links

Abstract

The invention discloses a voice control method and a system. In the voice control method, a mobile terminal sends acquired user voice instruction information to a voiceprint recognition server via a network access terminal; the voiceprint recognition server carries out voiceprint recognition on the user voice instruction information and sends a user identifier corresponding to the recognized voiceprint to the network access equipment; the network access equipment sends the user voice instruction information and the user identifier to a voice recognition server; the voice recognition server extracts a control instruction corresponding to the user voice instruction information in a user corpus related to the user identifier and sends the control instruction to the network access equipment so as to enable the network access equipment to perform corresponding operation according to the control instruction. The voiceprint recognition technology is used for distinguishing users, voice recognition is carried out on the basis of the user personalized corpus, voice recognition accuracy is improved, the voice recognition consumption time is shortened, and the user can acquire better use experience.

Description

Sound control method and system
Technical field
The present invention relates to the communications field, particularly a kind of sound control method and system.
Background technology
Speech recognition is exactly allow machine, by speech recognition and semantic understanding technology, voice signal is changed into the new and high technology of corresponding text and order, and utilizes speech recognition technology to realize controlling to be voice control technology to real things.
Wherein, what deposit in corpus is the true linguistic data occurred constantly accumulated in the actual use of speech control system, can be improved accuracy rate and the efficiency of semantic understanding in speech control system by corpus.
Sound groove recognition technology in e be one according to the speaker information contained in speech waveform, automatically identify the technology of speaker ' s identity.The behavior difference that everyone will be formed due to the differences of Physiological of vocal organs and the day after tomorrow, to make in their voice, all with strong personal colors, to be difficult to find two duplicate people of vocal print.Thus this characteristic can be utilized to carry out authentication.
But, under the scene that user is more, due to the use habit of each user and conventional language different again, just make voice server be difficult to be formed corpus accurately, often need repeatedly could confirm user semantic alternately, have impact on Consumer's Experience.
Summary of the invention
The embodiment of the present invention provides a kind of sound control method and system.By utilizing sound groove recognition technology in e to distinguish user, speech recognition is carried out on the basis of user individual corpus, thus improve accuracy rate and the efficiency of speech recognition.
According to an aspect of the present invention, a kind of sound control method is provided, comprises:
The user speech command information collected is sent to network insertion terminal by mobile terminal;
User speech command information is sent to Application on Voiceprint Recognition server by network insertion terminal;
Application on Voiceprint Recognition server carries out Application on Voiceprint Recognition to user speech command information, and the user ID corresponding with the vocal print identified is sent to network access equipment;
User speech command information and user ID are sent to speech recognition server by network access equipment;
Speech recognition server inquires about the user's corpus be associated with user ID;
Speech recognition server, in the user's corpus be associated with user ID, extracts the steering order corresponding with user speech command information, steering order is sent to network access equipment, so that network access equipment carries out corresponding operating according to steering order.
In one embodiment, speech recognition server is in the user's corpus be associated with user ID, and the step extracting the steering order corresponding with user speech command information comprises:
Speech recognition server judges, in the user's corpus be associated with user ID, whether to there is the steering order corresponding with user speech command information;
If with exist in user's corpus that user ID is associated and the corresponding steering order of user speech command information, then perform the step extracting the steering order corresponding with user speech command information.
In one embodiment, if there is not the steering order corresponding with user speech command information with user's corpus that user ID is associated, then by general corpus, speech recognition is carried out to obtain steering order to user speech command information, and steering order is deposited in the user's corpus be associated with user ID with corresponding user speech command information.
In one embodiment, the step that speech recognition server inquires about the user's corpus be associated with user ID comprises:
Speech recognition server judges whether to inquire the user's corpus be associated with user ID;
The user's corpus be associated with user ID if inquire, then perform speech recognition server in the user's corpus be associated with user ID, extract the step of the steering order corresponding with user speech command information.
In one embodiment, the user's corpus be associated with user ID if do not inquire, then speech recognition server sets up the user's corpus be associated with user ID, speech recognition is carried out to obtain steering order to user speech command information, and steering order is deposited in the user's corpus be associated with user ID with corresponding user speech command information, then perform step steering order being sent to network access equipment.
In one embodiment, Application on Voiceprint Recognition server carries out Application on Voiceprint Recognition to user speech command information, and sends to the step of network access equipment to comprise the user ID corresponding with the vocal print identified:
Application on Voiceprint Recognition server carries out Application on Voiceprint Recognition to user speech command information, to obtain voiceprint;
Judge whether there is described voiceprint in vocal print storehouse;
If there is described voiceprint in vocal print storehouse, then perform the step user ID corresponding with the vocal print identified being sent to network access equipment.
In one embodiment, if there is not described voiceprint in vocal print storehouse, then described voiceprint is stored in vocal print storehouse, and distributes corresponding user ID for described voiceprint, then the user ID of distribution is sent to network access equipment.
In one embodiment, mobile terminal is telepilot, and network insertion terminal is Set Top Box.
According to a further aspect in the invention, provide a kind of speech control system, comprise mobile terminal, network insertion terminal, Application on Voiceprint Recognition server and speech recognition server, wherein:
Mobile terminal, for gathering user speech command information, sends to network insertion terminal by the user speech command information collected;
Network insertion terminal, for when receiving the user speech command information that mobile terminal sends, sends to Application on Voiceprint Recognition server by user speech command information; When receiving the user ID that Application on Voiceprint Recognition server sends, user speech command information and user ID are sent to speech recognition server;
Application on Voiceprint Recognition server, for when receiving the user speech command information that network insertion terminal sends, carrying out Application on Voiceprint Recognition to user speech command information, and the user ID corresponding with the vocal print identified is sent to network access equipment;
Speech recognition server, for when receiving user speech command information and the user ID of network insertion terminal transmission, inquire about the user's corpus be associated with user ID, in the user's corpus be associated with user ID, extract the steering order corresponding with user speech command information, steering order is sent to network access equipment, so that network access equipment carries out corresponding operating according to steering order.
In one embodiment, speech recognition server is specifically when receiving user speech command information and the user ID of network insertion terminal transmission, judge, in the user's corpus be associated with user ID, whether to there is the steering order corresponding with user speech command information; If with exist in user's corpus that user ID is associated and the corresponding steering order of user speech command information, then perform the operation of extracting the steering order corresponding with user speech command information.
In one embodiment, also for when there is not the steering order corresponding with user speech command information in user's corpus that user ID is associated in speech recognition server, by general corpus, speech recognition is carried out to obtain steering order to user speech command information, and steering order is deposited in the user's corpus be associated with user ID with corresponding user speech command information.
In one embodiment, speech recognition server specifically when receiving user speech command information and the user ID of network insertion terminal transmission, judges whether to inquire the user's corpus be associated with user ID; The user's corpus be associated with user ID if inquire, then perform in the user's corpus be associated with user ID, extracts the operation of the steering order corresponding with user speech command information.
In one embodiment, speech recognition server is not also for when inquiring the user's corpus be associated with user ID, set up the user's corpus be associated with user ID, speech recognition is carried out to obtain steering order to user speech command information, and steering order is deposited in the user's corpus be associated with user ID with corresponding user speech command information, then perform operation steering order being sent to network access equipment.
In one embodiment, Application on Voiceprint Recognition server specifically when receiving the user speech command information that network insertion terminal sends, carries out Application on Voiceprint Recognition to user speech command information, to obtain voiceprint; Judge whether there is described voiceprint in vocal print storehouse; If there is described voiceprint in vocal print storehouse, then perform the operation user ID corresponding with the vocal print identified being sent to network access equipment.
In one embodiment, when also for there is not described voiceprint in vocal print storehouse in Application on Voiceprint Recognition server, described voiceprint is stored in vocal print storehouse, and distributes corresponding user ID for described voiceprint, then the user ID of distribution is sent to network access equipment.
In one embodiment, mobile terminal is telepilot, and network insertion terminal is Set Top Box.
The present invention confirms the identity of active user by Application on Voiceprint Recognition, utilizes and extracts the steering order corresponding with user speech instruction with the personalized corpus that user identity is associated.Thus the accuracy rate of speech recognition can be improved, shorten the elapsed time of speech recognition, make user obtain better experience.
Description of the invention provides in order to example with for the purpose of describing, and is not exhaustively or limit the invention to disclosed form.Many modifications and variations are obvious for the ordinary skill in the art.Selecting and describing embodiment is in order to principle of the present invention and practical application are better described, and enables those of ordinary skill in the art understand the present invention thus design the various embodiments with various amendment being suitable for special-purpose.
Accompanying drawing explanation
In order to be illustrated more clearly in the embodiment of the present invention or technical scheme of the prior art, be briefly described to the accompanying drawing used required in embodiment or description of the prior art below, apparently, accompanying drawing in the following describes is only some embodiments of the present invention, for those of ordinary skill in the art, under the prerequisite not paying creative work, other accompanying drawing can also be obtained according to these accompanying drawings.
Fig. 1 is the schematic diagram of a sound control method of the present invention embodiment.
Fig. 2 is the schematic diagram of another embodiment of sound control method of the present invention.
Fig. 3 is the schematic diagram of the another embodiment of sound control method of the present invention.
Fig. 4 is the schematic diagram of a speech control system of the present invention embodiment.
Embodiment
Below in conjunction with the accompanying drawing in the embodiment of the present invention, be clearly and completely described the technical scheme in the embodiment of the present invention, obviously, described embodiment is only the present invention's part embodiment, instead of whole embodiments.Illustrative to the description only actually of at least one exemplary embodiment below, never as any restriction to the present invention and application or use.Based on the embodiment in the present invention, those of ordinary skill in the art, not making the every other embodiment obtained under creative work prerequisite, belong to the scope of protection of the invention.
Unless specifically stated otherwise, otherwise positioned opposite, the numerical expression of the parts of setting forth in these embodiments and step and numerical value do not limit the scope of the invention.
Meanwhile, it should be understood that for convenience of description, the size of the various piece shown in accompanying drawing is not draw according to the proportionate relationship of reality.
May not discuss in detail for the known technology of person of ordinary skill in the relevant, method and apparatus, but in the appropriate case, described technology, method and apparatus should be regarded as a part of authorizing instructions.
In all examples with discussing shown here, any occurrence should be construed as merely exemplary, instead of as restriction.Therefore, other example of exemplary embodiment can have different values.
It should be noted that: represent similar terms in similar label and letter accompanying drawing below, therefore, once be defined in an a certain Xiang Yi accompanying drawing, then do not need to be further discussed it in accompanying drawing subsequently.
Fig. 1 is the schematic diagram of a sound control method of the present invention embodiment.As shown in Figure 1, the method step of the present embodiment is as follows:
Step 101, the user speech command information collected is sent to network insertion terminal by mobile terminal.
Step 102, user speech command information is sent to Application on Voiceprint Recognition server by network insertion terminal.
Step 103, Application on Voiceprint Recognition server carries out Application on Voiceprint Recognition to user speech command information, and the user ID corresponding with the vocal print identified is sent to network access equipment.
Step 104, user speech command information and user ID are sent to speech recognition server by network access equipment.
Step 105, speech recognition server inquires about the user's corpus be associated with user ID.
Step 106, speech recognition server is in the user's corpus be associated with user ID, extract the steering order corresponding with user speech command information, steering order is sent to network access equipment, so that network access equipment carries out corresponding operating according to steering order.
Based on the sound control method that the above embodiment of the present invention provides, confirmed the identity of active user by Application on Voiceprint Recognition, utilize and extract the steering order corresponding with user speech instruction with the personalized corpus that user identity is associated.Thus the accuracy rate of speech recognition can be improved, shorten the elapsed time of speech recognition, make user obtain better experience.
In one embodiment, the method can be applicable to IPTV(Internet ProtocolTelevision, Web TV) in speech control system, wherein mobile terminal can be telepilot, and network insertion terminal can be Set Top Box.Wherein in IPTV speech control system, IPTV voice remote controller is collected the voice of each new user and is delivered to Set Top Box, forms the vocal print storehouse of this new user under this Set Top Box.When user is when using Voice command IPTV, user identity can be identified according to user's vocal print feature, in user's routine use process, progressively setting up the personalized corpus of this user.Like this, IPTV speech recognition server carries out in speech recognition process, can first search user individual corpus, in order to improve the accuracy of voice identification result, shortens the time that speech recognition consumes, the Consumer's Experience of optimizing product.
Equally, all the present invention can be adopted in other similar Voice command scenes with fixed-line subscriber.As Intelligent household voice control system, family KTV to request a song speech control system, vehicle-mounted program request speech control system etc.
Fig. 2 is the schematic diagram of another embodiment of sound control method of the present invention.Compared with embodiment illustrated in fig. 1, further replenish user corpus when there is not corresponding information in user's corpus embodiment illustrated in fig. 2, thus can Consumer's Experience be improved.
Step 201, the user speech command information collected is sent to network insertion terminal by mobile terminal.
Step 202, user speech command information is sent to Application on Voiceprint Recognition server by network insertion terminal.
Step 203, Application on Voiceprint Recognition server carries out Application on Voiceprint Recognition to user speech command information, and the user ID corresponding with the vocal print identified is sent to network access equipment.
Step 204, user speech command information and user ID are sent to speech recognition server by network access equipment.
Step 205, speech recognition server inquires about the user's corpus be associated with user ID.
Step 206, speech recognition server judges, in the user's corpus be associated with user ID, whether to there is the steering order corresponding with user speech command information.If there is not the steering order corresponding with user speech command information with user's corpus that user ID is associated, then performing step 207; If with exist in user's corpus that user ID is associated and the corresponding steering order of user speech command information, then perform step 208.
Step 207, speech recognition server carries out speech recognition to obtain steering order by general corpus to user speech command information, and steering order is deposited in the user's corpus be associated with user ID with corresponding user speech command information.Then step 209 is performed.
Step 208, speech recognition server extracts the steering order corresponding with user speech command information.
Step 209, steering order is sent to network access equipment by speech recognition server, so that network access equipment carries out corresponding operating according to steering order.
Fig. 3 is the schematic diagram of the another embodiment of sound control method of the present invention.In this embodiment, when occur new user time or when not existing when user's corpus, system all automatically can add relevant information, thus improves Consumer's Experience.
Step 301, the user speech command information collected is sent to network insertion terminal by mobile terminal.
Step 302, user speech command information is sent to Application on Voiceprint Recognition server by network insertion terminal.
Step 303, Application on Voiceprint Recognition server carries out Application on Voiceprint Recognition to user speech command information, to obtain voiceprint.
Step 304, Application on Voiceprint Recognition server judges whether there is described voiceprint in vocal print storehouse.If there is not described voiceprint in vocal print storehouse, then perform step 305; If there is described voiceprint in vocal print storehouse, then perform step 307.
Step 305, is stored into described voiceprint in vocal print storehouse, and distributes corresponding user ID for described voiceprint.
Step 306, sends to network access equipment by the user ID of distribution.Then step 308 is performed.
Step 307, sends to network access equipment by the user ID corresponding with the vocal print identified.
Step 308, user speech command information and user ID are sent to speech recognition server by network access equipment.
Step 309, speech recognition server judges whether to inquire the user's corpus be associated with user ID.The user's corpus be associated with user ID if do not inquire, then perform step 310; The user's corpus be associated with user ID if inquire, then perform step 311.
Step 310, speech recognition server sets up the user's corpus be associated with user ID, speech recognition is carried out to obtain steering order to user speech command information, and steering order and corresponding user speech command information are deposited in the user's corpus be associated with user ID, then perform step 312.
Step 311, speech recognition server, in the user's corpus be associated with user ID, extracts the steering order corresponding with user speech command information.
Preferably, the extraction operation in step 311, can adopt embodiment illustrated in fig. 2 process.
Step 312, steering order is sent to network access equipment by speech recognition server, so that network access equipment carries out corresponding operating according to steering order.
Fig. 4 is the schematic diagram of a speech control system of the present invention embodiment.As shown in Figure 4, this system comprises mobile terminal 401, network insertion terminal 402, Application on Voiceprint Recognition server 403 and speech recognition server 404.Wherein:
Mobile terminal 401, for gathering user speech command information, sends to network insertion terminal 402 by the user speech command information collected.
Network insertion terminal 402, for when receiving the user speech command information that mobile terminal 401 sends, sends to Application on Voiceprint Recognition server 403 by user speech command information; When receiving the user ID that Application on Voiceprint Recognition server 403 sends, user speech command information and user ID are sent to speech recognition server 404.
Application on Voiceprint Recognition server 403, for when receiving the user speech command information that network insertion terminal 402 sends, carrying out Application on Voiceprint Recognition to user speech command information, and the user ID corresponding with the vocal print identified is sent to network access equipment 402.
Speech recognition server 404, for when receiving user speech command information and the user ID of network insertion terminal 402 transmission, inquire about the user's corpus be associated with user ID, in the user's corpus be associated with user ID, extract the steering order corresponding with user speech command information, steering order is sent to network access equipment 402, so that network access equipment 402 carries out corresponding operating according to steering order.
Based on the speech control system that the above embodiment of the present invention provides, confirmed the identity of active user by Application on Voiceprint Recognition, utilize and extract the steering order corresponding with user speech instruction with the personalized corpus that user identity is associated.Thus the accuracy rate of speech recognition can be improved, shorten the elapsed time of speech recognition, make user obtain better experience.
Preferably, this system can be in IPTV speech control system, and wherein mobile terminal is telepilot, and network insertion terminal is Set Top Box.Equally, all the present invention can be adopted in other similar Voice command scenes with fixed-line subscriber.As Intelligent household voice control system, family KTV to request a song speech control system, vehicle-mounted program request speech control system etc.
Preferably, speech recognition server 404 is specifically when receiving user speech command information and the user ID of network insertion terminal 402 transmission, judge, in the user's corpus be associated with user ID, whether to there is the steering order corresponding with user speech command information; If with exist in user's corpus that user ID is associated and the corresponding steering order of user speech command information, then perform the operation of extracting the steering order corresponding with user speech command information.
Preferably, also for when there is not the steering order corresponding with user speech command information in user's corpus that user ID is associated in speech recognition server 404, by general corpus, speech recognition is carried out to obtain steering order to user speech command information, and steering order is deposited in the user's corpus be associated with user ID with corresponding user speech command information.
Preferably, speech recognition server 404 specifically when receiving user speech command information and the user ID of network insertion terminal transmission, judges whether to inquire the user's corpus be associated with user ID; The user's corpus be associated with user ID if inquire, then perform in the user's corpus be associated with user ID, extracts the operation of the steering order corresponding with user speech command information.
Preferably, speech recognition server 404 is not also for when inquiring the user's corpus be associated with user ID, set up the user's corpus be associated with user ID, speech recognition is carried out to obtain steering order to user speech command information, and steering order is deposited in the user's corpus be associated with user ID with corresponding user speech command information, then perform operation steering order being sent to network access equipment.
Preferably, Application on Voiceprint Recognition server 403 specifically when receiving the user speech command information that network insertion terminal sends, carries out Application on Voiceprint Recognition to user speech command information, to obtain voiceprint; Judge whether there is described voiceprint in vocal print storehouse; If there is described voiceprint in vocal print storehouse, then perform the operation user ID corresponding with the vocal print identified being sent to network access equipment.
Preferably, when also for there is not described voiceprint in vocal print storehouse in Application on Voiceprint Recognition server 403, described voiceprint is stored in vocal print storehouse, and distributes corresponding user ID for described voiceprint, then the user ID of distribution is sent to network access equipment.
Such as, the new user of IPTV allows IPTV service provider to set up oneself vocal print storehouse and corpus, and typing one section of voice.Application on Voiceprint Recognition server extracts user's vocal print characteristic storage (Application on Voiceprint Recognition server and storer can be arranged on Set Top Box this locality, also can be arranged in speech recognition server) under this family's vocal print storehouse from this section of voice.Speech recognition server sets up user individual corpus according to the user's common-use words in User IP TV use procedure and speech habits in the actual use procedure of user, under being stored in this family's corpus.
Finally, when this user sends phonetic order, Application on Voiceprint Recognition server identifies user identity, and speech recognition server searches for this user individual corpus, and returns accordingly result.
In the present invention, because corpus is for each individual subscriber, therefore speech discrimination accuracy can significantly improve, domestic consumer's number is few, vocal print storehouse is little, Application on Voiceprint Recognition is consuming time almost can be ignored, and individual corpus is more much smaller than family corpus, and therefore speech recognition elapsed time also can shorten greatly.
One of ordinary skill in the art will appreciate that all or part of step realizing above-described embodiment can have been come by hardware, the hardware that also can carry out instruction relevant by program completes, described program can be stored in a kind of computer-readable recording medium, the above-mentioned storage medium mentioned can be ROM (read-only memory), disk or CD etc.

Claims (16)

1. a sound control method, is characterized in that, comprising:
The user speech command information collected is sent to network insertion terminal by mobile terminal;
User speech command information is sent to Application on Voiceprint Recognition server by network insertion terminal;
Application on Voiceprint Recognition server carries out Application on Voiceprint Recognition to user speech command information, and the user ID corresponding with the vocal print identified is sent to network access equipment;
User speech command information and user ID are sent to speech recognition server by network access equipment;
Speech recognition server inquires about the user's corpus be associated with user ID;
Speech recognition server, in the user's corpus be associated with user ID, extracts the steering order corresponding with user speech command information, steering order is sent to network access equipment, so that network access equipment carries out corresponding operating according to steering order.
2. method according to claim 1, is characterized in that,
Speech recognition server is in the user's corpus be associated with user ID, and the step extracting the steering order corresponding with user speech command information comprises:
Speech recognition server judges, in the user's corpus be associated with user ID, whether to there is the steering order corresponding with user speech command information;
If with exist in user's corpus that user ID is associated and the corresponding steering order of user speech command information, then perform the step extracting the steering order corresponding with user speech command information.
3. method according to claim 2, is characterized in that,
If there is not the steering order corresponding with user speech command information with user's corpus that user ID is associated, then by general corpus, speech recognition is carried out to obtain steering order to user speech command information, and steering order is deposited in the user's corpus be associated with user ID with corresponding user speech command information.
4. the method according to any one of claim 1-3, is characterized in that,
The step that speech recognition server inquires about the user's corpus be associated with user ID comprises:
Speech recognition server judges whether to inquire the user's corpus be associated with user ID;
The user's corpus be associated with user ID if inquire, then perform speech recognition server in the user's corpus be associated with user ID, extract the step of the steering order corresponding with user speech command information.
5. method according to claim 4, is characterized in that,
The user's corpus be associated with user ID if do not inquire, then speech recognition server sets up the user's corpus be associated with user ID, speech recognition is carried out to obtain steering order to user speech command information, and steering order is deposited in the user's corpus be associated with user ID with corresponding user speech command information, then perform step steering order being sent to network access equipment.
6. the method according to any one of claim 1-3, is characterized in that,
Application on Voiceprint Recognition server carries out Application on Voiceprint Recognition to user speech command information, and sends to the step of network access equipment to comprise the user ID corresponding with the vocal print identified:
Application on Voiceprint Recognition server carries out Application on Voiceprint Recognition to user speech command information, to obtain voiceprint;
Judge whether there is described voiceprint in vocal print storehouse;
If there is described voiceprint in vocal print storehouse, then perform the step user ID corresponding with the vocal print identified being sent to network access equipment.
7. method according to claim 6, is characterized in that,
If there is not described voiceprint in vocal print storehouse, then described voiceprint is stored in vocal print storehouse, and distributes corresponding user ID for described voiceprint, then the user ID of distribution is sent to network access equipment.
8. the method according to any one of claim 1-3, is characterized in that,
Mobile terminal is telepilot;
Network insertion terminal is Set Top Box.
9. a speech control system, is characterized in that, comprises mobile terminal, network insertion terminal, Application on Voiceprint Recognition server and speech recognition server, wherein:
Mobile terminal, for gathering user speech command information, sends to network insertion terminal by the user speech command information collected;
Network insertion terminal, for when receiving the user speech command information that mobile terminal sends, sends to Application on Voiceprint Recognition server by user speech command information; When receiving the user ID that Application on Voiceprint Recognition server sends, user speech command information and user ID are sent to speech recognition server;
Application on Voiceprint Recognition server, for when receiving the user speech command information that network insertion terminal sends, carrying out Application on Voiceprint Recognition to user speech command information, and the user ID corresponding with the vocal print identified is sent to network access equipment;
Speech recognition server, for when receiving user speech command information and the user ID of network insertion terminal transmission, inquire about the user's corpus be associated with user ID, in the user's corpus be associated with user ID, extract the steering order corresponding with user speech command information, steering order is sent to network access equipment, so that network access equipment carries out corresponding operating according to steering order.
10. system according to claim 9, is characterized in that,
Speech recognition server specifically when receiving user speech command information and the user ID of network insertion terminal transmission, judges, in the user's corpus be associated with user ID, whether to there is the steering order corresponding with user speech command information; If with exist in user's corpus that user ID is associated and the corresponding steering order of user speech command information, then perform the operation of extracting the steering order corresponding with user speech command information.
11. systems according to claim 10, is characterized in that,
Also for when there is not the steering order corresponding with user speech command information in user's corpus that user ID is associated in speech recognition server, by general corpus, speech recognition is carried out to obtain steering order to user speech command information, and steering order is deposited in the user's corpus be associated with user ID with corresponding user speech command information.
12. systems according to any one of claim 9-11, is characterized in that,
Speech recognition server specifically when receiving user speech command information and the user ID of network insertion terminal transmission, judges whether to inquire the user's corpus be associated with user ID; The user's corpus be associated with user ID if inquire, then perform in the user's corpus be associated with user ID, extracts the operation of the steering order corresponding with user speech command information.
13. systems according to claim 12, is characterized in that,
Speech recognition server is not also for when inquiring the user's corpus be associated with user ID, set up the user's corpus be associated with user ID, speech recognition is carried out to obtain steering order to user speech command information, and steering order is deposited in the user's corpus be associated with user ID with corresponding user speech command information, then perform operation steering order being sent to network access equipment.
14. systems according to any one of claim 9-11, is characterized in that,
Application on Voiceprint Recognition server specifically when receiving the user speech command information that network insertion terminal sends, carries out Application on Voiceprint Recognition to user speech command information, to obtain voiceprint; Judge whether there is described voiceprint in vocal print storehouse; If there is described voiceprint in vocal print storehouse, then perform the operation user ID corresponding with the vocal print identified being sent to network access equipment.
15. systems according to claim 14, is characterized in that,
When Application on Voiceprint Recognition server also for not existing described voiceprint in vocal print storehouse, described voiceprint being stored in vocal print storehouse, and distributing corresponding user ID for described voiceprint, then the user ID of distribution being sent to network access equipment.
16. systems according to any one of claim 9-11, is characterized in that,
Mobile terminal is telepilot;
Network insertion terminal is Set Top Box.
CN201410011484.XA 2014-01-10 2014-01-10 Voice control method and system Pending CN104778946A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201410011484.XA CN104778946A (en) 2014-01-10 2014-01-10 Voice control method and system

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201410011484.XA CN104778946A (en) 2014-01-10 2014-01-10 Voice control method and system

Publications (1)

Publication Number Publication Date
CN104778946A true CN104778946A (en) 2015-07-15

Family

ID=53620374

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201410011484.XA Pending CN104778946A (en) 2014-01-10 2014-01-10 Voice control method and system

Country Status (1)

Country Link
CN (1) CN104778946A (en)

Cited By (31)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104036778A (en) * 2014-05-20 2014-09-10 安徽科大讯飞信息科技股份有限公司 Equipment control method, device and system
CN105137789A (en) * 2015-08-28 2015-12-09 青岛海尔科技有限公司 Control method and device of intelligent IoT electrical appliances, and related devices
CN105374355A (en) * 2015-12-17 2016-03-02 厦门科牧智能技术有限公司 Electronic pedestal pan voice control and interaction system and method and electronic pedestal pan
CN105791931A (en) * 2016-02-26 2016-07-20 深圳Tcl数字技术有限公司 Smart television and voice control method of the smart television
CN105810200A (en) * 2016-02-04 2016-07-27 深圳前海勇艺达机器人有限公司 Man-machine dialogue apparatus and method based on voiceprint identification
CN105825856A (en) * 2016-05-16 2016-08-03 四川长虹电器股份有限公司 Independent learning method for vehicle-mounted speech recognition module
CN106782535A (en) * 2016-12-26 2017-05-31 深圳前海勇艺达机器人有限公司 Data processing method and device based on intelligent appliance
CN107346568A (en) * 2016-05-05 2017-11-14 阿里巴巴集团控股有限公司 The authentication method and device of a kind of gate control system
CN107911386A (en) * 2017-12-06 2018-04-13 北京小米移动软件有限公司 Obtain the method and device of service authorization information
CN108369806A (en) * 2016-01-22 2018-08-03 微软技术许可有限责任公司 Configurable all-purpose language understands model
CN108428446A (en) * 2018-03-06 2018-08-21 北京百度网讯科技有限公司 Audio recognition method and device
CN109036424A (en) * 2018-08-30 2018-12-18 出门问问信息科技有限公司 Audio recognition method, device, electronic equipment and computer readable storage medium
CN109018778A (en) * 2018-08-31 2018-12-18 深圳市研本品牌设计有限公司 Rubbish put-on method and system based on speech recognition
CN109065056A (en) * 2018-09-26 2018-12-21 珠海格力电器股份有限公司 A kind of method and device of voice control air-conditioning
CN109104634A (en) * 2017-06-20 2018-12-28 中兴通讯股份有限公司 A kind of set-top box working method, set-top box and computer readable storage medium
CN109119071A (en) * 2018-09-26 2019-01-01 珠海格力电器股份有限公司 A kind of training method and device of speech recognition modeling
CN109360563A (en) * 2018-12-10 2019-02-19 珠海格力电器股份有限公司 A kind of sound control method, device, storage medium and air-conditioning
WO2019051902A1 (en) * 2017-09-18 2019-03-21 广东美的制冷设备有限公司 Terminal control method, air conditioner and computer-readable storage medium
CN109617772A (en) * 2018-12-11 2019-04-12 鹤壁国立光电科技股份有限公司 A kind of smart home system based on speech recognition
CN109920429A (en) * 2017-12-13 2019-06-21 上海擎感智能科技有限公司 It is a kind of for vehicle-mounted voice recognition data processing method and system
WO2019137066A1 (en) * 2018-01-15 2019-07-18 格力电器(武汉)有限公司 Electric appliance control method and device
CN110232457A (en) * 2019-04-15 2019-09-13 广东康云科技有限公司 A kind of government affairs service hall system
CN110400568A (en) * 2018-04-20 2019-11-01 比亚迪股份有限公司 Awakening method, intelligent voice system and the vehicle of intelligent voice system
CN110570843A (en) * 2019-06-28 2019-12-13 北京蓦然认知科技有限公司 user voice recognition method and device
CN110853637A (en) * 2019-10-17 2020-02-28 北京雷石天地电子技术有限公司 Intelligent terminal control system and method
CN110910875A (en) * 2019-11-13 2020-03-24 秒针信息技术有限公司 Member management method and system based on voice recognition
CN110931018A (en) * 2019-12-03 2020-03-27 珠海格力电器股份有限公司 Intelligent voice interaction method and device and computer readable storage medium
CN112599136A (en) * 2020-12-15 2021-04-02 江苏惠通集团有限责任公司 Voice recognition method and device based on voiceprint recognition, storage medium and terminal
CN112651526A (en) * 2020-12-21 2021-04-13 北京百度网讯科技有限公司 Method, device, equipment and storage medium for reserving target service
CN114494267A (en) * 2021-11-30 2022-05-13 北京国网富达科技发展有限责任公司 Substation and cable tunnel scene semantic construction system and method
CN115277279A (en) * 2022-08-03 2022-11-01 海南创兴高科技有限公司 Intelligent bedside cabinet control method, voice recognition method, computer and storage medium

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2003122395A (en) * 2001-10-19 2003-04-25 Asahi Kasei Corp Voice recognition system, terminal and program, and voice recognition method
CN102316162A (en) * 2011-09-01 2012-01-11 深圳市子栋科技有限公司 Vehicle remote control method based on voice command, apparatus and system thereof
CN202679415U (en) * 2011-09-01 2013-01-16 深圳市车音网科技有限公司 Vehicle remote control system based on voice command, communication terminal and cloud computing platform server
CN102945669A (en) * 2012-11-14 2013-02-27 四川长虹电器股份有限公司 Household appliance voice control method
CN103414560A (en) * 2013-07-05 2013-11-27 北京车音网科技有限公司 Starting method of application, device thereof, system thereof and application server
CN103456303A (en) * 2013-08-08 2013-12-18 四川长虹电器股份有限公司 Method for controlling voice and intelligent air-conditionier system

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2003122395A (en) * 2001-10-19 2003-04-25 Asahi Kasei Corp Voice recognition system, terminal and program, and voice recognition method
CN102316162A (en) * 2011-09-01 2012-01-11 深圳市子栋科技有限公司 Vehicle remote control method based on voice command, apparatus and system thereof
CN202679415U (en) * 2011-09-01 2013-01-16 深圳市车音网科技有限公司 Vehicle remote control system based on voice command, communication terminal and cloud computing platform server
CN102945669A (en) * 2012-11-14 2013-02-27 四川长虹电器股份有限公司 Household appliance voice control method
CN103414560A (en) * 2013-07-05 2013-11-27 北京车音网科技有限公司 Starting method of application, device thereof, system thereof and application server
CN103456303A (en) * 2013-08-08 2013-12-18 四川长虹电器股份有限公司 Method for controlling voice and intelligent air-conditionier system

Cited By (39)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104036778A (en) * 2014-05-20 2014-09-10 安徽科大讯飞信息科技股份有限公司 Equipment control method, device and system
CN105137789A (en) * 2015-08-28 2015-12-09 青岛海尔科技有限公司 Control method and device of intelligent IoT electrical appliances, and related devices
CN105374355A (en) * 2015-12-17 2016-03-02 厦门科牧智能技术有限公司 Electronic pedestal pan voice control and interaction system and method and electronic pedestal pan
CN108369806B (en) * 2016-01-22 2022-07-22 微软技术许可有限责任公司 Configurable generic language understanding model
CN108369806A (en) * 2016-01-22 2018-08-03 微软技术许可有限责任公司 Configurable all-purpose language understands model
CN105810200A (en) * 2016-02-04 2016-07-27 深圳前海勇艺达机器人有限公司 Man-machine dialogue apparatus and method based on voiceprint identification
CN105791931A (en) * 2016-02-26 2016-07-20 深圳Tcl数字技术有限公司 Smart television and voice control method of the smart television
WO2017143692A1 (en) * 2016-02-26 2017-08-31 深圳Tcl数字技术有限公司 Smart television and voice control method therefor
CN107346568A (en) * 2016-05-05 2017-11-14 阿里巴巴集团控股有限公司 The authentication method and device of a kind of gate control system
CN105825856A (en) * 2016-05-16 2016-08-03 四川长虹电器股份有限公司 Independent learning method for vehicle-mounted speech recognition module
CN105825856B (en) * 2016-05-16 2019-11-08 四川长虹电器股份有限公司 The autonomous learning method of vehicle-mounted voice identification module
CN106782535A (en) * 2016-12-26 2017-05-31 深圳前海勇艺达机器人有限公司 Data processing method and device based on intelligent appliance
CN109104634A (en) * 2017-06-20 2018-12-28 中兴通讯股份有限公司 A kind of set-top box working method, set-top box and computer readable storage medium
WO2019051902A1 (en) * 2017-09-18 2019-03-21 广东美的制冷设备有限公司 Terminal control method, air conditioner and computer-readable storage medium
CN107911386A (en) * 2017-12-06 2018-04-13 北京小米移动软件有限公司 Obtain the method and device of service authorization information
CN107911386B (en) * 2017-12-06 2020-12-04 北京小米移动软件有限公司 Method and device for acquiring service authorization information
CN109920429A (en) * 2017-12-13 2019-06-21 上海擎感智能科技有限公司 It is a kind of for vehicle-mounted voice recognition data processing method and system
WO2019137066A1 (en) * 2018-01-15 2019-07-18 格力电器(武汉)有限公司 Electric appliance control method and device
US10978047B2 (en) 2018-03-06 2021-04-13 Beijing Baidu Netcom Science And Technology Co., Ltd. Method and apparatus for recognizing speech
CN108428446A (en) * 2018-03-06 2018-08-21 北京百度网讯科技有限公司 Audio recognition method and device
CN108428446B (en) * 2018-03-06 2020-12-25 北京百度网讯科技有限公司 Speech recognition method and device
CN110400568A (en) * 2018-04-20 2019-11-01 比亚迪股份有限公司 Awakening method, intelligent voice system and the vehicle of intelligent voice system
CN109036424A (en) * 2018-08-30 2018-12-18 出门问问信息科技有限公司 Audio recognition method, device, electronic equipment and computer readable storage medium
CN109018778A (en) * 2018-08-31 2018-12-18 深圳市研本品牌设计有限公司 Rubbish put-on method and system based on speech recognition
CN109119071A (en) * 2018-09-26 2019-01-01 珠海格力电器股份有限公司 A kind of training method and device of speech recognition modeling
CN109065056B (en) * 2018-09-26 2021-05-11 珠海格力电器股份有限公司 Method and device for controlling air conditioner through voice
CN109065056A (en) * 2018-09-26 2018-12-21 珠海格力电器股份有限公司 A kind of method and device of voice control air-conditioning
CN109360563A (en) * 2018-12-10 2019-02-19 珠海格力电器股份有限公司 A kind of sound control method, device, storage medium and air-conditioning
CN109617772A (en) * 2018-12-11 2019-04-12 鹤壁国立光电科技股份有限公司 A kind of smart home system based on speech recognition
CN110232457A (en) * 2019-04-15 2019-09-13 广东康云科技有限公司 A kind of government affairs service hall system
CN110570843A (en) * 2019-06-28 2019-12-13 北京蓦然认知科技有限公司 user voice recognition method and device
CN110570843B (en) * 2019-06-28 2021-03-05 北京蓦然认知科技有限公司 User voice recognition method and device
CN110853637A (en) * 2019-10-17 2020-02-28 北京雷石天地电子技术有限公司 Intelligent terminal control system and method
CN110910875A (en) * 2019-11-13 2020-03-24 秒针信息技术有限公司 Member management method and system based on voice recognition
CN110931018A (en) * 2019-12-03 2020-03-27 珠海格力电器股份有限公司 Intelligent voice interaction method and device and computer readable storage medium
CN112599136A (en) * 2020-12-15 2021-04-02 江苏惠通集团有限责任公司 Voice recognition method and device based on voiceprint recognition, storage medium and terminal
CN112651526A (en) * 2020-12-21 2021-04-13 北京百度网讯科技有限公司 Method, device, equipment and storage medium for reserving target service
CN114494267A (en) * 2021-11-30 2022-05-13 北京国网富达科技发展有限责任公司 Substation and cable tunnel scene semantic construction system and method
CN115277279A (en) * 2022-08-03 2022-11-01 海南创兴高科技有限公司 Intelligent bedside cabinet control method, voice recognition method, computer and storage medium

Similar Documents

Publication Publication Date Title
CN104778946A (en) Voice control method and system
CN105120304B (en) Information display method, apparatus and system
CN107105340A (en) People information methods, devices and systems are shown in video based on artificial intelligence
CN105336324B (en) A kind of Language Identification and device
KR101315970B1 (en) Apparatus and method for recognizing content using audio signal
CN103607609B (en) The method for switching languages and device of a kind of TV channel
CN105488135B (en) Live content classification method and device
CN106663129A (en) A sensitive multi-round dialogue management system and method based on state machine context
CN105872619A (en) Video playing record matching method and matching device
CN103092928B (en) Voice inquiry method and system
CN107613400A (en) A kind of implementation method and device of voice barrage
CN106789543A (en) The method and apparatus that facial expression image sends are realized in session
CN104602130A (en) Interactive advertisement implementation method and interactive advertisement implementation system
KR20140098525A (en) Speech recognition apparatus and method for providing response information
CN104519124A (en) Allocation method and device of virtual resources
CN105898525A (en) Method of searching videos in specific video database, and video terminal thereof
KR20130055748A (en) System and method for recommending of contents
CN107943914A (en) Voice information processing method and device
CN108933730A (en) Information-pushing method and device
CN103943111A (en) Method and device for identity recognition
US20170134806A1 (en) Selecting content based on media detected in environment
CN104965594A (en) Intelligent face identification cloud sound control method, device and system thereof
WO2017164510A3 (en) Voice data-based multimedia content tagging method, and system using same
CN106874451A (en) A kind of method of the personal exclusive corpus of automatic foundation
CN103984699B (en) The method for pushing and device of promotion message

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
EXSB Decision made by sipo to initiate substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication
RJ01 Rejection of invention patent application after publication

Application publication date: 20150715