CN109032039A - A kind of method and device of voice control - Google Patents
A kind of method and device of voice control Download PDFInfo
- Publication number
- CN109032039A CN109032039A CN201811031798.0A CN201811031798A CN109032039A CN 109032039 A CN109032039 A CN 109032039A CN 201811031798 A CN201811031798 A CN 201811031798A CN 109032039 A CN109032039 A CN 109032039A
- Authority
- CN
- China
- Prior art keywords
- user
- controlled device
- angle
- target
- moment
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
Classifications
-
- G—PHYSICS
- G05—CONTROLLING; REGULATING
- G05B—CONTROL OR REGULATING SYSTEMS IN GENERAL; FUNCTIONAL ELEMENTS OF SUCH SYSTEMS; MONITORING OR TESTING ARRANGEMENTS FOR SUCH SYSTEMS OR ELEMENTS
- G05B19/00—Programme-control systems
- G05B19/02—Programme-control systems electric
- G05B19/04—Programme control other than numerical control, i.e. in sequence controllers or logic controllers
- G05B19/042—Programme control other than numerical control, i.e. in sequence controllers or logic controllers using digital processors
- G05B19/0423—Input/output
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/22—Procedures used during a speech recognition process, e.g. man-machine dialogue
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/26—Speech to text systems
-
- G—PHYSICS
- G05—CONTROLLING; REGULATING
- G05B—CONTROL OR REGULATING SYSTEMS IN GENERAL; FUNCTIONAL ELEMENTS OF SUCH SYSTEMS; MONITORING OR TESTING ARRANGEMENTS FOR SUCH SYSTEMS OR ELEMENTS
- G05B2219/00—Program-control systems
- G05B2219/20—Pc systems
- G05B2219/25—Pc structure of the system
- G05B2219/25257—Microcontroller
Abstract
The embodiment of the invention discloses a kind of method and device of voice control, to avoid multiple equipment simultaneously to the phonetic order of user respond and caused by maloperation.This method comprises: obtaining user's pose presentation, user's pose presentation is to be acquired by being located at least one of pre-set space acquisition equipment at the first moment;According to user's pose presentation, determine that user is intended to the target controlled device of control from least one controlled device in the pre-set space;It controls the target controlled device and responds the phonetic order that the user inputs at first moment.
Description
Technical field
The present invention relates to terminal applies fields, more particularly to a kind of method and device of voice control.
Background technique
Traditional control method to multiple equipment is usually controlled using the respective remote controler of equipment respectively, and
These remote controlers are often mutually uncurrent, and operate excessively cumbersome.Simpler, more natural mode of operation is used in order to realize
Equipment is controlled, voice control comes into being.
Currently, in order to realize voice control mode, controlled device will be equipped with camera or speech ciphering equipment, to realize view
Feel identification or speech recognition.And there may be setting for multiple support voice control modes in actual application environment, in the same space
It is standby, and these equipment all have camera and voice software, are easy for causing maloperation in speech control process in this way.
Summary of the invention
In view of this, the embodiment of the present invention provides a kind of method and device of voice control, main purpose is to avoid more
A equipment simultaneously to the phonetic order of user respond and caused by maloperation.
According to an embodiment of the present invention in a first aspect, providing a kind of method of voice control, comprising: obtain user's posture
Image, user's pose presentation are to be acquired by being located at least one of pre-set space acquisition equipment at the first moment;Root
According to user's pose presentation, determine that user is intended to the target controlled and is controlled from least one controlled device in the pre-set space
Equipment;It controls the target controlled device and responds the phonetic order that the user inputs at first moment.
In embodiments of the present invention, described according to user's pose presentation, it is controlled from least one of described pre-set space
Determine that user is intended to the target controlled device of control in equipment, comprising: according to user's pose presentation, determine the body of user
The sight angle of angle, the face angle of user and/or user;According to the body angle of the user, the face angle of user
And/or the sight angle of user, by user at least one described controlled device towards target controlled device be determined as it is described
Target controlled device.
In embodiments of the present invention, acquisition user's pose presentation, comprising: receive and set from least one described acquisition
At least one standby image;Determine that timestamp is the image at first moment from least one described image;According to pre-
If target user's model, target detection is carried out to the image that timestamp is first moment, is determined comprising target user
Image, the target user is the user of at least one controlled device;The figure comprising target user that will be determined
As being determined as user's pose presentation.
In embodiments of the present invention, it is defeated at first moment to respond the user for the control target controlled device
The phonetic order entered, comprising: send control instruction to the target controlled device, the control instruction is used to indicate the target
Controlled device responds the phonetic order that the user inputs at first moment.
In embodiments of the present invention, acquisition user's pose presentation, comprising: obtain user and inputted at first moment
Phonetic order;According to preset user's sound-groove model, the user for inputting the phonetic order is identified;When identifying
State user be legitimate user when, acquire user's pose presentation.
In embodiments of the present invention, it is defeated at first moment to respond the user for the control target controlled device
The phonetic order entered, comprising: speech recognition is carried out to the phonetic order;The phonetic order is responded, corresponding target is executed
Operation.
Second aspect according to an embodiment of the present invention provides a kind of device of voice control, comprising: obtaining unit is used
In obtaining user's pose presentation, user's pose presentation is to acquire equipment first by being located at least one of pre-set space
Moment acquisition;Determination unit is used for according to user's pose presentation, from least one controlled device in the pre-set space
Determine that user is intended to the target controlled device of control;Control unit responds the user for controlling the target controlled device
In the phonetic order of first moment input.
In embodiments of the present invention, the determination unit is specifically used for determining user's according to user's pose presentation
The sight angle of body angle, the face angle of user and/or user;According to the body angle of the user, the face of user
The sight angle of angle and/or user, by user at least one described controlled device towards target controlled device be determined as
The target controlled device.
The third aspect according to an embodiment of the present invention, provides a kind of electronic equipment, comprising: at least one processor;With
And at least one processor, the bus being connected to the processor;Wherein, the processor, memory are complete by the bus
At mutual communication;The processor is used to call program instruction in the memory, to execute said one or more
The method of voice control described in a technical solution.
Fourth aspect according to an embodiment of the present invention provides a kind of computer readable storage medium, and the computer can
It reads storage medium and stores computer instruction, the computer instruction makes the computer execute said one or multiple technical sides
The method of voice control described in case.
By above-mentioned technical proposal, a kind of method and device of voice control provided in an embodiment of the present invention, wherein first
It obtains and acquires user's pose presentation that equipment acquires at the first moment by being located at least one of pre-set space, then, according to
User's pose presentation determines that user is intended to the target controlled device of control from least one controlled device in pre-set space,
Finally, the phonetic order that control target controlled device response user inputs at the first moment, that is to say, that in the embodiment of the present invention
In, by according to user's pose presentation, determining that user is intended to the target controlled device of control in multiple controlled devices, and then control
The phonetic order of target controlled device response user is made, so that multiple equipment be avoided to ring simultaneously to the phonetic order of user
Answer and caused by maloperation.
The above description is only an overview of the technical scheme of the present invention, in order to better understand the technical means of the present invention,
And it can be implemented in accordance with the contents of the specification, and in order to allow above and other objects of the present invention, feature and advantage can
It is clearer and more comprehensible, the followings are specific embodiments of the present invention.
Detailed description of the invention
By reading the following detailed description of the preferred embodiment, various other advantages and benefits are common for this field
Technical staff will become clear.The drawings are only for the purpose of illustrating a preferred embodiment, and is not considered as to the present invention
Limitation.And throughout the drawings, the same reference numbers will be used to refer to the same parts.In the accompanying drawings:
Fig. 1 is the implementation process diagram of the sound control method in the embodiment of the present invention;
Fig. 2 is the schematic diagram of the pre-set space in the embodiment of the present invention;
Fig. 3 is the implementation process schematic diagram of controlled device of setting the goal really in the embodiment of the present invention;
Fig. 4 is the structural schematic diagram of the phonetic controller in the embodiment of the present invention;
Fig. 5 is the structural schematic diagram of the electronic equipment in the embodiment of the present invention.
Specific embodiment
Exemplary embodiments of the present disclosure are described in more detail below with reference to accompanying drawings.Although showing the disclosure in attached drawing
Exemplary embodiment, it being understood, however, that may be realized in various forms the disclosure without should be by embodiments set forth here
It is limited.On the contrary, these embodiments are provided to facilitate a more thoroughly understanding of the present invention, and can be by the scope of the present disclosure
It is fully disclosed to those skilled in the art.
In embodiments of the present invention, in same pre-set space, such as parlor, bedroom, office, compartment inside, can be with
At least one electronic equipment is existed simultaneously, such as smart phone, smartwatch, tablet computer, laptop, intelligent air condition, net
Network camera, intelligent sound box etc., user can control these electronic equipments by input phonetic order.But due to same
Multiple electronic equipments are stored in one space, it is possible to cause user to actually want to control A equipment, be B device response user
The case where phonetic order, occurs, and causes maloperation.
So, in order to solve this problem, in embodiments of the present invention, the electronic equipment in same pre-set space is according to certainly
Whether body has image collecting function, can be divided into acquisition equipment and controlled device, wherein acquisition equipment is for acquiring default sky
Interior image, controlled device is used to respond the phonetic order of user, and controlled device has the function of voice collecting.In reality
In, the same electronic equipment can be acquisition equipment, and be by equipment, such as smart television, IP Camera, intelligent hand
Machine, smartwatch etc.;Certainly, the same electronic equipment can moreover be only controlled device, such as intelligent sound box, intelligent air condition, right
This, the embodiment of the present invention is not especially limited.
It in practical applications, can be with direct communication or connection between the electronic equipment in above-mentioned same pre-set space
Letter.For example, these electronic equipments can log in background server using same user account, then carried out by background server
Communication;Alternatively, in order to improve communication efficiency, these electronic equipments can also be by the intelligent gateway that is arranged in the pre-set space
It is communicated;Furthermore in order to further improve communication efficiency, these electronic equipments can also (purple honeybee be assisted by such as Zigbee
View), the wireless communication techniques such as Wi-Fi (Wireless-Fidelity, Wireless Fidelity) are communicated.Certainly, these electronic equipments
Between can also be not especially limited using other communication modes, the embodiment of the present invention.
Further, the embodiment of the present invention provides a kind of method of voice control, and this method can be applied to a voice control
Device processed, the phonetic controller can be applied to such as above-mentioned background server, intelligent gateway, acquisition equipment or controlled device
In.
Fig. 1 is the implementation process diagram of the sound control method in the embodiment of the present invention, shown in Figure 1, this method
Include:
S101: user's pose presentation is obtained;
Wherein, user's pose presentation is to be acquired by being located at least one of pre-set space acquisition equipment at the first moment
's;
Here, when user is when carrying out voice control in pre-set space at the first moment, user says a phonetic order, this
When, the electronic equipment for having voice collecting function in the pre-set space, i.e., above-mentioned controlled device will receive the voice and refer to
It enables.If controlled device and acquisition equipment are same equipment, controlled device control is from the first moment acquisition pre-set space
Image.If controlled device is different equipment from acquisition equipment, controlled device sends acquisition instructions, acquisition to acquisition equipment
After equipment receives the acquisition instructions, the image in the instruction acquisition pre-set space is responded.Since user is at the first moment
In pre-set space, so, it may include the image of user, i.e. user's pose presentation in the image of acquisition equipment acquisition.Finally,
Collected user's pose presentation can be sent to above-mentioned phonetic controller by acquisition equipment, at this point, phonetic controller obtains
Obtain user's pose presentation.
S102: according to user's pose presentation, determine that user is intended to control from least one controlled device in pre-set space
The target controlled device of system;
Here, phonetic controller identifies user's pose presentation, after obtaining user's pose presentation to obtain
Obtain the posture information of user, such as body angle, the face angle of user and/or the sight angle of user of user.Then, it presses
According to the posture information of user, be capable of determining that user the first moment towards direction, and then determine user towards quilt
Equipment is controlled, which is the target controlled device that user is intended to control.
S103: the phonetic order that control target controlled device response user inputs at the first moment.
Here, phonetic controller is after determining target controlled device, so that it may control the response of target controlled device
The phonetic order that user inputs at the first moment.At this point, if phonetic controller is applied to background server or intelligent gateway,
Then phonetic controller sends control instruction and gives target controlled device, and controlled device executes the control instruction, and response is from the
The phonetic order for user's input that one moment received;If phonetic controller is applied to different from target controlled device set
Standby, then phonetic controller equally sends control instruction and gives target controlled device, and controlled device executes the control instruction, and response is certainly
The phonetic order for the user's input arrived in the first reception;If phonetic controller is applied to target controlled device, language
Sound control device control itself response from the first reception to user input phonetic order.
The method of above-mentioned voice control is illustrated with specific example below.
For example, Fig. 2 is the schematic diagram of the pre-set space in the embodiment of the present invention, and it is shown in Figure 2, in pre-set space 200
In IP Camera 201, smart television 202 and intelligent sound box 203 are installed, wherein IP Camera 201 and smart television
It is provided with camera on 202, is provided with microphone on IP Camera 201, smart television 202 and intelligent sound box 203.
So, firstly, 204 station of user inputs phonetic order, such as " today towards intelligent sound box 203 in pre-set space 200
How is weather? ", smart television 202 and intelligent sound box 203 can receive the phonetic order of user's input at this time, then,
Smart television 202 and/or intelligent sound box send acquisition instructions to all acquisition equipment, then, IP Camera 201 and intelligence
TV 202 carries out Image Acquisition by respective camera, and collected user's pose presentation is sent to by IP Camera 201
Smart television 202, smart television 202 is according to itself and the collected user's pose presentation of IP Camera, to the posture of user
Identified, obtain the posture information of user, and according to the posture information determine user towards be intelligent sound box 203, in turn
Determine that intelligent sound box is the target controlled device that user is intended to control, next, smart television 202 controls 203 sound of intelligent sound box
Input using family at the first moment " today, how is weather? " this phonetic order, intelligent sound box 203 are obtained by speech recognition
Obtain " today, how is weather? " revert statement " today, Beijing, fine, 35 DEG C to 25 DEG C ", and by the sentence be converted to voice letter
Number output.
So far, the speech control process of user is just completed.
Based on previous embodiment, Fig. 3 is the implementation process schematic diagram of controlled device of setting the goal really in the embodiment of the present invention,
Shown in Figure 3, above-mentioned S102 can specifically include:
S301: according to user's pose presentation, the view of the body angle of user, the face angle of user and/or user is determined
Line angle degree;
S302: according to the sight angle of the body angle of user, the face angle of user and/or user, by least one
In controlled device user towards target controlled device be determined as target controlled device.
In the specific implementation process, the pose presentation of image analysis algorithm analysis user can be used in phonetic controller,
To determine whether user is intended for some acquisition equipment, and then determine the practical target controlled device faced of user.
Specifically, the angle, the angle of face and/or sight angle of user's body are determined according to the pose presentation of user.Example
If facial image is towards there are many angles: just facing towards referred to as ten facets, subject is two symmetrical on picture;Nine facets,
Face constitutes 18 degree of side angles towards with taking lens;Eight facets are 36 degree of side angles;Seven facets are 54 degree of side angles;Six facets are 72 degree
Side angle;Five facets are 90 degree of side angles, at this moment, subject only performance one eye eyeball, that is, full side on picture.It can be used
Image analysis algorithm analyzes the symmetric case of the left and right sides in facial image, and then judges whether the angle of face is basic face sheet
Equipment.If the angle of user's body, the angle of face and/or sight angle are basic this equipment of face, then it is assumed that user
It is to there is control to be intended to this equipment.
In other embodiments of the present invention, angle, the angle of face and/or sight angle and the shooting of user's body be can recognize
The angle that camera lens is constituted, by the direction of plane where the direction of plane where user's body, face and/or vertical with sight angle
The angle that the direction of the plane where direction and taking lens where plane is constituted as user towards angle, when user's
It is less than or equal to preset angle threshold towards angle, then it is assumed that the angle of user is basic this equipment of face, and angle threshold can
It is set as 5 ° to 10 °.
So, if body is big towards angle, the difference towards angle and sight towards any two angle in angle of face
In being equal to preset angle difference threshold value, then in the case where sight is less than or equal to preset angle threshold towards angle, judgement
User has control to be intended to the controlled device.If the angle of body, the angle of face and this three of sight angle and taking lens structure
At angle it is different, then the angle preferentially constituted using sight angle and taking lens is as user towards angle, because of sight
The consciousness that angle can most react a people is intended to, and when all angles data are inconsistent, pays the utmost attention to sight angle, can be most quasi-
Really identify that the control of user is intended to.
Further, in the case where that cannot determine user's sight angle according to the pose presentation of the user of acquisition, in face
Towards angle be less than or equal to preset angle threshold in the case where, judge user to controlled device have control be intended to.If due to
The reasons such as light can not identify the angle that sight angle and taking lens are constituted;The angle then constituted with the angle of face and taking lens
It spends as user towards angle;If can not identify the angle that the angle of face and taking lens are constituted due to light etc.;Then
Using the angle that the angle of body and taking lens are constituted as user towards angle.That is, three kinds of angle-datas is excellent
The sequence of first grade from high to low is sight angle, the angle of face, the angle of user's body, and the higher data of priority more can be quasi-
The control for really reacting user is intended to.
Certainly, phonetic controller, which can also be adopted, determines that target is controlled according to user's pose presentation with other methods and sets
Standby, the embodiment of the present invention is not especially limited.
In embodiments of the present invention, in order to reduce phonetic controller to the analytical calculation amount of user's pose presentation, S101
It may include: to receive at least one image from least one acquisition equipment;Timestamp is determined from least one image
Target detection is carried out to the image that timestamp was the first moment according to preset target user's model for the image at the first moment,
Determine the image comprising target user, target user is the user of at least one controlled device;It will determine comprising mesh
The image of mark user is determined as user's pose presentation.
Specifically, phonetic controller receives at least one image from least one acquisition equipment, then therefrom
The image that timestamp was the first moment is first selected, was the first moment to timestamp according to preset target user's model then
Image carry out target detection, therefrom determine include target user image, goal user can be preset right
At least one controlled device has the user of access right, finally, the image comprising target user determined is determined as using
Family pose presentation.
Alternatively, in other embodiments of the present invention, if controlled device and acquisition equipment are same equipment, at this point, S101 can
To include: the phonetic order for obtaining user and being inputted at the first moment;According to preset user's sound-groove model, to input phonetic order
User identify;When identifying user is legitimate user, user's pose presentation is acquired.
Here, acquisition equipment can carry out identification to user before acquiring user's pose presentation.That is,
Acquisition equipment is receiving user after the phonetic order that the first moment inputted, according to preset user's sound-groove model, to input
Phonetic order carries out Application on Voiceprint Recognition, determines the identity of the user of input phonetic order, when the user identified is legitimate user,
Acquisition equipment controls itself acquisition user's pose presentation, and when the user identified is illegal user, acquisition equipment is not done any
Response, so that controlled device only responds the phonetic order of specific user, avoids the maloperation of other users.
Further, in these cases, i.e. controlled device and acquisition equipment is same equipment, and S103 may include: pair
Phonetic order carries out speech recognition, and voice responsive instructs, and executes corresponding object run.
Here, the phonetic order for user's input that controlled device receives itself carries out speech recognition, and then, response is known
Not Chu phonetic order, execute the corresponding object run of the phonetic order.
By above-mentioned technical proposal, a kind of method and device of voice control provided in an embodiment of the present invention, wherein first
It obtains and acquires user's pose presentation that equipment acquires at the first moment by being located at least one of pre-set space, then, according to
User's pose presentation determines that user is intended to the target controlled device of control from least one controlled device in pre-set space,
Finally, the phonetic order that control target controlled device response user inputs at the first moment, that is to say, that in the embodiment of the present invention
In, by according to user's pose presentation, determining that user is intended to the target controlled device of control in multiple controlled devices, and then control
The phonetic order of target controlled device response user is made, so that multiple equipment be avoided to ring simultaneously to the phonetic order of user
Answer and caused by maloperation.
Based on the same inventive concept, the embodiment of the present invention provides a kind of device of voice control, the phonetic controller with
Said one or multiple phonetic controllers as described in the examples are consistent.
Fig. 4 is the structural schematic diagram of the phonetic controller in the embodiment of the present invention, shown in Figure 4, the voice control
Device 400 includes: obtaining unit 401, and for obtaining user's pose presentation, user's pose presentation is by being located in pre-set space
What at least one acquisition equipment acquired at the first moment;Determination unit 402 is used for according to user's pose presentation, from pre-set space
In at least one controlled device in determine user be intended to control target controlled device;Control unit 403, for controlling target
The phonetic order that controlled device response user inputs at the first moment.
In embodiments of the present invention, above-mentioned determination unit is specifically used for determining the body of user according to user's pose presentation
The sight angle of angle, the face angle of user and/or user;According to the body angle of user, user face angle and/or
The sight angle of user, by user at least one controlled device towards target controlled device be determined as target controlled device.
In embodiments of the present invention, above-mentioned obtaining unit, specifically for receiving from least one acquisition equipment at least
One image;Determine that timestamp is the image at the first moment from least one image;According to preset target user's model,
Target detection is carried out to the image that timestamp was the first moment, determines the image comprising target user, target user is at least
The user of one controlled device;The image comprising target user determined is determined as user's pose presentation.
In embodiments of the present invention, above-mentioned control unit is specifically used for sending control instruction, control to target controlled device
Instruction is used to indicate the phonetic order that target controlled device response user inputs at the first moment.
In embodiments of the present invention, when above-mentioned phonetic controller is arranged on target controlled device, above-mentioned acquisition list
Member is specifically also used to obtain the phonetic order that user inputs at the first moment;According to preset user's sound-groove model, to input language
The user of sound instruction identifies;When identifying user is legitimate user, user's pose presentation is acquired.
Further, above-mentioned control unit is specifically also used to carry out speech recognition to phonetic order;Voice responsive instruction,
Execute corresponding object run.
It need to be noted that: the description of apparatus above embodiment, be with the description of above method embodiment it is similar,
With the similar beneficial effect of same embodiment of the method.For undisclosed technical detail in apparatus of the present invention embodiment, please refer to
The description of embodiment of the method in the present invention and understand.
Based on the same inventive concept, the embodiment of the present invention provides a kind of electronic equipment, the electronic equipment and said one or
The multiple electronic equipments as described in the examples of person are consistent.
Fig. 5 is the structural schematic diagram of the electronic equipment in the embodiment of the present invention, shown in Figure 5, the electronic equipment 500 packet
It includes: at least one processor 501;And at least one processor 502, the bus 503 being connect with processor 501;Wherein, it handles
Device 501, memory 502 complete mutual communication by bus 503;Processor 501 is used to call the program in memory 502
Instruction, to execute the method and step of said one or multiple voice controls as described in the examples.
It need to be noted that: the description of the above electronic equipment embodiment, the description with above-mentioned apparatus embodiment are classes
As, there is with Installation practice similar beneficial effect.It is thin for undisclosed technology in electronic equipment embodiment of the present invention
Section, please refers to the description of Installation practice in the present invention and understands.
Based on the same inventive concept, the embodiment of the present invention provides a kind of computer readable storage medium, computer-readable to deposit
Storage media stores computer instruction, and computer instruction makes computer execute said one or multiple voices as described in the examples
The method and step of control.
It can be seen from the above, in embodiments of the present invention, by being determined in multiple controlled devices according to user's pose presentation
User is intended to the target controlled device of control, and then controls the phonetic order of target controlled device response user, to avoid
Multiple equipment simultaneously to the phonetic order of user respond and caused by maloperation.
It should be understood by those skilled in the art that, the embodiment of the present invention can provide as method, system or computer program
Product.Therefore, complete hardware embodiment, complete software embodiment or reality combining software and hardware aspects can be used in the present invention
Apply the form of example.Moreover, it wherein includes the computer of computer usable program code that the present invention, which can be used in one or more,
The computer program implemented in usable storage medium (including but not limited to magnetic disk storage, CD-ROM, optical memory etc.) produces
The form of product.
The present invention be referring to according to the method for the embodiment of the present invention, the process of equipment (system) and computer program product
Figure and/or block diagram describe.It should be understood that every one stream in flowchart and/or the block diagram can be realized by computer program instructions
The combination of process and/or box in journey and/or box and flowchart and/or the block diagram.It can provide these computer programs to refer to
Enable the processor of general purpose computer, special purpose computer, Embedded Processor or other programmable data processing devices to generate
One machine so that by the instruction that the processor of computer or other programmable data processing devices executes generate for realizing
The PLM plug-in unit for the function of being specified in one or more flows of the flowchart and/or one or more blocks of the block diagram.
These computer program instructions, which may also be stored in, is able to guide computer or other programmable data processing devices with spy
Determine in the computer-readable memory that mode works, so that it includes referring to that instruction stored in the computer readable memory, which generates,
The manufacture of PLM plug-in unit is enabled, instruction PLM plug-in unit is realized in one or more flows of the flowchart and/or one, block diagram
The function of being specified in box or multiple boxes.
These computer program instructions can also be loaded into computer or other programmable data processing devices, so that counting
Series of operation steps are executed on calculation machine or other programmable devices to generate computer implemented processing, thus in computer or
The instruction executed on other programmable devices is provided for realizing in one or more flows of the flowchart and/or block diagram one
The step of function of being specified in a box or multiple boxes.
Although preferred embodiments of the present invention have been described, it is created once a person skilled in the art knows basic
Property concept, then additional changes and modifications may be made to these embodiments.So it includes excellent that the following claims are intended to be interpreted as
It selects embodiment and falls into all change and modification of the scope of the invention.
Obviously, various changes and modifications can be made to the invention without departing from essence of the invention by those skilled in the art
Mind and range.In this way, if these modifications and changes of the present invention belongs to the range of the claims in the present invention and its equivalent technologies
Within, then the present invention is also intended to include these modifications and variations.
Claims (10)
1. a kind of method of voice control characterized by comprising
Obtain user's pose presentation, user's pose presentation is to acquire equipment the by being located at least one of pre-set space
The acquisition of one moment;
According to user's pose presentation, determine that user is intended to the mesh of control from least one controlled device in the pre-set space
Mark controlled device;
It controls the target controlled device and responds the phonetic order that the user inputs at first moment.
2. the method according to claim 1, wherein described according to user's pose presentation, from the pre-set space
In at least one controlled device in determine user be intended to control target controlled device, comprising:
According to user's pose presentation, the angle of sight of the body angle of user, the face angle of user and/or user is determined
Degree;
It, will at least one described quilt according to the sight angle of the body angle of the user, the face angle of user and/or user
Control equipment in user towards target controlled device be determined as the target controlled device.
3. the method according to claim 1, wherein acquisition user's pose presentation, comprising:
Receive at least one image from least one the acquisition equipment;
Determine that timestamp is the image at first moment from least one described image;
According to preset target user's model, target detection is carried out to the image that timestamp is first moment, determines to wrap
Image containing target user, the target user are the user of at least one controlled device;
The image comprising target user determined is determined as user's pose presentation.
4. the method according to claim 1, wherein the control target controlled device responds the user
In the phonetic order of first moment input, comprising:
Control instruction is sent to the target controlled device, the control instruction is used to indicate target controlled device response institute
State the phonetic order that user inputs at first moment.
5. the method according to claim 1, wherein acquisition user's pose presentation, comprising:
Obtain the phonetic order that user inputs at first moment;
According to preset user's sound-groove model, the user for inputting the phonetic order is identified;
When identifying the user is legitimate user, user's pose presentation is acquired.
6. according to the method described in claim 5, it is characterized in that, the control target controlled device responds the user
In the phonetic order of first moment input, comprising:
Speech recognition is carried out to the phonetic order;
The phonetic order is responded, corresponding object run is executed.
7. a kind of device of voice control characterized by comprising
Obtaining unit, for obtaining user's pose presentation, user's pose presentation is by least one in pre-set space
What a acquisition equipment acquired at the first moment;
Determination unit, for determining and using from least one controlled device in the pre-set space according to user's pose presentation
Family is intended to the target controlled device of control;
Control unit responds the user for controlling the target controlled device and refers in the voice that first moment inputs
It enables.
8. device according to claim 7, which is characterized in that the determination unit is specifically used for according to user's appearance
State image determines the sight angle of the body angle of user, the face angle of user and/or user;According to the body of the user
The sight angle of body angle, the face angle of user and/or user, by user at least one described controlled device towards mesh
Mark controlled device is determined as the target controlled device.
9. a kind of electronic equipment characterized by comprising
At least one processor;
And at least one processor, the bus being connected to the processor;Wherein,
The processor, memory complete mutual communication by the bus;
The processor is used to call the program instruction in the memory, is required described in any one of 1 to 6 with perform claim
The method of voice control.
10. a kind of computer readable storage medium, which is characterized in that the computer-readable recording medium storage computer refers to
It enables, the computer instruction makes the method for voice control described in any one of described computer perform claim requirement 1 to 6.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201811031798.0A CN109032039B (en) | 2018-09-05 | 2018-09-05 | Voice control method and device |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201811031798.0A CN109032039B (en) | 2018-09-05 | 2018-09-05 | Voice control method and device |
Publications (2)
Publication Number | Publication Date |
---|---|
CN109032039A true CN109032039A (en) | 2018-12-18 |
CN109032039B CN109032039B (en) | 2021-05-11 |
Family
ID=64623565
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201811031798.0A Active CN109032039B (en) | 2018-09-05 | 2018-09-05 | Voice control method and device |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN109032039B (en) |
Cited By (9)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN110047494A (en) * | 2019-04-15 | 2019-07-23 | 北京小米智能科技有限公司 | Equipment response method, equipment and storage medium |
CN110136714A (en) * | 2019-05-14 | 2019-08-16 | 北京探境科技有限公司 | Natural interaction sound control method and device |
CN110556115A (en) * | 2019-09-10 | 2019-12-10 | 深圳创维-Rgb电子有限公司 | IOT equipment control method based on multiple control terminals, control terminal and storage medium |
CN110730115A (en) * | 2019-09-11 | 2020-01-24 | 北京小米移动软件有限公司 | Voice control method and device, terminal and storage medium |
CN112207812A (en) * | 2019-07-12 | 2021-01-12 | 阿里巴巴集团控股有限公司 | Device control method, device, system and storage medium |
CN112417712A (en) * | 2021-01-21 | 2021-02-26 | 深圳市友杰智新科技有限公司 | Target device determination method and device, computer device and storage medium |
CN112908321A (en) * | 2020-12-02 | 2021-06-04 | 青岛海尔科技有限公司 | Device control method, device, storage medium, and electronic apparatus |
WO2022188552A1 (en) * | 2021-03-10 | 2022-09-15 | Oppo广东移动通信有限公司 | Device control method and related apparatus |
CN115086096A (en) * | 2021-03-15 | 2022-09-20 | Oppo广东移动通信有限公司 | Method, apparatus, device and storage medium for responding control voice |
Citations (7)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN103730120A (en) * | 2013-12-27 | 2014-04-16 | 深圳市亚略特生物识别科技有限公司 | Voice control method and system for electronic device |
US20160162039A1 (en) * | 2013-07-21 | 2016-06-09 | Pointgrab Ltd. | Method and system for touchless activation of a device |
CN105785782A (en) * | 2016-03-29 | 2016-07-20 | 北京小米移动软件有限公司 | Intelligent household equipment control method and device |
CN107728482A (en) * | 2016-08-11 | 2018-02-23 | 阿里巴巴集团控股有限公司 | Control system, control process method and device |
CN108089695A (en) * | 2016-11-23 | 2018-05-29 | 纳恩博(北京)科技有限公司 | A kind of method and apparatus for controlling movable equipment |
CN108398906A (en) * | 2018-03-27 | 2018-08-14 | 百度在线网络技术(北京)有限公司 | Apparatus control method, device, electric appliance, total control equipment and storage medium |
CN108490832A (en) * | 2018-03-27 | 2018-09-04 | 百度在线网络技术(北京)有限公司 | Method and apparatus for sending information |
-
2018
- 2018-09-05 CN CN201811031798.0A patent/CN109032039B/en active Active
Patent Citations (7)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20160162039A1 (en) * | 2013-07-21 | 2016-06-09 | Pointgrab Ltd. | Method and system for touchless activation of a device |
CN103730120A (en) * | 2013-12-27 | 2014-04-16 | 深圳市亚略特生物识别科技有限公司 | Voice control method and system for electronic device |
CN105785782A (en) * | 2016-03-29 | 2016-07-20 | 北京小米移动软件有限公司 | Intelligent household equipment control method and device |
CN107728482A (en) * | 2016-08-11 | 2018-02-23 | 阿里巴巴集团控股有限公司 | Control system, control process method and device |
CN108089695A (en) * | 2016-11-23 | 2018-05-29 | 纳恩博(北京)科技有限公司 | A kind of method and apparatus for controlling movable equipment |
CN108398906A (en) * | 2018-03-27 | 2018-08-14 | 百度在线网络技术(北京)有限公司 | Apparatus control method, device, electric appliance, total control equipment and storage medium |
CN108490832A (en) * | 2018-03-27 | 2018-09-04 | 百度在线网络技术(北京)有限公司 | Method and apparatus for sending information |
Cited By (14)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN110047494A (en) * | 2019-04-15 | 2019-07-23 | 北京小米智能科技有限公司 | Equipment response method, equipment and storage medium |
CN110047494B (en) * | 2019-04-15 | 2022-06-03 | 北京小米智能科技有限公司 | Device response method, device and storage medium |
CN110136714A (en) * | 2019-05-14 | 2019-08-16 | 北京探境科技有限公司 | Natural interaction sound control method and device |
CN112207812A (en) * | 2019-07-12 | 2021-01-12 | 阿里巴巴集团控股有限公司 | Device control method, device, system and storage medium |
CN110556115A (en) * | 2019-09-10 | 2019-12-10 | 深圳创维-Rgb电子有限公司 | IOT equipment control method based on multiple control terminals, control terminal and storage medium |
EP3793202A1 (en) * | 2019-09-11 | 2021-03-17 | Beijing Xiaomi Mobile Software Co., Ltd. | Method and device for voice control,terminal, and storage |
CN110730115B (en) * | 2019-09-11 | 2021-11-09 | 北京小米移动软件有限公司 | Voice control method and device, terminal and storage medium |
US11335345B2 (en) | 2019-09-11 | 2022-05-17 | Beijing Xiaomi Mobile Software Co., Ltd. | Method for voice control, terminal, and non-transitory computer-readable storage medium |
CN110730115A (en) * | 2019-09-11 | 2020-01-24 | 北京小米移动软件有限公司 | Voice control method and device, terminal and storage medium |
CN112908321A (en) * | 2020-12-02 | 2021-06-04 | 青岛海尔科技有限公司 | Device control method, device, storage medium, and electronic apparatus |
CN112417712A (en) * | 2021-01-21 | 2021-02-26 | 深圳市友杰智新科技有限公司 | Target device determination method and device, computer device and storage medium |
WO2022188552A1 (en) * | 2021-03-10 | 2022-09-15 | Oppo广东移动通信有限公司 | Device control method and related apparatus |
CN115086095A (en) * | 2021-03-10 | 2022-09-20 | Oppo广东移动通信有限公司 | Equipment control method and related device |
CN115086096A (en) * | 2021-03-15 | 2022-09-20 | Oppo广东移动通信有限公司 | Method, apparatus, device and storage medium for responding control voice |
Also Published As
Publication number | Publication date |
---|---|
CN109032039B (en) | 2021-05-11 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN109032039A (en) | A kind of method and device of voice control | |
CN103716669B (en) | Electronic device and control method thereof | |
US10938725B2 (en) | Load balancing multimedia conferencing system, device, and methods | |
CN107748500A (en) | Method and apparatus for controlling smart machine | |
KR102077887B1 (en) | Enhancing video conferences | |
US20170251138A1 (en) | Video call method and device | |
CN107655154A (en) | Terminal control method, air conditioner and computer-readable recording medium | |
CN109243448A (en) | A kind of sound control method and device | |
US11503110B2 (en) | Method for presenting schedule reminder information, terminal device, and cloud server | |
CN109360558A (en) | A kind of method and apparatus of voice answer-back | |
CN113238727A (en) | Screen switching method and device, computer readable medium and electronic equipment | |
CN111881740A (en) | Face recognition method, face recognition device, electronic equipment and medium | |
CN106375594A (en) | Method and device for adjusting equipment, and electronic equipment | |
CN107622241A (en) | Display methods and device for mobile device | |
CN112533070B (en) | Video sound and picture adjusting method, terminal and computer readable storage medium | |
CN111126159A (en) | Method, apparatus, electronic device, and medium for tracking pedestrian in real time | |
CN109388238A (en) | The control method and device of a kind of electronic equipment | |
CN110570383A (en) | image processing method and device, electronic equipment and storage medium | |
CN207718803U (en) | Multiple source speech differentiation identifying system | |
CN113342170A (en) | Gesture control method, device, terminal and storage medium | |
CN105991825A (en) | Voice control method, device and system | |
CA3040268A1 (en) | Methods, systems, and media for pairing devices to complete a task using an application request | |
CN108415572B (en) | Module control method and device applied to mobile terminal and storage medium | |
CN109976169B (en) | Internet television intelligent control method and system based on self-learning technology | |
CN116560512A (en) | Virtual digital human interaction method, electronic equipment, system and storage medium |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
TA01 | Transfer of patent application right |
Effective date of registration: 20210402 Address after: 210038 8th floor, building D11, Hongfeng science and Technology Park, Nanjing Economic and Technological Development Zone, Jiangsu Province Applicant after: New Technology Co.,Ltd. Address before: 100080 Suzhou Street, Haidian District, Haidian District, No. 1602, room 16, room 16 Applicant before: Beijing Yufanzhi Information Technology Co.,Ltd. |
|
TA01 | Transfer of patent application right | ||
GR01 | Patent grant | ||
GR01 | Patent grant |