CN103634442A - Three-dimensional gesture and voice-based autodialing method and mobile terminal - Google Patents

Three-dimensional gesture and voice-based autodialing method and mobile terminal Download PDF

Info

Publication number
CN103634442A
CN103634442A CN201210308653.7A CN201210308653A CN103634442A CN 103634442 A CN103634442 A CN 103634442A CN 201210308653 A CN201210308653 A CN 201210308653A CN 103634442 A CN103634442 A CN 103634442A
Authority
CN
China
Prior art keywords
mobile terminal
user
voice
default position
module
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN201210308653.7A
Other languages
Chinese (zh)
Other versions
CN103634442B (en
Inventor
齐颖
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Beijing Baidu Netcom Science and Technology Co Ltd
Original Assignee
Beijing Baidu Netcom Science and Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Beijing Baidu Netcom Science and Technology Co Ltd filed Critical Beijing Baidu Netcom Science and Technology Co Ltd
Priority to CN201210308653.7A priority Critical patent/CN103634442B/en
Publication of CN103634442A publication Critical patent/CN103634442A/en
Application granted granted Critical
Publication of CN103634442B publication Critical patent/CN103634442B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Landscapes

  • Telephone Function (AREA)

Abstract

The invention brings forward a three-dimensional gesture and voice-based autodialing method. The method comprises the following steps that: a mobile terminal detects whether the motion of a user conforms to a preset track; if so, an image sensor is started to detect whether an image of a preset part of the user is captured and a range sensor is started to detect whether a distance between the mobile terminal and the preset part of the user is less than or equal to a preset distance; if the image sensor captures the preset part of the user, the distance between the mobile terminal and the preset part is less than or equal to the preset distance and a voice dialing mode is turned on; under the voice dialing mode, a voice signal emitted by the user is collected and is parsed to obtain voice information; and a contact telephone number corresponding to the voice information is dialed automatically. According to the method, the task flow for dialing is simplified and the operation is rapid and efficient; and the dialing interaction mode is natural and humanized. In addition, the invention also brings forward a mobile terminal.

Description

Automatic dial method based on three-dimension gesture and voice and mobile terminal
Technical field
The present invention relates to mobile communication technology field, particularly a kind of automatic dial method and a kind of mobile terminal based on three-dimension gesture and voice.
Background technology
Three-dimension gesture be the motion in three dimensions of limbs based on people and make there is identity and semantic gesture.Three-dimension gesture can not contact machinery equipment to carry out separately, after also can contacting with machinery equipment, carries out, and jointly completes three-dimension gesture.Three-dimension gesture technology is at present in extensive use of field of play, also very wide in the application prospect in other field.
Speech recognition technology, is also referred to as automatic speech recognition, and its target is that the vocabulary content in the mankind's voice is converted to computer-readable input, for example button, binary coding or character string.
At present, existing mobile terminal for example the form of mobile phone dialing phone mainly contain two kinds.A kind of is to trigger by touch screen button, for example, phone mother, needs first release mobile phone, at telephone dial input mother's telephone number, then taps " dialing " button and transfers to; Or find the mother's who has stored telephone number, then tap " dialing " button and transfer to.Another kind is to trigger by complete phonetic order, for example, phone mother, needs first opening voice pattern, then says " phoning mother ", and phone is transferred to automatically.
The shortcoming of prior art is: adopt the first touch screen button to trigger while calling, at least need to operate four steps:
(1) by power key, wake for example screen of mobile phone of mobile terminal up;
(2) mobile phone release;
(3) find make a phone call application or object contact person;
(4) extract phone.
As from the foregoing, prior art call complex steps, poor in timeliness, especially in case of emergency, more inconvenient operation.
While adopting the second to call by complete phonetic order triggering, memory capacitance is large, and interactive form is inflexible, not fault-tolerant.And, need to remember phonetic order complete, standard, during operation, accurately export phonetic order, once make mistakes, just cannot finish the work.In addition, inflexible interactive form makes user and machinery equipment produce estrangement, lacks humanistic care breath.
Summary of the invention
The present invention is intended to one of solve the problems of the technologies described above at least to a certain extent.
For this reason, one object of the present invention is to propose a kind of automatic dial method based on three-dimension gesture and voice, and the flow of task of calling is simplified, and operates quickness and high efficiency more, and make interactive mode naturalization, the hommization more of calling, easily understand, remember, operate.
Second object of the present invention is also to propose a kind of mobile terminal.
For achieving the above object, a kind of automatic dial method based on three-dimension gesture and voice that a first aspect of the present invention embodiment proposes, comprises the steps:
Whether the action that mobile terminal detects user meets desired trajectory, if met, start imageing sensor and detect the image at the default position whether capture described user and start the distance that range sensor detects described mobile terminal and described user's default position whether be less than or equal to predeterminable range;
If described image capture sensor is to described user's default position, and described mobile terminal with described default position apart from being less than or equal to described predeterminable range, start phonetic dialing pattern;
Under described phonetic dialing pattern, gather the voice signal that described user sends, and described voice signal is resolved to obtain voice messaging; And
Automatically dial the contact phone number corresponding with described voice messaging.
According to the automatic dial method based on three-dimension gesture and voice of the embodiment of the present invention, by three-dimension gesture, mutual and these two kinds of natural, abundant interactive modes of interactive voice combine, intellectuality, hommization, the naturalized interactive mode of calling on mobile terminal, have been realized, the quickness and high efficiency more that operates, easily understands, memory, operation.Meanwhile, effectively reduce the embarrassment that user brings because making a mistake, avoid the misoperation bringing because of single channel error.In addition in emergency circumstances, can at utmost save time,, avoid misoperation, avoid dangerous and injury.
In one embodiment of the invention, whether the action that described mobile terminal utilizes gesture transducer to detect described user meets desired trajectory, wherein, the outside utilizing emitted light signal of described gesture transducer, and according to the reverberation receiving, obtain the track of described user action.
In one embodiment of the invention, described startup imageing sensor detects the image at the default position that whether captures described user, comprises the steps:
Described imageing sensor gathers the image information in current window, and judges whether described image information mates with described default position, and if so, judgement captures the image at described user's default position.
According to the automatic dial method based on three-dimension gesture and voice of the embodiment of the present invention, make interaction style naturalization, the hommization more of calling.
In one embodiment of the invention, describedly automatically dial the contact phone number corresponding with described voice messaging, comprise the steps:
Judge whether described voice messaging belongs to contact name or the contact phone prestoring, wherein, described contacts list comprises described contact name and corresponding contact phone;
If so, automatically dial the contact phone number corresponding with described voice messaging.
With three-dimension gesture, two passages of voice, locate dialing to certain contact person's task operating, can greatly increase the accuracy of task operating location, avoid the misoperation bringing because of single channel error.
Second aspect present invention embodiment has proposed a kind of mobile terminal, comprise: three-dimension gesture detection module, whether the action for detection of user meets desired trajectory, if met,, when capturing described user's the image at default position and the distance at described mobile terminal and described user's default position and be less than or equal to predeterminable range, send phonetic dialing enabling signal; Acquisition module, the voice signal sending for gathering described user; Voice parsing module, described voice parsing module is connected with described acquisition module, for described voice signal is resolved to obtain voice messaging; And dial module, the described module of dialing is connected with described voice parsing module with described three-dimension gesture detection module respectively, for after receiving described phonetic dialing enabling signal, start phonetic dialing pattern, and automatically dial the contact phone number corresponding with described voice messaging.
According to the mobile terminal of the embodiment of the present invention, by three-dimension gesture is mutual and these two kinds of natural, abundant interactive modes of interactive voice, combine, can make the flow of task of calling simplify, operate more simple and convenient, effectively reduce in addition the embarrassment that user brings because making a mistake, especially in case of emergency, can at utmost save time, avoid misoperation, avoid dangerous and injury.
In one embodiment of the invention, described three-dimension gesture detection module comprises: whether gesture transducer, meet desired trajectory for detection of described user's action, wherein, the outside utilizing emitted light signal of described gesture transducer, and according to the track that receives reverberation and obtain described user action; Imageing sensor, for gathering the image information in current window, and judges whether described image information mates with described default position, and if so, judgement captures the image at described user's default position; And range sensor, for detection of the current distance at described mobile terminal and described default position, and judge whether current distance is less than or equal to described predeterminable range.
Wherein, described gesture transducer comprises: light source, for outside utilizing emitted light signal; Optical controller, for receiving reverberation, and focuses on described reverberation; Photo-detector, for the reverberation after collectiong focusing, and is converted to the signal of telecommunication by the reverberation after described focusing; Controller, for receiving the described signal of telecommunication, and is converted to application readable format by the described signal of telecommunication.
In an example of the present invention, described light source is LED or laser diode.
The mobile terminal of the embodiment of the present invention, make interactive mode naturalization, hommization, the intellectuality more of calling, and the interaction style of calling is more friendly.
In one embodiment of the invention, described mobile terminal also comprises memory module, and for storing described contacts list, wherein, described contacts list comprises described contact name and corresponding contact phone.
Further, described voice parsing module is connected with described memory module, for reading described contacts list, and judges whether described voice messaging belongs to contact name or the contact phone prestoring.
Adopt three-dimension gesture, two passages of voice to locate dialing to certain contact person's task operating, can greatly increase the accuracy of task operating location, avoid the misoperation bringing because of single channel error.
In an example of the present invention, described acquisition module is receiver.
Additional aspect of the present invention and advantage in the following description part provide, and part will become obviously from the following description, or recognize by practice of the present invention.
Accompanying drawing explanation
Above-mentioned and/or additional aspect of the present invention and advantage accompanying drawing below combination obviously and is easily understood becoming the description of embodiment, wherein:
Fig. 1 is according to the flow chart of the automatic dial method based on three-dimension gesture and voice of the embodiment of the present invention;
Fig. 2 is according to the structural representation of gesture transducer in the embodiment of the present invention;
Fig. 3 is the particular flow sheet of the automatic dial method based on three-dimension gesture and voice according to an embodiment of the invention;
Fig. 4 is according to the structural representation of the mobile terminal of the embodiment of the present invention; With
Fig. 5 is according to the structural representation of the three-dimension gesture detection module of the embodiment of the present invention.
Embodiment
Describe embodiments of the invention below in detail, the example of described embodiment is shown in the drawings, and wherein same or similar label represents same or similar element or has the element of identical or similar functions from start to finish.Below by the embodiment being described with reference to the drawings, be exemplary, be intended to for explaining the present invention, and can not be interpreted as limitation of the present invention.
In the present invention, unless otherwise clearly defined and limited, the terms such as term " installation ", " being connected ", " connection ", " fixing " should be interpreted broadly, and for example, can be to be fixedly connected with, and can be also to removably connect, or connect integratedly; Can be mechanical connection, can be to be also electrically connected to; Can be to be directly connected, also can indirectly be connected by intermediary, can be the connection of two element internals.For the ordinary skill in the art, can understand as the case may be above-mentioned term concrete meaning in the present invention.
The automatic dial method based on three-dimension gesture and voice proposing according to first aspect present invention embodiment is described below with reference to Fig. 1 to Fig. 3.
As shown in Figure 1, the automatic dial method that the embodiment of the present invention provides comprises the steps:
S101, whether the action that mobile terminal detects user meets desired trajectory, if met, start whether imageing sensor detection captures this user's the image at default position and whether the distance at startup range sensor detection mobile terminal and this user's default position is less than or equal to predeterminable range.
In one embodiment of the invention, whether the action that mobile terminal utilizes gesture transducer to detect user meets desired trajectory, wherein, and the outside utilizing emitted light signal of gesture transducer, and according to the reverberation receiving, obtain the track of user action.
Particularly, as shown in Figure 2, gesture transducer comprises light source 201, optical controller 202, photo-detector 203 and controller 204.
Light source 201 is for outside utilizing emitted light signal, the general LED(Light Emitting Diode that adopts, light-emitting diode) or laser diode, conventionally can produce infrared light or near infrared light, this light is generally difficult for discovering for user, and mostly pass through light modulation, can improve the resolution of gesture transducer.
Optical controller 202 is for receiving reverberation, and reverberation is focused on.That is to say, optical controller 202 contributes to the ambient lighting of realizing ideal, and reverberation is focused on the surface of photo-detector 203.In addition, the band pass filter in optical controller 202 can filtering affects bias light and other stray lights of performance, and the reverberation that only has the optical frequency with light source 201 to match just can enter the light-sensitive element of optical controller 202.
Photo-detector 203 is for the reverberation after collectiong focusing, and the reverberation after focusing on is converted to the signal of telecommunication.That is to say, photo-detector 203 can detect the reverberation through filtering, and is converted into the signal of telecommunication, for controller 204, processes.
Controller 204 is for receiving the signal of telecommunication after photo-detector 203 conversion, and converts electrical signals to application readable format.For example, controller 204 can be super high-speed A SIC(Application Specific Integrated Circuit, the integrated circuit of specialized application) or DSP(Digital Signal Processing, Digital Signal Processing) chip, can for example, to the information receiving (signal of telecommunication), process, be converted into the form that terminal use's application (for example software in mobile terminal) can be understood.In one embodiment of the invention, in step S101, start the image that imageing sensor detects the default position that whether captures this user, further comprise:
Imageing sensor gathers the image information in current window, and judges whether image information mates with default position, and if so, judgement captures the image at user's default position.
In an example of the present invention, desired trajectory can be for user by mobile terminal from far-end the track near health, default position can be ear.In other words, user by mobile terminal from far-end progressively near health, until the position of ear.
Particularly, light source 201, for outside utilizing emitted light signal, comprises to user's utilizing emitted light signal.Optical controller 202 receives from outside reverberation, and reverberation is focused on.Wherein, optical controller 202 receives by the reverberation of user's the health reflection line focusing of going forward side by side.Photo-detector 203 is converted to the signal of telecommunication by the reverberation after focusing on, and is sent to controller 204.By 204 pairs of these signals of telecommunication of controller, analyzed, draw the transmission path of light, and then judge whether this transmission path meets default track.Under the condition meeting, further by imageing sensor, detected the image of the ear that whether captures user.Wherein, imageing sensor can be the camera head of mobile terminal.
S102, if image capture sensor to user's default position, and mobile terminal with default position apart from being less than or equal to predeterminable range, start phonetic dialing pattern.
That is to say, in this reciprocal process, mobile terminal detects user by gesture transducer and picks up this action that mobile terminal is pressed close to health, meet desired trajectory, then for example, by imageing sensor (image information of taking according to camera), catch user's body part, be above-mentioned default position, for example ear.
Particularly, mobile terminal detects user by gesture transducer and picks up this action that mobile terminal is pressed close to health, then according to image capture sensor, to mobile terminal, be the default position ear for example that is attached to user's body, by range sensor, judge that the distance of mobile terminal and ear is to be less than or equal to predeterminable range (for example 1 centimetre) again, when the judgement of above these three conditions all meets the requirements while making a phone call sight, mobile terminal starts phonetic dialing pattern automatically.
In an example of the present invention, mobile terminal can be mobile phone, is understandable that, the mobile terminal in example of the present invention is not limited in this.
S103, under phonetic dialing pattern, gathers the voice signal that user sends, and voice signal is resolved to obtain voice messaging.
That is to say, the voice signal that the language parsing module of mobile terminal sends the user who collects converts phonetic order to, directly controls mobile terminal and dials phone number.
S104, dials the contact phone number corresponding with voice messaging automatically.
That is to say, step S104 dials the contact phone number corresponding with voice messaging automatically, also further comprises:
Judge whether voice messaging belongs to contact name or the contact phone prestoring, wherein, contacts list comprises contact name and corresponding contact phone; If so, automatically dial the contact phone number corresponding with voice messaging.For example, user says " Xiao Ming ", and mobile terminal is the information of " Xiao Ming " dial out the telephone number corresponding with contact person in retrieves contact list immediately.Or user says and wants the telephone number dialed, mobile terminal retrieves immediately this number in contacts list and dials out.
Particularly, take mobile phone is below described in detail the flow process of the automatic dial method based on three-dimension gesture and voice of the embodiment of the present invention as example.As shown in Figure 3, the above-mentioned automatic dial method based on three-dimension gesture and voice, comprises the steps:
S301, whether the action that mobile phone detects user is to pick up the action that mobile phone is pressed close to health.If so, enter next step S302; If not, return to step S301, proceed to detect.
S302, starts handset image transducer, and gathers the image information in current window.
S303, judges whether image information is user's ear.If so, enter next step S304; If not, return to step S302.
S304, handset image transducer captures presses close to the ear that region is user.
S305, starts mobile phone range sensor.
S306, whether the distance of mobile phone range sensor detection of handset and user's ear is less than or equal to 1 centimetre.Preferably, in an example of the present invention, predeterminable range judges with 1 centimetre.If so, enter next step S307; If not, return to step S305.
S307, starts phonetic dialing pattern.
S308, gathers the voice signal that user sends, and voice signal is resolved to obtain voice messaging.
S309, judges whether voice messaging belongs to contact name or the contact phone prestoring in mobile phone.If so, enter next step; If not, return to step S308, Resurvey user's voice signal.
S310, dials the contact phone number corresponding with voice messaging automatically.
In an example of the present invention, mobile phone is placed on the table, be screen lock state, at this moment little U.S.A comes over, pick up the mobile phone on table and press close in ear to 1 centimetre, directly saying: " mother ", then mobile phone carries out voice feedback and " to mother, dials ", after several seconds, mother's phone has just been connected.
According to the automatic dial method based on three-dimension gesture and voice of the embodiment of the present invention, by three-dimension gesture is mutual and these two kinds of natural, abundant interactive modes of interactive voice, combine, there is following advantage: (1) simplifies the flow of task of calling, operate more quick, efficient; (2) make interactive mode naturalization, the hommization more of calling, easily understand, memory, operation; (3) make the interaction style called more friendly, effectively reduce the embarrassment that user brings because making a mistake; (4) with three-dimension gesture, two passages of voice, locate dialing to contact person's task operating, can greatly increase the accuracy of task operating location, avoid the misoperation bringing because of single channel error; (5) in emergency circumstances, can at utmost save time, avoid misoperation, avoid dangerous and injury.
Below with reference to Fig. 4 and Fig. 5, the mobile terminal proposing according to second aspect present invention embodiment is described.
As shown in Figure 4, this mobile terminal comprises three-dimension gesture detection module 401, acquisition module 402, voice parsing module 403 and dials module 404.
Wherein, whether three-dimension gesture detection module 401 meets desired trajectory for detection of user's action, if met,, when capturing user's the image at default position and the distance at mobile terminal and user's default position and be less than or equal to predeterminable range, send phonetic dialing enabling signal.The voice signal that acquisition module 402 sends for gathering user.Voice parsing module 403 is connected with acquisition module 402, for voice signal being resolved to obtain voice messaging.Dial module 404 and be connected with voice parsing module 403 with three-dimension gesture detection module 401 respectively, for after receiving phonetic dialing enabling signal, start phonetic dialing pattern, and automatically dial the contact phone number corresponding with voice messaging.That is to say, the voice signal that the user that language parsing module 403 collects acquisition module 402 sends converts phonetic order to, and direct control is dialed module 404 and dialed phone number.
Further, in one embodiment of the invention, as shown in Figure 5, three-dimension gesture detection module 401 comprises gesture transducer 501, imageing sensor 502 and range sensor 503.
Whether gesture transducer 501 meets desired trajectory for detection of user's action, wherein, and the outside utilizing emitted light signal of gesture transducer, and according to the track that receives reverberation and obtain user action.
Imageing sensor 502 is for gathering the image information in current window, and judges whether image information mates with default position, and if so, judgement captures the image at user's default position.
Range sensor 503 is for detection of the current distance at mobile terminal and default position, and judges whether current distance is less than or equal to predeterminable range.Preferably, in an example of the present invention, predeterminable range can be 1 centimetre.
That is to say, in this reciprocal process, gesture transducer 501 detects user and picks up this action that mobile terminal is pressed close to health, be that this action meets desired trajectory, the image information of then for example taking according to camera by imageing sensor 502() catch user's body part, be above-mentioned default position, in an example of the present invention, default position can be ear.
Particularly, gesture transducer 501 detects user and picks up this action that mobile terminal is pressed close to health, then according to imageing sensor 502, capturing mobile terminal is the default position ear for example that is attached to user's body, distance by range sensor 503 judgement mobile terminals and ear is to be less than or equal to predeterminable range (for example 1 centimetre) again, when the judgement of above these three conditions all meets the requirements while making a phone call sight, three-dimension gesture detection module 401 sends phonetic dialing enabling signal automatically.
Particularly, in one embodiment of the invention, as shown in Figure 2, gesture transducer 501 comprises light source 201, optical controller 202, photo-detector 203 and controller 204.
Wherein, light source 201, for outside utilizing emitted light signal, generally adopts LED or laser diode, conventionally can produce infrared light or near infrared light, and this light is generally difficult for for user discovers, and mostly passes through light modulation, can improve the resolution of gesture transducer 501.
Optical controller 202 is for receiving reverberation, and reverberation is focused on.That is to say, optical controller 202 contributes to the ambient lighting of realizing ideal, and reverberation is focused on the surface of photo-detector 203.In addition, the band pass filter in optical controller 202 can filtering affects bias light and other stray lights of performance, and the reverberation that only has the optical frequency with light source 201 to match just can enter the light-sensitive element of optical controller 202.
Photo-detector 203 is for the reverberation after collectiong focusing, and the reverberation after focusing on is converted to the signal of telecommunication.That is to say, photo-detector 203 can detect the reverberation through filtering, and is converted into the signal of telecommunication, for controller 204, processes.
Controller 204 is for receiving the signal of telecommunication after photo-detector 203 conversion, and converts electrical signals to application readable format.For example, controller 204 can be super high-speed A SIC or dsp chip, can for example, to the information receiving (signal of telecommunication), process, and is converted into the form that terminal use's application (for example software in mobile terminal) can be understood.
The mobile terminal of the embodiment of the present invention, make interactive mode naturalization, hommization, the intellectuality more of calling, and the interaction style of calling is more friendly.
In one embodiment of the invention, as shown in Figure 4, this mobile terminal also comprises memory module 405, and for storing contact list, wherein, contacts list comprises contact name and corresponding contact phone.
Further, as shown in Figure 4, voice parsing module 403 is connected with memory module 405, for reading contacts list, and judges whether voice messaging belongs to contact name or the contact phone prestoring.For example, acquisition module 402 collects the voice signal that user says " Xiao Ming ", voice parsing module 403 is resolved and the information of " Xiao Ming " in retrieves contact list immediately, dials module 404 automatic pokings and gets the telephone number corresponding with contact person " Xiao Ming ".Or acquisition module 402 collects user and says the voice signal of wanting the telephone number dialed, voice parsing module 403 is resolved and is retrieved immediately this number in contacts list, then dials module 404 automatic pokings and gets this number.
In an example of the present invention, acquisition module 402 is receiver or audio monitoring module.
Adopt three-dimension gesture, two passages of voice to locate dialing to contact person's task operating, can greatly increase the accuracy of task operating location, avoid the misoperation bringing because of single channel error.
In one embodiment of the invention, this mobile terminal can be mobile phone, is understandable that, the mobile terminal of the embodiment of the present invention is not limited in this.
In an example of the present invention, mobile phone is placed on the table, be screen lock state, at this moment little U.S.A comes over, pick up the mobile phone on table and press close in ear to 1 centimetre, directly saying: " mother ", then mobile phone carries out voice feedback and " to mother, dials ", after several seconds, mother's phone has just been connected.
According to the mobile terminal of the embodiment of the present invention, by three-dimension gesture is mutual and these two kinds of natural, abundant interactive modes of interactive voice, combine, can make the flow of task of calling simplify, operate more simple and convenient, effectively reduce in addition the embarrassment that user brings because making a mistake, especially in case of emergency, can at utmost save time, avoid misoperation, avoid dangerous and injury.
In flow chart or any process of otherwise describing at this or method describe and can be understood to, represent to comprise that one or more is for realizing module, fragment or the part of code of executable instruction of the step of specific logical function or process, and the scope of the preferred embodiment of the present invention comprises other realization, wherein can be not according to order shown or that discuss, comprise according to related function by the mode of basic while or by contrary order, carry out function, this should be understood by embodiments of the invention person of ordinary skill in the field.
The logic and/or the step that in flow chart, represent or otherwise describe at this, for example, can be considered to for realizing the sequencing list of the executable instruction of logic function, may be embodied in any computer-readable medium, for instruction execution system, device or equipment (as computer based system, comprise that the system of processor or other can and carry out the system of instruction from instruction execution system, device or equipment instruction fetch), use, or use in conjunction with these instruction execution systems, device or equipment.With regard to this specification, " computer-readable medium " can be anyly can comprise, storage, communication, propagation or transmission procedure be for instruction execution system, device or equipment or the device that uses in conjunction with these instruction execution systems, device or equipment.The example more specifically of computer-readable medium (non-exhaustive list) comprises following: the electrical connection section (electronic installation) with one or more wirings, portable computer diskette box (magnetic device), random-access memory (ram), read-only memory (ROM), the erasable read-only memory (EPROM or flash memory) of editing, fiber device, and portable optic disk read-only memory (CDROM).In addition, computer-readable medium can be even paper or other the suitable medium that can print described program thereon, because can be for example by paper or other media be carried out to optical scanner, then edit, decipher or process in electronics mode and obtain described program with other suitable methods if desired, be then stored in computer storage.
Should be appreciated that each several part of the present invention can realize with hardware, software, firmware or their combination.In the above-described embodiment, a plurality of steps or method can realize with being stored in memory and by software or the firmware of suitable instruction execution system execution.For example, if realized with hardware, the same in another embodiment, can realize by any one in following technology well known in the art or their combination: have for data-signal being realized to the discrete logic of the logic gates of logic function, the application-specific integrated circuit (ASIC) with suitable combinational logic gate circuit, programmable gate array (PGA), field programmable gate array (FPGA) etc.
Those skilled in the art are appreciated that realizing all or part of step that above-described embodiment method carries is to come the hardware that instruction is relevant to complete by program, described program can be stored in a kind of computer-readable recording medium, this program, when carrying out, comprises step of embodiment of the method one or a combination set of.
In addition, each functional unit in each embodiment of the present invention can be integrated in a processing module, can be also that the independent physics of unit exists, and also can be integrated in a module two or more unit.Above-mentioned integrated module both can adopt the form of hardware to realize, and also can adopt the form of software function module to realize.If described integrated module usings that the form of software function module realizes and during as production marketing independently or use, also can be stored in a computer read/write memory medium.
The above-mentioned storage medium of mentioning can be read-only memory, disk or CD etc.
In the description of this specification, the description of reference term " embodiment ", " some embodiment ", " example ", " concrete example " or " some examples " etc. means to be contained at least one embodiment of the present invention or example in conjunction with specific features, structure, material or the feature of this embodiment or example description.In this manual, the schematic statement of above-mentioned term is not necessarily referred to identical embodiment or example.And the specific features of description, structure, material or feature can be with suitable mode combinations in any one or more embodiment or example.
Although illustrated and described embodiments of the invention above, be understandable that, above-described embodiment is exemplary, can not be interpreted as limitation of the present invention, those of ordinary skill in the art can change above-described embodiment within the scope of the invention in the situation that not departing from principle of the present invention and aim, modification, replacement and modification.

Claims (11)

1. the automatic dial method based on three-dimension gesture and voice, is characterized in that, comprises the steps:
Whether the action that mobile terminal detects user meets desired trajectory, if met, start imageing sensor and detect the image at the default position whether capture described user and start the distance that range sensor detects described mobile terminal and described user's default position whether be less than or equal to predeterminable range;
If described image capture sensor is to described user's default position, and described mobile terminal with described default position apart from being less than or equal to described predeterminable range, start phonetic dialing pattern;
Under described phonetic dialing pattern, gather the voice signal that described user sends, and described voice signal is resolved to obtain voice messaging; And
Automatically dial the contact phone number corresponding with described voice messaging.
2. automatic dial method as claimed in claim 1, it is characterized in that, whether the action that described mobile terminal utilizes gesture transducer to detect described user meets desired trajectory, wherein, the outside utilizing emitted light signal of described gesture transducer, and according to the reverberation receiving, obtain the track of described user action.
3. automatic dial method as claimed in claim 1, is characterized in that, described startup imageing sensor detects the image at the default position that whether captures described user, comprises the steps:
Described imageing sensor gathers the image information in current window, and judges whether described image information mates with described default position, and if so, judgement captures the image at described user's default position.
4. automatic dial method as claimed in claim 1, is characterized in that, describedly automatically dials the contact phone number corresponding with described voice messaging, comprises the steps:
Judge whether described voice messaging belongs to contact name or the contact phone prestoring, wherein, described contacts list comprises described contact name and corresponding contact phone;
If so, automatically dial the contact phone number corresponding with described voice messaging.
5. a mobile terminal, is characterized in that, comprising:
Three-dimension gesture detection module, whether the action for detection of user meets desired trajectory, if met,, when capturing described user's the image at default position and the distance at described mobile terminal and described user's default position and be less than or equal to predeterminable range, send phonetic dialing enabling signal;
Acquisition module, the voice signal sending for gathering described user;
Voice parsing module, described voice parsing module is connected with described acquisition module, for described voice signal is resolved to obtain voice messaging; And
Dial module, the described module of dialing is connected with described voice parsing module with described three-dimension gesture detection module respectively, for after receiving described phonetic dialing enabling signal, start phonetic dialing pattern, and automatically dial the contact phone number corresponding with described voice messaging.
6. mobile terminal as claimed in claim 5, is characterized in that, described three-dimension gesture detection module comprises:
Whether gesture transducer, meet desired trajectory for detection of described user's action, wherein, and the outside utilizing emitted light signal of described gesture transducer, and according to the track that receives reverberation and obtain described user action;
Imageing sensor, for gathering the image information in current window, and judges whether described image information mates with described default position, and if so, judgement captures the image at described user's default position; And
Range sensor, for detection of the current distance at described mobile terminal and described default position, and judges whether current distance is less than or equal to described predeterminable range.
7. mobile terminal as claimed in claim 6, is characterized in that, described gesture transducer comprises:
Light source, for outside utilizing emitted light signal;
Optical controller, for receiving reverberation, and focuses on described reverberation;
Photo-detector, for the reverberation after collectiong focusing, and is converted to the signal of telecommunication by the reverberation after described focusing;
Controller, for receiving the described signal of telecommunication, and is converted to application readable format by the described signal of telecommunication.
8. mobile terminal as claimed in claim 7, is characterized in that, described light source is LED or laser diode.
9. mobile terminal as claimed in claim 5, is characterized in that, also comprises memory module, and for storing described contacts list, wherein, described contacts list comprises described contact name and corresponding contact phone.
10. mobile terminal as claimed in claim 9, is characterized in that, described voice parsing module is connected with described memory module, for reading described contacts list, and judges whether described voice messaging belongs to contact name or the contact phone prestoring.
11. mobile terminals as claimed in claim 5, is characterized in that, described acquisition module is receiver.
CN201210308653.7A 2012-08-27 2012-08-27 Based on three-dimension gesture and the automatic dial method of voice and mobile terminal Active CN103634442B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201210308653.7A CN103634442B (en) 2012-08-27 2012-08-27 Based on three-dimension gesture and the automatic dial method of voice and mobile terminal

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201210308653.7A CN103634442B (en) 2012-08-27 2012-08-27 Based on three-dimension gesture and the automatic dial method of voice and mobile terminal

Publications (2)

Publication Number Publication Date
CN103634442A true CN103634442A (en) 2014-03-12
CN103634442B CN103634442B (en) 2016-08-03

Family

ID=50215054

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201210308653.7A Active CN103634442B (en) 2012-08-27 2012-08-27 Based on three-dimension gesture and the automatic dial method of voice and mobile terminal

Country Status (1)

Country Link
CN (1) CN103634442B (en)

Cited By (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104010059A (en) * 2014-06-09 2014-08-27 深圳市中兴移动通信有限公司 Mobile terminal and method and device for making call through mobile terminal
CN104111728A (en) * 2014-06-26 2014-10-22 联想(北京)有限公司 Electronic device and voice command input method based on operation gestures
CN106325481A (en) * 2015-06-30 2017-01-11 展讯通信(天津)有限公司 A non-contact type control system and method and a mobile terminal
CN109542235A (en) * 2018-12-04 2019-03-29 广东小天才科技有限公司 Screen operation method and device of intelligent terminal and intelligent terminal
WO2019100331A1 (en) * 2017-11-24 2019-05-31 深圳传音通讯有限公司 Method for answering incoming call, terminal, storage medium, and computer program
CN113094483A (en) * 2021-03-30 2021-07-09 东风柳州汽车有限公司 Vehicle feedback information processing method and device, terminal equipment and storage medium

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN2369426Y (en) * 1998-11-13 2000-03-15 清华大学 Speech dialing telephone
CN102469300A (en) * 2010-11-19 2012-05-23 赵秋娴 Intelligent visual doorbell capable of inducing and wirelessly switching

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN2369426Y (en) * 1998-11-13 2000-03-15 清华大学 Speech dialing telephone
CN102469300A (en) * 2010-11-19 2012-05-23 赵秋娴 Intelligent visual doorbell capable of inducing and wirelessly switching

Cited By (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104010059A (en) * 2014-06-09 2014-08-27 深圳市中兴移动通信有限公司 Mobile terminal and method and device for making call through mobile terminal
CN104010059B (en) * 2014-06-09 2018-08-07 努比亚技术有限公司 A kind of mobile terminal and its realize the method and apparatus made a phone call
CN104111728A (en) * 2014-06-26 2014-10-22 联想(北京)有限公司 Electronic device and voice command input method based on operation gestures
CN104111728B (en) * 2014-06-26 2017-09-29 联想(北京)有限公司 Phonetic order input method and electronic equipment based on operating gesture
CN106325481A (en) * 2015-06-30 2017-01-11 展讯通信(天津)有限公司 A non-contact type control system and method and a mobile terminal
WO2019100331A1 (en) * 2017-11-24 2019-05-31 深圳传音通讯有限公司 Method for answering incoming call, terminal, storage medium, and computer program
CN109542235A (en) * 2018-12-04 2019-03-29 广东小天才科技有限公司 Screen operation method and device of intelligent terminal and intelligent terminal
CN113094483A (en) * 2021-03-30 2021-07-09 东风柳州汽车有限公司 Vehicle feedback information processing method and device, terminal equipment and storage medium
CN113094483B (en) * 2021-03-30 2023-04-25 东风柳州汽车有限公司 Method and device for processing vehicle feedback information, terminal equipment and storage medium

Also Published As

Publication number Publication date
CN103634442B (en) 2016-08-03

Similar Documents

Publication Publication Date Title
CN103634442A (en) Three-dimensional gesture and voice-based autodialing method and mobile terminal
CN106954115B (en) Equipment control method and device
CN107978316A (en) The method and device of control terminal
CN104916287A (en) Voice control method and device and mobile device
US20170244821A1 (en) Information processing device
US20080212753A1 (en) Terminal apparatus, call switching method, and recording medium having stored therein call switching program
CN102546953A (en) System and method for full voice control of mobile terminal
CN103002147A (en) Auto-answer method and device for mobile terminal (MT)
CN104317417B (en) A kind of method that key mouse takes over seamlessly, apparatus and system
CN109634495A (en) Method of payment, device and user equipment
CN109582976A (en) A kind of interpretation method and electronic equipment based on voice communication
CN103257594A (en) Universal voice control device, voice control system and voice control method
CN108595003A (en) Function control method and relevant device
JP2021150946A (en) Wireless earphone device and method for using the same
CN104301522A (en) Information input method in communication and communication terminal
CN109067965A (en) Interpretation method, translating equipment, wearable device and storage medium
CN113301465A (en) Bluetooth headset play control method and Bluetooth headset
CN109510891B (en) Voice-controlled recording device and method
CN102104651A (en) Method for playing reserved voice in incoming call reception of mobile terminal and mobile terminal
CN106023566A (en) Voice-recognition-based Bluetooth remote control device, system and method
CN110312031A (en) Incoming number processing method and electronic equipment
CN106027753B (en) Control method and device for click-to-read story teller
CN103258417A (en) Universal sound-controlled remote control device, system and method
US20110045772A1 (en) Triggering control device and method thereof
CN101645980A (en) Hand-held device and power saving method thereof

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
C14 Grant of patent or utility model
GR01 Patent grant