CN103634442A

CN103634442A - Three-dimensional gesture and voice-based autodialing method and mobile terminal

Info

Publication number: CN103634442A
Application number: CN201210308653.7A
Authority: CN
Inventors: 齐颖
Original assignee: Beijing Baidu Netcom Science and Technology Co Ltd
Current assignee: Beijing Baidu Netcom Science and Technology Co Ltd
Priority date: 2012-08-27
Filing date: 2012-08-27
Publication date: 2014-03-12
Anticipated expiration: 2032-08-27
Also published as: CN103634442B

Abstract

The invention brings forward a three-dimensional gesture and voice-based autodialing method. The method comprises the following steps that: a mobile terminal detects whether the motion of a user conforms to a preset track; if so, an image sensor is started to detect whether an image of a preset part of the user is captured and a range sensor is started to detect whether a distance between the mobile terminal and the preset part of the user is less than or equal to a preset distance; if the image sensor captures the preset part of the user, the distance between the mobile terminal and the preset part is less than or equal to the preset distance and a voice dialing mode is turned on; under the voice dialing mode, a voice signal emitted by the user is collected and is parsed to obtain voice information; and a contact telephone number corresponding to the voice information is dialed automatically. According to the method, the task flow for dialing is simplified and the operation is rapid and efficient; and the dialing interaction mode is natural and humanized. In addition, the invention also brings forward a mobile terminal.

Description

Automatic dial method based on three-dimension gesture and voice and mobile terminal

Technical field

The present invention relates to mobile communication technology field, particularly a kind of automatic dial method and a kind of mobile terminal based on three-dimension gesture and voice.

Background technology

Three-dimension gesture be the motion in three dimensions of limbs based on people and make there is identity and semantic gesture.Three-dimension gesture can not contact machinery equipment to carry out separately, after also can contacting with machinery equipment, carries out, and jointly completes three-dimension gesture.Three-dimension gesture technology is at present in extensive use of field of play, also very wide in the application prospect in other field.

Speech recognition technology, is also referred to as automatic speech recognition, and its target is that the vocabulary content in the mankind's voice is converted to computer-readable input, for example button, binary coding or character string.

At present, existing mobile terminal for example the form of mobile phone dialing phone mainly contain two kinds.A kind of is to trigger by touch screen button, for example, phone mother, needs first release mobile phone, at telephone dial input mother's telephone number, then taps " dialing " button and transfers to; Or find the mother's who has stored telephone number, then tap " dialing " button and transfer to.Another kind is to trigger by complete phonetic order, for example, phone mother, needs first opening voice pattern, then says " phoning mother ", and phone is transferred to automatically.

The shortcoming of prior art is: adopt the first touch screen button to trigger while calling, at least need to operate four steps:

(1) by power key, wake for example screen of mobile phone of mobile terminal up;

(2) mobile phone release;

(3) find make a phone call application or object contact person;

(4) extract phone.

As from the foregoing, prior art call complex steps, poor in timeliness, especially in case of emergency, more inconvenient operation.

While adopting the second to call by complete phonetic order triggering, memory capacitance is large, and interactive form is inflexible, not fault-tolerant.And, need to remember phonetic order complete, standard, during operation, accurately export phonetic order, once make mistakes, just cannot finish the work.In addition, inflexible interactive form makes user and machinery equipment produce estrangement, lacks humanistic care breath.

Summary of the invention

The present invention is intended to one of solve the problems of the technologies described above at least to a certain extent.

For this reason, one object of the present invention is to propose a kind of automatic dial method based on three-dimension gesture and voice, and the flow of task of calling is simplified, and operates quickness and high efficiency more, and make interactive mode naturalization, the hommization more of calling, easily understand, remember, operate.

Second object of the present invention is also to propose a kind of mobile terminal.

For achieving the above object, a kind of automatic dial method based on three-dimension gesture and voice that a first aspect of the present invention embodiment proposes, comprises the steps:

Whether the action that mobile terminal detects user meets desired trajectory, if met, start imageing sensor and detect the image at the default position whether capture described user and start the distance that range sensor detects described mobile terminal and described user's default position whether be less than or equal to predeterminable range;

If described image capture sensor is to described user's default position, and described mobile terminal with described default position apart from being less than or equal to described predeterminable range, start phonetic dialing pattern;

Under described phonetic dialing pattern, gather the voice signal that described user sends, and described voice signal is resolved to obtain voice messaging; And

Automatically dial the contact phone number corresponding with described voice messaging.

According to the automatic dial method based on three-dimension gesture and voice of the embodiment of the present invention, by three-dimension gesture, mutual and these two kinds of natural, abundant interactive modes of interactive voice combine, intellectuality, hommization, the naturalized interactive mode of calling on mobile terminal, have been realized, the quickness and high efficiency more that operates, easily understands, memory, operation.Meanwhile, effectively reduce the embarrassment that user brings because making a mistake, avoid the misoperation bringing because of single channel error.In addition in emergency circumstances, can at utmost save time,, avoid misoperation, avoid dangerous and injury.

In one embodiment of the invention, whether the action that described mobile terminal utilizes gesture transducer to detect described user meets desired trajectory, wherein, the outside utilizing emitted light signal of described gesture transducer, and according to the reverberation receiving, obtain the track of described user action.

In one embodiment of the invention, described startup imageing sensor detects the image at the default position that whether captures described user, comprises the steps:

Described imageing sensor gathers the image information in current window, and judges whether described image information mates with described default position, and if so, judgement captures the image at described user's default position.

According to the automatic dial method based on three-dimension gesture and voice of the embodiment of the present invention, make interaction style naturalization, the hommization more of calling.

In one embodiment of the invention, describedly automatically dial the contact phone number corresponding with described voice messaging, comprise the steps:

Judge whether described voice messaging belongs to contact name or the contact phone prestoring, wherein, described contacts list comprises described contact name and corresponding contact phone;

If so, automatically dial the contact phone number corresponding with described voice messaging.

With three-dimension gesture, two passages of voice, locate dialing to certain contact person's task operating, can greatly increase the accuracy of task operating location, avoid the misoperation bringing because of single channel error.

Second aspect present invention embodiment has proposed a kind of mobile terminal, comprise: three-dimension gesture detection module, whether the action for detection of user meets desired trajectory, if met,, when capturing described user's the image at default position and the distance at described mobile terminal and described user's default position and be less than or equal to predeterminable range, send phonetic dialing enabling signal; Acquisition module, the voice signal sending for gathering described user; Voice parsing module, described voice parsing module is connected with described acquisition module, for described voice signal is resolved to obtain voice messaging; And dial module, the described module of dialing is connected with described voice parsing module with described three-dimension gesture detection module respectively, for after receiving described phonetic dialing enabling signal, start phonetic dialing pattern, and automatically dial the contact phone number corresponding with described voice messaging.

According to the mobile terminal of the embodiment of the present invention, by three-dimension gesture is mutual and these two kinds of natural, abundant interactive modes of interactive voice, combine, can make the flow of task of calling simplify, operate more simple and convenient, effectively reduce in addition the embarrassment that user brings because making a mistake, especially in case of emergency, can at utmost save time, avoid misoperation, avoid dangerous and injury.

In one embodiment of the invention, described three-dimension gesture detection module comprises: whether gesture transducer, meet desired trajectory for detection of described user's action, wherein, the outside utilizing emitted light signal of described gesture transducer, and according to the track that receives reverberation and obtain described user action; Imageing sensor, for gathering the image information in current window, and judges whether described image information mates with described default position, and if so, judgement captures the image at described user's default position; And range sensor, for detection of the current distance at described mobile terminal and described default position, and judge whether current distance is less than or equal to described predeterminable range.

Wherein, described gesture transducer comprises: light source, for outside utilizing emitted light signal; Optical controller, for receiving reverberation, and focuses on described reverberation; Photo-detector, for the reverberation after collectiong focusing, and is converted to the signal of telecommunication by the reverberation after described focusing; Controller, for receiving the described signal of telecommunication, and is converted to application readable format by the described signal of telecommunication.

In an example of the present invention, described light source is LED or laser diode.

The mobile terminal of the embodiment of the present invention, make interactive mode naturalization, hommization, the intellectuality more of calling, and the interaction style of calling is more friendly.

In one embodiment of the invention, described mobile terminal also comprises memory module, and for storing described contacts list, wherein, described contacts list comprises described contact name and corresponding contact phone.

Further, described voice parsing module is connected with described memory module, for reading described contacts list, and judges whether described voice messaging belongs to contact name or the contact phone prestoring.

Adopt three-dimension gesture, two passages of voice to locate dialing to certain contact person's task operating, can greatly increase the accuracy of task operating location, avoid the misoperation bringing because of single channel error.

In an example of the present invention, described acquisition module is receiver.

Additional aspect of the present invention and advantage in the following description part provide, and part will become obviously from the following description, or recognize by practice of the present invention.

Accompanying drawing explanation

Above-mentioned and/or additional aspect of the present invention and advantage accompanying drawing below combination obviously and is easily understood becoming the description of embodiment, wherein:

Fig. 1 is according to the flow chart of the automatic dial method based on three-dimension gesture and voice of the embodiment of the present invention;

Fig. 2 is according to the structural representation of gesture transducer in the embodiment of the present invention;

Fig. 3 is the particular flow sheet of the automatic dial method based on three-dimension gesture and voice according to an embodiment of the invention;

Fig. 4 is according to the structural representation of the mobile terminal of the embodiment of the present invention; With

Fig. 5 is according to the structural representation of the three-dimension gesture detection module of the embodiment of the present invention.

Embodiment

Describe embodiments of the invention below in detail, the example of described embodiment is shown in the drawings, and wherein same or similar label represents same or similar element or has the element of identical or similar functions from start to finish.Below by the embodiment being described with reference to the drawings, be exemplary, be intended to for explaining the present invention, and can not be interpreted as limitation of the present invention.

In the present invention, unless otherwise clearly defined and limited, the terms such as term " installation ", " being connected ", " connection ", " fixing " should be interpreted broadly, and for example, can be to be fixedly connected with, and can be also to removably connect, or connect integratedly; Can be mechanical connection, can be to be also electrically connected to; Can be to be directly connected, also can indirectly be connected by intermediary, can be the connection of two element internals.For the ordinary skill in the art, can understand as the case may be above-mentioned term concrete meaning in the present invention.

The automatic dial method based on three-dimension gesture and voice proposing according to first aspect present invention embodiment is described below with reference to Fig. 1 to Fig. 3.

As shown in Figure 1, the automatic dial method that the embodiment of the present invention provides comprises the steps:

S101, whether the action that mobile terminal detects user meets desired trajectory, if met, start whether imageing sensor detection captures this user's the image at default position and whether the distance at startup range sensor detection mobile terminal and this user's default position is less than or equal to predeterminable range.

In one embodiment of the invention, whether the action that mobile terminal utilizes gesture transducer to detect user meets desired trajectory, wherein, and the outside utilizing emitted light signal of gesture transducer, and according to the reverberation receiving, obtain the track of user action.

Particularly, as shown in Figure 2, gesture transducer comprises light source 201, optical controller 202, photo-detector 203 and controller 204.

Light source 201 is for outside utilizing emitted light signal, the general LED(Light Emitting Diode that adopts, light-emitting diode) or laser diode, conventionally can produce infrared light or near infrared light, this light is generally difficult for discovering for user, and mostly pass through light modulation, can improve the resolution of gesture transducer.

Optical controller 202 is for receiving reverberation, and reverberation is focused on.That is to say, optical controller 202 contributes to the ambient lighting of realizing ideal, and reverberation is focused on the surface of photo-detector 203.In addition, the band pass filter in optical controller 202 can filtering affects bias light and other stray lights of performance, and the reverberation that only has the optical frequency with light source 201 to match just can enter the light-sensitive element of optical controller 202.

Photo-detector 203 is for the reverberation after collectiong focusing, and the reverberation after focusing on is converted to the signal of telecommunication.That is to say, photo-detector 203 can detect the reverberation through filtering, and is converted into the signal of telecommunication, for controller 204, processes.

Controller 204 is for receiving the signal of telecommunication after photo-detector 203 conversion, and converts electrical signals to application readable format.For example, controller 204 can be super high-speed A SIC(Application Specific Integrated Circuit, the integrated circuit of specialized application) or DSP(Digital Signal Processing, Digital Signal Processing) chip, can for example, to the information receiving (signal of telecommunication), process, be converted into the form that terminal use's application (for example software in mobile terminal) can be understood.In one embodiment of the invention, in step S101, start the image that imageing sensor detects the default position that whether captures this user, further comprise:

Imageing sensor gathers the image information in current window, and judges whether image information mates with default position, and if so, judgement captures the image at user's default position.

In an example of the present invention, desired trajectory can be for user by mobile terminal from far-end the track near health, default position can be ear.In other words, user by mobile terminal from far-end progressively near health, until the position of ear.

Particularly, light source 201, for outside utilizing emitted light signal, comprises to user's utilizing emitted light signal.Optical controller 202 receives from outside reverberation, and reverberation is focused on.Wherein, optical controller 202 receives by the reverberation of user's the health reflection line focusing of going forward side by side.Photo-detector 203 is converted to the signal of telecommunication by the reverberation after focusing on, and is sent to controller 204.By 204 pairs of these signals of telecommunication of controller, analyzed, draw the transmission path of light, and then judge whether this transmission path meets default track.Under the condition meeting, further by imageing sensor, detected the image of the ear that whether captures user.Wherein, imageing sensor can be the camera head of mobile terminal.

S102, if image capture sensor to user's default position, and mobile terminal with default position apart from being less than or equal to predeterminable range, start phonetic dialing pattern.

That is to say, in this reciprocal process, mobile terminal detects user by gesture transducer and picks up this action that mobile terminal is pressed close to health, meet desired trajectory, then for example, by imageing sensor (image information of taking according to camera), catch user's body part, be above-mentioned default position, for example ear.

Particularly, mobile terminal detects user by gesture transducer and picks up this action that mobile terminal is pressed close to health, then according to image capture sensor, to mobile terminal, be the default position ear for example that is attached to user's body, by range sensor, judge that the distance of mobile terminal and ear is to be less than or equal to predeterminable range (for example 1 centimetre) again, when the judgement of above these three conditions all meets the requirements while making a phone call sight, mobile terminal starts phonetic dialing pattern automatically.

In an example of the present invention, mobile terminal can be mobile phone, is understandable that, the mobile terminal in example of the present invention is not limited in this.

S103, under phonetic dialing pattern, gathers the voice signal that user sends, and voice signal is resolved to obtain voice messaging.

That is to say, the voice signal that the language parsing module of mobile terminal sends the user who collects converts phonetic order to, directly controls mobile terminal and dials phone number.

S104, dials the contact phone number corresponding with voice messaging automatically.

That is to say, step S104 dials the contact phone number corresponding with voice messaging automatically, also further comprises:

Judge whether voice messaging belongs to contact name or the contact phone prestoring, wherein, contacts list comprises contact name and corresponding contact phone; If so, automatically dial the contact phone number corresponding with voice messaging.For example, user says " Xiao Ming ", and mobile terminal is the information of " Xiao Ming " dial out the telephone number corresponding with contact person in retrieves contact list immediately.Or user says and wants the telephone number dialed, mobile terminal retrieves immediately this number in contacts list and dials out.

Particularly, take mobile phone is below described in detail the flow process of the automatic dial method based on three-dimension gesture and voice of the embodiment of the present invention as example.As shown in Figure 3, the above-mentioned automatic dial method based on three-dimension gesture and voice, comprises the steps:

S301, whether the action that mobile phone detects user is to pick up the action that mobile phone is pressed close to health.If so, enter next step S302; If not, return to step S301, proceed to detect.

S302, starts handset image transducer, and gathers the image information in current window.

S303, judges whether image information is user's ear.If so, enter next step S304; If not, return to step S302.

S304, handset image transducer captures presses close to the ear that region is user.

S305, starts mobile phone range sensor.

S306, whether the distance of mobile phone range sensor detection of handset and user's ear is less than or equal to 1 centimetre.Preferably, in an example of the present invention, predeterminable range judges with 1 centimetre.If so, enter next step S307; If not, return to step S305.

S307, starts phonetic dialing pattern.

S308, gathers the voice signal that user sends, and voice signal is resolved to obtain voice messaging.

S309, judges whether voice messaging belongs to contact name or the contact phone prestoring in mobile phone.If so, enter next step; If not, return to step S308, Resurvey user's voice signal.

S310, dials the contact phone number corresponding with voice messaging automatically.

In an example of the present invention, mobile phone is placed on the table, be screen lock state, at this moment little U.S.A comes over, pick up the mobile phone on table and press close in ear to 1 centimetre, directly saying: " mother ", then mobile phone carries out voice feedback and " to mother, dials ", after several seconds, mother's phone has just been connected.

According to the automatic dial method based on three-dimension gesture and voice of the embodiment of the present invention, by three-dimension gesture is mutual and these two kinds of natural, abundant interactive modes of interactive voice, combine, there is following advantage: (1) simplifies the flow of task of calling, operate more quick, efficient; (2) make interactive mode naturalization, the hommization more of calling, easily understand, memory, operation; (3) make the interaction style called more friendly, effectively reduce the embarrassment that user brings because making a mistake; (4) with three-dimension gesture, two passages of voice, locate dialing to contact person's task operating, can greatly increase the accuracy of task operating location, avoid the misoperation bringing because of single channel error; (5) in emergency circumstances, can at utmost save time, avoid misoperation, avoid dangerous and injury.

Below with reference to Fig. 4 and Fig. 5, the mobile terminal proposing according to second aspect present invention embodiment is described.

As shown in Figure 4, this mobile terminal comprises three-dimension gesture detection module 401, acquisition module 402, voice parsing module 403 and dials module 404.

Wherein, whether three-dimension gesture detection module 401 meets desired trajectory for detection of user's action, if met,, when capturing user's the image at default position and the distance at mobile terminal and user's default position and be less than or equal to predeterminable range, send phonetic dialing enabling signal.The voice signal that acquisition module 402 sends for gathering user.Voice parsing module 403 is connected with acquisition module 402, for voice signal being resolved to obtain voice messaging.Dial module 404 and be connected with voice parsing module 403 with three-dimension gesture detection module 401 respectively, for after receiving phonetic dialing enabling signal, start phonetic dialing pattern, and automatically dial the contact phone number corresponding with voice messaging.That is to say, the voice signal that the user that language parsing module 403 collects acquisition module 402 sends converts phonetic order to, and direct control is dialed module 404 and dialed phone number.

Further, in one embodiment of the invention, as shown in Figure 5, three-dimension gesture detection module 401 comprises gesture transducer 501, imageing sensor 502 and range sensor 503.

Whether gesture transducer 501 meets desired trajectory for detection of user's action, wherein, and the outside utilizing emitted light signal of gesture transducer, and according to the track that receives reverberation and obtain user action.

Imageing sensor 502 is for gathering the image information in current window, and judges whether image information mates with default position, and if so, judgement captures the image at user's default position.

Range sensor 503 is for detection of the current distance at mobile terminal and default position, and judges whether current distance is less than or equal to predeterminable range.Preferably, in an example of the present invention, predeterminable range can be 1 centimetre.

That is to say, in this reciprocal process, gesture transducer 501 detects user and picks up this action that mobile terminal is pressed close to health, be that this action meets desired trajectory, the image information of then for example taking according to camera by imageing sensor 502() catch user's body part, be above-mentioned default position, in an example of the present invention, default position can be ear.

Particularly, gesture transducer 501 detects user and picks up this action that mobile terminal is pressed close to health, then according to imageing sensor 502, capturing mobile terminal is the default position ear for example that is attached to user's body, distance by range sensor 503 judgement mobile terminals and ear is to be less than or equal to predeterminable range (for example 1 centimetre) again, when the judgement of above these three conditions all meets the requirements while making a phone call sight, three-dimension gesture detection module 401 sends phonetic dialing enabling signal automatically.

Particularly, in one embodiment of the invention, as shown in Figure 2, gesture transducer 501 comprises light source 201, optical controller 202, photo-detector 203 and controller 204.

Wherein, light source 201, for outside utilizing emitted light signal, generally adopts LED or laser diode, conventionally can produce infrared light or near infrared light, and this light is generally difficult for for user discovers, and mostly passes through light modulation, can improve the resolution of gesture transducer 501.

Controller 204 is for receiving the signal of telecommunication after photo-detector 203 conversion, and converts electrical signals to application readable format.For example, controller 204 can be super high-speed A SIC or dsp chip, can for example, to the information receiving (signal of telecommunication), process, and is converted into the form that terminal use's application (for example software in mobile terminal) can be understood.

In one embodiment of the invention, as shown in Figure 4, this mobile terminal also comprises memory module 405, and for storing contact list, wherein, contacts list comprises contact name and corresponding contact phone.

Further, as shown in Figure 4, voice parsing module 403 is connected with memory module 405, for reading contacts list, and judges whether voice messaging belongs to contact name or the contact phone prestoring.For example, acquisition module 402 collects the voice signal that user says " Xiao Ming ", voice parsing module 403 is resolved and the information of " Xiao Ming " in retrieves contact list immediately, dials module 404 automatic pokings and gets the telephone number corresponding with contact person " Xiao Ming ".Or acquisition module 402 collects user and says the voice signal of wanting the telephone number dialed, voice parsing module 403 is resolved and is retrieved immediately this number in contacts list, then dials module 404 automatic pokings and gets this number.

In an example of the present invention, acquisition module 402 is receiver or audio monitoring module.

Adopt three-dimension gesture, two passages of voice to locate dialing to contact person's task operating, can greatly increase the accuracy of task operating location, avoid the misoperation bringing because of single channel error.

In one embodiment of the invention, this mobile terminal can be mobile phone, is understandable that, the mobile terminal of the embodiment of the present invention is not limited in this.

In flow chart or any process of otherwise describing at this or method describe and can be understood to, represent to comprise that one or more is for realizing module, fragment or the part of code of executable instruction of the step of specific logical function or process, and the scope of the preferred embodiment of the present invention comprises other realization, wherein can be not according to order shown or that discuss, comprise according to related function by the mode of basic while or by contrary order, carry out function, this should be understood by embodiments of the invention person of ordinary skill in the field.

The logic and/or the step that in flow chart, represent or otherwise describe at this, for example, can be considered to for realizing the sequencing list of the executable instruction of logic function, may be embodied in any computer-readable medium, for instruction execution system, device or equipment (as computer based system, comprise that the system of processor or other can and carry out the system of instruction from instruction execution system, device or equipment instruction fetch), use, or use in conjunction with these instruction execution systems, device or equipment.With regard to this specification, " computer-readable medium " can be anyly can comprise, storage, communication, propagation or transmission procedure be for instruction execution system, device or equipment or the device that uses in conjunction with these instruction execution systems, device or equipment.The example more specifically of computer-readable medium (non-exhaustive list) comprises following: the electrical connection section (electronic installation) with one or more wirings, portable computer diskette box (magnetic device), random-access memory (ram), read-only memory (ROM), the erasable read-only memory (EPROM or flash memory) of editing, fiber device, and portable optic disk read-only memory (CDROM).In addition, computer-readable medium can be even paper or other the suitable medium that can print described program thereon, because can be for example by paper or other media be carried out to optical scanner, then edit, decipher or process in electronics mode and obtain described program with other suitable methods if desired, be then stored in computer storage.

Should be appreciated that each several part of the present invention can realize with hardware, software, firmware or their combination.In the above-described embodiment, a plurality of steps or method can realize with being stored in memory and by software or the firmware of suitable instruction execution system execution.For example, if realized with hardware, the same in another embodiment, can realize by any one in following technology well known in the art or their combination: have for data-signal being realized to the discrete logic of the logic gates of logic function, the application-specific integrated circuit (ASIC) with suitable combinational logic gate circuit, programmable gate array (PGA), field programmable gate array (FPGA) etc.

Those skilled in the art are appreciated that realizing all or part of step that above-described embodiment method carries is to come the hardware that instruction is relevant to complete by program, described program can be stored in a kind of computer-readable recording medium, this program, when carrying out, comprises step of embodiment of the method one or a combination set of.

In addition, each functional unit in each embodiment of the present invention can be integrated in a processing module, can be also that the independent physics of unit exists, and also can be integrated in a module two or more unit.Above-mentioned integrated module both can adopt the form of hardware to realize, and also can adopt the form of software function module to realize.If described integrated module usings that the form of software function module realizes and during as production marketing independently or use, also can be stored in a computer read/write memory medium.

The above-mentioned storage medium of mentioning can be read-only memory, disk or CD etc.

In the description of this specification, the description of reference term " embodiment ", " some embodiment ", " example ", " concrete example " or " some examples " etc. means to be contained at least one embodiment of the present invention or example in conjunction with specific features, structure, material or the feature of this embodiment or example description.In this manual, the schematic statement of above-mentioned term is not necessarily referred to identical embodiment or example.And the specific features of description, structure, material or feature can be with suitable mode combinations in any one or more embodiment or example.

Although illustrated and described embodiments of the invention above, be understandable that, above-described embodiment is exemplary, can not be interpreted as limitation of the present invention, those of ordinary skill in the art can change above-described embodiment within the scope of the invention in the situation that not departing from principle of the present invention and aim, modification, replacement and modification.

Claims

1. the automatic dial method based on three-dimension gesture and voice, is characterized in that, comprises the steps:

2. automatic dial method as claimed in claim 1, it is characterized in that, whether the action that described mobile terminal utilizes gesture transducer to detect described user meets desired trajectory, wherein, the outside utilizing emitted light signal of described gesture transducer, and according to the reverberation receiving, obtain the track of described user action.

3. automatic dial method as claimed in claim 1, is characterized in that, described startup imageing sensor detects the image at the default position that whether captures described user, comprises the steps:

4. automatic dial method as claimed in claim 1, is characterized in that, describedly automatically dials the contact phone number corresponding with described voice messaging, comprises the steps:

5. a mobile terminal, is characterized in that, comprising:

Three-dimension gesture detection module, whether the action for detection of user meets desired trajectory, if met,, when capturing described user's the image at default position and the distance at described mobile terminal and described user's default position and be less than or equal to predeterminable range, send phonetic dialing enabling signal;

Acquisition module, the voice signal sending for gathering described user;

Voice parsing module, described voice parsing module is connected with described acquisition module, for described voice signal is resolved to obtain voice messaging; And

Dial module, the described module of dialing is connected with described voice parsing module with described three-dimension gesture detection module respectively, for after receiving described phonetic dialing enabling signal, start phonetic dialing pattern, and automatically dial the contact phone number corresponding with described voice messaging.

6. mobile terminal as claimed in claim 5, is characterized in that, described three-dimension gesture detection module comprises:

Whether gesture transducer, meet desired trajectory for detection of described user's action, wherein, and the outside utilizing emitted light signal of described gesture transducer, and according to the track that receives reverberation and obtain described user action;

Imageing sensor, for gathering the image information in current window, and judges whether described image information mates with described default position, and if so, judgement captures the image at described user's default position; And

Range sensor, for detection of the current distance at described mobile terminal and described default position, and judges whether current distance is less than or equal to described predeterminable range.

7. mobile terminal as claimed in claim 6, is characterized in that, described gesture transducer comprises:

Light source, for outside utilizing emitted light signal;

Optical controller, for receiving reverberation, and focuses on described reverberation;

Photo-detector, for the reverberation after collectiong focusing, and is converted to the signal of telecommunication by the reverberation after described focusing;

Controller, for receiving the described signal of telecommunication, and is converted to application readable format by the described signal of telecommunication.

8. mobile terminal as claimed in claim 7, is characterized in that, described light source is LED or laser diode.

9. mobile terminal as claimed in claim 5, is characterized in that, also comprises memory module, and for storing described contacts list, wherein, described contacts list comprises described contact name and corresponding contact phone.

10. mobile terminal as claimed in claim 9, is characterized in that, described voice parsing module is connected with described memory module, for reading described contacts list, and judges whether described voice messaging belongs to contact name or the contact phone prestoring.

11. mobile terminals as claimed in claim 5, is characterized in that, described acquisition module is receiver.