CN104092936A

CN104092936A - Automatic focusing method and apparatus

Info

Publication number: CN104092936A
Application number: CN201410261049.2A
Authority: CN
Inventors: 唐明勇; 刘华一君; 周志农
Original assignee: Xiaomi Inc
Current assignee: Beijing Xiaomi Technology Co Ltd; Xiaomi Inc
Priority date: 2014-06-12
Filing date: 2014-06-12
Publication date: 2014-10-08
Anticipated expiration: 2034-06-12
Also published as: CN104092936B

Abstract

The invention relates to an automatic focusing method and apparatus, and belongs to the technical field of photography. The method comprises: during a focusing process, acquiring sound information of an environment; according to the sound information, analyzing a sound source position of the sound information; and performing automatic focusing on a target object at the sound source position. According to the invention, a sounding object is focused through the sound source position; the problem is solved that in a conventional touch control type automatic focusing method, focusing has to be controlled through a touch screen by a user, and in case that the user is at a state when he cannot conveniently operate an electronic device, such as a state when the user holds a tablet device by two hands and a state when the user controls the electronic device by use of a remote controller, automatic focusing cannot be applied; and the effect of normal focusing when the user cannot conveniently operate the electronic device is realized.

Description

Atomatic focusing method and device

Technical field

The present invention relates to camera work field, particularly a kind of Atomatic focusing method and device.

Background technology

Focusing refers to by Focusing mechanism change object distance and position apart in camera, makes the process of the imaging clearly of made thing body.Along with the fast development of electronic equipment, it is more and more frequent that the various electronic equipments that comprise shoot function use, and people are also more and more higher to the requirement of focus function.

A kind of Atomatic focusing method that correlation technique provides, comprising: electronic equipment, in shooting process, shows by touch-screen the picture of finding a view; Electronic equipment receives the click signal of user on touch-screen, and the object that this click signal is clicked in the picture of finding a view is focused automatically.

Open people is in realizing process of the present disclosure, find that aforesaid way at least exists following defect: although above-mentioned Atomatic focusing method focus process is automatic, but in choosing the process of focusing, the main operation that relies on user, when the state of user in inconvenient operating electronic equipment, such as the state of the hand-held flat-panel devices of both hands, with the state of remote controller control electronic equipment, above-mentioned Atomatic focusing method cannot be used.Meanwhile, user's point touching screen also can bring the shake of electronic equipment, affects focus process.

Summary of the invention

In order to solve current touch Atomatic focusing method because needs user controls focusing by touch-screen, and cause when the state of user in inconvenient operating electronic equipment, the problem that this Atomatic focusing method cannot be used, the embodiment of the present invention provides a kind of Atomatic focusing method and device.Described technical scheme is as follows:

According to the first aspect of the embodiment of the present invention, a kind of Atomatic focusing method is provided, described method comprises:

In focus process, gather the acoustic information of environment of living in;

According to described acoustic information, analyze the sound source position of described acoustic information;

Target object to described sound source position is focused automatically.

Optionally, the described sound source position of analyzing described acoustic information according to described acoustic information, comprising:

When described acoustic information is two or more, resolve each acoustic information, obtain the sound characteristic of described acoustic information;

Whether detect described sound characteristic mates with the sound characteristic of default acoustic information;

If described sound characteristic mates with the sound characteristic of default acoustic information, analyze the sound source position of described acoustic information.

Optionally, described method, also comprises:

Obtain the corresponding scene mode of environment of living in;

The acoustic information of selection and described matching scene modes from least one default acoustic information, as described default acoustic information.

Optionally, the described target object to described sound source position is focused automatically, comprising:

Described sound source position is tentatively focused, obtain image information;

In described image information, identify the target object at described sound source position place;

Whether detect described target object is the sound producing body of described acoustic information;

If described target object is the sound producing body of described acoustic information, described target object is focused automatically.

Optionally, described described sound source position is tentatively focused, obtains image information, comprising:

When described sound source position is not within the scope of current camera lens, according to described sound source position adjust described camera lens towards and attitude;

By the described camera lens after adjusting, described sound source position is tentatively focused, and obtain image information.

Optionally, described method, also comprises:

The acoustic information of target object described in continuous collecting;

According to the described acoustic information of continuous collecting, described target object is followed the tracks of to focusing.

According to the second aspect of the embodiment of the present invention, a kind of automatic focusing mechanism is provided, described device comprises:

Sound acquisition module, is configured in focus process, gathers the acoustic information of environment of living in;

Sound source position module, is configured to analyze according to described acoustic information the sound source position of described acoustic information;

Image collection module, is configured to the target object of described sound source position automatically to focus.

Optionally, described sound source position module, also comprises:

Sound resolution unit, characteristic detection unit and sound source position unit;

Described sound resolution unit, is configured to, when described acoustic information is two or more, resolve each acoustic information, obtains the sound characteristic of described acoustic information;

Whether described characteristic detection unit, be configured to detect described sound characteristic and mate with default acoustic information;

Described sound source position unit, is configured to, when described sound characteristic mates with the sound characteristic of default acoustic information, analyze the sound source position of described acoustic information.

Optionally, described device, also comprises:

Scene matching module, is configured to obtain the corresponding scene mode of environment of living in, and from least one default acoustic information, selects the acoustic information with described matching scene modes, as described default acoustic information.

Optionally, described image collection module, also comprises: the unit of tentatively focusing, image identification unit, image detecting element and the unit of automatically focusing;

Described preliminary focusing unit, is configured to described sound source position to carry out tentatively, to defocused, obtaining image information;

Described image identification unit, is configured to identify the target object at described sound source position place in described image information;

Described image detecting element, whether be configured to detect described target object is the sound producing body of described acoustic information;

Described automatic focusing unit, is configured to, when described target object is the sound producing body of described acoustic information, described target object be focused automatically.

Alternatively, described preliminary focusing unit, comprising:

Camera lens is adjusted subelement, be configured to when described sound source position is not within the scope of current camera lens, according to described sound source position adjust described camera lens towards and attitude;

Preliminary focusing subelement, is configured to by the described camera lens after adjusting, described sound source position tentatively be focused, and obtains image information.

Optionally, described device, also comprises: follow the tracks of Focusing module;

Described tracking Focusing module, is configured to the acoustic information of target object described in continuous collecting, and according to the described acoustic information of continuous collecting, described target object is followed the tracks of to focusing.

According to the third aspect of the embodiment of the present invention, a kind of automatic focusing mechanism is provided, described device comprises:

Processor;

For storing the memory of the executable instruction of described processor;

Wherein, described processor is configured to:

In focus process, gather the acoustic information of environment of living in;

Target object to described sound source position is focused automatically.

The technical scheme that disclosure embodiment provides can comprise following beneficial effect:

When subject can sounding, by sound source position, sound producing body is focused; Solved current touch Atomatic focusing method because needs user controls focusing by touch-screen, and caused when the state of user in inconvenient operating electronic equipment, Atomatic focusing method cannot be used; And user's point touching screen also can bring the shake of electronic equipment, affect the problem of focus process; Reached and when the state of inconvenient operating electronic equipment, also can normally focus and focus process can not brought because of point touching screen the effect of equipment shake.

Should be understood that, it is only exemplary and explanatory that above general description and details are hereinafter described, and can not limit the disclosure.

Accompanying drawing explanation

Accompanying drawing is herein merged in specification and forms the part of this specification, shows and meets embodiment of the present disclosure, and be used from and explain principle of the present disclosure with specification one.

Fig. 1 is according to the flow chart of a kind of Atomatic focusing method shown in an exemplary embodiment;

Fig. 2 A is according to the flow chart of a kind of Atomatic focusing method shown in another exemplary embodiment;

Fig. 2 B is according to the schematic appearance of the terminal shown in an exemplary embodiment;

Fig. 2 C is according to the schematic diagram of recording default acoustic information in auto-focus process shown in an exemplary embodiment;

Fig. 2 D is according to sound source being carried out the schematic diagram of two-dimensional localization in auto-focus process shown in an exemplary embodiment;

Fig. 2 E be according to shown in an exemplary embodiment the schematic diagram of auto-focus process;

Fig. 3 A is according to the flow chart of a kind of Atomatic focusing method shown in another exemplary embodiment;

Fig. 3 B is according to the schematic diagram of selecting scene mode in auto-focus process shown in an exemplary embodiment;

Fig. 3 C presets the schematic diagram of acoustic information according to the selection in auto-focus process shown in an exemplary embodiment;

Fig. 3 D is according to the schematic diagram of adjusting lens direction and attitude in auto-focus process shown in an exemplary embodiment;

Fig. 4 is according to the block diagram of a kind of automatic focusing mechanism shown in an exemplary embodiment;

Fig. 5 is according to the block diagram of a kind of automatic focusing mechanism shown in another exemplary embodiment.

Fig. 6 is according to the block diagram of a kind of automatic focusing mechanism shown in another exemplary embodiment

By above-mentioned accompanying drawing, the embodiment that the disclosure is clear and definite has been shown, will there is more detailed description hereinafter.These accompanying drawings and text description are not in order to limit the scope of disclosure design by any mode, but by reference to specific embodiment for those skilled in the art illustrate concept of the present disclosure.

Embodiment

Here will at length to exemplary embodiment, describe, its example shown in the accompanying drawings.When description below relates to accompanying drawing, unless separately there is expression, the same numbers in different accompanying drawings represents same or analogous key element.Execution mode described in following exemplary embodiment does not represent all execution modes consistent with the disclosure.On the contrary, they are only the examples with apparatus and method as consistent in some aspects that described in detail in appended claims, of the present disclosure.

Herein described terminal can be shooting mobile phone, camera, video camera and monitoring camera first-class all have the electronic product of shoot function.

Fig. 1 is that the present embodiment is applied to illustrate in terminal with Atomatic focusing method according to the flow chart of a kind of Atomatic focusing method shown in an exemplary embodiment.This Atomatic focusing method can comprise following several step:

In step 102, in focus process, gather the acoustic information of environment of living in;

In step 104, according to acoustic information, analyze the sound source position of acoustic information;

In step 106, the target object of sound source position is focused automatically.

In sum, the Atomatic focusing method that the present embodiment provides, when subject can sounding, focuses to sound producing body by sound source position; Solved current touch Atomatic focusing method because needs user controls focusing by touch-screen, and cause when the state of user in inconvenient operating electronic equipment, such as the state of the hand-held flat-panel devices of both hands, with remote controller or acoustic control mode, control the state of electronic equipment, above-mentioned Atomatic focusing method cannot be used, simultaneously, user's point touching screen also can bring the shake of electronic equipment, affects the problem of focus process; Reached and when the state of inconvenient operating electronic equipment, also can normally focus and focus process can not brought because of point touching screen the effect of equipment shake.

Fig. 2 A is that the present embodiment is applied to illustrate in terminal with Atomatic focusing method according to the method flow diagram of a kind of Atomatic focusing method shown in another exemplary embodiment.This Atomatic focusing method can comprise following several step:

In step 201, in focus process, gather the acoustic information of environment of living in.

Because the present embodiment need to obtain the sound source position of sound producing body, thus need two or more plane or space difference towards test point obtain acoustic information, each test point can be a microphone.

Take terminal as mobile phone be example, in conjunction with reference to figure 2B, it shows the schematic appearance of a mobile phone 20.There is a microphone 22 on the top of this mobile phone 20, and there is another microphone 24 bottom of this mobile phone 20.These 2 microphones can form the acoustic information that two test points obtain mobile phone 20 environment of living in.Alternatively, in order to realize three-dimensional auditory localization, these two or more test points can be realized by the microphone array being arranged in terminal, and this microphone array can be ternary microphone array, quaternary microphone array, five yuan of microphone arrays and hexa-atomic microphone array etc.

In the process of focusing, when terminal receives, start to focus after instruction, two or more plane or space difference towards test point start to gather the acoustic information of environment of living in.

In step 202, resolve acoustic information, obtain the sound characteristic of acoustic information.

Because the ambient sound of terminal environment of living in may mix and form for two or more acoustic information.Wherein, a part is for the effective acoustic information of focus process, the sound sending such as made thing body; And another part is for the invalid acoustic information even disturbing of focus process, such as environmental noise.Conventionally, terminal only needs to focus according to effective acoustic information.For this reason, terminal can identify the acoustic information that this focusing adopts by analyzing the sound characteristic of acoustic information.

The sound characteristic obtaining in the present embodiment can be, but not limited to as cepstrum feature, cepstrum feature is that the log power spectrum of acoustic information is carried out to the feature obtaining after inversefouriertransform, it can further effectively separate sound channel characteristic and drive characteristic, therefore can better disclose the substantive characteristics of acoustic information.

Terminal, after having gathered at least one acoustic information of environment of living in, starts each acoustic information to resolve, and obtains the cepstrum feature of each acoustic information.

It should be noted that, only need to obtain the cepstrum feature of the acoustic information that in two or more test points, any one test point obtains.

In step 203, detect sound characteristic and whether mate with the sound characteristic of default acoustic information.

Default acoustic information can be the built-in acoustic information of terminal self, can be also the acoustic information that user prerecords in terminal.

Such as, user A is often used the own son's of terminal taking image or video, and user A can prerecord own son's acoustic information as default acoustic information.

Again such as, user B is often used the own teacher's of terminal taking teaching content, user B can prerecord own teacher's acoustic information as default acoustic information, as shown in Figure 2 C, user can arrange in interface 21 to click at camera and record button 26 and record one section of sound for default acoustic information.

In focus process, whether terminal can detect the sound characteristic getting and mate with the sound characteristic of default acoustic information.If sound characteristic mates with the sound characteristic of default acoustic information, enter step 204.

It should be noted that, if be n from the acoustic information of environment collection of living in, default acoustic information is 1, and this step need to be carried out n time; If be 1 from the acoustic information of environment collection of living in, default acoustic information is m, and this step need to be carried out m time; If be n from the acoustic information of environment collection of living in, default acoustic information is m, and this step need to be carried out n*m time.

In step 204, if sound characteristic mates with the sound characteristic of default acoustic information, analyze the sound source position of acoustic information.

If in the voice recognition model of the acoustic information obtaining and the voice recognition Model Matching of default acoustic information, the sound source position of this acoustic information of terminal analysis.

The present embodiment can by two or more plane or space difference towards test point to obtain time of advent of same acoustic information poor, analyze sound source position.

Sound source position can comprise Sounnd source direction harmony spacing from.Sounnd source direction refers to that sound source is with respect to the direction of terminal, and sound source distance refers to the distance between sound source and terminal.

This step can comprise following sub-step:

1, by two or more plane or space difference towards test point time difference of obtaining same acoustic information;

Because the locus of each test point is different, same acoustic information arrive time of each test point can be different, can life period poor between mutually.

2, poor the corresponding time of advent at different test points according to same acoustic information, the space length between each test point and delay inequality algorithm calculate this acoustic information with respect to the Sounnd source direction harmony spacing of terminal from.

As shown in Figure 2 D, take auditory localization as two-dimensional localization be example, because the distance a between sound source 23 and microphone 27a is different from the distance b between sound source 23 and microphone 27b, so the sound that this sound source 23 is sent is to arrive microphone 27a poor with the time life period of microphone 27b, the Sounnd source direction α harmony spacing that can calculate sound source 23 and terminal according to the distance c between two microphones and delay inequality algorithm is from d.Certainly, if test point is more than three, can also realize the three-dimensional localization to auditory localization.

In step 205, the target object of sound source position is focused automatically.

After determining sound source position, terminal can be focused automatically according to sound source position.

Such as, as shown in Figure 2 E, sound source position place is a child 29, terminal can collect child 29 acoustic information, and when the sound characteristic of the acoustic information collecting and the sound characteristic of default acoustic information (child) mate, child 29 is carried out to auditory localization, thereby get child 29 sound source position.Then, terminal can be focused automatically according to child 29 sound source position, so that follow-up taking pictures or making a video recording.

Fig. 3 A shows the method flow diagram of a kind of Atomatic focusing method that another embodiment of the present invention provides, and the present embodiment is applied to illustrate in terminal with Atomatic focusing method.This Atomatic focusing method can comprise following several step:

In step 301, in focus process, gather the acoustic information of environment of living in.

Alternatively, in order to realize three-dimensional auditory localization, these two or more test points can be realized by the microphone array being arranged in terminal, and this microphone array can be ternary microphone array, quaternary microphone array, five yuan of microphone arrays and hexa-atomic microphone array etc.

In step 302, resolve acoustic information, obtain the sound characteristic of acoustic information.

In step 303, obtain the corresponding scene mode of environment of living in;

Terminal can be obtained the corresponding scene mode of environment of living in by following two kinds of methods:

1) receive user's selected scene mode at least one default scene mode;

Also, terminal can provide several scene modes in advance, and user can select in several scene modes that provide in advance.Then, terminal receives user's selected scene mode at least one default scene mode.Scene mode includes but not limited to: children's scene mode, party scene mode, cycling track scene mode, classroom scene mode, conference scenario pattern etc.

For example, user need to take people, selects party scene mode; User need to take automobile, selects cycling track scene mode; User need to take teacher's teaching, selects classroom scene mode, as shown in Figure 3 B.

2) terminal is selected scene mode automatically by current geographic position environment of living in;

For example, terminal, through GPS location, determines that current geographic position is in an assembly place, and terminal set scene pattern is party scene mode; Terminal, through GPS location, determines that current geographic position is in a cycling track, and terminal set scene pattern is cycling track scene mode; Terminal, through GPS location, determines that current geographic position is in a classroom, and terminal set scene pattern is classroom scene mode.

In step 304, the acoustic information of selection and matching scene modes from least one default acoustic information, as default acoustic information;

If terminal is pre-stored, there are a plurality of acoustic informations, this step can determine which of the acoustic information gathering in step 301 and at least one acoustic information pre-stored in terminal terminal mate.

Such as, terminal provides several acoustic informations under cycling track scene mode and this cycling track scene mode, and each acoustic information is corresponding with a kind of engine sound.

Such as, user A1 is often used the own son's of terminal taking image or video, and user A1 can prerecord the acoustic information of own son A2 as default acoustic information.

Again such as, user B1 is often used the own teacher's of terminal taking teaching content, user B1 can prerecord the acoustic information of own teacher B2 as default acoustic information.

Table 1

Terminal is determined after current scene pattern, the default acoustic information that the acoustic information of selection and current scene pattern matching is used as this matching process from least one default acoustic information.

Such as, when in children's scene mode, the default acoustic information that terminal selects the acoustic information of son A2 to use as this matching process.

When in cycling track scene mode, terminal can show the user interface 32 that goes out as shown in Figure 3 C, receive the acoustic information " popular 2.0 engines " 34 that user selects in user interface 32, and the default acoustic information that this acoustic information 34 is used as this matching process.

When in classroom scene mode, the default acoustic information that terminal selects the acoustic information of teacher B2 to use as this matching process.

This step can realize the focusing to specific objective.Such as, in a group children, only want the child's focusing to oneself, pre-stored own child's acoustic information, during shooting mates the acoustic information collecting and the own child's who presets acoustic information.

In step 305, detect sound characteristic and whether mate with the sound characteristic of default acoustic information;

Whether terminal detects sound characteristic and mates with the sound characteristic of default acoustic information.

The voice recognition model that terminal can be set up acoustic information, carries out sound modeling according to the cepstrum feature of acoustic information.

Whether the present embodiment can pass through DTW (Dynamic Time Warping, dynamic time consolidation) algorithm and detect sound characteristic and mate with the sound characteristic of default acoustic information; By DTW algorithm, detect the voice recognition model of the acoustic information obtaining and whether the voice recognition model of default acoustic information mates.DTW algorithm full name is dynamic time consolidation algorithm, is a kind of non-linear consolidation algorithm that time consolidation and distance measurement calculations incorporated are got up.

Whether terminal, after having set up the voice recognition model of the acoustic information that obtains and default acoustic information, can detect two voice recognition models by DTW algorithm and mate.

It should be noted that, step 303, step 304 and step 305 are optional step.

In step 306, if sound characteristic mates with the sound characteristic of default acoustic information, analyze the sound source position of acoustic information;

If in the voice recognition model of the acoustic information obtaining and the voice recognition Model Matching of default acoustic information, analyze the sound source position of acoustic information.

The present embodiment can by two or more plane or space difference towards test point time difference of obtaining same acoustic information, judge sound source position.

Sound source position can comprise Sounnd source direction harmony spacing from.Sounnd source direction refers to that sound source is with respect to the position of terminal, and sound source distance refers to the distance between sound source and terminal.

This step can comprise following sub-step:

Step 307, tentatively focuses to sound source position, obtains image information;

Terminal is carried out preliminary focusing by sound source position to sound producing body, and terminal can draw the distance between target object and terminal by going out sound source position, and focuses apart from adjusting camera lens by this.

Because sound source position may be within the scope of current camera lens, so this step can comprise following sub-step:

1, when sound source position is not within the scope of current camera lens, according to sound source position adjust camera lens towards and attitude;

Because sound source position may be positioned at the side of terminal or below, now, terminal can by inner mechanical structure adjust camera lens towards and attitude.

As shown in Figure 3 D, terminal comprises an automatically controlled runing rest 36.After getting the sound source position of sound producing body A, whether the sound source position that detects this sound producing body A is current viewfinder range, if this sound producing body A is not at current viewfinder range, terminal calculates the angle x of current shooting optical axis and sound producing body A present position, and this information is sent to runing rest 36, runing rest 36 makes current shooting optical axis overlap with sound producing body A through over-rotation, then tentatively focuses.

2, by the camera lens after adjusting, sound source position is tentatively focused, and obtain image information.

Terminal is obtained the image information of preliminary focusing, and the whole image information of directly obtaining when this image information can tentatively be focused for camera lens can be also the parts of images information of focusing area.

Step 308 is identified the target object at sound source position place in image information;

Terminal identifies the main body in image information by image recognition technology, using this main body as target object.

Step 309, whether detect target object is the sound producing body of acoustic information;

Due to the target object detecting, may be to be also not the real sound producing body of sound source, so terminal also needs whether target object in detected image information is the sound producing body of acoustic information.

As a kind of implementation, this step comprises following sub-step:

1, the image information associated with default acoustic information that inquiry sets in advance;

Such as, in step 305, the default acoustic information of coupling is teacher, the image information associated with this default acoustic information is this teacher's photo; The default acoustic information of coupling is child, and the image information associated with this default acoustic information is this child's photo; The default acoustic information of coupling is car engine sound, and the image information associated with this default acoustic information is the photo with the corresponding automobile of this car engine sound.

Whether the default image information that default acoustic information that 2, detection is mated with acoustic information is associated and the image information of target object mate;

If 3 couplings, determine that target object is the sound producing body of acoustic information.

The present embodiment can be used a kind of the most frequently used image matching method: frequency matching algorithm, the method by time-frequency conversion, is transformed to the data of frequency domain the data in territory, then by certain similitude degree, determines the match parameter between two width figure.The space-time transformation that the present embodiment adopts can be Fourier (Fourier) conversion, and the similarity measurement that the present embodiment adopts can be phase correlation amount.

Terminal can be by the image information of obtaining through Fourier conversion, be converted to the data of frequency domain, with phase correlation amount, detect again the matching degree of the image information of default image information and target object, here can set a threshold value, if testing result is less than this threshold value, think that presetting image information mates with the image information of target object, determine that this target object is sound producing body, if testing result is greater than this threshold value, think that presetting image information does not mate with the image information of target object.

Step 310, if target object is the sound producing body of acoustic information, focuses to target object automatically;

If default image information is mated with the image information of target object, target object is further focused, if do not mated, do not operate, and point out user.

Step 311, the acoustic information of continuous collecting target object;

Because target object may be moved, so if the definite target object of step 310 is the sound producing body of acoustic information, the acoustic information of terminal continuous collecting sound producing body.

Step 312, follows the tracks of focusing according to the acoustic information of continuous collecting to target object.

If the acoustic information of continuous collecting detects with default acoustic information and mates to 305 through step 302, continue target object to follow the tracks of focusing.

In sum, when subject can sounding, the present embodiment is focused to sound producing body by sound source position; Solved current touch Atomatic focusing method because needs user controls focusing by touch-screen, and cause when the state of user in inconvenient operating electronic equipment, such as the state of the hand-held flat-panel devices of both hands, with remote controller or acoustic control mode, control the state of electronic equipment, above-mentioned Atomatic focusing method cannot be used, simultaneously, user's point touching screen also can bring the shake of electronic equipment, affects the problem of focus process; Reached and when the state of inconvenient operating electronic equipment, also can normally focus and focus process can not brought because of point touching screen the effect of equipment shake.

Whether the present embodiment is by sound source position is focused in advance, and analyze the image information that pre-focusing obtains and mate with the sound characteristic of the acoustic information obtaining, and increased the accuracy of the present embodiment when sound producing body is focused.

It should be added that, by the acoustic information getting is mated with default acoustic information, and only in coupling in the situation that, according to this acoustic information, focus.Can make the practicality of this method embodiment stronger, make terminal can comparatively noisy again environment in accurately to wanting the object of taking to focus, for example, while autodyning in noisy park, user can only focus terminal by sounding to oneself.

It should be added that, step 307 pair sound source position can rotate lens direction while tentatively focusing, make sound source position within the viewfinder range of camera lens, the angle of rotating can be determined by sound source position information, same, when step 312 pair target object is followed the tracks of focusing, also can be by rotating lens direction, make target object within the viewfinder range of camera lens, to reach the effect of better tracking focusing; This feature can make this method embodiment flexible Application in monitoring camera aspect, for example, when terminal is as CCTV camera, by this feature, can monitor accurately flexibly a very large panel region, there is certain intelligent monitoring effect, as long as collect the sound of guarded region, CCTV camera will turn to voice directions, and focuses and find a view, and has played the effect of monitoring; This feature also can make this method embodiment application and the track up aspect to moving object, for example, in cycling track, can settle multi-section to apply the video camera of this method embodiment at racing track periphery, scene mode is set as to cycling track scene, video camera can be followed the tracks of focusing to the racing car of process automatically, and take the demand of greatly having saved manpower.

It should be added that, when step 312 pair target object is followed the tracks of focusing, for improving the accuracy of following the tracks of focusing, can identify by step 307 to 310 pairs of sound producing bodies of following the tracks of focusing at any time, whether the target object of determining current tracking focusing is correct, if incorrect, from step 301, restart auto-focus process; This feature has improved the reliability that this method embodiment follows the tracks of focusing greatly; For example, in concert, can settle several video cameras of applying this method embodiment around at stage, when singer performs, these video cameras can be followed the tracks of focusing or take singer comparatively accurately, because concert Field Force is numerous, comparatively chaotic, there is mistake in general tracking focusing possibly, and this feature can improve the reliability of following the tracks of focusing greatly, guarantee to follow the tracks of the target of focusing is singer always.

Following is disclosure device embodiment, can be for carrying out disclosure embodiment of the method.Details for not disclosing in disclosure device embodiment, please refer to disclosure embodiment of the method.

Fig. 4 is according to the block diagram of a kind of automatic focusing mechanism shown in an exemplary embodiment.This automatic focusing mechanism can be by software, hardware or both be combined into all or part of of terminal.This automatic focusing mechanism, comprising:

Sound acquisition module 410, is configured in focus process, gathers the acoustic information of environment of living in;

Sound source position module 420, is configured to according to the sound source position of acoustic information analysis acoustic information;

Image collection module 430, is configured to the target object of sound source position automatically to focus.

In sum, when subject can sounding, the present embodiment is focused to sound producing body by sound source position; Solved current touch Atomatic focusing method because needs user controls focusing by touch-screen, and cause when the state of user in inconvenient operating electronic equipment, such as the state of the hand-held flat-panel devices of both hands, with remote controller or acoustic control mode, control the state of electronic equipment, above-mentioned Atomatic focusing method cannot be used, simultaneously, user's point touching screen also can bring the shake of electronic equipment, affects the problem of focus process; Reached the effect that also can normally focus when the state of inconvenient operating electronic equipment.

Fig. 5 is according to the block diagram of a kind of automatic focusing mechanism shown in an exemplary embodiment.This automatic focusing mechanism can be by software, hardware or both be combined into all or part of of terminal.This automatic focusing mechanism, comprising:

Optionally, sound source position module 420, also comprises:

Sound resolution unit 421, characteristic detection unit 422 and sound source position unit 423;

Sound resolution unit 421, is configured to resolve acoustic information, obtains the sound characteristic of acoustic information;

Whether characteristic detection unit 422, be configured to detect sound characteristic and mate with default acoustic information;

Sound source position unit 423, is configured to when sound characteristic mates with the sound characteristic of default acoustic information, analyzes the sound source position of acoustic information.

Optionally, this device, also comprises:

Scene matching module 440, is configured to obtain the corresponding scene mode of environment of living in, and from least one default acoustic information, selects the acoustic information with matching scene modes, as default acoustic information.

Optionally, image collection module 430, also comprises: the unit 431 of tentatively focusing, image identification unit 432, image detecting element 433 and the unit 434 of automatically focusing;

Preliminary focusing unit 431, is configured to sound source position to carry out tentatively, to defocused, obtaining image information;

Image identification unit 432, is configured to identify the target object at sound source position place in image information;

Image detecting element 433, whether be configured to detect target object is the sound producing body of acoustic information;

Automatically focusing unit 434, is configured to, when described target object is the sound producing body of acoustic information, target object be focused automatically.

Optionally, described preliminary focusing unit 431, comprising:

Camera lens is adjusted subelement 431a, be configured to when described sound source position is not within the scope of current camera lens, according to described sound source position adjust described camera lens towards and attitude;

Preliminary focusing subelement 431b, is configured to by the described camera lens after adjusting, described sound source position tentatively be focused, and obtains image information.

Optionally, this device, also comprises: follow the tracks of Focusing module 450;

Follow the tracks of Focusing module 450, be configured to the acoustic information of continuous collecting target object, and according to the acoustic information of continuous collecting, target object followed the tracks of to focusing.

Fig. 6 is according to the block diagram of a kind of automatic focusing mechanism 600 shown in an exemplary embodiment.For example, device 600 can be mobile phone, computer, digital broadcast terminal, information receiving and transmitting equipment, game console, flat-panel devices, Medical Devices, body-building equipment, personal digital assistant etc.

With reference to Fig. 6, device 600 can comprise following one or more assembly: processing components 602, memory 604, power supply module 606, multimedia groupware 608, audio-frequency assembly 610, the interface 612 of I/O (I/O), sensor cluster 614, and communications component 616.

The integrated operation of processing components 602 common control device 600, such as with demonstration, call, data communication, the operation that camera operation and record operation are associated.Processing components 602 can comprise that one or more processors 620 carry out instruction, to complete all or part of step of above-mentioned method.In addition, processing components 602 can comprise one or more modules, is convenient to mutual between processing components 602 and other assemblies.For example, processing components 602 can comprise multi-media module, to facilitate mutual between multimedia groupware 608 and processing components 602.

Memory 604 is configured to store various types of data to be supported in the operation of device 600.The example of these data comprises for any application program of operation on device 600 or the instruction of method, contact data, telephone book data, message, picture, video etc.Memory 604 can be realized by the volatibility of any type or non-volatile memory device or their combination, as static RAM (SRAM), Electrically Erasable Read Only Memory (EEPROM), Erasable Programmable Read Only Memory EPROM (EPROM), programmable read only memory (PROM), read-only memory (ROM), magnetic memory, flash memory, disk or CD.

Power supply module 606 provides electric power for installing 600 various assemblies.Power supply module 606 can comprise power-supply management system, one or more power supplys, and other and the assembly that generates, manages and distribute electric power to be associated for device 600.

Multimedia groupware 608 is included in the screen that an output interface is provided between described device 600 and user.In certain embodiments, screen can comprise liquid crystal display (LCD) and touch panel (TP).If screen comprises touch panel, screen may be implemented as touch-screen, to receive the input signal from user.Touch panel comprises that one or more touch sensors are with the gesture on sensing touch, slip and touch panel.Described touch sensor is the border of sensing touch or sliding action not only, but also detects duration and the pressure relevant to described touch or slide.In certain embodiments, multimedia groupware 608 comprises a front-facing camera and/or post-positioned pick-up head.When device 600 is in operator scheme, during as screening-mode or video mode, front-facing camera and/or post-positioned pick-up head can receive outside multi-medium data.Each front-facing camera and post-positioned pick-up head can be fixing optical lens systems or have focal length and optical zoom ability.

Audio-frequency assembly 610 is configured to output and/or input audio signal.For example, audio-frequency assembly 610 comprises a microphone (MIC), and when device 600 is in operator scheme, during as call model, logging mode and speech recognition mode, microphone is configured to receive external audio signal.The audio signal receiving can be further stored in memory 604 or be sent via communications component 616.In certain embodiments, audio-frequency assembly 610 also comprises a loud speaker, for output audio signal.

I/O interface 612 is for providing interface between processing components 602 and peripheral interface module, and above-mentioned peripheral interface module can be keyboard, some striking wheel, button etc.These buttons can include but not limited to: home button, volume button, start button and locking press button.

Sensor cluster 614 comprises one or more transducers, is used to device 600 that the state estimation of various aspects is provided.For example, sensor cluster 614 can detect the opening/closing state of device 600, the relative positioning of assembly, for example described assembly is display and the keypad of device 600, the position of all right checkout gear 600 of sensor cluster 614 or 600 1 assemblies of device changes, user is with device 600 existence that contact or do not have the variations in temperature of device 600 orientation or acceleration/deceleration and device 600.Sensor cluster 614 can comprise proximity transducer, be configured to without any physical contact time detect near the existence of object.Sensor cluster 614 can also comprise optical sensor, as CMOS or ccd image sensor, for using in imaging applications.In certain embodiments, this sensor cluster 614 can also comprise acceleration transducer, gyro sensor, Magnetic Sensor, pressure sensor or temperature sensor.

Communications component 616 is configured to be convenient to the communication of wired or wireless mode between device 600 and other equipment.Device 600 wireless networks that can access based on communication standard, as WiFi, 2G or 3G, or their combination.In one exemplary embodiment, communications component 616 receives broadcast singal or the broadcast related information from external broadcasting management system via broadcast channel.In one exemplary embodiment, described communications component 616 also comprises near-field communication (NFC) module, to promote junction service.For example, can be based on radio-frequency (RF) identification (RFID) technology in NFC module, Infrared Data Association (IrDA) technology, ultra broadband (UWB) technology, bluetooth (BT) technology and other technologies realize.

In the exemplary embodiment, device 600 can be realized by one or more application specific integrated circuits (ASIC), digital signal processor (DSP), digital signal processing appts (DSPD), programmable logic device (PLD), field programmable gate array (FPGA), controller, microcontroller, microprocessor or other electronic components, the Atomatic focusing method providing for carrying out above-described embodiment.

In the exemplary embodiment, also provide a kind of non-provisional computer-readable recording medium that comprises instruction, for example, comprised the memory 604 of instruction, above-mentioned instruction can have been carried out said method by the processor 620 of device 600.For example, described non-provisional computer-readable recording medium can be ROM, random access memory (RAM), CD-ROM, tape, floppy disk and optical data storage equipment etc.

A non-provisional computer-readable recording medium, when the instruction in described storage medium is carried out by the processor that installs 600, makes device 600 can carry out the Atomatic focusing method that above-described embodiment provides.

Those skilled in the art, considering specification and putting into practice after invention disclosed herein, will easily expect other embodiment of the present disclosure.The application is intended to contain any modification of the present disclosure, purposes or adaptations, and these modification, purposes or adaptations are followed general principle of the present disclosure and comprised undocumented common practise or the conventional techniques means in the art of the disclosure.Specification and embodiment are only regarded as exemplary, and true scope of the present disclosure and spirit are pointed out by claim below.

Should be understood that, the disclosure is not limited to precision architecture described above and illustrated in the accompanying drawings, and can carry out various modifications and change not departing from its scope.The scope of the present disclosure is only limited by appended claim.

Claims

1. an Atomatic focusing method, is characterized in that, described method comprises:

In focus process, gather the acoustic information of environment of living in;

Target object to described sound source position is focused automatically.

2. method according to claim 1, is characterized in that, the described sound source position of analyzing described acoustic information according to described acoustic information, comprising:

3. method according to claim 2, is characterized in that, described method, also comprises:

Obtain the corresponding scene mode of environment of living in;

4. according to the arbitrary described method of claims 1 to 3, it is characterized in that, the described target object to described sound source position is focused automatically, comprising:

5. method according to claim 4, is characterized in that, described described sound source position is tentatively focused, and obtains image information, comprising:

6. according to the arbitrary described method of claims 1 to 3, it is characterized in that, described method, also comprises:

The acoustic information of target object described in continuous collecting;

7. an automatic focusing mechanism, is characterized in that, described device comprises:

8. device according to claim 7, is characterized in that, described sound source position module, also comprises:

9. device according to claim 8, is characterized in that, described device, also comprises:

10. according to the arbitrary described device of claim 7 to 9, it is characterized in that, described image collection module, comprising: the unit of tentatively focusing, image identification unit, image detecting element and the unit of automatically focusing;

Described preliminary focusing unit, is configured to described sound source position tentatively to focus, and obtains image information;

11. devices according to claim 10, is characterized in that, described preliminary focusing unit, comprising:

12. according to the arbitrary described device of claim 7 to 9, it is characterized in that, described device, also comprises: follow the tracks of Focusing module;

13. 1 kinds of automatic focusing mechanisms, is characterized in that, comprising:

Processor;

For storing the memory of the executable instruction of described processor;

Wherein, described processor is configured to:

In focus process, gather the acoustic information of environment of living in;

Target object to described sound source position is focused automatically.