Specific embodiment
The application is described in further detail with reference to the accompanying drawings and examples.It is understood that this place is retouched
The specific embodiment stated is used only for explaining related invention, rather than the restriction to the invention.It also should be noted that in order to
Convenient for description, part relevant to related invention is illustrated only in attached drawing.
It should be noted that in the absence of conflict, the features in the embodiments and the embodiments of the present application can phase
Mutually combination.The application is described in detail below with reference to the accompanying drawings and in conjunction with the embodiments.
Fig. 1 is shown can be using the detection method of the application or the exemplary system architecture 100 of detection device.
As shown in Figure 1, system architecture 100 may include terminal device 101,102,103, network 104 and server 105.
Network 104 between terminal device 101,102,103 and server 105 to provide the medium of communication link.Network 104 can be with
Including various connection types, such as wired, wireless communication link or fiber optic cables etc..
User can be used terminal device 101,102,103 and be interacted by network 104 with server 105, to receive or send out
Send message etc..Various telecommunication customer end applications can be installed, such as interactive voice class is answered on terminal device 101,102,103
With, shopping class application, searching class application, instant messaging tools, mailbox client, social platform software etc..
Terminal device 101,102,103 can be hardware, be also possible to software.When terminal device 101,102,103 is hard
When part, it can be the various electronic equipments with display screen and supported web page browsing, including but not limited to smart phone, plate
Computer, E-book reader, MP3 player (Moving Picture Experts Group Audio Layer III, dynamic
Image expert's compression standard audio level 3), MP4 (Moving Picture Experts Group Audio Layer IV, move
State image expert's compression standard audio level 4) player, pocket computer on knee and desktop computer etc..When terminal is set
Standby 101,102,103 when being software, may be mounted in above-mentioned cited electronic equipment.Its may be implemented into multiple softwares or
Software module (such as providing Distributed Services), also may be implemented into single software or software module.It does not do herein specific
It limits.
When terminal device 101,102,103 is hardware, it is also equipped with image capture device thereon.Image Acquisition is set
It is standby to can be the various equipment for being able to achieve acquisition image function, such as camera, sensor.User can use terminal device
101, the image capture device on 102,103, to acquire video.
Frame in the video that terminal device 101,102,103 can record the video or user that it is played carries out people
The processing such as face detection, face critical point detection;Face critical point detection result can also be analyzed, be calculated, determine view
The mouth of the face object of each frame opens distance in frequency;The mouth open and-shut mode for being also based on a certain frame chooses targets threshold,
To open distance using the mouth in the targets threshold and next frame, to the mouth open and-shut mode of the face object in next frame into
Row detection, obtains testing result.
Server 105 can be to provide the server of various services, such as uploading to terminal device 101,102,103
The video video processing service device that is stored, managed or analyzed.Video processing service device can store a large amount of view
Frequently, and video can be sent to terminal device 101,102,103.
It should be noted that server 105 can be hardware, it is also possible to software.When server is hardware, Ke Yishi
The distributed server cluster of ready-made multiple server compositions, also may be implemented into individual server.When server is software,
Multiple softwares or software module (such as providing Distributed Services) may be implemented into, single software or soft also may be implemented into
Part module.It is not specifically limited herein.
It should be noted that detection method provided by the embodiment of the present application is generally held by terminal device 101,102,103
Row, correspondingly, detection device is generally positioned in terminal device 101,102,103.
It should be pointed out that the case where the correlation function of server 105 may be implemented in terminal device 101,102,103
Under, server 105 can be not provided in system architecture 100.
It may also be noted that server 105 can also be to the video or terminal device 101,102,103 that it is stored
The video uploaded carries out the processing such as Face datection, face critical point detection, the detection of mouth open and-shut mode, and processing result is returned
Back to terminal device 101,102,103.At this point, detection method provided by the embodiment of the present application can also be held by server 105
Row, correspondingly, detection device also can be set in server 105.
It should be understood that the number of terminal device, network and server in Fig. 1 is only schematical.According to realization need
It wants, can have any number of terminal device, network and server.
With continued reference to Fig. 2, the process 200 of one embodiment of the detection method according to the application is shown.The detection side
Method, comprising the following steps:
Step 201, it obtains to acquired after the face object progress face critical point detection in the present frame of target video
Face critical point detection result.
In the present embodiment, the executing subject (such as terminal device shown in FIG. 1 101,102,103) of detection method can be with
Carry out the recording or broadcasting of video.Its video played, which can be, is stored in advance in local video;It is also possible to by having
Line connection or radio connection, from the video obtained in server (such as server 105 shown in FIG. 1).Herein, when into
When the recording of row video, image collecting device (such as camera) can be installed or be connected with to above-mentioned executing subject.It may be noted that
, above-mentioned radio connection can include but is not limited to 3G/4G connection, WiFi connection, bluetooth connection, WiMAX connection,
Zigbee connection, UWB (ultra wideband) connection and other currently known or exploitation in the future radio connections.
In the present embodiment, the face object in the available present frame to target video of above-mentioned executing subject carries out people
Obtained face critical point detection result after face critical point detection.Wherein, above-mentioned target video, which can be, is currently played
Video, be also possible to the video that user is recording.It is not construed as limiting herein.Above-mentioned face key testing result may include each
The position (coordinate representation can be used) of a face key point.In practice, face key point can be the crucial point (example in face
Such as with the point of semantic information, face mask or the point of face shape etc. are either influenced).It can be in face key testing result
Coordinate including upper lip center, the coordinate etc. of lower lip center.
Herein, the present frame of target video can be in target video to carry out mouth opening and closing to face object therein
The frame of state-detection.As an example, above-mentioned executing subject can be according to the sequence of the timestamp of frame, successively in target video
Face object in each frame carries out the detection of mouth open and-shut mode.The frame of current pending mouth open and-shut mode detection, it can
The referred to as present frame of target video.By taking the following two kinds scene as an example:
In a kind of scene, target video can be above-mentioned executing subject video being played on.In broadcasting for target video
During putting, above-mentioned executing subject seriatim can carry out face critical point detection to each frame to be played, obtain the frame
In face object face critical point detection as a result, so as to in the frame face object carry out the detection of mouth open and-shut mode,
And then carry out the broadcasting of the frame.It can be present frame in the frame that current time will play.
In another scene, target video can be the video that above-mentioned executing subject is being recorded.In target video
In recording process, above-mentioned executing subject seriatim can carry out face critical point detection to the frame that each has been captured, and be somebody's turn to do
The face critical point detection of face object in frame is as a result, to carry out the inspection of mouth open and-shut mode to the face object in the frame
It surveys, and then the frame is shown.Newest frame acquired in current time can be present frame.
It should be noted that can use various modes carries out face critical point detection.For example, can in above-mentioned executing subject
To be previously stored with the face critical point detection model for carrying out face critical point detection to image.For the every of target video
The frame can be input in above-mentioned face critical point detection model by one frame, obtain face critical point detection result.Here, people
Face critical point detection model can be using machine learning method, is based on sample set, has to existing convolutional neural networks
What supervised training obtained.Wherein, convolutional neural networks can be used various existing structures, such as DenseBox, VGGNet,
ResNet, SegNet etc..It should be noted that above-mentioned machine learning method, Training method be at present extensively research and
The well-known technique of application, details are not described herein.
In some optional implementations of the present embodiment, can also be previously stored in above-mentioned executing subject for pair
The Face datection model of image progress Face datection.At this point, when carrying out the detection of mouth open and-shut mode to a certain frame, it is above-mentioned to hold
The frame can be input to Face datection model first by row main body, obtained Face datection result and (such as be used to indicate face object
The position of region, that is, the position of Face datection frame).Then, screenshot can be carried out to the face object region, i.e.,
Facial image can be obtained.Later, which can be input to face critical point detection model, obtain the inspection of face key point
Survey result.
Step 202, based on face critical point detection as a result, determining that the mouth of the face object in present frame opens distance.
In the present embodiment, above-mentioned executing subject can be primarily based on face critical point detection result adjustment face object
Scaling.For example, can calculate the coordinate of the forehead in face critical point detection result to chin coordinate distance, then
The ratio is determined as scaling by the ratio for determining the distance with pre-determined distance.Then, due to can be in Face datection result
The coordinate of upper lip center including face object, the coordinate of lower lip center, therefore, above-mentioned executing subject can be with
The distance between the two coordinates are calculated, by the distance divided by above-mentioned scaling, determine that mouth opens distance.It may be noted that
, other distances can also be used to determine scaling, be not construed as limiting herein.It is, for example, possible to use the distances of the left and right corners of the mouth
Ratio with another pre-determined distance is as scaling.
It should be noted that above-mentioned executing subject can also carry out Face datection in advance, face subject area is being determined
The region is zoomed in and out afterwards, then carries out face critical point detection again.At this point, the upper lip in face critical point detection result
The coordinate of center is at a distance from the coordinate of lower lip center, and as mouth opens distance.
Step 203, the mouth open and-shut mode based on the face object in predetermined, present frame previous frame determines
Targets threshold.
In the present embodiment, since above-mentioned executing subject can successively carry out the face object in the frame in target video
The detection of mouth open and-shut mode, therefore, above-mentioned executing subject have been predefined when carrying out the detection of mouth open and-shut mode to present frame
The testing result of the mouth open and-shut mode of face object in the previous frame of present frame out.At this point, above-mentioned executing subject can be with base
The mouth open and-shut mode of face object in predetermined, present frame previous frame, determines targets threshold.Herein, target
Threshold value can be mouth open and-shut mode of the above-mentioned executing subject based on the face object in previous frame, pre-set from user institute
Selected threshold value in multiple threshold values.Herein, the mouth open and-shut mode of the face object in previous frame is different, and targets threshold is not yet
Together.
It should be noted that the face object in previous frame and present frame herein can be the face of the same person.Example
Such as, during user's recording shoots the video certainly, the face object in present frame and previous frame is all the face of the user.
In some optional implementations of the present embodiment, in response to determining that the mouth of the face object in previous frame is opened
Closed state is open configuration, preset first threshold can be determined as targets threshold.In response to determining the face in previous frame
The mouth open and-shut mode of object is closed state, preset second threshold can be determined as targets threshold.Wherein, above-mentioned first
Threshold value can be less than above-mentioned second threshold.It should be noted that in this implementation, technical staff can be in advance based on greatly
The data statistics of amount and test two threshold values (respectively first threshold and second threshold) of setting.
In some optional implementations of the present embodiment, technical staff can be in advance based on a large amount of data statistics and
Test sets multiple threshold values, and multiple threshold values is of different sizes.In response to determining the mouth opening and closing of the face object in previous frame
State is open configuration, above-mentioned executing subject can using any threshold value for being greater than default median in above-mentioned multiple threshold values as
Targets threshold.In response to determining that the mouth open and-shut mode of the face object in previous frame is closed state, above-mentioned executing subject can
Using by any threshold value for being less than default median in above-mentioned multiple threshold values as targets threshold.Herein, default median can be with
It is the average value of above-mentioned multiple threshold values, is also possible to any minimum value greater than in above-mentioned multiple threshold values and is less than above-mentioned multiple thresholds
The numerical value of maximum value in value.
In previous mode, a single threshold value is usually set.When mouth is opened apart from preset greater than this, then it is assumed that
It is the state of opening one's mouth;If mouth, which opens distance, is less than the threshold value, then it is assumed that be the state of shutting up.Previous this mode, in mouth
When opening distance in the Near Threshold, it will cause testing result and beat back and forth, cause the stability of testing result and accuracy equal
It is poor.And in such a way that the mouth open and-shut mode in the present embodiment based on the face object in previous frame determines targets threshold,
It by the selection of different threshold values, can frequently beat to avoid testing result, improve the stability and accuracy of testing result.
In some optional implementations of the present embodiment, the previous frame of present frame if it does not exist, i.e. present frame are mesh
Mark the first frame of video.Above-mentioned executing subject can be using pre-set initial threshold as targets threshold, based on above-mentioned mouth
Distance is opened compared with above-mentioned targets threshold, determines the mouth open and-shut mode of the face object in present frame.For example, if mouth
It opens distance and is greater than targets threshold, can be determined as mouth is open configuration;If mouth, which opens distance, is not more than above-mentioned targets threshold,
Can be determined as mouth is closed state.Herein, above-mentioned initial threshold can be set according to actual needs.
In some optional implementations of the present embodiment, above-mentioned initial threshold can be greater than above-mentioned first threshold and small
In above-mentioned second threshold.
Step 204, distance is opened compared with targets threshold based on mouth, determines the mouth of the face object in present frame
Open and-shut mode.
In the present embodiment, above-mentioned executing subject can the opening distance of the mouth based on determined by step 202 and step 203
The comparison of identified targets threshold determines the mouth open and-shut mode of the face object in present frame.
In some optional implementations of the present embodiment, distance is opened in response to the above-mentioned mouth of determination and is greater than above-mentioned mesh
Threshold value is marked, above-mentioned executing subject can determine that the mouth open and-shut mode of the face object in above-mentioned present frame is open configuration.It rings
Distance should be opened no more than above-mentioned targets threshold in determining above-mentioned mouth, above-mentioned executing subject can determine in above-mentioned present frame
The mouth open and-shut mode of face object is closed state.
In some optional implementations of the present embodiment, distance is opened in response to the above-mentioned mouth of determination and is greater than above-mentioned mesh
Threshold value is marked, above-mentioned executing subject can determine that the mouth open and-shut mode of the face object in above-mentioned present frame is open configuration.It rings
It should open distance in determining above-mentioned mouth and be less than above-mentioned targets threshold, above-mentioned executing subject can determine the people in above-mentioned present frame
The mouth open and-shut mode of face object is closed state.Distance, which is opened, in response to the above-mentioned mouth of determination is equal to above-mentioned targets threshold, on
Stating executing subject can be using the mouth open and-shut mode of previous frame as the mouth open and-shut mode of present frame.
In some optional implementations of the present embodiment, in response to the mouth of the face object in the above-mentioned present frame of determination
Bar open and-shut mode is open configuration, and above-mentioned executing subject obtains target special efficacy (such as paster of mouth), in above-mentioned present frame
The mouth position of face object shows above-mentioned target special efficacy.
With continued reference to the schematic diagram that Fig. 3, Fig. 3 are according to the application scenarios of the detection method of the present embodiment.Fig. 3's
In application scenarios, the self-timer mode of user's using terminal equipment 301 records target video.Terminal device is capturing present frame
Afterwards, face critical point detection has been carried out to present frame using the face critical point detection model that it is stored, and has obtained face
Critical point detection result 302.Then, terminal device 301 determines the people in present frame based on face critical point detection result 302
The mouth of face object opens distance 303.Then, terminal device 301 gets the mouth opening and closing shape of the face object in previous frame
State 304 may thereby determine that out targets threshold 305.Finally, above-mentioned terminal device 301 is based on targets threshold 305 and mouth opens
The comparison of distance 303 can determine the mouth open and-shut mode 306 of the face object in present frame.
The method provided by the above embodiment of the application, by obtain to the face object in the present frame of target video into
Obtained face critical point detection after pedestrian's face critical point detection is as a result, so as to be based on the face critical point detection knot
Fruit determines that the mouth of the face object in present frame opens distance.Mouth then based on the face object in previous frame is opened
Closed state can determine targets threshold, so as to based on targets threshold with identified mouth open distance compared with, really
The mouth open and-shut mode of face object in settled previous frame.The targets threshold compared with distance carries out data is opened with mouth as a result,
It is that the mouth open and-shut mode based on the face object in previous frame is determined, that is, it is considered that the face in previous frame
Influence of the mouth open and-shut mode of object to the mouth open and-shut mode of the face object in present frame.Pass through different target as a result,
The selection of threshold value can frequently beat to avoid testing result, improve the inspection of the mouth open and-shut mode of the face object in video
Survey the stability and accuracy of result.
With further reference to Fig. 4, it illustrates the processes 400 of another embodiment of detection method.The stream of the detection method
Journey 400, comprising the following steps:
Step 401, it obtains to acquired after the face object progress face critical point detection in the present frame of target video
Face critical point detection result.
In the present embodiment, the executing subject (such as terminal device shown in FIG. 1 101,102,103) of detection method obtains
To obtained face critical point detection result after the face object progress face critical point detection in the present frame of target video.
It in the present embodiment, can also be in advance before carrying out face critical point detection to the face object in present frame
The Face datection of present frame is carried out, to determine face object region.In addition, determine face object region it
Afterwards, which can also be zoomed in and out, so that the size (such as length) in the region and preset size (such as length) phase
Together.
Step 402, based on face critical point detection as a result, determining that the mouth of the face object in present frame opens distance.
In the present embodiment, due to may include in Face datection result face object upper lip center seat
Mark, the coordinate of lower lip center, therefore, above-mentioned executing subject can calculate the distance between the two coordinates, by this away from
From be determined as mouth open with a distance from.
It step 403, will be preset in response to determining that the mouth open and-shut mode of the face object in previous frame is open configuration
First threshold is determined as targets threshold.
In the present embodiment, in response to determining that the mouth open and-shut mode of the face object in previous frame is open configuration, on
Preset first threshold can be determined as targets threshold (such as 0.2) by stating executing subject.
It step 404, will be preset in response to determining that the mouth open and-shut mode of the face object in previous frame is closed state
Second threshold is determined as targets threshold.
In the present embodiment, in response to determining that the mouth open and-shut mode of the face object in previous frame is closed state, on
Targets threshold can be determined as preset second threshold by stating executing subject.Wherein, above-mentioned first threshold can be less than above-mentioned the
Two threshold values.
Step 405, distance is opened compared with targets threshold based on mouth, determines the mouth of the face object in present frame
Open and-shut mode.
In the present embodiment, above-mentioned electronic equipment can open the ratio of distance and above-mentioned targets threshold based on above-mentioned mouth
Compared with determining the mouth open and-shut mode of the face object in above-mentioned present frame.Specifically, distance is opened in response to the above-mentioned mouth of determination
Greater than above-mentioned targets threshold, it can determine that the mouth open and-shut mode of the face object in above-mentioned present frame is open configuration.Response
In determining that above-mentioned mouth opens distance no more than above-mentioned targets threshold, the mouth of the face object in above-mentioned present frame can be determined
Open and-shut mode is closed state.
Step 406, in response to determining that the mouth open and-shut mode of the face object in present frame is open configuration, target is obtained
Special efficacy shows target special efficacy in the mouth position of the face object of present frame.
It in the present embodiment, is to open shape in response to the mouth open and-shut mode of the face object in the above-mentioned present frame of determination
State, the above-mentioned available target special efficacy of executing subject (such as paster of mouth), in the mouth of the face object of above-mentioned present frame
Position shows above-mentioned target special efficacy.
Figure 4, it is seen that the process 400 of the detection method in the present embodiment relates to compared with the corresponding embodiment of Fig. 2
And to by the way that the step of dual threshold detects mouth open and-shut mode is arranged.The scheme of the present embodiment description can be with as a result,
Determine that the mode of targets threshold can by the selection of different threshold values based on the mouth open and-shut mode of the face object in previous frame
It frequently beats to avoid testing result, improves the stability and accuracy of testing result.In addition, having further related to determining mouth
Open and-shut mode is the displaying step of progress target special efficacy after open configuration.Thus, it is possible to which abundant video shows form.
With further reference to Fig. 5, as the realization to method shown in above-mentioned each figure, this application provides a kind of detection devices
One embodiment, the Installation practice is corresponding with embodiment of the method shown in Fig. 2, which specifically can be applied to various electricity
In sub- equipment.
As shown in figure 5, detection device 500 described in the present embodiment includes: acquiring unit 501, it is configured to obtain to mesh
The face object marked in the present frame of video carries out obtained face critical point detection result after face critical point detection;First
Determination unit 502 is configured to based on above-mentioned face critical point detection as a result, determining the mouth of the face object in above-mentioned present frame
Ba Zhangkai distance;Second determination unit 503, the face being configured in the previous frame based on predetermined, above-mentioned present frame
The mouth open and-shut mode of object, determines targets threshold;Third determination unit 504 is configured to open distance based on above-mentioned mouth
Compared with above-mentioned targets threshold, the mouth open and-shut mode of the face object in above-mentioned present frame is determined.
In some optional implementations of the present embodiment, above-mentioned second determination unit 503 may include first determining
Module and the second determining module (not shown).Wherein, above-mentioned first determining module may be configured in response in determination
The mouth open and-shut mode of face object in one frame is open configuration, and preset first threshold is determined as targets threshold.It is above-mentioned
Second determining module may be configured in response to the mouth open and-shut mode for determining the face object in previous frame be closed state,
Preset second threshold is determined as targets threshold, wherein above-mentioned first threshold is less than above-mentioned second threshold.
In some optional implementations of the present embodiment, above-mentioned third determination unit 504 may include that third determines
Module and the 4th determining module (not shown).Wherein, above-mentioned third determining module may be configured in response in determination
It states mouth and opens distance greater than above-mentioned targets threshold, determine the mouth open and-shut mode of the face object in above-mentioned present frame to open
State.Above-mentioned 4th determining module may be configured to open distance no more than above-mentioned target threshold in response to the above-mentioned mouth of determination
Value determines that the mouth open and-shut mode of the face object in above-mentioned present frame is closed state.
In some optional implementations of the present embodiment, which can also be including the 4th determination unit (in figure not
It shows).Wherein, above-mentioned 4th determination unit may be configured to the previous frame that present frame is not present in response to determining, will be preparatory
The initial threshold of setting opens distance compared with above-mentioned targets threshold as targets threshold, based on above-mentioned mouth, determines current
The mouth open and-shut mode of face object in frame.
In some optional implementations of the present embodiment, which can also include that display unit (does not show in figure
Out).Wherein, above-mentioned display unit may be configured to be opened and closed shape in response to the mouth of the face object in the above-mentioned present frame of determination
State is open configuration, obtains target special efficacy, shows above-mentioned target special efficacy in the mouth position of the face object of above-mentioned present frame.
The device provided by the above embodiment of the application is obtained by acquiring unit 501 in the present frame of target video
Face object carry out face critical point detection after obtained face critical point detection as a result, to the first determination unit 502
It can be based on the face critical point detection as a result, determining that the mouth of the face object in present frame opens distance.Then second
Mouth open and-shut mode of the determination unit 503 based on the face object in previous frame, can determine targets threshold, so that third is true
Order member 504 can determine the face object in present frame based on targets threshold compared with identified mouth opens distance
Mouth open and-shut mode.The targets threshold compared with mouth opens distance progress data is based on the face in previous frame as a result,
What the mouth open and-shut mode of object was determined, that is, it is considered that the mouth open and-shut mode pair of the face object in previous frame
The influence of the mouth open and-shut mode of face object in present frame.The selection for passing through different target threshold value as a result, can be to avoid inspection
It surveys result frequently to beat, improves the stability of the testing result of the mouth open and-shut mode of the face object in video and accurate
Property.
Below with reference to Fig. 6, it illustrates the computer systems 600 for the electronic equipment for being suitable for being used to realize the embodiment of the present application
Structural schematic diagram.Electronic equipment shown in Fig. 6 is only an example, function to the embodiment of the present application and should not use model
Shroud carrys out any restrictions.
As shown in fig. 6, computer system 600 includes central processing unit (CPU) 601, it can be read-only according to being stored in
Program in memory (ROM) 602 or be loaded into the program in random access storage device (RAM) 603 from storage section 608 and
Execute various movements appropriate and processing.In RAM 603, also it is stored with system 600 and operates required various programs and data.
CPU 601, ROM 602 and RAM 603 are connected with each other by bus 604.Input/output (I/O) interface 605 is also connected to always
Line 604.
I/O interface 605 is connected to lower component: the importation 606 including keyboard, mouse etc.;It is penetrated including such as cathode
The output par, c 607 of spool (CRT), liquid crystal display (LCD) etc. and loudspeaker etc.;Storage section 608 including hard disk etc.;
And the communications portion 609 of the network interface card including LAN card, modem etc..Communications portion 609 via such as because
The network of spy's net executes communication process.Driver 610 is also connected to I/O interface 605 as needed.Detachable media 611, such as
Disk, CD, magneto-optic disk, semiconductor memory etc. are mounted on as needed on driver 610, in order to read from thereon
Computer program be mounted into storage section 608 as needed.
Particularly, in accordance with an embodiment of the present disclosure, it may be implemented as computer above with reference to the process of flow chart description
Software program.For example, embodiment of the disclosure includes a kind of computer program product comprising be carried on computer-readable medium
On computer program, which includes the program code for method shown in execution flow chart.In such reality
It applies in example, which can be downloaded and installed from network by communications portion 609, and/or from detachable media
611 are mounted.When the computer program is executed by central processing unit (CPU) 601, limited in execution the present processes
Above-mentioned function.It should be noted that computer-readable medium described herein can be computer-readable signal media or
Computer readable storage medium either the two any combination.Computer readable storage medium for example can be --- but
Be not limited to --- electricity, magnetic, optical, electromagnetic, infrared ray or semiconductor system, device or device, or any above combination.
The more specific example of computer readable storage medium can include but is not limited to: have one or more conducting wires electrical connection,
Portable computer diskette, hard disk, random access storage device (RAM), read-only memory (ROM), erasable type may be programmed read-only deposit
Reservoir (EPROM or flash memory), optical fiber, portable compact disc read-only memory (CD-ROM), light storage device, magnetic memory
Part or above-mentioned any appropriate combination.In this application, computer readable storage medium, which can be, any include or stores
The tangible medium of program, the program can be commanded execution system, device or device use or in connection.And
In the application, computer-readable signal media may include in a base band or the data as the propagation of carrier wave a part are believed
Number, wherein carrying computer-readable program code.The data-signal of this propagation can take various forms, including but not
It is limited to electromagnetic signal, optical signal or above-mentioned any appropriate combination.Computer-readable signal media can also be computer
Any computer-readable medium other than readable storage medium storing program for executing, the computer-readable medium can send, propagate or transmit use
In by the use of instruction execution system, device or device or program in connection.Include on computer-readable medium
Program code can transmit with any suitable medium, including but not limited to: wireless, electric wire, optical cable, RF etc., Huo Zheshang
Any appropriate combination stated.
Flow chart and block diagram in attached drawing are illustrated according to the system of the various embodiments of the application, method and computer journey
The architecture, function and operation in the cards of sequence product.In this regard, each box in flowchart or block diagram can generation
A part of one module, program segment or code of table, a part of the module, program segment or code include one or more use
The executable instruction of the logic function as defined in realizing.It should also be noted that in some implementations as replacements, being marked in box
The function of note can also occur in a different order than that indicated in the drawings.For example, two boxes succeedingly indicated are actually
It can be basically executed in parallel, they can also be executed in the opposite order sometimes, and this depends on the function involved.Also it to infuse
Meaning, the combination of each box in block diagram and or flow chart and the box in block diagram and or flow chart can be with holding
The dedicated hardware based system of functions or operations as defined in row is realized, or can use specialized hardware and computer instruction
Combination realize.
Being described in unit involved in the embodiment of the present application can be realized by way of software, can also be by hard
The mode of part is realized.Described unit also can be set in the processor, for example, can be described as: a kind of processor packet
Include acquiring unit, the first determination unit, the second determination unit and third determination unit.Wherein, the title of these units is at certain
In the case of do not constitute restriction to the unit itself, for example, acquiring unit is also described as " obtaining to target video
Face object in present frame carries out the unit of obtained face critical point detection result after face critical point detection ".
As on the other hand, present invention also provides a kind of computer-readable medium, which be can be
Included in device described in above-described embodiment;It is also possible to individualism, and without in the supplying device.Above-mentioned calculating
Machine readable medium carries one or more program, when said one or multiple programs are executed by the device, so that should
Device: it obtains to obtained face key point after the face object progress face critical point detection in the present frame of target video
Testing result;Based on the face critical point detection as a result, determining that the mouth of the face object in the present frame opens distance;It is based on
The mouth open and-shut mode of face object in predetermined, the present frame previous frame, determines targets threshold;Based on the mouth
Distance is opened compared with the targets threshold, determines the mouth open and-shut mode of the face object in the present frame.
Above description is only the preferred embodiment of the application and the explanation to institute's application technology principle.Those skilled in the art
Member is it should be appreciated that invention scope involved in the application, however it is not limited to technology made of the specific combination of above-mentioned technical characteristic
Scheme, while should also cover in the case where not departing from foregoing invention design, it is carried out by above-mentioned technical characteristic or its equivalent feature
Any combination and the other technical solutions formed.Such as features described above has similar function with (but being not limited to) disclosed herein
Can technical characteristic replaced mutually and the technical solution that is formed.