CN110381368A

CN110381368A - Video cover generation method, device and electronic equipment

Info

Publication number: CN110381368A
Application number: CN201910622565.6A
Authority: CN
Inventors: 黄凯; 王长虎
Original assignee: Beijing ByteDance Network Technology Co Ltd
Current assignee: Beijing ByteDance Network Technology Co Ltd
Priority date: 2019-07-11
Filing date: 2019-07-11
Publication date: 2019-10-25
Also published as: WO2021004247A1

Abstract

A kind of video cover generation method, device and electronic equipment are provided in the embodiment of the present disclosure, are belonged to technical field of image processing, this method comprises: parsing to target video, are obtained multiple parsing images；Clustering processing is carried out to the multiple parsing image, obtains multiple candidate cover images；Based on the critical point detection to the target object execution for including in the candidate cover image as a result, line-of-sight detection is executed and detection of opening eyes to the target object for including in the multiple candidate cover image, to filter out the candidate cover image for not meeting preset requirement；Based on image quality evaluation network to the quality score of candidate's cover image remaining after filtering, final cover image of the target image as the target video is chosen from candidate's cover image remaining after filtering.By the processing scheme of the disclosure, the video cover of high quality can be automatically generated.

Description

Video cover generation method, device and electronic equipment

Technical field

This disclosure relates to which technical field of image processing more particularly to a kind of video cover generation method, device and electronics are set It is standby.

Background technique

Image procossing (image processing) is also known as image processing, is needed for being reached with computer to image As a result technology.Originating from the 1920s, generally Digital Image Processing.The main contents of image processing techniques include figure As compression, enhancing restore, matching description identification 3 parts, common processing have image digitazation, image coding, image enhancement, Image restoration, image segmentation and image analysis etc..Image procossing is to be processed image information to meet people using computer Visual psychology or application demand behavior, be widely used, be chiefly used in mapping science, atmospheric science, astronomy, U.S. figure, make figure As improving identification etc..

One application scenarios of image procossing are the views that a video frame is selected in one section of video as this section of video How frequency cover selects the video frame that one representative, picture quality is high as video from numerous video frames Cover becomes the technical issues that need to address.

Summary of the invention

In view of this, the embodiment of the present disclosure provides a kind of video cover generation method, device and electronic equipment, at least partly Solve problems of the prior art.

In a first aspect, the embodiment of the present disclosure provides a kind of video cover generation method, comprising:

Target video is parsed, multiple parsing images are obtained；

Clustering processing is carried out to the multiple parsing image, obtains multiple candidate cover images；

Based on the critical point detection executed to the target object for including in the candidate cover image as a result, to the multiple The target object for including in candidate cover image executes line-of-sight detection and detection of opening eyes, to filter out the time for not meeting preset requirement Select cover image；

It is remaining after filtering based on image quality evaluation network to the quality score of candidate's cover image remaining after filtering Candidate cover image in choose final cover image of the target image as the target video.

It is described that target video is parsed according to a kind of specific implementation of the embodiment of the present disclosure, obtain multiple solutions Analyse image, comprising:

All video frame performance objectives detection to including in the target video；

It is based on target detection as a result, whether judging to form in the video frame of target video comprising the target object；

If so, setting the parsing image for the video frame comprising the target object.

It is described that the multiple parsing image is carried out at cluster according to a kind of specific implementation of the embodiment of the present disclosure Reason obtains multiple candidate cover images, comprising:

The cluster calculation of k class is executed to the multiple parsing image；

Select the image for meeting preset condition as the candidate cover image in each cluster.

It is described that the multiple parsing image execution k class is gathered according to a kind of specific implementation of the embodiment of the present disclosure Class calculates, comprising:

It is initial cluster center that k sample point is chosen in the multiple parsing image, is denoted as z1 (l), z2 (l) ... ... Zk (l), iteration serial number l=1；

All samples are assigned in k class ω j (k) representated by each cluster centre using Nearest Neighbor Method, it is all kinds of to be wrapped The sample number contained is Nj (l)；

All kinds of centers of gravity is calculated, the center of gravity being calculated is determined as to new cluster centre；

For iteration j, judge whether the value of zj (l+1) and zj (l) is identical, continues to change as zj (l+1) ≠ zj (l) In generation, calculates, and as zj (l+1)=zj (l), stops iterative calculation.

It is described to including in the multiple candidate cover image according to a kind of specific implementation of the embodiment of the present disclosure Before target object executes line-of-sight detection and detection of opening eyes, the method also includes:

Quality evaluation is executed to the multiple candidate cover image using pre-set convolutional neural networks；

Based on the quality evaluation as a result, determining that the quality of each image in the multiple candidate cover image is commented Point.

It is described to including in the multiple candidate cover image according to a kind of specific implementation of the embodiment of the present disclosure Target object executes line-of-sight detection and detection of opening eyes, comprising:

Obtain the critical point detection result executed to the target object for including in candidate cover image；

Based on the critical point detection as a result, determining pupil key point, eyeball key point and eye profile key point；

Based on the pupil key point, eyeball key point and eye profile key point, the eyeball of the target object is executed Modelling operability；

Based on modelling operability as a result, being detected the target object is executed line-of-sight detection and be opened eyes.

According to a kind of specific implementation of the embodiment of the present disclosure, it is described based on modelling operability as a result, come to the mesh It marks object and executes line-of-sight detection and detection of opening eyes, comprising:

Go out the key point of eyeball using CLNF model inspection；

Three-dimensional modeling is carried out to eyes；

It connects origin and forms a ray to pupil center, its intersection point with eyeball is calculated, by eyeball center to intersection point side To vector as direction of visual lines.

By modelling operability as a result, calculating the ratio of width to height of external eyes profile；

Judge whether the target object is in eyes-open state by judging whether described the ratio of width to height is greater than preset threshold.

It is described to be remained based on image quality evaluation network to after filtering according to a kind of specific implementation of the embodiment of the present disclosure The quality score of remaining candidate cover image chooses target image as the mesh from candidate's cover image remaining after filtering Mark the final cover image of video, comprising:

The highest candidate cover image of quality score is chosen from candidate's cover image remaining after filtering；

Using the highest candidate cover image of the quality score as the final cover image of the target video.

Second aspect, the embodiment of the present disclosure provide a kind of video cover generating means, comprising:

Parsing module obtains multiple parsing images for parsing to target video；

Cluster module obtains multiple candidate cover images for carrying out clustering processing to the multiple parsing image；

Filtering module, for based on the critical point detection knot executed to the target object for including in the candidate cover image Fruit executes line-of-sight detection and opens eyes and detect, is not inconsistent with filtering out to the target object for including in the multiple candidate cover image Close the candidate cover image of preset requirement；

Selecting module, for being commented based on quality of the image quality evaluation network to candidate's cover image remaining after filtering Point, final cover image of the target image as the target video is chosen from candidate's cover image remaining after filtering.

The third aspect, the embodiment of the present disclosure additionally provide a kind of electronic equipment, which includes:

At least one processor；And

The memory being connect at least one processor communication；Wherein,

The memory is stored with the instruction that can be executed by least one processor, and the instruction is by least one processor It executes, so that at least one processor is able to carry out the view in any implementation of aforementioned first aspect or first aspect Frequency cover generation method.

Fourth aspect, the embodiment of the present disclosure additionally provide a kind of non-transient computer readable storage medium, the non-transient meter Calculation machine readable storage medium storing program for executing stores computer instruction, and the computer instruction is for making the computer execute aforementioned first aspect or the Video cover generation method in any implementation of one side.

5th aspect, the embodiment of the present disclosure additionally provide a kind of computer program product, which includes The calculation procedure being stored in non-transient computer readable storage medium, the computer program include program instruction, when the program When instruction is computer-executed, the computer is made to execute the video in aforementioned first aspect or any implementation of first aspect Cover generation method.

Video cover in the embodiment of the present disclosure generates scheme, including parses to target video, obtains multiple parsings Image；Clustering processing is carried out to the multiple parsing image, obtains multiple candidate cover images；Based on to the candidate surface plot The critical point detection that the target object for including as in executes is as a result, to the target object for including in the multiple candidate cover image Line-of-sight detection is executed and detection of opening eyes, to filter out the candidate cover image for not meeting preset requirement；Based on image quality evaluation Network chooses target from candidate's cover image remaining after filtering to the quality score of candidate's cover image remaining after filtering Final cover image of the image as the target video.By the scheme of the disclosure, the allusion quotation of high quality can be automatically selected Cover image of the type video frame as video.

Detailed description of the invention

It, below will be to needed in the embodiment attached in order to illustrate more clearly of the technical solution of the embodiment of the present disclosure Figure is briefly described, it should be apparent that, the accompanying drawings in the following description is only some embodiments of the present disclosure, for this field For those of ordinary skill, without creative efforts, it can also be obtained according to these attached drawings other attached drawings.

Fig. 1 is a kind of video cover product process schematic diagram that the embodiment of the present disclosure provides；

Fig. 2 is another video cover product process schematic diagram that the embodiment of the present disclosure provides；

Fig. 3 is another video cover product process schematic diagram that the embodiment of the present disclosure provides；

Fig. 4 is another video cover product process schematic diagram that the embodiment of the present disclosure provides；

Fig. 5 is a kind of video cover generating means structural schematic diagram that the embodiment of the present disclosure provides；

Fig. 6 is the electronic equipment schematic diagram that the embodiment of the present disclosure provides.

Specific embodiment

The embodiment of the present disclosure is described in detail with reference to the accompanying drawing.

Illustrate embodiment of the present disclosure below by way of specific specific example, those skilled in the art can be by this specification Disclosed content understands other advantages and effect of the disclosure easily.Obviously, described embodiment is only the disclosure A part of the embodiment, instead of all the embodiments.The disclosure can also be subject to reality by way of a different and different embodiment It applies or applies, the various details in this specification can also be based on different viewpoints and application, in the spirit without departing from the disclosure Lower carry out various modifications or alterations.It should be noted that in the absence of conflict, the feature in following embodiment and embodiment can To be combined with each other.Based on the embodiment in the disclosure, those of ordinary skill in the art are without creative efforts Every other embodiment obtained belongs to the range of disclosure protection.

It should be noted that the various aspects of embodiment within the scope of the appended claims are described below.Ying Xian And be clear to, aspect described herein can be embodied in extensive diversified forms, and any specific structure described herein And/or function is only illustrative.Based on the disclosure, it will be understood by one of ordinary skill in the art that one described herein Aspect can be independently implemented with any other aspect, and can combine the two or both in these aspects or more in various ways. For example, carry out facilities and equipments in terms of any number set forth herein can be used and/or practice method.In addition, can make With other than one or more of aspect set forth herein other structures and/or it is functional implement this equipment and/or Practice the method.

It should also be noted that, diagram provided in following embodiment only illustrates the basic structure of the disclosure in a schematic way Think, component count, shape and the size when only display is with component related in the disclosure rather than according to actual implementation in schema are drawn System, when actual implementation kenel, quantity and the ratio of each component can arbitrarily change for one kind, and its assembly layout kenel can also It can be increasingly complex.

In addition, in the following description, specific details are provided for a thorough understanding of the examples.However, fields The skilled person will understand that the aspect can be practiced without these specific details.

The embodiment of the present disclosure provides a kind of video cover generation method.Video cover generation method provided in this embodiment can To be executed by a computing device, which can be implemented as software, or be embodied as the combination of software and hardware, the meter It calculates device and can integrate and be arranged in server, terminal device etc..

Referring to Fig. 1, a kind of video cover generation method of embodiment of the present disclosure offer, comprising:

S101 parses target video, obtains multiple parsing images.

Target video is one section of video file for recording audio and image data, and target video can be arbitrary format Video file is also possible to other lattice for example, target video can be the video file of mpg, mp4, rm, rmvb, wax format The video file of formula.

It include video frame in target video, video frame is the set of all video pictures in target video, for example, for For the video that one frame per second is 30fps, according to normal video playout speed, it can be split out in the video of 1 second length 30 video frames.Certainly actual needs are based on, can also be obtained more by way of carrying out interleave in 30 video frames Video frame, alternatively, the selected section video frame from 30 video frames.

It, can be to the view in target video in order to improve the efficiency of parsing during being parsed to target video Frequency frame is screened, and will meet the video frame of screening conditions as parsing image.As an example, the target can be regarded All video frame performance objectives detection for including in frequency, passes through target detection, it can be determined that whether video frame includes target object (for example, people) then sets the parsing for the video frame comprising the target object when in video frame including target object Image.In this way, the specific aim of multiple parsing images can further be improved.Carrying out target detection to video frame can To use a variety of object detection methods for image, mode of target detection is not construed as limiting at this.

S102 carries out clustering processing to the multiple parsing image, obtains multiple candidate cover images.

Contain target object in parsing image, in order to be sieved to the parsing image comprising target object is further Choosing, can execute clustering processing to multiple parsing images can be selected by clustering to image in multiple parsing images A part of typical image (multiple candidate's cover images) is further to be handled.

Specifically, k class can be arranged to multiple parsing images in advance, to carry out cluster calculation.Firstly, the multiple Parsing and choosing k sample point in image is initial cluster center, is denoted as z1 (l), z2 (l) ... ... zk (l), iteration serial number l=1； Next, all samples are assigned in k class ω j (k) representated by each cluster centre using Nearest Neighbor Method, it is all kinds of to be included Sample number be Nj (l)；Next, the center of gravity being calculated is determined as new gather by way of calculating all kinds of centers of gravity Class center；Finally, judge whether the value of zj (l+1) and zj (l) is identical for iteration j by way of loop iteration, when Continue to iterate to calculate when zj (l+1) ≠ zj (l), as zj (l+1)=zj (l), stops iterative calculation.After iteration stopping, Can by it is finally clustering as a result, in each cluster select a parsing image, ultimately form k candidate cover image.

S103, based on the critical point detection executed to the target object for including in the candidate cover image as a result, to institute It states the target object for including in multiple candidate cover images and executes line-of-sight detection and detection of opening eyes, do not meet default want to filter out The candidate cover image asked.

After getting multiple candidate cover images, key point inspection can be executed to the target object in candidate cover image It surveys, by critical point detection, the key point on available target image head, the key point based on head can be to target pair As the sight in candidate cover image and whether opening eyes is detected, to target object sight be deviated or eyes are not complete The image filtering opened entirely falls.

Specifically, the available critical point detection executed to the target object for including in candidate cover image is as a result, base In these critical point detections as a result, determining pupil key point, eyeball key point and eye profile key point；It is closed based on the pupil Key point, eyeball key point and eye profile key point execute modelling operability to the eyeball of the target object, pass through modelling operability It is detected as a result, it is possible to which the target object is executed line-of-sight detection and opened eyes.

As a kind of mode, the key point of eyeball can be gone out using CLNF model inspection by the result of modelling operability, together When three-dimensional modeling is carried out to eyes, and after completing three-dimensional modeling to eyes, connection origin forms one to pupil center and penetrates Line calculates its intersection point with eyeball, using the vector at eyeball center to intersection point direction as direction of visual lines.To further judge Whether the sight of target object meets the requirements.

Alternatively mode, can be by modelling operability as a result, the ratio of width to height of calculating external eyes profile, passes through judgement Whether described the ratio of width to height is greater than preset threshold to judge whether the target object is in eyes-open state.

It, can will be undesirable when finding that candidate cover image is undesirable by line-of-sight detection and detection of opening eyes Candidate cover image filter out, thus remaining candidate's cover image set after being filtered.

S104, based on image quality evaluation network to the quality score of candidate's cover image remaining after filtering, from filtering Final cover image of the target image as the target video is chosen in remaining candidate cover image afterwards.

After being filtered after remaining candidate's cover image set, a figure can be further screened from this collection Picture, so that it is determined that the final cover image of target video.

As a kind of mode, the target object for including in the multiple candidate cover image can be executed line-of-sight detection and It opens eyes before detection, quality score, matter is carried out to all candidate cover images by using convolutional neural networks trained in advance During amount scoring, overall merit can be carried out from many aspects such as image quality, color, environment and expressions, thus to every A candidate's cover image provides a specific quality score.

It can pass through after the quality score of each image in remaining candidate's cover image set after being filtered The mode of sequence, final surface plot of the candidate cover image of the one or more for selecting quality score high as the target video Picture.

By above step, screening can be filtered to the video frame in target video in several ways, thus Screen the final cover image to match for target video.

Target video is parsed referring to fig. 2 according to a kind of optional implementation of the embodiment of the present disclosure, is obtained more A parsing image, may include steps of:

S201 detects all video frame performance objectives for including in the target video.

All video frame performance objectives for including in the target video can be detected, by target detection, can be sentenced Whether disconnected video frame includes target object (for example, people), and carrying out target detection to video frame can be using a variety of for image Object detection method is not construed as limiting the mode of target detection at this.

S202, it is based on target detection as a result, whether judging to form in the video frame of target video comprising the target pair As.

By detecting to video frame performance objective, multiple object detection results can be obtained, by by multiple object detections As a result similarity is carried out with target object to compare, can further judge whether contain target pair in the video frame of target video As.

S203, if so, setting the parsing image for the video frame comprising the target object.

By the step in step S201-203, targetedly video frame images can be selected, obtained multiple Parse image.

Alternatively situation is carrying out clustering processing to the multiple parsing image, obtains multiple candidate surface plots As during, the cluster calculation of k class can be executed to the multiple parsing image, selection one meets pre- in each cluster If the image of condition is as the candidate cover image.

According to a kind of optional implementation of the embodiment of the present disclosure, referring to Fig. 3, k class is executed to the multiple parsing image Cluster calculation, may include steps of:

S301, choosing k sample point in the multiple parsing image is initial cluster center, is denoted as z1 (l), z2 (l) ... ... zk (l), iteration serial number l=1；

All samples are assigned in k class ω j (k) representated by each cluster centre by S302 using Nearest Neighbor Method, all kinds of The sample number for being included is Nj (l)；

S303 calculates all kinds of centers of gravity, and the center of gravity being calculated is determined as to new cluster centre；

S304 judges whether the value of zj (l+1) and zj (l) is identical, as zj (l+1) ≠ zj (l) for iteration j Continue to iterate to calculate, as zj (l+1)=zj (l), stops iterative calculation.

It is described to including in the multiple candidate cover image according to a kind of optional implementation of the embodiment of the present disclosure Before target object executes line-of-sight detection and detection of opening eyes, the method also includes: utilize pre-set convolutional neural networks Quality evaluation is executed to the multiple candidate cover image；Based on the quality evaluation as a result, determining the multiple candidate envelope The quality score of each image in the image of face.

Referring to fig. 4, according to a kind of optional implementation of the embodiment of the present disclosure, to being wrapped in the multiple candidate cover image The target object contained executes line-of-sight detection and detection of opening eyes, comprising:

S401 obtains the critical point detection result executed to the target object for including in candidate cover image.

Specifically, critical point detection can be executed for the head zone of target object in cover image, pass through key point Detection, can obtain the crucial point data of the multiple organs of head zone (for example, eyes).

S402, based on the critical point detection as a result, determining that pupil key point, eyeball key point and eye profile are crucial Point.

By carrying out target identification to the key point that detects, be capable of determining that out pupil key point, eyeball key point and Eye profile key point.

S403 is based on the pupil key point, eyeball key point and eye profile key point, to the eyeball of the target object Execute modelling operability.

It, can be based on these crucial point datas to target after pupil key point, eyeball key point and eye profile key point The eyeball of object executes modelling operability, and after modeling, more careful quantization can be carried out to the eyeball of target object.For mesh The modelling operability for marking object eyeball can be carried out using various ways, be not limited thereto.

S404, based on modelling operability as a result, being detected the target object is executed line-of-sight detection and be opened eyes.

During realizing step S404, eyeball can be gone out using CLNF model inspection by the result of modelling operability Key point carries out three-dimensional modeling to eyes by the key point of eyeball, to the model after three-dimensional modeling, connects origin to pupil It is centrally formed a ray, calculates its intersection point with eyeball, using the vector at eyeball center to intersection point direction as direction of visual lines.From And whether sight requirement is met based on the sight of the walking direction target object of determining sight.

It, can also be by modelling operability as a result, the ratio of width to height of calculating external eyes profile, leads to other than carrying out line-of-sight detection It crosses and judges whether described the ratio of width to height is greater than preset threshold to judge whether the target object is in eyes-open state.To delete The candidate cover image of eyes-open state is not met.

It is described to be remained based on image quality evaluation network to after filtering according to a kind of optional implementation of the embodiment of the present disclosure The quality score of remaining candidate cover image chooses target image as the mesh from candidate's cover image remaining after filtering Mark the final cover image of video, comprising: choose the highest candidate of quality score from candidate's cover image remaining after filtering Cover image；Using the highest candidate cover image of the quality score as the final cover image of the target video.

Corresponding with above method embodiment, referring to Fig. 5, the embodiment of the present disclosure additionally provides a kind of video cover generation Device 50, comprising:

Parsing module 501 obtains multiple parsing images for parsing to target video.

Cluster module 502 obtains multiple candidate cover images for carrying out clustering processing to the multiple parsing image.

Filtering module 503, for being examined based on the key point executed to the target object for including in the candidate cover image It surveys and is detected as a result, executing line-of-sight detection and eye opening to the target object for including in the multiple candidate cover image, to filter out The candidate cover image of preset requirement is not met.

Eye can be gone out using CLNF model inspection by the result of modelling operability as a kind of mode as a kind of mode The key point of ball, while three-dimensional modeling is carried out to eyes, and after completing three-dimensional modeling to eyes, connect origin into pupil It is formed centrally a ray, calculates its intersection point with eyeball, using the vector at eyeball center to intersection point direction as direction of visual lines.To Further judge whether the sight of target object meets the requirements.

Selecting module 504, for the quality based on image quality evaluation network to candidate's cover image remaining after filtering Scoring chooses final cover image of the target image as the target video from candidate's cover image remaining after filtering.

Fig. 5 shown device can it is corresponding execute above method embodiment in content, what the present embodiment was not described in detail Part, referring to the content recorded in above method embodiment, details are not described herein.

Referring to Fig. 6, the embodiment of the present disclosure additionally provides a kind of electronic equipment 60, which includes:

At least one processor；And

The memory being connect at least one processor communication；Wherein,

The memory is stored with the instruction that can be executed by least one processor, and the instruction is by least one processor It executes, so that at least one processor is able to carry out video cover generation method in preceding method embodiment.

The embodiment of the present disclosure additionally provides a kind of non-transient computer readable storage medium, and the non-transient computer is readable to deposit Storage media stores computer instruction, and the computer instruction is for executing the computer in preceding method embodiment.

The embodiment of the present disclosure additionally provides a kind of computer program product, and the computer program product is non-temporary including being stored in Calculation procedure on state computer readable storage medium, the computer program include program instruction, when the program instruction is calculated When machine executes, the computer is made to execute the video cover generation method in preceding method embodiment.

Below with reference to Fig. 6, it illustrates the structural schematic diagrams for the electronic equipment 60 for being suitable for being used to realize the embodiment of the present disclosure. Electronic equipment in the embodiment of the present disclosure can include but is not limited to such as mobile phone, laptop, Digital Broadcasting Receiver Device, PDA (personal digital assistant), PAD (tablet computer), PMP (portable media player), car-mounted terminal are (such as vehicle-mounted Navigation terminal) etc. mobile terminal and such as number TV, desktop computer etc. fixed terminal.Electronics shown in Fig. 6 Equipment is only an example, should not function to the embodiment of the present disclosure and use scope bring any restrictions.

As shown in fig. 6, electronic equipment 60 may include processing unit (such as central processing unit, graphics processor etc.) 601, It can be loaded into random access storage according to the program being stored in read-only memory (ROM) 602 or from storage device 608 Program in device (RAM) 603 and execute various movements appropriate and processing.In RAM 603, it is also stored with the behaviour of electronic equipment 60 Various programs and data needed for making.Processing unit 601, ROM 602 and RAM 603 are connected with each other by bus 604.It is defeated Enter/export (I/O) interface 605 and is also connected to bus 604.

In general, following device can connect to I/O interface 605: including such as touch screen, touch tablet, keyboard, mouse, figure As the input unit 606 of sensor, microphone, accelerometer, gyroscope etc.；Including such as liquid crystal display (LCD), loudspeaking The output device 607 of device, vibrator etc.；Storage device 608 including such as tape, hard disk etc.；And communication device 609.It is logical T unit 609 can permit electronic equipment 60 and wirelessly or non-wirelessly be communicated with other equipment to exchange data.Although showing in figure The electronic equipment 60 with various devices is gone out, it should be understood that being not required for implementing or having all devices shown. It can alternatively implement or have more or fewer devices.

Particularly, in accordance with an embodiment of the present disclosure, it may be implemented as computer above with reference to the process of flow chart description Software program.For example, embodiment of the disclosure includes a kind of computer program product comprising be carried on computer-readable medium On computer program, which includes the program code for method shown in execution flow chart.In such reality It applies in example, which can be downloaded and installed from network by communication device 609, or from storage device 608 It is mounted, or is mounted from ROM 602.When the computer program is executed by processing unit 601, the embodiment of the present disclosure is executed Method in the above-mentioned function that limits.

It should be noted that the above-mentioned computer-readable medium of the disclosure can be computer-readable signal media or meter Calculation machine readable storage medium storing program for executing either the two any combination.Computer readable storage medium for example can be --- but not Be limited to --- electricity, magnetic, optical, electromagnetic, infrared ray or semiconductor system, device or device, or any above combination.Meter The more specific example of calculation machine readable storage medium storing program for executing can include but is not limited to: have the electrical connection, just of one or more conducting wires Taking formula computer disk, hard disk, random access storage device (RAM), read-only memory (ROM), erasable type may be programmed read-only storage Device (EPROM or flash memory), optical fiber, portable compact disc read-only memory (CD-ROM), light storage device, magnetic memory device, Or above-mentioned any appropriate combination.In the disclosure, computer readable storage medium can be it is any include or storage journey The tangible medium of sequence, the program can be commanded execution system, device or device use or in connection.And at this In open, computer-readable signal media may include in a base band or as the data-signal that carrier wave a part is propagated, In carry computer-readable program code.The data-signal of this propagation can take various forms, including but not limited to Electromagnetic signal, optical signal or above-mentioned any appropriate combination.Computer-readable signal media can also be computer-readable and deposit Any computer-readable medium other than storage media, the computer-readable signal media can send, propagate or transmit and be used for By the use of instruction execution system, device or device or program in connection.Include on computer-readable medium Program code can transmit with any suitable medium, including but not limited to: electric wire, optical cable, RF (radio frequency) etc. are above-mentioned Any appropriate combination.

Above-mentioned computer-readable medium can be included in above-mentioned electronic equipment；It is also possible to individualism, and not It is fitted into the electronic equipment.

Above-mentioned computer-readable medium carries one or more program, when said one or multiple programs are by the electricity When sub- equipment executes, so that the electronic equipment: obtaining at least two internet protocol addresses；Send to Node evaluation equipment includes institute State the Node evaluation request of at least two internet protocol addresses, wherein the Node evaluation equipment is internet from described at least two In protocol address, chooses internet protocol address and return；Receive the internet protocol address that the Node evaluation equipment returns；Its In, the fringe node in acquired internet protocol address instruction content distributing network.

Alternatively, above-mentioned computer-readable medium carries one or more program, when said one or multiple programs When being executed by the electronic equipment, so that the electronic equipment: receiving the Node evaluation including at least two internet protocol addresses and request； From at least two internet protocol address, internet protocol address is chosen；Return to the internet protocol address selected；Wherein, The fringe node in internet protocol address instruction content distributing network received.

The calculating of the operation for executing the disclosure can be write with one or more programming languages or combinations thereof Machine program code, above procedure design language include object oriented program language-such as Java, Smalltalk, C+ +, it further include conventional procedural programming language-such as " C " language or similar programming language.Program code can Fully to execute, partly execute on the user computer on the user computer, be executed as an independent software package, Part executes on the remote computer or executes on a remote computer or server completely on the user computer for part. In situations involving remote computers, remote computer can pass through the network of any kind --- including local area network (LAN) Or wide area network (WAN)-is connected to subscriber computer, or, it may be connected to outer computer (such as utilize Internet service Provider is connected by internet).

Flow chart and block diagram in attached drawing are illustrated according to the system of the various embodiments of the disclosure, method and computer journey The architecture, function and operation in the cards of sequence product.In this regard, each box in flowchart or block diagram can generation A part of one module, program segment or code of table, a part of the module, program segment or code include one or more use The executable instruction of the logic function as defined in realizing.It should also be noted that in some implementations as replacements, being marked in box The function of note can also occur in a different order than that indicated in the drawings.For example, two boxes succeedingly indicated are actually It can be basically executed in parallel, they can also be executed in the opposite order sometimes, and this depends on the function involved.Also it to infuse Meaning, the combination of each box in block diagram and or flow chart and the box in block diagram and or flow chart can be with holding The dedicated hardware based system of functions or operations as defined in row is realized, or can use specialized hardware and computer instruction Combination realize.

Being described in unit involved in the embodiment of the present disclosure can be realized by way of software, can also be by hard The mode of part is realized.Wherein, the title of unit does not constitute the restriction to the unit itself under certain conditions, for example, the One acquiring unit is also described as " obtaining the unit of at least two internet protocol addresses ".

It should be appreciated that each section of the disclosure can be realized with hardware, software, firmware or their combination.

The above, the only specific embodiment of the disclosure, but the protection scope of the disclosure is not limited thereto, it is any Those familiar with the art is in the technical scope that the disclosure discloses, and any changes or substitutions that can be easily thought of, all answers Cover within the protection scope of the disclosure.Therefore, the protection scope of the disclosure should be subject to the protection scope in claims.

Claims

1. a kind of video cover generation method characterized by comprising

Target video is parsed, multiple parsing images are obtained；

Based on the critical point detection executed to the target object for including in the candidate cover image as a result, to the multiple candidate The target object for including in cover image executes line-of-sight detection and detection of opening eyes, to filter out the candidate envelope for not meeting preset requirement Face image；

Based on image quality evaluation network to the quality score of candidate's cover image remaining after filtering, the remaining time after filtering It selects and chooses final cover image of the target image as the target video in cover image.

2. obtaining multiple parsings the method according to claim 1, wherein described parse target video Image, comprising:

3. the method according to claim 1, wherein it is described to the multiple parsing image carry out clustering processing, Obtain multiple candidate cover images, comprising:

The cluster calculation of k class is executed to the multiple parsing image；

4. according to the method described in claim 3, it is characterized in that, the cluster for executing k class to the multiple parsing image It calculates, comprising:

All samples are assigned in k class ω j (k) representated by each cluster centre using Nearest Neighbor Method, it is all kinds of included Sample number is Nj (l)；

For iteration j, judge whether the value of zj (l+1) and zj (l) is identical, when zj (l+1) ≠ zj (l) Shi Jixu iteration meter It calculates, as zj (l+1)=zj (l), stops iterative calculation.

5. the method according to claim 1, wherein described to the mesh for including in the multiple candidate cover image Before marking object execution line-of-sight detection and detection of opening eyes, the method also includes:

Based on the quality evaluation as a result, determining the quality score of each image in the multiple candidate cover image.

6. the method according to claim 1, wherein described to the mesh for including in the multiple candidate cover image It marks object and executes line-of-sight detection and detection of opening eyes, comprising:

Based on the pupil key point, eyeball key point and eye profile key point, modeling is executed to the eyeball of the target object Operation；

7. according to the method described in claim 6, it is characterized in that, it is described based on modelling operability as a result, come to the target Object executes line-of-sight detection and detection of opening eyes, comprising:

Go out the key point of eyeball using CLNF model inspection；

Three-dimensional modeling is carried out to eyes；

It connects origin and forms a ray to pupil center, its intersection point with eyeball is calculated, by eyeball center to intersection point direction Vector is as direction of visual lines.

8. according to the method described in claim 6, it is characterized in that, it is described based on modelling operability as a result, come to the target Object executes line-of-sight detection and detection of opening eyes, comprising:

9. the method according to claim 1, wherein the image quality evaluation network that is based on is to remaining after filtering Candidate cover image quality score, from after filtering it is remaining candidate cover image in choose target image as the target The final cover image of video, comprising:

10. a kind of video cover generating means characterized by comprising

Parsing module obtains multiple parsing images for parsing to target video；

Filtering module, for based on the critical point detection executed to the target object for including in the candidate cover image as a result, Line-of-sight detection is executed and detection of opening eyes to the target object for including in the multiple candidate cover image, with filter out do not meet it is pre- If it is required that candidate cover image；

Selecting module, for based on image quality evaluation network to after filtering it is remaining candidate cover image quality score, from Final cover image of the target image as the target video is chosen in remaining candidate's cover image after filtering.

11. a kind of electronic equipment, which is characterized in that the electronic equipment includes:

At least one processor；And

The memory being connect at least one described processor communication；Wherein,

The memory is stored with the instruction that can be executed by least one described processor, and described instruction is by described at least one It manages device to execute, so that at least one described processor is able to carry out the generation of video cover described in aforementioned any claim 1-9 Method.

12. a kind of non-transient computer readable storage medium, which stores computer instruction, The computer instruction is for making the computer execute video cover generation method described in aforementioned any claim 1-9.