CN109271542A - Cover determines method, apparatus, equipment and readable storage medium storing program for executing - Google Patents

Cover determines method, apparatus, equipment and readable storage medium storing program for executing Download PDF

Info

Publication number
CN109271542A
CN109271542A CN201811140229.XA CN201811140229A CN109271542A CN 109271542 A CN109271542 A CN 109271542A CN 201811140229 A CN201811140229 A CN 201811140229A CN 109271542 A CN109271542 A CN 109271542A
Authority
CN
China
Prior art keywords
candidate image
information
file
entity
degree
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201811140229.XA
Other languages
Chinese (zh)
Inventor
赵翔
李鑫
刘霄
李旭斌
孙昊
文石磊
丁二锐
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Baidu Online Network Technology Beijing Co Ltd
Beijing Baidu Netcom Science and Technology Co Ltd
Original Assignee
Beijing Baidu Netcom Science and Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Beijing Baidu Netcom Science and Technology Co Ltd filed Critical Beijing Baidu Netcom Science and Technology Co Ltd
Priority to CN201811140229.XA priority Critical patent/CN109271542A/en
Publication of CN109271542A publication Critical patent/CN109271542A/en
Pending legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/20Natural language analysis
    • G06F40/279Recognition of textual entities
    • G06F40/284Lexical analysis, e.g. tokenisation or collocates

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Health & Medical Sciences (AREA)
  • Artificial Intelligence (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Computational Linguistics (AREA)
  • General Health & Medical Sciences (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

The embodiment of the invention discloses a kind of covers to determine method, apparatus, equipment and readable storage medium storing program for executing.Wherein, method includes: the information for obtaining the candidate image set of file, and obtaining from each candidate image the entity of display;Obtain the content information of the file;Calculate the degree of correlation of the information for the entity that each candidate image is shown and the content information of file;From candidate image set, determine that the degree of correlation meets cover of the candidate image of the first preset requirement as the file, so that the main contents of the cover fitting file determined, realize that picture and text are consistent;Moreover, determining the high cover of the degree of correlation by intelligence, the human cost for manually selecting cover can be saved, the high-quality cover of selection is able to ascend user's clicking rate.

Description

Cover determines method, apparatus, equipment and readable storage medium storing program for executing
Technical field
The present embodiments relate to computer vision techniques more particularly to a kind of cover to determine method, apparatus, equipment and can Read storage medium.
Background technique
Cover original refers to one layer outside books and periodicals, refers in particular to be printed on the first of title, author or editor, publisher's title etc. sometimes Face.In recent years, the electronic documents such as webpage, video, electronics comic books, electronics atlas emerged in multitude, generally using image as this The cover of a little files.
The prior art arbitrarily chooses an image as cover, for example, arbitrarily choosing one from video generally from file Cover of a video frame as the video, in another example, cover of the image as the webpage is arbitrarily chosen from webpage.
Since the image in file often has many and picture material numerous and jumbled, the cover quality for causing existing method to be chosen It is low, it is low with the relevance of file.
Summary of the invention
The embodiment of the present invention provides a kind of cover and determines method, apparatus, equipment and readable storage medium storing program for executing, so that determine Cover is bonded the main contents of file, realizes that picture and text are consistent.
In a first aspect, the embodiment of the invention provides a kind of covers to determine method, comprising:
The candidate image set of file is obtained, and obtains from each candidate image the information of the entity of display;
Obtain the content information of the file;
Calculate the degree of correlation of the information for the entity that each candidate image is shown and the content information of file;
From candidate image set, determine that the degree of correlation meets envelope of the candidate image of the first preset requirement as the file Face.
Second aspect, the embodiment of the invention also provides a kind of cover determining device, which includes: the first acquisition mould Block for obtaining the candidate image set of file, and obtains from each candidate image the information of the entity of display;
Second obtains module, for obtaining the content information of the file;
Computing module, for calculating the degree of correlation of the information for the entity that each candidate image is shown and the content information of file;
Determining module, for from candidate image set, determining that the degree of correlation meets the candidate image work of the first preset requirement For the cover of the file.
The third aspect, the embodiment of the invention also provides a kind of electronic equipment, the electronic equipment includes:
One or more processors;
Memory, for storing one or more programs,
When one or more of programs are executed by one or more of processors, so that one or more of processing Device realizes that cover described in any embodiment determines method.
Fourth aspect, the embodiment of the invention also provides a kind of computer readable storage mediums, are stored thereon with computer Program realizes that cover described in any embodiment determines method when the program is executed by processor.
In the embodiment of the present invention, by obtaining the candidate image set of file, and display is obtained from each candidate image The information of entity obtains the content information of the file, calculates the information for the entity that each candidate image is shown and the content of file The degree of correlation of information, to obtain the information of the entity of candidate image and the correlation degree of content information;By from candidate image In set, determine that the degree of correlation meets cover of the candidate image of the first preset requirement as the file, to scheme according to candidate The entity information of picture and the correlation degree of content information determine cover, so that the main contents of the cover fitting file determined, real Existing picture and text are consistent;Moreover, determining the high cover of the degree of correlation by intelligence, the human cost for manually selecting cover can be saved, is chosen High-quality cover be able to ascend user's clicking rate.
Detailed description of the invention
Fig. 1 is the flow chart that a kind of cover that the embodiment of the present invention one provides determines method;
Fig. 2 is the flow chart that a kind of cover provided by Embodiment 2 of the present invention determines method;
Fig. 3 is the flow chart that a kind of cover that the embodiment of the present invention three provides determines method;
Fig. 4 is a kind of structure chart for cover determining device that the embodiment of the present invention four provides;
Fig. 5 is the structural schematic diagram for a kind of electronic equipment that the embodiment of the present invention five provides.
Specific embodiment
The present invention is described in further detail with reference to the accompanying drawings and examples.It is understood that this place is retouched The specific embodiment stated is used only for explaining the present invention rather than limiting the invention.It also should be noted that in order to just Only the parts related to the present invention are shown in description, attached drawing rather than entire infrastructure.
Embodiment one
Fig. 1 is the flow chart that a kind of cover that the embodiment of the present invention one provides determines method, and the present embodiment is applicable to really The case where determining the cover of file.It is alternatively possible to which the modification in response to the content information to file operates or to candidate image The modification of set operates, and executes cover provided in this embodiment and determines method;File can also pulled or distributed When, it executes cover provided in this embodiment and determines method;Cover provided in this embodiment can also periodically be executed and determine method.
This method can be executed by cover determining device, which can collect by hardware and/or software sharing, and generally At in the electronic device, specifically including following operation:
S110, the candidate image set for obtaining file, and obtain from each candidate image the information of the entity of display.
Optionally, the file in the present embodiment includes but is not limited to video file, atlas, comic books, audio file and net Page.Correspondingly, the set and/or file association that the candidate image set of file includes at least two images in file are at least The set of two images, optionally, the image of file association can be the image that user is directed to this document input.
The present embodiment will determine cover image from these images, describe and distinguish for convenience, these images are known as Candidate image, the collection that candidate image is constituted are collectively referred to as candidate image set.
In one example, this document is video file, then extracts at least two views according to setpoint frequency from video file Frequency frame, as candidate image set.Setpoint frequency can be preset, such as one 1 video frame of extraction in 1 second, extraction in 2 seconds view Frequency frame.Further, at least two video frames are extracted from the preset time period in video file according to setpoint frequency.Based on video The beginning part of file and ending are usually that advertisement, prelude etc. are associated with little video frame, preset time with video content Section can be the Central Time section of video file, such as 30%-70% period, so as to be drawn into be associated with video content it is larger Video frame.
In another example, this document is audio file, then obtains the associated singer's image of audio file, disc image etc. Set.
After getting the candidate image set of file, display is obtained from each candidate image of candidate image set The information of entity.For example, the entity that candidate image is shown includes personage, object, the scene etc. that candidate image is shown.Optionally, The information of entity is information of text type, including characters name, object names, scene information etc..
S120, the content information for obtaining file.
Wherein, the content information of file refers to embody the information of file main contents, optionally, the content information of file It is the information of text type, such as the title content information of file, the clip Text information of file, the brief introduction content of file letter Breath, classification information of file etc..
It is worth noting that S110 and S120 can be executed successively, or execute side by side, operation order is without limiting.
The degree of correlation of the content information of S130, the information for calculating the entity that each candidate image is shown and file.
In the present embodiment, the degree of correlation refers to the information of entity and the correlation degree of content information, or is referred to as similar Degree.
Optionally, the degree of correlation of the information for the entity that each candidate image is shown and the content information of file is calculated separately.Phase Pass degree calculation method includes but is not limited to cosine similarity, Euclidean distance, Pearson came correlation and Spearman rank correlation system Number.
S140, from candidate image set, determine the degree of correlation meet the first preset requirement candidate image as file Cover.
Bigger with the content information degree of correlation of file in the present embodiment, corresponding candidate image can more characterize file Content also more meets the cover requirement of file.
In order to determine the image for being best suitable for cover from candidate cover set, the first preset requirement is preset.Optionally, First preset requirement includes that the degree of correlation is more than or equal to preset threshold or degree of correlation maximum.Correspondingly, determine that the degree of correlation is greater than In cover of the candidate image as file of preset threshold;Alternatively, determining the maximum candidate figure of the degree of correlation in candidate image set As the cover as file.Have multiple if it is larger than or equal to the candidate image of preset threshold, then chooses any one candidate image Cover as file.
In the embodiment of the present invention, by obtaining the candidate image set of file, and display is obtained from each candidate image The information of entity obtains the content information of file, calculates the information for the entity that each candidate image is shown and the content information of file The degree of correlation, to obtain the information of the entity of candidate image and the correlation degree of content information;By from candidate image set In, determine that the degree of correlation meets cover of the candidate image of the first preset requirement as file, thus according to the entity of candidate image The correlation degree of information and content information determines cover, so that the main contents of the cover fitting file determined, realize picture and text phase Symbol;Moreover, determining the high cover of the degree of correlation by intelligence, the human cost for manually selecting cover, the high-quality envelope of selection can be saved Face is able to ascend user's clicking rate.
Further, the information of entity can be the information of text type, and the content information of file is also possible to text class The information of type, text are a kind of semantic spaces of higher-dimension, are capable of the information and content information of accurate characterization entity;Then, pass through The degree of correlation of the information of two text types is calculated, can relatively accurately obtain the correlation degree of candidate image and file, in turn Improve the accuracy that cover determines.
Embodiment two
Fig. 2 is the flow chart that a kind of cover provided by Embodiment 2 of the present invention determines method.The embodiment of the present invention is above-mentioned It is refined on the basis of the technical solution of each embodiment.
Further, it is refined as operation " obtaining the information of the entity of display from each candidate image " " to identify each candidate The entity that image is shown obtains the description information of entity in each candidate image;By the description information of entity in each candidate image or The information for the entity that keyword in person's description information is shown as each candidate image ", is embodied as reality for the information of entity The description information of body or the keyword in description information.
Further, operation " is calculated into the related of the information for the entity that each candidate image is shown and the content information of file Degree " is refined as " content information of file being generated the first text vector;The information for the entity that each candidate image is shown generates Second text vector;According to the first text vector and each second text vector, the information for the entity that each candidate image is shown is calculated With the degree of correlation of the content information of file ", to calculate the degree of correlation by text vector.
A kind of cover as shown in Figure 2 determines method, comprising:
S210, the candidate image set for obtaining file.
The entity that S220, each candidate image of identification are shown, obtains the description information of entity in each candidate image.
Optionally, description information is the information of text type, name description information, the description of the title of article including personage Information, description information of scene etc..
Optionally, each candidate image is input to entity recognition model, obtains the description information of entity in each candidate image. Optionally, entity recognition model can be convolutional neural networks model.Entity recognition model includes at least: human face recognition model, One of article identification model and gesture recognition model.
Optionally, human face recognition model is used to carry out the face in each candidate image public figure's Sex criminals, output The public figure's name recognized.Article identification model is used to carry out each candidate image the classification of multiclass common user item, and output is known The Item Title being clipped to, such as mountain, desk, ocean etc..Gesture recognition model is used to carry out gesture recognition to each candidate image, defeated The posture recognized out, such as play soccer, jump square dance, on a treadmill running etc..
S230, using the keyword in the description information or description information of entity in each candidate image as each candidate image The information of the entity of display.
Optionally, description is believed by natural language processing (Natural Language Processing, NLP) technology Breath carries out word cutting, obtains description information keyword, the information as entity.For example, " running on a treadmill " from description information In obtain keyword " treadmill " and " running ".
S240, the content information for obtaining file.
Optionally, the content information of file includes file content itself or content keyword.
Optionally, by natural language processing (Natural Language Processing, NLP) technology in file Hold, such as title, abstract, brief introduction or classification, carries out word cutting, obtain content keyword, the content information as file.Example Such as, keyword " TV play A " and " end " are obtained from title " TV play A end ".
S250, the content information of file is generated into the first text vector.
Optionally, the content information of file is mapped as by N-dimensional vector using vector space model.Further, by file Content itself or content keyword are mapped as N-dimensional text vector, referred to as the first text vector.When content keyword have it is multiple When, the first text vector includes multiple vectors of corresponding each keyword.
Information the second text vector of generation of S260, the entity for showing each candidate image.
Optionally, use vector space model by the information MAP of entity for N-dimensional vector.Further, retouching entity It states the keyword in information or description information and is mapped as N-dimensional text vector, referred to as the second text vector.When in description information When keyword has multiple, the second text vector includes multiple vectors of corresponding each keyword.
S270, according to the first text vector and each second text vector, calculate the information for the entity that each candidate image is shown With the degree of correlation of the content information of file.
Optionally, for each second text vector, each second text of each vector sum in the first text vector is calculated The degree of correlation of each vector in vector, chooses the maximum degree of correlation, as the final degree of correlation.
Optionally, cosine similarity is used to calculate the degree of correlation between two text vectors: with each in the first text vector The angle of each vector is to consider angle in each second text vector of vector sum, respectively with inner product (each corresponding element of two vectors Be multiplied summation) compare the product of two vector field homoemorphisms for calculated result, the cosine similarity between two text vectors is obtained, by each second text The content information of the information and file for the entity that maximum cosine similarity is shown respectively as each candidate image in this vector The degree of correlation.
Optionally, the inverse of the distance of the first text vector and each second text vector is calculated, it is aobvious as each candidate image The degree of correlation of the content information of the information and file of the entity shown.
Specifically, it is calculated using NLP technology every in each second text vector of each vector sum in the first text vector Distance of a vector, such as literal distance, Levenstein distance, Jaro distance etc., by minimum range in each second text vector The entity shown respectively as each candidate image reciprocal information and file content information the degree of correlation.
As it can be seen that the first text vector and the second text vector are closer, the degree of correlation is higher, and corresponding candidate image is determined It is bigger for the probability of cover.
S280, from candidate image set, determine the degree of correlation meet the first preset requirement candidate image as file Cover.
In the present embodiment, by using the keyword in the description information or description information of entity in each candidate image as The content information of file is generated the first text vector, each candidate image is shown by the information for the entity that each candidate image is shown The information of the entity shown generates the second text vector, according to the first text vector and each second text vector, calculates each candidate figure The degree of correlation of the content information of the information and file of entity as shown in improves to calculate the degree of correlation according to text vector The accuracy of relatedness computation, and then improve the accuracy that cover determines.
Embodiment three
Fig. 3 is the flow chart that a kind of cover that the embodiment of the present invention three provides determines method.The embodiment of the present invention is above-mentioned It is optimized on the basis of the technical solution of each embodiment.
Further, additional operation " the image quality information for obtaining each candidate image ", and will " from candidate image set, Determine that the degree of correlation meets cover of the candidate image of the first preset requirement as file " it is refined as " being shown according to each candidate image Entity information and file content information the degree of correlation and corresponding image quality information, calculate the score of each candidate image; From candidate image set, determine that score meets cover of the candidate image of the second preset requirement as file ", to pass through image quality Information and the comprehensive determining cover of the degree of correlation.
A kind of cover as shown in Figure 3 determines method, comprising:
S310, the candidate image set for obtaining file, and obtain from each candidate image the information of the entity of display.
S320, the content information for obtaining file.
S330, the image quality information for obtaining each candidate image.
Optionally, image quality information includes but is not limited to clarity, aesthetics and whether there is or not texts.
Optionally, each candidate image is input to image quality identification model, obtains the image quality information of candidate image.Image quality identification Model can be convolutional neural networks model.Image quality identification model includes at least: aesthetics identification model, clarity identification model One of with Text region model.
Optionally, aesthetics identification model is used to carry out aesthetics identification to each candidate image, exports aesthetics.Aesthetics The training process of identification model includes: to obtain multiple image patterns, and mark the aesthetics 1-10 of each image pattern, and numerical value is higher Illustrate more beautiful.Aesthetics identification model is trained according to multiple image patterns and aesthetics, makes aesthetics identification model Output approach the aesthetics of mark.
Clarity identification model is used to carry out clarity identification to each candidate image, exports clarity.Clarity identifies mould The training process of type includes: to obtain multiple image patterns, and mark the clarity 1-10 of each image pattern, and the higher explanation of numerical value is more Clearly.Clarity identification model is trained according to multiple image patterns and clarity, makes the output of clarity identification model Approach the clarity of mark.
Text region model is used to carry out each candidate image the identification of text existence, and whether there is or not texts, such as 1 expression for output Text is indicated without text, 0.
As it can be seen that the image quality value of information is bigger, illustrate that image quality is better, the probability that corresponding candidate image is confirmed as cover is got over Greatly.
It is worth noting that S330 can be executed after the candidate image set for obtaining file, before S350, this implementation Example, which provides only, a kind of optional executes sequence.
The degree of correlation of the content information of S340, the information for calculating the entity that each candidate image is shown and file.
The degree of correlation of the content information of S350, the information of the entity shown according to each candidate image and file, and it is corresponding Image quality information, calculate the score of each candidate image.
Optionally, the degree of correlation of the content information of the information and file of the entity each candidate image shown, and it is corresponding Image quality information be weighted summation, obtain the score of each candidate image.
Wherein, when image quality information includes multiple information, the weight of image quality information includes the corresponding power of each information Weight.Optionally, the weight of the weight of image quality information and the degree of correlation can require to be arranged according to practical cover.For example, cover requires Closer to file content, then the weight of the degree of correlation is arranged larger.In another example cover requires image quality more preferable, then image quality is believed The weight of breath is arranged larger.
Optionally, the weight of the weight of image quality information and the degree of correlation can also be obtained by study.Specifically include following three Step operation:
Step 1: obtaining multiple image patterns and corresponding target score.
Optionally, multiple image patterns are obtained from sample file, such as multiple video frames are extracted from Sample video.People Work gives a mark to every image pattern according to image quality information and the degree of correlation, as target score.Such as 0-4 points, the bigger expression of score value More meet cover requirement.
Step 2: the degree of correlation of the content information of the information and file for the entity that every image pattern of acquisition is shown and every The image quality information of image pattern.
It is obtained in the information and file for the entity that every image pattern is shown using the method that the various embodiments described above provide Hold the degree of correlation of information and the image quality information of every image pattern, details are not described herein again.
Step 3: the degree of correlation of the content information of the information and file of the entity shown by every image pattern, and The image quality information fit object score of every image pattern, obtains the weight of the information of entity and the weight of image quality information.
Optionally, using logistical regression fit object score.Logistic regression is linear separable problem A kind of disaggregated model for being easily achieved and haveing excellent performance is one of the most widely used disaggregated model.Building prediction first FunctionZ is the weighted sum of the degree of correlation and image quality information, and weight is variable.φ (Z) is predicted value.Then, will Target score maps between 0-1, by gradient descent method, so that predicted value approaches the mapping value of target score, to obtain Suitable weight.
S360, from candidate image set, determine that score meets envelope of the candidate image as file of the second preset requirement Face.
Optionally it is determined that score is more than or equal to cover of the candidate image of preset fraction as file;Alternatively, determining candidate Cover of the maximum candidate image of score as file in image collection.If score is more than or equal to the candidate image of preset fraction Have multiple, then therefrom chooses cover of any candidate image as file.
In the present embodiment, the information of the entity by being shown according to each candidate image is related to the content information of file Degree and corresponding image quality information, calculate the score of each candidate image, so that synthesis pertinence and image quality information, determine file Cover so that determine cover be bonded file content while, image quality is also preferable.
In the above embodiments, after the information of entity for obtaining display in each candidate image, further includes: according to each The information for the entity that candidate image is shown judges the legitimacy of corresponding candidate image.Correspondingly, from candidate image set, Determine the degree of correlation meet the first preset requirement candidate image as the cover of file include: from candidate image set, determination The degree of correlation meets the first preset requirement and cover of the legal candidate image as file.
Optionally, information unauthorized, such as Drug Reference, gambling information, violence information, pornography etc. are collected in advance. Information such as sporocarp is consistent with information unauthorized, then determines that corresponding candidate image is illegal;As sporocarp information with not Legal information is inconsistent, then determines that corresponding candidate image is legal.In turn, determine that the degree of correlation meets the first preset requirement and legal Cover of the candidate image as file, can guarantee the legitimacy for the cover chosen in this way.
Example IV
Fig. 4 is a kind of structure chart for cover determining device that the embodiment of the present invention four provides.The embodiment of the present invention is suitable for The case where determining the cover of file.As shown in figure 4, the device specifically includes: first, which obtains module 410, second, obtains module 420, computing module 430 and determining module 440.
First obtains module 410, for obtaining the candidate image set of file, and display is obtained from each candidate image The information of entity;
Second obtains module 420, for obtaining the content information of file;
Computing module 430, for calculating the related of the information for the entity that each candidate image is shown and the content information of file Degree;
Determining module 440, for from candidate image set, determining that the degree of correlation meets the candidate image of the first preset requirement Cover as file.
In the embodiment of the present invention, by obtaining the candidate image set of file, and display is obtained from each candidate image The information of entity obtains the content information of file, calculates the information for the entity that each candidate image is shown and the content information of file The degree of correlation, to obtain the information of the entity of candidate image and the correlation degree of content information;By from candidate image set In, determine that the degree of correlation meets cover of the candidate image of the first preset requirement as file, thus according to the entity of candidate image The correlation degree of information and content information determines cover, so that the main contents of the cover fitting file determined, realize picture and text phase Symbol;Moreover, determining the high cover of the degree of correlation by intelligence, the human cost for manually selecting cover, the high-quality envelope of selection can be saved Face is able to ascend user's clicking rate.
Further, the information of entity is the information of text type, and the content information of file is also the information of text type, Text is a kind of semantic space of higher-dimension, is capable of the information and content information of accurate characterization entity;Then, by calculating two texts The degree of correlation of the information of this type can relatively accurately obtain the correlation degree of candidate image and file, and then it is true to improve cover Fixed accuracy.
Optionally, first module 410 is obtained when obtaining the information of entity of display from each candidate image, it is specific to use In: it identifies the entity that each candidate image shows, obtains the description information of entity in each candidate image;By entity in each candidate image Description information or description information in the information of entity that is shown as each candidate image of keyword.Further, first Module 410 is obtained in the entity for identifying that each candidate image is shown, when obtaining the description information of entity in each candidate image, is specifically used In: each candidate image is input to entity recognition model, obtains the description information of entity in each candidate image, entity recognition model It includes at least: one of human face recognition model, article identification model and gesture recognition model.
Optionally, content information of the computing module 430 in the information and file that calculate the entity that each candidate image is shown When the degree of correlation, it is specifically used for: the content information of file is generated into the first text vector;The entity that each candidate image is shown Information generates the second text vector;According to the first text vector and each second text vector, the reality that each candidate image is shown is calculated The degree of correlation of the content information of the information and file of body.Further, computing module 430 is according to the first text vector and Ge Two text vectors are specifically used for when calculating the degree of correlation of the information for the entity that each candidate image is shown and the content information of file: The inverse for calculating the distance of the first text vector and each second text vector, the information of the entity shown as each candidate image with The degree of correlation of the content information of file.
Optionally it is determined that module 440 is in the candidate for from candidate image set, determining the degree of correlation the first preset requirement of satisfaction When cover of the image as file, it is specifically used for: determines that the degree of correlation is more than or equal to the candidate image of preset threshold as file Cover;Alternatively, determining cover of the maximum candidate image of the degree of correlation as file in candidate image set.
Optionally, which further includes that third obtains module, for obtaining the image quality information of each candidate image.Correspondingly, Determining module 44 from candidate image set, is determining envelope of the candidate image of the degree of correlation the first preset requirement of satisfaction as file When face, it is specifically used for: the degree of correlation of the content information of the information and file of the entity shown according to each candidate image, and it is corresponding Image quality information, calculate the score of each candidate image;From candidate image set, determine that score meets the time of the second preset requirement Select cover of the image as file.
Optionally, third obtains module when obtaining the image quality information of each candidate image, is specifically used for: by each candidate image Be input to image quality identification model, obtain the image quality information of candidate image, image quality identification model includes at least: aesthetics identifies mould One of type, clarity identification model and Text region model.
Optionally it is determined that module 440 is in the information of the entity shown according to each candidate image and the content information of file The degree of correlation and corresponding image quality information are specifically used for when calculating the score of each candidate image: showing to each candidate image The degree of correlation of the content information of the information and file of entity and corresponding image quality information are weighted summation, obtain each candidate The score of image.
Optionally, which further includes training module, for the information and file in the entity shown to each candidate image Content information the degree of correlation and corresponding image quality information be weighted before summation, obtain multiple image patterns and correspondence Target score;Obtain the degree of correlation and every image of the information for the entity that every image pattern is shown and the content information of file The image quality information of sample;The degree of correlation of the content information of the information and file of the entity shown by every cover sample and every The image quality information fit object score of cover sample, obtains the weight of the information of entity and the weight of image quality information.
Optionally, which further includes judgment module, for obtained from each candidate image display entity information it Afterwards, the information of the entity shown according to each candidate image judges the legitimacy of corresponding candidate image.Correspondingly, determining module 440 when from candidate image set, determining that the degree of correlation meets cover of the candidate image of the first preset requirement as file, tool Body is used for: from candidate image set, determining that the degree of correlation meets the first preset requirement and legal candidate image is as file Cover.
Optionally, the content information of file is the title content information of file.First, which obtains module 410, is obtaining file When candidate image set, it is specifically used for: extracts at least two video frames according to setpoint frequency from video file, schemes as candidate Image set closes.
It is true that cover provided by any embodiment of the invention can be performed in cover determining device provided by the embodiment of the present invention Determine method, has the corresponding functional module of execution method and beneficial effect.
Embodiment five
Fig. 5 is the structural schematic diagram for a kind of electronic equipment that the embodiment of the present invention five provides.Fig. 5, which is shown, to be suitable for being used in fact The block diagram of the example electronic device 12 of existing embodiment of the present invention.The electronic equipment 12 that Fig. 5 is shown is only an example, no The function and use scope for coping with the embodiment of the present invention bring any restrictions.
As shown in figure 5, electronic equipment 12 is showed in the form of universal computing device.The component of electronic equipment 12 may include But be not limited to: one or more processor or processing unit 16, system storage 28, connect different system components (including System storage 28 and processing unit 16) bus 18.
Bus 18 indicates one of a few class bus structures or a variety of, including memory bus or Memory Controller, Peripheral bus, graphics acceleration port, processor or the local bus using any bus structures in a variety of bus structures.It lifts For example, these architectures include but is not limited to industry standard architecture (ISA) bus, microchannel architecture (MAC) Bus, enhanced isa bus, Video Electronics Standards Association (VESA) local bus and peripheral component interconnection (PCI) bus.
Electronic equipment 12 typically comprises a variety of computer system readable media.These media can be it is any can be electric The usable medium that sub- equipment 12 accesses, including volatile and non-volatile media, moveable and immovable medium.
System storage 28 may include the computer system readable media of form of volatile memory, such as arbitrary access Memory (RAM) 30 and/or cache memory 32.Electronic equipment 12 may further include other removable/not removable Dynamic, volatile/non-volatile computer system storage medium.Only as an example, storage system 34 can be used for read and write can not Mobile, non-volatile magnetic media (Fig. 5 do not show, commonly referred to as " hard disk drive ").Although being not shown in Fig. 5, Ke Yiti For the disc driver for being read and write to removable non-volatile magnetic disk (such as " floppy disk "), and to moving non-volatile light The CD drive of disk (such as CD-ROM, DVD-ROM or other optical mediums) read-write.In these cases, each driver It can be connected by one or more data media interfaces with bus 18.Memory 28 may include that at least one program produces Product, the program product have one group of (for example, at least one) program module, these program modules are configured to perform of the invention each The function of embodiment.
Program/utility 40 with one group of (at least one) program module 42 can store in such as memory 28 In, such program module 42 include but is not limited to operating system, one or more application program, other program modules and It may include the realization of network environment in program data, each of these examples or certain combination.Program module 42 is usual Execute the function and/or method in embodiment described in the invention.
Electronic equipment 12 can also be with one or more external equipments 14 (such as keyboard, sensing equipment, display 24 etc.) Communication, can also be enabled a user to one or more equipment interact with the electronic equipment 12 communicate, and/or with make the electricity Any equipment (such as network interface card, modem etc.) that sub- equipment 12 can be communicated with one or more of the other calculating equipment Communication.This communication can be carried out by input/output (I/O) interface 22.Also, electronic equipment 12 can also be suitable by network Orchestration 20 and one or more network (such as local area network (LAN), wide area network (WAN) and/or public network, such as internet) Communication.As shown, network adapter 20 is communicated by bus 18 with other modules of equipment/terminal/server 12.It should be bright It is white, although not shown in the drawings, other hardware and/or software module can be used in conjunction with electronic equipment 12, including but not limited to: micro- Code, device driver, redundant processing unit, external disk drive array, RAID system, tape drive and data backup Storage system etc..
Processing unit 16 by the program that is stored in system storage 28 of operation, thereby executing various function application and Data processing, such as realize that cover provided by the embodiment of the present invention determines method.
Embodiment six
The embodiment of the present invention six additionally provides a kind of computer readable storage medium, is stored thereon with computer program, should Realize that cover provided by any embodiment of the present invention determines method when program is executed by processor.
The computer storage medium of the embodiment of the present invention, can be using any of one or more computer-readable media Combination.Computer-readable medium can be computer-readable signal media or computer readable storage medium.It is computer-readable Storage medium for example may be-but not limited to-the system of electricity, magnetic, optical, electromagnetic, infrared ray or semiconductor, device or Device, or any above combination.The more specific example (non exhaustive list) of computer readable storage medium includes: tool There are electrical connection, the portable computer diskette, hard disk, random access memory (RAM), read-only memory of one or more conducting wires (ROM), erasable programmable read only memory (EPROM or flash memory), optical fiber, portable compact disc read-only memory (CD- ROM), light storage device, magnetic memory device or above-mentioned any appropriate combination.In this document, computer-readable storage Medium can be any tangible medium for including or store program, which can be commanded execution system, device or device Using or it is in connection.
Computer-readable signal media may include in a base band or as carrier wave a part propagate data-signal, Wherein carry computer-readable program code.The data-signal of this propagation can take various forms, including but unlimited In electromagnetic signal, optical signal or above-mentioned any appropriate combination.Computer-readable signal media can also be that computer can Any computer-readable medium other than storage medium is read, which can send, propagates or transmit and be used for By the use of instruction execution system, device or device or program in connection.
The program code for including on computer-readable medium can transmit with any suitable medium, including --- but it is unlimited In wireless, electric wire, optical cable, RF etc. or above-mentioned any appropriate combination.
The computer for executing operation of the present invention can be write with one or more programming languages or combinations thereof Program code, programming language include object oriented program language-such as Java, Smalltalk, C++, are also wrapped Include conventional procedural programming language-such as " C " language or similar programming language.Program code can be complete Ground executes on the user computer, partly executes on the user computer, executing as an independent software package, partially existing Part executes on the remote computer or executes on a remote computer or server completely on subscriber computer.It is being related to In the situation of remote computer, remote computer can pass through the network of any kind --- including local area network (LAN) or wide area Net (WAN)-be connected to subscriber computer, or, it may be connected to outer computer (such as utilize ISP To be connected by internet).
Note that the above is only a better embodiment of the present invention and the applied technical principle.It will be appreciated by those skilled in the art that The invention is not limited to the specific embodiments described herein, be able to carry out for a person skilled in the art it is various it is apparent variation, It readjusts and substitutes without departing from protection scope of the present invention.Therefore, although being carried out by above embodiments to the present invention It is described in further detail, but the present invention is not limited to the above embodiments only, without departing from the inventive concept, also It may include more other equivalent embodiments, and the scope of the invention is determined by the scope of the appended claims.

Claims (15)

1. a kind of cover determines method characterized by comprising
The candidate image set of file is obtained, and obtains from each candidate image the information of the entity of display;
Obtain the content information of the file;
Calculate the degree of correlation of the information for the entity that each candidate image is shown and the content information of file;
From candidate image set, determine that the degree of correlation meets cover of the candidate image of the first preset requirement as the file.
2. the method according to claim 1, wherein the letter of the entity for obtaining display from each candidate image Breath, comprising:
It identifies the entity that each candidate image is shown, obtains the description information of entity in each candidate image;
The reality that keyword in the description information or description information of entity in each candidate image is shown as each candidate image The information of body.
3. according to the method described in claim 2, it is characterized in that, the entity that each candidate image of identification is shown, obtains each The description information of entity in candidate image, comprising:
Each candidate image is input to entity recognition model, obtains the description information of entity in each candidate image, the entity is known Other model includes at least: one of human face recognition model, article identification model and gesture recognition model.
4. the method according to claim 1, wherein the information for calculating the entity that each candidate image show and The degree of correlation of the content information of file, comprising:
The content information of the file is generated into the first text vector;
The information for the entity that each candidate image is shown generates the second text vector;
According to first text vector and each second text vector, the information and file of the entity that each candidate image is shown are calculated Content information the degree of correlation.
5. according to the method described in claim 4, it is characterized in that, described according to first text vector and each second text Vector calculates the degree of correlation of the information for the entity that each candidate image is shown and the content information of file, comprising:
Calculate the inverse of the distance of first text vector and each second text vector, the entity shown as each candidate image Information and file content information the degree of correlation.
6. determining that the degree of correlation meets the method according to claim 1, wherein described from candidate image set Cover of the candidate image of first preset requirement as the file, comprising:
Determine that the degree of correlation is more than or equal to cover of the candidate image of preset threshold as the file;Alternatively,
Determine cover of the maximum candidate image of the degree of correlation as the file in candidate image set.
7. method according to claim 1-6, which is characterized in that the method also includes:
Obtain the image quality information of each candidate image;
Correspondingly, determining that the degree of correlation meets the candidate image of the first preset requirement as the file from candidate image set Cover include:
The degree of correlation and corresponding image quality of the content information of the information and file of the entity shown according to each candidate image are believed Breath, calculates the score of each candidate image;
From candidate image set, determine that score meets cover of the candidate image of the second preset requirement as the file.
8. the method according to the description of claim 7 is characterized in that the image quality information for obtaining each candidate image, comprising:
Each candidate image is input to image quality identification model, obtains the image quality information of the candidate image, the image quality identifies mould Type includes at least: one of aesthetics identification model, clarity identification model and Text region model.
9. the method according to the description of claim 7 is characterized in that the information of the entity shown according to each candidate image with The degree of correlation of the content information of file and corresponding image quality information, calculate the score of each candidate image, comprising:
The degree of correlation and corresponding image quality information of the content information of the information and file for the entity that each candidate image is shown into Row weighted sum obtains the score of each candidate image.
10. according to the method described in claim 9, it is characterized in that, the entity that each candidate image is shown information and text The degree of correlation of the content information of part and corresponding image quality information are weighted before summation, further includes:
Obtain multiple image patterns and corresponding target score;
Obtain the degree of correlation and every image pattern of the information for the entity that every image pattern is shown and the content information of file Image quality information;
The degree of correlation and every cover sample of the content information of the information and file of the entity shown by every cover sample Image quality information fit object score, obtain the weight of the information of entity and the weight of image quality information.
11. method according to claim 1-6, which is characterized in that obtain the reality of display from each candidate image After the information of body, further includes:
According to the information for the entity that each candidate image is shown, the legitimacy of corresponding candidate image is judged;
Correspondingly, determining that the degree of correlation meets the candidate image of the first preset requirement as the file from candidate image set Cover include:
From candidate image set, determine that the degree of correlation meets the first preset requirement and legal candidate image is as the file Cover.
12. method according to claim 1-6, which is characterized in that the content information of the file is the text The title content information of part, also, the candidate image set for obtaining file includes:
At least two video frames are extracted according to setpoint frequency from video file, as candidate image set.
13. a kind of cover determining device characterized by comprising
First obtains module, for obtaining the candidate image set of file, and obtain from each candidate image the entity of display Information;
Second obtains module, for obtaining the content information of the file;
Computing module, for calculating the degree of correlation of the information for the entity that each candidate image is shown and the content information of file;
Determining module, for from candidate image set, determining that the degree of correlation meets the candidate image of the first preset requirement as institute State the cover of file.
14. a kind of electronic equipment, which is characterized in that the electronic equipment includes:
One or more processors;
Memory, for storing one or more programs,
When one or more of programs are executed by one or more of processors, so that one or more of processors are real Now the cover as described in any in claim 1-12 determines method.
15. a kind of computer readable storage medium, is stored thereon with computer program, which is characterized in that the program is by processor Realize that the cover as described in any in claim 1-12 determines method when execution.
CN201811140229.XA 2018-09-28 2018-09-28 Cover determines method, apparatus, equipment and readable storage medium storing program for executing Pending CN109271542A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201811140229.XA CN109271542A (en) 2018-09-28 2018-09-28 Cover determines method, apparatus, equipment and readable storage medium storing program for executing

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201811140229.XA CN109271542A (en) 2018-09-28 2018-09-28 Cover determines method, apparatus, equipment and readable storage medium storing program for executing

Publications (1)

Publication Number Publication Date
CN109271542A true CN109271542A (en) 2019-01-25

Family

ID=65199086

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201811140229.XA Pending CN109271542A (en) 2018-09-28 2018-09-28 Cover determines method, apparatus, equipment and readable storage medium storing program for executing

Country Status (1)

Country Link
CN (1) CN109271542A (en)

Cited By (13)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109918513A (en) * 2019-03-12 2019-06-21 北京百度网讯科技有限公司 Image processing method, device, server and storage medium
CN109996091A (en) * 2019-03-28 2019-07-09 苏州八叉树智能科技有限公司 Generate method, apparatus, electronic equipment and the computer readable storage medium of video cover
CN110457523A (en) * 2019-08-12 2019-11-15 腾讯科技(深圳)有限公司 The choosing method of cover picture, the training method of model, device and medium
CN111143613A (en) * 2019-12-30 2020-05-12 携程计算机技术(上海)有限公司 Method, system, electronic device and storage medium for selecting video cover
CN111918130A (en) * 2020-08-11 2020-11-10 北京达佳互联信息技术有限公司 Video cover determining method and device, electronic equipment and storage medium
CN112231504A (en) * 2020-09-30 2021-01-15 北京三快在线科技有限公司 Method and device for determining cover picture, electronic equipment and storage medium
CN112561334A (en) * 2020-12-16 2021-03-26 咪咕文化科技有限公司 Grading method and device for reading object, electronic equipment and storage medium
CN112613293A (en) * 2020-12-29 2021-04-06 北京中科闻歌科技股份有限公司 Abstract generation method and device, electronic equipment and storage medium
CN112866801A (en) * 2021-03-11 2021-05-28 北京小米移动软件有限公司 Video cover determining method and device, electronic equipment and storage medium
CN113157963A (en) * 2021-04-28 2021-07-23 维沃移动通信有限公司 Image screening method, device electronic equipment and readable storage medium
CN113254696A (en) * 2021-06-30 2021-08-13 腾讯科技(深圳)有限公司 Cover image acquisition method and device
CN113343832A (en) * 2021-06-01 2021-09-03 北京奇艺世纪科技有限公司 Video cover judging method, device, equipment and computer readable medium
CN113591513A (en) * 2020-04-30 2021-11-02 北京字节跳动网络技术有限公司 Method and apparatus for processing image

Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20060140469A1 (en) * 2003-05-15 2006-06-29 Warren Joel E Method for communicating and matching electronic files for financial transactions
CN105389593A (en) * 2015-11-16 2016-03-09 上海交通大学 Image object recognition method based on SURF
CN106503693A (en) * 2016-11-28 2017-03-15 北京字节跳动科技有限公司 The offer method and device of video front cover
CN106973324A (en) * 2017-03-28 2017-07-21 深圳市茁壮网络股份有限公司 A kind of poster generation method and device
CN107832725A (en) * 2017-11-17 2018-03-23 北京奇虎科技有限公司 Video front cover extracting method and device based on evaluation index
CN108062533A (en) * 2017-12-28 2018-05-22 北京达佳互联信息技术有限公司 Analytic method, system and the mobile terminal of user's limb action
CN108415961A (en) * 2018-02-06 2018-08-17 厦门集微科技有限公司 A kind of advertising pictures recommendation method and device

Patent Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20060140469A1 (en) * 2003-05-15 2006-06-29 Warren Joel E Method for communicating and matching electronic files for financial transactions
CN105389593A (en) * 2015-11-16 2016-03-09 上海交通大学 Image object recognition method based on SURF
CN106503693A (en) * 2016-11-28 2017-03-15 北京字节跳动科技有限公司 The offer method and device of video front cover
CN106973324A (en) * 2017-03-28 2017-07-21 深圳市茁壮网络股份有限公司 A kind of poster generation method and device
CN107832725A (en) * 2017-11-17 2018-03-23 北京奇虎科技有限公司 Video front cover extracting method and device based on evaluation index
CN108062533A (en) * 2017-12-28 2018-05-22 北京达佳互联信息技术有限公司 Analytic method, system and the mobile terminal of user's limb action
CN108415961A (en) * 2018-02-06 2018-08-17 厦门集微科技有限公司 A kind of advertising pictures recommendation method and device

Cited By (19)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109918513A (en) * 2019-03-12 2019-06-21 北京百度网讯科技有限公司 Image processing method, device, server and storage medium
US11475588B2 (en) 2019-03-12 2022-10-18 Beijing Baidu Netcom Science And Technology Co., Ltd. Image processing method and device for processing image, server and storage medium
CN109996091A (en) * 2019-03-28 2019-07-09 苏州八叉树智能科技有限公司 Generate method, apparatus, electronic equipment and the computer readable storage medium of video cover
CN110457523A (en) * 2019-08-12 2019-11-15 腾讯科技(深圳)有限公司 The choosing method of cover picture, the training method of model, device and medium
CN110457523B (en) * 2019-08-12 2022-03-08 腾讯科技(深圳)有限公司 Cover picture selection method, model training method, device and medium
CN111143613A (en) * 2019-12-30 2020-05-12 携程计算机技术(上海)有限公司 Method, system, electronic device and storage medium for selecting video cover
CN111143613B (en) * 2019-12-30 2024-02-06 携程计算机技术(上海)有限公司 Method, system, electronic device and storage medium for selecting video cover
CN113591513A (en) * 2020-04-30 2021-11-02 北京字节跳动网络技术有限公司 Method and apparatus for processing image
CN113591513B (en) * 2020-04-30 2024-03-29 北京字节跳动网络技术有限公司 Method and apparatus for processing image
CN111918130A (en) * 2020-08-11 2020-11-10 北京达佳互联信息技术有限公司 Video cover determining method and device, electronic equipment and storage medium
CN112231504A (en) * 2020-09-30 2021-01-15 北京三快在线科技有限公司 Method and device for determining cover picture, electronic equipment and storage medium
CN112561334A (en) * 2020-12-16 2021-03-26 咪咕文化科技有限公司 Grading method and device for reading object, electronic equipment and storage medium
CN112613293A (en) * 2020-12-29 2021-04-06 北京中科闻歌科技股份有限公司 Abstract generation method and device, electronic equipment and storage medium
CN112866801A (en) * 2021-03-11 2021-05-28 北京小米移动软件有限公司 Video cover determining method and device, electronic equipment and storage medium
CN112866801B (en) * 2021-03-11 2023-10-03 北京小米移动软件有限公司 Video cover determining method and device, electronic equipment and storage medium
CN113157963A (en) * 2021-04-28 2021-07-23 维沃移动通信有限公司 Image screening method, device electronic equipment and readable storage medium
CN113343832A (en) * 2021-06-01 2021-09-03 北京奇艺世纪科技有限公司 Video cover judging method, device, equipment and computer readable medium
CN113343832B (en) * 2021-06-01 2024-02-02 北京奇艺世纪科技有限公司 Video cover distinguishing method, device, equipment and computer readable medium
CN113254696A (en) * 2021-06-30 2021-08-13 腾讯科技(深圳)有限公司 Cover image acquisition method and device

Similar Documents

Publication Publication Date Title
CN109271542A (en) Cover determines method, apparatus, equipment and readable storage medium storing program for executing
CN110287479B (en) Named entity recognition method, electronic device and storage medium
CN109117777B (en) Method and device for generating information
CN111062871B (en) Image processing method and device, computer equipment and readable storage medium
US10642887B2 (en) Multi-modal image ranking using neural networks
US7853582B2 (en) Method and system for providing information services related to multimodal inputs
CN106776503B (en) Text semantic similarity determination method and device
CN111198958A (en) Method, device and terminal for matching background music
US20190065492A1 (en) Zero-shot event detection using semantic embedding
CN109657054A (en) Abstraction generating method, device, server and storage medium
CN103988202A (en) Image attractiveness based indexing and searching
CN109660865B (en) Method and device for automatically labeling videos, medium and electronic equipment
US9639633B2 (en) Providing information services related to multimodal inputs
WO2023197749A9 (en) Background music insertion time point determining method and apparatus, device, and storage medium
CN112085120B (en) Multimedia data processing method and device, electronic equipment and storage medium
CN113515669A (en) Data processing method based on artificial intelligence and related equipment
CN110263218A (en) Video presentation document creation method, device, equipment and medium
CN112995757B (en) Video clipping method and device
CN114358203A (en) Training method and device for image description sentence generation module and electronic equipment
CN111783734A (en) Original edition video identification method and device
CN104462151B (en) Assess the method and relevant apparatus of Homepage Publishing time
CN116955707A (en) Content tag determination method, device, equipment, medium and program product
CN114329049A (en) Video search method and device, computer equipment and storage medium
CN115439922A (en) Object behavior identification method, device, equipment and medium
CN113642536A (en) Data processing method, computer device and readable storage medium

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination