WO2022156534A1 - Video quality assessment method and device - Google Patents

Video quality assessment method and device Download PDF

Info

Publication number
WO2022156534A1
WO2022156534A1 PCT/CN2022/070276 CN2022070276W WO2022156534A1 WO 2022156534 A1 WO2022156534 A1 WO 2022156534A1 CN 2022070276 W CN2022070276 W CN 2022070276W WO 2022156534 A1 WO2022156534 A1 WO 2022156534A1
Authority
WO
WIPO (PCT)
Prior art keywords
video frame
frame sequence
sub
score
scoring
Prior art date
Application number
PCT/CN2022/070276
Other languages
French (fr)
Chinese (zh)
Inventor
周芳汝
安山
Original Assignee
北京沃东天骏信息技术有限公司
北京京东世纪贸易有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 北京沃东天骏信息技术有限公司, 北京京东世纪贸易有限公司 filed Critical 北京沃东天骏信息技术有限公司
Publication of WO2022156534A1 publication Critical patent/WO2022156534A1/en

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T7/00Image analysis
    • G06T7/0002Inspection of images, e.g. flaw detection
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/04Architecture, e.g. interconnection topology
    • G06N3/045Combinations of networks
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/08Learning methods
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T7/00Image analysis
    • G06T7/90Determination of colour characteristics
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N17/00Diagnosis, testing or measuring for television systems or their details
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T2207/00Indexing scheme for image analysis or image enhancement
    • G06T2207/10Image acquisition modality
    • G06T2207/10016Video; Image sequence
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T2207/00Indexing scheme for image analysis or image enhancement
    • G06T2207/20Special algorithmic details
    • G06T2207/20081Training; Learning
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T2207/00Indexing scheme for image analysis or image enhancement
    • G06T2207/20Special algorithmic details
    • G06T2207/20084Artificial neural networks [ANN]

Definitions

  • the present application relates to the field of computer technologies, in particular to the field of video processing technologies, and in particular, to a video quality assessment method and apparatus.
  • Video quality is evaluated.
  • Embodiments of the present application provide a video quality assessment method, apparatus, device, and storage medium.
  • an embodiment of the present application provides a video quality assessment method, the method includes: acquiring a video frame sequence to be evaluated; and determining at least one sub-video frame sequence from the to-be-evaluated video frame sequence; It is a continuous video frame whose color mean gradient of a group of video frames satisfies the preset condition; according to the preset scoring rule, each sub-video frame sequence is scored, and the preset scoring rule is associated with the attribute information of each sub-video frame sequence ; According to the score of each sub-video frame sequence, the quality of the video frame sequence to be evaluated is evaluated.
  • the preset scoring rules include at least one of the following: a first scoring rule associated with the average duration of each sub-video frame sequence; a second scoring associated with the target display object of each sub-video frame sequence rule.
  • the preset scoring rule includes: a first scoring rule and a second scoring rule, and scoring each sub-video frame sequence according to the preset scoring rule includes: scoring each sub-video frame sequence according to the first scoring rule The video frame sequence is scored to obtain the first score of each sub-video frame sequence; the sub-video frame sequence is scored according to the second scoring rule to obtain the second score of each sub-video frame sequence; The second score is to score each sub-video frame sequence.
  • the second scoring rule includes at least one of the following: a third scoring rule associated with the frequency with which the target presentation object of each sub-video frame sequence appears in the corresponding sub-video frame sequence; a third scoring rule associated with each sub-video frame sequence The target shows the fourth scoring rule associated with the area of the object in the corresponding sub-video frame sequence.
  • the preset scoring rule includes a third scoring rule and a fourth scoring rule
  • scoring each sub-video frame sequence according to the preset scoring rule includes: scoring each sub-video frame sequence according to the third scoring rule The frame sequence is scored to obtain the third score of each sub-video frame sequence; the sub-video frame sequence is scored according to the fourth scoring rule to obtain the fourth score of each sub-video frame sequence; according to the third score and the fourth score Score, score each sub-video frame sequence.
  • an embodiment of the present application provides an apparatus for evaluating video quality, the apparatus including: an obtaining module configured to obtain a sequence of video frames to be evaluated; a determining module configured to determine from the sequence of video frames to be evaluated At least one sub-video frame sequence is obtained, and the sub-video frame sequence is a continuous video frame whose color value gradients of a group of video frames meet preset conditions; the scoring module is configured to be configured according to preset scoring rules. The frame sequence is scored, and the preset scoring rule is associated with attribute information of each sub-video frame sequence; the evaluation module is configured to perform quality evaluation on the video frame sequence to be evaluated according to the score of each sub-video frame sequence.
  • the preset scoring rules include at least one of the following: a first scoring rule associated with the average duration of each sub-video frame sequence; a second scoring associated with the target display object of each sub-video frame sequence rule.
  • the preset scoring rule includes a first scoring rule and a second scoring rule
  • the scoring module is further configured to: score each sub-video frame sequence according to the first scoring rule, and obtain the score of each sub-video frame sequence. First scoring; scoring each sub video frame sequence according to the second scoring rule to obtain a second score of each sub video frame sequence; scoring each sub video frame sequence according to the first score and the second score.
  • the second scoring rule includes at least one of the following: a third scoring rule associated with the frequency with which the target presentation object of each sub-video frame sequence appears in the corresponding sub-video frame sequence; a third scoring rule associated with each sub-video frame sequence The target shows the fourth scoring rule associated with the area of the object in the corresponding sub-video frame sequence.
  • the preset scoring rules include a third scoring rule and a fourth scoring rule
  • the scoring module is further configured to: score each sub-video frame sequence according to the third scoring rule, and obtain the score of each sub-video frame sequence. the third score; score each sub-video frame sequence according to the fourth scoring rule to obtain a fourth score of each sub-video frame sequence; score each sub-video frame sequence according to the third score and the fourth score.
  • an embodiment of the present application provides an electronic device, the electronic device includes at least one processor; and a memory communicatively connected to the at least one processor; wherein the memory stores instructions executable by the at least one processor , the instruction is executed by at least one processor, so that when the at least one processor is executed, the video quality assessment method according to any embodiment of the first aspect can be implemented.
  • an embodiment of the present application provides a non-transitory computer-readable storage medium storing computer instructions, where the computer instructions are used to enable a computer to implement the video quality assessment according to any embodiment of the first aspect. method.
  • an embodiment of the present application provides a computer program product including a computer program, the computer program can implement the video quality assessment method according to any embodiment of the first aspect when the computer program is executed by a processor.
  • the present application obtains the video frame sequence to be evaluated; determines at least one sub-video frame sequence from the video frame sequence to be evaluated; scores each sub-video frame sequence according to a preset scoring rule; Scoring, to evaluate the quality of the video frame sequence to be evaluated, that is, first divide the video frame sequence to be evaluated into different pictures, and then score each picture according to the attribute information of each picture, and then evaluate the video quality, which effectively improves the quality of the video. Reasonableness and validity of quality assessment.
  • FIG. 1 is an exemplary system architecture diagram to which the present application can be applied;
  • FIG. 2 is a flowchart of an embodiment of a video quality assessment method according to the present application.
  • FIG. 3 is a schematic diagram of an application scenario of the video quality assessment method according to the present application.
  • FIG. 4 is a flowchart of another embodiment of a video quality assessment method according to the present application.
  • FIG. 5 is a schematic diagram of an embodiment of a video quality assessment apparatus according to the present application.
  • FIG. 6 is a schematic structural diagram of a computer system suitable for implementing the server of the embodiment of the present application.
  • FIG. 1 illustrates an exemplary system architecture 100 to which embodiments of the video quality assessment method of the present application may be applied.
  • the system architecture 100 may include terminal devices 101 , 102 , and 103 , a network 104 and a server 105 .
  • the network 104 is a medium used to provide a communication link between the terminal devices 101 , 102 , 103 and the server 105 .
  • the network 104 may include various connection types, such as wired, wireless communication links, or fiber optic cables, among others.
  • the terminal devices 101, 102, and 103 interact with the server 105 through the network 104 to receive or send messages and the like.
  • Various communication client applications may be installed on the terminal devices 101 , 102 and 103 , for example, video playback applications, communication applications, and the like.
  • the terminal devices 101, 102, and 103 may be hardware or software.
  • the terminal devices 101, 102, and 103 can be various electronic devices with display screens, including but not limited to mobile phones and notebook computers.
  • the terminal devices 101, 102, and 103 are software, they can be installed in the electronic devices listed above. It can be implemented as a plurality of software or software modules (for example, to provide video quality assessment services), or can be implemented as a single software or software module. There is no specific limitation here.
  • the server 105 may be a server that provides various services, for example, acquiring the video frame sequence to be evaluated; determining at least one sub-video frame sequence from the video frame sequence to be evaluated; Score; according to the score of each sub-video frame sequence, perform quality assessment on the video frame sequence to be evaluated.
  • the server 105 may be hardware or software.
  • the server 105 can be implemented as a distributed server cluster composed of multiple servers, or can be implemented as a single server.
  • the server is software, it can be implemented as a plurality of software or software modules (for example, used to provide a video quality assessment service), or can be implemented as a single software or software module. There is no specific limitation here.
  • each part (for example, each unit, sub-unit, module, sub-module) included in the video quality evaluation apparatus may be all set in the server 105, or all may be set in the terminal devices 101, 102, 103, or may be set separately in the server 105 and the terminal devices 101, 102, and 103.
  • terminal devices, networks and servers in FIG. 1 are merely illustrative. There can be any number of terminal devices, networks and servers according to implementation needs.
  • FIG. 2 shows a schematic flowchart 200 of an embodiment of a video quality assessment method that can be applied to the present application.
  • the video quality assessment method includes the following steps:
  • Step 201 acquiring a sequence of video frames to be evaluated.
  • the execution subject can obtain the video frame sequence to be evaluated locally, or can obtain the video frame sequence to be evaluated from a remote video database in a wired or wireless manner.
  • the server obtains the video frame sequence to be evaluated, which is not limited in this application.
  • Step 202 Determine at least one sub-video frame sequence from the to-be-evaluated video frame sequence.
  • the executing subject may determine at least one sub-video frame sequence from the to-be-evaluated video frame sequence, and the sub-video frame sequence is a group of video frames whose color mean gradient satisfies a predetermined Conditioned consecutive video frames.
  • the preset condition may be whether the color mean gradient of the video frame is greater than or equal to a preset color mean gradient threshold, whether it satisfies the preset color mean gradient threshold range, and the like.
  • the color mean gradient threshold and the color mean gradient threshold range can be determined according to experience, actual needs and specific application scenarios, which are not limited in this application.
  • the color mean gradient of the kth frame is denoted by Dk
  • Step 203 Score each sub-video frame sequence according to a preset scoring rule.
  • the execution body may further score each video frame sequence according to a preset scoring rule.
  • the preset scoring rule and the attribute information of each sub-video frame sequence Correlation that is, the preset scoring rule is set according to the attribute information of each sub-video frame.
  • the attribute information of each sub-video frame sequence may include the target presentation object corresponding to each sub-video frame sequence, the average presentation duration of each sub-video frame sequence, the number of presentation objects included in each sub-video frame sequence, and the like.
  • the preset scoring rule may be set according to one or more of the attribute information of each sub-video frame sequence, which is not limited in this application.
  • the target display object is the target object that each sub-video frame sequence mainly wants to display.
  • the execution subject can determine the target display object according to the area of each display object in the sub-video frame sequence, or can also determine the target display object according to the area of each display object in the sub-video frame sequence.
  • the frequency determines the target display object, which is not limited in this application.
  • the execution body may determine the display object corresponding to the category with the highest occurrence frequency in the display object category set corresponding to each sub-video frame sequence as the target display object, wherein the display object category set is composed of each sub-video frame sequence.
  • the category to which the display object of the video frame belongs if the determined category to which the display object of each video frame belongs has the same category, the same category is deleted, and only one of the same categories is retained.
  • the corresponding display object category is The collection of display object categories is If the same category exists in M n display objects, There are C n different categories of display objects in the nth video frame, then the category set after deduplication of L n is: C n ⁇ Mn .
  • the display object category l i ′ whose frequency is p i
  • the category of the target impression object is Its index is i *
  • the category in the sub-video frame sequence is The display object of is the target display object.
  • the execution subject can use the existing technology or future development technology to detect the category of the display object in the video frame, for example, SSD (Single Shot MultiBox Detector, one-stage multi-box detection algorithm), R-CNN (Region -based Convolution Neural Networks, region-based convolutional neural network algorithm), etc., to detect the categories of display objects contained in each video frame in each sub-video frame sequence.
  • SSD Single Shot MultiBox Detector, one-stage multi-box detection algorithm
  • R-CNN Region -based Convolution Neural Networks, region-based convolutional neural network algorithm
  • the sub-video frame sequence M includes video frame A (contained display objects are two people and a dog) and video frame B (contained display objects are one person and three vehicles).
  • a and video frame B are detected, and it is obtained that the categories to which the display objects included in video frame A belong are people and animals, and the categories to which the display objects included in video frame B belong are people and cars, so the display corresponding to the sub-video frame sequence M
  • the set of object categories is ⁇ person, animal, person, car ⁇ .
  • the execution subject may determine the display objects (two persons in the video frame A and one person in the video frame B) corresponding to the persons in the presentation object category set as the target presentation objects.
  • the preset scoring rules include at least one of the following scoring sub-rules: a first scoring rule associated with the average duration of each sub-video frame sequence; a target display object associated with each sub-video frame sequence The second scoring rule of the association.
  • the execution subject scores each sub-video frame sequence according to at least one of the first scoring rule and the second scoring rule.
  • the first scoring rule is associated with the average duration of each sub-video frame sequence, that is, the first scoring rule is set according to the average duration of each sub-video frame sequence.
  • the manner in which the execution subject scores each sub-video frame sequence according to the first scoring rule may be to score each sub-video frame sequence according to a preset duration scoring comparison table, or may be based on whether the average duration of each sub-video frame sequence is Each sub-video frame sequence is scored if the preset duration threshold range is satisfied, which is not limited in this application.
  • the preset duration threshold range may be determined according to experience, actual needs and specific application scenarios, which is not limited in this application.
  • Q sub-video frame sequences are determined from the to-be-evaluated video frame sequence L, the frame rate of the to-be-evaluated video frame sequence L is f, and the number of frames of the qth (value range of 1 to Q) sub-video frame sequence is F q , then the average duration MT of each sub-video frame sequence is
  • the execution subject can determine whether the MT meets the preset duration threshold range, and if so, the score is 1, and if not, the score is 0.
  • the second scoring rule is associated with the target display object of each sub-video frame sequence, that is, the second scoring rule is set according to the target display object of each sub-video frame sequence.
  • the second scoring rule may be set according to the target display object of each sub-video frame sequence in a variety of ways, for example, it may be set according to the type of the target display object of each sub-video frame sequence; The frequency of the target display object appearing in the corresponding sub video frame sequence is set, and the setting is based on the area occupied by the target display object of each sub video frame sequence in the corresponding sub video frame sequence, etc. This application does not limit this. .
  • the execution subject may score each sub-video frame sequence according to the second scoring rule in various ways, for example, scoring each sub-video frame sequence according to a preset target display object category scoring comparison table; according to each sub-video frame sequence; Whether the frequency of the target display object of the video frame sequence in the corresponding sub-video sequence frame meets the preset frequency threshold range to score each sub-video frame sequence; according to the area of the target display object of each sub-video frame sequence and the target display Whether the ratio of the area of the video frame of the object satisfies the preset ratio threshold range is used to score each sub-video frame sequence, etc., which is not limited in this application.
  • the video frame sequence to be evaluated includes a sub video frame sequence A and a sub video frame sequence B, wherein the target display object of the sub video frame sequence A is a person; and the target display object of the sub video frame sequence B is an item, and the preset
  • the scoring rule is: when the target display object is a person, the corresponding sub-video frame sequence is scored as 1 point; when the target display object is an item, the corresponding sub-video frame sequence is scored as 0 points, so the video frame sequence to be evaluated is scored as 0 points.
  • the score of sub-video frame sequence A is 1 point
  • the score of sub-video frame sequence B is 0 point.
  • the execution subject will score each sub-video frame sequence according to the first scoring rule and the second scoring rule.
  • the first scoring rule is associated with the average duration of each sub video frame sequence
  • the second scoring rule is associated with the category of the target display object of each sub video frame sequence
  • the video frame sequence to be evaluated includes the sub video frame sequence A and sub-video frame sequence B, wherein, the target display object of sub-video frame sequence A is a person, and the average duration is 5s; while the target display object of sub-video frame sequence B is an item, the average duration is 5s
  • the preset scoring rule is: when When the target display object is a person and the average duration is greater than or equal to 3s, the score of the corresponding sub-video frame sequence is 1 point; otherwise, the score of the corresponding sub-video frame sequence is 0 point, so the sub-video frame sequence A in the video frame sequence to be evaluated The score is 1 point, and the score of the sub-video frame sequence B is 0 point.
  • each sub-video frame sequence is scored according to the first scoring rule and/or the second scoring rule, and then the quality of the video frame sequence to be evaluated is evaluated according to the score of each sub-video frame sequence, fully considering each sub-video frame sequence.
  • the average duration in the attribute information of the frame sequence and/or the influence of the target display object on the video quality effectively improves the accuracy and validity of the evaluated video quality.
  • Step 204 according to the score of each sub-video frame sequence, perform quality assessment on the video frame sequence to be evaluated.
  • the execution body may evaluate the video frame sequence to be evaluated according to the score of each sub-video frame sequence and the corresponding weight coefficient.
  • the weight coefficients respectively corresponding to the first score and the second score may be determined according to experience, actual needs and specific application scenarios, which are not limited in this application.
  • FIG. 3 is a schematic diagram of an application scenario of the video quality method according to this embodiment.
  • the execution body 301 acquires the video frame sequence 302 to be evaluated, and determines three sub-video frame sequences from the to-be-evaluated video frame sequence 302, and the sub-video frame sequence is the color average value of a group of video frames
  • the gradient satisfies a preset condition (for example, less than a preset color mean gradient threshold) for continuous video frames
  • the three sub-video frame sequences are respectively sub-video frame sequence A303 (including the number of display objects), sub-video frame sequence B304 (including The number of display objects is 10) and the sub-video frame sequence C305 (including the number of display objects is 20); according to the preset scoring rules, each sub-video frame sequence is scored to obtain the score of each sub-video frame sequence, that is, the sub-video
  • the score 306 of the frame A, the score 307 of the sub video frame sequence B, and the score 308 of the sub video frame sequence C wherein the preset scoring rules are associated with the attribute information of each sub video frame sequence, and the
  • the video quality evaluation method of the present disclosure obtains the video frame sequence to be evaluated; determines at least one sub-video frame sequence from the video frame sequence to be evaluated; scores each sub-video frame sequence according to a preset scoring rule; The score of each sub-video frame sequence is used to evaluate the quality of the video frame sequence to be evaluated, which effectively improves the rationality and effectiveness of evaluating the video quality.
  • FIG. 4 shows a flow 400 of yet another embodiment of the video quality assessment method shown in FIG. 2 .
  • the preset scoring rules include a first scoring rule and a second scoring rule.
  • the process 400 of the video quality assessment method of the present embodiment may include the following steps:
  • Step 401 acquiring a sequence of video frames to be evaluated.
  • step 401 for the implementation details and technical effects of step 401, reference may be made to the description of step 201, and details are not repeated here.
  • Step 402 Determine at least one sub-video frame sequence based on the color mean gradient of each video frame in the to-be-evaluated video frame sequence.
  • step 402 can be referred to the description of step 202, which will not be repeated here.
  • Step 403 Score each sub-video frame sequence according to the first scoring rule to obtain a first score of each sub-video frame sequence.
  • the way for the execution subject to score each sub-video frame sequence according to the first scoring rule may be to score each sub-video frame sequence according to a preset time-length scoring comparison table, or to score each sub-video frame sequence according to Whether the average duration of each sub-video frame sequence meets the preset duration threshold range, the sub-video frame sequence is scored, which is not limited in this application.
  • Step 404 Score each sub-video frame sequence according to the second scoring rule to obtain a second score of each sub-video frame sequence.
  • the execution subject may score each sub-video frame sequence according to the second scoring rule in a variety of ways, for example, scoring each sub-video frame sequence according to a preset target display object category scoring comparison table; Each sub-video frame sequence is scored according to whether the frequency of occurrence of the target display object of each sub-video frame sequence meets the preset frequency threshold range; according to the area occupied by the target display object of each sub-video frame sequence and the video containing the target display object Whether the ratio of the area of the frame satisfies the preset ratio threshold range is used to score each sub-video frame sequence, etc., which is not limited in this application.
  • the second scoring rule includes at least one of the following: a third scoring rule associated with the frequency of occurrence of the target display object of each sub-video frame sequence; Area is associated with the fourth scoring rule.
  • the execution subject scores each sub-video frame sequence according to at least one of the third scoring rule and the fourth scoring rule.
  • the third scoring rule is associated with the frequency of the target display object of each sub-video frame sequence appearing in the corresponding sub-video frame sequence, that is, the third scoring rule is based on the target display object of each sub-video frame sequence in the corresponding sub-video frame sequence. The frequency setting that appears in the frame sequence.
  • the frequency at which the target display object of each sub-video frame sequence appears in the corresponding sub-video frame sequence may be the frequency at which the target display object appears in the corresponding sub-video frame sequence, or the type of the target display object in the corresponding sub-video frame.
  • the frequency of occurrence in the set of display object types corresponding to the sequence is not limited in this application.
  • the third scoring rule associated with the frequency of occurrence of the target display object may be a preset frequency score comparison table, or may be based on whether the frequency of the target display object of each sub-video sequence appearing in the corresponding sub-video frame sequence satisfies the preset frequency The frequency threshold range of the sub-video frame sequence is scored.
  • the display object category set corresponding to the sub-video frame sequence M is ⁇ person, person, person, animal ⁇
  • the category of the target display object is human, and the frequency of people appearing in the display object category set is 0.75
  • the display object category set corresponding to the sequence N is ⁇ car, car, animal ⁇ , the target display object category is car, and the frequency of the car in the display object category set is 0.67.
  • the third scoring rule is that if the preset frequency threshold range is met, the score is 1, and if the preset frequency threshold range is not met, the score is 0, according to In the third scoring rule, the score of the sub-video frame sequence M is 1, and the score of the sub-video frame sequence N is 0.
  • the fourth scoring rule is associated with the area of the target display object of each sub video frame sequence in the corresponding sub video frame sequence, that is, the fourth scoring rule is set according to the target display area of each sub video frame sequence.
  • the fourth scoring rule associated with the area of the target display object in each sub-video frame sequence may be the first score between the total area of the target display object in each sub-video frame sequence and the total area of the video frame containing the target display object
  • the fourth scoring rule associated with the ratio can also be the sum of the area of the target display object with the largest area in each video frame containing the target display object in each sub-video frame sequence and the total area of the video frame containing the target display object.
  • the fourth scoring rule associated with the second ratio is not limited in this application.
  • the corresponding display object category is The collection of display object categories is The set of categories after deduplication of L n is C n ⁇ Mn . for the category Required in all categories Among the display objects, keep the display object with the largest area, and record the display object The area of is then for the category After screening, the display object with the largest area is
  • the set of display objects of the filtered n-th video frame is
  • the display object set composed of the display object sets corresponding to N video frames is:
  • the fourth scoring rule may be a ratio scoring comparison table, or may be scoring each sub-video frame sequence according to whether the ratio of each sub-video sequence satisfies a preset ratio threshold range.
  • the corresponding display object category set For ⁇ person, car, person, animal ⁇ , its target display object is person.
  • the execution subject calculates the total area of the target display object (the sum of the area of the two people in the video frame C1 and the area of the two people in the video frame C2) and the total area of the video frame containing the target display object (the video frame C1 and the video The ratio of the total area of the frame C2), and the fourth scoring rule is to judge whether the ratio satisfies the preset ratio threshold range, if so, the score is 1, if not, the score is 0. If the ratio calculated above is 0.5, and the preset ratio threshold range is greater than 0.3 and less than or equal to 0.6, the score of the sub video frame sequence M is 1.
  • each sub-video frame sequence is scored according to the third scoring rule and/or the fourth scoring rule, and then the quality of the video frame sequence to be evaluated is evaluated according to the score of each sub-video frame sequence, fully considering each sub-video frame sequence.
  • the influence of the area and/or the frequency of occurrence of the target object in the frame sequence on the video quality further improves the accuracy and validity of the assessed video quality.
  • scoring each sub-video frame sequence according to a preset scoring rule includes: scoring each sub-video frame sequence according to a third scoring rule to obtain a third score, and according to a fourth scoring rule Each sub-video frame sequence is scored to obtain a fourth score, and each sub-video frame sequence is scored according to the third score and the fourth score.
  • the preset scoring rules include a third scoring rule and a fourth scoring rule
  • the execution subject may first follow the frequency associated with the frequency of the target display object of each sub-video frame sequence appearing in the corresponding sub-video frame sequence.
  • the third scoring rule scores each sub-video frame to obtain a third score; then, according to the fourth scoring rule associated with the area of the target display object of each sub-video frame sequence in the corresponding sub-video frame sequence
  • the frame sequence is scored to obtain the fourth score, and finally the score of each sub-video frame sequence is obtained according to the third score and the fourth score.
  • the executive body may obtain the score of each sub-video frame sequence according to the third score and the fourth score and their respective weight coefficients.
  • the weight coefficients corresponding to the third score and the fourth score respectively may be set according to experience, actual needs and specific application scenarios, which are not limited in this application.
  • each sub-video frame sequence is scored independently according to the third scoring rule and the fourth scoring rule, which is helpful for comprehensively considering the frequency of the target display objects of each sub-video frame sequence and the target display of each sub-video frame sequence.
  • the accuracy of the obtained scores of each sub-video frame sequence is improved, thereby further improving the validity and rationality of evaluating the video quality.
  • each sub-video frame sequence is scored, including: according to the first scoring rule According to a scoring rule, each sub-video frame sequence is scored to obtain the first score; according to the third scoring rule, each sub-video frame sequence is scored to obtain a third score; according to the fourth scoring rule, each sub-video frame sequence is scored according to the fourth scoring rule. Scoring is performed to obtain a fourth score; according to the first score, the third score and the fourth score, each sub-video frame sequence is scored.
  • each sub-video frame sequence is scored to obtain the first score of each sub-video frame sequence.
  • the first score is Score 1
  • the third score is The fourth rating is Then the final score Score of the video frame sequence can be expressed as
  • the average duration of the sub-video frame sequence M is 2s
  • the corresponding display object category set is ⁇ person, car, person, animal ⁇
  • the target display object is human.
  • the execution subject calculates the total area of the target display object (the sum of the area of the two people in the video frame C1 and the area of the two people in the video frame C2) and the total area of the video frame containing the target display object (the video frame C1 and the video The ratio of the total area of the frame C2), and the frequency 0.5 that the target display object appears in the display object category set, and the first scoring rule is to judge whether the average duration of the sub-video frame sequence satisfies the preset duration threshold range, if If it is satisfied, the score is 1; if not, the score is 0.
  • the third scoring rule is to judge whether the frequency meets the preset frequency threshold range. If it is satisfied, the score is 1; if not, the score is 0.
  • the four scoring rules are to judge whether the ratio satisfies the preset ratio threshold range. If so, the score is 1; if not, the score is 0. If the ratio calculated above is 0.5, the preset ratio threshold range is greater than 0.3 and less than or equal to 0.6, the preset duration threshold range is greater than 1s and less than or equal to 2s, and the preset frequency threshold range is greater than 0.4 and less than or equal to 0.6, then the sub- The score of the video frame sequence M can be expressed as the sum 3 of the first score, the third score and the fourth score.
  • each sub-video frame sequence is scored independently according to the first scoring, the third scoring rule and the fourth scoring rule, which is helpful for comprehensively considering the average duration of each sub-video frame sequence and the target of each sub-video frame sequence.
  • the accuracy of the obtained score of each sub-video frame sequence is improved, thereby further improving the effectiveness and efficiency of evaluating the video quality. rationality.
  • Step 405 Obtain a score of each sub-video frame sequence according to the first score of each sub-video frame sequence and the second score of each sub-video frame sequence.
  • the execution body can directly obtain the score of each sub-video frame sequence according to the first score and the second score, or obtain each sub-video frame sequence according to the first score and the second score and their respective weight coefficients score, which is not limited in this application.
  • Step 406 according to the score of each sub-video frame sequence, perform quality assessment on the video frame sequence to be evaluated.
  • step 406 for the implementation details and technical effects of step 406, reference may be made to the description of step 204, and details are not repeated here.
  • the process 400 of the video quality evaluation method in this embodiment embodies that each sub-video frame sequence is evaluated according to the first scoring rule and the second scoring rule.
  • the first score and the second score are obtained, and each sub-video frame sequence is scored according to the first score and the second score, and then the quality of the video frame sequence to be evaluated is evaluated according to the score of each sub-video frame sequence.
  • the solution described in this embodiment grades each sub-video frame sequence independently according to the first scoring rule and the second scoring rule, which helps to comprehensively consider the average duration of each sub-video frame sequence and the target display of each sub-video frame sequence Under the condition of the influence of the object on the video index, the accuracy of the obtained scores of each sub-video frame sequence is improved, and the validity and rationality of the video quality evaluation are improved.
  • the present application provides an embodiment of a device for evaluating video quality.
  • the embodiment of the device corresponds to the embodiment of the method shown in FIG. 1 .
  • the video quality evaluation apparatus 500 in this embodiment includes: an acquisition module 501 , a determination module 502 , a scoring module 503 and an evaluation module 504 .
  • the obtaining module 501 may be configured to obtain a sequence of video frames to be evaluated.
  • the determining module 502 may be configured to determine at least one sub-video frame sequence from the video frame sequence to be evaluated.
  • the scoring module 503 may be configured to score each sub-video frame sequence according to a preset scoring rule.
  • the evaluation module 504 may be configured to perform quality evaluation on the video frame sequence to be evaluated according to the score of each sub-video frame sequence.
  • the preset scoring rules include at least one of the following: a first scoring rule associated with the average duration of each sub-video frame sequence; a target display object associated with each sub-video frame sequence The associated second scoring rule.
  • the preset scoring rule includes a first scoring rule and a second scoring rule
  • the scoring module is further configured to: score each sub-video frame sequence according to the first scoring rule, and obtain The first score of each sub-video frame sequence; the sub-video frame sequence is scored according to the second scoring rule to obtain the second score of each sub-video frame sequence; according to the first score and the second score, each sub-video frame sequence is scored. to rate.
  • the second scoring rule includes at least one of the following: a third scoring rule associated with the frequency with which the target display object of each sub-video frame sequence appears in the corresponding sub-video frame sequence; A fourth scoring rule associated with the area of each sub-video frame sequence's target presentation object in the corresponding sub-video frame sequence.
  • the preset scoring rules include a third scoring rule and a fourth scoring rule
  • the scoring module is further configured to: score each sub-video frame sequence according to the third scoring rule, and obtain The third score of each sub-video frame sequence; according to the fourth scoring rule, score each sub-video frame sequence to obtain the fourth score of each sub-video frame sequence; according to the third score and the fourth score, to each sub-video frame sequence to rate.
  • the present application further provides an electronic device and a readable storage medium.
  • FIG. 6 it is a block diagram of an electronic device according to the video quality assessment method according to an embodiment of the present application.
  • Electronic devices are intended to represent various forms of digital computers, such as laptops, desktops, workstations, personal digital assistants, servers, blade servers, mainframe computers, and other suitable computers.
  • Electronic devices may also represent various forms of mobile devices, such as personal digital processors, cellular phones, smart phones, wearable devices, and other similar computing devices.
  • the components shown herein, their connections and relationships, and their functions are by way of example only, and are not intended to limit implementations of the application described and/or claimed herein.
  • the electronic device includes: one or more processors 601, a memory 602, and interfaces for connecting various components, including a high-speed interface and a low-speed interface.
  • the various components are interconnected using different buses and may be mounted on a common motherboard or otherwise as desired.
  • the processor may process instructions executed within the electronic device, including instructions stored in or on memory to display graphical information of the GUI on an external input/output device, such as a display device coupled to the interface.
  • multiple processors and/or multiple buses may be used with multiple memories and multiple memories, if desired.
  • multiple electronic devices may be connected, each providing some of the necessary operations (eg, as a server array, a group of blade servers, or a multiprocessor system).
  • a processor 601 is taken as an example in FIG. 6 .
  • the memory 602 is the non-transitory computer-readable storage medium provided by the present application.
  • the memory stores instructions executable by at least one processor, so that the at least one processor executes the video quality assessment method provided by the present application.
  • the non-transitory computer-readable storage medium of the present application stores computer instructions, and the computer instructions are used to cause the computer to execute the video quality assessment method provided by the present application.
  • the memory 602 can be used to store non-transitory software programs, non-transitory computer-executable programs and modules, such as program instructions/modules corresponding to the video quality assessment method in the embodiments of the present application (for example, The acquisition module 501, the determination module 502, the scoring module 503 and the evaluation module 504 shown in FIG. 5).
  • the processor 601 executes various functional applications and data processing of the server by running the non-transitory software programs, instructions and modules stored in the memory 602, ie, implements the video quality assessment method in the above method embodiments.
  • the memory 602 may include a storage program area and a storage data area, wherein the storage program area may store an operating system, an application program required for at least one function; the storage data area may store data created by the use of electronic equipment for video quality assessment, and the like. Additionally, memory 602 may include high-speed random access memory, and may also include non-transitory memory, such as at least one magnetic disk storage device, flash memory device, or other non-transitory solid state storage device. In some embodiments, memory 602 may optionally include memory located remotely relative to processor 601, and these remote memories may be connected via a network to the electronic device for video quality assessment. Examples of such networks include, but are not limited to, the Internet, intranets, local area networks, mobile communication networks, and combinations thereof.
  • the electronic device of the video quality assessment method may further include: an input device 603 and an output device 604 .
  • the processor 601 , the memory 602 , the input device 603 and the output device 604 may be connected by a bus or in other ways, and the connection by a bus is taken as an example in FIG. 6 .
  • the input device 603 may receive input numerical or character information, such as a touch screen, a keypad, a mouse, a trackpad, a touchpad, a pointing stick, one or more mouse buttons, a trackball, a joystick, and other input devices.
  • Output devices 604 may include display devices, auxiliary lighting devices (eg, LEDs), haptic feedback devices (eg, vibration motors), and the like.
  • the display device may include, but is not limited to, a liquid crystal display (LCD), a light emitting diode (LED) display, and a plasma display. In some implementations, the display device may be a touch screen.
  • Various implementations of the systems and techniques described herein can be implemented in digital electronic circuitry, integrated circuit systems, application specific ASICs (application specific integrated circuits), computer hardware, firmware, software, and/or combinations thereof. These various embodiments may include being implemented in one or more computer programs executable and/or interpretable on a programmable system including at least one programmable processor that The processor, which may be a special purpose or general-purpose programmable processor, may receive data and instructions from a storage system, at least one input device, and at least one output device, and transmit data and instructions to the storage system, the at least one input device, and the at least one output device an output device.
  • the processor which may be a special purpose or general-purpose programmable processor, may receive data and instructions from a storage system, at least one input device, and at least one output device, and transmit data and instructions to the storage system, the at least one input device, and the at least one output device an output device.
  • machine-readable medium and “computer-readable medium” refer to any computer program product, apparatus, and/or apparatus for providing machine instructions and/or data to a programmable processor ( For example, magnetic disks, optical disks, memories, programmable logic devices (PLDs), including machine-readable media that receive machine instructions as machine-readable signals.
  • machine-readable signal refers to any signal used to provide machine instructions and/or data to a programmable processor.
  • the systems and techniques described herein may be implemented on a computer having a display device (eg, a CRT (cathode ray tube) or LCD (liquid crystal display) monitor) for displaying information to the user ); and a keyboard and pointing device (eg, a mouse or trackball) through which a user can provide input to the computer.
  • a display device eg, a CRT (cathode ray tube) or LCD (liquid crystal display) monitor
  • a keyboard and pointing device eg, a mouse or trackball
  • Other kinds of devices can also be used to provide interaction with the user; for example, the feedback provided to the user can be any form of sensory feedback (eg, visual feedback, auditory feedback, or tactile feedback); and can be in any form (including acoustic input, voice input, or tactile input) to receive input from the user.
  • the systems and techniques described herein may be implemented on a computing system that includes back-end components (eg, as a data server), or a computing system that includes middleware components (eg, an application server), or a computing system that includes front-end components (eg, a user computer having a graphical user interface or web browser through which a user may interact with implementations of the systems and techniques described herein), or including such backend components, middleware components, Or any combination of front-end components in a computing system.
  • the components of the system may be interconnected by any form or medium of digital data communication (eg, a communication network). Examples of communication networks include: Local Area Networks (LANs), Wide Area Networks (WANs), and the Internet.
  • a computer system can include clients and servers.
  • Clients and servers are generally remote from each other and usually interact through a communication network.
  • the relationship of client and server arises by computer programs running on the respective computers and having a client-server relationship to each other.

Abstract

The present application relates to the technical field of video processing, and discloses a video quality assessment method and device. In one specific embodiment, the method comprises: obtaining a video frame sequence to be assessed; determining at least one sub video frame sequence from the video frame sequence to be assessed; scoring sub video frame sequences according to a preset scoring rule; and performing, according to scores of the sub video frame sequences, quality assessment on the video frame sequence to be assessed. The embodiment facilitates performing reasonable and effective assessment on video quality.

Description

视频质量评估方法和装置Video quality assessment method and device
相关申请的交叉引用CROSS-REFERENCE TO RELATED APPLICATIONS
本专利申请要求于2021年01月21日提交的、申请号为202110083566.5、发明名称为“视频质量评估方法和装置”的中国专利申请的优先权,该申请的全文以引用的方式并入本申请中。This patent application claims the priority of the Chinese patent application with the application number of 202110083566.5 and the invention titled "video quality assessment method and device", filed on January 21, 2021, the full text of which is incorporated into this application by reference middle.
技术领域technical field
本申请涉及计算机技术领域,具体涉及视频处理技术领域,尤其涉及一种视频质量评估方法和装置。The present application relates to the field of computer technologies, in particular to the field of video processing technologies, and in particular, to a video quality assessment method and apparatus.
背景技术Background technique
随着信息时代的到来,视频用户增长迅速,各个平台上的视频出现了爆发式的增长,为了保证用户对视频的观感体验,我们需要对大量的视频进行质量评估。With the advent of the information age, video users have grown rapidly, and videos on various platforms have experienced explosive growth. In order to ensure users' viewing experience of videos, we need to evaluate the quality of a large number of videos.
目前对视频进行质量评估的方式主要包括两种:一、通过用户观看视频的点击率、点赞率、观看时长等人为方式对视频质量进行评估;二、通过视频的每帧图像的画质对视频质量进行评估。At present, there are two main ways to assess the quality of videos: first, the video quality is assessed by artificial methods such as the click rate, like rate, and viewing time of the video; second, the quality of each frame of the video is compared Video quality is evaluated.
发明内容SUMMARY OF THE INVENTION
本申请实施例提供了一种视频质量评估方法、装置、设备以及存储介质。Embodiments of the present application provide a video quality assessment method, apparatus, device, and storage medium.
根据第一方面,本申请实施例提供了一种视频质量评估方法,该方法包括:获取待评估视频帧序列;从待评估视频帧序列中,确定出至少一个子视频帧序列,子视频帧序列为一组视频帧的颜色均值梯度满足预设条件的连续视频帧;按照预设的评分规则,对各子视频帧序列进行评分,预设的评分规则与各子视频帧序列的属性信息相关联;根据各子视频帧序列的评分,对待评估视频帧序列进行质量评估。According to a first aspect, an embodiment of the present application provides a video quality assessment method, the method includes: acquiring a video frame sequence to be evaluated; and determining at least one sub-video frame sequence from the to-be-evaluated video frame sequence; It is a continuous video frame whose color mean gradient of a group of video frames satisfies the preset condition; according to the preset scoring rule, each sub-video frame sequence is scored, and the preset scoring rule is associated with the attribute information of each sub-video frame sequence ; According to the score of each sub-video frame sequence, the quality of the video frame sequence to be evaluated is evaluated.
在一些实施例中,预设的评分规则包括以下至少一项:与各子视频帧序列的平均时长相关联的第一评分规则;与各子视频帧序列的目标展示对象相关联的第二评分规则。In some embodiments, the preset scoring rules include at least one of the following: a first scoring rule associated with the average duration of each sub-video frame sequence; a second scoring associated with the target display object of each sub-video frame sequence rule.
在一些实施例中,预设的评分规则包括:第一评分规则和第二评分规则,以及按照预设的评分规则,对各子视频帧序列进行评分,包括:按照第一评分规则对各子视频帧序列进行评分,得到各子视频帧序列的第一评分;按照第二评分规则对各子视频帧序列进行评分,得到各子视频帧序列的第二评分;根据第一评分和所述第二评分,对各子视频帧序列进行评分。In some embodiments, the preset scoring rule includes: a first scoring rule and a second scoring rule, and scoring each sub-video frame sequence according to the preset scoring rule includes: scoring each sub-video frame sequence according to the first scoring rule The video frame sequence is scored to obtain the first score of each sub-video frame sequence; the sub-video frame sequence is scored according to the second scoring rule to obtain the second score of each sub-video frame sequence; The second score is to score each sub-video frame sequence.
在一些实施例中,第二评分规则包括以下至少一项:与各子视频帧序列的目标展示对象在相应子视频帧序列中出现的频率相关联的第三评分规则;与各子视频帧序列的目标展示对象在相应子视频帧序列中的面积相关联的第四评分规则。In some embodiments, the second scoring rule includes at least one of the following: a third scoring rule associated with the frequency with which the target presentation object of each sub-video frame sequence appears in the corresponding sub-video frame sequence; a third scoring rule associated with each sub-video frame sequence The target shows the fourth scoring rule associated with the area of the object in the corresponding sub-video frame sequence.
在一些实施例中,预设的评分规则包括第三评分规则和第四评分规则,以及按照预设的评分规则,对各子视频帧序列进行评分,包括:按照第三评分规则对各子视频帧序列进行评分,得到各子视频帧序列的第三评分;按照第四评分规则对各子视频帧序列进行评分,得到各子视频帧序列的第四评分;根据第三评分和所述第四评分,对各子视频帧序列进行评分。In some embodiments, the preset scoring rule includes a third scoring rule and a fourth scoring rule, and scoring each sub-video frame sequence according to the preset scoring rule includes: scoring each sub-video frame sequence according to the third scoring rule The frame sequence is scored to obtain the third score of each sub-video frame sequence; the sub-video frame sequence is scored according to the fourth scoring rule to obtain the fourth score of each sub-video frame sequence; according to the third score and the fourth score Score, score each sub-video frame sequence.
根据第二方面,本申请实施例提供了一种视频质量评估装置,该装置包括:获取模块,被配置成获取待评估视频帧序列;确定模块,被配置成从待评估视频帧序列中,确定出至少一个子视频帧序列,所述子视频帧序列为一组视频帧的颜色值梯度均满足预设条件的连续视频帧;评分模块,被配置成按照预设的评分规则,对各子视频帧序列进行评分,所述预设的评分规则与各子视频帧序列的属性信息相关联;评估模块,被配置成根据各子视频帧序列的评分,对待评估视频帧序列进行质量评估。According to a second aspect, an embodiment of the present application provides an apparatus for evaluating video quality, the apparatus including: an obtaining module configured to obtain a sequence of video frames to be evaluated; a determining module configured to determine from the sequence of video frames to be evaluated At least one sub-video frame sequence is obtained, and the sub-video frame sequence is a continuous video frame whose color value gradients of a group of video frames meet preset conditions; the scoring module is configured to be configured according to preset scoring rules. The frame sequence is scored, and the preset scoring rule is associated with attribute information of each sub-video frame sequence; the evaluation module is configured to perform quality evaluation on the video frame sequence to be evaluated according to the score of each sub-video frame sequence.
在一些实施例中,预设的评分规则包括以下至少一项:与各子视频帧序列的平均时长相关联的第一评分规则;与各子视频帧序列的目标展示对象相关联的第二评分规则。In some embodiments, the preset scoring rules include at least one of the following: a first scoring rule associated with the average duration of each sub-video frame sequence; a second scoring associated with the target display object of each sub-video frame sequence rule.
在一些实施例中,预设的评分规则包括第一评分规则和第二评分规则,评分模块进一步被配置成:按照第一评分规则对各子视频帧序列进行评分,得到各子视频帧序列的第一评分;按照第二评分规则对各子视频帧序列进行评分,得到各子视频帧序列的第二评分;根据第一评分和第二评分,对各子视频帧序列进行评分。In some embodiments, the preset scoring rule includes a first scoring rule and a second scoring rule, and the scoring module is further configured to: score each sub-video frame sequence according to the first scoring rule, and obtain the score of each sub-video frame sequence. First scoring; scoring each sub video frame sequence according to the second scoring rule to obtain a second score of each sub video frame sequence; scoring each sub video frame sequence according to the first score and the second score.
在一些实施例中,第二评分规则包括以下至少一项:与各子视频帧序列的目标展示对象在相应子视频帧序列中出现的频率相关联的第三评分规则;与各子视频帧序列的目标展示对象在相应子视频帧序列中的面积相关联的第四评分规则。In some embodiments, the second scoring rule includes at least one of the following: a third scoring rule associated with the frequency with which the target presentation object of each sub-video frame sequence appears in the corresponding sub-video frame sequence; a third scoring rule associated with each sub-video frame sequence The target shows the fourth scoring rule associated with the area of the object in the corresponding sub-video frame sequence.
在一些实施例中,预设的评分规则包括第三评分规则和第四评分规则,评分模块进一步被配置成:按照第三评分规则对各子视频帧序列进行评分,得到各子视频帧序列的第三评分;按照第四评分规则对各子视频帧序列进行评分,得到各子视频帧序列的第四评分;根据第三评分和第四评分,对各子视频帧序列进行评分。In some embodiments, the preset scoring rules include a third scoring rule and a fourth scoring rule, and the scoring module is further configured to: score each sub-video frame sequence according to the third scoring rule, and obtain the score of each sub-video frame sequence. the third score; score each sub-video frame sequence according to the fourth scoring rule to obtain a fourth score of each sub-video frame sequence; score each sub-video frame sequence according to the third score and the fourth score.
根据第三方面,本申请实施例提供了一种电子设备,该电子设备包括至少一个处理器;以及与至少一个处理器通信连接的存储器;其中,存储器存储有可被至少一个处理器执行的指令,该指令被至少一个处理器执行,以使至少一个处理器执行时能够实现如第一方面的任一实施例的视频质量评估方法。According to a third aspect, an embodiment of the present application provides an electronic device, the electronic device includes at least one processor; and a memory communicatively connected to the at least one processor; wherein the memory stores instructions executable by the at least one processor , the instruction is executed by at least one processor, so that when the at least one processor is executed, the video quality assessment method according to any embodiment of the first aspect can be implemented.
根据第四方面,本申请实施例提供了一种存储有计算机指令的非瞬时计算机可读存储介质,该计算机指令用于使计算机执行时能够实现如第一方面的任一实施例的视频质量评估方法。According to a fourth aspect, an embodiment of the present application provides a non-transitory computer-readable storage medium storing computer instructions, where the computer instructions are used to enable a computer to implement the video quality assessment according to any embodiment of the first aspect. method.
根据第五方面,本申请实施例提供了一种包括计算机程序的计算机程序产品,该计算机程序在被处理器执行时能够实现如第一方面的任一实施例的视频质量评估方法。According to a fifth aspect, an embodiment of the present application provides a computer program product including a computer program, the computer program can implement the video quality assessment method according to any embodiment of the first aspect when the computer program is executed by a processor.
本申请通过获取待评估视频帧序列;从待评估视频帧序列中,确定出至少一个子视频帧序列;按照预设的评分规则,对各子视频帧序列进行评分;根据各子视频帧序列的评分,对待评估视频帧序列进行质量评估,即先将待评估视频帧序列划分为不同的画面,然后根据各 画面的属性信息对各画面进行评分,进而对视频质量进行评估,有效提升了对视频质量进行评估的合理性和有效性。The present application obtains the video frame sequence to be evaluated; determines at least one sub-video frame sequence from the video frame sequence to be evaluated; scores each sub-video frame sequence according to a preset scoring rule; Scoring, to evaluate the quality of the video frame sequence to be evaluated, that is, first divide the video frame sequence to be evaluated into different pictures, and then score each picture according to the attribute information of each picture, and then evaluate the video quality, which effectively improves the quality of the video. Reasonableness and validity of quality assessment.
应当理解,本部分所描述的内容并非旨在标识本公开的实施例的关键或重要特征,也不用于限制本公开的范围。本公开的其他特征将通过以下的说明书而变得容易理解。It should be understood that what is described in this section is not intended to identify key or critical features of embodiments of the disclosure, nor is it intended to limit the scope of the disclosure. Other features of the present disclosure will become readily understood from the following description.
附图说明Description of drawings
图1是本申请可以应用于其中的示例性系统架构图;FIG. 1 is an exemplary system architecture diagram to which the present application can be applied;
图2是根据本申请的视频质量评估方法的一个实施例的流程图;2 is a flowchart of an embodiment of a video quality assessment method according to the present application;
图3是根据本申请的视频质量评估方法的一个应用场景的示意图;3 is a schematic diagram of an application scenario of the video quality assessment method according to the present application;
图4是根据本申请的视频质量评估方法的另一个实施例的流程图;4 is a flowchart of another embodiment of a video quality assessment method according to the present application;
图5是根据本申请的视频质量评估装置的一个实施例的示意图;5 is a schematic diagram of an embodiment of a video quality assessment apparatus according to the present application;
图6是适于用来实现本申请实施例的服务器的计算机系统的结构示意图。FIG. 6 is a schematic structural diagram of a computer system suitable for implementing the server of the embodiment of the present application.
具体实施方式Detailed ways
以下结合附图对本申请的示范性实施例做出说明,其中包括本申请实施例的各种细节以助于理解,应当将它们认为仅仅是示范性的。因此,本领域普通技术人员应当认识到,可以对这里描述的实施例做出各种改变和修改,而不会背离本申请的范围和精神。同样,为了清楚和简明,以下的描述中省略了对公知功能和结构的描述。Exemplary embodiments of the present application are described below with reference to the accompanying drawings, which include various details of the embodiments of the present application to facilitate understanding, and should be considered as exemplary only. Accordingly, those of ordinary skill in the art will recognize that various changes and modifications of the embodiments described herein can be made without departing from the scope and spirit of the present application. Also, descriptions of well-known functions and constructions are omitted from the following description for clarity and conciseness.
需要说明的是,在不冲突的情况下,本申请中的实施例及实施例中的特征可以相互组合。下面将参考附图并结合实施例来详细说明本申请。It should be noted that the embodiments in the present application and the features of the embodiments may be combined with each other in the case of no conflict. The present application will be described in detail below with reference to the accompanying drawings and in conjunction with the embodiments.
图1示出了可以应用本申请的视频质量评估方法的实施例的示例性系统架构100。FIG. 1 illustrates an exemplary system architecture 100 to which embodiments of the video quality assessment method of the present application may be applied.
如图1所示,系统架构100可以包括终端设备101、102、103,网络104和服务器105。网络104用以在终端设备101、102、103和服务器105之间提供通信链路的介质。网络104可以包括各种连接类型,例如有线、无线通信链路或者光纤电缆等等。As shown in FIG. 1 , the system architecture 100 may include terminal devices 101 , 102 , and 103 , a network 104 and a server 105 . The network 104 is a medium used to provide a communication link between the terminal devices 101 , 102 , 103 and the server 105 . The network 104 may include various connection types, such as wired, wireless communication links, or fiber optic cables, among others.
终端设备101、102、103通过网络104与服务器105交互,以接收或发送消息等。终端设备101、102、103上可以安装有各种通讯客户端应用,例如,视频播放类应用、通讯类应用等。The terminal devices 101, 102, and 103 interact with the server 105 through the network 104 to receive or send messages and the like. Various communication client applications may be installed on the terminal devices 101 , 102 and 103 , for example, video playback applications, communication applications, and the like.
终端设备101、102、103可以是硬件,也可以是软件。当终端设备101、102、103为硬件时,可以是具有显示屏的各种电子设备,包括但不限于手机和笔记本电脑。当终端设备101、102、103为软件时,可以安装在上述所列举的电子设备中。其可以实现成多个软件或软件模块(例如用来提供视频质量评估服务),也可以实现成单个软件或软件模块。在此不做具体限定。The terminal devices 101, 102, and 103 may be hardware or software. When the terminal devices 101, 102, and 103 are hardware, they can be various electronic devices with display screens, including but not limited to mobile phones and notebook computers. When the terminal devices 101, 102, and 103 are software, they can be installed in the electronic devices listed above. It can be implemented as a plurality of software or software modules (for example, to provide video quality assessment services), or can be implemented as a single software or software module. There is no specific limitation here.
服务器105可以是提供各种服务的服务器,例如,获取待评估视频帧序列;从待评估视频帧序列中,确定出至少一个子视频帧序列;按照预设的评分规则,对各子视频帧序列进行评分;根据各子视频帧序列的评分,对待评估视频帧序列进行质量评估。The server 105 may be a server that provides various services, for example, acquiring the video frame sequence to be evaluated; determining at least one sub-video frame sequence from the video frame sequence to be evaluated; Score; according to the score of each sub-video frame sequence, perform quality assessment on the video frame sequence to be evaluated.
需要说明的是,服务器105可以是硬件,也可以是软件。当服务器105为硬件时,可以实现成多个服务器组成的分布式服务器集群,也可以实现成单个服务器。当服务器为软件时,可以实现成多个软件或软件模块(例如用来提供视频质量评估服务),也可以实现成单个软件或软件模块。在此不做具体限定。It should be noted that the server 105 may be hardware or software. When the server 105 is hardware, it can be implemented as a distributed server cluster composed of multiple servers, or can be implemented as a single server. When the server is software, it can be implemented as a plurality of software or software modules (for example, used to provide a video quality assessment service), or can be implemented as a single software or software module. There is no specific limitation here.
需要指出的是,本公开的实施例所提供的视频质量评估方法可以由服务器105执行,也可以由终端设备101、102、103执行,还可以由服务器105和终端设备101、102、103彼此配合执行。相应地,视频质量评估装置包括的各个部分(例如各个单元、子单元、模块、子模块)可以全部设置于服务器105中,也可以全部设置于终端设备101、102、103中,还可以分别设置于服务器105和终端设备101、102、103中。It should be pointed out that the video quality assessment method provided by the embodiments of the present disclosure may be executed by the server 105, or by the terminal devices 101, 102, 103, or by the server 105 and the terminal devices 101, 102, 103 cooperate with each other implement. Correspondingly, each part (for example, each unit, sub-unit, module, sub-module) included in the video quality evaluation apparatus may be all set in the server 105, or all may be set in the terminal devices 101, 102, 103, or may be set separately in the server 105 and the terminal devices 101, 102, and 103.
应该理解,图1中的终端设备、网络和服务器的数目仅仅是示意性的。根据实现需要,可以具有任意数目的终端设备、网络和服务器。It should be understood that the numbers of terminal devices, networks and servers in FIG. 1 are merely illustrative. There can be any number of terminal devices, networks and servers according to implementation needs.
图2示出了可以应用于本申请的视频质量评估方法的实施例的流程示意图200。在本实施例中,视频质量评估方法包括以下步骤:FIG. 2 shows a schematic flowchart 200 of an embodiment of a video quality assessment method that can be applied to the present application. In this embodiment, the video quality assessment method includes the following steps:
步骤201,获取待评估视频帧序列。 Step 201, acquiring a sequence of video frames to be evaluated.
在本实施例中,执行主体(如图1中所示的服务器105或终端设备101、102、103)可以从本地获取待评估的视频帧序列,也可以通过有线或无线的方式从远程视频数据库服务器获取待评估的视频帧序列,本申请对此不作限定。In this embodiment, the execution subject (the server 105 or the terminal devices 101, 102, 103 as shown in FIG. 1 ) can obtain the video frame sequence to be evaluated locally, or can obtain the video frame sequence to be evaluated from a remote video database in a wired or wireless manner. The server obtains the video frame sequence to be evaluated, which is not limited in this application.
步骤202,从所述待评估视频帧序列中,确定出至少一个子视频帧序列。Step 202: Determine at least one sub-video frame sequence from the to-be-evaluated video frame sequence.
在本实施例中,执行主体在获取到待评估视频帧序列后,可以从待评估视频帧序列中确定出至少一个子视频帧序列,子视频帧序列为一组视频帧的颜色均值梯度满足预设条件的连续视频帧。In this embodiment, after acquiring the video frame sequence to be evaluated, the executing subject may determine at least one sub-video frame sequence from the to-be-evaluated video frame sequence, and the sub-video frame sequence is a group of video frames whose color mean gradient satisfies a predetermined Conditioned consecutive video frames.
其中,预设条件可以是视频帧的颜色均值梯度是否大于等于预设的颜色均值梯度阈值,是否满足预设的颜色均值梯度阈值范围等。这里,颜色均值梯度阈值、颜色均值梯度阈值范围可根据经验、实际需求和具体地应用场景确定,本申请对此不作限定。The preset condition may be whether the color mean gradient of the video frame is greater than or equal to a preset color mean gradient threshold, whether it satisfies the preset color mean gradient threshold range, and the like. Here, the color mean gradient threshold and the color mean gradient threshold range can be determined according to experience, actual needs and specific application scenarios, which are not limited in this application.
这里,颜色均值梯度通过如下形式表示,若当前待评估视频帧序列可以拆分为K个视频帧,其中,第k(k=1,2,…,K)个视频帧使用I k表示,视频帧中第i行、第j列的像素值为I k(i,j),视频帧的大小为(W,H),该视频帧的RGB(Red Green Blue,红色、绿色、蓝色)通道分别表示为R k、G k、B k。计算每个视频帧R、G、B通道的像素均值,则第k帧的三个通道的均值分别为: Here, the color mean gradient is expressed in the following form. If the current video frame sequence to be evaluated can be divided into K video frames, where the kth (k=1, 2, . . . , K) video frame is represented by I k , the video frame The pixel value of the i-th row and the j-th column in the frame is I k (i, j), the size of the video frame is (W, H), and the RGB (Red Green Blue, red, green, blue) channels of the video frame are are denoted as R k , G k , and B k , respectively. Calculate the pixel mean of the R, G, and B channels of each video frame, then the mean of the three channels of the kth frame are:
Figure PCTCN2022070276-appb-000001
Figure PCTCN2022070276-appb-000001
Figure PCTCN2022070276-appb-000002
Figure PCTCN2022070276-appb-000002
Figure PCTCN2022070276-appb-000003
Figure PCTCN2022070276-appb-000003
第k帧的颜色均值梯度由D k表示 The color mean gradient of the kth frame is denoted by Dk
Figure PCTCN2022070276-appb-000004
Figure PCTCN2022070276-appb-000004
具体地,颜色均值梯度阈值由th(例如,th=100)表示,若第k帧的颜色均值梯度大于等于预设颜色均值梯度阈值,表示为T k=1;若第k帧的颜色均值梯度小于预设颜色均值梯度阈值,表示为T k=0,具体如下所示 Specifically, the color mean gradient threshold is represented by th (for example, th=100). If the color mean gradient of the kth frame is greater than or equal to the preset color mean gradient threshold, it is expressed as Tk = 1; if the color mean gradient of the kth frame is equal to or greater than less than the preset color mean gradient threshold, expressed as T k =0, as shown below
Figure PCTCN2022070276-appb-000005
Figure PCTCN2022070276-appb-000005
其中,至少一个子视频帧序列中的各子视频帧序列中的视频帧可以由T k=0的视频帧组成,且通常T k=0的视频帧为连续帧,用于表征一个画面,而对于T k=1的视频帧通常用于表示用于表征画面的各子视频帧序列之间的过渡帧。 Wherein, the video frames in each sub-video frame sequence in the at least one sub-video frame sequence may be composed of video frames with T k =0, and usually the video frames with T k =0 are continuous frames, which are used to represent a picture, and Video frames for Tk = 1 are generally used to represent transition frames between sequences of sub-video frames used to characterize a picture.
步骤203,按照预设的评分规则,对各子视频帧序列进行评分。Step 203: Score each sub-video frame sequence according to a preset scoring rule.
在本实施例中,执行主体在确定出各子视频帧序列后,可进一步按照预设的评分规则对各视频帧序列进行评分,这里,预设的评分规则与各子视频帧序列的属性信息相关联,也即预设的评分规则依据各子视频帧的属性信息设定。In this embodiment, after determining each sub-video frame sequence, the execution body may further score each video frame sequence according to a preset scoring rule. Here, the preset scoring rule and the attribute information of each sub-video frame sequence Correlation, that is, the preset scoring rule is set according to the attribute information of each sub-video frame.
这里,各子视频帧序列的属性信息可以包括各子视频帧序列对应的目标展示对象、各子视频帧序列的平均展示时长、各子视频帧序列包括的展示对象的数量等等。预设的评分规则可以根据上述各子视频帧序列的属性信息中的一项或多项进行设定,本申请对此不做限定。Here, the attribute information of each sub-video frame sequence may include the target presentation object corresponding to each sub-video frame sequence, the average presentation duration of each sub-video frame sequence, the number of presentation objects included in each sub-video frame sequence, and the like. The preset scoring rule may be set according to one or more of the attribute information of each sub-video frame sequence, which is not limited in this application.
这里,目标展示对象为各子视频帧序列主要想展示的目标物体,执行主体可以依据子视频帧序列中各展示对象的面积确定目标展示对象,也可以依据子视频帧序列中各展示对象出现的频率确定目标展示对象,本申请对此不作限定。Here, the target display object is the target object that each sub-video frame sequence mainly wants to display. The execution subject can determine the target display object according to the area of each display object in the sub-video frame sequence, or can also determine the target display object according to the area of each display object in the sub-video frame sequence. The frequency determines the target display object, which is not limited in this application.
可选地,执行主体可将各子视频帧序列对应的展示对象类别集合中出现频率最高的类别对应的展示对象确定为目标展示对象,其中,展示对象类别集合由相应子视频帧序列中每一视频帧的展示对象所属的类别组成。这里,若确定出的每一视频帧的展示对象所属的类别存 在相同的类别,则对相同类别进行删减,仅保留相同类别中的一个。Optionally, the execution body may determine the display object corresponding to the category with the highest occurrence frequency in the display object category set corresponding to each sub-video frame sequence as the target display object, wherein the display object category set is composed of each sub-video frame sequence. The category to which the display object of the video frame belongs. Here, if the determined category to which the display object of each video frame belongs has the same category, the same category is deleted, and only one of the same categories is retained.
具体地,一个子视频帧序列中包括N(N=1,2,…,n)个视频帧,假设第n帧视频帧中检测到M n个展示对象,则展示对象集合为
Figure PCTCN2022070276-appb-000006
Figure PCTCN2022070276-appb-000007
其中第i(i=1……M n)个展示对象
Figure PCTCN2022070276-appb-000008
对应的展示对象类别为
Figure PCTCN2022070276-appb-000009
展示对象类别集合为
Figure PCTCN2022070276-appb-000010
若M n个展示对象中存在相同类别,
Figure PCTCN2022070276-appb-000011
第n帧视频帧中共存在C n种不同类别的展示对象,则对L n去重后的类别集合为
Figure PCTCN2022070276-appb-000012
C n≤M n
Specifically, a sub-video frame sequence includes N ( N =1, 2, .
Figure PCTCN2022070276-appb-000006
Figure PCTCN2022070276-appb-000007
where the i-th (i=1...M n ) display object
Figure PCTCN2022070276-appb-000008
The corresponding display object category is
Figure PCTCN2022070276-appb-000009
The collection of display object categories is
Figure PCTCN2022070276-appb-000010
If the same category exists in M n display objects,
Figure PCTCN2022070276-appb-000011
There are C n different categories of display objects in the nth video frame, then the category set after deduplication of L n is:
Figure PCTCN2022070276-appb-000012
C n ≤ Mn .
由N个视频帧中去重后的类别集合构成的展示对象类别集合为L={l 1,l 2,…,l M},共包括M个对象的类别,经类别去重后,子视频帧序列对应的类别集合为L′={l 1′,l 2′,…,l C′},共包括C种类别,计算集合L′中各展示对象类别出现的频率,对于展示对象类别l i′,其出现的频率为p i The display object category set composed of the deduplicated category sets in the N video frames is L={l 1 ,l 2 ,...,l M }, which includes M object categories in total. After category deduplication, the sub-video The category set corresponding to the frame sequence is L′={l 1 ′,l 2 ′,...,l C ′}, including C categories in total, and the frequency of each display object category in the set L′ is calculated. For the display object category l i ′, whose frequency is p i
Figure PCTCN2022070276-appb-000013
Figure PCTCN2022070276-appb-000013
目标展示对象的类别为
Figure PCTCN2022070276-appb-000014
其索引为i *
The category of the target impression object is
Figure PCTCN2022070276-appb-000014
Its index is i *
Figure PCTCN2022070276-appb-000015
Figure PCTCN2022070276-appb-000015
即可认为子视频帧序列中类别为
Figure PCTCN2022070276-appb-000016
的展示对象为目标展示对象。
It can be considered that the category in the sub-video frame sequence is
Figure PCTCN2022070276-appb-000016
The display object of is the target display object.
执行主体可以采用现有技术或未来发展技术中的对视频帧中的展示对象的类别进行检测的算法,例如,SSD(Single Shot MultiBox Detector,one-stage多框检测算法),R-CNN(Region-based Convolution Neural Networks,基于区域的卷积神经网络算法)等,检测各子视频帧序列中每一视频帧所包含的展示对象的类别。The execution subject can use the existing technology or future development technology to detect the category of the display object in the video frame, for example, SSD (Single Shot MultiBox Detector, one-stage multi-box detection algorithm), R-CNN (Region -based Convolution Neural Networks, region-based convolutional neural network algorithm), etc., to detect the categories of display objects contained in each video frame in each sub-video frame sequence.
具体地,子视频帧序列M包括视频帧A(包含的展示对象为两个人和一只狗)和视频帧B(包含的展示对象为一个人和三辆车),经目标检测算法对视频帧A和视频帧B进行检测,得到视频帧A中所包括的展示对象所属的类别为人和动物,视频帧B中所包括的展示对象所属的类别为人和汽车,故子视频帧序列M对应的展示对象类别集合为{人、动物、人、汽车}。执行主体可将展示对象类别集合中的人对应的展示对象(视频帧A中的两个人和视频帧B中的一个人)确定为目标展示对象。Specifically, the sub-video frame sequence M includes video frame A (contained display objects are two people and a dog) and video frame B (contained display objects are one person and three vehicles). A and video frame B are detected, and it is obtained that the categories to which the display objects included in video frame A belong are people and animals, and the categories to which the display objects included in video frame B belong are people and cars, so the display corresponding to the sub-video frame sequence M The set of object categories is {person, animal, person, car}. The execution subject may determine the display objects (two persons in the video frame A and one person in the video frame B) corresponding to the persons in the presentation object category set as the target presentation objects.
在一些可选的方式中,预设的评分规则包括以下至少一项评分子 规则:与各子视频帧序列的平均时长相关联的第一评分规则;与各子视频帧序列的目标展示对象相关联的第二评分规则。In some optional manners, the preset scoring rules include at least one of the following scoring sub-rules: a first scoring rule associated with the average duration of each sub-video frame sequence; a target display object associated with each sub-video frame sequence The second scoring rule of the association.
在本实现方式中,执行主体按照第一评分规则、第二评分规则中的至少一项,对各子视频帧序列进行评分。In this implementation manner, the execution subject scores each sub-video frame sequence according to at least one of the first scoring rule and the second scoring rule.
其中,第一评分规则与各子视频帧序列的平均时长相关联,也即第一评分规则依据各子视频帧序列的平均时长设定。The first scoring rule is associated with the average duration of each sub-video frame sequence, that is, the first scoring rule is set according to the average duration of each sub-video frame sequence.
这里,执行主体按照第一评分规则对各子视频帧序列进行评分的方式可以是根据预设的时长评分对照表对各子视频帧序列进行评分,也可以根据各子视频帧序列的平均时长是否满足预设的时长阈值范围,对各子视频帧序列进行评分,本申请对此不作限定。Here, the manner in which the execution subject scores each sub-video frame sequence according to the first scoring rule may be to score each sub-video frame sequence according to a preset duration scoring comparison table, or may be based on whether the average duration of each sub-video frame sequence is Each sub-video frame sequence is scored if the preset duration threshold range is satisfied, which is not limited in this application.
其中,预设的时长阈值范围可根据经验、实际需求和具体的应用场景确定,本申请对此不作限定。The preset duration threshold range may be determined according to experience, actual needs and specific application scenarios, which is not limited in this application.
具体地,从待评估视频帧序列L中确定出Q个子视频帧序列,待评估视频帧序列L的帧率为f,第q(取值范围为1~Q)个子视频帧序列的帧数为F q,则各子视频帧序列的平均时长MT为 Specifically, Q sub-video frame sequences are determined from the to-be-evaluated video frame sequence L, the frame rate of the to-be-evaluated video frame sequence L is f, and the number of frames of the qth (value range of 1 to Q) sub-video frame sequence is F q , then the average duration MT of each sub-video frame sequence is
Figure PCTCN2022070276-appb-000017
Figure PCTCN2022070276-appb-000017
进一步地,执行主体可以判断MT是否满足预设的时长阈值范围,若满足,则评分为1,若不满足,则评分为0。Further, the execution subject can determine whether the MT meets the preset duration threshold range, and if so, the score is 1, and if not, the score is 0.
其中,第二评分规则与各子视频帧序列的目标展示对象相关联,也即第二评分规则依据各子视频帧序列的目标展示对象设定。The second scoring rule is associated with the target display object of each sub-video frame sequence, that is, the second scoring rule is set according to the target display object of each sub-video frame sequence.
这里,第二评分规则依据各子视频帧序列的目标展示对象进行设定的方式可以包括多种,例如,根据各子视频帧序列的目标展示对象的类别进行设定;根据各子视频帧序列的目标展示对象在相应子视频帧序列中出现的频率进行设定,根据各子视频帧序列的目标展示对象在相应子视频帧序列中占据的面积进行设定等等,本申请对此不作限定。Here, the second scoring rule may be set according to the target display object of each sub-video frame sequence in a variety of ways, for example, it may be set according to the type of the target display object of each sub-video frame sequence; The frequency of the target display object appearing in the corresponding sub video frame sequence is set, and the setting is based on the area occupied by the target display object of each sub video frame sequence in the corresponding sub video frame sequence, etc. This application does not limit this. .
相应地,执行主体按照第二评分规则对各子视频帧序列进行评分的方式可以包括多种,例如,根据预设的目标展示对象类别评分对照表对各子视频帧序列进行评分;根据各子视频帧序列的目标展示对象 在相应子视频序列帧中出现的频率是否满足预设的频率阈值范围对各子视频帧序列进行评分;根据各子视频帧序列的目标展示对象的面积与包含目标展示对象的视频帧的面积的比值是否满足预设的比值阈值范围对各子视频帧序列进行评分等等,本申请对此不作限定。Correspondingly, the execution subject may score each sub-video frame sequence according to the second scoring rule in various ways, for example, scoring each sub-video frame sequence according to a preset target display object category scoring comparison table; according to each sub-video frame sequence; Whether the frequency of the target display object of the video frame sequence in the corresponding sub-video sequence frame meets the preset frequency threshold range to score each sub-video frame sequence; according to the area of the target display object of each sub-video frame sequence and the target display Whether the ratio of the area of the video frame of the object satisfies the preset ratio threshold range is used to score each sub-video frame sequence, etc., which is not limited in this application.
具体地,待评估视频帧序列中包含子视频帧序列A和子视频帧序列B,其中,子视频帧序列A的目标展示对象为人;而子视频帧序列B的目标展示对象为物品,预设的评分规则为:当目标展示对象为人时,对应的子视频帧序列的评分为1分;当目标展示对象为物品时,对应的子视频帧序列的评分为0分,故待评估视频帧序列中子视频帧序列A的评分为1分,子视频帧序列B的评分为0分。Specifically, the video frame sequence to be evaluated includes a sub video frame sequence A and a sub video frame sequence B, wherein the target display object of the sub video frame sequence A is a person; and the target display object of the sub video frame sequence B is an item, and the preset The scoring rule is: when the target display object is a person, the corresponding sub-video frame sequence is scored as 1 point; when the target display object is an item, the corresponding sub-video frame sequence is scored as 0 points, so the video frame sequence to be evaluated is scored as 0 points. The score of sub-video frame sequence A is 1 point, and the score of sub-video frame sequence B is 0 point.
需要指出的是,若预设的评分规则包括第一评分规则和第二评分规则,则执行主体按照第一评分规则和第二评分规则对各子视频帧序列进行评分。It should be noted that, if the preset scoring rule includes the first scoring rule and the second scoring rule, the execution subject will score each sub-video frame sequence according to the first scoring rule and the second scoring rule.
具体地,第一评分规则与各子视频帧序列的平均时长相关联,第二评分规则与各子视频帧序列的目标展示对象的类别相关联,待评估视频帧序列中包含子视频帧序列A和子视频帧序列B,其中,子视频帧序列A的目标展示对象为人,平均时长为5s;而子视频帧序列B的目标展示对象为物品,平均时长为5s,预设的评分规则为:当目标展示对象为人且平均时长大于等于3s时,对应的子视频帧序列的评分为1分;否则,对应的子视频帧序列的评分为0分,故待评估视频帧序列中子视频帧序列A的评分为1分,子视频帧序列B的评分为0分。Specifically, the first scoring rule is associated with the average duration of each sub video frame sequence, the second scoring rule is associated with the category of the target display object of each sub video frame sequence, and the video frame sequence to be evaluated includes the sub video frame sequence A and sub-video frame sequence B, wherein, the target display object of sub-video frame sequence A is a person, and the average duration is 5s; while the target display object of sub-video frame sequence B is an item, the average duration is 5s, and the preset scoring rule is: when When the target display object is a person and the average duration is greater than or equal to 3s, the score of the corresponding sub-video frame sequence is 1 point; otherwise, the score of the corresponding sub-video frame sequence is 0 point, so the sub-video frame sequence A in the video frame sequence to be evaluated The score is 1 point, and the score of the sub-video frame sequence B is 0 point.
该实现通过按照第一评分规则和/或第二评分规则,对各子视频帧序列进行评分,进而根据各子视频帧序列的评分,对待评估视频帧序列进行质量评估,充分考虑了各子视频帧序列的属性信息中的平均时长和/或目标展示对象对视频质量的影响,有效提升了评估的视频质量的准确性和有效性。In this implementation, each sub-video frame sequence is scored according to the first scoring rule and/or the second scoring rule, and then the quality of the video frame sequence to be evaluated is evaluated according to the score of each sub-video frame sequence, fully considering each sub-video frame sequence. The average duration in the attribute information of the frame sequence and/or the influence of the target display object on the video quality effectively improves the accuracy and validity of the evaluated video quality.
步骤204,根据各子视频帧序列的评分,对待评估视频帧序列进行质量评估。 Step 204, according to the score of each sub-video frame sequence, perform quality assessment on the video frame sequence to be evaluated.
在本实施例中,执行主体在确定各子视频帧序列的评分后,可以根据各子视频帧序列的评分以及相应的权重系数,对待评估视频帧序 列进行评估。In this embodiment, after determining the score of each sub-video frame sequence, the execution body may evaluate the video frame sequence to be evaluated according to the score of each sub-video frame sequence and the corresponding weight coefficient.
其中,分别与第一评分和第二评分相对应的权重系数可以根据经验、实际需求和具体地应用场景确定,本申请对此不作限定。The weight coefficients respectively corresponding to the first score and the second score may be determined according to experience, actual needs and specific application scenarios, which are not limited in this application.
继续参见图3,图3是根据本实施例的视频质量方法的应用场景的一个示意图。Continue to refer to FIG. 3 , which is a schematic diagram of an application scenario of the video quality method according to this embodiment.
在图3的应用场景中,执行主体301获取到待评估视频帧序列302,从待评估待评估视频帧序列302中确定出3个子视频帧序列,子视频帧序列为一组视频帧的颜色均值梯度满足预设条件(例如,小于预设的颜色均值梯度阈值)连续视频帧,3个子视频帧序列分别为子视频帧序列A303(包含展示对象的数量为5)、子视频帧序列B304(包含展示对象的数量为10)和子视频帧序列C305(包含展示对象的数量为20);按照预设的评分规则,对各子视频帧序列进行评分,得到各子视频帧序列的评分,即子视频帧A的评分306、子视频帧序列B的评分307和子视频帧序列C的评分308的评分,其中,预设的评分规则与各子视频帧序列的属性信息相关联,属性信息包括展示对象的数量,预设的评分规则为数量评分对照表,如,数量20对应2分,数量10对应1分,数量5对应0分;进一步地,执行主体根据各子视频帧序列的评分,对待评估视频帧序列进行质量评估,如直接将子视频帧A的评分306、子视频帧序列B的评分307、子视频帧序列C的评分308的评分直接相加得到待评估视频帧序列的评分309。In the application scenario of FIG. 3 , the execution body 301 acquires the video frame sequence 302 to be evaluated, and determines three sub-video frame sequences from the to-be-evaluated video frame sequence 302, and the sub-video frame sequence is the color average value of a group of video frames The gradient satisfies a preset condition (for example, less than a preset color mean gradient threshold) for continuous video frames, and the three sub-video frame sequences are respectively sub-video frame sequence A303 (including the number of display objects), sub-video frame sequence B304 (including The number of display objects is 10) and the sub-video frame sequence C305 (including the number of display objects is 20); according to the preset scoring rules, each sub-video frame sequence is scored to obtain the score of each sub-video frame sequence, that is, the sub-video The score 306 of the frame A, the score 307 of the sub video frame sequence B, and the score 308 of the sub video frame sequence C, wherein the preset scoring rules are associated with the attribute information of each sub video frame sequence, and the attribute information includes the display object's Quantity, the preset scoring rule is a quantitative score comparison table, for example, the number of 20 corresponds to 2 points, the number of 10 corresponds to 1 point, and the number of 5 corresponds to 0 points; further, the execution subject according to the score of each sub-video frame sequence, the video to be evaluated The quality of the frame sequence is evaluated, for example, the score 309 of the video frame sequence to be evaluated is obtained by directly adding the score 306 of the sub video frame sequence B, the score 307 of the sub video frame sequence B, and the score 308 of the sub video frame sequence C.
本公开的视频质量评估方法,通过获取待评估视频帧序列;从待评估视频帧序列中,确定出至少一个子视频帧序列;按照预设的评分规则,对各子视频帧序列进行评分;根据各子视频帧序列的评分,对待评估视频帧序列进行质量评估,有效提升了对视频质量进行评估的合理性和有效性。The video quality evaluation method of the present disclosure obtains the video frame sequence to be evaluated; determines at least one sub-video frame sequence from the video frame sequence to be evaluated; scores each sub-video frame sequence according to a preset scoring rule; The score of each sub-video frame sequence is used to evaluate the quality of the video frame sequence to be evaluated, which effectively improves the rationality and effectiveness of evaluating the video quality.
进一步参考图4,其示出了图2所示的视频质量评估方法的又一个实施例的流程400。在本实施例中,预设的评分规则包括第一评分规则和第二评分规则。本实施例的视频质量评估方法的流程400,可 包括以下步骤:Further reference is made to FIG. 4 , which shows a flow 400 of yet another embodiment of the video quality assessment method shown in FIG. 2 . In this embodiment, the preset scoring rules include a first scoring rule and a second scoring rule. The process 400 of the video quality assessment method of the present embodiment may include the following steps:
步骤401,获取待评估视频帧序列。 Step 401, acquiring a sequence of video frames to be evaluated.
在本实施例中,步骤401的实现细节和技术效果,可以参考对步骤201的描述,在此不再赘述。In this embodiment, for the implementation details and technical effects of step 401, reference may be made to the description of step 201, and details are not repeated here.
步骤402,基于所述待评估视频帧序列中每一视频帧的颜色均值梯度,确定出至少一个子视频帧序列。Step 402: Determine at least one sub-video frame sequence based on the color mean gradient of each video frame in the to-be-evaluated video frame sequence.
在本实施例中,步骤402实现细节和技术效果,可以参考对步骤202描述,在此不再赘述。In this embodiment, the implementation details and technical effects of step 402 can be referred to the description of step 202, which will not be repeated here.
步骤403,按照第一评分规则对各子视频帧序列进行评分,得到各子视频帧序列的第一评分。Step 403: Score each sub-video frame sequence according to the first scoring rule to obtain a first score of each sub-video frame sequence.
在本实施例中,执行主体按照第一评分规则对各子视频帧序列进行评分的方式可以是根据预设的时长评分对照表对各子视频帧序列进行评分,也可以根据各子视频帧序列的平均时长是否满足预设的时长阈值范围,对各子视频帧序列进行评分,本申请对此不作限定。In this embodiment, the way for the execution subject to score each sub-video frame sequence according to the first scoring rule may be to score each sub-video frame sequence according to a preset time-length scoring comparison table, or to score each sub-video frame sequence according to Whether the average duration of each sub-video frame sequence meets the preset duration threshold range, the sub-video frame sequence is scored, which is not limited in this application.
步骤404,按照第二评分规则对各子视频帧序列进行评分,得到各子视频帧序列的第二评分。Step 404: Score each sub-video frame sequence according to the second scoring rule to obtain a second score of each sub-video frame sequence.
在本实施例中,执行主体按照第二评分规则对各子视频帧序列进行评分的方式可以包括多种,例如,根据预设的目标展示对象类别评分对照表对各子视频帧序列进行评分;根据各子视频帧序列的目标展示对象出现的频率是否满足预设的频率阈值范围对各子视频帧序列进行评分;根据各子视频帧序列的目标展示对象占据的面积与包含目标展示对象的视频帧的面积的比值是否满足预设的比值阈值范围对各子视频帧序列进行评分等等,本申请对此不作限定。In this embodiment, the execution subject may score each sub-video frame sequence according to the second scoring rule in a variety of ways, for example, scoring each sub-video frame sequence according to a preset target display object category scoring comparison table; Each sub-video frame sequence is scored according to whether the frequency of occurrence of the target display object of each sub-video frame sequence meets the preset frequency threshold range; according to the area occupied by the target display object of each sub-video frame sequence and the video containing the target display object Whether the ratio of the area of the frame satisfies the preset ratio threshold range is used to score each sub-video frame sequence, etc., which is not limited in this application.
在一些可选的方式中,第二评分规则包括以下至少一项:与各子视频帧序列的目标展示对象出现的频率相关联的第三评分规则;与各子视频帧序列的目标展示对象的面积相关联的第四评分规则。In some optional manners, the second scoring rule includes at least one of the following: a third scoring rule associated with the frequency of occurrence of the target display object of each sub-video frame sequence; Area is associated with the fourth scoring rule.
在本实现方式中,执行主体按照第三评分规则、第四评分规则中的至少一项,对各子视频帧序列进行评分。In this implementation manner, the execution subject scores each sub-video frame sequence according to at least one of the third scoring rule and the fourth scoring rule.
其中,第三评分规则与各子视频帧序列的目标展示对象在相应子视频帧序列中出现的频率相关联,也即,第三评分规则依据各子视频 帧序列的目标展示对象在相应子视频帧序列中出现的频率设定。Wherein, the third scoring rule is associated with the frequency of the target display object of each sub-video frame sequence appearing in the corresponding sub-video frame sequence, that is, the third scoring rule is based on the target display object of each sub-video frame sequence in the corresponding sub-video frame sequence. The frequency setting that appears in the frame sequence.
这里,各子视频帧序列的目标展示对象在相应子视频帧序列中出现的频率可以是目标展示对象在相应子视频帧序列中出现的频率,也可以是目标展示对象的类型在相应子视频帧序列对应的展示对象类型集合中出现的频率,本申请对此不作限定。Here, the frequency at which the target display object of each sub-video frame sequence appears in the corresponding sub-video frame sequence may be the frequency at which the target display object appears in the corresponding sub-video frame sequence, or the type of the target display object in the corresponding sub-video frame. The frequency of occurrence in the set of display object types corresponding to the sequence is not limited in this application.
与目标展示对象出现的频率相关联的第三评分规则可以是预设的频率评分对照表,也可以是根据各子视频序列的目标展示对象在相应子视频帧序列中出现的频率是否满足预设的频率阈值范围对各子视频帧序列进行评分。The third scoring rule associated with the frequency of occurrence of the target display object may be a preset frequency score comparison table, or may be based on whether the frequency of the target display object of each sub-video sequence appearing in the corresponding sub-video frame sequence satisfies the preset frequency The frequency threshold range of the sub-video frame sequence is scored.
具体地,若子视频帧序列M对应的展示对象类别集合为{人,人,人,动物},其目标展示对象的类别为人,且人在展示对象类别集合中出现的频率为0.75;子视频帧序列N对应的展示对象类别集合为{车,车,动物},其目标展示对象的类别为车,且车在展示对象类别集合中出现的频率为0.67。若预设的频率阈值范围为大于等于0.7小于等于0.8,第三评分规则为若满足预设的频率阈值范围则评分为1,若不满足预设的频率阈值范围,则评分为0,则根据第三评分规则,子视频帧序列M的评分为1,子视频帧序列N的评分为0。Specifically, if the display object category set corresponding to the sub-video frame sequence M is {person, person, person, animal}, the category of the target display object is human, and the frequency of people appearing in the display object category set is 0.75; The display object category set corresponding to the sequence N is {car, car, animal}, the target display object category is car, and the frequency of the car in the display object category set is 0.67. If the preset frequency threshold range is greater than or equal to 0.7 and less than or equal to 0.8, the third scoring rule is that if the preset frequency threshold range is met, the score is 1, and if the preset frequency threshold range is not met, the score is 0, according to In the third scoring rule, the score of the sub-video frame sequence M is 1, and the score of the sub-video frame sequence N is 0.
其中,第四评分规则与各子视频帧序列的目标展示对象在相应子视频帧序列中的面积相关联,也即,第四评分规则依据各子视频帧序列的目标展示的面积设定。The fourth scoring rule is associated with the area of the target display object of each sub video frame sequence in the corresponding sub video frame sequence, that is, the fourth scoring rule is set according to the target display area of each sub video frame sequence.
这里,与各子视频帧序列中目标展示对象的面积相关联的第四评分规则可以是与各子视频帧序列中目标展示对象的总面积与包含目标展示对象的视频帧的总面积的第一比值相关联的第四评分规则,也可以是与各子视频帧序列中包含目标展示对象的各视频帧中面积最大的目标展示对象的面积的总和与包含目标展示对象的视频帧的总面积的第二比值相关联的第四评分规则,本申请对此不作限定。Here, the fourth scoring rule associated with the area of the target display object in each sub-video frame sequence may be the first score between the total area of the target display object in each sub-video frame sequence and the total area of the video frame containing the target display object The fourth scoring rule associated with the ratio can also be the sum of the area of the target display object with the largest area in each video frame containing the target display object in each sub-video frame sequence and the total area of the video frame containing the target display object. The fourth scoring rule associated with the second ratio is not limited in this application.
具体地,一个子视频帧序列中包括N(N=1,2,…,n)个视频帧,假设第n帧视频帧中检测到M n个展示对象,则展示对象集合为
Figure PCTCN2022070276-appb-000018
Figure PCTCN2022070276-appb-000019
其中第i个展示对象
Figure PCTCN2022070276-appb-000020
对应的展示对象类别为
Figure PCTCN2022070276-appb-000021
展示对象类别集合为
Figure PCTCN2022070276-appb-000022
对L n去重后的类别集合为
Figure PCTCN2022070276-appb-000023
Figure PCTCN2022070276-appb-000024
C n≤M n。对于类别
Figure PCTCN2022070276-appb-000025
需在所有类别为
Figure PCTCN2022070276-appb-000026
的展示对象中,保留面积最大的展示对象,记展示对象
Figure PCTCN2022070276-appb-000027
的面积为
Figure PCTCN2022070276-appb-000028
则对于类别
Figure PCTCN2022070276-appb-000029
筛选后的面积最大的展示对象为
Figure PCTCN2022070276-appb-000030
Specifically, a sub-video frame sequence includes N ( N =1, 2, .
Figure PCTCN2022070276-appb-000018
Figure PCTCN2022070276-appb-000019
where the i-th display object
Figure PCTCN2022070276-appb-000020
The corresponding display object category is
Figure PCTCN2022070276-appb-000021
The collection of display object categories is
Figure PCTCN2022070276-appb-000022
The set of categories after deduplication of L n is
Figure PCTCN2022070276-appb-000023
Figure PCTCN2022070276-appb-000024
C n ≤ Mn . for the category
Figure PCTCN2022070276-appb-000025
Required in all categories
Figure PCTCN2022070276-appb-000026
Among the display objects, keep the display object with the largest area, and record the display object
Figure PCTCN2022070276-appb-000027
The area of is
Figure PCTCN2022070276-appb-000028
then for the category
Figure PCTCN2022070276-appb-000029
After screening, the display object with the largest area is
Figure PCTCN2022070276-appb-000030
Figure PCTCN2022070276-appb-000031
Figure PCTCN2022070276-appb-000031
筛选后的第n帧视频帧的展示对象集合为
Figure PCTCN2022070276-appb-000032
The set of display objects of the filtered n-th video frame is
Figure PCTCN2022070276-appb-000032
由N个视频帧对应的展示对象集合构成的展示对象集合为
Figure PCTCN2022070276-appb-000033
Figure PCTCN2022070276-appb-000034
展示对象集合S对应的展示对象类别集合为L={l 1,l 2,…,l M},经类别去重后,得到的类别集合为L′={l 1′,l 2′,…,l C′}。在展示对象集合S中找到所有类别为目标展示对象类别
Figure PCTCN2022070276-appb-000035
的展示对象,并计算他们的面积在视频帧中的比例,每一视频帧的大小为W×H,s i的面积为area(s i),则第q个子视频帧序列的目标展示对象的平均面积比例R为
The display object set composed of the display object sets corresponding to N video frames is:
Figure PCTCN2022070276-appb-000033
Figure PCTCN2022070276-appb-000034
The display object category set corresponding to the display object set S is L={l 1 ,l 2 ,...,l M }. After the categories are deduplicated, the obtained category set is L'={l 1 ',l 2 ',... , l C ′}. Find all categories in the display object set S as the target display object category
Figure PCTCN2022070276-appb-000035
, and calculate the proportion of their area in the video frame, the size of each video frame is W×H, and the area of s i is area(s i ), then the target display object of the qth sub-video frame sequence is The average area ratio R is
Figure PCTCN2022070276-appb-000036
Figure PCTCN2022070276-appb-000036
相应地,第四评分规则可以是比值评分对照表,也可以是根据各子视频序列的比值是否满足预设的比值阈值范围对各子视频帧序列进行评分。Correspondingly, the fourth scoring rule may be a ratio scoring comparison table, or may be scoring each sub-video frame sequence according to whether the ratio of each sub-video sequence satisfies a preset ratio threshold range.
具体地,若子视频帧序列M包括视频帧C1(包含的展示对象两个人和一辆车)和视频帧C2(包含的展示对象为两个人和一只动物),故其对应的展示对象类别集合为{人,车,人,动物},其目标展示对象为人。进一步地,执行主体计算目标展示对象的总面积(视频帧C1中两个人的面积与视频帧C2中两个人的面积的和)与包含目标展示对象的视频帧的总面积(视频帧C1和视频帧C2的总面积)的比值,并且,第四评分规则为判断比值是否满足预设的比值阈值范围,若满足,则评分为1,若不满足,则评分为0。若上述计算得到的比值为0.5,预设的比值阈值范围为大于0.3小于等于0.6,则子视频帧序列M的评分为1。Specifically, if the sub-video frame sequence M includes video frame C1 (including two display objects and one car) and video frame C2 (including two display objects and one animal), the corresponding display object category set For {person, car, person, animal}, its target display object is person. Further, the execution subject calculates the total area of the target display object (the sum of the area of the two people in the video frame C1 and the area of the two people in the video frame C2) and the total area of the video frame containing the target display object (the video frame C1 and the video The ratio of the total area of the frame C2), and the fourth scoring rule is to judge whether the ratio satisfies the preset ratio threshold range, if so, the score is 1, if not, the score is 0. If the ratio calculated above is 0.5, and the preset ratio threshold range is greater than 0.3 and less than or equal to 0.6, the score of the sub video frame sequence M is 1.
该实现通过按照第三评分规则和/或第四评分规则,对各子视频帧序列进行评分,进而根据各子视频帧序列的评分,对待评估视频帧序列进行质量评估,充分考虑了各子视频帧序列中的目标展示对象的面 积和/或出现的频率对视频质量的影响,进一步提升了评估的视频质量的准确性和有效性。In this implementation, each sub-video frame sequence is scored according to the third scoring rule and/or the fourth scoring rule, and then the quality of the video frame sequence to be evaluated is evaluated according to the score of each sub-video frame sequence, fully considering each sub-video frame sequence. The influence of the area and/or the frequency of occurrence of the target object in the frame sequence on the video quality further improves the accuracy and validity of the assessed video quality.
在一些可选的方式中,按照预设的评分规则,对各子视频帧序列进行评分,包括:按照第三评分规则对各子视频帧序列进行评分,得到第三评分,按照第四评分规则对各子视频帧序列进行评分,得到第四评分,根据第三评分和第四评分对各子视频帧序列进行评分。In some optional manners, scoring each sub-video frame sequence according to a preset scoring rule includes: scoring each sub-video frame sequence according to a third scoring rule to obtain a third score, and according to a fourth scoring rule Each sub-video frame sequence is scored to obtain a fourth score, and each sub-video frame sequence is scored according to the third score and the fourth score.
在本实现方式中,预设的评分规则包括第三评分规则和第四评分规则,执行主体可以首先按照与各子视频帧序列的目标展示对象在相应子视频帧序列中出现的频率相关联的第三评分规则对各子视频帧进行评分,得到第三评分;然后,按照与各子视频帧序列的目标展示对象在相应子视频帧序列中的面积相关联的第四评分规则对各子视频帧序列进行评分,得到第四评分,最后根据第三评分和第四评分,得到各子视频帧序列的评分。In this implementation manner, the preset scoring rules include a third scoring rule and a fourth scoring rule, and the execution subject may first follow the frequency associated with the frequency of the target display object of each sub-video frame sequence appearing in the corresponding sub-video frame sequence. The third scoring rule scores each sub-video frame to obtain a third score; then, according to the fourth scoring rule associated with the area of the target display object of each sub-video frame sequence in the corresponding sub-video frame sequence The frame sequence is scored to obtain the fourth score, and finally the score of each sub-video frame sequence is obtained according to the third score and the fourth score.
这里,执行主体可以根据第三评分和第四评分以及各自的权重系数,得到各子视频帧序列的评分。Here, the executive body may obtain the score of each sub-video frame sequence according to the third score and the fourth score and their respective weight coefficients.
其中,分别与第三评分和第四评分相对应的权重系数,可根据经验、实际需求和具体的应用场景进行设定,本申请不作限定。The weight coefficients corresponding to the third score and the fourth score respectively may be set according to experience, actual needs and specific application scenarios, which are not limited in this application.
该实现方式通过根据第三评分规则和第四评分规则单独对各子视频帧序列进行评分,有助于在综合考虑各子视频帧序列的目标展示对象的频率以及各子视频帧序列的目标展示对象的面积对视频质量影响的条件下,提升得到的各子视频帧序列的评分的准确性,进而进一步提升对视频质量进行评估的有效性和合理性。In this implementation, each sub-video frame sequence is scored independently according to the third scoring rule and the fourth scoring rule, which is helpful for comprehensively considering the frequency of the target display objects of each sub-video frame sequence and the target display of each sub-video frame sequence. Under the condition that the area of the object affects the video quality, the accuracy of the obtained scores of each sub-video frame sequence is improved, thereby further improving the validity and rationality of evaluating the video quality.
此外,需要指出的是,若预设的评分规则包括第一评分规则、第三评分规则、第四评分规则,则按照预设的评分规则,对各子视频帧序列进行评分,包括:按照第一评分规则,对各子视频帧序列进行评分,得到第一评分;按照第三评分规则,对各子视频帧序列进行评分,得到第三评分;按照第四评分规则,对各子视频帧序列进行评分,得到第四评分;根据第一评分、第三评分和第四评分,对各子视频帧序列进行评分。In addition, it should be pointed out that if the preset scoring rules include the first scoring rule, the third scoring rule, and the fourth scoring rule, then according to the preset scoring rules, each sub-video frame sequence is scored, including: according to the first scoring rule According to a scoring rule, each sub-video frame sequence is scored to obtain the first score; according to the third scoring rule, each sub-video frame sequence is scored to obtain a third score; according to the fourth scoring rule, each sub-video frame sequence is scored according to the fourth scoring rule. Scoring is performed to obtain a fourth score; according to the first score, the third score and the fourth score, each sub-video frame sequence is scored.
这里,若对于一个视频帧序列,共包括Q个子视频帧序列,按照 第一评分规则、第三评分规则、第四评分规则,对各子视频帧序列进行评分,得到各子视频帧序列的第一评分为Score 1,第三评分为
Figure PCTCN2022070276-appb-000037
Figure PCTCN2022070276-appb-000038
第四评分为
Figure PCTCN2022070276-appb-000039
则视频帧序列的最终得分Score可以表示为
Here, if a video frame sequence includes a total of Q sub-video frame sequences, according to the first scoring rule, the third scoring rule, and the fourth scoring rule, each sub-video frame sequence is scored to obtain the first score of each sub-video frame sequence. The first score is Score 1 , and the third score is
Figure PCTCN2022070276-appb-000037
Figure PCTCN2022070276-appb-000038
The fourth rating is
Figure PCTCN2022070276-appb-000039
Then the final score Score of the video frame sequence can be expressed as
Figure PCTCN2022070276-appb-000040
Figure PCTCN2022070276-appb-000040
具体地,若子视频帧序列M包括视频帧C1(包含的展示对象两个人和一辆车)和视频帧C2(包含的展示对象为两个人和一只动物),子视频帧序列M的平均时长为2s,其对应的展示对象类别集合为{人,车,人,动物},目标展示对象为人。进一步地,执行主体计算目标展示对象的总面积(视频帧C1中两个人的面积与视频帧C2中两个人的面积的和)与包含目标展示对象的视频帧的总面积(视频帧C1和视频帧C2的总面积)的比值,以及计算目标展示对象在展示对象类别集合中出现的频率0.5,并且,第一评分规则为判断子视频帧序列的平均时长是否满足预设的时长阈值范围,若满足,则评分为1,若不满足,则评分为0,第三评分规则为判断频率是否满足预设的频率阈值范围,若满足,则评分为1,若不满足,则评分为0,第四评分规则为判断比值是否满足预设的比值阈值范围,若满足,则评分为1,若不满足,则评分为0。若上述计算得到的比值为0.5,预设的比值阈值范围为大于0.3小于等于0.6,预设的时长阈值范围为大于1s小于等于2s,预设的频率阈值范围为大于0.4小于等于0.6,则子视频帧序列M的评分可以表示为第一评分、第三评分和第四评分的和3。Specifically, if the sub-video frame sequence M includes video frame C1 (including two display objects and one vehicle) and video frame C2 (including two display objects and one animal), the average duration of the sub-video frame sequence M is is 2s, the corresponding display object category set is {person, car, person, animal}, and the target display object is human. Further, the execution subject calculates the total area of the target display object (the sum of the area of the two people in the video frame C1 and the area of the two people in the video frame C2) and the total area of the video frame containing the target display object (the video frame C1 and the video The ratio of the total area of the frame C2), and the frequency 0.5 that the target display object appears in the display object category set, and the first scoring rule is to judge whether the average duration of the sub-video frame sequence satisfies the preset duration threshold range, if If it is satisfied, the score is 1; if not, the score is 0. The third scoring rule is to judge whether the frequency meets the preset frequency threshold range. If it is satisfied, the score is 1; if not, the score is 0. The four scoring rules are to judge whether the ratio satisfies the preset ratio threshold range. If so, the score is 1; if not, the score is 0. If the ratio calculated above is 0.5, the preset ratio threshold range is greater than 0.3 and less than or equal to 0.6, the preset duration threshold range is greater than 1s and less than or equal to 2s, and the preset frequency threshold range is greater than 0.4 and less than or equal to 0.6, then the sub- The score of the video frame sequence M can be expressed as the sum 3 of the first score, the third score and the fourth score.
该实现方式通过根据第一评分、第三评分规则和第四评分规则单独对各子视频帧序列进行评分,有助于在综合考虑各子视频帧序列的平均时长、各子视频帧序列的目标展示对象的频率以及各子视频帧序列的目标展示对象的面积对视频质量影响的条件下,提升得到的各子视频帧序列的评分的准确性,进而进一步提升对视频质量进行评估的有效性和合理性。In this implementation, each sub-video frame sequence is scored independently according to the first scoring, the third scoring rule and the fourth scoring rule, which is helpful for comprehensively considering the average duration of each sub-video frame sequence and the target of each sub-video frame sequence. Under the condition that the frequency of the display objects and the area of the target display object of each sub-video frame sequence affect the video quality, the accuracy of the obtained score of each sub-video frame sequence is improved, thereby further improving the effectiveness and efficiency of evaluating the video quality. rationality.
步骤405,根据各子视频帧序列的第一评分和各子视频帧序列的第二评分,得到各子视频帧序列的评分。Step 405: Obtain a score of each sub-video frame sequence according to the first score of each sub-video frame sequence and the second score of each sub-video frame sequence.
在本实施例中,执行主体可以直接根据第一评分和第二评分,得到各子视频帧序列的评分,也可以根据第一评分和第二评分以及各自的权重系数,得到各子视频帧序列的评分,本申请对此不作限定。In this embodiment, the execution body can directly obtain the score of each sub-video frame sequence according to the first score and the second score, or obtain each sub-video frame sequence according to the first score and the second score and their respective weight coefficients score, which is not limited in this application.
步骤406,根据各子视频帧序列的评分,对待评估视频帧序列进行质量评估。 Step 406, according to the score of each sub-video frame sequence, perform quality assessment on the video frame sequence to be evaluated.
在本实施例中,步骤406的实现细节和技术效果,可以参考对步骤204的描述,在此不再赘述。In this embodiment, for the implementation details and technical effects of step 406, reference may be made to the description of step 204, and details are not repeated here.
从图4中可以看出,与图2对应的实施例相比,本实施例中的视频质量评估方法的流程400体现了按照第一评分规则和第二评分规则分别对各子视频帧序列进行评分,得到第一评分和第二评分,并根据第一评分和第二评分,对各子视频帧序列进行评分,进而根据各子视频帧序列的评分,对待评估视频帧序列进行质量评估。本实施例描述的方案通过根据第一评分规则和第二评分规则单独对各子视频帧序列进行评分,有助于在综合考虑各子视频帧序列的平均时长以及各子视频帧序列的目标展示对象对视频指量影响的条件下,提升得到的各子视频帧序列的评分的准确性,提升对视频质量进行评估的有效性和合理性。As can be seen from FIG. 4 , compared with the embodiment corresponding to FIG. 2 , the process 400 of the video quality evaluation method in this embodiment embodies that each sub-video frame sequence is evaluated according to the first scoring rule and the second scoring rule. The first score and the second score are obtained, and each sub-video frame sequence is scored according to the first score and the second score, and then the quality of the video frame sequence to be evaluated is evaluated according to the score of each sub-video frame sequence. The solution described in this embodiment grades each sub-video frame sequence independently according to the first scoring rule and the second scoring rule, which helps to comprehensively consider the average duration of each sub-video frame sequence and the target display of each sub-video frame sequence Under the condition of the influence of the object on the video index, the accuracy of the obtained scores of each sub-video frame sequence is improved, and the validity and rationality of the video quality evaluation are improved.
进一步参考图5,作为对上述各图所示方法的实现,本申请提供了一种视频质量评估装置的一个实施例,该装置实施例与图1所示的方法实施例相对应,该装置具体可以应用于各种电子设备中。Further referring to FIG. 5 , as an implementation of the methods shown in the above figures, the present application provides an embodiment of a device for evaluating video quality. The embodiment of the device corresponds to the embodiment of the method shown in FIG. 1 . Can be used in various electronic devices.
如图5所示,本实施例的视频质量评估装置500包括:获取模块501、确定模块502和评分模块503和评估模块504。As shown in FIG. 5 , the video quality evaluation apparatus 500 in this embodiment includes: an acquisition module 501 , a determination module 502 , a scoring module 503 and an evaluation module 504 .
其中,获取模块501,可被配置成获取待评估视频帧序列。The obtaining module 501 may be configured to obtain a sequence of video frames to be evaluated.
确定模块502,可被配置成从待评估视频帧序列中,确定出至少一个子视频帧序列。The determining module 502 may be configured to determine at least one sub-video frame sequence from the video frame sequence to be evaluated.
评分模块503,可被配置成按照预设的评分规则,对各子视频帧序列进行评分。The scoring module 503 may be configured to score each sub-video frame sequence according to a preset scoring rule.
评估模块504,可被配置成根据各子视频帧序列的评分,对待评估视频帧序列进行质量评估。The evaluation module 504 may be configured to perform quality evaluation on the video frame sequence to be evaluated according to the score of each sub-video frame sequence.
在本实施例的一些可选的方式中,预设的评分规则包括以下至少一项:与各子视频帧序列的平均时长相关联的第一评分规则;与各子视频帧序列的目标展示对象相关联的第二评分规则。In some optional ways of this embodiment, the preset scoring rules include at least one of the following: a first scoring rule associated with the average duration of each sub-video frame sequence; a target display object associated with each sub-video frame sequence The associated second scoring rule.
在本实施例的一些可选的方式中,预设的评分规则包括第一评分规则和第二评分规则,评分模块进一步被配置成:按照第一评分规则对各子视频帧序列进行评分,得到各子视频帧序列的第一评分;按照第二评分规则对各子视频帧序列进行评分,得到各子视频帧序列的第二评分;根据第一评分和第二评分,对各子视频帧序列进行评分。In some optional ways of this embodiment, the preset scoring rule includes a first scoring rule and a second scoring rule, and the scoring module is further configured to: score each sub-video frame sequence according to the first scoring rule, and obtain The first score of each sub-video frame sequence; the sub-video frame sequence is scored according to the second scoring rule to obtain the second score of each sub-video frame sequence; according to the first score and the second score, each sub-video frame sequence is scored. to rate.
在本实施例的一些可选的方式中,第二评分规则包括以下至少一项:与各子视频帧序列的目标展示对象在相应子视频帧序列中出现的频率相关联的第三评分规则;与各子视频帧序列的目标展示对象在相应子视频帧序列中的面积相关联的第四评分规则。In some optional manners of this embodiment, the second scoring rule includes at least one of the following: a third scoring rule associated with the frequency with which the target display object of each sub-video frame sequence appears in the corresponding sub-video frame sequence; A fourth scoring rule associated with the area of each sub-video frame sequence's target presentation object in the corresponding sub-video frame sequence.
在本实施例的一些可选的方式中,预设的评分规则包括第三评分规则和第四评分规则,评分模块进一步被配置成:按照第三评分规则对各子视频帧序列进行评分,得到各子视频帧序列的第三评分;按照第四评分规则对各子视频帧序列进行评分,得到各子视频帧序列的第四评分;根据第三评分和第四评分,对各子视频帧序列进行评分。In some optional ways of this embodiment, the preset scoring rules include a third scoring rule and a fourth scoring rule, and the scoring module is further configured to: score each sub-video frame sequence according to the third scoring rule, and obtain The third score of each sub-video frame sequence; according to the fourth scoring rule, score each sub-video frame sequence to obtain the fourth score of each sub-video frame sequence; according to the third score and the fourth score, to each sub-video frame sequence to rate.
根据本申请的实施例,本申请还提供了一种电子设备和一种可读存储介质。According to the embodiments of the present application, the present application further provides an electronic device and a readable storage medium.
如图6所示,是根据本申请实施例的视频质量评估方法的电子设备的框图。As shown in FIG. 6 , it is a block diagram of an electronic device according to the video quality assessment method according to an embodiment of the present application.
600是根据本申请实施例的视频质量评估方法的电子设备的框图。电子设备旨在表示各种形式的数字计算机,诸如,膝上型计算机、台式计算机、工作台、个人数字助理、服务器、刀片式服务器、大型计算机、和其它适合的计算机。电子设备还可以表示各种形式的移动装置,诸如,个人数字处理、蜂窝电话、智能电话、可穿戴设备和其它类似的计算装置。本文所示的部件、它们的连接和关系、以及它们的功能仅仅作为示例,并且不意在限制本文中描述的和/或者要求的本申请的实现。600 is a block diagram of an electronic device for a video quality assessment method according to an embodiment of the present application. Electronic devices are intended to represent various forms of digital computers, such as laptops, desktops, workstations, personal digital assistants, servers, blade servers, mainframe computers, and other suitable computers. Electronic devices may also represent various forms of mobile devices, such as personal digital processors, cellular phones, smart phones, wearable devices, and other similar computing devices. The components shown herein, their connections and relationships, and their functions are by way of example only, and are not intended to limit implementations of the application described and/or claimed herein.
如图6所示,该电子设备包括:一个或多个处理器601、存储器602,以及用于连接各部件的接口,包括高速接口和低速接口。各个部件利用不同的总线互相连接,并且可以被安装在公共主板上或者根据需要以其它方式安装。处理器可以对在电子设备内执行的指令进行处理,包括存储在存储器中或者存储器上以在外部输入/输出装置(诸如,耦合至接口的显示设备)上显示GUI的图形信息的指令。在其它实施方式中,若需要,可以将多个处理器和/或多条总线与多个存储器和多个存储器一起使用。同样,可以连接多个电子设备,各个设备提供部分必要的操作(例如,作为服务器阵列、一组刀片式服务器、或者多处理器系统)。图6中以一个处理器601为例。As shown in FIG. 6, the electronic device includes: one or more processors 601, a memory 602, and interfaces for connecting various components, including a high-speed interface and a low-speed interface. The various components are interconnected using different buses and may be mounted on a common motherboard or otherwise as desired. The processor may process instructions executed within the electronic device, including instructions stored in or on memory to display graphical information of the GUI on an external input/output device, such as a display device coupled to the interface. In other embodiments, multiple processors and/or multiple buses may be used with multiple memories and multiple memories, if desired. Likewise, multiple electronic devices may be connected, each providing some of the necessary operations (eg, as a server array, a group of blade servers, or a multiprocessor system). A processor 601 is taken as an example in FIG. 6 .
存储器602即为本申请所提供的非瞬时计算机可读存储介质。其中,所述存储器存储有可由至少一个处理器执行的指令,以使所述至少一个处理器执行本申请所提供的视频质量评估方法。本申请的非瞬时计算机可读存储介质存储计算机指令,该计算机指令用于使计算机执行本申请所提供的视频质量评估方法。The memory 602 is the non-transitory computer-readable storage medium provided by the present application. Wherein, the memory stores instructions executable by at least one processor, so that the at least one processor executes the video quality assessment method provided by the present application. The non-transitory computer-readable storage medium of the present application stores computer instructions, and the computer instructions are used to cause the computer to execute the video quality assessment method provided by the present application.
存储器602作为一种非瞬时计算机可读存储介质,可用于存储非瞬时软件程序、非瞬时计算机可执行程序以及模块,如本申请实施例中的视频质量评估方法对应的程序指令/模块(例如,附图5所示的获取模块501、确定模块502、评分模块503和评估模块504)。处理器601通过运行存储在存储器602中的非瞬时软件程序、指令以及模块,从而执行服务器的各种功能应用以及数据处理,即实现上述方法实施例中的视频质量评估方法。As a non-transitory computer-readable storage medium, the memory 602 can be used to store non-transitory software programs, non-transitory computer-executable programs and modules, such as program instructions/modules corresponding to the video quality assessment method in the embodiments of the present application (for example, The acquisition module 501, the determination module 502, the scoring module 503 and the evaluation module 504 shown in FIG. 5). The processor 601 executes various functional applications and data processing of the server by running the non-transitory software programs, instructions and modules stored in the memory 602, ie, implements the video quality assessment method in the above method embodiments.
存储器602可以包括存储程序区和存储数据区,其中,存储程序区可存储操作系统、至少一个功能所需要的应用程序;存储数据区可存储视频质量评估的电子设备的使用所创建的数据等。此外,存储器602可以包括高速随机存取存储器,还可以包括非瞬时存储器,例如至少一个磁盘存储器件、闪存器件、或其他非瞬时固态存储器件。在一些实施例中,存储器602可选包括相对于处理器601远程设置的存储器,这些远程存储器可以通过网络连接至视频质量评估的电子设备。上述网络的实例包括但不限于互联网、企业内部网、局域网、移动通 信网及其组合。The memory 602 may include a storage program area and a storage data area, wherein the storage program area may store an operating system, an application program required for at least one function; the storage data area may store data created by the use of electronic equipment for video quality assessment, and the like. Additionally, memory 602 may include high-speed random access memory, and may also include non-transitory memory, such as at least one magnetic disk storage device, flash memory device, or other non-transitory solid state storage device. In some embodiments, memory 602 may optionally include memory located remotely relative to processor 601, and these remote memories may be connected via a network to the electronic device for video quality assessment. Examples of such networks include, but are not limited to, the Internet, intranets, local area networks, mobile communication networks, and combinations thereof.
视频质量评估方法的电子设备还可以包括:输入装置603和输出装置604。处理器601、存储器602、输入装置603和输出装置604可以通过总线或者其他方式连接,图6中以通过总线连接为例。The electronic device of the video quality assessment method may further include: an input device 603 and an output device 604 . The processor 601 , the memory 602 , the input device 603 and the output device 604 may be connected by a bus or in other ways, and the connection by a bus is taken as an example in FIG. 6 .
输入装置603可接收输入的数字或字符信息,例如触摸屏、小键盘、鼠标、轨迹板、触摸板、指示杆、一个或者多个鼠标按钮、轨迹球、操纵杆等输入装置。输出装置604可以包括显示设备、辅助照明装置(例如,LED)和触觉反馈装置(例如,振动电机)等。该显示设备可以包括但不限于,液晶显示器(LCD)、发光二极管(LED)显示器和等离子体显示器。在一些实施方式中,显示设备可以是触摸屏。The input device 603 may receive input numerical or character information, such as a touch screen, a keypad, a mouse, a trackpad, a touchpad, a pointing stick, one or more mouse buttons, a trackball, a joystick, and other input devices. Output devices 604 may include display devices, auxiliary lighting devices (eg, LEDs), haptic feedback devices (eg, vibration motors), and the like. The display device may include, but is not limited to, a liquid crystal display (LCD), a light emitting diode (LED) display, and a plasma display. In some implementations, the display device may be a touch screen.
此处描述的系统和技术的各种实施方式可以在数字电子电路系统、集成电路系统、专用ASIC(专用集成电路)、计算机硬件、固件、软件、和/或它们的组合中实现。这些各种实施方式可以包括:实施在一个或者多个计算机程序中,该一个或者多个计算机程序可在包括至少一个可编程处理器的可编程系统上执行和/或解释,该可编程处理器可以是专用或者通用可编程处理器,可以从存储系统、至少一个输入装置、和至少一个输出装置接收数据和指令,并且将数据和指令传输至该存储系统、该至少一个输入装置、和该至少一个输出装置。Various implementations of the systems and techniques described herein can be implemented in digital electronic circuitry, integrated circuit systems, application specific ASICs (application specific integrated circuits), computer hardware, firmware, software, and/or combinations thereof. These various embodiments may include being implemented in one or more computer programs executable and/or interpretable on a programmable system including at least one programmable processor that The processor, which may be a special purpose or general-purpose programmable processor, may receive data and instructions from a storage system, at least one input device, and at least one output device, and transmit data and instructions to the storage system, the at least one input device, and the at least one output device an output device.
这些计算程序(也称作程序、软件、软件应用、或者代码)包括可编程处理器的机器指令,并且可以利用高级过程和/或面向对象的编程语言、和/或汇编/机器语言来实施这些计算程序。如本文使用的,术语“机器可读介质”和“计算机可读介质”指的是用于将机器指令和/或数据提供给可编程处理器的任何计算机程序产品、设备、和/或装置(例如,磁盘、光盘、存储器、可编程逻辑装置(PLD)),包括,接收作为机器可读信号的机器指令的机器可读介质。术语“机器可读信号”指的是用于将机器指令和/或数据提供给可编程处理器的任何信号。These computational programs (also referred to as programs, software, software applications, or codes) include machine instructions for programmable processors, and may be implemented using high-level procedural and/or object-oriented programming languages, and/or assembly/machine languages calculation program. As used herein, the terms "machine-readable medium" and "computer-readable medium" refer to any computer program product, apparatus, and/or apparatus for providing machine instructions and/or data to a programmable processor ( For example, magnetic disks, optical disks, memories, programmable logic devices (PLDs), including machine-readable media that receive machine instructions as machine-readable signals. The term "machine-readable signal" refers to any signal used to provide machine instructions and/or data to a programmable processor.
为了提供与用户的交互,可以在计算机上实施此处描述的系统和技术,该计算机具有:用于向用户显示信息的显示装置(例如,CRT(阴极射线管)或者LCD(液晶显示器)监视器);以及键盘和指向装置(例如,鼠标或者轨迹球),用户可以通过该键盘和该指向装置 来将输入提供给计算机。其它种类的装置还可以用于提供与用户的交互;例如,提供给用户的反馈可以是任何形式的传感反馈(例如,视觉反馈、听觉反馈、或者触觉反馈);并且可以用任何形式(包括声输入、语音输入或者、触觉输入)来接收来自用户的输入。To provide interaction with a user, the systems and techniques described herein may be implemented on a computer having a display device (eg, a CRT (cathode ray tube) or LCD (liquid crystal display) monitor) for displaying information to the user ); and a keyboard and pointing device (eg, a mouse or trackball) through which a user can provide input to the computer. Other kinds of devices can also be used to provide interaction with the user; for example, the feedback provided to the user can be any form of sensory feedback (eg, visual feedback, auditory feedback, or tactile feedback); and can be in any form (including acoustic input, voice input, or tactile input) to receive input from the user.
可以将此处描述的系统和技术实施在包括后台部件的计算系统(例如,作为数据服务器)、或者包括中间件部件的计算系统(例如,应用服务器)、或者包括前端部件的计算系统(例如,具有图形用户界面或者网络浏览器的用户计算机,用户可以通过该图形用户界面或者该网络浏览器来与此处描述的系统和技术的实施方式交互)、或者包括这种后台部件、中间件部件、或者前端部件的任何组合的计算系统中。可以通过任何形式或者介质的数字数据通信(例如,通信网络)来将系统的部件相互连接。通信网络的示例包括:局域网(LAN)、广域网(WAN)和互联网。The systems and techniques described herein may be implemented on a computing system that includes back-end components (eg, as a data server), or a computing system that includes middleware components (eg, an application server), or a computing system that includes front-end components (eg, a user computer having a graphical user interface or web browser through which a user may interact with implementations of the systems and techniques described herein), or including such backend components, middleware components, Or any combination of front-end components in a computing system. The components of the system may be interconnected by any form or medium of digital data communication (eg, a communication network). Examples of communication networks include: Local Area Networks (LANs), Wide Area Networks (WANs), and the Internet.
计算机系统可以包括客户端和服务器。客户端和服务器一般远离彼此并且通常通过通信网络进行交互。通过在相应的计算机上运行并且彼此具有客户端-服务器关系的计算机程序来产生客户端和服务器的关系。A computer system can include clients and servers. Clients and servers are generally remote from each other and usually interact through a communication network. The relationship of client and server arises by computer programs running on the respective computers and having a client-server relationship to each other.
根据本申请实施例的技术方案,有助于对视频质量进行更加合理、有效地评估。According to the technical solutions of the embodiments of the present application, it is helpful to evaluate the video quality more reasonably and effectively.
应该理解,可以使用上面所示的各种形式的流程,重新排序、增加或删除步骤。例如,本发申请中记载的各步骤可以并行地执行也可以顺序地执行也可以不同的次序执行,只要能够实现本申请公开的技术方案所期望的结果,本文在此不进行限制。It should be understood that steps may be reordered, added or deleted using the various forms of flow shown above. For example, the steps described in the present application can be performed in parallel, sequentially or in different orders, and as long as the desired results of the technical solutions disclosed in the present application can be achieved, no limitation is imposed herein.
上述具体实施方式,并不构成对本申请保护范围的限制。本领域技术人员应该明白的是,根据设计要求和其他因素,可以进行各种修改、组合、子组合和替代。任何在本申请的精神和原则之内所作的修改、等同替换和改进等,均应包含在本申请保护范围之内。The above-mentioned specific embodiments do not constitute a limitation on the protection scope of the present application. It should be understood by those skilled in the art that various modifications, combinations, sub-combinations and substitutions may occur depending on design requirements and other factors. Any modifications, equivalent replacements and improvements made within the spirit and principles of this application shall be included within the protection scope of this application.

Claims (13)

  1. 一种视频质量评估方法,所述方法包括:A video quality assessment method, the method comprising:
    获取待评估视频帧序列;Obtain the video frame sequence to be evaluated;
    从所述待评估视频帧序列中,确定出至少一个子视频帧序列,所述子视频帧序列为一组视频帧的颜色均值梯度满足预设条件的连续视频帧;From the to-be-evaluated video frame sequence, determine at least one sub-video frame sequence, where the sub-video frame sequence is a group of consecutive video frames whose color mean gradients of the video frames satisfy a preset condition;
    按照预设的评分规则,对各子视频帧序列进行评分,所述预设的评分规则与各子视频帧序列的属性信息相关联;According to a preset scoring rule, each sub-video frame sequence is scored, and the preset scoring rule is associated with the attribute information of each sub-video frame sequence;
    根据各子视频帧序列的评分,对所述待评估视频帧序列进行质量评估。According to the score of each sub-video frame sequence, quality assessment is performed on the video frame sequence to be evaluated.
  2. 根据权利要求1所述的方法,其中,所述预设的评分规则包括以下至少一项:The method according to claim 1, wherein the preset scoring rules include at least one of the following:
    与各子视频帧序列的平均时长相关联的第一评分规则;a first scoring rule associated with the average duration of each sub-video frame sequence;
    与各子视频帧序列的目标展示对象相关联的第二评分规则。A second scoring rule associated with the target presentation object of each sub-video frame sequence.
  3. 根据权利要求2所述的方法,其中,所述预设的评分规则包括:所述第一评分规则和所述第二评分规则,以及所述按照预设的评分规则,对各子视频帧序列进行评分,包括:The method according to claim 2, wherein the preset scoring rule comprises: the first scoring rule and the second scoring rule, and the preset scoring rule for each sub-video frame sequence Score, including:
    按照所述第一评分规则对各子视频帧序列进行评分,得到各子视频帧序列的第一评分;Score each sub-video frame sequence according to the first scoring rule to obtain the first score of each sub-video frame sequence;
    按照所述第二评分规则对各子视频帧序列进行评分,得到各子视频帧序列的第二评分;Scoring each sub-video frame sequence according to the second scoring rule to obtain a second score of each sub-video frame sequence;
    根据所述第一评分和所述第二评分,对各子视频帧序列进行评分。Each sub-video frame sequence is scored according to the first score and the second score.
  4. 根据权利要求2所述的方法,其中,所述第二评分规则包括以下至少一项:The method of claim 2, wherein the second scoring rule includes at least one of the following:
    与各子视频帧序列的目标展示对象在相应子视频帧序列中出现的频率相关联的第三评分规则;a third scoring rule associated with the frequency with which the target presentation object of each sub-video frame sequence appears in the corresponding sub-video frame sequence;
    与各子视频帧序列的目标展示对象在相应子视频帧序列中的面积相关联的第四评分规则。A fourth scoring rule associated with the area of each sub-video frame sequence's target presentation object in the corresponding sub-video frame sequence.
  5. 根据权利要求4所述的方法,其中,所述预设的评分规则包括所述第三评分规则和所述第四评分规则,以及所述按照预设的评分规则,对各子视频帧序列进行评分,包括:The method according to claim 4, wherein the preset scoring rule includes the third scoring rule and the fourth scoring rule, and the preset scoring rule is used to perform the evaluation on each sub-video frame sequence. Scoring, including:
    按照所述第三评分规则对各子视频帧序列进行评分,得到各子视频帧序列的第三评分;Scoring each sub-video frame sequence according to the third scoring rule to obtain a third score of each sub-video frame sequence;
    按照所述第四评分规则对各子视频帧序列进行评分,得到各子视频帧序列的第四评分;Scoring each sub-video frame sequence according to the fourth scoring rule to obtain a fourth score of each sub-video frame sequence;
    根据所述第三评分和所述第四评分,对各子视频帧序列进行评分。Each sub-video frame sequence is scored according to the third score and the fourth score.
  6. 一种视频质量评估装置,所述装置包括:A device for evaluating video quality, the device comprising:
    获取模块,被配置成获取待评估视频帧序列;an acquisition module, configured to acquire a sequence of video frames to be evaluated;
    确定模块,被配置成从所述待评估视频帧序列中,确定出至少一个子视频帧序列,所述子视频帧序列为一组视频帧的颜色值梯度均满足预设条件的连续视频帧;A determination module, configured to determine at least one sub-video frame sequence from the video frame sequence to be evaluated, where the sub-video frame sequence is a group of video frames whose color value gradients all satisfy a preset condition of continuous video frames;
    评分模块,被配置成按照预设的评分规则,对各子视频帧序列进行评分,所述预设的评分规则与各子视频帧序列的属性信息相关联;a scoring module, configured to score each sub-video frame sequence according to a preset scoring rule, and the preset scoring rule is associated with attribute information of each sub-video frame sequence;
    评估模块,被配置成根据各子视频帧序列的评分,对所述待评估视频帧序列进行质量评估。The evaluation module is configured to perform quality evaluation on the to-be-evaluated video frame sequence according to the score of each sub-video frame sequence.
  7. 根据权利要求6所述的装置,其中,所述预设的评分规则包括以下至少一项:The device according to claim 6, wherein the preset scoring rules include at least one of the following:
    与各子视频帧序列的平均时长相关联的第一评分规则;a first scoring rule associated with the average duration of each sub-video frame sequence;
    与各子视频帧序列的目标展示对象相关联的第二评分规则。A second scoring rule associated with the target presentation object of each sub-video frame sequence.
  8. 根据权利要求7所述的装置,其中,所述预设的评分规则包括所述第一评分规则和所述第二评分规则,所述评分模块进一步被配置成:The apparatus according to claim 7, wherein the preset scoring rules include the first scoring rules and the second scoring rules, and the scoring module is further configured to:
    按照所述第一评分规则对各子视频帧序列进行评分,得到各子视频帧序列的第一评分;Score each sub-video frame sequence according to the first scoring rule to obtain the first score of each sub-video frame sequence;
    按照所述第二评分规则对各子视频帧序列进行评分,得到各子视频帧序列的第二评分;Scoring each sub-video frame sequence according to the second scoring rule to obtain a second score of each sub-video frame sequence;
    根据所述第一评分和所述第二评分,对各子视频帧序列进行评分。Each sub-video frame sequence is scored according to the first score and the second score.
  9. 根据权利要求7所述的装置,其中,所述第二评分规则包括以下至少一项:The apparatus of claim 7, wherein the second scoring rule includes at least one of the following:
    与各子视频帧序列的目标展示对象在相应子视频帧序列中出现的频率相关联的第三评分规则;a third scoring rule associated with the frequency with which the target presentation object of each sub-video frame sequence appears in the corresponding sub-video frame sequence;
    与各子视频帧序列的目标展示对象在相应子视频帧序列中的面积相关联的第四评分规则。A fourth scoring rule associated with the area of each sub-video frame sequence's target presentation object in the corresponding sub-video frame sequence.
  10. 根据权利要求9所述的装置,其中,所述预设的评分规则包括所述第三评分规则和所述第四评分规则,所述评分模块进一步被配置成:The apparatus according to claim 9, wherein the preset scoring rule includes the third scoring rule and the fourth scoring rule, and the scoring module is further configured to:
    按照所述第三评分规则对各子视频帧序列进行评分,得到各子视频帧序列的第三评分;Scoring each sub-video frame sequence according to the third scoring rule to obtain a third score of each sub-video frame sequence;
    按照所述第四评分规则对各子视频帧序列进行评分,得到各子视频帧序列的第四评分;Scoring each sub-video frame sequence according to the fourth scoring rule to obtain a fourth score of each sub-video frame sequence;
    根据所述第三评分和所述第四评分,对各子视频帧序列进行评分。Each sub-video frame sequence is scored according to the third score and the fourth score.
  11. 一种电子设备,其特征在于,包括:An electronic device, comprising:
    至少一个处理器;以及at least one processor; and
    与所述至少一个处理器通信连接的存储器;其中,a memory communicatively coupled to the at least one processor; wherein,
    所述存储器存储有可被所述至少一个处理器执行的指令,所述指令被所述至少一个处理区执行,以使所述至少一个处理器能够执行权利要求1-5中任一项所述的方法。The memory stores instructions executable by the at least one processor, the instructions being executed by the at least one processing area to enable the at least one processor to perform the execution of any of claims 1-5 Methods.
  12. 一种存储有计算机指令的非瞬时计算机可读存储介质,其特征在于,所述计算机指令用于使所述计算机执行权利要求1-5中任一项所述的方法。A non-transitory computer-readable storage medium storing computer instructions, wherein the computer instructions are used to cause the computer to perform the method of any one of claims 1-5.
  13. 一种计算机程序产品,包括计算机程序,所述计算机程序在被处理器执行时实现根据权利要求1-5中任一项所述的方法。A computer program product comprising a computer program which, when executed by a processor, implements the method of any of claims 1-5.
PCT/CN2022/070276 2021-01-21 2022-01-05 Video quality assessment method and device WO2022156534A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN202110083566.5 2021-01-21
CN202110083566.5A CN113781384A (en) 2021-01-21 2021-01-21 Video quality evaluation method and device

Publications (1)

Publication Number Publication Date
WO2022156534A1 true WO2022156534A1 (en) 2022-07-28

Family

ID=78835499

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2022/070276 WO2022156534A1 (en) 2021-01-21 2022-01-05 Video quality assessment method and device

Country Status (2)

Country Link
CN (1) CN113781384A (en)
WO (1) WO2022156534A1 (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN116484051A (en) * 2023-02-24 2023-07-25 广州沐思信息科技有限公司 Course assessment method based on knowledge training platform

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113781384A (en) * 2021-01-21 2021-12-10 北京沃东天骏信息技术有限公司 Video quality evaluation method and device

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20170154415A1 (en) * 2015-11-30 2017-06-01 Disney Enterprises, Inc. Saliency-weighted video quality assessment
CN108376147A (en) * 2018-01-24 2018-08-07 北京览科技有限公司 A kind of method and apparatus for obtaining the evaluation result information of video
CN110267119A (en) * 2019-06-28 2019-09-20 北京奇艺世纪科技有限公司 The evaluation method and relevant device of video highlight degree
CN111008578A (en) * 2019-11-26 2020-04-14 天津易华录信息技术有限公司 Video file data value evaluation method
CN112233075A (en) * 2020-09-30 2021-01-15 腾讯科技(深圳)有限公司 Video definition evaluation method and device, storage medium and electronic equipment
CN113781384A (en) * 2021-01-21 2021-12-10 北京沃东天骏信息技术有限公司 Video quality evaluation method and device

Family Cites Families (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110490845A (en) * 2019-07-26 2019-11-22 北京大米科技有限公司 A kind of image characteristic extracting method, device, storage medium and electronic equipment
CN110798735B (en) * 2019-08-28 2022-11-18 腾讯科技(深圳)有限公司 Video processing method and device and electronic equipment

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20170154415A1 (en) * 2015-11-30 2017-06-01 Disney Enterprises, Inc. Saliency-weighted video quality assessment
CN108376147A (en) * 2018-01-24 2018-08-07 北京览科技有限公司 A kind of method and apparatus for obtaining the evaluation result information of video
CN110267119A (en) * 2019-06-28 2019-09-20 北京奇艺世纪科技有限公司 The evaluation method and relevant device of video highlight degree
CN111008578A (en) * 2019-11-26 2020-04-14 天津易华录信息技术有限公司 Video file data value evaluation method
CN112233075A (en) * 2020-09-30 2021-01-15 腾讯科技(深圳)有限公司 Video definition evaluation method and device, storage medium and electronic equipment
CN113781384A (en) * 2021-01-21 2021-12-10 北京沃东天骏信息技术有限公司 Video quality evaluation method and device

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN116484051A (en) * 2023-02-24 2023-07-25 广州沐思信息科技有限公司 Course assessment method based on knowledge training platform

Also Published As

Publication number Publication date
CN113781384A (en) 2021-12-10

Similar Documents

Publication Publication Date Title
WO2022156534A1 (en) Video quality assessment method and device
EP3869403A2 (en) Image recognition method, apparatus, electronic device, storage medium and program product
US7831111B2 (en) Method and mechanism for retrieving images
US9740789B2 (en) Search engine analytics and optimization for media content in social networks
US11182447B2 (en) Customized display of emotionally filtered social media content
KR20210132578A (en) Method, apparatus, device and storage medium for constructing knowledge graph
WO2021139221A1 (en) Method and apparatus for query auto-completion, device and computer storage medium
WO2019118236A1 (en) Deep learning on image frames to generate a summary
US20190122667A1 (en) Question Urgency in QA System with Visual Representation in Three Dimensional Space
CN111143613A (en) Method, system, electronic device and storage medium for selecting video cover
US20220027854A1 (en) Data processing method and apparatus, electronic device and storage medium
CN111368153A (en) Searching method and device
US20220036085A1 (en) Video event recognition method, electronic device and storage medium
US8977061B2 (en) Merging face clusters
CN113765873A (en) Method and apparatus for detecting abnormal access traffic
CN110427436B (en) Method and device for calculating entity similarity
CN108197203A (en) A kind of shop front head figure selection method, device, server and storage medium
CN116468479A (en) Method for determining page quality evaluation dimension, and page quality evaluation method and device
CN113032251B (en) Method, device and storage medium for determining service quality of application program
CN114639143B (en) Portrait archiving method, device and storage medium based on artificial intelligence
CN113435523B (en) Method, device, electronic equipment and storage medium for predicting content click rate
CN111695036B (en) Content recommendation method and device
CN112733879A (en) Model distillation method and device for different scenes
CN112542244A (en) Auxiliary information generation method, related device and computer program product
CN113378781B (en) Training method and device of video feature extraction model and electronic equipment

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 22742013

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 22742013

Country of ref document: EP

Kind code of ref document: A1