WO2022087826A1 - Video processing method and apparatus, mobile device, and readable storage medium - Google Patents

Video processing method and apparatus, mobile device, and readable storage medium Download PDF

Info

Publication number
WO2022087826A1
WO2022087826A1 PCT/CN2020/123998 CN2020123998W WO2022087826A1 WO 2022087826 A1 WO2022087826 A1 WO 2022087826A1 CN 2020123998 W CN2020123998 W CN 2020123998W WO 2022087826 A1 WO2022087826 A1 WO 2022087826A1
Authority
WO
WIPO (PCT)
Prior art keywords
video
target
picture
preset
processed
Prior art date
Application number
PCT/CN2020/123998
Other languages
French (fr)
Chinese (zh)
Inventor
周娴
李鑫超
刘志鹏
Original Assignee
深圳市大疆创新科技有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 深圳市大疆创新科技有限公司 filed Critical 深圳市大疆创新科技有限公司
Priority to CN202080044431.1A priority Critical patent/CN114026874A/en
Priority to PCT/CN2020/123998 priority patent/WO2022087826A1/en
Publication of WO2022087826A1 publication Critical patent/WO2022087826A1/en

Links

Images

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/20Servers specifically adapted for the distribution of content, e.g. VOD servers; Operations thereof
    • H04N21/23Processing of content or additional data; Elementary server operations; Server middleware
    • H04N21/234Processing of video elementary streams, e.g. splicing of video streams or manipulating encoded video stream scene graphs
    • H04N21/23418Processing of video elementary streams, e.g. splicing of video streams or manipulating encoded video stream scene graphs involving operations for analysing video streams, e.g. detecting features or characteristics
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/20Servers specifically adapted for the distribution of content, e.g. VOD servers; Operations thereof
    • H04N21/23Processing of content or additional data; Elementary server operations; Server middleware
    • H04N21/234Processing of video elementary streams, e.g. splicing of video streams or manipulating encoded video stream scene graphs
    • H04N21/2343Processing of video elementary streams, e.g. splicing of video streams or manipulating encoded video stream scene graphs involving reformatting operations of video signals for distribution or compliance with end-user requests or end-user device requirements
    • H04N21/234381Processing of video elementary streams, e.g. splicing of video streams or manipulating encoded video stream scene graphs involving reformatting operations of video signals for distribution or compliance with end-user requests or end-user device requirements by altering the temporal resolution, e.g. decreasing the frame rate by frame skipping
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/43Processing of content or additional data, e.g. demultiplexing additional data from a digital video stream; Elementary client operations, e.g. monitoring of home network or synchronising decoder's clock; Client middleware
    • H04N21/44Processing of video elementary streams, e.g. splicing a video clip retrieved from local storage with an incoming video stream or rendering scenes according to encoded video stream scene graphs
    • H04N21/44008Processing of video elementary streams, e.g. splicing a video clip retrieved from local storage with an incoming video stream or rendering scenes according to encoded video stream scene graphs involving operations for analysing video streams, e.g. detecting features or characteristics in the video stream
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/43Processing of content or additional data, e.g. demultiplexing additional data from a digital video stream; Elementary client operations, e.g. monitoring of home network or synchronising decoder's clock; Client middleware
    • H04N21/44Processing of video elementary streams, e.g. splicing a video clip retrieved from local storage with an incoming video stream or rendering scenes according to encoded video stream scene graphs
    • H04N21/4402Processing of video elementary streams, e.g. splicing a video clip retrieved from local storage with an incoming video stream or rendering scenes according to encoded video stream scene graphs involving reformatting operations of video signals for household redistribution, storage or real-time display
    • H04N21/440281Processing of video elementary streams, e.g. splicing a video clip retrieved from local storage with an incoming video stream or rendering scenes according to encoded video stream scene graphs involving reformatting operations of video signals for household redistribution, storage or real-time display by altering the temporal resolution, e.g. by frame skipping

Definitions

  • the present invention belongs to the field of network technologies, and in particular, relates to a video processing method, device, removable device and readable storage medium.
  • the present invention provides a video processing method, device, removable device and readable storage medium, so as to solve the problems of high screening cost and low screening efficiency.
  • an embodiment of the present invention provides a video processing method, which includes:
  • the target information For the target picture in the video to be processed, determine the target information corresponding to the picture parameter of the target picture; the target information is used to represent whether the picture parameter of the target picture meets the preset requirements;
  • an embodiment of the present invention provides a video processing apparatus, and the apparatus includes a memory and a processor;
  • the memory for storing program codes
  • the processor calls the program code, and when the program code is executed, is configured to perform the following operations:
  • the target information For the target picture in the video to be processed, determine the target information corresponding to the picture parameter of the target picture; the target information is used to represent whether the picture parameter of the target picture meets the preset requirements;
  • an embodiment of the present invention provides a movable device, where the movable device is configured to execute the steps in the video processing method described in the first aspect.
  • an embodiment of the present invention provides a computer-readable storage medium, where a computer program is stored on the computer-readable storage medium, and when the computer program is executed by a processor, the following operations are implemented:
  • the target information For the target picture in the video to be processed, determine the target information corresponding to the picture parameter of the target picture; the target information is used to represent whether the picture parameter of the target picture meets the preset requirements;
  • target information corresponding to the picture parameters of the target picture is determined, wherein the target information is used to indicate whether the picture parameters of the target picture meet the preset requirements.
  • the target information of the included target picture is screened for the video to be processed, wherein the screening process includes deleting the target picture that does not meet the preset requirements or one of the video clips to which the target picture belongs, and finally, according to the screened and processed video to be processed , generate the target video.
  • FIG. 1 is a flowchart of steps of a video processing method provided by an embodiment of the present invention.
  • FIG. 2 is a schematic diagram of a clipping process provided by an embodiment of the present invention.
  • FIG. 3 is a block diagram of a video processing apparatus provided by an embodiment of the present invention.
  • FIG. 4 is a block diagram of a computing processing device according to an embodiment of the present invention.
  • FIG. 5 is a block diagram of a portable or fixed storage unit according to an embodiment of the present invention.
  • FIG. 1 is a flowchart of steps of a video processing method provided by an embodiment of the present invention. As shown in FIG. 1 , the method may include:
  • Step 101 For a target picture in the video to be processed, determine target information corresponding to a picture parameter of the target picture; the target information is used to represent whether the picture parameter of the target picture meets a preset requirement.
  • the video to be processed may be a video that needs to be screened for pictures with poor quality.
  • the video to be processed may be a video shot by the user, or a video downloaded from a network, which is not limited in this embodiment of the present invention.
  • a video can be essentially understood as a picture sequence composed of multiple frames of images, and the target picture in the video to be processed may be all pictures in the picture sequence, or a part of the pictures in the picture sequence.
  • the picture parameter of the target picture may be a parameter that can characterize the picture quality, and the specific number and type of the picture parameter may be set according to actual requirements.
  • the picture parameters may be set as the clarity of the picture, the exposure level of the picture, and the like.
  • the preset requirement may be a quality requirement that needs to be satisfied by the picture parameters of the picture. If the picture parameters of the picture meet the preset requirements, it means that the picture quality is high and can meet the quality requirements. On the contrary, if the picture parameters of the picture do not meet the preset requirements, it means that the quality of the picture is poor and cannot meet the quality requirements.
  • the specific content of the preset requirements can be set according to actual needs. For example, the preset requirement may be that the degree of sharpness is greater than the preset sharpness threshold, or the degree of exposure is within the range of the preset exposure degree, and so on.
  • the target information may be a tag that can characterize whether the picture parameters of the target picture meet the preset requirements.
  • the specific content of the label can be set according to actual needs. For example, numbers, letters, special symbols, etc. can be used as labels.
  • the specific content of the label when the picture parameters meet the preset requirements is different from the specific content of the label when the picture parameters do not meet the preset requirements. For example, "0" may be used as a label indicating that the picture parameters of the target picture do not meet the preset requirements, and "1" may be used as a label indicating that the picture parameters of the target picture meet the preset requirements.
  • Step 102 Perform screening processing on the video to be processed according to the target information of the target picture contained in the video to be processed; wherein, the screening processing includes deleting target pictures or the target pictures that do not meet the preset requirements One of the video clips the image belongs to.
  • whether the picture parameters of the target picture satisfy a preset condition may be first determined according to the target information of the target picture. For example, it may be determined that the picture parameter does not meet the preset requirement when the target information is 0, and it can be determined that the picture parameter meets the preset requirement when the target information is 1. Further, if the picture parameters of the target picture do not meet the preset requirements, the target picture may be determined as the target picture that does not meet the preset requirements. Correspondingly, the target picture may be deleted or one of the video clips to which the target picture that does not meet the preset requirements belongs may be directly deleted. One of the video clips to which the target picture belongs may be a video clip that includes the target picture in the video to be processed, and the specific length of the video clip can be set according to actual requirements, which is not limited in this embodiment of the present invention.
  • Step 103 generating a target video according to the to-be-processed video after screening and processing.
  • the video to be processed is screened, which can reduce the pictures of poor quality included in the video to be processed, and further improve the image of the video to be processed after the screening process. quality.
  • the target video is generated according to the video to be processed after screening, which can ensure that the final generated target video has higher image quality.
  • the screened video to be processed may be directly used as the target video.
  • the video processing method can determine the target information corresponding to the picture parameter of the target picture for the target picture in the video to be processed, wherein the target information is used to indicate whether the picture parameter of the target picture satisfies the According to the preset requirements, the to-be-processed video is screened according to the target information of the target picture contained in the to-be-processed video, wherein the screening process includes deleting the target picture that does not meet the preset requirements or one of the video clips to which the target picture belongs, and finally , and generate the target video according to the to-be-processed video after screening.
  • the screening cost can be reduced to a certain extent and the screening efficiency can be improved.
  • the above picture parameters may include one or more of a blur degree, a shake degree, an exposure degree, and a color change degree of the picture.
  • the picture parameters may also include other types of parameters, which are not limited in this embodiment of the present invention.
  • the degree of blurring of the picture it can be determined whether the overall picture is clear, based on the degree of shaking of the picture, it can be determined whether the picture of the picture has a shaking problem, based on the degree of exposure of the picture, it can be determined whether the brightness of the picture is appropriate, and the degree of color change of the picture can be determined. Is the color difference of the picture appropriate? However, in practical application scenarios, when the picture is blurred and unclear, the picture is shaken, the brightness is too high or too low, and the color difference is too high or too low, it often means that the quality of the picture is poor.
  • the pictures in the video to be repaired can be screened from these parameters in the future, and then certain To a certain extent, it can ensure that pictures with poor quality can be deleted more accurately.
  • the calculation amount that the hardware device can bear may be determined first, and the number of selected picture parameters is determined according to the amount of calculation that the hardware device can bear, so as to ensure that the hardware device can run normally. Wherein, the number of selected picture parameters is positively related to the loadable calculation amount.
  • the fuzzy label information corresponding to the degree of blur the information of the shaking label corresponding to the degree of shaking, the information of the exposure label corresponding to the degree of exposure, and the corresponding degree of color change can be obtained.
  • color difference label information the fuzzy label information, the shake label information, the exposure label information and the color difference label information are the target information.
  • the fuzzy label information can be used to characterize whether the blur degree of the target image meets the preset requirement corresponding to the blur degree
  • the jitter label information can be used to characterize whether the jitter degree of the target image meets the preset requirement corresponding to the jitter degree
  • the exposure label information is used to characterize the target Whether the exposure degree of the picture meets the preset requirements corresponding to the exposure degree
  • the color difference label information is used to indicate whether the color change degree of the target image meets the preset requirements corresponding to the color change degree.
  • the above-mentioned operation of screening the video to be processed according to the target information of the target picture contained in the video to be processed may be implemented by the following sub-steps:
  • Sub-step (1) segment the to-be-processed video to obtain multiple video segments.
  • the video to be processed may be divided into video segments containing the same number of frames by using a preset segmentation algorithm in a manner of dividing at equal intervals.
  • a preset segmentation algorithm may be used to randomly divide the video to be processed into video segments containing different numbers of frames in a manner of unequal interval division, which is not limited in this embodiment of the present invention.
  • other division manners may also be used to implement division, which is not limited in this embodiment of the present invention.
  • Sub-step (2) For each of the video clips, perform screening processing on the video clips according to the target information of the target picture contained in the video clips.
  • a video clip may be used as a processing unit, and screening processing may be performed for each video clip.
  • the screening process for the video clip may include deleting the target picture or the video clip that does not meet the preset requirements in the video clip.
  • the video to be processed is first divided to obtain a plurality of video segments.
  • the video clip is used as a processing unit to filter the video clip.
  • the video clips after the screening process can be merged to obtain the target video. Since the screening process deletes pictures with poor quality in each video clip, the target video is obtained by merging the video clips after the screening process, which can ensure the quality of the target video to a certain extent. Specifically, when merging, the filtered video segments may be merged according to the sequence in the video to be processed, so as to ensure that the final target video can be played normally.
  • the video to be processed may be independently selected by the user. Specifically, before the target information corresponding to the picture parameter of the target picture is determined, a user's selection operation on an optional video may be received; the optional video selected by the selection operation may be determined as the to-be-processed video; the optional video For videos stored in electronic devices. Correspondingly, after the target video is generated, the target video can be displayed to the user, so as to ensure that the user can obtain the processing result in time and improve the interaction effect.
  • each video stored in the electronic device can be displayed in the client, and the user can click the optional video to be processed to realize the input selection operation.
  • the electronic device may receive the selection operation, and determine the selected optional video as the video to be processed.
  • target information may be added to each target picture and displayed to the user.
  • the user can select the target image to be deleted according to the requirements and in combination with the target information of the target image. Accordingly, the electronic device can delete the target picture selected by the user. In this way, by displaying the target picture with the target information added to the user, the user can independently decide the specific target picture to be deleted, which can improve the flexibility of the operation.
  • the following steps may also be performed:
  • Step A Determine the target picture according to the pictures contained in the video to be processed; extract the target picture from the video to be processed, and add timestamp information to each of the target pictures; the timestamp information Used to characterize the order of the target picture in the target picture.
  • the appearance time point of each target picture in the video to be processed may be used as timestamp information.
  • the target picture includes picture a, picture b, and picture c.
  • the appearance time points of the picture a, the picture b, and the picture c are respectively: the 10th second, the 15th second, and the 21st second.
  • the 10th second, the 15th second, and the 21st second can be used as the timestamp information including the picture a, the picture b, and the picture c, respectively.
  • an association relationship between each target picture and the corresponding time stamp information may be established, or, the corresponding time stamp information may also be written into the target picture. .
  • Step B According to the timestamp information of the target picture, the The target picture is merged into the to-be-processed video.
  • the location of the target picture may be determined according to the timestamp information first, and then the target picture is inserted into the corresponding location. For example, picture a may be inserted into the video to be processed at the 10th second, picture b may be inserted into the video to be processed at the 15th second, and picture c may be inserted into the video to be processed at the 21st second.
  • the preset requirement may include one or more of normal blur degree, normal jitter degree, normal exposure degree and normal color change degree.
  • the target information corresponding to the picture parameter is a label used to indicate that the picture parameter does not meet the preset requirements, and if so, it can be determined that none of the picture parameters meet the preset requirements. .
  • the third target picture can be directly deleted before being merged. In this way, the burden of subsequent screening processing can be reduced, and unnecessary target pictures can be avoided from performing merging operations, thereby ensuring processing efficiency.
  • the target picture before the operation of determining the target information corresponding to the picture parameters of the target picture, the target picture is determined and the target picture is extracted. In this way, by extracting the target picture separately, it is possible to avoid interference from other pictures in the video to be processed when the target information corresponding to the picture parameters of the target picture is subsequently determined, thereby ensuring the processing effect to a certain extent.
  • the target picture is combined into the video to be processed, so as to ensure that the subsequent steps can be performed normally. And by adding time stamp information to the target picture, the target picture can be conveniently combined into the video to be processed according to the time stamp information, thereby ensuring the processing effect.
  • the operation of determining the target picture can be implemented by the following sub-steps:
  • Sub-step (3) when the total number of pictures contained in the video to be processed is greater than the first preset number threshold, select m frames of pictures from the video to be processed according to a preset frame rate as the target picture; the m is not greater than the first preset number threshold.
  • the first preset number threshold and the preset frame rate may be set according to actual requirements.
  • the first preset number threshold may be the number of pictures required to implement screening.
  • the first preset number threshold may be 50, 60, and so on.
  • the preset frame rate can be 4 frames per second (fps).
  • the total number of pictures included in the video to be processed may be determined first, for example, the configuration information of the video to be processed may be read to obtain the total number.
  • selection can be made according to the preset frame rate, and accordingly, these target pictures can be correspondingly extracted subsequently. That is, frame extraction can be performed according to a preset frame rate to obtain m target pictures, and the m target pictures form a picture sequence.
  • Sub-step (4) under the condition that the total number is not greater than the first preset number threshold, use all pictures included in the video to be processed as the target picture.
  • the total number is not greater than the first preset number threshold, it means that if all the pictures are directly used as the target pictures, too much calculation amount will not be introduced. Therefore, all pictures can be directly used as target pictures without other processing. In this way, it is possible to avoid performing unnecessary other processing while realizing the determination of the target picture.
  • all pictures included in the video to be processed are used as target pictures only when the total number is not greater than the first preset number threshold, and when the total number is greater than the first preset number threshold , select only part of the image as the target image. In this way, it can be ensured that there are enough target pictures to a certain extent while avoiding too many target pictures, so as to ensure that more information can be provided for subsequent screening, and the screening effect can be ensured.
  • the preset requirements in this embodiment of the present invention may include preset requirements corresponding to the degree of blur: normal degree of blur, preset requirements corresponding to the degree of jitter: normal degree of jitter and preset requirements corresponding to the degree of exposure: normal degree of exposure,
  • the preset requirements corresponding to the degree of color change the degree of color change is normal.
  • the target information corresponding to the blur degree of the target picture can be determined through the following sub-steps:
  • Sub-step (5) determine the first ratio of the first number of pixels with the fuzzy confidence greater than the preset reliability threshold in the target picture to the total number of pixels of the target picture; according to the first ratio and the first ratio; A magnitude relationship between preset ratio thresholds determines blur label information; the blur label information is used to represent whether the blur degree of the target image is normal.
  • the fuzzy confidence level can be used to represent the probability that the pixel point is a fuzzy pixel point.
  • the target image may be input into a preset blur detection module, and the blur detection module may be implemented based on a semantic segmentation network composed of convolutional neural networks (Convolutional Neural Networks, CNN).
  • CNN convolutional Neural Networks
  • sample images with different degrees of blur can be obtained, and the neural network can be trained by using these sample images to generate a blur detection module.
  • the blur detection module can extract the picture features of the input target picture, and then determine the blur confidence of each pixel in the target picture according to the extracted picture features.
  • the preset reliability threshold and the first preset ratio threshold may be set according to actual requirements.
  • the preset reliability threshold may be 80%.
  • the first preset ratio threshold may be 0.2.
  • the fuzzy confidence of the pixel point is greater than the preset confidence threshold, it can be considered that the pixel point has a fuzzy problem, and the pixel point is a fuzzy pixel point.
  • the fuzzy confidence of each pixel can be compared with a preset confidence threshold to determine the first number. Then, the first number is divided by the total number of pixels of the target image to obtain a first ratio.
  • the blurring label information of the target image may be set as the first blurring label.
  • the first fuzzy label indicates that the degree of blurring of the target image is abnormal.
  • the blur tag information of the target image can be set as the second blur tag.
  • the second fuzzy label indicates that the degree of blurring of the target image is normal.
  • the first fuzzy label may be "unclear"
  • the second fuzzy label may be "clear".
  • the background area in the target image can also be determined, and then the coincidence ratio of the blurred area and the background area can be detected; wherein, the blurred area is an area composed of pixels whose blur confidence is greater than a preset confidence threshold.
  • the first ratio is greater than the first preset ratio threshold and the overlap ratio is not greater than the preset overlap ratio threshold
  • the blur tag information of the target image is set as the first blur tag.
  • the preset coincidence ratio threshold can be set according to actual needs. For example, the preset coincidence ratio threshold can be 90%.
  • the coincidence ratio is greater than the preset coincidence ratio threshold, it can be considered that the blurred area in the target image is a normal phenomenon. If the coincidence ratio is not greater than the preset coincidence ratio threshold, it can be considered that the blurred area in the target image is caused by abnormal factors.
  • the target image sets a first blur label representing an abnormal blur degree. In this way, in the case of blurred background caused by focusing, it can be avoided that the target picture is misjudged as a picture with an abnormal degree of blurring, and then inappropriate fuzzy label information is set for the target picture.
  • the target information corresponding to the degree of shaking of the target picture can be determined by the following sub-steps:
  • Sub-step (6) take the target picture as the input of the first preset classification model, and determine the jitter label information according to the output category of the first preset classification model; the first preset classification model is used for Whether the degree of jitter is normal for image classification.
  • the shaking label information is used to represent whether the shaking degree of the target image is normal.
  • the first preset classification model may be based on a CNN neural network, and the first preset classification model may be obtained by training sample pictures with different degrees of jitter (including normal and abnormal degrees of jitter), and the first preset classification model may be. During the training process, through deep learning to learn the ability to distinguish whether the jitter of the picture is normal or not. Specifically, after the target picture is input into the first preset classification model, the first preset classification model can extract the picture features of the input target picture, and then according to the extracted picture features, determine whether the degree of shaking of the target picture is normal, and output category.
  • the shaking label information of the target image may be set as the first fuzzy label.
  • the output category of the first preset classification model is a category representing a normal degree of shaking
  • the shaking label information of the target image may be set as the second fuzzy label.
  • the first jitter tag may be "tremble”
  • the second jitter tag may be "untremble”.
  • the jitter label information is determined by the output category of the first preset classification model. In this way, it is only necessary to input the target image into the first preset classification model to conveniently determine whether the target image jitters, which in turn can facilitate Set the jitter label information later to improve the setting efficiency.
  • the target information corresponding to the exposure degree of the target picture can be determined by the following sub-steps:
  • Sub-step (7) take the target picture as the input of the second preset classification model, and determine the exposure label information according to the output category of the second preset classification model; the second preset classification model is used for Whether the exposure level is normal for image classification.
  • the exposure label information is used to represent whether the exposure degree of the target image is normal.
  • the second preset classification model may be based on a CNN neural network, and the second preset classification model may be obtained by training sample pictures with different exposure levels (including normal exposure and abnormal exposure).
  • the second preset classification model The ability to distinguish whether the exposure level of a picture is normal can be learned during the training process. Specifically, after the target picture is input into the second preset classification model, the second preset classification model can extract the picture features of the input target picture, and then judge whether the exposure degree of the target picture is normal according to the extracted picture features, and output category.
  • the exposure label information of the target image may be set as the first exposure label.
  • the output category of the second preset classification model is a category representing a normal exposure degree
  • the exposure label information of the target image may be set as the second exposure label.
  • the first exposure tag may be "expose F”
  • the second exposure tag may be "expose R”.
  • the exposure label information is determined by the output category of the second preset classification model. In this way, it is only necessary to input the target image into the second preset classification model to conveniently determine whether the target image has abnormal exposure ( Overexposure and overdarkness), which can facilitate subsequent setting of exposure label information and improve setting efficiency.
  • abnormal exposure Overexposure and overdarkness
  • the target information corresponding to the color change degree of the target picture can be determined by the following sub-steps:
  • Sub-step (8) determine the second ratio between the second number of pixels whose color value exceeds the preset color value range in the target picture and the total number of pixels; according to the second ratio and the second preset ratio The size relationship between the thresholds determines the color difference label information; the color difference label information is used to represent whether the color change degree of the target picture is normal.
  • the color value of the pixel point may be the color channel value of the pixel point, for example, the red, green and blue (RGB) color channel value.
  • the preset color value range and the second preset ratio threshold can be set according to actual needs. For example, the preset color value range may be determined according to the lowest color value and the highest color value in multiple pictures with normal color difference. The second preset ratio threshold may be based on the lowest ratio threshold that will affect the user's viewing experience. If the color value falls within the preset color value range, it can be considered that the color of the pixel point is normal, and if the color value does not fall within the preset color value range, it can be considered that the color of the pixel point is abnormal.
  • a color value detection algorithm may be used to determine the color value of each pixel in the target image. The color value is then compared to a preset range of color values to determine the second quantity. The second ratio is then divided by the total number of pixels in the target image.
  • the color difference label information of the target picture may be set as the first color difference label.
  • the first color difference label indicates that the color change degree of the target picture is abnormal.
  • the second ratio is not greater than the second preset ratio threshold, it can be considered that the color change degree of the target picture is normal, and accordingly, the color difference label information of the target picture can be set as the second color difference label.
  • the second color difference label indicates that the color change degree of the target picture is normal.
  • the first color difference label may be "Color F"
  • the second color difference label may be "Color R".
  • the present invention by determining the proportion of pixels with abnormal colors in the target picture, determining whether the color change degree of the target picture is normal according to the proportion of the proportion, and setting the corresponding color difference label information, to a certain extent, it is possible to ensure that the picture quality The accuracy of the color difference judgment result, and the reliability of the set color difference label information.
  • the above-mentioned step of screening the video clips according to the target information of the target pictures included in the video clips may include:
  • the target pictures that appear continuously refer to the pictures that are adjacent to the target picture in the forward direction and/or the backward direction in the video segment are also target pictures.
  • the target pictures whose picture parameters do not meet the preset requirements may be determined according to the target information of each target picture in the video clip.
  • the pictures of the target pictures whose picture parameters do not meet the preset requirements that appear continuously are used as the first target pictures.
  • the picture parameters do not meet the preset requirements may be that all the picture parameters do not meet the preset requirements, or some picture parameters do not meet the preset requirements.
  • the target picture that all picture parameters do not meet the preset requirements and appears continuously can be used as the first target picture
  • the target information can be: the first fuzzy label "unclear”, the first jitter label "tremble", The consecutively appearing target pictures of the first exposure label "expose F” and the first color difference label "Color F" are used as the first target picture.
  • a target picture in which some of the picture parameters do not meet the preset requirements and which appear continuously may also be used as the first target picture.
  • the video clip includes picture 1 to picture 20.
  • the target pictures whose picture parameters do not meet the preset requirements include: picture 1, picture 10, picture 11, and picture 13. Because the picture 1 to the picture 10 are the target pictures whose parameters that appear continuously do not meet the preset requirements. Therefore, picture 1 to picture 10 can be determined as the first target picture.
  • the second preset number threshold may be set according to actual needs. If the third number is greater than the second preset number threshold, it can be considered that if all the first target pictures are directly deleted, there is a high probability that the video will appear in the video later. The front and back of the screen are not harmoniously connected, and the video playback is not smooth. Therefore, only part of the first target picture can be culled. Specifically, n first target pictures may be randomly selected for deletion.
  • the present invention by determining the first target pictures that appear continuously and the picture parameters do not meet the preset requirements, and in the case of a large number of first target pictures, n partial first target pictures are eliminated, and the continuous low-quality pictures can be In the case of occurrence, avoid the problem that the subsequent video playback is not smooth due to culling too many consecutive pictures.
  • a picture transition effect may be added to the remaining first target pictures, wherein the picture transition effect may be used to control the first target picture
  • the display effect when it appears can include one or more of right out, left in, spin out and spin in, and fade in and fade out.
  • the picture transition effect may also include other types of effects, which are not limited in this embodiment of the present invention.
  • an operation of adding a picture transition effect may be performed when an adding instruction sent by a user is received. In this way, unnecessary adding operations can be avoided, resulting in that the final generated target video cannot satisfy the user’s needs. question of needs.
  • a transition video frame may be generated according to the picture content of the remaining first target pictures; wherein, the picture parameters of the transition video frame meet preset requirements and The picture similarity is greater than the preset similarity threshold, and the picture similarity is the similarity between the picture content of the transition video frame and the picture content of the remaining first target picture.
  • the preset similarity threshold may be set according to actual requirements. For example, the preset similarity threshold may be 99%.
  • transition video frames are added to the remaining first target pictures.
  • the image quality of the video can be ensured to a certain extent, and at the same time, the culling of the first target picture can be avoided to a greater extent.
  • the front and back of the video are not harmoniously connected, and the video playback is not smooth.
  • a target picture whose picture parameters do not meet the preset requirements can be determined, and then a second target picture is obtained.
  • the number of the determined second target pictures may be counted to obtain a fourth number.
  • a ratio of the fourth number to the total number of pictures in the video clip may be calculated to obtain a third ratio.
  • the third preset ratio threshold and the third preset quantity threshold may be set according to actual requirements. If the third ratio is greater than the third preset ratio threshold, it may be considered that the low-quality pictures in the video clip occupy the proportion of If the fourth number is greater than the third preset number threshold, it can be considered that the number of low-quality pictures in the video clip is large, and it can be determined that the overall quality of the video clip is poor. Therefore, the video clip can be directly discarded. video clips.
  • the overall quality of the video clip is measured in terms of the proportion of low-quality second target pictures and the specific quantity, which can avoid the existence of multiple second targets in the video clip due to the large number of pictures in the video clip.
  • the video clips whose third ratio is greater than the third preset ratio threshold, and/or the fourth number is greater than the third preset number threshold may also be displayed to the user, and the user receives the video clips for these video clips. select operation, and then delete the video clip selected by the selection operation to ensure the flexibility of user operation.
  • the proportion or specific number of the second target pictures is relatively high, that is, the When the overall quality of the video clip is poor, the video clip is directly discarded, thereby reducing the computation amount of subsequent related operations to a certain extent and saving processing resources.
  • the above-mentioned sub-steps may be executed before sub-step (2a), or may be executed after sub-step (2a).
  • it can be performed before sub-step (2a), so that unnecessary culling operations can be avoided, and processing resources can be saved to a greater extent.
  • the above-mentioned step of determining the third preset ratio threshold and the third preset quantity threshold may include: sub-step (2d1): determining, according to the video clip template corresponding to the video to be processed, the third preset ratio threshold and the third preset quantity threshold corresponding to the video clip templates; wherein, the third preset ratio thresholds corresponding to different video clip templates are different, and different video clip templates correspond to The third preset number threshold is different, and the third preset ratio threshold and the third preset number threshold are related to the video content type corresponding to the video clip template.
  • the video processing method in the embodiment of the present invention can be applied to an automatic video editing scene.
  • waste films pictures with lower quality
  • the automatic rejection method provided by the embodiments of the present invention can be performed online in real time, thereby reducing the time-consuming of rejection to a certain extent and ensuring the timeliness of processing.
  • FIG. 2 is a schematic diagram of an editing process provided by an embodiment of the present invention.
  • a user may first select a material (video to be processed), and then perform frame extraction according to a preset frame rate to obtain a picture sequence composed of target pictures. Then, through the CNN network, the fuzzy label information, jitter label information and exposure label information of the target image are determined. Select the clip template, select the clip suitable for clipping.
  • the clips suitable for editing may be the remaining video clips after being discarded through the above sub-step (2e).
  • the video clip template corresponding to the video to be processed may be selected from optional clip templates.
  • the optional editing template may be preset in the electronic device and may be used for video editing.
  • the optional editing template may include a sports template, a gourmet template, a dynamic template, and the like.
  • the editing template commonly used by the user may be determined according to the user's historical usage record. Then, the commonly used editing template is determined as the video editing template corresponding to the video to be processed.
  • the optional editing template may be directly displayed to the user, and the optional editing template selected by the user is determined as the video editing template corresponding to the video to be processed.
  • the third preset ratio threshold and the third preset quantity threshold corresponding to the video clip template corresponding to the video to be processed may be searched in the preset correspondence between the video clip template and the threshold.
  • the video is often edited according to the video editing template.
  • the editing methods of the video editing templates are often different.
  • the corresponding video content type is dynamic content, such as a video editing template that highlights the motion process of the moving subject in the video, such as motion template and dynamic template.
  • the focus is often on the coherence of the video screen. Since the content in such video images is often in motion, it is difficult for users to perceive low-quality images.
  • a video editing template that highlights the appearance of a static subject in a video, such as a gourmet template
  • the focus is often on the appearance details of the subject in the video screen.
  • the motion range of the content in the video picture is often small, so it is easier for the user to perceive that there are more low-quality pictures.
  • different thresholds may be set for different video editing templates according to the picture characteristics of each video editing template.
  • a higher third preset ratio threshold and a third preset number threshold are set, and in the case where the video content type corresponding to the video clip template is dynamic content, A lower third preset ratio threshold and third preset quantity threshold are set.
  • different third preset ratio thresholds and third preset quantity thresholds are correspondingly set for different video editing templates according to the video content types corresponding to the video editing templates, and when editing, according to the currently used Video clip template, select the corresponding threshold to filter video clips, and then to a certain extent, the screening operation can be more adapted to the current clipping needs, thereby improving the effect of clip screening.
  • FIG. 3 is a block diagram of a video processing apparatus provided by an embodiment of the present invention.
  • the apparatus may include: a memory 301 and a processor 302 .
  • the memory 301 is used to store program codes.
  • the processor 302 calls the program code, and when the program code is executed, is configured to perform the following operations: for the target picture in the video to be processed, determine the target information corresponding to the picture parameter of the target picture;
  • the target information is used to represent whether the picture parameters of the target picture meet the preset requirements; according to the target information of the target picture contained in the video to be processed, the video to be processed is screened; wherein, the screening process It includes deleting a target picture that does not meet the preset requirements or one of the video clips to which the target picture belongs; and generating a target video according to the video to be processed after screening.
  • the processor 302 includes deleting a target picture that does not meet the preset requirements or one of the video clips to which the target picture belongs; and generating a target video according to the
  • the picture parameters include one or more of a blur degree, a shake degree, an exposure degree, and a color change degree of the picture.
  • the processor 302 is further configured to: segment the video to be processed to obtain multiple video segments; for each of the video segments, according to the target information of the target picture included in the video segment, Screening processing is performed on the video clips.
  • the processor 302 is further configured to: for the target picture in the video to be processed, before determining the target information corresponding to the picture parameter of the target picture, determine the target picture according to the pictures included in the video to be processed.
  • the target picture is extracted; the target picture is extracted from the video to be processed, and timestamp information is added to each of the target pictures; the timestamp information is used to characterize the order of the target picture in the target picture; For the target picture in the video to be processed, after determining the target information corresponding to the picture parameter of the target picture, the target picture is combined into the video to be processed according to the timestamp information of the target picture.
  • the processor 302 is further configured to: in the case that the total number of pictures included in the to-be-processed video is greater than a first preset number threshold, extract data from the to-be-processed video according to a preset frame rate.
  • the processor 302 is further configured to: determine the first ratio of the first number of pixels with a fuzzy confidence level greater than a preset confidence threshold in the target picture to the total number of pixels in the target picture; According to the magnitude relationship between the first ratio and the first preset ratio threshold, the fuzzy label information is determined; the fuzzy label information is used to indicate whether the fuzzy degree of the target picture is normal; and/or, the target image is The picture is used as the input of the first preset classification model, and according to the output category of the first preset classification model, the shaking label information is determined; the first preset classification model is used for classifying pictures according to whether the shaking degree is normal; and/ Or, taking the target picture as the input of the second preset classification model, and determining the exposure label information according to the output category of the second preset classification model;
  • the processor 302 is further configured to: determine a third number of first target pictures in the video clip; the first target picture is target information indicating that the picture parameters do not meet the preset requirements and are continuous The target pictures that appear; if the third number is greater than the second preset number threshold, remove n of the first target pictures; the n is less than the third number.
  • the processor 302 is further configured to: add a picture transition effect to the remaining first target picture; wherein, the picture transition effect includes right-out, left-in, spin-out, spin-in, and fade-in and fade-out one or more of.
  • the processor 302 is further configured to: determine a third ratio between the fourth number of second target pictures included in the video clip and the total number of pictures in the video clip; the second target The picture is a target picture whose target information indicates that the picture parameters do not meet the preset requirements; determine a third preset ratio threshold and a third preset number threshold; if the third ratio is greater than the third preset ratio threshold, and /or, if the fourth quantity is greater than the third preset quantity threshold, the video clip is discarded.
  • the processor 302 is further configured to: determine, according to the video clip template corresponding to the video to be processed, the third preset ratio threshold and the third preset number corresponding to the video clip template threshold; wherein the third preset ratio threshold corresponding to different video clip templates is different, the third preset number threshold corresponding to different video clip templates is different, the third preset ratio threshold and the third preset ratio threshold
  • the three preset quantity thresholds are related to the video content type corresponding to the video clip template.
  • the processor 302 is further configured to: determine a third target picture whose all picture parameters do not meet the preset requirements; delete the third target picture; wherein the preset requirements include the One or more of normal blur, normal jitter, normal exposure, and normal color variation.
  • the processor 302 is further configured to: combine the screened and processed video clips to obtain the target video.
  • the processor 302 is further configured to: for the target picture in the video to be processed, before determining the target information corresponding to the picture parameter of the target picture, receive the user's selection operation on the optional video; The optional video selected by the selection operation is determined to be the video to be processed; the optional video is the video stored in the electronic device; after the target video is generated according to the screened video to be processed, the target video is displayed to the user. video.
  • the video processing apparatus determines, for the target picture in the video to be processed, target information corresponding to the picture parameters of the target picture, wherein the target information is used to indicate whether the picture parameters of the target picture meet the predetermined requirements.
  • Setting requirements according to the target information of the target picture contained in the video to be processed, the video to be processed is screened, wherein the screening process includes deleting the target picture that does not meet the preset requirements or one of the video clips to which the target picture belongs, and finally, Generate a target video according to the to-be-processed video after screening.
  • an embodiment of the present invention further provides a movable device, where the movable device includes a video capture device, and the movable device is configured to capture a video to be processed through the video capture device.
  • the method processes the video to be processed.
  • the movable device is a drone and/or an unmanned vehicle.
  • an embodiment of the present invention also provides a computer-readable storage medium, where a computer program is stored on the computer-readable storage medium, and when the computer program is executed by a processor, each step in the above video processing method is implemented, and can To achieve the same technical effect, in order to avoid repetition, details are not repeated here.
  • the device embodiments described above are only illustrative, wherein the units described as separate components may or may not be physically separated, and the components shown as units may or may not be physical units, that is, they may be located in One place, or it can be distributed over multiple network elements. Some or all of the modules may be selected according to actual needs to achieve the purpose of the solution in this embodiment. Those of ordinary skill in the art can understand and implement it without creative effort.
  • Various component embodiments of the present invention may be implemented in hardware, or in software modules running on one or more processors, or in a combination thereof.
  • a microprocessor or a digital signal processor may be used in practice to implement some or all of the functions of some or all of the components in the computing processing device according to the embodiments of the present invention.
  • the present invention can also be implemented as apparatus or apparatus programs (eg, computer programs and computer program products) for performing part or all of the methods described herein.
  • Such a program implementing the present invention may be stored on a computer-readable medium, or may be in the form of one or more signals. Such signals may be downloaded from Internet sites, or provided on carrier signals, or in any other form.
  • FIG. 4 is a block diagram of a computing processing device provided by an embodiment of the present invention. As shown in FIG. 4 , FIG. 4 shows a computing processing device that can implement the method according to the present invention.
  • the computing processing device traditionally includes a processor 710 and a computer program product or computer readable medium in the form of a memory 720 .
  • the memory 720 may be electronic memory such as flash memory, EEPROM (electrically erasable programmable read only memory), EPROM, hard disk, or ROM.
  • the memory 720 has storage space 730 for program code for performing any of the method steps in the above-described methods.
  • the storage space 730 for program codes may include various program codes for implementing various steps in the above methods, respectively.
  • These program codes can be read from or written to one or more computer program products.
  • These computer program products include program code carriers such as hard disks, compact disks (CDs), memory cards or floppy disks.
  • Such computer program products are typically portable or fixed storage units as described with reference to FIG. 5 .
  • the storage unit may have storage segments, storage spaces, etc. arranged similarly to the memory 720 in the computing processing device of FIG. 4 .
  • the program code may, for example, be compressed in a suitable form.
  • the storage unit includes computer readable code, ie code readable by a processor such as 710 for example, which when executed by a computing processing device, causes the computing processing device to perform each of the methods described above. step.
  • any reference signs placed between parentheses shall not be construed as limiting the claim.
  • the word “comprising” does not exclude the presence of elements or steps not listed in a claim.
  • the word “a” or “an” preceding an element does not exclude the presence of a plurality of such elements.
  • the invention can be implemented by means of hardware comprising several different elements and by means of a suitably programmed computer. In a unit claim enumerating several means, several of these means can be embodied by one and the same item of hardware.
  • the use of the words first, second, and third, etc. do not denote any order. These words can be interpreted as names.

Landscapes

  • Engineering & Computer Science (AREA)
  • Multimedia (AREA)
  • Signal Processing (AREA)
  • Television Signal Processing For Recording (AREA)

Abstract

A video processing method and apparatus, a mobile device, and a readable storage medium. The method comprises: for a target picture in a video to be processed, determining target information corresponding to a picture parameter of the target picture, the target information being used for representing whether the picture parameter of the target picture satisfies a preset requirement (101); according to the target information of the target picture comprised in the video to be processed, performing screening processing on the video to be processed, wherein the screening processing comprises deleting the target picture that does not satisfy the preset requirement or one of video clips to which the target picture belongs (102); and finally, according to the video to be processed subjected to the screening processing, generating a target video (103). By means of the present invention, during video processing, pictures in a video are automatically screened according to picture parameters of the pictures, so that the screening cost can be reduced to a certain extent, and the screening efficiency is improved.

Description

视频处理方法、装置、可移动设备及可读存储介质Video processing method, apparatus, removable device and readable storage medium 技术领域technical field
本发明属于网络技术领域,特别是涉及一种视频处理方法、装置、可移动设备及可读存储介质。The present invention belongs to the field of network technologies, and in particular, relates to a video processing method, device, removable device and readable storage medium.
背景技术Background technique
目前,视频作为获取信息的优良途径,越来越多的场景下会产出视频。为了提高视频的质量,经常需要对视频中质量较差的图片进行筛选。现有方式中,往往是直接通过人工筛选的方式进行筛选。但是,这种筛选方式的成本较大且效率较低。At present, video is an excellent way to obtain information, and video is produced in more and more scenarios. In order to improve the quality of the video, it is often necessary to filter the pictures with poor quality in the video. In the existing methods, screening is often performed directly through manual screening. However, this screening method is costly and inefficient.
发明内容SUMMARY OF THE INVENTION
本发明提供一种视频处理方法、装置、可移动设备及可读存储介质,以便解决筛选成本较大且筛选效率较低的问题。The present invention provides a video processing method, device, removable device and readable storage medium, so as to solve the problems of high screening cost and low screening efficiency.
为了解决上述技术问题,本发明是这样实现的:In order to solve the above-mentioned technical problems, the present invention is achieved in this way:
第一方面,本发明实施例提供了一种视频处理方法,该方法包括:In a first aspect, an embodiment of the present invention provides a video processing method, which includes:
对于待处理视频中的目标图片,确定所述目标图片的图片参数对应的目标信息;所述目标信息用于表征所述目标图片的图片参数是否满足预设要求;For the target picture in the video to be processed, determine the target information corresponding to the picture parameter of the target picture; the target information is used to represent whether the picture parameter of the target picture meets the preset requirements;
根据所述待处理视频中包含的目标图片的目标信息,对所述待处理视频进行筛选处理;其中,所述筛选处理包括删除不满足所述预设要求的目标图片或所述目标图片所属的其中一个视频片段;Screening the video to be processed according to the target information of the target picture contained in the video to be processed; wherein, the screening process includes deleting target pictures that do not meet the preset requirements or the target pictures to which the target pictures belong. one of the video clips;
根据筛选处理后的待处理视频,生成目标视频。Generate a target video according to the to-be-processed video after screening.
第二方面,本发明实施例提供了一种视频处理装置,所述装置包括存储器和处理器;In a second aspect, an embodiment of the present invention provides a video processing apparatus, and the apparatus includes a memory and a processor;
所述存储器,用于存储程序代码;the memory for storing program codes;
所述处理器,调用所述程序代码,当所述程序代码被执行时,用于执行以下操作:The processor calls the program code, and when the program code is executed, is configured to perform the following operations:
对于待处理视频中的目标图片,确定所述目标图片的图片参数对应的目标信息;所述目标信息用于表征所述目标图片的图片参数是否满足预设要求;For the target picture in the video to be processed, determine the target information corresponding to the picture parameter of the target picture; the target information is used to represent whether the picture parameter of the target picture meets the preset requirements;
根据所述待处理视频中包含的目标图片的目标信息,对所述待处理视频进行筛选处理;其中,所述筛选处理包括删除不满足所述预设要求的目标图片或所述目标图片所属的其中一个视频片段;Screening the video to be processed according to the target information of the target picture contained in the video to be processed; wherein, the screening process includes deleting target pictures that do not meet the preset requirements or the target pictures to which the target pictures belong. one of the video clips;
根据筛选处理后的待处理视频,生成目标视频。Generate a target video according to the to-be-processed video after screening.
第三方面,本发明实施例提供了一种可移动设备,所述可移动设备用于执行第 一方面中所述的视频处理方法中的步骤。In a third aspect, an embodiment of the present invention provides a movable device, where the movable device is configured to execute the steps in the video processing method described in the first aspect.
第四方面,本发明实施例提供了一种计算机可读存储介质,所述计算机可读存储介质上存储计算机程序,所述计算机程序被处理器执行时实现以下操作:In a fourth aspect, an embodiment of the present invention provides a computer-readable storage medium, where a computer program is stored on the computer-readable storage medium, and when the computer program is executed by a processor, the following operations are implemented:
对于待处理视频中的目标图片,确定所述目标图片的图片参数对应的目标信息;所述目标信息用于表征所述目标图片的图片参数是否满足预设要求;For the target picture in the video to be processed, determine the target information corresponding to the picture parameter of the target picture; the target information is used to represent whether the picture parameter of the target picture meets the preset requirements;
根据所述待处理视频中包含的目标图片的目标信息,对所述待处理视频进行筛选处理;其中,所述筛选处理包括删除不满足所述预设要求的目标图片或所述目标图片所属的其中一个视频片段;Screening the video to be processed according to the target information of the target picture contained in the video to be processed; wherein, the screening process includes deleting target pictures that do not meet the preset requirements or the target pictures to which the target pictures belong. one of the video clips;
根据筛选处理后的待处理视频,生成目标视频。Generate a target video according to the to-be-processed video after screening.
在本发明实施例中,对于待处理视频中的目标图片,确定目标图片的图片参数对应的目标信息,其中,目标信息用于表征目标图片的图片参数是否满足预设要求,根据待处理视频中包含的目标图片的目标信息,对待处理视频进行筛选处理,其中,筛选处理包括删除不满足预设要求的目标图片或该目标图片所属的其中一个视频片段,最后,根据筛选处理后的待处理视频,生成目标视频。这样,通过在进行视频处理时,根据图片的图片参数自动对视频中的图片进行筛选,进而一定程度上可以降低筛选成本,提高筛选效率。In the embodiment of the present invention, for the target picture in the video to be processed, target information corresponding to the picture parameters of the target picture is determined, wherein the target information is used to indicate whether the picture parameters of the target picture meet the preset requirements. The target information of the included target picture is screened for the video to be processed, wherein the screening process includes deleting the target picture that does not meet the preset requirements or one of the video clips to which the target picture belongs, and finally, according to the screened and processed video to be processed , generate the target video. In this way, by automatically screening the pictures in the video according to the picture parameters of the pictures during the video processing, the screening cost can be reduced to a certain extent and the screening efficiency can be improved.
附图说明Description of drawings
图1是本发明实施例提供的一种视频处理方法的步骤流程图;1 is a flowchart of steps of a video processing method provided by an embodiment of the present invention;
图2是本发明实施例提供的一种剪辑过程示意图;2 is a schematic diagram of a clipping process provided by an embodiment of the present invention;
图3是本发明实施例提供的一种视频处理装置的框图;3 is a block diagram of a video processing apparatus provided by an embodiment of the present invention;
图4为本发明实施例提供的一种计算处理设备的框图;FIG. 4 is a block diagram of a computing processing device according to an embodiment of the present invention;
图5为本发明实施例提供的一种便携式或者固定存储单元的框图。FIG. 5 is a block diagram of a portable or fixed storage unit according to an embodiment of the present invention.
具体实施方式Detailed ways
下面将结合本发明实施例中的附图,对本发明实施例中的技术方案进行清楚、完整地描述,显然,所描述的实施例是本发明一部分实施例,而不是全部的实施例。基于本发明中的实施例,本领域普通技术人员在没有作出创造性劳动前提下所获得的所有其他实施例,都属于本发明保护的范围。The technical solutions in the embodiments of the present invention will be clearly and completely described below with reference to the accompanying drawings in the embodiments of the present invention. Obviously, the described embodiments are part of the embodiments of the present invention, but not all of the embodiments. Based on the embodiments of the present invention, all other embodiments obtained by those of ordinary skill in the art without creative efforts shall fall within the protection scope of the present invention.
图1是本发明实施例提供的一种视频处理方法的步骤流程图,如图1所示,所述方法可以包括:FIG. 1 is a flowchart of steps of a video processing method provided by an embodiment of the present invention. As shown in FIG. 1 , the method may include:
步骤101、对于待处理视频中的目标图片,确定所述目标图片的图片参数对应的目标信息;所述目标信息用于表征所述目标图片的图片参数是否满足预设要求。Step 101: For a target picture in the video to be processed, determine target information corresponding to a picture parameter of the target picture; the target information is used to represent whether the picture parameter of the target picture meets a preset requirement.
本发明实施例中,待处理视频可以是需要筛选质量较差的图片的视频。示例的, 待处理视频可以是用户自己拍摄的视频,也可以是从网络上下载的视频,本发明实施例对此不作限定。进一步地,视频本质上可以理解为多帧图像组成的图片序列,待处理视频中的目标图片可以是图片序列中的全部图片,也可以是图片序列中的部分图片。目标图片的图片参数可以是能够表征图片质量的参数,具体的图片参数的数量以及种类可以根据实际需求设定。示例的,可以设定图片参数为图片的清晰程度、图片的曝光程度等等。In this embodiment of the present invention, the video to be processed may be a video that needs to be screened for pictures with poor quality. For example, the video to be processed may be a video shot by the user, or a video downloaded from a network, which is not limited in this embodiment of the present invention. Further, a video can be essentially understood as a picture sequence composed of multiple frames of images, and the target picture in the video to be processed may be all pictures in the picture sequence, or a part of the pictures in the picture sequence. The picture parameter of the target picture may be a parameter that can characterize the picture quality, and the specific number and type of the picture parameter may be set according to actual requirements. For example, the picture parameters may be set as the clarity of the picture, the exposure level of the picture, and the like.
进一步地,预设要求可以是图片的图片参数所需满足的质量要求。如果图片的图片参数满足该预设要求,则说明图片的质量较高,可以满足质量要求。反之,如果图片的图片参数不满足该预设要求,则说明图片的质量较差,不能满足质量要求。预设要求的具体内容可以根据实际需求设置。示例的,预设要求可以为清晰程度大于预设清晰度阈值、或者,曝光程度在预设曝光程度范围内,等等。进一步地,目标信息可以是能够表征目标图片的图片参数是否满足预设要求的标签(tag)。标签的具体内容可以根据实际需求设置,示例的,可以以数字、字母、特殊符号,等等作为标签。其中,图片参数满足预设要求时标签的具体内容与图片参数不满足预设要求时标签的具体内容不同。例如,可以以“0”作为表征目标图片的图片参数不满足预设要求的标签,以“1”作为表征目标图片的图片参数满足预设要求的标签。Further, the preset requirement may be a quality requirement that needs to be satisfied by the picture parameters of the picture. If the picture parameters of the picture meet the preset requirements, it means that the picture quality is high and can meet the quality requirements. On the contrary, if the picture parameters of the picture do not meet the preset requirements, it means that the quality of the picture is poor and cannot meet the quality requirements. The specific content of the preset requirements can be set according to actual needs. For example, the preset requirement may be that the degree of sharpness is greater than the preset sharpness threshold, or the degree of exposure is within the range of the preset exposure degree, and so on. Further, the target information may be a tag that can characterize whether the picture parameters of the target picture meet the preset requirements. The specific content of the label can be set according to actual needs. For example, numbers, letters, special symbols, etc. can be used as labels. The specific content of the label when the picture parameters meet the preset requirements is different from the specific content of the label when the picture parameters do not meet the preset requirements. For example, "0" may be used as a label indicating that the picture parameters of the target picture do not meet the preset requirements, and "1" may be used as a label indicating that the picture parameters of the target picture meet the preset requirements.
步骤102、根据所述待处理视频中包含的目标图片的目标信息,对所述待处理视频进行筛选处理;其中,所述筛选处理包括删除不满足所述预设要求的目标图片或所述目标图片所属的其中一个视频片段。Step 102: Perform screening processing on the video to be processed according to the target information of the target picture contained in the video to be processed; wherein, the screening processing includes deleting target pictures or the target pictures that do not meet the preset requirements One of the video clips the image belongs to.
本发明实施例在进行筛选处理时,对于各个目标图片,可以先根据目标图片的目标信息确定该目标图片的图片参数是否满足预设条件。示例的,可以在目标信息为0的情况下确定该图片参数不满足预设要求,在目标信息为1的情况下确定该图片参数满足预设要求。进一步地,如果目标图片的图片参数不满足预设要求,则可以将该目标图片确定为不满足预设要求的目标图片。相应地,可以删除该目标图片或者是直接删除该不满足预设要求的目标图片所属的其中一个视频片段。其中,目标图片所属的其中一个视频片段,可以是待处理视频中包含该目标图片的视频片段,该视频片段的具体长度可以根据实际需求设置,本发明实施例对此不作限定。When performing the screening process in this embodiment of the present invention, for each target picture, whether the picture parameters of the target picture satisfy a preset condition may be first determined according to the target information of the target picture. For example, it may be determined that the picture parameter does not meet the preset requirement when the target information is 0, and it can be determined that the picture parameter meets the preset requirement when the target information is 1. Further, if the picture parameters of the target picture do not meet the preset requirements, the target picture may be determined as the target picture that does not meet the preset requirements. Correspondingly, the target picture may be deleted or one of the video clips to which the target picture that does not meet the preset requirements belongs may be directly deleted. One of the video clips to which the target picture belongs may be a video clip that includes the target picture in the video to be processed, and the specific length of the video clip can be set according to actual requirements, which is not limited in this embodiment of the present invention.
步骤103、根据筛选处理后的待处理视频,生成目标视频。 Step 103 , generating a target video according to the to-be-processed video after screening and processing.
本发明实施例中,先根据待处理视频中目标图片的目标信息,对待处理视频进行筛选处理,可以减少待处理视频中包含的质量较差的图片,进而提高筛选处理后的待处理视频的图像质量。相应地,在完成筛选之后,根据筛选处理后的待处理视频生成目标视频,可以确保最后生成的目标视频具有较高的图像质量。示例的,在生成目标视频时,可以直接将筛选处理后的待处理视频作为目标视频。In the embodiment of the present invention, firstly, according to the target information of the target picture in the video to be processed, the video to be processed is screened, which can reduce the pictures of poor quality included in the video to be processed, and further improve the image of the video to be processed after the screening process. quality. Correspondingly, after the screening is completed, the target video is generated according to the video to be processed after screening, which can ensure that the final generated target video has higher image quality. For example, when the target video is generated, the screened video to be processed may be directly used as the target video.
综上所述,本发明实施例提供的视频处理方法,可以对于待处理视频中的目标图片,确定目标图片的图片参数对应的目标信息,其中,目标信息用于表征目标图片的图片参数是否满足预设要求,根据待处理视频中包含的目标图片的目标信息,对待处理视频进行筛选处理,其中,筛选处理包括删除不满足预设要求的目标图片或该目标图片所属的其中一个视频片段,最后,根据筛选处理后的待处理视频,生成目标视频。这样,通过在进行视频处理时,根据图片的图片参数自动对视频中的图片进行筛选,进而一定程度上可以降低筛选成本,提高筛选效率。To sum up, the video processing method provided by the embodiment of the present invention can determine the target information corresponding to the picture parameter of the target picture for the target picture in the video to be processed, wherein the target information is used to indicate whether the picture parameter of the target picture satisfies the According to the preset requirements, the to-be-processed video is screened according to the target information of the target picture contained in the to-be-processed video, wherein the screening process includes deleting the target picture that does not meet the preset requirements or one of the video clips to which the target picture belongs, and finally , and generate the target video according to the to-be-processed video after screening. In this way, by automatically screening the pictures in the video according to the picture parameters of the pictures during the video processing, the screening cost can be reduced to a certain extent and the screening efficiency can be improved.
可选的,本发明实施例的一种实现方式中,上述图片参数可以包括图片的模糊程度、抖动程度、曝光程度以及色彩变化程度中的一种或多种。当然,图片参数也可以包括其他种类的参数,本发明实施例对此不作限定。Optionally, in an implementation manner of the embodiment of the present invention, the above picture parameters may include one or more of a blur degree, a shake degree, an exposure degree, and a color change degree of the picture. Certainly, the picture parameters may also include other types of parameters, which are not limited in this embodiment of the present invention.
其中,基于图片的模糊程度可以确定图片整体是否清晰,基于图片的抖动程度可以确定图片的画面是否存在抖动问题,基于图片的曝光程度可以确定图片的亮度是否合适,基于图片的色彩变化程度可以确定图片的色彩差是否合适。而实际应用场景中,在图片中存在模糊不清楚、画面抖动、亮度过高或者过低、色彩差过高或者过低的情况下,往往说明该图片的质量较差。因此,本发明实施例中,通过设置图片的模糊程度、抖动程度、曝光程度和/或色彩变化程度为图片参数,使得后续可以从这几个参数维度对待修理视频中的图片进行筛选,进而一定程度上可以确保能够较为准确的将质量较差的图片删除。Among them, based on the degree of blurring of the picture, it can be determined whether the overall picture is clear, based on the degree of shaking of the picture, it can be determined whether the picture of the picture has a shaking problem, based on the degree of exposure of the picture, it can be determined whether the brightness of the picture is appropriate, and the degree of color change of the picture can be determined. Is the color difference of the picture appropriate? However, in practical application scenarios, when the picture is blurred and unclear, the picture is shaken, the brightness is too high or too low, and the color difference is too high or too low, it often means that the quality of the picture is poor. Therefore, in the embodiment of the present invention, by setting the degree of blurring, the degree of shaking, the degree of exposure and/or the degree of color change of the picture as the picture parameters, the pictures in the video to be repaired can be screened from these parameters in the future, and then certain To a certain extent, it can ensure that pictures with poor quality can be deleted more accurately.
需要说明的是,图片参数的种类越多,后续筛选时参照的维度就会越多,相应地,最终筛选的结果也会更精确,但是计算量也会相应增加。反之,如果图片参数的种类越少,后续筛选时参照的维度就会越少,相应地,最终筛选的结果的精确就会降低,但是计算量会较小。因此,在具体实施时,可以先确定硬件设备可承载的计算量,根据可承载的计算量确定所选用的图片参数的数量,以确保硬件设备能够正常运行。其中,所选用的图片参数的数量与所述可承载的计算量正相关。It should be noted that the more types of picture parameters, the more dimensions will be referenced in subsequent screening, and accordingly, the final screening results will be more accurate, but the amount of calculation will also increase accordingly. Conversely, if there are fewer types of picture parameters, the less dimensions will be referenced in subsequent screening, and accordingly, the accuracy of the final screening results will be reduced, but the amount of calculation will be smaller. Therefore, in the specific implementation, the calculation amount that the hardware device can bear may be determined first, and the number of selected picture parameters is determined according to the amount of calculation that the hardware device can bear, so as to ensure that the hardware device can run normally. Wherein, the number of selected picture parameters is positively related to the loadable calculation amount.
需要说明的是,在选择了多种图片参数的情况下,图片参数对应的目标信息可以为多个。示例的,假设选择了模糊程度、抖动程度、曝光程度以及色彩变化程度,那么可以得到模糊程度对应的模糊标签信息、抖动程度对应的抖动标签信息、曝光程度对应的曝光标签信息以及色彩变化程度对应的色彩差标签信息。其中,模糊标签信息、抖动标签信息、曝光标签信息以及色彩差标签信息即为目标信息。模糊标签信息可以用于表征目标图片的模糊程度是否满足模糊程度对应的预设要求,抖动标签信息用于表征目标图片的抖动程度是否满足抖动程度对应的预设要求,曝光标签信息用于表征目标图片的曝光程度是否满足曝光程度对应的预设要求,色彩差标签信息用于表征目标图片的色彩变化程度是否满足色彩变化程度对应的预设要求。It should be noted that, in the case where multiple picture parameters are selected, there may be multiple pieces of target information corresponding to the picture parameters. As an example, assuming that the degree of blurring, the degree of shaking, the degree of exposure, and the degree of color change are selected, then the fuzzy label information corresponding to the degree of blur, the information of the shaking label corresponding to the degree of shaking, the information of the exposure label corresponding to the degree of exposure, and the corresponding degree of color change can be obtained. color difference label information. Among them, the fuzzy label information, the shake label information, the exposure label information and the color difference label information are the target information. The fuzzy label information can be used to characterize whether the blur degree of the target image meets the preset requirement corresponding to the blur degree, the jitter label information can be used to characterize whether the jitter degree of the target image meets the preset requirement corresponding to the jitter degree, and the exposure label information is used to characterize the target Whether the exposure degree of the picture meets the preset requirements corresponding to the exposure degree, and the color difference label information is used to indicate whether the color change degree of the target image meets the preset requirements corresponding to the color change degree.
可选的,本发明实施例中,上述根据待处理视频中包含的目标图片的目标信息,对待处理视频进行筛选处理的操作,可以通过下述子步骤实现:Optionally, in this embodiment of the present invention, the above-mentioned operation of screening the video to be processed according to the target information of the target picture contained in the video to be processed may be implemented by the following sub-steps:
子步骤(1):对所述待处理视频进行分割,得到多个视频片段。Sub-step (1): segment the to-be-processed video to obtain multiple video segments.
示例的,可以按照等间隔划分的方式,利用预设的分割算法将待处理视频划分为包含相同帧数的视频片段。或者,也可以是按照非等间隔划分的方式,利用预设的分割算法将待处理视频随机划分为包含不同帧数的视频片段,本发明实施例对此不作限定。当然,也可以采用其他分割方式实现划分,本发明实施例对此不作限定。For example, the video to be processed may be divided into video segments containing the same number of frames by using a preset segmentation algorithm in a manner of dividing at equal intervals. Alternatively, a preset segmentation algorithm may be used to randomly divide the video to be processed into video segments containing different numbers of frames in a manner of unequal interval division, which is not limited in this embodiment of the present invention. Of course, other division manners may also be used to implement division, which is not limited in this embodiment of the present invention.
子步骤(2):对于各个所述视频片段,根据所述视频片段中包含的目标图片的目标信息,对所述视频片段进行筛选处理。Sub-step (2): For each of the video clips, perform screening processing on the video clips according to the target information of the target picture contained in the video clips.
本步骤中,可以以视频片段为处理单位,针对每个视频片段进行筛选处理。其中,针对视频片段的筛选处理可以包括删除该视频片段中不满足预设要求的目标图片或该视频片段。In this step, a video clip may be used as a processing unit, and screening processing may be performed for each video clip. The screening process for the video clip may include deleting the target picture or the video clip that does not meet the preset requirements in the video clip.
本发明实施例中,在具体进行筛选处理时,先对待处理视频进行分割,得到多个视频片段。接着,根据视频片段中包含的目标图片的目标信息,以视频片段为处理单位,对视频片段进行筛选处理。这样,通过将待处理视频分割为更小粒度的视频片段之后再进行处理,一定程度上可以降低每次处理操作的负担,进而一定程度上可以确保处理效率。In the embodiment of the present invention, when the screening process is specifically performed, the video to be processed is first divided to obtain a plurality of video segments. Next, according to the target information of the target picture contained in the video clip, the video clip is used as a processing unit to filter the video clip. In this way, by dividing the video to be processed into video segments with smaller granularity and then processing, the burden of each processing operation can be reduced to a certain extent, and further processing efficiency can be ensured to a certain extent.
进一步地,在将待处理视频分割为视频片段的情况下,后续在实现根据筛选处理后的待处理视频生成目标视频的操作时,可以对筛选处理后的视频片段进行合并,得到目标视频。由于筛选处理将各个视频片段中质量较差的图片进行了删除,因此,通过将筛选处理后的视频片段合并得到目标视频,一定程度上可以确保目标视频的质量。具体在合并时,可以按照各个筛选处理后的视频片段在待处理视频中的先后顺序进行合并,以确保最终得到的目标视频能够正常播放。Further, in the case of dividing the video to be processed into video segments, in the subsequent operation of generating the target video according to the video to be processed after the screening process, the video clips after the screening process can be merged to obtain the target video. Since the screening process deletes pictures with poor quality in each video clip, the target video is obtained by merging the video clips after the screening process, which can ensure the quality of the target video to a certain extent. Specifically, when merging, the filtered video segments may be merged according to the sequence in the video to be processed, so as to ensure that the final target video can be played normally.
可选的,在本发明的一种实施方式中,待处理视频可以是用户自主选择的。具体的,可以在确定目标图片的图片参数对应的目标信息之前,接收用户对可选视频的选择操作;将所述选择操作选中的可选视频确定为所述待处理视频;所述可选视频为电子设备中存储的视频。相应地,在生成目标视频之后,可以向所述用户显示所述目标视频,以确保用户能够及时获取到处理结果,提高交互效果。Optionally, in an embodiment of the present invention, the video to be processed may be independently selected by the user. Specifically, before the target information corresponding to the picture parameter of the target picture is determined, a user's selection operation on an optional video may be received; the optional video selected by the selection operation may be determined as the to-be-processed video; the optional video For videos stored in electronic devices. Correspondingly, after the target video is generated, the target video can be displayed to the user, so as to ensure that the user can obtain the processing result in time and improve the interaction effect.
其中,可以在客户端中显示电子设备中存储的各个视频,用户可以点击需要处理的可选视频,实现输入选择操作。相应地,电子设备可以接收该选择操作,并将选中的可选视频确定为待处理视频。需要说明的是,本发明实施例中还可以在确定出目标信息之后,为各个目标图片添加目标信息,并显示给用户。用户可以根据需求以及结合目标图片的目标信息,选择需要删除的目标图片。相应地,电子设备可 以将用户选中的目标图片删除。这样,通过向用户显示添加有目标信息的目标图片,由用户自主决定具体要删除的目标图片,可以提高操作的灵活性。Wherein, each video stored in the electronic device can be displayed in the client, and the user can click the optional video to be processed to realize the input selection operation. Correspondingly, the electronic device may receive the selection operation, and determine the selected optional video as the video to be processed. It should be noted that, in this embodiment of the present invention, after the target information is determined, target information may be added to each target picture and displayed to the user. The user can select the target image to be deleted according to the requirements and in combination with the target information of the target image. Accordingly, the electronic device can delete the target picture selected by the user. In this way, by displaying the target picture with the target information added to the user, the user can independently decide the specific target picture to be deleted, which can improve the flexibility of the operation.
可选的,本发明实施例在对于待处理视频中的目标图片,确定目标图片的图片参数对应的目标信息的操作之前,还可以执行下述步骤:Optionally, in this embodiment of the present invention, for the target picture in the video to be processed, before the operation of determining the target information corresponding to the picture parameter of the target picture, the following steps may also be performed:
步骤A、根据所述待处理视频中包含的图片,确定所述目标图片;从所述待处理视频中提取所述目标图片,并为各个所述目标图片添加时间戳信息;所述时间戳信息用于表征所述目标图片在所述目标图片中的次序。Step A: Determine the target picture according to the pictures contained in the video to be processed; extract the target picture from the video to be processed, and add timestamp information to each of the target pictures; the timestamp information Used to characterize the order of the target picture in the target picture.
本步骤中,可以将各个目标图片在待处理视频中的出现时间点作为时间戳信息。示例的,假设目标图片包含图片a、图片b以及图片c。其中,图片a、图片b及图片c的出现时间点分别为:第10秒,第15秒,第21秒。那么可以将第10秒,第15秒,第21秒分别作为含图片a、图片b,图片c的时间戳信息。进一步地,在为各个目标图片添加时间戳信息时,可以是建立各个目标图片与各自对应的时间戳信息之间的关联关系,或者,也可以是将各自对应的时间戳信息写入目标图片中。In this step, the appearance time point of each target picture in the video to be processed may be used as timestamp information. For example, it is assumed that the target picture includes picture a, picture b, and picture c. Among them, the appearance time points of the picture a, the picture b, and the picture c are respectively: the 10th second, the 15th second, and the 21st second. Then, the 10th second, the 15th second, and the 21st second can be used as the timestamp information including the picture a, the picture b, and the picture c, respectively. Further, when adding time stamp information to each target picture, an association relationship between each target picture and the corresponding time stamp information may be established, or, the corresponding time stamp information may also be written into the target picture. .
相应地,在对于待处理视频中的目标图片,确定所述目标图片的图片参数对应的目标信息的操作之后,还可以执行下述步骤:步骤B、根据所述目标图片的时间戳信息,将所述目标图片合入所述待处理视频。具体实施时,可以先根据时间戳信息确定目标图片所在位置,然后将目标图片插入对应位置中。示例的,可以将图片a插入待处理视频中第10秒的位置处,将图片b插入待处理视频中第15秒的位置处,将图片c插入待处理视频中第21秒的位置处。Correspondingly, after the operation of determining the target information corresponding to the picture parameter of the target picture for the target picture in the video to be processed, the following steps can also be performed: Step B: According to the timestamp information of the target picture, the The target picture is merged into the to-be-processed video. During specific implementation, the location of the target picture may be determined according to the timestamp information first, and then the target picture is inserted into the corresponding location. For example, picture a may be inserted into the video to be processed at the 10th second, picture b may be inserted into the video to be processed at the 15th second, and picture c may be inserted into the video to be processed at the 21st second.
需要说明的是,本发明实施例中,还可以在将目标图片合入待处理视频之前,确定所有图片参数均不满足预设要求的第三目标图片,然后删除该第三目标图片。其中,该预设要求可以包括模糊程度正常、抖动程度正常、曝光程度正常及色彩变化程度正常中的一种或多种。具体的,对于每个图片参数,可以先确定该图片参数对应的目标信息是否是用于表征该图片参数不满足预设要求的标签,如果是,则可以确定该图片参数均不满足预设要求。进一步地,如果所有图片参数均不满足预设要求,则说明该图片的质量很差,因此,可以直接在合入之前直接删除该第三目标图片。这样,可以减轻后续筛选处理的负担,以及避免对不必要的目标图片执行合入操作,进而确保处理效率。It should be noted that, in this embodiment of the present invention, before incorporating the target image into the video to be processed, it is also possible to determine a third target image whose parameters do not meet the preset requirements, and then delete the third target image. Wherein, the preset requirement may include one or more of normal blur degree, normal jitter degree, normal exposure degree and normal color change degree. Specifically, for each picture parameter, it can be first determined whether the target information corresponding to the picture parameter is a label used to indicate that the picture parameter does not meet the preset requirements, and if so, it can be determined that none of the picture parameters meet the preset requirements. . Further, if all the picture parameters do not meet the preset requirements, it means that the quality of the picture is very poor, and therefore, the third target picture can be directly deleted before being merged. In this way, the burden of subsequent screening processing can be reduced, and unnecessary target pictures can be avoided from performing merging operations, thereby ensuring processing efficiency.
本发明实施例中,通过在确定目标图片的图片参数对应的目标信息的操作之前,先确定目标图片并提取目标图片。这样,通过单独将目标图片提取出来,可以避免后续确定目标图片的图片参数对应的目标信息时,受到待处理视频中其他图片的干扰,进而一定程度上可以确保处理效果。同时,在根据提取出的目标图片确定出目标信息之后,再将目标图片合入待处理视频,可以确保后续步骤可以正常进行。且 通过为目标图片加入时间戳信息,使得可以便捷的根据时间戳信息将目标图片合入待处理视频,进而确保处理效果。In the embodiment of the present invention, before the operation of determining the target information corresponding to the picture parameters of the target picture, the target picture is determined and the target picture is extracted. In this way, by extracting the target picture separately, it is possible to avoid interference from other pictures in the video to be processed when the target information corresponding to the picture parameters of the target picture is subsequently determined, thereby ensuring the processing effect to a certain extent. At the same time, after the target information is determined according to the extracted target picture, the target picture is combined into the video to be processed, so as to ensure that the subsequent steps can be performed normally. And by adding time stamp information to the target picture, the target picture can be conveniently combined into the video to be processed according to the time stamp information, thereby ensuring the processing effect.
可选的,在本发明的一种实施方式中,根据待处理视频中包含的图片,确定目标图片的操作可以通过下述子步骤实现:Optionally, in an embodiment of the present invention, according to the pictures contained in the video to be processed, the operation of determining the target picture can be implemented by the following sub-steps:
子步骤(3):在所述待处理视频中包含的图片的总数量大于第一预设数量阈值的情况下,按照预设帧率从所述待处理视频中选取m帧图片,作为所述目标图片;所述m不大于所述第一预设数量阈值。Sub-step (3): when the total number of pictures contained in the video to be processed is greater than the first preset number threshold, select m frames of pictures from the video to be processed according to a preset frame rate as the target picture; the m is not greater than the first preset number threshold.
本步骤中,第一预设数量阈值以及预设帧率可以是根据实际需求设置的。其中,第一预设数量阈值可以为实现筛选所需的图片数量。示例的,第一预设数量阈值可以为50,60,等等。预设帧率可以为4帧/秒(frames per second,fps)。具体的,可以先确定待处理视频中包含的图片的总数量,示例的,可以读取待处理视频的配置信息,以获取总数量。接着,判断总数量是否大于第一预设数量阈值。如果总数量大于第一预设数量阈值,则可以认为图片的总数量较多,若直接将所有图片作为目标图片,会引入过多的计算量。因此,可以按照预设帧率进行选取,相应地后续可以对应提取这些目标图片。即,可以按照预设帧率进行抽帧,得到m个目标图片,这m个目标图片组成一个图片序列。In this step, the first preset number threshold and the preset frame rate may be set according to actual requirements. Wherein, the first preset number threshold may be the number of pictures required to implement screening. For example, the first preset number threshold may be 50, 60, and so on. The preset frame rate can be 4 frames per second (fps). Specifically, the total number of pictures included in the video to be processed may be determined first, for example, the configuration information of the video to be processed may be read to obtain the total number. Next, it is determined whether the total number is greater than the first preset number threshold. If the total number is greater than the first preset number threshold, it can be considered that the total number of pictures is large, and if all pictures are directly used as target pictures, too much calculation amount will be introduced. Therefore, selection can be made according to the preset frame rate, and accordingly, these target pictures can be correspondingly extracted subsequently. That is, frame extraction can be performed according to a preset frame rate to obtain m target pictures, and the m target pictures form a picture sequence.
子步骤(4):在所述总数量不大于所述第一预设数量阈值的情况下,将所述待处理视频中的包含的所有图片,作为所述目标图片。Sub-step (4): under the condition that the total number is not greater than the first preset number threshold, use all pictures included in the video to be processed as the target picture.
如果总数量不大于第一预设数量阈值,则说明若直接将所有图片作为目标图片,不会引入过多的计算量。因此,可以不作其他处理,直接将所有图片作为目标图片。这样,可以在实现确定目标图片的同时,避免执行不必要的其他处理。If the total number is not greater than the first preset number threshold, it means that if all the pictures are directly used as the target pictures, too much calculation amount will not be introduced. Therefore, all pictures can be directly used as target pictures without other processing. In this way, it is possible to avoid performing unnecessary other processing while realizing the determination of the target picture.
本发明实施例中,通过在总数量不大于第一预设数量阈值的情况下,才将待处理视频中的包含的所有图片作为目标图片,在总数量大于第一预设数量阈值的情况下,仅选取部分图片作为目标图片。这样,可一定程度上以在避免目标图片过多的情况下,确保有足够的目标图片,进而可以确保能够为后续的筛选提供更多的信息,确保筛选效果。In this embodiment of the present invention, all pictures included in the video to be processed are used as target pictures only when the total number is not greater than the first preset number threshold, and when the total number is greater than the first preset number threshold , select only part of the image as the target image. In this way, it can be ensured that there are enough target pictures to a certain extent while avoiding too many target pictures, so as to ensure that more information can be provided for subsequent screening, and the screening effect can be ensured.
可选的,本发明实施例中的预设要求可以包括模糊程度对应的预设要求:模糊程度正常、抖动程度对应的预设要求:抖动程度正常曝光程度对应的预设要求:曝光程度正常、色彩变化程度对应的预设要求:色彩变化程度正常。Optionally, the preset requirements in this embodiment of the present invention may include preset requirements corresponding to the degree of blur: normal degree of blur, preset requirements corresponding to the degree of jitter: normal degree of jitter and preset requirements corresponding to the degree of exposure: normal degree of exposure, The preset requirements corresponding to the degree of color change: the degree of color change is normal.
进一步地,对于目标图片的模糊程度对应的目标信息,可以通过下述子步骤确定:Further, the target information corresponding to the blur degree of the target picture can be determined through the following sub-steps:
子步骤(5):确定所述目标图片中模糊置信度大于预设置信度阈值的像素点的第一数量与所述目标图片的总像素数量的第一比值;根据所述第一比值与第一预设 比值阈值之间的大小关系,确定模糊标签信息;所述模糊标签信息用于表征所述目标图片的模糊程度是否正常。Sub-step (5): determine the first ratio of the first number of pixels with the fuzzy confidence greater than the preset reliability threshold in the target picture to the total number of pixels of the target picture; according to the first ratio and the first ratio; A magnitude relationship between preset ratio thresholds determines blur label information; the blur label information is used to represent whether the blur degree of the target image is normal.
本步骤中,模糊置信度可以用于表征像素点为模糊像素点的概率。具体的,可以将目标图片输入至预设的模糊检测模块中,该模糊检测模块可以基于卷积神经网络(Convolutional Neural Networks,CNN)构成的语义分割网络实现。实际应用场景中,可以获取不同模糊程度的样本图片,并利用这些样本图片训练神经网络,以生成模糊检测模块。进一步地,在使用时,该模糊检测模块可以提取输入的目标图片的图片特征,然后根据提取到的图片特征,确定该目标图片中各个像素点的模糊置信度。In this step, the fuzzy confidence level can be used to represent the probability that the pixel point is a fuzzy pixel point. Specifically, the target image may be input into a preset blur detection module, and the blur detection module may be implemented based on a semantic segmentation network composed of convolutional neural networks (Convolutional Neural Networks, CNN). In practical application scenarios, sample images with different degrees of blur can be obtained, and the neural network can be trained by using these sample images to generate a blur detection module. Further, in use, the blur detection module can extract the picture features of the input target picture, and then determine the blur confidence of each pixel in the target picture according to the extracted picture features.
进一步地,预设置信度阈值以及第一预设比值阈值可以是根据实际需求设置,示例的,预设置信度阈值可以为80%。第一预设比值阈值可以为0.2。进一步地,如果像素点的模糊置信度大于预设置信度阈值,则可以认为该像素点存在模糊问题,该像素点为模糊像素点。相应地,本步骤中,可以将各个像素点的模糊置信度与预设置信度阈值进行比较,以确定第一数量。接着将第一数量与目标图片的总像素数量相除,得到第一比值。Further, the preset reliability threshold and the first preset ratio threshold may be set according to actual requirements. For example, the preset reliability threshold may be 80%. The first preset ratio threshold may be 0.2. Further, if the fuzzy confidence of the pixel point is greater than the preset confidence threshold, it can be considered that the pixel point has a fuzzy problem, and the pixel point is a fuzzy pixel point. Correspondingly, in this step, the fuzzy confidence of each pixel can be compared with a preset confidence threshold to determine the first number. Then, the first number is divided by the total number of pixels of the target image to obtain a first ratio.
如果第一比值大于第一预设比值阈值,则可以认为该目标图片的模糊程度较为严重,相应地,可以将目标图片的模糊标签信息设置为第一模糊标签。其中,第一模糊标签表征目标图片的模糊程度不正常。如果第一比值不大于第一预设比值阈值,则可以认为该目标图片的模糊程度在可接受范围内,相应地,可以将目标图片的模糊标签信息设置为第二模糊标签。其中,第二模糊标签表征目标图片的模糊程度正常。示例的,第一模糊标签可以为“unclear”,第二模糊标签可以为“clear”。If the first ratio is greater than the first preset ratio threshold, it may be considered that the degree of blurring of the target image is relatively serious, and accordingly, the blurring label information of the target image may be set as the first blurring label. Wherein, the first fuzzy label indicates that the degree of blurring of the target image is abnormal. If the first ratio is not greater than the first preset ratio threshold, it can be considered that the blur degree of the target image is within an acceptable range, and accordingly, the blur tag information of the target image can be set as the second blur tag. Wherein, the second fuzzy label indicates that the degree of blurring of the target image is normal. For example, the first fuzzy label may be "unclear", and the second fuzzy label may be "clear".
本发明实施例中,通过确定目标图片中模糊的像素点所占的比重,根据比重的大小确定目标图片是否模糊并设置相应的模糊标签信息。一定程度上可以确保图片模糊的判断结果的准确性,以及确保设置的模糊标签信息的可信度。In the embodiment of the present invention, by determining the proportion of the blurred pixels in the target picture, whether the target picture is blurred is determined according to the size of the proportion, and corresponding fuzzy label information is set. To a certain extent, it can ensure the accuracy of the judgment result of image blurring, and ensure the credibility of the set fuzzy label information.
需要说明的是,实际应用场景有时会在拍摄过程中主要对焦于被摄主体,而故意模糊背景,以凸显被摄主体。因此,本发明实施例中还可以确定目标图片中的背景区域,然后检测模糊区域与背景区域的重合比例;其中,模糊区域为模糊置信度大于预设置信度阈值的像素点组成的区域。在第一比值大于第一预设比值阈值且重合比例不大于预设重合比例阈值的情况下,将目标图片的模糊标签信息设置为第一模糊标签。其中,预设重合比例阈值可以根据实际需求设置,示例的,预设重合比例阈值可以为90%,如果重合比例大于预设重合比例阈值,则可以认为该目标图片中的模糊区域属于正常现象,如果重合比例不大于预设重合比例阈值,则可以认为该目标图片中的模糊区域属于非正常因素导致的。It should be noted that, in actual application scenarios, sometimes the subject is mainly focused during the shooting process, and the background is intentionally blurred to highlight the subject. Therefore, in the embodiment of the present invention, the background area in the target image can also be determined, and then the coincidence ratio of the blurred area and the background area can be detected; wherein, the blurred area is an area composed of pixels whose blur confidence is greater than a preset confidence threshold. In the case where the first ratio is greater than the first preset ratio threshold and the overlap ratio is not greater than the preset overlap ratio threshold, the blur tag information of the target image is set as the first blur tag. The preset coincidence ratio threshold can be set according to actual needs. For example, the preset coincidence ratio threshold can be 90%. If the coincidence ratio is greater than the preset coincidence ratio threshold, it can be considered that the blurred area in the target image is a normal phenomenon. If the coincidence ratio is not greater than the preset coincidence ratio threshold, it can be considered that the blurred area in the target image is caused by abnormal factors.
本发明实施例中,通过检测模糊区域是否为背景区域,在模糊区域不为背景区域(重合比例不大于预设重合比例阈值)且第一比值大于第一预设比值阈值的情况下,才为目标图片设置表征模糊程度不正常的第一模糊标签。这样,可以避免在对焦导致的背景模糊的情况下,将目标图片误判为模糊程度不正常的图片,进而为目标图片设置不合适的模糊标签信息。In this embodiment of the present invention, by detecting whether the blurred area is a background area, only when the blurred area is not a background area (the overlap ratio is not greater than the preset overlap ratio threshold) and the first ratio is greater than the first preset ratio threshold The target image sets a first blur label representing an abnormal blur degree. In this way, in the case of blurred background caused by focusing, it can be avoided that the target picture is misjudged as a picture with an abnormal degree of blurring, and then inappropriate fuzzy label information is set for the target picture.
进一步地,对于目标图片的抖动程度对应的目标信息,可以通过下述子步骤确定:Further, the target information corresponding to the degree of shaking of the target picture can be determined by the following sub-steps:
子步骤(6):将所述目标图片作为第一预设分类模型的输入,根据所述第一预设分类模型的输出类别,确定抖动标签信息;所述第一预设分类模型用于按照抖动程度是否正常进行图片分类。Sub-step (6): take the target picture as the input of the first preset classification model, and determine the jitter label information according to the output category of the first preset classification model; the first preset classification model is used for Whether the degree of jitter is normal for image classification.
本步骤中,抖动标签信息用于表征目标图片的抖动程度是否正常。第一预设分类模型可以为基于CNN神经网络,该第一预设分类模型可以是以不同抖动程度(包含抖动程度正常以及不正常)的样本图片训练得到的,该第一预设分类模型可以在训练过程中,进过深度学习学习到分辨图片的抖动程度是否正常的能力。具体的,将目标图片输入第一预设分类模型之后,第一预设分类模型可以提取输入的目标图片的图片特征,然后根据提取到的图片特征,判断该目标图片的抖动程度是否正常,并输出类别。相应地,若第一预设分类模型的输出类别为表征抖动程度不正常的类别,则可以将目标图片的抖动标签信息设置为第一模糊标签。若第一预设分类模型的输出类别为表征抖动程度正常的类别,则可以将目标图片的抖动标签信息设置为第二模糊标签。示例的,第一抖动标签可以为“tremble”,第二抖动标签可以为“untremble”。In this step, the shaking label information is used to represent whether the shaking degree of the target image is normal. The first preset classification model may be based on a CNN neural network, and the first preset classification model may be obtained by training sample pictures with different degrees of jitter (including normal and abnormal degrees of jitter), and the first preset classification model may be. During the training process, through deep learning to learn the ability to distinguish whether the jitter of the picture is normal or not. Specifically, after the target picture is input into the first preset classification model, the first preset classification model can extract the picture features of the input target picture, and then according to the extracted picture features, determine whether the degree of shaking of the target picture is normal, and output category. Correspondingly, if the output category of the first preset classification model is a category representing an abnormal degree of shaking, the shaking label information of the target image may be set as the first fuzzy label. If the output category of the first preset classification model is a category representing a normal degree of shaking, the shaking label information of the target image may be set as the second fuzzy label. For example, the first jitter tag may be "tremble", and the second jitter tag may be "untremble".
本发明实施例中,通过第一预设分类模型的输出类别确定抖动标签信息,这样,仅需将目标图片输入第一预设分类模型,即可便捷的确定出目标图片是否抖动,进而可以方便后续设置抖动标签信息,提高设置效率。In this embodiment of the present invention, the jitter label information is determined by the output category of the first preset classification model. In this way, it is only necessary to input the target image into the first preset classification model to conveniently determine whether the target image jitters, which in turn can facilitate Set the jitter label information later to improve the setting efficiency.
进一步地,对于目标图片的曝光程度对应的目标信息,可以通过下述子步骤确定:Further, the target information corresponding to the exposure degree of the target picture can be determined by the following sub-steps:
子步骤(7):将所述目标图片作为第二预设分类模型的输入,根据所述第二预设分类模型的输出类别,确定曝光标签信息;所述第二预设分类模型用于按照曝光程度是否正常进行图片分类。Sub-step (7): take the target picture as the input of the second preset classification model, and determine the exposure label information according to the output category of the second preset classification model; the second preset classification model is used for Whether the exposure level is normal for image classification.
本步骤中,曝光标签信息用于表征目标图片的曝光程度是否正常。第二预设分类模型可以为基于CNN神经网络,该第二预设分类模型可以是以不同曝光程度(包含曝光程度正常以及曝光不正常)的样本图片训练得到的,该第二预设分类模型可以在训练过程中学习到分辨图片的曝光程度是否正常的能力。具体的,将目标图片 输入第二预设分类模型之后,第二预设分类模型可以提取输入的目标图片的图片特征,然后根据提取到的图片特征,判断该目标图片的曝光程度是否正常,并输出类别。相应地,若第二预设分类模型的输出类别为表征曝光程度不正常的类别,则可以将目标图片的曝光标签信息设置为第一曝光标签。若第二预设分类模型的输出类别为表征曝光程度正常的类别,则可以将目标图片的曝光标签信息设置为第二曝光标签。示例的,第一曝光标签可以为“expose F”,第二曝光标签可以为“expose R”。In this step, the exposure label information is used to represent whether the exposure degree of the target image is normal. The second preset classification model may be based on a CNN neural network, and the second preset classification model may be obtained by training sample pictures with different exposure levels (including normal exposure and abnormal exposure). The second preset classification model The ability to distinguish whether the exposure level of a picture is normal can be learned during the training process. Specifically, after the target picture is input into the second preset classification model, the second preset classification model can extract the picture features of the input target picture, and then judge whether the exposure degree of the target picture is normal according to the extracted picture features, and output category. Correspondingly, if the output category of the second preset classification model is a category representing an abnormal degree of exposure, the exposure label information of the target image may be set as the first exposure label. If the output category of the second preset classification model is a category representing a normal exposure degree, the exposure label information of the target image may be set as the second exposure label. For example, the first exposure tag may be "expose F", and the second exposure tag may be "expose R".
本发明实施例中,通过第二预设分类模型的输出类别确定曝光标签信息,这样,仅需将目标图片输入第二预设分类模型,即可便捷的确定出目标图片是否存在曝光不正常(过曝过暗),进而可以方便后续设置曝光标签信息,提高设置效率。In the embodiment of the present invention, the exposure label information is determined by the output category of the second preset classification model. In this way, it is only necessary to input the target image into the second preset classification model to conveniently determine whether the target image has abnormal exposure ( Overexposure and overdarkness), which can facilitate subsequent setting of exposure label information and improve setting efficiency.
进一步地,对于目标图片的色彩变化程度对应的目标信息,可以通过下述子步骤确定:Further, the target information corresponding to the color change degree of the target picture can be determined by the following sub-steps:
子步骤(8):确定所述目标图片中颜色值超出预设颜色值范围的像素点的第二数量与所述总像素数量的第二比值;根据所述第二比值与第二预设比值阈值之间的大小关系,确定色彩差标签信息;所述色差标签信息用于表征所述目标图片的色彩变化程度是否正常。Sub-step (8): determine the second ratio between the second number of pixels whose color value exceeds the preset color value range in the target picture and the total number of pixels; according to the second ratio and the second preset ratio The size relationship between the thresholds determines the color difference label information; the color difference label information is used to represent whether the color change degree of the target picture is normal.
本发明实施例中,像素点的颜色值可以为像素点的颜色通道值,,例如,红绿蓝(RGB)颜色通道值。预设颜色值范围以及第二预设比值阈值可以根据实际需求设置。示例的,预设颜色值范围可以是根据色彩差正常的多个图片中的最低颜色值以及最高颜色值确定的。第二预设比值阈值可以是根据会影响到用户观看体验的最低比值阈值。如果颜色值落入预设颜色值范围,则可以认为该像素点的色彩正常,如果颜色值未落入预设颜色值范围,则可以认为该像素点的色彩存在异常。In the embodiment of the present invention, the color value of the pixel point may be the color channel value of the pixel point, for example, the red, green and blue (RGB) color channel value. The preset color value range and the second preset ratio threshold can be set according to actual needs. For example, the preset color value range may be determined according to the lowest color value and the highest color value in multiple pictures with normal color difference. The second preset ratio threshold may be based on the lowest ratio threshold that will affect the user's viewing experience. If the color value falls within the preset color value range, it can be considered that the color of the pixel point is normal, and if the color value does not fall within the preset color value range, it can be considered that the color of the pixel point is abnormal.
进一步地,可以先利用颜色值检测算法确定目标图片中各个像素点的颜色值。然后将颜色值与预设颜色值范围进行比对,以确定第二数量。接着将第二数量与目标图片的总像素数量相除,得到第二比值。Further, a color value detection algorithm may be used to determine the color value of each pixel in the target image. The color value is then compared to a preset range of color values to determine the second quantity. The second ratio is then divided by the total number of pixels in the target image.
如果第二比值大于第二预设比值阈值,则可以认为该目标图片的色彩差异常,即,色彩变化程度不正常。相应地,可以将目标图片的色彩差标签信息设置为第一色彩差标签。其中,第一色彩差标签表征目标图片的色彩变化程度不正常。如果第二比值不大于第二预设比值阈值,则可以认为该目标图片的色彩变化程度正常,相应地,可以将目标图片的色彩差标签信息设置为第二色彩差标签。其中,第二色彩差标签表征目标图片的色彩变化程度正常。示例的,第一色彩差标签可以为“Color F”,第二色彩差标签可以为“Color R”。If the second ratio is greater than the second preset ratio threshold, it can be considered that the color difference of the target picture is normal, that is, the degree of color change is abnormal. Correspondingly, the color difference label information of the target picture may be set as the first color difference label. The first color difference label indicates that the color change degree of the target picture is abnormal. If the second ratio is not greater than the second preset ratio threshold, it can be considered that the color change degree of the target picture is normal, and accordingly, the color difference label information of the target picture can be set as the second color difference label. The second color difference label indicates that the color change degree of the target picture is normal. For example, the first color difference label may be "Color F", and the second color difference label may be "Color R".
本发明实施例中,通过确定目标图片中色彩异常的像素点所占的比重,根据比重的大小确定目标图片的色彩变化程度是否正常并设置相应的色彩差标签信息,一 定程度上可以确保图片的色彩差的判断结果的准确性,以及确保设置的色彩差标签信息的可信度。In the embodiment of the present invention, by determining the proportion of pixels with abnormal colors in the target picture, determining whether the color change degree of the target picture is normal according to the proportion of the proportion, and setting the corresponding color difference label information, to a certain extent, it is possible to ensure that the picture quality The accuracy of the color difference judgment result, and the reliability of the set color difference label information.
可选的,本发明实施例中,上述根据视频片段中包含的目标图片的目标信息,对视频片段进行筛选处理的步骤,可以包括:Optionally, in the embodiment of the present invention, the above-mentioned step of screening the video clips according to the target information of the target pictures included in the video clips may include:
子步骤(2a):确定所述视频片段中第一目标图片的第三数量;所述第一目标图片为目标信息表征图片参数不满足所述预设要求且连续出现的目标图片。Sub-step (2a): Determine a third number of first target pictures in the video segment; the first target pictures are target pictures whose target information indicates that picture parameters do not meet the preset requirements and appear continuously.
本步骤中,连续出现的目标图片指的是该目标图片在视频片段中前向邻接和/或后向邻接的图片也为目标图片。具体的,可以先根据该视频片段中各个目标图片的目标信息,确定图片参数不满足预设要求的目标图片。然后将连续出现的图片参数不满足预设要求的目标图片的图片作为第一目标图片。其中,图片参数不满足预设要求可以是所有图片参数均不满足预设要求,也可以是部分图片参数不满足预设要求。相应地,可以将所有图片参数均不满足预设要求且连续出现的目标图片作为第一目标图片,例如,可以将目标信息为:第一模糊标签“unclear”、第一抖动标签“tremble”、第一曝光标签“expose F”以及第一色彩差标签“Color F”的连续出现的目标图片作为第一目标图片。或者,也可以是将部分图片参数不满足预设要求且连续出现的目标图片作为第一目标图片。In this step, the target pictures that appear continuously refer to the pictures that are adjacent to the target picture in the forward direction and/or the backward direction in the video segment are also target pictures. Specifically, the target pictures whose picture parameters do not meet the preset requirements may be determined according to the target information of each target picture in the video clip. Then, the pictures of the target pictures whose picture parameters do not meet the preset requirements that appear continuously are used as the first target pictures. Wherein, the picture parameters do not meet the preset requirements may be that all the picture parameters do not meet the preset requirements, or some picture parameters do not meet the preset requirements. Correspondingly, the target picture that all picture parameters do not meet the preset requirements and appears continuously can be used as the first target picture, for example, the target information can be: the first fuzzy label "unclear", the first jitter label "tremble", The consecutively appearing target pictures of the first exposure label "expose F" and the first color difference label "Color F" are used as the first target picture. Alternatively, a target picture in which some of the picture parameters do not meet the preset requirements and which appear continuously may also be used as the first target picture.
示例的,假设该视频片段中包括图片1ˉ图片20。其中,图片参数不满足预设要求的目标图片包括:图片1ˉ图片10、图片11、图片13。由于图片1ˉ图片10为连续出现的图片参数不满足预设要求的目标图片。因此,可以将图片1ˉ图片10确定为第一目标图片。For example, it is assumed that the video clip includes picture 1 to picture 20. The target pictures whose picture parameters do not meet the preset requirements include: picture 1, picture 10, picture 11, and picture 13. Because the picture 1 to the picture 10 are the target pictures whose parameters that appear continuously do not meet the preset requirements. Therefore, picture 1 to picture 10 can be determined as the first target picture.
子步骤(2b):若所述第三数量大于第二预设数量阈值,则剔除n个所述第一目标图片;所述n小于所述第三数量。Sub-step (2b): if the third number is greater than the second preset number threshold, remove n of the first target pictures; the n is less than the third number.
本步骤中,第二预设数量阈值可以是根据实际需求设置,如果第三数量大于第二预设数量阈值,则可以认为如果直接删除全部第一目标图片,很大概率会导致后续出现视频中画面前后衔接不和谐,视频播放不流畅的问题。因此,可以仅剔除部分第一目标图片。具体的,可以随机选择n个第一目标图片删除。In this step, the second preset number threshold may be set according to actual needs. If the third number is greater than the second preset number threshold, it can be considered that if all the first target pictures are directly deleted, there is a high probability that the video will appear in the video later. The front and back of the screen are not harmoniously connected, and the video playback is not smooth. Therefore, only part of the first target picture can be culled. Specifically, n first target pictures may be randomly selected for deletion.
本发明实施例中,通过确定连续出现且图片参数不满足预设要求的第一目标图片,在第一目标图片较多的情况下,剔除n个部分第一目标图片,可以在低质量图片连续出现的情况下,避免由于剔除过多连续的图片导致后续视频播放不流畅的问题。In the embodiment of the present invention, by determining the first target pictures that appear continuously and the picture parameters do not meet the preset requirements, and in the case of a large number of first target pictures, n partial first target pictures are eliminated, and the continuous low-quality pictures can be In the case of occurrence, avoid the problem that the subsequent video playback is not smooth due to culling too many consecutive pictures.
进一步地,本发明实施例中,还可以在剔除n个第一目标图片之后,为剩余的所述第一目标图片添加图片转场效果,其中,图片转场效果可以用于控制第一目标图片出现时的显示效果,该图片转场效果可以包括右出左进、旋出旋入以及淡入淡 出中的一种或多种。当然,图片转场效果还可以包括其他类型的效果,本发明实施例对此不作限定。这样,通过为剩余的第一目标图片添加图片转场效果,一定程度上可以提高第一目标图片显示时的显示效果。需要说明的是,本发明实施例可以是在接收到用户发送的添加指令时,执行添加图片转场效果的操作,这样,可以避免执行不必要的添加操作,导致最终生成的目标视频无法满足用户需求的问题。Further, in this embodiment of the present invention, after removing n first target pictures, a picture transition effect may be added to the remaining first target pictures, wherein the picture transition effect may be used to control the first target picture The display effect when it appears, the picture transition effect can include one or more of right out, left in, spin out and spin in, and fade in and fade out. Certainly, the picture transition effect may also include other types of effects, which are not limited in this embodiment of the present invention. In this way, by adding a picture transition effect to the remaining first target picture, the display effect when the first target picture is displayed can be improved to a certain extent. It should be noted that, in this embodiment of the present invention, an operation of adding a picture transition effect may be performed when an adding instruction sent by a user is received. In this way, unnecessary adding operations can be avoided, resulting in that the final generated target video cannot satisfy the user’s needs. question of needs.
进一步地,本发明实施例中,还可以在剔除n个第一目标图片之后,根据剩余的第一目标图片的图片内容,生成过渡视频帧;其中,过渡视频帧的图片参数满足预设要求且图片相似度大于预设相似度阈值,图片相似度为过渡视频帧的图片内容与剩余的第一目标图片的图片内容之间的相似度。预设相似度阈值可以是根据实际需求设置,示例的,预设相似度阈值可以为99%。接着,在剩余的第一目标图片中添加过渡视频帧。本发明实施例中,通过进一步在剩余的第一目标图片中插入内容相似的高质量图片,一定程度上可以在确保视频的图像质量的同时,更大程度的避免由于剔除第一目标图片,导致的视频中画面前后衔接不和谐,视频播放不流畅的问题。Further, in this embodiment of the present invention, after removing n first target pictures, a transition video frame may be generated according to the picture content of the remaining first target pictures; wherein, the picture parameters of the transition video frame meet preset requirements and The picture similarity is greater than the preset similarity threshold, and the picture similarity is the similarity between the picture content of the transition video frame and the picture content of the remaining first target picture. The preset similarity threshold may be set according to actual requirements. For example, the preset similarity threshold may be 99%. Next, transition video frames are added to the remaining first target pictures. In the embodiment of the present invention, by further inserting high-quality pictures with similar content into the remaining first target pictures, the image quality of the video can be ensured to a certain extent, and at the same time, the culling of the first target picture can be avoided to a greater extent. In the video, the front and back of the video are not harmoniously connected, and the video playback is not smooth.
可选的,本发明实施例中还可以在上述子步骤(2a)之前执行下述步骤:Optionally, in this embodiment of the present invention, the following steps may also be performed before the foregoing sub-step (2a):
子步骤(2c):确定所述视频片段中包含的第二目标图片的第四数量与所述视频片段中图片的总数量的第三比值;所述第二目标图片为目标信息表征图片参数不满足所述预设要求的目标图片。Sub-step (2c): determine the third ratio between the fourth number of the second target pictures included in the video clip and the total number of pictures in the video clip; the second target picture is the target information indicating that the picture parameters are different. A target image that meets the preset requirements.
本步骤中,可以先根据该视频片段中各个目标图片的目标信息,确定图片参数不满足预设要求的目标图片,进而得到第二目标图片。具体确定方式可以参照前述相关步骤中的描述,此处不再赘述。然后,可以统计确定出的第二目标图片的数量,得到第四数量。接着,可以计算第四数量与该视频片段中图片的总数量的比值,得到第三比值。In this step, first, according to the target information of each target picture in the video clip, a target picture whose picture parameters do not meet the preset requirements can be determined, and then a second target picture is obtained. For the specific determination method, reference may be made to the descriptions in the foregoing related steps, which will not be repeated here. Then, the number of the determined second target pictures may be counted to obtain a fourth number. Next, a ratio of the fourth number to the total number of pictures in the video clip may be calculated to obtain a third ratio.
子步骤(2d):确定第三预设比值阈值及第三预设数量阈值。Sub-step (2d): determine a third preset ratio threshold and a third preset number threshold.
子步骤(2e):若所述第三比值大于所述第三预设比值阈值,和/或,所述第四数量大于所述第三预设数量阈值,则丢弃所述视频片段。Sub-step (2e): if the third ratio is greater than the third preset ratio threshold, and/or the fourth number is greater than the third preset number threshold, discard the video clip.
本发明实施例中,第三预设比值阈值及第三预设数量阈值可以是根据实际需求设置,如果第三比值大于第三预设比值阈值,则可以认为该视频片段中低质量图片所占的比重较大,如果第四数量大于第三预设数量阈值,则可以认为该视频片段中低质量图片的数量较多,进而可以确定该视频片段的整体质量较差,因此,可以直接丢弃该视频片段。In this embodiment of the present invention, the third preset ratio threshold and the third preset quantity threshold may be set according to actual requirements. If the third ratio is greater than the third preset ratio threshold, it may be considered that the low-quality pictures in the video clip occupy the proportion of If the fourth number is greater than the third preset number threshold, it can be considered that the number of low-quality pictures in the video clip is large, and it can be determined that the overall quality of the video clip is poor. Therefore, the video clip can be directly discarded. video clips.
同时,以低质量的第二目标图片所占的比重以及具体数量两个维度衡量视频片段的整体质量,可以避免由于视频片段中图片的整体数量太多,导致视频片段中存 在多张第二目标图片时未被识别的情况,以及避免主观上第二目标图片数量较少,但相对视频片段而言第二目标图片占据较大比重却未被识别的情况,进而确保确定整理质量较差的视频片段的准确性。At the same time, the overall quality of the video clip is measured in terms of the proportion of low-quality second target pictures and the specific quantity, which can avoid the existence of multiple second targets in the video clip due to the large number of pictures in the video clip. The situation where the picture is not recognized, and avoid the situation that the number of second target pictures is subjectively small, but the second target picture occupies a large proportion in the video clip but is not recognized, so as to ensure that the video with poor quality is determined. Fragment accuracy.
需要说明的是,本发明实施例中还可以向用户显示第三比值大于第三预设比值阈值,和/或,第四数量大于第三预设数量阈值的视频片段,接收用户针对这些视频片段的选择操作,然后将选择操作选中的视频片段删除,以确保用户操作的灵活性。It should be noted that, in this embodiment of the present invention, the video clips whose third ratio is greater than the third preset ratio threshold, and/or the fourth number is greater than the third preset number threshold may also be displayed to the user, and the user receives the video clips for these video clips. select operation, and then delete the video clip selected by the selection operation to ensure the flexibility of user operation.
本发明实施例中,通过确定各个视频片段中出现的质量较差的第二目标图片所占的比例或具体数量,在第二目标图片所占的比例较高或具体数量较大,即,该视频片段的整体质量较差的情况下,直接丢弃该视频片段,进而一定程度上可以减少后续相关操作的计算量,节省处理资源。需要说明的是,上述子步骤可以是在子步骤(2a)之前执行,也可以是在子步骤(2a)之后执行。可选的,可以是在子步骤(2a)之前执行,这样,可以避免执行不必要的剔除操作,进而可以更大程度的节省处理资源。In the embodiment of the present invention, by determining the proportion or specific number of second target pictures with poor quality appearing in each video segment, the proportion or specific number of the second target pictures is relatively high, that is, the When the overall quality of the video clip is poor, the video clip is directly discarded, thereby reducing the computation amount of subsequent related operations to a certain extent and saving processing resources. It should be noted that, the above-mentioned sub-steps may be executed before sub-step (2a), or may be executed after sub-step (2a). Optionally, it can be performed before sub-step (2a), so that unnecessary culling operations can be avoided, and processing resources can be saved to a greater extent.
可选的,本发明实施例中,上述确定第三预设比值阈值及第三预设数量阈值的步骤,可以包括:子步骤(2d1):根据所述待处理视频对应的视频剪辑模板,确定所述视频剪辑模板对应的所述第三预设比值阈值及所述第三预设数量阈值;其中,不同的视频剪辑模板对应的所述第三预设比值阈值不同,不同的视频剪辑模板对应的所述第三预设数量阈值不同,所述第三预设比值阈值及所述第三预设数量阈值与所述视频剪辑模板对应视频内容类型相关。Optionally, in this embodiment of the present invention, the above-mentioned step of determining the third preset ratio threshold and the third preset quantity threshold may include: sub-step (2d1): determining, according to the video clip template corresponding to the video to be processed, the third preset ratio threshold and the third preset quantity threshold corresponding to the video clip templates; wherein, the third preset ratio thresholds corresponding to different video clip templates are different, and different video clip templates correspond to The third preset number threshold is different, and the third preset ratio threshold and the third preset number threshold are related to the video content type corresponding to the video clip template.
本发明实施例中的视频处理方法可以应用在视频自动剪辑场景中。一般,为了提高视频剪辑效果,往往需要将视频中的废片(质量较低的图片)剔除。因此,可以基于本发明实施例中的视频处理方法实现视频剪辑中的废片剔除,以提高视频剪辑的效率以及后续的观看体验。同时,相较于人工检测剔除的方式,本发明实施例提供的自动剔除的方式可以实时在线的进行,进而一定程度上可以降低剔除的耗时,以及确保处理的及时性。示例的,以目标参数包括模糊程度、抖动程度以及曝光程度为例,图2是本发明实施例提供的一种剪辑过程示意图。如图2所示,可以先由用户选取素材(待处理视频),然后按照预设帧率进行抽帧,得到目标图片组成的图片序列。接着,经过CNN网络,确定目标图片的模糊标签信息、抖动标签信息以及曝光标签信息,根据目标图片的时间戳信息,将目标图片及标签信息重新合入待处理视频,按照用户偏好的剪辑模板或选择的剪辑模板,选取适合剪辑的片段进行剪辑。其中,适合剪辑的片段可以是在经过上述子步骤(2e)丢弃之后,剩余的视频片段。The video processing method in the embodiment of the present invention can be applied to an automatic video editing scene. Generally, in order to improve the effect of video editing, it is often necessary to remove waste films (pictures with lower quality) in the video. Therefore, based on the video processing method in the embodiment of the present invention, waste pieces in the video clip can be eliminated, so as to improve the efficiency of the video clip and the subsequent viewing experience. Meanwhile, compared with the manual detection and rejection method, the automatic rejection method provided by the embodiments of the present invention can be performed online in real time, thereby reducing the time-consuming of rejection to a certain extent and ensuring the timeliness of processing. By way of example, taking the target parameters including the degree of blur, the degree of shaking, and the degree of exposure as an example, FIG. 2 is a schematic diagram of an editing process provided by an embodiment of the present invention. As shown in FIG. 2 , a user may first select a material (video to be processed), and then perform frame extraction according to a preset frame rate to obtain a picture sequence composed of target pictures. Then, through the CNN network, the fuzzy label information, jitter label information and exposure label information of the target image are determined. Select the clip template, select the clip suitable for clipping. The clips suitable for editing may be the remaining video clips after being discarded through the above sub-step (2e).
进一步地,待处理视频对应的视频剪辑模板可以是从可选剪辑模板中选择的。 其中,可选剪辑模板可以是电子设备中预置的,可以用于进行视频剪辑的模板,例如,可选剪辑模板可以包括运动模板、美食模板、动感模板,等等。在选择时,可以是按照用户偏好选择,例如,可以根据用户的历史使用记录,确定用户常用的剪辑模板。然后将该常用的剪辑模板确定为该待处理视频对应的视频剪辑模板。或者,也可以是直接向用户显示可选剪辑模板,将用户选择的可选剪辑模板确定为待处理视频对应的视频剪辑模板。进一步地,可以在预先设置的视频剪辑模板与阈值对应关系中,查找该待处理视频对应的视频剪辑模板对应的第三预设比值阈值及第三预设数量阈值。Further, the video clip template corresponding to the video to be processed may be selected from optional clip templates. The optional editing template may be preset in the electronic device and may be used for video editing. For example, the optional editing template may include a sports template, a gourmet template, a dynamic template, and the like. When selecting, it may be selected according to the user's preference, for example, the editing template commonly used by the user may be determined according to the user's historical usage record. Then, the commonly used editing template is determined as the video editing template corresponding to the video to be processed. Alternatively, the optional editing template may be directly displayed to the user, and the optional editing template selected by the user is determined as the video editing template corresponding to the video to be processed. Further, the third preset ratio threshold and the third preset quantity threshold corresponding to the video clip template corresponding to the video to be processed may be searched in the preset correspondence between the video clip template and the threshold.
为了提高视频的视觉效果,在对视频进行剪辑时,往往会按照视频剪辑模板对视频进行剪辑。其中,对应不同的视频内容类型,视频剪辑模板的剪辑方式往往不同。示例的,对应视频内容类型为动态内容,例如突出视频中运动主体的运动过程的视频剪辑模板,例如,运动模板、动感模板,利用这些模板进行剪辑时,往往是重点突出视频画面的连贯性,由于这类视频画面中内容往往处于运动状态,因此,用户较难察觉到低质量图片。而对应视频内容类型静态内容,例如,突出视频中静态主体的外观的视频剪辑模板,例如,美食模板,利用这些模板进行剪辑时,往往是重点突出视频画面中主体的外观细节,这种情况下视频画面中内容的运动幅度往往较小,因此,用户较容易察觉在到低质量图片较多。相应地,本发明实施例中,可以针对各个视频剪辑模板的画面特点,为不同视频剪辑模板设置不同的阈值。例如,在视频剪辑模板对应视频内容类型为动态内容的情况下,设置较高的第三预设比值阈值及第三预设数量阈值,在视频剪辑模板对应视频内容类型为动态内容的情况下,设置较低的第三预设比值阈值及第三预设数量阈值。In order to improve the visual effect of the video, when editing the video, the video is often edited according to the video editing template. Among them, corresponding to different video content types, the editing methods of the video editing templates are often different. For example, the corresponding video content type is dynamic content, such as a video editing template that highlights the motion process of the moving subject in the video, such as motion template and dynamic template. When using these templates for editing, the focus is often on the coherence of the video screen. Since the content in such video images is often in motion, it is difficult for users to perceive low-quality images. For static content corresponding to the type of video content, for example, a video editing template that highlights the appearance of a static subject in a video, such as a gourmet template, when using these templates for editing, the focus is often on the appearance details of the subject in the video screen. In this case The motion range of the content in the video picture is often small, so it is easier for the user to perceive that there are more low-quality pictures. Correspondingly, in the embodiment of the present invention, different thresholds may be set for different video editing templates according to the picture characteristics of each video editing template. For example, in the case where the video content type corresponding to the video clip template is dynamic content, a higher third preset ratio threshold and a third preset number threshold are set, and in the case where the video content type corresponding to the video clip template is dynamic content, A lower third preset ratio threshold and third preset quantity threshold are set.
本发明实施例中,通过根据视频剪辑模板对应视频内容类型,为不同的视频剪辑模板对应设置不同的第三预设比值阈值以及第三预设数量阈值,并在剪辑时,根据当前所使用的视频剪辑模板,选择对应的阈值对视频片段进行筛选,进而一定程度上可以使筛选操作更加适配当前的剪辑需求,进而提高片段筛选的效果。In the embodiment of the present invention, different third preset ratio thresholds and third preset quantity thresholds are correspondingly set for different video editing templates according to the video content types corresponding to the video editing templates, and when editing, according to the currently used Video clip template, select the corresponding threshold to filter video clips, and then to a certain extent, the screening operation can be more adapted to the current clipping needs, thereby improving the effect of clip screening.
图3是本发明实施例提供的一种视频处理装置的框图,该装置可以包括:存储器301和处理器302。所述存储器301,用于存储程序代码。所述处理器302,调用所述程序代码,当所述程序代码被执行时,用于执行以下操作:对于待处理视频中的目标图片,确定所述目标图片的图片参数对应的目标信息;所述目标信息用于表征所述目标图片的图片参数是否满足预设要求;根据所述待处理视频中包含的目标图片的目标信息,对所述待处理视频进行筛选处理;其中,所述筛选处理包括删除不满足所述预设要求的目标图片或所述目标图片所属的其中一个视频片段;根据筛选处理后的待处理视频,生成目标视频。具体的,处理器302执行的各个操作的具 体实现过程可以参照前述方法实施例中的相关描述,此处不再赘述。FIG. 3 is a block diagram of a video processing apparatus provided by an embodiment of the present invention. The apparatus may include: a memory 301 and a processor 302 . The memory 301 is used to store program codes. The processor 302 calls the program code, and when the program code is executed, is configured to perform the following operations: for the target picture in the video to be processed, determine the target information corresponding to the picture parameter of the target picture; The target information is used to represent whether the picture parameters of the target picture meet the preset requirements; according to the target information of the target picture contained in the video to be processed, the video to be processed is screened; wherein, the screening process It includes deleting a target picture that does not meet the preset requirements or one of the video clips to which the target picture belongs; and generating a target video according to the video to be processed after screening. Specifically, for the specific implementation process of each operation performed by the processor 302, reference may be made to the relevant descriptions in the foregoing method embodiments, which will not be repeated here.
可选的,所述图片参数包括图片的模糊程度、抖动程度、曝光程度以及色彩变化程度中的一种或多种。可选的,所述处理器302,还用于:对所述待处理视频进行分割,得到多个视频片段;对于各个所述视频频段,根据所述视频片段中包含的目标图片的目标信息,对所述视频片段进行筛选处理。可选的,所述处理器302,还用于:对于待处理视频中的目标图片,确定所述目标图片的图片参数对应的目标信息之前,根据所述待处理视频中包含的图片,确定所述目标图片;从所述待处理视频中提取所述目标图片,并为各个所述目标图片添加时间戳信息;所述时间戳信息用于表征所述目标图片在所述目标图片中的次序;对于待处理视频中的目标图片,确定所述目标图片的图片参数对应的目标信息之后,根据所述目标图片的时间戳信息,将所述目标图片合入所述待处理视频。可选的,所述处理器302,还用于:在所述待处理视频中包含的图片的总数量大于第一预设数量阈值的情况下,按照预设帧率从所述待处理视频中选取m帧图片,作为所述目标图片;所述m不大于所述第一预设数量阈值;在所述总数量不大于所述第一预设数量阈值的情况下,将所述待处理视频中的包含的所有图片,作为所述目标图片。可选的,所述处理器302,还用于:确定所述目标图片中模糊置信度大于预设置信度阈值的像素点的第一数量与所述目标图片的总像素数量的第一比值;根据所述第一比值与第一预设比值阈值之间的大小关系,确定模糊标签信息;所述模糊标签信息用于表征所述目标图片的模糊程度是否正常;和/或,将所述目标图片作为第一预设分类模型的输入,根据所述第一预设分类模型的输出类别,确定抖动标签信息;所述第一预设分类模型用于按照抖动程度是否正常进行图片分类;和/或,将所述目标图片作为第二预设分类模型的输入,根据所述第二预设分类模型的输出类别,确定曝光标签信息;所述第二预设分类模型用于按照曝光程度是否正常进行图片分类;和/或,确定所述目标图片中颜色值超出预设颜色值范围的像素点的第二数量与所述总像素数量的第二比值;根据所述第二比值与第二预设比值阈值之间的大小关系,确定色彩差标签信息;所述色差标签信息用于表征所述目标图片的色彩变化程度是否正常。Optionally, the picture parameters include one or more of a blur degree, a shake degree, an exposure degree, and a color change degree of the picture. Optionally, the processor 302 is further configured to: segment the video to be processed to obtain multiple video segments; for each of the video segments, according to the target information of the target picture included in the video segment, Screening processing is performed on the video clips. Optionally, the processor 302 is further configured to: for the target picture in the video to be processed, before determining the target information corresponding to the picture parameter of the target picture, determine the target picture according to the pictures included in the video to be processed. The target picture is extracted; the target picture is extracted from the video to be processed, and timestamp information is added to each of the target pictures; the timestamp information is used to characterize the order of the target picture in the target picture; For the target picture in the video to be processed, after determining the target information corresponding to the picture parameter of the target picture, the target picture is combined into the video to be processed according to the timestamp information of the target picture. Optionally, the processor 302 is further configured to: in the case that the total number of pictures included in the to-be-processed video is greater than a first preset number threshold, extract data from the to-be-processed video according to a preset frame rate. Select m frames of pictures as the target pictures; the m is not greater than the first preset number threshold; when the total number is not greater than the first preset number threshold, the video to be processed is All pictures contained in , as the target picture. Optionally, the processor 302 is further configured to: determine the first ratio of the first number of pixels with a fuzzy confidence level greater than a preset confidence threshold in the target picture to the total number of pixels in the target picture; According to the magnitude relationship between the first ratio and the first preset ratio threshold, the fuzzy label information is determined; the fuzzy label information is used to indicate whether the fuzzy degree of the target picture is normal; and/or, the target image is The picture is used as the input of the first preset classification model, and according to the output category of the first preset classification model, the shaking label information is determined; the first preset classification model is used for classifying pictures according to whether the shaking degree is normal; and/ Or, taking the target picture as the input of the second preset classification model, and determining the exposure label information according to the output category of the second preset classification model; the second preset classification model is used to determine whether the exposure level is normal or not Perform picture classification; and/or, determine the second ratio of the second number of pixels whose color values exceed the preset color value range in the target picture to the total number of pixels; according to the second ratio and the second predetermined The magnitude relationship between the ratio thresholds is set to determine the color difference label information; the color difference label information is used to represent whether the color change degree of the target picture is normal.
可选的,所述处理器302,还用于:确定所述视频片段中第一目标图片的第三数量;所述第一目标图片为目标信息表征图片参数不满足所述预设要求且连续出现的目标图片;若所述第三数量大于第二预设数量阈值,则剔除n个所述第一目标图片;所述n小于所述第三数量。可选的,所述处理器302,还用于:为剩余的所述第一目标图片添加图片转场效果;其中,所述图片转场效果包括右出左进、旋出旋入以及淡入淡出中的一种或多种。可选的,所述处理器302,还用于:确定所述视频片段中包含的第二目标图片的第四数量与所述视频片段中图片的总数量的第三比值;所述 第二目标图片为目标信息表征图片参数不满足所述预设要求的目标图片;确定第三预设比值阈值及第三预设数量阈值;若所述第三比值大于所述第三预设比值阈值,和/或,所述第四数量大于所述第三预设数量阈值,则丢弃所述视频片段。Optionally, the processor 302 is further configured to: determine a third number of first target pictures in the video clip; the first target picture is target information indicating that the picture parameters do not meet the preset requirements and are continuous The target pictures that appear; if the third number is greater than the second preset number threshold, remove n of the first target pictures; the n is less than the third number. Optionally, the processor 302 is further configured to: add a picture transition effect to the remaining first target picture; wherein, the picture transition effect includes right-out, left-in, spin-out, spin-in, and fade-in and fade-out one or more of. Optionally, the processor 302 is further configured to: determine a third ratio between the fourth number of second target pictures included in the video clip and the total number of pictures in the video clip; the second target The picture is a target picture whose target information indicates that the picture parameters do not meet the preset requirements; determine a third preset ratio threshold and a third preset number threshold; if the third ratio is greater than the third preset ratio threshold, and /or, if the fourth quantity is greater than the third preset quantity threshold, the video clip is discarded.
可选的,所述处理器302,还用于:根据所述待处理视频对应的视频剪辑模板,确定所述视频剪辑模板对应的所述第三预设比值阈值及所述第三预设数量阈值;其中,不同的视频剪辑模板对应的所述第三预设比值阈值不同,不同的视频剪辑模板对应的所述第三预设数量阈值不同,所述第三预设比值阈值及所述第三预设数量阈值与所述视频剪辑模板对应视频内容类型相关。可选的,所述处理器302,还用于:确定所有图片参数均不满足所述预设要求的第三目标图片;删除所述第三目标图片;其中,所述预设要求包括所述模糊程度正常、抖动程度正常、曝光程度正常及色彩变化程度正常中的一种或多种。可选的,所述处理器302,还用于:对所述筛选处理后的视频片段进行合并,得到所述目标视频。可选的,所述处理器302,还用于:对于待处理视频中的目标图片,确定所述目标图片的图片参数对应的目标信息之前,接收用户对可选视频的选择操作;将所述选择操作选中的可选视频确定为所述待处理视频;所述可选视频为电子设备中存储的视频;根据筛选处理后的待处理视频,生成目标视频之后,向所述用户显示所述目标视频。Optionally, the processor 302 is further configured to: determine, according to the video clip template corresponding to the video to be processed, the third preset ratio threshold and the third preset number corresponding to the video clip template threshold; wherein the third preset ratio threshold corresponding to different video clip templates is different, the third preset number threshold corresponding to different video clip templates is different, the third preset ratio threshold and the third preset ratio threshold The three preset quantity thresholds are related to the video content type corresponding to the video clip template. Optionally, the processor 302 is further configured to: determine a third target picture whose all picture parameters do not meet the preset requirements; delete the third target picture; wherein the preset requirements include the One or more of normal blur, normal jitter, normal exposure, and normal color variation. Optionally, the processor 302 is further configured to: combine the screened and processed video clips to obtain the target video. Optionally, the processor 302 is further configured to: for the target picture in the video to be processed, before determining the target information corresponding to the picture parameter of the target picture, receive the user's selection operation on the optional video; The optional video selected by the selection operation is determined to be the video to be processed; the optional video is the video stored in the electronic device; after the target video is generated according to the screened video to be processed, the target video is displayed to the user. video.
综上所述,本发明实施例提供的视频处理装置,对于待处理视频中的目标图片,确定目标图片的图片参数对应的目标信息,其中,目标信息用于表征目标图片的图片参数是否满足预设要求,根据待处理视频中包含的目标图片的目标信息,对待处理视频进行筛选处理,其中,筛选处理包括删除不满足预设要求的目标图片或该目标图片所属的其中一个视频片段,最后,根据筛选处理后的待处理视频,生成目标视频。这样,通过在进行视频处理时,根据图片的图片参数自动对视频中的图片进行筛选,进而一定程度上可以降低筛选成本,提高筛选效率。进一步地,本发明实施例还提供一种可移动设备,所述可移动设备包括视频采集设备,所述可移动设备用于通过所述视频采集设备采集待处理视频,根据上述所述的视频处理方法对所述待处理视频进行处理。可选的,所述可移动设备为无人机和/或无人车。进一步地,本发明实施例还提供一种计算机可读存储介质,所述计算机可读存储介质上存储计算机程序,所述计算机程序被处理器执行时实现上述视频处理方法中的各个步骤,且能达到相同的技术效果,为避免重复,这里不再赘述。以上所描述的装置实施例仅仅是示意性的,其中所述作为分离部件说明的单元可以是或者也可以不是物理上分开的,作为单元显示的部件可以是或者也可以不是物理单元,即可以位于一个地方,或者也可以分布到多个网络单元上。可以根据实际的需要选择其中的部分或者全部模块来实现本实施例方案的目的。本领域普通技术人员在不付出创造性的劳动 的情况下,即可以理解并实施。本发明的各个部件实施例可以以硬件实现,或者以在一个或者多个处理器上运行的软件模块实现,或者以它们的组合实现。本领域的技术人员应当理解,可以在实践中使用微处理器或者数字信号处理器来实现根据本发明实施例的计算处理设备中的一些或者全部部件的一些或者全部功能。本发明还可以实现为用于执行这里所描述的方法的一部分或者全部的设备或者装置程序(例如,计算机程序和计算机程序产品)。这样的实现本发明的程序可以存储在计算机可读介质上,或者可以具有一个或者多个信号的形式。这样的信号可以从因特网网站上下载得到,或者在载体信号上提供,或者以任何其他形式提供。To sum up, the video processing apparatus provided by the embodiment of the present invention determines, for the target picture in the video to be processed, target information corresponding to the picture parameters of the target picture, wherein the target information is used to indicate whether the picture parameters of the target picture meet the predetermined requirements. Setting requirements, according to the target information of the target picture contained in the video to be processed, the video to be processed is screened, wherein the screening process includes deleting the target picture that does not meet the preset requirements or one of the video clips to which the target picture belongs, and finally, Generate a target video according to the to-be-processed video after screening. In this way, by automatically screening the pictures in the video according to the picture parameters of the pictures during the video processing, the screening cost can be reduced to a certain extent and the screening efficiency can be improved. Further, an embodiment of the present invention further provides a movable device, where the movable device includes a video capture device, and the movable device is configured to capture a video to be processed through the video capture device. According to the above video processing The method processes the video to be processed. Optionally, the movable device is a drone and/or an unmanned vehicle. Further, an embodiment of the present invention also provides a computer-readable storage medium, where a computer program is stored on the computer-readable storage medium, and when the computer program is executed by a processor, each step in the above video processing method is implemented, and can To achieve the same technical effect, in order to avoid repetition, details are not repeated here. The device embodiments described above are only illustrative, wherein the units described as separate components may or may not be physically separated, and the components shown as units may or may not be physical units, that is, they may be located in One place, or it can be distributed over multiple network elements. Some or all of the modules may be selected according to actual needs to achieve the purpose of the solution in this embodiment. Those of ordinary skill in the art can understand and implement it without creative effort. Various component embodiments of the present invention may be implemented in hardware, or in software modules running on one or more processors, or in a combination thereof. Those skilled in the art should understand that a microprocessor or a digital signal processor may be used in practice to implement some or all of the functions of some or all of the components in the computing processing device according to the embodiments of the present invention. The present invention can also be implemented as apparatus or apparatus programs (eg, computer programs and computer program products) for performing part or all of the methods described herein. Such a program implementing the present invention may be stored on a computer-readable medium, or may be in the form of one or more signals. Such signals may be downloaded from Internet sites, or provided on carrier signals, or in any other form.
例如,图4为本发明实施例提供的一种计算处理设备的框图,如图4所示,图4示出了可以实现根据本发明的方法的计算处理设备。该计算处理设备传统上包括处理器710和以存储器720形式的计算机程序产品或者计算机可读介质。存储器720可以是诸如闪存、EEPROM(电可擦除可编程只读存储器)、EPROM、硬盘或者ROM之类的电子存储器。存储器720具有用于执行上述方法中的任何方法步骤的程序代码的存储空间730。例如,用于程序代码的存储空间730可以包括分别用于实现上面的方法中的各种步骤的各个程序代码。这些程序代码可以从一个或者多个计算机程序产品中读出或者写入到这一个或者多个计算机程序产品中。这些计算机程序产品包括诸如硬盘,紧致盘(CD)、存储卡或者软盘之类的程序代码载体。这样的计算机程序产品通常为如参考图5所述的便携式或者固定存储单元。该存储单元可以具有与图4的计算处理设备中的存储器720类似布置的存储段、存储空间等。程序代码可以例如以适当形式进行压缩。通常,存储单元包括计算机可读代码,即可以由例如诸如710之类的处理器读取的代码,这些代码当由计算处理设备运行时,导致该计算处理设备执行上面所描述的方法中的各个步骤。本说明书中的各个实施例均采用递进的方式描述,每个实施例重点说明的都是与其他实施例的不同之处,各个实施例之间相同相似的部分互相参见即可。本文中所称的“一个实施例”、“实施例”或者“一个或者多个实施例”意味着,结合实施例描述的特定特征、结构或者特性包括在本发明的至少一个实施例中。此外,请注意,这里“在一个实施例中”的词语例子不一定全指同一个实施例。在此处所提供的说明书中,说明了大量具体细节。然而,能够理解,本发明的实施例可以在没有这些具体细节的情况下被实践。在一些实例中,并未详细示出公知的方法、结构和技术,以便不模糊对本说明书的理解。在权利要求中,不应将位于括号之间的任何参考符号构造成对权利要求的限制。单词“包含”不排除存在未列在权利要求中的元件或步骤。位于元件之前的单词“一”或“一个”不排除存在多个这样的元件。本发明可以借助于包括有若干不同元件的硬件以及借助于适当编程的计算机来实现。在列举了若干装置的单元权利要求中, 这些装置中的若干个可以是通过同一个硬件项来具体体现。单词第一、第二、以及第三等的使用不表示任何顺序。可将这些单词解释为名称。For example, FIG. 4 is a block diagram of a computing processing device provided by an embodiment of the present invention. As shown in FIG. 4 , FIG. 4 shows a computing processing device that can implement the method according to the present invention. The computing processing device traditionally includes a processor 710 and a computer program product or computer readable medium in the form of a memory 720 . The memory 720 may be electronic memory such as flash memory, EEPROM (electrically erasable programmable read only memory), EPROM, hard disk, or ROM. The memory 720 has storage space 730 for program code for performing any of the method steps in the above-described methods. For example, the storage space 730 for program codes may include various program codes for implementing various steps in the above methods, respectively. These program codes can be read from or written to one or more computer program products. These computer program products include program code carriers such as hard disks, compact disks (CDs), memory cards or floppy disks. Such computer program products are typically portable or fixed storage units as described with reference to FIG. 5 . The storage unit may have storage segments, storage spaces, etc. arranged similarly to the memory 720 in the computing processing device of FIG. 4 . The program code may, for example, be compressed in a suitable form. Typically, the storage unit includes computer readable code, ie code readable by a processor such as 710 for example, which when executed by a computing processing device, causes the computing processing device to perform each of the methods described above. step. The various embodiments in this specification are described in a progressive manner, and each embodiment focuses on the differences from other embodiments, and the same and similar parts between the various embodiments may be referred to each other. Reference herein to "one embodiment," "an embodiment," or "one or more embodiments" means that a particular feature, structure, or characteristic described in connection with an embodiment is included in at least one embodiment of the present invention. Also, please note that instances of the phrase "in one embodiment" herein are not necessarily all referring to the same embodiment. In the description provided herein, numerous specific details are set forth. It will be understood, however, that embodiments of the invention may be practiced without these specific details. In some instances, well-known methods, structures and techniques have not been shown in detail in order not to obscure an understanding of this description. In the claims, any reference signs placed between parentheses shall not be construed as limiting the claim. The word "comprising" does not exclude the presence of elements or steps not listed in a claim. The word "a" or "an" preceding an element does not exclude the presence of a plurality of such elements. The invention can be implemented by means of hardware comprising several different elements and by means of a suitably programmed computer. In a unit claim enumerating several means, several of these means can be embodied by one and the same item of hardware. The use of the words first, second, and third, etc. do not denote any order. These words can be interpreted as names.
最后应说明的是:以上实施例仅用以说明本发明的技术方案,而非对其限制;尽管参照前述实施例对本发明进行了详细的说明,本领域的普通技术人员应当理解:其依然可以对前述各实施例所记载的技术方案进行修改,或者对其中部分技术特征进行等同替换;而这些修改或者替换,并不使相应技术方案的本质脱离本发明各实施例技术方案的精神和范围。Finally, it should be noted that the above embodiments are only used to illustrate the technical solutions of the present invention, but not to limit them; although the present invention has been described in detail with reference to the foregoing embodiments, those of ordinary skill in the art should understand that it can still be The technical solutions described in the foregoing embodiments are modified, or some technical features thereof are equivalently replaced; and these modifications or replacements do not make the essence of the corresponding technical solutions deviate from the spirit and scope of the technical solutions of the embodiments of the present invention.

Claims (17)

  1. 一种视频处理方法,其特征在于,所述方法包括:A video processing method, characterized in that the method comprises:
    对于待处理视频中的目标图片,确定所述目标图片的图片参数对应的目标信息;所述目标信息用于表征所述目标图片的图片参数是否满足预设要求;For the target picture in the video to be processed, determine the target information corresponding to the picture parameter of the target picture; the target information is used to represent whether the picture parameter of the target picture meets the preset requirements;
    根据所述待处理视频中包含的目标图片的目标信息,对所述待处理视频进行筛选处理;其中,所述筛选处理包括删除不满足所述预设要求的目标图片或所述目标图片所属的其中一个视频片段;Screening the video to be processed according to the target information of the target picture contained in the video to be processed; wherein, the screening process includes deleting target pictures that do not meet the preset requirements or the target pictures to which the target pictures belong. one of the video clips;
    根据筛选处理后的待处理视频,生成目标视频。Generate a target video according to the to-be-processed video after screening.
  2. 根据权利要求1所述方法,其特征在于,所述图片参数包括图片的模糊程度、抖动程度、曝光程度以及色彩变化程度中的一种或多种。The method according to claim 1, wherein the picture parameters include one or more of the degree of blurring, the degree of shaking, the degree of exposure, and the degree of color change of the picture.
  3. 根据权利要求1或2所述的方法,其特征在于,所述根据所述待处理视频中包含的目标图片的目标信息,对所述待处理视频进行筛选处理,包括:The method according to claim 1 or 2, characterized in that, performing screening processing on the to-be-processed video according to target information of a target picture included in the to-be-processed video comprises:
    对所述待处理视频进行分割,得到多个视频片段;segmenting the to-be-processed video to obtain a plurality of video clips;
    对于各个所述视频频段,根据所述视频片段中包含的目标图片的目标信息,对所述视频片段进行筛选处理。For each of the video segments, the video segment is screened according to the target information of the target picture contained in the video segment.
  4. 根据权利要求1至3任一所述的方法,其特征在于,所述对于待处理视频中的目标图片,确定所述目标图片的图片参数对应的目标信息之前,所述方法还包括:根据所述待处理视频中包含的图片,确定所述目标图片;从所述待处理视频中提取所述目标图片,并为各个所述目标图片添加时间戳信息;所述时间戳信息用于表征所述目标图片在所述目标图片中的次序;The method according to any one of claims 1 to 3, wherein, for the target picture in the video to be processed, before determining the target information corresponding to the picture parameter of the target picture, the method further comprises: according to the The picture contained in the video to be processed is determined, and the target picture is determined; the target picture is extracted from the video to be processed, and timestamp information is added to each of the target pictures; the timestamp information is used to represent the the order of the target picture in the target picture;
    所述对于待处理视频中的目标图片,确定所述目标图片的图片参数对应的目标信息之后,所述方法还包括:根据所述目标图片的时间戳信息,将所述目标图片合入所述待处理视频。For the target picture in the video to be processed, after determining the target information corresponding to the picture parameter of the target picture, the method further includes: combining the target picture into the target picture according to the timestamp information of the target picture. Video to be processed.
  5. 根据权利要求4所述的方法,其特征在于,所述根据所述待处理视频中包含的图片,确定所述目标图片,包括:The method according to claim 4, wherein the determining the target picture according to the picture included in the to-be-processed video comprises:
    在所述待处理视频中包含的图片的总数量大于第一预设数量阈值的情况下,按照预设帧率从所述待处理视频中选取m帧图片,作为所述目标图片;所述m不大于所述第一预设数量阈值;When the total number of pictures included in the video to be processed is greater than the first preset number threshold, select m frames of pictures from the video to be processed according to a preset frame rate as the target picture; the m not greater than the first preset number threshold;
    在所述总数量不大于所述第一预设数量阈值的情况下,将所述待处理视频中的包含的所有图片,作为所述目标图片。Under the condition that the total number is not greater than the first preset number threshold, all pictures included in the video to be processed are used as the target picture.
  6. 根据权利要求1或2所述的方法,其特征在于,所述对于待处理视频中的目标图片,确定所述目标图片的图片参数对应的目标信息,包括:The method according to claim 1 or 2, wherein, for the target picture in the video to be processed, determining the target information corresponding to the picture parameter of the target picture, comprising:
    确定所述目标图片中模糊置信度大于预设置信度阈值的像素点的第一数量与所述目 标图片的总像素数量的第一比值;根据所述第一比值与第一预设比值阈值之间的大小关系,确定模糊标签信息;所述模糊标签信息用于表征所述目标图片的模糊程度是否正常;Determine the first ratio of the first number of pixels whose fuzzy confidence is greater than a preset reliability threshold in the target picture to the total number of pixels in the target picture; according to the ratio between the first ratio and the first preset ratio threshold The size relationship between the two is to determine the fuzzy label information; the fuzzy label information is used to represent whether the fuzzy degree of the target picture is normal;
    和/或,将所述目标图片作为第一预设分类模型的输入,根据所述第一预设分类模型的输出类别,确定抖动标签信息;所述第一预设分类模型用于按照抖动程度是否正常进行图片分类;And/or, taking the target picture as the input of the first preset classification model, and determining the jitter label information according to the output category of the first preset classification model; the first preset classification model is used to determine the jitter label information according to the degree of jitter Whether the image classification is performed normally;
    和/或,将所述目标图片作为第二预设分类模型的输入,根据所述第二预设分类模型的输出类别,确定曝光标签信息;所述第二预设分类模型用于按照曝光程度是否正常进行图片分类;And/or, the target picture is used as the input of the second preset classification model, and the exposure label information is determined according to the output category of the second preset classification model; the second preset classification model is used for according to the exposure degree. Whether image classification is performed normally;
    和/或,确定所述目标图片中颜色值超出预设颜色值范围的像素点的第二数量与所述总像素数量的第二比值;根据所述第二比值与第二预设比值阈值之间的大小关系,确定色彩差标签信息;所述色差标签信息用于表征所述目标图片的色彩变化程度是否正常。And/or, determining the second ratio between the second number of pixels whose color value exceeds the preset color value range in the target picture and the total number of pixels; according to the ratio between the second ratio and the second preset ratio threshold; The size relationship between the two is determined, and the color difference label information is determined; the color difference label information is used to represent whether the color change degree of the target picture is normal.
  7. 根据权利要求3所述的方法,其特征在于,所述根据所述视频片段中包含的目标图片的目标信息,对所述视频片段进行筛选处理,包括:The method according to claim 3, wherein the filtering of the video clip according to the target information of the target picture included in the video clip comprises:
    确定所述视频片段中第一目标图片的第三数量;所述第一目标图片为目标信息表征图片参数不满足所述预设要求且连续出现的目标图片;determining the third quantity of the first target picture in the video clip; the first target picture is the target picture whose target information indicates that the picture parameter does not meet the preset requirement and appears continuously;
    若所述第三数量大于第二预设数量阈值,则剔除n个所述第一目标图片;所述n小于所述第三数量。If the third number is greater than the second preset number threshold, remove n of the first target pictures; the n is less than the third number.
  8. 根据权利要求7所述的方法,其特征在于,所述剔除n个所述第一目标图片之后,所述方法还包括:The method according to claim 7, wherein after removing the n first target pictures, the method further comprises:
    为剩余的所述第一目标图片添加图片转场效果;adding a picture transition effect to the remaining first target pictures;
    其中,所述图片转场效果包括右出左进、旋出旋入以及淡入淡出中的一种或多种。Wherein, the picture transition effects include one or more of right-out, left-in, spin-out and spin-in, and fade-in and fade-out.
  9. 根据权利要求7所述的方法,其特征在于,所述方法还包括:The method according to claim 7, wherein the method further comprises:
    确定所述视频片段中包含的第二目标图片的第四数量与所述视频片段中图片的总数量的第三比值;所述第二目标图片为目标信息表征图片参数不满足所述预设要求的目标图片;Determine the third ratio of the fourth number of second target pictures included in the video clip to the total number of pictures in the video clip; the second target picture is target information indicating that the picture parameters do not meet the preset requirements the target image;
    确定第三预设比值阈值及第三预设数量阈值;determining a third preset ratio threshold and a third preset number threshold;
    若所述第三比值大于所述第三预设比值阈值,和/或,所述第四数量大于所述第三预设数量阈值,则丢弃所述视频片段。If the third ratio is greater than the third preset ratio threshold, and/or the fourth number is greater than the third preset number threshold, discarding the video segment.
  10. 根据权利要求9所述的方法,其特征在于,所述确定第三预设比值阈值及第三预设数量阈值,包括:The method according to claim 9, wherein the determining the third preset ratio threshold and the third preset quantity threshold comprises:
    根据所述待处理视频对应的视频剪辑模板,确定所述视频剪辑模板对应的所述第三预设比值阈值及所述第三预设数量阈值;According to the video clip template corresponding to the video to be processed, determine the third preset ratio threshold and the third preset number threshold corresponding to the video clip template;
    其中,不同的视频剪辑模板对应的所述第三预设比值阈值不同,不同的视频剪辑模 板对应的所述第三预设数量阈值不同,所述第三预设比值阈值及所述第三预设数量阈值与所述视频剪辑模板对应视频内容类型相关。The third preset ratio threshold corresponding to different video clip templates is different, the third preset number threshold corresponding to different video clip templates is different, the third preset ratio threshold and the third preset ratio threshold are different. The number threshold is set to be related to the video content type corresponding to the video clip template.
  11. 根据权利要求4所述的方法,其特征在于,所述根据所述目标图片的时间戳信息,将所述目标图片合入所述待处理视频之前,所述方法还包括:The method according to claim 4, wherein, before incorporating the target picture into the to-be-processed video according to the timestamp information of the target picture, the method further comprises:
    确定所有图片参数均不满足所述预设要求的第三目标图片;determining a third target picture for which all picture parameters do not meet the preset requirements;
    删除所述第三目标图片;delete the third target picture;
    其中,所述预设要求包括所述模糊程度正常、抖动程度正常、曝光程度正常及色彩变化程度正常中的一种或多种。Wherein, the preset requirements include one or more of the normal blur degree, normal jitter degree, normal exposure degree and normal color change degree.
  12. 根据权利要求3所述的方法,其特征在于,所述根据筛选处理后的待处理视频,生成目标视频,包括:The method according to claim 3, wherein generating the target video according to the to-be-processed video after screening and processing comprises:
    对所述筛选处理后的视频片段进行合并,得到所述目标视频。The target video is obtained by merging the screened video clips.
  13. 根据权利要求1所述的方法,其特征在于,所述对于待处理视频中的目标图片,确定所述目标图片的图片参数对应的目标信息之前,所述方法还包括:接收用户对可选视频的选择操作;将所述选择操作选中的可选视频确定为所述待处理视频;所述可选视频为电子设备中存储的视频;The method according to claim 1, wherein, for the target picture in the to-be-processed video, before determining the target information corresponding to the picture parameter of the target picture, the method further comprises: receiving user feedback on the optional video the selection operation; determine the optional video selected by the selection operation as the to-be-processed video; the optional video is the video stored in the electronic device;
    所述根据筛选处理后的待处理视频,生成目标视频之后,所述方法还包括:向所述用户显示所述目标视频。After generating the target video according to the screened video to be processed, the method further includes: displaying the target video to the user.
  14. 一种视频处理装置,其特征在于,所述装置包括存储器和处理器;A video processing device, characterized in that the device includes a memory and a processor;
    所述存储器,用于存储程序代码;the memory for storing program codes;
    所述处理器,调用所述程序代码,当所述程序代码被执行时,用于执行以下操作:The processor calls the program code, and when the program code is executed, is configured to perform the following operations:
    对于待处理视频中的目标图片,确定所述目标图片的图片参数对应的目标信息;所述目标信息用于表征所述目标图片的图片参数是否满足预设要求;For the target picture in the video to be processed, determine the target information corresponding to the picture parameter of the target picture; the target information is used to represent whether the picture parameter of the target picture meets the preset requirements;
    根据所述待处理视频中包含的目标图片的目标信息,对所述待处理视频进行筛选处理;其中,所述筛选处理包括删除不满足所述预设要求的目标图片或所述目标图片所属的其中一个视频片段;Screening the to-be-processed video according to the target information of the target picture contained in the to-be-processed video; wherein the screening process includes deleting the target picture that does not meet the preset requirements or the target picture to which the target picture belongs. one of the video clips;
    根据筛选处理后的待处理视频,生成目标视频。Generate a target video according to the to-be-processed video after screening.
  15. 一种可移动设备,其特征在于,所述可移动设备包括视频采集设备,所述可移动设备用于通过所述视频采集设备采集待处理视频,根据权利要求1至13中任一项所述的视频处理方法对所述待处理视频进行处理。A movable device, characterized in that the movable device includes a video capture device, and the movable device is configured to capture a video to be processed through the video capture device, according to any one of claims 1 to 13 The video processing method processes the to-be-processed video.
  16. 根据权利要求14所述方法,其特征在于,所述可移动设备为无人机和/或无人车。The method according to claim 14, wherein the movable device is an unmanned aerial vehicle and/or an unmanned vehicle.
  17. 一种计算机可读存储介质,其特征在于,所述计算机可读存储介质上存储计算机程序,所述计算机程序被处理器执行时实现以下操作:A computer-readable storage medium, characterized in that a computer program is stored on the computer-readable storage medium, and when the computer program is executed by a processor, the following operations are implemented:
    对于待处理视频中的目标图片,确定所述目标图片的图片参数对应的目标信息;所述目标信息用于表征所述目标图片的图片参数是否满足预设要求;For the target picture in the video to be processed, determine the target information corresponding to the picture parameter of the target picture; the target information is used to represent whether the picture parameter of the target picture meets the preset requirements;
    根据所述待处理视频中包含的目标图片的目标信息,对所述待处理视频进行筛选处理;其中,所述筛选处理包括删除不满足所述预设要求的目标图片或所述目标图片所属的其中一个视频片段;Screening the to-be-processed video according to the target information of the target picture contained in the to-be-processed video; wherein the screening process includes deleting the target picture that does not meet the preset requirements or the target picture to which the target picture belongs. one of the video clips;
    根据筛选处理后的待处理视频,生成目标视频。Generate a target video according to the to-be-processed video after screening.
PCT/CN2020/123998 2020-10-27 2020-10-27 Video processing method and apparatus, mobile device, and readable storage medium WO2022087826A1 (en)

Priority Applications (2)

Application Number Priority Date Filing Date Title
CN202080044431.1A CN114026874A (en) 2020-10-27 2020-10-27 Video processing method and device, mobile device and readable storage medium
PCT/CN2020/123998 WO2022087826A1 (en) 2020-10-27 2020-10-27 Video processing method and apparatus, mobile device, and readable storage medium

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
PCT/CN2020/123998 WO2022087826A1 (en) 2020-10-27 2020-10-27 Video processing method and apparatus, mobile device, and readable storage medium

Publications (1)

Publication Number Publication Date
WO2022087826A1 true WO2022087826A1 (en) 2022-05-05

Family

ID=80053986

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2020/123998 WO2022087826A1 (en) 2020-10-27 2020-10-27 Video processing method and apparatus, mobile device, and readable storage medium

Country Status (2)

Country Link
CN (1) CN114026874A (en)
WO (1) WO2022087826A1 (en)

Families Citing this family (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN114598925B (en) * 2022-03-18 2023-10-20 脸萌有限公司 Video editing method, device, equipment and storage medium
CN114446331B (en) * 2022-04-07 2022-06-24 深圳爱卓软科技有限公司 Video editing software system capable of rapidly cutting video
CN114979705A (en) * 2022-04-12 2022-08-30 杭州电子科技大学 Automatic editing method based on deep learning, self-attention mechanism and symbolic reasoning
CN116433538A (en) * 2023-06-15 2023-07-14 加之创(厦门)科技有限公司 Image processing method, medium and device for video image health monitoring

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN105704398A (en) * 2016-03-11 2016-06-22 咸阳师范学院 Video processing method
CN107977463A (en) * 2017-12-21 2018-05-01 广东欧珀移动通信有限公司 image processing method, device, storage medium and terminal
CN110650367A (en) * 2019-08-30 2020-01-03 维沃移动通信有限公司 Video processing method, electronic device, and medium

Family Cites Families (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP5332773B2 (en) * 2009-03-18 2013-11-06 ソニー株式会社 Image processing apparatus and method
EP3364342A1 (en) * 2017-02-17 2018-08-22 Cogisen SRL Method for image processing and video compression
CN109862394A (en) * 2019-03-27 2019-06-07 北京周同科技有限公司 Checking method, device, equipment and the storage medium of video content
CN110290320B (en) * 2019-06-27 2021-01-22 Oppo广东移动通信有限公司 Video preview generation method and device, electronic equipment and computer-readable storage medium
CN110929070A (en) * 2019-12-09 2020-03-27 北京字节跳动网络技术有限公司 Image processing method, image processing device, electronic equipment and storage medium
CN111090778B (en) * 2019-12-26 2023-06-27 北京百度网讯科技有限公司 Picture generation method, device, equipment and storage medium
CN111143613B (en) * 2019-12-30 2024-02-06 携程计算机技术(上海)有限公司 Method, system, electronic device and storage medium for selecting video cover
CN111356016B (en) * 2020-03-11 2022-04-22 北京小米松果电子有限公司 Video processing method, video processing apparatus, and storage medium
CN111416950B (en) * 2020-03-26 2023-11-28 腾讯科技(深圳)有限公司 Video processing method and device, storage medium and electronic equipment

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN105704398A (en) * 2016-03-11 2016-06-22 咸阳师范学院 Video processing method
CN107977463A (en) * 2017-12-21 2018-05-01 广东欧珀移动通信有限公司 image processing method, device, storage medium and terminal
CN110650367A (en) * 2019-08-30 2020-01-03 维沃移动通信有限公司 Video processing method, electronic device, and medium

Also Published As

Publication number Publication date
CN114026874A (en) 2022-02-08

Similar Documents

Publication Publication Date Title
WO2022087826A1 (en) Video processing method and apparatus, mobile device, and readable storage medium
US11093754B2 (en) Method, system and apparatus for selecting frames of a video sequence
US10706892B2 (en) Method and apparatus for finding and using video portions that are relevant to adjacent still images
CN111327945B (en) Method and apparatus for segmenting video
US8379154B2 (en) Key-frame extraction from video
US8195038B2 (en) Brief and high-interest video summary generation
US8818037B2 (en) Video scene detection
US9275683B1 (en) Systems and methods for identifying a scene-change/non-scene-change transition between frames
WO2021017406A1 (en) Video clip extraction method and apparatus, device and storage medium
CN107430780B (en) Method for output creation based on video content characteristics
CA3039239C (en) Conformance of media content to original camera source using optical character recognition
US20110255844A1 (en) System and method for parsing a video sequence
KR101709085B1 (en) Shot Boundary Detection method and apparatus using Convolutional Neural Networks
US20220172476A1 (en) Video similarity detection method, apparatus, and device
CN110996183B (en) Video abstract generation method, device, terminal and storage medium
CN114302226B (en) Intelligent cutting method for video picture
Husa et al. Automatic thumbnail selection for soccer videos using machine learning
CN108985244B (en) Television program type identification method and device
CN101304483A (en) Method and apparatus for image processing by using stored image
US10923154B2 (en) Systems and methods for determining highlight segment sets
JP2005167377A (en) Motion picture editor and motion picture editing method
Tsao et al. Thumbnail image selection for VOD services
CN113542909A (en) Video processing method and device, electronic equipment and computer storage medium
CN113762016A (en) Key frame selection method and device
CN111797912B (en) System and method for identifying film age type and construction method of identification model

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 20958991

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 20958991

Country of ref document: EP

Kind code of ref document: A1