WO2022156294A1

WO2022156294A1 - Video processing method and apparatus, computer readable storage medium, and electronic device

Info

Publication number: WO2022156294A1
Application number: PCT/CN2021/126446
Authority: WO
Inventors: 成云峰; 杨太任
Original assignee: Oppo广东移动通信有限公司
Priority date: 2021-01-21
Filing date: 2021-10-26
Publication date: 2022-07-28
Also published as: CN114827713A; CN114827713B

Abstract

A video processing method, a video processing apparatus, a computer readable storage medium, and an electronic device, which relate to the technical field of video processing. The video processing method comprises: when a first event occurs in a video, starting a video interception task (S70); within a predetermined duration after the first event ends, determining whether a second event occurs in the video (S72); if the second event occurs, then, within a predetermined duration after the second event end, determining whether a third event occurs in the videos (S74); if the third event occurs, using the third event as the second event (S76); and if the second event or the third event does not occur, ending the video interception task so as to determine an intercepted video clip (S78). At least two among the first event, the second event, and the third event are associated events. According to said method, a plurality of video clips of associated events can be intercepted from a video, and the video clips are continuous video clips, thus ensuring that the video clips viewed by a user are continuous and the events are complete. (FIG. 7)

Description

Video processing method and apparatus, computer-readable storage medium and electronic device

CROSS-REFERENCE TO RELATED APPLICATIONS

This application claims the priority of the Chinese patent application with the application number 202110082809.3 and the title of "video processing method and device, computer-readable storage medium and electronic equipment" filed on January 21, 2021, the entire content of the Chinese patent application Incorporated herein by reference in its entirety.

technical field

The present disclosure relates to the technical field of video processing, and in particular, to a video processing method, a video processing apparatus, a computer-readable storage medium, and an electronic device.

Background technique

As an important way to transmit information, video has been widely used in many fields such as monitoring, education, entertainment, medical care, and intelligent driving.

There are often some content that users do not pay attention to in the video, and the proportion of these content in the video may be large, the user's viewing experience is poor, and the storage pressure is large. At present, there are some solutions for intercepting video. However, these solutions for intercepting video may have problems such as loss of information that the user cares about and poor interception effect.

SUMMARY OF THE INVENTION

According to a first aspect of the present disclosure, a video processing method is provided, including: when a first event occurs in a video, starting a video capture task; and within a predetermined time period after the first event ends, determining whether a second event occurs in the video ; If the second event occurs, then within a predetermined time period after the second event ends, determine whether the third event occurs in the video; if the third event occurs, the third event is used as the second event; if the second event does not occur or In the third event, the video clipping task is ended to determine the clipped video segment; wherein, at least two of the first event, the second event and the third event are correlated events with each other.

According to a second aspect of the present disclosure, there is provided a video processing method, comprising: starting a video capture task when a first event occurs in a video; if no event associated with the first event occurs within a predetermined time period after the end of the first event , then end the video clipping task to determine the clipped video clip; if the second event associated with the first event occurs within a predetermined duration after the first event ends, and the second event does not appear within the predetermined duration after the second event ends When an event is associated with an event, the video clipping task is ended to determine the clipped video segment.

According to a third aspect of the present disclosure, there is provided a video processing device, comprising: a task initiation module for initiating a video capture task when a first event occurs in a video; an event determination module for initiating a video capture task after the first event ends Within the predetermined duration of the video, determine whether the second event occurs in the video; if the second event occurs, then within the predetermined duration after the second event ends, determine whether the third event occurs in the video; if the third event occurs, then the third event as the second event; the first video clipping module is used to end the video clipping task if the second event or the third event does not occur, so as to determine the clipped video segment; wherein the first event, the second event and the third event At least two of the events are related events.

According to a fourth aspect of the present disclosure, there is provided a video processing apparatus, comprising: a task initiating module for initiating a video capture task when a first event occurs in a video; a second video capture module for initiating a video capture task if the first event occurs If the associated event of the first event does not occur within the predetermined time period after the end, the video interception task is ended to determine the video clip to be intercepted; the third video interception module is used if the first event occurs within the predetermined time period after the end. A second event associated with an event, and no event associated with the first event occurs within a predetermined period of time after the end of the second event, the video clipping task is terminated to determine the clipped video segment.

According to a fifth aspect of the present disclosure, there is provided a computer-readable storage medium on which a computer program is stored, and when the program is executed by a processor, implements the above-mentioned video processing method.

According to a sixth aspect of the present disclosure, there is provided an electronic device including a processor; a memory for storing one or more programs, and when the one or more programs are executed by the processor, the processor enables the processor to implement the above-mentioned video processing method.

Description of drawings

Figure 1 shows a schematic diagram of a video containing user movement events in some technologies;

Fig. 2 shows the schematic diagram of the interception mode of the fixed duration interception of the video of Fig. 1;

Fig. 3 shows the schematic diagram of another example of adopting fixed duration interception;

FIG. 4 shows a schematic diagram of a video including user movement events in other technologies;

5 shows a schematic diagram of an exemplary system architecture of a video processing solution according to an embodiment of the present disclosure;

FIG. 6 shows a schematic structural diagram of an electronic device suitable for implementing an embodiment of the present disclosure;

FIG. 7 schematically shows a flowchart of a video processing method according to an exemplary embodiment of the present disclosure;

FIG. 8 schematically shows a flowchart of the entire process of the video processing solution according to an embodiment of the present disclosure;

FIG. 9 schematically shows a flowchart of a solution for participating in video capture by the cloud according to another embodiment of the present disclosure;

FIG. 10 schematically shows a flowchart of a video processing method according to another exemplary embodiment of the present disclosure;

FIG. 11 schematically shows a block diagram of a video processing apparatus according to an exemplary embodiment of the present disclosure;

FIG. 12 schematically shows a block diagram of a video processing apparatus according to another exemplary embodiment of the present disclosure;

FIG. 13 schematically shows a block diagram of a video processing apparatus according to yet another exemplary embodiment of the present disclosure;

FIG. 14 schematically shows a block diagram of a video processing apparatus according to still another exemplary embodiment of the present disclosure.

Detailed ways

Example embodiments will now be described more fully with reference to the accompanying drawings. Example embodiments, however, can be embodied in various forms and should not be construed as limited to the examples set forth herein; rather, these embodiments are provided so that this disclosure will be thorough and complete, and will fully convey the concept of example embodiments to those skilled in the art. The described features, structures, or characteristics may be combined in any suitable manner in one or more embodiments. In the following description, numerous specific details are provided in order to give a thorough understanding of the embodiments of the present disclosure. However, those skilled in the art will appreciate that the technical solutions of the present disclosure may be practiced without one or more of the specific details, or other methods, components, devices, steps, etc. may be employed. In other instances, well-known solutions have not been shown or described in detail to avoid obscuring aspects of the present disclosure.

Furthermore, the drawings are merely schematic illustrations of the present disclosure and are not necessarily drawn to scale. The same reference numerals in the drawings denote the same or similar parts, and thus their repeated descriptions will be omitted. Some of the block diagrams shown in the figures are functional entities that do not necessarily necessarily correspond to physically or logically separate entities. These functional entities may be implemented in software, or in one or more hardware modules or integrated circuits, or in different networks and/or processor devices and/or microcontroller devices.

The flow charts shown in the figures are merely illustrative and do not necessarily include all steps. For example, some steps can be decomposed, and some steps can be combined or partially combined, so the actual execution order may be changed according to the actual situation. In addition, all the following terms "first", "second" and "third" are only for the purpose of distinction and should not be used as a limitation of the present disclosure; the examples, implementations and specific technical features of the present application They can be combined with each other without conflict.

Figure 1 shows a schematic diagram of a video involving user movement events in some technologies. Referring to FIG. 1 , during the 1 minute from 13:00:00 to 13:01:00, user movement appears in the video.

According to some technical solutions of the present disclosure, the video shown in FIG. 1 can be intercepted by using a video interception method of intercepting with a fixed duration, so as to obtain a video picture of the user moving. For example, referring to FIG. 2 , the fixed duration is 5 minutes. In this case, a 5-minute video clip from 13:00:00 to 13:05:00 can be captured.

However, there is no user movement event during the 4 minutes from 13:01:00 to 13:05:00 and after that. If these 4 minutes are also intercepted, the storage space will be wasted, and the user will also waste the user's time when watching. Bad experience.

In addition, a mismatch between the fixed interception duration and the event occurrence duration can cause another result. For example, referring to FIG. 3 , from 13:00:00 to 13:06:00 for a total of 6 minutes, the event of the user moving appears in the video, and the fixed duration is configured to be 5 minutes. In this case, the 1-minute video clip from 13:05:00 to 13:06:00 cannot be captured, resulting in incomplete user movement events and missing event information.

FIG. 4 shows a schematic diagram of a video that includes user movement events in other technologies. Referring to FIG. 4, from 13:00:00 to 13:00:50, there are two events of user movement 1 and user movement 2, and the times are from 13:00:00 to 13:00:10 and 13:00:30, respectively. until 13:00:50.

With some solutions of the present disclosure, the video clips corresponding to the user movement 1 and the user movement 2 can be extracted separately, and then combined to obtain the clipped video clips.

However, on the one hand, such merging will cause discontinuity of the finally obtained video clips, which affects the viewing of users; on the other hand, the processing of splicing between video clips is complicated and difficult to implement.

In view of this, the present disclosure provides a new video processing solution.

FIG. 5 shows a schematic diagram of an exemplary system architecture of a video processing solution according to an embodiment of the present disclosure.

As shown in FIG. 5 , the system architecture may include a terminal device 51 and a cloud 53 . The terminal device 51 and the cloud 53 may be connected through a network, and the network may include various connection types, such as wired, wireless communication links, or optical fiber cables, and so on.

The terminal device 51 can interact with the cloud 53 through the network to receive or send messages and the like. The terminal device 51 may be a mobile phone, a tablet computer, a smart wearable device, a personal computer, various video surveillance devices (doorbell, camera), and the like. In different scenarios, the terminal device may also be referred to as a terminal, a mobile terminal, a mobile terminal, a smart terminal, and the like. In addition, the cloud 53 may be a single server or a server cluster composed of multiple servers, and the cloud 53 may also be referred to as a cloud server or a server.

In some instances where the terminal device 51 performs the video processing solution of the present disclosure, the terminal device 51 may initiate a video capture task when a first event occurs in the video. Within a predetermined time period after the end of the first event, it is determined whether the second event occurs in the video. If the second event occurs, within a predetermined period of time after the end of the second event, it is determined whether the third event occurs in the video. If the third event occurs, the third event is regarded as the second event, and it continues to determine whether the third event exists within a new predetermined time period, and the loop process is executed. If the terminal device 51 determines that the second event or the third event does not occur, the video clipping task is ended to determine the clipped video segment. Wherein, at least two of the first event, the second event and the third event are mutually related events, more specifically, the first event, the second event and the third event are mutually related events, or, the first event The event may be associated with the second event and the third event, respectively. It should be noted that the associated event may be the same event or a related event, and the related event may be user-defined, or may be preset by the system, for example, a fall event and a crying event are set as associated events.

In one embodiment, the terminal device 51 can remove the video clips of the last predetermined duration from the clipped video clips to generate the target video clips, and further, can upload the target video clips to the cloud 53 for storage. It can be understood that the target video clip can also be stored locally (which can be understood as a device that performs video capture tasks, such as a camera, a mobile phone, etc.) or on other devices (which can be understood as other devices connected to the local device), such as wirelessly. It is stored in the memory of other devices such as TVs, mobile phones, etc. by means of transmission or cable transmission.

It can be understood that the terminal device 51 can transmit the clipped video clips to the designated device, so that the designated device can remove the video clips of the last predetermined duration from the clipped video clips to generate the target video clips. The specified device may be other devices than the terminal device 51, such as a cloud server, a mobile phone, a TV, and the like.

In another embodiment, the terminal device 51 may upload the clipped video clips to the cloud 53 . The cloud 53 may, in response to the video acquisition request corresponding to the video clip, extract the video clip of the last predetermined duration from the video clip, generate the target video clip, and send the target video clip to the requesting end that initiates the request. The requesting end may be the terminal device 51 or other devices, which are not limited in the present disclosure.

In addition, after receiving the clipped video clip sent by the terminal device 51, the cloud 53 can immediately remove the video clip of the last predetermined duration from the video clip, generate and store the target video clip, so as to receive the above video acquisition request in the cloud 53 In the case of , send the target video clip to the requester.

In other examples in which the terminal device 51 performs the video processing solution of the present disclosure, the terminal device 51 may start a video capture task when the first event occurs in the video. If the associated event of the first event does not occur within a predetermined time period after the end of the first event, the video clipping task is ended to determine the clipped video segment. If the second event associated with the first event occurs within a predetermined time period after the end of the first event, and the event associated with the first event does not occur within a predetermined period of time after the end of the second event, the video capture task is terminated to determine the interception out video clips.

In some instances where the video processing scheme of the present disclosure is performed by the cloud 53 , the cloud 53 may receive video data from the terminal device 51 . Subsequently, the cloud 53 may analyze the video data, and start the video capture task when the first event occurs in the video. Within a predetermined time period after the end of the first event, it is determined whether the second event occurs in the video. If the second event occurs, within a predetermined period of time after the end of the second event, it is determined whether the third event occurs in the video. If the third event occurs, the third event is regarded as the second event, and the loop process is executed. If the cloud 53 determines that the second event or the third event does not occur, the video clipping task is ended to determine the clipped video segment. Wherein, at least two of the first event, the second event and the third event are mutually associated events, and more specifically, the first event, the second event and the third event are mutually associated events.

The cloud 53 may further intercept the clipped video clips, so as to eliminate the video clips of the last predetermined duration, and generate target video clips for storage. This process may be performed immediately after the cut-out video segment is determined, or may be performed after a corresponding video acquisition request is received, which is not limited in the present disclosure.

In other instances where the cloud 53 performs the video processing solution of the present disclosure, the cloud 53 may start a video capture task when the first event occurs in the video. If the associated event of the first event does not occur within a predetermined time period after the end of the first event, the video clipping task is ended to determine the clipped video segment. If a second event associated with the first event occurs within a predetermined time period after the end of the first event, the video clipping task is terminated after the predetermined time period elapses after the end of the second event to determine the clipped video segment.

In addition, it should be noted that, on the one hand, the video processing solution of the present disclosure can be applied in a video surveillance scenario, that is, the video is a video captured by a camera in real time, and real-time analysis is performed to intercept video clips that meet user needs . On the other hand, the video processing solution of the present disclosure can also be used to analyze existing videos.

6 shows a schematic diagram of an electronic device suitable for use in implementing exemplary embodiments of the present disclosure. The terminal device described in the present disclosure may be configured in the form of an electronic device as shown in FIG. 6 . It should be noted that the electronic device shown in FIG. 6 is only an example, and should not impose any limitations on the functions and scope of use of the embodiments of the present disclosure.

The electronic device of the present disclosure includes at least a processor and a memory for storing one or more programs, which, when executed by the processor, enable the processor to implement the video processing method of the exemplary embodiment of the present disclosure.

Specifically, as shown in FIG. 6 , the electronic device 600 may include: a processor 610, an internal memory 621, an external memory interface 622, a Universal Serial Bus (USB) interface 630, a charging management module 640, and a power management module 641, battery 642, antenna 1, antenna 2, mobile communication module 650, wireless communication module 660, audio module 670, speaker 671, receiver 672, microphone 673, headphone jack 674, sensor module 680, display screen 690, camera module 691 , an indicator 692, a motor 693, a key 694, a Subscriber Identification Module (SIM) card interface 695, and the like. The sensor module 680 may include a depth sensor, a pressure sensor, a gyroscope sensor, an air pressure sensor, a magnetic sensor, an acceleration sensor, a distance sensor, a proximity light sensor, a fingerprint sensor, a temperature sensor, a touch sensor, an ambient light sensor, a bone conduction sensor, and the like.

It can be understood that the structures illustrated in the embodiments of the present disclosure do not constitute a specific limitation on the electronic device 600 . In other embodiments of the present disclosure, the electronic device 600 may include more or less components than shown, or some components may be combined, or some components may be separated, or different component arrangements. The illustrated components may be implemented in hardware, software, or a combination of software and hardware.

The processor 610 may include one or more processing units, for example, the processor 610 may include an application processor (Application Processor, AP), a modem processor, a graphics processor (Graphics Processing Unit, GPU), an image signal processor (Image Signal Processor, ISP), controller, video codec, digital signal processor (Digital Signal Processor, DSP), baseband processor and/or neural network processor (Neural-network Processing Unit, NPU), etc. Wherein, different processing units may be independent devices, or may be integrated in one or more processors. In addition, a memory may also be provided in the processor 610 for storing instructions and data.

The electronic device 600 can realize the shooting function through the ISP, the camera module 691, the video codec, the GPU, the display screen 690, the application processor, and the like. In some embodiments, the electronic device 600 may include 1 or N camera modules 691, where N is a positive integer greater than 1. If the electronic device 600 includes N cameras, one of the N cameras is the main camera.

Internal memory 621 may be used to store computer executable program code, which includes instructions. The internal memory 621 may include a storage program area and a storage data area. The external memory interface 622 can be used to connect an external memory card, such as a Micro SD card, to expand the storage capacity of the electronic device 600.

The present disclosure also provides a computer-readable storage medium. The computer-readable storage medium may be included in the electronic device described in the above embodiments, or may exist alone without being assembled into the electronic device.

The computer-readable storage medium can be, for example, but not limited to, an electrical, magnetic, optical, electromagnetic, infrared, or semiconductor system, apparatus or device, or a combination of any of the above. More specific examples of computer readable storage media may include, but are not limited to, electrical connections with one or more wires, portable computer disks, hard disks, random access memory (RAM), read only memory (ROM), erasable Programmable read only memory (EPROM or flash memory), fiber optics, portable compact disk read only memory (CD-ROM), optical storage devices, magnetic storage devices, or any suitable combination of the foregoing. In this disclosure, a computer-readable storage medium may be any tangible medium that contains or stores a program that can be used by or in conjunction with an instruction execution system, apparatus, or device.

The computer-readable storage medium can transmit, propagate, or transport the program for use by or in connection with the instruction execution system, apparatus, or device. Program code embodied on a computer-readable storage medium may be transmitted using any suitable medium including, but not limited to, wireless, wireline, optical fiber cable, RF, etc., or any suitable combination of the foregoing.

The computer-readable storage medium carries one or more programs, which, when executed by an electronic device, cause the electronic device to implement the methods described in the following embodiments.

The flowchart and block diagrams in the Figures illustrate the architecture, functionality, and operation of possible implementations of systems, methods and computer program products according to various embodiments of the present disclosure. In this regard, each block in the flowchart or block diagrams may represent a module, segment, or portion of code that contains one or more logical functions for implementing the specified functions executable instructions. It should also be noted that, in some alternative implementations, the functions noted in the blocks may occur out of the order noted in the figures. For example, two blocks shown in succession may, in fact, be executed substantially concurrently, or the blocks may sometimes be executed in the reverse order, depending upon the functionality involved. It is also noted that each block of the block diagrams or flowchart illustrations, and combinations of blocks in the block diagrams or flowchart illustrations, can be implemented in special purpose hardware-based systems that perform the specified functions or operations, or can be implemented using A combination of dedicated hardware and computer instructions is implemented.

The units involved in the embodiments of the present disclosure may be implemented in software or hardware, and the described units may also be provided in a processor. Among them, the names of these units do not constitute a limitation on the unit itself under certain circumstances.

The video processing method of the exemplary embodiment of the present disclosure may include steps 1 to 4, specifically:

In step 1, when the first event occurs in the video, a video capture task is started.

In step 2, within a predetermined period of time after the end of the first event, it is determined whether the second event occurs in the video. If the second event does not occur, go to step 3; if the second event occurs, go to step 4.

In step 3, the video clipping task is ended to determine the clipped video segment.

In step 4, within a predetermined time period after the end of the second event, it is determined whether the third event occurs in the video. If the third event occurs, the third event is regarded as the second event to execute step 4 cyclically; if the third event does not occur, step 3 is executed.

Wherein, at least two of the first event, the second event and the third event are mutually associated events.

FIG. 7 schematically shows a flowchart of a video processing method according to an exemplary embodiment of the present disclosure. Each step of the video processing method of the present disclosure will be described below by taking the terminal device performing the steps shown in FIG. 7 as an example. Referring to Figure 7, the video processing method may include the following steps:

S70. When the first event occurs in the video, start the video capture task.

The video targeted by the solution of the present disclosure may be a video captured by a camera in real time, and the present disclosure does not limit the content of the video (ie, the object captured by the camera). Wherein, the camera may be a fixed camera, such as a monitoring camera in a parking lot or a manufacturing workshop. In addition, the camera may also be a mobile camera, for example, a camera on a mobile phone, and a user can perform mobile shooting through the camera to obtain surrounding scene information.

The video targeted by the solution of the present disclosure may also be a video that has been shot, and is obtained from the memory when the video needs to be analyzed. Similarly, the present disclosure does not limit the video type of the video that has been shot.

In an exemplary embodiment of the present disclosure, the first event may be a preset event, and the preset event may include a user preset event or a system preset event. The user preset event may be an event demonstrated by the user in advance, and the terminal device may photograph and save the event demonstrated by the user. For example, taking the preset event as an event of the appearance of a human face as an example, the terminal device can capture an image containing a human face and an image that does not contain a human face, and then the user can select an image containing a human face on a preset event configuration interface As an image containing preset events. In addition, the preset event may also be an event preset by the system when the terminal device leaves the factory, and the present disclosure does not limit the type of the preset event.

For another example, the first event may be an event of interest to the user, or an event of a predetermined type preset by the system. For example, the first event may be any one or more of situations such as the presence of a human face in the shooting scene, the movement of an object (such as a person, an animal, etc.), the device sending a prompt signal in the scene, crying, screaming, falling, etc. The type of the first event is not limited.

According to some embodiments of the present disclosure, first, the terminal device may extract video frame images from the video at predetermined time intervals. The predetermined time interval is related to the scene type and can be set based on the scene, and the value of the predetermined time interval is not limited in the present disclosure.

Since each frame is not processed, but video frame images are extracted at predetermined time intervals, the processing pressure of the terminal device is greatly reduced, and resources are saved.

It is easy to understand that in some scenes with drastic changes in image content or other scenes that require careful analysis, each frame of image in the video can be extracted for processing.

Next, feature extraction can be performed on the video frame images. Specifically, a machine learning model based on deep learning can be used to process the video frame images to extract features of the video frame images. Wherein, the present disclosure does not limit the structure and training process of the machine learning model. In addition, a method such as a histogram can also be used to extract the features of the video frame images, which is not limited in the present disclosure.

Then, according to the extracted features, it can be determined whether the above-mentioned preset event occurs in the video frame image. Understandably, the output of a machine learning model can be the result of whether a preset event occurs. In addition, further analysis can be performed according to the features extracted by the machine learning model to obtain a result of whether a preset event occurs.

For example, in the case where the first event is to judge that there is a cat in the scene, the video frame image can be input into the trained convolutional neural network, and the convolutional neural network can perform feature extraction to classify whether there is a cat in the video frame image. .

In addition, since the judgment of a single frame may be wrong, according to other embodiments of the present disclosure, a solution for judging whether a first event (or referred to as a preset event) occurs based on multiple frames is also provided.

Specifically, first, the target video frame image in which the preset object appears for the first time can be determined from the video according to the extracted features. The preset object is an object that determines that an event is a preset event. It can be understood that the preset object can be used as an identifier of the preset event. Next, if there are preset objects in one or more frames of video frame images after the target video frame image, it is determined that a preset event occurs in the video, and the target video frame image is used as the starting point of the preset event.

For example, in 100 consecutive frames of images, if a face appears in the 5th frame, it is judged whether a face also appears in the 6th frame, or whether a face appears in a predetermined number of subsequent video frames (such as the 6th to the 1st frame). 10 frames). If it is determined that there is a human face in these frames, it can be determined that a human face appears in the video, and the fifth frame is taken as the starting point of the human face appearing.

When the first event occurs in the video, the terminal device may start a video capture task.

As described above, in this case, the video clipping task can be started from the target video frame image. The above example is still described, and the video capture task can be started from the fifth frame image.

According to some embodiments of the present disclosure, the operation of initiating a video capture task includes starting a video capture operation. Specifically, in the case that the video is a video captured by a camera in real time, the operation of initiating the video capture task includes starting to record the video.

According to other embodiments of the present disclosure, the operation of initiating the video capture task includes recording the time when the first event begins to appear in the video as the video capture start time. It can be understood that the time when the first event begins to appear is the time point in the video when the first event starts from nothing, that is, the instantaneous time point from the absence of the first event to the appearance of the first event. In addition, the video clipping start time may be a time in the video, that is, it represents a relative time. However, the video clipping start time may also represent an absolute time in reality, which is not limited in the present disclosure.

S72. Within a predetermined period of time after the end of the first event, determine whether the second event occurs in the video.

The following description will be given by taking as an example that the second event is associated with the first event.

In some embodiments of the present disclosure, the association of the second event with the first event means that the second event and the first event are of the same event type. For example, all faces appear, all users move, all other designated objects (eg, cats, designated devices, etc.) exist.

In other embodiments of the present disclosure, the association between the second event and the first event may also refer to: the second event is the same as the first event. For example, both the second event and the first event appear on the face of user A. In addition, it can be understood that the same here means that the images corresponding to the events are the same, and it is not necessary that the positions and sizes of the images appear exactly the same.

In still other embodiments of the present disclosure, the association of the second event with the first event means that the second event may be a subsequent event of the first event. For example, assembling an item includes two steps: process a and process b. It is necessary to execute process a and then process b. In this case, the event corresponding to process a is the first event, and the event corresponding to process b is the second event. .

When the first event detected by step S70 ends, a timer may be started to determine whether the second event occurs in the video within a predetermined period of time. The predetermined duration is related to the application scenario of the solution of the present disclosure, and may be, for example, 10 seconds, 30 seconds, etc., which is not limited in the present disclosure.

That is, after the event detected in step S70 ends, it is determined whether the next event corresponding to the event occurs within a predetermined period of time. For example, in the case of detecting a human face, when the human face disappears from the video, the timer starts, and within a predetermined period of time, it is detected whether there is another human face.

In addition, the manner of determining whether the second event exists may be the same as the manner of determining the second event in step S70, that is, whether the event occurs may be determined by analyzing the video frame images.

If it is determined that the second event occurs in the video, the terminal device executes step S74; if it is determined that the second event does not appear in the video, the terminal device executes step S78.

In addition, for the process of detecting the end of the first event, similar to the above-mentioned case of detecting the occurrence of the first event, one or more frames may be combined to detect whether the first event ends.

For example, it is found in the 20th frame image that the first event ends. In this case, the judgment process of one or more frames can be carried out. If the first event does not appear, the 20th frame is regarded as the first event. end image.

In addition, it can still be understood that when using multiple frames of images to determine whether the second event occurs, it can be set so that when multiple frames of images (for example, 3-frame images, 5-frame images, etc.) with objects corresponding to the second event all appear in the Within the predetermined time period, it can be determined that the second event occurs in the video, or it can be determined that the second event occurs in the video as long as the image with the object corresponding to the second event appears within the predetermined time period.

S74. Within a predetermined period of time after the end of the second event, determine whether a third event occurs in the video.

In some embodiments of the present disclosure, the third event may be associated with the first event or the second event, and the meaning of association mentioned here is the same as the association described in step S72, and details are not repeated here. It should be noted that at least two of the first event, the second event and the third event are mutually related events, and more specifically, the first event, the second event and the third event are mutually related events.

If it is determined in step S72 that the second event occurs in the video, the terminal device may determine whether the third event occurs in the video within a predetermined period of time after the end of the second event.

Specifically, it is also possible to determine whether the third event occurs in the video by extracting and analyzing the features of the video frame images.

If it is determined that the third event occurs in the video, the terminal device executes step S76; if it is determined that the third event does not appear in the video, the terminal device executes step S78.

In addition, it can still be understood that when using multiple frames of images to determine whether the third event occurs, it can be set so that when multiple frames of images (for example, 3-frame images, 5-frame images, etc.) with objects corresponding to the third event all appear in the Within the predetermined time period, it can be determined that the third event occurs in the video, or it can be determined that the third event occurs in the video as long as the image with the object corresponding to the third event appears within the predetermined time period.

S76. Use the third event as the second event.

If it is determined in step S74 that the third event occurs in the video, the third event is regarded as the second event, and the process returns to step S74 to perform the operation of determining whether the third event occurs in the video within a predetermined time period after the end of the second event. Thus, as shown in FIG. 7 , a loop process of steps S74 and S76 is formed.

It can be seen that as long as another associated event occurs within a predetermined period of time after the end of an event, the loop process will be executed until no associated event occurs within a predetermined period of time after the end of the event, and the process will jump from step S74 to step S74. S78.

For example, the predetermined duration is 10 seconds. If event b associated with event a occurs within 10 seconds after event a ends, continue to judge whether an event associated with event a (or event b) occurs within 10 seconds after event b ends, and if associated event c occurs, Then continue to judge whether an event associated with the previous event occurs within 10 seconds after the end of event c, and so on.

S78. If the second event or the third event does not occur, end the video clipping task to determine the clipped video segment.

In the embodiment where the operation of initiating the video capture task includes starting the video capture operation, the operation of the terminal device to end the video capture task includes: ending the video capture operation. Specifically, when the video is a video captured by a camera in real time, ending the video capture task includes stopping the video recording.

In the embodiment where the operation of starting the video interception task includes recording the start time of video interception, in the case that it is determined in step S72 that the second event does not occur, the operation of the terminal device to end the video interception task includes: recording after determining that the first event ends After a predetermined period of time, it is used as the end time of video interception. In this case, a video clipping time period may be determined based on the video clipping start time and the video clipping end time, and a clipping operation is performed for the clipped video segment to determine the clipped video segment.

In the case where it is determined in step S74 that the third event does not occur, the operation of the terminal device to end the video capture task includes: recording a predetermined time elapsed after the end of the second event is determined as the video capture end time. In this case, a video clipping time period may be determined based on the video clipping start time and the video clipping end time, and a clipping operation is performed for the clipped video segment to determine the clipped video segment.

For example, in the video, the video clipping start time is 01:30, and the video clipping end time is 03:00. In this case, the terminal device can clip the video clip corresponding to 01:30 to 03:00 from the video, that is, determine Take out the clipped video clip.

After determining the clipped video clip, since there is no corresponding event within the last predetermined duration of the video clip, in this case, the terminal device can remove the clipped video clip with the last predetermined duration, and generate the target video clip . In addition, the terminal device can upload the target video clip to the cloud for storage.

Thus, the cloud can respond to a video acquisition request corresponding to the target video clip sent by the terminal device or other device, and send the target video clip to the device that sends the request.

In addition, considering the limited processing resources of the terminal device, the terminal device can directly upload the clipped video clips to the cloud.

In this case, in some embodiments, the cloud may, in response to the video acquisition request corresponding to the clipped video clip, remove the video clip of the last predetermined duration from the clipped video clip to generate the target video clip, and use the clipped video clip to generate the target video clip. The target video clip is sent to the requester that initiates the video acquisition request, so that the user can watch it.

In other embodiments, the cloud can remove the video clips of the last predetermined duration from the clipped video clips, generate and store the target video clips, so that the cloud can respond to the video acquisition request corresponding to the clipped video clips and store the target video clips. It is sent to the requester that initiates the video acquisition request so that users can watch it. It can be understood that in some embodiments of the present application, at least two consecutive associated events (including the same event) within a preset time interval can be intercepted from the video, and each associated event is not interrupted (any two consecutive events described above). The video between the associated events is also captured), thereby improving the user’s viewing effect. Since the preset time is set, it can avoid that two consecutive associated events with a long interval are included in the same captured video clip, to a certain extent The storage capacity is reduced, so that it is convenient to seek a balance between storage capacity and viewing effect.

Referring to FIG. 8 , the entire process of the video processing solution according to the embodiment of the present disclosure will be described below by taking the occurrence of the same preset event as an example.

In step S802, the terminal device monitors the video captured by the camera in real time. The camera can be integrated on the terminal device, and in addition, the camera can also establish a connection with the terminal device in a wired or wireless manner, so that the terminal device can obtain the video.

In step S804, the terminal device determines whether a preset event occurs in the video. If it appears, go to step S806; if not, go back to step S802.

In step S806, after the preset event ends, the video recording is extended for N seconds, where N seconds corresponds to the above-mentioned predetermined duration, for example, 10 seconds, 30 seconds, and the like.

In step S808, the terminal device determines whether a preset event occurs again within N seconds. If so, go back to step S806; if not, go to step S810.

In step S810, the terminal device determines a clipped video clip, where the clipped video clip includes a video clip of N seconds after the end of the last preset event.

In step S812, the terminal device truncates the video clips of the last N seconds from the video clips determined in step S810, and uploads them to the cloud for saving.

FIG. 9 schematically shows a flowchart of a solution for participating in video capture by the cloud according to another embodiment of the present disclosure.

In step S902, the cloud acquires and stores the video clips cut out by the terminal device. The process of determining the clipped video segment by the terminal device may be as shown in the above steps S802 to S810.

In step S904, the cloud receives a video acquisition request corresponding to the video clip.

In step S906, the cloud may truncate the last N seconds of the video clip and send it to the requester of the video acquisition request.

In addition, the present disclosure also provides another video processing method for a scene that only needs to output a video containing two related events. Referring to Figure 10, the video processing method may include the following steps:

S102. When the first event occurs in the video, start the video capture task.

Step S102 is the same as the above-mentioned step S70, and is not repeated here.

S104. If the associated event of the first event does not occur within a predetermined time period after the end of the first event, end the video clipping task to determine the clipped video segment.

Regarding the determination of whether the two events are related, it is similar to the case where the first event and the second event are related in step S72. After the first event occurs, the terminal device can determine whether an event associated with the first event occurs within a predetermined time period after the first event ends, and if so, ends the video capture task to determine the video clip to be captured.

The process of ending the video clipping task to determine the clipped video segment is similar to the process of step S78, and will not be repeated.

S106. If the second event associated with the first event occurs within a predetermined duration after the first event ends, and the associated event of the first event does not occur within the predetermined duration after the second event ends, then end the video interception task, with Determine the clipped video clip.

If an event associated with the first event occurs within a predetermined period of time after the first event ends, it will be recorded as a second event, and no event associated with the first event (or the second event) will appear within a predetermined period of time after the second event ends. , the terminal device can end the video clipping task to determine the clipped video segment.

In this exemplary solution, considering that events in some scenarios often have strong continuity, after the second event ends, the video capture task is terminated after a predetermined period of time, so as to avoid some relevant information that may exist in the video within the predetermined period of time being captured. Missing or discarding issues.

In addition, for other scenarios, the solution of the present disclosure can also eliminate the video of the last predetermined duration, and generate a target video segment for storage.

In some embodiments of the present disclosure, the terminal device may remove the video clip of the last predetermined duration from the clipped video clip to generate the target video clip. In addition, the terminal device can upload the target video clip to the cloud for storage.

In this case, in some embodiments, the cloud may, in response to a video acquisition request corresponding to the clipped video clip, remove the video clip of the last predetermined duration from the clipped video clip to generate the target video clip, and use the clipped video clip to generate the target video clip. The target video clip is sent to the requester that initiates the video acquisition request, so that the user can watch it.

In other embodiments, the cloud can remove the video clips of the last predetermined duration from the clipped video clips, generate and store the target video clips, so that the cloud can respond to the video acquisition request corresponding to the clipped video clips. It is sent to the requester that initiates the video acquisition request so that users can watch it.

Based on the above-mentioned video processing method of the present disclosure, on the one hand, the solution of the present disclosure can intercept multiple video clips associated with events from the video; on the other hand, the clipped video clips are continuous video clips, ensuring that the video clips viewed by the user Continuous and complete events; on the other hand, storage based on the clipped video clips can greatly save storage space.

It should be noted that although the various steps of the methods of the present disclosure are depicted in the figures in a particular order, this does not require or imply that the steps must be performed in that particular order, or that all illustrated steps must be performed to achieve the desired the result of. Additionally or alternatively, certain steps may be omitted, multiple steps may be combined into one step for execution, and/or one step may be decomposed into multiple steps for execution, and the like.

Further, this exemplary embodiment also provides a video processing apparatus.

FIG. 11 schematically shows a block diagram of a video processing apparatus of an exemplary embodiment of the present disclosure. Referring to FIG. 11 , the video processing apparatus 11 according to an exemplary embodiment of the present disclosure may include a task initiation module 111 , an event detection module 113 and a first video capture module 115 .

Specifically, the task initiation module 111 can be used to start the video capture task when the first event occurs in the video; the event detection module 113 can be used to determine whether the second event occurs in the video within a predetermined time period after the first event ends; If the second event occurs, determine whether the third event occurs in the video within a predetermined period of time after the second event ends; if the third event occurs, the third event is taken as the second event; the first video interception module 115 can use If the second event or the third event does not occur, end the video clipping task to determine the clipped video segment; wherein, at least two of the first event, the second event and the third event are correlated events with each other.

According to an exemplary embodiment of the present disclosure, the first video clipping module 115 may be further configured to perform: excluding a video clip of the last predetermined duration from the clipped video clips to generate a target video clip.

According to an exemplary embodiment of the present disclosure, the first video clipping module 115 may be further configured to perform: transmitting the clipped video segment to a designated device, so that the designated device can remove the video of the last predetermined duration from the clipped video clip segment to generate the target video segment.

According to an exemplary embodiment of the present disclosure, referring to FIG. 12 , compared to the video processing apparatus 11 , the video processing apparatus 12 may further include a video segment uploading module 121 .

Specifically, the video clip uploading module 121 may be configured to perform: uploading the clipped video clips to the cloud. In this case, in response to the video acquisition request corresponding to the clipped video clip, the cloud removes the video clip of the last predetermined duration from the clipped video clip, generates the target video clip, and sends the target video clip to the initiating video clip. The requesting end of the request; or, the cloud removes the video clips of the last predetermined duration from the clipped video clips, generates the target video clips and stores them, so that the cloud can respond to the video acquisition request corresponding to the clipped video clips and send the target video clips. To the requester that initiates the video acquisition request.

According to an exemplary embodiment of the present disclosure, the process in which the task initiating module 111 initiates a video capture task may be configured to perform: when a first event occurs in the video, start a video capture operation. In this case, the process of the first video capture module 115 ending the video capture task may be configured to perform: end the video capture operation.

According to an exemplary embodiment of the present disclosure, the process of initiating the video capture task by the task initiation module 111 may be configured to perform: recording the time when the first event starts to appear in the video as the video capture start time. In this case, the process of the first video clipping module 115 ending the video clipping task to determine the clipped video segment may be configured to perform: in the case that the second event does not occur, record the process after it is determined that the first event ends. The predetermined length of time is used as the end time of video interception. Based on the start time of video interception and the end time of video interception, the video is intercepted to determine the video clip to be intercepted; if the third event does not occur, record and determine the first video clip. After the second event ends, a predetermined period of time is used as the video clipping end time. Based on the video clipping start time and the video clipping end time, the video clipping operation is performed to determine the clipped video segment.

According to an exemplary embodiment of the present disclosure, the first event is a preset event, and the preset event includes a user preset event or a system preset event. In this case, referring to FIG. 13 , compared to the video processing apparatus 11 , the video processing apparatus 13 may further include an image analysis module 131 .

Specifically, the image analysis module 131 may be configured to perform: extracting features from video frame images in the video; and determining whether a preset event occurs in the video according to the extracted features.

According to an exemplary embodiment of the present disclosure, the process in which the image analysis module 131 determines whether a preset event occurs in the video according to the extracted features may be configured to perform: according to the extracted features, determine from the video the first occurrence of the preset object in the video The target video frame image, the preset object is an object that determines an event as a preset event; if there are preset objects in one or more frames of video frame images after the target video frame image, it is determined that a preset event occurs in the video; Wherein, starting from the target video frame image, the video interception task is started.

According to an exemplary embodiment of the present disclosure, the above video is a video captured by a camera in real time.

Further, another video processing apparatus is also provided in this exemplary embodiment.

FIG. 14 schematically shows a block diagram of a video processing apparatus according to another exemplary embodiment of the present disclosure. Referring to FIG. 14 , the video processing apparatus 14 according to an exemplary embodiment of the present disclosure may include a task initiation module 111 , a second video capture module 141 and a third video capture module 143 .

Specifically, the task initiation module 111 can be used to start the video capture task when the first event occurs in the video; the second video capture module 141 can be used to associate the first event if the first event does not occur within a predetermined period of time after the end of the first event event, then end the video clipping task to determine the clipped video segment; the third video clipping module 143 can be used if the second event associated with the first event occurs within a predetermined time period after the first event ends, and the second event occurs in the second event. After the event ends, if no related event of the first event occurs within a predetermined period of time, the video interception task is ended to determine the intercepted video segment.

According to an exemplary embodiment of the present disclosure, the third video clipping module 143 may be further configured to perform: excluding a video clip of the last predetermined duration from the clipped video clips to generate a target video clip.

According to an exemplary embodiment of the present disclosure, the third video clipping module 143 may be further configured to perform: transmitting the clipped video segment to a designated device, so that the designated device can remove the video of the last predetermined duration from the clipped video clip segment to generate the target video segment.

According to an exemplary embodiment of the present disclosure, the video processing apparatus 14 may further include the above-mentioned video clip uploading module 121 .

Since each functional module of the video processing apparatus in the embodiment of the present disclosure is the same as that in the above-mentioned method embodiment, it will not be repeated here.

From the description of the above embodiments, those skilled in the art can easily understand that the exemplary embodiments described herein may be implemented by software, or may be implemented by software combined with necessary hardware. Therefore, the technical solutions according to the embodiments of the present disclosure may be embodied in the form of software products, and the software products may be stored in a non-volatile storage medium (which may be CD-ROM, U disk, mobile hard disk, etc.) or on the network , including several instructions to cause a computing device (which may be a personal computer, a server, a terminal device, or a network device, etc.) to execute the method according to an embodiment of the present disclosure.

In addition, the above-mentioned figures are merely schematic illustrations of the processes included in the methods according to the exemplary embodiments of the present disclosure, and are not intended to be limiting. It is easy to understand that the processes shown in the above figures do not indicate or limit the chronological order of these processes. In addition, it is also readily understood that these processes may be performed synchronously or asynchronously, for example, in multiple modules.

It should be noted that although several modules or units of the apparatus for action performance are mentioned in the above detailed description, this division is not mandatory. Indeed, according to embodiments of the present disclosure, the features and functions of two or more modules or units described above may be embodied in one module or unit. Conversely, the features and functions of one module or unit described above may be further divided into multiple modules or units to be embodied.

Other embodiments of the present disclosure will readily suggest themselves to those skilled in the art upon consideration of the specification and practice of what is disclosed herein. This application is intended to cover any variations, uses, or adaptations of the present disclosure that follow the general principles of the present disclosure and include common knowledge or techniques in the technical field not disclosed by the present disclosure . The specification and examples are to be regarded as exemplary only, with the true scope and spirit of the disclosure being indicated by the claims.

It is to be understood that the present disclosure is not limited to the precise structures described above and illustrated in the accompanying drawings, and that various modifications and changes may be made without departing from the scope thereof. The scope of the present disclosure is limited only by the appended claims.

Claims

A video processing method, comprising:

When the first event occurs in the video, start the video capture task;

within a predetermined period of time after the end of the first event, determining whether a second event occurs in the video;

If the second event occurs, within the predetermined time period after the second event ends, determine whether a third event occurs in the video;

If the third event occurs, use the third event as the second event;

If the second event or the third event does not occur, end the video clipping task to determine the clipped video segment;

Wherein, at least two of the first event, the second event and the third event are mutually associated events.
The video processing method according to claim 1, wherein the video processing method further comprises:

Eliminate the last video clip of the predetermined duration from the clipped video clips to generate a target video clip.
The video processing method according to claim 1, wherein the video processing method further comprises:

The clipped video clip is transmitted to a designated device, so that the designated device can remove the last video clip of the predetermined duration from the clipped video clip to generate a target video clip.
The video processing method according to claim 1, wherein the video processing method further comprises:

uploading the clipped video clips to the cloud;

In order for the cloud to respond to the video acquisition request corresponding to the clipped video clip, remove the last video clip of the predetermined duration from the clipped video clip, generate a target video clip, and use the target video clip. The clip is sent to the requesting end that initiates the video acquisition request; or, the cloud removes the last video clip of the predetermined duration from the clipped video clip, generates and stores the target video clip, so that the cloud can respond to the For the video acquisition request corresponding to the intercepted video clip, the target video clip is sent to the requester that initiates the video acquisition request.
The video processing method according to claim 1, wherein starting a video capture task comprises: starting a capture operation on the video;

Ending the video capture task includes: ending the video capture operation.
The video processing method according to claim 1, wherein starting a video interception task comprises: recording the time when the first event begins to appear in the video as the video interception start time;

Ending the video clipping task to determine the clipped video clips includes: in the case where the second event does not occur, recording the time that has elapsed after the first event is determined to have passed the predetermined duration, as a video clipping The end time, based on the video clipping start time and the video clipping end time, perform a clipping operation on the video to determine the clipped video segment; in the case where the third event does not occur, record and determine the After the second event ends, the predetermined duration is used as the video clipping end time. Based on the video clipping start time and the video clipping end time, the video clipping operation is performed to determine the clipped video segment. .
The video processing method according to claim 1, wherein the first event is a preset event, and the preset event includes a user preset event or a system preset event; wherein, the video processing method further comprises:

Feature extraction is performed on the video frame images in the video;

According to the extracted features, it is determined whether the preset event occurs in the video.
The video processing method according to claim 7, wherein determining whether the preset event occurs in the video according to the extracted features comprises:

According to the extracted features, determine from the video a target video frame image in which a preset object appears for the first time, and the preset object is an object for which an event is determined to be the preset event;

If the preset object exists in one or more frames of video frame images after the target video frame image, determine that the preset event occurs in the video;

Wherein, starting from the target video frame image, the video clipping task is started.
The video processing method according to any one of claims 1 to 8, wherein the video is a video captured by a camera in real time.
A video processing method, comprising:

When the first event occurs in the video, start the video capture task;

If the associated event of the first event does not occur within a predetermined time period after the end of the first event, end the video capture task to determine the video clip to be captured;

If a second event associated with the first event occurs within the predetermined time period after the first event ends, and the first event does not occur within the predetermined time period after the second event ends If the event is associated, the video clipping task is ended to determine the clipped video segment.
The video processing method according to claim 10, wherein the video processing method further comprises:

Eliminate the last video clip of the predetermined duration from the clipped video clips to generate a target video clip.
The video processing method according to claim 10, wherein the video processing method further comprises:

The clipped video clip is transmitted to a designated device, so that the designated device can remove the last video clip of the predetermined duration from the clipped video clip to generate a target video clip.
The video processing method according to claim 10, wherein the video processing method further comprises:

uploading the clipped video clips to the cloud;

In order for the cloud to respond to the video acquisition request corresponding to the clipped video clip, remove the last video clip of the predetermined duration from the clipped video clip, generate a target video clip, and use the target video clip. The clip is sent to the requesting end that initiates the video acquisition request; or, the cloud removes the last video clip of the predetermined duration from the clipped video clip, generates and stores the target video clip, so that the cloud can respond to the For the video acquisition request corresponding to the intercepted video clip, the target video clip is sent to the requester that initiates the video acquisition request.
A video processing device, comprising:

a task initiation module, configured to initiate a video capture task when the first event occurs in the video;

an event determining module, configured to determine whether a second event occurs in the video within a predetermined period of time after the first event ends; if the second event occurs, the second event occurs in the video after the second event ends Within a predetermined period of time, determine whether a third event occurs in the video; if the third event occurs, use the third event as the second event;

a first video clipping module, configured to end the video clipping task if the second event or the third event does not occur, to determine the clipped video segment;

Wherein, at least two of the first event, the second event and the third event are mutually associated events.
The video processing apparatus according to claim 14, wherein the first video clipping module is further configured to remove the last video clip of the predetermined duration from the clipped video clips to generate a target video clip.
The video processing apparatus according to claim 14, wherein the video processing apparatus further comprises:

a video clip uploading module, configured to upload the clipped video clips to the cloud;

In order for the cloud to respond to the video acquisition request corresponding to the clipped video clip, remove the last video clip of the predetermined duration from the clipped video clip, generate a target video clip, and use the target video clip. The clip is sent to the requesting end that initiates the video acquisition request; or, the cloud removes the last video clip of the predetermined duration from the clipped video clip, generates and stores the target video clip, so that the cloud can respond to the For the video acquisition request corresponding to the intercepted video clip, the target video clip is sent to the requester that initiates the video acquisition request.
A video processing device, comprising:

a task initiation module, configured to initiate a video capture task when the first event occurs in the video;

The second video clipping module is configured to end the video clipping task if the associated event of the first event does not occur within a predetermined duration after the first event ends, to determine the clipped video clip;

A third video interception module is configured to, if a second event associated with the first event occurs within the predetermined time period after the first event ends, and the predetermined time period elapses after the second event ends If the associated event of the first event does not occur within the context, the video clipping task is ended to determine the clipped video segment.
The video processing apparatus according to claim 17, wherein the video processing apparatus further comprises:

a video clip uploading module, configured to upload the clipped video clips to the cloud;

In order for the cloud to respond to the video acquisition request corresponding to the clipped video clip, remove the last video clip of the predetermined duration from the clipped video clip, generate a target video clip, and convert the target video clip. The clip is sent to the requesting end that initiated the video acquisition request; or, the cloud removes the last video clip of the predetermined duration from the clipped video clip, generates and stores the target video clip, so that the cloud can respond to the For the video acquisition request corresponding to the intercepted video clip, the target video clip is sent to the requester that initiates the video acquisition request.
A computer-readable storage medium on which a computer program is stored, which implements the video processing method according to any one of claims 1 to 13 when the program is executed by a processor.
An electronic device comprising:

processor;

A memory configured to store one or more programs that, when executed by the processor, cause the processor to implement the video processing method of any one of claims 1 to 13 .