CN114494950A

CN114494950A - Video processing method and device, electronic equipment and storage medium

Info

Publication number: CN114494950A
Application number: CN202210032588.3A
Authority: CN
Inventors: 刘俊启
Original assignee: Beijing Baidu Netcom Science and Technology Co Ltd
Current assignee: Beijing Baidu Netcom Science and Technology Co Ltd
Priority date: 2022-01-12
Filing date: 2022-01-12
Publication date: 2022-05-13

Abstract

The disclosure provides a video processing method, a video processing device, electronic equipment and a storage medium, and relates to the technical field of computer vision and the technical field of video processing in the technical field of artificial intelligence. The method comprises the following steps: identifying roles in a video to be processed, dividing the video to be processed into a plurality of video segments according to the roles, determining the incidence relation between the roles and the video segments, determining the video segments associated with target roles as target video segments according to the incidence relation, wherein the target roles are selected by a user in the roles, and generating the target video according to the target video segments. The method comprises the steps of dividing a video to be processed into a plurality of video segments according to roles in the video to be processed, determining the association relationship between the roles and the video segments in response to the selection of a user, and determining the video segments associated with the target roles as the target video segments to generate the target video without repeated multiple operations of multiple steps such as watching, adjusting and cutting, so that the consumption of a large amount of manpower is avoided, and the efficiency of video processing is improved.

Description

Video processing method and device, electronic equipment and storage medium

Technical Field

The present disclosure relates to the field of computer vision technologies and video processing technologies in the field of artificial intelligence technologies, and in particular, to a video processing method and apparatus, an electronic device, and a storage medium.

Background

Video is an information bearing form widely applied to the internet, with the continuous development of internet technology, sharing life by publishing video on social software or a short video platform has become an important way for people to entertain daily life, video processing is usually performed on terminal equipment based on interaction modes such as gesture sliding or mouse operation when video clips are acquired or target videos are generated, however, the display space of the terminal equipment is limited, the target position is difficult to accurately position when video content is edited, the target video content can be acquired only through multiple operations of multiple steps such as watching, adjusting and cutting, a large amount of manpower is consumed, and the video processing efficiency is low.

Disclosure of Invention

A video processing method, an apparatus, an electronic device and a storage medium are provided.

According to a first aspect, there is provided a video processing method comprising: identifying roles in a video to be processed; dividing the video to be processed into a plurality of video segments according to the roles; determining the incidence relation between the role and the video clip; determining the video clips associated with target roles as target video clips according to the association relationship, wherein the target roles are selected by the user in the roles; and generating a target video according to the target video clip.

According to a second aspect, there is provided a video processing apparatus comprising: the identification module is used for identifying roles in the video to be processed; the dividing module is used for dividing the video to be processed into a plurality of video segments according to the roles; the first determination module is used for determining the incidence relation between the role and the video clip; a second determining module, configured to determine, according to the association relationship, the video segment associated with a target role as a target video segment, where the target role is selected by a user from the roles; and the generating module is used for generating a target video according to the target video clip.

According to a third aspect, there is provided an electronic device comprising: at least one processor; and a memory communicatively coupled to the at least one processor; wherein the memory stores instructions executable by the at least one processor to enable the at least one processor to perform the video processing method of the first aspect of the disclosure.

According to a fourth aspect, there is provided a non-transitory computer readable storage medium having stored thereon computer instructions for causing a computer to perform the video processing method according to the first aspect of the present disclosure.

According to a fifth aspect, there is provided a computer program product comprising a computer program which, when executed by a processor, implements the video processing method according to the first aspect of the disclosure.

It should be understood that the statements in this section do not necessarily identify key or critical features of the embodiments of the present disclosure, nor do they limit the scope of the present disclosure. Other features of the present disclosure will become apparent from the following description.

Drawings

The drawings are included to provide a better understanding of the present solution and are not to be construed as limiting the present disclosure. Wherein:

fig. 1 is a schematic flow diagram of a video processing method according to a first embodiment of the present disclosure;

fig. 2 is a schematic flow chart of a video processing method according to a second embodiment of the present disclosure;

fig. 3 is a schematic flow chart of a video processing method according to a third embodiment of the present disclosure;

fig. 4 is a schematic flow chart of a video processing method according to a fourth embodiment of the present disclosure;

fig. 5 is a block diagram of a video processing apparatus according to a first embodiment of the present disclosure;

fig. 6 is a block diagram of a video processing apparatus according to a second embodiment of the present disclosure;

FIG. 7 is a block diagram of an electronic device used to implement methods of embodiments of the present disclosure.

Detailed Description

Exemplary embodiments of the present disclosure are described below with reference to the accompanying drawings, in which various details of the embodiments of the disclosure are included to assist understanding, and which are to be considered as merely exemplary. Accordingly, those of ordinary skill in the art will recognize that various changes and modifications of the embodiments described herein can be made without departing from the scope and spirit of the disclosure. Also, descriptions of well-known functions and constructions are omitted in the following description for clarity and conciseness.

Artificial Intelligence (AI) is a technical science that studies and develops theories, methods, techniques and application systems for simulating, extending and expanding human intelligence. At present, the AI technology has the advantages of high automation degree, high accuracy and low cost, and is widely applied.

Computer Vision (also known as Machine Vision) is a simulation of biological Vision using a Computer and related equipment, and further refers to a method of using a camera and a Computer to replace human eyes to perform Machine Vision such as identification, tracking and measurement on a target, and further performing image processing, so that the Computer processing becomes an image more suitable for human eyes to observe or transmit to an instrument to detect.

And video processing, including video recording, playing, editing, clipping, special effect increasing and the like. Video processing is an important component of a video security and protection (security and protection for short) system. The video security is visual, accurate and timely, and rich in information content, so that the video security is widely applied to many occasions, such as public scenes of communities, schools, hospitals and the like, and small-scale civil scenes of families, individuals and the like.

A video processing method, an apparatus, an electronic device, and a storage medium according to embodiments of the present disclosure are described below with reference to the drawings.

Fig. 1 is a schematic flow chart diagram of a video processing method according to a first embodiment of the present disclosure.

As shown in fig. 1, the video processing method according to the embodiment of the present disclosure may specifically include the following steps:

s101, identifying roles in the video to be processed.

Specifically, the executing body of the video processing method according to the embodiment of the present disclosure may be the video processing apparatus provided in the embodiment of the present disclosure, and the video processing apparatus may be a hardware device having a data information processing capability and/or necessary software for driving the hardware device to operate. Alternatively, the execution body may include a workstation, a server, a computer, a user terminal, and other devices. The user terminal includes, but is not limited to, a mobile phone, a computer, an intelligent voice interaction device, an intelligent household appliance, a vehicle-mounted terminal, and the like.

According to the embodiment of the disclosure, a video clip can be acquired from a local storage device or a remote server to serve as a to-be-processed video, wherein the to-be-processed video comprises a role, a scene and the like. The roles in the video to be processed are identified, and one or more roles can be included in one video to be processed.

S102, dividing the video to be processed into a plurality of video segments according to roles.

In the embodiment of the present disclosure, the to-be-processed video is divided into a plurality of video segments according to the identified one or more roles, for example, the to-be-processed video is divided into a plurality of video segments according to whether there are corners in the video or several roles in the video, such as a plurality of video segments without corners, a plurality of video segments with role 1 and a plurality of video segments with role 2.

S103, determining the association relationship between the roles and the video clips.

In the embodiment of the present disclosure, the association relationship between roles and video clips may be represented by associating a role with a video clip in which only the role appears, or associating a role with a video clip in which a scene in which the role appears, and the latter may be understood as that the role and other roles may exist in the video clip associated with the role, for example, a video clip containing a dialog scene includes role 1 and role 2. The embodiment of the disclosure can respond to the selection of the user, associate the role with the corresponding video clip, and thus determine the association relationship between the role and the video clip.

And S104, determining the video clip associated with the target role as the target video clip according to the association relation, wherein the target role is selected by the user in the roles.

In embodiments of the present disclosure, the identified roles are provided to the user for the user to select a target role therefrom. For example, a character picture is presented to the user, and the user selects a target character by clicking on the picture. The target role selected by the user may be a role or a plurality of roles. According to the association relation, the video clip associated with the target role is obtained from the plurality of video clips, and the video clip associated with the target role is determined as the target video clip.

And S105, generating a target video according to the target video clip.

In the embodiment of the disclosure, a target video is generated according to the determined target video segment, the target video includes video content corresponding to a target role selected by a user, and video content irrelevant to the target role is cut off. The user does not need to watch the clips repeatedly, and the video processing efficiency is improved.

For example, two roles (role 1 and role 2) exist in a segment of video to be processed, a video segment where each role is located can be obtained by performing role identification on the video to be processed, so that the video to be processed is divided into a video segment where only the role 1 exists, a video segment where only the role 2 exists and a video segment where the roles 1 and 2 coexist, and in response to user selection, the association relationship between the roles and the video segments is determined to be the association between the role 1 and the video segment where the scene where the role appears is located, that is, the association relationship is established between the role 1 and the video segment where only the role 1 exists and between the role 1 and the role 2; the role 2 establishes an association relationship with the video clip only having the role 2 and the video clip commonly having the role 1 and the role 2, the role in the video to be processed is displayed to a user for the user to select a target role, if the target role selected by the user is the role 1, the video clip only having the role 1 and the video clip commonly having the role 1 and the role 2 associated with the role 1 are determined as target video clips, and a target video is generated according to the target video clips.

In summary, the video processing method according to the embodiment of the present disclosure identifies a role in a video to be processed, divides the video to be processed into a plurality of video segments according to the role, determines an association relationship between the role and the video segments, determines a video segment associated with a target role as a target video segment according to the association relationship, where the target role is selected by a user in the role, and generates the target video according to the target video segment. According to the method and the device, the video to be processed is divided into the plurality of video segments according to the roles in the video to be processed, the incidence relation between the roles and the video segments is determined in response to the selection of the user, the video segments relevant to the target roles are determined to be the target video segments to generate the target video, multiple operations of multiple steps of watching, adjusting, cutting and the like are not needed repeatedly, a large amount of manpower is avoided, and the video processing efficiency is improved.

Fig. 2 is a schematic flow chart of a video processing method according to a second embodiment of the present disclosure.

As shown in fig. 2, based on the embodiment shown in fig. 1, the video processing method according to the embodiment of the disclosure may specifically include the following steps:

the step S101 may include steps S201 to S202:

s201, identifying the character characteristics in the video to be processed.

In the embodiment of the present disclosure, the characteristics of the role can be obtained by identifying a plurality of key frames corresponding to the role in the video to be processed.

And S202, identifying the role according to the characteristics.

In the embodiment of the disclosure, according to the characteristics of the identified role, the role in the video to be processed is identified to determine the video segment corresponding to the role.

S203, dividing the video to be processed into a plurality of video segments according to roles.

S204, determining the association relationship between the role and the video clip.

S205, determining the video clip associated with the target role as the target video clip according to the association relation, wherein the target role is selected by the user in the roles.

And S206, generating the target video according to the target video clip.

Specifically, steps S203-S206 are the same as steps S102-S105, and are not described herein again.

In some embodiments, the video processing method according to the embodiments of the present application may further include: and adjusting the target video clip according to the video clip adjusting instruction of the user. And generating the target video according to the adjusted target video clip.

In the embodiment of the present disclosure, multiple target video clips may be previewed in the editing region, and a user may adjust the positions of the multiple target video clips by selecting and dragging one of the target video clips, or adjust the lengths of the target video clips by dragging the timeline of one of the target video clips. In the embodiment of the disclosure, adjustment operations such as selection, dragging, clipping and the like can be performed on the target video segment according to the video segment adjustment instruction of the user, and the target video is generated according to the adjusted target video segment.

On the basis of the above embodiment, as shown in fig. 3, the step S203 of dividing the video to be processed into a plurality of video segments according to roles may specifically include the following steps:

s301, determining video segmentation points in the video to be processed according to roles.

In the embodiment of the disclosure, a video segmentation point is determined according to whether a character exists in a certain segment of video content of a video to be processed. For example, if role 1 exists in the current video frame and does not exist in the next video frame, the current video frame can be used as a video segmentation point. One character may correspond to a plurality of video segment points.

S302, dividing the video to be processed into a plurality of video segments according to the video segmentation points.

In the embodiment of the present disclosure, the video segmentation point corresponding to the role divides the video to be processed into a plurality of video segments, and the video segments can be divided into video segments containing the role and video segments not containing the role.

On the basis of the above embodiment, as shown in fig. 4, the "determining an association relationship between a character and a video clip" in step S204 may specifically include the following steps:

s401, acquiring the association mode.

The method and the device for associating the roles and the scenes provide two association modes of role association and scene association, can show the two association modes to a user in a display screen of terminal equipment so that the user can select a required association mode, or set one of the association modes as a default association mode, and associate the roles and the video clips in the default association mode when the user does not select the association mode.

S402, determining the association relation between the role and the video clip according to the association mode.

In the embodiment of the disclosure, the association relationship between the character and the video clip is determined according to the determined association mode. As a possible implementation manner, in the role association manner, a video segment in which only a specific role exists is encapsulated into one video interval, so as to obtain a role-associated video interval corresponding to the role. As another feasible implementation manner, in a scene association manner, scene recognition is performed on a plurality of divided video segments, a video segment in which a scene corresponding to the same role is located is included in one video interval, so as to obtain a scene association video interval corresponding to the role, it should be noted that, in the scene association manner, the video segment associated with the role may further include other roles that are in the same scene as the role, for example, a role 1 and a role 2 exist in the same video frame; or in a scene in which the role 1 and the role 2 are conversed, because the shot is switched, only the role 1 exists in some video frames in the scene, and only the role 2 exists in other videos, in this case, the video frames in which only the role 1 exists and the video frames in which only the role 2 exists can be both taken as video frames that the role 1 can be associated with in the scene, and the video frames are all included in a video interval.

Fig. 5 is a block diagram of a video processing apparatus according to a first embodiment of the present disclosure.

As shown in fig. 5, the video processing apparatus 500 of the embodiment of the present disclosure includes: an identification module 501, a division module 502, a first determination module 503, a second determination module 504, and a generation module 505.

The identifying module 501 is configured to identify a role in a video to be processed.

A dividing module 502, configured to divide the video to be processed into a plurality of video segments according to roles.

A first determining module 503, configured to determine an association relationship between the character and the video segment.

A second determining module 504, configured to determine, according to the association relationship, a video segment associated with the target role as a target video segment, where the target role is selected by the user from the roles.

And a generating module 505, configured to generate a target video according to the target video segment.

It should be noted that the above explanation of the embodiment of the video processing method is also applicable to the video processing apparatus in the embodiment of the present disclosure, and the detailed process is not repeated here.

In summary, the video processing apparatus according to the embodiment of the present disclosure identifies a role in a video to be processed, divides the video to be processed into a plurality of video segments according to the role, determines an association relationship between the role and the video segments, determines a video segment associated with a target role as a target video segment according to the association relationship, where the target role is selected by a user in the role, and generates the target video according to the target video segment. According to the method and the device, the video to be processed is divided into the plurality of video segments according to the roles in the video to be processed, the incidence relation between the roles and the video segments is determined in response to the selection of the user, the video segments relevant to the target roles are determined to be the target video segments to generate the target video, multiple operations of multiple steps of watching, adjusting, cutting and the like are not needed repeatedly, a large amount of manpower is avoided, and the video processing efficiency is improved.

Fig. 6 is a block diagram of a video processing apparatus according to a second embodiment of the present disclosure.

As shown in fig. 6, a video processing apparatus 600 according to an embodiment of the present disclosure includes: an identification module 601, a partitioning module 602, a first determination module 603, a second determination module 604, and a generation module 605.

The identifying module 601 has the same structure and function as the identifying module 501 in the previous embodiment, the dividing module 602 has the same structure and function as the dividing module 502 in the previous embodiment, the first determining module 603 has the same structure and function as the first determining module 503 in the previous embodiment, the second determining module 604 has the same structure and function as the second determining module 504 in the previous embodiment, and the generating module 605 has the same structure and function as the generating module 505 in the previous embodiment.

Further, the identification module 601 includes: the first identification unit is used for identifying the characteristics of roles in the video to be processed; and a second recognition unit for recognizing the character according to the feature.

Further, the dividing module 602 includes: the first determining unit is used for determining video segmentation points in the video to be processed according to roles; and the dividing unit is used for dividing the video to be processed into a plurality of video segments according to the video segmentation points.

Further, the first determining module 603 includes: an acquisition unit configured to acquire a correlation method; and the second determining unit is used for determining the association relationship between the role and the video clip according to the association mode.

Further, the association mode includes role association or scene association.

Further, the target role is plural.

Further, the video processing apparatus 600 further includes: the adjusting module 606 is configured to adjust the target video segment according to the video segment adjusting instruction of the user.

In the technical scheme of the disclosure, the collection, storage, use, processing, transmission, provision, disclosure and other processing of the personal information of the related user are all in accordance with the regulations of related laws and regulations and do not violate the good customs of the public order.

The present disclosure also provides an electronic device, a readable storage medium, and a computer program product according to embodiments of the present disclosure.

FIG. 7 illustrates a schematic block diagram of an example electronic device 700 that can be used to implement embodiments of the present disclosure. Electronic devices are intended to represent various forms of digital computers, such as laptops, desktops, workstations, personal digital assistants, servers, blade servers, mainframes, and other appropriate computers. The electronic device may also represent various forms of mobile devices, such as personal digital processing, cellular phones, smart phones, wearable devices, and other similar computing devices. The components shown herein, their connections and relationships, and their functions, are meant to be examples only, and are not meant to limit implementations of the disclosure described and/or claimed herein.

As shown in fig. 7, the electronic device 700 comprises a computing unit 701, which may perform various appropriate actions and processes according to a computer program stored in a Read Only Memory (ROM)702 or a computer program loaded from a storage unit 708 into a Random Access Memory (RAM) 703. In the RAM703, various programs and data required for the operation of the electronic device 700 can also be stored. The computing unit 701, the ROM 702, and the RAM703 are connected to each other by a bus 704. An input/output (I/O) interface 705 is also connected to bus 704.

A number of components in the electronic device 700 are connected to the I/O interface 705, including: an input unit 706 such as a keyboard, a mouse, or the like; an output unit 707 such as various types of displays, speakers, and the like; a storage unit 708 such as a magnetic disk, optical disk, or the like; and a communication unit 709 such as a network card, modem, wireless communication transceiver, etc. The communication unit 709 allows the electronic device 700 to exchange information/data with other devices via a computer network such as the internet and/or various telecommunication networks.

Computing unit 701 may be a variety of general purpose and/or special purpose processing components with processing and computing capabilities. Some examples of the computing unit 701 include, but are not limited to, a Central Processing Unit (CPU), a Graphics Processing Unit (GPU), various specialized Artificial Intelligence (AI) computing chips, various computing units running machine learning model algorithms, a Digital Signal Processor (DSP), and any suitable processor, controller, microcontroller, and so forth. The calculation unit 701 executes the respective methods and processes described above, such as the video processing methods shown in fig. 1 to 4. For example, in some embodiments, the video processing method may be implemented as a computer software program tangibly embodied in a machine-readable medium, such as storage unit 708. In some embodiments, part or all of the computer program may be loaded and/or installed onto the electronic device 700 via the ROM 702 and/or the communication unit 709. When loaded into RAM703 and executed by the computing unit 701, may perform one or more steps of the semantic parsing method described above. Alternatively, in other embodiments, the computing unit 701 may be configured to perform the video processing method by any other suitable means (e.g. by means of firmware).

Various implementations of the systems and techniques described here above may be implemented in digital electronic circuitry, integrated circuitry, Field Programmable Gate Arrays (FPGAs), Application Specific Integrated Circuits (ASICs), Application Specific Standard Products (ASSPs), system on a chip (SOCs), load programmable logic devices (CPLDs), computer hardware, firmware, software, and/or combinations thereof. These various embodiments may include: implemented in one or more computer programs that are executable and/or interpretable on a programmable system including at least one programmable processor, which may be special or general purpose, receiving data and instructions from, and transmitting data and instructions to, a storage system, at least one input device, and at least one output device.

Program code for implementing the methods of the present disclosure may be written in any combination of one or more programming languages. These program codes may be provided to a processor or controller of a general purpose computer, special purpose computer, or other programmable data processing apparatus, such that the program codes, when executed by the processor or controller, cause the functions/operations specified in the flowchart and/or block diagram to be performed. The program code may execute entirely on the machine, partly on the machine, as a stand-alone software package partly on the machine and partly on a remote machine or entirely on the remote machine or server.

In the context of this disclosure, a machine-readable medium may be a tangible medium that can contain, or store a program for use by or in connection with an instruction execution system, apparatus, or device. The machine-readable medium may be a machine-readable signal medium or a machine-readable storage medium. A machine-readable medium may include, but is not limited to, an electronic, magnetic, optical, electromagnetic, infrared, or semiconductor system, apparatus, or device, or any suitable combination of the foregoing. More specific examples of a machine-readable storage medium would include an electrical connection based on one or more wires, a portable computer diskette, a hard disk, a Random Access Memory (RAM), a read-only memory (ROM), an erasable programmable read-only memory (EPROM or flash memory), an optical fiber, a portable compact disc read-only memory (CD-ROM), an optical storage device, a magnetic storage device, or any suitable combination of the foregoing.

To provide for interaction with a user, the systems and techniques described here can be implemented on a computer having: a display device (e.g., a CRT (cathode ray tube) or LCD (liquid crystal display) monitor) for displaying information to a user; and a keyboard and a pointing device (e.g., a mouse or a trackball) by which a user can provide input to the computer. Other kinds of devices may also be used to provide for interaction with a user; for example, feedback provided to the user can be any form of sensory feedback (e.g., visual feedback, auditory feedback, or tactile feedback); and input from the user may be received in any form, including acoustic, speech, or tactile input.

The systems and techniques described here can be implemented in a computing system that includes a back-end component (e.g., as a data server), or that includes a middleware component (e.g., an application server), or that includes a front-end component (e.g., a user computer having a graphical user interface or a web browser through which a user can interact with an implementation of the systems and techniques described here), or any combination of such back-end, middleware, or front-end components. The components of the system can be interconnected by any form or medium of digital data communication (e.g., a communication network). Examples of communication networks include: local Area Networks (LANs), Wide Area Networks (WANs), the internet, and blockchain networks.

The computer system may include clients and servers. A client and server are generally remote from each other and typically interact through a communication network. The relationship of client and server arises by virtue of computer programs running on the respective computers and having a client-server relationship to each other. The Server can be a cloud Server, also called a cloud computing Server or a cloud host, and is a host product in a cloud computing service system, so as to solve the defects of high management difficulty and weak service expansibility in the traditional physical host and VPS service ("Virtual Private Server", or simply "VPS"). The server may also be a server of a distributed system, or a server incorporating a blockchain.

According to an embodiment of the present disclosure, there is also provided a computer program product comprising a computer program, wherein the computer program, when executed by a processor, implements the steps of the video processing method according to the above-mentioned embodiment of the present disclosure.

It should be understood that various forms of the flows shown above may be used, with steps reordered, added, or deleted. For example, the steps described in the present disclosure may be executed in parallel or sequentially or in different orders, and are not limited herein as long as the desired results of the technical solutions disclosed in the present disclosure can be achieved.

The above detailed description should not be construed as limiting the scope of the disclosure. It should be understood by those skilled in the art that various modifications, combinations, sub-combinations and substitutions may be made in accordance with design requirements and other factors. Any modification, equivalent replacement, and improvement made within the spirit and principle of the present disclosure should be included in the scope of protection of the present disclosure.

Claims

1. A video processing method, comprising:

identifying roles in a video to be processed;

dividing the video to be processed into a plurality of video segments according to the roles;

determining the incidence relation between the role and the video clip;

determining the video clips associated with target roles as target video clips according to the association relationship, wherein the target roles are selected by the user in the roles; and

and generating a target video according to the target video clip.

2. The video processing method of claim 1, wherein the identifying the character in the video to be processed comprises:

identifying characteristics of the role in the video to be processed; and

and identifying the role according to the characteristics.

3. The video processing method according to claim 1, wherein said dividing the video to be processed into a plurality of video segments according to the roles comprises:

determining video segmentation points in the video to be processed according to the roles; and

and dividing the video to be processed into the plurality of video segments according to the video segmentation points.

4. The video processing method of claim 1, wherein the determining the association relationship between the character and the video segment comprises:

acquiring an association mode;

and determining the association relationship between the role and the video clip according to the association mode.

5. The video processing method according to claim 4, wherein the association comprises a role association or a scene association.

6. The video processing method according to claim 1, wherein the target character is plural.

7. The video processing method of claim 1, further comprising:

and adjusting the target video clip according to the video clip adjusting instruction of the user.

8. A video processing apparatus comprising:

the identification module is used for identifying roles in the video to be processed;

the dividing module is used for dividing the video to be processed into a plurality of video segments according to the roles;

the first determination module is used for determining the incidence relation between the role and the video clip;

a second determining module, configured to determine, according to the association relationship, the video segment associated with a target role as a target video segment, where the target role is selected by a user from the roles; and

and the generating module is used for generating a target video according to the target video clip.

9. The video processing apparatus of claim 8, wherein the identification module comprises:

the first identification unit is used for identifying the characteristics of the role in the video to be processed; and

and the second identification unit is used for identifying the role according to the characteristics.

10. The video processing apparatus of claim 8, wherein the partitioning module comprises:

the first determining unit is used for determining video segmentation points in the video to be processed according to the roles; and

and the dividing unit is used for dividing the video to be processed into the plurality of video segments according to the video segmentation points.

11. The video processing apparatus of claim 8, wherein the first determining module comprises:

an acquisition unit configured to acquire a correlation method;

and the second determining unit is used for determining the association relationship between the role and the video clip according to the association mode.

12. The video processing apparatus according to claim 11, wherein the association means comprises a role association or a scene association.

13. The video processing apparatus according to claim 8, wherein the target character is plural.

14. The video processing device of claim 8, further comprising:

and the adjusting module is used for adjusting the target video clip according to the video clip adjusting instruction of the user.

15. An electronic device, comprising:

at least one processor; and

a memory communicatively coupled to the at least one processor; wherein the content of the first and second substances,

the memory stores instructions executable by the at least one processor to enable the at least one processor to perform the method of any one of claims 1-7.

16. A non-transitory computer readable storage medium having stored thereon computer instructions for causing the computer to perform the method of any one of claims 1-7.

17. A computer program product comprising a computer program which, when being executed by a processor, carries out the steps of the method according to any one of claims 1-7.