CN111416939A

CN111416939A - Video processing method, video processing equipment and computer readable storage medium

Info

Publication number: CN111416939A
Application number: CN202010239753.3A
Authority: CN
Inventors: 李立锋; 白保军; 王科; 夏伟; 杜欧杰
Original assignee: Migu Cultural Technology Co Ltd; MIGU Video Technology Co Ltd
Current assignee: Migu Cultural Technology Co Ltd; MIGU Video Technology Co Ltd
Priority date: 2020-03-30
Filing date: 2020-03-30
Publication date: 2020-07-14

Abstract

The invention provides a video processing method, video processing equipment and a computer readable storage medium, which solve the problem of poor compression flexibility of the existing video compression technology. The method of the invention comprises the following steps: receiving a first input to a video call interface, wherein the video call interface displays a first video image corresponding to first terminal equipment and a second video image corresponding to second terminal equipment which carries out video call with the first terminal equipment; in response to the first input, determining a selected target image area, the target image area being a partial area of an image acted on by the first input; acquiring a compressed image of a target image area; and obtaining a final video image according to the image acted by the first input and the compressed image. Therefore, the compression mode is more flexible by compressing the partial area in the video image, the compression requirement of the user can be met, and the video call experience of the user is improved.

Description

Video processing method, video processing equipment and computer readable storage medium

Technical Field

The embodiments of the present invention relate to the field of multimedia communication technologies, and in particular, to a video processing method, a video processing device, and a computer-readable storage medium.

Background

In recent years, video communication has become a hot spot for new application requirements because of a series of advantages such as intuitiveness and reliability. Video compression techniques have been introduced to reduce the bandwidth required for video transmission.

In the existing video compression mode, a user is usually required to open a setting, find a compression setting, select a video compression rate, and then execute video compression processing. The existing video compression technology has single compression mode and poor compression flexibility.

Disclosure of Invention

Embodiments of the present invention provide a video processing method, a device, and a computer-readable storage medium, so as to solve the problem of poor compression flexibility in the existing video compression technology.

In order to solve the above problems, the present invention is realized by:

in a first aspect, an embodiment of the present invention provides a video processing method, which is applied to a terminal device, where the terminal device is a first terminal device, and the method includes:

receiving a first input to a video call interface, wherein the video call interface displays a first video image corresponding to the first terminal device and a second video image corresponding to a second terminal device which performs video call with the first terminal device;

in response to the first input, determining a selected target image area, the target image area being a partial area of an image acted on by the first input;

acquiring a compressed image of the target image area;

and obtaining a final video image according to the image acted by the first input and the compressed image.

Optionally, the acquiring a compressed image of the target image region includes:

and compressing the image in the target image area to obtain a compressed image under the condition that the target image area is the image area of the first video image.

Optionally, the obtaining a final video image according to the image acted by the first input and the compressed image includes:

and synthesizing the image acted by the first input and the compressed image to obtain a third video image.

synthesizing the image acted by the first input and the compressed image to obtain a fourth video image;

receiving a second input to the target image area in the fourth video image;

in response to the second input, deleting the compressed image to obtain a color block area;

and repairing and completing the color block area to obtain a fifth video image.

sending a request to a network device under the condition that the target image area is the image area of the second video image, and compressing the image in the target image area by the network device;

and receiving a compressed image sent by the network equipment, wherein the compressed image is an image obtained by compressing the image in the target image area by the network equipment.

Optionally, obtaining a final video image according to the image acted by the first input and the compressed image, includes:

and receiving a sixth video image sent by the network equipment, wherein the sixth video image is obtained by synthesizing the compressed image and the image acted by the first input by the network equipment.

receiving a seventh video image sent by the network device, wherein the seventh video image is obtained by synthesizing the compressed image and the image acted by the first input by the network device;

receiving a third input to the target image area in the seventh video image;

and responding to the third input, receiving an eighth video image sent by the network equipment, wherein the eighth video image is obtained by deleting the compressed image and repairing and complementing a color block area obtained after deleting the compressed image by the network equipment.

Optionally, the determining the selected target image area in response to the first input includes:

acquiring a first position corresponding to the first input in the video call interface;

and determining the selected target image area according to the first position.

Optionally, the determining the selected target image area according to the first position includes:

determining an image area where the first object identified at the first position is located as a selected target image area; or,

and determining an image area where a second object identified at the first position is located and image areas where other objects of the same type as the second object are located as selected target image areas, wherein the second object and the other objects are located in the same video image.

Optionally, in response to the first input, after determining the selected target image area, before acquiring a compressed image of the target image area, the method further includes:

and displaying the image in the target image area with a preset display effect.

Optionally, after obtaining a final video image according to the image acted by the first input and the compressed image, the method further includes:

and sending the final video image to second terminal equipment after video coding.

According to another aspect of the present invention, there is provided a video processing method applied to a network device, including:

receiving a request sent by first terminal equipment;

responding to the request, compressing an image in a target image area to obtain a compressed image, wherein the target image area is an image area in a second video image selected by a first input to a video call interface when the first terminal device and a second terminal device carry out video call, and the second video image is a video image corresponding to the second terminal device;

obtaining a final video image according to the image acted by the first input and the compressed image;

and sending the final video image to the first terminal equipment after video coding.

and synthesizing the image acted by the first input and the compressed image to obtain a sixth video image.

synthesizing the image acted by the first input and the compressed image to obtain a seventh video image;

deleting the compressed image to obtain a color block area under the condition that the third input of the first terminal device to the target image area of the seventh video image is obtained;

and repairing and completing the color block area to obtain an eighth video image.

According to another aspect of the present invention, there is provided a terminal device including: a processor, a memory and a computer program stored on the memory and executable on the processor, the computer program, when executed by the processor, implementing the steps of the video processing method as described above.

According to still another aspect of the present invention, there is provided a network apparatus including: a processor, a memory and a computer program stored on the memory and executable on the processor, the computer program, when executed by the processor, implementing the steps of the video processing method as described above.

According to a further aspect of the present invention, a computer-readable storage medium is provided, having stored thereon a computer program which, when being executed by a processor, carries out the steps of the video processing method as set forth above.

According to the embodiment of the invention, a first input of a video call interface is received, and the video call interface displays a first video image corresponding to first terminal equipment and a second video image corresponding to second terminal equipment which carries out video call with the first terminal equipment; in response to the first input, determining a selected target image area, the target image area being a partial area of an image acted on by the first input; acquiring a compressed image of a target image area; according to the image acted by the first input and the compressed image, the final video image is obtained, so that the compression mode is more flexible by compressing partial areas in the video image, the compression requirement of a user can be met, and the video call experience of the user is improved.

Drawings

In order to more clearly illustrate the technical solutions of the embodiments of the present invention, the drawings needed to be used in the description of the embodiments of the present invention will be briefly introduced below, and it is obvious that the drawings in the following description are only some embodiments of the present invention, and it is obvious for those skilled in the art that other drawings can be obtained according to these drawings without inventive exercise.

Fig. 1 is a flow chart of a video processing method according to an embodiment of the present invention;

FIG. 2 is a schematic diagram illustrating a processing procedure of a patch area after compressed image deletion according to an embodiment of the present invention;

FIG. 3 is a second flowchart of a video processing method according to an embodiment of the present invention;

FIG. 4 is a schematic structural diagram of a video processing apparatus according to an embodiment of the present invention;

fig. 5 is a schematic diagram illustrating an implementation structure of a terminal device according to an embodiment of the present invention;

fig. 6 is a second schematic structural diagram of a video processing apparatus according to an embodiment of the invention;

fig. 7 is a schematic diagram illustrating an implementation structure of a network device according to an embodiment of the present invention;

fig. 8 is a schematic structural diagram of a video processing system according to an embodiment of the present invention.

Detailed Description

In order to make the technical problems, technical solutions and advantages of the present invention more apparent, the following detailed description is given with reference to the accompanying drawings and specific embodiments. In the following description, specific details such as specific configurations and components are provided only to help the full understanding of the embodiments of the present invention. Thus, it will be apparent to those skilled in the art that various changes and modifications may be made to the embodiments described herein without departing from the scope and spirit of the invention. In addition, descriptions of well-known functions and constructions are omitted for clarity and conciseness.

It should be appreciated that reference throughout this specification to "one embodiment" or "an embodiment" means that a particular feature, structure or characteristic described in connection with the embodiment is included in at least one embodiment of the present invention. Thus, the appearances of the phrases "in one embodiment" or "in an embodiment" in various places throughout this specification are not necessarily all referring to the same embodiment. Furthermore, the particular features, structures, or characteristics may be combined in any suitable manner in one or more embodiments.

In various embodiments of the present invention, it should be understood that the sequence numbers of the following processes do not mean the execution sequence, and the execution sequence of each process should be determined by its function and inherent logic, and should not constitute any limitation to the implementation process of the embodiments of the present invention. In addition, the terms "system" and "network" are often used interchangeably herein.

As shown in fig. 1, an embodiment of the present invention provides a video processing method, which is applied to a terminal device, where the terminal device is a first terminal device, and the method specifically includes the following steps:

step 101, receiving a first input to a video call interface, where the video call interface displays a first video image corresponding to a first terminal device and a second video image corresponding to a second terminal device performing a video call with the first terminal device;

in this step, there may be one or at least two, that is, a plurality of second terminal devices that perform a video call with the first terminal device.

Note that the first input is a preset input. Optionally, the first input may include, but is not limited to, at least one of a click input, a press input, a long press input, a pinch input, a drag input, a slide input, and a swipe input, that is, the first input may be one of the above-mentioned inputs, or may also be a combined operation of two or more of the above-mentioned inputs.

Step 102, responding to the first input, determining a selected target image area, wherein the target image area is a partial area of an image acted by the first input;

in this step, the image acted by the first input may be a first video image or a second video image.

Here, the electronic device determines the selected target image area in response to the first input to the video call interface, that is, a partial area of the image acted by the first input is selected through the first input to the video call interface by the user, which indicates that the user wants to perform corresponding compression adjustment on the partial area, that is, the target image area. In other words, it is considered that the partial area is not a display focus for the user during the video call, and the partial area may be compressed to save the transmission bandwidth.

Step 103, acquiring a compressed image of the target image area;

in this step, the compression ratio corresponding to the compressed image of the target image region exceeds a preset threshold, that is, the compression ratio corresponding to the compressed image is a high compression ratio.

In particular, the compression effect may be determined by the range of colors in the image within the target image area. Wherein the less the image color range, the higher its compression rate.

And 104, obtaining a final video image according to the image acted by the first input and the compressed image.

According to the video processing method, a first input of a video call interface is received, and the video call interface displays a first video image corresponding to first terminal equipment and a second video image corresponding to second terminal equipment which carries out video call with the first terminal equipment; in response to the first input, determining a selected target image area, the target image area being a partial area of an image acted on by the first input; acquiring a compressed image of a target image area; according to the image acted by the first input and the compressed image, the final video image is obtained, so that the compression mode is more flexible by compressing partial areas in the video image, the compression requirement of a user can be met, and the video call experience of the user is improved.

As an optional implementation manner, step 103 of the method according to the embodiment of the present invention may specifically include:

Here, when the target image area is an image area of the first video image, it is described that the target image area is an image area in the first video image captured by the first terminal device itself, and the first terminal device may perform compression processing on the locally captured video image, that is, compress an image in the target image area to obtain a compressed image.

Based on this, as an optional implementation manner, the method step 104 of the embodiment of the present invention may specifically include:

Here, the image to which the first input is applied (in the present embodiment, the first video image) and the compressed image are subjected to the combining process, and specifically, the image region in which the compressed image is present, that is, the target image region is left at the original position, and the image to which the first input is applied is subjected to the combining process. In short, compressing an image reduces the local color range in the image affected by the first input. And finally, the obtained third video image, namely the composite image, is the final video image and is presented in the video call interface.

It should be noted that, since the compressed image is a high-compression-rate image combined with the image acted on by the first input, the image becomes partially blurred and becomes sharp.

And obtaining a final video image based on the third video image.

The step may specifically include:

obtaining a final video image from the third video image;

or,

receiving a second input to the target image area in the third video image;

note that the second input is a preset input. Optionally, the second input may include, but is not limited to, at least one of a click input, a press input, a long press input, a pinch input, a drag input, a slide input, and a swipe input, that is, the second input may be one of the above-mentioned inputs, or may also be a combined operation of two or more of the above-mentioned inputs.

Optionally, the second input is the same input as the first input.

In addition, although the image in the target image area of the third video image is compressed, the target image area position is not changed, but the initial image area position, that is, the target image area is input twice, once as the first input, and the second time as the second input.

in this step, the electronic device deletes the compressed image in response to the received second input, which indicates that the compressed image is not a display focus for the user during the video call, and may delete the image in order to further reduce the bandwidth requirement for video transmission.

And repairing and completing the color block area to obtain four video images.

In this step, in order not to affect the image display effect after the compressed image is deleted, the color block region is repaired and complemented, and a fourth video image is obtained.

Optionally, if the compressed image is a person who moves in the video image, the color block region and the non-color block region in the video at the same coordinate can be found in a front-back frame comparison manner according to the image similarity comparison manner, and then filling is performed.

As shown in fig. 2, the person a moves from left to right, but the background of the person a does not change much, and the images except the region of the color block (i.e., the black region in the figure after the person a is removed) can be synthesized by referring to the coordinates of the background map by the similarity of the background, so that the rightmost complete background map in the figure is finally obtained.

here, when the target image area is an image area of the second video image, it is described that the target image area is an opposite end of a video call with the first terminal device, that is, an image area in the second video image captured by the second terminal device is displayed on a video call interface of the first electronic device through video transmission. To reduce the bandwidth requirements of video transmission, a request is sent to a network device (e.g., a server) to compress the image within the target image area. In this way, in the subsequent process of transmitting the second video image to the first terminal device through the network device, the file size is reduced due to compression, and the requirement of video transmission on the bandwidth can be reduced.

The purpose of transmitting the compression processing of the local image in the target image area of the second video image by the network device to the first network device is to enable the operator of the first network device to know the effect of the compressed image in real time.

receiving a fifth video image sent by the network device, wherein the fifth video image is obtained by synthesizing the compressed image and the image acted by the first input by the network device;

in this embodiment, the image acted by the first input is a second video image.

Here, the receiving of the fifth video image sent by the network device may specifically include:

receiving a first video stream sent by the network equipment;

and carrying out video decoding on the first video stream to obtain a fifth video image.

Here, the network device synthesizes the compressed image and the image acted by the first input to obtain a fifth video image, which is a final video image and is presented in a video call interface.

And obtaining a final video image based on the fifth video image.

The step may specifically include:

taking the fifth video image as a final video image;

or,

receiving a third input to the target image area in the fifth video image;

note that the third input is a preset input. Optionally, the third input may include, but is not limited to, at least one of a click input, a press input, a long press input, a pinch input, a drag input, a slide input, and a swipe input, that is, the third input may be one of the above-mentioned inputs, or may also be a combined operation of two or more of the above-mentioned inputs.

Optionally, the third input is the same input as the third input.

In the fifth video image, although the image in the target image area is compressed, the position of the target image area is not changed, but the position of the target image area is the initial image area position, that is, the target image area is input twice, once as the first input, and the second time as the third input.

And responding to the third input, receiving a sixth video image sent by the network equipment, wherein the sixth video image is obtained by deleting the compressed image and repairing and complementing a color block area obtained after deleting the compressed image by the network equipment.

This step of receiving the sixth video image sent by the network device may specifically include:

receiving a third video stream sent by the network equipment;

and carrying out video decoding on the third video stream to obtain a sixth video image.

As an optional implementation manner, the method step 102 in the embodiment of the present invention may specifically include:

it should be noted that, in step 101, when a first input to the video call interface is received, a video image in the current video is acquired.

Specifically, the position coordinate corresponding to the first input in the video image is obtained, that is, the first position corresponding to the first input in the video call interface is obtained.

And determining the selected target image area according to the first position.

Here, the step may specifically include:

determining an image area where the first object identified at the first position is located as a selected target image area;

in this step, a first object, such as a person or an object, corresponding to the first position is identified through an image identification technology, such as human body identification, object identification, and the like, and an image area where the first object is located is determined as a target image area, which indicates that a user wants to perform a related process on the target image area.

It should be noted that, the human body identification only needs to identify a person; object recognition requires only two-level classification of objects, i.e., recognizing vehicles, regardless of which vehicle is recognized (the first classification is vehicles).

Here, preferably, the first input is a single click input.

Alternatively, the step may specifically include:

Here, in this step, the second object corresponding to the first position and the other objects of the same type as the second object are recognized by image recognition techniques, such as human body recognition, object recognition, and the like, and the image area in which each of the recognized objects is located is determined as the selected target image area.

Preferably, the first input is a long press input.

Further, in order to facilitate the operator of the first terminal device to view the selected target image area and enhance the intuitive experience of the operator of the first terminal device, after step 102 and before step 103 of the method according to the embodiment of the present invention, the method according to the embodiment of the present invention may further include:

and displaying the image in the target image area with a preset display effect.

In one example, the special effect is displayed on the image in the target image area, including but not limited to, gradually displaying the semi-transparent layer on the image in the target image area, and then gradually disappearing; alternatively, a scan effect, i.e., a semi-transparent layer is displayed on the image in the target image area, and then scan lines are displayed up and down and then disappear.

It should be noted that, when the preset display effect disappears, the compression rate of the image in the target image area is in a high compression effect, and other areas except the target image area are in a normal display effect.

As an optional implementation manner, after the method step 104 of the embodiment of the present invention, the method may further include:

It should be noted that the final video image is either an image obtained by compressing and synthesizing the image in the local area, or an image obtained by deleting and repairing the image in the local area. Here, the final video image is video-encoded, and the purpose is to reduce the size of the video file and reduce the bandwidth requirement of video transmission.

As shown in fig. 3, an embodiment of the present invention provides a video processing method, which is applied to a network device, and specifically includes the following steps:

step 301, receiving a request sent by a first terminal device;

in this step, the network device and the first terminal device in the previous embodiment can perform information interaction.

When the network device receives a request sent by the first terminal device, it is described that a target image area selected by a first input to a video call interface of the first terminal device received by the first terminal device is an image area in a second video image, that is, the target image area is a video image shot by the second terminal device which performs a video call with the first terminal device.

Step 302, in response to the request, compressing an image in a target image area to obtain a compressed image, where the target image area is an image area in a second video image selected by a first input to a video call interface when the first terminal device and a second terminal device perform a video call, and the second video image is a video image corresponding to the second terminal device;

step 303, obtaining a final video image according to the image acted by the first input and the compressed image;

as an optional implementation manner, this step may specifically include:

and synthesizing the image acted by the first input and the compressed image to obtain a fifth video image.

The fifth video image is the fifth video image in the previous embodiment.

And obtaining a final video image based on the fifth video image.

The step may specifically include:

taking the fifth video image as a final video image;

or,

deleting the compressed image to obtain a color block area under the condition that the third input of the first terminal device to the target image area of the fifth video image is obtained;

and repairing and completing the color block area to obtain a sixth video image.

And 304, sending the final video image to the first terminal equipment after video coding.

The video processing method of the embodiment of the invention receives the request sent by the first terminal device; responding to the request, compressing the image in the target image area to obtain a compressed image, wherein the target image area is an image area in a second video image selected by a first input to a video call interface when the first terminal device and the second terminal device carry out video call, and the second video image is a video image corresponding to the second terminal device; obtaining a final video image according to the image acted by the first input and the compressed image; the final video image is sent to the first terminal device after being subjected to video coding, so that the compression mode is more flexible by compressing partial areas in the video image, the compression requirements of a user can be met, and the video call experience of the user is improved.

As shown in fig. 4, an embodiment of the present invention further provides a video processing apparatus, which is applied to a terminal device, where the electronic device is a first terminal device, and includes:

a first receiving module 401, configured to receive a first input to a video call interface, where the video call interface displays a first video image corresponding to the first terminal device and a second video image corresponding to a second terminal device that performs a video call with the first terminal device;

a region determining module 402, configured to determine, in response to the first input, a selected target image region, where the target image region is a partial region of an image affected by the first input;

a first obtaining module 403, configured to obtain a compressed image of the target image area;

a second obtaining module 404, configured to obtain a final video image according to the image acted by the first input and the compressed image.

Optionally, the first obtaining module 403 includes:

and the first acquisition unit is used for compressing the image in the target image area to obtain a compressed image under the condition that the target image area is the image area of the first video image.

Optionally, the second obtaining module 404 includes:

the second acquisition unit is used for carrying out synthesis processing on the image acted by the first input and the compressed image to obtain a third video image;

and the first processing unit is used for obtaining a final video image based on the third video image.

Optionally, the first processing unit is specifically configured to:

taking the third video image as a final video image;

or,

receiving a second input to the target image area in the third video image;

and repairing and completing the color block area to obtain a fourth video image.

Optionally, the first obtaining module 403 includes:

a first sending unit, configured to send a request to a network device when the target image area is an image area of the second video image, and compress, by the network device, an image in the target image area;

a second receiving unit, configured to receive a compressed image sent by the network device, where the compressed image is an image obtained by compressing, by the network device, an image in the target image area.

Optionally, the second obtaining module 404 includes:

a third receiving unit, configured to receive a fifth video image sent by the network device, where the fifth video image is a video image obtained by synthesizing, by the network device, the compressed image and an image acted by the first input;

and the second processing unit is used for obtaining a final video image based on the fifth video image.

Optionally, the second processing unit is specifically configured to:

taking the fifth video image as a final video image;

or,

receiving a third input to the target image area in the fifth video image;

Optionally, the area determining module 402 includes:

a third obtaining unit, configured to obtain a first position, corresponding to the first input, in the video call interface;

and the area determining unit is used for determining the selected target image area according to the first position.

Optionally, the area determining unit is specifically configured to:

It should be noted that the apparatus is an apparatus corresponding to the above-mentioned method applied to video processing, and all the implementations in the above-mentioned method embodiment are applicable to the embodiment of the apparatus, and the same technical effects can be achieved.

The video processing device receives a first input to a video call interface through a first receiving module, wherein the video call interface displays a first video image corresponding to first terminal equipment and a second video image corresponding to second terminal equipment which carries out video call with the first terminal equipment; the area determining module is used for responding to the first input and determining a selected target image area, wherein the target image area is a partial area of an image acted by the first input; a first acquisition module acquires a compressed image of a target image area; the second acquisition module acquires the final video image according to the image acted by the first input and the compressed image, so that the compression mode is more flexible by compressing partial areas in the video image, the compression requirement of a user can be met, and the video call experience of the user is improved.

As shown in fig. 5, an embodiment of the present invention further provides a terminal device, where the terminal device is a first terminal device, and the terminal device includes: a transceiver 501, a processor 502, a memory 503 and a computer program stored on the memory 502 and executable on the processor 501, the processor 501 implementing the steps of the video processing method described above when executing the computer program. Specifically, the transceiver 501 receives a first input to a video call interface, where the video call interface displays a first video image corresponding to the first terminal device and a second video image corresponding to a second terminal device performing a video call with the first terminal device, and the processor 502 is configured to determine, in response to the first input, a selected target image area, where the target image area is a partial area of an image acted on by the first input; acquiring a compressed image of the target image area; and obtaining a final video image according to the image acted by the first input and the compressed image.

Optionally, the processor 502 is further configured to:

synthesizing the image acted by the first input and the compressed image to obtain a third video image;

and obtaining a final video image based on the third video image.

Optionally, the processor 502 is further configured to:

taking the third video image as a final video image;

or,

receiving a second input to the target image area in the third video image; in response to the second input, deleting the compressed image to obtain a color block area; and repairing and completing the color block area to obtain a third video image.

Optionally, the transceiver 501 is further configured to:

and obtaining a final video image based on the fifth video image.

Optionally, the transceiver 501 is further configured to:

taking the fifth video image as a final video image;

or,

receiving a third input to the target image area in the fifth video image;

Optionally, the processor 502 is further configured to:

acquiring a first position corresponding to the first input in the video call interface; and determining the selected target image area according to the first position.

Optionally, the processor 502 is further configured to:

determining an image area where the first object identified at the first position is located as a selected target image area; or determining an image area where a second object identified at the first position is located and an image area where other objects of the same type as the second object are located as the selected target image area, wherein the second object and the other objects are located in the same video image.

The bus architecture may include any number of interconnected buses and bridges, among which are linked together by one or more processors 501, represented by processor 501, and various circuits of memory 502, represented by memory 502. The bus architecture may also link together various other circuits such as peripherals, voltage regulators, power management circuits, and the like, which are well known in the art, and therefore, will not be described any further herein. The bus interface provides an interface. The transceiver 503 may be a number of elements, including a transmitter and a transceiver, providing a means for communicating with various other apparatus over a transmission medium. The processor 501 is responsible for managing the bus architecture and general processing, and the memory 502 may store data used by the processor in performing operations.

As shown in fig. 6, an embodiment of the present invention further provides a video processing apparatus, applied to a network device, including:

a second receiving module 601, configured to receive a request sent by a first terminal device;

a processing module 602, configured to, in response to the request, compress an image in a target image area to obtain a compressed image, where the target image area is an image area in a second video image selected by a first input to a video call interface when the first terminal device and a second terminal device perform a video call, and the second video image is a video image corresponding to the second terminal device;

a third obtaining module 603, configured to obtain a final video image according to the image acted by the first input and the compressed image;

a sending module 604, configured to send the final video image to the first terminal device after video coding.

Optionally, the third obtaining module 603 includes:

a fifth obtaining unit, configured to perform synthesis processing on the image acted by the first input and the compressed image to obtain a fifth video image;

and the third processing unit is used for obtaining a final video image based on the fifth video image.

Optionally, the third processing unit is specifically configured to:

taking the fifth video image as a final video image;

or,

It should be noted that the apparatus is an apparatus corresponding to the above-mentioned method applied to video processing, and all the implementations in the above-mentioned method embodiment are applicable to the embodiment of the apparatus, and the same technical effects can be achieved. By passing

The video processing device of the embodiment of the invention receives a request sent by first terminal equipment through a second receiving module; the processing module responds to the request, and compresses the image in the target image area to obtain a compressed image, wherein the target image area is an image area in a second video image selected by a first input to a video call interface when the first terminal device and the second terminal device carry out video call, and the second video image is a video image corresponding to the second terminal device; the third acquisition module acquires a final video image according to the image acted by the first input and the compressed image; the final video image is sent to the first terminal device after being subjected to video coding, so that the compression mode is more flexible by compressing partial areas in the video image, the compression requirements of a user can be met, and the video call experience of the user is improved.

As shown in fig. 7, an embodiment of the present invention further provides a terminal device, where the terminal device is a first terminal device, and the terminal device includes: a transceiver 701, a processor 702, a memory 703 and a computer program stored on the memory 702 and executable on the processor 701, the processor 701 implementing the steps of the video processing method described above when executing the computer program. Specifically, the transceiver 701 is configured to receive a request sent by a first terminal device, and the processor 702 is configured to, in response to the request, compress an image in a target image area to obtain a compressed image, where the target image area is an image area in a second video image selected by a first input to a video call interface when a video call is performed between the first terminal device and a second terminal device, and the second video image is a video image corresponding to the second terminal device; obtaining a final video image according to the image acted by the first input and the compressed image; the transceiver 701 is configured to send the final video image to the first terminal device after video coding.

Optionally, the processor 702 is further configured to:

synthesizing the image acted by the first input and the compressed image to obtain a fifth video image;

and obtaining a final video image based on the fifth video image.

Optionally, the processor 702 is further configured to:

taking the fifth video image as a final video image;

or deleting the compressed image to obtain a color block area under the condition that the third input of the first terminal device to the target image area of the fifth video image is obtained; and repairing and completing the color block area to obtain a sixth video image.

The bus architecture may include, among other things, any number of interconnected buses and bridges, with one or more processors 701, represented by processor 701, and various circuits of memory 702, represented by memory 702, being linked together. The bus architecture may also link together various other circuits such as peripherals, voltage regulators, power management circuits, and the like, which are well known in the art, and therefore, will not be described any further herein. The bus interface provides an interface. The transceiver 703 may be a number of elements, including a transmitter and a transceiver, providing a means for communicating with various other apparatus over a transmission medium. The processor 701 is responsible for managing the bus architecture and general processing, and the memory 702 may store data used by the processor in performing operations.

Those skilled in the art will appreciate that all or part of the steps for implementing the above embodiments may be performed by hardware, or may be instructed to be performed by associated hardware by a computer program that includes instructions for performing some or all of the steps of the above methods; and the computer program may be stored in a readable storage medium, which may be any form of storage medium.

An embodiment of the present invention further provides a video processing system, as shown in fig. 8, including: the terminal comprises a first terminal device and a second terminal device which carries out video call with the first terminal.

The first terminal device is configured to receive a first input to a video call interface, where the video call interface displays a first video image corresponding to the first terminal device and a second video image corresponding to the second terminal device; in response to the first input, determining a selected target image area, the target image area being a partial area of an image acted on by the first input; compressing the image in the target image area to obtain a compressed image under the condition that the target image area is the image area of the first video image; obtaining a final video image according to the image acted by the first input and the compressed image;

and the second terminal equipment is used for receiving the final video image sent by the first terminal equipment.

It should be noted that the first terminal device performs the method steps in the embodiment shown in fig. 1.

Optionally, the video processing system may further include: and the network equipment is respectively connected with the first terminal equipment and the second terminal equipment.

The first terminal device is used for sending a request to the network device under the condition that the target image area is the image area of the second video image;

the network equipment is used for receiving a request sent by the first terminal equipment; responding to the request, compressing the image in the target image area to obtain a compressed image; obtaining a final video image according to the image acted by the first input and the compressed image; and sending the final video image to the first terminal equipment after video coding.

It should be noted that the network device performs the method steps in the embodiment shown in fig. 3.

In addition, the embodiments of the present invention also provide a computer-readable storage medium, on which a computer program is stored, and the computer program, when executed by a processor, implements the steps in the video processing method. And the same technical effect can be achieved, and in order to avoid repetition, the description is omitted.

In the several embodiments provided in the present application, it should be understood that the disclosed method and apparatus may be implemented in other ways. For example, the above-described apparatus embodiments are merely illustrative, and for example, the division of the units is only one logical division, and other divisions may be realized in practice, for example, a plurality of units or components may be combined or integrated into another system, or some features may be omitted, or not executed. In addition, the shown or discussed mutual coupling or direct coupling or communication connection may be an indirect coupling or communication connection through some interfaces, devices or units, and may be in an electrical, mechanical or other form.

In addition, functional units in the embodiments of the present invention may be integrated into one processing unit, or each unit may be physically included alone, or two or more units may be integrated into one unit. The integrated unit can be realized in a form of hardware, or in a form of hardware plus a software functional unit.

The integrated unit implemented in the form of a software functional unit may be stored in a computer readable storage medium. The software functional unit is stored in a storage medium and includes several instructions for causing a computer device (which may be a personal computer, a server, or a network device) to execute some steps of the transceiving method according to various embodiments of the present invention. And the aforementioned storage medium includes: various media capable of storing program codes, such as a usb disk, a removable hard disk, a Read-Only Memory (ROM), a Random Access Memory (RAM), a magnetic disk, or an optical disk.

While the preferred embodiments of the present invention have been described, it will be understood by those skilled in the art that various changes and modifications may be made without departing from the spirit and scope of the invention as defined in the following claims.

Claims

1. A video processing method is applied to a terminal device, wherein the terminal device is a first terminal device, and the method comprises the following steps:

acquiring a compressed image of the target image area;

2. The video processing method according to claim 1, wherein said obtaining a compressed image of the target image region comprises:

3. The method of claim 2, wherein obtaining a final video image based on the image acted on by the first input and the compressed image comprises:

and obtaining a final video image based on the third video image.

4. The video processing method according to claim 3, wherein said deriving a final video image based on the third video image comprises:

taking the third video image as a final video image;

or,

receiving a second input to the target image area in the third video image;

5. The video processing method according to claim 1, wherein said obtaining a compressed image of the target image region comprises:

6. The video processing method according to claim 5, wherein obtaining a final video image based on the image acted on by the first input and the compressed image comprises:

and obtaining a final video image based on the fifth video image.

7. The video processing method according to claim 6, wherein obtaining a final video image based on the fifth video image comprises:

taking the fifth video image as a final video image;

or,

receiving a third input to the target image area in the fifth video image;

8. The video processing method of claim 1, wherein said determining the selected target image area in response to the first input comprises:

and determining the selected target image area according to the first position.

9. The video processing method of claim 8, wherein determining the selected target image area based on the first location comprises:

10. A video processing method is applied to network equipment and is characterized by comprising the following steps:

receiving a request sent by first terminal equipment;

11. The video processing method of claim 10, wherein obtaining a final video image based on the image acted on by the first input and the compressed image comprises:

and obtaining a final video image based on the fifth video image.

12. The video processing method according to claim 11, wherein said deriving a final video image based on the fifth video image comprises:

taking the fifth video image as a final video image;

or,

13. A terminal device, comprising: processor, memory and a computer program stored on the memory and executable on the processor, which computer program, when executed by the processor, carries out the steps of the video processing method according to any one of claims 1 to 9.

14. A network device, comprising: processor, memory and a computer program stored on the memory and executable on the processor, which computer program, when executed by the processor, carries out the steps of the video processing method according to any of claims 10 to 12.

15. A video processing system, comprising: the system comprises a first terminal device and a second terminal device which carries out video call with the first terminal device;

the first terminal device is used for receiving a first input to a video call interface, and the video call interface displays a first video image corresponding to the first terminal device and a second video image corresponding to the second terminal device; in response to the first input, determining a selected target image area, the target image area being a partial area of an image acted on by the first input; compressing the image in the target image area to obtain a compressed image under the condition that the target image area is the image area of the first video image; obtaining a final video image according to the image acted by the first input and the compressed image;

16. The video processing system of claim 15, wherein the video processing system further comprises: network equipment respectively connected with the first terminal equipment and the second terminal equipment;

17. A computer-readable storage medium, characterized in that a computer program is stored thereon, which computer program, when being executed by a processor, realizes the video processing method according to one of the claims 1 to 9 or the steps of the video processing method according to one of the claims 10 to 12.