WO2020181540A1

WO2020181540A1 - Video processing method and device, encoding apparatus, and decoding apparatus

Info

Publication number: WO2020181540A1
Application number: PCT/CN2019/078050
Authority: WO
Inventors: 孟学苇; 郑萧桢; 王苫社; 马思伟
Original assignee: 北京大学; 深圳市大疆创新科技有限公司
Priority date: 2019-03-13
Filing date: 2019-03-13
Publication date: 2020-09-17
Also published as: CN111567044A

Abstract

A video processing method and device, an encoding apparatus, and a decoding apparatus. The method can comprise: determining a video type of an acquired video to be processed; if the video type of said video is a preset video type, modifying respective pixel precision values in an initial pixel precision set to obtain a target pixel precision set; and performing encoding processing on said video on the basis of the target pixel precision set to obtain an encoded video. The embodiments of the present invention can enhance encoding performance of a terminal apparatus.

Description

Video processing method, device, coding equipment and decoding equipment

Technical field

The present invention relates to the field of communication technology, and in particular to a video processing method, device, encoding device and decoding device.

Background technique

With the continuous development of the information age, in daily life, more and more users will record or store certain content by shooting video with a camera or video camera. Due to the large amount of video data obtained by shooting, the terminal device is storing Or when transmitting a video, the video content needs to be encoded, and then the encoded video is stored or transmitted. When the video needs to be displayed, the encoded video is decoded and displayed through the decoding method corresponding to the encoding process.

In the process of encoding video, one of the key technologies is inter-frame prediction. The main idea of inter-frame prediction is to obtain the predicted frame from the motion vector of the current frame and the reference frame in the video. In this process, the selection of the pixel accuracy of the motion vector is directly related to the quality of inter-frame prediction, which in turn affects the quality of video coding. Therefore, in the field of digital video coding technology, how to choose the pixel accuracy during coding processing has become a hot issue in current research.

Summary of the invention

The embodiments of the present invention provide a video processing method, device, encoding device and decoding device, which can improve the encoding performance of terminal equipment.

In the first aspect, an embodiment of the present invention provides a video processing method, including:

Determine the video type of the acquired video to be processed;

If the video type of the video to be processed is a preset video type, modify each pixel precision value in the initial pixel precision set to obtain a target pixel precision set;

Perform encoding processing on the to-be-processed video based on the target pixel accuracy set to obtain an encoded video.

In the second aspect, embodiments of the present invention provide another video processing method, including:

Receive encoded video;

When the coded video includes identification information, determining that the video type corresponding to the coded video is a preset video type;

Decoding the encoded video based on the target pixel accuracy set;

The target pixel accuracy set is obtained by modifying each pixel accuracy value in the initial pixel accuracy set.

In a third aspect, an embodiment of the present invention provides a video processing device, including a determining unit and a processing unit:

The determining unit is used to determine the video type of the acquired video to be processed;

A processing unit, configured to, if the determining unit determines that the video type of the video to be processed is a preset video type, modify each pixel precision value in the initial pixel precision set to obtain a target pixel precision set;

The processing unit is further configured to perform encoding processing on the to-be-processed video based on the target pixel accuracy set to obtain an encoded video.

In a fourth aspect, an embodiment of the present invention also provides another video processing device, including a receiving unit and a processing unit:

Receiving unit for receiving encoded video;

A processing unit, configured to determine that the video type corresponding to the coded video is a preset video type when the coded video includes identification information;

The processing unit is further configured to decode the encoded video based on the target pixel accuracy set;

In a fifth aspect, an embodiment of the present invention provides an encoding device, which is characterized by comprising a memory and a processor, the memory is connected to the processor, the memory is used to store a computer program, and the computer program includes Program instructions, the processor is configured to call the program instructions to execute the video processing method of the first aspect described above.

In a sixth aspect, an embodiment of the present invention provides a decoding device, which is characterized in that it includes a memory and a processor, the memory is connected to the processor, the memory is used to store a computer program, and the computer program includes Program instructions, the processor is configured to call the program instructions to execute the video processing method of the second aspect described above.

In a seventh aspect, an embodiment of the present invention also provides a computer storage medium, in which a first computer program instruction is stored, and when the first computer program instruction is executed by a processor, it is used to execute the first aspect. The video processing method; the computer storage medium also stores a second computer program instruction, when the second computer program instruction is executed by the processor, it is used to execute the video processing method of the second aspect.

In the embodiment of the present invention, the terminal device judges the acquired video type of the video to be processed, and if the video type of the video to be processed is a preset video type, the accuracy value of each pixel in the initial pixel accuracy set is increased , Obtain a target pixel accuracy set, and further, perform encoding processing on the video to be processed based on the target pixel accuracy set to obtain an encoded video. In the above process of encoding the video to be processed, the target pixel accuracy set used in the encoding process is determined according to the video type of the video to be processed, so that targeted pixel accuracy sets are selected for the to-be-processed videos of different video types. Can improve the quality of encoded video.

Description of the drawings

In order to explain the embodiments of the present invention or the technical solutions in the prior art more clearly, the following will briefly introduce the drawings needed in the embodiments. Obviously, the drawings in the following description are only some of the present invention. Embodiments, for those of ordinary skill in the art, without creative work, other drawings can be obtained from these drawings.

FIG. 1 is a scene diagram of drone aerial photography provided by an embodiment of the present invention;

Figure 2a is a schematic diagram of a motion estimation provided by an embodiment of the present invention;

Figure 2b is a schematic diagram of determining a motion vector provided by an embodiment of the present invention;

Figure 3a is a schematic diagram of another motion estimation provided by an embodiment of the present invention;

Figure 3b is a schematic diagram of yet another motion estimation provided by an embodiment of the present invention;

4 is a schematic flowchart of a video processing method provided by an embodiment of the present invention;

Figure 5 is a schematic diagram of an encoding system provided by an embodiment of the present invention;

Figure 6 is an interaction diagram provided by an embodiment of the present invention;

Figure 7 is a schematic structural diagram of an encoding device provided by an embodiment of the present invention;

Fig. 8 is a schematic structural diagram of a decoding device provided by an embodiment of the present invention.

detailed description

The technical solutions in the embodiments of the present invention will be clearly and completely described below in conjunction with the accompanying drawings in the embodiments of the present invention. Obviously, the described embodiments are only a part of the embodiments of the present invention, rather than all the embodiments. Based on the embodiments of the present invention, all other embodiments obtained by those of ordinary skill in the art without creative work shall fall within the protection scope of the present invention.

The embodiment of the present invention proposes a video processing method for the pixel precision selection problem in video encoding. The method can set the pixel precision set for encoding processing according to the video type of the video to be processed, and can improve the video encoding. Performance. Specifically, the video processing method provided by the embodiment of the present invention may include: determining the acquired video type of the to-be-processed video; if the video type of the to-be-processed video is a preset video type, adding each pixel in the initial pixel accuracy set The accuracy value is modified to obtain a target pixel accuracy set; the to-be-processed video is encoded based on the target pixel accuracy set to obtain an encoded video. In the foregoing process of encoding the video to be processed, the initial pixel accuracy set is modified accordingly according to the video type of the video to be processed to obtain a pixel accuracy set suitable for the video to be processed, which achieves targeted behavior The target pixel precision set of the to-be-processed videos of different video types can be selected to improve the quality of the encoded video.

The video processing method provided by the embodiment of the present invention can be applied to various application scenarios of video encoding transmission. The following takes the application scenario of drone aerial photography as an example to specifically introduce the video processing method. Referring to FIG. 1, which is a scene diagram of drone aerial photography provided by an embodiment of the present invention, it is assumed that FIG. 1 includes a drone 101, a camera area 102, and a display device 103. Among them, a camera device 1011 is mounted on the drone 101, and the camera device can be used to shoot videos and images. The drone 101 can also be equipped with a pan/tilt 1012, and the camera device 1011 can pass through the cloud. The station 1012 is mounted on the drone 101. The camera area 102 includes vehicles, trees, and rivers. The camera device 1011 captures the camera area 102 to obtain a video to be processed.

The initial pixel accuracy set can be set in the drone 101 by default. Each pixel accuracy in the initial pixel precision set may not be applicable to all video types. Therefore, after obtaining the video to be processed, the drone 101 is not used directly The initial pixel accuracy set encodes the video to be processed, but determines the video type of the video to be processed, and further determines whether the video type to which the video to be processed belongs is suitable for encoding using the initial pixel accuracy set. If it is determined that the video type to which the to-be-processed video belongs can be encoded using the initial pixel accuracy set, then the to-be-processed video is encoded based on the initial pixel accuracy set; if it is determined that the video to be processed belongs to If the type is not suitable for using the initial pixel accuracy set, each pixel accuracy value included in the initial pixel accuracy set is modified so that each modified pixel accuracy value is applicable to the video type to which the video to be processed belongs, and each modified pixel The precision value constitutes the target pixel precision set. Next, the drone 101 encodes the video to be processed based on the target pixel accuracy set.

Optionally, the UAV 101 sends the encoded video obtained by encoding the video to be processed to the decoding end. The decoding end described here may be configured in the UAV 101 or may be independent of the UAV 101. The decoding device, the decoding end uses the corresponding decoding strategy to decode the encoded video, and finally sends the decoded video to the display device 103. The display device 101 can be an encoding device with a display screen. After receiving the decoding sent by the decoding end When the video is displayed, the decoded video can be displayed on the display screen so that the user can watch the video.

In one embodiment, video refers to various technologies that capture, record, process, store, transmit, and reproduce a series of static images in the form of electrical signals. The original video captured by a camera or video camera or other shooting device Contains a lot of redundant information, so the amount of uncompressed video data is very large, it is difficult to store, and it is not convenient to transmit on the network. For example, the data volume of one second of digital TV video is about 1113KB. If the transmission bandwidth is 1M and the bit rate is 9123840, it takes 9 seconds to transmit one second of digital TV video, that is, the user wants to watch one second Zhong's digital TV video needs to wait up to 9 seconds, which greatly reduces the user experience. For another example, the data volume of an uncompressed 10-second video is about 2.4G. Assuming a mobile phone with 16G memory, excluding the part occupied by the system, the remaining storage space is at most 12G and can only store at most 50 seconds. video.

Therefore, in order to solve the problem of difficulty in video storage and transmission, it is necessary to compress the original video. The so-called compression processing on the original video is to remove a large amount of redundant information contained in the original video, such as temporal redundancy, visual redundancy, and spatial redundancy. The process of compressing the original video is essentially The process of video encoding. In the embodiment of the present invention, the video to be processed is the original video, and one of the key technologies in the video encoding process is inter-frame prediction. The inter-frame prediction technique utilizes the temporal correlation between adjacent frames of the video. , Use the previously coded reconstructed frame as the reference frame, and predict the current frame through motion estimation and motion compensation, thereby removing the temporal redundant information of the video. In simple terms, the theoretical basis of inter-frame prediction is that there is a certain correlation between the scenes in the adjacent frames of the moving image. When encoding, it is not necessary to transmit all the information of each frame, but only the difference between frames. OK.

Video can be regarded as composed of multiple frames of images. Encoding the video refers to encoding each frame of image included in the video. In one embodiment, when encoding any frame image in the video, the frame image is first divided into multiple coding regions, and then each coding region is divided into multiple coding units, and each coding unit includes multiple coding units. For each coding block, perform inter-frame prediction in turn. The following takes the target coding block in a certain coding unit of the current frame as an example to introduce the process of inter-frame prediction: find the reference frame corresponding to the current frame in the time domain, and the reference frame is the coded near the current frame in the time domain Any frame in the frame; search for similar blocks similar to the target coded block in the reference frame, and determine the relative position between the target coded block and the similar block (as shown in Figure 2a and Figure 2b). The relative position is called Motion Vector (MV) (for the convenience of description, the process of determining the motion vector is called motion estimation below); according to the motion vector, the related information of the motion vector and the reference frame, the prediction block corresponding to the target encoding block is obtained. A similar process can obtain the prediction block corresponding to each coding block of the current frame, so that the prediction frame of the current frame can be obtained.

In an embodiment, the above-mentioned related information of the motion vector includes the pixel accuracy (also can be understood as the pixel accuracy of the motion vector) used in the motion estimation process, the motion vector difference (MVD), etc. Among them, MVD refers to the difference between a motion vector obtained through a motion estimation process and a motion vector prediction (MVP). The MVP uses multiple adjacent coded blocks and multiple current coded blocks. MV is calculated. The larger the pixel accuracy value, the lower the accuracy of the pixel accuracy, the lower the accuracy of the motion estimation, the smaller the pixel accuracy value, the higher the accuracy of the pixel accuracy, and the higher the accuracy of motion estimation. For example, referring to Figures 3a and 3b are schematic diagrams of two kinds of motion estimation provided by the embodiments of the present invention. In the above two figures, the black dots represent the whole pixel, the white dots represent 1/2 pixel, assuming that 301 represents the current A target coding block in the frame, 302 represents a similar block in the reference frame. Assuming that the pixel precision used in FIG. 3a is the integer pixel precision, the arrow 303 represents the motion vector corresponding to the target coding block, that is, the position difference between the target coding block in the previous frame image and the current frame image. Assuming that the pixel precision used in Figure 3b is 1/2 pixel precision, 304 represents the motion vector corresponding to the target coding block. From the comparison between Figure 3a and Figure 3b, it can be seen that the motion vector represented by 304 is more accurate than the motion vector represented by 303.

Considering that the human visual system is not sensitive to some details, some motion details in a video may be encoded with low pixel accuracy, and some motion details need to be encoded with high pixel accuracy. Based on this situation In order to improve the quality of inter-frame prediction, the encoding device can use Adaptive Motion Vector Resolution (AMVR) technology to determine the pixel accuracy of the motion vector used in the inter-frame prediction process. The above description shows that AMVR technology is essentially What determines the pixel accuracy of MVD. In one embodiment, the main principle of the AMVR technology is: the encoding device can set a set of pixel precisions, and the set of pixel precisions can include at least two pixel precisions. When encoding a certain coding unit of a certain video, At this time, the corresponding pixel precision can be selected adaptively from the pixel precision set according to the characteristics of the coding unit as the pixel precision of the MVD.

It should be understood that the encoding device sets a pixel accuracy set, and when encoding a video, selects an appropriate pixel accuracy for each coding unit from the pixel accuracy set for encoding, so as to ensure that while removing visual redundancy in the video, It also reduces the amount of data processed by the encoding device and saves some terminal power consumption. For example, the pixel accuracy set may be (integer pixel accuracy, 1/2 pixel accuracy, 1/4 pixel accuracy), or the pixel accuracy set may also be (integer pixel accuracy, 4 pixel accuracy, and 1/4 pixel accuracy).

In an embodiment, the characteristics of the video content under different video types are different, and the characteristics of the video content are different, resulting in different sets of pixel accuracy used when encoding the video using the AMVR technology. Generally, video types can include natural video and screen content video. Natural video refers to a video that is obtained by shooting certain scenes through a camera device without other processing; screen content video generally refers to the video displayed on the screen of an encoding device Content, mainly including computer screens, TV screens, mobile phone screens and other content video. This type of video includes not only some natural images, but also some visual content generated by computers such as text, graphics, animation, and games. It is a kind of video formed by a mixture of natural and artificial images. Compared with natural videos, screen content videos often have steep edges, high-purity colors, strong contrasts, etc., as well as more regular and simple motion information.

Due to the difference between natural video and screen content video, several pixel precisions in this pixel precision set may not be suitable for encoding screen content video or natural video. Therefore, this application sets the above pixel precision set by the encoding device The set (for example, the integer pixel precision, 1/2 pixel precision and 1/4 pixel precision as mentioned above) is called the initial pixel precision set. When the screen content video is encoded, the precision value of each pixel in the initial pixel precision set will be calculated Make modifications to obtain the target pixel accuracy set suitable for the screen content video. Similarly, the initial pixel accuracy set set by the encoding device may be suitable for screen content videos. If the video to be processed is a natural video, it is also necessary to modify the accuracy of each pixel in the initial pixel accuracy set to obtain a target suitable for natural video. Pixel accuracy collection.

Refer to FIG. 4, which is a video processing method provided by an embodiment of the present invention. The video processing method can be used in any encoding device capable of implementing an encoding function, and the video processing method can be specifically executed by a processor of the encoding device. The video processing method may include the following steps:

Step S401: The encoding device determines the video type of the acquired video to be processed.

In an embodiment, the to-be-processed video acquired by the encoding device may be obtained by photographing the camera object through a camera device configured on the encoding device, or the to-be-processed video may also be an independent camera device that captures the camera. The subject is photographed and sent to the encoding device.

In one embodiment, the video formats obtained by shooting the camera object by different camera devices may also be different, so a video may include multiple video formats, such as avi, mp4, mts, and mp3. Optionally, the video format of the to-be-processed video acquired by the encoding device may be any one of the foregoing video formats.

As a feasible implementation, the video can be classified into natural video and screen content video according to the way the video content is generated. Natural video can refer to a video directly shot by a camera or a video camera, that is to say, the natural video includes multiple frames Natural images, for example, daily small videos shot by mobile phones; screen content videos generally refer to content displayed on the screen of an encoding device. The encoding devices mentioned here may mainly include encoding devices such as computers, televisions, and mobile phones. Specifically, the screen content video includes not only some natural images, but also some computer-generated visual content such as text, graphics, animation, or games. The screen content video is a mixture of natural videos and artificial images. video. For example, a movie, or an animation added to a presentation through the computer. In the embodiment of the present invention, the video type of the to-be-processed video acquired by the encoding device may be any one of natural video or screen content video.

It should be understood that the above classification of videos is only a feasible video classification method listed in the embodiment of the present invention. Other classification criteria can also be used to classify video types. For example, videos can be classified into long videos and videos based on the duration of the video content. Short video.

From the description of the embodiments in Figure 2a, Figure 2b, and Figure 3a and Figure 3b, it can be seen that when the encoding device encodes videos of different video types, if the corresponding pixel precision sets are set for different video types in a targeted manner, you can While improving the quality of encoded video, it also saves the power consumption of some terminals. Therefore, in the embodiment of the present invention, after the video type of the video to be processed is determined, a suitable pixel precision set for the video to be processed is selected, and then the video to be processed is encoded based on the selected pixel precision set. In an embodiment, the encoding device may select a suitable pixel accuracy set for the video to be processed through step S402.

It should be understood that the to-be-processed video is composed of multiple frames of images. When the to-be-processed video is encoded, each frame of the multiple-frame images is processed. Therefore, the following determination Obtaining the video type of the to-be-processed video is essentially determining the video type of the current frame of the to-be-processed video being processed.

In an embodiment, the implementation of step S401 may include: determining a hash value corresponding to the video to be processed; if the hash value is not greater than a threshold, determining that the video type of the video to be processed is a pre-processing Set the video type; if the hash value is greater than the threshold, it is determined that the video type of the video to be processed is not the preset video type. That is, the encoding device can determine the video type to which the video to be processed belongs according to the hash value corresponding to the video to be processed. Wherein, the threshold is a preset value used to determine the video type, and the value can be set by the encoding device. In the embodiment of the present invention, the preset video type may be set by the encoding device, and the preset video type may include any one or more of screen content video or natural video. In other embodiments, the preset video type It is assumed that the video type may also include any one of a long video or a short video.

Suppose that after the encoding device obtains the video to be processed, before starting to encode the target frame of the video to be processed, it first needs to calculate the hash value of the target frame. If the hash value of the target frame is less than or equal to the threshold, the target frame is determined Is the preset video type; if the hash value of the target frame is greater than the threshold, it is determined that the target frame is not the preset video type. Further, according to the result of the judgment, a set of pixel accuracy required for encoding processing is selected.

In another embodiment, the implementation of step S401 may further include: calling a video type recognition model to recognize the to-be-processed video to obtain a recognition result; if the video type indicated by the recognition result is a preset video Type, it is determined that the video type of the to-be-processed video is the preset video type. That is to say, the encoding device may store a video type recognition model. The video type recognition mode is obtained through training of video samples containing different video types. The encoding device calls the model to identify the video to be processed and obtains the recognition result. The recognition result may include the probability that the video to be processed belongs to a certain video type, and the video type with a higher probability is determined as the video type of the video to be processed. For example, suppose that the encoding device calls the video type recognition model to recognize the video to be processed, and the obtained recognition result can be 30% of natural video and 70% of screen content video. According to the recognition result, the video to be processed is determined to be screen content video. .

Step S402: If the video type of the to-be-processed video is a preset video type, increase the accuracy value of each pixel in the initial pixel accuracy set to obtain the target pixel accuracy set.

In an embodiment, the initial pixel accuracy set may be the default pixel accuracy set of the encoding device, or may be the pixel accuracy set used by the encoding device in the encoding process of the previous frame. Assuming that the initial pixel accuracy set is the default pixel accuracy set of the encoding device, the encoding device may set the initial pixel accuracy set by: determining the initial pixel accuracy set according to the pixel accuracy set used in historical video encoding processing, for example, encoding The device obtains the pixel accuracy set used in the last 5 video processing, and the pixel accuracy set used in the 4 video encoding processing is the same (1/2 pixel accuracy, 1/4 pixel accuracy, integer pixel accuracy), then Determine (1/2 pixel accuracy, 1/4 pixel accuracy, integer pixel accuracy) as the default initial pixel accuracy. In other embodiments, the initial pixel accuracy may also be set by the encoding device according to the acquired setting operation. Before the encoding device performs encoding processing, the user can set the initial pixel accuracy set during encoding through the user interface of the encoding device. , Or the user can also perform some other coding-related configuration operations through the user interface.

In one embodiment, in the embodiment of the present invention, if it is determined in step S401 that the video type of the video to be processed is the preset video type, the initial pixel accuracy set currently set in the encoding device is acquired, if the initial pixel The accuracy of each pixel included in the accuracy set meets the pixel accuracy requirement when encoding the video of the preset video type, then the initial pixel accuracy set can be used directly to encode the image to be processed; if the initial pixel accuracy Each pixel accuracy included in the set has one or more pixel accuracy, which does not meet the pixel accuracy requirements when encoding the video of the preset video type, then the corresponding pixel accuracy value is modified, and finally the modified The initial pixel accuracy set of is used as the target pixel accuracy set for encoding the to-be-processed video. Wherein, the modification of the pixel accuracy value may include increase or decrease.

In one embodiment, the method of modifying each pixel precision value in the initial pixel precision set to obtain the target pixel precision set may be: determining the pixel precision value adjustment rule according to the preset video type; The value adjustment rule modifies each pixel precision value included in the initial pixel precision set to obtain the target pixel precision set. Wherein, in the embodiment of the present invention, the implementation manner of determining the pixel accuracy value adjustment rule according to the preset video type may be to determine the pixel accuracy value adjustment rule according to the motion rule of the preset video type and the video content of the video to be processed. For example, the pixel accuracy value adjustment rule is: the difference between the modified pixel accuracy value and the corresponding pixel accuracy value before the modification is less than or equal to 7 pixels. For another example, the adjustment rule can also be set as: the difference between the modified pixel accuracy and the corresponding pixel accuracy value before the modification is less than or equal to 1/2 pixel accuracy, etc. It should be understood that the foregoing is only a method for modifying the pixel accuracy value listed in the embodiment of the present invention, and the specific modification method is not specifically limited in the embodiment of the present invention.

In one embodiment, even for the same piece of video, the set of pixel precisions required when encoding it with different encoding modes is different. In the embodiment of the present invention, it is assumed that the coding mode includes the first type of coding mode and the second type of coding mode. The first type of coding mode may refer to any one of the inter coding mode and the affine affine coding mode. The second type of coding mode refers to the other of the inter coding mode and the affine coding mode. Corresponding to two encoding modes, the initial pixel accuracy set may include a first initial pixel accuracy set and a second initial pixel accuracy set. Wherein, the initial pixel accuracy set corresponding to the first type of encoding mode may be the first initial pixel accuracy set, and the corresponding initial pixel accuracy set in the second type of encoding mode may be the second initial pixel accuracy set. In other embodiments, the initial pixel accuracy set corresponding to the first type of encoding mode may also be the second initial pixel accuracy set, and the initial pixel accuracy set corresponding to the second type of encoding mode may also be the first initial pixel accuracy set.

In one embodiment, the main difference between the inter coding mode and the affine coding mode is that the inter coding mode only pays attention to the motion information of translational motion in the video, and the affine coding mode pays attention to more motion information, such as zoom, rotation, and perspective. Irregular sports such as sports. From the foregoing description, it can be seen that when using inter coding mode for inter-frame prediction, the processing object of inter-frame prediction is a certain coding block in an image, and the processing object of affine coding mode is no longer the entire coding block, but The entire coding block is divided into multiple coding sub-blocks, and each coding sub-block is used as a processing object. In this way, each coding sub-block in affine coding mode corresponds to a motion vector, and the motion vectors corresponding to multiple coding sub-blocks form the motion vector field in affine coding mode. Motion compensation in affine coding mode refers to the use of motion The vector field and the reference frame get the predicted frame. In an embodiment, the motion vector of each coding sub-block included in each coding block in the affine coding mode may be calculated through the parameters of the control points on the coding block. Generally, in the affine coding mode, the number of control points on each coding block can be two or three. As to how to calculate the motion vector of each code block corresponding to each code block in the affine coding mode through the parameter calculation of the control point, the relevant method in the prior art can be used, and it will not be repeated here.

In one embodiment, it can be seen from the above description that after determining that the video type of the video to be processed is a preset video type, before determining the corresponding target pixel accuracy set for the video to be processed, it is first necessary to determine The encoding mode used in the video encoding process, and then further select the initial pixel accuracy set to be adjusted according to the encoding mode, and finally increase the accuracy of each pixel in the initial pixel accuracy set to be adjusted to obtain the target pixel accuracy set.

Specifically, the initial pixel accuracy set includes a first initial pixel accuracy set and a second initial pixel accuracy set, the target pixel accuracy set includes a first target pixel accuracy set and a second target pixel accuracy set, and the initial pixel accuracy set The precision value of each pixel in the precision set is increased to obtain the target pixel precision set, including: acquiring the coding mode used when encoding the video to be processed; if the coding mode is the first type of coding mode, then Each pixel accuracy value in the first initial pixel accuracy set is modified to obtain the first target pixel accuracy set; if the encoding mode is the second type of encoding mode, each pixel accuracy value in the second initial pixel accuracy set is performed Modify to obtain the second target pixel accuracy set.

In an embodiment, the encoding mode used when encoding the video to be processed may be selected by the encoding device according to the motion information included in the video to be processed, specifically, if the encoding device determines that If the video to be processed includes multiple motion information such as rotation, translation, zooming, etc., the first type of encoding mode can be selected to encode the video to be processed; if the encoding device determines that the video to be processed only includes translation motion information , The second type of encoding mode can be selected to encode the to-be-processed video. In other embodiments, the encoding mode used when encoding the to-be-processed video may also be determined by the encoding device according to a setting operation input by the user on the user interface.

For example, assuming that the encoding device is set in the first encoding mode, the first initial pixel accuracy set is (1/2 pixel accuracy, integer pixel accuracy, 1/4 pixel accuracy); in the second encoding mode, the second The initial pixel accuracy set is (1/4 pixel accuracy, 1/8 pixel accuracy, 1/16 pixel accuracy). Assuming that the encoding device obtains a piece of video to be processed, and determines that the video type of the to-be-processed video is a preset video type, further, the encoding device determines the encoding mode required for encoding the to-be-processed video, if it is determined If the coding mode is the first type of coding mode, each pixel precision value in the first initial pixel precision set (integer pixel precision, 1/2 pixel precision, 1/4 pixel precision) is modified to obtain the first target pixel precision The set can be expressed as (integer pixel accuracy, 4 pixel accuracy, and 8 pixel accuracy); if the encoding mode is determined to be the second type of encoding mode, the second initial pixel accuracy set (1/4 pixel accuracy, 1/8 Each pixel precision value in the pixel precision, 1/16 pixel precision) is modified to obtain the second target pixel precision set, which can be expressed as (1/2 pixel precision, integer pixel precision, 2 pixel precision).

Step S403: Perform encoding processing on the to-be-processed video based on the target pixel accuracy set to obtain an encoded video.

In one embodiment, after the target pixel accuracy set corresponding to the video to be processed is determined in step S402, the to-be-processed video is inter-predicted based on the target pixel accuracy set, and then the to-be-processed video is inter-predicted. Process the video and perform other encoding processing, such as transformation, quantization, entropy encoding, etc., to obtain an encoded video. Wherein, the performing inter-frame prediction on the to-be-processed video based on the target pixel accuracy set is essentially performing inter-frame prediction on each frame of the to-be-processed video.

In one embodiment, after encoding the processed video to obtain the encoded video, the encoded video can be transmitted to the decoding end in the form of a bit stream, and the decoding end can decode the encoded video, and the decoding end can decode the encoded video. The obtained video is transmitted to the display device and displayed by the display device. After the above-mentioned video to be processed is encoded, decoded, and displayed, some redundant information included in the video to be processed is eliminated, which greatly reduces the data volume of the video to be processed, improves the video transmission efficiency, and also improves the user The viewing experience. For example, suppose a one-second digital TV video is transmitted through 1M transmission broadband. Without encoding processing, it needs to be transmitted for 9 seconds. That is, the user needs to wait for 9 seconds to watch a one-second video. Digital TV video; after the video processing method of the embodiment of the present invention is used for processing, it may only need to be transmitted for 1 second from time to time.

In one embodiment, after encoding each frame of the target video to be processed based on the target pixel accuracy, each frame of image is added with identification information that can identify the video type to which the frame of image belongs, and the to-be-processed After each frame image in the video is processed as described above, an encoded video is obtained. The encoded video includes identification information. The encoded video and identification information are sent to the decoding end, so that the decoding end decodes according to the encoding situation of each frame image . Specifically, the encoding process on the to-be-processed video based on the target pixel accuracy set to obtain the encoded video includes: adding identification information to the encoded video; and sending the encoded video with the identification information added to the decoding At the end, the identification information is used to instruct the decoding end to decode the encoded video based on the identification information.

In one embodiment, after determining the target pixel accuracy set corresponding to the video to be processed, the encoding device may set an index mark for each pixel accuracy included in the target pixel accuracy set. In this way, the encoding device can use the index mark You can know the pixel accuracy used when encoding a certain frame of image. Among them, the index identification of each pixel accuracy setting included in the target pixel accuracy set may be: assuming that the target pixel accuracy set is (integer pixel accuracy, 4 pixel accuracy, 8 pixel accuracy), the index identification is: 0 means integer pixel accuracy, 00 Represents 4-pixel accuracy, and 01 represents 8-pixel accuracy.

After encoding the video to be processed to obtain the encoded video, the encoding device may send the index identifier and the encoded video together to the decoder, where the index identifier is used to instruct the decoding end to decode the encoded video based on the index identifier. Specifically, the encoding process of the to-be-processed video based on the target pixel accuracy set to obtain the encoded video includes: setting an index identifier for each pixel accuracy included in the target pixel accuracy set; determining that the to-be-processed video The target pixel precision in the target pixel precision set used in encoding processing, and the index mark corresponding to the target pixel precision; add the index mark to the encoded video, and add the code of the index mark The video is sent to the decoding terminal, and the index identifier is used to instruct the decoding terminal to decode the encoded video based on the index identifier.

In the embodiment of the present invention, the encoding device judges the acquired video type of the to-be-processed video, and if the video type of the to-be-processed video is a preset video type, the accuracy value of each pixel in the initial pixel accuracy set is increased , Obtain a target pixel accuracy set, and further, perform encoding processing on the video to be processed based on the target pixel accuracy set to obtain an encoded video. In the above process of encoding the video to be processed, the target pixel accuracy set used in the encoding process is determined according to the video type of the video to be processed, so that targeted pixel accuracy sets are selected for the to-be-processed videos of different video types. Can improve the quality of encoded video.

5, it is an encoding system provided by an embodiment of the present invention. The encoding system may include an encoding terminal 501 and a decoding terminal 502. The encoding terminal 501 and the decoding terminal 502 may be configured in the same terminal device, Or the encoding end 501 and the decoding end 502 may also be two independent devices. In the encoding system shown in FIG. 5, the encoding terminal 501 is used to compress and encode the video to be processed using a suitable pixel accuracy set to reduce redundant information included in the video to be processed, and the encoding terminal 501 will compress and encode the video to be processed. The encoded video obtained after the encoding process is sent to the decoding end, and the decoding end 502 uses the pixel precision set corresponding to the encoding end and other encoding information to decode the encoded video.

Referring to FIG. 6, which is an interaction diagram provided by an embodiment of the present invention, the interaction flow between the encoding end 501 and the decoding end 502 in FIG. 5 when encoding and decoding a video is described below in conjunction with FIG. 6. In one embodiment, after obtaining the video to be processed in step S601, the encoding terminal 501 determines the video type of the video to be processed. According to the video type of the video to be processed, the target pixel accuracy set required for encoding the video to be processed is selected. Specifically, if the encoding terminal 501 determines in step S602 that the video type of the video to be processed is the preset video type, each pixel accuracy in the initial pixel accuracy set stored in the encoding terminal 501 is modified to obtain the target pixel accuracy set. Further, in step S603, the to-be-processed video is encoded based on the target pixel accuracy set; if the encoding end 501 determines that the video type of the to-be-processed video is not a preset video type, the initial pixel stored in the decoding end 501 is used for processing. The collection performs encoding processing on the to-be-processed video.

Optionally, the preset video type described in step S602 may include any one or more of screen content video and natural video, or the preset video type may also include any one of long video and short video. Or the preset video type may also be other video types, and the embodiment of the present invention does not limit the preset video type. In an embodiment, for the method for the encoding terminal 501 to determine the video type of the video to be processed, refer to the description of the related content in the embodiment of FIG. 4, which will not be repeated here.

The encoding mode of the encoding terminal 501 for encoding the video to be processed can be the first type of encoding mode and the second type of encoding mode. The first type of encoding mode may include any one of the inter encoding mode and the affine encoding mode. The coding mode may include one of inter coding mode and affine coding mode. In different encoding modes, the initial pixel accuracy sets corresponding to the encoding end 501 are different. Specifically, the initial pixel accuracy set corresponding to the first type of encoding mode may be the first initial pixel accuracy set, and the initial pixel accuracy set corresponding to the second type of encoding mode may be the second initial pixel accuracy set. After the encoding end 501 determines that the video type of the video to be processed is the preset video type, before determining the target pixel accuracy set, the encoding end 501 also needs to determine the encoding mode: if the encoding mode is the first type encoding mode, then encode The end 501 adjusts the precision values of each pixel in the first initial pixel accuracy set to obtain the first target pixel accuracy set; if the encoding mode is the second type of encoding mode, the encoding end 501 will adjust each pixel in the second initial pixel accuracy set The precision value is adjusted to obtain the second target pixel precision set.

In the encoding system described in FIG. 5, the encoding terminal 501 performs encoding processing on the video to be processed based on the target pixel accuracy set in step S603, and after obtaining the encoded video, it can also add identification information to the encoded video in step S604, and add the identification The encoded video of the information is sent to the decoding terminal 502. Wherein, the identification information refers to information used to identify that the video type corresponding to the encoded video is a preset video type. In other words, if the decoding terminal 502 detects that the encoded video includes this identification information, it can determine that the encoded video corresponds to the The video type is the preset video type.

The decoding terminal 502 receives the encoded video sent by the encoding terminal 501 in step S605, and can extract the information included in the encoded video. If it is determined in step S606 that the encoded video includes identification information, then the received encoded video is determined The corresponding video type is the preset video type; further, the decoding end 502 can obtain the target pixel accuracy set set by the encoding end 501 for the preset video type, and decode the encoded video based on the target pixel accuracy set through step S607 deal with.

In one embodiment, after the encoding terminal 501 performs step S603 to obtain the encoded video, in addition to adding identification information to the encoded video, it can also add an index identifier to the encoded video. The index identifier is used to identify the video to be processed by the encoding terminal 501 The target pixel accuracy used in the encoding process, and the target pixel accuracy belongs to any one of the target pixel accuracy set. It is understandable that the video to be processed is composed of multiple frames of images. Therefore, the video encoding process of the video to be processed is also processed in units of frames. After encoding each frame of image, it will include each frame in the form of code stream. The encoded video of the frame image encoding result is sent to the decoding end. Therefore, the foregoing target pixel accuracy used when encoding the video to be processed is essentially for each frame of the video to be processed.

The encoding end 501 sends the encoded video with the index mark to the decoding end 502, and the decoding end 502 determines the target pixel accuracy identified by the index mark from the target pixel accuracy set; The encoded video is decoded. After the decoding end 502 decodes the encoded video based on the target accuracy, a video with a small amount of data that can be displayed or stored is obtained.

In the encoding system provided by the embodiment of the present invention, the encoding terminal 501 sets the corresponding target pixel accuracy set according to the video type of the video to be processed to encode the video to be processed, which improves the quality of video encoding. The identification information included in the transmitted coded video can accurately select the target pixel accuracy set, and decode the coded video, which can ensure that the video content is not damaged and improve the decoding quality.

Referring to FIG. 7, it is a schematic structural diagram of an encoding device provided by an embodiment of the present invention. The encoding device as described in FIG. 7 may include: a memory 701 and a processor 702, where the memory 701 and the processor 702 are connected through a bus 703, and the memory Program codes are stored in 701, and the memory 702 calls the program codes in the memory 701.

The memory 701 may include volatile memory (volatile memory), such as random-access memory (RAM); the memory 701 may also include non-volatile memory (non-volatile memory), such as flash memory (flash memory), solid-state drive (SSD), etc.; the memory 701 may also include a combination of the foregoing types of memories.

The processor 702 may be a central processing unit (Central Processing Unit, CPU). The processor 702 may further include a hardware chip. The aforementioned hardware chip may be an application-specific integrated circuit (ASIC), a programmable logic device (PLD), etc. The PLD may be a field-programmable gate array (FPGA), a general array logic (generic array logic, GAL), etc. The processor 702 may also be a combination of the foregoing structures.

In the embodiment of the present invention, the memory 701 is used to store a computer program, and the computer program includes program instructions. The processor 702 is used to execute the program instructions stored in the memory 701 to implement the above-mentioned embodiment shown in FIG. 4 The steps of the corresponding method.

In one embodiment, the processor 702 is configured to execute when the program instructions are called: determine the acquired video type of the to-be-processed video; if the video type of the to-be-processed video is a preset video type, the initial Each pixel precision value in the pixel precision set is modified to obtain a target pixel precision set; the to-be-processed video is encoded based on the target pixel precision set to obtain an encoded video.

In one embodiment, when determining the video type of the acquired video to be processed, the processor 702 performs the following operations: determining the hash value corresponding to the video to be processed; if the hash value is not greater than the threshold, It is determined that the video type of the video to be processed is a preset video type; if the hash value is greater than the threshold, it is determined that the video type of the video to be processed is not the preset video type.

In one embodiment, when the processor 702 determines the acquired video type of the to-be-processed video, it performs the following operations: calling the video type recognition model to recognize the to-be-processed video and obtain the recognition result; if the recognition is The result indicates that the video type is the preset video type, and it is determined that the video type of the to-be-processed video is the preset video type.

In one embodiment, the initial pixel accuracy set includes a first initial pixel accuracy set and a second initial pixel accuracy set, and the target pixel accuracy set includes a first target pixel accuracy set and a second target pixel accuracy set. When the processor modifies each pixel precision value in the initial pixel precision set to obtain the target pixel precision set, it performs the following operations: obtains the encoding mode used when encoding the video to be processed; if the encoding mode is the first For the first type of encoding mode, each pixel accuracy value in the first initial pixel accuracy set is modified to obtain the first target pixel accuracy set; if the encoding mode is the second type of encoding mode, the second initial Each pixel precision value in the pixel precision set is modified to obtain a second target pixel precision set.

In an embodiment, the first type of coding mode may include any one of an inter coding mode and an affine coding mode, and the second type of coding mode may include an inter coding mode and the affine coding mode. Another encoding mode.

In one embodiment, when the processor 702 encodes the to-be-processed video based on the target pixel accuracy set to obtain an encoded video, it performs the following operations: add identification information to the encoded video; add The encoded video of the identification information is sent to the decoding end, and the identification information is used to instruct the decoding end to decode the encoded video based on the identification information.

In one embodiment, when the processor 702 encodes the to-be-processed video based on the target pixel accuracy set to obtain an encoded video, it performs the following operations: set the accuracy of each pixel included in the target pixel accuracy set Index identification; determining the target pixel accuracy in the target pixel accuracy set used when encoding the video to be processed, and the index identification corresponding to the target pixel accuracy; adding the index identification to the encoded video, And send the coded video to which the index identifier is added to the decoding terminal, where the index identifier is used to instruct the decoding terminal to decode the coded video based on the index identifier.

Referring to FIG. 8, it is a schematic structural diagram of a decoding device provided by an embodiment of the present invention. The decoding device described in FIG. 8 may include: a memory 801 and a processor 802. The memory 801 and the processor 802 are connected through a bus 803. Program codes are stored in 801, and the memory 802 calls the program codes in the memory 801.

The memory 801 may include volatile memory (volatile memory), such as random-access memory (RAM); the memory 801 may also include non-volatile memory (non-volatile memory), such as flash memory (flash memory), solid-state drive (solid-state drive, SSD), etc.; the memory 801 may also include a combination of the foregoing types of memories.

The processor 802 may be a central processing unit (Central Processing Unit, CPU). The processor 802 may further include a hardware chip. The aforementioned hardware chip may be an application-specific integrated circuit (ASIC), a programmable logic device (PLD), etc. The PLD may be a field-programmable gate array (FPGA), a general array logic (generic array logic, GAL), etc. The processor 802 may also be a combination of the foregoing structures.

In the embodiment of the present invention, the memory 801 is used to store a computer program, and the computer program includes program instructions, and the processor 802 is used to execute the program instructions stored in the memory 801.

In one embodiment, the processor 802 is configured to call the program instructions to execute: receive an encoded video; when the encoded video includes identification information, determine that the video type corresponding to the encoded video is a preset video type The coded video is decoded based on the target pixel accuracy set; the target pixel accuracy set is obtained by modifying each pixel accuracy value in the initial pixel accuracy set.

In one embodiment, the coded video further includes an index identifier, and the processor 802 performs the following operations when decoding the coded video based on the target pixel accuracy set: From the target pixel accuracy set, Determine the target pixel accuracy identified by the index identifier; and perform decoding processing on the encoded video based on the target pixel accuracy.

A person of ordinary skill in the art can understand that all or part of the processes in the above-mentioned embodiment methods can be implemented by instructing relevant hardware through a computer program. The program can be stored in a computer readable storage medium. During execution, it may include the procedures of the above-mentioned method embodiments. Wherein, the storage medium may be a magnetic disk, an optical disc, a read-only memory (Read-Only Memory, ROM), or a random access memory (Random Access Memory, RAM), etc.

The above-disclosed are only some embodiments of the present invention, which of course cannot be used to limit the scope of rights of the present invention. Therefore, equivalent changes made according to the claims of the present invention still fall within the scope of the present invention.

Claims

A video processing method, characterized by comprising:

Determine the video type of the acquired video to be processed;

If the video type of the video to be processed is a preset video type, modify each pixel precision value in the initial pixel precision set to obtain a target pixel precision set;

Perform encoding processing on the to-be-processed video based on the target pixel accuracy set to obtain an encoded video.
The method of claim 1, wherein the determining the acquired video type of the to-be-processed video comprises:

Determine the hash value corresponding to the video to be processed;

If the hash value is not greater than the threshold, determining that the video type of the video to be processed is a preset video type;

If the hash value is greater than the threshold, it is determined that the video type of the video to be processed is not the preset video type.
The method of claim 1, wherein the determining the acquired video type of the to-be-processed video comprises:

Calling a video type recognition model to recognize the to-be-processed video, and obtain a recognition result;

If the video type indicated by the recognition result is a preset video type, it is determined that the video type of the video to be processed is the preset video type.
The method of claim 1, wherein the initial pixel accuracy set includes a first initial pixel accuracy set and a second initial pixel accuracy set, and the target pixel accuracy set includes a first target pixel accuracy set and a second initial pixel accuracy set. Target pixel accuracy collection,

The modification of each pixel precision value in the initial pixel precision set to obtain the target pixel precision set includes:

Acquiring an encoding mode used when encoding the to-be-processed video;

If the coding mode is the first type of coding mode, modify each pixel precision value in the first initial pixel precision set to obtain a first target pixel precision set;

If the coding mode is the second type of coding mode, each pixel precision value in the second initial pixel precision set is modified to obtain a second target pixel precision set.
The method according to claim 4, wherein the first type of coding mode includes any one of an inter-frame coding mode and an affine coding mode, and the second type of coding mode includes the inter coding mode. Mode and the other of the affine coding mode.
The method according to claim 1, wherein the encoding the to-be-processed video based on the target pixel accuracy set to obtain the encoded video comprises:

Adding identification information to the encoded video;

Send the encoded video to which the identification information is added to the decoding end, where the identification information is used to instruct the decoding end to decode the encoded video based on the identification information.
The method according to claim 1, wherein the encoding the to-be-processed video based on the target pixel accuracy set to obtain the encoded video comprises:

Set an index mark for each pixel precision included in the target pixel precision set;

Determining the target pixel accuracy in the target pixel accuracy set used when encoding the video to be processed, and the index identifier corresponding to the target pixel accuracy;

The index identifier is added to the encoded video, and the encoded video added with the index identifier is sent to a decoding end, where the index identifier is used to instruct the decoding end to decode the encoded video based on the index identifier.
A video processing method, characterized by comprising:

Receive encoded video;

When the coded video includes identification information, determining that the video type corresponding to the coded video is a preset video type;

Decoding the encoded video based on the target pixel accuracy set;

The target pixel accuracy set is obtained by modifying each pixel accuracy value in the initial pixel accuracy set.
The method according to claim 8, wherein the coded video further includes an index identifier, and the decoding processing of the coded video based on the target pixel precision set comprises:

From the target pixel accuracy set, determine the target pixel accuracy identified by the index identifier;

Performing decoding processing on the encoded video based on the target pixel accuracy.
A video processing device, characterized by comprising:

The determining unit is used to determine the video type of the acquired video to be processed;

A processing unit, configured to, if the determining unit determines that the video type of the video to be processed is a preset video type, modify each pixel precision value in the initial pixel precision set to obtain a target pixel precision set;

The processing unit is further configured to perform encoding processing on the to-be-processed video based on the target pixel accuracy set to obtain an encoded video.
A video processing device, characterized by comprising:

Receiving unit for receiving encoded video;

A processing unit, configured to determine that the video type corresponding to the coded video is a preset video type when the coded video includes identification information;

The processing unit is further configured to decode the encoded video based on the target pixel accuracy set;

The target pixel accuracy set is obtained by modifying each pixel accuracy value in the initial pixel accuracy set.
An encoding device, characterized in that it comprises a memory and a processor:

The memory is used to store program code;

The processor calls the program code, and when the program code is executed, is used to perform the following operations:

Determine the video type of the acquired video to be processed;

If the video type of the video to be processed is a preset video type, modify each pixel precision value in the initial pixel precision set to obtain a target pixel precision set;

Perform encoding processing on the to-be-processed video based on the target pixel accuracy set to obtain an encoded video.
The encoding device according to claim 12, wherein the processor performs the following operations when determining the acquired video type of the to-be-processed video:

Determine the hash value corresponding to the video to be processed;

If the hash value is not greater than the threshold, determining that the video type of the video to be processed is a preset video type;

If the hash value is greater than the threshold, it is determined that the video type of the video to be processed is not the preset video type.
The encoding device according to claim 12, wherein the processor performs the following operations when determining the acquired video type of the to-be-processed video:

Calling a video type recognition model to recognize the to-be-processed video, and obtain a recognition result;

If the video type indicated by the recognition result is a preset video type, it is determined that the video type of the video to be processed is the preset video type.
The encoding device of claim 12, wherein the initial pixel accuracy set includes a first initial pixel accuracy set and a second initial pixel accuracy set, and the target pixel accuracy set includes a first target pixel accuracy set and a second initial pixel accuracy set. Two target pixel accuracy sets,

When the processor modifies each pixel precision value in the initial pixel precision set to obtain the target pixel precision set, it performs the following operations:

Acquiring an encoding mode used when encoding the to-be-processed video;

If the coding mode is the first type of coding mode, modify each pixel precision value in the first initial pixel precision set to obtain a first target pixel precision set;

If the coding mode is the second type of coding mode, each pixel precision value in the second initial pixel precision set is modified to obtain a second target pixel precision set.
The encoding device according to claim 15, wherein the first type of encoding mode includes any one of an inter encoding mode and an affine encoding mode, and the second type of encoding mode includes the inter The other of the encoding mode and the affine encoding mode.
The encoding device according to claim 12, wherein the processor performs the following operations when encoding the video to be processed based on the target pixel accuracy set to obtain the encoded video:

Adding identification information to the encoded video;

Send the encoded video to which the identification information is added to the decoding end, where the identification information is used to instruct the decoding end to decode the encoded video based on the identification information.
The encoding device according to claim 12, wherein the processor performs the following operations when encoding the video to be processed based on the target pixel accuracy set to obtain the encoded video:

Set an index mark for each pixel precision included in the target pixel precision set;

Determining the target pixel accuracy in the target pixel accuracy set used when encoding the video to be processed, and the index identifier corresponding to the target pixel accuracy;

The index identifier is added to the encoded video, and the encoded video added with the index identifier is sent to a decoding end, where the index identifier is used to instruct the decoding end to decode the encoded video based on the index identifier.
A decoding device, characterized in that it comprises a memory and a processor:

The memory is used to store program code;

The processor calls the program code, and when the program code is executed, is used to perform the following operations:

Receive encoded video;

When the coded video includes identification information, determining that the video type corresponding to the coded video is a preset video type;

Decoding the encoded video based on the target pixel accuracy set;

The target pixel accuracy set is obtained by modifying each pixel accuracy value in the initial pixel accuracy set.
The decoding device according to claim 19, wherein the coded video further includes an index identifier, and the processor performs the following operations when decoding the coded video based on the target pixel accuracy set:

From the target pixel accuracy set, determine the target pixel accuracy identified by the index identifier;

Performing decoding processing on the encoded video based on the target pixel accuracy.
A computer-readable storage medium, wherein the computer-readable storage medium stores a first computer program, the first computer program includes first program instructions, and the first program instructions are executed by a processor When making the processor execute the video processing method according to any one of claims 1-7; or, the computer-readable storage medium stores a second computer program, and the second computer program includes a second program Instruction, the second program instruction when executed by the processor causes the processor to execute the video processing method according to claim 8 or 9.