CN116527914A - Decoding method and device suitable for space image - Google Patents

Decoding method and device suitable for space image Download PDF

Info

Publication number
CN116527914A
CN116527914A CN202310483604.5A CN202310483604A CN116527914A CN 116527914 A CN116527914 A CN 116527914A CN 202310483604 A CN202310483604 A CN 202310483604A CN 116527914 A CN116527914 A CN 116527914A
Authority
CN
China
Prior art keywords
target
picture
key frame
latitude
longitude
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202310483604.5A
Other languages
Chinese (zh)
Inventor
张夏杰
魏伟
郭景昊
杜峰
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Beijing Jingdong Century Trading Co Ltd
Beijing Wodong Tianjun Information Technology Co Ltd
Original Assignee
Beijing Jingdong Century Trading Co Ltd
Beijing Wodong Tianjun Information Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Beijing Jingdong Century Trading Co Ltd, Beijing Wodong Tianjun Information Technology Co Ltd filed Critical Beijing Jingdong Century Trading Co Ltd
Priority to CN202310483604.5A priority Critical patent/CN116527914A/en
Publication of CN116527914A publication Critical patent/CN116527914A/en
Pending legal-status Critical Current

Links

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/10Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
    • H04N19/169Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding
    • H04N19/17Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object
    • H04N19/172Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object the region being a picture, frame or field
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/44Decoders specially adapted therefor, e.g. video decoders which are asymmetric with respect to the encoder
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/50Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding
    • H04N19/593Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding involving spatial prediction techniques

Abstract

The application discloses a decoding method and device suitable for a spatial image. One embodiment of the method comprises the following steps: according to the obtained operation information, determining the target longitude and latitude corresponding to the target space picture expected by the user; determining the position information of a target space picture in a coding file and a target picture group to which the target space picture belongs according to the longitude and latitude of the target; determining a target key frame corresponding to the target picture group from a preloaded key frame sequence, wherein the key frame sequence comprises key frames related to the coding file; and decoding according to the target key frame and the position information to obtain the target space picture. The method and the device improve the space decoding efficiency on the basis of ensuring the flexibility of the space decoding process.

Description

Decoding method and device suitable for space image
Technical Field
The embodiment of the application relates to the technical field of computers, in particular to a decoding technology, and especially relates to a decoding method and device suitable for space images, a computer readable medium and electronic equipment.
Background
The spatial code stream is sequential in time, compared to the conventional timing-based video code stream, which is spatially sequential. One of the basic requirements of spatial code streams is the ability to decode spatially at the time of decoding, rather than in the order of storage in the code stream, which is the greatest distinction from time-sequential code streams. For convenience of storage and transmission, I frames (key frames) and P frames (predicted frames) are often used in encoding, so that frames with dependency relationships cause several times of time sequence decoding pressure in decoding, and the efficiency of space decoding is greatly reduced.
Disclosure of Invention
The embodiment of the application provides a decoding method and device suitable for a space image, a computer readable medium and electronic equipment.
In a first aspect, an embodiment of the present application provides a decoding method applicable to a spatial image, including: according to the obtained operation information, determining the target longitude and latitude corresponding to the target space picture expected by the user; determining the position information of a target space picture in a coding file and a target picture group to which the target space picture belongs according to the longitude and latitude of the target; determining a target key frame corresponding to the target picture group from a preloaded key frame sequence, wherein the key frame sequence comprises key frames related to the coding file; and decoding according to the target key frame and the position information to obtain the target space picture.
In some examples, determining the position information of the target spatial picture in the encoded file and the target picture group to which the target spatial picture belongs according to the target longitude and latitude includes: determining a target picture group identifier of a target picture group to which the target space picture belongs and a target picture identifier of the target space picture according to the target longitude and latitude and the key frame longitude and latitude of key frames in each picture group in the coding file; and determining the position information of the target space picture in the coding file and the target picture group to which the target space picture belongs according to the target picture group identifier and the target picture identifier.
In some examples, determining the target picture group identifier of the target picture group to which the target spatial picture belongs and the target picture identifier of the target spatial picture according to the target longitude and latitude and the key frame longitude and latitude of the key frame in each picture group in the encoded file includes: the picture group which belongs to the key frame and corresponds to the key frame with the longitude and latitude closest to the target longitude and latitude in the key frame longitude and latitude of the key frames in each picture group in the coding file is taken as the target picture group, and the target picture group identification is determined; and determining the target picture identification according to the offset between the target longitude and latitude and the key frame longitude and latitude corresponding to the key frame in the target picture group.
In some examples, determining the target key frame corresponding to the target picture group from the preloaded key frame sequence includes: and determining a target key frame corresponding to the target picture group from the key frame sequence according to the target picture group identification.
In some examples, before determining the target key frame corresponding to the target picture group from the preloaded key frame sequence, the method further includes: in the initialization process, the decoded key frames included in the encoded file are loaded to generate a key frame sequence.
In some examples, decoding the target spatial picture according to the target key frame and the position information includes: determining whether the target space picture is a key frame in the target picture group or a predicted frame in the target picture group according to the target picture identification; and in response to determining that the target space picture is a predicted frame in the target picture group, decoding the predicted frame corresponding to the target picture identifier according to the target key frame and the position information to obtain the target space picture.
In some examples, the decoding to obtain the target spatial picture according to the target key frame and the position information further includes: in response to determining that the target spatial picture is a key frame in the target group of pictures, the target key frame is taken as the target spatial picture.
In a second aspect, an embodiment of the present application provides a decoding apparatus applicable to a spatial image, including: the first determining unit is configured to determine the target longitude and latitude corresponding to the target space picture expected by the user according to the acquired operation information; the second determining unit is configured to determine the position information of the target space picture in the coding file and the target picture group to which the target space picture belongs according to the target longitude and latitude; the third determining unit is configured to determine a target key frame corresponding to the target picture group from the preloaded key frame sequence, wherein the key frame sequence comprises key frames related to the coding file; and the decoding unit is configured to decode to obtain a target space picture according to the target key frame and the position information.
In some examples, the second determining unit is further configured to: determining a target picture group identifier of a target picture group to which the target space picture belongs and a target picture identifier of the target space picture according to the target longitude and latitude and the key frame longitude and latitude of key frames in each picture group in the coding file; and determining the position information of the target space picture in the coding file and the target picture group to which the target space picture belongs according to the target picture group identifier and the target picture identifier.
In some examples, the second determining unit is further configured to: the picture group which belongs to the key frame and corresponds to the key frame with the longitude and latitude closest to the target longitude and latitude in the key frame longitude and latitude of the key frames in each picture group in the coding file is taken as the target picture group, and the target picture group identification is determined; and determining the target picture identification according to the offset between the target longitude and latitude and the key frame longitude and latitude corresponding to the key frame in the target picture group.
In some examples, the third determining unit is further configured to: and determining a target key frame corresponding to the target picture group from the key frame sequence according to the target picture group identification.
In some examples, the apparatus further comprises: a loading unit configured to: in the initialization process, the decoded key frames included in the encoded file are loaded to generate a key frame sequence.
In some examples, the decoding unit described above is further configured to: determining whether the target space picture is a key frame in the target picture group or a predicted frame in the target picture group according to the target picture identification; and in response to determining that the target space picture is a predicted frame in the target picture group, decoding the predicted frame corresponding to the target picture identifier according to the target key frame and the position information to obtain the target space picture.
In some examples, the decoding unit described above is further configured to: in response to determining that the target spatial picture is a key frame in the target group of pictures, the target key frame is taken as the target spatial picture.
In a third aspect, embodiments of the present application provide a computer readable medium having a computer program stored thereon, wherein the program, when executed by a processor, implements a method as described in any of the implementations of the first aspect.
In a fourth aspect, an embodiment of the present application provides an electronic device, including: one or more processors; a storage device having one or more programs stored thereon, which when executed by one or more processors, cause the one or more processors to implement the method as described in any of the implementations of the first aspect.
According to the decoding method and device suitable for the space image, the target longitude and latitude corresponding to the target space image expected by the user is determined according to the acquired operation information; determining the position information of a target space picture in a coding file and a target picture group to which the target space picture belongs according to the longitude and latitude of the target; determining a target key frame corresponding to the target picture group from a preloaded key frame sequence, wherein the key frame sequence comprises key frames related to the coding file; according to the target key frames and the position information, the target space picture is obtained through decoding, so that the decoding method suitable for the space picture is provided, in the decoding process, the target longitude and latitude corresponding to the target space picture expected by a user are mapped to corresponding positions in the encoded file, so that data of the corresponding positions are decoded, all the key frames in the encoded file are preloaded and used as external reference frames of predicted frames in the encoded file, time loss caused by repeated decoding of the key frames in the decoding process is avoided, and the space decoding efficiency is improved on the basis of ensuring the flexibility of the space decoding process.
Drawings
Other features, objects and advantages of the present application will become more apparent upon reading of the detailed description of non-limiting embodiments, made with reference to the following drawings, in which:
FIG. 1 is an exemplary system architecture diagram in which an embodiment of the present application may be applied;
FIG. 2 is a schematic diagram of the spatial layout of sampling points in the warp and weft sampling mode of the present application;
FIG. 3 is a schematic illustration of a spatial picture matrix of the present application;
FIG. 4 is a schematic diagram of a reference manner between a predicted frame and a key frame in a group of pictures of the present application
FIG. 5 is a flow chart of one embodiment of a decoding method suitable for spatial image according to the present application;
fig. 6 is a schematic diagram of an operation locus of a user in a space picture matrix according to the present embodiment;
FIGS. 7A-7C are diagrams of various reference frame management approaches according to the present embodiment;
fig. 8 is a schematic diagram of an application scenario of a decoding method applicable to a spatial image according to the present embodiment;
FIG. 9 is a flow chart of yet another embodiment of a decoding method applicable to spatial images according to the present application;
FIG. 10 is a block diagram of one embodiment of a decoding apparatus suitable for spatial image according to the present application;
FIG. 11 is a schematic diagram of a computer system suitable for use in implementing embodiments of the present application.
Detailed Description
The present application is described in further detail below with reference to the drawings and examples. It is to be understood that the specific embodiments described herein are merely illustrative of the invention and are not limiting of the invention. It should be noted that, for convenience of description, only the portions related to the present invention are shown in the drawings.
It should be noted that, in the case of no conflict, the embodiments and features in the embodiments may be combined with each other. The present application will be described in detail below with reference to the accompanying drawings in conjunction with embodiments.
It should be noted that, in the technical solution of the present disclosure, the related aspects of collecting, updating, analyzing, processing, using, transmitting, storing, etc. of the personal information of the user all conform to the rules of the related laws and regulations, and are used for legal purposes without violating the public order colloquial. Necessary measures are taken for the personal information of the user, illegal access to the personal information data of the user is prevented, and the personal information security, network security and national security of the user are maintained.
Fig. 1 illustrates an exemplary architecture 100 to which the present decoding methods and apparatus for spatial image may be applied.
As shown in fig. 1, a system architecture 100 may include terminal devices 101, 102, 103, a network 104, and a server 105. The communication connection between the terminal devices 101, 102, 103 constitutes a topology network, the network 104 being the medium for providing the communication link between the terminal devices 101, 102, 103 and the server 105. The network 104 may include various connection types, such as wired, wireless communication links, or fiber optic cables, among others.
The user may interact with the server 105 via the network 104 using the terminal devices 101, 102, 103 to receive or send messages or the like. The terminal devices 101, 102, 103 may be hardware devices or software supporting network connections for data interaction and data processing. When the terminal device 101, 102, 103 is hardware, it may be various electronic devices supporting network connection, information acquisition, interaction, display, processing, etc., including but not limited to smartphones, tablet computers, electronic book readers, laptop and desktop computers, etc. When the terminal devices 101, 102, 103 are software, they can be installed in the above-listed electronic devices. It may be implemented as a plurality of software or software modules, for example, for providing distributed services, or as a single software or software module. The present invention is not particularly limited herein.
The server 105 may be a server providing various services, for example, may be a background processing server decoding to obtain a target spatial picture desired by a user according to operation information of the terminal devices 101, 102, 103 and preloaded key frames. As an example, the server 105 may be a cloud server.
The server may be hardware or software. When the server is hardware, the server may be implemented as a distributed server cluster formed by a plurality of servers, or may be implemented as a single server. When the server is software, it may be implemented as a plurality of software or software modules (e.g., software or software modules for providing distributed services), or as a single software or software module. The present invention is not particularly limited herein.
It should be further noted that, the decoding method applicable to the spatial image provided by the embodiments of the present application may be executed by a server, may be executed by a terminal device, or may be executed by the server and the terminal device in cooperation with each other. Accordingly, the respective portions (for example, the respective units) included in the decoding apparatus suitable for the spatial image may be provided in the server, may be provided in the terminal device, or may be provided in the server and the terminal device, respectively.
It should be understood that the number of terminal devices, networks and servers in fig. 1 is merely illustrative. There may be any number of terminal devices, networks, and servers, as desired for implementation. When the electronic device on which the decoding method applicable to the spatial image is operated does not need to perform data transmission with other electronic devices, the system architecture may include only the electronic device (e.g., a server or a terminal device) on which the decoding method applicable to the spatial image is operated.
For a full description of a decoding method applicable to a spatial image, a flow of a corresponding embodiment of a decoding method applicable to a spatial image is given, comprising the steps of:
the first step, according to longitude and latitude corresponding to a plurality of sampling points under different space angles, a space picture matrix comprising a plurality of space pictures acquired by the plurality of sampling points is generated.
The space picture represents pictures of the target object obtained under different space angles. The target object may be various objects such as a person, an object, and the like.
As an example, the plurality of sampling points may be uniformly disposed at different spatial angles to take spatial pictures of the target object at different spatial angles, that is, a uniform sampling manner.
As yet another example, the plurality of sampling points may be unevenly disposed at different spatial angles in a manner that the number of longitude and latitude of the preset interval surrounds the target object, so as to take spatial pictures at different spatial angles of the target object, that is, the longitude and latitude sampling manner.
As shown in fig. 2, a schematic diagram 200 of the spatial layout of sampling points in a warp and weft sampling manner is shown. For the upper hemisphere of the target object, sampling is performed at intervals of 10 °, then there are 36 sampling points in the longitudinal direction and 9 sampling points in the latitudinal direction, and 324 (36×9) sampling points are in the upper hemisphere of the target object. The left sub-graph in fig. 2 is a top view of the spatial layout of the sampling points, and the right sub-graph is a front view of the spatial layout of the sampling points.
Longitude and latitude corresponding to the sampling points under different space angles are different, so that space pictures under different space angles are obtained through sampling. And arranging the space pictures under different space angles according to a certain arrangement sequence to obtain a space picture matrix. For each space picture, naming the space picture according to the longitude and latitude of the sampling point corresponding to the space picture; further, a plurality of spatial pictures are arranged according to the naming information of each spatial picture, and a spatial picture matrix is generated. For example, the longitude and latitude of the sampling point corresponding to the spatial picture is (10 ° ,20 ° ) The spatial picture may be named "10-20.Jpg".
In some optional implementations of this embodiment, the executing body may execute the first step by: and arranging a plurality of space pictures by taking the longitude of the sampling point corresponding to the space picture as the horizontal axis and taking the latitude of the sampling point corresponding to the space picture as the vertical axis to generate a space picture matrix.
Specifically, the longitude of the sampling point corresponding to the space picture is taken as the horizontal axis, the latitude of the sampling point corresponding to the space picture is taken as the vertical axis, and the plurality of space pictures are arranged in the order from the small longitude to the large latitude, so as to obtain the space picture matrix.
Continuing with the example of the sampling points shown in fig. 2, a spatial picture matrix is generated as shown in fig. 3. Wherein the latitude in the space picture matrix is in the range of 0 ° -80 ° Longitude range of 0 ° -350 °
In the implementation manner, based on the longitude and latitude of the sampling point, an arrangement manner which is more in line with the spatial correlation among a plurality of spatial pictures is provided, so that the generated spatial picture matrix is more beneficial to spatial coding, and the spatial coding efficiency is improved.
And a second step of dividing the space picture matrix by a sub-matrix with a preset size to obtain a plurality of picture groups.
As an example, the preset size may be a fixed size set in advance. The length and width of the sub-matrix may be the same or different. For example, the preset size is 3×3.
As yet another example, the preset size may be flexibly determined according to the density of the sampling points. When the sampling points are denser, a larger preset size can be set; when the sampling points are sparse, a smaller preset size can be set, that is, the size of the preset size is positively correlated with the density of the sampling points.
Dividing the space picture matrix by a sub-matrix with a preset size, wherein the number of the space pictures included in each picture group is the same as the number of elements included in the sub-matrix. With continued reference to fig. 3, the spatial picture matrix is divided by a sub-matrix of a preset size of 3×3, resulting in 36 groups of GOP1-GOP36, each including 9 spatial pictures.
And thirdly, coding the space pictures in the space picture matrix according to the key frames and the predicted frames respectively included in the plurality of picture groups to generate a coding file.
In compression encoding, each frame of spatial picture represents a still image. While actual compression is performed, various compression algorithms are employed to reduce the data capacity, with IPB frames being the most common one. The I frames in the IPB frames are also called key frames, intra-coded frames. The key frame is typically the first frame of each group of pictures, and is moderately compressed to serve as a reference point for random access to generate a still image. The key frame can be regarded as a compressed product of an image, and the compression can remove redundant information of the video. P frames, also called predicted frames, forward predictive coded frames. And removing redundant information which is the same as the compressed data corresponding to the key frames in the picture group from the compressed data corresponding to the predicted frames to obtain the encoded data corresponding to the predicted frames. The predicted frame represents a difference between the predicted frame and the corresponding key frame, and when decoding, it is necessary to refer to the corresponding key frame and the decoded data corresponding to the predicted frame, and generate a spatial picture corresponding to the predicted frame.
In this embodiment, a determination manner may be preset to determine a key frame and a predicted frame in a picture group; and further, encoding the spatial picture in the spatial picture matrix according to the key frame and the predicted frame respectively included in the plurality of picture groups, and generating an encoded file.
For example, for each of the plurality of picture groups, a spatial picture corresponding to a sampling point with the smallest longitude and latitude in the picture group is used as a key frame, and the rest of the spatial pictures in the picture group are used as prediction frames, so that the spatial pictures in the spatial picture matrix are encoded, and an encoded file is generated.
In some optional implementations of this embodiment, the executing body may execute the third step by:
(1) And arranging the picture groups corresponding to the sampling points with the same latitude in the plurality of picture groups according to the order of the sampling points corresponding to the picture groups from small longitude to large longitude, and generating a plurality of picture group subsequences.
With continued reference to fig. 3, the plurality of picture group sub-sequences includes a first picture group sub-sequence, a second picture group sub-sequence, and a third picture group sub-sequence. Wherein the first group of pictures sub-sequence is "GOP 1- > GOP 2- > … … - > GOP12", the second group of pictures sub-sequence is "GOP 13- > GOP 14- > … … - > GOP24", and the third group of pictures sub-sequence is "GOP 25- > GOP 26- > … … - > GOP36".
(2) And arranging a plurality of picture group subsequences according to the order of the latitudes of sampling points corresponding to the picture groups from small to large, determining picture group identifiers corresponding to the picture groups, and generating a picture group sequence.
With continued reference to fig. 3, the group of pictures sequence is "GOP 1- > GOP 2- > GOP3 … … - > GOP 35- > GOP36"
(3) And arranging the space pictures in each picture group in the picture group sequence according to the sequence of the key frame and the predicted frame, determining the picture identification of the space picture in each picture group in the plurality of picture groups, and generating the space picture sequence.
Specifically, on the basis of the determined picture group sequences, for the spatial pictures in each picture group, the picture sequences in the picture groups are determined according to the sequence of the key frames and the predicted frames, so that the picture identification of the spatial pictures in each picture group is determined according to the picture sequences corresponding to the picture groups, and finally the spatial picture sequences are obtained.
For a plurality of predicted frames in each picture group, an arrangement order of the plurality of predicted frames may be determined in a preset determination manner. For example, a plurality of prediction frames are arranged in the order of the latitude from small to large and the longitude from small to large of the sampling point corresponding to the spatial picture.
With continued reference to fig. 3, the spatial picture sequence is "0 >1 >2 >3 … … >322 >323". The spatial picture sequence includes a plurality of picture sequences such as "0- (1) - (2) - (3- (… …) - (8) - (9) - (10) - (11) - (… …) - (17)". Taking the picture sequence of "0- (1- (2) -3- (… …) -8" as an example, "0" is the picture identification of the key frame in the picture sequence, and "1-8" is the picture identification of the predicted frame in the picture sequence.
(4) And encoding the spatial picture sequence according to the key frames and the predicted frames included in each of the plurality of picture groups to generate an encoded file.
In the implementation manner, the spatial picture matrix is firstly arranged to obtain the spatial picture sequence, and then coding is carried out according to the arrangement sequence of the spatial pictures in the spatial picture sequence and the key frames and the predicted frames in each picture group, so that the efficiency and the accuracy of the coding process are further improved.
In some optional implementations of this embodiment, the executing body may execute the step (3) by: and arranging the space pictures in each picture group in the picture group sequence according to the sequence of the first key frame and the second prediction frame by taking the space picture in the central position of the sub-matrix corresponding to the picture group as a key frame and taking the space picture adjacent to the key frame as a prediction frame, and determining the picture identification of the space picture in each picture group in the plurality of picture groups to generate the space picture sequence.
Continuing with the 3×3 sub-matrix shown in fig. 3 as an example, for 9 space pictures in the divided picture group, 8 space pictures around the center position have higher relevance to the space picture at the center position, because all the 8 space pictures around can be regarded as 10 from the space picture at the center position in the longitudinal direction and/or the latitudinal direction ° Obtained by a variation of (a).
The spatial picture at the central position of the sub-matrix corresponding to the picture group is used as a key frame, and the spatial picture adjacent to the key frame is used as a predicted frame, so that the predicted frame and the key frame have direct and strong relevance, direct reference of the predicted frame to the key frame in the encoding process of the spatial picture is facilitated, the encoding efficiency is improved, and the data volume of the encoded file is reduced.
In some optional implementations of this embodiment, the executing body may execute the step (4) as follows: and for each of the plurality of picture groups, encoding the spatial picture sequence by adopting a reference mode that the predicted frame in the picture group only references the key frame in the picture group, and generating an encoding file.
With continued reference to fig. 4, a schematic diagram 400 of a reference pattern between predicted frames and key frames in a group of pictures is shown. For each predicted frame in a group of pictures, the key frames in the group of pictures are uniquely referenced for encoding to obtain an encoded file.
In the implementation mode, the spatial picture sequence is encoded by adopting the reference mode of the predicted frame in the picture group and the key frame in the unique reference picture group, so that the complexity of the relation between the predicted frame and the key frame in the encoded file is reduced, and the determination speed and the decoding efficiency of data in the decoding process are improved.
To more fully describe the decoding method applicable to the spatial image, a corresponding flow of a further embodiment of the decoding method applicable to the spatial image is provided, comprising the following steps:
the first step is to arrange a plurality of space pictures by taking longitude of sampling points corresponding to the space pictures as a horizontal axis and latitude of sampling points corresponding to the space pictures as a vertical axis, so as to generate a space picture matrix.
And a second step of dividing the space picture matrix by a sub-matrix with a preset size to obtain a plurality of picture groups.
And thirdly, arranging the picture groups corresponding to the sampling points with the same latitude in the plurality of picture groups according to the order of the longitude of the sampling points corresponding to the picture groups from small to large, and generating a plurality of picture group subsequences.
And a fourth step of arranging a plurality of picture group subsequences according to the order of the latitudes of the sampling points corresponding to the picture groups from small to large, determining the picture group identifications corresponding to the picture groups, and generating a picture group sequence.
And fifthly, arranging the space pictures in each picture group in the picture group sequence according to the sequence of the first key frame and the second predicted frame by taking the space picture in the central position of the sub-matrix corresponding to the picture group as a key frame and taking the space picture adjacent to the key frame as a predicted frame, and determining the picture identification of the space picture in each picture group in the plurality of picture groups to generate the space picture sequence.
And a sixth step of encoding the spatial picture sequence by adopting a reference mode that the predicted frame in the picture group only references the key frame in the picture group for each picture group in the plurality of picture groups, and generating an encoding file.
With continued reference to fig. 5, a flow 500 of one embodiment of a decoding method suitable for spatial image is shown, comprising the steps of:
step 501, determining the target longitude and latitude corresponding to the target space picture expected by the user according to the acquired operation information.
In this embodiment, an execution body (for example, a terminal device or a server in fig. 1) of the decoding method applicable to the spatial image determines, according to the acquired operation information, a target longitude and latitude corresponding to a target spatial image desired by a user.
The operation information may be an action instruction corresponding to a sliding operation of the user or a voice instruction corresponding to voice information. With continued reference to fig. 6, an operation track of the user in the spatial picture matrix is shown, along with the operation track, the executing body aims to decode the spatial picture data at the corresponding position, so as to obtain and display the target spatial picture expected by the user.
As an example, the executing body may pre-establish a correspondence between an operation position of the user on the screen and a target longitude and latitude corresponding to a target space picture expected by the user, so as to determine, in real time, the target longitude and latitude corresponding to the target space picture expected by the user in a process of executing the operation action by the user. The longitude and latitude of the target corresponding to the target space picture is the longitude and latitude corresponding to the sampling point corresponding to the target space picture.
Step 502, determining the position information of the target space picture in the encoding file and the target picture group to which the target space picture belongs according to the target longitude and latitude.
In this embodiment, the executing body may determine, according to the latitude and longitude of the target, the position information of the target spatial picture in the encoded file and the target picture group to which the target spatial picture belongs.
As an example, the execution subject may previously establish a correspondence between longitude and latitude corresponding to each spatial picture referred to in the encoded file and position information of the spatial picture in the encoded file. Therefore, the position information of the target space picture in the coding file is determined according to the longitude and latitude of the target; and determining the key frame in the target picture group to which the target space picture belongs and the key frame longitude and latitude of the key frame according to the position information, and further determining the picture group to which the key frame belongs.
In some optional implementations of this embodiment, the executing body may execute the step 502 as follows:
first, determining a target picture group identifier of a target picture group to which a target space picture belongs and a target picture identifier of the target space picture according to the target longitude and latitude and the key frame longitude and latitude of key frames in each picture group in the coding file.
As an example, the executing body may determine a key frame longitude and latitude of a key frame in each picture group in the encoded file, and generate a key frame longitude and latitude set; further, comparing the target longitude and latitude with the longitude and latitude of the key frames in the longitude and latitude set of the key frames, and determining a target picture group to which the target space picture belongs according to a comparison result between the target longitude and latitude and the longitude and latitude of each key frame in the longitude and latitude set of the key frames; further, a target picture group identification of the target picture group and a target picture identification of the target spatial picture are determined.
With continued reference to FIG. 3, the corresponding keyframe longitude and latitude set is
And secondly, determining the position information of the target space picture in the coding file and the target picture group to which the target space picture belongs according to the target picture group identifier and the target picture identifier.
In the process of obtaining an encoded file through encoding, a spatial picture identifier of a spatial picture and a picture group identifier of a picture group to which the spatial picture belongs are generally encoded. After the target picture group identifier and the target picture identifier are determined, the position information of the target space picture in the encoded file and the target picture group to which the target space picture belongs can be determined.
In the implementation manner, the specific implementation manner for determining the position information of the target space picture in the coding file and the target picture group to which the target space picture belongs is provided, and the determination efficiency and accuracy of the information determination process are improved.
In some optional implementations of this embodiment, the executing body may execute the first step by: firstly, taking a picture group which belongs to a key frame and corresponds to the key frame with the longitude and latitude closest to the target longitude and latitude as a target picture group in the key frame longitudes and latitudes of key frames in each picture group in an encoding file, and determining a target picture group identifier; and then, determining the target picture identification according to the offset between the target longitude and latitude and the longitude and latitude of the key frame corresponding to the key frame in the target picture group.
As an example, the target longitude and latitude is (60, 20), which is closest to the distance between the key frame longitude and latitude (70, 10) in the key frame longitude and latitude set, the group of pictures GOP3 to which the key frame corresponding to the key frame longitude and latitude (70, 10) belongs is taken as the target group of pictures, and the target group of pictures is determined to be 3.
Specifically, the above-mentioned determination process of the target picture group identifier may be represented by the following formula:
GOPid=minl2_arg((longitude,latitude)-GOPs)
the GOPid represents a target picture group identifier, the minl2_arg represents that the minimum distance requirement is met, the target longitude and latitude is represented by (longitude, latitude), and the key frame longitude and latitude of the key frames in each picture group is represented by GOPs.
Then, the offset between the target longitude and latitude and the key frame longitude and latitude corresponding to the key frame in the target picture group is determined to be (-10, 10), and the target picture identification is determined to be 8.
Continuing with the 3×3 submatrix shown in fig. 3 as an example, the offset between the longitude and latitude of the spatial picture in each picture group and the longitude and latitude of the key frame corresponding to the key frame is:
reordering the offset sets according to the arrangement sequence of the space pictures in the picture sequence corresponding to the picture group in the coding process, and obtaining an ordered offset sequence as follows:
OFFSETs=
{(0,0),(-10,0),(-10,-10),(0,-10),(10,-10),(10,0),(10,10),(0,10),(-10,10)}
And according to the offset sequence, determining a target picture identifier corresponding to the target space picture.
Specifically, the above-mentioned determination process of the target picture identifier may be represented by the following formula:
OFFSETid=
minl2_arg(((longtitude,latitude)-GOPs[GOPid])-OFFSETs)
the offset represents a target picture identifier, minl2_arg represents that minimum distance requirements are met, (longtitude, latitudes) represents target longitude and latitude, the gop [ gipid ] represents the key frame longitude and latitude of a key frame corresponding to the target picture group identifier gipid, and the offset represents an offset between the longitude and latitude of a space picture in the picture group and the longitude and latitude of the key frame corresponding to the key frame.
In the implementation manner, the target picture group identification of the target picture group is determined according to the comparison result of the key frame longitude and latitude of each key frame related in the coding file and the target longitude and latitude of the target space picture expected by the user, so that the target picture identification is determined, and the universality and the accuracy of the identification information determination process are improved.
Step 503, determining a target key frame corresponding to the target picture group from the preloaded key frame sequence.
In this embodiment, the execution body may determine a target key frame corresponding to the target picture group from the preloaded key frame sequence. Wherein the key frame sequence includes key frames involved in the encoded file.
As an example, for a preloaded key frame sequence, a correspondence between each key frame in the key frame sequence and the group of pictures to which it belongs is determined during the loading process. And further, after the target picture group is determined, determining a target key frame corresponding to the target picture group from the key frame sequence according to the corresponding relation.
In some optional implementations of this embodiment, the executing body may execute the step 503 as follows: and determining a target key frame corresponding to the target picture group from the key frame sequence according to the target picture group identification.
As an example, for a preloaded key frame sequence, a correspondence between each key frame in the key frame sequence and the picture group identity of the belonging picture group is determined during the loading process. And matching the target picture group identifier with the picture group identifiers in the corresponding relation, so that the key frame corresponding to the picture group identifier matched with the target picture group identifier is determined as the target key frame.
In the implementation manner, based on the target picture group identification, the target key frame corresponding to the target picture group can be rapidly determined from the key frame sequence, so that the determination efficiency of the target key frame is improved.
In some optional implementations of this embodiment, before the step 503, the executing body may further perform the following operations: in the initialization process, the decoded key frames included in the encoded file are loaded to generate a key frame sequence.
As an example, in an initialization process of decoding logic for an encoded file, location information of key frames in each picture group is determined from the encoded file; and decoding according to the position information to obtain all the key frames, thereby caching the obtained key frames and generating a key frame sequence.
In the implementation manner, the key frames included in the encoded file are decoded and preloaded in the initialization process, so that the key frames referenced by the data to be decoded are directly determined from the key frame sequence in the subsequent decoding process, and the decoding efficiency is improved.
And step 504, decoding to obtain a target space picture according to the target key frame and the position information.
In this embodiment, the execution body may decode to obtain the target spatial image according to the target key frame and the position information.
After the position information of the target data to be decoded is determined, the target key frame can be referred to, and the coded data at the corresponding position in the coded file is decoded, so that the target space picture is obtained and displayed.
In some optional implementations of this embodiment, the executing body may execute the step 504 as follows:
first, according to the target picture identification, it is determined that the target spatial picture is a key frame in the target picture group or a predicted frame in the target picture group.
As an example, when it is determined that the target picture identification is the same as the key frame identification of the key frames in the target picture group, it is determined that the target spatial picture is a key frame in the target picture group; when the target picture identification is determined to be the same as the predicted frame identification of the predicted frames in the target picture group, the target spatial picture is determined to be the predicted frame in the target picture group.
Second, in response to determining that the target spatial picture is a predicted frame in the target picture group, decoding the predicted frame corresponding to the target picture identifier according to the target key frame and the position information to obtain the target spatial picture.
When the target spatial picture is a predicted frame in the target picture group, the predicted frame refers to a key frame in the picture group, so that the predicted frame at the position represented by the position information needs to be decoded by referring to the target key frame to obtain the target spatial picture.
In some optional implementations of this embodiment, the executing body may further execute the step 504 as follows: in response to determining that the target spatial picture is a key frame in the target group of pictures, the target key frame is taken as the target spatial picture.
When the target spatial picture is a key frame in the target picture group, the decoded target key frame in the key frame sequence is directly used as the target spatial picture, and the decoding process is not needed to be executed.
In view of the spatial decoding process, B frames are not used in the encoding stage, so B frames are generally not included in the encoded file. However, if only I frames are used in the encoding phase, the compression efficiency is too low to facilitate storage and transmission. Therefore, I-frames and P-frames are often used in encoding files.
Not only can the frame type affect the decoding efficiency, but the reference frame management approach also affects significantly the decoding efficiency. Taking the following three reference frame management as an example, fig. 7A is the most common management method, i.e., the previous I frame or P frame within the P frame reference picture group. At this time, decoding a P frame requires first decoding all I frames and P frames on which the P frame depends. Then, the number of decoded frames for each display frame in the group of pictures in fig. 7A is {1,2,3,4,5,6,7,8,9}, which on average shows that one frame needs to be decoded 45/9=5 frames.
In fig. 7B, although there is only one reference frame per P frame in the group of pictures, its reference frames depend on other frames as well, so the later reference frames actually depend more. For fig. 7B, the number of frames to be decoded per frame is shown as {1,2,3,4,5,6,7,8,9}, and 45/9=5 frames are shown on average per frame.
For fig. 7C, all predicted frames in the group of pictures refer to I frames only, then the number of frames to decode per frame is shown to be {1,2,2,2,2,2,2,2,2}, and on average 17/9 = 1.889 frames per frame shown to be decoded.
In the above embodiment, since the key frames in the picture group are preloaded, the number of frames to be decoded of each frame is {0,1,1,1,1,1,1,1,1}, and then an average of 8/9=0.889 frames to be decoded of one frame is shown, so that the decoding efficiency in the above embodiment is far higher than that of other reference modes.
With continued reference to fig. 8, fig. 8 is a schematic diagram 800 of an application scenario of the decoding method for spatial image according to the present embodiment. In the application scenario of fig. 8, a server obtains in advance an encoded file obtained by encoding spatial pictures under different spatial angles of an object to be displayed. In the process of displaying an object to be displayed by a user, firstly, determining a target longitude and latitude 801 corresponding to a target space picture expected by the user according to the acquired operation information of the user, wherein the determined target longitude and latitude 801 is (60, 20); then, according to the target longitude and latitude 801, determining the position information of the target space picture in the coding file and a target picture group to which the target space picture belongs; then, determining a target key frame corresponding to the target picture group from a preloaded key frame sequence 802, wherein the key frame sequence comprises key frames related to the encoded file; and finally, decoding to obtain the target space picture according to the target key frame and the position information.
According to the method provided by the embodiment of the application, the target longitude and latitude corresponding to the target space picture expected by the user is determined according to the acquired operation information; determining the position information of a target space picture in a coding file and a target picture group to which the target space picture belongs according to the longitude and latitude of the target; determining a target key frame corresponding to the target picture group from a preloaded key frame sequence, wherein the key frame sequence comprises key frames related to the coding file; according to the target key frames and the position information, the target space picture is obtained through decoding, so that the decoding method suitable for the space picture is provided, in the decoding process, the target longitude and latitude corresponding to the target space picture expected by a user are mapped to corresponding positions in the encoded file, so that data of the corresponding positions are decoded, all the key frames in the encoded file are preloaded and used as external reference frames of predicted frames in the encoded file, time loss caused by repeated decoding of the key frames in the decoding process is avoided, and the space decoding efficiency is improved on the basis of ensuring the flexibility of the space decoding process.
With continued reference to fig. 9, there is shown a schematic flow 900 of a further embodiment of a decoding method applicable to spatial images according to the present application, comprising the steps of:
Step 901, in the initialization process, the decoded key frames included in the encoded file are loaded to generate a key frame sequence.
Step 902, determining the target longitude and latitude corresponding to the target space picture expected by the user according to the acquired operation information.
Step 903, the picture group to which the key frame corresponding to the key frame longitude and latitude closest to the target longitude and latitude belongs from the key frame longitudes and latitudes of the key frames in each picture group in the encoded file is used as the target picture group, and the target picture group identifier is determined.
Step 904, determining the target picture identification according to the offset between the target longitude and latitude and the longitude and latitude of the key frame corresponding to the key frame in the target picture group.
In step 905, the position information of the target spatial picture in the encoded file and the target picture group to which the target spatial picture belongs are determined according to the target picture group identifier and the target picture identifier.
Step 906, determining a target key frame corresponding to the target picture group from the key frame sequence according to the target picture group identifier.
In step 907, it is determined that the target spatial picture is a key frame in the target picture group or a predicted frame in the target picture group according to the target picture identification.
In step 908, in response to determining that the target spatial picture is a predicted frame in the target picture group, the predicted frame corresponding to the target picture identifier is decoded according to the target key frame and the position information, so as to obtain the target spatial picture.
In response to determining that the target spatial picture is a key frame in the target group of pictures, step 909, the target key frame is taken as the target spatial picture.
As can be seen from this embodiment, compared with the embodiment corresponding to fig. 6, the process 900 of the decoding method applicable to the spatial image in this embodiment specifically illustrates the preloading process of the key frame sequence, the position information of the target spatial image in the encoded file and the information determining process of the target image group to which the target spatial image belongs, and the decoding process based on the key frame sequence, so that the spatial decoding efficiency is further improved on the basis of ensuring the flexibility of the spatial decoding process.
With continued reference to fig. 10, as an implementation of the method shown in the foregoing figures, the present application provides an embodiment of a decoding apparatus suitable for spatial image, where the embodiment of the apparatus corresponds to the embodiment of the method shown in fig. 6, and the apparatus may be specifically applied to various electronic devices.
As shown in fig. 10, the decoding apparatus applied to a spatial image includes: a first determining unit 1001 configured to determine, according to the acquired operation information, a target longitude and latitude corresponding to a target space picture desired by a user; a second determining unit 1002 configured to determine, according to the target longitude and latitude, position information of the target spatial picture in the encoded file and a target picture group to which the target spatial picture belongs; a third determining unit 1003, configured to determine a target key frame corresponding to the target picture group from the preloaded key frame sequence, where the key frame sequence includes key frames involved in the encoded file; and a decoding unit 1004 configured to decode the target spatial picture according to the target key frame and the position information.
In some examples, the second determining unit 1002 is further configured to: determining a target picture group identifier of a target picture group to which the target space picture belongs and a target picture identifier of the target space picture according to the target longitude and latitude and the key frame longitude and latitude of key frames in each picture group in the coding file; and determining the position information of the target space picture in the coding file and the target picture group to which the target space picture belongs according to the target picture group identifier and the target picture identifier.
In some examples, the second determining unit 1002 is further configured to: the picture group which belongs to the key frame and corresponds to the key frame with the longitude and latitude closest to the target longitude and latitude in the key frame longitude and latitude of the key frames in each picture group in the coding file is taken as the target picture group, and the target picture group identification is determined; and determining the target picture identification according to the offset between the target longitude and latitude and the key frame longitude and latitude corresponding to the key frame in the target picture group.
In some examples, the third determining unit 1003 is further configured to: and determining a target key frame corresponding to the target picture group from the key frame sequence according to the target picture group identification.
In some examples, the apparatus further comprises: a loading unit (not shown in the figure) configured to: in the initialization process, the decoded key frames included in the encoded file are loaded to generate a key frame sequence.
In some examples, the decoding unit 1004 described above is further configured to: determining whether the target space picture is a key frame in the target picture group or a predicted frame in the target picture group according to the target picture identification; and in response to determining that the target space picture is a predicted frame in the target picture group, decoding the predicted frame corresponding to the target picture identifier according to the target key frame and the position information to obtain the target space picture.
In some examples, the decoding unit 1004 described above is further configured to: in response to determining that the target spatial picture is a key frame in the target group of pictures, the target key frame is taken as the target spatial picture.
In this embodiment, a first determining unit in a decoding device applicable to a spatial image determines, according to the acquired operation information, a target longitude and latitude corresponding to a target spatial image desired by a user; the second determining unit determines the position information of the target space picture in the encoding file and the target picture group to which the target space picture belongs according to the target longitude and latitude; the third determining unit determines a target key frame corresponding to the target picture group from a preloaded key frame sequence, wherein the key frame sequence comprises key frames related to the coding file; the decoding unit decodes the target space picture according to the target key frame and the position information, thereby providing a decoding device suitable for the space picture, mapping the target longitude and latitude corresponding to the target space picture expected by a user to the corresponding position in the encoded file in the decoding process, so as to decode the data of the corresponding position, preloading all key frames in the encoded file as external reference frames of predicted frames in the encoded file, avoiding time loss caused by repeated decoding of the key frames in the decoding process, and improving the space decoding efficiency on the basis of ensuring the flexibility of the space decoding process.
Referring now to FIG. 11, there is illustrated a schematic diagram of a computer system 1100 suitable for use in implementing the apparatus of embodiments of the present application (e.g., apparatus 101, 102, 103, 105 illustrated in FIG. 1). The apparatus shown in fig. 11 is merely an example, and should not be construed as limiting the functionality and scope of use of the embodiments herein.
As shown in fig. 11, the computer system 1100 includes a processor (e.g., CPU, central processing unit) 1101 that can perform various appropriate actions and processes according to a program stored in a Read Only Memory (ROM) 1102 or a program loaded from a storage section 1108 into a Random Access Memory (RAM) 1103. In the RAM1103, various programs and data required for the operation of the system 1100 are also stored. The processor 1101, ROM1102, and RAM1103 are connected to each other by a bus 1104. An input/output (I/O) interface 1105 is also connected to bus 1104.
The following components are connected to the I/O interface 1105: an input section 1106 including a keyboard, a mouse, and the like; an output portion 1107 including a Cathode Ray Tube (CRT), a Liquid Crystal Display (LCD), and the like, a speaker, and the like; a storage section 1108 including a hard disk or the like; and a communication section 1109 including a network interface card such as a LAN card, a modem, and the like. The communication section 1109 performs communication processing via a network such as the internet. The drive 1110 is also connected to the I/O interface 1105 as needed. Removable media 1111, such as a magnetic disk, an optical disk, a magneto-optical disk, a semiconductor memory, or the like, is installed as needed in drive 1110, so that a computer program read therefrom is installed as needed in storage section 1108.
In particular, according to embodiments of the present application, the processes described above with reference to flowcharts may be implemented as computer software programs. For example, embodiments of the present application include a computer program product comprising a computer program embodied on a computer readable medium, the computer program comprising program code for performing the method shown in the flowcharts. In such an embodiment, the computer program can be downloaded and installed from a network via the communication portion 1109, and/or installed from the removable media 1111. The above-described functions defined in the methods of the present application are performed when the computer program is executed by the processor 1101.
It should be noted that the computer readable medium of the present application may be a computer readable signal medium or a computer readable storage medium, or any combination of the two. The computer readable storage medium can be, for example, but not limited to, an electronic, magnetic, optical, electromagnetic, infrared, or semiconductor system, apparatus, or device, or a combination of any of the foregoing. More specific examples of the computer-readable storage medium may include, but are not limited to: an electrical connection having one or more wires, a portable computer diskette, a hard disk, a Random Access Memory (RAM), a read-only memory (ROM), an erasable programmable read-only memory (EPROM or flash memory), an optical fiber, a portable compact disc read-only memory (CD-ROM), an optical storage device, a magnetic storage device, or any suitable combination of the foregoing. In the context of this document, a computer readable storage medium may be any tangible medium that can contain, or store a program for use by or in connection with an instruction execution system, apparatus, or device. In the present application, however, a computer-readable signal medium may include a data signal propagated in baseband or as part of a carrier wave, with computer-readable program code embodied therein. Such a propagated data signal may take any of a variety of forms, including, but not limited to, electro-magnetic, optical, or any suitable combination of the foregoing. A computer readable signal medium may also be any computer readable medium that is not a computer readable storage medium and that can communicate, propagate, or transport a program for use by or in connection with an instruction execution system, apparatus, or device. Program code embodied on a computer readable medium may be transmitted using any appropriate medium, including but not limited to: wireless, wire, fiber optic cable, RF, etc., or any suitable combination of the foregoing.
Computer program code for carrying out operations of the present application may be written in one or more programming languages, including an object oriented programming language such as Java, smalltalk, C ++ and conventional procedural programming languages, such as the "C" programming language or similar programming languages. The program code may execute entirely on the client computer, partly on the client computer, as a stand-alone software package, partly on the client computer and partly on a remote computer or entirely on the remote computer or server. In the case of a remote computer, the remote computer may be connected to the client computer through any kind of network, including a Local Area Network (LAN) or a Wide Area Network (WAN), or may be connected to an external computer (for example, through the Internet using an Internet service provider).
The flowcharts and block diagrams in the figures illustrate the architecture, functionality, and operation of possible implementations of apparatus, methods and computer program products according to various embodiments of the present application. In this regard, each block in the flowchart or block diagrams may represent a module, segment, or portion of code, which comprises one or more executable instructions for implementing the specified logical function(s). It should also be noted that, in some alternative implementations, the functions noted in the block may occur out of the order noted in the figures. For example, two blocks shown in succession may, in fact, be executed substantially concurrently, or the blocks may sometimes be executed in the reverse order, depending upon the functionality involved. It will also be noted that each block of the block diagrams and/or flowchart illustration, and combinations of blocks in the block diagrams and/or flowchart illustration, can be implemented by special purpose hardware-based systems which perform the specified functions or acts, or combinations of special purpose hardware and computer instructions.
The units involved in the embodiments of the present application may be implemented by software, or may be implemented by hardware. The described units may also be provided in a processor, for example, described as: a processor includes a first determination unit, a second determination unit, a third determination unit, and a decoding unit. The names of these units do not in some cases limit the unit itself, and for example, the third determining unit may also be described as "a unit for determining a target key frame corresponding to the target group of pictures from the preloaded key frame sequence".
As another aspect, the present application also provides a computer-readable medium that may be contained in the apparatus described in the above embodiments; or may be present alone without being fitted into the device. The computer readable medium carries one or more programs which, when executed by the apparatus, cause the computer device to: according to the obtained operation information, determining the target longitude and latitude corresponding to the target space picture expected by the user; determining the position information of a target space picture in a coding file and a target picture group to which the target space picture belongs according to the longitude and latitude of the target; determining a target key frame corresponding to the target picture group from a preloaded key frame sequence, wherein the key frame sequence comprises key frames related to the coding file; and decoding according to the target key frame and the position information to obtain the target space picture.
The foregoing description is only of the preferred embodiments of the present application and is presented as a description of the principles of the technology being utilized. It will be appreciated by persons skilled in the art that the scope of the invention referred to in this application is not limited to the specific combinations of features described above, but it is intended to cover other embodiments in which any combination of features described above or equivalents thereof is possible without departing from the spirit of the invention. Such as the above-described features and technical features having similar functions (but not limited to) disclosed in the present application are replaced with each other.

Claims (10)

1. A decoding method applicable to spatial images, comprising:
according to the obtained operation information, determining the target longitude and latitude corresponding to the target space picture expected by the user;
determining position information of the target space picture in a coding file and a target picture group to which the target space picture belongs according to the target longitude and latitude;
determining a target key frame corresponding to the target picture group from a preloaded key frame sequence, wherein the key frame sequence comprises key frames related to the coding file;
and decoding to obtain the target space picture according to the target key frame and the position information.
2. The method of claim 1, wherein the determining, according to the target longitude and latitude, the location information of the target spatial picture in the encoding file and the target picture group to which the target spatial picture belongs includes:
determining a target picture group identifier of a target picture group to which the target space picture belongs and a target picture identifier of the target space picture according to the target longitude and latitude and the key frame longitude and latitude of a key frame in each picture group in the coding file;
and determining the position information of the target space picture in the coding file and the target picture group to which the target space picture belongs according to the target picture group identifier and the target picture identifier.
3. The method of claim 2, wherein the determining the target picture group identifier of the target picture group to which the target spatial picture belongs and the target picture identifier of the target spatial picture according to the target longitude and latitude and the key frame longitude and latitude of the key frame in each picture group in the encoded file includes:
the picture group which belongs to the key frame corresponding to the key frame longitude and latitude closest to the target longitude and latitude in the key frame longitude and latitude in each picture group in the coding file is used as the target picture group, and the target picture group identification is determined;
And determining the target picture identification according to the offset between the target longitude and latitude and the longitude and latitude of the key frame corresponding to the key frame in the target picture group.
4. A method according to claim 2 or 3, wherein the determining, from the preloaded key frame sequence, a target key frame corresponding to the target group of pictures comprises:
and determining a target key frame corresponding to the target picture group from the key frame sequence according to the target picture group identifier.
5. The method of claim 1, wherein prior to the determining the target key frame corresponding to the target group of pictures from the preloaded sequence of key frames, further comprising:
and in the initialization process, loading the decoded key frames included in the coded file, and generating the key frame sequence.
6. The method of claim 1, wherein the decoding the target spatial picture according to the target key frame and the location information comprises:
determining whether the target space picture is a key frame in the target picture group or a predicted frame in the target picture group according to the target picture identification;
and in response to determining that the target space picture is a predicted frame in the target picture group, decoding the predicted frame corresponding to the target picture identifier according to the target key frame and the position information to obtain the target space picture.
7. The method of claim 6, wherein the decoding the target spatial picture according to the target key frame and the location information further comprises:
in response to determining that the target spatial picture is a key frame in the target group of pictures, the target key frame is taken as the target spatial picture.
8. A decoding device suitable for spatial image, comprising:
the first determining unit is configured to determine the target longitude and latitude corresponding to the target space picture expected by the user according to the acquired operation information;
the second determining unit is configured to determine position information of the target space picture in the encoding file and a target picture group to which the target space picture belongs according to the target longitude and latitude;
a third determining unit, configured to determine a target key frame corresponding to the target picture group from a preloaded key frame sequence, where the key frame sequence includes key frames involved in the encoded file;
and the decoding unit is configured to decode the target space picture according to the target key frame and the position information.
9. A computer readable medium having stored thereon a computer program, wherein the program when executed by a processor implements the method of any of claims 1-7.
10. An electronic device, comprising:
one or more processors;
a storage device having one or more programs stored thereon,
when executed by the one or more processors, causes the one or more processors to implement the method of any of claims 1-7.
CN202310483604.5A 2023-04-28 2023-04-28 Decoding method and device suitable for space image Pending CN116527914A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202310483604.5A CN116527914A (en) 2023-04-28 2023-04-28 Decoding method and device suitable for space image

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202310483604.5A CN116527914A (en) 2023-04-28 2023-04-28 Decoding method and device suitable for space image

Publications (1)

Publication Number Publication Date
CN116527914A true CN116527914A (en) 2023-08-01

Family

ID=87393654

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202310483604.5A Pending CN116527914A (en) 2023-04-28 2023-04-28 Decoding method and device suitable for space image

Country Status (1)

Country Link
CN (1) CN116527914A (en)

Similar Documents

Publication Publication Date Title
US9961398B2 (en) Method and device for switching video streams
CN110321958B (en) Training method of neural network model and video similarity determination method
US8787460B1 (en) Method and apparatus for motion vector estimation for an image sequence
CN108965907B (en) Method, device and system for playing video
CN110413812B (en) Neural network model training method and device, electronic equipment and storage medium
US11057628B2 (en) Effective intra encoding for screen data
CN109255337B (en) Face key point detection method and device
US11514263B2 (en) Method and apparatus for processing image
CN110738657B (en) Video quality evaluation method and device, electronic equipment and storage medium
CN111382123B (en) File storage method, device, equipment and storage medium
CN110248189B (en) Video quality prediction method, device, medium and electronic equipment
CN109934142B (en) Method and apparatus for generating feature vectors of video
CN113327599B (en) Voice recognition method, device, medium and electronic equipment
CN109145813B (en) Image matching algorithm testing method and device
CN112423140A (en) Video playing method and device, electronic equipment and storage medium
CN109919220B (en) Method and apparatus for generating feature vectors of video
CN113177450A (en) Behavior recognition method and device, electronic equipment and storage medium
CN113038176A (en) Video frame extraction method and device and electronic equipment
CN110891195B (en) Method, device and equipment for generating screen image and storage medium
CN110751251B (en) Method and device for generating and transforming two-dimensional code image matrix
CN116527914A (en) Decoding method and device suitable for space image
CN116708806A (en) Encoding method, decoding method and device applicable to spatial pictures
CN115706810A (en) Video frame adjusting method and device, electronic equipment and storage medium
CN116527936A (en) Coding method and device suitable for space image
CN113177483B (en) Video object segmentation method, device, equipment and storage medium

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination