CN115379114A

CN115379114A - Panoramic video processing method and device and electronic equipment

Info

Publication number: CN115379114A
Application number: CN202210872089.5A
Authority: CN
Inventors: 吴晓东
Original assignee: Alibaba China Co Ltd
Current assignee: Alibaba China Co Ltd
Priority date: 2022-07-19
Filing date: 2022-07-19
Publication date: 2022-11-22

Abstract

The embodiment of the application discloses a panoramic video processing method, a panoramic video processing device and electronic equipment, wherein the method comprises the following steps: determining a first video to be processed, wherein the first video is a first panoramic video generated in a mode of carrying out panoramic shooting through a fixed machine position; respectively intercepting local image content in a target area from a plurality of panoramic image frames included in the first panoramic video and generating a second video; the target area is related to a motion range of a video object which moves in the first panoramic video; generating a panoramic picture by extracting a target image frame from the plurality of panoramic image frames; and transmitting the second video, the position information of the target area and the panoramic picture to the client for assembling into a second panoramic video and playing. Through the embodiment of the application, the playing image quality of the panoramic video can be improved.

Description

Panoramic video processing method and device and electronic equipment

Technical Field

The present application relates to the field of panoramic video and terminal technologies, and in particular, to a panoramic video processing method and apparatus, and an electronic device.

Background

The panoramic video is a video generated by shooting 360 degrees in an all-around mode through a 3D camera, and a user can adjust the video to watch the panoramic video in an up-down mode and a left-right mode at will when watching the panoramic video. Panoramic video can be applied to various scenes, for example, in a traditional commodity information service system, the commodity is generally described by using the content of pictures, videos, texts and the like. With the development of panoramic video technology, some products are also described in the form of panoramic video.

For example, for a commodity of furniture family category, a specific furniture family commodity may be physically arranged in an entity place under the line in advance, then an instructor explains the specific commodity in the entity place, a panoramic video is recorded in the explanation process, and then the recorded panoramic video may be released on the line. When a user accesses pages such as detail pages of specific commodities, a play entrance of a panoramic video can be provided at a main picture container and the like, so that the user can obtain more comprehensive and more visual understanding of specific furniture household commodities through the panoramic video, including effects when the user is used in specific places, collocation effects with other furniture households and the like.

However, the data size of the panoramic video is usually large, so when transmitting to the client, compression processing is usually required, and a high compression ratio is required to be used, so that the network transmission delay can be controlled within an acceptable range. However, after the compression processing with a high compression ratio, the image quality of the panoramic video is significantly degraded, which affects the browsing experience of the user.

Disclosure of Invention

The application provides a panoramic video processing method and device and an electronic device, which can improve the playing image quality of a panoramic video.

The application provides the following scheme:

a panoramic video processing method, comprising:

determining a first video to be processed, wherein the first video is a first panoramic video generated in a mode of carrying out panoramic shooting through a fixed machine position;

respectively intercepting local image content in a target area from a plurality of panoramic image frames included in the first panoramic video and generating a second video; the target area is related to a motion range of a video object which moves in the first panoramic video;

generating a panoramic picture by extracting a target image frame from the plurality of panoramic image frames;

and transmitting the second video, the position information of the target area and the panoramic picture to the client for assembling into a second panoramic video and playing.

A panoramic video display method comprises the following steps:

responding to a panoramic video playing request of a user, and requesting a server to acquire corresponding video resources, wherein the video resources comprise a second video, position information of a target area and a panoramic picture, the second video and the panoramic picture are generated according to a first video, the first video is a first panoramic video generated in a mode of carrying out panoramic shooting through a fixed machine position, and the target area is related to the motion range of a video object which moves in the first panoramic video; the second video is generated by respectively intercepting local image contents located in the target area from a plurality of panoramic image frames, and the panoramic image is generated according to a target image frame extracted from the plurality of panoramic image frames;

and assembling the second video and the panoramic picture into a second panoramic video according to the position information of the target area and playing the second panoramic video.

A panoramic video processing apparatus comprising:

the first video determining unit is used for determining a first video to be processed, wherein the first video is a first panoramic video generated in a mode of carrying out panoramic shooting through a fixed machine position;

a second video generation unit configured to intercept local image content in a target area from a plurality of panoramic image frames included in the first panoramic video and generate a second video; the target area is related to a motion range of a video object which moves in the first panoramic video;

a panoramic picture generation unit configured to generate a panoramic picture by extracting a target image frame from the plurality of panoramic image frames;

and the data transmission unit is used for transmitting the second video, the position information of the target area and the panoramic picture to the client so as to be assembled into a second panoramic video and play the second panoramic video.

A panoramic video presentation device comprising:

the video resource acquisition unit is used for responding to a panoramic video playing request of a user and requesting a server to acquire corresponding video resources, wherein the video resources comprise a second video, position information of a target area and a panoramic picture, the second video and the panoramic picture are generated according to a first video, the first video is a first panoramic video generated in a mode of carrying out panoramic shooting through a fixed machine position, and the target area is related to the motion range of a video object which moves in the first panoramic video; the second video is generated by respectively intercepting local image contents located in the target area from a plurality of panoramic image frames, and the panoramic image is generated according to a target image frame extracted from the plurality of panoramic image frames;

and the panoramic video assembling unit is used for assembling the second video and the panoramic picture into a second panoramic video according to the position information of the target area and playing the second panoramic video.

A computer-readable storage medium, on which a computer program is stored which, when being executed by a processor, carries out the steps of the method of any of the preceding claims.

An electronic device, comprising:

one or more processors; and

a memory associated with the one or more processors for storing program instructions that, when read and executed by the one or more processors, perform the steps of the method of any of the preceding claims.

According to the specific embodiments provided herein, the present application discloses the following technical effects:

according to the embodiment of the application, for the first panoramic video shot by adopting a fixed-position mode, the target area related to the motion range of the video object moving in the first panoramic video can be determined, and the local image content in the target area is intercepted from a plurality of panoramic image frames included in the first panoramic video to generate the second video. In addition, a panoramic picture may be generated by extracting a target image frame from the plurality of panoramic image frames. In this way, the second video, the location information of the target area, and the panoramic picture may be transmitted to the client for assembly into a panoramic video at the client and play. By the mode, only the second video, the position information of the target area and the panoramic picture need to be transmitted, so that the requirements of the client on data transmission delay and the like can be met without data compression by adopting a high compression rate, correspondingly, the playing end can display the panoramic video with higher image quality, and the browsing experience of a user is improved.

Of course, it is not necessary for any product to achieve all of the above-described advantages at the same time for the practice of the present application.

Drawings

In order to more clearly illustrate the embodiments of the present application or the technical solutions in the prior art, the drawings needed to be used in the embodiments will be briefly described below, and it is obvious that the drawings in the following description are only some embodiments of the present application, and it is obvious for those skilled in the art to obtain other drawings without creative efforts.

FIG. 1 is a schematic diagram of a system architecture provided by an embodiment of the present application;

FIG. 2 is a flow chart of a first method provided by an embodiment of the present application;

fig. 3 is a schematic diagram of a target area determination process provided in an embodiment of the present application;

FIG. 4 is a schematic diagram of an interaction flow provided by an embodiment of the application;

FIG. 5 is a flow chart of a second method provided by embodiments of the present application;

FIG. 6 is a schematic diagram of a first apparatus provided by an embodiment of the present application;

FIG. 7 is a schematic diagram of a second apparatus provided by an embodiment of the present application;

fig. 8 is a schematic view of an electronic device provided in an embodiment of the present application.

Detailed Description

The technical solutions in the embodiments of the present application will be clearly and completely described below with reference to the drawings in the embodiments of the present application, and it is obvious that the described embodiments are only a part of the embodiments of the present application, and not all of the embodiments. All other embodiments obtained by a person of ordinary skill in the art based on the embodiments in the present application are within the scope of protection of the present application.

It should be noted that, in some scenes, the panoramic video may be shot in a fixed-position manner, that is, the position of the camera does not need to be changed during the shooting process. For example, in the process of shooting an explanation video for a specific commodity, an interpreter usually explains the specific commodity in front of a lens, and because the moving range of the interpreter is usually limited, the shooting of the panoramic video can be completed by adopting a fixed machine position mode. In this case, between each panoramic image frame in the panoramic video, the portions other than the portion where the motion occurs are the same and still. The moving part is mainly the human body and the figure of the interpreter, and in addition, if the actions of opening a cabinet door and the like are performed in the interpreting process, specific commodities can also move, and the like.

Therefore, based on the above situation, in the embodiment of the present application, an original panoramic video may be analyzed to determine a motion range of a moving video object and a target area where the motion range is located in a specific panoramic image frame, and then a local video combined by local image contents may be segmented from the panoramic video according to the target area. In addition, it is also possible to extract an arbitrary panoramic image frame from the panoramic video and generate a panoramic picture. Thus, a panoramic video may be converted into one or more partial videos, a panoramic picture, and one or more pieces of location information about the target area. When data of a specific panoramic video is transmitted to a client, the data can be transmitted according to the converted result, rather than transmitting the panoramic video itself. On the client side, the panoramic video can be assembled by the converted result to be played.

Thus, the size of the local video is much smaller than that of the panoramic video, and accordingly, the data amount is greatly reduced, so that the requirement on the transmission delay can be met only by using a relatively low compression ratio and even without compression. In addition, the panoramic picture is in a picture format and only needs one picture, so that the data volume of the part of information is low. Regarding the position information of the target area, since the target area is usually a rectangular area, it can be expressed by the coordinates of the start point of the rectangle in the panoramic image frame and the width and height values, the required data amount is very small, and thus the data amount as a whole is reduced. At the same time, however, since compression processing at a high compression ratio is not required, a panoramic video with higher image quality can be restored at the client.

Specifically, from the perspective of system architecture, as shown in fig. 1, the embodiment of the present application may be applied to a scene that needs to be displayed through a panoramic video, for example, a commodity display scene, etc. Specifically, a server-client mode may be used, wherein a video processing module may be included at the server side, and is configured to process the panoramic video to generate and store the aforementioned partially cut video, the position information of the target area, the panoramic picture, and the like. When the client needs to display the accessed panoramic video, the server can provide the local video, the panoramic picture and the position information of the target area to the client through the data transmission module, the data receiving module of the client can provide the data to the panoramic video restoration module after receiving the data, and the module can restore and play the panoramic video by using the received information. For example, the panoramic picture may be copied into multiple copies, which are respectively mapped to the inner surface of the spherical geometric body, and then each image frame in the partially cut video is mapped to the upper layer of the panoramic picture in the spherical geometric body in an overlapping manner according to the position information of the specific target area.

The following describes specific implementations provided in embodiments of the present application in detail.

Example one

First, the embodiment provides a panoramic video processing method from the perspective of a server, and referring to fig. 2, the method may include:

s201: determining a first video to be processed, wherein the first video is a first panoramic video generated in a manner of panoramic shooting through a fixed machine position.

For example, in the aforementioned commodity information service system, under the condition that a first panoramic video of a commodity is acquired by a way of providing materials by a merchant and the like and needs to be displayed to a consumer user through a client page, the first panoramic video can be determined as the first video to be processed. In addition, similar videos to be processed may be included in other scenes, for example, a certain museum or the like needs to provide information about the museum to visitors through a page such as a small program, and in this case, the information may be displayed in the form of a panoramic video, and the panoramic video may be shot during an explanation of a collection or the like in the certain museum by an instructor, and thus may also be used as a first video to be processed in the embodiment of the present application, and the like.

As for the specific first video, as described above, the specific first video belongs to a panoramic video, but in the embodiment of the present application, the specific first panoramic video may be shot by using a fixed stand. In this way, it can be ensured that in each panoramic image frame in the first panoramic video, the parts other than the moving part are still and the same. In contrast, if the machine position is not fixed during shooting of the first panoramic video, for example, the panoramic video shot by the unmanned aerial vehicle or the like, it is not suitable as the first video in the embodiment of the present application.

S202: respectively intercepting local image content in a target area from a plurality of panoramic image frames included in the first panoramic video and generating a second video; the target area is associated with a range of motion of a video object that is in motion in the first panoramic video.

After determining a specific first panoramic video, the local image content in the target area may be first cut out from a plurality of panoramic image frames in the first panoramic video and a second video may be generated, and the target area may be related to the motion range of the video object moving in the first panoramic video. That is, after a certain target area is determined, for each panoramic image frame in the first panoramic video, the local image frame may be cut out within the same area. Thus, the size of each local image frame and the position in the panoramic image frame are the same.

It should be noted here that each panoramic image frame may correspond to a rectangular image space, and the image space may establish a coordinate system with the upper left corner position and the like as an origin, where after the origins of the panoramic image frames are aligned, it can be found that, in different panoramic image frames, only the image content in the motion range of the aforementioned moving video object is different, and other portions are the same, so in the embodiment of the present application, the panoramic image frame may be processed as a static image content. In addition, since each panoramic image frame can be aligned based on the position of the origin, each panoramic image frame can correspond to the same coordinate system, and the position of the target area is specifically determined to be the position of the target area in such coordinate system. For example, for convenience of expression, the target region may be a rectangular region, and in this case, the target region may be expressed by information such as coordinates of a vertex of the rectangular region in the coordinate system and the width and height of the rectangle. Alternatively, the coordinates for the vertices may be expressed in terms of UV coordinates, to facilitate determining where the region is when mapped to the interior surface of the sphere, and so forth.

Specifically, the target area may be determined in a plurality of ways, for example, in one way, a plurality of similar panoramic videos may be observed or analyzed, and if it is found that most of the video objects in which motion occurs move within a certain range, the target area may be determined according to the range. That is, panoramic videos of the same type may all default to the video objects in which motion occurs moving within the same range. Of course, in this way, the target area may be too large for a single panoramic video, which may result in a larger size of the captured second video, which is not favorable for subsequent transmission.

Therefore, in another implementation, after the current first panoramic video is determined, a plurality of panoramic image frames included in the first panoramic video may be analyzed to determine the position information of the target area corresponding to the motion range of the video object in which the motion occurs in the panoramic image frames. Then, a plurality of local image frames are obtained by respectively intercepting the local image content in the target area from the plurality of panoramic image frames, and a second video is generated according to the plurality of local image frames. That is, the specific target area may be determined by analyzing each panoramic image frame, which may be more accurate and also advantageous to reduce the size of the second video.

For example, the position of a human body and/or a human shadow may be first identified from one panoramic image frame and selected by a rectangular area frame; after the next frame or several frames, the human body may move a certain distance in a certain direction, and then the rectangular area may be enlarged so as to cover the position of the human body after the movement, and so on, and the motion range of the human body may be determined. Specifically, as shown in fig. 3, assuming that, in the initial state, the video object moving in the first panoramic video is a human body, and the human body is located at the position a shown in fig. 3 (1) at the initial time, a rectangular range 31 can be determined at the position of the human body; in a certain frame, the human body moves to the position of b in fig. 3 (2), and at this time, the range of motion of the human body can be expanded to a rectangular range shown by 32. Then, assuming that the human body moves again to the position shown by c in fig. 3 (3), the rectangular area representing the range of motion of the human body is further enlarged, as shown at 33 therein. Assuming that the subsequent movement of the person to the right is not continued, it is of course also possible to return to the position shown as d in fig. 3 (4), at which point the rectangular area does not change, since this position is within the preceding rectangular area 33, which rectangular area is equal to the rectangular area shown as 33 in fig. 3 (3), as shown at 34. Subsequently, if the human body is found to move back and forth between the a position and the c position, the movement range of the human body can be determined as the area shown at 33 in fig. 3 (3), and so on. Naturally, the above description mainly takes the human body as an example of the object to be photographed which moves, and in practical applications, the instructor may move the object in the process of explaining the object, for example, opening a door of a cabinet, and thus, the instructor can recognize such a movement together. However, since the motion of the object is passively generated when the human body moves, the moving human body and the object may be taken as the same group of moving objects to identify the same target area. For example, in the example shown in fig. 3, it is assumed that the human body still moves back and forth from a to c in fig. 3 (4), but the operation of opening the cabinet door is also performed during the movement, and the height of the cabinet door may be higher than that of the human body, so that the target area corresponding to the specific movement range may also be expanded in the height direction, and so on.

After determining the position of the target area in the panoramic image frame of the motion range of the moving video object, the local image content in the target area can be respectively cut out from the plurality of panoramic image frames, wherein the cut-out local image content in each panoramic image frame has the same size and the position in the panoramic image frame is the same. Further, a second video may be generated from the plurality of cut partial image contents. Since the second video includes only image content within the motion range of the video object that moves in the panoramic video, the data amount thereof is greatly reduced with respect to the original panoramic video.

It should be noted that, in a specific implementation, there may be a case where one panoramic video includes a plurality of independently moving video objects, for example, a plurality of instructors, and the like, in this case, the motion ranges of the independently moving video objects may be identified, and correspondingly, a plurality of corresponding target areas may also be provided, and a plurality of corresponding second videos may be generated.

S203: and generating a panoramic picture by extracting a target image frame from the plurality of panoramic image frames.

In addition to generating the aforementioned second video, the target image frame may be extracted from the original plurality of panoramic image frames, for example, any one of the panoramic image frames, and so on. After such a panoramic image frame is extracted, it may be converted into a panoramic image in the format of a panoramic image.

In this way, the original panoramic video can be converted into the second video and the static panoramic picture, and the position information of the target area can be additionally saved. The position information about the target area may be saved in the form of a separate document or the like, or may be saved in a data volume corresponding to the second video as extension information or the like of the second video, and so on.

The process of generating the second video and the panoramic picture may be performed directly on the basis of the original panoramic video, or may be performed by compressing the panoramic video and then reproducing the second video and the panoramic picture. Of course, the compression process performed here may use a low compression ratio, that is, the compression process has a low influence on the image quality.

S204: and transmitting the second video, the position information of the target area and the panoramic picture to the client for assembling into a second panoramic video and playing.

After the original first panoramic video can be converted into the second video and the static panoramic picture, when a client needs to access the panoramic video, the second video, the position information of the target area and the panoramic picture can be transmitted to the client. When the information is transmitted, the information can be compressed to a certain degree, and of course, the data volume before transmission is much smaller than that of the original panoramic video, so the compression ratio used can be relatively low, and the image quality of the playing segment is prevented from being greatly influenced.

After receiving the information, the client can assemble a second panoramic video according to the information and play the second panoramic video. For example, in a specific implementation, the client may first copy a plurality of panoramic pictures according to the number of local image frames included in the second video, and map the panoramic pictures to the inner surfaces of the spherical geometric bodies respectively. Then, according to the position information of the target area, each local image frame in the second video is respectively mapped to the inner surface of the spherical geometric body and the upper layer of the panoramic picture so as to be assembled into a panoramic video for playing. That is, the local image frame may be superimposed on the upper layer of the panoramic image according to the position of the target area, so as to complete the restoration of the panoramic video. For a viewer user, the video can still be adjusted to be watched up, down, left and right at will, but the image quality of the panoramic video can be ensured because the compression rate of the video in the transmission process is low.

For example, as shown in fig. 4, assuming one of the panoramic image frames shown at 41, by analyzing a plurality of the panoramic image frames, it is determined that the video object in which the motion occurs is an instructor whose motion range is within the rectangular area shown at 42. Therefore, a plurality of partial image contents may be cut out from the plurality of panoramic image frames according to the position of the rectangular area, and then a second video (partial cut video) may be generated. In addition, the target image frame may be extracted from the plurality of panoramic image frames by a frame extraction method to generate a panoramic picture. After providing such second video, the position information of the target area, and the panoramic picture to the client, the panoramic picture may be copied in multiple copies, respectively mapped to the inner surface of the spherical geometry, and then each image frame of the second video may be superimposed on the upper layer of the panoramic picture according to the position information of the target area, as shown at 43 in fig. 4, and then the second panoramic video may be assembled, as shown at 44 (regarding the mapping result of the panoramic picture in the spherical geometry, not shown in the figure).

The scheme provided by the embodiment of the application is introduced above, and particularly in practical application, the scheme can be used in various application scenes to realize transmission of panoramic videos. For example, in a specific scenario of the product information service system, the first video may be a panoramic interpretation video corresponding to the target product, and at this time, after the second video and the panoramic picture are generated, the corresponding relationship between the target product and the second video, the position information of the target area, and the panoramic picture may be further stored. In this way, after a request for accessing the panoramic video of the target commodity submitted by the client is received, the second video corresponding to the target commodity, the position information of the target area and the panoramic picture can be transmitted to the client, so that the client can assemble and play the panoramic video according to the received information. Of course, the method can also be applied in other scenes, and is not described in detail here.

According to the embodiment of the application, for the first panoramic video shot by adopting a fixed machine position mode, a target area related to the motion range of a video object moving in the first panoramic video can be determined, and local image content in the target area is intercepted from a plurality of panoramic image frames included in the first panoramic video to generate the second video. In addition, a panoramic picture may be generated by extracting a target image frame from the plurality of panoramic image frames. In this way, the second video, the position information of the target area, and the panoramic picture can be transmitted to the client, so that the client can assemble the second panoramic video by using the received information and play the second panoramic video. By the mode, only the second video, the position information of the target area and the panoramic picture need to be transmitted, so that the requirements of the client on data transmission delay and the like can be met without data compression by adopting a high compression rate, correspondingly, the playing end can display the panoramic video with higher image quality, and the browsing experience of a user is improved.

Example two

The second embodiment corresponds to the first embodiment, and provides a panoramic video display method from the perspective of the client, with reference to fig. 5, where the method may specifically include:

s501: responding to a panoramic video playing request of a user, and requesting a server to acquire corresponding video resources, wherein the video resources comprise a second video, position information of a target area and a panoramic picture, the second video and the panoramic picture are generated according to a first video, the first video is a first panoramic video generated in a mode of carrying out panoramic shooting through a fixed machine position, and the target area is determined according to a motion range of a video object which moves in the first panoramic video; the second video is generated by respectively cutting out local image contents located in the target area from a plurality of panoramic image frames, the panoramic image being generated from target image frames extracted from the plurality of panoramic image frames.

In a specific implementation, the client may request the server to acquire the video resource corresponding to the target commodity when receiving a request of a user for accessing the panoramic video of the target commodity.

S502: and assembling the second video and the panoramic picture into a second panoramic video according to the position information of the target area and playing the second panoramic video.

Specifically, a plurality of copies of the panoramic image may be copied according to the number of local image frames included in the second video, and the copies of the panoramic image may be mapped to the inner surface of the spherical geometric body; then, according to the position information of the target area, each local image frame in the second video is respectively mapped to the inner surface of the spherical geometric body and the upper layer of the panoramic picture so as to be assembled into the second panoramic video for playing.

The video resource can comprise a plurality of second videos which respectively correspond to different target areas; at this time, each of the local image frames in the plurality of second videos may be mapped to the inner surface of the spherical geometry body and the upper layer of the panoramic image according to the position information of the target area corresponding to each of the plurality of second videos.

For the parts of the second embodiment not described in detail, reference may be made to the descriptions of the first embodiment and other parts of the present specification, which are not described herein again.

It should be noted that, in the embodiments of the present application, the user data may be used, and in practical applications, the user-specific personal data may be used in the scheme described herein within the scope permitted by the applicable law, under the condition of meeting the requirements of the applicable law and regulations in the country (for example, the user explicitly agrees, the user is informed, etc.).

Corresponding to the first embodiment, an embodiment of the present application further provides a panoramic video processing apparatus, and referring to fig. 6, the apparatus may include:

a first video determining unit 601, configured to determine a first video to be processed, where the first video is a first panoramic video generated in a manner of performing panoramic shooting through a fixed station;

a second video generating unit 602 configured to intercept local image content in a target region from a plurality of panoramic image frames included in the first panoramic video and generate a second video; the target area is related to a motion range of a video object which moves in the first panoramic video;

a panoramic picture generation unit 603 configured to generate a panoramic picture by extracting a target image frame from the plurality of panoramic image frames;

a data transmission unit 604, configured to transmit the second video, the position information of the target area, and the panoramic picture to the client, and assemble the second video, the position information of the target area, and the panoramic picture into a second panoramic video and play the second panoramic video.

Specifically, the second video generation unit may be specifically configured to:

analyzing a plurality of panoramic image frames included in the panoramic video, and determining position information of a target area corresponding to a motion range of a video object in which motion occurs in the panoramic image frames;

according to the position information of the target area, respectively intercepting the local image content in the target area from the plurality of panoramic image frames to obtain a plurality of local image frames, and generating a second video according to the plurality of local image frames.

Wherein, the device can also include:

a first compression unit, configured to perform compression processing on the second video before transmitting the second video, the location information of the target area, and the panoramic picture to the client.

Or, the second compression unit is configured to perform compression processing on the panoramic video before generating the second video and the panoramic picture.

Wherein if there are a plurality of video objects which independently move, there are a plurality of target areas, and a plurality of second videos are generated.

Specifically, the first video comprises a panoramic explanation video corresponding to a target commodity;

at this time, the apparatus may further include:

a correspondence storage unit configured to store correspondence between the target product and the second video, the position information of the target area, and the panoramic image after the second video and the panoramic image are generated;

the data transmission unit may specifically be configured to:

and responding to a request for accessing the panoramic video of the target commodity submitted by a client, and transmitting a second video corresponding to the target commodity, the position information of the target area and the panoramic picture to the client.

Corresponding to the second embodiment, an embodiment of the present application further provides a panoramic video display apparatus, and referring to fig. 7, the apparatus may include:

a video resource obtaining unit 701, configured to respond to a panoramic video playing request of a user, and request a server to obtain a corresponding video resource, where the video resource includes a second video, location information of a target area, and a panoramic picture, the second video and the panoramic picture are generated according to a first video, the first video is a first panoramic video generated by performing panoramic shooting at a fixed machine location, and the target area is related to a motion range of a video object that moves in the first panoramic video; the second video is generated by respectively intercepting local image contents located in the target area from a plurality of panoramic image frames, and the panoramic image is generated according to a target image frame extracted from the plurality of panoramic image frames;

a panoramic video assembling unit 702, configured to assemble the second video and the panoramic picture into a second panoramic video according to the position information of the target area, and play the second panoramic video.

The panoramic video assembling unit may specifically include:

a panoramic picture mapping subunit, configured to copy multiple panoramic pictures according to the number of local image frames included in the second video, and map the multiple panoramic pictures to an inner surface of a spherical geometric body respectively;

and the local image frame mapping subunit is configured to map, according to the position information of the target area, each local image frame in the second video to the inner surface of the spherical geometric body and the upper layer of the panoramic picture, respectively, so as to assemble a panoramic video for playing.

Specifically, the video resource includes a plurality of second videos, and the second videos respectively correspond to different target areas;

at this time, the local image frame mapping subunit may specifically be configured to:

and according to the position information of the target areas corresponding to the plurality of second videos respectively, mapping each local image frame in the plurality of second videos to the inner surface of the spherical geometric body and the upper layer of the panoramic picture respectively.

Specifically, the video resource obtaining unit may be specifically configured to:

responding to a request of a user for accessing the panoramic video of the target commodity, and requesting a server to acquire the video resource corresponding to the target commodity.

In addition, the present application also provides a computer readable storage medium, on which a computer program is stored, where the computer program is used to implement the steps of the method described in any one of the foregoing method embodiments when executed by a processor.

And an electronic device comprising:

one or more processors; and

a memory associated with the one or more processors for storing program instructions that, when read and executed by the one or more processors, perform the steps of the method of any of the preceding method embodiments.

Fig. 8 illustrates an architecture of an electronic device, which may include, in particular, a processor 810, a video display adapter 811, a disk drive 812, an input/output interface 813, a network interface 814, and a memory 820. The processor 810, the video display adapter 811, the disk drive 812, the input/output interface 813, the network interface 814, and the memory 820 may be communicatively connected by a communication bus 830.

The processor 810 may be implemented by a general-purpose CPU (Central Processing Unit), a microprocessor, an Application Specific Integrated Circuit (ASIC), or one or more Integrated circuits, and is configured to execute related programs to implement the technical solution provided by the present Application.

The Memory 820 may be implemented in the form of a ROM (Read Only Memory), a RAM (Random Access Memory), a static storage device, a dynamic storage device, or the like. The memory 820 may store an operating system 821 for controlling operation of the electronic device 800, a Basic Input Output System (BIOS) for controlling low-level operation of the electronic device 800. In addition, a web browser 823, a data storage management system 824, and a panoramic video processing system 825, among others, may also be stored. The panoramic video processing system 825 may be an application program that implements the operations of the foregoing steps in this embodiment of the application. In summary, when the technical solution provided in the present application is implemented by software or firmware, the relevant program codes are stored in the memory 820 and called for execution by the processor 810.

The input/output interface 813 is used for connecting an input/output module to realize information input and output. The i/o module may be configured as a component in a device (not shown) or may be external to the device to provide a corresponding function. The input devices may include a keyboard, a mouse, a touch screen, a microphone, various sensors, etc., and the output devices may include a display, a speaker, a vibrator, an indicator light, etc.

The network interface 814 is used for connecting a communication module (not shown in the figure) to realize communication interaction between the device and other devices. The communication module can realize communication in a wired mode (such as USB, network cable and the like) and also can realize communication in a wireless mode (such as mobile network, WIFI, bluetooth and the like).

Bus 830 includes a pathway for communicating information between various components of the device, such as processor 810, video display adapter 811, disk drive 812, input/output interface 813, network interface 814, and memory 820.

It should be noted that although the above-mentioned devices only show the processor 810, the video display adapter 811, the disk drive 812, the input/output interface 813, the network interface 814, the memory 820, the bus 830, etc., in a specific implementation, the devices may also include other components necessary for normal operation. Furthermore, it will be understood by those skilled in the art that the apparatus described above may also include only the components necessary to implement the solution of the present application, and not necessarily all of the components shown in the figures.

From the above description of the embodiments, it is clear to those skilled in the art that the present application can be implemented by software plus necessary general hardware platform. Based on such understanding, the technical solutions of the present application may be essentially or partially implemented in the form of a software product, which may be stored in a storage medium, such as a ROM/RAM, a magnetic disk, an optical disk, etc., and includes several instructions for enabling a computer device (which may be a personal computer, a server, or a network device, etc.) to execute the method according to the embodiments or some parts of the embodiments of the present application.

The embodiments in the present specification are described in a progressive manner, and the same and similar parts among the embodiments are referred to each other, and each embodiment focuses on the differences from the other embodiments. In particular, the system or system embodiments are substantially similar to the method embodiments and therefore are described in a relatively simple manner, and reference may be made to some of the descriptions of the method embodiments for related points. The above-described system and system embodiments are only illustrative, wherein the units described as separate parts may or may not be physically separate, and the parts displayed as units may or may not be physical units, may be located in one place, or may be distributed on a plurality of network units. Some or all of the modules may be selected according to actual needs to achieve the purpose of the solution of this embodiment. One of ordinary skill in the art can understand and implement without inventive effort.

The panoramic video processing method, the panoramic video processing device and the electronic device provided by the application are introduced in detail, and a specific example is applied in the description to explain the principle and the implementation of the application, and the description of the embodiment is only used for helping to understand the method and the core idea of the application; meanwhile, for a person skilled in the art, according to the idea of the present application, the specific embodiments and the application range may be changed. In view of the above, the description should not be taken as limiting the application.

Claims

1. A panoramic video processing method, comprising:

2. The method of claim 1,

the method for respectively intercepting local image content in a target area from a plurality of panoramic image frames included in the first panoramic video and generating a second video comprises the following steps:

3. The method of claim 1, further comprising:

compressing the second video before transmitting the second video, the position information of the target area and the panoramic picture to the client.

4. The method of claim 1, further comprising:

and compressing the panoramic video before generating the second video and the panoramic picture.

5. The method of claim 1,

if the number of the video objects which independently move is multiple, the number of the target areas is multiple, and multiple second videos are generated.

6. The method according to any one of claims 1 to 5,

the first video comprises a panoramic explanation video corresponding to a target commodity;

the method further comprises the following steps:

after the second video and the panoramic picture are generated, storing the corresponding relation between the target commodity and the second video, the position information of the target area and the panoramic picture;

the transmitting the second video, the position information of the target area, and the panoramic picture to the client includes:

7. A panoramic video display method is characterized by comprising the following steps:

8. The method of claim 7,

the restoring the second video and the panoramic picture into a second panoramic video according to the position information of the target area and playing the second panoramic video comprises the following steps:

copying a plurality of panoramic pictures according to the number of local image frames included in the second video, and respectively mapping the panoramic pictures to the inner surfaces of spherical geometric bodies;

and according to the position information of the target area, mapping each local image frame in the second video to the inner surface of the spherical geometric body and the upper layer of the panoramic picture respectively so as to assemble the second panoramic video for playing.

9. The method of claim 8,

the video resources comprise a plurality of second videos and respectively correspond to different target areas;

the mapping, according to the position information of the target region, each local image frame in the second video to the inner surface of the spherical geometry body and the upper layer of the panoramic picture, respectively, includes:

10. The method according to any one of claims 7 to 9,

the responding to the panoramic video playing request of the user, requesting the server to acquire the corresponding video resource, including:

11. A panoramic video processing apparatus, comprising:

12. A panoramic video presentation apparatus, comprising:

13. A computer-readable storage medium, on which a computer program is stored which, when being executed by a processor, carries out the steps of the method of any one of claims 1 to 10.

14. An electronic device, comprising:

one or more processors; and

memory associated with the one or more processors for storing program instructions which, when read and executed by the one or more processors, perform the steps of the method of any one of claims 1 to 10.