WO2018192276A1

WO2018192276A1 - Video recommendation method, server, and client

Info

Publication number: WO2018192276A1
Application number: PCT/CN2018/071527
Authority: WO
Inventors: 李东晖; 刘俊成; 徐昊; 李捷; 杨伟东; 潘柏宇; 王冀
Original assignee: 传线网络科技（上海）有限公司
Priority date: 2017-04-21
Filing date: 2018-01-05
Publication date: 2018-10-25
Also published as: TW201840196A; CN108737850A; TWI702841B; CN108737850B

Abstract

Disclosed by the embodiments of the present application are a video recommendation method, server, and client, said method comprising: receiving a video load request sent by a client, said video load request referring to panoramic video data; the panoramic video characterized by said panoramic video data comprising at least two video areas displaying different content; feeding said panoramic video data back to said client, so that the client obtains the panoramic video according to the rendering of the panoramic video data; receiving view area information sent by the client, said view area information being used for characterizing the target video area currently being viewed by a user; on the basis of said view area information, feeding recommended video data back to the client; the content displayed in the video characterized by the recommended video data corresponds to the content displayed in the target video area. The technical solution provided by the present application provides to the user a video meeting said user's actual requirements, thereby improving the effectiveness of video recommendation.

Description

Video recommendation method, server and client

The present application claims the priority of the Chinese Patent Application No. 201710267151.7, entitled "A Video Recommendation Method, Server and Client", filed on April 21, 2017, the entire contents of which is incorporated herein by reference. .

Technical field

The present application relates to the field of Internet technologies, and in particular, to a video recommendation method, a server, and a client.

Background technique

With the continuous development of Internet technology and the increasing speed of network transmission, people are more and more accustomed to watching videos online through video websites (and video applications corresponding to video websites). At present, due to the huge amount of video on the network, video websites usually provide users with video search and video recommendation functions, so that users can find videos of their own interest from numerous videos.

Currently, when making video recommendations, video sites usually pre-populate multiple videos with tags that can be used to indicate the content of the video. For example, these tags may include labels such as "stunning" and "speech" indicating the subject matter of the video, and may also include labels such as "Zhang San" and "Wang Er" indicating the video actors. After the user views a portion of the video, the video server's business server can count the tags of the videos to derive the type of video the user is interested in. In this way, different videos can be recommended to different users based on the statistical results.

However, with the popularity of panoramic video, the above video recommendation method may have the following problems:

Compared to traditional flat video, the amount of information displayed in panoramic video is more abundant. For example, in a panoramic video showing a football match, the user facing the video may be a video of the football game, and on the left side of the user's line of sight may be a video of the cheerleading performance. When the user is watching a panoramic video, it may only be interested in cheerleading performances and not in football competitions. According to the method recommended by the video in the prior art, the video website usually recommends a series of soccer game videos to the user after viewing the panoramic video, which may cause the recommended video to fail to meet the real needs of the user. It can be seen from the above that when applied to a scene of panoramic video, the method of video recommendation in the prior art may have the problem that the video of interest is not accurately recommended to the user.

Summary of the invention

The purpose of the embodiments of the present application is to provide a video recommendation method, a server, and a client, so as to be able to provide a video to the user that meets its real needs, thereby improving the effectiveness of the video recommendation.

To achieve the above objective, an embodiment of the present application provides a video recommendation method, where the method includes: receiving a video loading request sent by a client, where the video loading request is directed to panoramic video data; wherein the panoramic video data is characterized The panoramic video includes at least two video regions displaying different contents; feeding the panoramic video data to the client, so that the client obtains the panoramic video according to the panoramic video data rendering; receiving the client The viewing area information sent by the end, the viewing area information is used to represent the target video area currently viewed by the user; and the recommended video data is fed back to the client based on the viewing area information; wherein the recommended video data is characterized The content displayed in the video is adapted to the content displayed in the target video area.

To achieve the above objective, an embodiment of the present application further provides a server, where the server includes a network communication port, a memory, and a processor, where the network communication port is configured to receive a video loading request and a viewing area sent by the client. And feeding back the panoramic video data and the recommended video data to the client; the memory is configured to store the panoramic video data; the processor is configured to receive the video sent by the client by using the network communication port Loading a request, the video loading request is directed to the panoramic video data in the memory; wherein the panoramic video characterized by the panoramic video data includes at least two video regions exhibiting different content; controlling the network communication port to the The client feeds back the panoramic video data, so that the client obtains the panoramic video according to the panoramic video data rendering; and receives viewing area information sent by the client through the network communication port, the viewing The area information is used to represent the target video area currently viewed by the user; See area information, feedback controls the network communication port recommended video data to the client; wherein the recommended content data characterizing the video displayed in the video display with the video content in the target area adapted.

To achieve the above objective, the embodiment of the present application further provides a video recommendation method, where the method includes: sending a video loading request to a server, the video loading request is directed to the panoramic video data in the server; and receiving the feedback from the server And the panoramic video data is obtained according to the panoramic video data, and the panoramic video includes at least two video regions displaying different content; acquiring current viewing area information of the user, and sending the information to the server Viewing area information; the viewing area information is used to represent a target video area currently viewed by the user; receiving recommended video data fed back by the server; content displayed in the video represented by the recommended video data and the target video area The content of the display is adapted.

To achieve the above objective, an embodiment of the present application further provides a client, where the client includes a network communication port, a display, and a processor, where the network communication port is configured to send a video loading request and a viewing area information to the server. And receiving the panoramic video data and the recommended video data fed back by the server; the display, configured to display the panoramic video of the panoramic video data representation and display the video of the recommended video data representation; the processor, configured to: Controlling, by the network communication port, a video loading request to the server, the video loading request is directed to the panoramic video data in the server; receiving, by the network communication port, the panoramic video data fed back by the server, and according to the The panoramic video data is rendered to obtain a panoramic video; the panoramic video includes at least two video regions displaying different content; acquiring current viewing area information of the user, and controlling the network communication port to send the viewing area information to the server The viewing area information is used to characterize the user a target video area currently being viewed; receiving recommended video data fed back by the server through the network communication port; content displayed in the video characterized by the recommended video data is adapted to content displayed in the target video area.

As can be seen from the above, the present application can receive viewing area information sent by the client, and the viewing area information can be used to represent the target video area currently viewed by the user when viewing the panoramic video. Then, when the video is recommended to the user, the video adapted to the content of the display may be recommended to the user according to the content displayed in the target video area. In this way, although the information contained in the panoramic video is more, the technical solution of the present application can recommend the video that is really interested to the user according to the video area actually viewed by the user, thereby improving the effectiveness of the video recommendation.

DRAWINGS

In order to more clearly illustrate the embodiments of the present application or the technical solutions in the prior art, the drawings to be used in the embodiments or the description of the prior art will be briefly described below. Obviously, the drawings in the following description are only It is a few embodiments described in the present application, and other drawings can be obtained from those skilled in the art without any inventive labor.

FIG. 1 is a schematic diagram of interaction between a server and a client in an embodiment of the present application;

2 is a schematic diagram of a video area in an embodiment of the present application;

3 is a schematic diagram of association between a video tag and a video area in the embodiment of the present application;

4 is a flowchart of a method for performing video recommendation based on a video tag in an embodiment of the present application;

FIG. 5 is a schematic structural diagram of a server according to an embodiment of the present application;

FIG. 6 is a schematic structural diagram of a client in an embodiment of the present application.

detailed description

In order to enable those skilled in the art to better understand the technical solutions in the present application, the technical solutions in the embodiments of the present application will be clearly and completely described in conjunction with the drawings in the embodiments of the present application. The manners are only a part of the embodiments of the present application, and not all of them. All other embodiments obtained by a person of ordinary skill in the art based on the embodiments of the present application without departing from the inventive scope shall fall within the scope of the application.

The embodiment of the present application provides a video recommendation method, which can be applied to a system architecture of a server and a client. The server may be a device that stores panoramic video data. Specifically, the server may be a background service server of a website capable of providing a video service. The website may be, for example, iQiyi, Sohu video, Acfun, and the like. In this embodiment, the server may be an electronic device having a data operation, a storage function, and a network interaction function; or may be software running in the electronic device to provide support for data processing, storage, and network interaction. The number of the servers is not specifically limited in the present embodiment. The server may be a server, or may be several servers, or a server cluster formed by several servers.

In this embodiment, the client may be an electronic device for rendering panoramic video data. Specifically, the client may be, for example, a desktop computer, a tablet computer, a notebook computer, a smart phone, a digital assistant, a smart wearable device, a shopping guide terminal, a television set with network access function, or the like. Alternatively, the client may also be software capable of running in the above electronic device. Specifically, the client may be a browser in an electronic device, and an access portal provided by the video website platform may be loaded in the browser. The video website platform may be, for example, iQiyi, Sohu video, Acfun, etc., and the access portal may be the first page of the website platform. The client may also be an application running in the smart terminal provided by the video website platform.

An embodiment of the present application provides a video recommendation method, which may be applied to a server. Referring to FIG. 1, the method may include the following steps.

S1: Receive a video loading request sent by the client, where the video loading request is directed to the panoramic video data; wherein the panoramic video data represented by the panoramic video data includes at least two video regions displaying different content.

In this embodiment, a panoramic video database may be provided in the server. The panoramic video database may be a data set that stores panoramic video data. The panoramic video database may adopt any one of database formats such as MySQL, Oracle, DB2, and Sybase. The panoramic video database can be deployed on a storage medium in the server.

In this embodiment, various panoramic video data may be stored in the panoramic video database. Each panoramic video data in the panoramic video database may have its own data identification. The data identifier and the panoramic video data may be stored in the form of a key-value (key-value pair), so that the corresponding panoramic video data can be obtained from the panoramic video database by using the provided data identifier. .

In this embodiment, the panoramic video data can be drawn as a panoramic video by a drawing standard. The drawing standard may be, for example, the Open GL standard, the WebGL standard, or the like. The panoramic video may be a sphere video centered at a specified observation point. The designated observation point may be a center point of the panoramic video, and the image observed by the specified observation point may be displayed on a display for viewing by a user. In one aspect, the user can change the line of sight of the specified observation point by interacting with an input device of the display or display. For example, when a user applies a gesture of swiping to the right on the display, the line of sight of the specified observation point can be deflected to the left accordingly, so that the video information on the left side of the panoramic video can be displayed on the display. Alternatively, the panoramic video data may be rendered by an electronic device with a display to obtain a panoramic video characterized by the panoramic video data. The electronic device with the display may be provided with a gyroscope. When the electronic device is turned over, the line of sight of the specified observation point may be synchronized with the flipping of the electronic device under the action of the gyroscope. deflection. For example, when the user views the panoramic video through the head-mounted VR glasses, the user can bow down, and then the target observation point can be deflected downward according to the influence of the gyroscope in the VR glasses, in the VR glasses. The video screen below the previous video screen can be displayed.

In this embodiment, a link of the panoramic video may be displayed in the client. The link can be a text link, a picture link, or a video link. The user can send a video load request to the server by clicking on the link. In addition, the user may further input a Uniform Resource Locator (URL) pointing to the panoramic video data in the client, so as to send a video loading request to the server according to the URL.

In this embodiment, the video loading request may be a character string that is written according to a preset rule. The preset rule may be a network communication protocol that is followed between the client and the server. For example, the video loading request may be a string written in accordance with the HTTP protocol. The preset rule may define various components in the video loading request and an order of arrangement between the respective components. For example, the video loading request may include a request identifier field, a source IP address field, and a destination IP address field. The request identification field may populate an identification of the panoramic video data. The source IP address field may fill in an IP address of the client, and the target IP address field may fill in an IP address of the server. In this way, the video loading request can be sent from the client to the server.

In this embodiment, the video loading request pointing to the panoramic video data may refer to the identifier that carries the panoramic video data in the video loading request. In this way, after receiving the video loading request, the server may extract the identifier of the panoramic video data from the video loading request. After extracting the identifier of the panoramic video data, the server may read the panoramic video data having the identifier from the panoramic video database.

In this embodiment, the panoramic video data characterizing the panoramic video may refer to that the panoramic video may be obtained by rendering the panoramic video data. Specifically, the panoramic video usually has more information, and different video content can be displayed in different areas of the panoramic video. For example, in a panoramic video showing a beach volleyball game, the user is playing beach volleyball in front of the user. The left side of the user is a cheerleading performance on the side of the field, and the user is behind the sea and the beach. Then in a panoramic video, at least two video regions displaying different content may be included. For example, the above-mentioned panoramic video of a beach volleyball game may include at least three video areas, which respectively show beach volleyball matches, cheerleading performances, and sea and beach scenery.

In this embodiment, the video area included in the panoramic video may be pre-designated by the uploader of the panoramic video. For example, when uploading a panoramic video of the beach volleyball game, an uploader may pre-assign a total of three video regions in the panoramic video, and the uploader may also define a video displayed in each video region. The subject of the content. For example, the theme of the video content displayed in the three video regions of the panoramic video may be "volleyball game", "performance", and "seascape", respectively.

In this embodiment, the video area included in the panoramic video may also be obtained by technology recognition of image recognition after the panoramic video is uploaded. Specifically, when image recognition is performed, an image material library can usually be provided. In the image material library, materials of various scenes that may appear in the video, and a theme corresponding to each material may be stored. The material of these scenes can be stored in the form of feature vectors. When the contents of the two materials are relatively close, the distance between their corresponding feature vectors will be closer. Therefore, when image recognition is performed, feature vectors of respective regions in the panoramic video can be analyzed in real time, and the distance between the analyzed feature vector and the feature vector in the image material library can be calculated. When the calculated distance is less than the specified threshold, it indicates that the content displayed in the area is similar to a material in the image library. The area can then be used as a video area, and the subject matter of the content displayed in the video area can be the subject of similar material. For example, when performing image recognition on a panoramic video, the analysis shows that the content displayed in a certain area is a cabaret performance, then the area can be used as a video area, and the theme corresponding to the video area can be “song and dance performance”. Thus, by the method of image recognition, the number of video regions included in the panoramic video and the theme corresponding to each video region can be determined.

S2: feeding back the panoramic video data to the client, so that the client obtains the panoramic video according to the panoramic video data rendering.

In this embodiment, after the server obtains the panoramic video data pointed to by the video loading request from the panoramic video database, the panoramic video data may be fed back to the client.

In this embodiment, after receiving the panoramic video data, the client may render the panoramic video data into a corresponding panoramic video based on a drawing standard. Specifically, the panoramic video data may be rendered in a spatial coordinate system to obtain a sphere video characterized by the panoramic video data. The center of the sphere video can have a specified observation point. By adjusting the viewing angle of the specified observation point, different portions of the sphere video can be displayed on the display of the client.

It should be noted that although the panoramic video data is rendered in the space after being rendered by the drawing standard, the user views the spherical video through the display, because the radius of curvature of the spherical video is large, and The area actually displayed by the display is small relative to the entire sphere video, so that a portion of the sphere video viewed by the user from the display can be approximated as a flat video.

In this embodiment, in order to improve the loading speed of the video and reduce the load of the client, only a part of the video in the panoramic video may be rendered, wherein the part of the rendered video may be a video that can be viewed in the current viewing area of the user. In this way, the video data in the current view of the user can be rendered normally, and the video data in the current view of the user can be not rendered, thereby reducing the amount of data that the client needs to render, so as to reduce the load on the client. In addition, due to the reduced amount of data being rendered, the client can display the rendered video to the user faster, thereby increasing the loading speed of the video.

S3: Receive viewing area information sent by the client, where the viewing area information is used to represent a target video area currently viewed by the user.

In this embodiment, after the client renders the panoramic video, the user can view the panoramic video through the display of the client. Specifically, the user can change the line of sight of the specified observation point of the panoramic video by flipping the client or interacting with the display of the client, so that the video content displayed on the display can be changed.

In this embodiment, the video picture currently displayed on the display of the client may be content that is of interest to the user. In this way, the video area currently displayed on the display can be used as the target video area currently viewed by the user. In practical applications, when only one video area is displayed on the display, the video area may be used as the target video area. When at least two video regions are displayed on the display, it is necessary to make a determination on the video region that the user is really interested in. Specifically, when the determination is made, a video area having the largest display area in the display may be determined, and a video area having the largest display area is used as the target video area. For example, some of the video images displayed on the display are volleyball matches, and the other part is a cheerleading performance. The area of the cheerleading performance occupies 90% of the entire display, so it can be judged that the user is really interested in the video area showing the cheerleading performance. In addition, when the user is using the VR glasses, the video area to which the user's current line of sight is aligned can also be determined by identifying the pupil of the user, so that the video area to which the user's line of sight is aligned can be used as the target video area. .

In this embodiment, the video region in the panoramic video may also be characterized by angle information, such that each of the video regions may be associated with angle information. Specifically, since the panoramic video is a sphere video, the angle information associated with the video region in the panoramic video may be represented by an azimuth and/or a pitch angle, and may also be represented by longitude and/or latitude. In this embodiment, the angle information may include start angle information and end angle information, so that the angle interval of the video area may be jointly defined by the start angle information and the end angle information. Referring to FIG. 2, it is assumed that the starting angle information is 30 degrees east longitude and 20 degrees north latitude, and the ending angle information is 60 degrees east longitude and 20 degrees south latitude, then the sphere video is according to two warps and two weft lines. The upper defined area may be the video area.

It should be noted that, in some application scenarios, the starting angle information or the ending angle information may be a preset default angle, so that it may be defaulted. Thus, in the angle information associated with the video region in the panoramic video, only the starting angle information or only the ending angle information may be included. For example, in the case of polar coordinates, the starting angle information may default to (0, 0), then only the angle information needs to be terminated to define the range of the video area, and thus related to the video area in the panoramic video. In the associated angle information, only the termination angle information may be included.

In the present embodiment, the designated observation point of the panoramic video may generally have an initial observation line of sight, which may be pre-specified in the panoramic video data. In the present embodiment, the initial observation line of sight can be used as a reference line of sight. When the user is watching the panoramic video, the line of sight of the specified observation point of the panoramic video can be changed by flipping the client or interacting with the display of the client. The user's current line of sight can be characterized by the deviation between the changed line of sight and the initial observed line of sight. For example, the azimuth angle corresponding to the initial observation line of sight may be 0 degrees, then when the user adjusts the line of sight of the specified observation point left and right, an angle may exist between the current line of sight and the initial observation line of sight. The size can be used to characterize the location of the user's current line of sight. For example, when the angle is 30 degrees, it may indicate that the specified observation point is deflected to the right by 30 degrees from the position of the initial observation line of sight; for example, when the angle is -30 degrees, the specified observation may be indicated. The point is deflected 30 degrees to the left from the position of the initial observation line of sight. It should be noted that the angle interval corresponding to each video area in the panoramic video and the angle used to represent the current line of sight of the user can all adopt the same reference line of sight, thereby ensuring the uniformity of the angle. In this way, the target video area that the user is currently viewing can be determined according to the angle interval in which the angle corresponding to the user's current line of sight is located. For example, if the angle corresponding to the current line of sight of the user is 45 degrees, and the angle range corresponding to a certain video area is -50 degrees to 50 degrees, it can be determined that the user is currently viewing the video area.

In this embodiment, after the client determines the target video area currently viewed by the user, the viewing area information may be sent to the server, where the viewing area information may be used to represent the target video area currently viewed by the user. Specifically, the viewing area information may have various forms. For example, the viewing area information may include an angle corresponding to a video currently viewed by the user. Then, after the server receives the viewing area information sent by the client, the angle can be extracted, and then the angle between the angle and the video area can be matched to determine the target video area currently viewed by the user.

In addition, the viewing area information may further include an identifier of the video area. The identification of the video area may be a character string capable of uniquely characterizing the video area. For example, if a total of three video regions are included in one panoramic video, the identifiers of the three video regions may be “A001”, “A002”, and “A003”, respectively. In this embodiment, when the server feeds back the panoramic video data to the client, the panoramic video data may include an identifier of each video area. Then, after the client determines the target video area currently viewed by the user, the identifier of the target video area may be added to the viewing area information. In this way, the server may extract an identifier of the video area carried in the viewing area information, and according to the identifier, the target video area currently viewed by the user may be determined.

S4: Feeding recommended video data to the client based on the viewing area information; wherein content displayed in the video represented by the recommended video data is adapted to content displayed in the target video area.

In this embodiment, after determining the target video area currently viewed by the user, the server may feed back the recommended recommended video data to the client according to the content displayed in the target video area. Specifically, the video displayed in the target video area can usually have more tags, and each tag can reflect one aspect of the video. For example, in a video of a Chinese football match against South Korea, you can have such labels as "Football", "National Football", "World Cup", "Korea", "Oriental Gymnasium". For example, in a movie video, you can have labels such as “Quentin Tarantino”, “Oscar Best Movie”, “Leonardo DiCaprio”, “Plot”, “English Soundtrack”, etc. . In this embodiment, various video data stored in the server may be associated with a series of tags. In this way, multiple videos with the same tag can be considered as associated videos. The greater the number of identical tags, the higher the similarity between the two videos. In this way, when the recommended video data is fed back to the client, the video with at least one identical tag of the video displayed in the target video area may be fed back to the client. For example, after viewing a film directed by Quentin Tarantino in a video area in a panoramic video, the server can recommend other films directed by Quentin Tarantino to the client.

Further, in the present embodiment, the recommended video data corresponding to each video region in the panoramic video may be pre-specified. When a panoramic video is uploaded, it can be determined that it contains the number of video regions and what is displayed in each video region. Specifically, the technique of image recognition described in step S1 may be employed to determine the number of video regions included in the panoramic video and the theme corresponding to each video region. In this way, the server can assign associated recommended video data to each video region. When the recommended video data is allocated, recommended video data having the same theme can be assigned to each video area. For example, the panoramic video includes three video areas, and the theme of the content displayed in the three video areas may be “volleyball game”, “performance” and “seascape” respectively. Thus, when the recommended video data is allocated, the recommended video data of the same topic can be allocated to the video area. In this embodiment, the association between the identifier of the video area and the recommended video data may be stored in the server. In this way, after the server receives the viewing area information, the identifier of the target video area may be extracted from the viewing area information, and then the recommended video associated with the identifier of the target video area may be queried according to the locally stored association relationship. Data so that the recommended video data can be fed back to the client.

It can be seen that the content displayed in the video represented by the recommended video data is adapted to the content displayed in the target video area, and may refer to the content displayed in the video represented by the recommended video data and the target video area. The content displayed is identical or similar in at least one respect. For example, both videos have the same actors, the same scenery, the same director, the same theme, the same year, and so on.

It should be noted that the overall content displayed by the video represented by the recommended video data may be adapted to the content displayed in the target video area. For example, if the content displayed in the target video area is a movie directed by a director, then the overall content displayed by the video represented by the recommended video data may be another movie directed by the director.

In addition, the video represented by the recommended video data may also be a panoramic video, and the content displayed in one area of the panoramic video is adapted to the content displayed in the target video area. For example, if the content displayed in the target video area is a performance of a cheerleader, the server may feed back to the client a panoramic video about the basketball game, but the cheerleading performance may also be displayed in an area of the panoramic video. In this scenario, after the client receives the recommended video data, the video content that is adapted to the content displayed in the target video region may be directly displayed in the current line of sight of the user. For example, after receiving a panoramic video about a basketball game, the client can directly display the video area of the cheerleading performance to the user instead of presenting the video area of the basketball game to the user.

In a specific application scenario, the user views the panoramic video of the real-time travel in Dubai on the video website through the head-mounted VR glasses. In the panoramic video, three video regions may be included, which are located directly in front of the user, on the left side, and above. The identifiers of the three video areas may be "A001", "A002", and "A003", respectively. Among them, the music and dance performance in the hotel is displayed in front of the user. The left side of the user shows the beautiful fountain in the hotel, and the top of the user shows the famous artist's famous painting. When the user watches the panoramic video, he is very interested in the famous painting displayed above, and the user can look up at the famous painting above. At this time, the gyroscope in the VR glasses can feed back the angle between the user's current line of sight and the reference line of sight as the pitch angle of 80 degrees and the azimuth angle of 10 degrees according to the degree of inversion of the VR glasses. In the panoramic video, the angle range of the video area where the upper famous painting is located is: the azimuth angle is from -40 degrees to 40 degrees, and the pitch angle is from 50 degrees to 90 degrees. Then, the VR glasses can determine at this time that the user is currently watching the video area where the above-mentioned famous painting is located, so that the identifier "A003" of the video area can be fed back to the server of the video website. The association between the identifier of the video area and the video tag can be stored in the server of the video website. Then according to "A003", you can find the related labels such as "mural painting", "Michelangelo", "Renaissance" and so on. At this time, the server may query the video data including any one of the above three tags in the locally stored video data, and feed the queried video data to the VR glasses. In this way, the VR glasses can render the video data fed back by the server to the current page after the user views the full scene video for the user to select to view.

In an embodiment of the present application, each of the video regions in the panoramic video may be bound to at least one video tag. The video tag can be determined based on the video displayed in the video area. In particular, each video tag can embody an aspect of the video displayed in the video area. Please refer to Figure 3. In a video area, the video of China's football match against South Korea is displayed. Then the video area can have these video labels such as "soccer", "national football", "World Cup" and "Bird's Nest". For another example, in another video area, a video tag such as "beauty", "pull team", "competition performance" may be provided.

In this embodiment, the video tag bound to the video area may be pre-specified by the uploader of the panoramic video. For example, when uploading a panoramic video of a beach volleyball game, an uploader may pre-assign a total of three video regions in the panoramic video, and the labels of the video content displayed in the three video regions may be respectively “ Volleyball match, "performance" and "sea view".

In this embodiment, the video tag bound to the video area may also be obtained by technology recognition of image recognition. Specifically, when image recognition is performed, an image material library can usually be provided. In the image material library, materials of various scenes that may appear in the video, and video tags corresponding to the respective materials may be stored. The material of these scenes can be stored in the form of feature vectors. When the contents of the two materials are relatively close, the distance between their corresponding feature vectors will be closer. Therefore, when image recognition is performed, feature vectors of respective video regions in the panoramic video can be analyzed in real time, and the distance between the analyzed feature vector and the feature vector in the image material library can be calculated. When the calculated distance is less than the specified threshold, it indicates that the content displayed in the video area is similar to a material in the image library. Then, the label corresponding to the similar material can be used as the video label bound to the video area.

In this embodiment, after the server receives the viewing area information sent by the client, the server may determine at least one target video tag bound to the target video area represented by the viewing area information, and based on the at least one target. The video tag feeds back the recommended video data to the client. Specifically, various video data stored in the server can be associated with a series of tags. In this way, multiple videos with the same tag can be considered as associated videos. The greater the number of identical tags, the higher the similarity between the two videos. Thus, when the recommended video data is fed back to the client, at least one target video tag bound to the target video area can first be determined. The method of determining the at least one target video tag may include randomly extracting a specified number of non-repeating video tags from the video tags bound to the target video region, and using the extracted video tags as the target video tags. Moreover, the method of determining the at least one target video tag may further include counting the number of times each video tag is used as a search term, and using a specified number of video tags having a higher number of statistics as the target video tag.

In this embodiment, after the at least one target video tag bound to the target video region is determined, the recommended video data may be fed back to the client based on the at least one target video tag. Specifically, the content displayed in the video represented by the recommended video data may be adapted to the content displayed in the target video region. The matching may refer to that the video characterized by the recommended video data is provided with at least one of the target video tags. For example, after the user views a film directed by Quentin Tarantino in a video area of the panoramic video, the server can determine that the target video tag bound to the target video area can be "Quentin Tarantino" ", "drama", "black liberation" and so on. At this point, based on the identified video tags, the server can recommend other videos directed by Quentin Tarantino to the client, all of which have video tags of "Quentin Tarantino".

In one embodiment of the present application, the video region in the panoramic video may be associated with angle information that may be used to characterize the location of the video region in the panoramic video. Specifically, since the panoramic video is a sphere video, the angle information associated with the video region in the panoramic video may be represented by an azimuth and/or a pitch angle, and may also be represented by longitude and/or latitude. In this embodiment, the angle information may include start angle information and end angle information, so that the angle interval of the video area may be jointly defined by the start angle information and the end angle information. For example, assuming that the starting angle information is 30 degrees east longitude and 20 degrees north latitude, and the ending angle information is 60 degrees east longitude and 20 degrees south latitude, then the two warp threads and two weft lines are defined on the sphere video. The area can be the video area.

In the present embodiment, the designated observation point of the panoramic video may generally have an initial observation line of sight, which may be pre-specified in the panoramic video data. The initial observed line of sight can be used as a reference line of sight. When the user is watching the panoramic video, the line of sight of the specified observation point of the panoramic video can be changed by flipping the client or interacting with the display of the client. Then in the present embodiment, the deviation between the changed line of sight and the initial observed line of sight can be utilized to characterize the current line of sight of the user. For example, the azimuth angle corresponding to the initial observation line of sight may be 0 degrees, then when the user adjusts the line of sight of the specified observation point left and right, an angle may exist between the current line of sight and the initial observation line of sight. The size can be used to characterize the location of the user's current line of sight. For example, when the angle is 30 degrees, it may indicate that the specified observation point is deflected to the right by 30 degrees from the position of the initial observation line of sight; for example, when the angle is -30 degrees, the specified observation may be indicated. The point is deflected 30 degrees to the left from the position of the initial observation line of sight. It should be noted that the angle interval corresponding to each video area in the panoramic video and the angle used to represent the current line of sight of the user can all adopt the same reference line of sight, thereby ensuring the uniformity of the angle.

In this embodiment, the viewing area information received by the server may include an angle between the current line of sight of the user and the reference line of sight. In this way, after the server receives the viewing area information, the angle interval in which the angle in the viewing area information is located can be determined, so that the target video area that the user is currently viewing can be determined. Specifically, the determined video region corresponding to the angle interval may be determined as the target video region currently viewed by the user. For example, if the current line of sight of the user corresponds to an angle of 45 degrees, and a certain video area corresponds to an angle range of -50 degrees to 50 degrees, then it can be determined that the user is currently viewing the video area.

In an embodiment of the present application, when the server feeds back the panoramic video data to the client, the panoramic video data may include video tag data, where the video tag data is used to represent each block. At least one video tag bound to the video area. In the video tag data, an association relationship between the identifier of the video area and the video tag may be included. In this way, after receiving the panoramic video data, the client can store the video tag data in the panoramic video data in the local cache, and can obtain the corresponding video tag according to the identifier of the video area.

Referring to FIG. 4, in the embodiment, the method may further include the following steps.

S31: Receive at least one target video tag sent by the client, where the target video tag is determined according to a target video area viewed by the user.

In this embodiment, after receiving the panoramic video data, the client may render the panoramic video data as a panoramic video, and may store the video tag data in the panoramic video data in a local cache. The client may determine the target video area currently viewed by the user by using the scheme described in step S3. After determining the target video area, the client may query the video tag corresponding to the target video area according to the video tag data in the local cache. After querying the video tag corresponding to the target video area, the client may send the whole or a part of the queried video tag as the target video tag to the server. Specifically, when extracting a part of the tags from the queried video tags, a specified number of non-repeating video tags may be randomly extracted, and the extracted video tags are used as the target video tags. In addition, it is also possible to count the number of times each video tag is used as a search term, and use a specified number of video tags with a higher number of statistics as the target video tag. It can be seen that the target video tag sent by the client to the server can be determined according to the target video area viewed by the user.

S41: Feeding recommended video data to the client based on the at least one target video tag; wherein the video rendered according to the recommended video data is provided with at least one target video tag.

In this embodiment, various video data stored in the server may be provided with a series of video tags. In this way, multiple videos with the same tag can be considered as associated videos. The greater the number of identical tags, the higher the similarity between the two videos. In this way, the server can feed back the recommended video data to the client based on the at least one target video tag. Specifically, the recommended video data fed back by the server to the client may be: the video rendered according to the recommended video data is provided with at least one target video tag. For example, after the user views a film directed by Quentin Tarantino in a video area of the panoramic video, the server can determine that the target video tag bound to the target video area can be "Quentin Tarantino" ", "drama", "black liberation" and so on. At this point, based on the identified video tags, the server can recommend other videos directed by Quentin Tarantino to the client, all of which have video tags of "Quentin Tarantino".

In one embodiment of the present application, considering that when viewing a panoramic video, the user usually browses the content in each video area, and then selects the content that is of interest to the user. Therefore, in order to ensure the accuracy of the video recommendation, the viewing area information may further include a duration for the user to view the target video area. Specifically, the duration may be obtained by counting the total duration of the user's line of sight staying in the target video area. In this way, when the recommended video data is fed back to the client, it can be determined whether the duration of the user viewing the target video area is sufficiently long. Specifically, when the duration of the user watching the target video area reaches a specified length of time, the recommended video data is fed back to the client. For example, the specified duration may be 1 minute, and when the user views the target video area for 1 minute, the client feedback that can be used by the user is adapted to the content displayed in the target video area. Recommended video data. If the duration of the user viewing the target video area does not reach 1 minute, the recommended video data is not fed back to the client based on the target video area.

As can be seen from the above, in the present application, the video area can be associated with the angle information and can also be associated with the video label, and the angle information can also be associated with the video label. In this way, the angular interval represented by the angle information can be associated with at least one video tag, and the association relationship can be stored in the server. Then, when the client obtains the angle corresponding to the current line of sight of the user, the angle can be sent to the server. The server can determine the video tag corresponding to the current line of sight of the user according to the angle interval in which the angle is located. In addition, the association relationship may also be sent by the server to the client, so that after determining the angle, the client may locally query the angle interval in which the angle is located, so that the client can obtain The video tag corresponding to the user's current line of sight, so that the obtained video tag can be directly fed back to the server.

Referring to FIG. 5, the present application further provides a server including a network communication port 100, a memory 200, and a processor 300.

The network communication port 100 is configured to receive a video loading request and a viewing area information sent by the client, and feed back the panoramic video data and the recommended video data to the client.

The memory 200 is configured to store panoramic video data.

The processor 300 is configured to receive, by using the network communication port, a video loading request sent by the client, where the video loading request is directed to the panoramic video data in the memory; wherein the panoramic video data is characterized The panoramic video includes at least two video regions displaying different content; controlling the network communication port to feed back the panoramic video data to the client, so that the client obtains the according to the panoramic video data rendering a panoramic video; receiving, by the network communication port, viewing area information sent by the client, where the viewing area information is used to represent a target video area currently viewed by the user; and controlling the network communication port based on the viewing area information The recommended video data is fed back to the client; wherein the content displayed in the video characterized by the recommended video data is adapted to the content displayed in the target video region.

In this embodiment, the network communication port 100 may be a virtual port that is bound to a different communication protocol so that different data can be transmitted or received. For example, the network communication port may be port 80 responsible for web data communication, port 21 responsible for FTP data communication, and port 25 responsible for mail data communication. Furthermore, the network communication port can also be a physical communication interface or a communication chip. For example, it may be a wireless mobile network communication chip, such as GSM, CDMA, etc.; it may also be a Wifi chip; it may also be a Bluetooth chip.

In the present embodiment, the memory 200 may be a memory device for storing information. In a digital system, a device capable of storing binary data may be a memory; in an integrated circuit, a circuit having a storage function without a physical form may also be a memory such as a RAM, a FIFO, etc.; in the system, having a physical form of storage The device can also be called a memory, such as a memory stick, a TF card, or the like.

The processor 300 can be implemented in any suitable manner. For example, a processor can employ, for example, a microprocessor or processor and a computer readable medium, logic gate, switch, application specific integrated circuit that stores computer readable program code (eg, software or firmware) executable by the (micro)processor. (Application Specific Integrated Circuit, ASIC), programmable logic controller and embedded microcontroller form, etc. This application is not limited.

The specific functions implemented by the network communication port 100, the memory 200, and the processor 300 of the server disclosed in the above embodiments may be compared with the implementation manner of the video recommendation method on the server side in the present application, and the video recommendation method of the present application may be implemented. The embodiment achieves the technical effect of the method embodiment.

The application also provides a video recommendation method, which can be applied to the client. Referring to FIG. 1, the method may include the following steps.

S10: Send a video loading request to the server, the video loading request pointing to the panoramic video data in the server.

In this embodiment, the video loading request to the panoramic video data in the server may refer to the identifier that carries the panoramic video data in the video loading request. In this way, after receiving the video loading request, the server may extract the identifier of the panoramic video data from the video loading request. After extracting the identifier of the panoramic video data, the server may read the panoramic video data having the identifier from the panoramic video database.

S20: Receive the panoramic video data fed back by the server, and obtain a panoramic video according to the panoramic video data rendering; the panoramic video includes at least two video regions that display different content.

In the present embodiment, the panoramic video usually has a large amount of information, and different video contents can be displayed in different areas of the panoramic video. For example, in a panoramic video showing a beach volleyball game, the user is playing beach volleyball in front of the user. The left side of the user is a cheerleading performance on the side of the field, and the user is behind the sea and the beach. Then in a panoramic video, at least two video regions displaying different content may be included. For example, in the panoramic video of the beach volleyball game described above, at least three video areas may be included, which show beach volleyball matches, cheerleading performances, and sea and beach scenery.

S30: Acquire current viewing area information of the user, and send the viewing area information to the server; the viewing area information is used to represent a target video area currently viewed by the user.

In this embodiment, the video picture currently displayed on the display of the client may be content that is of interest to the user. In this way, the video area currently displayed on the display can be used as the target video area currently viewed by the user. In practical applications, when only one video area is displayed on the display, the video area may be used as the target video area. When at least two video regions are displayed on the display, it is necessary to make a determination on the video region that the user is really interested in. Specifically, when the determination is made, a video area having the largest display area in the display may be determined, and a video area having the largest display area is used as the target video area. For example, some of the video images displayed on the display are volleyball matches, and the other part is a cheerleading performance. The area of the cheerleading performance occupies 90% of the entire display, so it can be judged that the user is really interested in the video area showing the cheerleading performance. In addition, when the user is using the VR glasses, the pupil of the user can also be identified to determine the video area to which the user's current line of sight is aligned, so that the video area to which the user's line of sight is aligned can be used as the target video. region.

In this embodiment, the video region in the panoramic video may also be characterized by angle information, such that each of the video regions may be associated with angle information. In particular, since the panoramic video is a sphere video, the angle information associated with the video region in the panoramic video may be represented by an azimuth and/or pitch angle, and may also be represented by longitude and/or latitude. In this embodiment, the angle information may include start angle information and end angle information, so that the angle interval of the video area may be jointly defined by the start angle information and the end angle information. Assuming that the starting angle information is 30 degrees east longitude and 20 degrees north latitude, and the ending angle information is 60 degrees east longitude and 20 degrees south latitude, then the area defined on the sphere video according to the two warps and two weft lines is It can be the video area.

In this embodiment, after the client determines the target video area currently viewed by the user, the viewing area information may be sent to the server, where the viewing area information may be used to represent the target video area currently viewed by the user. Specifically, the viewing area information may have various forms. For example, the viewing area information may include an angle corresponding to a video currently viewed by the user. Then, after the server receives the viewing area information sent by the client, the angle can be extracted, and then the angle can be matched with the angle interval of each video area, so that the target video area currently viewed by the user can be determined.

In addition, the viewing area information may further include an identifier of the video area. The identification of the video area may be a character string capable of uniquely characterizing the video area. For example, if a total of three video regions are included in one panoramic video, the identifiers of the three video regions may be “A001”, “A002”, and “A003”, respectively. In this embodiment, when the server feeds back the panoramic video data to the client, the panoramic video data may include an identifier of each video region. Then, after the client determines the target video area currently viewed by the user, the identifier of the target video area may be added to the viewing area information. In this way, the server may extract an identifier of the video area carried in the viewing area information, and according to the identifier, the target video area currently viewed by the user may be determined.

S40: Receive recommended video data fed back by the server; content displayed in the video represented by the recommended video data is adapted to content displayed in the target video area.

In this embodiment, after determining the target video area currently viewed by the user, the server may feed back the recommended recommended video data to the client according to the content displayed in the target video area. Specifically, the video displayed in the target video area can usually have more tags, and each tag can reflect one aspect of the video. For example, in a video of a Chinese football match against South Korea, you can have such labels as "Football", "National Football", "World Cup Qualifier", "Korea", "Oriental Gymnasium". For example, in a movie video, you can have labels such as “Quentin Tarantino”, “Oscar Best Movie”, “Leonardo DiCaprio”, “Plot”, “English Soundtrack”, etc. . In this embodiment, various video data stored in the server may be associated with a series of tags. In this way, multiple videos with the same tag can be considered as associated videos. The greater the number of identical tags, the higher the similarity between the two videos. In this way, when the recommended video data is fed back to the client, the video with at least one identical tag of the video displayed in the target video area may be fed back to the client. For example, after viewing a film directed by Quentin Tarantino in a video area in a panoramic video, the server can recommend other films directed by Quentin Tarantino to the client.

In an embodiment of the present application, when the current viewing area information of the user is acquired, an angle between the current line of sight of the user and the reference line of sight may be determined, and the included angle is used as the current viewing area information of the user. Specifically, the specified observation point of the panoramic video may generally have an initial observation line of sight, which may be pre-specified in the panoramic video data. In the present embodiment, the initial observation line of sight can be used as a reference line of sight. When the user is watching the panoramic video, the line of sight of the specified observation point of the panoramic video can be changed by flipping the client or interacting with the display of the client. The angle between the changed line of sight and the initial observed line of sight can be utilized to characterize the user's current line of sight. For example, the azimuth angle corresponding to the initial observation line of sight may be 0 degrees, then when the user adjusts the line of sight of the specified observation point left and right, an angle may exist between the current line of sight and the initial observation line of sight. The size can be used to characterize the location of the user's current line of sight. For example, when the angle is 30 degrees, it may indicate that the specified observation point is deflected to the right by 30 degrees from the position of the initial observation line of sight; for example, when the angle is -30 degrees, the specified observation may be indicated. The point is deflected 30 degrees to the left from the position of the initial observation line of sight. In this way, after determining the angle between the current line of sight of the user and the reference line of sight, the angle may be used as the current viewing area information of the user.

In addition, in an embodiment of the present application, the identifier of each block of video regions may be included in the panoramic video data. Then, when the current viewing area information of the user is obtained, the video area with the largest area is displayed in the current viewing area of the user, and the identifier of the video area with the largest display area is used as the current viewing area information of the user. The current view of the user may refer to the area displayed by the display of the client to the user. Specifically, when only one video area is displayed on the display, the video area can be used as the target video area. When at least two video regions are displayed on the display, it is necessary to make a determination on the video region that the user is really interested in. Specifically, when the determination is made, a video area having the largest display area in the display may be determined, and a video area having the largest display area is used as the target video area. For example, some of the video images displayed on the display are volleyball matches, and the other part is a cheerleading performance. The area of the cheerleading performance occupies 90% of the entire display, so it can be judged that the user is really interested in the video area showing the cheerleading performance. In this embodiment, when the server feeds back the panoramic video data to the client, the panoramic video data may include an identifier of each video area. The identification of the video area may be a character string capable of uniquely characterizing the video area. For example, if a total of three video regions are included in one panoramic video, the identifiers of the three video regions may be “A001”, “A002”, and “A003”, respectively. Then, after the client determines the target video area currently viewed by the user, the identifier of the target video area may be used as the current viewing area information of the user.

In this embodiment, when the number of the video areas with the largest display area is at least two, the line of sight of the user may be determined, and the identifier of the video area in which the line of sight of the user is aligned is used as the current viewing of the user. The area information, in particular, when the user is using the VR glasses, the video area to which the user's current line of sight is aligned can be determined by identifying the pupil of the user, so that the video area to which the user's line of sight is aligned can be used as the Target video area.

In an embodiment of the present application, the panoramic video data may further include an angular interval of each block of video regions. In particular, the video regions in the panoramic video may be characterized by angle information such that each of the video regions may be associated with angle information. Specifically, since the panoramic video is a sphere video, the angle information associated with the video region in the panoramic video may be represented by an azimuth and/or a pitch angle, and may also be represented by longitude and/or latitude. In this embodiment, the angle information may include start angle information and end angle information, so that the angle interval of the video area may be jointly defined by the start angle information and the end angle information. Assuming that the starting angle information is 30 degrees east longitude and 20 degrees north latitude, and the ending angle information is 60 degrees east longitude and 20 degrees south latitude, then the area defined on the sphere video according to the two warps and two weft lines is It can be the video area. In this way, when the current viewing area information of the user is acquired, the angle between the current line of sight of the user and the reference line of sight can be determined, and the angle interval in which the angle is located can be determined, so that the determined angle interval can be determined as The current viewing area information of the user. For example, if the current line of sight of the user corresponds to an angle of 45 degrees, and a certain video area corresponds to an angle range of -50 degrees to 50 degrees, then it can be determined that the user is currently viewing the video area. In this way, the client can feed the angle range of -50 degrees to 50 degrees as the viewing area information to the server.

In an embodiment of the present application, the panoramic video data may further include video tag data, where the video tag data may be used to represent at least one video tag to which each of the video regions is bound. Specifically, in the video tag data, an association relationship between the identifier of the video area and the video tag may be included. In this way, after receiving the panoramic video data, the client can store the video tag data in the panoramic video data in the local cache, and can obtain the corresponding video tag according to the identifier of the video area.

In this embodiment, after receiving the panoramic video data, the client may render the panoramic video data as a panoramic video, and may store the video tag data in the panoramic video data in a local cache. The client may determine the target video area currently viewed by the user by using the scheme described in step S30. After determining the target video area, the client may query the video tag corresponding to the target video area according to the video tag data in the local cache. After querying the video tag corresponding to the target video area, the client may send the whole or a part of the queried video tag as the target video tag to the server. Specifically, when extracting a part of the tags from the queried video tags, a specified number of non-repeating video tags may be randomly extracted, and the extracted video tags are used as the target video tags. In addition, it is also possible to count the number of times each video tag is used as a search term, and use a specified number of video tags with a higher number of statistics as the target video tag. In this way, the client can determine at least one target video tag that is bound to the target video area currently viewed by the user, and send the at least one target video tag to the server. In this embodiment, the content displayed in the video represented by the recommended video data is adapted to the content displayed in the target video area, and may refer to that the video represented by the recommended video data is provided with at least one target video tag.

Referring to FIG. 6, the present application further provides a client, which includes a network communication port 110, a display 210, and a processor 310.

The network communication port 110 is configured to send a video loading request and viewing area information to the server, and receive the panoramic video data and the recommended video data fed back by the server;

The display 210 is configured to display a panoramic video represented by the panoramic video data and display a video represented by the recommended video data;

The processor 310 is configured to control the network communication port to send a video loading request to a server, where the video loading request is directed to the panoramic video data in the server; and the network communication port receives the feedback that is sent by the server And obtaining panoramic video according to the panoramic video data, and the panoramic video includes at least two video regions displaying different contents; acquiring current viewing area information of the user, and controlling the network communication port to The server sends the viewing area information; the viewing area information is used to represent a target video area currently viewed by the user; the recommended video data fed back by the server is received through the network communication port; and the recommended video data is represented in the video The displayed content is adapted to the content displayed in the target video area.

In this embodiment, the network communication port 110 may be a virtual port that is bound to a different communication protocol so that different data can be transmitted or received. For example, the network communication port may be port 80 responsible for web data communication, port 21 responsible for FTP data communication, and port 25 responsible for mail data communication. Furthermore, the network communication port can also be a physical communication interface or a communication chip. For example, it may be a wireless mobile network communication chip, such as GSM, CDMA, etc.; it may also be a Wifi chip; it may also be a Bluetooth chip.

In this embodiment, the display 210 may be a display tool that displays a certain electronic file to a screen through a specific transmission device and then reflects it to the human eye. The display may include a liquid crystal display (LCD), a cathode ray tube (CRT) display, a light emitting diode (LED) display, and the like.

The processor 310 can be implemented in any suitable manner. For example, a processor can employ, for example, a microprocessor or processor and a computer readable medium, logic gate, switch, application specific integrated circuit that stores computer readable program code (eg, software or firmware) executable by the (micro)processor. (Application Specific Integrated Circuit, ASIC), programmable logic controller and embedded microcontroller form, etc. This application is not limited.

The specific functions implemented by the network communication port 110, the display 210, and the processor 310 of the client disclosed in the foregoing embodiments may be compared with the implementation manner of the video recommendation method on the client side of the present application, and the present application may be implemented. The implementation of the video recommendation method achieves the technical effect of the method implementation.

In the 1990s, improvements to a technology could clearly distinguish between hardware improvements (eg, improvements to circuit structures such as diodes, transistors, switches, etc.) or software improvements (for process flow improvements). However, as technology advances, many of today's method flow improvements can be seen as direct improvements in hardware circuit architecture. Designers almost always get the corresponding hardware circuit structure by programming the improved method flow into the hardware circuit. Therefore, it cannot be said that the improvement of a method flow cannot be implemented by hardware entity modules. For example, a Programmable Logic Device (PLD) (such as a Field Programmable Gate Array (FPGA)) is an integrated circuit whose logic function is determined by the user programming the device. Designers program themselves to "integrate" a digital system on a single PLD without having to ask the chip manufacturer to design and fabricate a dedicated integrated circuit chip. Moreover, today, instead of manually making integrated circuit chips, this programming is mostly implemented using "logic compiler" software, which is similar to the software compiler used in programming development, but before compiling The original code has to be written in a specific programming language. This is called the Hardware Description Language (HDL). HDL is not the only one, but there are many kinds, such as ABEL (Advanced Boolean Expression Language). AHDL (Altera Hardware Description Language), Confluence, CUPL (Cornell University Programming Language), HDCal, JHDL (Java Hardware Description Language), Lava, Lola, MyHDL, PALASM, RHDL (Ruby Hardware Description Language), etc., are currently the most commonly used VHDL (Very-High-Speed Integrated Circuit Hardware Description Language) and Verilog2. It should also be apparent to those skilled in the art that the hardware flow for implementing the logic method flow can be easily obtained by simply programming the method flow into the integrated circuit with a few hardware description languages as described above.

Those skilled in the art also know that in addition to implementing the client and the server in a purely computer readable program code, the logic steps can be used to make the client and the server use logic gates, switches, ASICs, and programmable logic. Controllers and embedded microcontrollers, etc., implement the same functionality. Thus such a client, server can be considered a hardware component, and the means for implementing various functions included therein can also be considered as a structure within the hardware component. Or even a device for implementing various functions can be considered as a software module that can be both a method of implementation and a structure within a hardware component.

It will be apparent to those skilled in the art from the above description of the embodiments that the present application can be implemented by means of software plus a necessary general hardware platform. Based on such understanding, the technical solution of the present application may be embodied in the form of a software product in essence or in the form of a software product, which may be stored in a storage medium such as a ROM/RAM or a disk. , an optical disk, etc., includes instructions for causing a computer device (which may be a personal computer, server, or network device, etc.) to perform the methods described in various embodiments of the present application or portions of the embodiments.

The various embodiments in the specification are described in a progressive manner, and the same or similar parts between the various embodiments may be referred to each other, and each embodiment focuses on differences from other embodiments. In particular, for the embodiment of the server and the client, reference can be made to the introduction and comparison of the embodiments of the foregoing method.

The application can be described in the general context of computer-executable instructions executed by a computer, such as a program module. Generally, program modules include routines, programs, objects, components, data structures, and the like that perform particular tasks or implement particular abstract data types. The present application can also be practiced in distributed computing environments where tasks are performed by remote processing devices that are connected through a communication network. In a distributed computing environment, program modules can be located in both local and remote computer storage media including storage devices.

While the present invention has been described by the embodiments of the present invention, it will be understood by those skilled in the art

Claims

A video recommendation method, the method comprising:

Receiving a video loading request sent by the client, where the video loading request is directed to the panoramic video data; wherein the panoramic video data represented by the panoramic video data includes at least two video regions displaying different content;

And feeding back the panoramic video data to the client, so that the client obtains the panoramic video according to the panoramic video data rendering;

Receiving viewing area information sent by the client, where the viewing area information is used to represent a target video area currently viewed by the user;

And recommending, to the client, recommendation video data based on the viewing area information; wherein the content displayed in the video represented by the recommended video data is adapted to content displayed in the target video area.
The method according to claim 1, wherein each of the video regions is bound to at least one video tag; and correspondingly, the recommending video data to the client based on the viewing region information comprises:

Determining at least one target video tag bound to the target video region characterized by the viewing area information, and feeding the recommended video data to the client based on the at least one target video tag.
The method according to claim 2, wherein the content displayed in the video represented by the recommended video data is adapted to the content displayed in the target video area, including:

The video characterized by the recommended video data is provided with at least one of the target video tags.
The method of claim 1 wherein each of said video regions is associated with angle information; said angle information being used to characterize a location of said video region in said panoramic video; said angle The information includes start angle information and end angle information, wherein the start angle information and the end angle information together define an angular interval of the video region.
The method according to claim 4, wherein the viewing area information includes an angle between a current line of sight of the user and a reference line of sight; and correspondingly, the viewing area information represents that the target video area currently viewed by the user includes :

Determining an angle interval in which the angle is located, and determining a video region corresponding to the determined angle interval as a target video region currently viewed by the user.
The method according to claim 1, wherein the panoramic video data fed back to the client further comprises video tag data, and the video tag data is used to represent at least one of the video regions bound to each block. a video tag; correspondingly, the method further includes:

Receiving at least one target video tag sent by the client, where the target video tag is determined according to a target video area viewed by the user;

And recommending the video data to the client based on the at least one target video tag; wherein the video rendered according to the recommended video data is provided with at least one of the target video tags.
The method according to claim 1, wherein the viewing area information further includes a duration of time that the user views the target video area; and correspondingly, the recommended video data is fed back to the client based on the viewing area information. include:

When the user views the target video area for a specified length of time, the recommended video data is fed back to the client.
A server, comprising: a network communication port, a memory, and a processor, wherein:

The network communication port is configured to receive a video loading request and a viewing area information sent by the client, and feed back the panoramic video data and the recommended video data to the client;

The memory is configured to store panoramic video data;

The processor, configured to receive, by using the network communication port, a video loading request sent by the client, where the video loading request is directed to the panoramic video data in the memory; wherein the panoramic video data represents a panoramic view The video includes at least two video regions displaying different contents; controlling the network communication port to feed back the panoramic video data to the client, so that the client obtains the panoramic video according to the panoramic video data rendering. Receiving, by the network communication port, viewing area information sent by the client, the viewing area information is used to represent a target video area currently viewed by the user; and the network communication port is controlled based on the viewing area information The client feedbacks the recommended video data; wherein the content displayed in the video represented by the recommended video data is adapted to the content displayed in the target video region.
A video recommendation method, the method comprising:

Sending a video loading request to the server, the video loading request pointing to the panoramic video data in the server;

Receiving the panoramic video data fed back by the server, and obtaining a panoramic video according to the panoramic video data rendering; the panoramic video includes at least two video regions displaying different contents;

Obtaining current viewing area information of the user, and sending the viewing area information to the server; the viewing area information is used to represent a target video area currently viewed by the user;

Receiving recommended video data fed back by the server; content displayed in the video characterized by the recommended video data is adapted to content displayed in the target video area.
The method according to claim 9, wherein the obtaining the current viewing area information of the user comprises:

The angle between the current line of sight of the user and the reference line of sight is determined, and the included angle is used as the current viewing area information of the user.
The method according to claim 9, wherein the panoramic video data includes an identifier of each of the video regions; and correspondingly, acquiring the current viewing region information of the user includes:

The video area with the largest area is displayed in the current view field of the user, and the identifier of the video area with the largest display area is used as the current viewing area information of the user.
The method of claim 11 wherein the method further comprises:

When the number of the video areas with the largest display area is at least two, the identifier of the video area in which the user's line of sight is aligned is used as the current viewing area information of the user.
The method according to claim 9, wherein the panoramic video data includes an angular interval of each of the video regions; and correspondingly, acquiring the current viewing region information of the user includes:

Determining an angle between a current line of sight of the user and a reference line of sight, and determining an angular interval at which the angle is included;

The determined angle interval is used as the current viewing area information of the user.
The method according to claim 9, wherein the panoramic video data further comprises video tag data, wherein the video tag data is used to represent at least one video tag to which each of the video regions is bound; The method further includes:

Determining at least one target video tag bound to a target video area currently viewed by the user, and transmitting the at least one target video tag to the server.
The method according to claim 14, wherein the content displayed in the video represented by the recommended video data is adapted to the content displayed in the target video area, including:

The video characterized by the recommended video data is provided with at least one of the target video tags.
A client, characterized in that the client comprises a network communication port, a display and a processor, wherein:

The network communication port is configured to send a video loading request and viewing area information to the server, and receive the panoramic video data and the recommended video data fed back by the server;

The display is configured to display a panoramic video characterized by the panoramic video data and display a video characterized by the recommended video data;

The processor is configured to control the network communication port to send a video loading request to a server, where the video loading request is directed to the panoramic video data in the server; and the panoramic view fed back by the server is received by the network communication port Video data, and obtaining a panoramic video according to the panoramic video data rendering; the panoramic video includes at least two video regions displaying different content; acquiring current viewing area information of the user, and controlling the network communication port to the The server sends the viewing area information; the viewing area information is used to represent the target video area currently viewed by the user; the recommended video data fed back by the server is received through the network communication port; and the recommended video data is represented in the video. The content is adapted to the content displayed in the target video area.