CN109274999A - A kind of video playing control method, device, equipment and medium - Google Patents

A kind of video playing control method, device, equipment and medium Download PDF

Info

Publication number
CN109274999A
CN109274999A CN201811169061.5A CN201811169061A CN109274999A CN 109274999 A CN109274999 A CN 109274999A CN 201811169061 A CN201811169061 A CN 201811169061A CN 109274999 A CN109274999 A CN 109274999A
Authority
CN
China
Prior art keywords
video
information
instruction information
markup information
video data
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201811169061.5A
Other languages
Chinese (zh)
Inventor
陈姿
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Tencent Technology Shenzhen Co Ltd
Original Assignee
Tencent Technology Shenzhen Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Tencent Technology Shenzhen Co Ltd filed Critical Tencent Technology Shenzhen Co Ltd
Priority to CN201811169061.5A priority Critical patent/CN109274999A/en
Publication of CN109274999A publication Critical patent/CN109274999A/en
Pending legal-status Critical Current

Links

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/41Structure of client; Structure of client peripherals
    • H04N21/4104Peripherals receiving signals from specially adapted client devices
    • H04N21/4126The peripheral being portable, e.g. PDAs or mobile phones
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/43Processing of content or additional data, e.g. demultiplexing additional data from a digital video stream; Elementary client operations, e.g. monitoring of home network or synchronising decoder's clock; Client middleware
    • H04N21/437Interfacing the upstream path of the transmission network, e.g. for transmitting client requests to a VOD server
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/47End-user applications
    • H04N21/472End-user interface for requesting content, additional data or services; End-user interface for interacting with content, e.g. for content reservation or setting reminders, for requesting event notification, for manipulating displayed content
    • H04N21/47205End-user interface for requesting content, additional data or services; End-user interface for interacting with content, e.g. for content reservation or setting reminders, for requesting event notification, for manipulating displayed content for manipulating displayed content, e.g. interacting with MPEG-4 objects, editing locally
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/60Network structure or processes for video distribution between server and client or between remote clients; Control signalling between clients, server and network components; Transmission of management data between server and client, e.g. sending from server to client commands for recording incoming content stream; Communication details between server and client 
    • H04N21/65Transmission of management data between client and server
    • H04N21/658Transmission by the client directed to the server
    • H04N21/6587Control parameters, e.g. trick play commands, viewpoint selection
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/80Generation or processing of content or additional data by content creator independently of the distribution process; Content per se
    • H04N21/83Generation or processing of protective or descriptive data associated with content; Content structuring
    • H04N21/84Generation or processing of descriptive data, e.g. content descriptors

Abstract

This application discloses a kind of video playing control methods, including send acquisition request to server, obtain the designated that the server is provided according to the acquisition request video data and its corresponding instruction information;Obtain the markup information of specified object;Then it plays the video data of designated and shows the markup information of the specified object according to the instruction information.In this way, user can be by specifying the markup information of object to identify object in piece when watching video, user, which does not need to exit again to play, goes for corresponding objects by watching film profile.This method provides more convenient information acquiring pattern for user, and user is facilitated more intuitively to obtain information, to reduce the frequent interaction of user and network, improves user's viewing experience, and reduce the occupancy and waste of video platform resource.Disclosed herein as well is a kind of video playing control device, equipment and storage mediums.

Description

A kind of video playing control method, device, equipment and medium
Technical field
This application involves video technique field more particularly to a kind of video playing control method, device, equipment and computers Readable storage medium storing program for executing.
Background technique
The case where user can encounter each object being difficult to differentiate between in film when watching video at present.For example, user watches When outer text video, for a user, foreigner's appearance is similar to be difficult to distinguish, therefore user can usually not remember performer institute in play The character played the part of;For another example being difficult to differentiate between since cartoon character is closely similar, user when user watches cartoon Also different animal characters can usually be obscured.
Under normal circumstances, if user encounters the case where being hard to tell object when watching video, user can usually exit video It plays, then removes to check the film profile of video, and then specific object, the on the one hand meeting of this phenomenon are distinguished based on the film profile The additional interaction times for increasing user and network, occupy the resource of video platform network, and another aspect user passes through this indirect It checks that the mode of film profile obtains corresponding informance, frequent break of video is needed to play, influence user's viewing experience, certain customers Video-see can be abandoned because interaction is inconvenient.
As it can be seen that above-mentioned phenomenon can greatly limit the development of video platform, therefore, how by technological means reduce user with The frequent interaction of network reduces the occupancy and waste of video platform resource, so that user be helped to distinguish object when watching video It is a big problem of current video platform urgent need to resolve.
Summary of the invention
The embodiment of the present application provides a kind of video playing control method, can reduce user and network using this method Frequently interaction reduces the occupancy and waste of video platform resource.Based on this, the embodiment of the present application also provides corresponding device, Equipment and computer storage medium.
In view of this, on the one hand the application provides a kind of video playing control method, which comprises
Acquisition request is sent to server, and the acquisition request is for the video data of request designated and its right The instruction information answered;
Obtain the video data of the designated that the server is provided according to the acquisition request and its corresponding Indicate information, the instruction information is used to indicate client when playing the video data of the designated, on specified opportunity The markup information of specified object is shown on corresponding designated position;
Obtain the markup information of the specified object;
It plays the video data of the designated and shows that the mark of the specified object is believed according to the instruction information Breath.
On the one hand the application provides a kind of video playing control method, which comprises
Receive client send acquisition request, the acquisition request for request designated video data and Its corresponding instruction information;
According to the acquisition request to the client provide the designated video data and its corresponding instruction Information, the instruction information is used to indicate the client when playing the video data of the designated, on specified opportunity Corresponding designated position shows the markup information of specified object.
On the one hand the application provides a kind of video playing control device, described device includes:
Sending module, for sending acquisition request to server, the acquisition request is for request designated Video data and its corresponding instruction information;
First obtains module, for obtaining the view for the designated that the server is provided according to the acquisition request Frequency evidence and its corresponding instruction information, the instruction information are used to indicate client in the video counts for playing the designated According to when, the markup information of specified object is shown on specified opportunity corresponding designated position;
Second obtains module, for obtaining the markup information of the specified object;
Control module, for playing the video data of the designated and being shown according to the instruction information described specified The markup information of object.
On the one hand the application provides a kind of video playing control device, described device includes:
Receiving module, for receiving the acquisition request of client transmission, the acquisition request is for the specified view of request The video data of frequency and its corresponding instruction information;
There is provided module, for according to the acquisition request to the client provide the designated video data and Its corresponding instruction information, the instruction information are used to indicate the client in the video data for playing the designated When, the markup information of specified object is shown in specified opportunity corresponding designated position.
On the one hand the application provides a kind of video playing control equipment, the equipment includes processor and memory:
Said program code is transferred to the processor for storing program code by the memory;
The processor is used to execute the step such as above-mentioned video playing control method according to the instruction in said program code Suddenly.
On the one hand the application provides a kind of computer readable storage medium, the computer readable storage medium is for storing Program code, said program code is for executing above-mentioned video playing control method.
As can be seen from the above technical solutions, the embodiment of the present application has the advantage that
In the embodiment of the present application, a kind of video playing control method is provided, in the method, client is obtained from server The video data taken and instruction information corresponding with video data, the instruction information are used to indicate client on specified opportunity The markup information of specified object is shown on corresponding designated position, then client is when playing the video data, and refers to according to this Show that information shows the markup information of specified object.User can pass through the markup information of specified object when watching video in this way Identify object in piece, user, which does not need to exit again to play, goes for corresponding objects by watching film profile, can so reduce use The frequent interaction at family and network reduces the occupancy and waste of video platform resource.
Detailed description of the invention
Fig. 1 is a kind of scene framework figure of video playing control method in the embodiment of the present application;
Fig. 2 is a kind of flow chart of video playing control method in the embodiment of the present application;
Fig. 3 is a kind of flow chart of video playing control method in the embodiment of the present application;
Fig. 4 is a kind of flow chart of video playing control method in the embodiment of the present application;
Fig. 5 is a kind of flow chart of video playing control method in the embodiment of the present application;
Interaction diagrams of the Fig. 6 for video playing control method a kind of in the embodiment of the present application in practical application scene;
Fig. 7 is the interface schematic diagram clicked annotation component in the embodiment of the present application and trigger object marking operation;
Fig. 8 is the interface schematic diagram for highlighting facial image in the embodiment of the present application in the form of bounding box;
Fig. 9 is the interface schematic diagram that markup information input control is shown in the embodiment of the present application;
Figure 10 is a kind of structural schematic diagram of video playing control device in the embodiment of the present application;
Figure 11 is a kind of structural schematic diagram of video playing control device in the embodiment of the present application;
Figure 12 is a kind of structural schematic diagram of video playing control device in the embodiment of the present application;
Figure 13 is a kind of structural schematic diagram of video playing control device in the embodiment of the present application;
Figure 14 is a kind of structural schematic diagram of video playing control device in the embodiment of the present application;
Figure 15 is a kind of structural schematic diagram of video playing control equipment in the embodiment of the present application;
Figure 16 is a kind of structural schematic diagram of video playing control equipment in the embodiment of the present application.
Specific embodiment
In order to make those skilled in the art more fully understand application scheme, below in conjunction in the embodiment of the present application Attached drawing, the technical scheme in the embodiment of the application is clearly and completely described, it is clear that described embodiment is only this Apply for a part of the embodiment, instead of all the embodiments.Based on the embodiment in the application, those of ordinary skill in the art exist Every other embodiment obtained under the premise of creative work is not made, shall fall in the protection scope of this application.
The description and claims of this application and term " first ", " second ", " third ", " in above-mentioned attached drawing The (if present)s such as four " are to be used to distinguish similar objects, without being used to describe a particular order or precedence order.It should manage The data that solution uses in this way are interchangeable under appropriate circumstances, so that embodiments herein described herein for example can be to remove Sequence other than those of illustrating or describe herein is implemented.In addition, term " includes " and " having " and theirs is any Deformation, it is intended that cover it is non-exclusive include, for example, containing the process, method of a series of steps or units, system, production Product or equipment those of are not necessarily limited to be clearly listed step or unit, but may include be not clearly listed or for this A little process, methods, the other step or units of product or equipment inherently.
For the frequent interaction for how reducing user and network, this technology of the occupancy of reduction video platform resource is asked Topic, this application provides a kind of video playing control methods, in the method, client from server obtain video data and Instruction information corresponding with video data, the instruction information are used to indicate client and show on corresponding designated position of specified opportunity The markup information for showing specified object shows specified object according to the instruction information when client is when playing the video data Markup information.
In this way, user can be by specifying the markup information of object to identify object in piece when watching video, user is not It needs to exit broadcasting again and goes for corresponding objects by watching film profile.This method provides more convenient acquisition of information for user Mode facilitates user more intuitively to obtain information, to reduce the frequent interaction of user and network, improves user's viewing body It tests, and reduces the occupancy and waste of video platform resource.
It is appreciated that above-mentioned video playing control method provided by the present application can be applied to terminal device, terminal device On client is installed, terminal device is interacted with realizing with server by running client, is broadcast to provide video for user Control service is put, so that user can be by specifying the markup information of object to identify object in piece, without moving back when watching video Viewing film profile is played out to go to search corresponding objects.Wherein, terminal device can be the calculating with data-handling capacity and set It is standby, including the equipment such as desktop computer, laptop or smart phone.The client installed on terminal device can be independent answer With program, functional module, plug-in unit or the component being either integrated in other applications, such as client can be integrated with The plug-in unit of video platform, for providing video playing control service.
The technical solution of the application in order to facilitate understanding, below in conjunction with concrete scene to view a kind of in the embodiment of the present application Frequency control method for playing back is introduced.
Fig. 1 is a kind of scene framework figure of video playing control method in the embodiment of the present application, referring to Fig. 1, in the scene Including terminal device 10 and server 20, client is installed on terminal device 10, when client operation, is handed over server 20 Mutually, a kind of video playing control method in the embodiment of the present application is executed, provides video playing control service for user.Below to it Specific implementation process is introduced.
Client sends acquisition request to server 20 by terminal device 10, receives the designated that server 20 returns Video data and its corresponding instruction information, the instruction information be used to indicate client play designated video data When, the markup information of specified object is shown on specified opportunity corresponding designated position.Then, client obtains specified object Markup information plays the video data of designated and shows the markup information of specified object according to instruction information.
In this way, user can not be needed again when watching video by specifying the markup information of object to identify object in piece It exits broadcasting and goes for corresponding objects by watching film profile.Therefore, this method provides more convenient acquisition of information for user Mode facilitates user more intuitively to obtain information, to reduce the frequent interaction of user and network, reduces video platform resource Occupancy and waste.
It next will be respectively from the angle of client and server to video playing controlling party a kind of in the embodiment of the present application Method describes in detail.
Firstly, being introduced in conjunction with Fig. 2 from client angle.
Fig. 2 is a kind of flow chart of video playing control method in the embodiment of the present application, is applied to client, referring to fig. 2, This method comprises:
S201: acquisition request is sent to server.
Wherein, video data and its corresponding instruction information of the acquisition request for request designated.With Family can select interested video as designated and play out from the video that video platform provides, specifically, when with Family triggering is directed to the play operation of designated, and client end response sends acquisition request in user's operation, to server, to obtain The video data of designated and instruction information corresponding with the video data of designated.
In specific implementation, the video identifier of designated is carried in the acquisition request, which can be unique Identify the designated.As the specific example of the application, all videos in video platform all have video number, and And the number of each video is different, and is based on this, and video can be numbered to the video identifier as video.Certainly, in reality Video identifier can also be characterized using other modes in, for example, by using video title as video identifier.
S202: the video data of the designated that the server is provided according to the acquisition request and its right is obtained The instruction information answered.
Wherein, the instruction information is used to indicate client when playing the video data of the designated, specified The markup information of specified object is shown on opportunity corresponding designated position.In the present embodiment, server is stored with the view of video Frequency accordingly and instruction information corresponding with video data, in this way, the available server of client is according to the acquisition request The video data of the designated of offer and its corresponding instruction information.
There are many realize for the video data for the designated that client acquisition server provides and its corresponding instruction information Mode.A kind of implementation is that client receives the video for the designated that server is returned in response to the acquisition request Data and its corresponding instruction information, such client can obtain the video data of designated and corresponding with video data Instruction information.Another implementation is, client actively obtains the video data of designated and its right from server The instruction information answered, specifically, client receive the video counts for the designated that server is returned in response to the acquisition request According to and its instruction information storage address, then according to the storage address access server, to obtain the video of designated Data and its corresponding instruction information.
In this embodiment, instruction information can be by specifying object in the specific time put the corresponding position that is particularly shown Form indicates.For example, server identifies the face in video by face recognition technology, determine that specified object is regarding The time occurred in frequency, and then determine the display time point of the markup information of specified object, wherein the time point can be as accurate as Second, server can generate instruction information according to the display time point, for example, the markup information according to specified object is corresponding aobvious Show that time point and display position corresponding with the display time point generate instruction information.Based on this, indicate that information includes referring to Determine object markup information corresponding display time point and display position corresponding with the display time point.
Since second data each in video data includes multiple image data, instruction information can also be by specified Object is indicated in the specific video frame number corresponding form for being particularly shown position.In some possible realizations of the embodiment of the present application In mode, instruction information also may include specified object the corresponding video frame number of markup information and with the video frame number pair The display position answered.
It should be noted that video data is generally large, in order to improve video playing response efficiency, server can be used Allocation methods carry out fragment processing to video data, obtain multiple video slicings for a video.Based on this, client is being broadcast Video corresponding video slicing can be obtained when putting video from server, then the video data that server returns can be video Fragment.In order to make it easy to understand, video can be divided into multiple fragments, so for 90 minutes films, it is possible to reduce view The time that frequency loads before playing, and broadband resource is saved, mitigate server stress.Certainly, in some possible implementations In, video data can be complete video data.For example, the video for duration within 5 minutes, server can be returned Entire video data, without being divided into multiple video slicings.
Wherein, when video data is video slicing, instruction information may include: the mark that object is specified in the video slicing Infuse information corresponding display time point and display position corresponding with the display time point;Certainly, instruction information can also be with It include: the corresponding video frame number of markup information that object is specified in the video slicing and display corresponding with the video frame number Position.
S203: the markup information of the specified object is obtained.
Markup information refers to the information being identified to specified object, is used to help user and distinguishes object.For example, mark letter Breath can be identity information, including name, classification of specified object etc..For example, if specified object is personage in video, Then markup information can be personage's name in video namely role's title, and certain people information is also possible to personage's Nickname, occupation etc.;It can also be animal category if specified object is animal in video, for example, for identifications such as otters Lower animal can regard its item name " otter " as markup information.
Client obtains the markup information of specified object, enables and shows mark letter when client terminal playing video data Breath, so that user be helped to distinguish the object in video.In some cases, client is locally stored for the object in video Markup information, then the specified object just refers to the object being marked in video, and client can directly acquire this and refer to from local Determine the markup information of object.In other cases, the markup information for the object in video is previously stored in server, Then the specified object just refers to the object being marked in video, and client can obtain the mark letter of specified object from server Breath.
It should be noted that S202 and S203 may be performed simultaneously, can also be executed according to preset order, S202 and The execution sequence of S203 has no effect on the specific implementation of the application.
S204: playing the video data of the designated and the mark of the specified object is shown according to the instruction information Infuse information.
After client obtains video data, instruction information corresponding with video data and the markup information of designated, The video data can be played, and indicates that information carries the opportunity for showing the markup information of specified object and position, therefore, Client shows the markup information of specified object according to instruction information and markup information, in specified opportunity corresponding designated position.
In view of in practical applications, if user can recognize that specific object according to identification information, user is not The prompt of identification information is needed again, and user is based on this, in some possible implementations with greater need for pure video pictures are seen In, client can also hide markup information, and the viewing experience of user is influenced to avoid markup information.Specifically, client is rung It should cancel in object marking and operate, hide the markup information of the specified object.In specific implementation, user can be according to itself Demand, select show or hide to specify the markup information of object, when user is difficult to differentiate between the object in video, display is specified The markup information of object, when user can distinguish the object in video, is hidden the markup information, is avoided to help user to distinguish The viewing experience of markup information influence user.
From the foregoing, it will be observed that the embodiment of the present application provides a kind of video playing control method, in the method, client is from clothes The video data and instruction information corresponding with video data, the instruction information that business device obtains are used to indicate client and are referring to The markup information of specified object is shown on the corresponding designated position of timing machine, then client is when playing the video data, and root The markup information of specified object is shown according to the instruction information.User can pass through the mark of specified object when watching video in this way Object in information identification piece is infused, user, which does not need to exit again to play, goes for corresponding objects, therefore, the party by watching film profile Method provides more convenient information acquiring pattern for user, and user is facilitated more intuitively to obtain information, thus reduce user with The frequent interaction of network reduces the occupancy and waste of video platform resource.
In the embodiment depicted in figure 2, a committed step for realizing video playing control method is to obtain specified object Markup information.And the markup information of object can be and mark generation in the server in advance by video platform operation personnel. But in order to improve the viewing experience that user watches video, in order to meet user to the individual demand of object marking, the application is also Corresponding solution is provided, specifically, providing the service of personalized mark object by client for user, it is possible to understand that , user first can carry out personalized mark to some or certain objects that mark, then client according to actual needs Markup information can be buffered in local, so that subsequent client is from the local markup information for obtaining specified object, and for broadcasting The markup information is shown when putting video.Certainly, client can also will be sent to service for the markup information of marked object Device is saved by server, in case needed for subsequent.The specific implementation of the solution in order to facilitate understanding is carried out below with reference to Fig. 3 Explanation.
Fig. 3 is a kind of flow chart of video playing control method in the embodiment of the present application, is implemented shown in the present embodiment and Fig. 2 Example difference is, also adds to the treatment process that can be marked object and be labeled, and the present embodiment is only just and embodiment illustrated in fig. 2 Difference is described in detail, other steps may refer to embodiment illustrated in fig. 2, referring to Fig. 3, this method comprises:
S301: it is operated in response to object marking, suspends the broadcasting of video data and marked according to what is obtained from server The regional location of object, object can be marked by highlighting in the video frame in video pause.
Client end response is operated in object marking, suspend the broadcasting of video data, and client from server according to obtaining The regional location that object can be marked, object can be marked by highlighting in the video frame in video pause, so that user chooses certain A or certain objects that mark carry out information labeling.
Object can be marked and refer to the object that can be marked in video, or refer to the object for allowing to be marked in video, For example, can mark object can be personage or the animal in video.It may include all people's object pair in video that object, which can be marked, As or animal target, naturally it is also possible to including part who object or animal target in video, such as act the leading role object.Client End can provide mark control on playing the page for user, then when user triggers object marking operation, client pause view The broadcasting of frequency evidence, according to the regional location for marking object obtained from server, according to the regional location in video pause When video frame in highlight and can mark object, so that user selects the object for needing to mark.In some possible realization sides In formula, client can be highlighted with boundary box form can mark object.Specifically, video frame of the client in video pause In, all are marked into object, are highlighted with boundary box form.
S302: in response to choosing operation for the region that can mark object, display can mark object for described Markup information input control.
Show in video frame when client is suspended and can mark object, user can click by mouse, touch-control or The modes such as voice control choose the object that marks that needs mark, and client end response is in choosing behaviour for the region that can mark object Make, display is for the markup information input control that can mark object.
It should be noted that showing multiple when marking object in video frame when client pause, user can be with Object can be marked by once only choosing one, and client end response chooses operation for the region that can mark object in user, show needle The markup information input control of object can be marked to this.Certainly, in the application in other possible implementations, user can also be with Once choose it is multiple mark object, for client end response in choosing operation for the region that can mark object, display is above-mentioned multiple The markup information input control of object can be marked.
S303: inputting in response to markup information and operate, and markup information made by object can be marked for described by receiving.
According to client show for can mark the markup information input control of object, user can input for can mark The markup information of object is infused, this is based on, client can be inputted in response to markup information and be operated, and receiving can mark pair for described As made markup information.
Client can store the markup information that can mark object to local, in this way, client is obtaining specified object Markup information when, can be from the local markup information for obtaining specified object, to realize when playing video, on specified opportunity Corresponding designated position shows the markup information of specified object.
Certainly, client can also will receive and be sent to server for can mark markup information made by object, It saves in the server, in case needed for client subsequent video plays.
From the foregoing, it will be observed that the embodiment of the present application provides a kind of video playing control method, client end response is in object marking Operation, suspends the broadcasting of video data, and according to the regional location for marking object obtained from server, in video pause Video frame in highlight and can mark object, then in response to for the operation of choosing that can mark object, display can be marked pair The markup information input control of elephant is then inputted in response to markup information and is operated, and mark made by object can be marked by receiving to be directed to Information.Client can send acquisition request to server device, to obtain video data and its corresponding instruction information, and from The markup information that specified object is obtained in the markup information of object can be marked, then playing video data, and according to instruction information Show the markup information of specified object.
In this way, user can not be needed when watching video by specifying the markup information of object to identify object in piece Broadcasting is exited again goes for corresponding objects by watching film profile.This method provides more convenient acquisition of information side for user Formula facilitates user more intuitively to obtain information, to reduce the frequent interaction of user and network, reduces video platform resource It occupies and wastes.
Fig. 2 and embodiment illustrated in fig. 3 are from the angle of client to video playing controlling party a kind of in the embodiment of the present application Method is described in detail, next, will in conjunction with specific embodiments, from the angle of server to video a kind of in the embodiment of the present application Control method for playing back is described in detail.
Fig. 4 is a kind of flow chart of video playing control method in the embodiment of the present application, is applied to server, the method Include:
S401: the acquisition request that client is sent is received.
Wherein, video data and its corresponding instruction information of the acquisition request for request designated.Its Specific implementation may refer to related content description above.
S402: the video data of the designated and its corresponding is provided to the client according to the acquisition request Indicate information.
Wherein, the instruction information is used to indicate client in playing video data, corresponding specified on specified opportunity Position shows the markup information of specified object.In specific implementation, server can refer to according to acquisition request to client offer Determine video video data and its corresponding instruction information, in this way, client can be according to video data, corresponding instruction information And markup information, when realizing playing video data, the mark letter of specified object is shown in specified opportunity corresponding designated position Breath.
Server to client provide designated video data and its corresponding instruction information there are many implementation. A kind of implementation is, server in response to the acquisition request, directly to client return designated video data and Its corresponding instruction information;Another implementation is that server returns to specified view in response to the acquisition request, to client The storage address of the video data of frequency and its corresponding instruction information, so that client obtains designated according to the storage address Video data and its corresponding instruction information.
From the foregoing, it will be observed that the embodiment of the present application provides a kind of video playing control method, in this method server in response to The acquisition request that client is sent provides the video data and instruction corresponding with video data letter of designated to client Breath, enables a client in playing video data, according to instruction information, shows and refers in corresponding designated position of specified opportunity The markup information of object is determined, so that user can be right in piece by specifying the markup information of object to identify when watching video As not needing to exit again to play and going to search corresponding objects by watching film profile, therefore, this method provides more just for user The information acquiring pattern of benefit, facilitates user more intuitively to obtain information, to reduce the frequent interaction of user and network, reduces The occupancy and waste of video platform resource.
In the embodiment shown in fig. 4, it when server returns to instruction information to client, needs first to determine and specified object phase The specified opportunity closed and designated position, so that it is determined that instruction information.Based on this, the embodiment of the present application also provides a kind of videos The specific implementation of control method for playing back.The implementation of the video playing control method will be described in detail below.
Fig. 5 is a kind of flow chart of video playing control method in the embodiment of the present application, the reality shown in the present embodiment and Fig. 4 It applies example difference to be, also adds and pretreated step is carried out to video, for the video in video platform, server can be pre- It first identifies to marking object involved in video, instruction information corresponding to video is determined, to provide for client Data are supported, so that when playing designated, the mark of specified object is shown according to the instruction information of designated for client Information.It should be noted that server can be performed both by above-mentioned preprocessing process to each video, it can also be according to business demand Above-mentioned preprocessing process only executed to partial video, and to each video carry out pretreated process be it is identical, therefore, in order to It is easy to understand, the application is only introduced with executing above-mentioned pretreated process for a video.Information is indicated about determining The step of, the present embodiment is only just described in detail with embodiment illustrated in fig. 4 difference, other steps may refer to real shown in Fig. 4 Example is applied, referring to Fig. 5, this method comprises:
S501: the corresponding material data of video is obtained.
The material data includes the video data of video and the image that can mark object.
Wherein, the image that can mark object can be what the extraction from stage photo or video obtained.Object can be marked as view For personage in frequency, the image that can mark object can be the facial image of the personage.In some possible implementations, The operation personnel of video platform can log in material data and upload platform for the facial image of high priest in video and video Be uploaded to server, in this way, the corresponding material data of the available video of server, the material data include video data and The facial image of high priest in video.
It should be noted that can mark object is not limited to high priest, the minor character in video or view can also be The non-humans role such as animal in frequency, the embodiment of the present application are not construed as limiting this.
S502: according to the video data for marking video described in the image recognition of object, described can mark pair is obtained As the regional location in video frame.
Video data includes multiple image data, and server uses image recognition technology, will can mark the image of object with The multiple image data that video data includes are compared, and obtain the video frame comprising that can mark object, and further obtaining can Object is marked in the regional location of video frame.For it can mark object and be personage, then its corresponding regional location can be this The head zone position of personage.For it can mark object and be animal, then its corresponding regional location can be the animal Entire body position.
In some cases, the time that can mark object appearance is shorter, shows that markup information can't be at the time point User distinguishes object and brings help, and therefore, server, which can be obtained only, persistently to be occurred reaching the object that marks of certain time and exist The regional location of video frame.Specifically, for the corresponding video slicing of video data of the video, server is according to can mark Video slicing described in the image recognition of object obtains to mark object in the regional location of the video frame at display time point.Wherein, Show that time point is the earliest time of occurrence point that can be marked object and persistently occur reaching time threshold for the first time in video slicing.
In order to make it easy to understand, being illustrated in conjunction with specific example.In this example, video data was obtained with 10 seconds progress fragments To video slicing corresponding with video data, time threshold is set as 2 seconds based on experience value, is based on this, server is according to personage A Facial image identify video slicing, when the facial image of personage A persistently occurs reaching 2 seconds for the first time in video slicing, will Persistently occur reaching 2 seconds earliest time of occurrence points for the first time and be used as display time point, then server obtains personage A in display Between the regional location of video frame put.
S503: the regional location of object in the video frame is marked according to described, the mark of object can be marked described in determination The display position of information in the video frame.
Markup information is used to help user and identifies therefore the object in video can mark the markup information of object in display When, display position should be associated with that can mark the regional location of object in the video frame, so that user believes according to the mark Breath, which distinguishes this, can mark object.Wherein, the display position of the markup information for marking object in the video frame can be institute State the periphery that can mark the regional location of object in the video frame.As a kind of possible implementation, the mark of object can be marked The display position of note information in the video frame is in the upper left corner of the regional location in the video frame that can mark object.It can The display position for marking the markup information of object can be set according to demand, other possible implementations in the embodiment of the present application In, server can will can mark the top center of the regional location of object in the video frame as the mark that can mark object Infuse the display position of information in the video frame.
S504: according to the display position of the markup information for marking object in the video frame, the determining and video The corresponding instruction information of video data.
Wherein, the corresponding instruction information of the video data of the video is used to indicate client in the view for playing the video Frequency according to when, the markup information for specifying object show in specified opportunity corresponding designated position.It can be marked described in specified opportunity characterization It infuses the display opportunity of the markup information of object, the markup information that object can be marked described in the characterization of designated position is corresponding on specified opportunity Display position.Since the display position corresponds to specific video frame, server can mark pair according to the video frame determination The display opportunity of the markup information of elephant in video, so it is corresponding with video data according to the display opportunity and display position determination Instruction information.
In some possible implementations, server can determine display time point according to above-mentioned video frame, in this way, clothes Being engaged in device can be corresponding according to the display position of the markup information for marking object in the video frame and the video frame Show that time point determines instruction information corresponding with the video data.In this implementation, instruction information includes that can mark The markup information of object corresponding display time point and display position corresponding with display time point.
In other possible implementations, server can be according to can mark the markup information of object in the video frame Display position and the corresponding video frame number of the video frame determine instruction information corresponding with the video data.In the reality In existing mode, instruction information include the corresponding video frame number of markup information that can mark object and with the video frame number Corresponding display position.
In this embodiment, instruction information is determined according to video frame corresponding display time point, compared to according to video frame Corresponding frame number determines instruction information, it is possible to reduce data volume reduces the calculating pressure of server.
From the foregoing, it will be observed that the embodiment of the present application provides a kind of video playing control method, it is corresponding that server obtains video Then material data identifies the video data in material data according to the object images that mark in material data, obtains To object can be marked in the regional location of video frame, the display position that can mark object can be determined according to the regional location, taken Business device determines instruction letter corresponding with video data according to that can mark the display position of the markup information of object in the video frame Breath.When the instruction information is returned to client by server, client can be in playing video data, according to instruction information The markup information of specified object is shown in specified opportunity corresponding designated position.In this way, user can lead to when watching video Object in the markup information identification piece of specified object is crossed, does not need to exit to play going for reply again by watching film profile As.This method provides more convenient information acquiring pattern for user, and user is facilitated more intuitively to obtain information, to reduce The frequent interaction of user and network reduce the occupancy and waste of video platform resource.
Fig. 2 is to embodiment illustrated in fig. 5 respectively from client or the angle of server to video a kind of in the embodiment of the present application Control method for playing back is introduced, next will be in conjunction with concrete application scene, from interaction angle to a kind of in the embodiment of the present application Video playing control method is illustrated.
Interaction diagrams of the Fig. 6 for video playing control method a kind of in the embodiment of the present application in practical application scene, Referring to Fig. 6, the application scenarios include in include the first client 100, the second client 200 and server 300.Wherein, The client that one client 100 is installed by the terminal device of the operation personnel of video platform, for the view in server 300 Frequency evidence is managed, such as video data can be uploaded to server 300, and the second client 200 is the use of video platform The client that the terminal device at family is installed, for obtaining the video data and and video counts of designated from server 300 According to corresponding instruction information, the markup information of specified object is shown according to instruction information in playing video data.Such as Fig. 6 institute Show, this method specifically comprises the following steps:
The facial image of film video data and film high priest is uploaded to server by the S601: the first client 100 300。
Wherein, film video data and the facial image of film high priest form the corresponding material data of film.
S602: server 300 identifies film video data according to facial image, obtains each facial image at display time point Video frame regional location, the display position of markup information in the video frame is determined based on the regional location.
Wherein, display time point is that facial image persistently occurs reaching time threshold in the corresponding video slicing of film for the first time The earliest time of occurrence point of value.If a length of 10 seconds when video slicing, time threshold is 2 seconds, then show that time point is face figure As persistently occurring reaching 2 seconds earliest time of occurrence points for the first time in the corresponding video slicing of a film.Server 300 identifies Show facial image in the video frame at time point, the regional location of facial image, then by the periphery of the regional location, such as upper left Display position of the angle as markup information in the video frame.
Below with reference to specific example, display time point is illustrated.For example, facial image exists in a video slicing It is aobvious then will persistently to there are reaching conducts in earliest time of occurrence point 2 seconds in 2 seconds for the first time for lasting appearance in 2-5 seconds and the 6th to 8 second Show time point.In another example facial image only occurred at the 3rd second in another video slicing, then in the video slicing, people reads figure Image persistence time of occurrence is not up to time threshold, thus the video slicing does not have display time point, namely in the video slicing In will not show markup information.
It should be noted that server 300 can successively identify each facial image respectively, each face figure is obtained As corresponding display time point, and the display time point based on each facial image, each facial image is obtained respectively in the display time The regional location of the video frame of point obtains each facial image in respectively display time point corresponding video frame based on the regional location In display position.
The video play operation that S603: the second client 200 is triggered in response to user sends first to server 300 and obtains Take request.
S604: server 300 to the second client 200 return designated video data and video data in each one Face image corresponding display time point and the regional location in the video frame at display time point.
Wherein, the video that the video play operation that designated is triggered by user identifies, some tools as the application Body example, video that designated can be chosen in the film list of the second client 200 for user or user are second It is searched in the search box of client 200, the video chosen from search result.
S605: the second client 200 is operated in response to object marking, suspends the broadcasting of video data and according to from server The regional location of 300 facial images obtained, highlights face in the video frame in video pause in the form of bounding box Image.
As shown in fig. 7, there is annotation component, when user is clicked by mouse in the broadcast interface of the client of video platform The annotation component triggers object marking operation, and the second client 200 can suspend the broadcasting of video data in response to the operation. Second client 200 is according to the regional location of the facial image obtained from server, with side in the video frame in video pause The form of boundary's frame highlights facial image, as shown in figure 8, including five facial images in the video frame of pause, for each Facial image, the second client 200 highlight facial image in the form of bounding box.
S606: the second client 200 chooses operation in response to the region for facial image, and display is for facial image Markup information input control.
S607: the second client 200 is inputted in response to markup information and is operated, and is received for mark made by the facial image Infuse information.
As shown in figure 9, user can be by choosing bounding box to realize that operation is chosen in the region for facial image, when a certain When the region of facial image is selected, the second client 200 is in the periphery of the regional location of facial image, such as upper left corner, display Markup information input control.User can input markup information according to the markup information input control, such as " Lucy ", the second visitor Family end 200 inputs in response to markup information and operates, and receives for markup information made by facial image.
S608: the second client 200 sends the second acquisition request to server 300.
S609: server 300 returns to the video data and corresponding instruction letter of designated to the second client 200 Breath.
The instruction information is used to indicate client and shows specified facial image on specified opportunity corresponding designated position Markup information.Wherein, specified opportunity includes the display time point of specified facial image, and designated position includes specified facial image Display position of the markup information in the video frame at display time point.
S610: the second client 200 obtains the markup information of specified facial image.
200 playing video data of S611: the second client, and show that the mark of specified facial image is believed according to instruction information Breath.
Its specific result of broadcast may refer to Fig. 9, and in playing video data, the second client is shown according to instruction information The markup information of specified facial image.
It should be noted that S601 and S602 is the preprocessing process carried out for video data, for any video, one Denier identifies high priest therein by its upload server, and by server, obtains the facial image of each personage aobvious Show the regional location of the video frame at time point, and the display position based on the determining markup information of the regional location in the video frame It postpones, user can directly obtain corresponding data, without re-executing S601 when requesting video data from server And S602.
In the application scenarios, the video data that server uploads operation personnel is pre-processed, main according to film The facial image of personage identifies video data, and the display time point for obtaining each facial image and each facial image are in the display time The regional location of the video frame of point, and based on the display position of this determination markup information, when user triggers object marking operation, Corresponding second client end response of user suspends video playing data, information labeling approach is provided, so as to user couple in the operation Facial image is labeled, in this way, the display that the second client can mostly be answered according to video data, the specified facial image of acquisition Time point, specified facial image markup information display position and markup information, in playing video data, specified Opportunity corresponding designated position shows the markup information of specified facial image.
In this way, user can identify high priest in piece by the markup information of specified facial image when watching video, It does not need to exit again to play and goes for corresponding personage by watching film profile, therefore, this method provides more convenient for user Information acquiring pattern facilitates user more intuitively to obtain information, to reduce the frequent interaction of user and network, reduces video The occupancy and waste of platform resource.
The above are some specific implementations of video playing control method provided by the embodiments of the present application, are based on this, this Application embodiment additionally provides a kind of video playing control device.Next, will be in conjunction with attached drawing, from the angle pair of function modoularization A kind of video playing control device is introduced in the embodiment of the present application.
Figure 10 is a kind of structural schematic diagram of video playing control device in the embodiment of the present application, referring to Figure 10, the device 1000 include:
Sending module 1010, for sending acquisition request to server, the acquisition request is for the specified view of request The video data of frequency and its corresponding instruction information;
First obtains module 1020, the designated provided for obtaining the server according to the acquisition request Video data and its corresponding instruction information, the instruction information be used to indicate client in the view for playing the designated Frequency according to when, the markup information of specified object is shown on specified opportunity corresponding designated position;
Second obtains module 1030, for obtaining the markup information of the specified object;
Control module 1040, for playing the video data of the designated and described in being shown according to the instruction information The markup information of specified object.
Optionally, referring to the knot that Figure 11, Figure 11 are a kind of video playing control device provided by the embodiments of the present application Structure schematic diagram, on the basis of the structure shown in Figure 10, described device further includes the first display module 1050, the second display module 1060 and receiving module 1070, in which:
First display module 1050 suspends the broadcasting of video data and basis for operating in response to object marking The regional location for marking object obtained from the server, highlighting in the video frame in video pause described can mark Infuse object;
Second display module 1060, for showing in response to choosing operation for the region that can mark object For the markup information input control for marking object;
The receiving module 1070 is operated for inputting in response to markup information, and object institute can be marked for described by receiving The markup information of work.
Optionally, first display module 1050 is specifically used for:
Being highlighted with boundary box form described can mark object.
Optionally, the instruction information includes the markup information corresponding display time point of specified object and shows with described Show time point corresponding display position;
The control module 1040 is specifically used for:
At the markup information of the specified object corresponding display time point, according to corresponding with the display time point aobvious Show position, shows the markup information of the specified object.
Optionally, the instruction information include specified object the corresponding video frame number of markup information and with the video The corresponding display position of frame number;
The control module 1040 is specifically used for:
According to display position corresponding with the video frame number, in the corresponding video frame of the video frame number described in display The markup information of specified object.
Optionally, referring to the knot that Figure 12, Figure 12 are a kind of video playing control device provided by the embodiments of the present application Structure schematic diagram, on the basis of the structure shown in Figure 10, described device further include:
Hidden module 1080 operates for cancelling in response to object marking, hides the markup information of the specified object.
From the foregoing, it will be observed that the embodiment of the present application provides a kind of video playing control device, which obtains from server and regards Frequency is accordingly and instruction information corresponding with video data, the instruction information are used to indicate specific bit corresponding on specified opportunity The markup information for showing specified object is set, then when playing the video data, specified object is shown according to the instruction information Markup information.User can not be needed when watching video by specifying the markup information of object to identify object in piece in this way Broadcasting is exited again and goes for corresponding objects by watching film profile, and therefore, which provides more convenient information for user and obtain Mode is taken, user is facilitated more intuitively to obtain information, to reduce the frequent interaction of user and network, reduces video platform money The occupancy and waste in source.
Figure 13 is a kind of structural schematic diagram of video playing control device in the embodiment of the present application, referring to Figure 10, the device 1300 include:
Receiving module 1310, for receiving the acquisition request of client transmission, the acquisition request refers to for request Determine video video data and its corresponding instruction information;
Module 1320 is provided, for providing the video counts of the designated to the client according to the acquisition request According to and its corresponding instruction information, the instruction information be used to indicate the client in the video counts for playing the designated According to when, the markup information for specifying object show in specified opportunity corresponding designated position.
Optionally, referring to the knot that Figure 14, Figure 14 are a kind of video playing control device provided by the embodiments of the present application Structure schematic diagram, on the basis of the structure shown in Figure 13, described device further include:
Module 1330 is obtained, for obtaining the corresponding material data of video, the material data includes the video counts of video Accordingly and the image of object can be marked;
Identification module 1340 is obtained for that can mark the video data of video described in the image recognition of object according to The object that marks is in the regional location of video frame;
First determining module 1350, for can mark the regional location of object in the video frame according to, determine described in The display position of the markup information of object in the video frame can be marked, the display position is located at the periphery of the regional location;
Second determining module 1360, for the display position of the markup information of object in the video frame can be marked according to It sets, determines that the corresponding instruction information of the video data of the video, the corresponding instruction information of the video data of the video are used for Client is indicated when playing the video data of the video, shows the mark of specified object in specified opportunity corresponding designated position Infuse information.
Optionally, the identification module 1340 is specifically used for:
For the corresponding video slicing of video data of the video, marked described in the image recognition of object according to described Video slicing obtains the object that can mark in the regional location for the video frame for showing time point, and the display time point is described Object can be marked and persistently occur the earliest time of occurrence point for reaching time threshold for the first time in video slicing.
Optionally, second determining module 1360 is specifically used for:
It is corresponding according to the markup information for marking object display position in the video frame and the video frame Video frame number determines the corresponding instruction information of the video data of the video, and the instruction information includes that described can mark object The corresponding video frame number of markup information and display position corresponding with the video frame number;Alternatively,
It is corresponding according to the markup information for marking object display position in the video frame and the video frame Display time point determines that the corresponding instruction information of the video data of the video, the instruction information can mark object including described Markup information corresponding display time point and display position corresponding with the display time point.
Optionally, the display position position of the markup information for marking object in the video frame is in described and can mark The display position of object in the video frame is in the upper left corner that can mark the regional location of object in the video frame.
From the foregoing, it will be observed that the embodiment of the present application provides a kind of video playing control device, by receiving client transmission Then acquisition request provides the video data and finger corresponding with video data of designated to client according to acquisition request Show information, in this way, client can show the mark of specified object according to instruction information on specified opportunity corresponding designated position Infuse information.User can not need to exit again when watching video by specifying the markup information of object to identify object in piece It plays and goes for corresponding objects by watching film profile, therefore, which provides more convenient information acquiring pattern for user, Facilitate user more intuitively to obtain information, to reduce the frequent interaction of user and network, reduces accounting for for video platform resource With and waste.
Angle of the Figure 10 to embodiment illustrated in fig. 14 from function modoularization, to video playing control a kind of in the embodiment of the present application Device processed is introduced, and is based on this, and present invention also provides a kind of video playings to control equipment, next will be from hardware entities Angle to video playing a kind of in the embodiment of the present application control equipment be introduced.
The embodiment of the present application provides a kind of video playing control equipment, and video playing control equipment can be terminal and set It is standby, as shown in figure 15, for ease of description, part relevant to the embodiment of the present application is illustrated only, particular technique details is not taken off Show, please refers to the embodiment of the present application method part.The terminal device can be include that mobile phone, tablet computer, individual digital help Any terminal devices such as (full name in English: Personal Digital Assistant, english abbreviation: PDA), vehicle-mounted computer are managed, By taking terminal device is mobile phone as an example:
Figure 15 shows the block diagram of the part-structure of mobile phone relevant to terminal device provided by the embodiments of the present application.Ginseng Figure 15 is examined, mobile phone includes: radio frequency (full name in English: Radio Frequency, english abbreviation: RF) circuit 1510, memory 1520, input unit 1530, display unit 1540, sensor 1550, voicefrequency circuit 1560, Wireless Fidelity (full name in English: Wireless fidelity, english abbreviation: WiFi) components such as module 1570, processor 1580 and power supply 1590.This field Technical staff is appreciated that handset structure shown in Figure 15 does not constitute the restriction to mobile phone, may include more than illustrating Or less component, perhaps combine certain components or different component layouts.
Memory 1520 can be used for storing software program and module, and processor 1580 is stored in memory by operation 1520 software program and module, thereby executing the various function application and data processing of mobile phone.Memory 1520 can be led It to include storing program area and storage data area, wherein storing program area can be needed for storage program area, at least one function Application program (such as sound-playing function, image player function etc.) etc.;Storage data area, which can be stored, uses institute according to mobile phone Data (such as audio data, phone directory etc.) of creation etc..In addition, memory 1520 may include high random access storage Device, can also include nonvolatile memory, and a for example, at least disk memory, flush memory device or other volatibility are solid State memory device.
Processor 1580 is the control centre of mobile phone, using the various pieces of various interfaces and connection whole mobile phone, By running or execute the software program and/or module that are stored in memory 1520, and calls and be stored in memory 1520 Interior data execute the various functions and processing data of mobile phone, to carry out integral monitoring to mobile phone.Optionally, processor 1580 may include one or more processing units;Preferably, processor 1580 can integrate application processor and modulation /demodulation processing Device, wherein the main processing operation system of application processor, user interface and application program etc., modem processor is mainly located Reason wireless communication.It is understood that above-mentioned modem processor can not also be integrated into processor 1580.
Although being not shown, mobile phone can also include camera, bluetooth module etc., and details are not described herein.
In the embodiment of the present application, processor 1580 included by the terminal device is also with the following functions:
Acquisition request is sent to server, receives the video data and and video of the designated that the server returns The corresponding instruction information of data, the instruction information, which is used to indicate client and shows on corresponding designated position of specified opportunity, to be referred to Determine the markup information of object;
Obtain the markup information of the specified object;
It plays the video data and shows the markup information of the specified object according to the instruction information.
Optionally, processor 1580 included by the terminal device can be also used for executing a kind of view in the embodiment of the present application Any one implementation of frequency control method for playing back.
The embodiment of the present application also provides another video playings to control equipment, and Figure 16 is provided by the embodiments of the present application one The structural schematic diagram of kind video playing control equipment, the equipment can be server, which can be because of configuration or performance It is different and generate bigger difference, it may include one or more central processing units (central processing Units, CPU) 1622 (for example, one or more processors) and memory 1632, one or more storage applications The storage medium 1630 (such as one or more mass memory units) of program 1642 or data 1644.Wherein, memory 1632 and storage medium 1630 can be of short duration storage or persistent storage.The program for being stored in storage medium 1630 may include one A or more than one module (diagram does not mark), each module may include to the series of instructions operation in server.More into One step, central processing unit 1622 can be set to communicate with storage medium 1630, execute storage medium on server 1600 Series of instructions operation in 1630.
Server 1600 can also include one or more power supplys 1626, one or more wired or wireless nets Network interface 1650, one or more input/output interfaces 1658, and/or, one or more operating systems 1641, example Such as Windows ServerTM, Mac OS XTM, UnixTM, LinuxTM, FreeBSDTM etc..
The step as performed by server can be based on server architecture shown in the Figure 16 in above-described embodiment.
Wherein, CPU 1622 is for executing following steps:
Receive the acquisition request that client is sent;
The video data of designated and corresponding with video data is returned to the client according to the acquisition request Instruction information, the instruction information is used to indicate client in playing video data, in corresponding specific bit of specified opportunity Set the markup information for showing specified object.
The embodiment of the present application also provides a kind of computer readable storage medium, for storing program code, the program code For executing any one embodiment in a kind of video playing control method described in foregoing individual embodiments.
The embodiment of the present application also provides a kind of computer program product including instruction, when run on a computer, So that computer executes any one embodiment in a kind of video playing control method described in foregoing individual embodiments.
It is apparent to those skilled in the art that for convenience and simplicity of description, the system of foregoing description, The specific work process of device and unit, can refer to corresponding processes in the foregoing method embodiment, and details are not described herein.
In several embodiments provided herein, it should be understood that disclosed system, device and method can be with It realizes by another way.For example, the apparatus embodiments described above are merely exemplary, for example, the unit It divides, only a kind of logical function partition, there may be another division manner in actual implementation, such as multiple units or components It can be combined or can be integrated into another system, or some features can be ignored or not executed.Another point, it is shown or The mutual coupling, direct-coupling or communication connection discussed can be through some interfaces, the indirect coupling of device or unit It closes or communicates to connect, can be electrical property, mechanical or other forms.
The unit as illustrated by the separation member may or may not be physically separated, aobvious as unit The component shown may or may not be physical unit, it can and it is in one place, or may be distributed over multiple In network unit.It can select some or all of unit therein according to the actual needs to realize the mesh of this embodiment scheme 's.
It, can also be in addition, each functional unit in each embodiment of the application can integrate in one processing unit It is that each unit physically exists alone, can also be integrated in one unit with two or more units.Above-mentioned integrated list Member both can take the form of hardware realization, can also realize in the form of software functional units.
If the integrated unit is realized in the form of SFU software functional unit and sells or use as independent product When, it can store in a computer readable storage medium.Based on this understanding, the technical solution of the application is substantially The all or part of the part that contributes to existing technology or the technical solution can be in the form of software products in other words It embodies, which is stored in a storage medium, including some instructions are used so that a computer Equipment (can be personal computer, server or the network equipment etc.) executes the complete of each embodiment the method for the application Portion or part steps.And storage medium above-mentioned includes: USB flash disk, mobile hard disk, read-only memory (full name in English: Read-Only Memory, english abbreviation: ROM), random access memory (full name in English: Random Access Memory, english abbreviation: RAM), the various media that can store program code such as magnetic or disk.
It should be appreciated that in this application, " at least one (item) " refers to one or more, and " multiple " refer to two or two More than a."and/or" indicates may exist three kinds of relationships, for example, " A and/or B " for describing the incidence relation of affiliated partner It can indicate: only exist A, only exist B and exist simultaneously tri- kinds of situations of A and B, wherein A, B can be odd number or plural number.Word Symbol "/" typicallys represent the relationship that forward-backward correlation object is a kind of "or"." at least one of following (a) " or its similar expression, refers to Any combination in these, any combination including individual event (a) or complex item (a).At least one of for example, in a, b or c (a) can indicate: a, b, c, " a and b ", " a and c ", " b and c ", or " a and b and c ", and wherein a, b, c can be individually, can also To be multiple.
The above, above embodiments are only to illustrate the technical solution of the application, rather than its limitations;Although referring to before Embodiment is stated the application is described in detail, those skilled in the art should understand that: it still can be to preceding Technical solution documented by each embodiment is stated to modify or equivalent replacement of some of the technical features;And these It modifies or replaces, the spirit and scope of each embodiment technical solution of the application that it does not separate the essence of the corresponding technical solution.

Claims (15)

1. a kind of video playing control method characterized by comprising
Acquisition request is sent to server, and the acquisition request is for the video data of request designated and its corresponding Indicate information;
Obtain the designated that the server is provided according to the acquisition request video data and its corresponding instruction Information, the instruction information is used to indicate client when playing the video data of the designated, corresponding on specified opportunity Designated position on show the markup information of specified object;
Obtain the markup information of the specified object;
It plays the video data of the designated and shows the markup information of the specified object according to the instruction information.
2. the method according to claim 1, wherein the method also includes:
It is operated in response to object marking, suspends the broadcasting of video data and according to the object that marks obtained from the server Regional location, highlighting in the video frame in video pause described can mark object;
In response to choosing operation for the region that can mark object, display is defeated for the markup information that can mark object Enter control;
It inputs and operates in response to markup information, markup information made by object can be marked for described by receiving.
3. according to the method described in claim 2, it is characterized in that, described highlight described can mark object, comprising:
Being highlighted with boundary box form described can mark object.
4. the method according to claim 1, wherein the instruction information includes the markup information pair of specified object The display time point answered and display position corresponding with the display time point;
The then markup information that the specified object is shown according to the instruction information, comprising:
At the markup information of the specified object corresponding display time point, according to display position corresponding with the display time point It sets, shows the markup information of the specified object.
5. the method according to claim 1, wherein the instruction information includes the markup information pair of specified object The video frame number answered and display position corresponding with the video frame number;
The then markup information that the specified object is shown according to the instruction information, comprising:
According to display position corresponding with the video frame number, shown in the corresponding video frame of the video frame number described specified The markup information of object.
6. the method according to claim 1, wherein the method also includes:
Cancel in response to object marking and operating, hides the markup information of the specified object.
7. a kind of video playing control method characterized by comprising
Receive the acquisition request that client is sent, the acquisition request is for the video data of request designated and its right The instruction information answered;
According to the acquisition request to the client provide the designated video data and its corresponding instruction information, The instruction information is used to indicate the client when playing the video data of the designated, corresponding on specified opportunity Designated position shows the markup information of specified object.
8. the method according to the description of claim 7 is characterized in that the method also includes:
The corresponding material data of video is obtained, the material data includes the video data of video and the figure that can mark object Picture;
According to the video data for marking video described in the image recognition of object, obtain described to mark object in video frame Regional location;
The regional location of object in the video frame is marked according to described, the markup information of object can be marked in video described in determination Display position in frame, the display position are located at the periphery of the regional location;
According to the display position of the markup information for marking object in the video frame, the video data pair of the video is determined The instruction information answered, the corresponding instruction information of the video data of the video are used to indicate client in the view for playing the video Frequency according to when, the markup information for specifying object show in specified opportunity corresponding designated position.
9. according to the method described in claim 8, it is characterized in that, described can mark described in the image recognition of object according to Video data obtains described to mark object in the regional location of video frame, comprising:
For the corresponding video slicing of video data of the video, video described in the image recognition of object is marked according to described Fragment obtains the object that can mark in the regional location for the video frame for showing time point, and the display time point is described marks Persistently there is the earliest time of occurrence point for reaching time threshold for the first time in video slicing in note object.
10. method according to claim 8 or claim 9, which is characterized in that regarded according to the markup information for marking object Display position in frequency frame determines the corresponding instruction information of the video data of the video, comprising:
According to the markup information for marking object display position in the video frame and the corresponding video of the video frame Frame number determines that the corresponding instruction information of the video data of the video, the instruction information include the mark that can mark object The corresponding video frame number of information and display position corresponding with the video frame number;Alternatively,
According to the markup information for marking object display position in the video frame and the corresponding display of the video frame Time point determines that the corresponding instruction information of the video data of the video, the instruction information include the mark that can mark object Infuse information corresponding display time point and display position corresponding with the display time point.
11. method according to claim 8 or claim 9, which is characterized in that the markup information for marking object is in video frame In display position be in the upper left corner that can mark the regional location of object in the video frame.
12. a kind of video playing control device characterized by comprising
Sending module, for sending acquisition request to server, the acquisition request is used for the video of request designated Data and its corresponding instruction information;
First obtains module, for obtaining the video counts for the designated that the server is provided according to the acquisition request According to and its corresponding instruction information, the instruction information be used to indicate client in the video data for playing the designated When, the markup information of specified object is shown on specified opportunity corresponding designated position;
Second obtains module, for obtaining the markup information of the specified object;
Control module, for playing the video data of the designated and showing the specified object according to the instruction information Markup information.
13. a kind of video playing control device characterized by comprising
Receiving module, for receiving the acquisition request of client transmission, the acquisition request is for request designated Video data and its corresponding instruction information;
Module is provided, for providing the video data of the designated and its right to the client according to the acquisition request The instruction information answered, the instruction information are used to indicate the client when playing the video data of the designated, Specified opportunity corresponding designated position shows the markup information of specified object.
14. a kind of video playing controls equipment, which is characterized in that the equipment includes processor and memory:
Said program code is transferred to the processor for storing program code by the memory;
The processor is used for according to the instruction execution video playing described in any one of claims 1-6 in said program code Control method, alternatively, the described in any item video playing control methods of claim 7-11.
15. a kind of computer readable storage medium, which is characterized in that the computer readable storage medium is for storing program generation Code, said program code requires the described in any item video playing control methods of 1-6 for perform claim, alternatively, claim The described in any item video playing control methods of 7-11.
CN201811169061.5A 2018-10-08 2018-10-08 A kind of video playing control method, device, equipment and medium Pending CN109274999A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201811169061.5A CN109274999A (en) 2018-10-08 2018-10-08 A kind of video playing control method, device, equipment and medium

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201811169061.5A CN109274999A (en) 2018-10-08 2018-10-08 A kind of video playing control method, device, equipment and medium

Publications (1)

Publication Number Publication Date
CN109274999A true CN109274999A (en) 2019-01-25

Family

ID=65195994

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201811169061.5A Pending CN109274999A (en) 2018-10-08 2018-10-08 A kind of video playing control method, device, equipment and medium

Country Status (1)

Country Link
CN (1) CN109274999A (en)

Cited By (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110099303A (en) * 2019-06-05 2019-08-06 四川长虹电器股份有限公司 A kind of media play system based on artificial intelligence
CN110225367A (en) * 2019-06-27 2019-09-10 北京奇艺世纪科技有限公司 It has been shown that, recognition methods and the device of object information in a kind of video
CN110505498A (en) * 2019-09-03 2019-11-26 腾讯科技(深圳)有限公司 Processing, playback method, device and the computer-readable medium of video
CN111652678A (en) * 2020-05-27 2020-09-11 腾讯科技(深圳)有限公司 Article information display method, device, terminal, server and readable storage medium
CN111800651A (en) * 2020-06-29 2020-10-20 联想(北京)有限公司 Information processing method and information processing device
CN112601129A (en) * 2020-12-09 2021-04-02 深圳市房多多网络科技有限公司 Video interaction system, method and receiving end
CN113794907A (en) * 2021-09-16 2021-12-14 广州虎牙科技有限公司 Video processing method, video processing device and electronic equipment
CN115119004A (en) * 2019-05-13 2022-09-27 阿里巴巴集团控股有限公司 Data processing method, information display method, device, server and terminal equipment
CN115379284A (en) * 2022-07-15 2022-11-22 广州力天文化创意产业集团有限公司 Film playing method and device

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20140019862A1 (en) * 2008-06-03 2014-01-16 Google Inc. Web-Based System for Collaborative Generation of Interactive Videos
US20140023341A1 (en) * 2012-07-18 2014-01-23 Hulu, LLC Annotating General Objects in Video
CN103970906A (en) * 2014-05-27 2014-08-06 百度在线网络技术(北京)有限公司 Method and device for establishing video tags and method and device for displaying video contents
CN104105010A (en) * 2013-04-01 2014-10-15 云联(北京)信息技术有限公司 Video playing method and device
CN106358092A (en) * 2015-07-13 2017-01-25 阿里巴巴集团控股有限公司 Information processing method and device
CN108401176A (en) * 2018-02-06 2018-08-14 北京奇虎科技有限公司 A kind of method and apparatus for realizing video personage mark

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20140019862A1 (en) * 2008-06-03 2014-01-16 Google Inc. Web-Based System for Collaborative Generation of Interactive Videos
US20140023341A1 (en) * 2012-07-18 2014-01-23 Hulu, LLC Annotating General Objects in Video
CN104105010A (en) * 2013-04-01 2014-10-15 云联(北京)信息技术有限公司 Video playing method and device
CN103970906A (en) * 2014-05-27 2014-08-06 百度在线网络技术(北京)有限公司 Method and device for establishing video tags and method and device for displaying video contents
CN106358092A (en) * 2015-07-13 2017-01-25 阿里巴巴集团控股有限公司 Information processing method and device
CN108401176A (en) * 2018-02-06 2018-08-14 北京奇虎科技有限公司 A kind of method and apparatus for realizing video personage mark

Cited By (11)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN115119004A (en) * 2019-05-13 2022-09-27 阿里巴巴集团控股有限公司 Data processing method, information display method, device, server and terminal equipment
CN115119004B (en) * 2019-05-13 2024-03-29 阿里巴巴集团控股有限公司 Data processing method, information display device, server and terminal equipment
CN110099303A (en) * 2019-06-05 2019-08-06 四川长虹电器股份有限公司 A kind of media play system based on artificial intelligence
CN110225367A (en) * 2019-06-27 2019-09-10 北京奇艺世纪科技有限公司 It has been shown that, recognition methods and the device of object information in a kind of video
CN110505498A (en) * 2019-09-03 2019-11-26 腾讯科技(深圳)有限公司 Processing, playback method, device and the computer-readable medium of video
CN111652678A (en) * 2020-05-27 2020-09-11 腾讯科技(深圳)有限公司 Article information display method, device, terminal, server and readable storage medium
CN111652678B (en) * 2020-05-27 2023-11-14 腾讯科技(深圳)有限公司 Method, device, terminal, server and readable storage medium for displaying article information
CN111800651A (en) * 2020-06-29 2020-10-20 联想(北京)有限公司 Information processing method and information processing device
CN112601129A (en) * 2020-12-09 2021-04-02 深圳市房多多网络科技有限公司 Video interaction system, method and receiving end
CN113794907A (en) * 2021-09-16 2021-12-14 广州虎牙科技有限公司 Video processing method, video processing device and electronic equipment
CN115379284A (en) * 2022-07-15 2022-11-22 广州力天文化创意产业集团有限公司 Film playing method and device

Similar Documents

Publication Publication Date Title
CN109274999A (en) A kind of video playing control method, device, equipment and medium
US9641471B2 (en) Electronic device, and method and computer-readable recording medium for displaying message in electronic device
US20180054564A1 (en) Apparatus and method for providing user's emotional information in electronic device
EP2728859B1 (en) Method of providing information-of-users' interest when video call is made, and electronic apparatus thereof
US10430456B2 (en) Automatic grouping based handling of similar photos
US10175863B2 (en) Video content providing scheme
JP6986187B2 (en) Person identification methods, devices, electronic devices, storage media, and programs
CN102984050A (en) Method, client and system for searching voices in instant messaging
US10674183B2 (en) System and method for perspective switching during video access
CN112653902A (en) Speaker recognition method and device and electronic equipment
CN108874827B (en) Searching method and related device
CN105929980A (en) Method and device for inputting information
WO2019085625A1 (en) Emotion picture recommendation method and apparatus
JP5611155B2 (en) Content tagging program, server and terminal
CN111046210A (en) Information recommendation method and device and electronic equipment
CN111158924A (en) Content sharing method and device, electronic equipment and readable storage medium
CN110390641B (en) Image desensitizing method, electronic device and storage medium
CN112866577B (en) Image processing method and device, computer readable medium and electronic equipment
CN112187624B (en) Message reply method and device and electronic equipment
CN105204718B (en) Information processing method and electronic equipment
CN111611030A (en) Data processing method and device and data processing device
CN110825243A (en) Shortcut phrase input method, terminal device and computer-readable storage medium
CN110929122A (en) Data processing method and device and data processing device
CN111625740A (en) Image display method, image display device and electronic equipment
CN111857467B (en) File processing method and electronic equipment

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication

Application publication date: 20190125

RJ01 Rejection of invention patent application after publication