WO2021047181A1 - Video type-based playback control implementation method and apparatus, and computer device - Google Patents

Video type-based playback control implementation method and apparatus, and computer device Download PDF

Info

Publication number
WO2021047181A1
WO2021047181A1 PCT/CN2020/087026 CN2020087026W WO2021047181A1 WO 2021047181 A1 WO2021047181 A1 WO 2021047181A1 CN 2020087026 W CN2020087026 W CN 2020087026W WO 2021047181 A1 WO2021047181 A1 WO 2021047181A1
Authority
WO
WIPO (PCT)
Prior art keywords
video
currently
played
video file
playback
Prior art date
Application number
PCT/CN2020/087026
Other languages
French (fr)
Chinese (zh)
Inventor
齐燕
Original Assignee
深圳壹账通智能科技有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 深圳壹账通智能科技有限公司 filed Critical 深圳壹账通智能科技有限公司
Publication of WO2021047181A1 publication Critical patent/WO2021047181A1/en

Links

Images

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/20Servers specifically adapted for the distribution of content, e.g. VOD servers; Operations thereof
    • H04N21/23Processing of content or additional data; Elementary server operations; Server middleware
    • H04N21/234Processing of video elementary streams, e.g. splicing of video streams or manipulating encoded video stream scene graphs
    • H04N21/23418Processing of video elementary streams, e.g. splicing of video streams or manipulating encoded video stream scene graphs involving operations for analysing video streams, e.g. detecting features or characteristics
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/20Servers specifically adapted for the distribution of content, e.g. VOD servers; Operations thereof
    • H04N21/23Processing of content or additional data; Elementary server operations; Server middleware
    • H04N21/234Processing of video elementary streams, e.g. splicing of video streams or manipulating encoded video stream scene graphs
    • H04N21/2343Processing of video elementary streams, e.g. splicing of video streams or manipulating encoded video stream scene graphs involving reformatting operations of video signals for distribution or compliance with end-user requests or end-user device requirements
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/20Servers specifically adapted for the distribution of content, e.g. VOD servers; Operations thereof
    • H04N21/23Processing of content or additional data; Elementary server operations; Server middleware
    • H04N21/235Processing of additional data, e.g. scrambling of additional data or processing content descriptors
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/20Servers specifically adapted for the distribution of content, e.g. VOD servers; Operations thereof
    • H04N21/25Management operations performed by the server for facilitating the content distribution or administrating data related to end-users or client devices, e.g. end-user or client device authentication, learning user preferences for recommending movies
    • H04N21/266Channel or content management, e.g. generation and management of keys and entitlement messages in a conditional access system, merging a VOD unicast channel into a multicast channel
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/43Processing of content or additional data, e.g. demultiplexing additional data from a digital video stream; Elementary client operations, e.g. monitoring of home network or synchronising decoder's clock; Client middleware
    • H04N21/443OS processes, e.g. booting an STB, implementing a Java virtual machine in an STB or power management in an STB
    • H04N21/4436Power management, e.g. shutting down unused components of the receiver

Definitions

  • This application relates to the field of image recognition technology, and in particular to a method, device, computer equipment, and storage medium for implementing video type-based playback control.
  • the video server sends the streaming data of the video to the local buffer of the smart terminal for playback.
  • Videos can be divided into different video types according to their video content. (Such as news programs, sports programs, variety shows, etc.).
  • the inventor realized that when general video software plays video content, regardless of any video type, it will run on the front end of the smart terminal and perform video playback.
  • the display screen of the smart terminal needs to be bright for playback, resulting in the video server being unable to display the video on the smart terminal. Perform power saving control.
  • the embodiments of the present application provide a method, device, computer equipment and storage medium for implementing playback control based on video types, aiming to solve the problem that the video software of the smart terminal in the prior art will play video content regardless of any video type.
  • the display of the smart terminal needs to be on for a long time to play, which causes the problem that the video server cannot perform power saving control on the smart terminal.
  • an embodiment of the present application provides a method for implementing playback control based on a video type, which includes:
  • the currently to-be-played video file does not include the program type tag, input the selected multiple frames of video images in the currently-to-be-played video file to a pre-trained convolutional neural network network to obtain the video program type;
  • the currently to-be-played video file includes a program type tag
  • the play mode information corresponding to the currently-to-be-played video file is obtained and sent to the corresponding user terminal; wherein, the video type play strategy Including a variety of video program types, and one-to-one corresponding playback mode information for each video program type; and
  • the currently-to-be-played video file is decomposed into corresponding audio-visual files and/or video files to assemble into corresponding playback data And sent to the user terminal.
  • an embodiment of the present application provides a video type-based playback control implementation device, which includes:
  • the video selection unit is configured to, if a video selection instruction is detected, obtain a current to-be-played video file corresponding to the video selection instruction;
  • a video tag determining unit configured to determine whether the currently to-be-played video file includes a video program type tag
  • the first program type acquiring unit is configured to, if the currently to-be-played video file does not include a program type tag, input the selected multiple frames of video images in the currently-to-be-played video file to a pre-trained convolutional neural network, Get the type of video program;
  • the second program type acquiring unit is configured to, if the currently to-be-played video file includes a program type tag, acquire the video program type corresponding to the currently-to-be-played video file according to the program type tag;
  • the playing mode information obtaining unit is configured to obtain the playing mode information corresponding to the current to-be-played video file according to the video program type corresponding to the current to-be-played video file and the preset video type playing strategy, and send it to the corresponding user terminal;
  • the video type play strategy includes multiple video program types, and play mode information corresponding to each video program type one-to-one;
  • the playback data sending unit is configured to, if an agreed operation instruction corresponding to the playback mode information of the currently to-be-played video file is detected, decompose the currently-to-be-played video file into corresponding audio-visual files and/or video files, It can be assembled into corresponding playback data and sent to the user terminal.
  • an embodiment of the present application provides a computer device, which includes a memory, a processor, and a computer program stored on the memory and running on the processor, and the processor executes the computer
  • a video type-based playback control implementation method is implemented during the program.
  • the method includes: if a video selection instruction is detected, obtaining a current to-be-played video file corresponding to the video selection instruction; and judging whether the currently-to-be-played video file includes a video Program type tag; if the currently to-be-played video file does not include a program type tag, input the selected multiple frames of video images in the currently-to-be-played video file to a pre-trained convolutional neural network to obtain the video program type; If the currently to-be-played video file includes a program type tag, obtain the video program type corresponding to the currently-to-be-played video file according to the program type tag; play according to the video program type corresponding to the current to-be-played video file and the preset video type Strategy, to obtain the playback mode information corresponding to the currently to-be-played video file and send it to the corresponding user terminal; wherein, the video type playback strategy includes multiple video program types and one-to-one playback corresponding to each
  • the embodiments of the present application also provide a computer-readable storage medium, wherein the computer-readable storage medium stores a computer program, and when the computer program is executed by a processor, the processor executes a video-based Type of playback control implementation method, the method includes: if a video selection instruction is detected, obtaining a current to-be-played video file corresponding to the video selection instruction; judging whether the current to-be-played video file includes a video program type tag; if If the currently to-be-played video file does not include the program type tag, the selected multi-frame video image in the currently-to-be-played video file is input to a pre-trained convolutional neural network network to obtain the video program type;
  • the video file to be played includes a program type tag, and the video program type corresponding to the current to-be-played video file is obtained according to the program type tag; according to the video program type corresponding to the currently-to-be-played video file and the preset video
  • the embodiments of the present application provide a method, a device, a computer device, and a storage medium for implementing playback control based on a video type.
  • the method realizes that based on the video type of the current to-be-played video file, the playing mode information and the corresponding playing data are obtained, and the power saving control of the user terminal is realized according to the playing mode information.
  • FIG. 1 is a schematic diagram of an application scenario of a method for implementing playback control based on a video type provided by an embodiment of the application;
  • FIG. 2 is a schematic flowchart of a method for implementing video type-based playback control provided by an embodiment of the application
  • FIG. 3 is a schematic diagram of a sub-flow of a method for implementing video type-based playback control provided by an embodiment of the application;
  • FIG. 4 is a schematic diagram of another sub-flow of the method for implementing video type-based playback control according to an embodiment of the application;
  • FIG. 5 is a schematic block diagram of a device for implementing playback control based on video types according to an embodiment of the application
  • FIG. 6 is a schematic block diagram of subunits of the device for implementing video type-based playback control according to an embodiment of the application
  • FIG. 7 is a schematic block diagram of another subunit of the device for implementing video type-based playback control according to an embodiment of the application.
  • FIG. 8 is a schematic block diagram of a computer device provided by an embodiment of the application.
  • Figure 1 is a schematic diagram of an application scenario of a method for implementing video type-based playback control provided by an embodiment of this application
  • Figure 2 is a schematic flowchart of a method for implementing video type-based playback control provided by an embodiment of this application
  • the method for implementing playback control based on video types is applied to a server, and the method is executed by application software installed in the server.
  • the method includes steps S110 to S160.
  • the video file to be played when the user opens the user interaction interface of the video software on the user terminal (such as a smart phone, a tablet computer, etc.), a video file to be played needs to be selected. Once the video file to be played is selected, the user terminal will This video selection instruction is sent to the video server. The video server obtains the video selection instruction, and correspondingly obtains the current to-be-played video file.
  • the video file to be played can be a video file with a limited duration (such as a movie, a short video, etc., and the video file with a limited duration can be a video file that includes a single video program type, or it can include multiple video programs. Types of video files), or live video (for example, the online program of TV A, the live video broadcasts different types of video programs in different time periods).
  • S120 Determine whether the currently to-be-played video file includes a video program type tag.
  • the currently to-be-played video file selected by the user may have been set with a video program type tag, or it may not have been set, in order to determine the playback of the currently-to-be-played video file in the subsequent steps Method, you need to determine the video program type first according to the video program type tag or video content recognition.
  • the video server is required to identify the video program type of the currently-to-be-played video file.
  • the video server obtains the currently to-be-played video file, in order to determine the type of its video program (such as news programs, real-time review programs, recitation programs, sports programs, variety shows, movie programs, etc.), it is necessary to set the currently-to-be-played video files
  • the video file is split into video to obtain multiple frames of video images. Then, according to the pre-trained convolutional neural network network, the multi-frame video images obtained by the split can be identified, so as to obtain the corresponding video program type.
  • step S130 includes:
  • S131 Acquire multiple frames of video images in the video image set corresponding to the currently to-be-played video file at a preset interval number, as a target image set;
  • S134 Locate the separation time point corresponding to each separator inserted in the program type sequence, and divide the current to-be-played video file into corresponding play segments through each separation time point;
  • the preset number of intervals may be set to 12.
  • a 1 second video can be split into 24 frames of images, and the current video file to be played with a duration of N minutes can be split into 24N video images.
  • the first frame of video image can be used as the starting point, and the 13th frame can also be selected.
  • 25,..., 1+12n (where 1+12n ⁇ 24N) frames of video images form the target image set.
  • another implementation manner may be adopted, that is, multiple frames of video images in the video image set corresponding to the current to-be-played video file are randomly obtained as the target image set.
  • the target image set is input to the convolutional neural network
  • the program type corresponding to each frame of the video image in the target image set can be obtained through the convolutional neural network, and then the convolutional neural network obtains all The total number of types in the program types corresponding to each frame of video image in the target image set.
  • the total number of types is greater than 1, it means that there are two or more program types in the current to-be-played video file.
  • you can locate the program type sequence at this time For the separation time point corresponding to each of the inserted separators, the current to-be-played video file is divided into corresponding play segments through each separation time point;
  • the program type sequence is [News Program News Program News Program Sports Program Sports Program Sports Program Variety Program Variety Program Variety Program Variety Program Variety Program Variety Program...Variety Program]
  • the program type sequence after inserting the separator is [News Program News Program News Program News Program
  • the first separator is between the news program in the fourth place and the sports program in the fifth place
  • the second separator is between the sports program in the eighth place and the variety show in the ninth place.
  • the video program types corresponding to each playback segment in the current to-be-played video file are sequentially acquired.
  • the convolutional neural network can effectively identify the video program type included in the currently to-be-played video file, so as to facilitate subsequent control of the video file playback mode according to the video program type.
  • the method before step S130 or step S110, the method further includes:
  • the image feature of the movie program is that the image feature in the movie scene is the upper and lower or left and right black borders in the movie scene, so the right of the image in the image can be increased.
  • the upper and lower or left and right black borders are used as image feature data;
  • the image characteristics of news programs are the upper left corner of the station logo and the lower subtitles in the news scene, so the station logo and subtitles can be extracted as image feature data.
  • step S132 the method further includes:
  • the total number of types is equal to 1, it means that there is only one program type in the current to-be-played video file, and there is no need to divide the current to-be-played video file to identify the corresponding video program type.
  • S140 If the currently to-be-played video file includes a program type tag, obtain the video program type corresponding to the currently-to-be-played video file according to the program type tag.
  • the corresponding video program type can be directly extracted, without the need to use the convolutional neural network as a recognition model to obtain the current to be played
  • the program type corresponding to each picture in the target image set in the video file is, if the program type tag of the current video file to be played has been set.
  • the video program type corresponding to the currently to-be-played video file and the preset video type playback strategy obtain the playback mode information corresponding to the currently-to-be-played video file and send it to the corresponding user terminal; wherein, the video type
  • the play strategy includes a variety of video program types, and the play mode information corresponding to each video program type one-to-one.
  • the video type play strategy can be preset as the following Table 1:
  • the video type playback strategy can be preset as the following Table 2:
  • Video program type Play mode information 1 News program Audio playback + background operation 2 Real-time comment on the show Audio playback + background operation 3 Recitation program Audio playback + background operation 4 Sports programs Video playback 5 variety show Video playback 6 Movie show Video playback ... ... ...
  • the difference between the above two specific embodiments of the video playback strategy is that the audio playback type video program shown in Table 1 is played when the user terminal screen is turned off to save power.
  • the audio playback type video program is as follows: The second method in Table 2 is to suspend playback in the background of the user terminal so that the user can operate the user terminal to perform other operations (such as browsing web pages).
  • the video server sends the information about the play mode of the currently to-be-played video file to the user terminal
  • the consent operation instruction is triggered, and the consent operation instruction is sent by the user terminal To the video server.
  • the video server detects the consent operation instruction, it decomposes the currently to-be-played video file into corresponding audio-visual files and/or video files and sends them to the user terminal.
  • step S160 includes:
  • the audio playback control parameter is used It is used to control the display screen of the user terminal to turn on or off, or it is used to control the player corresponding to the current video file to be played to run in the front end or hang in the background;
  • the audio data of the first playback segment and the audio playback control parameters, and the video data of the second playback segment are assembled into playback data.
  • the audio data of the first playback segment and the audio playback control parameters (such as controlling the user The display screen of the terminal is turned off or the player is suspended in the background); if the second playback segment of the video playback mode exists in the playback mode information corresponding to the currently to-be-played video file, the video data of the second playback segment is acquired; and finally The audio data of the first playback segment, the audio playback control parameters, and the video data of the second playback segment are assembled into playback data and sent to the user terminal.
  • the audio data and audio playback control parameters of the first playback segment are not null values; if the currently to-be-played If the first playback segment of the audio playback mode does not exist in the playback mode information corresponding to the video file, the audio data and audio playback control parameters of the first playback segment are null values.
  • the video data corresponding to the second playback segment is not a null value; if the currently-to-be-played video file corresponds to If the second playback segment of the video playback mode does not exist in the playback mode information, the video data corresponding to the second playback segment is a null value.
  • the audio data of the first playback segment and the audio playback control parameters, and the video data of the second playback segment are assembled into playback data and sent to the user terminal, it can be based on whether there is a second playback mode information corresponding to the current video file to be played. Both the one playback terminal and/or the second playback terminal can combine the playback data corresponding to the currently to-be-played video file, and there is no error.
  • a smart playback can be realized, which can save power immediately when the display screen is turned off or the background hangs in response to audio playback, without the need to keep on It consumes electricity.
  • the user terminal after sending the playback data to the user terminal for playback, you can also determine whether the video needs to be played (turn off the screen) according to the location of the user terminal. For example, it is detected that the user puts the user terminal in a pocket or a suitcase, and then pauses the video playback. ,
  • the initiator of this control method is the user terminal itself, not the video server. In this way, the user terminal can also be effectively controlled to save power.
  • the method realizes that based on the video type of the current to-be-played video file, the playing mode information and the corresponding playing data are obtained, and the power saving control of the user terminal is realized according to the playing mode information.
  • the embodiment of the present application also provides a device for implementing playback control based on a video type, and the device for implementing playback control based on a video type is used to execute any embodiment of the foregoing method for implementing playback control based on a video type.
  • FIG. 5 is a schematic block diagram of a video type-based playback control implementation device provided by an embodiment of the present application.
  • the device 100 for implementing playback control based on video types may be configured in a server.
  • the device 100 for implementing playback control based on video type includes a video selection unit 110, a video tag determination unit 120, a first program type acquisition unit 130, a second program type acquisition unit 140, a playback mode information acquisition unit 150, Play data sending unit 160.
  • the video selection unit 110 is configured to, if a video selection instruction is detected, obtain a current to-be-played video file corresponding to the video selection instruction.
  • the video file to be played when the user opens the user interaction interface of the video software on the user terminal (such as a smart phone, a tablet computer, etc.), a video file to be played needs to be selected. Once the video file to be played is selected, the user terminal will This video selection instruction is sent to the video server. The video server obtains the video selection instruction, and correspondingly obtains the current to-be-played video file.
  • the video file to be played can be a video file with a limited duration (such as a movie, a short video, etc., and the video file with a limited duration can be a video file that includes a single video program type, or it can include multiple video programs. Types of video files), or live video (for example, the online program of TV A, the live video broadcasts different types of video programs in different time periods).
  • the video tag determining unit 120 is configured to determine whether the currently to-be-played video file includes a video program type tag.
  • the currently to-be-played video file selected by the user may have been set with a video program type tag, or it may not have been set, in order to determine the playback of the currently-to-be-played video file in the subsequent steps Method, you need to determine the video program type first according to the video program type tag or video content recognition.
  • the first program type acquiring unit 130 is configured to, if the currently to-be-played video file does not include a program type tag, input the selected multiple frames of video images in the currently-to-be-played video file to a pre-trained convolutional neural network network , Get the video program type.
  • the video server is required to identify the video program type of the currently-to-be-played video file.
  • the video server obtains the currently to-be-played video file, in order to determine the type of its video program (such as news programs, real-time review programs, recitation programs, sports programs, variety shows, movie programs, etc.), it is necessary to set the currently-to-be-played video files
  • the video file is split into video to obtain multiple frames of video images. Then, according to the pre-trained convolutional neural network, the split multi-frame video images can be identified, and the corresponding video program type can be obtained.
  • the first program type acquiring unit 130 includes:
  • the target image set obtaining unit 131 is configured to obtain multiple frames of video images in the video image set corresponding to the current to-be-played video file according to a preset number of intervals, as a target image set;
  • the program type identification unit 132 is configured to input the target image set to the convolutional neural network to obtain the program type corresponding to each frame of video image in the target image set;
  • the separating unit 133 is configured to, if the total number of types is greater than 1, sequentially obtain the program type corresponding to each frame of the video image in the target image set to form a program type sequence, and divide the two adjacent program types in the program type sequence Insert separators between different program types;
  • the playback segment dividing unit 134 is configured to locate the separation time point corresponding to each separator inserted in the program type sequence, and divide the current to-be-played video file into corresponding playback segments through each separation time point;
  • the playback segment type acquiring unit 135 is configured to sequentially acquire the video program type corresponding to each playback segment in the currently to-be-played video file.
  • the preset number of intervals may be set to 12.
  • a 1 second video can be split into 24 frames of images, and the current video file to be played with a duration of N minutes can be split into 24N video images.
  • the first frame of video image can be used as the starting point, and the 13th frame can also be selected.
  • 25,..., 1+12n (where 1+12n ⁇ 24N) frames of video images form the target image set.
  • another implementation manner may be adopted, that is, multiple frames of video images in the video image set corresponding to the current to-be-played video file are randomly obtained as the target image set.
  • the target image set is input to the convolutional neural network
  • the program type corresponding to each frame of the video image in the target image set can be obtained through the convolutional neural network, and then the convolutional neural network obtains all The total number of types in the program types corresponding to each frame of video image in the target image set.
  • the total number of types is greater than 1, it means that there are two or more program types in the current to-be-played video file.
  • you can locate the program type sequence at this time For the separation time point corresponding to each of the inserted separators, the current to-be-played video file is divided into corresponding play segments through each separation time point;
  • the program type sequence is [News Program News Program News Program Sports Program Sports Program Sports Program Variety Program Variety Program Variety Program Variety Program Variety Program Variety Program...Variety Program]
  • the program type sequence after inserting the separator is [News Program News Program News Program News Program
  • the first separator is between the news program in the fourth place and the sports program in the fifth place
  • the second separator is between the sports program in the eighth place and the variety show in the ninth place.
  • the video program types corresponding to each playback segment in the current to-be-played video file are sequentially acquired.
  • the convolutional neural network can effectively identify the video program type included in the currently to-be-played video file, so as to facilitate subsequent control of the video file playback mode according to the video program type.
  • the device 100 for implementing video type-based playback control further includes:
  • the video program type dividing unit 1101 is configured to divide the video program into multiple video program types
  • the video sample obtaining unit 1102 is used to obtain video samples of each video program type
  • the training set obtaining unit 1103 is configured to extract image feature data of each frame of image in the video sample as a training set
  • the model training unit 1104 is configured to input the training set to the convolutional neural network to be trained for training to obtain the convolutional neural network.
  • the image feature of the movie program is that the image feature in the movie scene is the upper and lower or left and right black borders in the movie scene, so the right of the image in the image can be increased.
  • the upper and lower or left and right black borders are used as image feature data;
  • the image characteristics of news programs are the upper left corner of the station logo and the lower subtitles in the news scene, so the station logo and subtitles can be extracted as image feature data.
  • the first program type acquiring unit 130 further includes:
  • the current program type obtaining unit 136 is configured to obtain the video program type corresponding to the current video file to be played if the total number of types is equal to one.
  • the total number of types is equal to 1, it means that there is only one program type in the current to-be-played video file, and there is no need to divide the current to-be-played video file to identify the corresponding video program type.
  • the second program type obtaining unit 140 is configured to, if the currently to-be-played video file includes a program type tag, obtain the video program type corresponding to the currently-to-be-played video file according to the program type tag.
  • the corresponding video program type can be directly extracted, without the need to use the convolutional neural network as a recognition model to obtain the current to be played
  • the program type corresponding to each picture in the target image set in the video file is, if the program type tag of the current video file to be played has been set.
  • the playing mode information obtaining unit 150 is configured to obtain the playing mode information corresponding to the current to-be-played video file according to the video program type corresponding to the current to-be-played video file and the preset video type playing strategy and send it to the corresponding user terminal ;
  • the video type play strategy includes multiple video program types, and play mode information corresponding to each video program type one-to-one.
  • the video type play strategy may be preset as shown in Table 1 above: as another implementation manner, the video type play strategy may be preset as shown in Table 2 above.
  • the difference between the above two specific embodiments of the video playback strategy is that the audio playback type video program shown in Table 1 is played when the user terminal screen is turned off to save power.
  • the audio playback type video program is as follows: The second method in Table 2 is to suspend playback in the background of the user terminal so that the user can operate the user terminal to perform other operations (such as browsing web pages).
  • the play data sending unit 160 is configured to, if an agreed operation instruction corresponding to the play mode information of the currently to-be-played video file is detected, decompose the currently-to-be-played video file into corresponding audio-visual files and/or video files , To assemble the corresponding playback data and send it to the user terminal.
  • the video server sends the information about the play mode of the currently to-be-played video file to the user terminal
  • the consent operation instruction is triggered, and the consent operation instruction is sent by the user terminal To the video server.
  • the video server detects the consent operation instruction, it decomposes the currently to-be-played video file into corresponding audio-visual files and/or video files and sends them to the user terminal.
  • the play data sending unit 160 includes:
  • the first acquiring unit is configured to acquire audio data of the first playback segment and corresponding audio playback control parameters if there is a first playback segment of the audio playback mode in the playback mode corresponding to the playback mode information of the currently to-be-played video file;
  • the audio playback control parameter is used to control the display screen of the user terminal to turn on or off, or to control the player corresponding to the video file currently to be played to run in the front end or hang in the background;
  • the second obtaining unit is configured to obtain the video data of the second playing section if there is a second playing section of the video playing mode in the playing mode corresponding to the playing mode information of the currently to-be-played video file;
  • the data combination unit is used to assemble the audio data of the first playback segment, the audio playback control parameters, and the video data of the second playback segment into playback data.
  • the audio data of the first playback segment and the audio playback control parameters (such as controlling the user The display screen of the terminal is turned off or the player is suspended in the background); if the second playback segment of the video playback mode exists in the playback mode information corresponding to the currently to-be-played video file, the video data of the second playback segment is acquired; and finally The audio data of the first playback segment, the audio playback control parameters, and the video data of the second playback segment are assembled into playback data and sent to the user terminal.
  • the audio data and audio playback control parameters of the first playback segment are not null values; if the currently to-be-played If the first playback segment of the audio playback mode does not exist in the playback mode information corresponding to the video file, the audio data and audio playback control parameters of the first playback segment are null values.
  • the video data corresponding to the second playback segment is not a null value; if the currently-to-be-played video file corresponds to If the second playback segment of the video playback mode does not exist in the playback mode information, the video data corresponding to the second playback segment is a null value.
  • the audio data of the first playback segment and the audio playback control parameters, and the video data of the second playback segment are assembled into playback data and sent to the user terminal, it can be based on whether there is a second playback mode information corresponding to the current video file to be played. Both the one playback terminal and/or the second playback terminal can combine the playback data corresponding to the currently to-be-played video file, and there is no error.
  • a smart playback can be realized, which can save power immediately when the display screen is turned off or the background hangs in response to audio playback, without the need to keep on It consumes electricity.
  • the user terminal after sending the playback data to the user terminal for playback, you can also determine whether the video needs to be played (turn off the screen) according to the location of the user terminal. For example, it is detected that the user puts the user terminal in a pocket or a suitcase, and then pauses the video playback. ,
  • the initiator of this control method is the user terminal itself, not the video server. In this way, the user terminal can also be effectively controlled to save power.
  • the device realizes that based on the video type of the current to-be-played video file, it obtains its playing mode information and corresponding playing data, and realizes power-saving control of the user terminal according to the playing mode information.
  • the foregoing device for implementing playback control based on the video type may be implemented in the form of a computer program, and the computer program may run on the computer device as shown in FIG. 8.
  • FIG. 8 is a schematic block diagram of a computer device according to an embodiment of the present application.
  • the computer device 500 is a server.
  • the server can be an independent server or a server cluster composed of multiple servers to implement a video type-based playback control method, where the method includes: if a video selection instruction is detected, acquiring and The current to-be-played video file corresponding to the video selection instruction; determine whether the current to-be-played video file includes a video program type tag; if the current to-be-played video file does not include a program type tag, the current to-be-played video file The selected multi-frame video images are input to the pre-trained convolutional neural network to obtain the video program type; if the currently to-be-played video file includes a program type tag, the video file to be played is obtained according to the program type tag.
  • Corresponding video program type according to the video program type corresponding to the current to-be-played video file and the preset video type playback strategy, obtain the playback mode information corresponding to the currently-to-be-played video file and send it to the corresponding user terminal; wherein, The video type play strategy includes multiple video program types, and play mode information corresponding to each video program type one-to-one; and if an agreed operation instruction corresponding to the play mode information of the currently to-be-played video file is detected, the The current to-be-played video file is decomposed into corresponding audio-visual files and/or video files to be assembled into corresponding play data and sent to the user terminal.
  • the computer device 500 includes a processor 502, a memory, and a network interface 505 connected through a system bus 501, where the memory may include a non-volatile storage medium 503 and an internal memory 504.
  • the non-volatile storage medium 503 can store an operating system 5031 and a computer program 5032.
  • the processor 502 can execute the method for implementing playback control based on the video type.
  • the processor 502 is used to provide computing and control capabilities, and support the operation of the entire computer device 500.
  • the internal memory 504 provides an environment for the operation of the computer program 5032 in the non-volatile storage medium 503.
  • the processor 502 can make the processor 502 execute a video type-based playback control implementation method.
  • the network interface 505 is used for network communication, such as providing data information transmission.
  • the structure shown in FIG. 8 is only a block diagram of a part of the structure related to the solution of the present application, and does not constitute a limitation on the computer device 500 to which the solution of the present application is applied.
  • the specific computer device 500 may include more or fewer components than shown in the figure, or combine certain components, or have a different component arrangement.
  • the processor 502 is configured to run a computer program 5032 stored in a memory to implement the video type-based playback control implementation method disclosed in the embodiment of the present application, where the method at least includes:
  • the embodiment of the computer device shown in FIG. 8 does not constitute a limitation on the specific configuration of the computer device.
  • the computer device may include more or less components than those shown in the figure. Or some parts are combined, or different parts are arranged.
  • the computer device may only include a memory and a processor. In such an embodiment, the structures and functions of the memory and the processor are consistent with the embodiment shown in FIG. 8 and will not be repeated here.
  • the processor 502 may be a central processing unit (Central Processing Unit, CPU), and the processor 502 may also be other general-purpose processors, digital signal processors (Digital Signal Processors, DSPs), and special purpose processors.
  • Integrated circuit Application Specific Integrated Circuit, ASIC
  • off-the-shelf programmable gate array Field-Programmable Gate Array, FPGA
  • the general-purpose processor may be a microprocessor or the processor may also be any conventional processor.
  • a computer-readable storage medium may be a volatile medium or a non-volatile computer-readable storage medium.
  • the computer-readable storage medium stores a computer program, where the computer program is executed by a processor to implement the video type-based playback control implementation method disclosed in the embodiments of the present application, wherein the method at least includes: if a video selection instruction is detected, Obtain the current to-be-played video file corresponding to the video selection instruction; determine whether the current to-be-played video file includes a video program type tag; if the current to-be-played video file does not include a program type tag, set the currently to be played
  • the selected multi-frame video images in the video file are input to the pre-trained convolutional neural network network to obtain the video program type; if the currently to-be-played video file includes a program type tag, it is obtained according to the program type tag and the current to-be-played
  • the integrated unit is implemented in the form of a software functional unit and sold or used as an independent product, it can be stored in a storage medium.
  • the technical solution of this application is essentially or the part that contributes to the existing technology, or all or part of the technical solution can be embodied in the form of a software product, and the computer software product is stored in a storage medium. It includes several instructions to make a computer device (which may be a personal computer, a server, or a network device, etc.) execute all or part of the steps of the methods described in the various embodiments of the present application.
  • the aforementioned storage media include: U disk, mobile hard disk, read-only memory (ROM, Read-Only Memory), magnetic disk or optical disk and other media that can store program codes.

Landscapes

  • Engineering & Computer Science (AREA)
  • Multimedia (AREA)
  • Signal Processing (AREA)
  • General Engineering & Computer Science (AREA)
  • Software Systems (AREA)
  • Databases & Information Systems (AREA)
  • Television Signal Processing For Recording (AREA)

Abstract

Disclosed in the present invention are a video type-based playback control implementation method and apparatus, a computer device and a storage medium. Said method comprises: acquiring a video file to be played back currently corresponding to a video selection instruction; if said video file does not comprise a program type tag, inputting a plurality of selected video images into a convolutional neural network to obtain a video program type; according to the video program type corresponding to said video file and a video type playback strategy, acquiring corresponding playback mode information and sending same to a corresponding user terminal; and if a corresponding operation approval instruction is detected, decomposing said video file into a corresponding audio-video file and/or video file, assembling same into corresponding playback data and sending same to the user terminal. Said method achieves acquiring, on the basis of the video type of a video file to be played back currently, playback mode information and corresponding playback data, and achieves power saving control on the user terminal according to the playback mode information.

Description

基于视频类型的播放控制实现方法、装置及计算机设备Method, device and computer equipment for implementing playback control based on video type
本申请要求于2019年9月11日提交中国专利局、申请号为201910858320.3,发明名称为“基于视频类型的播放控制实现方法、装置及计算机设备”的中国专利申请的优先权,其全部内容通过引用结合在本申请中。This application claims the priority of a Chinese patent application filed with the Chinese Patent Office on September 11, 2019, the application number is 201910858320.3, and the invention title is "Video type-based playback control implementation methods, devices, and computer equipment". The entire content is approved The reference is incorporated in this application.
技术领域Technical field
本申请涉及图像识别技术领域,尤其涉及一种基于视频类型的播放控制实现方法、装置、计算机设备及存储介质。This application relates to the field of image recognition technology, and in particular to a method, device, computer equipment, and storage medium for implementing video type-based playback control.
背景技术Background technique
目前,用户越来越多的使用智能终端的视频软件在线观看视频(即视频服务器将视频的流数据发送至智能终端的本地缓存以进行播放),视频根据其视频内容可分为不同的视频类型(如新闻节目、体育节目、综艺节目等)。发明人意识到,一般视频软件在播放视频内容时是无论任何视频类型,均会在智能终端的前端运行并进行视频播放,智能终端的显示屏需长亮进行播放,导致视频服务器无法对智能终端进行节电控制。At present, more and more users use the video software of smart terminals to watch videos online (that is, the video server sends the streaming data of the video to the local buffer of the smart terminal for playback). Videos can be divided into different video types according to their video content. (Such as news programs, sports programs, variety shows, etc.). The inventor realized that when general video software plays video content, regardless of any video type, it will run on the front end of the smart terminal and perform video playback. The display screen of the smart terminal needs to be bright for playback, resulting in the video server being unable to display the video on the smart terminal. Perform power saving control.
发明内容Summary of the invention
本申请实施例提供了一种基于视频类型的播放控制实现方法、装置、计算机设备及存储介质,旨在解决现有技术中智能终端的视频软件在播放视频内容时是无论任何视频类型,均会在智能终端的前端运行并进行视频播放,智能终端的显示屏需长亮进行播放,导致视频服务器无法对智能终端进行节电控制的问题。The embodiments of the present application provide a method, device, computer equipment and storage medium for implementing playback control based on video types, aiming to solve the problem that the video software of the smart terminal in the prior art will play video content regardless of any video type. When running and playing video at the front end of the smart terminal, the display of the smart terminal needs to be on for a long time to play, which causes the problem that the video server cannot perform power saving control on the smart terminal.
第一方面,本申请实施例提供了一种基于视频类型的播放控制实现方法,其包括:In the first aspect, an embodiment of the present application provides a method for implementing playback control based on a video type, which includes:
若检测到视频选择指令,获取与所述视频选择指令对应的当前待播放视频文件;If a video selection instruction is detected, obtain the current to-be-played video file corresponding to the video selection instruction;
判断所述当前待播放视频文件是否包括视频节目类型标签;Judging whether the currently to-be-played video file includes a video program type tag;
若所述当前待播放视频文件不包括节目类型标签,将所述当前待播放视频文件中选定的多帧视频图像输入至预先训练的卷积神经网络网络,得到视频节目类型;If the currently to-be-played video file does not include the program type tag, input the selected multiple frames of video images in the currently-to-be-played video file to a pre-trained convolutional neural network network to obtain the video program type;
若所述当前待播放视频文件包括节目类型标签,根据节目类型标签获取与所述当前待播放视频文件对应的视频节目类型;If the currently to-be-played video file includes a program type tag, obtain the video program type corresponding to the currently-to-be-played video file according to the program type tag;
根据当前待播放视频文件对应的视频节目类型及预先设置的视频类型播放策略,获取与所述当前待播放视频文件对应的播放方式信息并发送至对应的用户终端;其中,所述视频类型播放策略包括多种视频节目类型,及与各视频节目类型一一对应的播放方式信息;以及According to the video program type corresponding to the currently to-be-played video file and the preset video type play strategy, the play mode information corresponding to the currently-to-be-played video file is obtained and sent to the corresponding user terminal; wherein, the video type play strategy Including a variety of video program types, and one-to-one corresponding playback mode information for each video program type; and
若检测到与所述当前待播放视频文件的播放方式信息对应的同意操作指令,将与所述当前待播放视频文件分解为对应的音视文件和/或视频文件,以组装成对应的播放数据并发送至用户终端。If the consent operation instruction corresponding to the playing mode information of the currently to-be-played video file is detected, the currently-to-be-played video file is decomposed into corresponding audio-visual files and/or video files to assemble into corresponding playback data And sent to the user terminal.
第二方面,本申请实施例提供了一种基于视频类型的播放控制实现装置,其包括:In the second aspect, an embodiment of the present application provides a video type-based playback control implementation device, which includes:
视频选择单元,用于若检测到视频选择指令,获取与所述视频选择指令对应的当前待播放视频文件;The video selection unit is configured to, if a video selection instruction is detected, obtain a current to-be-played video file corresponding to the video selection instruction;
视频标签判断单元,用于判断所述当前待播放视频文件是否包括视频节目类型标签;A video tag determining unit, configured to determine whether the currently to-be-played video file includes a video program type tag;
第一节目类型获取单元,用于若所述当前待播放视频文件不包括节目类型标签,将所述当前待播放视频文件中选定的多帧视频图像输入至预先训练的卷积神经网络网络,得到视频节目类型;The first program type acquiring unit is configured to, if the currently to-be-played video file does not include a program type tag, input the selected multiple frames of video images in the currently-to-be-played video file to a pre-trained convolutional neural network, Get the type of video program;
第二节目类型获取单元,用于若所述当前待播放视频文件包括节目类型标签,根据节目类型标签获取与所述当前待播放视频文件对应的视频节目类型;The second program type acquiring unit is configured to, if the currently to-be-played video file includes a program type tag, acquire the video program type corresponding to the currently-to-be-played video file according to the program type tag;
播放方式信息获取单元,用于根据当前待播放视频文件对应的视频节目类型及预先设置 的视频类型播放策略,获取与所述当前待播放视频文件对应的播放方式信息并发送至对应的用户终端;其中,所述视频类型播放策略包括多种视频节目类型,及与各视频节目类型一一对应的播放方式信息;以及The playing mode information obtaining unit is configured to obtain the playing mode information corresponding to the current to-be-played video file according to the video program type corresponding to the current to-be-played video file and the preset video type playing strategy, and send it to the corresponding user terminal; Wherein, the video type play strategy includes multiple video program types, and play mode information corresponding to each video program type one-to-one; and
播放数据发送单元,用于若检测到与所述当前待播放视频文件的播放方式信息对应的同意操作指令,将与所述当前待播放视频文件分解为对应的音视文件和/或视频文件,以组装成对应的播放数据并发送至用户终端。The playback data sending unit is configured to, if an agreed operation instruction corresponding to the playback mode information of the currently to-be-played video file is detected, decompose the currently-to-be-played video file into corresponding audio-visual files and/or video files, It can be assembled into corresponding playback data and sent to the user terminal.
第三方面,本申请实施例又提供了一种计算机设备,其包括存储器、处理器及存储在所述存储器上并可在所述处理器上运行的计算机程序,所述处理器执行所述计算机程序时实现基于视频类型的播放控制实现方法,所述方法包括:若检测到视频选择指令,获取与所述视频选择指令对应的当前待播放视频文件;判断所述当前待播放视频文件是否包括视频节目类型标签;若所述当前待播放视频文件不包括节目类型标签,将所述当前待播放视频文件中选定的多帧视频图像输入至预先训练的卷积神经网络网络,得到视频节目类型;若所述当前待播放视频文件包括节目类型标签,根据节目类型标签获取与所述当前待播放视频文件对应的视频节目类型;根据当前待播放视频文件对应的视频节目类型及预先设置的视频类型播放策略,获取与所述当前待播放视频文件对应的播放方式信息并发送至对应的用户终端;其中,所述视频类型播放策略包括多种视频节目类型,及与各视频节目类型一一对应的播放方式信息;以及若检测到与所述当前待播放视频文件的播放方式信息对应的同意操作指令,将与所述当前待播放视频文件分解为对应的音视文件和/或视频文件,以组装成对应的播放数据并发送至用户终端。In a third aspect, an embodiment of the present application provides a computer device, which includes a memory, a processor, and a computer program stored on the memory and running on the processor, and the processor executes the computer A video type-based playback control implementation method is implemented during the program. The method includes: if a video selection instruction is detected, obtaining a current to-be-played video file corresponding to the video selection instruction; and judging whether the currently-to-be-played video file includes a video Program type tag; if the currently to-be-played video file does not include a program type tag, input the selected multiple frames of video images in the currently-to-be-played video file to a pre-trained convolutional neural network to obtain the video program type; If the currently to-be-played video file includes a program type tag, obtain the video program type corresponding to the currently-to-be-played video file according to the program type tag; play according to the video program type corresponding to the current to-be-played video file and the preset video type Strategy, to obtain the playback mode information corresponding to the currently to-be-played video file and send it to the corresponding user terminal; wherein, the video type playback strategy includes multiple video program types and one-to-one playback corresponding to each video program type Mode information; and if an agreed operation instruction corresponding to the playback mode information of the current to-be-played video file is detected, the current to-be-played video file is decomposed into corresponding audio-visual files and/or video files to be assembled into The corresponding playback data is sent to the user terminal.
第四方面,本申请实施例还提供了一种计算机可读存储介质,其中所述计算机可读存储介质存储有计算机程序,所述计算机程序当被处理器执行时使所述处理器执行基于视频类型的播放控制实现方法,所述方法包括:若检测到视频选择指令,获取与所述视频选择指令对应的当前待播放视频文件;判断所述当前待播放视频文件是否包括视频节目类型标签;若所述当前待播放视频文件不包括节目类型标签,将所述当前待播放视频文件中选定的多帧视频图像输入至预先训练的卷积神经网络网络,得到视频节目类型;若所述当前待播放视频文件包括节目类型标签,根据节目类型标签获取与所述当前待播放视频文件对应的视频节目类型;根据当前待播放视频文件对应的视频节目类型及预先设置的视频类型播放策略,获取与所述当前待播放视频文件对应的播放方式信息并发送至对应的用户终端;其中,所述视频类型播放策略包括多种视频节目类型,及与各视频节目类型一一对应的播放方式信息;以及若检测到与所述当前待播放视频文件的播放方式信息对应的同意操作指令,将与所述当前待播放视频文件分解为对应的音视文件和/或视频文件,以组装成对应的播放数据并发送至用户终端。In a fourth aspect, the embodiments of the present application also provide a computer-readable storage medium, wherein the computer-readable storage medium stores a computer program, and when the computer program is executed by a processor, the processor executes a video-based Type of playback control implementation method, the method includes: if a video selection instruction is detected, obtaining a current to-be-played video file corresponding to the video selection instruction; judging whether the current to-be-played video file includes a video program type tag; if If the currently to-be-played video file does not include the program type tag, the selected multi-frame video image in the currently-to-be-played video file is input to a pre-trained convolutional neural network network to obtain the video program type; The video file to be played includes a program type tag, and the video program type corresponding to the current to-be-played video file is obtained according to the program type tag; according to the video program type corresponding to the currently-to-be-played video file and the preset video type playback strategy, the data is obtained The playing mode information corresponding to the current video file to be played is sent to the corresponding user terminal; wherein, the video type playing strategy includes multiple video program types, and the playing mode information corresponding to each video program type one-to-one; and if The consent operation instruction corresponding to the playing mode information of the currently to-be-played video file is detected, and the currently-to-be-played video file is decomposed into corresponding audio-visual files and/or video files to assemble into corresponding playback data and Send to the user terminal.
本申请实施例提供了一种基于视频类型的播放控制实现方法、装置、计算机设备及存储介质。该方法实现了基于当前待播放视频文件的视频类型而获取其播放方式信息和对应的播放数据,根据播放方式信息实现对用户终端的节电控制。The embodiments of the present application provide a method, a device, a computer device, and a storage medium for implementing playback control based on a video type. The method realizes that based on the video type of the current to-be-played video file, the playing mode information and the corresponding playing data are obtained, and the power saving control of the user terminal is realized according to the playing mode information.
附图说明Description of the drawings
图1为本申请实施例提供的基于视频类型的播放控制实现方法的应用场景示意图;FIG. 1 is a schematic diagram of an application scenario of a method for implementing playback control based on a video type provided by an embodiment of the application;
图2为本申请实施例提供的基于视频类型的播放控制实现方法的流程示意图;2 is a schematic flowchart of a method for implementing video type-based playback control provided by an embodiment of the application;
图3为本申请实施例提供的基于视频类型的播放控制实现方法的子流程示意图;3 is a schematic diagram of a sub-flow of a method for implementing video type-based playback control provided by an embodiment of the application;
图4为本申请实施例提供的基于视频类型的播放控制实现方法的另一子流程示意图;4 is a schematic diagram of another sub-flow of the method for implementing video type-based playback control according to an embodiment of the application;
图5为本申请实施例提供的基于视频类型的播放控制实现装置的示意性框图;FIG. 5 is a schematic block diagram of a device for implementing playback control based on video types according to an embodiment of the application;
图6为本申请实施例提供的基于视频类型的播放控制实现装置的子单元示意性框图;6 is a schematic block diagram of subunits of the device for implementing video type-based playback control according to an embodiment of the application;
图7为本申请实施例提供的基于视频类型的播放控制实现装置的另一子单元示意性框图;FIG. 7 is a schematic block diagram of another subunit of the device for implementing video type-based playback control according to an embodiment of the application; FIG.
图8为本申请实施例提供的计算机设备的示意性框图。FIG. 8 is a schematic block diagram of a computer device provided by an embodiment of the application.
具体实施方式detailed description
下面将结合本申请实施例中的附图,对本申请实施例中的技术方案进行清楚、完整地描 述,显然,所描述的实施例是本申请一部分实施例,而不是全部的实施例。基于本申请中的实施例,本领域普通技术人员在没有做出创造性劳动前提下所获得的所有其他实施例,都属于本申请保护的范围。The technical solutions in the embodiments of the present application will be described clearly and completely in conjunction with the accompanying drawings in the embodiments of the present application. Obviously, the described embodiments are part of the embodiments of the present application, rather than all of them. Based on the embodiments in this application, all other embodiments obtained by those of ordinary skill in the art without creative work shall fall within the protection scope of this application.
请参阅图1和图2,图1为本申请实施例提供的基于视频类型的播放控制实现方法的应用场景示意图;图2为本申请实施例提供的基于视频类型的播放控制实现方法的流程示意图,该基于视频类型的播放控制实现方法应用于服务器中,该方法通过安装于服务器中的应用软件进行执行。Please refer to Figures 1 and 2. Figure 1 is a schematic diagram of an application scenario of a method for implementing video type-based playback control provided by an embodiment of this application; Figure 2 is a schematic flowchart of a method for implementing video type-based playback control provided by an embodiment of this application The method for implementing playback control based on video types is applied to a server, and the method is executed by application software installed in the server.
如图2所示,该方法包括步骤S110~S160。As shown in Figure 2, the method includes steps S110 to S160.
S110、若检测到视频选择指令,获取与所述视频选择指令对应的当前待播放视频文件。S110: If a video selection instruction is detected, obtain a current to-be-played video file corresponding to the video selection instruction.
在本实施例中,当用户在用户终端(如智能手机、平板电脑等)打开视频软件的用户交互界面时,需选定一个待播放视频文件,一旦选定了待播放视频文件,用户终端将此视频选择指令发送至视频服务器。视频服务器获取了所述视频选择指令,则对应的获取当前待播放视频文件。其中,待播放视频文件既可以是限定了时长的视频文件(如一部电影、一段短视频等,限定了时长的视频文件可以是包括单一视频节目类型的视频文件,也可以是包括多个视频节目类型的视频文件),也可以是直播视频(例如A电视台的在线节目,直播视频在不同的时间段播放不同类型的视频节目)。In this embodiment, when the user opens the user interaction interface of the video software on the user terminal (such as a smart phone, a tablet computer, etc.), a video file to be played needs to be selected. Once the video file to be played is selected, the user terminal will This video selection instruction is sent to the video server. The video server obtains the video selection instruction, and correspondingly obtains the current to-be-played video file. Among them, the video file to be played can be a video file with a limited duration (such as a movie, a short video, etc., and the video file with a limited duration can be a video file that includes a single video program type, or it can include multiple video programs. Types of video files), or live video (for example, the online program of TV A, the live video broadcasts different types of video programs in different time periods).
S120、判断所述当前待播放视频文件是否包括视频节目类型标签。S120: Determine whether the currently to-be-played video file includes a video program type tag.
在本实施例中,由于用户所选定的当前待播放视频文件可能已被设置有视频节目类型标签,也有可能还未被设置视频节目类型标签,为了后续步骤中确定当前待播放视频文件的播放方式,需先根据视频节目类型标签或视频内容识别来确定视频节目类型。In this embodiment, since the currently to-be-played video file selected by the user may have been set with a video program type tag, or it may not have been set, in order to determine the playback of the currently-to-be-played video file in the subsequent steps Method, you need to determine the video program type first according to the video program type tag or video content recognition.
S130、若所述当前待播放视频文件不包括节目类型标签,将所述当前待播放视频文件中选定的多帧视频图像输入至预先训练的卷积神经网络网络,得到视频节目类型。S130: If the currently to-be-played video file does not include the program type tag, input the selected multiple frames of video images in the currently-to-be-played video file to a pre-trained convolutional neural network network to obtain the video program type.
在本实施例中,若所述当前待播放视频文件不包括节目类型标签,表示需要视频服务器对当前待播放视频文件进行视频节目类型的识别。当视频服务器获取了所述当前待播放视频文件后,为了判断其视频节目类型(如新闻节目、实时点评节目、朗诵节目、体育节目、综艺节目、电影节目等),需要将所述当前待播放视频文件进行视频拆分,得到多帧视频图像。之后根据预先训练的卷积神经网络网络即可对拆分得到的多帧视频图像进行识别,从而得到对应的视频节目类型。In this embodiment, if the currently to-be-played video file does not include the program type tag, it means that the video server is required to identify the video program type of the currently-to-be-played video file. After the video server obtains the currently to-be-played video file, in order to determine the type of its video program (such as news programs, real-time review programs, recitation programs, sports programs, variety shows, movie programs, etc.), it is necessary to set the currently-to-be-played video files The video file is split into video to obtain multiple frames of video images. Then, according to the pre-trained convolutional neural network network, the multi-frame video images obtained by the split can be identified, so as to obtain the corresponding video program type.
在一实施例中,如图3所示,步骤S130包括:In an embodiment, as shown in FIG. 3, step S130 includes:
S131、按预设的间隔数获取所述当前待播放视频文件对应的视频图像集合中的多帧视频图像,以作为目标图像集;S131: Acquire multiple frames of video images in the video image set corresponding to the currently to-be-played video file at a preset interval number, as a target image set;
S132、将所述目标图像集输入至所述卷积神经网络,得到所述目标图像集中每一帧视频图像对应的节目类型;S132. Input the target image set to the convolutional neural network to obtain the program type corresponding to each frame of video image in the target image set;
S133、若所述类型总数量大于1,依序获取所述目标图像集中每一帧视频图像对应节目类型以组成节目类型序列,将所述节目类型序列中相邻两个节目类型存在不同的节目类型之间插入分隔符;S133. If the total number of types is greater than 1, sequentially obtain the program type corresponding to each frame of video image in the target image set to form a program type sequence, and divide two adjacent program types in the program type sequence into different programs Insert a separator between the types;
S134、定位节目类型序列中所插入的每一分隔符对应的分隔时间点,通过各分隔时间点将所述当前待播放视频文件划分为对应播放段;S134: Locate the separation time point corresponding to each separator inserted in the program type sequence, and divide the current to-be-played video file into corresponding play segments through each separation time point;
S135、依序获取所述当前待播放视频文件中各播放段所对应的视频节目类型。S135. Obtain video program types corresponding to each playback segment in the currently to-be-played video file in sequence.
在本实施例中,按预设的间隔数获取所述当前待播放视频文件对应的视频图像集合中的多帧视频图像时,可以将预设的间隔数设置为12。一般1秒钟的视频可以拆分为24帧图像,则N分钟时长的当前待播放视频文件可以拆分为24N张视频图像,此时可以第一帧视频图像为起始点,还选择第13,25,……,1+12n(其中1+12n≤24N)帧视频图像组成目标图像集。步骤S131在具体实施时,可以采用另一种实施方式,即随机获取所述当前待播放视频文件对应的视频图像集合中的多帧视频图像,以作为目标图像集。In this embodiment, when acquiring multiple frames of video images in the video image set corresponding to the currently to-be-played video file according to a preset number of intervals, the preset number of intervals may be set to 12. Generally, a 1 second video can be split into 24 frames of images, and the current video file to be played with a duration of N minutes can be split into 24N video images. At this time, the first frame of video image can be used as the starting point, and the 13th frame can also be selected. 25,..., 1+12n (where 1+12n≤24N) frames of video images form the target image set. In the specific implementation of step S131, another implementation manner may be adopted, that is, multiple frames of video images in the video image set corresponding to the current to-be-played video file are randomly obtained as the target image set.
之后将所述目标图像集输入至所述卷积神经网络,可通过所述卷积神经网络获取所述目 标图像集中每一帧视频图像对应的节目类型,然后统计所述卷积神经网络获取所述目标图像集中每一帧视频图像对应的节目类型中类型总数量。After that, the target image set is input to the convolutional neural network, the program type corresponding to each frame of the video image in the target image set can be obtained through the convolutional neural network, and then the convolutional neural network obtains all The total number of types in the program types corresponding to each frame of video image in the target image set.
当类型总数量大于1时,表示所述当前待播放视频文件中存在两个及以上的节目类型,此时为了将所述当前待播放视频文件进行有效的划分,此时可定位节目类型序列中所插入的每一分隔符对应的分隔时间点,通过各分隔时间点将所述当前待播放视频文件划分为对应播放段;When the total number of types is greater than 1, it means that there are two or more program types in the current to-be-played video file. At this time, in order to effectively divide the current to-be-played video file, you can locate the program type sequence at this time For the separation time point corresponding to each of the inserted separators, the current to-be-played video file is divided into corresponding play segments through each separation time point;
例如节目类型序列为[新闻节目 新闻节目 新闻节目 新闻节目 体育节目 体育节目 体育节目 体育节目 综艺节目 综艺节目 综艺节目 综艺节目 综艺节目……综艺节目]For example, the program type sequence is [News Program News Program News Program News Program Sports Program Sports Program Sports Program Sports Program Variety Program Variety Program Variety Program Variety Program Variety Program Variety Program...Variety Program]
插入分隔符后的节目类型序列为[新闻节目 新闻节目 新闻节目 新闻节目|体育节目 体育节目 体育节目 体育节目|综艺节目 综艺节目 综艺节目 综艺节目 综艺节目……综艺节目]。The program type sequence after inserting the separator is [News Program News Program News Program News Program|Sports Program Sports Program Sports Program Sports Program|Variety Program Variety Program Variety Program Variety Program Variety Program Variety Program Variety Program...Variety Program].
上述节目类型序列中第一个分隔符在第四位的新闻节目和第五位的体育节目之间,第二个分隔符在第八位的体育节目和第九位的综艺节目之间,此时分别获取第一分隔符对应的第一分隔时间点、及第二分隔符对应的第二分隔时间点(其中第一分隔时间点记作第五位的体育节目对应的起始时间,其中第二分隔时间点记作第九位的综艺节目对应的起始时间)。则第一位的新闻节目对应的起始时间点与第一分隔时间点之间记为第一播放段,第一分隔时间点与第二分隔时间点之间记为第二播放段,第二分隔时间点与最末位的综艺节目对应的结束时间点之间记为第三播放段。完成上述划分后,依序获取所述当前待播放视频文件中各播放段所对应的视频节目类型。通过所述卷积神经网络有效的对所述当前待播放视频文件中包括的视频节目类型进行有效识别,便于后续根据视频节目类型控制视频文件的播放方式。In the above program type sequence, the first separator is between the news program in the fourth place and the sports program in the fifth place, and the second separator is between the sports program in the eighth place and the variety show in the ninth place. Obtain the first separation time point corresponding to the first separator and the second separation time point corresponding to the second separator (where the first separation time point is recorded as the start time corresponding to the fifth place sports program, where the The two separate time points are recorded as the start time corresponding to the ninth variety show). Then the time between the start time point corresponding to the first news program and the first separation time point is recorded as the first play segment, and the time between the first separation time point and the second separation time point is recorded as the second play segment, and the second The interval between the separation time point and the end time point corresponding to the last variety show is recorded as the third play segment. After the above division is completed, the video program types corresponding to each playback segment in the current to-be-played video file are sequentially acquired. The convolutional neural network can effectively identify the video program type included in the currently to-be-played video file, so as to facilitate subsequent control of the video file playback mode according to the video program type.
在一实施例中,如图4所示,步骤S130或步骤S110之前还包括:In an embodiment, as shown in FIG. 4, before step S130 or step S110, the method further includes:
S1101、对视频节目划分多个视频节目类型;S1101: Divide the video program into multiple video program types;
S1102、获取每一个视频节目类型的视频样本;S1102: Obtain video samples of each video program type;
S1103、提取所述视频样本中的每帧图像的图像特征数据作为训练集;S1103: Extract image feature data of each frame of image in the video sample as a training set;
S1104、将所述训练集输入至待训练卷积神经网络进行训练,得到卷积神经网络。S1104. Input the training set to the convolutional neural network to be trained for training to obtain the convolutional neural network.
在本实施例中,在预先训练所述卷积神经网络时,例如,电影节目的图像特征是电影场景中的图像特征是电影场景中上下或者左右的黑边,因此可以提权该图像中的上下或左右的黑边作为图像特征数据;新闻节目的图像特征是新闻场景中的左上角的台标和下边的字幕,因此可以提取台标和字幕作为图像特征数据。通过上述方式训练出卷积神经网络后,即可用于识别视频图像对应的视频节目类型。In this embodiment, when the convolutional neural network is trained in advance, for example, the image feature of the movie program is that the image feature in the movie scene is the upper and lower or left and right black borders in the movie scene, so the right of the image in the image can be increased. The upper and lower or left and right black borders are used as image feature data; the image characteristics of news programs are the upper left corner of the station logo and the lower subtitles in the news scene, so the station logo and subtitles can be extracted as image feature data. After the convolutional neural network is trained in the above manner, it can be used to identify the video program type corresponding to the video image.
在一实施例中,步骤S132之后还包括:In an embodiment, after step S132, the method further includes:
S136、若所述类型总数量等于1,获取所述当前待播放视频文件对应的视频节目类型。S136: If the total number of types is equal to 1, obtain the video program type corresponding to the currently to-be-played video file.
在本实施例中,若类型总数量等于1时,表示所述当前待播放视频文件中仅存在一种节目类型,此时无需对所述当前待播放视频文件进行划分来识别对应的视频节目类型,直接以所述当前待播放视频文件的当前节目类型作为对应的视频节目类型。In this embodiment, if the total number of types is equal to 1, it means that there is only one program type in the current to-be-played video file, and there is no need to divide the current to-be-played video file to identify the corresponding video program type. , Directly use the current program type of the currently to-be-played video file as the corresponding video program type.
S140、若所述当前待播放视频文件包括节目类型标签,根据节目类型标签获取与所述当前待播放视频文件对应的视频节目类型。S140: If the currently to-be-played video file includes a program type tag, obtain the video program type corresponding to the currently-to-be-played video file according to the program type tag.
在本实施例中,即若当前待播放视频文件已被设置了节目类型标签,则直接提取对应的视频节目类型即可,无需通过所述卷积神经网络作为识别模型来获取所述当前待播放视频文件中目标图像集各图片对应的节目类型。In this embodiment, that is, if the program type tag of the current video file to be played has been set, the corresponding video program type can be directly extracted, without the need to use the convolutional neural network as a recognition model to obtain the current to be played The program type corresponding to each picture in the target image set in the video file.
S150、根据当前待播放视频文件对应的视频节目类型及预先设置的视频类型播放策略,获取与所述当前待播放视频文件对应的播放方式信息并发送至对应的用户终端;其中,所述视频类型播放策略包括多种视频节目类型,及与各视频节目类型一一对应的播放方式信息。S150. According to the video program type corresponding to the currently to-be-played video file and the preset video type playback strategy, obtain the playback mode information corresponding to the currently-to-be-played video file and send it to the corresponding user terminal; wherein, the video type The play strategy includes a variety of video program types, and the play mode information corresponding to each video program type one-to-one.
在本实施例中,所述视频类型播放策略可预先设置的为如下表1:In this embodiment, the video type play strategy can be preset as the following Table 1:
Figure PCTCN2020087026-appb-000001
Figure PCTCN2020087026-appb-000001
Figure PCTCN2020087026-appb-000002
Figure PCTCN2020087026-appb-000002
表1Table 1
作为另一种实施方式的视频类型播放策略可预先设置的为如下表2:As another implementation manner, the video type playback strategy can be preset as the following Table 2:
序号Serial number 视频节目类型Video program type 播放方式信息Play mode information
11 新闻节目News program 音频播放+后台运行Audio playback + background operation
22 实时点评节目Real-time comment on the show 音频播放+后台运行Audio playback + background operation
33 朗诵节目Recitation program 音频播放+后台运行Audio playback + background operation
44 体育节目Sports programs 视频播放Video playback
55 综艺节目variety show 视频播放Video playback
66 电影节目Movie show 视频播放Video playback
……... ……... ……...
表2Table 2
视频播放策略的上述两具体实施例不同之处在于,音频播放类的视频节目如表1的第一种方式是在用户终端屏幕关闭的情况下播放以实现省电,音频播放类的视频节目如表2的第二种方式是在用户终端的后台挂起播放以便于用户操作用户终端进行其他操作(如浏览网页页面)。The difference between the above two specific embodiments of the video playback strategy is that the audio playback type video program shown in Table 1 is played when the user terminal screen is turned off to save power. The audio playback type video program is as follows: The second method in Table 2 is to suspend playback in the background of the user terminal so that the user can operate the user terminal to perform other operations (such as browsing web pages).
S160、若检测到与所述当前待播放视频文件的播放方式信息对应的同意操作指令,将与所述当前待播放视频文件分解为对应的音视文件和/或视频文件,以组装成对应的播放数据并发送至用户终端。S160. If the consent operation instruction corresponding to the playing mode information of the currently to-be-played video file is detected, decompose the currently-to-be-played video file into corresponding audio-visual files and/or video files to assemble into corresponding Play the data and send it to the user terminal.
在本实施例中,当视频服务器将与所述当前待播放视频文件的播放方式信息发送至用户终端后,若用户选择同意该播放方式则触发同意操作指令,且该同意操作指令由用户终端发送至视频服务器。若视频服务器检测到同意操作指令,将所述当前待播放视频文件分解为对应的音视文件和/或视频文件发送至用户终端。In this embodiment, after the video server sends the information about the play mode of the currently to-be-played video file to the user terminal, if the user chooses to agree to the play mode, the consent operation instruction is triggered, and the consent operation instruction is sent by the user terminal To the video server. If the video server detects the consent operation instruction, it decomposes the currently to-be-played video file into corresponding audio-visual files and/or video files and sends them to the user terminal.
在一实施例中,步骤S160包括:In an embodiment, step S160 includes:
若所述当前待播放视频文件的播放方式信息所对应的播放方式中存在音频播放方式的第一播放段,获取第一播放段的音频数据以及对应音频播放控制参数;其中,音频播放控制参数用于控制用户终端的显示屏开启或关闭,或者是用于控制当前待播放视频文件对应的播放器在前端运行或后台挂起;If there is a first playback segment of the audio playback mode in the playback mode corresponding to the playback mode information of the currently to-be-played video file, the audio data of the first playback segment and the corresponding audio playback control parameters are acquired; wherein, the audio playback control parameter is used It is used to control the display screen of the user terminal to turn on or off, or it is used to control the player corresponding to the current video file to be played to run in the front end or hang in the background;
若所述当前待播放视频文件的播放方式信息所对应的播放方式中存在视频播放方式的第二播放段,获取第二播放段的视频数据;If there is a second play segment of the video play mode in the play mode corresponding to the play mode information of the currently to-be-played video file, acquiring the video data of the second play segment;
将第一播放段的音频数据以及音频播放控制参数、第二播放段的视频数据组装成播放数据。The audio data of the first playback segment and the audio playback control parameters, and the video data of the second playback segment are assembled into playback data.
在本实施例中,例如若所述当前待播放视频文件对应的播放方式信息中存在音频播放方式的第一播放段,则获取该第一播放段的音频数据以及音频播放控制参数(如控制用户终端的显示屏关闭或是播放器后台挂起);若所述当前待播放视频文件对应的播放方式信息中存在视频播放方式的第二播放段,则获取该第二播放段的视频数据;最后将第一播放段的音频数据以及音频播放控制参数、第二播放段的视频数据组装成播放数据发送至用户终端。In this embodiment, for example, if the first playback segment of the audio playback mode exists in the playback mode information corresponding to the currently to-be-played video file, the audio data of the first playback segment and the audio playback control parameters (such as controlling the user The display screen of the terminal is turned off or the player is suspended in the background); if the second playback segment of the video playback mode exists in the playback mode information corresponding to the currently to-be-played video file, the video data of the second playback segment is acquired; and finally The audio data of the first playback segment, the audio playback control parameters, and the video data of the second playback segment are assembled into playback data and sent to the user terminal.
其中,若所述当前待播放视频文件对应的播放方式信息中存在音频播放方式的第一播放段,则第一播放段的音频数据以及音频播放控制参数不为空值;若所述当前待播放视频文件对应的播放方式信息中不存在音频播放方式的第一播放段,则第一播放段的音频数据以及音频播放控制参数为空值。同样的,若所述当前待播放视频文件对应的播放方式信息中存在视频播放方式的第二播放段,则第二播放段对应的视频数据不为空值;若所述当前待播放视频文件对应的播放方式信息中不存在视频播放方式的第二播放段,则第二播放段对应的视频数据为空值。这样将第一播放段的音频数据以及音频播放控制参数、第二播放段的视频数据组装成播放数据发送至用户终端时,可以根据所述当前待播放视频文件对应的播放方式信息中是存在第一播放端和/或第二播放端,均能组合出与所述当前待播放视频文件对应的播放数据,也不会出现误差。Wherein, if the first playback segment of the audio playback mode exists in the playback mode information corresponding to the currently to-be-played video file, the audio data and audio playback control parameters of the first playback segment are not null values; if the currently to-be-played If the first playback segment of the audio playback mode does not exist in the playback mode information corresponding to the video file, the audio data and audio playback control parameters of the first playback segment are null values. Similarly, if the second playback segment of the video playback mode exists in the playback mode information corresponding to the currently to-be-played video file, the video data corresponding to the second playback segment is not a null value; if the currently-to-be-played video file corresponds to If the second playback segment of the video playback mode does not exist in the playback mode information, the video data corresponding to the second playback segment is a null value. In this way, when the audio data of the first playback segment and the audio playback control parameters, and the video data of the second playback segment are assembled into playback data and sent to the user terminal, it can be based on whether there is a second playback mode information corresponding to the current video file to be played. Both the one playback terminal and/or the second playback terminal can combine the playback data corresponding to the currently to-be-played video file, and there is no error.
在之后,通过在用户终端上的视频播放器中导入播放数据而进行播放,即可实现一种进行智能播放,在应音频播放而关闭显示屏或后台挂起时即刻进行节电,无需长亮耗费电能。After that, by importing the playback data into the video player on the user terminal for playback, a smart playback can be realized, which can save power immediately when the display screen is turned off or the background hangs in response to audio playback, without the need to keep on It consumes electricity.
此时,将播放数据发送到用户终端进行播放后,也可以根据用户终端的位置判断是否需要播放视频(关闭屏幕),比如检测到用户将用户终端放入口袋中或者箱包中等时就暂停播放视频,这一控制方式的发起方在于用户终端本身,而非视频服务器。通过这一方式,也能有效控制用户终端进行节电。At this point, after sending the playback data to the user terminal for playback, you can also determine whether the video needs to be played (turn off the screen) according to the location of the user terminal. For example, it is detected that the user puts the user terminal in a pocket or a suitcase, and then pauses the video playback. , The initiator of this control method is the user terminal itself, not the video server. In this way, the user terminal can also be effectively controlled to save power.
该方法实现了基于当前待播放视频文件的视频类型而获取其播放方式信息和对应的播放数据,根据播放方式信息实现对用户终端的节电控制。The method realizes that based on the video type of the current to-be-played video file, the playing mode information and the corresponding playing data are obtained, and the power saving control of the user terminal is realized according to the playing mode information.
本申请实施例还提供一种基于视频类型的播放控制实现装置,该基于视频类型的播放控制实现装置用于执行前述基于视频类型的播放控制实现方法的任一实施例。具体地,请参阅图5,图5是本申请实施例提供的基于视频类型的播放控制实现装置的示意性框图。该基于视频类型的播放控制实现装置100可以配置于服务器中。The embodiment of the present application also provides a device for implementing playback control based on a video type, and the device for implementing playback control based on a video type is used to execute any embodiment of the foregoing method for implementing playback control based on a video type. Specifically, please refer to FIG. 5, which is a schematic block diagram of a video type-based playback control implementation device provided by an embodiment of the present application. The device 100 for implementing playback control based on video types may be configured in a server.
如图5所示,基于视频类型的播放控制实现装置100包括视频选择单元110、视频标签判断单元120、第一节目类型获取单元130、第二节目类型获取单元140、播放方式信息获取单元150、播放数据发送单元160。As shown in FIG. 5, the device 100 for implementing playback control based on video type includes a video selection unit 110, a video tag determination unit 120, a first program type acquisition unit 130, a second program type acquisition unit 140, a playback mode information acquisition unit 150, Play data sending unit 160.
视频选择单元110,用于若检测到视频选择指令,获取与所述视频选择指令对应的当前待播放视频文件。The video selection unit 110 is configured to, if a video selection instruction is detected, obtain a current to-be-played video file corresponding to the video selection instruction.
在本实施例中,当用户在用户终端(如智能手机、平板电脑等)打开视频软件的用户交互界面时,需选定一个待播放视频文件,一旦选定了待播放视频文件,用户终端将此视频选择指令发送至视频服务器。视频服务器获取了所述视频选择指令,则对应的获取当前待播放视频文件。其中,待播放视频文件既可以是限定了时长的视频文件(如一部电影、一段短视频等,限定了时长的视频文件可以是包括单一视频节目类型的视频文件,也可以是包括多个视频节目类型的视频文件),也可以是直播视频(例如A电视台的在线节目,直播视频在不同的时间段播放不同类型的视频节目)。In this embodiment, when the user opens the user interaction interface of the video software on the user terminal (such as a smart phone, a tablet computer, etc.), a video file to be played needs to be selected. Once the video file to be played is selected, the user terminal will This video selection instruction is sent to the video server. The video server obtains the video selection instruction, and correspondingly obtains the current to-be-played video file. Among them, the video file to be played can be a video file with a limited duration (such as a movie, a short video, etc., and the video file with a limited duration can be a video file that includes a single video program type, or it can include multiple video programs. Types of video files), or live video (for example, the online program of TV A, the live video broadcasts different types of video programs in different time periods).
视频标签判断单元120,用于判断所述当前待播放视频文件是否包括视频节目类型标签。The video tag determining unit 120 is configured to determine whether the currently to-be-played video file includes a video program type tag.
在本实施例中,由于用户所选定的当前待播放视频文件可能已被设置有视频节目类型标签,也有可能还未被设置视频节目类型标签,为了后续步骤中确定当前待播放视频文件的播放方式,需先根据视频节目类型标签或视频内容识别来确定视频节目类型。In this embodiment, since the currently to-be-played video file selected by the user may have been set with a video program type tag, or it may not have been set, in order to determine the playback of the currently-to-be-played video file in the subsequent steps Method, you need to determine the video program type first according to the video program type tag or video content recognition.
第一节目类型获取单元130,用于若所述当前待播放视频文件不包括节目类型标签,将所述当前待播放视频文件中选定的多帧视频图像输入至预先训练的卷积神经网络网络,得到视频节目类型。The first program type acquiring unit 130 is configured to, if the currently to-be-played video file does not include a program type tag, input the selected multiple frames of video images in the currently-to-be-played video file to a pre-trained convolutional neural network network , Get the video program type.
在本实施例中,若所述当前待播放视频文件不包括节目类型标签,表示需要视频服务器对当前待播放视频文件进行视频节目类型的识别。当视频服务器获取了所述当前待播放视频文件后,为了判断其视频节目类型(如新闻节目、实时点评节目、朗诵节目、体育节目、综艺节目、电影节目等),需要将所述当前待播放视频文件进行视频拆分,得到多帧视频图像。之后根据预先训练的卷积神经网络网络即可对拆分得到的多帧视频图像进行识别,从而得到 对应的视频节目类型。In this embodiment, if the currently to-be-played video file does not include the program type tag, it means that the video server is required to identify the video program type of the currently-to-be-played video file. After the video server obtains the currently to-be-played video file, in order to determine the type of its video program (such as news programs, real-time review programs, recitation programs, sports programs, variety shows, movie programs, etc.), it is necessary to set the currently-to-be-played video files The video file is split into video to obtain multiple frames of video images. Then, according to the pre-trained convolutional neural network, the split multi-frame video images can be identified, and the corresponding video program type can be obtained.
在一实施例中,如图6所示,第一节目类型获取单元130包括:In an embodiment, as shown in FIG. 6, the first program type acquiring unit 130 includes:
目标图像集获取单元131,用于按预设的间隔数获取所述当前待播放视频文件对应的视频图像集合中的多帧视频图像,以作为目标图像集;The target image set obtaining unit 131 is configured to obtain multiple frames of video images in the video image set corresponding to the current to-be-played video file according to a preset number of intervals, as a target image set;
节目类型识别单元132,用于将所述目标图像集输入至所述卷积神经网络,得到所述目标图像集中每一帧视频图像对应的节目类型;The program type identification unit 132 is configured to input the target image set to the convolutional neural network to obtain the program type corresponding to each frame of video image in the target image set;
分隔单元133,用于若所述类型总数量大于1,依序获取所述目标图像集中每一帧视频图像对应节目类型以组成节目类型序列,将所述节目类型序列中相邻两个节目类型存在不同的节目类型之间插入分隔符;The separating unit 133 is configured to, if the total number of types is greater than 1, sequentially obtain the program type corresponding to each frame of the video image in the target image set to form a program type sequence, and divide the two adjacent program types in the program type sequence Insert separators between different program types;
播放段划分单元134,用于定位节目类型序列中所插入的每一分隔符对应的分隔时间点,通过各分隔时间点将所述当前待播放视频文件划分为对应播放段;The playback segment dividing unit 134 is configured to locate the separation time point corresponding to each separator inserted in the program type sequence, and divide the current to-be-played video file into corresponding playback segments through each separation time point;
播放段类型获取单元135,用于依序获取所述当前待播放视频文件中各播放段所对应的视频节目类型。The playback segment type acquiring unit 135 is configured to sequentially acquire the video program type corresponding to each playback segment in the currently to-be-played video file.
在本实施例中,按预设的间隔数获取所述当前待播放视频文件对应的视频图像集合中的多帧视频图像时,可以将预设的间隔数设置为12。一般1秒钟的视频可以拆分为24帧图像,则N分钟时长的当前待播放视频文件可以拆分为24N张视频图像,此时可以第一帧视频图像为起始点,还选择第13,25,……,1+12n(其中1+12n≤24N)帧视频图像组成目标图像集。步骤S131在具体实施时,可以采用另一种实施方式,即随机获取所述当前待播放视频文件对应的视频图像集合中的多帧视频图像,以作为目标图像集。In this embodiment, when acquiring multiple frames of video images in the video image set corresponding to the currently to-be-played video file according to a preset number of intervals, the preset number of intervals may be set to 12. Generally, a 1 second video can be split into 24 frames of images, and the current video file to be played with a duration of N minutes can be split into 24N video images. At this time, the first frame of video image can be used as the starting point, and the 13th frame can also be selected. 25,..., 1+12n (where 1+12n≤24N) frames of video images form the target image set. In the specific implementation of step S131, another implementation manner may be adopted, that is, multiple frames of video images in the video image set corresponding to the current to-be-played video file are randomly obtained as the target image set.
之后将所述目标图像集输入至所述卷积神经网络,可通过所述卷积神经网络获取所述目标图像集中每一帧视频图像对应的节目类型,然后统计所述卷积神经网络获取所述目标图像集中每一帧视频图像对应的节目类型中类型总数量。After that, the target image set is input to the convolutional neural network, the program type corresponding to each frame of the video image in the target image set can be obtained through the convolutional neural network, and then the convolutional neural network obtains all The total number of types in the program types corresponding to each frame of video image in the target image set.
当类型总数量大于1时,表示所述当前待播放视频文件中存在两个及以上的节目类型,此时为了将所述当前待播放视频文件进行有效的划分,此时可定位节目类型序列中所插入的每一分隔符对应的分隔时间点,通过各分隔时间点将所述当前待播放视频文件划分为对应播放段;When the total number of types is greater than 1, it means that there are two or more program types in the current to-be-played video file. At this time, in order to effectively divide the current to-be-played video file, you can locate the program type sequence at this time For the separation time point corresponding to each of the inserted separators, the current to-be-played video file is divided into corresponding play segments through each separation time point;
例如节目类型序列为[新闻节目 新闻节目 新闻节目 新闻节目 体育节目 体育节目 体育节目 体育节目 综艺节目 综艺节目 综艺节目 综艺节目 综艺节目……综艺节目]For example, the program type sequence is [News Program News Program News Program News Program Sports Program Sports Program Sports Program Sports Program Variety Program Variety Program Variety Program Variety Program Variety Program Variety Program...Variety Program]
插入分隔符后的节目类型序列为[新闻节目 新闻节目 新闻节目 新闻节目|体育节目 体育节目 体育节目 体育节目|综艺节目 综艺节目 综艺节目 综艺节目 综艺节目……综艺节目]。The program type sequence after inserting the separator is [News Program News Program News Program News Program|Sports Program Sports Program Sports Program Sports Program|Variety Program Variety Program Variety Program Variety Program Variety Program Variety Program Variety Program...Variety Program].
上述节目类型序列中第一个分隔符在第四位的新闻节目和第五位的体育节目之间,第二个分隔符在第八位的体育节目和第九位的综艺节目之间,此时分别获取第一分隔符对应的第一分隔时间点、及第二分隔符对应的第二分隔时间点(其中第一分隔时间点记作第五位的体育节目对应的起始时间,其中第二分隔时间点记作第九位的综艺节目对应的起始时间)。则第一位的新闻节目对应的起始时间点与第一分隔时间点之间记为第一播放段,第一分隔时间点与第二分隔时间点之间记为第二播放段,第二分隔时间点与最末位的综艺节目对应的结束时间点之间记为第三播放段。完成上述划分后,依序获取所述当前待播放视频文件中各播放段所对应的视频节目类型。通过所述卷积神经网络有效的对所述当前待播放视频文件中包括的视频节目类型进行有效识别,便于后续根据视频节目类型控制视频文件的播放方式。In the above program type sequence, the first separator is between the news program in the fourth place and the sports program in the fifth place, and the second separator is between the sports program in the eighth place and the variety show in the ninth place. Obtain the first separation time point corresponding to the first separator and the second separation time point corresponding to the second separator (where the first separation time point is recorded as the start time corresponding to the fifth place sports program, where the The two separate time points are recorded as the start time corresponding to the ninth variety show). Then the time between the start time point corresponding to the first news program and the first separation time point is recorded as the first play segment, and the time between the first separation time point and the second separation time point is recorded as the second play segment, and the second The interval between the separation time point and the end time point corresponding to the last variety show is recorded as the third play segment. After the above division is completed, the video program types corresponding to each playback segment in the current to-be-played video file are sequentially acquired. The convolutional neural network can effectively identify the video program type included in the currently to-be-played video file, so as to facilitate subsequent control of the video file playback mode according to the video program type.
在一实施例中,如图7所示,基于视频类型的播放控制实现装置100还包括:In an embodiment, as shown in FIG. 7, the device 100 for implementing video type-based playback control further includes:
视频节目类型划分单元1101,用于对视频节目划分多个视频节目类型;The video program type dividing unit 1101 is configured to divide the video program into multiple video program types;
视频样本获取单元1102,用于获取每一个视频节目类型的视频样本;The video sample obtaining unit 1102 is used to obtain video samples of each video program type;
训练集获取单元1103,用于提取所述视频样本中的每帧图像的图像特征数据作为训练集;The training set obtaining unit 1103 is configured to extract image feature data of each frame of image in the video sample as a training set;
模型训练单元1104,用于将所述训练集输入至待训练卷积神经网络进行训练,得到卷积 神经网络。The model training unit 1104 is configured to input the training set to the convolutional neural network to be trained for training to obtain the convolutional neural network.
在本实施例中,在预先训练所述卷积神经网络时,例如,电影节目的图像特征是电影场景中的图像特征是电影场景中上下或者左右的黑边,因此可以提权该图像中的上下或左右的黑边作为图像特征数据;新闻节目的图像特征是新闻场景中的左上角的台标和下边的字幕,因此可以提取台标和字幕作为图像特征数据。通过上述方式训练出卷积神经网络后,即可用于识别视频图像对应的视频节目类型。In this embodiment, when the convolutional neural network is trained in advance, for example, the image feature of the movie program is that the image feature in the movie scene is the upper and lower or left and right black borders in the movie scene, so the right of the image in the image can be increased. The upper and lower or left and right black borders are used as image feature data; the image characteristics of news programs are the upper left corner of the station logo and the lower subtitles in the news scene, so the station logo and subtitles can be extracted as image feature data. After the convolutional neural network is trained in the above manner, it can be used to identify the video program type corresponding to the video image.
在一实施例中,第一节目类型获取单元130还包括:In an embodiment, the first program type acquiring unit 130 further includes:
当前节目类型获取单元136,用于若所述类型总数量等于1,获取所述当前待播放视频文件对应的视频节目类型。The current program type obtaining unit 136 is configured to obtain the video program type corresponding to the current video file to be played if the total number of types is equal to one.
在本实施例中,若类型总数量等于1时,表示所述当前待播放视频文件中仅存在一种节目类型,此时无需对所述当前待播放视频文件进行划分来识别对应的视频节目类型,直接以所述当前待播放视频文件的当前节目类型作为对应的视频节目类型。In this embodiment, if the total number of types is equal to 1, it means that there is only one program type in the current to-be-played video file, and there is no need to divide the current to-be-played video file to identify the corresponding video program type. , Directly use the current program type of the currently to-be-played video file as the corresponding video program type.
第二节目类型获取单元140,用于若所述当前待播放视频文件包括节目类型标签,根据节目类型标签获取与所述当前待播放视频文件对应的视频节目类型。The second program type obtaining unit 140 is configured to, if the currently to-be-played video file includes a program type tag, obtain the video program type corresponding to the currently-to-be-played video file according to the program type tag.
在本实施例中,即若当前待播放视频文件已被设置了节目类型标签,则直接提取对应的视频节目类型即可,无需通过所述卷积神经网络作为识别模型来获取所述当前待播放视频文件中目标图像集各图片对应的节目类型。In this embodiment, that is, if the program type tag of the current video file to be played has been set, the corresponding video program type can be directly extracted, without the need to use the convolutional neural network as a recognition model to obtain the current to be played The program type corresponding to each picture in the target image set in the video file.
播放方式信息获取单元150,用于根据当前待播放视频文件对应的视频节目类型及预先设置的视频类型播放策略,获取与所述当前待播放视频文件对应的播放方式信息并发送至对应的用户终端;其中,所述视频类型播放策略包括多种视频节目类型,及与各视频节目类型一一对应的播放方式信息。The playing mode information obtaining unit 150 is configured to obtain the playing mode information corresponding to the current to-be-played video file according to the video program type corresponding to the current to-be-played video file and the preset video type playing strategy and send it to the corresponding user terminal ; Wherein, the video type play strategy includes multiple video program types, and play mode information corresponding to each video program type one-to-one.
在本实施例中,所述视频类型播放策略可预先设置的为如上表1:作为另一种实施方式的视频类型播放策略可预先设置的为如上表2。In this embodiment, the video type play strategy may be preset as shown in Table 1 above: as another implementation manner, the video type play strategy may be preset as shown in Table 2 above.
视频播放策略的上述两具体实施例不同之处在于,音频播放类的视频节目如表1的第一种方式是在用户终端屏幕关闭的情况下播放以实现省电,音频播放类的视频节目如表2的第二种方式是在用户终端的后台挂起播放以便于用户操作用户终端进行其他操作(如浏览网页页面)。The difference between the above two specific embodiments of the video playback strategy is that the audio playback type video program shown in Table 1 is played when the user terminal screen is turned off to save power. The audio playback type video program is as follows: The second method in Table 2 is to suspend playback in the background of the user terminal so that the user can operate the user terminal to perform other operations (such as browsing web pages).
播放数据发送单元160,用于若检测到与所述当前待播放视频文件的播放方式信息对应的同意操作指令,将与所述当前待播放视频文件分解为对应的音视文件和/或视频文件,以组装成对应的播放数据并发送至用户终端。The play data sending unit 160 is configured to, if an agreed operation instruction corresponding to the play mode information of the currently to-be-played video file is detected, decompose the currently-to-be-played video file into corresponding audio-visual files and/or video files , To assemble the corresponding playback data and send it to the user terminal.
在本实施例中,当视频服务器将与所述当前待播放视频文件的播放方式信息发送至用户终端后,若用户选择同意该播放方式则触发同意操作指令,且该同意操作指令由用户终端发送至视频服务器。若视频服务器检测到同意操作指令,将所述当前待播放视频文件分解为对应的音视文件和/或视频文件发送至用户终端。In this embodiment, after the video server sends the information about the play mode of the currently to-be-played video file to the user terminal, if the user chooses to agree to the play mode, the consent operation instruction is triggered, and the consent operation instruction is sent by the user terminal To the video server. If the video server detects the consent operation instruction, it decomposes the currently to-be-played video file into corresponding audio-visual files and/or video files and sends them to the user terminal.
在一实施例中,播放数据发送单元160包括:In an embodiment, the play data sending unit 160 includes:
第一获取单元,用于若所述当前待播放视频文件的播放方式信息所对应的播放方式中存在音频播放方式的第一播放段,获取第一播放段的音频数据以及对应音频播放控制参数;其中,音频播放控制参数用于控制用户终端的显示屏开启或关闭,或者是用于控制当前待播放视频文件对应的播放器在前端运行或后台挂起;The first acquiring unit is configured to acquire audio data of the first playback segment and corresponding audio playback control parameters if there is a first playback segment of the audio playback mode in the playback mode corresponding to the playback mode information of the currently to-be-played video file; Among them, the audio playback control parameter is used to control the display screen of the user terminal to turn on or off, or to control the player corresponding to the video file currently to be played to run in the front end or hang in the background;
第二获取单元,用于若所述当前待播放视频文件的播放方式信息所对应的播放方式中存在视频播放方式的第二播放段,获取第二播放段的视频数据;The second obtaining unit is configured to obtain the video data of the second playing section if there is a second playing section of the video playing mode in the playing mode corresponding to the playing mode information of the currently to-be-played video file;
数据组合单元,用于将第一播放段的音频数据以及音频播放控制参数、第二播放段的视频数据组装成播放数据。The data combination unit is used to assemble the audio data of the first playback segment, the audio playback control parameters, and the video data of the second playback segment into playback data.
在本实施例中,例如若所述当前待播放视频文件对应的播放方式信息中存在音频播放方式的第一播放段,则获取该第一播放段的音频数据以及音频播放控制参数(如控制用户终端的显示屏关闭或是播放器后台挂起);若所述当前待播放视频文件对应的播放方式信息中存 在视频播放方式的第二播放段,则获取该第二播放段的视频数据;最后将第一播放段的音频数据以及音频播放控制参数、第二播放段的视频数据组装成播放数据发送至用户终端。In this embodiment, for example, if the first playback segment of the audio playback mode exists in the playback mode information corresponding to the currently to-be-played video file, the audio data of the first playback segment and the audio playback control parameters (such as controlling the user The display screen of the terminal is turned off or the player is suspended in the background); if the second playback segment of the video playback mode exists in the playback mode information corresponding to the currently to-be-played video file, the video data of the second playback segment is acquired; and finally The audio data of the first playback segment, the audio playback control parameters, and the video data of the second playback segment are assembled into playback data and sent to the user terminal.
其中,若所述当前待播放视频文件对应的播放方式信息中存在音频播放方式的第一播放段,则第一播放段的音频数据以及音频播放控制参数不为空值;若所述当前待播放视频文件对应的播放方式信息中不存在音频播放方式的第一播放段,则第一播放段的音频数据以及音频播放控制参数为空值。同样的,若所述当前待播放视频文件对应的播放方式信息中存在视频播放方式的第二播放段,则第二播放段对应的视频数据不为空值;若所述当前待播放视频文件对应的播放方式信息中不存在视频播放方式的第二播放段,则第二播放段对应的视频数据为空值。这样将第一播放段的音频数据以及音频播放控制参数、第二播放段的视频数据组装成播放数据发送至用户终端时,可以根据所述当前待播放视频文件对应的播放方式信息中是存在第一播放端和/或第二播放端,均能组合出与所述当前待播放视频文件对应的播放数据,也不会出现误差。Wherein, if the first playback segment of the audio playback mode exists in the playback mode information corresponding to the currently to-be-played video file, the audio data and audio playback control parameters of the first playback segment are not null values; if the currently to-be-played If the first playback segment of the audio playback mode does not exist in the playback mode information corresponding to the video file, the audio data and audio playback control parameters of the first playback segment are null values. Similarly, if the second playback segment of the video playback mode exists in the playback mode information corresponding to the currently to-be-played video file, the video data corresponding to the second playback segment is not a null value; if the currently-to-be-played video file corresponds to If the second playback segment of the video playback mode does not exist in the playback mode information, the video data corresponding to the second playback segment is a null value. In this way, when the audio data of the first playback segment and the audio playback control parameters, and the video data of the second playback segment are assembled into playback data and sent to the user terminal, it can be based on whether there is a second playback mode information corresponding to the current video file to be played. Both the one playback terminal and/or the second playback terminal can combine the playback data corresponding to the currently to-be-played video file, and there is no error.
在之后,通过在用户终端上的视频播放器中导入播放数据而进行播放,即可实现一种进行智能播放,在应音频播放而关闭显示屏或后台挂起时即刻进行节电,无需长亮耗费电能。After that, by importing the playback data into the video player on the user terminal for playback, a smart playback can be realized, which can save power immediately when the display screen is turned off or the background hangs in response to audio playback, without the need to keep on It consumes electricity.
此时,将播放数据发送到用户终端进行播放后,也可以根据用户终端的位置判断是否需要播放视频(关闭屏幕),比如检测到用户将用户终端放入口袋中或者箱包中等时就暂停播放视频,这一控制方式的发起方在于用户终端本身,而非视频服务器。通过这一方式,也能有效控制用户终端进行节电。At this point, after sending the playback data to the user terminal for playback, you can also determine whether the video needs to be played (turn off the screen) according to the location of the user terminal. For example, it is detected that the user puts the user terminal in a pocket or a suitcase, and then pauses the video playback. , The initiator of this control method is the user terminal itself, not the video server. In this way, the user terminal can also be effectively controlled to save power.
该装置实现了基于当前待播放视频文件的视频类型而获取其播放方式信息和对应的播放数据,根据播放方式信息实现对用户终端的节电控制。The device realizes that based on the video type of the current to-be-played video file, it obtains its playing mode information and corresponding playing data, and realizes power-saving control of the user terminal according to the playing mode information.
上述基于视频类型的播放控制实现装置可以实现为计算机程序的形式,该计算机程序可以在如图8所示的计算机设备上运行。The foregoing device for implementing playback control based on the video type may be implemented in the form of a computer program, and the computer program may run on the computer device as shown in FIG. 8.
请参阅图8,图8是本申请实施例提供的计算机设备的示意性框图。该计算机设备500是服务器,服务器可以是独立的服务器,也可以是多个服务器组成的服务器集群,实现基于视频类型的播放控制方法,其中,所述方法包括:若检测到视频选择指令,获取与所述视频选择指令对应的当前待播放视频文件;判断所述当前待播放视频文件是否包括视频节目类型标签;若所述当前待播放视频文件不包括节目类型标签,将所述当前待播放视频文件中选定的多帧视频图像输入至预先训练的卷积神经网络网络,得到视频节目类型;若所述当前待播放视频文件包括节目类型标签,根据节目类型标签获取与所述当前待播放视频文件对应的视频节目类型;根据当前待播放视频文件对应的视频节目类型及预先设置的视频类型播放策略,获取与所述当前待播放视频文件对应的播放方式信息并发送至对应的用户终端;其中,所述视频类型播放策略包括多种视频节目类型,及与各视频节目类型一一对应的播放方式信息;以及若检测到与所述当前待播放视频文件的播放方式信息对应的同意操作指令,将与所述当前待播放视频文件分解为对应的音视文件和/或视频文件,以组装成对应的播放数据并发送至用户终端。Please refer to FIG. 8. FIG. 8 is a schematic block diagram of a computer device according to an embodiment of the present application. The computer device 500 is a server. The server can be an independent server or a server cluster composed of multiple servers to implement a video type-based playback control method, where the method includes: if a video selection instruction is detected, acquiring and The current to-be-played video file corresponding to the video selection instruction; determine whether the current to-be-played video file includes a video program type tag; if the current to-be-played video file does not include a program type tag, the current to-be-played video file The selected multi-frame video images are input to the pre-trained convolutional neural network to obtain the video program type; if the currently to-be-played video file includes a program type tag, the video file to be played is obtained according to the program type tag. Corresponding video program type; according to the video program type corresponding to the current to-be-played video file and the preset video type playback strategy, obtain the playback mode information corresponding to the currently-to-be-played video file and send it to the corresponding user terminal; wherein, The video type play strategy includes multiple video program types, and play mode information corresponding to each video program type one-to-one; and if an agreed operation instruction corresponding to the play mode information of the currently to-be-played video file is detected, the The current to-be-played video file is decomposed into corresponding audio-visual files and/or video files to be assembled into corresponding play data and sent to the user terminal.
参阅图8,该计算机设备500包括通过系统总线501连接的处理器502、存储器和网络接口505,其中,存储器可以包括非易失性存储介质503和内存储器504。Referring to FIG. 8, the computer device 500 includes a processor 502, a memory, and a network interface 505 connected through a system bus 501, where the memory may include a non-volatile storage medium 503 and an internal memory 504.
该非易失性存储介质503可存储操作系统5031和计算机程序5032。该计算机程序5032被执行时,可使得处理器502执行基于视频类型的播放控制实现方法。The non-volatile storage medium 503 can store an operating system 5031 and a computer program 5032. When the computer program 5032 is executed, the processor 502 can execute the method for implementing playback control based on the video type.
该处理器502用于提供计算和控制能力,支撑整个计算机设备500的运行。The processor 502 is used to provide computing and control capabilities, and support the operation of the entire computer device 500.
该内存储器504为非易失性存储介质503中的计算机程序5032的运行提供环境,该计算机程序5032被处理器502执行时,可使得处理器502执行基于视频类型的播放控制实现方法。The internal memory 504 provides an environment for the operation of the computer program 5032 in the non-volatile storage medium 503. When the computer program 5032 is executed by the processor 502, the processor 502 can make the processor 502 execute a video type-based playback control implementation method.
该网络接口505用于进行网络通信,如提供数据信息的传输等。本领域技术人员可以理解,图8中示出的结构,仅仅是与本申请方案相关的部分结构的框图,并不构成对本申请方案所应用于其上的计算机设备500的限定,具体的计算机设备500可以包括比图中所示更多或更少的部件,或者组合某些部件,或者具有不同的部件布置。The network interface 505 is used for network communication, such as providing data information transmission. Those skilled in the art can understand that the structure shown in FIG. 8 is only a block diagram of a part of the structure related to the solution of the present application, and does not constitute a limitation on the computer device 500 to which the solution of the present application is applied. The specific computer device 500 may include more or fewer components than shown in the figure, or combine certain components, or have a different component arrangement.
其中,所述处理器502用于运行存储在存储器中的计算机程序5032,以实现本申请实施例公开的基于视频类型的播放控制实现方法,其中,所述方法至少包括:。Wherein, the processor 502 is configured to run a computer program 5032 stored in a memory to implement the video type-based playback control implementation method disclosed in the embodiment of the present application, where the method at least includes:
本领域技术人员可以理解,图8中示出的计算机设备的实施例并不构成对计算机设备具体构成的限定,在其他实施例中,计算机设备可以包括比图示更多或更少的部件,或者组合某些部件,或者不同的部件布置。例如,在一些实施例中,计算机设备可以仅包括存储器及处理器,在这样的实施例中,存储器及处理器的结构及功能与图8所示实施例一致,在此不再赘述。Those skilled in the art can understand that the embodiment of the computer device shown in FIG. 8 does not constitute a limitation on the specific configuration of the computer device. In other embodiments, the computer device may include more or less components than those shown in the figure. Or some parts are combined, or different parts are arranged. For example, in some embodiments, the computer device may only include a memory and a processor. In such an embodiment, the structures and functions of the memory and the processor are consistent with the embodiment shown in FIG. 8 and will not be repeated here.
应当理解,在本申请实施例中,处理器502可以是中央处理单元(Central ProcessingUnit,CPU),该处理器502还可以是其他通用处理器、数字信号处理器(Digital Signal Processor,DSP)、专用集成电路(Application Specific Integrated Circuit,ASIC)、现成可编程门阵列(Field-Programmable GateArray,FPGA)或者其他可编程逻辑器件、分立门或者晶体管逻辑器件、分立硬件组件等。其中,通用处理器可以是微处理器或者该处理器也可以是任何常规的处理器等。It should be understood that in the embodiment of the present application, the processor 502 may be a central processing unit (Central Processing Unit, CPU), and the processor 502 may also be other general-purpose processors, digital signal processors (Digital Signal Processors, DSPs), and special purpose processors. Integrated circuit (Application Specific Integrated Circuit, ASIC), off-the-shelf programmable gate array (Field-Programmable Gate Array, FPGA) or other programmable logic devices, discrete gates or transistor logic devices, discrete hardware components, etc. Among them, the general-purpose processor may be a microprocessor or the processor may also be any conventional processor.
在本申请的另一实施例中提供计算机可读存储介质。该计算机可读存储介质可以为易失性介质,也可以为非易失性的计算机可读存储介质。该计算机可读存储介质存储有计算机程序,其中计算机程序被处理器执行时实现本申请实施例公开的基于视频类型的播放控制实现方法,其中,所述方法至少包括:若检测到视频选择指令,获取与所述视频选择指令对应的当前待播放视频文件;判断所述当前待播放视频文件是否包括视频节目类型标签;若所述当前待播放视频文件不包括节目类型标签,将所述当前待播放视频文件中选定的多帧视频图像输入至预先训练的卷积神经网络网络,得到视频节目类型;若所述当前待播放视频文件包括节目类型标签,根据节目类型标签获取与所述当前待播放视频文件对应的视频节目类型;根据当前待播放视频文件对应的视频节目类型及预先设置的视频类型播放策略,获取与所述当前待播放视频文件对应的播放方式信息并发送至对应的用户终端;其中,所述视频类型播放策略包括多种视频节目类型,及与各视频节目类型一一对应的播放方式信息;以及若检测到与所述当前待播放视频文件的播放方式信息对应的同意操作指令,将与所述当前待播放视频文件分解为对应的音视文件和/或视频文件,以组装成对应的播放数据并发送至用户终端。所述集成的单元如果以软件功能单元的形式实现并作为独立的产品销售或使用时,可以存储在一个存储介质中。基于这样的理解,本申请的技术方案本质上或者说对现有技术做出贡献的部分,或者该技术方案的全部或部分可以以软件产品的形式体现出来,该计算机软件产品存储在一个存储介质中,包括若干指令用以使得一台计算机设备(可以是个人计算机,服务器,或者网络设备等)执行本申请各个实施例所述方法的全部或部分步骤。而前述的存储介质包括:U盘、移动硬盘、只读存储器(ROM,Read-Only Memory)、磁碟或者光盘等各种可以存储程序代码的介质。In another embodiment of the present application, a computer-readable storage medium is provided. The computer-readable storage medium may be a volatile medium or a non-volatile computer-readable storage medium. The computer-readable storage medium stores a computer program, where the computer program is executed by a processor to implement the video type-based playback control implementation method disclosed in the embodiments of the present application, wherein the method at least includes: if a video selection instruction is detected, Obtain the current to-be-played video file corresponding to the video selection instruction; determine whether the current to-be-played video file includes a video program type tag; if the current to-be-played video file does not include a program type tag, set the currently to be played The selected multi-frame video images in the video file are input to the pre-trained convolutional neural network network to obtain the video program type; if the currently to-be-played video file includes a program type tag, it is obtained according to the program type tag and the current to-be-played The video program type corresponding to the video file; according to the video program type corresponding to the current to-be-played video file and the preset video type play strategy, obtain the play mode information corresponding to the currently-to-be-played video file and send it to the corresponding user terminal; Wherein, the video type play strategy includes multiple video program types, and play mode information corresponding to each video program type one-to-one; and if an agreed operation instruction corresponding to the play mode information of the currently to-be-played video file is detected , Decompose the currently to-be-played video file into corresponding audio-visual files and/or video files to assemble into corresponding play data and send them to the user terminal. If the integrated unit is implemented in the form of a software functional unit and sold or used as an independent product, it can be stored in a storage medium. Based on this understanding, the technical solution of this application is essentially or the part that contributes to the existing technology, or all or part of the technical solution can be embodied in the form of a software product, and the computer software product is stored in a storage medium. It includes several instructions to make a computer device (which may be a personal computer, a server, or a network device, etc.) execute all or part of the steps of the methods described in the various embodiments of the present application. The aforementioned storage media include: U disk, mobile hard disk, read-only memory (ROM, Read-Only Memory), magnetic disk or optical disk and other media that can store program codes.
以上所述,仅为本申请的具体实施方式,但本申请的保护范围并不局限于此,因此,本申请的保护范围应以权利要求的保护范围为准。The above are only specific implementations of this application, but the protection scope of this application is not limited thereto. Therefore, the protection scope of this application shall be subject to the protection scope of the claims.

Claims (20)

  1. 一种基于视频类型的播放控制实现方法,其中,包括:A method for implementing playback control based on video types, including:
    若检测到视频选择指令,获取与所述视频选择指令对应的当前待播放视频文件;If a video selection instruction is detected, obtain the current to-be-played video file corresponding to the video selection instruction;
    判断所述当前待播放视频文件是否包括视频节目类型标签;Judging whether the currently to-be-played video file includes a video program type tag;
    若所述当前待播放视频文件不包括节目类型标签,将所述当前待播放视频文件中选定的多帧视频图像输入至预先训练的卷积神经网络网络,得到视频节目类型;If the currently to-be-played video file does not include the program type tag, input the selected multiple frames of video images in the currently-to-be-played video file to a pre-trained convolutional neural network network to obtain the video program type;
    若所述当前待播放视频文件包括节目类型标签,根据节目类型标签获取与所述当前待播放视频文件对应的视频节目类型;If the currently to-be-played video file includes a program type tag, obtain the video program type corresponding to the currently-to-be-played video file according to the program type tag;
    根据当前待播放视频文件对应的视频节目类型及预先设置的视频类型播放策略,获取与所述当前待播放视频文件对应的播放方式信息并发送至对应的用户终端;其中,所述视频类型播放策略包括多种视频节目类型,及与各视频节目类型一一对应的播放方式信息;以及According to the video program type corresponding to the currently to-be-played video file and the preset video type play strategy, the play mode information corresponding to the currently-to-be-played video file is obtained and sent to the corresponding user terminal; wherein, the video type play strategy Including a variety of video program types, and one-to-one corresponding playback mode information for each video program type; and
    若检测到与所述当前待播放视频文件的播放方式信息对应的同意操作指令,将与所述当前待播放视频文件分解为对应的音视文件和/或视频文件,以组装成对应的播放数据并发送至用户终端。If the consent operation instruction corresponding to the playing mode information of the currently to-be-played video file is detected, the currently-to-be-played video file is decomposed into corresponding audio-visual files and/or video files to assemble into corresponding playback data And sent to the user terminal.
  2. 根据权利要求1所述的基于视频类型的播放控制实现方法,其中,所述将所述当前待播放视频文件中选定的多帧视频图像输入至预先训练的卷积神经网络网络,得到视频节目类型,包括:The method for implementing playback control based on video types according to claim 1, wherein said inputting the selected multi-frame video images in the currently to-be-played video file into a pre-trained convolutional neural network network to obtain a video program Types, including:
    按预设的间隔数获取所述当前待播放视频文件对应的视频图像集合中的多帧视频图像,以作为目标图像集;Acquiring multiple frames of video images in the video image set corresponding to the currently to-be-played video file according to a preset number of intervals, as a target image set;
    将所述目标图像集输入至所述卷积神经网络,得到所述目标图像集中每一帧视频图像对应的节目类型;Inputting the target image set to the convolutional neural network to obtain the program type corresponding to each frame of video image in the target image set;
    若所述类型总数量大于1,依序获取所述目标图像集中每一帧视频图像对应节目类型以组成节目类型序列,将所述节目类型序列中相邻两个节目类型存在不同的节目类型之间插入分隔符;If the total number of types is greater than 1, the program types corresponding to each frame of the video image in the target image set are sequentially obtained to form a program type sequence, and two adjacent program types in the program type sequence are divided into two different program types. Insert a separator between;
    定位节目类型序列中所插入的每一分隔符对应的分隔时间点,通过各分隔时间点将所述当前待播放视频文件划分为对应播放段;Locate the separation time point corresponding to each separator inserted in the program type sequence, and divide the current to-be-played video file into corresponding play segments through each separation time point;
    依序获取所述当前待播放视频文件中各播放段所对应的视频节目类型。Obtain the video program type corresponding to each playback segment in the currently to-be-played video file in sequence.
  3. 根据权利要求2所述的基于视频类型的播放控制实现方法,其中,所述将所述目标图像集输入至所述卷积神经网络,得到所述目标图像集中每一帧视频图像对应的节目类型之后,还包括:The method for implementing playback control based on video types according to claim 2, wherein said inputting said target image set to said convolutional neural network obtains the program type corresponding to each frame of video image in said target image set After that, it also includes:
    若所述类型总数量等于1,获取所述当前待播放视频文件对应的视频节目类型。If the total number of types is equal to 1, the video program type corresponding to the currently to-be-played video file is acquired.
  4. 根据权利要求1所述的基于视频类型的播放控制实现方法,其中,所述若所述当前待播放视频文件不包括节目类型标签,将所述当前待播放视频文件中选定的多帧视频图像输入至预先训练的卷积神经网络网络,得到视频节目类型之前,或者实时若检测到视频选择指令,获取与所述视频选择指令对应的当前待播放视频文件之前,还包括:The method for implementing video type-based playback control according to claim 1, wherein if the current to-be-played video file does not include a program type tag, the selected multi-frame video image in the current to-be-played video file Before inputting to a pre-trained convolutional neural network to obtain the video program type, or if a video selection instruction is detected in real time, before obtaining the current to-be-played video file corresponding to the video selection instruction, it also includes:
    对视频节目划分多个视频节目类型;Divide multiple video program types into video programs;
    获取每一个视频节目类型的视频样本;Obtain video samples of each video program type;
    提取所述视频样本中的每帧图像的图像特征数据作为训练集;Extracting image feature data of each frame of image in the video sample as a training set;
    将所述训练集输入至待训练卷积神经网络进行训练,得到卷积神经网络。The training set is input to the convolutional neural network to be trained for training to obtain the convolutional neural network.
  5. 根据权利要求2所述的基于视频类型的播放控制实现方法,其中,所述定位节目类型序列中所插入的每一分隔符对应的分隔时间点,包括:The method for implementing video type-based playback control according to claim 2, wherein the positioning time point corresponding to each separator inserted in the sequence of program types includes:
    获取每一分隔符之后排序第一位的节目类型对应视频段的起始播放时间,以每一分隔符之后排序第一位的节目类型对应视频段的起始播放时间为相应分隔符所对应的分隔时间点。The first program type sorted after each separator corresponds to the start playback time of the video segment, and the first program type sorted after each separator corresponds to the start playback time of the video segment corresponding to the corresponding separator Separate time points.
  6. 根据权利要求1所述的基于视频类型的播放控制实现方法,其中,所述将与所述当前待播放视频文件分解为对应的音视文件和/或视频文件,以组装成对应的播放数据,包括:The method for implementing playback control based on video types according to claim 1, wherein said decomposing said currently to-be-played video file into corresponding audiovisual files and/or video files to assemble into corresponding playback data, include:
    若所述当前待播放视频文件的播放方式信息所对应的播放方式中存在音频播放方式的第一播放段,获取第一播放段的音频数据以及对应音频播放控制参数;其中,音频播放控制参数用于控制用户终端的显示屏开启或关闭,或者是用于控制当前待播放视频文件对应的播放器在前端运行或后台挂起;If there is a first playback segment of the audio playback mode in the playback mode corresponding to the playback mode information of the currently to-be-played video file, the audio data of the first playback segment and the corresponding audio playback control parameters are acquired; wherein, the audio playback control parameter is used It is used to control the display screen of the user terminal to turn on or off, or it is used to control the player corresponding to the current video file to be played to run in the front end or hang in the background;
    若所述当前待播放视频文件的播放方式信息所对应的播放方式中存在视频播放方式的第二播放段,获取第二播放段的视频数据;If there is a second play segment of the video play mode in the play mode corresponding to the play mode information of the currently to-be-played video file, acquiring the video data of the second play segment;
    将第一播放段的音频数据以及音频播放控制参数、第二播放段的视频数据组装成播放数据。The audio data of the first playback segment and the audio playback control parameters, and the video data of the second playback segment are assembled into playback data.
  7. 一种基于视频类型的播放控制实现装置,其中,包括:A video type-based playback control implementation device, which includes:
    视频选择单元,用于若检测到视频选择指令,获取与所述视频选择指令对应的当前待播放视频文件;The video selection unit is configured to, if a video selection instruction is detected, obtain a current to-be-played video file corresponding to the video selection instruction;
    视频标签判断单元,用于判断所述当前待播放视频文件是否包括视频节目类型标签;A video tag determining unit, configured to determine whether the currently to-be-played video file includes a video program type tag;
    第一节目类型获取单元,用于若所述当前待播放视频文件不包括节目类型标签,将所述当前待播放视频文件中选定的多帧视频图像输入至预先训练的卷积神经网络网络,得到视频节目类型;The first program type acquiring unit is configured to, if the currently to-be-played video file does not include a program type tag, input the selected multiple frames of video images in the currently-to-be-played video file to a pre-trained convolutional neural network, Get the type of video program;
    第二节目类型获取单元,用于若所述当前待播放视频文件包括节目类型标签,根据节目类型标签获取与所述当前待播放视频文件对应的视频节目类型;The second program type acquiring unit is configured to, if the currently to-be-played video file includes a program type tag, acquire the video program type corresponding to the currently-to-be-played video file according to the program type tag;
    播放方式信息获取单元,用于根据当前待播放视频文件对应的视频节目类型及预先设置的视频类型播放策略,获取与所述当前待播放视频文件对应的播放方式信息并发送至对应的用户终端;其中,所述视频类型播放策略包括多种视频节目类型,及与各视频节目类型一一对应的播放方式信息;以及The playing mode information obtaining unit is configured to obtain the playing mode information corresponding to the current to-be-played video file according to the video program type corresponding to the current to-be-played video file and the preset video type playing strategy, and send it to the corresponding user terminal; Wherein, the video type play strategy includes multiple video program types, and play mode information corresponding to each video program type one-to-one; and
    播放数据发送单元,用于若检测到与所述当前待播放视频文件的播放方式信息对应的同意操作指令,将与所述当前待播放视频文件分解为对应的音视文件和/或视频文件,以组装成对应的播放数据并发送至用户终端。The playback data sending unit is configured to, if an agreed operation instruction corresponding to the playback mode information of the currently to-be-played video file is detected, decompose the currently-to-be-played video file into corresponding audio-visual files and/or video files, It can be assembled into corresponding playback data and sent to the user terminal.
  8. 根据权利要求7所述的基于视频类型的播放控制实现装置,其中,所述第一节目类型获取单元,包括:The device for implementing playback control based on video type according to claim 7, wherein the first program type acquiring unit comprises:
    目标图像集获取单元,用于按预设的间隔数获取所述当前待播放视频文件对应的视频图像集合中的多帧视频图像,以作为目标图像集;The target image set acquiring unit is configured to acquire multiple frames of video images in the video image set corresponding to the current to-be-played video file according to a preset number of intervals, as a target image set;
    节目类型识别单元,用于将所述目标图像集输入至所述卷积神经网络,得到所述目标图像集中每一帧视频图像对应的节目类型;A program type identification unit, configured to input the target image set to the convolutional neural network to obtain the program type corresponding to each frame of the video image in the target image set;
    分隔单元,用于若所述类型总数量大于1,依序获取所述目标图像集中每一帧视频图像对应节目类型以组成节目类型序列,将所述节目类型序列中相邻两个节目类型存在不同的节目类型之间插入分隔符;The separating unit is used to obtain the program type corresponding to each frame of the video image in the target image set in sequence if the total number of types is greater than 1, to form a program type sequence, and to store two adjacent program types in the program type sequence Insert separators between different program types;
    播放段划分单元,用于定位节目类型序列中所插入的每一分隔符对应的分隔时间点,通过各分隔时间点将所述当前待播放视频文件划分为对应播放段;The playback segment dividing unit is used to locate the separation time point corresponding to each separator inserted in the program type sequence, and divide the current to-be-played video file into corresponding playback segments through each separation time point;
    播放段类型获取单元,用于依序获取所述当前待播放视频文件中各播放段所对应的视频节目类型。The playback segment type acquiring unit is configured to sequentially acquire the video program type corresponding to each playback segment in the currently to-be-played video file.
  9. 一种计算机设备,包括存储器、处理器及存储在所述存储器上并可在所述处理器上运行的计算机程序,其中,所述处理器执行所述计算机程序时实现基于视频类型的播放控制实现方法,其中,包括:A computer device, including a memory, a processor, and a computer program stored on the memory and capable of running on the processor, wherein the processor implements video type-based playback control when the processor executes the computer program Methods, including:
    若检测到视频选择指令,获取与所述视频选择指令对应的当前待播放视频文件;If a video selection instruction is detected, obtain the current to-be-played video file corresponding to the video selection instruction;
    判断所述当前待播放视频文件是否包括视频节目类型标签;Judging whether the currently to-be-played video file includes a video program type tag;
    若所述当前待播放视频文件不包括节目类型标签,将所述当前待播放视频文件中选定的多帧视频图像输入至预先训练的卷积神经网络网络,得到视频节目类型;If the currently to-be-played video file does not include the program type tag, input the selected multiple frames of video images in the currently-to-be-played video file to a pre-trained convolutional neural network network to obtain the video program type;
    若所述当前待播放视频文件包括节目类型标签,根据节目类型标签获取与所述当前待播放视频文件对应的视频节目类型;If the currently to-be-played video file includes a program type tag, obtain the video program type corresponding to the currently-to-be-played video file according to the program type tag;
    根据当前待播放视频文件对应的视频节目类型及预先设置的视频类型播放策略,获取与所述当前待播放视频文件对应的播放方式信息并发送至对应的用户终端;其中,所述视频类型播放策略包括多种视频节目类型,及与各视频节目类型一一对应的播放方式信息;以及According to the video program type corresponding to the currently to-be-played video file and the preset video type play strategy, the play mode information corresponding to the currently-to-be-played video file is obtained and sent to the corresponding user terminal; wherein, the video type play strategy Including a variety of video program types, and one-to-one corresponding playback mode information for each video program type; and
    若检测到与所述当前待播放视频文件的播放方式信息对应的同意操作指令,将与所述当前待播放视频文件分解为对应的音视文件和/或视频文件,以组装成对应的播放数据并发送至用户终端。If the consent operation instruction corresponding to the playing mode information of the currently to-be-played video file is detected, the currently-to-be-played video file is decomposed into corresponding audio-visual files and/or video files to assemble into corresponding playback data And sent to the user terminal.
  10. 根据权利要求9所述的计算机设备,其中,所述将所述当前待播放视频文件中选定的多帧视频图像输入至预先训练的卷积神经网络网络,得到视频节目类型,包括:9. The computer device according to claim 9, wherein the inputting the selected multi-frame video images in the currently to-be-played video file to a pre-trained convolutional neural network network to obtain the video program type comprises:
    按预设的间隔数获取所述当前待播放视频文件对应的视频图像集合中的多帧视频图像,以作为目标图像集;Acquiring multiple frames of video images in the video image set corresponding to the currently to-be-played video file according to a preset number of intervals, as a target image set;
    将所述目标图像集输入至所述卷积神经网络,得到所述目标图像集中每一帧视频图像对应的节目类型;Inputting the target image set to the convolutional neural network to obtain the program type corresponding to each frame of video image in the target image set;
    若所述类型总数量大于1,依序获取所述目标图像集中每一帧视频图像对应节目类型以组成节目类型序列,将所述节目类型序列中相邻两个节目类型存在不同的节目类型之间插入分隔符;If the total number of types is greater than 1, the program types corresponding to each frame of the video image in the target image set are sequentially obtained to form a program type sequence, and two adjacent program types in the program type sequence are divided into two different program types. Insert a separator between;
    定位节目类型序列中所插入的每一分隔符对应的分隔时间点,通过各分隔时间点将所述当前待播放视频文件划分为对应播放段;Locate the separation time point corresponding to each separator inserted in the program type sequence, and divide the current to-be-played video file into corresponding play segments through each separation time point;
    依序获取所述当前待播放视频文件中各播放段所对应的视频节目类型。Obtain the video program type corresponding to each playback segment in the currently to-be-played video file in sequence.
  11. 根据权利要求10所述的计算机设备,其中,所述将所述目标图像集输入至所述卷积神经网络,得到所述目标图像集中每一帧视频图像对应的节目类型之后,还包括:The computer device according to claim 10, wherein after the input of the target image set to the convolutional neural network to obtain the program type corresponding to each frame of the video image in the target image set, the method further comprises:
    若所述类型总数量等于1,获取所述当前待播放视频文件对应的视频节目类型。If the total number of types is equal to 1, the video program type corresponding to the currently to-be-played video file is acquired.
  12. 根据权利要求9所述的计算机设备,其中,所述若所述当前待播放视频文件不包括节目类型标签,将所述当前待播放视频文件中选定的多帧视频图像输入至预先训练的卷积神经网络网络,得到视频节目类型之前,或者实时若检测到视频选择指令,获取与所述视频选择指令对应的当前待播放视频文件之前,还包括:The computer device according to claim 9, wherein, if the currently to-be-played video file does not include a program type tag, input the selected multi-frame video image in the currently-to-be-played video file to a pre-trained volume The product neural network network, before obtaining the video program type, or if a video selection instruction is detected in real time, before obtaining the currently to-be-played video file corresponding to the video selection instruction, it also includes:
    对视频节目划分多个视频节目类型;Divide multiple video program types into video programs;
    获取每一个视频节目类型的视频样本;Obtain video samples of each video program type;
    提取所述视频样本中的每帧图像的图像特征数据作为训练集;Extracting image feature data of each frame of image in the video sample as a training set;
    将所述训练集输入至待训练卷积神经网络进行训练,得到卷积神经网络。The training set is input to the convolutional neural network to be trained for training to obtain the convolutional neural network.
  13. 根据权利要求10所述的计算机设备,其中,所述定位节目类型序列中所插入的每一分隔符对应的分隔时间点,包括:The computer device according to claim 10, wherein the separation time point corresponding to each separator inserted in the sequence of positioning program types comprises:
    获取每一分隔符之后排序第一位的节目类型对应视频段的起始播放时间,以每一分隔符之后排序第一位的节目类型对应视频段的起始播放时间为相应分隔符所对应的分隔时间点。The first program type sorted after each separator corresponds to the start playback time of the video segment, and the first program type sorted after each separator corresponds to the start playback time of the video segment corresponding to the corresponding separator Separate time points.
  14. 根据权利要求9所述的计算机设备,其中,所述将与所述当前待播放视频文件分解为对应的音视文件和/或视频文件,以组装成对应的播放数据,包括:8. The computer device according to claim 9, wherein the decomposing the currently to-be-played video file into corresponding audio-visual files and/or video files to assemble into corresponding playback data comprises:
    若所述当前待播放视频文件的播放方式信息所对应的播放方式中存在音频播放方式的第一播放段,获取第一播放段的音频数据以及对应音频播放控制参数;其中,音频播放控制参数用于控制用户终端的显示屏开启或关闭,或者是用于控制当前待播放视频文件对应的播放器在前端运行或后台挂起;If there is a first playback segment of the audio playback mode in the playback mode corresponding to the playback mode information of the currently to-be-played video file, the audio data of the first playback segment and the corresponding audio playback control parameters are acquired; wherein, the audio playback control parameter is used It is used to control the display screen of the user terminal to turn on or off, or it is used to control the player corresponding to the current video file to be played to run in the front end or hang in the background;
    若所述当前待播放视频文件的播放方式信息所对应的播放方式中存在视频播放方式的第二播放段,获取第二播放段的视频数据;If there is a second play segment of the video play mode in the play mode corresponding to the play mode information of the currently to-be-played video file, acquiring the video data of the second play segment;
    将第一播放段的音频数据以及音频播放控制参数、第二播放段的视频数据组装成播放数据。The audio data of the first playback segment and the audio playback control parameters, and the video data of the second playback segment are assembled into playback data.
  15. 一种计算机可读存储介质,其中,所述计算机可读存储介质存储有计算机程序,所述计算机程序当被处理器执行时使所述处理器执行基于视频类型的播放控制实现方法,其中,包括:A computer-readable storage medium, wherein the computer-readable storage medium stores a computer program that, when executed by a processor, causes the processor to execute a video type-based playback control implementation method, which includes :
    若检测到视频选择指令,获取与所述视频选择指令对应的当前待播放视频文件;If a video selection instruction is detected, obtain the current to-be-played video file corresponding to the video selection instruction;
    判断所述当前待播放视频文件是否包括视频节目类型标签;Judging whether the currently to-be-played video file includes a video program type tag;
    若所述当前待播放视频文件不包括节目类型标签,将所述当前待播放视频文件中选定的多帧视频图像输入至预先训练的卷积神经网络网络,得到视频节目类型;If the currently to-be-played video file does not include the program type tag, input the selected multiple frames of video images in the currently-to-be-played video file to a pre-trained convolutional neural network network to obtain the video program type;
    若所述当前待播放视频文件包括节目类型标签,根据节目类型标签获取与所述当前待播放视频文件对应的视频节目类型;If the currently to-be-played video file includes a program type tag, obtain the video program type corresponding to the currently-to-be-played video file according to the program type tag;
    根据当前待播放视频文件对应的视频节目类型及预先设置的视频类型播放策略,获取与所述当前待播放视频文件对应的播放方式信息并发送至对应的用户终端;其中,所述视频类型播放策略包括多种视频节目类型,及与各视频节目类型一一对应的播放方式信息;以及According to the video program type corresponding to the currently to-be-played video file and the preset video type play strategy, the play mode information corresponding to the currently-to-be-played video file is obtained and sent to the corresponding user terminal; wherein, the video type play strategy Including a variety of video program types, and one-to-one corresponding playback mode information for each video program type; and
    若检测到与所述当前待播放视频文件的播放方式信息对应的同意操作指令,将与所述当前待播放视频文件分解为对应的音视文件和/或视频文件,以组装成对应的播放数据并发送至用户终端。If the consent operation instruction corresponding to the playing mode information of the currently to-be-played video file is detected, the currently-to-be-played video file is decomposed into corresponding audio-visual files and/or video files to assemble into corresponding playback data And sent to the user terminal.
  16. 根据权利要求15所述的介质,其中,所述将所述当前待播放视频文件中选定的多帧视频图像输入至预先训练的卷积神经网络网络,得到视频节目类型,包括:The medium according to claim 15, wherein said inputting the selected multi-frame video images in the currently to-be-played video file into a pre-trained convolutional neural network to obtain the video program type comprises:
    按预设的间隔数获取所述当前待播放视频文件对应的视频图像集合中的多帧视频图像,以作为目标图像集;Acquiring multiple frames of video images in the video image set corresponding to the currently to-be-played video file according to a preset number of intervals, as a target image set;
    将所述目标图像集输入至所述卷积神经网络,得到所述目标图像集中每一帧视频图像对应的节目类型;Inputting the target image set to the convolutional neural network to obtain the program type corresponding to each frame of video image in the target image set;
    若所述类型总数量大于1,依序获取所述目标图像集中每一帧视频图像对应节目类型以组成节目类型序列,将所述节目类型序列中相邻两个节目类型存在不同的节目类型之间插入分隔符;If the total number of types is greater than 1, the program types corresponding to each frame of the video image in the target image set are sequentially obtained to form a program type sequence, and two adjacent program types in the program type sequence are divided into two different program types. Insert a separator between;
    定位节目类型序列中所插入的每一分隔符对应的分隔时间点,通过各分隔时间点将所述当前待播放视频文件划分为对应播放段;Locate the separation time point corresponding to each separator inserted in the program type sequence, and divide the current to-be-played video file into corresponding play segments through each separation time point;
    依序获取所述当前待播放视频文件中各播放段所对应的视频节目类型。Obtain the video program type corresponding to each playback segment in the currently to-be-played video file in sequence.
  17. 根据权利要求16所述的介质,其中,所述将所述目标图像集输入至所述卷积神经网络,得到所述目标图像集中每一帧视频图像对应的节目类型之后,还包括:The medium according to claim 16, wherein said inputting said target image set to said convolutional neural network to obtain the program type corresponding to each frame of video image in said target image set, further comprising:
    若所述类型总数量等于1,获取所述当前待播放视频文件对应的视频节目类型。If the total number of types is equal to 1, the video program type corresponding to the currently to-be-played video file is acquired.
  18. 根据权利要求15所述的介质,其中,所述若所述当前待播放视频文件不包括节目类型标签,将所述当前待播放视频文件中选定的多帧视频图像输入至预先训练的卷积神经网络网络,得到视频节目类型之前,或者实时若检测到视频选择指令,获取与所述视频选择指令对应的当前待播放视频文件之前,还包括:The medium according to claim 15, wherein, if the currently to-be-played video file does not include a program type tag, input the selected multi-frame video image in the currently-to-be-played video file to a pre-trained convolution The neural network network, before obtaining the video program type, or if a video selection instruction is detected in real time, before obtaining the currently to-be-played video file corresponding to the video selection instruction, it also includes:
    对视频节目划分多个视频节目类型;Divide multiple video program types into video programs;
    获取每一个视频节目类型的视频样本;Obtain video samples of each video program type;
    提取所述视频样本中的每帧图像的图像特征数据作为训练集;Extracting image feature data of each frame of image in the video sample as a training set;
    将所述训练集输入至待训练卷积神经网络进行训练,得到卷积神经网络。The training set is input to the convolutional neural network to be trained for training to obtain the convolutional neural network.
  19. 根据权利要求16所述的介质,其中,所述定位节目类型序列中所插入的每一分隔符对应的分隔时间点,包括:The medium according to claim 16, wherein the separation time point corresponding to each separator inserted in the sequence of positioning program types comprises:
    获取每一分隔符之后排序第一位的节目类型对应视频段的起始播放时间,以每一分隔符之后排序第一位的节目类型对应视频段的起始播放时间为相应分隔符所对应的分隔时间点。The first program type sorted after each separator corresponds to the start playback time of the video segment, and the first program type sorted after each separator corresponds to the start playback time of the video segment corresponding to the corresponding separator Separate time points.
  20. 根据权利要求15所述的介质,其中,所述将与所述当前待播放视频文件分解为对应的音视文件和/或视频文件,以组装成对应的播放数据,包括:The medium according to claim 15, wherein the decomposing the currently to-be-played video file into corresponding audio-visual files and/or video files to assemble into corresponding playback data comprises:
    若所述当前待播放视频文件的播放方式信息所对应的播放方式中存在音频播放方式的第一播放段,获取第一播放段的音频数据以及对应音频播放控制参数;其中,音频播放控制参数用于控制用户终端的显示屏开启或关闭,或者是用于控制当前待播放视频文件对应的播放器在前端运行或后台挂起;If there is a first playback segment of the audio playback mode in the playback mode corresponding to the playback mode information of the currently to-be-played video file, the audio data of the first playback segment and the corresponding audio playback control parameters are acquired; wherein, the audio playback control parameter is used It is used to control the display screen of the user terminal to turn on or off, or it is used to control the player corresponding to the current video file to be played to run in the front end or hang in the background;
    若所述当前待播放视频文件的播放方式信息所对应的播放方式中存在视频播放方式的第 二播放段,获取第二播放段的视频数据;If there is a second play segment of the video play mode in the play mode corresponding to the play mode information of the currently to-be-played video file, acquiring the video data of the second play segment;
    将第一播放段的音频数据以及音频播放控制参数、第二播放段的视频数据组装成播放数据。The audio data of the first playback segment and the audio playback control parameters, and the video data of the second playback segment are assembled into playback data.
PCT/CN2020/087026 2019-09-11 2020-04-26 Video type-based playback control implementation method and apparatus, and computer device WO2021047181A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN201910858320.3 2019-09-11
CN201910858320.3A CN110740343B (en) 2019-09-11 2019-09-11 Video type-based play control implementation method and device and computer equipment

Publications (1)

Publication Number Publication Date
WO2021047181A1 true WO2021047181A1 (en) 2021-03-18

Family

ID=69267913

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2020/087026 WO2021047181A1 (en) 2019-09-11 2020-04-26 Video type-based playback control implementation method and apparatus, and computer device

Country Status (2)

Country Link
CN (1) CN110740343B (en)
WO (1) WO2021047181A1 (en)

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110740343B (en) * 2019-09-11 2022-08-26 深圳壹账通智能科技有限公司 Video type-based play control implementation method and device and computer equipment

Citations (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103024490A (en) * 2012-12-26 2013-04-03 北京奇艺世纪科技有限公司 Method and device supporting independent playing of audio and video
CN104331442A (en) * 2014-10-24 2015-02-04 华为技术有限公司 Video classification method and device
US20160127793A1 (en) * 2014-02-14 2016-05-05 Pluto Inc. Methods and systems for generating and providing program guides and content
CN105979355A (en) * 2015-12-10 2016-09-28 乐视网信息技术(北京)股份有限公司 Method and device for playing video
CN107094263A (en) * 2017-03-13 2017-08-25 华为技术有限公司 A kind of video broadcasting method, user terminal and server
CN107291416A (en) * 2017-06-20 2017-10-24 广东小天才科技有限公司 A kind of audio frequency playing method, system and terminal device
CN108647571A (en) * 2018-03-30 2018-10-12 国信优易数据有限公司 Video actions disaggregated model training method, device and video actions sorting technique
CN109257622A (en) * 2018-11-01 2019-01-22 广州市百果园信息技术有限公司 A kind of audio/video processing method, device, equipment and medium
CN110740343A (en) * 2019-09-11 2020-01-31 深圳壹账通智能科技有限公司 Video type-based play control implementation method and device and computer equipment

Family Cites Families (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20080133696A1 (en) * 2006-12-04 2008-06-05 Hanebeck Hanns-Christian Leemo Personal multi-media playing system
CN101695123A (en) * 2009-01-23 2010-04-14 李冬 Media broadcasting system and method for respectively customizing broadcast strategy for media segment
KR20160045441A (en) * 2014-10-17 2016-04-27 삼성전자주식회사 Method and apparatus for playing video
CN106534962A (en) * 2016-10-11 2017-03-22 腾讯科技(北京)有限公司 Television content playing method and device
CN106973317A (en) * 2016-12-30 2017-07-21 华为软件技术有限公司 Multimedia data processing method, multimedia data providing method, apparatus and system
CN107995523B (en) * 2017-12-21 2019-09-03 Oppo广东移动通信有限公司 Video broadcasting method, device, terminal and storage medium
CN109361949B (en) * 2018-11-27 2020-08-25 Oppo广东移动通信有限公司 Video processing method, video processing device, electronic equipment and storage medium

Patent Citations (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103024490A (en) * 2012-12-26 2013-04-03 北京奇艺世纪科技有限公司 Method and device supporting independent playing of audio and video
US20160127793A1 (en) * 2014-02-14 2016-05-05 Pluto Inc. Methods and systems for generating and providing program guides and content
CN104331442A (en) * 2014-10-24 2015-02-04 华为技术有限公司 Video classification method and device
CN105979355A (en) * 2015-12-10 2016-09-28 乐视网信息技术(北京)股份有限公司 Method and device for playing video
CN107094263A (en) * 2017-03-13 2017-08-25 华为技术有限公司 A kind of video broadcasting method, user terminal and server
CN107291416A (en) * 2017-06-20 2017-10-24 广东小天才科技有限公司 A kind of audio frequency playing method, system and terminal device
CN108647571A (en) * 2018-03-30 2018-10-12 国信优易数据有限公司 Video actions disaggregated model training method, device and video actions sorting technique
CN109257622A (en) * 2018-11-01 2019-01-22 广州市百果园信息技术有限公司 A kind of audio/video processing method, device, equipment and medium
CN110740343A (en) * 2019-09-11 2020-01-31 深圳壹账通智能科技有限公司 Video type-based play control implementation method and device and computer equipment

Also Published As

Publication number Publication date
CN110740343B (en) 2022-08-26
CN110740343A (en) 2020-01-31

Similar Documents

Publication Publication Date Title
US9378770B2 (en) Systems and methods of facilitating installment-by-installment consumption of discrete installments of a unitary media program
CN108024079B (en) Screen recording method, device, terminal and storage medium
WO2018000624A1 (en) Video playing control method and device
US10897637B1 (en) Synchronize and present multiple live content streams
US20150256885A1 (en) Method for determining content for a personal channel
US20150156227A1 (en) Synchronize Tape Delay and Social Networking Experience
US10904616B2 (en) Filtering of content in near real time
WO2017101510A1 (en) Video processing method and apparatus
CN109729429B (en) Video playing method, device, equipment and medium
US20130325963A1 (en) Information processing device, information processing method, and program
CN107333163A (en) A kind of method for processing video frequency and device, a kind of terminal and storage medium
US20230289622A1 (en) Volume recommendation method and apparatus, device and storage medium
US9930412B2 (en) Network set-top box and its operating method
WO2021047181A1 (en) Video type-based playback control implementation method and apparatus, and computer device
CN111698261B (en) Video playing method, device, equipment and storage medium based on streaming media
US10110951B2 (en) Coordinated background TV recognition
WO2023060759A1 (en) Video pushing method, device, and storage medium
US20170285912A1 (en) Methods, systems, and media for media guidance
CN106205654B (en) Audio-video data processing method and device
CN111343391A (en) Video capture method and electronic device using same
US11490148B2 (en) Systems and methods to determine when to rejoin a live media broadcast after an interruption
JP6363015B2 (en) Electronic device and display method
US10911822B2 (en) Device-based detection of ambient media to be used by a server to selectively provide secondary content to the device
US11729480B2 (en) Systems and methods to enhance interactive program watching
TWI669960B (en) Television and method for filtering television program

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 20862887

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

32PN Ep: public notification in the ep bulletin as address of the adressee cannot be established

Free format text: NOTING OF LOSS OF RIGHTS PURSUANT TO RULE 112(1) EPC (EPO FORM 1205A DATED 20/07/2022)

122 Ep: pct application non-entry in european phase

Ref document number: 20862887

Country of ref document: EP

Kind code of ref document: A1