WO2017101510A1 - Video processing method and apparatus - Google Patents

Video processing method and apparatus Download PDF

Info

Publication number
WO2017101510A1
WO2017101510A1 PCT/CN2016/097217 CN2016097217W WO2017101510A1 WO 2017101510 A1 WO2017101510 A1 WO 2017101510A1 CN 2016097217 W CN2016097217 W CN 2016097217W WO 2017101510 A1 WO2017101510 A1 WO 2017101510A1
Authority
WO
WIPO (PCT)
Prior art keywords
video stream
advertisement content
file
video
player
Prior art date
Application number
PCT/CN2016/097217
Other languages
French (fr)
Chinese (zh)
Inventor
董春
Original Assignee
乐视控股(北京)有限公司
乐视致新电子科技(天津)有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 乐视控股(北京)有限公司, 乐视致新电子科技(天津)有限公司 filed Critical 乐视控股(北京)有限公司
Publication of WO2017101510A1 publication Critical patent/WO2017101510A1/en

Links

Images

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/43Processing of content or additional data, e.g. demultiplexing additional data from a digital video stream; Elementary client operations, e.g. monitoring of home network or synchronising decoder's clock; Client middleware
    • H04N21/439Processing of audio elementary streams
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/43Processing of content or additional data, e.g. demultiplexing additional data from a digital video stream; Elementary client operations, e.g. monitoring of home network or synchronising decoder's clock; Client middleware
    • H04N21/439Processing of audio elementary streams
    • H04N21/4396Processing of audio elementary streams by muting the audio signal
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/43Processing of content or additional data, e.g. demultiplexing additional data from a digital video stream; Elementary client operations, e.g. monitoring of home network or synchronising decoder's clock; Client middleware
    • H04N21/442Monitoring of processes or resources, e.g. detecting the failure of a recording device, monitoring the downstream bandwidth, the number of times a movie has been viewed, the storage space available from the internal hard disk
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/43Processing of content or additional data, e.g. demultiplexing additional data from a digital video stream; Elementary client operations, e.g. monitoring of home network or synchronising decoder's clock; Client middleware
    • H04N21/442Monitoring of processes or resources, e.g. detecting the failure of a recording device, monitoring the downstream bandwidth, the number of times a movie has been viewed, the storage space available from the internal hard disk
    • H04N21/44204Monitoring of content usage, e.g. the number of times a movie has been viewed, copied or the amount which has been watched
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/45Management operations performed by the client for facilitating the reception of or the interaction with the content or administrating data related to the end-user or to the client device itself, e.g. learning user preferences for recommending movies, resolving scheduling conflicts
    • H04N21/458Scheduling content for creating a personalised stream, e.g. by combining a locally stored advertisement with an incoming stream; Updating operations, e.g. for OS modules ; time-related management operations
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/45Management operations performed by the client for facilitating the reception of or the interaction with the content or administrating data related to the end-user or to the client device itself, e.g. learning user preferences for recommending movies, resolving scheduling conflicts
    • H04N21/458Scheduling content for creating a personalised stream, e.g. by combining a locally stored advertisement with an incoming stream; Updating operations, e.g. for OS modules ; time-related management operations
    • H04N21/4586Content update operation triggered locally, e.g. by comparing the version of software modules in a DVB carousel to the version stored locally
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/80Generation or processing of content or additional data by content creator independently of the distribution process; Content per se
    • H04N21/81Monomedia components thereof
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/80Generation or processing of content or additional data by content creator independently of the distribution process; Content per se
    • H04N21/81Monomedia components thereof
    • H04N21/812Monomedia components thereof involving advertisement data
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/80Generation or processing of content or additional data by content creator independently of the distribution process; Content per se
    • H04N21/83Generation or processing of protective or descriptive data associated with content; Content structuring
    • H04N21/845Structuring of content, e.g. decomposing content into time segments
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/80Generation or processing of content or additional data by content creator independently of the distribution process; Content per se
    • H04N21/83Generation or processing of protective or descriptive data associated with content; Content structuring
    • H04N21/845Structuring of content, e.g. decomposing content into time segments
    • H04N21/8456Structuring of content, e.g. decomposing content into time segments by decomposing the content in the time domain, e.g. in time segments

Definitions

  • the present invention relates to the field of Internet technologies, and in particular, to a video processing method and apparatus.
  • the videos that general users watch are mainly divided into two categories: on-demand and live.
  • on-demand the user can perform fast-forward or fast-rewind operations on the viewed video according to his own needs during viewing, and for the live-type video, the user cannot perform any operation on the video while watching.
  • some advertisements are inserted during video playback, such as insertion before the video is played or inserted during the video playback.
  • users can filter out some of the ads in the video by fast-forwarding, or filter out all the ads in the video through some existing software that filters ads.
  • the embodiments of the present invention provide a video processing method and device, which are used to solve the problem that the existing users have poor flexibility in watching video.
  • An embodiment of the present invention provides a video processing method, which is applied to a client, and includes:
  • the advertisement content exists, opening a local audio file when the advertisement content is played, the local audio file being an audio file stored in a local storage system;
  • the player is tuned to a silent mode, and the player is used to play the video stream.
  • An embodiment of the present invention provides an apparatus for video processing, including:
  • a determining unit configured to determine whether an advertisement content exists in the video stream
  • An opening unit configured to: when the advertisement content is present, open a local audio file when the advertisement content is played, where the local audio file is an audio file stored in a local storage system;
  • an adjustment unit configured to adjust the player to a silent mode, the player is configured to play the video stream.
  • Embodiments of the present invention also provide an electronic device, including: at least one processor; and a memory communicatively coupled to the at least one processor; wherein the memory stores instructions executable by the at least one processor The instructions are executed by the at least one processor to enable the at least one processor to perform the method of video processing described above herein.
  • Embodiments of the present invention also provide a non-transitory computer readable storage medium storing computer instructions for causing the computer to perform the above video processing method of the present application .
  • Embodiments of the present invention also provide a computer program product, the computer program product comprising a computing program stored on a non-transitory computer readable storage medium, the computer program comprising program instructions, when the program instructions are executed by a computer
  • the computer is caused to perform the above-described video processing method of the present application.
  • the method and device for video processing provided by the embodiment of the present invention can first determine whether there is advertisement content in the currently played video stream, and if there is advertisement content, when the advertisement content is played, the local audio file is turned on, and the video is played. The streamed player is muted to silent mode.
  • the embodiment of the present invention can turn on a local audio file when playing an advertisement, and does not affect the normal display of the advertisement content, and realizes the effect of replacing the advertisement sound with the local audio sound. This method allows users to enjoy the content of their favorite audio files while watching an ad. This increases the flexibility of the user when watching the video.
  • FIG. 1 is a flowchart of a method for video processing according to an embodiment of the present invention
  • FIG. 2 is a flowchart of another method for video processing according to an embodiment of the present invention.
  • FIG. 3 is a flowchart of still another method for video processing according to an embodiment of the present invention.
  • FIG. 4 is a block diagram of a device for video processing according to an embodiment of the present invention.
  • FIG. 5 is a structural block diagram of another apparatus for video processing according to an embodiment of the present invention.
  • FIG. 6 is a schematic structural diagram of a physical structure of a video processing apparatus according to an embodiment of the present disclosure.
  • FIG. 7 is a schematic structural diagram of hardware of an electronic device for performing a video processing method according to an embodiment of the present invention.
  • the embodiment of the invention provides a method for video processing. As shown in FIG. 1 , the method includes:
  • the video stream in this embodiment refers to a live video stream that conforms to the HLS (HTTP Live Streaming, HLS) protocol.
  • HLS is a dynamic rate adaptive technology, mainly used for audio and video services of computer PCs and Apple Apple terminals.
  • the HLS mainly includes an M3U8 playlist file, the M3U8 file is an index file, and a TS (Transport Stream, TS) media slice file.
  • TS is also a form of video encapsulation that is suitable for live broadcast.
  • the advertisement in the live video stream is processed. Therefore, it is first determined whether there is an advertisement in the video stream, and preparation for the subsequent steps of processing the advertisement.
  • the local audio file is opened when the advertising content is played.
  • the local audio file is opened when the advertisement content is played, and the local audio file is opened to replace the sound of the advertisement content in the video stream, so that the user can enjoy the audio file that he likes.
  • the local audio file is an audio file stored in a local storage system, wherein the recorded content is an audio file added by the user according to his or her favorite, and may be a song, a crosstalk, a recording, or the like.
  • the currently displayed advertisement content will not be deleted, but the sound of the advertisement is replaced, which is equivalent to the replacement of the background music, and the advertisement will continue to be displayed.
  • the player When the local audio file is turned on, the player's sound mode is muted, which is also a necessary step to replace the sound in the local audio file with the commercial sound. Otherwise it will affect the appreciation of the local audio file.
  • the player is a player for playing a video stream.
  • the player can be a network television, a mobile phone, or various video playing software installed on a computer.
  • the method for video processing provided by the embodiment of the present invention can first determine whether there is advertisement content in the currently played video stream, and if there is advertisement content, when the advertisement content is played, the local audio file is turned on, and the video stream is played. The player is muted to silent mode.
  • the embodiment of the present invention can turn on a local audio file when playing an advertisement, and does not affect the normal display of the advertisement content, and realizes the effect of replacing the advertisement sound with the local audio sound. This method allows users to enjoy the content of their favorite audio files while watching an ad. This increases the flexibility of the user when watching the video.
  • the present invention also provides another embodiment.
  • the method for video processing in this embodiment includes:
  • the playlist file is an M3U8 file, which is an index file when the video stream is played.
  • the index of the video file to be played is recorded in the M3U8 file.
  • the index of the new video file is reloaded until all the video files are played.
  • the content in the M3U8 file is dynamically changed, so it is necessary to monitor the updated M3U8 file.
  • the video file therein refers to the TS fragment file.
  • EXT-X-DISCONTINUITY When the coded discontinuity identifier EXT-X-DISCONTINUITY appears in the M3U8 file, the content before and after the EXT-X-DISCONTINUITY is discontinuous, so it can be guessed that EXT-X-DISCONTINUITY is followed by other content, possibly advertising content. , so monitor the EXT-X-DISCONTINUITY logo.
  • EXT-X-DISCONTINUITY After monitoring the EXT-X-DISCONTINUITY flag, in order to verify whether the advertisement content is added after EXT-X-DISCONTINUITY, it is necessary to find out the corresponding first video file after the EXT-X-DISCONTINUITY, that is, the TS file, and then extract Keyframes in the TS file to facilitate comparison with frame data in the ad database.
  • the specific filtering principle is to remove the non-information frames in all key frames.
  • the no-information frame refers to a frame without image content, such as a black frame.
  • the key frames obtained in step 202 are compared with the frame data in the advertisement database, and there are two ways to compare, as follows:
  • the key frame is compared with the key frame in the advertisement database by one frame and one frame.
  • a preset comparison number is set, and the preset comparison number is an experience value obtained after repeated experiments. Assume that the preset number of comparisons is N, and N is a positive integer greater than 0.
  • the continuous M key frames are not the same as the frame data content after the N times, it is determined that there is no advertisement content in the video stream, that is, the first TS file after the EXT-X-DISCONTINUITY is the non-advertising content.
  • the design principle of the comparison model is basically the same as the principle of the first method. The difference is that the comparison model dynamically acquires an N according to the input key frame and frame data, and the value of the N can be an artificial intelligence algorithm. Learned by training.
  • the local audio file is turned on when the first video file after the coded discontinuous identifier is played.
  • the local audio file is opened when the first TS file after EXT-X-DISCONTINUITY is played.
  • the implementation manner of the local audio file is the same as that of the step 102 in FIG. 1 , and details are not described herein again.
  • the present invention also provides another embodiment.
  • the method for video processing in this embodiment includes:
  • step 201 in FIG. 2 The implementation of this step is the same as the implementation of step 201 in FIG. 2, and details are not described herein again.
  • the coded discontinuity identifier in this embodiment is EXT-X-DISCONTINUITY, and the ID corresponding to the video file after EXT-X-DISCONTINUITY is acquired. This ID is a unique identifier that distinguishes different video content.
  • the ID obtained by step 302 is compared with a preset identification list.
  • the ID of all advertisements is recorded in the preset ID.
  • the ID can be found in the preset identifier list, it is determined that the advertisement content exists in the video stream, that is, the video file after the EXT-X-DISCONTINUITY is the advertisement content;
  • the title of the advertisement is also recorded in the identification list, wherein the title and the ID are in a one-to-one correspondence, and the record of the title is for readability when the user modifies the identification list.
  • the local audio file is turned on when the first video file after the coded discontinuous identifier is played.
  • step 204 The implementation of this step is the same as the implementation of step 204 in FIG. 2, and details are not described herein again.
  • step 205 The implementation of this step is the same as the implementation of step 205 in FIG. 2, and details are not described herein again.
  • step 206 The implementation of this step is the same as the implementation of step 206 in FIG. 2, and details are not described herein again.
  • the specific replacement method is: Set the list of identifiers to update, that is, add videos that you don't like in the list of preset identifiers.
  • the specific way of adding is: when the user is watching the video, if he encounters the video that he does not like, he can add the corresponding title and ID to the preset identification list by selecting the option added to the identification list in the menu.
  • the options added to the list are provided by the system.
  • other implementations are the same as those of FIG. The implementation is the same and will not be described here.
  • the preset identifier list in FIG. 3 is set by the user, so the user can add the non-advertising video or advertisement video that is not like to the preset identifier list, and can also use the favorite non-advertising video.
  • the advertisement video is deleted from the preset identification list.
  • the specific deletion method is to find the title of the favorite video in the identification list, and then delete it, and the deleted video will not be replaced by the local audio file when the next time the video is played. .
  • FIG. 4 another embodiment of the present invention further provides a video processing apparatus.
  • the apparatus includes: determining The unit 41, the opening unit 42, and the adjusting unit 43.
  • the determining unit 41 is configured to determine whether the advertisement content exists in the video stream.
  • the video stream in this embodiment refers to a live video stream that conforms to the HLS protocol.
  • HLS is a dynamic rate adaptive technology, mainly used for audio and video services of computer PCs and Apple Apple terminals.
  • the HLS mainly includes an M3U8 playlist file, the M3U8 file is an index file, and a TS media slice file.
  • TS is also a form of video encapsulation that is suitable for live broadcast.
  • the advertisement in the live video stream is processed. Therefore, it is first determined whether there is an advertisement in the video stream, and preparation for the subsequent steps of processing the advertisement.
  • the opening unit 42 is configured to open a local audio file when the advertisement content is played if the advertisement content exists, and the local audio file is an audio file stored in the local storage system.
  • the local audio file is opened when the advertisement content is played, and the local audio file is opened to replace the sound of the advertisement content in the video stream, so that the user can enjoy the audio file that he likes.
  • the local audio file is an audio file stored in a local storage system, wherein the recorded content is an audio file added by the user according to his or her favorite, and may be a song, a crosstalk, a recording, or the like.
  • the adjusting unit 43 is configured to adjust the player to the silent mode, and the player is used to play the video stream.
  • the player When the local audio file is turned on, the player's sound mode is muted, which is also a necessary step to replace the sound in the local audio file with the commercial sound. Otherwise it will affect the appreciation of the local audio file.
  • the player is a player for playing a video stream.
  • the player can be a network television, a mobile phone, or various video playing software installed on a computer.
  • the determining unit 41 includes:
  • the monitoring module 411 is configured to monitor the coded discontinuity identifier in the updated playlist file.
  • the audio file list is a playlist file corresponding to the video stream.
  • the content in the M3U8 file is dynamically changed, so the updated M3U8 file needs to be monitored.
  • the video file therein refers to the TS fragment file.
  • EXT-X-DISCONTINUITY When the coded discontinuity identifier EXT-X-DISCONTINUITY appears in the M3U8 file, the content before and after the EXT-X-DISCONTINUITY is discontinuous, so it can be guessed that EXT-X-DISCONTINUITY is followed by other content, possibly advertising content. , so monitor the EXT-X-DISCONTINUITY logo.
  • the extracting module 412 is configured to extract key frames in the first video file corresponding to the encoded discontinuous identifier.
  • EXT-X-DISCONTINUITY After monitoring the EXT-X-DISCONTINUITY flag, in order to verify whether the advertisement content is added after EXT-X-DISCONTINUITY, it is necessary to find out the corresponding first video file after the EXT-X-DISCONTINUITY, that is, the TS file, and then extract Keyframes in the TS file to facilitate comparison with frame data in the ad database.
  • the determining module 413 is configured to determine whether the advertisement content exists in the video stream by comparing the frame data in the key frame and the advertisement database.
  • the determining module 413 is configured to:
  • the determining module 413 is configured to:
  • the key frame and the frame data are input into the comparison model for comparison, and the similarity is obtained, and the similarity is used to describe the degree of similarity between the key frame and the frame data;
  • Whether the advertisement content exists in the video stream is judged according to the similarity.
  • the apparatus further includes:
  • the removing unit 44 is configured to remove the non-information frame in the key frame before determining whether the advertisement content exists in the video stream by comparing the frame data in the key frame and the advertisement database, and the no information frame is a frame without image content.
  • the determining unit 41 includes:
  • the monitoring module 411 is configured to monitor the coded discontinuity identifier in the updated playlist file.
  • the audio file list is a playlist file corresponding to the video stream.
  • the content in the M3U8 file is dynamically changed, so the updated M3U8 file needs to be monitored.
  • the video file therein refers to the TS fragment file.
  • EXT-X-DISCONTINUITY When the coded discontinuity identifier EXT-X-DISCONTINUITY appears in the M3U8 file, the content before and after the EXT-X-DISCONTINUITY is discontinuous, so it can be guessed that EXT-X-DISCONTINUITY is followed by other content, possibly advertising content. , so monitor the EXT-X-DISCONTINUITY logo.
  • the obtaining module 414 is configured to obtain an ID corresponding to the video file after the coded discontinuity identifier.
  • the coded discontinuity identifier in this embodiment is EXT-X-DISCONTINUITY, and the ID corresponding to the video file after EXT-X-DISCONTINUITY is acquired. This ID is a unique identifier that distinguishes different video content.
  • the comparison module 415 is configured to compare the ID with a preset identifier list, and the preset identifier list is used to record IDs of all advertisements.
  • the determining module 416 is configured to determine that the advertisement content exists in the video stream if the ID is found in the preset identifier list; if the ID cannot be found in the preset identifier list, it is determined that the advertisement content does not exist in the video stream.
  • the ID can be found in the preset identifier list, it is determined that the advertisement content exists in the video stream, that is, the video file after the EXT-X-DISCONTINUITY is the advertisement content;
  • the opening unit 42 is configured to: when playing the first video file after the coded discontinuous identifier, turn on the local audio file.
  • the local audio file is opened when the first TS file after EXT-X-DISCONTINUITY is played.
  • the adjusting unit 43 is configured to: send a silence request to the player to adjust the player to the silent mode.
  • the apparatus further includes:
  • the closing unit 45 is configured to: after adjusting the player to the silent mode, after detecting the next coded discontinuity identifier in the playlist file, closing the local audio file;
  • the unit 46 is turned on for turning on the sound of the player.
  • the device for video processing according to the embodiment of the present invention can first determine whether there is advertisement content in the currently played video stream, and if there is advertisement content, when the advertisement content is played, the local audio file is turned on, and the video stream is played. The player is muted to silent mode.
  • the embodiment of the present invention can turn on a local audio file when playing an advertisement, and does not affect the normal display of the advertisement content, and realizes the effect of replacing the advertisement sound with the local audio sound. This method allows users to enjoy the content of their favorite audio files while watching an ad. This increases the flexibility of the user when watching the video.
  • FIG. 6 is a schematic diagram showing the physical structure of a video processing apparatus according to an embodiment of the present invention.
  • the physical structure may include a processor 61 and a communications interface. 62.
  • Communication interface 62 can be used for information transfer between the server and the client.
  • the processor 61 may call the logic instructions in the memory 63 to perform a method of determining whether there is advertising content in the video stream; if the advertising content is present, turning on the local audio file when the advertising content is played, the local audio
  • the file is an audio file stored in a local storage system; and the player is tuned to a silent mode, the player is for playing the video stream.
  • the logic instructions in the memory 63 described above may be implemented in the form of a software functional unit and sold or used as a stand-alone product, and may be stored in a computer readable storage medium.
  • the computer software product is stored in a storage medium and includes instructions for causing a computer device (which may be a personal computer, server, or network device, etc.) to perform all or part of the steps of the methods described in various embodiments of the present invention.
  • the foregoing storage medium includes: a U disk, a mobile hard disk, a read-only memory (ROM), a random access memory (RAM), a magnetic disk, or an optical disk, and the like. .
  • FIG. 7 is a schematic structural diagram of hardware of an electronic device for performing a video processing method according to an embodiment of the present disclosure. As shown in FIG. 7, the device includes:
  • One or more processors 71 and a memory 72 are exemplified by a processor 71 in FIG.
  • the apparatus that performs the method of video processing may further include: an input device 73 and an output device 74.
  • the processor 71, the memory 72, the input device 73, and the output device 74 may be connected by a bus or other means, as exemplified by a bus connection in FIG.
  • the memory 72 is a non-volatile computer readable storage medium, and can be used for storing a non-volatile software program, a non-volatile computer executable program, and a module, such as a program corresponding to the video processing method in the embodiment of the present application.
  • An instruction/module (for example, the determination unit 41, the opening unit 42, and the adjustment unit 43 shown in FIG. 4).
  • the processor 71 executes various functional applications of the server and data processing by executing non-volatile software programs, instructions, and modules stored in the memory 72, that is, a method of implementing video processing of the above-described method embodiments.
  • the memory 72 may include a storage program area and a storage data area, wherein the storage program area may store an operating system, an application required for at least one function; the storage data area may store data created according to use of the video processing device, and the like.
  • memory 72 can include high speed random access memory, and can also include non-volatile memory, such as at least one magnetic disk storage device, flash memory device, or other non-volatile solid state storage device.
  • memory 72 can optionally include memory remotely located relative to processor 71, which can be connected to the video processing device over a network. Examples of such networks include, but are not limited to, the Internet, intranets, local area networks, mobile communication networks, and combinations thereof.
  • Input device 73 can receive the input digital or character information and generate key signal inputs related to user settings and function control of the video processing device.
  • Output device 74 can include a display device such as a display screen.
  • the one or more modules are stored in the memory 72, and when executed by the one or more processors 71, perform the method of video processing in any of the above method embodiments.
  • the electronic device of the embodiment of the present application exists in various forms, including but not limited to:
  • Mobile communication devices These devices are characterized by mobile communication functions and are mainly aimed at providing voice and data communication.
  • Such terminals include: smart phones (such as iPhone), multimedia phones, functional phones, and low-end phones.
  • Ultra-mobile personal computer equipment This type of equipment belongs to the category of personal computers, has computing and processing functions, and generally has mobile Internet access.
  • Such terminals include: PDAs, MIDs, and UMPC devices, such as the iPad.
  • Portable entertainment devices These devices can display and play multimedia content. Such devices include: audio, video processing devices (such as iPod), handheld game consoles, e-books, and smart toys and portable car navigation devices.
  • the server consists of a processor, a hard disk, a memory, a system bus, etc.
  • the server is similar to a general-purpose computer architecture, but because of the need to provide highly reliable services, processing power and stability High reliability in terms of reliability, security, scalability, and manageability.
  • the embodiment of the present application further provides a non-transitory computer storage medium, where the computer storage medium stores computer executable instructions, which can perform the video processing method in any of the foregoing method embodiments.
  • the storage medium may be a magnetic disk, an optical disk, a read only memory (ROM), or a random access memory (RAM).
  • the device embodiments described above are merely illustrative, wherein the units described as separate components may or may not be physically separate, and the components displayed as units may or may not be physical units, ie may be located A place, or it can be distributed to multiple network units. Some or all of the modules can be selected according to actual needs. The purpose of the solution of this embodiment. Those of ordinary skill in the art can understand and implement without deliberate labor.

Abstract

Provided in the embodiments of the present invention are a video processing method and apparatus, the method of the present invention primarily comprising: determining whether advertising content is present in a video stream; if advertising content is present, then opening a local audio file when playing said advertising content, said local audio file being an audio file stored in the local storage system; adjusting the player to a silent mode, said player being used for playing the video stream. Compared to the prior art, the present invention can improve the flexibility of a user when watching videos.

Description

视频处理的方法及装置Video processing method and device
本申请基于申请号为201510959280.3、申请日为2015年12月18日的中国专利申请提出,并要求该中国专利申请的优先权,该中国专利申请的全部内容在此引入本申请作为参考。The present application is based on a Chinese patent application filed on Jan. 18, 2015, the entire disclosure of which is hereby incorporated by reference.
技术领域Technical field
本发明涉及互联网技术领域,尤其涉及一种视频处理的方法及装置。The present invention relates to the field of Internet technologies, and in particular, to a video processing method and apparatus.
背景技术Background technique
随着互联网技术的发展,越来越多的用户习惯通过电脑、手机等观看视频,通过本地安装的客户端的播放器或者网页上植入的播放器观看。一般用户观看的视频主要分为两类:点播类和直播类。对于点播类的视频,用户在观看的时候可以根据自己需求,对观看的视频进行快进或快退等操作;而对于直播类的视频,用户在观看的时候不能对视频进行任何操作。通常处于商业目的考虑,在视频播放过程中会插入一些广告,例如在视频播放前插入或者在视频播放的过程中插入。With the development of Internet technology, more and more users are accustomed to watching videos through computers, mobile phones, etc., through a locally installed client player or a player embedded on a web page. The videos that general users watch are mainly divided into two categories: on-demand and live. For the video of the on-demand type, the user can perform fast-forward or fast-rewind operations on the viewed video according to his own needs during viewing, and for the live-type video, the user cannot perform any operation on the video while watching. Often for commercial purposes, some advertisements are inserted during video playback, such as insertion before the video is played or inserted during the video playback.
对于点播类视频,用户可以通过快进的方式过滤掉视频中的一些广告,也可以通过现有的一些过滤广告的软件将视频中的所有广告过滤掉。For on-demand video, users can filter out some of the ads in the video by fast-forwarding, or filter out all the ads in the video through some existing software that filters ads.
但是对于直播类的视频,目前还没有对其中的广告进行处理的方法。因此当用户观看时,不能对观看的视频进行任何操作,灵活度较差,从而影响用户的观看效果。因此如何处理直播视频中插入的广告,提高用户的观看的灵活度成为急需解决的问题。However, for live video, there is currently no way to process the ads. Therefore, when the user views, no operation can be performed on the viewed video, and the flexibility is poor, thereby affecting the user's viewing effect. Therefore, how to deal with the advertisement inserted in the live video and improve the flexibility of the user's viewing become an urgent problem to be solved.
发明内容Summary of the invention
本发明实施例提供一种视频处理的方法及装置,用以解决现有用户观看视频的灵活度差的问题。The embodiments of the present invention provide a video processing method and device, which are used to solve the problem that the existing users have poor flexibility in watching video.
本发明实施例提供一种视频处理的方法,应用于客户端,包括: An embodiment of the present invention provides a video processing method, which is applied to a client, and includes:
判断视频流中是否存在广告内容;Determining whether there is advertising content in the video stream;
若存在所述广告内容,则在播放所述广告内容时开启本地音频文件,所述本地音频文件为存储在本地存储系统中的音频文件;并且,If the advertisement content exists, opening a local audio file when the advertisement content is played, the local audio file being an audio file stored in a local storage system;
将播放器调为静音模式,所述播放器用于播放所述视频流。The player is tuned to a silent mode, and the player is used to play the video stream.
本发明实施例提供一种视频处理的装置,包括:An embodiment of the present invention provides an apparatus for video processing, including:
判断单元,用于判断视频流中是否存在广告内容;a determining unit, configured to determine whether an advertisement content exists in the video stream;
开启单元,用于若存在所述广告内容,则在播放所述广告内容时开启本地音频文件,所述本地音频文件为存储在本地存储系统中的音频文件;An opening unit, configured to: when the advertisement content is present, open a local audio file when the advertisement content is played, where the local audio file is an audio file stored in a local storage system;
调节单元,用于将播放器调为静音模式,所述播放器用于播放所述视频流。And an adjustment unit, configured to adjust the player to a silent mode, the player is configured to play the video stream.
本发明实施例还提供了一种电子设备,包括:至少一个处理器;以及与所述至少一个处理器通信连接的存储器;其中,所述存储器存储有可被所述至少一个处理器执行的指令,所述指令被所述至少一个处理器执行,以使所述至少一个处理器能够执行本申请上述视频处理的方法。Embodiments of the present invention also provide an electronic device, including: at least one processor; and a memory communicatively coupled to the at least one processor; wherein the memory stores instructions executable by the at least one processor The instructions are executed by the at least one processor to enable the at least one processor to perform the method of video processing described above herein.
本发明实施例还提供了一种非暂态计算机可读存储介质,所述非暂态计算机可读存储介质存储计算机指令,所述计算机指令用于使所述计算机执行本申请上述视频处理的方法。Embodiments of the present invention also provide a non-transitory computer readable storage medium storing computer instructions for causing the computer to perform the above video processing method of the present application .
本发明实施例还提供了一种计算机程序产品,所述计算机程序产品包括存储在非暂态计算机可读存储介质上的计算程序,所述计算机程序包括程序指令,当所述程序指令被计算机执行时,使所述计算机执行本申请上述视频处理的方法。Embodiments of the present invention also provide a computer program product, the computer program product comprising a computing program stored on a non-transitory computer readable storage medium, the computer program comprising program instructions, when the program instructions are executed by a computer The computer is caused to perform the above-described video processing method of the present application.
本发明实施例提供的视频处理的方法及装置,能够首先判断当前播放的视频流中是否存在广告内容,若存在广告内容,则在播放该广告内容时,开启本地的音频文件,并且将播放视频流的播放器调为静音模式。与现有技术相比,本发明实施例能够在播放广告时,将本地的音频文件开启,并且不影响广告内容的正常显示,实现将广告声音替换为本地音频声音的效果。该方法使用户在观看广告时可以欣赏自己喜欢的音频文件中的内容。因此提高了用户在观看视频时的灵活度。 The method and device for video processing provided by the embodiment of the present invention can first determine whether there is advertisement content in the currently played video stream, and if there is advertisement content, when the advertisement content is played, the local audio file is turned on, and the video is played. The streamed player is muted to silent mode. Compared with the prior art, the embodiment of the present invention can turn on a local audio file when playing an advertisement, and does not affect the normal display of the advertisement content, and realizes the effect of replacing the advertisement sound with the local audio sound. This method allows users to enjoy the content of their favorite audio files while watching an ad. This increases the flexibility of the user when watching the video.
附图说明DRAWINGS
为了更清楚地说明本发明实施例或现有技术中的技术方案,下面将对实施例或现有技术描述中所需要使用的附图作一简单地介绍,显而易见地,下面描述中的附图是本发明的一些实施例,对于本领域普通技术人员来讲,在不付出创造性劳动的前提下,还可以根据这些附图获得其他的附图。In order to more clearly illustrate the embodiments of the present invention or the technical solutions in the prior art, a brief description of the drawings used in the embodiments or the prior art description will be briefly described below. Obviously, the drawings in the following description It is a certain embodiment of the present invention, and other drawings can be obtained from those skilled in the art without any creative work.
图1为本发明实施例提供的一种视频处理的方法的流程图;FIG. 1 is a flowchart of a method for video processing according to an embodiment of the present invention;
图2为本发明实施例提供的另一种视频处理的方法的流程图;2 is a flowchart of another method for video processing according to an embodiment of the present invention;
图3为本发明实施例提供的又一种视频处理的方法的流程图;FIG. 3 is a flowchart of still another method for video processing according to an embodiment of the present invention;
图4为本发明实施例提供的一种视频处理的装置的组成框图;4 is a block diagram of a device for video processing according to an embodiment of the present invention;
图5为本发明实施例提供的另一种视频处理的装置的组成框图;FIG. 5 is a structural block diagram of another apparatus for video processing according to an embodiment of the present invention;
图6为本发明实施例提供的一种视频处理装置的实体结构示意图;FIG. 6 is a schematic structural diagram of a physical structure of a video processing apparatus according to an embodiment of the present disclosure;
图7为本发明实施例提供的执行视频处理的方法的电子设备的硬件结构示意图。FIG. 7 is a schematic structural diagram of hardware of an electronic device for performing a video processing method according to an embodiment of the present invention.
具体实施方式detailed description
为使本发明实施例的目的、技术方案和优点更加清楚,下面将结合本发明实施例中的附图,对本发明实施例中的技术方案进行清楚、完整地描述,显然,所描述的实施例是本发明一部分实施例,而不是全部的实施例。基于本发明中的实施例,本领域普通技术人员在没有作出创造性劳动前提下所获得的所有其他实施例,都属于本发明保护的范围。The technical solutions in the embodiments of the present invention will be clearly and completely described in conjunction with the drawings in the embodiments of the present invention. It is a partial embodiment of the invention, and not all of the embodiments. All other embodiments obtained by those skilled in the art based on the embodiments of the present invention without creative efforts are within the scope of the present invention.
本发明实施例提供了一种视频处理的方法,如图1所示,该方法包括:The embodiment of the invention provides a method for video processing. As shown in FIG. 1 , the method includes:
101、判断视频流中是否存在广告内容。101. Determine whether an advertisement content exists in the video stream.
本实施例中的视频流是指遵循HLS(HTTP Live Streaming,简称HLS)协议的直播视频流。HLS是动态码率自适应技术,主要用于电脑PC和苹果Apple终端的音视频服务。HLS主要包括一个M3U8播放列表文件,M3U8文件是索引文件;以及TS(Transport Stream,简称TS)媒体分片文件等。TS也是一种视频的封装的形式,它适用于直播。The video stream in this embodiment refers to a live video stream that conforms to the HLS (HTTP Live Streaming, HLS) protocol. HLS is a dynamic rate adaptive technology, mainly used for audio and video services of computer PCs and Apple Apple terminals. The HLS mainly includes an M3U8 playlist file, the M3U8 file is an index file, and a TS (Transport Stream, TS) media slice file. TS is also a form of video encapsulation that is suitable for live broadcast.
本实施例是对直播视频流中的广告进行处理,因此首先要判断视频流中是否存在广告,为后续的处理广告的步骤做准备工作。 In this embodiment, the advertisement in the live video stream is processed. Therefore, it is first determined whether there is an advertisement in the video stream, and preparation for the subsequent steps of processing the advertisement.
102、若存在广告内容,则在播放广告内容时开启本地音频文件。102. If there is advertising content, the local audio file is opened when the advertising content is played.
若确定视频流中存在广告内容,则在播放广告内容时开启本地音频文件,开启本地音频文件是为了替换视频流中广告内容的声音,使用户可以欣赏自己喜欢的音频文件。其中,本地音频文件是存储在本地存储系统中的音频文件,其中记录的内容是用户根据自己的喜欢添加的音频文件,可以是歌曲、相声、录音等。If it is determined that the advertisement content exists in the video stream, the local audio file is opened when the advertisement content is played, and the local audio file is opened to replace the sound of the advertisement content in the video stream, so that the user can enjoy the audio file that he likes. The local audio file is an audio file stored in a local storage system, wherein the recorded content is an audio file added by the user according to his or her favorite, and may be a song, a crosstalk, a recording, or the like.
需要说明的是,当开启本地音频文件时,不会将当前展示的广告内容删除,只是替换了广告的声音,相当于背景音乐的替换,广告还是会继续展示。It should be noted that when the local audio file is opened, the currently displayed advertisement content will not be deleted, but the sound of the advertisement is replaced, which is equivalent to the replacement of the background music, and the advertisement will continue to be displayed.
103、将播放器调为静音模式。103. Turn the player to silent mode.
在开启本地音频文件的同时,会将播放器的声音模式调为静音模式,这也是将本地音频文件中的声音替换广告声音的必要的步骤。否则就会影响本地音频文件的欣赏效果。其中,播放器是用于播放视频流的播放器。该播放器可以是网络电视、手机、或电脑上安装的各种视频播放软件等。When the local audio file is turned on, the player's sound mode is muted, which is also a necessary step to replace the sound in the local audio file with the commercial sound. Otherwise it will affect the appreciation of the local audio file. Among them, the player is a player for playing a video stream. The player can be a network television, a mobile phone, or various video playing software installed on a computer.
本发明实施例提供的视频处理的方法,能够首先判断当前播放的视频流中是否存在广告内容,若存在广告内容,则在播放该广告内容时,开启本地的音频文件,并且将播放视频流的播放器调为静音模式。与现有技术相比,本发明实施例能够在播放广告时,将本地的音频文件开启,并且不影响广告内容的正常显示,实现将广告声音替换为本地音频声音的效果。该方法使用户在观看广告时可以欣赏自己喜欢的音频文件中的内容。因此提高了用户在观看视频时的灵活度。The method for video processing provided by the embodiment of the present invention can first determine whether there is advertisement content in the currently played video stream, and if there is advertisement content, when the advertisement content is played, the local audio file is turned on, and the video stream is played. The player is muted to silent mode. Compared with the prior art, the embodiment of the present invention can turn on a local audio file when playing an advertisement, and does not affect the normal display of the advertisement content, and realizes the effect of replacing the advertisement sound with the local audio sound. This method allows users to enjoy the content of their favorite audio files while watching an ad. This increases the flexibility of the user when watching the video.
进一步的,作为对图1所示实施例的细化及扩展,本发明还提供了另一实施例。如图2所示,该实施例中视频处理的方法包括:Further, as a refinement and expansion of the embodiment shown in FIG. 1, the present invention also provides another embodiment. As shown in FIG. 2, the method for video processing in this embodiment includes:
201、监测更新后的播放列表文件中的编码间断标识。201. Monitor the coded discontinuity identifier in the updated playlist file.
本实施例中,播放列表文件是M3U8文件,它是视频流播放时的一个索引文件。M3U8文件中记录有将要播放的视频文件的索引。通常M3U8文件中只允许存放三个视频文件的索引,当播完当前的三个视频文件后,重新加载新的视频文件的索引,直到所有的视频文件被播放完毕为止。由上述可以看出,M3U8文件中的内容是动态变化的,因此需要监测更新后的M3U8文件。需要说明的是其中的视频文件是指TS分片文件。 In this embodiment, the playlist file is an M3U8 file, which is an index file when the video stream is played. The index of the video file to be played is recorded in the M3U8 file. Usually, only the index of three video files is allowed in the M3U8 file. After the current three video files are broadcast, the index of the new video file is reloaded until all the video files are played. As can be seen from the above, the content in the M3U8 file is dynamically changed, so it is necessary to monitor the updated M3U8 file. It should be noted that the video file therein refers to the TS fragment file.
当M3U8文件中出现编码间断标识EXT-X-DISCONTINUITY时,表示EXT-X-DISCONTINUITY前后的内容是不连续的,因此可以猜想EXT-X-DISCONTINUITY之后是添加了别的内容,有可能是广告内容,所以要监测EXT-X-DISCONTINUITY标识。When the coded discontinuity identifier EXT-X-DISCONTINUITY appears in the M3U8 file, the content before and after the EXT-X-DISCONTINUITY is discontinuous, so it can be guessed that EXT-X-DISCONTINUITY is followed by other content, possibly advertising content. , so monitor the EXT-X-DISCONTINUITY logo.
202、提取编码间断标识之后对应的第一个视频文件中的关键帧。202. Extract a key frame in the first video file corresponding to the coded discontinuous identifier.
监测到EXT-X-DISCONTINUITY标识之后,为了验证在EXT-X-DISCONTINUITY之后是否添加了广告内容,就需要将EXT-X-DISCONTINUITY之后的对应的第一个视频文件即TS文件找出来,然后提取TS文件中的关键帧,以便于其与广告数据库中的帧数据作比对。After monitoring the EXT-X-DISCONTINUITY flag, in order to verify whether the advertisement content is added after EXT-X-DISCONTINUITY, it is necessary to find out the corresponding first video file after the EXT-X-DISCONTINUITY, that is, the TS file, and then extract Keyframes in the TS file to facilitate comparison with frame data in the ad database.
需要说明的是,为了提高后面进行比对的准确性,需要将提取出的关键帧进行过滤。具体的过滤原则是:去除所有关键帧中的无信息帧。其中无信息帧指无图像内容的帧,比如黑帧等。It should be noted that in order to improve the accuracy of the subsequent comparison, it is necessary to filter the extracted key frames. The specific filtering principle is to remove the non-information frames in all key frames. The no-information frame refers to a frame without image content, such as a black frame.
203、通过比对关键帧与广告数据库中的帧数据来判断视频流中是否存在广告内容。203. Determine whether the advertisement content exists in the video stream by comparing the frame data in the key frame and the advertisement database.
将由步骤202得到的关键帧与广告数据库中的帧数据进行比对,比对的方式有两种,如下:The key frames obtained in step 202 are compared with the frame data in the advertisement database, and there are two ways to compare, as follows:
方式一method one
将关键帧与广告数据库中的关键帧,进行一帧一帧的比对,首先设置一个预设的比对次数,该预设比对次数是经过多次重复实验得到的经验值。假设预设比对次数为N,N为大于0正整数。The key frame is compared with the key frame in the advertisement database by one frame and one frame. First, a preset comparison number is set, and the preset comparison number is an experience value obtained after repeated experiments. Assume that the preset number of comparisons is N, and N is a positive integer greater than 0.
若在比对N次的范围内,存在连续M个关键帧与广告数据库中的帧数据内容对应相同,则确定视频流中存在广告内容,即EXT-X-DISCONTINUITY之后第一个TS文件中是广告内容。其中M为小于预设比对次数的正整数,实际应用中N与M具体的比例关系,可以根据实际的需求确定。If there are consecutive M key frames corresponding to the frame data content in the advertisement database within the range of N times of comparison, it is determined that the advertisement content exists in the video stream, that is, the first TS file after EXT-X-DISCONTINUITY is Advertising content. Where M is a positive integer less than the preset number of comparisons, and the specific proportional relationship between N and M in actual application can be determined according to actual needs.
若比对N次后,依然没有出现连续M个关键帧与帧数据内容对应相同,则确定视频流中不存在广告内容,即EXT-X-DISCONTINUITY之后第一个TS文件中是非广告内容。If the continuous M key frames are not the same as the frame data content after the N times, it is determined that there is no advertisement content in the video stream, that is, the first TS file after the EXT-X-DISCONTINUITY is the non-advertising content.
方式二Way two
提前设计一个用于帧比对的模型,然后将关键帧与帧数据输入到比对 模型中进行比对,得到相似度,相似度用于描述关键帧与帧数据的相似程度,然后根据相似度判断视频流中是否存在广告内容,具体在判断时会设置一个相似度判断的标准,当达到标准的相似度结果,才可以判定提取的关键帧与广告数据库中的帧数据是对应相同的,才可以进一步确定视频流中存在广告内容,即EXT-X-DISCONTINUITY之后的第一个TS文件中是广告内容。Design a model for frame alignment in advance, then input keyframe and frame data into the comparison In the model, the similarity is obtained. The similarity is used to describe the similarity between the key frame and the frame data. Then, according to the similarity, it is judged whether there is advertising content in the video stream. Specifically, a similarity judgment standard is set in the judgment. When the standard similarity result is reached, it can be determined that the extracted key frame is the same as the frame data in the advertisement database, and then it can be further determined that the advertisement content exists in the video stream, that is, the first TS after the EXT-X-DISCONTINUITY The file is the content of the advertisement.
该比对模型的设计原理与方式一的原理基本是一致的,不同的是比对模型中会根据输入的关键帧与帧数据来动态的获取一个N,该N的值可以是通过人工智能算法进行学习训练得到的。The design principle of the comparison model is basically the same as the principle of the first method. The difference is that the comparison model dynamically acquires an N according to the input key frame and frame data, and the value of the N can be an artificial intelligence algorithm. Learned by training.
204、若存在广告内容,在播放编码间断标识之后的第一个视频文件时,开启本地音频文件。204. If the advertisement content exists, the local audio file is turned on when the first video file after the coded discontinuous identifier is played.
对于由步骤203结果中确定EXT-X-DISCONTINUITY之后的第一个TS文件中是广告内容的情况,执行在播放EXT-X-DISCONTINUITY之后的第一个TS文件时,开启本地音频文件。For the case where it is determined in step 203 that the first TS file after EXT-X-DISCONTINUITY is the advertisement content, the local audio file is opened when the first TS file after EXT-X-DISCONTINUITY is played.
具体的开启本地音频文件的实现方式与图1步骤102的实现方式相同,此处不再赘述。The implementation manner of the local audio file is the same as that of the step 102 in FIG. 1 , and details are not described herein again.
205、向播放器发送静音请求,以使得播放器调为静音模式。205. Send a mute request to the player to adjust the player to the silent mode.
在开启本地音频文件的同时,为了保证本地音频文件的播放效果,需要向播放视频流的播放器发送一个静音的请求,当该播放器收到静音的请求后会主动地将声音模式调为静音模式。While the local audio file is being turned on, in order to ensure the playback effect of the local audio file, a silent request is sent to the player playing the video stream, and when the player receives the request for mute, the sound mode is actively muted. mode.
206、当监测到播放列表文件中的下一个编码间断标识后,关闭本地音频文件。206. When the next coded discontinuity identifier in the playlist file is detected, the local audio file is closed.
开启本地音频文件后,还需要继续监测M3U8文件中的下一个EXT-X-DISCONTINUITY,当监测到下一个EXT-X-DISCONTINUITY后,表示该广告已经播放结束,这时需要恢复播放器的声音,使用户继续观看视频流中的非广告内容。After opening the local audio file, you need to continue to monitor the next EXT-X-DISCONTINUITY in the M3U8 file. When the next EXT-X-DISCONTINUITY is detected, it indicates that the advertisement has finished playing. In this case, you need to restore the player's sound. Let users continue to watch non-advertising content in the video stream.
进一步的,作为对图2所示实施例的细化及扩展,本发明还提供了另一实施例。如图3所示,该实施例中视频处理的方法包括:Further, as a refinement and expansion of the embodiment shown in FIG. 2, the present invention also provides another embodiment. As shown in FIG. 3, the method for video processing in this embodiment includes:
301、监测更新后的播放列表文件中的编码间断标识。301. Monitor the coded discontinuity identifier in the updated playlist file.
本步骤的实现方式与图2步骤201的实现方式相同,此处不再赘述。 The implementation of this step is the same as the implementation of step 201 in FIG. 2, and details are not described herein again.
302、获取编码间断标识之后的视频文件对应的身份标识(Identity,简称ID)。302. Obtain an identity (ID) corresponding to the video file after the coded discontinuity identifier.
本实施例中的编码间断标识为EXT-X-DISCONTINUITY,获取EXT-X-DISCONTINUITY之后的视频文件对应的ID。该ID是区别不同的视频内容的唯一标识。The coded discontinuity identifier in this embodiment is EXT-X-DISCONTINUITY, and the ID corresponding to the video file after EXT-X-DISCONTINUITY is acquired. This ID is a unique identifier that distinguishes different video content.
303、将身份标识ID与预设标识列表进行比对。303. Compare the identity ID with the preset identifier list.
将由步骤302获取到的ID与预设标识列表进行比对。其中预设ID中记录有所有广告的ID。The ID obtained by step 302 is compared with a preset identification list. The ID of all advertisements is recorded in the preset ID.
在进行比对时,若能在预设标识列表中找到ID,则确定视频流中存在广告内容,即EXT-X-DISCONTINUITY之后的视频文件中是广告内容;In the comparison, if the ID can be found in the preset identifier list, it is determined that the advertisement content exists in the video stream, that is, the video file after the EXT-X-DISCONTINUITY is the advertisement content;
在进行比对时,若不能在预设标识列表中找到ID,则确定视频流中不存在广告内容,即EXT-X-DISCONTINUITY之后的视频文件中是非广告内容。In the comparison, if the ID cannot be found in the preset identifier list, it is determined that there is no advertisement content in the video stream, that is, the video file after EXT-X-DISCONTINUITY is non-advertising content.
另外标识列表中还记录有广告的标题,其中标题与ID是一一对应的关系,标题的记录是为了是用户对标识列表进行修改时的可读性。In addition, the title of the advertisement is also recorded in the identification list, wherein the title and the ID are in a one-to-one correspondence, and the record of the title is for readability when the user modifies the identification list.
304、若存在广告内容,在播放编码间断标识之后的第一个视频文件时,开启本地音频文件。304. If the advertisement content exists, the local audio file is turned on when the first video file after the coded discontinuous identifier is played.
本步骤的实现方式与图2步骤204的实现方式相同,此处不再赘述。The implementation of this step is the same as the implementation of step 204 in FIG. 2, and details are not described herein again.
305、向播放器发送静音请求,以使得播放器调为静音模式。305. Send a mute request to the player to adjust the player to the silent mode.
本步骤的实现方式与图2步骤205的实现方式相同,此处不再赘述。The implementation of this step is the same as the implementation of step 205 in FIG. 2, and details are not described herein again.
306、当监测到播放列表文件中的下一个编码间断标识后,关闭本地音频文件。306. After monitoring the next coded discontinuity identifier in the playlist file, close the local audio file.
本步骤的实现方式与图2步骤206的实现方式相同,此处不再赘述。The implementation of this step is the same as the implementation of step 206 in FIG. 2, and details are not described herein again.
进一步的,对于图3的方法,不仅可以将视频流中的广告声音替换为本地音频文件中的声音,也可以将用户不喜欢的非广告视频进行声音的替换,具体的替换方法是:将预设标识列表进行更新,即在预设标识列表中添加自己不喜欢的视频。具体的添加方式为:当用户在观看视频时,若遇到自己不喜欢的视频可以通过选择菜单中的添加至标识列表的选项就可以实现将其对应的标题和ID添加到预设标识列表中,其中添加至列表的选项是由系统提供的。除了预设标识列表的不同外,其他的实现方式与图3的 实现方式是相同的,此处不再赘述。Further, for the method of FIG. 3, not only the advertisement sound in the video stream may be replaced with the sound in the local audio file, but also the non-advertising video that the user does not like may be replaced by a sound. The specific replacement method is: Set the list of identifiers to update, that is, add videos that you don't like in the list of preset identifiers. The specific way of adding is: when the user is watching the video, if he encounters the video that he does not like, he can add the corresponding title and ID to the preset identification list by selecting the option added to the identification list in the menu. The options added to the list are provided by the system. In addition to the difference in the preset identification list, other implementations are the same as those of FIG. The implementation is the same and will not be described here.
进一步的,对于图3中的预设标识列表是用户可以自己设定的,所以用户即可以将不喜欢的非广告视频或者广告视频添加到预设标识列表中,又可以将喜欢的非广告视频或者广告视频从预设标识列表中删除,具体的删除方法是在标识列表中找到喜欢的视频的标题,然后将其删除,删除后的视频在下次播放时就不会被本地音频文件替换背景声音。Further, the preset identifier list in FIG. 3 is set by the user, so the user can add the non-advertising video or advertisement video that is not like to the preset identifier list, and can also use the favorite non-advertising video. Or the advertisement video is deleted from the preset identification list. The specific deletion method is to find the title of the favorite video in the identification list, and then delete it, and the deleted video will not be replaced by the local audio file when the next time the video is played. .
进一步的,作为对上述图1、图2以及图3所示方法的实现,本发明实施例的另一个实施例还提供了一种视频处理的装置,如图4所示,该装置包括:判断单元41、开启单元42、调节单元43。Further, as an implementation of the method shown in FIG. 1, FIG. 2 and FIG. 3, another embodiment of the present invention further provides a video processing apparatus. As shown in FIG. 4, the apparatus includes: determining The unit 41, the opening unit 42, and the adjusting unit 43.
判断单元41,用于判断视频流中是否存在广告内容。The determining unit 41 is configured to determine whether the advertisement content exists in the video stream.
本实施例中的视频流是指遵循HLS协议的直播视频流。HLS是动态码率自适应技术,主要用于电脑PC和苹果Apple终端的音视频服务。HLS主要包括一个M3U8播放列表文件,M3U8文件是索引文件;以及TS媒体分片文件等。TS也是一种视频的封装的形式,它适用于直播。The video stream in this embodiment refers to a live video stream that conforms to the HLS protocol. HLS is a dynamic rate adaptive technology, mainly used for audio and video services of computer PCs and Apple Apple terminals. The HLS mainly includes an M3U8 playlist file, the M3U8 file is an index file, and a TS media slice file. TS is also a form of video encapsulation that is suitable for live broadcast.
本实施例是对直播视频流中的广告进行处理,因此首先要判断视频流中是否存在广告,为后续的处理广告的步骤做准备工作。In this embodiment, the advertisement in the live video stream is processed. Therefore, it is first determined whether there is an advertisement in the video stream, and preparation for the subsequent steps of processing the advertisement.
开启单元42,用于若存在广告内容,则在播放广告内容时开启本地音频文件,本地音频文件为存储在本地存储系统中的音频文件。The opening unit 42 is configured to open a local audio file when the advertisement content is played if the advertisement content exists, and the local audio file is an audio file stored in the local storage system.
若确定视频流中存在广告内容,则在播放广告内容时开启本地音频文件,开启本地音频文件是为了替换视频流中广告内容的声音,使用户可以欣赏自己喜欢的音频文件。其中,本地音频文件是存储在本地存储系统中的音频文件,其中记录的内容是用户根据自己的喜欢添加的音频文件,可以是歌曲、相声、录音等。If it is determined that the advertisement content exists in the video stream, the local audio file is opened when the advertisement content is played, and the local audio file is opened to replace the sound of the advertisement content in the video stream, so that the user can enjoy the audio file that he likes. The local audio file is an audio file stored in a local storage system, wherein the recorded content is an audio file added by the user according to his or her favorite, and may be a song, a crosstalk, a recording, or the like.
调节单元43,用于将播放器调为静音模式,播放器用于播放视频流。The adjusting unit 43 is configured to adjust the player to the silent mode, and the player is used to play the video stream.
在开启本地音频文件的同时,会将播放器的声音模式调为静音模式,这也是将本地音频文件中的声音替换广告声音的必要的步骤。否则就会影响本地音频文件的欣赏效果。其中,播放器是用于播放视频流的播放器。该播放器可以是网络电视、手机、或电脑上安装的各种视频播放软件等。When the local audio file is turned on, the player's sound mode is muted, which is also a necessary step to replace the sound in the local audio file with the commercial sound. Otherwise it will affect the appreciation of the local audio file. Among them, the player is a player for playing a video stream. The player can be a network television, a mobile phone, or various video playing software installed on a computer.
进一步的,如图5所示,判断单元41,包括:Further, as shown in FIG. 5, the determining unit 41 includes:
监测模块411,用于监测更新后的播放列表文件中的编码间断标识, 音频文件列表为视频流对应的播放列表文件。The monitoring module 411 is configured to monitor the coded discontinuity identifier in the updated playlist file. The audio file list is a playlist file corresponding to the video stream.
M3U8文件中的内容是动态变化的,因此需要监测更新后的M3U8文件。需要说明的是其中的视频文件是指TS分片文件。The content in the M3U8 file is dynamically changed, so the updated M3U8 file needs to be monitored. It should be noted that the video file therein refers to the TS fragment file.
当M3U8文件中出现编码间断标识EXT-X-DISCONTINUITY时,表示EXT-X-DISCONTINUITY前后的内容是不连续的,因此可以猜想EXT-X-DISCONTINUITY之后是添加了别的内容,有可能是广告内容,所以要监测EXT-X-DISCONTINUITY标识。When the coded discontinuity identifier EXT-X-DISCONTINUITY appears in the M3U8 file, the content before and after the EXT-X-DISCONTINUITY is discontinuous, so it can be guessed that EXT-X-DISCONTINUITY is followed by other content, possibly advertising content. , so monitor the EXT-X-DISCONTINUITY logo.
提取模块412,用于提取编码间断标识之后对应的第一个视频文件中的关键帧。The extracting module 412 is configured to extract key frames in the first video file corresponding to the encoded discontinuous identifier.
监测到EXT-X-DISCONTINUITY标识之后,为了验证在EXT-X-DISCONTINUITY之后是否添加了广告内容,就需要将EXT-X-DISCONTINUITY之后的对应的第一个视频文件即TS文件找出来,然后提取TS文件中的关键帧,以便于其与广告数据库中的帧数据作比对。After monitoring the EXT-X-DISCONTINUITY flag, in order to verify whether the advertisement content is added after EXT-X-DISCONTINUITY, it is necessary to find out the corresponding first video file after the EXT-X-DISCONTINUITY, that is, the TS file, and then extract Keyframes in the TS file to facilitate comparison with frame data in the ad database.
判断模块413,用于通过比对关键帧与广告数据库中的帧数据来判断视频流中是否存在广告内容。The determining module 413 is configured to determine whether the advertisement content exists in the video stream by comparing the frame data in the key frame and the advertisement database.
进一步的,判断模块413,用于:Further, the determining module 413 is configured to:
在预设比对次数内,若存在连续M个关键帧与帧数据内容对应相同,则确定视频流中存在广告内容,其中M为小于预设比对次数的正整数;Within the preset number of comparisons, if there are consecutive M key frames corresponding to the frame data content, it is determined that there is advertising content in the video stream, where M is a positive integer smaller than the preset comparison number;
在预设比对次数内,若不存在连续M个关键帧与帧数据内容对应相同,则确定视频流中不存在广告内容。Within the preset number of comparisons, if there are no consecutive M key frames corresponding to the frame data content, it is determined that there is no advertisement content in the video stream.
进一步的,判断模块413,用于:Further, the determining module 413 is configured to:
将关键帧与帧数据输入到比对模型中进行比对,得到相似度,相似度用于描述关键帧与帧数据的相似程度;The key frame and the frame data are input into the comparison model for comparison, and the similarity is obtained, and the similarity is used to describe the degree of similarity between the key frame and the frame data;
根据相似度判断视频流中是否存在广告内容。Whether the advertisement content exists in the video stream is judged according to the similarity.
进一步的,如图5所示,装置进一步包括:Further, as shown in FIG. 5, the apparatus further includes:
去除单元44,用于在通过比对关键帧与广告数据库中的帧数据来判断视频流中是否存在广告内容之前,将关键帧中的无信息帧去除,无信息帧为无图像内容的帧。The removing unit 44 is configured to remove the non-information frame in the key frame before determining whether the advertisement content exists in the video stream by comparing the frame data in the key frame and the advertisement database, and the no information frame is a frame without image content.
进一步的,如图5所示,判断单元41,包括:Further, as shown in FIG. 5, the determining unit 41 includes:
监测模块411,用于监测更新后的播放列表文件中的编码间断标识, 音频文件列表为视频流对应的播放列表文件。The monitoring module 411 is configured to monitor the coded discontinuity identifier in the updated playlist file. The audio file list is a playlist file corresponding to the video stream.
M3U8文件中的内容是动态变化的,因此需要监测更新后的M3U8文件。需要说明的是其中的视频文件是指TS分片文件。The content in the M3U8 file is dynamically changed, so the updated M3U8 file needs to be monitored. It should be noted that the video file therein refers to the TS fragment file.
当M3U8文件中出现编码间断标识EXT-X-DISCONTINUITY时,表示EXT-X-DISCONTINUITY前后的内容是不连续的,因此可以猜想EXT-X-DISCONTINUITY之后是添加了别的内容,有可能是广告内容,所以要监测EXT-X-DISCONTINUITY标识。When the coded discontinuity identifier EXT-X-DISCONTINUITY appears in the M3U8 file, the content before and after the EXT-X-DISCONTINUITY is discontinuous, so it can be guessed that EXT-X-DISCONTINUITY is followed by other content, possibly advertising content. , so monitor the EXT-X-DISCONTINUITY logo.
获取模块414,用于获取编码间断标识之后的视频文件对应的ID。The obtaining module 414 is configured to obtain an ID corresponding to the video file after the coded discontinuity identifier.
本实施例中的编码间断标识为EXT-X-DISCONTINUITY,获取EXT-X-DISCONTINUITY之后的视频文件对应的ID。该ID是区别不同的视频内容的唯一标识。The coded discontinuity identifier in this embodiment is EXT-X-DISCONTINUITY, and the ID corresponding to the video file after EXT-X-DISCONTINUITY is acquired. This ID is a unique identifier that distinguishes different video content.
比对模块415,用于将ID与预设标识列表进行比对,预设标识列表用于记录所有广告的ID。The comparison module 415 is configured to compare the ID with a preset identifier list, and the preset identifier list is used to record IDs of all advertisements.
确定模块416,用于若能在预设标识列表中找到ID,则确定视频流中存在广告内容;若不能在预设标识列表中找到ID,则确定视频流中不存在广告内容。The determining module 416 is configured to determine that the advertisement content exists in the video stream if the ID is found in the preset identifier list; if the ID cannot be found in the preset identifier list, it is determined that the advertisement content does not exist in the video stream.
在进行比对时,若能在预设标识列表中找到ID,则确定视频流中存在广告内容,即EXT-X-DISCONTINUITY之后的视频文件中是广告内容;In the comparison, if the ID can be found in the preset identifier list, it is determined that the advertisement content exists in the video stream, that is, the video file after the EXT-X-DISCONTINUITY is the advertisement content;
在进行比对时,若不能在预设标识列表中找到ID,则确定视频流中不存在广告内容,即EXT-X-DISCONTINUITY之后的视频文件中是非广告内容。In the comparison, if the ID cannot be found in the preset identifier list, it is determined that there is no advertisement content in the video stream, that is, the video file after EXT-X-DISCONTINUITY is non-advertising content.
进一步的,开启单元42,用于:在播放编码间断标识之后的第一个视频文件时,开启本地音频文件。Further, the opening unit 42 is configured to: when playing the first video file after the coded discontinuous identifier, turn on the local audio file.
在播放EXT-X-DISCONTINUITY之后的第一个TS文件时,开启本地音频文件。The local audio file is opened when the first TS file after EXT-X-DISCONTINUITY is played.
进一步的,调节单元43,用于:向播放器发送静音请求,以使得播放器调为静音模式。Further, the adjusting unit 43 is configured to: send a silence request to the player to adjust the player to the silent mode.
在开启本地音频文件的同时,为了保证本地音频文件的播放效果,需要向播放视频流的播放器发送一个静音的请求,当该播放器收到静音的请求后会主动地将声音模式调为静音模式。 While the local audio file is being turned on, in order to ensure the playback effect of the local audio file, a silent request is sent to the player playing the video stream, and when the player receives the request for mute, the sound mode is actively muted. mode.
进一步的,如图5所示,装置进一步包括:Further, as shown in FIG. 5, the apparatus further includes:
关闭单元45,用于在将播放器调为静音模式之后,当监测到播放列表文件中的下一个编码间断标识后,关闭本地音频文件;The closing unit 45 is configured to: after adjusting the player to the silent mode, after detecting the next coded discontinuity identifier in the playlist file, closing the local audio file;
打开单元46,用于打开播放器的声音。The unit 46 is turned on for turning on the sound of the player.
开启本地音频文件后,还需要继续监测M3U8文件中的下一个EXT-X-DISCONTINUITY,当监测到后,表示该段广告已经播放结束,这时需要恢复播放器的声音,使用户继续观看视频流中的非广告内容。After the local audio file is enabled, you need to continue to monitor the next EXT-X-DISCONTINUITY in the M3U8 file. When it is detected, it indicates that the advertisement has finished playing. At this time, you need to restore the player's sound, so that the user can continue watching the video stream. Non-advertising content in .
本发明实施例提供的视频处理的装置,能够首先判断当前播放的视频流中是否存在广告内容,若存在广告内容,则在播放该广告内容时,开启本地的音频文件,并且将播放视频流的播放器调为静音模式。与现有技术相比,本发明实施例能够在播放广告时,将本地的音频文件开启,并且不影响广告内容的正常显示,实现将广告声音替换为本地音频声音的效果。该方法使用户在观看广告时可以欣赏自己喜欢的音频文件中的内容。因此提高了用户在观看视频时的灵活度。The device for video processing according to the embodiment of the present invention can first determine whether there is advertisement content in the currently played video stream, and if there is advertisement content, when the advertisement content is played, the local audio file is turned on, and the video stream is played. The player is muted to silent mode. Compared with the prior art, the embodiment of the present invention can turn on a local audio file when playing an advertisement, and does not affect the normal display of the advertisement content, and realizes the effect of replacing the advertisement sound with the local audio sound. This method allows users to enjoy the content of their favorite audio files while watching an ad. This increases the flexibility of the user when watching the video.
需要说明的是,针对上述视频处理的装置,凡是本发明实施例中使用到的各个单元模块的功能都可以通过硬件处理器(hardware processor)来实现。It should be noted that, for the above-mentioned video processing device, the functions of the respective unit modules used in the embodiments of the present invention can be implemented by a hardware processor.
示例性的,如图6所示,图6示出了本发明实施例提供的一种视频处理装置的实体结构示意图,该实体结构可以包括:处理器(processor)61、通信接口(Communications Interface)62、存储器(memory)63和总线64,其中,处理器61、通信接口62、存储器63通过总线64完成相互间的通信。通信接口62可以用于服务器与客户端之间的信息传输。处理器61可以调用存储器63中的逻辑指令,以执行如下方法:判断视频流中是否存在广告内容;若存在所述广告内容,则在播放所述广告内容时开启本地音频文件,所述本地音频文件为存储在本地存储系统中的音频文件;并且,将播放器调为静音模式,所述播放器用于播放所述视频流。Illustratively, as shown in FIG. 6, FIG. 6 is a schematic diagram showing the physical structure of a video processing apparatus according to an embodiment of the present invention. The physical structure may include a processor 61 and a communications interface. 62. A memory 63 and a bus 64, wherein the processor 61, the communication interface 62, and the memory 63 complete communication with each other via the bus 64. Communication interface 62 can be used for information transfer between the server and the client. The processor 61 may call the logic instructions in the memory 63 to perform a method of determining whether there is advertising content in the video stream; if the advertising content is present, turning on the local audio file when the advertising content is played, the local audio The file is an audio file stored in a local storage system; and the player is tuned to a silent mode, the player is for playing the video stream.
此外,上述的存储器63中的逻辑指令可以通过软件功能单元的形式实现并作为独立的产品销售或使用时,可以存储在一个计算机可读取存储介质中。基于这样的理解,本发明的技术方案本质上或者说对现有技术做出贡献的部分或者该技术方案的部分可以以软件产品的形式体现出来,该计 算机软件产品存储在一个存储介质中,包括若干指令用以使得一台计算机设备(可以是个人计算机,服务器,或者网络设备等)执行本发明各个实施例所述方法的全部或部分步骤。而前述的存储介质包括:U盘、移动硬盘、只读存储器(ROM,Read-Only Memory)、随机存取存储器(RAM,Random Access Memory)、磁碟或者光盘等各种可以存储程序代码的介质。Furthermore, the logic instructions in the memory 63 described above may be implemented in the form of a software functional unit and sold or used as a stand-alone product, and may be stored in a computer readable storage medium. Based on such understanding, the part of the technical solution of the present invention that contributes in essence or to the prior art or the part of the technical solution can be embodied in the form of a software product. The computer software product is stored in a storage medium and includes instructions for causing a computer device (which may be a personal computer, server, or network device, etc.) to perform all or part of the steps of the methods described in various embodiments of the present invention. The foregoing storage medium includes: a U disk, a mobile hard disk, a read-only memory (ROM), a random access memory (RAM), a magnetic disk, or an optical disk, and the like. .
图7是本申请实施例提供的执行视频处理的方法的电子设备的硬件结构示意图,如图7所示,该设备包括:FIG. 7 is a schematic structural diagram of hardware of an electronic device for performing a video processing method according to an embodiment of the present disclosure. As shown in FIG. 7, the device includes:
一个或多个处理器71以及存储器72,图7中以一个处理器71为例。One or more processors 71 and a memory 72 are exemplified by a processor 71 in FIG.
执行视频处理的方法的设备还可以包括:输入装置73和输出装置74。The apparatus that performs the method of video processing may further include: an input device 73 and an output device 74.
处理器71、存储器72、输入装置73和输出装置74可以通过总线或者其他方式连接,图7中以通过总线连接为例。The processor 71, the memory 72, the input device 73, and the output device 74 may be connected by a bus or other means, as exemplified by a bus connection in FIG.
存储器72作为一种非易失性计算机可读存储介质,可用于存储非易失性软件程序、非易失性计算机可执行程序以及模块,如本申请实施例中的视频处理的方法对应的程序指令/模块(例如,附图4所示的判断单元41、开启单元42和调节单元43)。处理器71通过运行存储在存储器72中的非易失性软件程序、指令以及模块,从而执行服务器的各种功能应用以及数据处理,即实现上述方法实施例视频处理的方法。The memory 72 is a non-volatile computer readable storage medium, and can be used for storing a non-volatile software program, a non-volatile computer executable program, and a module, such as a program corresponding to the video processing method in the embodiment of the present application. An instruction/module (for example, the determination unit 41, the opening unit 42, and the adjustment unit 43 shown in FIG. 4). The processor 71 executes various functional applications of the server and data processing by executing non-volatile software programs, instructions, and modules stored in the memory 72, that is, a method of implementing video processing of the above-described method embodiments.
存储器72可以包括存储程序区和存储数据区,其中,存储程序区可存储操作系统、至少一个功能所需要的应用程序;存储数据区可存储根据视频处理的装置的使用所创建的数据等。此外,存储器72可以包括高速随机存取存储器,还可以包括非易失性存储器,例如至少一个磁盘存储器件、闪存器件、或其他非易失性固态存储器件。在一些实施例中,存储器72可选包括相对于处理器71远程设置的存储器,这些远程存储器可以通过网络连接至视频处理的装置。上述网络的实例包括但不限于互联网、企业内部网、局域网、移动通信网及其组合。The memory 72 may include a storage program area and a storage data area, wherein the storage program area may store an operating system, an application required for at least one function; the storage data area may store data created according to use of the video processing device, and the like. Moreover, memory 72 can include high speed random access memory, and can also include non-volatile memory, such as at least one magnetic disk storage device, flash memory device, or other non-volatile solid state storage device. In some embodiments, memory 72 can optionally include memory remotely located relative to processor 71, which can be connected to the video processing device over a network. Examples of such networks include, but are not limited to, the Internet, intranets, local area networks, mobile communication networks, and combinations thereof.
输入装置73可接收输入的数字或字符信息,以及产生与视频处理的装置的用户设置以及功能控制有关的键信号输入。输出装置74可包括显示屏等显示设备。 Input device 73 can receive the input digital or character information and generate key signal inputs related to user settings and function control of the video processing device. Output device 74 can include a display device such as a display screen.
所述一个或者多个模块存储在所述存储器72中,当被所述一个或者多个处理器71执行时,执行上述任意方法实施例中的视频处理的方法。 The one or more modules are stored in the memory 72, and when executed by the one or more processors 71, perform the method of video processing in any of the above method embodiments.
上述产品可执行本申请实施例所提供的方法,具备执行方法相应的功能模块和有益效果。未在本实施例中详尽描述的技术细节,可参见本申请实施例所提供的方法。The above products can perform the methods provided by the embodiments of the present application, and have the corresponding functional modules and beneficial effects of the execution method. For technical details that are not described in detail in this embodiment, reference may be made to the method provided by the embodiments of the present application.
本申请实施例的电子设备以多种形式存在,包括但不限于:The electronic device of the embodiment of the present application exists in various forms, including but not limited to:
(1)移动通信设备:这类设备的特点是具备移动通信功能,并且以提供话音、数据通信为主要目标。这类终端包括:智能手机(例如iPhone)、多媒体手机、功能性手机,以及低端手机等。(1) Mobile communication devices: These devices are characterized by mobile communication functions and are mainly aimed at providing voice and data communication. Such terminals include: smart phones (such as iPhone), multimedia phones, functional phones, and low-end phones.
(2)超移动个人计算机设备:这类设备属于个人计算机的范畴,有计算和处理功能,一般也具备移动上网特性。这类终端包括:PDA、MID和UMPC设备等,例如iPad。(2) Ultra-mobile personal computer equipment: This type of equipment belongs to the category of personal computers, has computing and processing functions, and generally has mobile Internet access. Such terminals include: PDAs, MIDs, and UMPC devices, such as the iPad.
(3)便携式娱乐设备:这类设备可以显示和播放多媒体内容。该类设备包括:音频、视频处理的器(例如iPod),掌上游戏机,电子书,以及智能玩具和便携式车载导航设备。(3) Portable entertainment devices: These devices can display and play multimedia content. Such devices include: audio, video processing devices (such as iPod), handheld game consoles, e-books, and smart toys and portable car navigation devices.
(4)服务器:提供计算服务的设备,服务器的构成包括处理器、硬盘、内存、系统总线等,服务器和通用的计算机架构类似,但是由于需要提供高可靠的服务,因此在处理能力、稳定性、可靠性、安全性、可扩展性、可管理性等方面要求较高。(4) Server: A device that provides computing services. The server consists of a processor, a hard disk, a memory, a system bus, etc. The server is similar to a general-purpose computer architecture, but because of the need to provide highly reliable services, processing power and stability High reliability in terms of reliability, security, scalability, and manageability.
(5)其他具有数据交互功能的电子装置。(5) Other electronic devices with data interaction functions.
本申请实施例还提供了一种非暂态计算机存储介质,所述计算机存储介质存储有计算机可执行指令,该计算机可执行指令可执行上述任意方法实施例中的视频处理的方法。The embodiment of the present application further provides a non-transitory computer storage medium, where the computer storage medium stores computer executable instructions, which can perform the video processing method in any of the foregoing method embodiments.
最后需要说明的是,本领域普通技术人员可以理解实现上述实施例方法中的全部或部分流程,是可以通过计算机程序来指令相关的硬件来完成,所述的程序可存储于一计算机可读取存储介质中,该程序在执行时,可包括如上述各方法的实施例的流程。其中,所述的存储介质可为磁碟、光盘、只读存储记忆体(ROM)或随机存储记忆体(RAM)等。Finally, it should be understood that those skilled in the art can understand that all or part of the process of implementing the above embodiments can be completed by a computer program to instruct related hardware, and the program can be stored in a computer readable. In the storage medium, the program, when executed, may include the flow of an embodiment of the methods as described above. The storage medium may be a magnetic disk, an optical disk, a read only memory (ROM), or a random access memory (RAM).
以上所描述的装置实施例仅仅是示意性的,其中所述作为分离部件说明的单元可以是或者也可以不是物理上分开的,作为单元显示的部件可以是或者也可以不是物理单元,即可以位于一个地方,或者也可以分布到多个网络单元上。可以根据实际的需要选择其中的部分或者全部模块来实现 本实施例方案的目的。本领域普通技术人员在不付出创造性的劳动的情况下,即可以理解并实施。The device embodiments described above are merely illustrative, wherein the units described as separate components may or may not be physically separate, and the components displayed as units may or may not be physical units, ie may be located A place, or it can be distributed to multiple network units. Some or all of the modules can be selected according to actual needs. The purpose of the solution of this embodiment. Those of ordinary skill in the art can understand and implement without deliberate labor.
通过以上的实施方式的描述,本领域的技术人员可以清楚地了解到各实施方式可借助软件加必需的通用硬件平台的方式来实现,当然也可以通过硬件。基于这样的理解,上述技术方案本质上或者说对现有技术做出贡献的部分可以以软件产品的形式体现出来,该计算机软件产品可以存储在计算机可读存储介质中,如ROM/RAM、磁碟、光盘等,包括若干指令用以使得一台计算机设备(可以是个人计算机,服务器,或者网络设备等)执行各个实施例或者实施例的某些部分所述的方法。Through the description of the above embodiments, those skilled in the art can clearly understand that the various embodiments can be implemented by means of software plus a necessary general hardware platform, and of course, by hardware. Based on such understanding, the above-described technical solutions may be embodied in the form of software products in essence or in the form of software products, which may be stored in a computer readable storage medium such as ROM/RAM, magnetic Discs, optical discs, etc., include instructions for causing a computer device (which may be a personal computer, server, or network device, etc.) to perform the methods described in various embodiments or portions of the embodiments.
最后应说明的是:以上实施例仅用以说明本发明的技术方案,而非对其限制;尽管参照前述实施例对本发明进行了详细的说明,本领域的普通技术人员应当理解:其依然可以对前述各实施例所记载的技术方案进行修改,或者对其中部分技术特征进行等同替换;而这些修改或者替换,并不使相应技术方案的本质脱离本发明各实施例技术方案的精神和范围。 It should be noted that the above embodiments are only used to illustrate the technical solutions of the present invention, and are not limited thereto; although the present invention has been described in detail with reference to the foregoing embodiments, those skilled in the art should understand that The technical solutions described in the foregoing embodiments are modified, or the equivalents of the technical features are replaced. The modifications and substitutions do not depart from the spirit and scope of the technical solutions of the embodiments of the present invention.

Claims (21)

  1. 一种视频处理的方法,其特征在于,应用于客户端,所述方法包括:A method for video processing, characterized in that it is applied to a client, and the method includes:
    判断视频流中是否存在广告内容;Determining whether there is advertising content in the video stream;
    若存在所述广告内容,则在播放所述广告内容时开启本地音频文件,所述本地音频文件为存储在本地存储系统中的音频文件;并且,If the advertisement content exists, opening a local audio file when the advertisement content is played, the local audio file being an audio file stored in a local storage system;
    将播放器调为静音模式,所述播放器用于播放所述视频流。The player is tuned to a silent mode, and the player is used to play the video stream.
  2. 根据权利要求1所述的方法,其特征在于,所述判断视频流中是否存在广告内容,包括:The method according to claim 1, wherein the determining whether the advertisement content exists in the video stream comprises:
    监测更新后的播放列表文件中的编码间断标识,所述播放列表文件为所述视频流对应的播放列表文件;Monitoring a coded discontinuity identifier in the updated playlist file, where the playlist file is a playlist file corresponding to the video stream;
    提取所述编码间断标识之后对应的第一个视频文件中的关键帧;Extracting a key frame in the first video file corresponding to the coded discontinuous identifier;
    通过比对所述关键帧与广告数据库中的帧数据来判断所述视频流中是否存在所述广告内容。The presence or absence of the advertisement content in the video stream is determined by comparing the frame data in the key frame and the advertisement database.
  3. 根据权利要求2所述的方法,其特征在于,所述通过比对所述关键帧与广告数据库中的帧数据来判断所述视频流中是否存在广告内容,包括:The method according to claim 2, wherein the determining whether the advertisement content exists in the video stream by comparing the frame data in the key frame and the advertisement database comprises:
    在预设比对次数内,若存在连续M个关键帧与所述帧数据内容对应相同,则确定所述视频流中存在所述广告内容,其中M为小于所述预设比对次数的正整数;In the preset number of comparisons, if there are consecutive M key frames corresponding to the frame data content, determining that the advertisement content exists in the video stream, where M is less than the preset comparison number Integer
    在预设比对次数内,若不存在连续M个关键帧与所述帧数据内容对应相同,则确定所述视频流中不存在所述广告内容。Within the preset number of comparisons, if there are no consecutive M key frames corresponding to the frame data content, it is determined that the advertisement content does not exist in the video stream.
  4. 根据权利要求2所述的方法,其特征在于,所述通过比对所述关键帧与广告数据库中的帧数据来判断所述视频流中是否存在广告内容,包括:The method according to claim 2, wherein the determining whether the advertisement content exists in the video stream by comparing the frame data in the key frame and the advertisement database comprises:
    将所述关键帧与所述帧数据输入到比对模型中进行比对,得到相似度,所述相似度用于描述所述关键帧与所述帧数据的相似程度;Comparing the key frame and the frame data into a comparison model to obtain a similarity, wherein the similarity is used to describe a degree of similarity between the key frame and the frame data;
    根据所述相似度判断所述视频流中是否存在所述广告内容。Determining whether the advertisement content exists in the video stream according to the similarity.
  5. 根据权利要求2至4中任意一项方法,其特征在于,在所述通过比对所述关键帧与广告数据库中的帧数据来判断所述视频流中是否存在广告内容之前,所述方法进一步包括:A method according to any one of claims 2 to 4, wherein said method further determines whether said advertisement content is present in said video stream by comparing frame data in said key frame with said advertisement database include:
    将所述关键帧中的无信息帧去除,所述无信息帧为无图像内容的帧。The non-information frame in the key frame is removed, and the non-information frame is a frame without image content.
  6. 根据权利要求1所述的方法,其特征在于,所述判断判断视频流中是否存在广告内容,包括: The method according to claim 1, wherein the determining whether the advertisement content exists in the video stream comprises:
    监测更新后的播放列表文件中的编码间断标识,所述播放列表文件为所述视频流对应的播放列表文件;Monitoring a coded discontinuity identifier in the updated playlist file, where the playlist file is a playlist file corresponding to the video stream;
    获取所述编码间断标识之后的视频文件对应的身份标识号ID;Obtaining an identity identification ID corresponding to the video file after the coding discontinuity identifier;
    将所述ID与预设标识列表进行比对,所述预设标识列表用于记录所有广告的ID;Comparing the ID with a preset identifier list, where the preset identifier list is used to record IDs of all advertisements;
    若能在所述预设标识列表中找到所述ID,则确定所述视频流中存在广告内容;If the ID is found in the preset identifier list, determining that the advertisement content exists in the video stream;
    若不能在所述预设标识列表中找到所述ID,则确定所述视频流中不存在广告内容。If the ID cannot be found in the preset identifier list, it is determined that there is no advertisement content in the video stream.
  7. 根据权利要求2所述的方法,其特征在于,所述在播放所述广告内容时开启本地音频文件,包括:The method according to claim 2, wherein the opening of the local audio file when the advertisement content is played comprises:
    在播放所述编码间断标识之后的第一个视频文件时,开启本地音频文件。The local audio file is opened when the first video file after the encoding of the discontinuous identifier is played.
  8. 根据权利要求1所述的方法,其特征在于,所述将播放器调为静音模式,包括:The method according to claim 1, wherein said adjusting said player to a silent mode comprises:
    向所述播放器发送静音请求,以使得所述播放器调为静音模式。A mute request is sent to the player to cause the player to be in a silent mode.
  9. 根据权利要求2所述的方法,其特征在于,在所述将播放器调为静音模式之后,所述方法进一步包括:The method of claim 2, wherein after the player is tuned to the silent mode, the method further comprises:
    当监测到所述播放列表文件中的下一个编码间断标识后,关闭所述本地音频文件;并且,After monitoring the next coded discontinuity identifier in the playlist file, closing the local audio file; and,
    打开所述播放器的声音。Turn on the sound of the player.
  10. 一种视频处理的装置,其特征在于,所述装置包括:A device for video processing, characterized in that the device comprises:
    判断单元,用于判断视频流中是否存在广告内容;a determining unit, configured to determine whether an advertisement content exists in the video stream;
    开启单元,用于若存在所述广告内容,则在播放所述广告内容时开启本地音频文件,所述本地音频文件为存储在本地存储系统中的音频文件;An opening unit, configured to: when the advertisement content is present, open a local audio file when the advertisement content is played, where the local audio file is an audio file stored in a local storage system;
    调节单元,用于将播放器调为静音模式,所述播放器用于播放所述视频流。And an adjustment unit, configured to adjust the player to a silent mode, the player is configured to play the video stream.
  11. 根据权利要求10所述的装置,其特征在于,所述判断单元,包括:The device according to claim 10, wherein the determining unit comprises:
    监测模块,用于监测更新后的播放列表文件中的编码间断标识,所述播放列表文件为所述视频流对应的播放列表文件;a monitoring module, configured to monitor an encoded discontinuity identifier in the updated playlist file, where the playlist file is a playlist file corresponding to the video stream;
    提取模块,用于提取所述编码间断标识之后对应的第一个视频文件中的关键帧;An extraction module, configured to extract a key frame in the first video file corresponding to the coded discontinuous identifier;
    判断模块,用于通过比对所述关键帧与广告数据库中的帧数据来判断所述视频流中是否存在所述广告内容。 The determining module is configured to determine whether the advertisement content exists in the video stream by comparing the frame data in the key frame and the advertisement database.
  12. 根据权利要求11所述的装置,其特征在于,所述判断模块,用于:The device according to claim 11, wherein the determining module is configured to:
    在预设比对次数内,若存在连续M个关键帧与所述帧数据内容对应相同,则确定所述视频流中存在所述广告内容,其中M为小于所述预设比对次数的正整数;In the preset number of comparisons, if there are consecutive M key frames corresponding to the frame data content, determining that the advertisement content exists in the video stream, where M is less than the preset comparison number Integer
    在预设比对次数内,若不存在连续M个关键帧与所述帧数据内容对应相同,则确定所述视频流中不存在所述广告内容。Within the preset number of comparisons, if there are no consecutive M key frames corresponding to the frame data content, it is determined that the advertisement content does not exist in the video stream.
  13. 根据权利要求11所述的装置,其特征在于,所述判断模块,用于:The device according to claim 11, wherein the determining module is configured to:
    将所述关键帧与所述帧数据输入到比对模型中进行比对,得到相似度,所述相似度用于描述所述关键帧与所述帧数据的相似程度;Comparing the key frame and the frame data into a comparison model to obtain a similarity, wherein the similarity is used to describe a degree of similarity between the key frame and the frame data;
    根据所述相似度判断所述视频流中是否存在所述广告内容。Determining whether the advertisement content exists in the video stream according to the similarity.
  14. 根据权利要求11至13中任意一项装置,其特征在于,所述装置进一步包括:Apparatus according to any one of claims 11 to 13 wherein said apparatus further comprises:
    去除单元,用于在所述通过比对所述关键帧与广告数据库中的帧数据来判断所述视频流中是否存在广告内容之前,将所述关键帧中的无信息帧去除,所述无信息帧为无图像内容的帧。a removing unit, configured to remove a non-information frame in the key frame before determining whether the advertisement content exists in the video stream by comparing the frame data in the key frame and the advertisement database, the none Information frames are frames with no image content.
  15. 根据权利要求10所述的装置,其特征在于,所述判断单元,包括:The device according to claim 10, wherein the determining unit comprises:
    监测模块,用于监测更新后的播放列表文件中的编码间断标识,所述播放列表文件为所述视频流对应的播放列表文件;a monitoring module, configured to monitor an encoded discontinuity identifier in the updated playlist file, where the playlist file is a playlist file corresponding to the video stream;
    获取模块,用于获取所述编码间断标识之后的视频文件对应的身份标识ID;An obtaining module, configured to acquire an identity ID corresponding to the video file after the coded discontinuity identifier;
    比对模块,用于将所述ID与预设标识列表进行比对,所述预设标识列表用于记录所有广告的ID;a comparison module, configured to compare the ID with a preset identifier list, where the preset identifier list is used to record IDs of all advertisements;
    确定模块,用于若能在所述预设标识列表中找到所述ID,则确定所述视频流中存在广告内容;若不能在所述预设标识列表中找到所述ID,则确定所述视频流中不存在广告内容。Determining a module, if the ID is found in the preset identifier list, determining that the advertisement content exists in the video stream; if the ID cannot be found in the preset identifier list, determining the There is no advertising content in the video stream.
  16. 根据权利要求11所述的装置,其特征在于,所述开启单元,用于:The device according to claim 11, wherein the opening unit is configured to:
    在播放所述编码间断标识之后的第一个视频文件时,开启本地音频文件。The local audio file is opened when the first video file after the encoding of the discontinuous identifier is played.
  17. 根据权利要求10所述的装置,其特征在于,所述调节单元,用于:The device according to claim 10, wherein the adjusting unit is configured to:
    向所述播放器发送静音请求,以使得所述播放器调为静音模式。A mute request is sent to the player to cause the player to be in a silent mode.
  18. 根据权利要求11所述的装置,其特征在于,所述装置进一步包括:The device according to claim 11, wherein the device further comprises:
    关闭单元,用于在所述将播放器调为静音模式之后,当监测到所述播放列表文件中的下一个编码间断标识后,关闭所述本地音频文件; a closing unit, configured to close the local audio file after monitoring the next coded discontinuity identifier in the playlist file after the player is adjusted to the silent mode;
    打开单元,用于打开所述播放器的声音。Open the unit to open the sound of the player.
  19. 一种电子设备,包括:An electronic device comprising:
    至少一个处理器;以及,At least one processor; and,
    与所述至少一个处理器通信连接的存储器;其中,a memory communicatively coupled to the at least one processor; wherein
    所述存储器存储有可被所述至少一个处理器执行的指令,所述指令被所述至少一个处理器执行,以使所述至少一个处理器能够:The memory stores instructions executable by the at least one processor, the instructions being executed by the at least one processor to enable the at least one processor to:
    判断视频流中是否存在广告内容;Determining whether there is advertising content in the video stream;
    若存在所述广告内容,则在播放所述广告内容时开启本地音频文件,所述本地音频文件为存储在本地存储系统中的音频文件;并且,If the advertisement content exists, opening a local audio file when the advertisement content is played, the local audio file being an audio file stored in a local storage system;
    将播放器调为静音模式,所述播放器用于播放所述视频流。The player is tuned to a silent mode, and the player is used to play the video stream.
  20. 一种非暂态计算机可读存储介质,其特征在于,所述非暂态计算机可读存储介质存储计算机指令,所述计算机指令用于使所述计算机执行权利要求1-9任一项所述的方法。A non-transitory computer readable storage medium storing computer instructions for causing the computer to perform any of claims 1-9 Methods.
  21. 一种计算机程序产品,所述计算机程序产品包括存储在非暂态计算机可读存储介质上的计算程序,所述计算机程序包括程序指令,当所述程序指令被计算机执行时,使所述计算机执行权利要求1-9任一项所述的方法。 A computer program product comprising a computing program stored on a non-transitory computer readable storage medium, the computer program comprising program instructions that, when executed by a computer, cause the computer to execute The method of any of claims 1-9.
PCT/CN2016/097217 2015-12-18 2016-08-29 Video processing method and apparatus WO2017101510A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN201510959280.3A CN105872749A (en) 2015-12-18 2015-12-18 Video processing method and device
CN201510959280.3 2015-12-18

Publications (1)

Publication Number Publication Date
WO2017101510A1 true WO2017101510A1 (en) 2017-06-22

Family

ID=56623791

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2016/097217 WO2017101510A1 (en) 2015-12-18 2016-08-29 Video processing method and apparatus

Country Status (2)

Country Link
CN (1) CN105872749A (en)
WO (1) WO2017101510A1 (en)

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112256565A (en) * 2020-09-24 2021-01-22 北京达佳互联信息技术有限公司 Verification method and device for open screen advertisement in application program and electronic equipment
CN113593608A (en) * 2021-06-29 2021-11-02 荣耀终端有限公司 Object recognition-based voice beautifying method, electronic device and storage medium
CN115484494A (en) * 2022-09-15 2022-12-16 云控智行科技有限公司 Method, device and equipment for processing digital twin video stream

Families Citing this family (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN105872749A (en) * 2015-12-18 2016-08-17 乐视致新电子科技(天津)有限公司 Video processing method and device
CN107864400B (en) * 2016-09-22 2020-11-10 青岛海尔多媒体有限公司 Playing method and device
CN106528042A (en) * 2016-11-22 2017-03-22 上海与德信息技术有限公司 Volume control method and device based on earphone playing
CN109151496B (en) * 2018-07-25 2021-04-23 维沃移动通信有限公司 Music playing method and mobile terminal
CN111246283B (en) * 2020-01-17 2022-09-30 北京达佳互联信息技术有限公司 Video playing method and device, electronic equipment and storage medium

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101755453A (en) * 2007-05-31 2010-06-23 索尼计算机娱乐美国公司 System and method for taking control of a system during a commercial break
US20120169938A1 (en) * 2011-01-05 2012-07-05 Ray Harvey Automatic Mute Control
CN103235956A (en) * 2013-03-28 2013-08-07 天脉聚源(北京)传媒科技有限公司 Method and device for detecting advertisements
CN103945272A (en) * 2013-01-23 2014-07-23 腾讯科技(北京)有限公司 Video interaction method, apparatus and system
CN104113780A (en) * 2014-06-25 2014-10-22 小米科技有限责任公司 Advertisement processing method and apparatus
CN105872749A (en) * 2015-12-18 2016-08-17 乐视致新电子科技(天津)有限公司 Video processing method and device

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101755453A (en) * 2007-05-31 2010-06-23 索尼计算机娱乐美国公司 System and method for taking control of a system during a commercial break
US20120169938A1 (en) * 2011-01-05 2012-07-05 Ray Harvey Automatic Mute Control
CN103945272A (en) * 2013-01-23 2014-07-23 腾讯科技(北京)有限公司 Video interaction method, apparatus and system
CN103235956A (en) * 2013-03-28 2013-08-07 天脉聚源(北京)传媒科技有限公司 Method and device for detecting advertisements
CN104113780A (en) * 2014-06-25 2014-10-22 小米科技有限责任公司 Advertisement processing method and apparatus
CN105872749A (en) * 2015-12-18 2016-08-17 乐视致新电子科技(天津)有限公司 Video processing method and device

Cited By (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112256565A (en) * 2020-09-24 2021-01-22 北京达佳互联信息技术有限公司 Verification method and device for open screen advertisement in application program and electronic equipment
CN113593608A (en) * 2021-06-29 2021-11-02 荣耀终端有限公司 Object recognition-based voice beautifying method, electronic device and storage medium
CN115484494A (en) * 2022-09-15 2022-12-16 云控智行科技有限公司 Method, device and equipment for processing digital twin video stream
CN115484494B (en) * 2022-09-15 2024-04-02 云控智行科技有限公司 Digital twin video stream processing method, device and equipment

Also Published As

Publication number Publication date
CN105872749A (en) 2016-08-17

Similar Documents

Publication Publication Date Title
WO2017101510A1 (en) Video processing method and apparatus
US10476925B2 (en) Media stream cue point creation with automated content recognition
US11308159B2 (en) Dynamic detection of custom linear video clip boundaries
US9369780B2 (en) Methods and systems for detecting one or more advertisement breaks in a media content stream
CN105208463B (en) The method and system of frame determination is carried out for m3u8 files
US9215496B1 (en) Determining the location of a point of interest in a media stream that includes caption data
US9723374B2 (en) Programmatically determining when credits appear during a video in order to provide supplemental information
CN112753227A (en) Audio processing for detecting the occurrence of crowd noise in a sporting event television program
US9832493B2 (en) Method and apparatus for processing audio/video file
US9137560B2 (en) Methods and systems for providing access to content during a presentation of a media content instance
US11849181B2 (en) Systems and methods for applying behavioral-based parental controls for media assets
US20210065719A1 (en) Methods and systems for intelligent content controls
US11727687B2 (en) Processing content based on natural language queries
US9197920B2 (en) Shared media experience distribution and playback
US20200159759A1 (en) Systems and methods for indexing a content asset
CN113170228A (en) Audio processing for extracting variable length disjoint segments from audiovisual content
WO2021047181A1 (en) Video type-based playback control implementation method and apparatus, and computer device
US20160127807A1 (en) Dynamically determined audiovisual content guidebook
EP3362913A1 (en) Methods, systems, and media for media guidance
US20220394323A1 (en) Supplmental audio generation system in an audio-only mode
US20240022791A1 (en) Systems and methods to adapt a schedule to be played by a media player
US11606606B1 (en) Systems and methods for detecting and analyzing audio in a media presentation environment to determine whether to replay a portion of the media
US20230319346A1 (en) Systems and methods for automatically generating content items from identified events
US11729480B2 (en) Systems and methods to enhance interactive program watching
US20220417600A1 (en) Gesture-based parental control system

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 16874570

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 16874570

Country of ref document: EP

Kind code of ref document: A1