WO2022237461A1 - Procédé et appareil de pré-chargement de vidéo, dispositif, et support de stockage - Google Patents

Procédé et appareil de pré-chargement de vidéo, dispositif, et support de stockage Download PDF

Info

Publication number
WO2022237461A1
WO2022237461A1 PCT/CN2022/087441 CN2022087441W WO2022237461A1 WO 2022237461 A1 WO2022237461 A1 WO 2022237461A1 CN 2022087441 W CN2022087441 W CN 2022087441W WO 2022237461 A1 WO2022237461 A1 WO 2022237461A1
Authority
WO
WIPO (PCT)
Prior art keywords
candidate
duration
video
candidate video
sequence
Prior art date
Application number
PCT/CN2022/087441
Other languages
English (en)
Chinese (zh)
Inventor
黄胜兰
李小成
严冰
马超
钟振东
黄清
姜建华
Original Assignee
北京字节跳动网络技术有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 北京字节跳动网络技术有限公司 filed Critical 北京字节跳动网络技术有限公司
Publication of WO2022237461A1 publication Critical patent/WO2022237461A1/fr

Links

Images

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/20Servers specifically adapted for the distribution of content, e.g. VOD servers; Operations thereof
    • H04N21/23Processing of content or additional data; Elementary server operations; Server middleware
    • H04N21/234Processing of video elementary streams, e.g. splicing of video streams or manipulating encoded video stream scene graphs
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/43Processing of content or additional data, e.g. demultiplexing additional data from a digital video stream; Elementary client operations, e.g. monitoring of home network or synchronising decoder's clock; Client middleware
    • H04N21/433Content storage operation, e.g. storage operation in response to a pause request, caching operations
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/43Processing of content or additional data, e.g. demultiplexing additional data from a digital video stream; Elementary client operations, e.g. monitoring of home network or synchronising decoder's clock; Client middleware
    • H04N21/44Processing of video elementary streams, e.g. splicing a video clip retrieved from local storage with an incoming video stream or rendering scenes according to encoded video stream scene graphs
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/47End-user applications
    • H04N21/482End-user interface for program selection
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/80Generation or processing of content or additional data by content creator independently of the distribution process; Content per se
    • H04N21/83Generation or processing of protective or descriptive data associated with content; Content structuring
    • H04N21/845Structuring of content, e.g. decomposing content into time segments
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/80Generation or processing of content or additional data by content creator independently of the distribution process; Content per se
    • H04N21/83Generation or processing of protective or descriptive data associated with content; Content structuring
    • H04N21/845Structuring of content, e.g. decomposing content into time segments
    • H04N21/8456Structuring of content, e.g. decomposing content into time segments by decomposing the content in the time domain, e.g. in time segments

Definitions

  • the present disclosure relates to the technical field of video processing, for example, to a video preloading method, device, device, and storage medium.
  • the user will play the videos in the video queue in sequence. Each of these videos will have a different bitrate resolution available for download.
  • the video is usually downloaded in pieces. When the downloaded segments are continuous, the user can play the video smoothly.
  • the user will receive a video stream composed of multiple videos recommended based on the user's historical viewing records, and each video is of different lengths. Since the videos are delivered based on the recommendation algorithm, compared with long videos, users have a high probability of not watching all the videos in the recommendation list in turn. Users are likely to skip some disliked videos and watch them directly backwards.
  • the viewing behavior of the above users has brought new challenges to the user's playback experience: how to use limited bandwidth resources to maximize the download of videos that meet the user's needs.
  • the present disclosure provides a video preloading method, device, equipment, and storage medium, which can maximize the preloading of videos that meet user needs and improve the utilization rate of bandwidth resources.
  • the present disclosure provides a video preloading method, including:
  • the candidate segment sequence with the highest playback performance is determined as the target segment sequence, and the candidate video segment in the first time window in the target segment sequence is preloaded.
  • the present disclosure also provides a video preloading device, including:
  • Candidate video fragment determination module is configured to determine multiple candidate video fragments according to the current video to be downloaded and bit rate type
  • the time window division module is configured to divide the set duration after the current moment into a set number of time windows
  • the candidate slice sequence determination module is configured to determine a plurality of candidate slice sequences according to the plurality of candidate video slices and the set number of time windows; wherein, the candidate slice sequence includes the set number of candidate video segments;
  • a playback performance determination module configured to determine the playback performance of each candidate segment sequence within the set duration
  • the target segment sequence determination module is configured to determine the candidate segment sequence with the highest playback performance as the target segment sequence, and preload the candidate video segments in the first time window in the target segment sequence.
  • the present disclosure also provides an electronic device, the electronic device comprising:
  • a storage device configured to store one or more programs
  • the one or more processing devices are made to implement the above video preloading method.
  • the present disclosure discloses a computer-readable medium on which a computer program is stored, and when the program is executed by a processing device, the above-mentioned video preloading method is realized.
  • FIG. 1 is a flow chart of a video preloading method provided by an embodiment of the present disclosure
  • FIG. 2 is a schematic structural diagram of a video preloading device provided by an embodiment of the present disclosure
  • Fig. 3 is a schematic structural diagram of an electronic device provided by an embodiment of the present disclosure.
  • the term “comprise” and its variations are open-ended, ie “including but not limited to”.
  • the term “based on” is “based at least in part on”.
  • the term “one embodiment” means “at least one embodiment”; the term “another embodiment” means “at least one further embodiment”; the term “some embodiments” means “at least some embodiments.” Relevant definitions of other terms will be given in the description below.
  • the preloading behavior adopted in the industry can be summarized as an idle time preloading task: that is, when the currently playing video has been completely downloaded, the current network is considered to be idle, and the current video is downloaded serially during the idle time of the network. Play the task of the video following the video.
  • the business end can uniformly set a preload size, such as 800KB, and the number N of videos to be downloaded. After the current video download is completed, the service end will sequentially preload the 800KB bytes of the next N videos to ensure that the data stream of the first few seconds has been cached locally when the broadcast starts, and it can be started without spending extra time for network requests. broadcast.
  • Fig. 1 is a flow chart of a video preloading method provided by an embodiment of the present disclosure. This embodiment is applicable to the situation of pre-downloading a video to be played, and the method can be executed by a video preloading device.
  • the device can be composed of hardware and/or software, and can generally be integrated into a device with a video preloading function, which can be an electronic device such as a server, a mobile terminal, or a server cluster.
  • the method includes:
  • the videos to be downloaded may include currently playing videos and unplayed videos pushed into the video queue.
  • the type of bit rate can be understood as the bit rate available for each video to be downloaded.
  • a video segment can be understood as a video segment with a fixed duration S t , for example, a video segment with a duration of 5 seconds.
  • V represents the total number of videos to be downloaded
  • L represents the available bit rate types
  • the total number of candidate video fragments is V*L.
  • X represents video fragments
  • X vl represents the vth video to be downloaded
  • the sequence of candidate slices includes a set number of candidate video slices.
  • the way to determine multiple candidate slice sequences according to multiple candidate video slices and a set number of time windows may be: for each time window, select a candidate video slice from multiple candidate video slices to obtain the set A number of candidate video slices; a set number of candidate video slices are formed into a sequence of candidate slices in time order.
  • the sequence of candidate slices contains K candidate video slices.
  • Playback performance is determined by terminal revenue and system revenue.
  • the process of determining the playback performance of each candidate segment sequence within the set duration may be: for each candidate segment sequence, determine the terminal revenue and playback probability of each candidate video segment in the candidate segment sequence; The terminal revenue and playback probability of the candidate video fragments determine the total terminal revenue of the candidate fragment sequence; determine the system revenue of the candidate fragment sequence; sum the total terminal revenue and the system revenue to obtain the candidate fragment sequence within the set duration playback performance.
  • the playback probability can be counted according to the user's historical viewing behavior, and updated according to the transition equation of the Markov probability model.
  • the method of determining the total terminal revenue of the candidate segment sequence according to the terminal revenue and playback probability of each candidate video segment may be: respectively multiply the terminal revenue and playback probability of multiple candidate video segments and accumulate , to obtain the total terminal revenue.
  • p kv is the probability of playing video v in the kth time window, Indicates the terminal revenue of playing video v in the kth time window.
  • the playback probability satisfies the following conditions:
  • System benefits can be derived from Indicates that the calculation formula for the playback performance of the candidate segment sequence within the set duration is:
  • the method of determining the terminal revenue of each candidate video segment in the candidate segment sequence may be: for each candidate video segment in the candidate segment sequence, determine the image quality, freeze loss and first frame estimation of the candidate video segment Time-consuming; determine the terminal revenue of candidate video segments based on image quality, freeze loss, and estimated time-consuming for the first frame.
  • Image quality can be characterized by setting quality evaluation indicators, such as Peak Signal-to-Noise Ratio (PSNR) or Structural SIMilarity (SSIM).
  • PSNR Peak Signal-to-Noise Ratio
  • SSIM Structural SIMilarity
  • Caton loss can be understood as the Caton effect on the currently playing video.
  • the time-consuming estimation of the first frame can be understood as the loss of start-up time.
  • the calculation formula for determining the terminal revenue of candidate video segments based on image quality, freeze loss, and first frame estimation time consumption is: Among them, Q kv is the image quality on the video v in the kth time window, R kv is the freeze loss on the video v in the kth time window, ST kv is the video quality in the kth time window It takes time to estimate the first frame of video v, ⁇ , ⁇ are the weight coefficients of different parts, which can be adjusted.
  • the process of determining the freeze loss of the candidate video segment may be: obtaining the first time length required for downloading the candidate video segment, the second time length required for receiving the set data amount of the candidate video segment, and the time length of the candidate video segment Cache playback duration; determine the stuttering loss based on the first duration, the second duration, and the cache playback duration.
  • the first duration is determined by the data volume and predicted bandwidth of the candidate video segments.
  • the set data amount may be 1 byte, and the second duration may represent the duration required from sending the download request to receiving the first byte of data.
  • the buffer playback duration can be understood as the duration corresponding to the amount of remaining playable data of the candidate video segment in the buffer.
  • the way to obtain the first duration required to download the candidate video fragments may be: obtain the data volume and predicted bandwidth of the candidate video fragments; divide the data volume of the candidate video fragments by the predicted bandwidth to obtain the download candidate The first duration required for video fragmentation.
  • the predicted bandwidth may be to predict an average bandwidth within a certain period of time in the future.
  • the formula for calculating the first duration is: Wherein, S kv is the data volume of the video slice corresponding to the video v in the kth time window, and C represents the prediction bandwidth.
  • the way to obtain the cached playback duration of the candidate video segment may be: obtain the buffered playback duration of the previous time window of the candidate video segment and the buffer increase duration from the previous time window to the current time window; according to the above
  • the buffered playback duration, first duration, buffer increase duration and second duration of a time window determine the buffered playback duration of the candidate video segment at the current moment.
  • the calculation formula for determining the cached playback duration of the candidate video segment at the current moment according to the cached playback duration, first duration, cache increase duration, and second duration of the previous time window is: Wherein, D (k-1)v is the buffer increase time length from the previous time window to the current time window, and ⁇ t is the second time length.
  • the calculation formula for determining the freeze loss according to the first duration, the second duration, and the buffer playback duration is:
  • B l is the warning value of Buffer, which is a set constant.
  • the method for determining the estimated time-consuming of the first frame of the candidate video segment may be as follows: if the cached playback duration of the candidate video segment is greater than or equal to the duration required for starting the broadcast, then the estimated time-consuming of the first frame is 0; If the cached playback duration of the video segment is less than the duration required for the start of playback, the estimated time spent on the first frame is the set value.
  • the calculation formula for determining the time-consuming estimation of the first frame of the candidate video segment is: Among them, r is the time required for broadcasting, which is a constant, is the set value.
  • the way to determine the system revenue of the candidate slice sequence may be: sum the data volumes corresponding to multiple candidate video slices in the candidate slice sequence to obtain the total data volume; The expected value of multiple candidate video fragments; determine the system benefit according to the total data volume and the expected value.
  • the calculation formula for determining the system benefit based on the total data volume and expected value is: Among them, E v is the expectation value of the user watching candidate video segments in multiple time windows, and ⁇ is a weight parameter, which can be adjusted.
  • S150 Determine the candidate segment sequence with the highest playback performance as the target segment sequence, and preload the candidate video segments in the first time window in the target segment sequence.
  • the candidate segment sequence with the highest playback performance is determined as the target segment sequence, and the candidate video segment in the first time window in the target segment sequence is preloaded.
  • the process of S110-S150 is repeated to calculate the candidate video segment that needs to be preloaded at the next moment.
  • a plurality of candidate video segments are determined according to the current video to be downloaded and the code rate type; the set duration after the current moment is divided into a set number of time windows; according to the multiple candidate video segments and Set the number of time windows to determine multiple candidate fragment sequences; determine the playback performance of each candidate fragment sequence within a set duration; determine the candidate fragment sequence with the highest playback performance as the target fragment sequence, and preload the target The candidate video slice in the first time window in the slice sequence.
  • the video preloading method provided by the embodiment of the present disclosure determines the candidate segment sequence with the highest playback performance as the target segment sequence, and preloads the candidate video segment in the first time window in the target segment sequence, which can maximize Videos that meet user needs can be pre-downloaded efficiently, improving the utilization of bandwidth resources.
  • Fig. 2 is a schematic structural diagram of a video preloading device provided by an embodiment of the present disclosure. As shown in Figure 2, the device includes:
  • the candidate video segment determination module 210 is configured to determine a plurality of candidate video segments according to the current video to be downloaded and the code rate type; the time window division module 220 is configured to divide the set duration after the current moment into a set amount of time Window; the candidate slice sequence determination module 230 is configured to determine a plurality of candidate slice sequences according to a plurality of candidate video slices and a set number of time windows; wherein, the candidate slice sequence includes a set number of candidate video slices;
  • the playback performance determination module 240 is configured to determine the playback performance of each candidate segment sequence within the set duration; the target segment sequence determination module 250 is configured to determine the candidate segment sequence with the highest playback performance as the target segment sequence, And preload the candidate video segments in the first time window in the target segment sequence.
  • the candidate slice sequence determination module 230 is set to:
  • a candidate video slice is selected from multiple candidate video slices to obtain a set number of candidate video slices; and a set number of candidate video slices are formed into a sequence of candidate slices in time order.
  • the playback performance determination module 240 is set to:
  • For each candidate segment sequence determine the terminal revenue and playback probability of each candidate video segment in the candidate segment sequence; determine the total terminal revenue of the candidate segment sequence according to the terminal revenue and playback probability of each candidate video segment; Determine the system revenue of the candidate slice sequence; sum the total terminal revenue and the system revenue to obtain the playback performance of the candidate slice sequence within the set duration.
  • the playback performance determination module 240 is configured to determine the terminal revenue of each candidate video segment in the candidate segment sequence in the following manner:
  • the playback performance determination module 240 is configured to determine the freeze loss of the candidate video fragments in the following manner:
  • the playback duration determines the stutter loss.
  • the playback performance determination module 240 is configured to determine the time-consuming estimation of the first frame of the candidate video segment in the following manner:
  • the estimated time-consuming of the first frame is 0; if the buffered playback duration of the candidate video segment is less than the duration required for playback, the estimated duration of the first frame is set value.
  • the playback performance determination module 240 is configured to obtain the first duration required for downloading the candidate video segments in the following manner:
  • the playback performance determination module 240 is configured to obtain the cached playback duration of the candidate video fragments in the following manner:
  • the playback performance determination module 240 is configured to determine the system revenue of the candidate fragment sequence in the following manner:
  • the above-mentioned device can execute the methods provided by all the foregoing embodiments of the present disclosure, and has corresponding functional modules and effects for executing the above-mentioned methods.
  • the above-mentioned device can execute the methods provided by all the foregoing embodiments of the present disclosure, and has corresponding functional modules and effects for executing the above-mentioned methods.
  • FIG. 3 it shows a schematic structural diagram of an electronic device 300 suitable for implementing an embodiment of the present disclosure.
  • the electronic device 300 in the embodiment of the present disclosure may include but not limited to mobile phones, notebook computers, digital broadcast receivers, personal digital assistants (Personal Digital Assistant, PDA), tablet computers (PAD), portable multimedia players (Portable Media Player, PMP), mobile terminals such as vehicle terminals (such as vehicle navigation terminals), fixed terminals such as digital TV (Television, TV), desktop computers, etc., or various forms of servers, such as independent servers or server clusters.
  • PDA Personal Digital Assistant
  • PMP portable multimedia players
  • mobile terminals such as vehicle terminals (such as vehicle navigation terminals), fixed terminals such as digital TV (Television, TV), desktop computers, etc.
  • servers such as independent servers or server clusters.
  • the electronic device shown in FIG. 3 is only an example, and should not limit the functions and scope of use of the embodiments of the present disclosure.
  • an electronic device 300 may include a processing device (such as a central processing unit, a graphics processing unit, etc.)
  • the device 308 loads programs in the random access storage device (Random Access Memory, RAM) 303 to perform various appropriate actions and processes.
  • RAM Random Access Memory
  • various programs and data necessary for the operation of the electronic device 300 are also stored.
  • the processing device 301, ROM 302, and RAM 303 are connected to each other through a bus 304.
  • An input/output (Input/Output, I/O) interface 305 is also connected to the bus 304 .
  • an input device 306 including, for example, a touch screen, a touchpad, a keyboard, a mouse, a camera, a microphone, an accelerometer, a gyroscope, etc.; including, for example, a liquid crystal display (Liquid Crystal Display, LCD) , an output device 307 such as a speaker, a vibrator, etc.; a storage device 308 including, for example, a magnetic tape, a hard disk, etc.; and a communication device 309.
  • the communication means 309 may allow the electronic device 300 to perform wireless or wired communication with other devices to exchange data.
  • FIG. 3 shows electronic device 300 having various means, it is not a requirement to implement or possess all of the means shown. More or fewer means may alternatively be implemented or provided.
  • embodiments of the present disclosure include a computer program product comprising a computer program carried on a computer readable medium, the computer program comprising program code for performing a word recommendation method.
  • the computer program may be downloaded and installed from a network via communication means 309, or from storage means 308, or from ROM 302.
  • the processing device 301 When the computer program is executed by the processing device 301, the above-mentioned functions defined in the methods of the embodiments of the present disclosure are performed.
  • the computer-readable medium mentioned above in the present disclosure may be a computer-readable signal medium or a computer-readable storage medium, or any combination of the above two.
  • a computer readable storage medium may be, for example, but not limited to, an electrical, magnetic, optical, electromagnetic, infrared, or semiconductor system, apparatus, or device, or any combination thereof.
  • Examples of computer readable storage media may include, but are not limited to: electrical connections with one or more wires, portable computer disks, hard disks, RAM, ROM, Erasable Programmable Read-Only Memory (EPROM) or flash memory), optical fiber, portable compact disk read-only memory (Compact Disc Read-Only Memory, CD-ROM), optical storage device, magnetic storage device, or any suitable combination of the above.
  • a computer-readable storage medium may be any tangible medium that contains or stores a program that can be used by or in conjunction with an instruction execution system, apparatus, or device.
  • a computer-readable signal medium may include a data signal in baseband or propagated as part of a carrier wave carrying computer-readable program code therein. Such propagated data signals may take many forms, including but not limited to electromagnetic signals, optical signals, or any suitable combination of the foregoing.
  • a computer-readable signal medium may also be any computer-readable medium other than a computer-readable storage medium, which can transmit, propagate, or transmit a program for use by or in conjunction with an instruction execution system, apparatus, or device .
  • the program code contained on the computer readable medium can be transmitted by any appropriate medium, including but not limited to: electric wire, optical cable, radio frequency (Radio Frequency, RF), etc., or any suitable combination of the above.
  • the client and the server can communicate using any currently known or future network protocols such as Hypertext Transfer Protocol (HyperText Transfer Protocol, HTTP), and can communicate with digital data in any form or medium
  • the communication eg, communication network
  • Examples of communication networks include local area networks (Local Area Network, LAN), wide area networks (Wide Area Network, WAN), internetworks (e.g., the Internet), and peer-to-peer networks (e.g., ad hoc peer-to-peer networks), as well as any currently existing networks that are known or developed in the future.
  • the above-mentioned computer-readable medium may be included in the above-mentioned electronic device, or may exist independently without being incorporated into the electronic device.
  • the above-mentioned computer-readable medium carries one or more programs, and when the above-mentioned one or more programs are executed by the electronic device, the electronic device: determines a plurality of candidate video segments according to the current video to be downloaded and the type of code rate; The set duration after the current moment is divided into a set number of time windows; multiple candidate slice sequences are determined according to the multiple candidate video slices and the set number of time windows; wherein, the candidate slice sequences Including the set number of candidate video fragments; determining the playback performance of each candidate fragment sequence within the set duration; determining the candidate fragment sequence with the highest playback performance as the target fragment sequence, and preloading all Candidate video slices in the first time window in the target slice sequence.
  • Computer program code for carrying out operations of the present disclosure may be written in one or more programming languages, or combinations thereof, including but not limited to object-oriented programming languages—such as Java, Smalltalk, C++, and Includes conventional procedural programming languages - such as the "C" language or similar programming languages.
  • the program code may execute entirely on the user's computer, partly on the user's computer, as a stand-alone software package, partly on the user's computer and partly on a remote computer or entirely on the remote computer or server.
  • the remote computer can be connected to the user computer through any kind of network, including a LAN or WAN, or it can be connected to an external computer (eg via the Internet using an Internet Service Provider).
  • each block in a flowchart or block diagram may represent a module, program segment, or portion of code that contains one or more logical functions for implementing specified executable instructions.
  • the functions noted in the block may occur out of the order noted in the figures. For example, two blocks shown in succession may, in fact, be executed substantially concurrently, or they may sometimes be executed in the reverse order, depending upon the functionality involved.
  • each block of the block diagrams and/or flowchart illustrations, and combinations of blocks in the block diagrams and/or flowchart illustrations can be implemented by a dedicated hardware-based system that performs the specified functions or operations , or may be implemented by a combination of dedicated hardware and computer instructions.
  • the units involved in the embodiments described in the present disclosure may be implemented by software or by hardware. Wherein, the name of the unit does not constitute a limitation of the unit itself in one case.
  • exemplary types of hardware logic components include: Field Programmable Gate Arrays (Field Programmable Gate Arrays, FPGAs), Application Specific Integrated Circuits (ASICs), Application Specific Standard Products (Application Specific Standard Parts, ASSP), System on Chip (System on Chip, SOC), Complex Programmable Logic Device (Complex Programmable Logic Device, CPLD) and so on.
  • a machine-readable medium may be a tangible medium that may contain or store a program for use by or in conjunction with an instruction execution system, apparatus, or device.
  • a machine-readable medium may be a machine-readable signal medium or a machine-readable storage medium.
  • a machine-readable medium may include, but is not limited to, electronic, magnetic, optical, electromagnetic, infrared, or semiconductor systems, apparatus, or devices, or any suitable combination of the foregoing. Examples of machine-readable storage media would include one or more wire-based electrical connections, portable computer disks, hard disks, RAM, ROM, EPROM or flash memory, optical fiber, CD-ROM, optical storage devices, magnetic storage devices, or Any suitable combination of the above.
  • the embodiments of the present disclosure disclose a video preloading method, including:
  • the candidate segment sequence with the highest playback performance is determined as the target segment sequence, and the candidate video segment in the first time window in the target segment sequence is preloaded.
  • the determining multiple candidate slice sequences according to the multiple candidate video slices and the set number of time windows includes:
  • For each time window select a candidate video slice from the plurality of candidate video slices to obtain a set number of candidate video slices;
  • the determination of the playback performance of each candidate segment sequence within the set duration includes:
  • For each candidate segment sequence determine the terminal revenue and playback probability of each candidate video segment in the candidate segment sequence
  • the determining the terminal revenue of each candidate video segment in the candidate segment sequence includes:
  • For each candidate video slice in the candidate slice sequence determine the image quality, freeze loss, and first frame estimation time-consuming of the candidate video slice;
  • the determination of the freezing loss of the candidate video fragments includes:
  • the stuttering loss is determined according to the first duration, the second duration, and the cache playback duration.
  • the time-consuming determination of the first frame of the candidate video segment includes:
  • the estimated time-consuming of the first frame is 0;
  • the estimated time-consuming of the first frame is a set value.
  • the acquisition of the first duration required for downloading the candidate video segments includes:
  • the acquisition of the cached playback duration of the candidate video fragments includes:
  • the buffered playback duration of the candidate video segment at the current moment is determined according to the cached playback duration of the previous time window, the first duration, the cache increase duration, and the second duration.
  • the determining the system income of the candidate fragmentation sequence includes:
  • the system benefit is determined according to the total data volume and the expected value.

Landscapes

  • Engineering & Computer Science (AREA)
  • Multimedia (AREA)
  • Signal Processing (AREA)
  • Human Computer Interaction (AREA)
  • Two-Way Televisions, Distribution Of Moving Picture Or The Like (AREA)

Abstract

Sont divulgués un procédé et un appareil de pré-chargement de vidéo, un dispositif, et un support de stockage. Le procédé de pré-chargement de vidéo comprend les étapes suivantes : sur la base d'une vidéo courante à télécharger vers l'aval et d'un type de débit de code, détermination d'une pluralité de fragments vidéo candidats (S110) ; division d'une durée définie après le moment courant en un nombre défini de fenêtres temporelles (S120) ; sur la base de la pluralité de fragments vidéo candidats et du nombre défini de fenêtres temporelles, détermination d'une pluralité de séquences de fragments candidats (S130) ; détermination des performances de lecture de chaque séquence de fragments candidats à l'intérieur de la durée définie (S140) ; détermination de la séquence de fragments candidats présentant la performance de lecture la plus élevée en tant que séquence de fragments cibles, et pré-chargement des fragments vidéo candidats dans la première fenêtre temporelle dans la séquence de fragments cibles (S150).
PCT/CN2022/087441 2021-05-13 2022-04-18 Procédé et appareil de pré-chargement de vidéo, dispositif, et support de stockage WO2022237461A1 (fr)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN202110523764.9A CN115348460B (zh) 2021-05-13 2021-05-13 视频的预加载方法、装置、设备及存储介质
CN202110523764.9 2021-05-13

Publications (1)

Publication Number Publication Date
WO2022237461A1 true WO2022237461A1 (fr) 2022-11-17

Family

ID=83977858

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2022/087441 WO2022237461A1 (fr) 2021-05-13 2022-04-18 Procédé et appareil de pré-chargement de vidéo, dispositif, et support de stockage

Country Status (2)

Country Link
CN (1) CN115348460B (fr)
WO (1) WO2022237461A1 (fr)

Citations (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2012160796A (ja) * 2011-01-31 2012-08-23 Mitsubishi Electric Corp プレイリスト作成装置、プレイリスト編集装置
US20160308994A1 (en) * 2015-04-19 2016-10-20 Carlos Manuel Gonzalez Pre-Load of Video Content to Optimize Internet Usage
CN106550284A (zh) * 2015-09-21 2017-03-29 北京国双科技有限公司 一种播放分片视频的方法及装置
US20180035151A1 (en) * 2016-08-01 2018-02-01 Microsoft Technology Licensing, Llc Video segment playlist generation in a video management system
WO2018201746A1 (fr) * 2017-05-05 2018-11-08 广州优视网络科技有限公司 Procédé de préchargement vidéo, dispositif, lecteur vidéo, et dispositif électronique
CN110072145A (zh) * 2019-04-03 2019-07-30 北京字节跳动网络技术有限公司 用于终端设备的信息播放方法、装置和终端设备
US20200351564A1 (en) * 2017-10-30 2020-11-05 Guangzhou Huya Information Technology Co., Ltd. Video Playback Control Method, Apparatus, and Terminal
CN112423127A (zh) * 2020-11-20 2021-02-26 上海哔哩哔哩科技有限公司 视频加载方法及装置

Family Cites Families (13)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
TW201615022A (en) * 2014-10-14 2016-04-16 Hon Hai Prec Ind Co Ltd Video preloading system and method of video preloading
CN104683857B (zh) * 2015-01-23 2018-05-29 华为技术有限公司 用于数据预加载的可视化呈现的方法和设备
CN105657523B (zh) * 2016-01-28 2019-11-08 腾讯科技(深圳)有限公司 视频预加载的方法和装置
US20180109827A1 (en) * 2016-10-13 2018-04-19 International Business Machines Corporation User affinity for video content and video content recommendations
CN106791898B (zh) * 2016-12-12 2020-02-14 广州华多网络科技有限公司 一种直播视频加载方法和装置
CN108024145B (zh) * 2017-12-07 2020-12-11 北京百度网讯科技有限公司 视频推荐方法、装置、计算机设备和存储介质
US20190200051A1 (en) * 2017-12-27 2019-06-27 Facebook, Inc. Live Media-Item Transitions
CN109040801B (zh) * 2018-07-19 2019-07-09 北京达佳互联信息技术有限公司 媒体码率自适应方法、装置、计算机设备及存储介质
CN111385660B (zh) * 2018-12-28 2022-07-12 广州市百果园信息技术有限公司 视频的点播方法、装置、设备及存储介质
CN111866549B (zh) * 2019-04-29 2023-03-24 腾讯科技(深圳)有限公司 一种视频处理方法及装置、终端、存储介质
CN110868626B (zh) * 2019-11-06 2021-06-11 北京达佳互联信息技术有限公司 一种内容数据预加载的方法及装置
CN112423125A (zh) * 2020-11-20 2021-02-26 上海哔哩哔哩科技有限公司 视频加载方法及装置
CN112672186B (zh) * 2020-12-09 2023-03-24 北京达佳互联信息技术有限公司 视频预加载的方法和装置

Patent Citations (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2012160796A (ja) * 2011-01-31 2012-08-23 Mitsubishi Electric Corp プレイリスト作成装置、プレイリスト編集装置
US20160308994A1 (en) * 2015-04-19 2016-10-20 Carlos Manuel Gonzalez Pre-Load of Video Content to Optimize Internet Usage
CN106550284A (zh) * 2015-09-21 2017-03-29 北京国双科技有限公司 一种播放分片视频的方法及装置
US20180035151A1 (en) * 2016-08-01 2018-02-01 Microsoft Technology Licensing, Llc Video segment playlist generation in a video management system
WO2018201746A1 (fr) * 2017-05-05 2018-11-08 广州优视网络科技有限公司 Procédé de préchargement vidéo, dispositif, lecteur vidéo, et dispositif électronique
US20200351564A1 (en) * 2017-10-30 2020-11-05 Guangzhou Huya Information Technology Co., Ltd. Video Playback Control Method, Apparatus, and Terminal
CN110072145A (zh) * 2019-04-03 2019-07-30 北京字节跳动网络技术有限公司 用于终端设备的信息播放方法、装置和终端设备
CN112423127A (zh) * 2020-11-20 2021-02-26 上海哔哩哔哩科技有限公司 视频加载方法及装置

Also Published As

Publication number Publication date
CN115348460B (zh) 2024-06-07
CN115348460A (zh) 2022-11-15

Similar Documents

Publication Publication Date Title
US10110694B1 (en) Adaptive transfer rate for retrieving content from a server
US10659832B1 (en) Dynamic bitrate selection for streaming media
CN112135169B (zh) 一种媒体内容加载方法、装置、设备和介质
CN106998485B (zh) 视频直播方法及装置
WO2023116233A1 (fr) Procédé et appareil de prédiction de saccades d'une vidéo, dispositif et support
WO2023051243A1 (fr) Procédé et appareil de commutation de débit binaire vidéo, dispositif électronique, et support de stockage
CN112954354B (zh) 视频的转码方法、装置、设备和介质
CN111147606A (zh) 数据传输的方法、装置、终端及存储介质
WO2023165371A1 (fr) Procédé et appareil de lecture audio, ainsi que dispositif électronique et support de stockage
CN112887795A (zh) 视频播放方法、装置、设备和介质
WO2023035879A1 (fr) Procédé de commutation d'angle de vue, appareil et système pour vidéo à angle de vue libre, dispositif et support
WO2022228390A1 (fr) Procédé, appareil et dispositif de traitement de contenu multimédia, et support de stockage
CN110636367A (zh) 一种视频加载方法、装置、终端设备及介质
CN113891132A (zh) 一种音视频同步监控方法、装置、电子设备及存储介质
WO2022134997A1 (fr) Procédé et appareil de lecture de saut vidéo, dispositif terminal et support de stockage
CN114786055A (zh) 一种预加载方法、装置、电子设备及介质
WO2023197811A1 (fr) Procédé et appareil de téléchargement de vidéo, procédé et appareil de transmission de vidéo, dispositif terminal, serveur et support
CN113542856A (zh) 在线录像的倒放方法、装置、设备和计算机可读介质
WO2022237461A1 (fr) Procédé et appareil de pré-chargement de vidéo, dispositif, et support de stockage
WO2023179404A1 (fr) Procédé de démarrage de diffusion en continu en direct, et dispositif et produit de programme
CN112153322B (zh) 数据分发方法、装置、设备及存储介质
WO2022188618A1 (fr) Procédé, appareil et dispositif de préchargement de ressources, et support de stockage
CN113242446B (zh) 视频帧的缓存方法、转发方法、通信服务器及程序产品
CN114760506B (zh) 视频转码的评估方法、装置、设备及存储介质
WO2022242498A1 (fr) Procédé et appareil de planification de cdn, dispositif et support de stockage

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 22806431

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

32PN Ep: public notification in the ep bulletin as address of the adressee cannot be established

Free format text: NOTING OF LOSS OF RIGHTS PURSUANT TO RULE 112(1) EPC (EPO FORM 1205A DATED 23.02.2024)