WO2023029237A1 - Procédé de prévisualisation vidéo et terminal - Google Patents

Procédé de prévisualisation vidéo et terminal Download PDF

Info

Publication number
WO2023029237A1
WO2023029237A1 PCT/CN2021/133221 CN2021133221W WO2023029237A1 WO 2023029237 A1 WO2023029237 A1 WO 2023029237A1 CN 2021133221 W CN2021133221 W CN 2021133221W WO 2023029237 A1 WO2023029237 A1 WO 2023029237A1
Authority
WO
WIPO (PCT)
Prior art keywords
video
preview
multimedia resources
frames
frame
Prior art date
Application number
PCT/CN2021/133221
Other languages
English (en)
Chinese (zh)
Inventor
李芳�
Original Assignee
游艺星际(北京)科技有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 游艺星际(北京)科技有限公司 filed Critical 游艺星际(北京)科技有限公司
Publication of WO2023029237A1 publication Critical patent/WO2023029237A1/fr

Links

Images

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/43Processing of content or additional data, e.g. demultiplexing additional data from a digital video stream; Elementary client operations, e.g. monitoring of home network or synchronising decoder's clock; Client middleware
    • H04N21/431Generation of visual interfaces for content selection or interaction; Content or additional data rendering
    • H04N21/4312Generation of visual interfaces for content selection or interaction; Content or additional data rendering involving specific graphical features, e.g. screen layout, special fonts or colors, blinking icons, highlights or animations
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/43Processing of content or additional data, e.g. demultiplexing additional data from a digital video stream; Elementary client operations, e.g. monitoring of home network or synchronising decoder's clock; Client middleware
    • H04N21/44Processing of video elementary streams, e.g. splicing a video clip retrieved from local storage with an incoming video stream or rendering scenes according to encoded video stream scene graphs
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/43Processing of content or additional data, e.g. demultiplexing additional data from a digital video stream; Elementary client operations, e.g. monitoring of home network or synchronising decoder's clock; Client middleware
    • H04N21/443OS processes, e.g. booting an STB, implementing a Java virtual machine in an STB or power management in an STB
    • H04N21/4438Window management, e.g. event handling following interaction with the user interface
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/47End-user applications
    • H04N21/472End-user interface for requesting content, additional data or services; End-user interface for interacting with content, e.g. for content reservation or setting reminders, for requesting event notification, for manipulating displayed content
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/80Generation or processing of content or additional data by content creator independently of the distribution process; Content per se
    • H04N21/83Generation or processing of protective or descriptive data associated with content; Content structuring
    • H04N21/845Structuring of content, e.g. decomposing content into time segments
    • H04N21/8456Structuring of content, e.g. decomposing content into time segments by decomposing the content in the time domain, e.g. in time segments

Definitions

  • the present disclosure relates to the technical field of the Internet, and in particular to a video preview method and terminal.
  • video websites or video applications include a large number of videos.
  • video websites or video applications will generate preview information for videos. Users can judge whether the video content is of interest by viewing the preview information of the video. content, and then choose the video you are interested in.
  • preview information is usually displayed in the form of a video preview image displayed on a video list page, and the preview image is generally generated by intercepting one or more video frames in a video.
  • the disclosure provides a video preview method and a terminal.
  • the disclosed technical scheme is as follows:
  • a video preview method executed by a terminal including:
  • the video preview interface includes previewable videos
  • multiple multimedia resources are simultaneously previewed and displayed on the video preview interface.
  • the video frame of the video can reflect the content of the video
  • multiple multimedia resources generated based on multiple video frames of the video can fuse the content of the video.
  • the user can preview the video content provided by multiple multimedia resources before watching the video, so as to have a pre-understanding of the whole picture of the video, thereby improving the video preview effect.
  • a video preview method executed by a terminal including:
  • a first preview frame corresponding to the video is displayed in the video preview interface, the first preview frame includes a plurality of multimedia resources, and each multimedia resource consists of at least one video in the video frame generation.
  • a video preview device including:
  • the first display unit is configured to display a video preview interface, and the video preview interface includes previewable videos;
  • a generating unit configured to extract multiple video frames from the video, and generate multiple multimedia resources from the multiple video frames
  • the first display unit is configured to simultaneously preview and display the multiple multimedia resources in the video preview interface based on the display order of the multiple multimedia resources.
  • a video preview device including:
  • the second display unit is configured to display a video preview interface, and the video preview interface displays a cover of the video;
  • the second presentation unit is configured to display a first preview frame corresponding to the video in the video preview interface in response to the video preview operation based on the cover, the first preview frame includes a plurality of multimedia resources, and each multimedia Assets are generated from at least one video frame in the video.
  • a terminal includes a processor and a memory for storing instructions executable by the processor; wherein the processor is configured to execute the instructions, so as to implement the above-mentioned first A video preview method in one hand.
  • a terminal includes a processor and a memory for storing instructions executable by the processor; wherein the processor is configured to execute the instructions, so as to implement the above-mentioned first The video preview method in the second aspect.
  • a non-volatile computer-readable storage medium When instructions in the non-volatile computer-readable storage medium are executed by a processor of a terminal, the terminal can execute The video preview method in the first aspect above.
  • a non-volatile computer-readable storage medium When instructions in the non-volatile computer-readable storage medium are executed by a processor of a terminal, the terminal can execute The video preview method in the second aspect above.
  • a computer program product including a computer program/instruction, and when the computer program/instruction is executed by a processor, the video preview method in the first aspect above is implemented.
  • a computer program product including a computer program/instruction, and when the computer program/instruction is executed by a processor, the video preview method in the second aspect above is implemented.
  • Fig. 1 is an implementation environment diagram of a video preview method shown according to an exemplary embodiment
  • Fig. 2 is a flow chart of a video preview method shown according to an exemplary embodiment
  • Fig. 3 is a flowchart of a video preview method shown according to an exemplary embodiment
  • Fig. 4 is a schematic diagram of a video cover according to an exemplary embodiment
  • Fig. 5 is a flowchart of a video preview method shown according to an exemplary embodiment
  • Fig. 6 is a flowchart of a video preview method according to an exemplary embodiment
  • Fig. 7 is a flowchart of a video preview method shown according to an exemplary embodiment
  • Fig. 8 is a block diagram of a video preview device according to an exemplary embodiment
  • Fig. 9 is a block diagram of a video preview device according to an exemplary embodiment
  • Fig. 10 is a block diagram of a terminal according to an exemplary embodiment.
  • the user information involved in this disclosure is information authorized by the user or fully authorized by all parties.
  • FIG. 1 is an implementation environment diagram of a video preview method according to an exemplary embodiment.
  • the implementation environment includes: a terminal 10 and a server 20 .
  • the terminal 10 is connected to the server 20 through a wireless or wired network.
  • a target application program provided by the server 20 is installed on the terminal 10, and the terminal 10 realizes data interaction with the server 20 through the target application program.
  • the target application program is an application program in the operating system of the terminal or an application program provided by a third party.
  • the target application program is any application program capable of playing video; for example, the target application program is a video playback application program or a browser.
  • This target application also has a video preview feature.
  • the target application program can also have other functions, such as a sharing function, a live broadcast function, and the like.
  • the terminal 10 is at least one of devices such as a mobile phone, a tablet computer, and a PC (Personal Computer) device.
  • the server 20 is at least one of a server, a server cluster composed of multiple servers, a cloud server, a cloud computing platform, and a virtualization center.
  • the terminal 10 displays a previewable video on the video preview interface of the target application program, generates multiple multimedia resources for the video, and displays multiple multimedia resources of the video.
  • the multimedia resource is a static picture, a dynamic preview picture or a video segment.
  • the terminal 10 obtains the video from the server 20, generates multiple multimedia resources of the video through the video preview method provided by the embodiments of the present disclosure, and previews and displays the multiple multimedia resources in the video preview interface. In other embodiments, the terminal 10 generates multiple multimedia resources of the video by means of the server 20, the server 20 returns the multiple multimedia resources to the terminal 10, and the terminal 10 previews and displays the multiple multimedia resources on the video preview interface.
  • the video preview method provided by the embodiments of the present disclosure is mainly applied to videos with a playing time longer than 5 minutes, that is, long videos; the video preview method can be applied in the following two scenarios:
  • the video on the video preview interface in the target application is a video uploaded by the user.
  • the terminal 10 previews multiple multimedia resources displaying the video through the method provided by the embodiment of the present disclosure, so that other users can know the video in advance based on multiple multimedia resources before playing the video. of video content.
  • the video in the target application program is a film and television work, that is, a video produced by a technician with certain professional knowledge.
  • the terminal 10 previews multiple multimedia resources displaying the video through the method provided by the embodiment of the present disclosure, so that the user can know the video in advance based on multiple multimedia resources before playing the video. of video content.
  • Fig. 2 is a flowchart of a video preview method according to an exemplary embodiment. As shown in Fig. 2, the video preview method is executed by a terminal, and includes the following steps.
  • Step S21 The terminal displays a video preview interface, where the video preview interface includes previewable videos.
  • Step S22 the terminal extracts multiple video frames from the video, and generates multiple multimedia resources from the multiple video frames.
  • Step S23 Based on the presentation order of the multiple multimedia resources, the terminal simultaneously previews and displays the multiple multimedia resources on the video preview interface.
  • a plurality of video frames are extracted from the video, including:
  • multiple video frames matching the tag information are extracted from the sequence of video frames.
  • extracting a plurality of video frames matching the tag information from the sequence of video frames includes:
  • a first number of video frames with the highest matching degree are extracted from the sequence of video frames.
  • determining a video tag for each video frame in the sequence of video frames comprises:
  • the video tag of the video frame is determined.
  • a plurality of video frames are extracted from the video, including:
  • a target video frame is extracted every interval of the second number of video frames to obtain a plurality of video frames.
  • converting the video to a sequence of video frames comprises:
  • the video is divided into multiple video segments
  • the video frame sequence is composed of video frames extracted from each video segment.
  • multiple multimedia resources are simultaneously previewed and displayed, including:
  • the number of multiple multimedia resources is M ⁇ N
  • the first preview frame includes M ⁇ N preview frames
  • each preview frame is used to display a multimedia resource
  • both M and N are integers greater than 1.
  • the method also includes:
  • the size of the preview frame for displaying the multimedia resource is set.
  • the method also includes:
  • the video preview interface includes multiple target videos that can be previewed, and the multiple target videos include the video; based on the display sequence of multiple multimedia resources, in the video preview interface, multiple multimedia resources are simultaneously previewed and displayed ,include:
  • the second preview frame includes a plurality of first preview frames, and each first preview frame is used to preview a multimedia resource showing a target video.
  • multiple multimedia resources are simultaneously previewed and displayed, including:
  • multiple multimedia resources are played in turn in the video preview interface, and thumbnails of unplayed multimedia resources are displayed.
  • the cover of the video is displayed in the video preview interface, and the cover is used to trigger a video preview request; the method also includes:
  • a step of extracting multiple video frames from the video and generating multiple multimedia resources from the multiple video frames is performed.
  • receiving the video preview request includes:
  • the video preview interface includes a preview entry, and in response to a trigger operation based on the preview entry, a video preview request based on the trigger operation is received.
  • the method also includes at least one of the following implementations:
  • the display order of the multiple multimedia resources is determined, wherein the display positions of the multiple multimedia resources whose similarity is smaller than the similarity threshold are not adjacent to each other in the video preview interface.
  • Fig. 3 is a flow chart of a video preview method according to an exemplary embodiment. As shown in Fig. 3, the video preview method is executed by a terminal. In the embodiment of the present disclosure, the terminal directly previews and displays the video in the video preview interface. Using multiple multimedia resources as an example, the method includes the following steps.
  • Step 301 The terminal displays a video preview interface, where the video preview interface includes previewable videos.
  • Step 302 the terminal converts the video into a sequence of video frames.
  • Step 303 the terminal extracts multiple video frames from the video frame sequence.
  • Step 304 The terminal generates multiple multimedia resources from multiple video frames.
  • Step 305 Based on the display sequence of the multiple multimedia resources, the terminal simultaneously previews and displays the multiple multimedia resources in the video preview interface.
  • a target application program is installed in the terminal.
  • the user triggers the terminal to run the target application; the terminal displays a video preview interface of the target application in response to the target application being triggered; the user previews the video through the video preview interface.
  • the video preview interface also includes a video acquisition entry of previewable videos, and the video acquisition entry of any video is used to trigger playing the video, that is, the video acquisition entry is also called a video playback entry.
  • the video acquisition entry includes a play button of the video.
  • the terminal sets the video acquisition entry on the cover of the video. The user triggers the video acquisition entry, and the terminal jumps from the video preview interface to the video playback interface in response to the video acquisition entry being triggered, and plays the video on the video playback interface.
  • the target application program is a video playing application program or a browser.
  • the implementation of the terminal displaying the video preview interface of the target application includes the following two situations:
  • the terminal displays a main interface of the video playback application program, and the main interface is a video preview interface.
  • the terminal displays the main interface of the browser, and based on the Uniform Resource Locator (Uniform Resource Locator, URL) input by the user on the main interface, the terminal displays the URL The video preview interface of the corresponding video website.
  • Uniform Resource Locator Uniform Resource Locator
  • the terminal extracts multiple video frames of the video before previewing and displaying the video.
  • the terminal converts the video into a sequence of video frames, and extracts a plurality of video frames from the sequence of video frames, that is, performs the operations of steps 302-303.
  • the terminal samples the video based on the sampling duration to obtain the video frame sequence
  • the implementation of step 302 includes the following process: the terminal divides the video into multiple video segments based on the sampling duration; Extracting a third number of video frames from the video segment; and composing the video frame sequence from the video frames extracted from each video segment.
  • the playback duration of each video segment is the sampling duration.
  • the sampling duration can be set and changed according to requirements, which is not specifically limited in the present disclosure; for example, the sampling duration is 0.5 seconds, 1 second, or 2 seconds.
  • the ratio of the third quantity to the sampling duration is not greater than the frame rate of the video. For example, if the frame rate of the video is 30 frames per second and the sampling duration is 1 second, then the third number is not greater than 30 frames.
  • the playback time of the video is 10 minutes
  • the sampling time is 1 second
  • the third number is 24 frames
  • a video frame sequence is obtained by extracting multiple video frames from a video, which provides data support for subsequent generation of multimedia resources.
  • the terminal does not need to display all the video frames in the video frame sequence, so the terminal extracts some video frames from the video frame sequence.
  • the terminal directly extracts multiple video frames from the sequence of video frames, that is, the first case below; or the terminal extracts multiple video frames from the sequence of video frames based on the tag information of the currently logged-in account, that is, realizes Personalized recommendations for the currently logged in account, which is the second case below.
  • the first case the terminal extracts a target video frame every interval of the second number of video frames in the sequence of video frames to obtain multiple video frames.
  • the second number can be set and changed according to requirements, and this disclosure does not specifically limit it; taking the number of video frames in the video frame sequence as 864,000 frames as an example, the second number is 24 frames, and the terminal in the video frame In the sequence, a target video frame is extracted every 24 frames, and 36000 video frames are obtained.
  • the video by converting the video into a video frame sequence, according to the set extraction frequency, that is, every interval of the second number, extracting a target video frame from the video frame sequence to obtain a plurality of video frames, realizing It extracts the content that needs to be previewed and displayed in the video, thereby saving display resources.
  • Case 2 The terminal extracts multiple video frames from the sequence of video frames based on the tag information of the currently logged-in account, including the following process: the terminal obtains the tag information of the currently logged-in account; Multiple video frames matching the tag information are extracted from the sequence.
  • the tag information includes at least one tag and a score corresponding to each tag.
  • This tag indicates the style of the video, for example, two-dimensional, landscape, or funny.
  • the score corresponding to the tag can represent the account's preference for the style indicated by the tag. In some embodiments, the score is positively correlated with the preference. For example, the higher the score, the more the account likes the style of video, that is, the deeper the preference.
  • the terminal determines the tag information of the currently logged-in account based on the stored tag information, which is the following first implementation mode; Label information, that is, the second implementation method below.
  • the historically watched videos refer to the videos watched by the current login account during the historical video watching process.
  • the terminal has locally stored the tag information of the currently logged-in account, and the terminal directly obtains the tag information.
  • the server has stored the tag information of the currently logged-in account, and the terminal acquires the tag information by means of the server.
  • the terminal sends a tag acquisition request to the server, and the tag acquisition request includes the account identifier of the currently logged-in account; the server receives the tag acquisition request; based on the tag acquisition request, obtains the tag information of the currently logged-in account, and sends the tag information to the terminal ; The terminal receives the tag information.
  • the server has stored correspondences between account identifiers and label information of multiple accounts.
  • the server acquires the tag information of the currently logged-in account based on the tag acquisition request, including: the server acquires the account ID from the correspondence between the account ID and the tag information based on the account ID included in the tag acquisition request corresponding label information.
  • the second implementation manner the terminal determines the tag information of the currently logged-in account based on the tags of the historically watched videos of the currently logged-in account.
  • the terminal sends the account ID of the currently logged-in account to the server; the server receives the account ID, and based on the account ID, determines at least A tag of historically watched videos, sending at least one tag to the terminal; the terminal receives at least one tag; the terminal determines the ratio of the number of videos corresponding to each tag to the number of at least one historically watched video, and the first number range in which the ratio is located , determining the score corresponding to the label based on the first quantity range; and composing the label and the score corresponding to the label into label information.
  • the terminal sends a tag acquisition request to the server, and the tag acquisition request includes the account identifier of the currently logged-in account; the server receives the tag acquisition request, and based on the account identifier included in the tag acquisition request, views from the account identifier and history In the corresponding relationship of video tags, determine the tag of at least one historically watched video corresponding to the account identifier, determine the ratio of the number of videos corresponding to each tag to the number of at least one historically watched video, and the first location where the ratio is located.
  • Quantity range Determine the score corresponding to the label based on the first quantity range; combine the label and the score corresponding to the label into label information; send the label information to the terminal; and the terminal receives the label information.
  • the server has stored the corresponding relationship between the account identifiers of multiple accounts and the tags of historically watched videos, and the corresponding relationship between the first number range and the score.
  • the server determines the score corresponding to the label based on the first number range, including: the server determines the first number range from the correspondence between the first number range and the score based on the first number range corresponding score. For example, if the label is a two-dimensional element, the ratio is 0.52, and the first quantity range to which the ratio belongs is 0.5-0.6, then the score corresponding to the first quantity range is 6, and the score corresponding to the label is 6.
  • the terminal will form the tag information of the current login account with the target tag and the score corresponding to the target tag; for example , the target label is landscape, and the score corresponding to the target label is 5.
  • the terminal obtains the registration information, and determines tag information corresponding to the registration information.
  • the registration information includes age, gender or other information input by the user during the account registration process.
  • the terminal determines and stores the corresponding relationship between the registration information and the tag information.
  • the corresponding relationship between registration information and label information can be set and changed according to requirements, and this disclosure does not specifically limit it; for example, if the age is 20, the label information corresponding to this age includes the label "funny" and the score "5". ".
  • the terminal by screening the video frames of the video based on the user's tag information, the video frames conforming to the user's tag information are obtained, so that the terminal can display different preview information for different users during the process of previewing and displaying the video. In this way, the personalized recommendation to the user is realized, and the preview effect of the video is improved.
  • the terminal based on the tag information, extracts a plurality of video frames that match the tag information from the video frame sequence, including the following process: the terminal determines the video tag of each video frame in the video frame sequence; The matching degree between the video label of each video frame and the label information; Based on the matching degree between each video frame and the label information, extract the first number of video frames with the highest matching degree from the video frame sequence.
  • the method for the terminal to determine the video tag of each video frame in the video frame sequence includes: the terminal recognizes the image of each video frame, and determines the style information of the video frame; based on the style information of the video frame, Determines the video tag for this video frame.
  • the style information includes at least one style identifier and a score corresponding to the style identifier; each style identifier indicates a video style.
  • the style identification is two-dimensional, landscape, funny, or other types of style identifications.
  • the score corresponding to the style identifier can represent the degree of correlation between the video frame and the video style indicated by the style identifier; the score is positively correlated with the degree of correlation. For example, the higher the score, the more relevant the video frame is to the video style, that is, the deeper the degree of correlation.
  • the video tag includes at least one tag and a score corresponding to the tag.
  • the terminal determines the video tag of the video frame based on the style information of the video frame, including: the terminal determines a style identifier with a score higher than a style threshold, and combines the style identifier with the score corresponding to the style identifier.
  • Video tags for video frames including: the terminal determines a style identifier with a score higher than a style threshold, and combines the style identifier with the score corresponding to the style identifier.
  • the embodiment of the present disclosure does not specifically limit the value of the style threshold; It should be noted that when the number of style identifiers in the style information is one, the terminal uses the style identifier and the score corresponding to the style identifier as the video tag of the video frame.
  • the method for the terminal to determine the matching degree between the video tag of the video frame and the tag information includes: the terminal acquires the same tag as the video tag of the video frame from the tag information; based on the score of the tag and the The difference between the scores of the video tags determines the second quantitative range to which the difference belongs; and determines the matching degree based on the second quantitative range.
  • the terminal has stored the correspondence between the second number range and the matching degree; in some embodiments, the implementation of the terminal determining the matching degree based on the second number range includes: the terminal based on the second number range, from the second number range In the corresponding relationship with the matching degree, the matching degree corresponding to the second quantity range is determined.
  • the video frame conforming to the tag information of the user is determined, so that the terminal can display the video for different users during the preview display process.
  • Different preview information is used to realize personalized recommendations for users, thereby improving the preview effect of videos.
  • the terminal can execute step 303 according to any of the above implementation methods; or, the terminal can also execute step 303 in different implementation methods based on different video preview interfaces; in some embodiments, in the video preview interface If there is no currently playing video, the terminal extracts multiple video frames from the sequence of video frames based on the tag information of the currently logged in account; Extract multiple video frames in a sequence of frames.
  • a first video matching model is pre-deployed in the terminal, and the terminal extracts video frames by means of the first video matching model based on tag information and a sequence of video frames. Then, based on the tag information, the terminal extracts a plurality of video frames that match the tag information from the video frame sequence, including: the terminal inputs the tag information and the video frame sequence into the first video matching model, and through the first video matching The model outputs a plurality of video frames, and the first video matching model is used to extract a plurality of video frames matching the label information from a sequence of video frames.
  • the first video matching model is trained by the terminal, or is deployed to the terminal after being trained by the server; the embodiment of the present disclosure does not specifically limit the source of the first video matching model.
  • the first video matching model since the first video matching model is trained based on a large amount of sample data, the matching accuracy of the first video matching model is relatively high; the video frame is extracted through the first video matching model, which can improve the matching accuracy. accuracy.
  • the function of the first video matching model is to extract video frames. Therefore, before extracting multiple video frames matching the tag information from the video frame sequence based on the tag information, the terminal converts the video to sequence of video frames.
  • the second video matching model is pre-deployed in the terminal, and the second video matching model is used to extract multiple video frames from the video based on the video tag, that is, the function of the second video matching model is not only to extract video frame, and the video can also be converted into a video frame sequence, then step 302 and step 303 can be replaced by: the terminal inputs tag information and video into the second video matching model, and outputs multiple video frames through the second video matching model.
  • the terminal directly generates the same number of multimedia resources from multiple video frames. Wherein, the terminal generates a fourth number of multimedia resources from multiple video frames; the present disclosure does not specifically limit the setting of the fourth number. In some other embodiments, for different terminal types, the terminal generates multiple video frames with different numbers of multimedia resources; then the implementation of step 304 includes the following process: the terminal determines the terminal type of the terminal; based on the terminal type, the terminal type is acquired The number of matching multimedia resources; multiple video frames are generated to generate this number of multimedia resources.
  • the terminal can determine the quantity of multimedia resources based on the terminal type of the current terminal.
  • the terminal type of the current terminal is a personal computer (Personal Computer, PC) device or a mobile phone.
  • the terminal stores the corresponding relationship between the terminal type and the number of multimedia resources; in some embodiments, the terminal is based on the terminal type, and the implementation of obtaining the number of multimedia resources matched by the terminal type includes: the terminal is based on the terminal type, from In the corresponding relationship between the terminal type and the quantity of multimedia resources, the quantity of multimedia resources matching the terminal type is determined.
  • the corresponding relationship between the terminal type and the quantity of multimedia resources can be set and changed according to requirements, which is not specifically limited in the present disclosure. For example, if the terminal type is a mobile phone, then the number of multimedia resources is 9; for another example, if the terminal type is a PC device, then the number of matching multimedia resources is 100.
  • the terminal sequentially splices the fifth number of video frames to obtain multiple multimedia resources.
  • the fifth number is a ratio of the number of video frames included in the plurality of video frames to the number of multimedia resources. For example, if the number of multiple video frames is 36000 frames and the number of multimedia resources is 100, then the fifth number is 360, that is, the terminal sequentially stitches every 360 video frames to obtain 100 multimedia resources.
  • the manner of video preview can be changed as the user's terminal type changes, thereby improving the diversity of video preview manners.
  • the multimedia resources include any type of static pictures, dynamic preview pictures and video clips.
  • the terminal obtains the number of multimedia resources that match the terminal type in any of the following ways: the terminal generates multiple still pictures from multiple video frames; or, generates multiple video frames from multiple video frames Multiple dynamic preview images; or, generate multiple video clips from multiple video frames.
  • the user can set the type of the multimedia resource through the terminal; or, the terminal uses the target type as the type of the multimedia resource.
  • the target type is a type set in advance by the terminal, and the setting of the target type is not specifically limited in the present disclosure; for example, the target type is a dynamic preview image.
  • the multimedia resource includes multiple types, so that the user can flexibly select the type, thereby improving the selectivity of the video preview mode.
  • the terminal after generating multiple multimedia resources from multiple video frames, the terminal first determines the presentation order of the multiple multimedia resources, and then performs step 305, wherein the presentation order is used to determine the presentation order of the multimedia resources in the video preview interface. placement.
  • the method for the terminal to determine the display order of the multiple multimedia resources includes: the terminal determines the display sequence of the multiple multimedia resources based on the position of the video frame corresponding to each multimedia resource in the video, that is, the multiple multimedia resources Resources are displayed sequentially.
  • the three multimedia resources are respectively multimedia resource 1, multimedia resource 2 and multimedia resource 3.
  • Multimedia resource 1 is generated from the first video frame to the third video frame of the video
  • multimedia resource 2 is generated from the tenth video frame of the video.
  • One video frame to the thirteenth video frame is generated
  • the multimedia resource 3 is generated from the twenty-first video frame to the twenty-third video frame of the video, so the display order of the three multimedia resources is multimedia resource 1 - multimedia resource 2 - multimedia resource 3.
  • the terminal determines the display sequence of the multiple multimedia resources based on the positions of the multiple multimedia resources in the video, so that the display sequence of the multiple multimedia resources is consistent with the playback sequence of the original content of the video.
  • the method for the terminal to determine the display order of the multiple multimedia resources includes: the terminal determines the display order of the multiple multimedia resources based on the similarity of the multiple multimedia resources; The display positions of multimedia resources in this video preview interface are not adjacent.
  • the similarity threshold can be set and changed according to requirements, which is not specifically limited in the present disclosure. In the embodiment of the present disclosure, homogenization of video content is avoided by displaying similar multimedia resources separately.
  • the terminal determines the display order of multiple multimedia resources.
  • the location of the multiple multimedia resources is determined to determine the display order of the multiple multimedia resources; based on the similarity of the multiple multimedia resources, the display sequence of the multiple multimedia resources is adjusted to avoid homogenization of video content.
  • the way of previewing and displaying includes at least one of playing and displaying thumbnails; in the case that the way of previewing and showing includes playing, the implementation of step 305 includes: based on the display order of a plurality of multimedia resources, the terminal displays in the video preview interface The display position of each multimedia resource is determined in , and multiple multimedia resources are played simultaneously at the display position of each multimedia resource.
  • the number of multiple multimedia resources is M ⁇ N
  • the implementation of step 305 includes: based on the display sequence of multiple multimedia resources, the terminal simultaneously previews and presents in the first preview frame in the video preview interface A plurality of multimedia resources, wherein the first preview frame includes M ⁇ N preview frames, each preview frame is used to display a multimedia resource, and both M and N are integers greater than 1.
  • the terminal determines the values of M and N respectively based on the quantities of multiple multimedia resources. For example, if the number of multimedia resources is 100, then M and N are 10 and 10 respectively; for another example, if the number of multimedia resources is 35, then M and N are 5 and 7 respectively.
  • Figure 4. The above picture is a schematic diagram of the cover of the video before the video is previewed and displayed.
  • the text introduction of the video is "[Mixed Cut] Animation Mixed Cut";
  • the first preview frame set includes 8 ⁇ 8 preview frames, and 8 ⁇ 8 multimedia resources are displayed in each of the 8 ⁇ 8 preview frames, so that multiple multimedia resources can be previewed and displayed at the same time.
  • a schematic diagram of a multimedia resource A schematic diagram of a multimedia resource.
  • the terminal can flexibly set the size of the first preview frame based on the number of multimedia resources, thereby improving the flexibility of the video preview method.
  • the terminal can flexibly set the size of the first preview frame based on the number of multimedia resources, thereby improving the flexibility of the video preview method.
  • the first preview frame includes a preview frame for previewing and displaying multiple multimedia resources
  • the size of the multiple preview frames can be a fixed size, or the terminal can also flexibly set the size of the preview frame; in some embodiments, the terminal
  • the implementation of setting the size of the preview frame includes: the terminal determines the matching degree between each multimedia resource and the label information of the current login account, and based on the matching degree, sets the size of the preview frame used to display the multimedia resource, wherein the preview frame Size is positively correlated with fit.
  • the terminal before step 305, the terminal generates a first preview frame.
  • the terminal generating the first preview frame includes: the terminal determining the first layout information, and determining the first preview frame matching the first layout information.
  • the first layout information is the layout information of the terminal system, or user-defined layout information, that is, in the embodiment of the present disclosure, the user can customize the first layout information, so as to realize the customization of the first preview frame, thereby Improved flexibility.
  • the implementation manner for the terminal to generate the first preview frame includes: the terminal determines the video type of the video, the matching first preview frame, and different video types correspond to first preview frames of different structures.
  • video types include animation, entertainment, life, games, etc.
  • animation, entertainment, life, and games respectively correspond to first preview frames of different structures.
  • the selection manner of the first preview frame is enriched.
  • the terminal generating the first preview frame includes: the terminal determining the first preview frame matching the terminal type of the terminal, and different terminal types correspond to first preview frames of different structures.
  • the first preview frame can be adapted to the current terminal, thereby improving the adaptability of the first preview frame.
  • the method for the terminal to generate the first preview frame includes: the terminal determines the first preview frame that matches the display ratio of the video preview interface, where the display ratio is the ratio of the video preview interface to the display screen, and different display ratios correspond to First preview frames of different structures.
  • the first preview frame including a larger number of preview frames is set, so that the first preview frame can display more multimedia resources, so that the overall picture of the video can be viewed through the multimedia resources.
  • the video preview interface in the non-full-screen state is small, and the first preview frame including a small number of preview frames is set to make the interface more tidy.
  • the terminal previews and displays the multiple multimedia resources in turn on the video preview interface based on the presentation order of the multiple multimedia resources.
  • the implementation of step 305 includes: the terminal plays multiple multimedia resources in turn in the video preview interface based on the display sequence of multiple multimedia resources, and displays unplayed Thumbnails for multimedia assets.
  • the terminal when the terminal is playing a multimedia resource, the unplayed multimedia resource is in a non-playing state, and the terminal displays thumbnails of the unplayed multimedia resource in the video preview interface.
  • the three multimedia resources are respectively multimedia resource 1, multimedia resource 2 and multimedia resource 3.
  • the terminal displays thumbnails of unplayed multimedia resources on the video preview interface.
  • the realization method for the terminal to play multiple multimedia resources in turn in the video preview interface includes: based on the display order of multiple multimedia resources, the terminal determines each The display position of the multimedia resource is to display the corresponding multimedia resource at each display position of the multimedia resource; determine the playing sequence of the multiple multimedia resources, and play the multiple multimedia resources in turn based on the playing sequence.
  • the implementation method for the terminal to determine the playing order of multiple multimedia resources includes: the terminal directly determines the display order as the playing order; or, based on the matching degree between each multimedia resource and the label information of the current login account, determine the The playback order of the assets.
  • the higher the matching degree is, the higher the position of the multimedia resource is in the playing order
  • the lower the matching degree is, the lower the position of the multimedia resource is in the playing order, so that the multimedia resource that the user is interested in is played preferentially.
  • step 305 when the way of previewing and displaying includes displaying thumbnails, the implementation of step 305 includes: the terminal determines the display position of each multimedia resource in the video preview interface based on the sequence of displaying multiple multimedia resources , on the display position of each multimedia resource, simultaneously display the thumbnails of multiple multimedia resources.
  • the terminal can preview and display multiple multimedia resources in turn.
  • the operation of previewing and displaying multiple multimedia resources in turn can be understood as quick display, that is, the operation also belongs to the scope of "simultaneous display" in the embodiments of the present disclosure.
  • the display resources of the video preview interface are saved by previewing and displaying multiple multimedia resources in turn.
  • the terminal can directly preview and display multiple multimedia resources in the video preview interface, or the terminal can also preview and display multiple multimedia resources with audio added in the video preview interface.
  • the terminal adds audio information of the video to the multimedia resource of the video, and previews and displays the multimedia resource with the added audio information on the video preview interface.
  • the implementation of previewing and displaying multiple multimedia resources by the terminal includes the following process: the terminal obtains the audio information corresponding to each multimedia resource; adds the audio information to the multimedia resource; in the video preview interface, previews and displays Multimedia resources after adding audio information.
  • the terminal obtains the audio information corresponding to each multimedia resource.
  • the time point corresponding to the video frame in the multimedia resource is determined from the audio corresponding to the video, and the audio frame corresponding to the time point is determined; multiple audio frames corresponding to the same multimedia resource are spliced to obtain the audio information corresponding to the multimedia resource.
  • the terminal synthesizes the audio information and the multimedia resource to obtain the multimedia resource with the audio information added.
  • the terminal previews and displays a preview play button of the multimedia resource in the video preview interface; in response to the preview play button being triggered, the terminal plays the multimedia resource with audio information added.
  • the user can not only preview the screen content of the video, but also preview the audio content corresponding to the video, thereby allowing the user to combine the screen content in the multimedia resource Content and audio content improve video preview by determining whether a video is of interest.
  • the terminal can directly preview and display multiple multimedia resources on the cover of the video at the same time, or the terminal can also preview and display multiple multimedia resources in the floating window of the video preview interface.
  • a floating window is displayed above the cover of the video, and multiple multimedia resources are simultaneously previewed and displayed in the floating window.
  • different display modes are provided, thereby improving the selectivity of the display modes.
  • the number of previewable videos included in the video preview interface is one or more. Take the preview of multiple target videos as an example.
  • the multiple target videos include videos.
  • the terminal For each target video, the terminal generates multiple multimedia resources of the target video according to the above steps 302-305, and simultaneously previews and displays them in the video preview interface. There are multiple multimedia resources of the target video. Since one target video corresponds to multiple multimedia resources, the terminal simultaneously displays multiple multimedia resources of the target video on the video preview interface.
  • the terminal can also layout the multimedia resources of multiple target videos, then the terminal determines the second layout information of the multiple target videos in the video preview interface; determines the second preview frame matched by the second layout information; The multimedia resource displaying multiple target videos is simultaneously previewed in the second preview frame.
  • the second preview frame is a global preview frame.
  • the second preview frame includes a plurality of first preview frames, and each first preview frame is used to display a multimedia resource of a target video.
  • the second layout information is used to indicate that multiple target videos of the video preview interface are arranged side by side, and then the plurality of first preview frames included in the second preview frame are also arranged side by side, so that the second preview frame and the first preview frame are arranged side by side.
  • layout information matching that is, in the process of previewing and displaying multimedia resources, the original layout of the video preview interface is not changed, thereby improving the adaptability of the interface display.
  • Fig. 5 is a flow chart of a video preview method according to an exemplary embodiment. As shown in Fig. 5, the video preview method is executed by a terminal. In the embodiment of the present disclosure, the terminal is based on a video preview request, and in the video preview interface , preview multiple multimedia resources displaying video as an example for illustration, the method includes the following steps.
  • Step 501 The terminal displays a video preview interface, the video preview interface includes a previewable video and a cover of the video, and the cover is used to trigger a video preview request.
  • Step 502 the terminal receives the video preview request.
  • Step 503 The terminal converts the video corresponding to the video preview request into a sequence of video frames.
  • Step 504 the terminal extracts multiple video frames from the video frame sequence.
  • Step 505 The terminal generates multiple multimedia resources from multiple video frames.
  • Step 506 Based on the display sequence of the multiple multimedia resources, the terminal simultaneously previews and displays the multiple multimedia resources in the video preview interface.
  • the video is displayed in the video preview interface in the form of displaying a cover.
  • the terminal executes the operation of step 502 in response to the video preview request triggered based on the cover art.
  • the implementation manner of step 501 is the same as the implementation manner of step 301, which will not be repeated here.
  • the implementation of the terminal receiving the video preview request includes at least one of the following implementations: first implementation: the terminal responds to a mouse hovering operation detected on the cover, and receives the video based on the hovering operation Preview request. Wherein, once the terminal detects the hovering operation, it determines to receive the video preview request, or, when the terminal detects that the hovering duration of the hovering operation reaches the target duration, it determines to receive the video preview request.
  • the target duration can be set and changed according to requirements, which is not specifically limited in the embodiment of the present disclosure; for example, if the target duration is 5 seconds, then the mouse pointer hovers over the cover of the video for 5 seconds Next, the terminal determines that the video preview request is received.
  • the hovering operation on a video indicates that the user intends to watch the video, and the terminal can preview and display the video, thereby improving the accuracy of the video preview.
  • the video preview interface includes a preview entry
  • the terminal receives a video preview request based on the trigger operation in response to the trigger operation based on the preview entry.
  • the preview entry includes a preview button. The user triggers the preview button, and the terminal determines that a video preview request is received.
  • the terminal can set a preview button on the video preview interface; for example, the preview button is set below or above the cover of the video; or, the preview button is set on the cover of the video, and the position of the play button is different from that of the video .
  • the user triggers a preview button through the terminal, and the terminal determines that a video preview request is received.
  • the operation of the user triggering the video preview entry through the terminal indicates that the user has an intention to watch the video, so that the terminal can determine that a video preview request has been received, which improves the accuracy of the video preview.
  • the implementation manner of step 503 is the same as the implementation manner of the terminal converting the video into a video frame sequence in step 302, which will not be repeated here.
  • the terminal converts the video in the video preview interface into a video frame sequence only after receiving the video preview request, thereby avoiding the need to store video frame sequences of multiple videos and causing the storage space of the terminal to be reduced. The occurrence of excessive occupancy further reduces the storage pressure of the terminal.
  • steps 504-506 is the same as the implementation of steps 303-305, which will not be repeated here.
  • the video frame of the video can reflect the content of the video
  • multiple multimedia resources generated based on multiple video frames of the video can fuse the content of the video.
  • the user can preview the video content provided by multiple multimedia resources before watching the video, so as to have a pre-understanding of the whole picture of the video, thereby improving the video preview effect.
  • Fig. 6 is a flow chart showing a video preview method according to an exemplary embodiment. As shown in Fig. 6, the video preview method is executed by a terminal, and the method includes the following steps.
  • Step 601 The terminal displays a video preview interface, and the video preview interface displays a cover of the video.
  • Step 602 In response to the video preview operation based on the cover, the terminal displays a first preview frame corresponding to the video on the video preview interface.
  • the first preview frame includes a plurality of multimedia resources, and each multimedia resource is represented by the video in the video. At least one video frame of is generated.
  • the method also includes:
  • the video preview interface includes a preview entry, and in response to a trigger operation based on the preview entry, it is determined that the video preview operation is detected.
  • Fig. 7 is a flowchart of a video preview method according to an exemplary embodiment. As shown in Fig. 7, the video preview method is executed by a terminal. In the embodiment of the present disclosure, the terminal is based on the video preview operation, and in the video preview interface Taking multiple multimedia resources displaying videos as an example, the method includes the following steps.
  • Step 701 The terminal displays a video preview interface, and the video preview interface displays a cover of the video.
  • Step 702 In response to the video preview operation based on the cover, the terminal displays the first preview frame corresponding to the video on the video preview interface, the first preview frame includes multiple multimedia resources, and each multimedia resource consists of At least one video frame is generated.
  • step 701 is the same as the implementation manner of step 501, which will not be repeated here.
  • the first preview frame includes M ⁇ N preview frames, each preview frame is used to display a multimedia resource, and both M and N are integers greater than 1.
  • steps 302-304 for the process of generating the first preview frame and multiple multimedia resources by the terminal which will not be repeated here.
  • the implementation manner of the terminal detection video preview operation is the same as the implementation manner of step 502, and will not be repeated here.
  • the multiple multimedia resources displayed in the first preview frame can be integrated with the content of the video.
  • the user can Before watching a video, the video content provided by multiple multimedia resources can be previewed, so as to know the whole picture of the video in advance, thereby improving the video preview effect.
  • Fig. 8 is a block diagram of a video preview device 800 according to an exemplary embodiment.
  • the apparatus 800 includes a first presentation unit 801 and a generation unit 802 .
  • the first display unit 801 is configured to display a video preview interface, where the video preview interface includes previewable videos;
  • the generating unit 802 is configured to extract multiple video frames from the video, and generate multiple multimedia resources from the multiple video frames;
  • the first display unit 801 is configured to simultaneously preview and display multiple multimedia resources in the video preview interface based on the display sequence of the multiple multimedia resources.
  • generating unit 802 includes:
  • a conversion subunit configured to convert the video into a sequence of video frames
  • the obtaining subunit is configured to obtain the label information of the current login account
  • the extracting subunit is configured to extract a plurality of video frames matching the tag information from the sequence of video frames based on the tag information.
  • the extraction subunit is configured to determine the video label of each video frame in the video sequence; determine the matching degree between the video label of each video frame and the label information; based on each video frame According to the matching degree between the label information and the video frame sequence, the first number of video frames with the highest matching degree are extracted.
  • the extraction subunit is configured to recognize each video frame image, determine the style information of the video frame; and determine the video tag of the video frame based on the style information of the video frame.
  • generating unit 802 includes:
  • a conversion subunit configured to convert the video into a sequence of video frames
  • the second extracting subunit is configured to extract a target video frame every interval of a second number of video frames in the sequence of video frames to obtain a plurality of video frames.
  • the conversion subunit is configured to divide the video into a plurality of video segments based on the sampling duration; extract a third number of video frames from each video segment; extract the video frames from each video segment Video frames make up the sequence of video frames.
  • the first display unit 801 is configured to simultaneously preview and display multiple multimedia resources in the first preview frame in the video preview interface based on the display sequence of the multiple multimedia resources;
  • the number of multiple multimedia resources is M ⁇ N
  • the first preview frame includes M ⁇ N preview frames
  • each preview frame is used to display a multimedia resource
  • both M and N are integers greater than 1.
  • the device also includes:
  • the first determining unit is configured to determine a degree of matching between each multimedia resource and the label information of the current login account; based on the degree of matching, set the size of a preview frame for displaying the multimedia resource.
  • the device also includes:
  • the first determining unit is configured to determine the first preview frame that matches the video type of the video, and different video types correspond to the first preview frame of different structures; or,
  • the first determining unit is configured to determine the first preview frame matching the terminal type of the terminal, and different terminal types correspond to the first preview frame of different structures; or,
  • the first determination unit is configured to determine the first preview frame that matches the display ratio of the video preview interface, where the display ratio is the ratio of the preview interface to the display screen, and different display ratios correspond to different structures of the first preview frame.
  • the video preview interface includes multiple target videos that can be previewed, and the multiple target videos include the video;
  • the first presentation unit 801 is configured to determine the layout information of the multiple target videos in the video preview interface ; Determine the second preview frame that the layout information matches; preview the multimedia resources showing multiple target videos simultaneously in the second preview frame;
  • the second preview frame includes a plurality of first preview frames, and each first preview frame is used to preview a multimedia resource showing a target video.
  • the first display unit 801 is configured to determine the display position of each multimedia resource in the video preview interface based on the display sequence of the multiple multimedia resources, and at the display position of each multimedia resource, at the same time play multiple multimedia resources; or,
  • the first display unit 801 is configured to play multiple multimedia resources in turn on the video preview interface based on the display sequence of the multiple multimedia resources, and display thumbnails of unplayed multimedia resources.
  • the cover of the video is displayed in the video preview interface, and the cover is used to trigger a video preview request; the device also includes:
  • a receiving unit configured to receive the video preview request
  • the generating unit 802 is configured to extract multiple video frames from the video based on the video preview request, and generate multiple multimedia resources from the multiple video frames.
  • the receiving unit is configured to receive the video preview request based on the hovering operation in response to detecting a mouse hovering operation on the cover; or,
  • the video preview interface includes a preview entry, and the receiving unit is configured to, in response to a trigger operation based on the preview entry, receive the video preview request based on the trigger operation.
  • the device also includes:
  • the first determining unit is configured to determine the display order of multiple multimedia resources based on the position of the video frame corresponding to each multimedia resource in the video; or,
  • the first determination unit is configured to determine the display sequence of multiple multimedia resources based on the similarity between multiple multimedia resources; wherein, the multiple multimedia resources whose similarity is smaller than the similarity threshold are displayed in the video preview interface The locations are not adjacent.
  • the video frame of the video can reflect the content of the video
  • multiple multimedia resources generated based on multiple video frames of the video can fuse the content of the video.
  • the user can preview the video content provided by multiple multimedia resources before watching the video, so as to have a pre-understanding of the whole picture of the video, thereby improving the video preview effect.
  • Fig. 9 is a block diagram of a video preview device 900 according to an exemplary embodiment.
  • the device 900 includes a second display unit 901 .
  • the second display unit 901 is configured to display a video preview interface, and the video preview interface displays a cover of the video;
  • the second presentation unit 901 is configured to display a first preview frame corresponding to the video in the video preview interface in response to the video preview operation based on the cover, the first preview frame includes a plurality of multimedia resources, and each The multimedia resource is generated from at least one video frame in the video.
  • the device also includes:
  • the second determining unit is configured to determine that the video preview operation is detected in response to detecting a mouse hovering operation on the cover; or,
  • the preview interface includes a preview entry, and the second determining unit is configured to determine that the video preview operation is detected in response to a trigger operation based on the preview entry.
  • the multiple multimedia resources displayed in the first preview frame can be integrated with the content of the video.
  • the user can Before watching a video, the video content provided by multiple multimedia resources can be previewed, so as to know the whole picture of the video in advance, thereby improving the video preview effect.
  • Fig. 10 is a block diagram showing a terminal 10 according to an exemplary embodiment.
  • the terminal 10 is: a smart phone, a tablet computer, a notebook computer or a desktop computer, and the like.
  • the terminal 10 is called user equipment, portable terminal, laptop terminal, desktop terminal and other names.
  • the terminal 10 includes: a processor 101 and a memory 102 .
  • the processor 101 includes one or more processing cores, such as a 4-core processor, an 8-core processor, and the like.
  • the processor 101 adopts at least one of DSP (Digital Signal Processing, digital signal processing), FPGA (Field-Programmable Gate Array, field programmable gate array), PLA (Programmable Logic Array, programmable logic array) A form of hardware to achieve.
  • the processor 101 also includes a main processor and a coprocessor, and the main processor is a processor for processing data in a wake-up state, also called a CPU (Central Processing Unit, central processing unit) ;
  • the coprocessor is a low-power processor used to process data in the standby state.
  • the processor 101 is integrated with a GPU (Graphics Processing Unit, image processor), and the GPU is used for rendering and drawing the content that needs to be displayed on the display screen.
  • the processor 101 further includes an AI (Artificial Intelligence, artificial intelligence) processor, where the AI processor is configured to process computing operations related to machine learning.
  • AI Artificial Intelligence, artificial intelligence
  • memory 102 includes one or more non-volatile computer-readable storage media that are non-transitory.
  • the memory 102 also includes high-speed random access memory and non-volatile memory, such as one or more disk storage devices and flash memory storage devices.
  • the non-transitory non-volatile computer-readable storage medium in the memory 102 is used to store at least one instruction, and the at least one instruction is used to be executed by the processor 101 to implement the method embodiments of the present disclosure The provided video preview method.
  • the terminal 10 may optionally further include: a peripheral device interface 103 and at least one peripheral device.
  • the processor 101 , the memory 102 and the peripheral device interface 103 are connected through buses or signal lines.
  • each peripheral device is connected to the peripheral device interface 103 through a bus, a signal line or a circuit board.
  • the peripheral device includes: at least one of a radio frequency circuit 104 , a display screen 105 , a camera component 106 , an audio circuit 107 , a positioning component 108 and a power supply 109 .
  • the peripheral device interface 103 may be used to connect at least one peripheral device related to I/O (Input/Output, input/output) to the processor 101 and the memory 102 .
  • the processor 101, memory 102 and peripheral device interface 103 are integrated on the same chip or circuit board; in some other embodiments, any one of the processor 101, memory 102 and peripheral device interface 103 or The two are implemented on a separate chip or circuit board, which is not limited in this embodiment.
  • the radio frequency circuit 104 is used to receive and transmit RF (Radio Frequency, radio frequency) signals, also called electromagnetic signals.
  • Radio frequency circuitry 104 communicates with communication networks and other communication devices via electromagnetic signals.
  • the radio frequency circuit 104 converts electrical signals into electromagnetic signals for transmission, or converts received electromagnetic signals into electrical signals.
  • the radio frequency circuit 104 includes: an antenna system, an RF transceiver, one or more amplifiers, a tuner, an oscillator, a digital signal processor, a codec chipset, a subscriber identity module card, and the like.
  • the radio frequency circuit 104 communicates with other terminals through at least one wireless communication protocol.
  • the wireless communication protocol includes but is not limited to: World Wide Web, Metropolitan Area Network, Intranet, various generations of mobile communication networks (2G, 3G, 4G and 5G), wireless local area network and/or WiFi (Wireless Fidelity, Wireless Fidelity) network.
  • the radio frequency circuit 104 also includes circuits related to NFC (Near Field Communication, short-range wireless communication), which is not limited in the present disclosure.
  • the display screen 105 is used to display a UI (User Interface, user interface).
  • the UI includes graphics, text, icons, video, and any combination thereof.
  • the display screen 105 also has the ability to collect touch signals on or above the surface of the display screen 105 .
  • the touch signal is input to the processor 101 as a control signal for processing.
  • the display screen 105 is also used to provide virtual buttons and/or virtual keyboards, also called soft buttons and/or soft keyboards.
  • the display screen 105 there is one display screen 105, which is arranged on the front panel of the terminal 10; in other embodiments, there are at least two display screens 105, which are respectively arranged on different surfaces of the terminal 10 or in a folding design; In some embodiments, the display screen 105 is a flexible display screen, and is disposed on a curved surface or a folded surface of the terminal 10 . Even, the display screen 105 is set as a non-rectangular irregular figure, that is, a special-shaped screen. In some embodiments, the display screen 105 is made of LCD (Liquid Crystal Display, liquid crystal display), OLED (Organic Light-Emitting Diode, organic light-emitting diode) and other materials.
  • LCD Liquid Crystal Display, liquid crystal display
  • OLED Organic Light-Emitting Diode, organic light-emitting diode
  • the camera assembly 106 is used to capture images or videos.
  • the camera assembly 106 includes a front camera and a rear camera.
  • the front camera is set on the front panel of the terminal, and the rear camera is set on the back of the terminal.
  • there are at least two rear cameras which are any one of the main camera, depth-of-field camera, wide-angle camera, and telephoto camera, so as to realize the fusion of the main camera and the depth-of-field camera to realize the background blur function.
  • camera assembly 106 also includes a flash.
  • the flash is a single-color temperature flash, and in some embodiments, the flash is a dual-color temperature flash. Dual color temperature flash refers to the combination of warm light flash and cold light flash, which is used for light compensation under different color temperatures.
  • audio circuitry 107 includes a microphone and a speaker.
  • the microphone is used to collect sound waves of the user and the environment, and convert the sound waves into electrical signals and input them to the processor 101 for processing, or input them to the radio frequency circuit 104 to realize voice communication.
  • there are multiple microphones which are respectively arranged in different parts of the terminal 10 .
  • the microphone is an array microphone or an omnidirectional collection microphone.
  • the speaker is used to convert the electrical signal from the processor 101 or the radio frequency circuit 104 into sound waves.
  • the speaker is a conventional membrane speaker, and in some embodiments, the speaker is a piezoelectric ceramic speaker.
  • audio circuitry 107 also includes a headphone jack.
  • the positioning component 108 is used to locate the current geographic location of the terminal 10 to implement navigation or LBS (Location Based Service, location-based service).
  • LBS Location Based Service, location-based service
  • the positioning component 107 is a positioning component based on the GPS (Global Positioning System) of the United States, the Beidou positioning system of China, the Greinus positioning system of Russia or the Galileo positioning system of the European Union.
  • the power supply 109 is used to supply power to various components in the terminal 10 .
  • the power source 109 is alternating current, direct current, disposable batteries, or rechargeable batteries.
  • the rechargeable battery is a wired rechargeable battery or a wireless rechargeable battery.
  • a wired rechargeable battery is a battery charged through a wired line
  • a wireless rechargeable battery is a battery charged through a wireless coil.
  • the rechargeable battery is also used to support fast charging technology.
  • the terminal 10 further includes one or more sensors 110 .
  • the one or more sensors 110 include, but are not limited to: an acceleration sensor 111 , a gyro sensor 112 , a pressure sensor 113 , a fingerprint sensor 114 , an optical sensor 115 and a proximity sensor 116 .
  • the acceleration sensor 111 detects the acceleration on the three coordinate axes of the coordinate system established by the terminal 10 .
  • the acceleration sensor 111 is used to detect the components of the gravitational acceleration on the three coordinate axes.
  • the processor 101 controls the display screen 105 to display the user interface in a landscape view or a portrait view according to the gravitational acceleration signal collected by the acceleration sensor 111 .
  • the acceleration sensor 111 is also used for collecting game or user's motion data.
  • the gyro sensor 112 detects the body orientation and rotation angle of the terminal 10 , and the gyro sensor 112 and the acceleration sensor 111 cooperate to collect 3D actions of the user on the terminal 10 .
  • the processor 101 can realize the following functions: motion sensing (such as changing the UI according to the user's tilt operation), image stabilization during shooting, game control and inertial navigation.
  • the pressure sensor 113 is disposed on a side frame of the terminal 10 and/or a lower layer of the display screen 105 .
  • the pressure sensor 113 can detect the user's grip signal on the terminal 10 , and the processor 101 performs left and right hand recognition or shortcut operation according to the grip signal collected by the pressure sensor 113 .
  • the processor 101 controls the operable controls on the UI interface according to the user's pressure operation on the display screen 105.
  • the operable controls include at least one of button controls, scroll bar controls, icon controls, and menu controls.
  • the fingerprint sensor 114 is used to collect the user's fingerprint, and the processor 101 identifies the identity of the user according to the fingerprint collected by the fingerprint sensor 114, or, the fingerprint sensor 114 identifies the user's identity according to the collected fingerprint.
  • the processor 101 authorizes the user to perform related sensitive operations, such sensitive operations include unlocking the screen, viewing encrypted information, downloading software, making payment, and changing settings.
  • the fingerprint sensor 114 is disposed on the front, back or side of the terminal 10 . When the terminal 10 is provided with a physical button or a manufacturer's logo, the fingerprint sensor 114 is integrated with the physical button or the manufacturer's Logo.
  • the optical sensor 115 is used to collect ambient light intensity.
  • the processor 101 controls the display brightness of the display screen 105 according to the ambient light intensity collected by the optical sensor 115 . Specifically, when the ambient light intensity is high, the display brightness of the display screen 105 is increased; when the ambient light intensity is low, the display brightness of the display screen 105 is decreased.
  • the processor 101 also dynamically adjusts shooting parameters of the camera assembly 106 according to the ambient light intensity collected by the optical sensor 115 .
  • the proximity sensor 116 also called a distance sensor, is usually arranged on the front panel of the terminal 10 .
  • the proximity sensor 116 is used to collect the distance between the user and the front of the terminal 10 .
  • the processor 101 controls the display screen 105 to switch from the bright screen state to the off-screen state; when the proximity sensor 116 detects When the distance between the user and the front of the terminal 10 gradually increases, the processor 101 controls the display screen 105 to switch from the off-screen state to the on-screen state.
  • FIG. 10 does not constitute a limitation on the terminal 10, and may include more or less components than shown in the figure, or combine certain components, or adopt a different component arrangement.
  • a non-volatile computer-readable storage medium including instructions, such as a memory 102 including instructions, the above instructions are executed by the processor 101 of the terminal 10 to complete the video in the above embodiments preview method.
  • the computer-readable storage medium is ROM, random access memory (RAM), CD-ROM, magnetic tape, floppy disk, optical data storage device, and the like.
  • a computer program product including a computer program/instruction, and when the computer program/instruction is executed by the processor 101, the video preview method in the foregoing embodiment is implemented.

Landscapes

  • Engineering & Computer Science (AREA)
  • Multimedia (AREA)
  • Signal Processing (AREA)
  • Human Computer Interaction (AREA)
  • Databases & Information Systems (AREA)
  • Software Systems (AREA)
  • Two-Way Televisions, Distribution Of Moving Picture Or The Like (AREA)

Abstract

La présente invention, qui relève du domaine technique de l'internet, concerne un procédé de prévisualisation vidéo et un terminal. Le procédé comprend : l'affichage d'une interface de prévisualisation vidéo, l'interface de prévisualisation vidéo comprenant une vidéo pouvant être prévisualisée (S21) ; le fait d'extraire, de la vidéo, une pluralité de trames vidéo, et la génération d'une pluralité de ressources multimédias à partir de la pluralité de trames vidéo (S22) ; et sur la base d'une séquence d'affichage de la pluralité de ressources multimédias, la prévisualisation et l'affichage de manière simultanée de la pluralité de ressources multimédias dans l'interface de prévisualisation vidéo (S23).
PCT/CN2021/133221 2021-08-31 2021-11-25 Procédé de prévisualisation vidéo et terminal WO2023029237A1 (fr)

Applications Claiming Priority (4)

Application Number Priority Date Filing Date Title
CN202111016537.3 2021-08-31
CN202111016537 2021-08-31
CN202111241441.7A CN114125531B (zh) 2021-08-31 2021-10-25 视频预览方法、装置、终端及存储介质
CN202111241441.7 2021-10-25

Publications (1)

Publication Number Publication Date
WO2023029237A1 true WO2023029237A1 (fr) 2023-03-09

Family

ID=80376564

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2021/133221 WO2023029237A1 (fr) 2021-08-31 2021-11-25 Procédé de prévisualisation vidéo et terminal

Country Status (2)

Country Link
CN (1) CN114125531B (fr)
WO (1) WO2023029237A1 (fr)

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN117097954A (zh) * 2023-09-13 2023-11-21 北京饼干科技有限公司 一种视频处理方法、装置、介质及设备

Citations (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20080005128A1 (en) * 2006-06-30 2008-01-03 Samsung Electronics., Ltd. Method and system for addition of video thumbnail
CN102917270A (zh) * 2011-08-04 2013-02-06 形山科技(深圳)有限公司 一种多视频动态预览方法、装置及系统
CN102915194A (zh) * 2012-11-13 2013-02-06 北京奇艺世纪科技有限公司 基于移动设备的视频预览的实现方法、装置和移动终端
CN103024567A (zh) * 2012-12-06 2013-04-03 广东欧珀移动通信有限公司 一种移动终端视频预览的方法及系统
CN104780417A (zh) * 2015-03-20 2015-07-15 广东欧珀移动通信有限公司 一种预览视频文件的展示方法及移动终端和系统
CN109257611A (zh) * 2017-07-12 2019-01-22 阿里巴巴集团控股有限公司 一种视频播放方法、装置、终端设备和服务器
CN109756767A (zh) * 2017-11-06 2019-05-14 腾讯科技(深圳)有限公司 预览数据播放方法、装置及存储介质
CN111182359A (zh) * 2019-12-30 2020-05-19 咪咕视讯科技有限公司 视频预览方法、视频抽帧方法、视频处理装置及存储介质
CN111641868A (zh) * 2020-05-27 2020-09-08 维沃移动通信有限公司 预览视频生成方法、装置及电子设备

Family Cites Families (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104238893B (zh) * 2013-06-08 2018-08-03 腾讯科技(深圳)有限公司 一种对视频预览图片进行显示的方法和装置
CN108924626B (zh) * 2018-08-17 2021-02-23 腾讯科技(深圳)有限公司 图片生成方法、装置、设备及存储介质
CN110225390B (zh) * 2019-06-20 2021-07-23 广州酷狗计算机科技有限公司 视频预览的方法、装置、终端及计算机可读存储介质
CN110337011A (zh) * 2019-07-17 2019-10-15 百度在线网络技术(北京)有限公司 视频处理方法、装置及设备

Patent Citations (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20080005128A1 (en) * 2006-06-30 2008-01-03 Samsung Electronics., Ltd. Method and system for addition of video thumbnail
CN102917270A (zh) * 2011-08-04 2013-02-06 形山科技(深圳)有限公司 一种多视频动态预览方法、装置及系统
CN102915194A (zh) * 2012-11-13 2013-02-06 北京奇艺世纪科技有限公司 基于移动设备的视频预览的实现方法、装置和移动终端
CN103024567A (zh) * 2012-12-06 2013-04-03 广东欧珀移动通信有限公司 一种移动终端视频预览的方法及系统
CN104780417A (zh) * 2015-03-20 2015-07-15 广东欧珀移动通信有限公司 一种预览视频文件的展示方法及移动终端和系统
CN109257611A (zh) * 2017-07-12 2019-01-22 阿里巴巴集团控股有限公司 一种视频播放方法、装置、终端设备和服务器
CN109756767A (zh) * 2017-11-06 2019-05-14 腾讯科技(深圳)有限公司 预览数据播放方法、装置及存储介质
CN111182359A (zh) * 2019-12-30 2020-05-19 咪咕视讯科技有限公司 视频预览方法、视频抽帧方法、视频处理装置及存储介质
CN111641868A (zh) * 2020-05-27 2020-09-08 维沃移动通信有限公司 预览视频生成方法、装置及电子设备

Also Published As

Publication number Publication date
CN114125531A (zh) 2022-03-01
CN114125531B (zh) 2023-02-17

Similar Documents

Publication Publication Date Title
CN111079012B (zh) 直播间推荐方法、装置、存储介质及终端
CN109982102B (zh) 直播间的界面显示方法和系统、以及直播服务器和主播端
CN109729372B (zh) 直播间切换方法、装置、终端、服务器及存储介质
CN112118477B (zh) 虚拟礼物展示方法、装置、设备以及存储介质
CN109327608B (zh) 歌曲分享的方法、终端、服务器和系统
CN109451343A (zh) 视频分享方法、装置、终端及存储介质
CN113411680B (zh) 多媒体资源播放方法、装置、终端及存储介质
CN111026992B (zh) 多媒体资源预览方法、装置、终端、服务器及存储介质
WO2023000677A1 (fr) Procédé et appareil d'affichage d'élément de contenu
CN111711838B (zh) 视频切换方法、装置、终端、服务器及存储介质
CN113204672B (zh) 资源展示方法、装置、计算机设备及介质
CN114245218B (zh) 音视频播放方法、装置、计算机设备及存储介质
CN109618192B (zh) 播放视频的方法、装置、系统和存储介质
CN111368114A (zh) 信息展示方法、装置、设备及存储介质
CN114302160B (zh) 信息显示方法、装置、计算机设备及介质
CN112257006A (zh) 页面信息的配置方法、装置、设备及计算机可读存储介质
CN112131422A (zh) 表情图片生成方法、装置、设备及介质
CN112004134B (zh) 多媒体数据的展示方法、装置、设备及存储介质
CN113609358B (zh) 内容分享方法、装置、电子设备以及存储介质
CN112511889B (zh) 视频播放方法、装置、终端及存储介质
CN112822544B (zh) 视频素材文件生成方法、视频合成方法、设备及介质
CN114186083A (zh) 信息显示方法、装置、终端、服务器及存储介质
WO2023029237A1 (fr) Procédé de prévisualisation vidéo et terminal
CN113190302A (zh) 信息显示方法、装置、电子设备及存储介质
CN112492331B (zh) 直播方法、装置、系统及存储介质

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 21955765

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

32PN Ep: public notification in the ep bulletin as address of the adressee cannot be established

Free format text: NOTING OF LOSS OF RIGHTS PURSUANT TO RULE 112(1) EPC (EPO FORM 1205A DATED 24/06/2024)