CN115209233A

CN115209233A - Video playing method and related device and equipment

Info

Publication number: CN115209233A
Application number: CN202210731907.XA
Authority: CN
Inventors: 杨博研
Original assignee: Ping An Bank Co Ltd
Current assignee: Ping An Bank Co Ltd
Priority date: 2022-06-25
Filing date: 2022-06-25
Publication date: 2022-10-18
Anticipated expiration: 2042-06-25
Also published as: CN115209233B

Abstract

The application discloses a video playing method, a related device and equipment. The video playing method comprises the following steps: acquiring a target video comprising a video directory, wherein the video directory comprises a plurality of directory abstracts corresponding to video segments of the target video; and responding to a trigger instruction of receiving the target directory abstract in the video directory, and playing the video segment corresponding to the target directory abstract. According to the scheme, the acquisition efficiency of the user on the target video content information and the convenience for the user to watch the target video can be improved.

Description

Video playing method and related device and equipment

Technical Field

The present application relates to the field of video playing technologies, and in particular, to a video playing method, and a related apparatus and device.

Background

With the continuous development of internet technology, people can publish various types of videos through various internet platforms, for example: the video of the electronic equipment, the video of food, the video of makeup and make up, the video of teaching etc.

Video content currently serves as the most popular information carrier, and has the advantage of information expression, and compared with single information such as characters, pictures, sounds and the like, the expression form of the video enables the brain to more easily acquire the information.

However, compared with a text carrier, the video has the problem that people cannot quickly determine the content position required by themselves as the image is to the text content when facing a video of a period of several minutes to tens of minutes, and need to frequently and blindly adjust the progress bar for searching.

Disclosure of Invention

The application provides a video playing method, a related device and equipment, and aims to solve the problem that the position of target content is difficult to determine quickly by a video.

The application provides a video playing method, which comprises the following steps: acquiring a target video comprising a video directory, wherein the video directory comprises a plurality of directory abstracts corresponding to video segments of the target video; and responding to a trigger instruction of receiving the target directory abstract in the video directory, and playing the video segment corresponding to the target directory abstract.

The method for acquiring the target video comprising the video directory, wherein the video directory comprises a plurality of directory abstracts corresponding to video segments of the target video, comprises the following steps: acquiring a target video; identifying a target video, and determining a plurality of keywords and a plurality of key pictures of the target video; determining a plurality of division nodes of the target video based on the plurality of key pictures and the plurality of keywords; and determining the video between each two adjacent division nodes as each video segment of the target video, and determining the directory abstract of the corresponding video segment based on the key words corresponding to each two adjacent division nodes.

The obtaining of the target video comprises: acquiring a target video and determining the type of the target video; acquiring a plurality of preset keywords and a plurality of preset key pictures of a target video based on the type of the target video; identifying a target video, and determining a plurality of keywords and a plurality of key pictures of the target video, wherein the steps of: performing image recognition on each image frame of the target video, and determining a plurality of key pictures corresponding to the target video in a plurality of preset key pictures; and performing voice recognition on the audio of the target video, and determining a plurality of keywords corresponding to the target video in a plurality of preset keywords.

The method for determining the directory abstract of the corresponding video segment based on the keywords corresponding to the two adjacent division nodes comprises the following steps: performing word frequency statistical analysis on the video between every two adjacent division nodes, and determining high-frequency words of the video between every two adjacent division nodes; and determining the directory abstract of the corresponding video segment based on the keywords corresponding to the two adjacent division nodes and the semantics of the high-frequency words.

Wherein, in response to receiving a playing instruction of a target directory abstract in the video directory, playing a video segment corresponding to the target directory abstract, further comprising: playing the video segment corresponding to the target directory abstract, and determining whether an adjustment playing instruction for the video segment is received or not within a preset time; the method comprises the steps of adjusting a playing instruction, wherein the step of adjusting the playing instruction comprises adjusting the playing progress of a target video; and after responding to the preset playing of the video segments corresponding to the target directory abstracts, receiving an adjusting playing instruction with the number exceeding the set number, and adjusting the target directory abstracts corresponding to the video segments based on the keywords corresponding to the video segments and the semantics of the high-frequency words.

The method for playing the video segments corresponding to the target directory abstract in response to receiving the trigger instruction of the target directory abstract in the video directory comprises the following steps: in response to a received search instruction of the target directory abstract, displaying a target video and a video directory corresponding to the target directory abstract; and in response to receiving a trigger operation on the target video or the target directory abstract of the video directory, determining the trigger operation as a trigger instruction of the target directory abstract, and playing the video segment corresponding to the target directory abstract.

The method for playing the video segments corresponding to the target directory abstract in the video directory comprises the following steps of in response to a received trigger instruction of the target directory abstract in the video directory, before playing the video segments corresponding to the target directory abstract: and responding to the obtained playing instruction of the target video, playing the target video and displaying the video directory of the target video.

The video directory for playing the target video and displaying the target video comprises: and displaying the video directory in a preset area, wherein the preset area comprises one or more of an inner edge of a playing window of the target video, a playing progress bar of the target video and a directory display window outside the playing window of the target video.

The video catalog for playing the target video and displaying the target video comprises: and in response to the received hovering operation on the target directory abstract of the video directory, displaying the thumbnail of the video segment corresponding to the target directory abstract.

Wherein, in response to receiving a playing instruction of the target directory abstract in the video directory, playing the video segment corresponding to the target directory abstract further comprises: counting the playing times or the playing frequency of the video segments corresponding to the directory abstracts in the video directory; and sequencing all the video segments according to the sequence of the playing times or the playing frequency from high to low to obtain the heat feedback data of the target video.

The present application further provides an electronic device, which includes a memory and a processor coupled to each other, wherein the processor is configured to execute program instructions stored in the memory to implement any one of the above video playing methods.

The present application also provides a computer readable storage medium having stored thereon program instructions that, when executed by a processor, implement any of the video playback methods described above.

According to the scheme, the target video comprising the video directory is obtained firstly, wherein the video directory comprises a plurality of directory abstracts corresponding to all video segments of the target video, the video segments corresponding to the target directory abstracts are played in response to the received trigger instruction of the target directory abstracts in the video directory, and the contents of all the video segments of the target video can be visually displayed to a user by using the video directory comprising the plurality of directory abstracts corresponding to all the video segments of the target video, so that the user can conveniently and quickly position the required video segments, and the obtaining efficiency of the user on the target video content information and the convenience of the user in watching the target video are greatly improved.

Drawings

Fig. 1 is a schematic flowchart of an embodiment of a video playing method of the present application;

FIG. 2 is a schematic flowchart of another embodiment of a video playing method of the present application;

FIG. 3 is a diagram illustrating an embodiment of a video directory showing target videos in the embodiment of FIG. 2;

FIG. 4 is a schematic diagram of an embodiment of the embodiment of FIG. 2 showing thumbnails;

FIG. 5 is a flow chart of an embodiment of constructing a video directory according to any of the above embodiments;

FIG. 6 is a block diagram of an embodiment of an electronic device of the present application;

FIG. 7 is a block diagram of an embodiment of a computer-readable storage medium of the present application.

Detailed Description

The following describes in detail the embodiments of the present application with reference to the drawings attached hereto.

In the following description, for purposes of explanation and not limitation, specific details are set forth such as particular system structures, interfaces, techniques, etc. in order to provide a thorough understanding of the present application.

The terms "system" and "network" are often used interchangeably herein. The term "and/or" herein is merely an association describing an associated object, and there may be three relationships, e.g., a and/or B, and: a exists alone, A and B exist simultaneously, and B exists alone. In addition, the character "/" in this document, generally, the former and latter associated objects are in an "or" relationship. Further, herein, "more" than two or more than two.

The video playing method, the related device and the equipment provided by the embodiment of the application can be applied to various video platforms and applied to various fields. Such as cooking, handcrafting, software, dance, make-up, etc. In the embodiment of the present application, no limitation is imposed on a video platform, a video field, and contents related to a played video.

The embodiment of the application provides a video playing method and a related device and equipment, and relates to the field of artificial intelligence. Artificial Intelligence (AI) is a theory, method, technique and application system that uses a digital computer or a machine controlled by a digital computer to simulate, extend and expand human Intelligence, perceive the environment, acquire knowledge and use the knowledge to obtain the best results. In other words, artificial intelligence is a comprehensive technique of computer science that attempts to understand the essence of intelligence and produce a new intelligent machine that can react in a manner similar to human intelligence. Artificial intelligence is the research of the design principle and the implementation method of various intelligent machines, so that the machines have the functions of perception, reasoning and decision making.

The artificial intelligence technology is a comprehensive subject, and relates to the field of extensive technology, namely the technology of a hardware level and the technology of a software level. The artificial intelligence infrastructure generally includes technologies such as sensors, dedicated artificial intelligence chips, cloud computing, distributed storage, big data processing technologies, operation/interaction systems, mechatronics, and the like. The artificial intelligence software technology mainly comprises a computer vision technology, a voice processing technology, a natural language processing technology, machine learning/deep learning, automatic driving, intelligent traffic and the like.

Embodiments of the present application relate to Computer Vision technology (CV) and Natural Language Processing (NLP). Computer vision is a science for researching how to make a machine "see", and further, it means that a camera and a computer are used to replace human eyes to perform machine vision such as identification, tracking and measurement on a target, and further image processing is performed, so that the computer processing becomes an image more suitable for human eyes to observe or transmitted to an instrument to detect. As a scientific discipline, computer vision research-related theories and techniques attempt to build artificial intelligence systems that can capture information from images or multidimensional data. The computer vision technology generally includes image processing, image recognition, image semantic understanding, image retrieval, OCR, video processing, video semantic understanding, video content/behavior recognition, three-dimensional object reconstruction, 3D technology, virtual reality, augmented reality, synchronous positioning and map building, automatic driving, intelligent transportation and other technologies, and also includes common biometric identification technologies such as face recognition and fingerprint recognition.

Natural language processing is an important direction in the fields of computer science and artificial intelligence. It studies various theories and methods that enable efficient communication between humans and computers using natural language. Natural language processing is a science integrating linguistics, computer science and mathematics. Therefore, the research in this field will involve natural language, i.e. the language that people use everyday, so it is closely related to the research of linguistics. Natural language processing techniques typically include text processing, semantic understanding, machine translation, robotic question and answer, knowledge mapping, and the like.

Referring to fig. 1, fig. 1 is a schematic flowchart illustrating a video playing method according to an embodiment of the present application.

Step S11: the method comprises the steps of obtaining a target video comprising a video directory, wherein the video directory comprises a plurality of directory abstracts corresponding to video segments of the target video.

A target video comprising a video directory is acquired first. The video directory of the target video may be pre-constructed before the video playing method of this embodiment.

The video directory includes a plurality of directory summaries corresponding to respective video segments of the target video. The video segment refers to a certain segment of video in the target video, and is not video outside the target video. The video directory includes a plurality of directory digests, each directory digest having a different meaning and corresponding one-to-one to the video segments within the target video, and each directory digest is used to express the content of the corresponding video segment. All the video segments are combined according to the playing sequence to form the complete target video.

For example: when the video directory includes: when the starting part, the equipment display, the equipment performance analysis and the summarized directory abstract are adopted, the starting directory abstract corresponds to the video segment of the starting part in the target video; the catalog abstract displayed by the equipment corresponds to the video segment of the equipment display part in the target video; the catalog abstract of the equipment performance analysis corresponds to the video segment of the equipment performance analysis part in the target video; the summarized catalog summary corresponds to a video segment of the summarized portion of the target video.

Step S12: and responding to a trigger instruction of receiving the target directory abstract in the video directory, and playing the video segment corresponding to the target directory abstract.

And responding to a trigger instruction of receiving the target directory abstract in the video directory, and playing the video segment corresponding to the target directory abstract.

In a specific application scenario, after a target video including a video directory is acquired, in response to a play instruction for acquiring the target video, the target video is played and the video directory of the target video is displayed, and in response to a trigger instruction for receiving a target directory abstract in the video directory, a video segment corresponding to the target directory abstract is played. The triggering instruction of the target directory abstract may include receiving a triggering operation on the target directory abstract.

Because the video directory of the embodiment shows the directory abstract of each video segment of the target video, the user can know the content of each video segment in the target video based on the directory abstract, and then select to play the video content required to be played, so that the information acquisition efficiency of the user on the target video content is improved, and the convenience of the user in watching the target video is improved.

In another specific application scenario, after a target video including a video directory is acquired, a target video and the video directory corresponding to a target directory abstract can be displayed in response to a search instruction of the received target directory abstract; and in response to receiving a trigger operation on the target video or the target directory abstract of the video directory, determining the trigger operation as a trigger instruction of the target directory abstract, and playing the video segment corresponding to the target directory abstract.

According to the application scene, when a user searches for content related to the target directory abstract, the target video and the video directory corresponding to the target directory abstract are directly displayed, and when the user triggers the target video or the target directory of the video directory, the video segment corresponding to the target directory abstract is directly played.

The triggering operation of the target directory abstract may include clicking, double-clicking, long-pressing, and the like of the target directory abstract by the mouse, and may be specifically set based on an actual situation, which is not limited herein.

Through the steps, the video playing method of the embodiment firstly obtains the target video comprising the video directory, wherein the video directory comprises a plurality of directory abstracts corresponding to each video segment of the target video, and then plays the video segments corresponding to the target directory abstracts in response to the received trigger instruction of the target directory abstracts in the video directory, so that the contents of each video segment of the target video can be visually displayed to a user by using the video directory comprising the plurality of directory abstracts corresponding to each video segment of the target video, the user can conveniently and quickly position the required video segments, and the acquisition efficiency of the user on the target video content information and the convenience for the user to watch the target video are greatly improved.

Referring to fig. 2, fig. 2 is a schematic flowchart illustrating a video playing method according to another embodiment of the present application.

Step S21: the method comprises the steps of acquiring a target video comprising a video directory, wherein the video directory comprises a plurality of directory abstracts corresponding to video segments of the target video.

A target video comprising a video directory is acquired first. The video directory of the target video may be pre-constructed before the video playing method of the embodiment. The method for constructing the video directory can refer to the embodiment shown in fig. 5, and details are not repeated herein.

Here, this step is the same as step S11 in the previous embodiment, please refer to the foregoing, and is not described herein again.

Step S22: and responding to the obtained playing instruction of the target video, playing the target video and displaying the video catalog of the target video.

And responding to the obtained playing instruction of the target video, playing the target video and displaying the video directory of the target video.

In a specific application scene, a target video is displayed on a screen of the intelligent terminal, a user clicks, double clicks or other playing operations on a cover or a title of the target video and other related positions, then, the user jumps to a playing website of the target video from a current website to play the target video, and a video directory of the target video is displayed.

When the target video is played, the video directory is displayed in a preset area at the same time, wherein the preset area comprises one or more of the inner edge of a playing window of the target video, a playing progress bar of the target video and a directory display window outside the playing window of the target video.

Referring to fig. 3, fig. 3 is a schematic diagram illustrating an embodiment of a video directory of a target video shown in fig. 2.

The play window of the video play interface 330 is playing the target video 320. Meanwhile, the video catalog 300 can be presented at any one or more of an edge within the play window of the target video 320, the play progress bar 310 of the target video 320, and a catalog presentation window outside the play window of the target video 320. The schematic diagram only illustrates the preset positions, and does not limit the number of positions for displaying the video directory 300 when the target video is played.

The video directory 300 includes a plurality of directory summaries 301, and the plurality of directory summaries 301 are ordered at a preset position according to the playing sequence of each corresponding video segment.

When the video directory 300 is displayed at any one or more of the inner edge of the playing window of the target video 320, the playing progress bar 310 of the target video 320, and the directory display window outside the playing window of the target video 320, the user may trigger the directory summary 301 in the video directory 300 at the above-mentioned position, determine the video segment to be viewed, and then the playing progress bar jumps to the beginning of the video segment for playing.

In a specific embodiment, when the target video is played and the video directory of the target video is displayed, in response to receiving the hovering operation on the target directory abstract of the video directory, the thumbnail of the video segment corresponding to the target directory abstract is displayed. Wherein, the thumbnail can be a certain image frame or a certain motion picture with more content meaning in the corresponding video segment. For example: when a certain target directory abstract of the food type target video is a starching picture, the thumbnail is an image frame or a continuous dynamic picture of the starching picture of the target video.

When the video directory is displayed on the playing progress bar of the target video, the thumbnail of the image frame at the playing progress can be displayed based on the corresponding position of the hovering operation on the playing progress bar, and when the user moves the hovering position on the playing progress bar, the thumbnail corresponding to the playing progress is also displayed corresponding to the hovering position.

When the user suspends on the target directory abstract, the thumbnails of the video segments corresponding to the target directory abstract are displayed, so that the user can further judge whether the video segments corresponding to the target directory abstract are the content required by the user, and the acquisition efficiency of the user on the target video content information and the convenience of the user in watching the target video are improved.

Referring to fig. 4, fig. 4 is a diagram illustrating an embodiment of a thumbnail image shown in the embodiment of fig. 2.

The video directory 400 of the present embodiment is displayed on the play progress bar 410, wherein a plurality of directory summaries 401 are sequentially displayed in sequence according to the play order of the video, and when a user hovers at a certain directory summary, a thumbnail 402 of a video segment corresponding to the directory summary is displayed, so that the user can further determine whether the video segment corresponding to the directory summary is a content required by the user.

Step S23: and responding to a trigger instruction of the target directory abstract in the received video directory, and playing the video segment corresponding to the target directory abstract.

And responding to a trigger instruction of receiving a target directory abstract in the video directory, increasing a playing progress bar to the beginning of the video segment corresponding to the target directory abstract, and playing the video segment corresponding to the target directory abstract.

In a specific application scenario, after a target video including a video directory is acquired, the video directory of the target video is played and displayed in response to an acquisition instruction of the target video, and a video segment corresponding to a target directory abstract is played in response to a received trigger instruction of the target directory abstract in the video directory. The triggering instruction of the target directory abstract may include receiving a triggering operation on the target directory abstract.

The triggering operation of the target directory abstract may include operations such as clicking, double clicking, long pressing, and the like on the target directory abstract, which may be specifically set based on an actual situation, and is not limited herein.

In other embodiments, when a user searches for a target video in a retrieval manner, the target video including a video directory can be obtained, and then the target video and the video directory corresponding to the target directory abstract are directly displayed in response to receiving a search instruction of the target directory abstract; and in response to receiving a trigger operation on the target video or the target directory abstract of the video directory, determining the trigger operation as a trigger instruction of the target directory abstract, and playing the video segment corresponding to the target directory abstract. That is, the present embodiment may not perform step S22.

According to the method and the device, when a user searches for the content which is the same as the target directory abstract, the target video and the video directory corresponding to the target directory abstract are directly displayed, and when the user triggers the target video or the target directory of the video directory, the video segment corresponding to the target directory abstract in the target video is directly played, so that the video segment corresponding to the target directory abstract in the target video can be directly played based on the user requirement in the user searching stage, and therefore the obtaining efficiency of the user on the target video content information and the convenience of the user in watching the target video are greatly improved.

In a specific embodiment, the playing times or the playing frequencies of the video segments corresponding to the summary of each directory in the video directory may be counted, and the video segments are sorted according to the sequence of the playing times or the playing frequencies from high to low, so as to obtain the heat feedback data of the target video. The heat feedback data can be fed back to a video producer, so that the video producer can analyze the market feedback condition of the target video, and the preparation of the subsequent video is facilitated.

In a specific embodiment, when a video segment corresponding to a target directory abstract is played, whether an adjustment playing instruction for the video segment is received or not can be determined within a preset time; and adjusting the playing instruction comprises adjusting the playing progress of the target video. The preset time may include a short time such as 1, 2, 3 seconds, and may be specifically set based on actual conditions.

And after responding to the preset playing of the video segments corresponding to the target directory abstract, receiving a playing adjusting instruction exceeding the set number, and adjusting the target directory abstract corresponding to the video segments. The number of times of the preset times and the number of the set number may be set based on actual conditions, and are not limited herein. For example: when the video segment corresponding to the target directory abstract is played for 100 times, and the adjustment playing instruction is received for more than 80 times, the target directory abstract corresponding to the video segment can be adjusted.

When the user selects to play the video segment corresponding to the target directory abstract, the playing window starts to play the video segment, but the user adjusts the playing progress within the preset time, which indicates that the content of the target directory abstract may not correspond to the content of the corresponding video segment. When the probability of the occurrence of the above situation is too large, the target directory abstract corresponding to the video segment is adjusted to improve the accuracy of each directory abstract.

In a specific application scenario, the adjustment may be performed based on the keywords corresponding to the video segments corresponding to the target directory abstract and the semantics of the high-frequency words, specifically, the image recognition is performed on the video segments to obtain the keywords corresponding to the video segments, the word-frequency statistical analysis is performed on the video segments to obtain the high-frequency words corresponding to the video segments, and the directory abstract corresponding to the video segments is adjusted based on the semantics of the keywords and the high-frequency words, so as to improve the accuracy of each directory abstract.

Through the steps, the video playing method of the embodiment can be used for visually displaying the content of each video segment of the target video to a user by utilizing the video directory comprising the plurality of directory abstracts corresponding to each video segment of the target video through firstly obtaining the target video comprising the video directory, then responding to the playing instruction of obtaining the target video, playing the target video and displaying the video directory of the target video, and finally responding to the received triggering instruction of the target directory abstract in the video directory and playing the video segment corresponding to the target directory abstract, so that the user can conveniently and quickly locate the video segment required by the user, and the obtaining efficiency of the user on the content information of the target video and the convenience of the user watching the target video are greatly improved.

In other embodiments, the video directory may be constructed as follows:

referring to fig. 5, fig. 5 is a schematic flow chart illustrating an implementation of constructing a video directory according to any of the above embodiments.

Step S51: and acquiring the target video.

The target video is acquired first. In a specific application scenario, the type of the target video may be determined, and then a plurality of preset keywords and a plurality of preset key pictures of the target video may be obtained based on the type of the target video.

The type of the target video can be determined by receiving the label of the video producer on the target video or identifying the target video. The type of the target video may include various types such as cooking, handmade, software, dance, beauty and make-up, and electronic equipment, which are not limited herein.

Since in practice, structural frameworks of videos of the same type have certain similarity, the embodiment may determine a plurality of preset keywords and a plurality of preset key pictures of videos of various types in advance. The preset key picture may include a summarized picture, and the preset keyword may include a feature word. The preset key picture and the preset key words can be set by an operator based on the commonality of videos in a certain field, or can be determined by an intelligent algorithm according to the content of a large number of videos in the same field.

For example: when the type of the target video is electronic equipment, the preset keywords of the target video can comprise performance, battery life, characteristics and the like, and the preset keywords of the target video can comprise equipment surrounding display, equipment close-up, scoring tables and the like. When the type of the target video is makeup, the preset keywords may include pressed powder, shadow, highlight, makeup fixation, and the like, and the preset key picture may include partial close-up of the face, makeup style, makeup display, and the like. When the type of the target video is a food, the preset keywords thereof may include start, taste, price, etc., and the preset key pictures thereof may include close-up of food, empty dish, etc. When the type of the target video is a conversational speech class such as a story, a current affairs, a history, etc., the preset keyword may be set based on the specific domain content of each conversational speech class, and the preset key screen may be set based on the lecture posture of the presenter. And the like, will not be described in detail.

Step S52: the method comprises the steps of identifying a target video, and determining a plurality of keywords and a plurality of key pictures of the target video.

The method comprises the steps of identifying a target video, and determining a plurality of keywords and a plurality of key pictures of the target video.

In a specific application scene, image recognition can be performed on each image frame of a target video, and a plurality of key pictures corresponding to the target video in a plurality of preset key pictures are determined; and extracting characters from each image frame of the target video, and then performing semantic analysis to determine a plurality of keywords corresponding to the target video in a plurality of preset keywords. The character extraction, semantic analysis and image recognition can be operated through a deep neural network with corresponding functions or performed manually.

In another specific application scenario, image recognition can be performed on each image frame of a target video, and a plurality of key pictures corresponding to the target video in a plurality of preset key pictures are determined; and performing voice recognition on the audio of the target video, and determining a plurality of keywords corresponding to the target video in a plurality of preset keywords. The voice recognition and the image recognition can be operated through a deep neural network of a corresponding function or performed manually.

The main body of the implementation of this embodiment may be a video background, which may be a terminal device or a server. The video background can perform image recognition, voice recognition or character recognition by calling a mature interface for recognition. In recognition, the semantics in the image are read, so that a plurality of key words and a plurality of key pictures of the target video are determined. The image recognition, voice recognition or character recognition of the video content belongs to a relatively mature technology, and can be realized by applying various algorithms, and detailed description is not provided herein. The recognition technique of the present embodiment may be performed by a computer vision technique.

The preset keywords and the preset key pictures correspond to a template for identifying the target video, so that the target video is divided and the content is determined based on the template.

Step S53: a plurality of division nodes of the target video are determined based on the plurality of key pictures and the plurality of keywords.

After a plurality of key pictures and a plurality of keywords of a target video are obtained, the target video is divided based on the expression contents of the key pictures and the keywords, and a plurality of division nodes of the target video are determined. Video contents before and after each division node are different.

When the key picture of a certain video does not accord with the expression content of the keyword, the expression content of the key picture is taken as the reference to analyze the target video.

Step S54: and determining the video between each two adjacent division nodes as each video segment of the target video, and determining the directory abstract of the corresponding video segment based on the key words corresponding to each two adjacent division nodes.

And determining the video between every two adjacent division nodes as each video segment of the target video according to the playing sequence of the target video, and determining the directory abstract of the corresponding video segment based on the key words corresponding to every two adjacent division nodes. And determining the expression content of the corresponding directory abstract based on the semantics of the keywords corresponding to the two adjacent division nodes.

Specifically, according to the playing sequence of the video, the former division node of two adjacent division nodes is used as the beginning of the corresponding video segment, and the latter division node is used as the end of the corresponding video segment.

In a specific application scene, word frequency statistical analysis can be further performed on the video between every two adjacent division nodes, and high-frequency words of the video between every two adjacent division nodes are determined; and determining the directory abstract of the corresponding video segment based on the keywords and the high-frequency words corresponding to the two adjacent division nodes. For example: when the key word corresponding to the video between two adjacent division nodes is identified as gorgon euryale juice and the high-frequency word obtained through the word frequency statistical analysis is starch in the target video of the food type, the catalogue abstract can be determined as the gorgon euryale. The step of determining the directory abstract of the corresponding video segment based on the keywords and the high-frequency words corresponding to the two adjacent division nodes can be performed in a natural language processing mode.

The method comprises the steps of obtaining a high-frequency word corresponding to a video between two adjacent division nodes, determining a directory abstract of a corresponding video segment based on the high-frequency word and the high-frequency word corresponding to the two adjacent division nodes, possibly generating a plurality of directory abstracts with different semantics and different confidence degrees, and determining the directory abstract with the highest confidence degree as the directory abstract of the corresponding video segment.

In step S23 in the embodiment of fig. 2, when an adjustment playing instruction exceeding the set number is received and the target directory summary corresponding to the video segment is adjusted based on the keyword corresponding to the video segment and the high-frequency word, the directory summary with the second highest confidence may be determined as the directory summary of the corresponding video segment to adjust the directory summary. Or adjust the catalog abstract manually, which is not limited herein.

The steps of the embodiment can automatically generate the video catalog through an artificial intelligence technology, so that the efficiency of generating the video catalog is improved.

Through the steps, the embodiment can identify the target video and determine a plurality of keywords and a plurality of key pictures of the target video; determining a plurality of division nodes of the target video based on the plurality of key pictures and the plurality of key words; and finally, determining the video between each two adjacent division nodes as each video segment of the target video, determining the directory abstract of the corresponding video segment based on the key words corresponding to each two adjacent division nodes, obtaining the directory abstract corresponding to each video segment one by one, and forming a video directory, so that the playing of the target video is assisted through the video directory, and the contents of each video segment of the target video are visually displayed to a user by using the video directory comprising a plurality of directory abstracts corresponding to each video segment of the target video, so that the user can conveniently and quickly position the required video segment, and the acquisition efficiency of the user on the content information of the target video and the convenience for the user to watch the target video are greatly improved.

In other embodiments, the partitioning and labeling of the target video by the video producer or other human may also be accepted, and each directory summary and its corresponding video segment in the video directory of the target video are determined. And is not limited thereto.

Referring to fig. 6, fig. 6 is a schematic frame diagram of an embodiment of an electronic device according to the present application. The electronic device 60 comprises a memory 61 and a processor 62 coupled to each other, and the processor 62 is configured to execute program instructions stored in the memory 61 to implement the steps of the video playing method according to any of the embodiments described above. In one particular implementation scenario, the electronic device 60 may include, but is not limited to: a microcomputer, a server, and in addition, the electronic device 60 may also include a mobile device such as a notebook computer, a tablet computer, and the like, which is not limited herein.

In particular, the processor 62 is configured to control itself and the memory 61 to implement the steps of any of the above-described embodiments of the video playing method. The processor 62 may also be referred to as a CPU (Central Processing Unit). The processor 62 may be an integrated circuit chip having signal processing capabilities. The Processor 62 may also be a general purpose Processor, a Digital Signal Processor (DSP), an Application Specific Integrated Circuit (ASIC), a Field Programmable Gate Array (FPGA) or other Programmable logic device, discrete Gate or transistor logic, discrete hardware components. A general purpose processor may be a microprocessor or the processor may be any conventional processor or the like. In addition, the processor 62 may be collectively implemented by an integrated circuit chip.

By the scheme, the acquisition efficiency of the user on the content information of the target video and the convenience for watching the target video by the user can be improved.

Referring to fig. 7, fig. 7 is a block diagram illustrating an embodiment of a computer readable storage medium according to the present application. The computer readable storage medium 70 stores program instructions 701 executable by the processor, the program instructions 701 being for implementing the video playback method and the steps of the video playback method of any of the embodiments described above.

In the several embodiments provided in the present application, it should be understood that the disclosed method and apparatus may be implemented in other manners. For example, the above-described apparatus embodiments are merely illustrative, e.g., a division of modules or units into only one type of logical division, and additional divisions may be achieved, e.g., units or components may be combined or integrated into another system, or some features may be omitted, or not performed. In addition, the shown or discussed coupling or direct coupling or communication connection between each other may be through some interfaces, indirect coupling or communication connection between devices or units, and may be in an electrical, mechanical or other form.

Units described as separate parts may or may not be physically separate, and parts displayed as units may or may not be physical units, may be located in one place, or may be distributed on network elements. Some or all of the units can be selected according to the needs to achieve the purpose of the embodiment.

In addition, functional units in the embodiments of the present application may be integrated into one processing unit, or each unit may exist alone physically, or two or more units are integrated into one unit. The integrated unit may be implemented in the form of hardware, or may also be implemented in the form of a software functional unit.

The integrated unit, if implemented in the form of a software functional unit and sold or used as a stand-alone product, may be stored in a computer readable storage medium. Based on such understanding, the technical solutions of the present application, which are essential or contributing to the prior art, or all or part of the technical solutions may be embodied in the form of a software product, which is stored in a storage medium and includes several instructions for causing a computer device (which may be a personal computer, a server, a network device, or the like) or a processor (processor) to execute all or part of the steps of the methods of the embodiments of the present application. And the aforementioned storage medium includes: a U-disk, a removable hard disk, a Read-Only Memory (ROM), a Random Access Memory (RAM), a magnetic disk, or an optical disk, and various media capable of storing program codes.

Claims

1. A video playing method, the video playing method comprising:

acquiring a target video comprising a video directory, wherein the video directory comprises a plurality of directory abstracts corresponding to video segments of the target video;

and responding to a trigger instruction of receiving a target directory abstract in the video directory, and playing a video segment corresponding to the target directory abstract.

2. The method of claim 1, wherein the obtaining a target video comprising a video directory, wherein the video directory comprises a plurality of directory summaries corresponding to video segments of the target video, comprises:

acquiring the target video;

identifying the target video, and determining a plurality of keywords and a plurality of key pictures of the target video;

determining a plurality of division nodes of the target video based on the plurality of key pictures and the plurality of keywords;

and respectively determining the video between each two adjacent division nodes as each video segment of the target video, and determining the directory abstract of the corresponding video segment based on the key words corresponding to each two adjacent division nodes.

3. The video playing method according to claim 2, wherein the obtaining the target video includes:

acquiring the target video and determining the type of the target video;

acquiring a plurality of preset keywords and a plurality of preset key pictures of the target video based on the type of the target video;

the identifying the target video and determining a plurality of keywords and a plurality of key pictures of the target video comprise:

performing image recognition on each image frame of the target video, and determining a plurality of key pictures corresponding to the target video in the plurality of preset key pictures; and

and performing voice recognition on the audio of the target video, and determining a plurality of keywords corresponding to the target video in the plurality of preset keywords.

4. The video playing method according to claim 2, wherein the determining the directory abstract of the corresponding video segment based on the keywords corresponding to the two adjacent division nodes comprises:

performing word frequency statistical analysis on the video between every two adjacent division nodes, and determining high-frequency words of the video between every two adjacent division nodes;

and determining the directory abstract of the corresponding video segment based on the keywords corresponding to the two adjacent division nodes and the semantics of the high-frequency words.

5. The video playing method according to claim 4, wherein said playing the video segment corresponding to the target directory abstract in response to receiving the playing instruction of the target directory abstract in the video directory, further comprises:

playing the video segment corresponding to the target directory abstract, and determining whether an adjustment playing instruction for the video segment is received or not within a preset time; the adjustment of the playing instruction comprises adjustment of the playing progress of the target video;

and after responding to the preset playing of the video segments corresponding to the target directory abstracts, receiving a playing adjusting instruction exceeding a set number, and adjusting the target directory abstracts corresponding to the video segments based on the semantics of the keywords corresponding to the video segments and the high-frequency words.

6. The video playing method according to claim 1, wherein said playing the video segment corresponding to the target directory abstract in response to receiving the trigger instruction of the target directory abstract in the video directory comprises:

in response to receiving a search instruction of the target directory abstract, displaying a target video and a video directory corresponding to the target directory abstract;

and in response to receiving a trigger operation on the target video or a target directory abstract of the video directory, determining the trigger operation as a trigger instruction of the target directory abstract, and playing a video segment corresponding to the target directory abstract.

7. The video playing method according to claim 1, wherein before playing the video segment corresponding to the target directory abstract in response to receiving the trigger instruction of the target directory abstract in the video directory, the method includes:

8. The video playing method according to claim 7, wherein said playing the target video and presenting the video directory of the target video comprises:

displaying the video catalog in a preset area, wherein the preset area comprises one or more of an inner edge of a playing window of the target video, a playing progress bar of the target video and a catalog display window outside the playing window of the target video.

9. The video playing method according to claim 7, wherein said playing the target video and presenting the video directory of the target video comprises:

in response to receiving a hover operation for a target directory summary of the video directory, thumbnails of video segments corresponding to the target directory summary are presented.

10. The video playing method according to claim 1, wherein said playing the video segment corresponding to the target directory abstract in response to receiving the playing instruction of the target directory abstract in the video directory further comprises:

counting the playing times or playing frequency of the video segments corresponding to the directory abstracts in the video directory;

and sequencing all the video segments according to the sequence of the playing times or the playing frequency from high to low to obtain the heat feedback data of the target video.

11. An electronic device comprising a memory and a processor coupled to each other, the processor being configured to execute program instructions stored in the memory to implement the video playback method of any one of claims 1 to 10.

12. A computer-readable storage medium having stored thereon program instructions, characterized in that said program instructions, when executed by a processor, implement the video playback method according to any one of claims 1 to 10.