CN115209233B

CN115209233B - Video playing method, related device and equipment

Info

Publication number: CN115209233B
Application number: CN202210731907.XA
Authority: CN
Inventors: 杨博研
Original assignee: Ping An Bank Co Ltd
Current assignee: Ping An Bank Co Ltd
Priority date: 2022-06-25
Filing date: 2022-06-25
Publication date: 2023-08-25
Anticipated expiration: 2042-06-25
Also published as: CN115209233A

Abstract

The application discloses a video playing method, a related device and equipment. The video playing method comprises the following steps: obtaining a target video comprising a video catalog, wherein the video catalog comprises a plurality of catalog summaries corresponding to each video segment of the target video; and responding to the received triggering instruction of the target catalog abstract in the video catalog, and playing the video segment corresponding to the target catalog abstract. By the aid of the scheme, the acquisition efficiency of the user on the target video content information and the convenience of the user for watching the target video can be improved.

Description

Video playing method, related device and equipment

Technical Field

The present application relates to the field of video playing technologies, and in particular, to a video playing method, and related devices and equipment.

Background

With the continuous development of internet technology, people can release various types of video through various internet platforms, for example: electronic device video, food video, make-up video, teaching video, and the like.

Video content is currently used as the most popular information carrier, and has the advantages of various information expressions, and compared with single information such as characters, pictures, sounds and the like, the expression form of the video enables the brain to acquire the information more easily.

However, compared with a text carrier, the video has the problem that when facing a video of several minutes to tens of minutes, people cannot quickly determine the required content position just like facing text content, and need to frequently and blindly adjust a progress bar to search.

Disclosure of Invention

The application provides a video playing method, a related device and equipment, which are used for solving the problem that the position of target content is difficult to quickly determine by video.

The application provides a video playing method, which comprises the following steps: obtaining a target video comprising a video catalog, wherein the video catalog comprises a plurality of catalog summaries corresponding to each video segment of the target video; and responding to the received triggering instruction of the target catalog abstract in the video catalog, and playing the video segment corresponding to the target catalog abstract.

The method comprises the steps of obtaining a target video comprising a video catalog, wherein the video catalog comprises a plurality of catalog summaries corresponding to video segments of the target video, and the method comprises the following steps: acquiring a target video; identifying a target video, and determining a plurality of keywords and a plurality of key pictures of the target video; determining a plurality of dividing nodes of the target video based on the plurality of key pictures and the plurality of keywords; and respectively determining videos between every two adjacent dividing nodes as each video segment of the target video, and determining the catalog abstract of the corresponding video segment based on the keywords corresponding to the two adjacent dividing nodes.

Wherein, obtain the target video, include: acquiring a target video and determining the type of the target video; acquiring a plurality of preset keywords and a plurality of preset key pictures of the target video based on the type of the target video; identifying the target video, determining a plurality of keywords and a plurality of key pictures of the target video, including: performing image recognition on each image frame of the target video, and determining a plurality of key pictures corresponding to the target video in a plurality of preset key pictures; and performing voice recognition on the audio of the target video, and determining a plurality of keywords corresponding to the target video in a plurality of preset keywords.

The method for determining the catalog abstracts of the corresponding video segments based on the keywords corresponding to each two adjacent dividing nodes comprises the following steps: performing word frequency statistical analysis on the video between every two adjacent dividing nodes to determine high-frequency words of the video between every two adjacent dividing nodes; and determining the catalog abstract of the corresponding video segment based on the keywords corresponding to each two adjacent dividing nodes and the semantics of the high-frequency word.

Wherein, in response to receiving a play instruction of a target catalog abstract in the video catalog, playing a video segment corresponding to the target catalog abstract, further comprising: playing the video segment corresponding to the target catalog abstract, and determining whether an adjustment playing instruction of the video segment is received or not in preset time; the method comprises the steps that a playing instruction is adjusted, wherein the playing instruction comprises the step of adjusting the playing progress of a target video; and after the video segments corresponding to the target catalog abstracts are played for a preset time, receiving an adjusting play instruction exceeding a set number, and adjusting the target catalog abstracts corresponding to the video segments based on keywords corresponding to the video segments and semantics of the high-frequency words.

The method for playing the video segments corresponding to the target catalog abstract in response to receiving the triggering instruction of the target catalog abstract in the video catalog comprises the following steps: responding to a search instruction of the received target catalog abstract, and displaying target videos and video catalogs corresponding to the target catalog abstract; and in response to receiving the trigger operation of the target video or the target catalog abstract of the video catalog, determining the trigger operation as a trigger instruction of the target catalog abstract, and playing the video segment corresponding to the target catalog abstract.

The method for playing the video segments corresponding to the target catalog abstract comprises the following steps of: and playing the target video and displaying a video catalog of the target video in response to the obtained playing instruction of the target video.

The method for playing the target video and displaying the video catalog of the target video comprises the following steps: and displaying the video catalogue in a preset area, wherein the preset area comprises one or more of the inner edge of a playing window of the target video, a playing progress bar of the target video and a catalogue display window outside the playing window of the target video.

The method for playing the target video and displaying the video catalog of the target video comprises the following steps: and responding to the received hovering operation of the target catalog abstract of the video catalog, and displaying the thumbnail of the video segment corresponding to the target catalog abstract.

Wherein, in response to receiving a play instruction of the target catalog abstract in the video catalog, playing the video segment corresponding to the target catalog abstract further comprises: counting the playing times or playing frequencies of the video segments corresponding to each catalog abstract in the video catalog; and sequencing the video segments according to the sequence of the playing times or the playing frequency from high to low to obtain the heat feedback data of the target video.

The application also provides an electronic device, which comprises a memory and a processor which are mutually coupled, wherein the processor is used for executing program instructions stored in the memory so as to realize any video playing method.

The application also provides a computer readable storage medium having program instructions stored thereon, which when executed by a processor, implement any of the video playing methods described above.

According to the scheme, the target video comprising the video catalogs is obtained firstly, wherein the video catalogs comprise a plurality of catalogs corresponding to each video segment of the target video, then the video segments corresponding to the target catalogs are played in response to receiving the triggering instruction of the target catalogs in the video catalogs, and the content of each video segment of the target video can be intuitively displayed to a user by utilizing the video catalogs comprising the plurality of catalogs corresponding to each video segment of the target video, so that the user can conveniently and quickly locate the video segments required by the user, and the obtaining efficiency of the user on the content information of the target video and the convenience of the user for watching the target video are greatly improved.

Drawings

FIG. 1 is a flowchart of a video playing method according to an embodiment of the present application;

FIG. 2 is a flowchart of another embodiment of a video playing method according to the present application;

FIG. 3 is a schematic diagram of an implementation of the video catalog of FIG. 2 illustrating target video;

FIG. 4 is a schematic diagram of an embodiment of FIG. 2 showing one implementation of a thumbnail;

FIG. 5 is a flow diagram of an implementation of constructing a video catalog in accordance with any of the above embodiments;

FIG. 6 is a schematic diagram of a frame of an embodiment of an electronic device of the present application;

FIG. 7 is a block diagram of a computer readable storage medium according to an embodiment of the present application.

Detailed Description

The following describes embodiments of the present application in detail with reference to the drawings.

In the following description, for purposes of explanation and not limitation, specific details are set forth such as the particular system architecture, interfaces, techniques, etc., in order to provide a thorough understanding of the present application.

The terms "system" and "network" are often used interchangeably herein. The term "and/or" is merely one association relationship describing the associated object, and three relationships may exist, for example, a and/or B may: a exists alone, A and B exist together, and B exists alone. In addition, the character "/" herein is generally an or relationship between the front and rear related objects. Further, "more" than two or more than two herein.

The video playing method, the related device and the equipment provided by the embodiment of the application can be applied to various video platforms and various fields. For example, in the fields of cooking, hand-making, software, dancing, cosmetic, etc. The embodiment of the application does not limit the video platform, the video field and the content related to the played video.

The embodiment of the application provides a video playing method, a related device and equipment, and relates to the field of artificial intelligence. Artificial intelligence (Artificial Intelligence, AI) is the theory, method, technique and application system that uses a digital computer or a machine controlled by a digital computer to simulate, extend and extend human intelligence, sense the environment, acquire knowledge and use the knowledge to obtain optimal results. In other words, artificial intelligence is an integrated technology of computer science that attempts to understand the essence of intelligence and to produce a new intelligent machine that can react in a similar way to human intelligence. Artificial intelligence, i.e. research on design principles and implementation methods of various intelligent machines, enables the machines to have functions of sensing, reasoning and decision.

The artificial intelligence technology is a comprehensive subject, and relates to the technology with wide fields, namely the technology with a hardware level and the technology with a software level. Artificial intelligence infrastructure technologies generally include technologies such as sensors, dedicated artificial intelligence chips, cloud computing, distributed storage, big data processing technologies, operation/interaction systems, mechatronics, and the like. The artificial intelligence software technology mainly comprises a computer vision technology, a voice processing technology, a natural language processing technology, machine learning/deep learning, automatic driving, intelligent traffic and other directions.

Embodiments of the present application relate to Computer Vision (CV) and natural language processing (Nature Language processing, NLP). The computer vision is a science for researching how to make a machine "see", and more specifically, a camera and a computer are used to replace human eyes to identify, track and measure targets, and the like, and further, graphic processing is performed, so that the computer is processed into images which are more suitable for human eyes to observe or transmit to an instrument to detect. As a scientific discipline, computer vision research-related theory and technology has attempted to build artificial intelligence systems that can acquire information from images or multidimensional data. Computer vision technologies typically include image processing, image recognition, image semantic understanding, image retrieval, OCR, video processing, video semantic understanding, video content/behavior recognition, three-dimensional object reconstruction, 3D technology, virtual reality, augmented reality, synchronous positioning and mapping, autopilot, intelligent transportation, etc., as well as common biometric technologies such as face recognition, fingerprint recognition, etc.

Natural language processing is an important direction in the fields of computer science and artificial intelligence. It is studying various theories and methods that enable effective communication between a person and a computer in natural language. Natural language processing is a science that integrates linguistics, computer science, and mathematics. Thus, the research in this field will involve natural language, i.e. language that people use daily, so it has a close relationship with the research in linguistics. Natural language processing techniques typically include text processing, semantic understanding, machine translation, robotic questions and answers, knowledge graph techniques, and the like.

Referring to fig. 1, fig. 1 is a flowchart illustrating an embodiment of a video playing method according to the present application.

Step S11: a target video including a video catalog is obtained, wherein the video catalog includes a plurality of catalog summaries corresponding to each video segment of the target video.

Target video including a video catalog is first acquired. The video catalog of the target video may be pre-built before the video playing method of the present embodiment.

The video catalog includes a plurality of catalog summaries corresponding to each video segment of the target video. Where video segmentation refers to a certain segment of video within a target video, not a video other than the target video. The video catalog includes a plurality of catalog summaries, each catalog summary having a different meaning, in one-to-one correspondence with a video segment within the target video, and each catalog summary for expressing the content of the corresponding video segment. All the video segments are combined according to the playing sequence to obtain the complete target video.

For example: when the video directory includes: when the catalog abstracts are started, displayed by equipment and analyzed and summarized by equipment, the catalog abstracts at the beginning correspond to the video segments at the beginning part in the target video; the catalog abstract displayed by the equipment corresponds to the video segment of the equipment display part in the target video; the catalog abstract of the equipment performance analysis corresponds to the video segment of the equipment performance analysis part in the target video; the summarized catalog abstract corresponds to the video segment of the summarized part in the target video.

Step S12: and responding to the received triggering instruction of the target catalog abstract in the video catalog, and playing the video segment corresponding to the target catalog abstract.

And responding to the received triggering instruction of the target catalog abstract in the video catalog, and playing the video segment corresponding to the target catalog abstract.

In a specific application scenario, after a target video including a video catalog is acquired, a video catalog of the target video is played and displayed in response to an instruction for playing the target video, and a video segment corresponding to the target catalog abstract is played in response to an instruction for triggering the target catalog abstract in the video catalog. The triggering instruction of the target catalog abstract may include receiving a triggering operation on the target catalog abstract.

Because the video catalog of the embodiment shows the catalog abstract of each video segment of the target video, the user can know the content of each video segment in the target video based on the catalog abstract, and then select to play the video content required to be played by the user, so that the information acquisition efficiency of the user on the target video content is improved, and the convenience of the user for watching the target video is improved.

In another specific application scenario, after a target video including a video catalog is acquired, a target video corresponding to the target catalog abstract and the video catalog are displayed in response to receiving a search instruction of the target catalog abstract; and in response to receiving the trigger operation of the target video or the target catalog abstract of the video catalog, determining the trigger operation as a trigger instruction of the target catalog abstract, and playing the video segment corresponding to the target catalog abstract.

The application scene directly displays the target video and the video catalogue corresponding to the target catalogue abstract when the user searches the content related to the target catalogue abstract, and directly plays the video segment corresponding to the target catalogue abstract when the user triggers the target video or the target catalogue of the video catalogue, so that the video segment corresponding to the target catalogue abstract in the target video can be directly played based on the user requirement in the user searching stage, thereby greatly improving the acquisition efficiency of the user on the content information of the target video and the convenience of the user for watching the target video.

The triggering operation of the target catalog abstract may include operations such as clicking, double clicking, long pressing, etc. of the mouse on the target catalog abstract, and may specifically be set based on actual situations, which is not limited herein.

Through the steps, the video playing method of the embodiment firstly obtains the target video comprising the video catalogue, wherein the video catalogue comprises a plurality of catalogue summaries corresponding to all video segments of the target video, then plays the video segments corresponding to the target catalogue summaries in response to receiving the triggering instruction of the target catalogue summaries in the video catalogue, and can intuitively display the content of all video segments of the target video to a user by utilizing the video catalogue comprising the plurality of catalogue summaries corresponding to all video segments of the target video, thereby being convenient for the user to quickly locate the video segments required by the user, and greatly improving the obtaining efficiency of the content information of the target video for the user and the convenience of the user for watching the target video.

Referring to fig. 2, fig. 2 is a flowchart of another embodiment of a video playing method according to the present application.

Step S21: a target video including a video catalog is obtained, wherein the video catalog includes a plurality of catalog summaries corresponding to each video segment of the target video.

Target video including a video catalog is first acquired. The video catalog of the target video may be pre-constructed before the video playing method of the present embodiment. The method for constructing the video directory may refer to the embodiment of fig. 5, and will not be described herein.

The step is the same as the step S11 in the foregoing embodiment, please refer to the foregoing, and the description is omitted herein.

Step S22: and playing the target video and displaying a video catalog of the target video in response to the obtained playing instruction of the target video.

And playing the target video and displaying a video catalog of the target video in response to the obtained playing instruction of the target video.

In a specific application scenario, a target video is displayed on a screen of the intelligent terminal, a user clicks, double clicks or other playing operations on relevant positions such as a cover or a title of the target video, and then jumps to a playing website of the target video from a current website, plays the target video, and displays a video catalog of the target video.

And when the target video is played, simultaneously displaying the video catalogue in a preset area, wherein the preset area comprises one or more of the inner edge of a playing window of the target video, a playing progress bar of the target video and a catalogue display window outside the playing window of the target video.

Referring to fig. 3, fig. 3 is a schematic diagram illustrating an implementation of a video catalog of the target video according to the embodiment of fig. 2.

The playback window of video playback interface 330 is playing target video 320. The video catalog 300 may be presented at any one or more of the inner edge of the play window of the target video 320, the play progress bar 310 of the target video 320, and the catalog presentation window outside the play window of the target video 320. The present schematic diagram only illustrates the preset positions, and does not limit the number of positions of the video catalog 300 displayed when the target video is played.

The video catalog 300 includes a plurality of catalog summaries 301, and the catalog summaries 301 are ordered at preset positions according to the playing sequence of the corresponding video segments.

When the video catalog 300 is displayed at any one or more positions in the inner edge of the playing window of the target video 320, the playing progress bar 310 of the target video 320 and the catalog display window outside the playing window of the target video 320, the user can trigger the catalog abstract 301 in the video catalog 300 at the above positions to determine the video segment to be watched, and the playing progress bar jumps to the beginning of the video segment for playing.

In a specific embodiment, when playing the target video and displaying the video catalog of the target video, in response to receiving a hover operation on the target catalog digest of the video catalog, displaying the thumbnail of the video segment corresponding to the target catalog digest. The thumbnail may be a certain image frame or a certain moving picture of the corresponding video segment, which has a meaning of content. For example: when a certain target catalog abstract of the target video of the food type is thickening, the thumbnail is an image frame or a continuous moving picture of a thickening picture of the target video.

When the video catalogue is displayed on the playing progress bar of the target video, the thumbnail of the image frame at the playing progress can be displayed based on the corresponding position of the hovering operation on the playing progress bar, and when the user moves the hovering position on the playing progress bar, the thumbnail corresponding to the playing progress is displayed corresponding to the hovering position.

When a user hovers over the target catalog abstract, thumbnail images of the video segments corresponding to the target catalog abstract are displayed, so that the user can conveniently further judge whether the video segments corresponding to the target catalog abstract are required contents or not, and the acquisition efficiency of the user on the content information of the target video and the convenience of the user for watching the target video are improved.

Referring to fig. 4, fig. 4 is a schematic diagram showing an implementation of the thumbnail image according to the embodiment of fig. 2.

The video catalog 400 of the present embodiment is displayed at the playing progress bar 410, in which a plurality of catalog summaries 401 are sequentially arranged and displayed according to the playing order of the video, and when a user hovers at a certain catalog summary, the thumbnail 402 of the video segment corresponding to the catalog summary is displayed, so that the user can further determine whether the video segment corresponding to the catalog summary is the content required by the user.

Step S23: and responding to the received triggering instruction of the target catalog abstract in the video catalog, and playing the video segment corresponding to the target catalog abstract.

And in response to receiving a trigger instruction of the target catalog abstract in the video catalog, adjusting and increasing a playing progress bar to the beginning of the video segment corresponding to the target catalog abstract, and playing the video segment corresponding to the target catalog abstract.

The triggering operation of the target catalog abstract may include operations such as clicking, double clicking, long pressing, etc. on the target catalog abstract, and may specifically be set based on actual situations, which is not limited herein.

In other embodiments, when the user searches for the target video through the search mode, after the target video including the video catalog is obtained, the target video and the video catalog corresponding to the target catalog abstract may be directly displayed in response to receiving a search instruction of the target catalog abstract; and in response to receiving the trigger operation of the target video or the target catalog abstract of the video catalog, determining the trigger operation as a trigger instruction of the target catalog abstract, and playing the video segment corresponding to the target catalog abstract. I.e. the present embodiment may not perform step S22.

According to the embodiment, when the user searches the content which is the same as the target catalog abstract, the target video and the video catalog corresponding to the target catalog abstract are directly displayed, and when the user triggers the target video or the target catalog of the video catalog, the video segments corresponding to the target catalog abstract are directly played, and in the user searching stage, the video segments corresponding to the target catalog abstract in the target video can be directly played based on the user requirement, so that the acquisition efficiency of the user on the content information of the target video and the convenience of the user for watching the target video are greatly improved.

In a specific embodiment, the playing times or playing frequencies of the video segments corresponding to the summaries of the directories in the video directory can be counted, and the video segments are ordered according to the order from high to low of the playing times or the playing frequencies, so that the heat feedback data of the target video is obtained. The heat feedback data can be fed back to a video producer so that the video producer can analyze market feedback conditions of target videos, and the preparation of subsequent videos is facilitated.

In a specific embodiment, when playing the video segment corresponding to the target catalog abstract, determining whether an adjustment playing instruction of the video segment is received within a preset time; the adjusting playing instruction comprises adjusting the playing progress of the target video. The preset time may include a shorter time of 1, 2, 3 seconds, etc., and may be specifically set based on actual conditions.

And after receiving the adjusting play instruction exceeding the set number after the video segments corresponding to the target catalog abstracts are played for a preset time, adjusting the target catalog abstracts corresponding to the video segments. The number of times of the preset times and the number of the set numbers may be set based on actual conditions, and are not limited herein. For example: after the video segments corresponding to the target catalog abstracts are played for 100 times, the target catalog abstracts corresponding to the video segments can be adjusted after receiving the play adjusting instruction for more than 80 times.

When the user selects to play the video segment corresponding to the target catalog abstract, the play window starts to play the video segment, but the user adjusts the play progress within the preset time, so that the situation that the content of the target catalog abstract and the content of the corresponding video segment possibly do not correspond is indicated. When the probability of occurrence of the situation is too large, the target catalog abstracts corresponding to the video segments are adjusted so as to improve the accuracy of each catalog abstract.

In a specific application scenario, the keyword corresponding to the video segment corresponding to the target catalog abstract and the semantic meaning of the high-frequency word can be adjusted, specifically, the image recognition is performed on the video segment to obtain the keyword corresponding to the video segment, the word frequency statistical analysis is performed on the video segment to obtain the high-frequency word corresponding to the video segment, and the catalog abstract corresponding to the video segment is adjusted based on the keyword and the semantic meaning of the high-frequency word, so that the accuracy of each catalog abstract is improved.

Through the steps, the video playing method of the embodiment firstly obtains the target video comprising the video catalogue, then plays the target video and displays the video catalogue of the target video in response to the obtained playing instruction of the target video, finally plays the video segments corresponding to the target catalogue abstract in response to the received triggering instruction of the target catalogue abstract in the video catalogue, and can intuitively display the content of each video segment of the target video to the user by utilizing the video catalogue comprising a plurality of catalogues corresponding to each video segment of the target video, thereby being convenient for the user to quickly position the video segments required by the user, and greatly improving the obtaining efficiency of the user on the content information of the target video and the convenience of the user for watching the target video.

In other embodiments, the video directory may be constructed as follows:

referring to fig. 5, fig. 5 is a schematic flow chart of an implementation of constructing a video catalog according to any of the above embodiments.

Step S51: and obtaining the target video.

The target video is acquired first. In a specific application scenario, the type of the target video may be determined first, and then a plurality of preset keywords and a plurality of preset key pictures of the target video may be obtained based on the type of the target video.

The type of the target video can be determined by receiving the label of the target video by a video producer or identifying the target video. The types of the target video may include various types of cooking, hand-made, software, dance, make-up, electronic equipment, and the like, which are not limited herein.

Since in practice, there is a certain similarity between the structural frames of the same type of video, the present embodiment may determine a plurality of preset keywords and a plurality of preset key pictures of various types of video in advance. The preset key picture may include a summary picture, and the preset key word may include a feature word. The preset key picture and the preset keyword can be set by operators based on the commonality of videos in a certain field, or can be determined by an intelligent algorithm for the content of a large number of videos in the same field.

For example: when the type of the target video is an electronic device, the preset keywords of the target video may include performance, battery duration, characteristics, and the like, and the preset key pictures of the target video may include device surrounding display, device close-up, scoring tables, and the like. When the type of the target video is makeup, the preset keywords can comprise a cake, shadows, highlights, make-up and the like, and the preset key pictures can comprise facial local close-up, makeup style, make-up display and the like. When the type of the target video is a food, its preset keywords may include start, taste, price, etc., and its preset key picture may include food feature, empty dish, etc. When the type of the target video is a talk lecture class of story, time administration, history, etc., a preset keyword may be set based on the specific domain content of each talk lecture class, and a preset key picture may be set based on the lecturer lecture gesture. And so on, will not be described in detail.

Step S52: and identifying the target video, and determining a plurality of keywords and a plurality of key pictures of the target video.

And identifying the target video, and determining a plurality of keywords and a plurality of key pictures of the target video.

In a specific application scene, image recognition can be performed on each image frame of the target video, and a plurality of key pictures corresponding to the target video in a plurality of preset key pictures are determined; and extracting the characters of each image frame of the target video, and then carrying out semantic analysis to determine a plurality of keywords corresponding to the target video in a plurality of preset keywords. The text extraction, the semantic analysis and the image recognition can be operated through a deep neural network with corresponding functions or manually operated.

In another specific application scene, image recognition can be performed on each image frame of the target video, and a plurality of key pictures corresponding to the target video in a plurality of preset key pictures are determined; and performing voice recognition on the audio of the target video, and determining a plurality of keywords corresponding to the target video in a plurality of preset keywords. The voice recognition and the image recognition can be performed through a deep neural network with corresponding functions or manually.

The implementation main body of the embodiment may be a video background, and the video background may be a terminal device or a server. The video background can perform image recognition, voice recognition or text recognition by calling a mature interface for recognition with the called interface. Upon recognition, semantics in the image are read, thereby determining a plurality of keywords and a plurality of key pictures with the target video. Among them, image recognition, voice recognition or text recognition of video content belongs to a relatively mature technology, and can be implemented by applying various algorithms, which are not described in detail herein. The identification technique of the present embodiment can be performed by a computer vision technique.

The preset keywords and the preset key pictures in the embodiment are equivalent to templates for identifying the target video, so that the target video is divided and the content is determined based on the templates.

Step S53: a plurality of partitioning nodes of the target video is determined based on the plurality of key pictures and the plurality of keywords.

After a plurality of key pictures and a plurality of keywords of the target video are acquired, dividing the target video based on the expression contents of the plurality of key pictures and the plurality of keywords, and determining a plurality of dividing nodes of the target video. There is a certain difference in video content before and after each dividing node.

And when the key picture of a certain section of video does not accord with the expression content of the key word, analyzing the target video by taking the expression content of the key picture as the reference.

Step S54: and respectively determining videos between every two adjacent dividing nodes as each video segment of the target video, and determining the catalog abstract of the corresponding video segment based on the keywords corresponding to the two adjacent dividing nodes.

And respectively determining videos between every two adjacent dividing nodes as video segments of the target video according to the playing sequence of the target video, and determining the catalog abstracts of the corresponding video segments based on the keywords corresponding to the two adjacent dividing nodes. And determining the expression content of the corresponding catalog abstract based on the semantics of the keywords corresponding to each two adjacent dividing nodes.

Specifically, the former partition node of the two adjacent partition nodes is used as the beginning of the corresponding video segment according to the playing sequence of the video, and the latter partition nodes are used as the ending of the corresponding video segment.

In a specific application scene, word frequency statistical analysis can be performed on the video between every two adjacent dividing nodes, and high-frequency words of the video between every two adjacent dividing nodes are determined; and determining the catalog abstract of the corresponding video segment based on the keywords and the high-frequency words corresponding to each two adjacent dividing nodes. For example: when the keyword corresponding to the video between two adjacent dividing nodes is identified as Gorgon juice in the target video of the food type, and the high-frequency word obtained through word frequency statistics analysis is starch, the directory abstract can be determined to be thickening. The step of determining the catalog abstract of the corresponding video segment based on the keywords and the high-frequency words corresponding to each two adjacent dividing nodes can be performed in a natural language processing mode.

If the number of the high-frequency words and the keywords corresponding to the video between each two adjacent partition nodes may be multiple, the directory digests of the corresponding video segments are determined based on the keywords and the high-frequency words corresponding to each two adjacent partition nodes, a plurality of directory digests with different semantics and different confidence degrees may be generated, and the directory digest with the highest confidence degree is determined as the directory digest of the corresponding video segment.

In step S23 of the embodiment of fig. 2, when receiving more than the set number of play adjustment commands, the target catalog digest corresponding to the video segment is adjusted based on the keyword corresponding to the video segment and the high-frequency word, the catalog digest with the second highest confidence level may be determined as the catalog digest of the corresponding video segment, so as to adjust the catalog digest. Or to receive manual adjustments to the catalog digest, without limitation.

The steps of the embodiment can automatically generate the video catalogue through an artificial intelligence technology, so that the efficiency of generating the video catalogue is improved.

Through the steps, the embodiment can determine a plurality of keywords and a plurality of key pictures of the target video by identifying the target video; determining a plurality of dividing nodes of the target video based on the plurality of key pictures and the plurality of keywords; and finally, respectively determining videos between every two adjacent dividing nodes as each video segment of the target video, determining the catalog abstracts of the corresponding video segments based on the keywords corresponding to the two adjacent dividing nodes, obtaining catalog abstracts corresponding to each video segment one by one, and forming a video catalog, so that playing of the target video is assisted by the video catalog, the contents of each video segment of the target video are intuitively displayed to a user by utilizing the video catalog comprising a plurality of catalog abstracts corresponding to each video segment of the target video, and the user can conveniently and quickly position the video segments required by the user, thereby greatly improving the acquisition efficiency of the user on the content information of the target video and the convenience of the user for watching the target video.

In other embodiments, the division and labeling of the target video by the video producer or other personnel may also be accepted, and each catalog abstract and its corresponding video segment in the video catalog of the target video may be determined. And are not limited herein.

Referring to fig. 6, fig. 6 is a schematic diagram of a frame of an electronic device according to an embodiment of the application. The electronic device 60 comprises a memory 61 and a processor 62 coupled to each other, the processor 62 being adapted to execute program instructions stored in the memory 61 for implementing the steps of the video playback method of any of the embodiments described above. In one particular implementation scenario, electronic device 60 may include, but is not limited to: the microcomputer and the server, and the electronic device 60 may also include a mobile device such as a notebook computer and a tablet computer, which is not limited herein.

In particular, the processor 62 is configured to control itself and the memory 61 to implement the steps of any of the video playback method embodiments described above. The processor 62 may also be referred to as a CPU (Central Processing Unit ). The processor 62 may be an integrated circuit chip having signal processing capabilities. The processor 62 may also be a general purpose processor, a digital signal processor (Digital Signal Processor, DSP), an application specific integrated circuit (Application Specific Integrated Circuit, ASIC), a Field programmable gate array (Field-Programmable Gate Array, FPGA) or other programmable logic device, discrete gate or transistor logic device, discrete hardware components. A general purpose processor may be a microprocessor or the processor may be any conventional processor or the like. In addition, the processor 62 may be commonly implemented by an integrated circuit chip.

By the aid of the scheme, the acquisition efficiency of the user on the target video content information and the convenience of the user for watching the target video can be improved.

Referring to fig. 7, fig. 7 is a schematic diagram of a frame of an embodiment of a computer readable storage medium according to the present application. The computer-readable storage medium 70 stores program instructions 701 capable of being executed by a processor, the program instructions 701 being for implementing the video playing method and the steps of the video playing method of any of the above embodiments.

In the several embodiments provided in the present application, it should be understood that the disclosed method and apparatus may be implemented in other manners. For example, the apparatus embodiments described above are merely illustrative, e.g., the division of modules or units is merely a logical functional division, and may be implemented in other ways, e.g., the units or components may be combined or integrated into another system, or some features may be omitted, or not performed. Alternatively, the coupling or direct coupling or communication connection shown or discussed with each other may be an indirect coupling or communication connection via some interfaces, devices or units, which may be in electrical, mechanical, or other forms.

The elements illustrated as separate elements may or may not be physically separate, and elements shown as elements may or may not be physical elements, may be located in one place, or may be distributed over network elements. Some or all of the units may be selected as needed to achieve the object of the embodiment.

In addition, each functional unit in the embodiments of the present application may be integrated in one processing unit, or each unit may exist alone physically, or two or more units may be integrated in one unit. The integrated units may be implemented in hardware or in software functional units.

The integrated units, if implemented in the form of software functional units and sold or used as stand-alone products, may be stored in a computer readable storage medium. Based on such understanding, the technical solution of the present application may be embodied in essence or a part contributing to the prior art or all or part of the technical solution in the form of a software product stored in a storage medium, including several instructions for causing a computer device (which may be a personal computer, a server, or a network device, etc.) or a processor (processor) to execute all or part of the steps of the methods of the embodiments of the present application. And the aforementioned storage medium includes: a U-disk, a removable hard disk, a Read-Only Memory (ROM), a random access Memory (RAM, random Access Memory), a magnetic disk, or an optical disk, or other various media capable of storing program codes.

Claims

1. A video playing method, characterized in that the video playing method comprises:

obtaining a target video comprising a video catalog, wherein the video catalog comprises a plurality of catalog summaries corresponding to each video segment of the target video;

responding to a trigger instruction of receiving a target catalog abstract in the video catalog, and playing a video segment corresponding to the target catalog abstract;

the obtaining a target video including a video catalog, wherein the video catalog includes a plurality of catalog summaries corresponding to each video segment of the target video, and the method includes:

identifying the target video, and determining a plurality of keywords and a plurality of key pictures of the target video; determining a plurality of partition nodes of the target video based on the plurality of key pictures and the plurality of keywords; respectively determining videos between every two adjacent dividing nodes as video segments of the target video, performing word frequency statistical analysis on the videos between every two adjacent dividing nodes, and determining a plurality of high-frequency words of the videos between every two adjacent dividing nodes; determining a plurality of catalog summaries with different confidence degrees of the corresponding video segments based on the keywords corresponding to each two adjacent dividing nodes and the semantics of the plurality of high-frequency words, and determining the catalog summary with the highest confidence degree as the catalog summary of the corresponding video segment;

the responding to the receiving of the triggering instruction of the target catalog abstract in the video catalog, playing the video segment corresponding to the target catalog abstract comprises the following steps:

responding to the received search instruction of the target catalog abstract, displaying target video and video catalog corresponding to the target catalog abstract, determining the triggering operation as the triggering instruction of the target catalog abstract when the triggering operation of the target video or the target catalog abstract of the video catalog is received, and playing video segments corresponding to the target catalog abstract;

and in response to receiving the adjusting play instruction exceeding the set number, determining the directory abstract with the second highest confidence as the directory abstract of the corresponding video segment when the target directory abstract corresponding to the video segment is adjusted based on the keyword corresponding to the video segment and the high-frequency word.

2. The video playing method according to claim 1, wherein the obtaining the target video includes:

acquiring the target video and determining the type of the target video;

acquiring a plurality of preset keywords and a plurality of preset key pictures of the target video based on the type of the target video;

the identifying the target video, determining a plurality of keywords and a plurality of key pictures of the target video, includes:

performing image recognition on each image frame of the target video, and determining a plurality of key pictures corresponding to the target video in the plurality of preset key pictures; and

and carrying out voice recognition on the audio of the target video, and determining a plurality of keywords corresponding to the target video from the plurality of preset keywords.

3. The method for playing video according to claim 1, wherein the playing the video segment corresponding to the target catalog abstract in response to receiving the trigger instruction of the target catalog abstract in the video catalog, further comprises:

playing the video segment corresponding to the target catalog abstract, and determining whether an adjusting playing instruction for the video segment is received or not in preset time; the play adjusting instruction comprises adjusting the play progress of the target video;

and after the video segments corresponding to the target catalog abstracts are played for a preset time, receiving an adjustment playing instruction exceeding a set number, and determining the catalog abstracts with the second highest confidence as the catalog abstracts of the corresponding video segments when the target catalog abstracts corresponding to the video segments are adjusted based on the keywords corresponding to the video segments and the high-frequency words.

4. The video playing method according to claim 1, wherein the displaying the target video and the video directory corresponding to the target directory summary includes:

and displaying the video catalogue in a preset area, wherein the preset area comprises one or more of the inner edge of a playing window of the target video, a playing progress bar of the target video and a catalogue display window outside the playing window of the target video.

5. The video playing method according to claim 1, wherein the displaying the target video and the video directory corresponding to the target directory summary includes:

and responding to the received hovering operation of the target catalog abstract of the video catalog, and displaying the thumbnail of the video segment corresponding to the target catalog abstract.

6. The method of claim 1, wherein, in response to receiving a trigger instruction for a target catalog digest in the video catalog, playing a video segment corresponding to the target catalog digest further comprises:

counting the playing times or playing frequencies of the video segments corresponding to each catalog abstract in the video catalog;

and sequencing the video segments according to the sequence of the playing times or the playing frequency from high to low to obtain the heat feedback data of the target video.

7. An electronic device comprising a memory and a processor coupled to each other, the processor being configured to execute program instructions stored in the memory to implement the video playback method of any one of claims 1 to 6.

8. A computer readable storage medium having stored thereon program instructions, which when executed by a processor, implement the video playback method of any one of claims 1 to 6.